Pay Less Attention to Function Words for Free Robustness of Vision-Language Models Paper • 2512.07222 • Published about 1 month ago • 1
Adversarial Video Promotion Against Text-to-Video Retrieval Paper • 2508.06964 • Published Aug 9, 2025 • 9
GAID: Frame-Level Gated Audio-Visual Integration with Directional Perturbation for Text-Video Retrieval Paper • 2508.01711 • Published Aug 3, 2025 • 1