Enhancing Long Video Understanding via Hierarchical Event-Based Memory Paper • 2409.06299 • Published Sep 10, 2024
Watch and Listen: Understanding Audio-Visual-Speech Moments with Multimodal LLM Paper • 2505.18110 • Published May 23 • 1
G$^2$RPO-A: Guided Group Relative Policy Optimization with Adaptive Guidance Paper • 2508.13023 • Published Aug 18 • 1
TRACE: Temporal Grounding Video LLM via Causal Event Modeling Paper • 2410.05643 • Published Oct 8, 2024 • 9
FedRC: Tackling Diverse Distribution Shifts Challenge in Federated Learning by Robust Clustering Paper • 2301.12379 • Published Jan 29, 2023
VTG-LLM: Integrating Timestamp Knowledge into Video LLMs for Enhanced Video Temporal Grounding Paper • 2405.13382 • Published May 22, 2024 • 1
Dynamic Mixture of Experts: An Auto-Tuning Approach for Efficient Transformer Models Paper • 2405.14297 • Published May 23, 2024 • 3
FedBR: Improving Federated Learning on Heterogeneous Data via Local Learning Bias Reduction Paper • 2205.13462 • Published May 26, 2022