SpatialVID: A Large-Scale Video Dataset with Spatial Annotations Paper • 2509.09676 • Published Sep 11 • 32
Image Textualization: An Automatic Framework for Creating Accurate and Detailed Image Descriptions Paper • 2406.07502 • Published Jun 11, 2024 • 1
FIRST: Teach A Reliable Large Language Model Through Efficient Trustworthy Distillation Paper • 2408.12168 • Published Aug 22, 2024
EquivPruner: Boosting Efficiency and Quality in LLM-Based Search via Action Pruning Paper • 2505.16312 • Published May 22 • 3 • 5
EquivPruner: Boosting Efficiency and Quality in LLM-Based Search via Action Pruning Paper • 2505.16312 • Published May 22 • 3
Backdoor Cleaning without External Guidance in MLLM Fine-tuning Paper • 2505.16916 • Published May 22 • 17