arxiv:2410.19702
Yansong Shi
nanamma
AI & ML interests
multi modality, video understanding, robotics
Recent Activity
authored
a paper
29 days ago
InternVideo2: Scaling Video Foundation Models for Multimodal Video
Understanding
authored
a paper
29 days ago
TimeSuite: Improving MLLMs for Long Video Understanding via Grounded
Tuning
new activity
2 months ago
qiukingballball/RoboCerebra:how to test