Yansong Shi
nanamma
AI & ML interests
multi modality, video understanding, robotics
Recent Activity
authored
a paper
about 1 month ago
InternVideo2: Scaling Video Foundation Models for Multimodal Video
Understanding
authored
a paper
about 1 month ago
TimeSuite: Improving MLLMs for Long Video Understanding via Grounded
Tuning
new activity
2 months ago
qiukingballball/RoboCerebra:how to test