Yansong Shi's picture

7 3

Yansong Shi

nanamma

·

https://huggingface.co/nanamma

AI & ML interests

multi modality, video understanding, robotics

Recent Activity

authored a paper about 1 month ago

InternVideo2: Scaling Video Foundation Models for Multimodal Video Understanding

authored a paper about 1 month ago

TimeSuite: Improving MLLMs for Long Video Understanding via Grounded Tuning

new activity 2 months ago

qiukingballball/RoboCerebra:how to test

View all activity

Organizations

New activity in qiukingballball/RoboCerebra 2 months ago

how to test

#4 opened 2 months ago by

New activity in Enxin/MovieChat-1K_train about 1 year ago

so many quote '"' in captions in json files

#2 opened about 1 year ago by

New activity in openbmb/RLHF-V about 1 year ago

key_error "beit3_llava"

#4 opened about 1 year ago by

New activity in liuhaotian/LLaVA-Instruct-150K over 1 year ago

请问哪里可以下载 LLaVA-150K 的图片

#4 opened about 2 years ago by