Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
DAMO-NLP-SG
/
VideoRefer-VideoLLaMA3-7B
like
11
Follow
Language Technology Lab at Alibaba DAMO Academy
153
Video-Text-to-Text
Transformers
Safetensors
English
videollama3_qwen2
text-generation
multimodal large language model
large video-language model
custom_code
arxiv:
4 papers
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
2
Deploy
Use this model
e94edba
VideoRefer-VideoLLaMA3-7B
Commit History
Update README.md
e94edba
verified
CircleRadon
commited on
Jun 18
Update config.json
5a71ce0
verified
CircleRadon
commited on
Jun 17
Upload tokenizer
565a591
verified
CircleRadon
commited on
Jun 17
Upload Videollama3Qwen2ForCausalLM
b899a04
verified
CircleRadon
commited on
Jun 17
initial commit
e07d466
verified
CircleRadon
commited on
Jun 17