zhanghang's picture

zhanghang

hangzhang-nlp

·

hangzhang-nlp

AI & ML interests

None yet

Recent Activity

upvoted a paper 12 days ago

Qwen3-VL Technical Report

liked a model about 2 months ago

Qwen/Qwen3-VL-2B-Thinking

liked a model about 2 months ago

Qwen/Qwen3-VL-2B-Instruct

View all activity

Organizations

upvoted a paper 12 days ago

Qwen3-VL Technical Report

Paper • 2511.21631 • Published 19 days ago • 124

liked 13 models about 2 months ago

Qwen/Qwen3-VL-2B-Thinking

Image-Text-to-Text • 2B • Updated Oct 20 • 39.4k • 91

Qwen/Qwen3-VL-2B-Instruct

Image-Text-to-Text • 2B • Updated Oct 23 • 540k • 231

Qwen/Qwen3-VL-4B-Instruct

Image-Text-to-Text • 4B • Updated Oct 15 • 820k • 268

Qwen/Qwen3-VL-4B-Thinking

Image-Text-to-Text • 4B • Updated Oct 15 • 51.2k • 89

Qwen/Qwen3-VL-8B-Instruct

Image-Text-to-Text • 9B • Updated Oct 15 • 2.65M • • 553

Qwen/Qwen3-VL-8B-Thinking

Image-Text-to-Text • 9B • Updated 20 days ago • 183k • 154

Qwen/Qwen3-VL-30B-A3B-Instruct-FP8

Image-Text-to-Text • 31B • Updated 20 days ago • 131k • 91

Qwen/Qwen3-VL-30B-A3B-Instruct

Image-Text-to-Text • 31B • Updated 20 days ago • 1.35M • • 443

Qwen/Qwen3-VL-30B-A3B-Thinking

Image-Text-to-Text • 31B • Updated 20 days ago • 53.8k • • 164

Qwen/Qwen3-VL-235B-A22B-Instruct-FP8

Image-Text-to-Text • 236B • Updated 20 days ago • 311k • 32

Qwen/Qwen3-VL-235B-A22B-Thinking-FP8

Image-Text-to-Text • 236B • Updated 20 days ago • 8.3k • 24

Qwen/Qwen3-VL-235B-A22B-Instruct

Image-Text-to-Text • 236B • Updated 20 days ago • 146k • • 334

Qwen/Qwen3-VL-235B-A22B-Thinking

Image-Text-to-Text • 236B • Updated 20 days ago • 6.42k • • 344

liked a Space 6 months ago

VideoRefer VideoLLaMA3

VideoRefer x VideoLLaMA3

upvoted a paper 6 months ago

Lingshu: A Generalist Foundation Model for Unified Multimodal Medical Understanding and Reasoning

Paper • 2506.07044 • Published Jun 8 • 114

upvoted a paper 8 months ago

InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models

Paper • 2504.10479 • Published Apr 14 • 303

upvoted a paper 9 months ago

VideoLLaMA 2: Advancing Spatial-Temporal Modeling and Audio Understanding in Video-LLMs

Paper • 2406.07476 • Published Jun 11, 2024 • 37

upvoted 2 papers 10 months ago

Qwen2.5-VL Technical Report

Paper • 2502.13923 • Published Feb 19 • 211

LongPO: Long Context Self-Evolution of Large Language Models through Short-to-Long Preference Optimization

Paper • 2502.13922 • Published Feb 19 • 28