Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
lmms-lab
's Collections
LLaVA-OneVision-1.5
LLaVA-Critic-R1
MMSearch-R1
Aero-1-Audio
EgoLife
VideoMMMU
Multimodal-SAE
LLaVA-Critic
LLaVA-Video
LLaVA-OneVision
LMMs-Eval
LongVA
LLaVA-Next-Interleave
LLaVA-NeXT
LMMs-Eval-Lite
LLaVA-OneVision
updated
Sep 17
a model good at arbitrary types of visual input
Upvote
31
+21
LLaVA-OneVision: Easy Visual Task Transfer
Paper
•
2408.03326
•
Published
Aug 6, 2024
•
60
lmms-lab/LLaVA-OneVision-Mid-Data
Viewer
•
Updated
Aug 26, 2024
•
563k
•
268
•
21
lmms-lab/LLaVA-OneVision-Data
Viewer
•
Updated
May 24
•
3.94M
•
18.4k
•
220
lmms-lab/LLaVA-NeXT-Data
Viewer
•
Updated
Aug 30, 2024
•
779k
•
2.68k
•
41
lmms-lab/llavanext-qwen-siglip-tokenizer
Text Generation
•
Updated
Jul 11, 2024
•
10
•
3
lmms-lab/llava-onevision-qwen2-0.5b-si
Text Generation
•
0.9B
•
Updated
Sep 2, 2024
•
2.42k
•
14
lmms-lab/llava-onevision-qwen2-0.5b-ov
Text Generation
•
0.9B
•
Updated
Sep 2, 2024
•
25.8k
•
24
lmms-lab/llava-onevision-qwen2-7b-si
Text Generation
•
8B
•
Updated
Sep 2, 2024
•
4.14k
•
12
lmms-lab/llava-onevision-qwen2-7b-ov
Text Generation
•
8B
•
Updated
Sep 2, 2024
•
95.9k
•
57
lmms-lab/llava-onevision-qwen2-72b-si
Text Generation
•
73B
•
Updated
Sep 2, 2024
•
10
•
1
lmms-lab/llava-onevision-qwen2-72b-ov-sft
Text Generation
•
73B
•
Updated
Sep 2, 2024
•
595
•
14
lmms-lab/llava-onevision-qwen2-72b-ov-chat
Image-Text-to-Text
•
73B
•
Updated
Oct 9, 2024
•
116
•
9
lmms-lab/llava-onevision-projectors
Updated
Aug 14, 2024
•
3
lmms-lab/llava-onevision-qwen2-0.5b-mid-stage-a4
1B
•
Updated
Aug 6, 2024
•
200
lmms-lab/llava-onevision-qwen2-7b-mid-stage-a4
8B
•
Updated
Aug 6, 2024
•
101
lmms-lab/LLaVA-OneVision-1.5-8B-Instruct
Image-Text-to-Text
•
9B
•
Updated
3 days ago
•
4.82k
•
46
lmms-lab/LLaVA-OneVision-1.5-8B-stage0
9B
•
Updated
24 days ago
•
32
•
2
Upvote
31
+27
Share collection
View history
Collection guide
Browse collections