meta-llama/Llama-3.2-11B-Vision-Instruct Image-Text-to-Text • 11B • Updated Dec 4, 2024 • 324k • • 1.54k
Running on CPU Upgrade 1.12k 1.12k Open ASR Leaderboard 🏆 Display and request speech recognition model benchmarks
view article Article Supercharge Edge AI With High‑Accuracy Reasoning Using NVIDIA Nemotron Nano 2 9B By nvidia and 9 others • Aug 18 • 30
💧 LFM2 Collection LFM2 is a new generation of hybrid models, designed for on-device deployment. • 21 items • Updated 11 days ago • 116
unsloth/Llama-3.2-11B-Vision-Instruct-bnb-4bit Image-to-Text • 6B • Updated Dec 10, 2024 • 12.1k • 79
unsloth/Llama-3.2-11B-Vision-Instruct-unsloth-bnb-4bit Image-to-Text • 6B • Updated Dec 4, 2024 • 7.06k • 28
Seamless Communication Collection A significant step towards removing language barriers through expressive, fast and high-quality AI translation. • 16 items • Updated Jan 16, 2024 • 154
NVLM 1.0 Collection A family of frontier-class multimodal large language models (LLMs) that achieve state-of-the-art results on vision-language tasks and text-only tasks. • 2 items • Updated 12 days ago • 52