microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition • 6B • Updated 14 days ago • 304k • 1.55k
sdasd112132/Vision-8B-MiniCPM-2_5-Uncensored-and-Detailed-4bit Visual Question Answering • 9B • Updated Jun 1, 2024 • 26 • 32
microsoft/Phi-4-multimodal-instruct-onnx Automatic Speech Recognition • Updated 24 days ago • 156 • 84
google/pix2struct-widget-captioning-large Visual Question Answering • 1B • Updated Apr 10, 2024 • 74 • 20