microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition • 6B • Updated 25 days ago • 216k • 1.56k
microsoft/Phi-4-multimodal-instruct-onnx Automatic Speech Recognition • Updated Nov 30, 2025 • 140 • 86
google/pix2struct-widget-captioning-large Visual Question Answering • 1B • Updated Apr 10, 2024 • 48 • 20