Audio stabilityai/stable-audio-open-small Text-to-Audio • Updated May 27 • 1.65k • 236 Running Featured 81 ONNX Model Explorer 🔍 81 Explore ONNX models interactively microsoft/VibeVoice-1.5B Text-to-Speech • 3B • Updated Sep 1 • 169k • 1.99k
Play-Ground Running on CPU Upgrade 233 Inference Playground 🔋 233 Set theme for Hugging Face Playground
OCR SkalskiP/paligemma2_latex_ocr_v5 Updated Dec 11, 2024 • 4 • 2 nanonets/Nanonets-OCR-s Image-Text-to-Text • 4B • Updated Jun 20 • 113k • 1.56k
Multimode microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition • 6B • Updated May 1 • 420k • 1.54k ByteDance/Sa2VA-8B Image-Text-to-Text • 8B • Updated Sep 8 • 1.24k • 65
Speako ibm-granite/granite-speech-3.2-8b Automatic Speech Recognition • 8B • Updated Apr 16 • 110 • 84 ByteDance/MegaTTS3 Text-to-Speech • Updated Apr 4 • 164 • 412 Sleeping Demo 🚀 Transcribe audio/video to text
Audio stabilityai/stable-audio-open-small Text-to-Audio • Updated May 27 • 1.65k • 236 Running Featured 81 ONNX Model Explorer 🔍 81 Explore ONNX models interactively microsoft/VibeVoice-1.5B Text-to-Speech • 3B • Updated Sep 1 • 169k • 1.99k
Speako ibm-granite/granite-speech-3.2-8b Automatic Speech Recognition • 8B • Updated Apr 16 • 110 • 84 ByteDance/MegaTTS3 Text-to-Speech • Updated Apr 4 • 164 • 412 Sleeping Demo 🚀 Transcribe audio/video to text
Play-Ground Running on CPU Upgrade 233 Inference Playground 🔋 233 Set theme for Hugging Face Playground
OCR SkalskiP/paligemma2_latex_ocr_v5 Updated Dec 11, 2024 • 4 • 2 nanonets/Nanonets-OCR-s Image-Text-to-Text • 4B • Updated Jun 20 • 113k • 1.56k
Multimode microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition • 6B • Updated May 1 • 420k • 1.54k ByteDance/Sa2VA-8B Image-Text-to-Text • 8B • Updated Sep 8 • 1.24k • 65