Marco
AI & ML interests
Recent Activity
Organizations
-
Running176176
Qwen3 Omni Demo
⚡Interact with a multimodal chatbot using text, audio, images, or video
-
Running4646
Qwen3 Omni Captioner Demo
🐠Generate captions from audio
-
Qwen/Qwen3-Omni-30B-A3B-Thinking
Any-to-Any • 32B • Updated • 65.2k • 222 -
Qwen/Qwen3-Omni-30B-A3B-Instruct
Any-to-Any • 35B • Updated • 325k • 690
-
RunningMCP119119
Consilium MCP Server
🏢Multi-AI Expert Consensus Platform
-
SleepingMCP22
MCP Hackathon Deepfake Watchdog
🛡Upload your image and/or voice to scan for deepfake misuse o
-
Running3535
VulnBuster
🛡AI Security Agent: Multi-MCP Code Vulnerability Scanner
-
RunningMCP188188
AI Marketing Content Generator
🎨An AI-powered tool made for content creators and marketers
-
nvidia/parakeet-tdt-0.6b-v2
Automatic Speech Recognition • Updated • 1.91M • 1.35k -
Running on Zero440440
Parakeet-TDT-0.6b-V2
Transcribe audio to text with timestamps
-
Running on CPU Upgrade3333
Blazing Fast Whisper
👁Blazing Fast Whisper Deployed on HF Inference Endpoints
-
Running on CPU Upgrade1.12k1.12k
Open ASR Leaderboard
🏆Display and request speech recognition model benchmarks
-
Running on T48181
RF-DETR
🔥SOTA real-time object detection model
-
Running on CPU Upgrade4949
YOLO ARENA
🏟compare performance of top object detectors
-
Running on Zero8888
D-Fine - SOTA Real-Time Object Detector
⚡Object Detection on Images and Video
-
Running on ZeroMCP2828
Gaze LLE
👀Gaze Target Estimation
-
Running on ZeroMCP537537
LatentSync
👄Audio Conditioned LipSync with Latent Diffusion Models
-
Running on Zero214214
BEN2
🚀Remove background from images and videos
-
Build error8181
SmolVLM
📊Generate answers by combining text and images
-
Runtime error5858
SmolVLM2 HighlightGenerator
🐨Generate video highlights from uploaded video
-
onnx-community/Kokoro-82M-ONNX
Text-to-Speech • Updated • 13.3k • 152 -
Running218218
Kokoro Text-to-Speech
🗣High-quality speech synthesis powered by Kokoro TTS
-
NexaAI/Qwen2-Audio-7B-GGUF
Audio-Text-to-Text • 8B • Updated • 6.98k • 160 -
jonatasgrosman/wav2vec2-large-xlsr-53-english
Automatic Speech Recognition • 0.3B • Updated • 303k • 475
-
stepfun-ai/GOT-OCR-2.0-hf
Image-Text-to-Text • 0.6B • Updated • 54.3k • 217 -
Running on Zero8484
GOT OCR Transformers
📷Demo of GOT-OCR 2.0's Transformers implementation
-
allenai/olmOCR-7B-0225-preview
Image-to-Text • 8B • Updated • 11.3k • 703 -
allenai/olmOCR-mix-0225
Viewer • Updated • 259k • 1.12k • 167
-
Running on CPU Upgrade11.4k11.4k
Stable Diffusion 2-1
🔥Generate images from text prompts
-
Running6060
SmolVLM 256M Instruct WebGPU
🐨Find answers by describing images
-
deepseek-ai/Janus-Pro-7B
Any-to-Any • Updated • 86.7k • 3.52k -
Qwen/Qwen2.5-VL-72B-Instruct
Image-Text-to-Text • 73B • Updated • 633k • • 555
-
Running553553
DeepSeek-R1 WebGPU
🧠Next-generation reasoning model that runs locally in-browser
-
Running9494
Qwen2.5-1M Demo
💻Upload documents and ask questions
-
mistralai/Mistral-Small-24B-Base-2501
24B • Updated • 14.8k • 258 -
deepseek-ai/deepseek-vl2-small
Image-Text-to-Text • 16B • Updated • 14.5k • 166
-
Running176176
Qwen3 Omni Demo
⚡Interact with a multimodal chatbot using text, audio, images, or video
-
Running4646
Qwen3 Omni Captioner Demo
🐠Generate captions from audio
-
Qwen/Qwen3-Omni-30B-A3B-Thinking
Any-to-Any • 32B • Updated • 65.2k • 222 -
Qwen/Qwen3-Omni-30B-A3B-Instruct
Any-to-Any • 35B • Updated • 325k • 690
-
RunningMCP119119
Consilium MCP Server
🏢Multi-AI Expert Consensus Platform
-
SleepingMCP22
MCP Hackathon Deepfake Watchdog
🛡Upload your image and/or voice to scan for deepfake misuse o
-
Running3535
VulnBuster
🛡AI Security Agent: Multi-MCP Code Vulnerability Scanner
-
RunningMCP188188
AI Marketing Content Generator
🎨An AI-powered tool made for content creators and marketers
-
nvidia/parakeet-tdt-0.6b-v2
Automatic Speech Recognition • Updated • 1.91M • 1.35k -
Running on Zero440440
Parakeet-TDT-0.6b-V2
Transcribe audio to text with timestamps
-
Running on CPU Upgrade3333
Blazing Fast Whisper
👁Blazing Fast Whisper Deployed on HF Inference Endpoints
-
Running on CPU Upgrade1.12k1.12k
Open ASR Leaderboard
🏆Display and request speech recognition model benchmarks
-
Running on T48181
RF-DETR
🔥SOTA real-time object detection model
-
Running on CPU Upgrade4949
YOLO ARENA
🏟compare performance of top object detectors
-
Running on Zero8888
D-Fine - SOTA Real-Time Object Detector
⚡Object Detection on Images and Video
-
Running on ZeroMCP2828
Gaze LLE
👀Gaze Target Estimation
-
stepfun-ai/GOT-OCR-2.0-hf
Image-Text-to-Text • 0.6B • Updated • 54.3k • 217 -
Running on Zero8484
GOT OCR Transformers
📷Demo of GOT-OCR 2.0's Transformers implementation
-
allenai/olmOCR-7B-0225-preview
Image-to-Text • 8B • Updated • 11.3k • 703 -
allenai/olmOCR-mix-0225
Viewer • Updated • 259k • 1.12k • 167
-
Running on ZeroMCP537537
LatentSync
👄Audio Conditioned LipSync with Latent Diffusion Models
-
Running on Zero214214
BEN2
🚀Remove background from images and videos
-
Build error8181
SmolVLM
📊Generate answers by combining text and images
-
Runtime error5858
SmolVLM2 HighlightGenerator
🐨Generate video highlights from uploaded video
-
Running on CPU Upgrade11.4k11.4k
Stable Diffusion 2-1
🔥Generate images from text prompts
-
Running6060
SmolVLM 256M Instruct WebGPU
🐨Find answers by describing images
-
deepseek-ai/Janus-Pro-7B
Any-to-Any • Updated • 86.7k • 3.52k -
Qwen/Qwen2.5-VL-72B-Instruct
Image-Text-to-Text • 73B • Updated • 633k • • 555
-
Running553553
DeepSeek-R1 WebGPU
🧠Next-generation reasoning model that runs locally in-browser
-
Running9494
Qwen2.5-1M Demo
💻Upload documents and ask questions
-
mistralai/Mistral-Small-24B-Base-2501
24B • Updated • 14.8k • 258 -
deepseek-ai/deepseek-vl2-small
Image-Text-to-Text • 16B • Updated • 14.5k • 166
-
onnx-community/Kokoro-82M-ONNX
Text-to-Speech • Updated • 13.3k • 152 -
Running218218
Kokoro Text-to-Speech
🗣High-quality speech synthesis powered by Kokoro TTS
-
NexaAI/Qwen2-Audio-7B-GGUF
Audio-Text-to-Text • 8B • Updated • 6.98k • 160 -
jonatasgrosman/wav2vec2-large-xlsr-53-english
Automatic Speech Recognition • 0.3B • Updated • 303k • 475