Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
falamarcao
's Collections
Veterinary
On Device (local)
Start Here
omni models (text, image, audio, video)
Speech related
Web GPU
Software Engineering
Tracker
Speech-to-speech
MCP Servers
computer-use
Speech-to-text
Index-embed
3D
Code
Object Detection
Safety
Parser
Multimodal
Specialized
OCR
Video
Image
Audio
LLM
Text-to-speech
computer-use
updated
Jun 1
Upvote
-
Paused
Featured
981
Computer Agent
🖥
981
Interact with an AI agent to perform web tasks
Running
on
Zero
Featured
56
Jedi
🎯
56
Select elements in images using text instructions
xlangai/Jedi-7B-1080p
Image-Text-to-Text
•
8B
•
Updated
Jun 18
•
114
•
29
Upvote
-
Share collection
View history
Collection guide
Browse collections