Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Guilherme34
/
Samantha-omni
like
3
Any-to-Any
Transformers
Safetensors
openbmb/RLAIF-V-Dataset
multilingual
minicpmo
feature-extraction
minicpm-o
omni
vision
ocr
multi-image
video
custom_code
audio
speech
voice cloning
live Streaming
realtime speech conversation
asr
tts
arxiv:
2408.01800
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
f2c0120
Samantha-omni
65.6 MB
1 contributor
History:
14 commits
Guilherme34
Upload assets/input_examples/assistant_male_voice.wav with huggingface_hub
f2c0120
verified
3 months ago
assets
Upload assets/input_examples/assistant_male_voice.wav with huggingface_hub
3 months ago
.gitattributes
1.82 kB
Upload assets/input_examples/Trump_WEF_2018_10s.mp3 with huggingface_hub
3 months ago
README.md
Safe
50.4 kB
Upload README.md with huggingface_hub
3 months ago
added_tokens.json
Safe
1.41 kB
Upload added_tokens.json with huggingface_hub
3 months ago