Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Guilherme34
/
Samantha-omni
like
3
Any-to-Any
Transformers
Safetensors
openbmb/RLAIF-V-Dataset
multilingual
minicpmo
feature-extraction
minicpm-o
omni
vision
ocr
multi-image
video
custom_code
audio
speech
voice cloning
live Streaming
realtime speech conversation
asr
tts
arxiv:
2408.01800
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
f2c0120
Samantha-omni
/
assets
/
input_examples
764 kB
1 contributor
History:
4 commits
Guilherme34
Upload assets/input_examples/assistant_male_voice.wav with huggingface_hub
f2c0120
verified
3 months ago
Trump_WEF_2018_10s.mp3
161 kB
xet
Upload assets/input_examples/Trump_WEF_2018_10s.mp3 with huggingface_hub
3 months ago
assistant_default_female_voice.wav
Safe
224 kB
xet
Upload assets/input_examples/assistant_default_female_voice.wav with huggingface_hub
3 months ago
assistant_female_voice.wav
Safe
235 kB
xet
Upload assets/input_examples/assistant_female_voice.wav with huggingface_hub
3 months ago
assistant_male_voice.wav
144 kB
xet
Upload assets/input_examples/assistant_male_voice.wav with huggingface_hub
3 months ago