Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
nguyenvulebinh
/
AVSRCocktail
like
0
Automatic Speech Recognition
Transformers
Safetensors
PyTorch
nguyenvulebinh/AVYT
English
avhubert_avsr
audio-visual-speech-recognition
multimodal
speech-recognition
lip-reading
cocktail-party
noise-robust
av-hubert
transformer
audio
video
english
lrs2
voxceleb2
ctc
attention
beam-search
multi-speaker
noisy-speech
arxiv:
2506.02178
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
AVSRCocktail
Commit History
Update README.md
ae29b16
verified
nguyenvulebinh
commited on
Jul 7, 2025
Upload AVHubertAVSR
67bfcfe
verified
nguyenvulebinh
commited on
Jul 4, 2025
initial commit
db84d10
verified
nguyenvulebinh
commited on
Jul 4, 2025