Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Heng-Jui Chang's picture
4 4 3

Heng-Jui Chang

vectominist
shuyuej's profile picture 21world's profile picture GMMark's profile picture
·
https://people.csail.mit.edu/hengjui/
  • hjchang87
  • vectominist

AI & ML interests

Speech Processing, Multimodal Learning, Self-supervised Learning

Recent Activity

authored a paper 1 day ago
Pushing the Frontier of Audiovisual Perception with Large-Scale Multimodal Correspondence Learning
upvoted a collection 4 days ago
perception-encoder-audio-visual
liked a model 29 days ago
nvidia/audio-flamingo-3-hf
View all activity

Organizations

Massachusetts Institute of Technology's profile picture ESPnet's profile picture NTU Speech Processing & Machine Learning Lab's profile picture s3prl's profile picture Meta Llama's profile picture Spoken Language Systems's profile picture

authored a paper 1 day ago

Pushing the Frontier of Audiovisual Perception with Large-Scale Multimodal Correspondence Learning

Paper • 2512.19687 • Published 5 days ago • 1
authored 3 papers 6 months ago

DistilHuBERT: Speech Representation Learning by Layer-wise Distillation of Hidden-unit BERT

Paper • 2110.01900 • Published Oct 5, 2021

SpeechCLIP: Integrating Speech with Pre-Trained Vision and Language Model

Paper • 2210.00705 • Published Oct 3, 2022

USAD: Universal Speech and Audio Representation via Distillation

Paper • 2506.18843 • Published Jun 23 • 12
authored a paper over 2 years ago

DinoSR: Self-Distillation and Online Clustering for Self-supervised Speech Representation Learning

Paper • 2305.10005 • Published May 17, 2023 • 3
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs