Heng-Jui Chang's picture

4 4 3

Heng-Jui Chang

vectominist

·

https://people.csail.mit.edu/hengjui/

AI & ML interests

Speech Processing, Multimodal Learning, Self-supervised Learning

Recent Activity

authored a paper 1 day ago

Pushing the Frontier of Audiovisual Perception with Large-Scale Multimodal Correspondence Learning

upvoted a collection 4 days ago

perception-encoder-audio-visual

liked a model 29 days ago

nvidia/audio-flamingo-3-hf

View all activity

Organizations

authored a paper 1 day ago

Pushing the Frontier of Audiovisual Perception with Large-Scale Multimodal Correspondence Learning

Paper • 2512.19687 • Published 5 days ago • 1

authored 3 papers 6 months ago

DistilHuBERT: Speech Representation Learning by Layer-wise Distillation of Hidden-unit BERT

Paper • 2110.01900 • Published Oct 5, 2021

SpeechCLIP: Integrating Speech with Pre-Trained Vision and Language Model

Paper • 2210.00705 • Published Oct 3, 2022

USAD: Universal Speech and Audio Representation via Distillation

Paper • 2506.18843 • Published Jun 23 • 12

authored a paper over 2 years ago

DinoSR: Self-Distillation and Online Clustering for Self-supervised Speech Representation Learning

Paper • 2305.10005 • Published May 17, 2023 • 3