Sung-Feng Huang's picture

1 10 2

Sung-Feng Huang

sungfengh

·

https://sungfeng-huang.github.io

SungFeng-Huang

AI & ML interests

None yet

Recent Activity

upvoted a paper 2 days ago

OpenVoxel: Training-Free Grouping and Captioning Voxels for Open-Vocabulary 3D Scene Understanding

upvoted a paper 2 days ago

Quantile Rendering: Efficiently Embedding High-dimensional Feature on 3D Gaussian Splatting

upvoted a paper 2 days ago

Fast-ThinkAct: Efficient Vision-Language-Action Reasoning via Verbalizable Latent Planning

View all activity

Organizations

upvoted 3 papers 2 days ago

OpenVoxel: Training-Free Grouping and Captioning Voxels for Open-Vocabulary 3D Scene Understanding

Paper • 2601.09575 • Published 3 days ago • 24

Quantile Rendering: Efficiently Embedding High-dimensional Feature on 3D Gaussian Splatting

Paper • 2512.20927 • Published 24 days ago • 15

Fast-ThinkAct: Efficient Vision-Language-Action Reasoning via Verbalizable Latent Planning

Paper • 2601.09708 • Published 3 days ago • 44

upvoted a paper 8 days ago

GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization

Paper • 2601.05242 • Published 9 days ago • 191

upvoted a paper 18 days ago

Masking Teacher and Reinforcing Student for Distilling Vision-Language Models

Paper • 2512.22238 • Published 25 days ago • 25

upvoted a paper 26 days ago

4D-RGPT: Toward Region-level 4D Understanding via Perceptual Distillation

Paper • 2512.17012 • Published 30 days ago • 43

upvoted a collection 3 months ago

Awesome papers from 臺大李宏毅 (Hung-yi Lee)

Recent papers authored by Hung-yi Lee. Sorted by ID • 8 items • Updated Oct 24, 2025 • 17

upvoted 2 papers 3 months ago

IMPACT: Iterative Mask-based Parallel Decoding for Text-to-Audio Generation with Diffusion Modeling

Paper • 2506.00736 • Published May 31, 2025 • 10

DeSTA2.5-Audio: Toward General-Purpose Large Audio Language Model with Self-Generated Cross-Modal Alignment

Paper • 2507.02768 • Published Jul 3, 2025 • 18

upvoted a collection 9 months ago

NeMo Audio Codecs

A series of Neural Audio Codecs • 8 items • Updated 1 day ago • 15