quangdq's picture

13 2

quangdq

kaidduong

AI & ML interests

None yet

Recent Activity

upvoted a paper 7 days ago

Trace Anything: Representing Any Video in 4D via Trajectory Fields

upvoted a paper 21 days ago

TTT3R: 3D Reconstruction as Test-Time Training

upvoted a paper about 1 month ago

Kling-Avatar: Grounding Multimodal Instructions for Cascaded Long-Duration Avatar Animation Synthesis

View all activity

Organizations

None yet

upvoted a paper 7 days ago

Trace Anything: Representing Any Video in 4D via Trajectory Fields

Paper • 2510.13802 • Published 11 days ago • 30

upvoted a paper 21 days ago

TTT3R: 3D Reconstruction as Test-Time Training

Paper • 2509.26645 • Published 26 days ago • 14

upvoted 6 papers about 1 month ago

Kling-Avatar: Grounding Multimodal Instructions for Cascaded Long-Duration Avatar Animation Synthesis

Paper • 2509.09595 • Published Sep 11 • 48

InfGen: A Resolution-Agnostic Paradigm for Scalable Image Synthesis

Paper • 2509.10441 • Published Sep 12 • 30

Qwen-Image Technical Report

Paper • 2508.02324 • Published Aug 4 • 258

DINOv3

Paper • 2508.10104 • Published Aug 13 • 274

HuMo: Human-Centric Video Generation via Collaborative Multi-Modal Conditioning

Paper • 2509.08519 • Published Sep 10 • 126

From Editor to Dense Geometry Estimator

Paper • 2509.04338 • Published Sep 4 • 91

upvoted 4 papers about 2 months ago

GUAVA: Generalizable Upper Body 3D Gaussian Avatar

Paper • 2505.03351 • Published May 6 • 2

Planning with Reasoning using Vision Language World Model

Paper • 2509.02722 • Published Sep 2 • 22

Mixture of Global and Local Experts with Diffusion Transformer for Controllable Face Generation

Paper • 2509.00428 • Published Aug 30 • 17

Few-step Flow for 3D Generation via Marginal-Data Transport Distillation

Paper • 2509.04406 • Published Sep 4 • 11

upvoted a paper 2 months ago

Voost: A Unified and Scalable Diffusion Transformer for Bidirectional Virtual Try-On and Try-Off

Paper • 2508.04825 • Published Aug 6 • 57