2 25

zhijie deng PRO

zhijie3

https://thudzj.github.io/

thudzj

AI & ML interests

None yet

Recent Activity

upvoted a paper 3 days ago

LiveTalk: Real-Time Multimodal Interactive Video Diffusion via Improved On-Policy Distillation

published a Space 7 days ago

zhijie3/think-then-generate

updated a Space 7 days ago

zhijie3/think-then-generate

View all activity

Organizations

upvoted a paper 3 days ago

LiveTalk: Real-Time Multimodal Interactive Video Diffusion via Improved On-Policy Distillation

Paper • 2512.23576 • Published 4 days ago • 61

upvoted a paper 11 days ago

LoPA: Scaling dLLM Inference via Lookahead Parallel Decoding

Paper • 2512.16229 • Published 16 days ago • 15

upvoted 2 papers 16 days ago

Fast and Accurate Causal Parallel Decoding using Jacobi Forcing

Paper • 2512.14681 • Published 17 days ago • 39

DEER: Draft with Diffusion, Verify with Autoregressive Models

Paper • 2512.15176 • Published 17 days ago • 41

upvoted a paper about 1 month ago

Mantis: A Versatile Vision-Language-Action Model with Disentangled Visual Foresight

Paper • 2511.16175 • Published Nov 20, 2025 • 12

upvoted a paper 2 months ago

Efficient Long-context Language Model Training by Core Attention Disaggregation

Paper • 2510.18121 • Published Oct 20, 2025 • 122

upvoted 2 papers 5 months ago

NextStep-1: Toward Autoregressive Image Generation with Continuous Tokens at Scale

Paper • 2508.10711 • Published Aug 14, 2025 • 145

Diffusion LLMs Can Do Faster-Than-AR Inference via Discrete Diffusion Forcing

Paper • 2508.09192 • Published Aug 8, 2025 • 30

upvoted a paper 6 months ago

Scaling Speculative Decoding with Lookahead Reasoning

Paper • 2506.19830 • Published Jun 24, 2025 • 12

upvoted 3 papers 7 months ago

LoHoVLA: A Unified Vision-Language-Action Model for Long-Horizon Embodied Tasks

Paper • 2506.00411 • Published May 31, 2025 • 31

Which Data Attributes Stimulate Math and Code Reasoning? An Investigation via Influence Functions

Paper • 2505.19949 • Published May 26, 2025 • 16

Done Is Better than Perfect: Unlocking Efficient Reasoning by Structured Multi-Turn Decomposition

Paper • 2505.19788 • Published May 26, 2025 • 13

upvoted 2 papers 9 months ago

FlowReasoner: Reinforcing Query-Level Meta-Agents

Paper • 2504.15257 • Published Apr 21, 2025 • 47

Improved Visual-Spatial Reasoning via R1-Zero-Like Training

Paper • 2504.00883 • Published Apr 1, 2025 • 67

upvoted a paper 10 months ago

SIFT: Grounding LLM Reasoning in Contexts via Stickers

Paper • 2502.14922 • Published Feb 19, 2025 • 32

upvoted 4 papers 11 months ago

The Stochastic Parrot on LLM's Shoulder: A Summative Assessment of Physical Concept Understanding

Paper • 2502.08946 • Published Feb 13, 2025 • 191

Show-o Turbo: Towards Accelerated Unified Multimodal Understanding and Generation

Paper • 2502.05415 • Published Feb 8, 2025 • 20

Efficient-vDiT: Efficient Video Diffusion Transformers With Attention Tile

Paper • 2502.06155 • Published Feb 10, 2025 • 10

Fast Video Generation with Sliding Tile Attention

Paper • 2502.04507 • Published Feb 6, 2025 • 51

upvoted a paper 12 months ago

Inference-Time Scaling for Diffusion Models beyond Scaling Denoising Steps

Paper • 2501.09732 • Published Jan 16, 2025 • 71

zhijie deng PRO

AI & ML interests

Recent Activity

Organizations

zhijie3's activity