HiStream: Efficient High-Resolution Video Generation via Redundancy-Eliminated Streaming Paper • 2512.21338 • Published 2 days ago • 15
Learning from Next-Frame Prediction: Autoregressive Video Modeling Encodes Effective Representations Paper • 2512.21004 • Published 3 days ago • 10
TurboDiffusion: Accelerating Video Diffusion Models by 100-200 Times Paper • 2512.16093 • Published 9 days ago • 63
The Prism Hypothesis: Harmonizing Semantic and Pixel Representations via Unified Autoencoding Paper • 2512.19693 • Published 4 days ago • 60
Both Semantics and Reconstruction Matter: Making Representation Encoders Ready for Text-to-Image Generation and Editing Paper • 2512.17909 • Published 7 days ago • 35
Next-Embedding Prediction Makes Strong Vision Learners Paper • 2512.16922 • Published 8 days ago • 78
REGLUE Your Latents with Global and Local Semantics for Entangled Diffusion Paper • 2512.16636 • Published 8 days ago • 25
DEER: Draft with Diffusion, Verify with Autoregressive Models Paper • 2512.15176 • Published 10 days ago • 41
Fast and Accurate Causal Parallel Decoding using Jacobi Forcing Paper • 2512.14681 • Published 10 days ago • 39
WorldPlay: Towards Long-Term Geometric Consistency for Real-Time Interactive World Modeling Paper • 2512.14614 • Published 10 days ago • 64
Towards Scalable Pre-training of Visual Tokenizers for Generation Paper • 2512.13687 • Published 11 days ago • 95
LongVie 2: Multimodal Controllable Ultra-Long Video World Model Paper • 2512.13604 • Published 11 days ago • 70
EgoX: Egocentric Video Generation from a Single Exocentric Video Paper • 2512.08269 • Published 18 days ago • 110
SVG-T2I: Scaling Up Text-to-Image Latent Diffusion Model Without Variational Autoencoder Paper • 2512.11749 • Published 14 days ago • 36
MagicWorld: Interactive Geometry-driven Video World Exploration Paper • 2511.18886 • Published Nov 24 • 19
MoCapAnything: Unified 3D Motion Capture for Arbitrary Skeletons from Monocular Videos Paper • 2512.10881 • Published 15 days ago • 29
StereoWorld: Geometry-Aware Monocular-to-Stereo Video Generation Paper • 2512.09363 • Published 17 days ago • 70