Diffusion Transformers with Representation Autoencoders Paper • 2510.11690 • Published 12 days ago • 157
Self-Forcing++: Towards Minute-Scale High-Quality Video Generation Paper • 2510.02283 • Published 23 days ago • 91
LongRM: Revealing and Unlocking the Context Boundary of Reward Modeling Paper • 2510.06915 • Published 17 days ago • 14
Ming-UniVision: Joint Image Understanding and Generation with a Unified Continuous Tokenizer Paper • 2510.06590 • Published 18 days ago • 69
Revisiting Long-context Modeling from Context Denoising Perspective Paper • 2510.05862 • Published 18 days ago • 20
LongLive: Real-time Interactive Long Video Generation Paper • 2509.22622 • Published 29 days ago • 176
MultiEdit: Advancing Instruction-based Image Editing on Diverse and Challenging Tasks Paper • 2509.14638 • Published Sep 18 • 11
ReSum: Unlocking Long-Horizon Search Intelligence via Context Summarization Paper • 2509.13313 • Published Sep 16 • 78
WebResearcher: Unleashing unbounded reasoning capability in Long-Horizon Agents Paper • 2509.13309 • Published Sep 16 • 66
Towards General Agentic Intelligence via Environment Scaling Paper • 2509.13311 • Published Sep 16 • 70
WebSailor-V2: Bridging the Chasm to Proprietary Agents via Synthetic Data and Scalable Reinforcement Learning Paper • 2509.13305 • Published Sep 16 • 88
WebWeaver: Structuring Web-Scale Evidence with Dynamic Outlines for Open-Ended Deep Research Paper • 2509.13312 • Published Sep 16 • 104
FLUX-Reason-6M & PRISM-Bench: A Million-Scale Text-to-Image Reasoning Dataset and Comprehensive Benchmark Paper • 2509.09680 • Published Sep 11 • 42
LOOM-Scope: a comprehensive and efficient LOng-cOntext Model evaluation framework Paper • 2507.04723 • Published Jul 7 • 10
OmniPart: Part-Aware 3D Generation with Semantic Decoupling and Structural Cohesion Paper • 2507.06165 • Published Jul 8 • 57