SUCCESS-GS: Survey of Compactness and Compression for Efficient Static and Dynamic Gaussian Splatting Paper • 2512.07197 • Published 3 days ago • 2
TrackingWorld: World-centric Monocular 3D Tracking of Almost All Pixels Paper • 2512.08358 • Published 1 day ago • 2
Ground Slow, Move Fast: A Dual-System Foundation Model for Generalizable Vision-and-Language Navigation Paper • 2512.08186 • Published 2 days ago • 3
Visionary: The World Model Carrier Built on WebGPU-Powered Gaussian Splatting Platform Paper • 2512.08478 • Published 1 day ago • 65
Nex-N1: Agentic Models Trained via a Unified Ecosystem for Large-Scale Environment Construction Paper • 2512.04987 • Published 6 days ago • 71
VideoVLA: Video Generators Can Be Generalizable Robot Manipulators Paper • 2512.06963 • Published 3 days ago • 2
ReCamDriving: LiDAR-Free Camera-Controlled Novel Trajectory Video Generation Paper • 2512.03621 • Published 8 days ago • 8
Joint 3D Geometry Reconstruction and Motion Generation for 4D Synthesis from a Single Image Paper • 2512.05044 • Published 6 days ago • 15
NeuralRemaster: Phase-Preserving Diffusion for Structure-Aligned Generation Paper • 2512.05106 • Published 6 days ago • 15
Generative Neural Video Compression via Video Diffusion Prior Paper • 2512.05016 • Published 6 days ago • 8
SIMA 2: A Generalist Embodied Agent for Virtual Worlds Paper • 2512.04797 • Published 7 days ago • 17
DynamicVerse: A Physically-Aware Multimodal Framework for 4D World Modeling Paper • 2512.03000 • Published 8 days ago • 34
SeeNav-Agent: Enhancing Vision-Language Navigation with Visual Prompt and Step-Level Policy Optimization Paper • 2512.02631 • Published 9 days ago • 7
On GRPO Collapse in Search-R1: The Lazy Likelihood-Displacement Death Spiral Paper • 2512.04220 • Published 7 days ago • 11
SR-GRPO: Stable Rank as an Intrinsic Geometric Reward for Large Language Model Alignment Paper • 2512.02807 • Published 8 days ago • 7
RELIC: Interactive Video World Model with Long-Horizon Memory Paper • 2512.04040 • Published 7 days ago • 22
Steering Vision-Language-Action Models as Anti-Exploration: A Test-Time Scaling Approach Paper • 2512.02834 • Published 8 days ago • 39
GoRL: An Algorithm-Agnostic Framework for Online Reinforcement Learning with Generative Policies Paper • 2512.02581 • Published 9 days ago • 13
Visual Sync: Multi-Camera Synchronization via Cross-View Object Motion Paper • 2512.02017 • Published 9 days ago • 3