Self-Evaluation Unlocks Any-Step Text-to-Image Generation Paper • 2512.22374 • Published 7 days ago • 14
Concerto: Joint 2D-3D Self-Supervised Learning Emerges Spatial Representations Paper • 2510.23607 • Published Oct 27, 2025 • 177
QeRL: Beyond Efficiency -- Quantization-enhanced Reinforcement Learning for LLMs Paper • 2510.11696 • Published Oct 13, 2025 • 176
The Automated LLM Speedrunning Benchmark: Reproducing NanoGPT Improvements Paper • 2506.22419 • Published Jun 27, 2025 • 15
Vision-Language-Vision Auto-Encoder: Scalable Knowledge Distillation from Diffusion Models Paper • 2507.07104 • Published Jul 9, 2025 • 45
Vision Foundation Models as Effective Visual Tokenizers for Autoregressive Image Generation Paper • 2507.08441 • Published Jul 11, 2025 • 61
Vision Foundation Models as Effective Visual Tokenizers for Autoregressive Image Generation Paper • 2507.08441 • Published Jul 11, 2025 • 61
Vision Foundation Models as Effective Visual Tokenizers for Autoregressive Image Generation Paper • 2507.08441 • Published Jul 11, 2025 • 61 • 2
Holistic Tokenizer for Autoregressive Image Generation Paper • 2507.02358 • Published Jul 3, 2025 • 4
A Data-Centric Revisit of Pre-Trained Vision Models for Robot Learning Paper • 2503.06960 • Published Mar 10, 2025 • 3
"Principal Components" Enable A New Language of Images Paper • 2503.08685 • Published Mar 11, 2025 • 12
"Principal Components" Enable A New Language of Images Paper • 2503.08685 • Published Mar 11, 2025 • 12
"Principal Components" Enable A New Language of Images Paper • 2503.08685 • Published Mar 11, 2025 • 12 • 2
A Data-Centric Revisit of Pre-Trained Vision Models for Robot Learning Paper • 2503.06960 • Published Mar 10, 2025 • 3
A Data-Centric Revisit of Pre-Trained Vision Models for Robot Learning Paper • 2503.06960 • Published Mar 10, 2025 • 3 • 2