Think with 3D: Geometric Imagination Grounded Spatial Reasoning from Limited Views Paper • 2510.18632 • Published 5 days ago • 20
LeanK: Learnable K Cache Channel Pruning for Efficient Decoding Paper • 2508.02215 • Published Aug 4 • 12
Agent Lightning: Train ANY AI Agents with Reinforcement Learning Paper • 2508.03680 • Published Aug 5 • 70
Do Not Let Low-Probability Tokens Over-Dominate in RL for LLMs Paper • 2505.12929 • Published May 19 • 3