Efficient Long-context Language Model Training by Core Attention Disaggregation Paper • 2510.18121 • Published 5 days ago • 104
Are they lovers or friends? Evaluating LLMs' Social Reasoning in English and Korean Dialogues Paper • 2510.19028 • Published 4 days ago • 6
Every Attention Matters: An Efficient Hybrid Architecture for Long-Context Reasoning Paper • 2510.19338 • Published 4 days ago • 90
LightMem: Lightweight and Efficient Memory-Augmented Generation Paper • 2510.18866 • Published 4 days ago • 100
LightReasoner: Can Small Language Models Teach Large Language Models Reasoning? Paper • 2510.07962 • Published 17 days ago • 8
LiveResearchBench: A Live Benchmark for User-Centric Deep Research in the Wild Paper • 2510.14240 • Published 10 days ago • 11
In-the-Flow Agentic System Optimization for Effective Planning and Tool Use Paper • 2510.05592 • Published 19 days ago • 91
QeRL: Beyond Efficiency -- Quantization-enhanced Reinforcement Learning for LLMs Paper • 2510.11696 • Published 12 days ago • 165
Less is More: Recursive Reasoning with Tiny Networks Paper • 2510.04871 • Published 20 days ago • 443
Diffusion Transformers with Representation Autoencoders Paper • 2510.11690 • Published 12 days ago • 157
BrowserAgent: Building Web Agents with Human-Inspired Web Browsing Actions Paper • 2510.10666 • Published 14 days ago • 27