Diversity-Incentivized Exploration for Versatile Reasoning Paper • 2509.26209 • Published Sep 30 • 16
Divide and Conquer: Grounding LLMs as Efficient Decision-Making Agents via Offline Hierarchical Reinforcement Learning Paper • 2505.19761 • Published May 26
Text-to-Decision Agent: Offline Meta-Reinforcement Learning from Natural Language Supervision Paper • 2504.15046 • Published Apr 21
Attention-Guided Contrastive Role Representations for Multi-Agent Reinforcement Learning Paper • 2312.04819 • Published Dec 8, 2023