Steering Autoregressive Music Generation with Recursive Feature Machines Paper • 2510.19127 • Published Oct 21 • 7
Group Think: Multiple Concurrent Reasoning Agents Collaborating at Token Level Granularity Paper • 2505.11107 • Published May 16 • 29
DiaTool-DPO: Multi-Turn Direct Preference Optimization for Tool-Augmented Large Language Models Paper • 2504.02882 • Published Apr 2 • 7
Communicate to Play: Pragmatic Reasoning for Efficient Cross-Cultural Communication in Codenames Paper • 2408.04900 • Published Aug 9, 2024 • 1
Collaborating Action by Action: A Multi-agent LLM Framework for Embodied Reasoning Paper • 2504.17950 • Published Apr 24 • 5