Empowering Multi-Turn Tool-Integrated Reasoning with Group Turn Policy Optimization Paper • 2511.14846 • Published 22 days ago
Horizon-Length Prediction: Advancing Fill-in-the-Middle Capabilities for Code Generation with Lookahead Planning Paper • 2410.03103 • Published Oct 4, 2024 • 9
XFT: Unlocking the Power of Code Instruction Tuning by Simply Merging Upcycled Mixture-of-Experts Paper • 2404.15247 • Published Apr 23, 2024 • 3