LongNet: Scaling Transformers to 1,000,000,000 Tokens Paper • 2307.02486 • Published Jul 5, 2023 • 81
Synthetic Data (Almost) from Scratch: Generalized Instruction Tuning for Language Models Paper • 2402.13064 • Published Feb 20, 2024 • 50
xRAG: Extreme Context Compression for Retrieval-augmented Generation with One Token Paper • 2405.13792 • Published May 22, 2024 • 1
Preference Optimization for Reasoning with Pseudo Feedback Paper • 2411.16345 • Published Nov 25, 2024 • 1
RedStone: Curating General, Code, Math, and QA Data for Large Language Models Paper • 2412.03398 • Published Dec 4, 2024 • 2
Chain-of-Reasoning: Towards Unified Mathematical Reasoning in Large Language Models via a Multi-Paradigm Perspective Paper • 2501.11110 • Published Jan 19 • 4
WildLong: Synthesizing Realistic Long-Context Instruction Data at Scale Paper • 2502.16684 • Published Feb 23 • 1
Think Only When You Need with Large Hybrid-Reasoning Models Paper • 2505.14631 • Published May 20 • 20
QueST: Incentivizing LLMs to Generate Difficult Problems Paper • 2510.17715 • Published 6 days ago • 29
QueST: Incentivizing LLMs to Generate Difficult Problems Paper • 2510.17715 • Published 6 days ago • 29