Towards Reasoning Era: A Survey of Long Chain-of-Thought for Reasoning Large Language Models Paper • 2503.09567 • Published Mar 12
Efficient Process Reward Model Training via Active Learning Paper • 2504.10559 • Published Apr 14 • 13
End-to-end Task-oriented Dialogue: A Survey of Tasks, Methods, and Future Directions Paper • 2311.09008 • Published Nov 15, 2023
OWL: Optimized Workforce Learning for General Multi-Agent Assistance in Real-World Task Automation Paper • 2505.23885 • Published May 29
AI4Research: A Survey of Artificial Intelligence for Scientific Research Paper • 2507.01903 • Published Jul 2 • 4
Cross-lingual Prompting: Improving Zero-shot Chain-of-Thought Reasoning across Languages Paper • 2310.14799 • Published Oct 23, 2023
CCHall: A Novel Benchmark for Joint Cross-Lingual and Cross-Modal Hallucinations Detection in Large Language Models Paper • 2505.19108 • Published May 25
AutoPR: Let's Automate Your Academic Promotion! Paper • 2510.09558 • Published 16 days ago • 49
Aware First, Think Less: Dynamic Boundary Self-Awareness Drives Extreme Reasoning Efficiency in Large Language Models Paper • 2508.11582 • Published Aug 15 • 1
Beyond Correctness: Evaluating Subjective Writing Preferences Across Cultures Paper • 2510.14616 • Published 11 days ago • 10
COIG-Writer: A High-Quality Dataset for Chinese Creative Writing with Thought Processes Paper • 2510.14763 • Published 11 days ago • 13
ECM: A Unified Electronic Circuit Model for Explaining the Emergence of In-Context Learning and Chain-of-Thought in Large Language Model Paper • 2502.03325 • Published Feb 5 • 1
CoMT: A Novel Benchmark for Chain of Multi-modal Thought on Large Vision-Language Models Paper • 2412.12932 • Published Dec 17, 2024 • 1
Text2World: Benchmarking Large Language Models for Symbolic World Model Generation Paper • 2502.13092 • Published Feb 18 • 13
Tree-Planner: Efficient Close-loop Task Planning with Large Language Models Paper • 2310.08582 • Published Oct 12, 2023 • 2
Through the Lens of Core Competency: Survey on Evaluation of Large Language Models Paper • 2308.07902 • Published Aug 15, 2023
M$^3$CoT: A Novel Benchmark for Multi-Domain Multi-step Multi-modal Chain-of-Thought Paper • 2405.16473 • Published May 26, 2024
HiAgent: Hierarchical Working Memory Management for Solving Long-Horizon Agent Tasks with Large Language Model Paper • 2408.09559 • Published Aug 18, 2024
What are the Essential Factors in Crafting Effective Long Context Multi-Hop Instruction Datasets? Insights and Best Practices Paper • 2409.01893 • Published Sep 3, 2024
Multilingual Large Language Model: A Survey of Resources, Taxonomy and Frontiers Paper • 2404.04925 • Published Apr 7, 2024 • 1