SwarmSys: Decentralized Swarm-Inspired Agents for Scalable and Adaptive Reasoning Paper • 2510.10047 • Published 16 days ago • 13
KnowledgeMath: Knowledge-Intensive Math Word Problem Solving in Finance Domains Paper • 2311.09797 • Published Nov 16, 2023 • 1
DocMath-Eval: Evaluating Numerical Reasoning Capabilities of LLMs in Understanding Long Documents with Tabular Data Paper • 2311.09805 • Published Nov 16, 2023 • 3
PuzzlePlex: Benchmarking Foundation Models on Reasoning and Planning with Puzzles Paper • 2510.06475 • Published 19 days ago • 1
SUCEA: Reasoning-Intensive Retrieval for Adversarial Fact-checking through Claim Decomposition and Editing Paper • 2506.04583 • Published Jun 5
MedMMV: A Controllable Multimodal Multi-Agent Framework for Reliable and Verifiable Clinical Reasoning Paper • 2509.24314 • Published 28 days ago • 1
Superclass-Guided Representation Disentanglement for Spurious Correlation Mitigation Paper • 2508.08570 • Published Aug 12
FinDVer: Explainable Claim Verification over Long and Hybrid-Content Financial Documents Paper • 2411.05764 • Published Nov 8, 2024