view article Article Back to The Future: Evaluating AI Agents on Predicting Future Events Jul 17 • 44
view article Article ScreenSuite - The most comprehensive evaluation suite for GUI Agents! Jun 6 • 54
Distilling LLM Agent into Small Models with Retrieval and Code Tools Paper • 2505.17612 • Published May 23 • 81
view article Article TinyAgents: A Minimal Experiment with Code Agents and MCP Tools By albertvillanova • May 16 • 30