You Don't Know Until You Click:Automated GUI Testing for Production-Ready Software Evaluation Paper • 2508.14104 • Published Aug 17 • 1
Don't Just Fine-tune the Agent, Tune the Environment Paper • 2510.10197 • Published 22 days ago • 28 • 3
Don't Just Fine-tune the Agent, Tune the Environment Paper • 2510.10197 • Published 22 days ago • 28 • 3
Recon-Act: A Self-Evolving Multi-Agent Browser-Use System via Web Reconnaissance, Tool Generation, and Task Execution Paper • 2509.21072 • Published Sep 25 • 15
PreAct: Predicting Future in ReAct Enhances Agent's Planning Ability Paper • 2402.11534 • Published Feb 18, 2024 • 1