WebOperator: Action-Aware Tree Search for Autonomous Agents in Web Environment Paper โข 2512.12692 โข Published 14 days ago โข 13
SPARK: Stepwise Process-Aware Rewards for Reference-Free Reinforcement Learning Paper โข 2512.03244 โข Published 26 days ago โข 16