Computer Use Agent Agent S2: A Compositional Generalist-Specialist Framework for Computer Use Agents Paper • 2504.00906 • Published Apr 1 • 26
Agent S2: A Compositional Generalist-Specialist Framework for Computer Use Agents Paper • 2504.00906 • Published Apr 1 • 26
Learning from examples - training/inference ExGRPO: Learning to Reason from Experience Paper • 2510.02245 • Published 21 days ago • 76 A Practitioner's Guide to Multi-turn Agentic Reinforcement Learning Paper • 2510.01132 • Published 22 days ago • 5 Agentic Context Engineering: Evolving Contexts for Self-Improving Language Models Paper • 2510.04618 • Published 17 days ago • 104 MixReasoning: Switching Modes to Think Paper • 2510.06052 • Published 16 days ago • 21
A Practitioner's Guide to Multi-turn Agentic Reinforcement Learning Paper • 2510.01132 • Published 22 days ago • 5
Agentic Context Engineering: Evolving Contexts for Self-Improving Language Models Paper • 2510.04618 • Published 17 days ago • 104
Computer Use Agent Agent S2: A Compositional Generalist-Specialist Framework for Computer Use Agents Paper • 2504.00906 • Published Apr 1 • 26
Agent S2: A Compositional Generalist-Specialist Framework for Computer Use Agents Paper • 2504.00906 • Published Apr 1 • 26
Learning from examples - training/inference ExGRPO: Learning to Reason from Experience Paper • 2510.02245 • Published 21 days ago • 76 A Practitioner's Guide to Multi-turn Agentic Reinforcement Learning Paper • 2510.01132 • Published 22 days ago • 5 Agentic Context Engineering: Evolving Contexts for Self-Improving Language Models Paper • 2510.04618 • Published 17 days ago • 104 MixReasoning: Switching Modes to Think Paper • 2510.06052 • Published 16 days ago • 21
A Practitioner's Guide to Multi-turn Agentic Reinforcement Learning Paper • 2510.01132 • Published 22 days ago • 5
Agentic Context Engineering: Evolving Contexts for Self-Improving Language Models Paper • 2510.04618 • Published 17 days ago • 104