Aguvis: Unified Pure Vision Agents for Autonomous GUI Interaction Paper • 2412.04454 • Published Dec 5, 2024 • 70
Enhancing Visual Grounding for GUI Agents via Self-Evolutionary Reinforcement Learning Paper • 2505.12370 • Published May 18
UIShift: Enhancing VLM-based GUI Agents through Self-supervised Reinforcement Learning Paper • 2505.12493 • Published May 18
GUI Odyssey: A Comprehensive Dataset for Cross-App GUI Navigation on Mobile Devices Paper • 2406.08451 • Published Jun 12, 2024 • 25
GUI-Reflection: Empowering Multimodal GUI Models with Self-Reflection Behavior Paper • 2506.08012 • Published Jun 9 • 7