Policy2026-05-05
Learn where to Click from Yourself: On-Policy Self-Distillation for GUI Grounding
Source: Arxiv CS.AI
arXiv:2605.00642v1 Announce Type: new Abstract: Graphical User Interface (GUI) grounding maps natural language instructions to the visual coordinates of target elements and serves as a core capability for autonomous GUI agents. Recent reinforcement learning methods (e.g., GRPO) have achieved strong...
arxivpapers