Research2026-05-14
MobiBench: Multi-Branch, Modular Benchmark for Mobile GUI Agents
Source: Arxiv CS.AI
arXiv:2512.12634v3 Announce Type: replace Abstract: Mobile GUI Agents, AI agents capable of interacting with mobile applications on behalf of users, have the potential to transform human computer interaction. However, current evaluation practices for GUI agents face two fundamental limitations....
arxivpapersagentsbenchmark