BeClaude
Back to News
Research2026-04-17

UI-Zoomer: Uncertainty-Driven Adaptive Zoom-In for GUI Grounding

Source: Arxiv CS.AI

arXiv:2604.14113v1 Announce Type: cross Abstract: GUI grounding, which localizes interface elements from screenshots given natural language queries, remains challenging for small icons and dense layouts. Test-time zoom-in methods improve localization by cropping and re-running inference at higher...

arxivpapers