Research2026-04-20
Zoom Consistency: A Free Confidence Signal in Multi-Step Visual Grounding Pipelines
Source: Arxiv CS.AI
arXiv:2604.15376v1 Announce Type: cross Abstract: Multi-step zoom-in pipelines are widely used for GUI grounding, yet the intermediate predictions they produce are typically discarded after coordinate remapping. We observe that these intermediate outputs contain a useful confidence signal for free:...
arxivpapers