Research2026-05-12

Investigating Anisotropy in Visual Grounding under Controlled Counterfactual Perturbations

arXiv:2605.09090v1 Announce Type: cross Abstract: Visual Grounding benchmarks assume that the object described by a referring expression is always present in the image, and grounding models are therefore rarely evaluated under semantically mismatched captions. In such cases, models frequently...

Read Original Article on Arxiv CS.AI

arxivpapers