Research2026-05-12
Investigating Anisotropy in Visual Grounding under Controlled Counterfactual Perturbations
Source: Arxiv CS.AI
arXiv:2605.09090v1 Announce Type: cross Abstract: Visual Grounding benchmarks assume that the object described by a referring expression is always present in the image, and grounding models are therefore rarely evaluated under semantically mismatched captions. In such cases, models frequently...
arxivpapers