BeClaude
Research2026-05-12

Investigating Anisotropy in Visual Grounding under Controlled Counterfactual Perturbations

Source: Arxiv CS.AI

arXiv:2605.09090v1 Announce Type: cross Abstract: Visual Grounding benchmarks assume that the object described by a referring expression is always present in the image, and grounding models are therefore rarely evaluated under semantically mismatched captions. In such cases, models frequently...

arxivpapers