BeClaude
Research2026-05-14

Skill-Conditioned Visual Geolocation for Vision-Language Models

Source: Arxiv CS.AI

arXiv:2604.09025v2 Announce Type: replace-cross Abstract: Vision-language models (VLMs) have shown a promising ability in image geolocation, but they still lack structured geographic reasoning and the capacity for autonomous self-evolution. Existing methods predominantly rely on implicit parametric...

arxivpapersvision