Research2026-04-28
A systematic evaluation of vision-language models for observational astronomical reasoning tasks
Source: Arxiv CS.AI
arXiv:2604.24589v1 Announce Type: new Abstract: Vision-language models (VLMs) are increasingly proposed as general-purpose tools for scientific data interpretation, yet their reliability on real astronomical observations across diverse modalities remains untested. We present AstroVLBench, a...
arxivpapersreasoningvision