Research2026-04-22
A-MAR: Agent-based Multimodal Art Retrieval for Fine-Grained Artwork Understanding
Source: Arxiv CS.AI
arXiv:2604.19689v1 Announce Type: new Abstract: Understanding artworks requires multi-step reasoning over visual content and cultural, historical, and stylistic context. While recent multimodal large language models show promise in artwork explanation, they rely on implicit reasoning and...
arxivpapersagentsmultimodal