Research2026-04-22
Multi-modal Reasoning with LLMs for Visual Semantic Arithmetic
Source: Arxiv CS.AI
arXiv:2604.19567v1 Announce Type: new Abstract: Reinforcement learning (RL) as post-training is crucial for enhancing the reasoning ability of large language models (LLMs) in coding and math. However, their capacity for visual semantic arithmetic, inferring relationships from images, remains...
arxivpapersreasoning