Research2026-05-14
MMSkills: Towards Multimodal Skills for General Visual Agents
Source: Arxiv CS.AI
arXiv:2605.13527v1 Announce Type: new Abstract: Reusable skills have become a core substrate for improving agent capabilities, yet most existing skill packages encode reusable behavior primarily as textual prompts, executable code, or learned routines. For visual agents, however, procedural...
arxivpapersagentsmultimodal