BeClaude
Research2026-05-14

MMSkills: Towards Multimodal Skills for General Visual Agents

Source: Arxiv CS.AI

arXiv:2605.13527v1 Announce Type: new Abstract: Reusable skills have become a core substrate for improving agent capabilities, yet most existing skill packages encode reusable behavior primarily as textual prompts, executable code, or learned routines. For visual agents, however, procedural...

arxivpapersagentsmultimodal