Research2026-04-30
MINOS: A Multimodal Evaluation Model for Bidirectional Generation Between Image and Text
Source: Arxiv CS.AI
arXiv:2506.02494v2 Announce Type: replace-cross Abstract: Evaluation is important for multimodal generation tasks, while traditional multimodal evaluation metrics suffer from several limitations. With the rapid progress of MLLMs, there is growing interest in applying MLLMs to build general...
arxivpapersmultimodal