Research2026-05-06
IPS: In-Prompt Process Supervision for Short Video Content Moderation
Source: Arxiv CS.AI
arXiv:2412.15251v3 Announce Type: replace-cross Abstract: Multimodal large language models (MLLMs) are effective at capturing the semantics of short video content; however, they often fail to attend to the policy-specific details required for reliable content moderation. To address this limitation,...
arxivpaperspromptingvision