Research2026-04-28
StoryTR: Narrative-Centric Video Temporal Retrieval with Theory of Mind Reasoning
Source: Arxiv CS.AI
arXiv:2604.23198v1 Announce Type: new Abstract: Current video moment retrieval excels at action-centric tasks but struggles with narrative content. Models can see \textit{what is happening} but fail to reason \textit{why it matters}. This semantic gap stems from the lack of \textbf{Theory of Mind...
arxivpapersreasoning