BeClaude
Research2026-04-28

StoryTR: Narrative-Centric Video Temporal Retrieval with Theory of Mind Reasoning

Source: Arxiv CS.AI

arXiv:2604.23198v1 Announce Type: new Abstract: Current video moment retrieval excels at action-centric tasks but struggles with narrative content. Models can see \textit{what is happening} but fail to reason \textit{why it matters}. This semantic gap stems from the lack of \textbf{Theory of Mind...

arxivpapersreasoning