BeClaude
Research2026-05-06

STAGE: A Full-Screenplay Benchmark for Reasoning over Evolving Storie

Source: Arxiv CS.AI

arXiv:2601.08510v3 Announce Type: replace-cross Abstract: Movie screenplays are rich long-form narratives that interleave complex character relationships, temporally ordered events, and dialogue-driven interactions. While prior benchmarks target individual subtasks such as question answering or...

arxivpapersreasoningbenchmark