Research2026-05-01
TransVLM: A Vision-Language Framework and Benchmark for Detecting Any Shot Transitions
Source: Arxiv CS.AI
arXiv:2604.27975v1 Announce Type: cross Abstract: Traditional Shot Boundary Detection (SBD) inherently struggles with complex transitions by formulating the task around isolated cut points, frequently yielding corrupted video shots. We address this fundamental limitation by formalizing the Shot...
arxivpapersbenchmarkvision