Research2026-05-12
One Trajectory, One Token: Grounded Video Tokenization via Panoptic Sub-object Trajectory
Source: Arxiv CS.AI
arXiv:2505.23617v3 Announce Type: replace-cross Abstract: Effective video tokenization is critical for scaling transformer models for long videos. Current approaches tokenize videos using space-time patches, leading to excessive tokens and computational inefficiencies. The best token reduction...
arxivpapers