Research2026-05-14
STAR: Semantic-Temporal Adaptive Representation Learning for Few-Shot Action Recognition
Source: Arxiv CS.AI
arXiv:2605.13202v1 Announce Type: cross Abstract: Few-shot action recognition (FSAR) requires models to generalize to novel action categories from only a handful of annotated samples. Despite progress with vision-language models, existing approaches still suffer from semantic-temporal misalignment,...
arxivpapers