Research2026-04-27
From Global to Local: Rethinking CLIP Feature Aggregation for Person Re-Identification
Source: Arxiv CS.AI
arXiv:2604.22190v1 Announce Type: cross Abstract: CLIP-based person re-identification (ReID) methods aggregate spatial features into a single global \texttt{[CLS]} token optimized for image-text alignment rather than spatial selectivity, making representations fragile under occlusion and...
arxivpapers