Research2026-05-12
Key-Value Means
Source: Arxiv CS.AI
arXiv:2605.09877v1 Announce Type: cross Abstract: We present Key-Value Means ("KVM"), a novel block-recurrence for attention that can accommodate either fixed-size or growing state. Equipping a strong transformer baseline with fixed-size KVM attention layers yields a strong $O(N)$ chunked RNN,...
arxivpapers