BeClaude
Research2026-04-30

Rethinking KV Cache Eviction via a Unified Information-Theoretic Objective

Source: Arxiv CS.AI

arXiv:2604.25975v1 Announce Type: cross Abstract: Key-value (KV) caching is essential for large language model inference, yet its memory overhead poses a critical bottleneck for long-context generation. Existing eviction policies predominantly rely on empirical heuristics, lacking a rigorous...

arxivpapers