pith. machine review for the scientific record. sign in

Keep the cost down: A review on methods to optimize llm’s kv-cache consumption

9 Pith papers cite this work. Polarity classification is still indexing.

9 Pith papers citing it

years

2026 8 2025 1

representative citing papers

Rethinking KV Cache Eviction via a Unified Information-Theoretic Objective

cs.LG · 2026-04-28 · unverdicted · novelty 7.0

KV cache eviction is unified under an information capacity maximization principle derived from a linear-Gaussian attention surrogate, with CapKV proposed as a leverage-score based implementation that outperforms prior heuristics in experiments.

citing papers explorer

Showing 9 of 9 citing papers.