Compactor: Calibrated Query-Agnostic

Vivek Chari, Benjamin Van Durme , year = · 2025 · arXiv 2507.08143

4 Pith papers cite this work. Polarity classification is still indexing.

4 Pith papers citing it

representative citing papers

Still: Amortized KV Cache Compaction in a Single Forward Pass

cs.LG · 2026-06-05 · unverdicted · novelty 6.0

Still is an amortized per-layer Perceiver that synthesizes compact KV caches in one forward pass, outperforming selection and per-context baselines on RULER, HELMET, and LongBench at 8-200x compression.

Cartridges at Scale: Training Modular KV Caches over Large Document Collections

cs.CL · 2026-06-03 · unverdicted · novelty 6.0

CAS trains composable per-document KV cache cartridges via dynamic distractor mixing and a rotating budget manager, scaling to million-token collections with 10-31 point gains over monolithic cartridges and matching RAG at 3-4x lower token cost.

Value-Aware Stochastic KV Cache Eviction for Reasoning Models

cs.LG · 2026-06-02 · unverdicted · novelty 6.0

VaSE improves KV cache eviction accuracy for reasoning models by over 4% versus prior eviction methods at 4x compression through value-magnitude protection and stochastic diversity.

Rethinking LoRA Memory Through the Lens of KV Cache Compression

cs.CL · 2026-06-04 · unverdicted · novelty 5.0

Document LoRA acts as decoding-time parametric memory that recovers 13-21 ROUGE-L points under heavy KV cache compression in QA, performing best when the base model encodes the document and the adapter is used only at generation with QA supervision.

citing papers explorer

Showing 2 of 2 citing papers after filters.

Still: Amortized KV Cache Compaction in a Single Forward Pass cs.LG · 2026-06-05 · unverdicted · none · ref 14
Still is an amortized per-layer Perceiver that synthesizes compact KV caches in one forward pass, outperforming selection and per-context baselines on RULER, HELMET, and LongBench at 8-200x compression.
Value-Aware Stochastic KV Cache Eviction for Reasoning Models cs.LG · 2026-06-02 · unverdicted · none · ref 39
VaSE improves KV cache eviction accuracy for reasoning models by over 4% versus prior eviction methods at 4x compression through value-magnitude protection and stochastic diversity.

Compactor: Calibrated Query-Agnostic

fields

years

verdicts

representative citing papers

citing papers explorer