(13) and (14) gives ∥E∥2 ≤V e2ε −1 +δ v, whereε=Qδ k/√dh

Combing Eqn · 2023

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

Latent-Condensed Transformer for Efficient Long Context Modeling

cs.CL · 2026-04-14 · unverdicted · novelty 6.0

Latent-Condensed Attention condenses context in MLA's latent space via query-aware semantic pooling and positional anchor selection, delivering up to 2.5x prefilling speedup and 90% KV cache reduction at 128K length with a length-independent error bound.

citing papers explorer

Showing 1 of 1 citing paper.

Latent-Condensed Transformer for Efficient Long Context Modeling cs.CL · 2026-04-14 · unverdicted · none · ref 9
Latent-Condensed Attention condenses context in MLA's latent space via query-aware semantic pooling and positional anchor selection, delivering up to 2.5x prefilling speedup and 90% KV cache reduction at 128K length with a length-independent error bound.

(13) and (14) gives ∥E∥2 ≤V e2ε −1 +δ v, whereε=Qδ k/√dh

fields

years

verdicts

representative citing papers

citing papers explorer