GemFilter: Discovering Gems in Early Layers for Accelerated Long-Context LLMs,

· 2024 · arXiv 2409.17422

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

representative citing papers

StructKV: Preserving the Structural Skeleton for Scalable Long-Context Inference

cs.CL · 2026-04-08 · unverdicted · novelty 6.0

StructKV compresses LLM KV caches by tracking global in-degree centrality across network depth and dynamically selecting compression layers to preserve long-range dependencies better than local pruning methods.

Correctness-Aware Repository Filtering Under Maximum Effective Context Window Constraints

cs.SE · 2026-05-14 · unverdicted · novelty 5.0

A pre-execution size filter cuts repository tokens by 80-89% at sub-millisecond cost and raises file-level accuracy from 25% to 72% in a small CodeLlama evaluation.

citing papers explorer

Showing 2 of 2 citing papers.

StructKV: Preserving the Structural Skeleton for Scalable Long-Context Inference cs.CL · 2026-04-08 · unverdicted · none · ref 15
StructKV compresses LLM KV caches by tracking global in-degree centrality across network depth and dynamically selecting compression layers to preserve long-range dependencies better than local pruning methods.
Correctness-Aware Repository Filtering Under Maximum Effective Context Window Constraints cs.SE · 2026-05-14 · unverdicted · none · ref 19
A pre-execution size filter cuts repository tokens by 80-89% at sub-millisecond cost and raises file-level accuracy from 25% to 72% in a small CodeLlama evaluation.

GemFilter: Discovering Gems in Early Layers for Accelerated Long-Context LLMs,

fields

years

verdicts

representative citing papers

citing papers explorer