Compressed Sensing for Capability Localization in Large Language Models

Anna Bair; J. Zico Kolter; Mingjie Sun; Yixuan Even Xu

arxiv: 2603.03335 · v2 · pith:PO2EHLWZnew · submitted 2026-02-11 · 💻 cs.CL

Compressed Sensing for Capability Localization in Large Language Models

Anna Bair , Yixuan Even Xu , Mingjie Sun , J. Zico Kolter This is my paper

classification 💻 cs.CL

keywords capabilitiesmodelscapabilitycodeheadslanguagecompressedgeneration

0 comments

read the original abstract

Large language models (LLMs) exhibit a wide range of capabilities, including mathematical reasoning, code generation, and linguistic behaviors. We show that Transformer architectures contain small subsets of attention heads that are necessary for certain capabilities. Zeroing out as few as five task-specific heads can degrade performance by up to $60\%$ on standard benchmarks measuring the capability of interest, while largely preserving performance on unrelated tasks. We introduce a compressed sensing-based method that exploits the sparsity of these heads to identify them via strategic knockouts and a small number of model evaluations. We validate these findings across Llama and Qwen models ranging from 1B to 14B parameters and a diverse set of capabilities including mathematical abilities and code generation, revealing a modular organization in which specialized capabilities are dependent on sparse, functionally distinct components. Overall, our results suggest that capability localization is a general organizational principle of Transformer language models, with implications for interpretability, model editing, and AI safety. Code is released at https://github.com/locuslab/llm-components.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 2 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

See What I See, Know What I Think: Dense Latent Communication Across Heterogeneous Agents
cs.MA 2026-06 unverdicted novelty 7.0

Heterogeneous agents achieve dense latent KV-cache communication via lightweight cross-model transformation and two-phase training, outperforming text at lower compute in context-aware settings and enabling context-un...
Ablation-Reversible Heads Don't Transfer: A Stress Test for Mechanistic Role Claims in Transformers
cs.AI 2026-06 unverdicted novelty 6.0

Standard tests for mechanistic roles in transformer attention heads are insufficient because heads that pass them fail to transfer computations across prompts under matched controls.