Analyzing the structure of attention in a transformer language model

Vig, Jesse, Belinkov, Yonatan · 2019 · DOI 10.18653/v1/w19-4808

4 Pith papers cite this work. Polarity classification is still indexing.

4 Pith papers citing it

open at publisher browse 4 citing papers

representative citing papers

Power-Softmax: Towards Secure LLM Inference over Encrypted Data

cs.LG · 2024-10-12 · unverdicted · novelty 7.0

Power-Softmax is a new HE-compatible attention variant that permits training and inference of billion-parameter polynomial LLMs with performance matching standard transformers.

LMs as Task-Specific Knowledge Bases: An Interpretability Analysis

cs.CL · 2026-06-25 · unverdicted · novelty 6.0

LMs store facts in task-specific parameter subsets, shown by inconsistent emergence across tasks during training and distinct localized parameters for the same fact.

G-Long: Graph-Enhanced Memory Management for Efficient Long-Term Dialogue Agents

cs.CL · 2026-06-11 · unverdicted · novelty 5.0

G-Long uses graph-enhanced triplet memory and attention-aware scoring from a T5 summarizer to achieve up to 9.8% better response quality on MSC and 40.8% better retrieval recall on LME with lower overhead.

Discovering Millions of Interpretable Features with Sparse Autoencoders

cs.LG · 2026-06-25 · unverdicted · novelty 3.0

Trains and releases SAEs for Qwen3-1.7B/4B/8B models with layer-wise coverage and demonstrates causal steering of refusal via selected features.

citing papers explorer

Showing 4 of 4 citing papers.

Power-Softmax: Towards Secure LLM Inference over Encrypted Data cs.LG · 2024-10-12 · unverdicted · none · ref 33
Power-Softmax is a new HE-compatible attention variant that permits training and inference of billion-parameter polynomial LLMs with performance matching standard transformers.
LMs as Task-Specific Knowledge Bases: An Interpretability Analysis cs.CL · 2026-06-25 · unverdicted · none · ref 57
LMs store facts in task-specific parameter subsets, shown by inconsistent emergence across tasks during training and distinct localized parameters for the same fact.
G-Long: Graph-Enhanced Memory Management for Efficient Long-Term Dialogue Agents cs.CL · 2026-06-11 · unverdicted · none · ref 44
G-Long uses graph-enhanced triplet memory and attention-aware scoring from a T5 summarizer to achieve up to 9.8% better response quality on MSC and 40.8% better retrieval recall on LME with lower overhead.
Discovering Millions of Interpretable Features with Sparse Autoencoders cs.LG · 2026-06-25 · unverdicted · none · ref 86
Trains and releases SAEs for Qwen3-1.7B/4B/8B models with layer-wise coverage and demonstrates causal steering of refusal via selected features.

Analyzing the structure of attention in a transformer language model

fields

years

verdicts

representative citing papers

citing papers explorer