Locating and editing factual associations in gpt

Kevin Meng, David Bau, Alex Andonian, Yonatan Belinkov · 2023

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

browse 2 citing papers

representative citing papers

SteeringDiffusion: A Bottlenecked Activation Control Interface for Diffusion Models

cs.CV · 2026-05-03 · unverdicted · novelty 7.0

SteeringDiffusion supplies a bottlenecked, prompt-conditioned activation interface for frozen diffusion models that delivers smooth monotonic content-style control via one runtime scalar and timestep gating.

Linear Representations of Sentiment in Large Language Models

cs.LG · 2023-10-23 · unverdicted · novelty 6.0

Sentiment is represented as a single linear direction in LLM activation space that is causally relevant across tasks and is summarized at punctuation and names in addition to charged words.

citing papers explorer

Showing 1 of 1 citing paper after filters.

Linear Representations of Sentiment in Large Language Models cs.LG · 2023-10-23 · unverdicted · none · ref 104
Sentiment is represented as a single linear direction in LLM activation space that is causally relevant across tasks and is summarized at punctuation and names in addition to charged words.

Locating and editing factual associations in gpt

fields

years

verdicts

representative citing papers

citing papers explorer