Proceedings of the 40th International Conference on Machine Learning , articleno =

Biderman, Stella, Schoelkopf, Hailey, Anthony, Quentin, Bradley, Herbie, O'Brien, Kyle, Hallahan, Eric · 2023

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

browse 2 citing papers

representative citing papers

Polar probe linearly decodes semantic structures from LLMs

cs.CL · 2026-05-13 · unverdicted · novelty 6.0 · 2 refs

LLMs represent semantic relations geometrically via embedding distance and direction; a linear Polar Probe decodes these structures from middle-layer activations and generalizes to new entities.

On the (In-)Security of the Shuffling Defense in the Transformer Secure Inference

cs.CR · 2026-05-06 · conditional · novelty 6.0

An attack aligns differently shuffled intermediate activations from secure Transformer inference queries to recover model weights with low error using roughly one dollar of queries.

citing papers explorer

Showing 2 of 2 citing papers.

Polar probe linearly decodes semantic structures from LLMs cs.CL · 2026-05-13 · unverdicted · none · ref 78 · 2 links
LLMs represent semantic relations geometrically via embedding distance and direction; a linear Polar Probe decodes these structures from middle-layer activations and generalizes to new entities.
On the (In-)Security of the Shuffling Defense in the Transformer Secure Inference cs.CR · 2026-05-06 · conditional · none · ref 195
An attack aligns differently shuffled intermediate activations from secure Transformer inference queries to recover model weights with low error using roughly one dollar of queries.

Proceedings of the 40th International Conference on Machine Learning , articleno =

fields

years

verdicts

representative citing papers

citing papers explorer