pith. sign in

Emergent linear representations in world models of self-supervised sequence models

10 Pith papers cite this work. Polarity classification is still indexing.

10 Pith papers citing it

representative citing papers

Predicting Where Steering Vectors Succeed

cs.LG · 2026-04-16 · unverdicted · novelty 6.0

The Linear Accessibility Profile predicts steering vector effectiveness and optimal layers with Spearman correlations of 0.86-0.91 using unembedding projections on intermediate states across multiple models and concepts.

citing papers explorer

Showing 10 of 10 citing papers.