Dynamics of transient structure in in-context linear regression transformers

Liam Carroll, Jesse Hoogland, Matthew Farrugia-Roberts, Daniel Murfet · 2025 · arXiv 2501.17745

6 Pith papers cite this work. Polarity classification is still indexing.

6 Pith papers citing it

read on arXiv browse 6 citing papers

citation-role summary

background 1

citation-polarity summary

background 1

representative citing papers

Manifold Steering Reveals the Shared Geometry of Neural Network Representation and Behavior

cs.LG · 2026-05-06 · unverdicted · novelty 7.0

Manifold steering along activation geometry induces behavioral trajectories matching the natural manifold of outputs, while linear steering produces off-manifold unnatural behaviors.

Task Vector Geometry Underlies Dual Modes of Task Inference in Transformers

cs.LG · 2026-05-05 · unverdicted · novelty 7.0

In a controlled synthetic setting, transformers implement in-distribution task inference via convex combinations of task vectors and out-of-distribution inference via nearly orthogonal extrapolative representations.

Temporal Task Diversity: Inductive Biases Under Non-Stationarity in Synthetic Sequence Modelling

cs.LG · 2026-05-18 · unverdicted · novelty 6.0

Temporal diversity in task distribution during training increases generalization bias over memorization in transformers for in-context linear regression.

Structure and Scale in Simplicial Sequence Modelling

cs.LG · 2026-05-31 · unverdicted · novelty 5.0

Small transformers on HMM prediction tasks exhibit correlated scaling between performance and linear encoding of belief distributions in residual activations.

High-Dimensional Statistics: Reflections on Progress and Open Problems

math.ST · 2026-05-06 · unverdicted · novelty 2.0 · 2 refs

This review synthesizes representative advances in high-dimensional statistics, highlights common themes and open problems, and points to key entry works.

Mechanistic Anomaly Detection via Functional Attribution

cs.LG · 2026-04-21

citing papers explorer

Showing 6 of 6 citing papers.

Manifold Steering Reveals the Shared Geometry of Neural Network Representation and Behavior cs.LG · 2026-05-06 · unverdicted · none · ref 256
Manifold steering along activation geometry induces behavioral trajectories matching the natural manifold of outputs, while linear steering produces off-manifold unnatural behaviors.
Task Vector Geometry Underlies Dual Modes of Task Inference in Transformers cs.LG · 2026-05-05 · unverdicted · none · ref 6
In a controlled synthetic setting, transformers implement in-distribution task inference via convex combinations of task vectors and out-of-distribution inference via nearly orthogonal extrapolative representations.
Temporal Task Diversity: Inductive Biases Under Non-Stationarity in Synthetic Sequence Modelling cs.LG · 2026-05-18 · unverdicted · none · ref 6
Temporal diversity in task distribution during training increases generalization bias over memorization in transformers for in-context linear regression.
Structure and Scale in Simplicial Sequence Modelling cs.LG · 2026-05-31 · unverdicted · none · ref 10
Small transformers on HMM prediction tasks exhibit correlated scaling between performance and linear encoding of belief distributions in residual activations.
High-Dimensional Statistics: Reflections on Progress and Open Problems math.ST · 2026-05-06 · unverdicted · none · ref 15 · 2 links
This review synthesizes representative advances in high-dimensional statistics, highlights common themes and open problems, and points to key entry works.
Mechanistic Anomaly Detection via Functional Attribution cs.LG · 2026-04-21 · unreviewed · ref 67

Dynamics of transient structure in in-context linear regression transformers

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer