Training dynamics of in-context learning in linear attention

Yedi Zhang, Freya Behrens, Florent Krzakala · 2025 · arXiv 2501.16265

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

representative citing papers

Transformers Can Implement Preconditioned Richardson Iteration for In-Context Gaussian Kernel Regression

cs.LG · 2026-05-08 · unverdicted · novelty 7.0 · 2 refs

A single-head softmax transformer with O(log(1/ε)) blocks and O(√(N/ε)) MLP width implements preconditioned Richardson iteration to achieve ε-accurate Gaussian KRR predictions on length-N prompts under bounded data.

Learning to Adapt: In-Context Learning Beyond Stationarity

cs.LG · 2026-04-13 · unverdicted · novelty 6.0

Gated linear attention enables lower training and test errors in non-stationary in-context learning by adaptively modulating past inputs through a learnable recency bias under an autoregressive model of task evolution.

Understanding LoRA as Knowledge Memory: An Empirical Analysis

cs.LG · 2026-03-01

citing papers explorer

Showing 3 of 3 citing papers.

Transformers Can Implement Preconditioned Richardson Iteration for In-Context Gaussian Kernel Regression cs.LG · 2026-05-08 · unverdicted · none · ref 44 · 2 links
A single-head softmax transformer with O(log(1/ε)) blocks and O(√(N/ε)) MLP width implements preconditioned Richardson iteration to achieve ε-accurate Gaussian KRR predictions on length-N prompts under bounded data.
Learning to Adapt: In-Context Learning Beyond Stationarity cs.LG · 2026-04-13 · unverdicted · none · ref 55
Gated linear attention enables lower training and test errors in non-stationary in-context learning by adaptively modulating past inputs through a learnable recency bias under an autoregressive model of task evolution.
Understanding LoRA as Knowledge Memory: An Empirical Analysis cs.LG · 2026-03-01 · unreviewed · ref 4

Training dynamics of in-context learning in linear attention

fields

years

verdicts

representative citing papers

citing papers explorer