Maheep Chaudhary and Atticus Geiger

Maheep Chaudhary, Atticus Geiger · 2024 · arXiv 2409.04478

4 Pith papers cite this work. Polarity classification is still indexing.

4 Pith papers citing it

representative citing papers

PLOT: Progressive Localization via Optimal Transport in Neural Causal Abstraction

cs.LG · 2026-05-07 · unverdicted · novelty 7.0 · 2 refs

PLOT localizes causal variables in neural networks by fitting optimal transport couplings between abstract and neural intervention effect geometries, enabling fast handles or guided search.

Why Limit the Residual Stream to Layers and Not Tokens? Persistent Memory for Continuous Latent Reasoning

cs.AI · 2026-06-05 · unverdicted · novelty 6.0

AGCLR extends CoCoNuT with a gated concept stream for persistent memory to fix fact loss in latent reasoning, yielding improvements on reasoning benchmarks as depth increases.

Mechanistic origins of catastrophic forgetting: why RL preserves circuits better than SFT?

cs.LG · 2026-05-21 · unverdicted · novelty 6.0

RL preserves a larger fraction of base model circuits than SFT during fine-tuning on scientific QA, per a new head-level differential circuit vulnerability metric, at the cost of slower adaptation.

SAERec: Constructing Fine-grained Interpretable Intents Priors via Sparse Autoencoders for Recommendation

cs.IR · 2026-06-17 · unverdicted · novelty 5.0

SAERec extracts fine-grained interpretable intents from LLM embeddings via sparse autoencoders and integrates them as priors into sequence recommendation using multi-branch attention, outperforming baselines on public datasets.

citing papers explorer

Showing 1 of 1 citing paper after filters.

Why Limit the Residual Stream to Layers and Not Tokens? Persistent Memory for Continuous Latent Reasoning cs.AI · 2026-06-05 · unverdicted · none · ref 35
AGCLR extends CoCoNuT with a gated concept stream for persistent memory to fix fact loss in latent reasoning, yielding improvements on reasoning benchmarks as depth increases.

Maheep Chaudhary and Atticus Geiger

fields

years

verdicts

representative citing papers

citing papers explorer