Title resolution pending

Avichal Goel, Yoon Kim, Nir Shavit, Tony T · 2025 · arXiv 2510.05092

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

Title metadata for this work has not finished resolving. The hub is built from the citation graph; the title resolver retries DOI and OpenAlex on its next pass.

representative citing papers

Introspective Coupling: Self-Explanation Training Tracks Behavioral Change Despite Fixed Supervision

cs.CL · 2026-06-30 · unverdicted · novelty 6.0

Fixed counterfactual explanation datasets train LMs such that generated explanations track the model's evolving behavior rather than the fixed targets, due to persistent correlation during training.

Building Better Activation Oracles

cs.LG · 2026-05-23 · unverdicted · novelty 3.0

Four changes to Activation Oracle training yield marginal capability gains but better practical quality, plus an open-sourced evaluation suite AObench.

citing papers explorer

Showing 2 of 2 citing papers.

Introspective Coupling: Self-Explanation Training Tracks Behavioral Change Despite Fixed Supervision cs.CL · 2026-06-30 · unverdicted · none · ref 12
Fixed counterfactual explanation datasets train LMs such that generated explanations track the model's evolving behavior rather than the fixed targets, due to persistent correlation during training.
Building Better Activation Oracles cs.LG · 2026-05-23 · unverdicted · none · ref 4
Four changes to Activation Oracle training yield marginal capability gains but better practical quality, plus an open-sourced evaluation suite AObench.

Title resolution pending

fields

years

verdicts

representative citing papers

citing papers explorer