Cladder: Assessing causal reasoning in language models, 2024 a

Zhijing Jin, Yuen Chen, Felix Leeb, Luigi Gresele, Ojasv Kamal, Zhiheng Lyu, Kevin Blin, Fernando Gonzalez Adauto, Max Kleiman-Weiner, Mrinmaya Sachan, Bernhard Schölkopf · 2024 · arXiv 2312.04350

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

read on arXiv browse 2 citing papers

representative citing papers

Reading the Finetuning Prior: Verbatim Content Recovery via Contrastive Decoding Diffing

cs.LG · 2026-05-25 · unverdicted · novelty 7.0

Contrastive Decoding Diffing recovers exact implanted facts from finetuned LLMs via logit-space differences between finetuned and base models, outperforming white-box baselines with less access.

Instrumented data for causal scientific machine learning

cs.LG · 2026-06-05 · unverdicted · novelty 5.0

Instrumented data augments observations with mechanistic models, uncertainty, and counterfactuals to enable causal interventions via Pearl's do-operator in scientific machine learning.

citing papers explorer

Showing 2 of 2 citing papers.

Reading the Finetuning Prior: Verbatim Content Recovery via Contrastive Decoding Diffing cs.LG · 2026-05-25 · unverdicted · none · ref 10
Contrastive Decoding Diffing recovers exact implanted facts from finetuned LLMs via logit-space differences between finetuned and base models, outperforming white-box baselines with less access.
Instrumented data for causal scientific machine learning cs.LG · 2026-06-05 · unverdicted · none · ref 26
Instrumented data augments observations with mechanistic models, uncertainty, and counterfactuals to enable causal interventions via Pearl's do-operator in scientific machine learning.

Cladder: Assessing causal reasoning in language models, 2024 a

fields

years

verdicts

representative citing papers

citing papers explorer