Use What You Know: Causal Foundation Models with Partial Graphs

Adrian Weller; Anish Dhir; Arik Reuter; Bernhard Sch\"olkopf; Cristiana Diaconu; Frank Hutter; Jake Robertson; Mark van der Wilk; Ole Ossen

arxiv: 2602.14972 · v2 · pith:ZNEALS6Gnew · submitted 2026-02-16 · 💻 cs.LG

Use What You Know: Causal Foundation Models with Partial Graphs

Arik Reuter , Anish Dhir , Cristiana Diaconu , Jake Robertson , Ole Ossen , Frank Hutter , Adrian Weller , Mark van der Wilk

show 1 more author

Bernhard Sch\"olkopf

This is my paper

classification 💻 cs.LG

keywords causalinformationmodelsapproachfoundationpartialcfmsconditioning

0 comments

read the original abstract

Estimating causal quantities traditionally relies on bespoke estimators tailored to specific assumptions. Recently proposed Causal Foundation Models (CFMs) promise a more unified approach by amortising causal discovery and inference in a single step. However, in their current state, they do not allow for the incorporation of any domain knowledge, which can lead to suboptimal predictions. We bridge this gap by introducing methods to condition CFMs on causal information, such as the causal graph or more readily available ancestral information. When access to complete causal graph information is too strict a requirement, our approach also effectively leverages partial causal information. We systematically evaluate conditioning strategies and find that injecting learnable biases into the attention mechanism, together with a graph-convolutional encoder, is a highly effective method to utilise full and partial causal information. Our experiments show that this conditioning allows a general-purpose CFM to match the performance of specialised models trained on specific causal structures. Overall, our approach addresses a central hurdle on the path towards all-in-one causal foundation models: the capability to answer causal queries in a data-driven manner while effectively leveraging any amount of domain expertise.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 2 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

FoundCause: Causal Discovery with Latent Confounders from Observational Data
cs.LG 2026-06 unverdicted novelty 7.0

FoundCause is a transformer-based amortized model for causal graph discovery that explicitly models latent confounders via learnable tokens and reports better performance than prior methods on 15 real-world datasets.
Learning Causal Orderings for In-Context Tabular Prediction
cs.LG 2026-05 unverdicted novelty 7.0

TabOrder learns unsupervised causal variable orderings and enforces them with order-constrained attention for tabular prediction and imputation under distribution shifts.