Causal Atlases from Entropic Inference: Bayesian Networks beyond Optimal DAGs

Greg van Anders; Hazhir Aliahmadi; Irina Babayan

arxiv: 2606.06440 · v1 · pith:D64USFODnew · submitted 2026-06-04 · 💻 cs.LG · stat.ML

Causal Atlases from Entropic Inference: Bayesian Networks beyond Optimal DAGs

Hazhir Aliahmadi , Irina Babayan , Greg van Anders This is my paper

Pith reviewed 2026-06-28 02:23 UTC · model grok-4.3

classification 💻 cs.LG stat.ML

keywords causal inferenceBayesian networksmaximum entropydirected acyclic graphsstructural ambiguityentropic inferencecausal atlasesstructural equation models

0 comments

The pith

Entropy-based inference generates atlases of multiple causal graphs consistent with data rather than single optimized DAGs.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper establishes that optimization methods for Bayesian networks often produce one directed acyclic graph even when data supports several causal structures. It proposes entropy-based inference to sample a maximum-entropy ensemble of graphs instead. Tests on noisy simulated data from 2-node and 20-node linear structural equation models show this ensemble quantifies structural ambiguity and reveals that optimized DAGs can embed causal links absent from other equally accurate graphs. A reader would care because real data frequently admits multiple chains of causation, and a single graph can misrepresent that variability. The approach aims to produce more data-faithful representations of causal relationships.

Core claim

Entropy-based inference generates atlases of plausible causal relationships that are consistent with underlying data. On simulated noisy data of 2- and 20-node linear structural equation models, sampling a maximum-entropy ensemble of graphs quantifies the inherent structural ambiguity in underlying causal relationships. The method shows that optimized DAGs can contain causal artifacts not consistent across equivalently accurate topologies.

What carries the argument

maximum-entropy ensemble of graphs, which samples multiple directed acyclic graphs to quantify structural ambiguity instead of selecting one optimized network.

If this is right

Optimized single DAGs may include causal artifacts that do not appear in other topologies of equivalent accuracy.
The ensemble approach quantifies the degree of structural ambiguity present in noisy observations of linear structural equation models.
Multiple causal maps can be produced that remain consistent with the variability in the underlying data.
Bayesian networks need not be limited to one graph when data admit several plausible causal orderings.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The ensemble method could be tested on observational datasets with partial ground-truth causal information to check whether the atlases align with known relations.
Extending the sampling to nonlinear or non-Gaussian data would show whether the ambiguity quantification generalizes beyond the linear case studied.
Comparing the entropy-derived atlases against uncertainty estimates from other causal discovery algorithms on the same simulations would clarify relative strengths.

Load-bearing premise

The sampling from a maximum-entropy ensemble of graphs accurately captures the inherent structural ambiguity in the causal relationships without bias from the entropy measure or graph prior.

What would settle it

Applying the sampling procedure to data generated from a single known causal structure and finding that the resulting ensemble spreads probability across many inconsistent graphs instead of concentrating near the true structure.

Figures

Figures reproduced from arXiv: 2606.06440 by Greg van Anders, Hazhir Aliahmadi, Irina Babayan.

**Figure 2.** Figure 2: b in Fig. 2c–f). Optimizing the total energy directly yields only one graph with one set of connection strengths (represented by edge thickness in Fig.2a, generated by using the DAGMA methodology7 on the problem setup described above). Using our sampler, we gain access to multiple uncertainty quantification measures without compromising the acyclicity requirement. For this problem, under finite-temperature… view at source ↗

read the original abstract

Data-driven causal relationship identification is pertinent to advancing understanding of complex systems both within and beyond science. Bayesian networks offer a probabilistic method for modelling generic causal relationships via directed acyclic graphs (DAGs). However, typical techniques for constructing Bayesian networks rely on optimization, which can be ill-suited for learning causal relationships because the underlying data may admit multiple chains of causation. More data-faithful representations of causal relationships would provide frameworks for constructing multiple causal maps that are consistent with the variability that is inherent in underlying data. Here, we show that entropy-based inference generates atlases of plausible causal relationships that are consistent with underlying data. On simulated noisy data of 2- and 20-node linear structural equation models, we sample a maximum-entropy ensemble of graphs that allow us to quantify the inherent structural ambiguity in underlying causal relationships. Our method shows that "optimized" DAGs can contain causal artifacts are not consistent across equivalently accurate topologies.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

The paper's core move—sampling max-ent ensembles of DAGs to map causal ambiguity instead of optimizing one graph—is worth a look, but the simulations give no evidence the ensembles are robust to entropy choice or prior.

read the letter

The paper's main claim is that maximum-entropy sampling can produce an atlas of plausible DAGs that reveals structural ambiguity in causal data, and that single optimized DAGs often contain relations that do not appear consistently across other high-accuracy graphs.

On the simulated 2-node and 20-node linear SEMs with noise, they generate these ensembles and use them to flag inconsistencies in the optimized outputs. That demonstration is the concrete piece of work here.

The approach is new in its direct use of entropic inference for this purpose. It correctly identifies a limitation of standard optimization methods when the data underdetermines the causal structure.

The soft spot is exactly the one flagged in the stress-test note. Nothing in the description shows that the edge marginals or ambiguity measures stay stable under different entropy functionals or different graph priors. If those choices shift the atlas, then the reported artifacts in optimized DAGs could be method-dependent rather than a property of the data. The abstract also gives no quantitative results, no error bars, and no sensitivity checks, so the strength of the evidence is hard to judge.

This is for people working on causal discovery who already see the limits of single-DAG outputs and want ensemble-style alternatives. A reader looking for a new technical framework would find the direction useful even if the current implementation is preliminary.

It deserves peer review. The idea is distinct enough and the basic simulation setup is clear enough that referees can ask for the missing robustness tests and quantitative details without starting from zero.

Referee Report

2 major / 2 minor

Summary. The paper claims that entropy-based inference generates 'causal atlases'—ensembles of plausible DAGs consistent with data—allowing quantification of inherent structural ambiguity in causal relationships. On simulated noisy data from 2- and 20-node linear structural equation models, a maximum-entropy ensemble of graphs is sampled to show that single 'optimized' DAGs can contain causal artifacts not consistent across equivalently accurate topologies.

Significance. If the maximum-entropy ensemble is shown to faithfully capture data-consistent ambiguity independent of the specific entropy functional and graph prior, the approach would offer a principled way to represent causal uncertainty beyond point estimates from optimization, with potential utility in domains where multiple causal structures fit the data equally well.

major comments (2)

[Methods (ensemble sampling procedure)] The central claim that the sampled maximum-entropy ensemble quantifies inherent structural ambiguity (rather than artifacts of the chosen entropy measure or implicit prior) is load-bearing, yet the manuscript provides no sensitivity analysis on alternative sufficient statistics for the entropy or on different graph priors; without such checks, the reported inconsistency of optimized DAGs with the ensemble cannot be distinguished from method dependence.
[Abstract and Results] Abstract and results sections: the claims rest on simulations of 2- and 20-node linear SEMs, but no quantitative metrics (edge marginals, ambiguity measures, error bars, or comparison to ground-truth ambiguity) are reported, preventing assessment of whether the ensemble actually supports the stated conclusions about structural ambiguity.

minor comments (2)

[Abstract] Abstract: grammatical error in 'causal artifacts are not consistent across equivalently accurate topologies' (missing 'that').
[Methods] Notation for the entropy functional and the precise definition of the graph prior should be stated explicitly in the methods to allow reproducibility.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for their constructive comments, which highlight important aspects for strengthening our work on causal atlases via entropic inference. We provide point-by-point responses to the major comments and indicate the revisions we will make.

read point-by-point responses

Referee: [Methods (ensemble sampling procedure)] The central claim that the sampled maximum-entropy ensemble quantifies inherent structural ambiguity (rather than artifacts of the chosen entropy measure or implicit prior) is load-bearing, yet the manuscript provides no sensitivity analysis on alternative sufficient statistics for the entropy or on different graph priors; without such checks, the reported inconsistency of optimized DAGs with the ensemble cannot be distinguished from method dependence.

Authors: We recognize the importance of verifying that the observed structural ambiguity is not dependent on the specific entropy measure or graph prior used in our sampling procedure. Although our current implementation relies on a standard maximum-entropy formulation with particular sufficient statistics derived from the data, we agree that additional checks would bolster the claim. In the revised version, we will conduct sensitivity analyses by varying the sufficient statistics and employing alternative graph priors, reporting how these affect the ensemble properties and the inconsistencies with optimized DAGs. This will help confirm the robustness of our findings. revision: yes
Referee: [Abstract and Results] Abstract and results sections: the claims rest on simulations of 2- and 20-node linear SEMs, but no quantitative metrics (edge marginals, ambiguity measures, error bars, or comparison to ground-truth ambiguity) are reported, preventing assessment of whether the ensemble actually supports the stated conclusions about structural ambiguity.

Authors: We concur that incorporating quantitative metrics would enhance the clarity and assessability of our results. The simulations in the manuscript demonstrate the concept qualitatively through examples of inconsistencies, but to provide stronger evidence, we will add in the revision: computations of edge marginals from the ensemble, quantitative ambiguity measures such as the entropy of the graph distribution, error bars from repeated sampling, and where possible, comparisons against the known ground-truth ambiguity from the simulated SEMs. These additions will be reflected in both the abstract and results sections. revision: yes

Circularity Check

0 steps flagged

No significant circularity; derivation applies standard max-ent sampling independently to data

full rationale

The paper's core method samples a maximum-entropy ensemble of DAGs consistent with simulated linear SEM data to quantify structural ambiguity, without any quoted equations or steps that reduce by construction to fitted inputs, self-definitions, or load-bearing self-citations. The abstract and description present the entropy principle as an external tool applied to data, with no renaming of known results or ansatz smuggling; the claim that optimized DAGs contain inconsistent artifacts follows directly from comparing the ensemble to point estimates, remaining self-contained against external benchmarks.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

Abstract provides no details on specific parameters, axioms or new entities; the method is based on standard maximum-entropy principles applied to graph ensembles.

pith-pipeline@v0.9.1-grok · 5692 in / 1180 out tokens · 59799 ms · 2026-06-28T02:23:38.440345+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

36 extracted references · 13 canonical work pages · 1 internal anchor

[2]

Conformal

Franca, Guilherme and Sulam, Jeremias and Robinson, Daniel and Vidal, Rene , year = 2020, volume =. Conformal. Advances in

2020
[3]

Understanding Molecular Simulation:

Frenkel, Daan and Smit, Berend , year = 1996, publisher =. Understanding Molecular Simulation:

1996
[6]

Counterfactual

Kusner, Matt J and Loftus, Joshua and Russell, Chris and Silva, Ricardo , year = 2017, volume =. Counterfactual. Advances in

2017
[12]

Advances in

Wei, Dennis and Gao, Tian and Yu, Yue , year = 2020, volume =. Advances in

2020
[13]

Proceedings of the 36th

Yu, Yue and Chen, Jie and Gao, Tian and Yu, Mo , year = 2019, month = may, pages =. Proceedings of the 36th

2019
[16]

Nonnegative Matrices in the Mathematical Sciences , author =
[17]

Journal of Optimization Theory and Applications , volume =

The Essence of Invexity , author =. Journal of Optimization Theory and Applications , volume =. doi:10.1007/BF00941316 , urldate =

work page doi:10.1007/bf00941316
[19]

Annadani, Yashas and Pawlowski, Nick and Jennings, Joel and Bauer, Stefan and Zhang, Cheng and Gong, Wenbo , year = 2023, month = dec, journal =

2023
[21]

Advances in

Lorch, Lars and Rothfuss, Jonas and Sch. Advances in
[22]

Ng, Ignavier and Ghassami, AmirEmad and Zhang, Kun , year = 2020, volume =. On the. Advances in

2020
[23]

and Kohn, Robert , year = 2025, month = oct, urldate =

Thompson, Ryan and Bonilla, Edwin V. and Kohn, Robert , year = 2025, month = oct, urldate =. The

2025
[24]

Proceedings of the 38th

Yu, Yue and Gao, Tian and Yin, Naiyu and Ji, Qiang , year = 2021, month = jul, pages =. Proceedings of the 38th

2021
[25]

Learning

Zheng, Xun and Dan, Chen and Aragam, Bryon and Ravikumar, Pradeep and Xing, Eric , year = 2020, month = jun, pages =. Learning. Proceedings of the

2020
[26]

Causal protein-signaling networks derived from multiparameter single-cell data

author author K. Sachs , author O. Perez , author D. Pe'er , author D. A. \ Lauffenburger ,\ and\ author G. P. \ Nolan ,\ title title Causal Protein-Signaling Networks Derived from Multiparameter Single-Cell Data , \ https://doi.org/10.1126/science.1105809 journal journal Science \ volume 308 ,\ pages 523--529 ( year 2005 ) NoStop

work page doi:10.1126/science.1105809 2005
[27]

Zhang , author C

author author B. Zhang , author C. Gaiteri , author L.-G. \ Bodea , author Z. Wang , author J. McElwee , author A. A. \ Podtelezhnikov , author C. Zhang , author T. Xie , author L. Tran , author R. Dobrin , author E. Fluder , author B. Clurman , author S. Melquist , author M. Narayanan , author C. Suver , author H. Shah , author M. Mahajan , author T. Gil...

work page doi:10.1016/j.cell.2013.03.030 2013
[28]

Spirtes , author C

author author P. Spirtes , author C. Glymour ,\ and\ author R. Scheines ,\ https://doi.org/10.1007/978-1-4612-2748-9 title Causation, Prediction , and Search ,\ edited by\ editor J. Berger , editor S. Fienberg , editor J. Gani , editor K. Krickeberg , editor I. Olkin ,\ and\ editor B. Singer ,\ series Lecture Notes in Statistics , Vol. volume 81 \ ( publi...

work page doi:10.1007/978-1-4612-2748-9 1993
[29]

author author M. J. \ Kusner , author J. Loftus , author C. Russell ,\ and\ author R. Silva ,\ title title Counterfactual Fairness , \ in\ @noop booktitle Advances in Neural Information Processing Systems ,\ Vol. volume 30 \ ( publisher Curran Associates, Inc. ,\ year 2017 ) NoStop

2017
[30]

author author A. D. \ Sanford \ and\ author I. A. \ Moosa ,\ title title A Bayesian network structure for operational risk modelling in structured finance operations , \ https://doi.org/10.1057/jors.2011.7 journal journal Journal of the Operational Research Society \ volume 63 ,\ pages 431--444 ( year 2012 ) NoStop

work page doi:10.1057/jors.2011.7 2011
[31]

DAGs with NO TEARS: Continuous Optimization for Structure Learning

author author X. Zheng , author B. Aragam , author P. Ravikumar ,\ and\ author E. P. \ Xing ,\ https://doi.org/10.48550/arXiv.1803.01422 title DAGs with NO TEARS : Continuous Optimization for Structure Learning , \ ( year 2018 ),\ https://arxiv.org/abs/1803.01422 arXiv:1803.01422 [stat] NoStop

work page internal anchor Pith review Pith/arXiv arXiv doi:10.48550/arxiv.1803.01422 2018
[32]

Bello , author B

author author K. Bello , author B. Aragam ,\ and\ author P. Ravikumar ,\ https://doi.org/10.48550/arXiv.2209.08037 title DAGMA : Learning DAGs via M-matrices and a Log-Determinant Acyclicity Characterization , \ ( year 2023 ),\ https://arxiv.org/abs/2209.08037 arXiv:2209.08037 [cs] NoStop

work page doi:10.48550/arxiv.2209.08037 2023
[33]

Husmeier , editor R

editor D. Husmeier , editor R. Dybowski ,\ and\ editor S. Roberts ,\ eds.,\ https://doi.org/10.1007/b138794 title Probabilistic Modeling in Bioinformatics and Medical Informatics ,\ Advanced Information and Knowledge Processing \ ( publisher Springer London ,\ address London ,\ year 2005 ) NoStop

work page doi:10.1007/b138794 2005
[34]

Yu , author J

author author Y. Yu , author J. Chen , author T. Gao ,\ and\ author M. Yu ,\ title title DAG-GNN : DAG Structure Learning with Graph Neural Networks , \ in\ @noop booktitle Proceedings of the 36th International Conference on Machine Learning \ ( publisher PMLR ,\ year 2019 )\ pp.\ pages 7154--7163 NoStop

2019
[35]

Zheng , author C

author author X. Zheng , author C. Dan , author B. Aragam , author P. Ravikumar ,\ and\ author E. Xing ,\ title title Learning Sparse Nonparametric DAGs , \ in\ @noop booktitle Proceedings of the Twenty Third International Conference on Artificial Intelligence and Statistics \ ( publisher PMLR ,\ year 2020 )\ pp.\ pages 3414--3425 NoStop

2020
[36]

Ng , author A

author author I. Ng , author A. Ghassami ,\ and\ author K. Zhang ,\ title title On the Role of Sparsity and DAG Constraints for Learning Linear DAGs , \ in\ @noop booktitle Advances in Neural Information Processing Systems ,\ Vol. volume 33 \ ( publisher Curran Associates, Inc. ,\ year 2020 )\ pp.\ pages 17943--17954 NoStop

2020
[37]

Yu , author T

author author Y. Yu , author T. Gao , author N. Yin ,\ and\ author Q. Ji ,\ title title DAGs with No Curl : An Efficient DAG Structure Learning Approach , \ in\ @noop booktitle Proceedings of the 38th International Conference on Machine Learning \ ( publisher PMLR ,\ year 2021 )\ pp.\ pages 12156--12166 NoStop

2021
[38]

Annadani , author J

author author Y. Annadani , author J. Rothfuss , author A. Lacoste , author N. Scherrer , author A. Goyal , author Y. Bengio ,\ and\ author S. Bauer ,\ https://doi.org/10.48550/arXiv.2106.07635 title Variational Causal Networks : Approximate Bayesian Inference over Causal Structures , \ ( year 2021 ),\ https://arxiv.org/abs/2106.07635 arXiv:2106.07635 [cs...

work page doi:10.48550/arxiv.2106.07635 2021
[39]

Lorch , author J

author author L. Lorch , author J. Rothfuss , author B. Sch \"o lkopf ,\ and\ author A. Krause ,\ title title DiBS : Differentiable Bayesian Structure Learning , \ in\ @noop booktitle Advances in Neural Information Processing Systems ,\ Vol. volume 34 \ ( publisher Curran Associates, Inc. ,\ year 2021 )\ pp.\ pages 24111--24123 NoStop

2021
[40]

Annadani , author N

author author Y. Annadani , author N. Pawlowski , author J. Jennings , author S. Bauer , author C. Zhang ,\ and\ author W. Gong ,\ title title BayesDAG : Gradient-Based Posterior Inference for Causal Discovery , \ @noop journal journal Advances in Neural Information Processing Systems \ volume 36 ,\ pages 1738--1763 ( year 2023 ) NoStop

2023
[41]

Thompson , author E

author author R. Thompson , author E. V. \ Bonilla ,\ and\ author R. Kohn ,\ title title ProDAG : Projected Variational Inference for Directed Acyclic Graphs , \ in\ @noop booktitle The Thirty-ninth Annual Conference on Neural Information Processing Systems \ ( year 2025 ) NoStop

2025
[42]

Babayan , author H

author author I. Babayan , author H. Aliahmadi ,\ and\ author G. Van Anders ,\ title title Sufficient is better than optimal for training neural networks , \ https://doi.org/10.1038/s41467-025-66983-3 journal journal Nature Communications \ volume 17 ,\ pages 271 ( year 2025 ) NoStop

work page doi:10.1038/s41467-025-66983-3 2025
[43]

author author G. J. \ Martyna , author M. E. \ Tuckerman , author D. J. \ Tobias ,\ and\ author M. L. \ Klein ,\ title title Explicit reversible integrators for extended systems dynamics , \ https://doi.org/10.1080/00268979600100761 journal journal Molecular Physics \ volume 87 ,\ pages 1117--1157 ( year 1996 ) NoStop

work page doi:10.1080/00268979600100761 1996
[44]

Tuckerman, B

author author M. Tuckerman , author B. J. \ Berne ,\ and\ author G. J. \ Martyna ,\ title title Reversible multiple time scale molecular dynamics , \ https://doi.org/10.1063/1.463137 journal journal The Journal of Chemical Physics \ volume 97 ,\ pages 1990--2001 ( year 1992 ) ,\ https://arxiv.org/abs/https://doi.org/10.1063/1.463137 https://doi.org/10.106...

work page doi:10.1063/1.463137 1990
[45]

Frenkel \ and\ author B

author author D. Frenkel \ and\ author B. Smit ,\ @noop title Understanding Molecular Simulation: From Algorithms to Applications \ ( publisher Academic Press ,\ address San Diego ,\ year 1996 ) NoStop

1996
[46]

Jang \ and\ author G

author author S. Jang \ and\ author G. A. \ Voth ,\ title title Simple reversible molecular dynamics algorithms for Nos\'e -- Hoover chain dynamics , \ https://doi.org/10.1063/1.475247 journal journal The Journal of Chemical Physics \ volume 107 ,\ pages 9514--9526 ( year 1997 ) NoStop

work page doi:10.1063/1.475247 1997
[47]

Franca , author J

author author G. Franca , author J. Sulam , author D. Robinson ,\ and\ author R. Vidal ,\ title title Conformal Symplectic and Relativistic Optimization , \ in\ @noop booktitle Advances in Neural Information Processing Systems ,\ Vol. volume 33 \ ( publisher Curran Associates, Inc. ,\ year 2020 )\ pp.\ pages 16916--16926 NoStop

2020
[48]

Berman \ and\ author R

author author A. Berman \ and\ author R. J. \ Plemmons ,\ @noop title Nonnegative Matrices in the Mathematical Sciences ,\ Computer Science and Applied Mathematics\ ( publisher Academic Press ,\ address New York ,\ year 1979 ) NoStop

1979

[1] [2]

Conformal

Franca, Guilherme and Sulam, Jeremias and Robinson, Daniel and Vidal, Rene , year = 2020, volume =. Conformal. Advances in

2020

[2] [3]

Understanding Molecular Simulation:

Frenkel, Daan and Smit, Berend , year = 1996, publisher =. Understanding Molecular Simulation:

1996

[3] [6]

Counterfactual

Kusner, Matt J and Loftus, Joshua and Russell, Chris and Silva, Ricardo , year = 2017, volume =. Counterfactual. Advances in

2017

[4] [12]

Advances in

Wei, Dennis and Gao, Tian and Yu, Yue , year = 2020, volume =. Advances in

2020

[5] [13]

Proceedings of the 36th

Yu, Yue and Chen, Jie and Gao, Tian and Yu, Mo , year = 2019, month = may, pages =. Proceedings of the 36th

2019

[6] [16]

Nonnegative Matrices in the Mathematical Sciences , author =

[7] [17]

Journal of Optimization Theory and Applications , volume =

The Essence of Invexity , author =. Journal of Optimization Theory and Applications , volume =. doi:10.1007/BF00941316 , urldate =

work page doi:10.1007/bf00941316

[8] [19]

Annadani, Yashas and Pawlowski, Nick and Jennings, Joel and Bauer, Stefan and Zhang, Cheng and Gong, Wenbo , year = 2023, month = dec, journal =

2023

[9] [21]

Advances in

Lorch, Lars and Rothfuss, Jonas and Sch. Advances in

[10] [22]

Ng, Ignavier and Ghassami, AmirEmad and Zhang, Kun , year = 2020, volume =. On the. Advances in

2020

[11] [23]

and Kohn, Robert , year = 2025, month = oct, urldate =

Thompson, Ryan and Bonilla, Edwin V. and Kohn, Robert , year = 2025, month = oct, urldate =. The

2025

[12] [24]

Proceedings of the 38th

Yu, Yue and Gao, Tian and Yin, Naiyu and Ji, Qiang , year = 2021, month = jul, pages =. Proceedings of the 38th

2021

[13] [25]

Learning

Zheng, Xun and Dan, Chen and Aragam, Bryon and Ravikumar, Pradeep and Xing, Eric , year = 2020, month = jun, pages =. Learning. Proceedings of the

2020

[14] [26]

Causal protein-signaling networks derived from multiparameter single-cell data

author author K. Sachs , author O. Perez , author D. Pe'er , author D. A. \ Lauffenburger ,\ and\ author G. P. \ Nolan ,\ title title Causal Protein-Signaling Networks Derived from Multiparameter Single-Cell Data , \ https://doi.org/10.1126/science.1105809 journal journal Science \ volume 308 ,\ pages 523--529 ( year 2005 ) NoStop

work page doi:10.1126/science.1105809 2005

[15] [27]

Zhang , author C

author author B. Zhang , author C. Gaiteri , author L.-G. \ Bodea , author Z. Wang , author J. McElwee , author A. A. \ Podtelezhnikov , author C. Zhang , author T. Xie , author L. Tran , author R. Dobrin , author E. Fluder , author B. Clurman , author S. Melquist , author M. Narayanan , author C. Suver , author H. Shah , author M. Mahajan , author T. Gil...

work page doi:10.1016/j.cell.2013.03.030 2013

[16] [28]

Spirtes , author C

author author P. Spirtes , author C. Glymour ,\ and\ author R. Scheines ,\ https://doi.org/10.1007/978-1-4612-2748-9 title Causation, Prediction , and Search ,\ edited by\ editor J. Berger , editor S. Fienberg , editor J. Gani , editor K. Krickeberg , editor I. Olkin ,\ and\ editor B. Singer ,\ series Lecture Notes in Statistics , Vol. volume 81 \ ( publi...

work page doi:10.1007/978-1-4612-2748-9 1993

[17] [29]

author author M. J. \ Kusner , author J. Loftus , author C. Russell ,\ and\ author R. Silva ,\ title title Counterfactual Fairness , \ in\ @noop booktitle Advances in Neural Information Processing Systems ,\ Vol. volume 30 \ ( publisher Curran Associates, Inc. ,\ year 2017 ) NoStop

2017

[18] [30]

author author A. D. \ Sanford \ and\ author I. A. \ Moosa ,\ title title A Bayesian network structure for operational risk modelling in structured finance operations , \ https://doi.org/10.1057/jors.2011.7 journal journal Journal of the Operational Research Society \ volume 63 ,\ pages 431--444 ( year 2012 ) NoStop

work page doi:10.1057/jors.2011.7 2011

[19] [31]

DAGs with NO TEARS: Continuous Optimization for Structure Learning

author author X. Zheng , author B. Aragam , author P. Ravikumar ,\ and\ author E. P. \ Xing ,\ https://doi.org/10.48550/arXiv.1803.01422 title DAGs with NO TEARS : Continuous Optimization for Structure Learning , \ ( year 2018 ),\ https://arxiv.org/abs/1803.01422 arXiv:1803.01422 [stat] NoStop

work page internal anchor Pith review Pith/arXiv arXiv doi:10.48550/arxiv.1803.01422 2018

[20] [32]

Bello , author B

author author K. Bello , author B. Aragam ,\ and\ author P. Ravikumar ,\ https://doi.org/10.48550/arXiv.2209.08037 title DAGMA : Learning DAGs via M-matrices and a Log-Determinant Acyclicity Characterization , \ ( year 2023 ),\ https://arxiv.org/abs/2209.08037 arXiv:2209.08037 [cs] NoStop

work page doi:10.48550/arxiv.2209.08037 2023

[21] [33]

Husmeier , editor R

editor D. Husmeier , editor R. Dybowski ,\ and\ editor S. Roberts ,\ eds.,\ https://doi.org/10.1007/b138794 title Probabilistic Modeling in Bioinformatics and Medical Informatics ,\ Advanced Information and Knowledge Processing \ ( publisher Springer London ,\ address London ,\ year 2005 ) NoStop

work page doi:10.1007/b138794 2005

[22] [34]

Yu , author J

author author Y. Yu , author J. Chen , author T. Gao ,\ and\ author M. Yu ,\ title title DAG-GNN : DAG Structure Learning with Graph Neural Networks , \ in\ @noop booktitle Proceedings of the 36th International Conference on Machine Learning \ ( publisher PMLR ,\ year 2019 )\ pp.\ pages 7154--7163 NoStop

2019

[23] [35]

Zheng , author C

author author X. Zheng , author C. Dan , author B. Aragam , author P. Ravikumar ,\ and\ author E. Xing ,\ title title Learning Sparse Nonparametric DAGs , \ in\ @noop booktitle Proceedings of the Twenty Third International Conference on Artificial Intelligence and Statistics \ ( publisher PMLR ,\ year 2020 )\ pp.\ pages 3414--3425 NoStop

2020

[24] [36]

Ng , author A

author author I. Ng , author A. Ghassami ,\ and\ author K. Zhang ,\ title title On the Role of Sparsity and DAG Constraints for Learning Linear DAGs , \ in\ @noop booktitle Advances in Neural Information Processing Systems ,\ Vol. volume 33 \ ( publisher Curran Associates, Inc. ,\ year 2020 )\ pp.\ pages 17943--17954 NoStop

2020

[25] [37]

Yu , author T

author author Y. Yu , author T. Gao , author N. Yin ,\ and\ author Q. Ji ,\ title title DAGs with No Curl : An Efficient DAG Structure Learning Approach , \ in\ @noop booktitle Proceedings of the 38th International Conference on Machine Learning \ ( publisher PMLR ,\ year 2021 )\ pp.\ pages 12156--12166 NoStop

2021

[26] [38]

Annadani , author J

author author Y. Annadani , author J. Rothfuss , author A. Lacoste , author N. Scherrer , author A. Goyal , author Y. Bengio ,\ and\ author S. Bauer ,\ https://doi.org/10.48550/arXiv.2106.07635 title Variational Causal Networks : Approximate Bayesian Inference over Causal Structures , \ ( year 2021 ),\ https://arxiv.org/abs/2106.07635 arXiv:2106.07635 [cs...

work page doi:10.48550/arxiv.2106.07635 2021

[27] [39]

Lorch , author J

author author L. Lorch , author J. Rothfuss , author B. Sch \"o lkopf ,\ and\ author A. Krause ,\ title title DiBS : Differentiable Bayesian Structure Learning , \ in\ @noop booktitle Advances in Neural Information Processing Systems ,\ Vol. volume 34 \ ( publisher Curran Associates, Inc. ,\ year 2021 )\ pp.\ pages 24111--24123 NoStop

2021

[28] [40]

Annadani , author N

author author Y. Annadani , author N. Pawlowski , author J. Jennings , author S. Bauer , author C. Zhang ,\ and\ author W. Gong ,\ title title BayesDAG : Gradient-Based Posterior Inference for Causal Discovery , \ @noop journal journal Advances in Neural Information Processing Systems \ volume 36 ,\ pages 1738--1763 ( year 2023 ) NoStop

2023

[29] [41]

Thompson , author E

author author R. Thompson , author E. V. \ Bonilla ,\ and\ author R. Kohn ,\ title title ProDAG : Projected Variational Inference for Directed Acyclic Graphs , \ in\ @noop booktitle The Thirty-ninth Annual Conference on Neural Information Processing Systems \ ( year 2025 ) NoStop

2025

[30] [42]

Babayan , author H

author author I. Babayan , author H. Aliahmadi ,\ and\ author G. Van Anders ,\ title title Sufficient is better than optimal for training neural networks , \ https://doi.org/10.1038/s41467-025-66983-3 journal journal Nature Communications \ volume 17 ,\ pages 271 ( year 2025 ) NoStop

work page doi:10.1038/s41467-025-66983-3 2025

[31] [43]

author author G. J. \ Martyna , author M. E. \ Tuckerman , author D. J. \ Tobias ,\ and\ author M. L. \ Klein ,\ title title Explicit reversible integrators for extended systems dynamics , \ https://doi.org/10.1080/00268979600100761 journal journal Molecular Physics \ volume 87 ,\ pages 1117--1157 ( year 1996 ) NoStop

work page doi:10.1080/00268979600100761 1996

[32] [44]

Tuckerman, B

author author M. Tuckerman , author B. J. \ Berne ,\ and\ author G. J. \ Martyna ,\ title title Reversible multiple time scale molecular dynamics , \ https://doi.org/10.1063/1.463137 journal journal The Journal of Chemical Physics \ volume 97 ,\ pages 1990--2001 ( year 1992 ) ,\ https://arxiv.org/abs/https://doi.org/10.1063/1.463137 https://doi.org/10.106...

work page doi:10.1063/1.463137 1990

[33] [45]

Frenkel \ and\ author B

author author D. Frenkel \ and\ author B. Smit ,\ @noop title Understanding Molecular Simulation: From Algorithms to Applications \ ( publisher Academic Press ,\ address San Diego ,\ year 1996 ) NoStop

1996

[34] [46]

Jang \ and\ author G

author author S. Jang \ and\ author G. A. \ Voth ,\ title title Simple reversible molecular dynamics algorithms for Nos\'e -- Hoover chain dynamics , \ https://doi.org/10.1063/1.475247 journal journal The Journal of Chemical Physics \ volume 107 ,\ pages 9514--9526 ( year 1997 ) NoStop

work page doi:10.1063/1.475247 1997

[35] [47]

Franca , author J

author author G. Franca , author J. Sulam , author D. Robinson ,\ and\ author R. Vidal ,\ title title Conformal Symplectic and Relativistic Optimization , \ in\ @noop booktitle Advances in Neural Information Processing Systems ,\ Vol. volume 33 \ ( publisher Curran Associates, Inc. ,\ year 2020 )\ pp.\ pages 16916--16926 NoStop

2020

[36] [48]

Berman \ and\ author R

author author A. Berman \ and\ author R. J. \ Plemmons ,\ @noop title Nonnegative Matrices in the Mathematical Sciences ,\ Computer Science and Applied Mathematics\ ( publisher Academic Press ,\ address New York ,\ year 1979 ) NoStop

1979