Dataset-aware entropy-maximized active learning for machine-learned interatomic potentials

Li Zhu; Meiyan Wang; Rishi Rao

arxiv: 2605.20384 · v1 · pith:NZAFJYWGnew · submitted 2026-05-19 · ❄️ cond-mat.mtrl-sci

Dataset-aware entropy-maximized active learning for machine-learned interatomic potentials

Meiyan Wang , Rishi Rao , Li Zhu This is my paper

Pith reviewed 2026-05-21 07:04 UTC · model grok-4.3

classification ❄️ cond-mat.mtrl-sci

keywords active learninginteratomic potentialsentropy maximizationmachine learning potentialsmolecular dynamicsfingerprint selectionDFT dataconfiguration space

0 comments

The pith

Entropy-driven active learning reduces energy errors in interatomic potentials by factors of 3 to 10 at 800 structures

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper is trying to establish that an active learning strategy based on entropy maximization can generate more efficient training data for machine-learned interatomic potentials than standard random sampling does. It combines local entropy biasing in MD trajectories with global selection using the log-determinant of the fingerprint covariance matrix and dual covariance modes for different structure types. This produces training sets of a few hundred structures that achieve low errors on holdout data for covalent, metallic, and ionic materials. A sympathetic reader would care because it suggests a path to accurate potentials with far fewer expensive quantum mechanical calculations.

Core claim

By using per-configuration entropy to bias molecular dynamics trajectories and the log-determinant of the fingerprint covariance to filter for new information, with dual per-atom and per-config modes, the method generates training data that yields a factor of 3 to 10 lower energy mean absolute error at N=800 compared to random MD baselines on in-distribution tests for carbon, silicon, and NaCl.

What carries the argument

The dataset-aware entropy maximization procedure that pairs local trajectory biasing with global covariance log-determinant selection

If this is right

Achieves near-meV per atom accuracy with training sets of order 100 to 1000 structures.
Outperforms random MD sampling by factors of 3-10 in energy MAE at N=800 for covalent, metallic, and ionic bonded systems.
Handles pressure-driven phase transitions through broad coverage of configuration space.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The method may enable rapid development of specialized potentials for unexplored materials by minimizing the number of required DFT calculations.
Similar entropy-based selection could apply to active learning in other scientific machine learning domains like molecular properties.
Further tests could check whether the gains persist for out-of-distribution predictions or much larger systems.

Load-bearing premise

The method assumes that maximizing local and global entropy measures will select configurations that add genuinely new and unbiased information to the training dataset.

What would settle it

Finding a material system or phase where the entropy-selected training set produces larger errors on holdout data than a random sample of the same size would indicate a failure to cover critical configuration space regions.

read the original abstract

We present an active learning framework for efficiently generating training data for machine-learned interatomic potentials (MLIPs). The method combines local entropy-driven molecular dynamics with global dataset-aware filtering: a per-configuration entropy term biases MD trajectories toward structurally diverse snapshots, while a global entropy measure, the log-determinant of the fingerprint covariance matrix of the entire dataset, selects only those configurations that provide genuinely new information. We employ dual covariance modes (per-atom for disordered structures and per-config for ordered phases) to achieve broad coverage of configuration space. Combined with a pre-trained foundation model (Allegro-OAM-L) and analytical fingerprint gradients from Gaussian overlap matrix eigenvalues, the framework produces high-quality domain-specific potentials with near- or sub-meV/atom accuracy on test data drawn from the same distribution at training-set sizes of order $10^{2}$ to $10^{3}$ entropy-selected DFT-labeled structures. We demonstrate the method on three systems spanning diverse bonding types and pressure-driven phase transitions: carbon (covalent), silicon (covalent/metallic), and NaCl (ionic). In learning curve comparisons against random molecular dynamics sampling at matched training set sizes ($N = 100$ to $800$), evaluated over three independent training-set draws per condition, entropy-driven sampling achieves a factor of approximately $3$ to $10$ lower energy MAE at $N = 800$ on in-distribution holdouts across the three systems, with the magnitude of the gain depending on the bonding type and the size at which the random-MD baseline saturates.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

Entropy-driven active learning cuts MLIP training data needs by 3-10x on the tested systems, but the bias toward high-entropy states risks missing critical low-entropy configurations in phase transitions.

read the letter

This paper combines local entropy-driven molecular dynamics with global log-determinant selection on fingerprint covariances to build training sets for ML interatomic potentials more efficiently. The dual covariance modes for per-atom disordered structures and per-config ordered phases are the concrete new element, paired with analytical gradients and a pre-trained foundation model. It applies this to carbon, silicon, and NaCl, which cover different bonding types and pressure-driven transitions. The learning curves show consistent gains over random MD sampling at matched sizes from 100 to 800, with three independent draws each, reaching factors of roughly 3 to 10 lower energy MAE at N=800 on in-distribution holdouts. That empirical control is useful and worth noting for anyone who has to pay for DFT labels. The main soft spot is the sampling bias concern. Entropy terms push trajectories toward high-variance disordered states, which could under-represent saddle points or coexistence regions that matter for phase transitions in silicon and NaCl. If the holdout sets are drawn from the same biased distribution, the reported MAE drops may reflect distribution matching more than genuine information gain across the full space the potentials will be used on. The abstract also omits error bars, exact fingerprint definitions, and implementation choices, so statistical robustness and reproducibility are not yet clear. This work is for computational materials scientists who train ML interatomic potentials and want to reduce the number of expensive calculations. Readers focused on active learning pipelines or data-efficient methods for atomistic models will find the comparisons practical. It deserves a serious referee because the controlled experiments are present and the method is described enough to be checked. I recommend sending it to peer review; the gains are concrete enough that feedback on coverage of critical configurations would strengthen the result.

Referee Report

2 major / 2 minor

Summary. The paper presents an active learning framework for ML interatomic potentials that integrates per-configuration entropy biasing in MD trajectories with global log-determinant selection on the fingerprint covariance matrix of the accumulating dataset. Dual covariance modes (per-atom for disordered structures, per-config for ordered phases) are used together with a pre-trained foundation model and analytical fingerprint gradients. On carbon, silicon, and NaCl, the method is reported to yield 3–10× lower energy MAE than random-MD sampling at matched training-set sizes N=100–800 (three independent draws), reaching near- or sub-meV/atom accuracy on in-distribution holdouts.

Significance. If the reported gains are robust and free of sampling bias, the approach would meaningfully reduce the number of expensive DFT labels needed to reach target accuracy for MLIPs, especially for systems with pressure-driven phase transitions. The combination of entropy-driven local sampling and dataset-aware global filtering, together with reuse of a foundation model, is a concrete strength that could be adopted more broadly if the central assumption about coverage of configuration space is verified.

major comments (2)

Abstract: the headline claim of a factor of 3–10 lower energy MAE at N=800 rests on the unverified assumption that entropy biasing plus log-determinant selection will not systematically under-represent low-entropy but critical saddle-point or coexistence configurations in silicon and NaCl. The description of the dual covariance modes does not include an explicit check (e.g., order-parameter histograms or phase-coexistence sampling statistics) that such regions are adequately covered relative to a uniform random-MD baseline; without this, the observed MAE reduction on holdouts drawn from the same biased distribution does not yet demonstrate superior information gain for the intended MLIP use case.
Results section (learning-curve comparisons): the abstract states that evaluations were performed over three independent training-set draws, yet no error bars, standard deviations, or statistical significance tests on the MAE ratios are mentioned. This omission makes it impossible to judge whether the reported factor-of-3–10 improvement is robust or could be explained by variance in the random-MD baseline.

minor comments (2)

Abstract: the exact definitions of the per-atom and per-config fingerprint covariance matrices, the numerical value of the entropy temperature, and the precise form of the log-determinant selection criterion are not stated; these details are needed for reproducibility.
The manuscript should clarify whether the holdout sets are drawn from the same entropy-biased trajectories or from independent uniform MD runs; the current wording leaves this ambiguous.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the detailed and constructive review. The comments highlight important aspects of validation and statistical reporting that will improve the clarity and robustness of our claims. We address each major comment below and outline the corresponding revisions.

read point-by-point responses

Referee: Abstract: the headline claim of a factor of 3–10 lower energy MAE at N=800 rests on the unverified assumption that entropy biasing plus log-determinant selection will not systematically under-represent low-entropy but critical saddle-point or coexistence configurations in silicon and NaCl. The description of the dual covariance modes does not include an explicit check (e.g., order-parameter histograms or phase-coexistence sampling statistics) that such regions are adequately covered relative to a uniform random-MD baseline; without this, the observed MAE reduction on holdouts drawn from the same biased distribution does not yet demonstrate superior information gain for the intended MLIP use case.

Authors: We agree that explicit verification of coverage for low-entropy critical regions strengthens the interpretation of the reported gains. Although the dual covariance modes and entropy biasing are designed to promote structural diversity, the original manuscript does not include order-parameter histograms or phase-coexistence sampling statistics. In the revised version we will add these analyses for silicon and NaCl, comparing the sampled distributions against the random-MD baseline to confirm that saddle-point and coexistence configurations are not systematically under-represented. revision: yes
Referee: Results section (learning-curve comparisons): the abstract states that evaluations were performed over three independent training-set draws, yet no error bars, standard deviations, or statistical significance tests on the MAE ratios are mentioned. This omission makes it impossible to judge whether the reported factor-of-3–10 improvement is robust or could be explained by variance in the random-MD baseline.

Authors: We acknowledge that the absence of error bars and statistical tests limits assessment of robustness. The three independent draws were performed, yet standard deviations and significance tests were not reported. In the revised manuscript we will include error bars on all learning curves, report the standard deviation of the MAE ratios across the three draws, and add a brief statistical comparison (e.g., paired t-test or Wilcoxon test) to quantify the significance of the observed improvements. revision: yes

Circularity Check

0 steps flagged

No significant circularity; empirical validation independent of inputs

full rationale

The paper presents an algorithmic framework for active learning in MLIPs that combines per-configuration entropy biasing of MD trajectories with global log-determinant selection on fingerprint covariance matrices (including dual per-atom/per-config modes). Performance is assessed via direct empirical learning-curve comparisons to random-MD baselines at fixed N=100–800, with MAE measured on in-distribution holdouts across three chemically distinct systems. No derivation reduces a claimed result to a fitted parameter or self-referential definition; the reported 3–10× MAE gains are outcomes of the sampling procedure rather than being forced by construction. No load-bearing self-citation chain or uniqueness theorem is invoked to justify the central claims. The framework is therefore self-contained against external benchmarks.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

The central claim rests on the unproven effectiveness of the entropy measures for selecting informative configurations and on the assumption that the pre-trained Allegro-OAM-L foundation model plus Gaussian overlap fingerprints provide a sufficiently rich representation; no free parameters are numerically specified in the abstract.

axioms (1)

domain assumption The per-configuration entropy term and global log-determinant of fingerprint covariance matrix reliably quantify structural novelty and information gain.
Invoked throughout the method description as the basis for biasing MD and filtering snapshots.

pith-pipeline@v0.9.0 · 5813 in / 1423 out tokens · 60209 ms · 2026-05-21T07:04:47.113339+00:00 · methodology

discussion (0)

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

IndisputableMonolith/Cost/FunctionalEquation.lean washburn_uniqueness_aczel unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

a per-configuration entropy term biases MD trajectories toward structurally diverse snapshots, while a global entropy measure, the log-determinant of the fingerprint covariance matrix of the entire dataset, selects only those configurations that provide genuinely new information
IndisputableMonolith/Foundation/AlexanderDuality.lean alexander_duality_circle_linking unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

We employ dual covariance modes (per-atom for disordered structures and per-config for ordered phases)

What do these tags mean?

matches: The paper's claim is directly supported by a theorem in the formal canon.
supports: The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends: The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses: The paper appears to rely on the theorem as machinery.
contradicts: The paper's claim conflicts with a theorem or certificate in the canon.
unclear: Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Reference graph

Works this paper leans on

53 extracted references · 53 canonical work pages

[1]

Behler, J.: Perspective: Machine learning potentials for atomistic simulations. J. Chem. Phys.145, 170201 (2016)

work page 2016
[2]

Deringer, V.L., Caro, M.A., Cs´ anyi, G.: Machine learning interatomic potentials as emerging tools for materials science. Adv. Mater.31, 1902765 (2019)

work page 2019
[3]

Unke, O.T., Chmiela, S., Sauceda, H.E., Gastegger, M., Poltavsky, I., Sch¨ utt, K.T., Tkatchenko, A., M¨ uller, K.-R.: Machine learning force fields. Chem. Rev. 121, 10142 (2021) 25

work page 2021
[4]

Behler, J., Parrinello, M.: Generalized neural-network representation of high- dimensional potential-energy surfaces. Phys. Rev. Lett.98, 146401 (2007)

work page 2007
[5]

Bart´ ok, A.P., Payne, M.C., Kondor, R., Cs´ anyi, G.: Gaussian approximation potentials: The accuracy of quantum mechanics, without the electrons. Phys. Rev. Lett.104, 136403 (2010)

work page 2010
[6]

Bart´ ok, A.P., Kondor, R., Cs´ anyi, G.: On representing chemical environments. Phys. Rev. B87, 184115 (2013)

work page 2013
[7]

Drautz, R.: Atomic cluster expansion for accurate and transferable interatomic potentials. Phys. Rev. B99, 014104 (2019)

work page 2019
[8]

Sch¨ utt, K.T., Sauceda, H.E., Kindermans, P.-J., Tkatchenko, A., M¨ uller, K.-R.: SchNet: A deep learning architecture for molecules and materials. J. Chem. Phys. 148, 241722 (2018)

work page 2018
[9]

Wang, H., Zhang, L., Han, J., E, W.: DeePMD-kit: A deep learning package for many-body potential energy representation and molecular dynamics. Comput. Phys. Commun.228, 178 (2018)

work page 2018
[10]

Batzner, S., Musaelian, A., Sun, L., Geiger, M., Mailoa, J.P., Kornbluth, M., Molinari, N., Smidt, T.E., Kozinsky, B.: E(3)-equivariant graph neural networks for data-efficient and accurate interatomic potentials. Nat. Commun.13, 2453 (2022)

work page 2022
[11]

In: Advances in Neural Information Processing Systems, vol

Batatia, I., Kov´ acs, D.P., Simm, G.N.C., Ortner, C., Cs´ anyi, G.: MACE: Higher order equivariant message passing neural networks for fast and accurate force fields. In: Advances in Neural Information Processing Systems, vol. 35, p. 11423 (2022)

work page 2022
[12]

Musaelian, A., Batzner, S., Johansson, A., Sun, L., Owen, C.J., Kornbluth, M., Kozinsky, B.: Learning local equivariant representations for large-scale atomistic dynamics. Nat. Commun.14, 579 (2023)

work page 2023
[13]

Batatia, I., Benner, P., Chiang, Y., Elena, A.M., Kov´ acs, D.P., Riebesell, J., Advincula, X.R., Asta, M., Avaylon, M., Baldwin, W.J., et al.: A foundation model for atomistic materials chemistry (2024)

work page 2024
[14]

Deng, B., Zhong, P., Jun, K., Riebesell, J., Han, K., Bartel, C.J., Ceder, G.: CHGNet as a pretrained universal neural network potential for charge-informed atomistic modelling. Nat. Mach. Intell.5, 1031 (2023)

work page 2023
[15]

Yang, H., Hu, C., Zhou, Y., Liu, X., Shi, Y., Li, J., Li, G., Chen, Z., Chen, S., Zeni, C., Horton, M., Pinsler, R., Fowler, A., Z¨ ugner, D., Xie, T., Smith, J., Sun, L., Wang, Q., Kong, L., Liu, C., Hao, H., Lu, Z.: MatterSim: A deep learning atomistic model across elements, temperatures and pressures (2024) 26

work page 2024
[16]

Nature624, 80 (2023)

Merchant, A., Batzner, S., Schoenholz, S.S., Aykol, M., Cheon, G., Cubuk, E.D.: Scaling deep learning for materials discovery. Nature624, 80 (2023)

work page 2023
[17]

Releases the Allegro-OAM-L foundation potential; pretrained on OMat24 and fine-tuned on MPtrj+sAlex

Tan, C.W., Descoteaux, M.L., Kotak, M., Miranda Nascimento, G., Kavanagh, S.R., Zichi, L., Wang, M., Saluja, A., Hu, Y.R., Smidt, T., Johansson, A., Witt, W.C., Kozinsky, B., Musaelian, A.: High-performance training and inference for deep equivariant interatomic potentials. Releases the Allegro-OAM-L foundation potential; pretrained on OMat24 and fine-tun...

work page 2025
[18]

Karabin, M., Perez, D.: An entropy-maximization approach to automated training set generation for interatomic potentials. J. Chem. Phys.153, 094110 (2020)

work page 2020
[19]

Zhang, Y., Wang, H., Chen, W., Zeng, J., Zhang, L., Wang, H., E, W.: DP-GEN: A concurrent learning platform for the generation of reliable deep learning based potential energy models. Comput. Phys. Commun.253, 107206 (2020)

work page 2020
[20]

Schran, C., Brezina, K., Marsalek, O.: Committee neural network potentials con- trol generalization errors and enable active learning. J. Chem. Phys.153, 104105 (2020)

work page 2020
[21]

npj Comput

Vandermause, J., Torrisi, S.B., Batzner, S., Xie, Y., Sun, L., Kolpak, A.M., Kozinsky, B.: On-the-fly active learning of interpretable Bayesian force fields for atomistic rare events. npj Comput. Mater.6, 20 (2020)

work page 2020
[22]

npj Comput

Bernstein, N., Cs´ anyi, G., Deringer, V.L.: De novo exploration and self-guided learning of potential-energy surfaces. npj Comput. Mater.5, 99 (2019)

work page 2019
[23]

Kulichenko, M., Barros, K., Lubbers, N., Li, Y.W., Messerly, R., Tretiak, S., Smith, J.S., Nebgen, B.: Uncertainty-driven dynamics for active learning of interatomic potentials. Nat. Comput. Sci.4, 29 (2024)

work page 2024
[24]

Podryabinkin, E.V., Shapeev, A.V.: Active learning of linearly parametrized interatomic potentials. Comput. Mater. Sci.140, 171 (2017)

work page 2017
[25]

Multiscale Model

Shapeev, A.V.: Moment tensor potentials: A class of systematically improvable interatomic potentials. Multiscale Model. Simul.14, 1153 (2016)

work page 2016
[26]

Zhu, L., Amsler, M., Fuhrer, T., Schaefer, B., Faraji, S., Rostami, S., Ghasemi, S.A., Sadeghi, A., Grauzinyte, M., Wolverton, C., Goedecker, S.: A fingerprint based metric for measuring similarities of crystalline structures. J. Chem. Phys. 144, 034203 (2016)

work page 2016
[27]

npj Comput

Subramanyam, A.P.A., Perez, D.: Information-maximization based active learn- ing of interatomic potentials. npj Comput. Mater.11, 218 (2025)

work page 2025
[28]

Barroso-Luque, L., Shuaibi, M., Fu, X., Wood, B.M., Dzamba, M., Gao, M., Rizvi, A., Zitnick, C.L., Ulissi, Z.W.: Open Materials 2024 (OMat24) inorganic 27 materials dataset and models (2024)

work page 2024
[29]

Bussi, G., Donadio, D., Parrinello, M.: Canonical sampling through velocity rescaling. J. Chem. Phys.126, 014101 (2007)

work page 2007
[30]

Hjorth Larsen, A., Jørgen Mortensen, J., Blomqvist, J., Castelli, I.E., Christensen, R., Du lak, M., Friis, J., Groves, M.N., Hammer, B., Hargus, C., Hermes, E.D., Jennings, P.C., Bjerre Jensen, P., Kermode, J., Kitchin, J.R., Leonhard Kolsbjerg, E., Kubal, J., Kaasbjerg, K., Lysgaard, S., Bergmann Maronsson, J., Maxson, T., Olsen, T., Pastewka, L., Peter...

work page 2017
[31]

Liu, D.C., Nocedal, J.: On the limited memory BFGS method for large scale optimization. Math. Program.45, 503 (1989)

work page 1989
[32]

Kresse, G., Furthm¨ uller, J.: Efficient iterative schemes forab initiototal-energy calculations using a plane-wave basis set. Phys. Rev. B54, 11169 (1996)

work page 1996
[33]

Kresse, G., Furthm¨ uller, J.: Efficiency of ab-initio total energy calculations for metals and semiconductors using a plane-wave basis set. Comput. Mater. Sci.6, 15 (1996)

work page 1996
[34]

Bl¨ ochl, P.E.: Projector augmented-wave method. Phys. Rev. B50, 17953 (1994)

work page 1994
[35]

Kresse, G., Joubert, D.: From ultrasoft pseudopotentials to the projector augmented-wave method. Phys. Rev. B59, 1758 (1999)

work page 1999
[36]

Furness, J.W., Kaplan, A.D., Ning, J., Perdew, J.P., Sun, J.: Accurate and numer- ically efficient r2SCAN meta-generalized gradient approximation. J. Phys. Chem. Lett.11, 8208 (2020)

work page 2020
[37]

Sun, J., Ruzsinszky, A., Perdew, J.P.: Strongly constrained and appropriately normed semilocal density functional. Phys. Rev. Lett.115, 036402 (2015)

work page 2015
[38]

Sabatini, R., Gorni, T., Gironcoli, S.: Nonlocal van der Waals density functional made simple and efficient. Phys. Rev. B87, 041108 (2013)

work page 2013
[39]

Vydrov, O.A., Van Voorhis, T.: Nonlocal van der Waals density functional: The simpler the better. J. Chem. Phys.133, 244103 (2010)

work page 2010
[40]

Ning, J., Kothakonda, M., Furness, J.W., Kaplan, A.D., Ehlert, S., Brandenburg, J.G., Perdew, J.P., Sun, J.: Workhorse minimally empirical dispersion-corrected density functional with tests for weakly bound systems: r 2SCAN+rVV10. Phys. Rev. B106, 075422 (2022)

work page 2022
[41]

Peng, H., Yang, Z.-H., Perdew, J.P., Sun, J.: Versatile van der Waals density 28 functional based on a meta-generalized gradient approximation. Phys. Rev. X6, 041005 (2016)

work page 2016
[42]

Perdew, J.P., Burke, K., Ernzerhof, M.: Generalized gradient approximation made simple. Phys. Rev. Lett.77, 3865 (1996)

work page 1996
[43]

Published as a conference paper at ICLR 2015

Kingma, D.P., Ba, J.: Adam: A method for stochastic optimization. Published as a conference paper at ICLR 2015. (2014)

work page 2015
[44]

Huber, P.J.: Robust estimation of a location parameter. Ann. Math. Stat.35, 73 (1964)

work page 1964
[45]

APL Mater.1, 011002 (2013)

Jain, A., Ong, S.P., Hautier, G., Chen, W., Richards, W.D., Dacek, S., Cholia, S., Gunter, D., Skinner, D., Ceder, G., Persson, K.A.: Commentary: The Materials Project: A materials genome approach to accelerating materials innovation. APL Mater.1, 011002 (2013)

work page 2013
[46]

Nature214, 587 (1967)

Frondel, C., Marvin, U.B.: Lonsdaleite, a hexagonal polymorph of diamond. Nature214, 587 (1967)

work page 1967
[47]

Li, Q., Ma, Y., Oganov, A.R., Wang, H., Wang, H., Xu, Y., Cui, T., Mao, H.- K., Zou, G.: Superhard monoclinic polymorph of carbon. Phys. Rev. Lett.102, 175506 (2009)

work page 2009
[48]

Togo, A., Tanaka, I.: First principles phonon calculations in materials science. Scr. Mater.108, 1 (2015)

work page 2015
[49]

Togo, A., Chaput, L., Tadano, T., Tanaka, I.: Implementation strategies in phonopy and phono3py. J. Phys. Condens. Matter35, 353001 (2023)

work page 2023
[50]

Warren, J.L., Yarnell, J.L., Dolling, G., Cowley, R.A.: Lattice dynamics of diamond. Phys. Rev.158, 805 (1967)

work page 1967
[51]

Solin, S.A., Ramdas, A.K.: Raman spectrum of diamond. Phys. Rev. B1, 1687 (1970)

work page 1970
[52]

Leb` egue, S., Harl, J., Gould, T.,´Angy´ an, J.G., Kresse, G., Dobson, J.F.: Cohesive properties and asymptotics of the dispersion interaction in graphite by the random phase approximation. Phys. Rev. Lett.105, 196401 (2010)

work page 2010
[53]

Kotliar, G., Savrasov, S.Y., Haule, K., Oudovenko, V.S., Parcollet, O., Marianetti, C.A.: Electronic structure calculations with dynamical mean-field theory. Rev. Mod. Phys.78, 865 (2006) 29

work page 2006

[1] [1]

Behler, J.: Perspective: Machine learning potentials for atomistic simulations. J. Chem. Phys.145, 170201 (2016)

work page 2016

[2] [2]

Deringer, V.L., Caro, M.A., Cs´ anyi, G.: Machine learning interatomic potentials as emerging tools for materials science. Adv. Mater.31, 1902765 (2019)

work page 2019

[3] [3]

Unke, O.T., Chmiela, S., Sauceda, H.E., Gastegger, M., Poltavsky, I., Sch¨ utt, K.T., Tkatchenko, A., M¨ uller, K.-R.: Machine learning force fields. Chem. Rev. 121, 10142 (2021) 25

work page 2021

[4] [4]

Behler, J., Parrinello, M.: Generalized neural-network representation of high- dimensional potential-energy surfaces. Phys. Rev. Lett.98, 146401 (2007)

work page 2007

[5] [5]

Bart´ ok, A.P., Payne, M.C., Kondor, R., Cs´ anyi, G.: Gaussian approximation potentials: The accuracy of quantum mechanics, without the electrons. Phys. Rev. Lett.104, 136403 (2010)

work page 2010

[6] [6]

Bart´ ok, A.P., Kondor, R., Cs´ anyi, G.: On representing chemical environments. Phys. Rev. B87, 184115 (2013)

work page 2013

[7] [7]

Drautz, R.: Atomic cluster expansion for accurate and transferable interatomic potentials. Phys. Rev. B99, 014104 (2019)

work page 2019

[8] [8]

Sch¨ utt, K.T., Sauceda, H.E., Kindermans, P.-J., Tkatchenko, A., M¨ uller, K.-R.: SchNet: A deep learning architecture for molecules and materials. J. Chem. Phys. 148, 241722 (2018)

work page 2018

[9] [9]

Wang, H., Zhang, L., Han, J., E, W.: DeePMD-kit: A deep learning package for many-body potential energy representation and molecular dynamics. Comput. Phys. Commun.228, 178 (2018)

work page 2018

[10] [10]

Batzner, S., Musaelian, A., Sun, L., Geiger, M., Mailoa, J.P., Kornbluth, M., Molinari, N., Smidt, T.E., Kozinsky, B.: E(3)-equivariant graph neural networks for data-efficient and accurate interatomic potentials. Nat. Commun.13, 2453 (2022)

work page 2022

[11] [11]

In: Advances in Neural Information Processing Systems, vol

Batatia, I., Kov´ acs, D.P., Simm, G.N.C., Ortner, C., Cs´ anyi, G.: MACE: Higher order equivariant message passing neural networks for fast and accurate force fields. In: Advances in Neural Information Processing Systems, vol. 35, p. 11423 (2022)

work page 2022

[12] [12]

Musaelian, A., Batzner, S., Johansson, A., Sun, L., Owen, C.J., Kornbluth, M., Kozinsky, B.: Learning local equivariant representations for large-scale atomistic dynamics. Nat. Commun.14, 579 (2023)

work page 2023

[13] [13]

Batatia, I., Benner, P., Chiang, Y., Elena, A.M., Kov´ acs, D.P., Riebesell, J., Advincula, X.R., Asta, M., Avaylon, M., Baldwin, W.J., et al.: A foundation model for atomistic materials chemistry (2024)

work page 2024

[14] [14]

Deng, B., Zhong, P., Jun, K., Riebesell, J., Han, K., Bartel, C.J., Ceder, G.: CHGNet as a pretrained universal neural network potential for charge-informed atomistic modelling. Nat. Mach. Intell.5, 1031 (2023)

work page 2023

[15] [15]

Yang, H., Hu, C., Zhou, Y., Liu, X., Shi, Y., Li, J., Li, G., Chen, Z., Chen, S., Zeni, C., Horton, M., Pinsler, R., Fowler, A., Z¨ ugner, D., Xie, T., Smith, J., Sun, L., Wang, Q., Kong, L., Liu, C., Hao, H., Lu, Z.: MatterSim: A deep learning atomistic model across elements, temperatures and pressures (2024) 26

work page 2024

[16] [16]

Nature624, 80 (2023)

Merchant, A., Batzner, S., Schoenholz, S.S., Aykol, M., Cheon, G., Cubuk, E.D.: Scaling deep learning for materials discovery. Nature624, 80 (2023)

work page 2023

[17] [17]

Releases the Allegro-OAM-L foundation potential; pretrained on OMat24 and fine-tuned on MPtrj+sAlex

Tan, C.W., Descoteaux, M.L., Kotak, M., Miranda Nascimento, G., Kavanagh, S.R., Zichi, L., Wang, M., Saluja, A., Hu, Y.R., Smidt, T., Johansson, A., Witt, W.C., Kozinsky, B., Musaelian, A.: High-performance training and inference for deep equivariant interatomic potentials. Releases the Allegro-OAM-L foundation potential; pretrained on OMat24 and fine-tun...

work page 2025

[18] [18]

Karabin, M., Perez, D.: An entropy-maximization approach to automated training set generation for interatomic potentials. J. Chem. Phys.153, 094110 (2020)

work page 2020

[19] [19]

Zhang, Y., Wang, H., Chen, W., Zeng, J., Zhang, L., Wang, H., E, W.: DP-GEN: A concurrent learning platform for the generation of reliable deep learning based potential energy models. Comput. Phys. Commun.253, 107206 (2020)

work page 2020

[20] [20]

Schran, C., Brezina, K., Marsalek, O.: Committee neural network potentials con- trol generalization errors and enable active learning. J. Chem. Phys.153, 104105 (2020)

work page 2020

[21] [21]

npj Comput

Vandermause, J., Torrisi, S.B., Batzner, S., Xie, Y., Sun, L., Kolpak, A.M., Kozinsky, B.: On-the-fly active learning of interpretable Bayesian force fields for atomistic rare events. npj Comput. Mater.6, 20 (2020)

work page 2020

[22] [22]

npj Comput

Bernstein, N., Cs´ anyi, G., Deringer, V.L.: De novo exploration and self-guided learning of potential-energy surfaces. npj Comput. Mater.5, 99 (2019)

work page 2019

[23] [23]

Kulichenko, M., Barros, K., Lubbers, N., Li, Y.W., Messerly, R., Tretiak, S., Smith, J.S., Nebgen, B.: Uncertainty-driven dynamics for active learning of interatomic potentials. Nat. Comput. Sci.4, 29 (2024)

work page 2024

[24] [24]

Podryabinkin, E.V., Shapeev, A.V.: Active learning of linearly parametrized interatomic potentials. Comput. Mater. Sci.140, 171 (2017)

work page 2017

[25] [25]

Multiscale Model

Shapeev, A.V.: Moment tensor potentials: A class of systematically improvable interatomic potentials. Multiscale Model. Simul.14, 1153 (2016)

work page 2016

[26] [26]

Zhu, L., Amsler, M., Fuhrer, T., Schaefer, B., Faraji, S., Rostami, S., Ghasemi, S.A., Sadeghi, A., Grauzinyte, M., Wolverton, C., Goedecker, S.: A fingerprint based metric for measuring similarities of crystalline structures. J. Chem. Phys. 144, 034203 (2016)

work page 2016

[27] [27]

npj Comput

Subramanyam, A.P.A., Perez, D.: Information-maximization based active learn- ing of interatomic potentials. npj Comput. Mater.11, 218 (2025)

work page 2025

[28] [28]

Barroso-Luque, L., Shuaibi, M., Fu, X., Wood, B.M., Dzamba, M., Gao, M., Rizvi, A., Zitnick, C.L., Ulissi, Z.W.: Open Materials 2024 (OMat24) inorganic 27 materials dataset and models (2024)

work page 2024

[29] [29]

Bussi, G., Donadio, D., Parrinello, M.: Canonical sampling through velocity rescaling. J. Chem. Phys.126, 014101 (2007)

work page 2007

[30] [30]

Hjorth Larsen, A., Jørgen Mortensen, J., Blomqvist, J., Castelli, I.E., Christensen, R., Du lak, M., Friis, J., Groves, M.N., Hammer, B., Hargus, C., Hermes, E.D., Jennings, P.C., Bjerre Jensen, P., Kermode, J., Kitchin, J.R., Leonhard Kolsbjerg, E., Kubal, J., Kaasbjerg, K., Lysgaard, S., Bergmann Maronsson, J., Maxson, T., Olsen, T., Pastewka, L., Peter...

work page 2017

[31] [31]

Liu, D.C., Nocedal, J.: On the limited memory BFGS method for large scale optimization. Math. Program.45, 503 (1989)

work page 1989

[32] [32]

Kresse, G., Furthm¨ uller, J.: Efficient iterative schemes forab initiototal-energy calculations using a plane-wave basis set. Phys. Rev. B54, 11169 (1996)

work page 1996

[33] [33]

Kresse, G., Furthm¨ uller, J.: Efficiency of ab-initio total energy calculations for metals and semiconductors using a plane-wave basis set. Comput. Mater. Sci.6, 15 (1996)

work page 1996

[34] [34]

Bl¨ ochl, P.E.: Projector augmented-wave method. Phys. Rev. B50, 17953 (1994)

work page 1994

[35] [35]

Kresse, G., Joubert, D.: From ultrasoft pseudopotentials to the projector augmented-wave method. Phys. Rev. B59, 1758 (1999)

work page 1999

[36] [36]

Furness, J.W., Kaplan, A.D., Ning, J., Perdew, J.P., Sun, J.: Accurate and numer- ically efficient r2SCAN meta-generalized gradient approximation. J. Phys. Chem. Lett.11, 8208 (2020)

work page 2020

[37] [37]

Sun, J., Ruzsinszky, A., Perdew, J.P.: Strongly constrained and appropriately normed semilocal density functional. Phys. Rev. Lett.115, 036402 (2015)

work page 2015

[38] [38]

Sabatini, R., Gorni, T., Gironcoli, S.: Nonlocal van der Waals density functional made simple and efficient. Phys. Rev. B87, 041108 (2013)

work page 2013

[39] [39]

Vydrov, O.A., Van Voorhis, T.: Nonlocal van der Waals density functional: The simpler the better. J. Chem. Phys.133, 244103 (2010)

work page 2010

[40] [40]

Ning, J., Kothakonda, M., Furness, J.W., Kaplan, A.D., Ehlert, S., Brandenburg, J.G., Perdew, J.P., Sun, J.: Workhorse minimally empirical dispersion-corrected density functional with tests for weakly bound systems: r 2SCAN+rVV10. Phys. Rev. B106, 075422 (2022)

work page 2022

[41] [41]

Peng, H., Yang, Z.-H., Perdew, J.P., Sun, J.: Versatile van der Waals density 28 functional based on a meta-generalized gradient approximation. Phys. Rev. X6, 041005 (2016)

work page 2016

[42] [42]

Perdew, J.P., Burke, K., Ernzerhof, M.: Generalized gradient approximation made simple. Phys. Rev. Lett.77, 3865 (1996)

work page 1996

[43] [43]

Published as a conference paper at ICLR 2015

Kingma, D.P., Ba, J.: Adam: A method for stochastic optimization. Published as a conference paper at ICLR 2015. (2014)

work page 2015

[44] [44]

Huber, P.J.: Robust estimation of a location parameter. Ann. Math. Stat.35, 73 (1964)

work page 1964

[45] [45]

APL Mater.1, 011002 (2013)

Jain, A., Ong, S.P., Hautier, G., Chen, W., Richards, W.D., Dacek, S., Cholia, S., Gunter, D., Skinner, D., Ceder, G., Persson, K.A.: Commentary: The Materials Project: A materials genome approach to accelerating materials innovation. APL Mater.1, 011002 (2013)

work page 2013

[46] [46]

Nature214, 587 (1967)

Frondel, C., Marvin, U.B.: Lonsdaleite, a hexagonal polymorph of diamond. Nature214, 587 (1967)

work page 1967

[47] [47]

Li, Q., Ma, Y., Oganov, A.R., Wang, H., Wang, H., Xu, Y., Cui, T., Mao, H.- K., Zou, G.: Superhard monoclinic polymorph of carbon. Phys. Rev. Lett.102, 175506 (2009)

work page 2009

[48] [48]

Togo, A., Tanaka, I.: First principles phonon calculations in materials science. Scr. Mater.108, 1 (2015)

work page 2015

[49] [49]

Togo, A., Chaput, L., Tadano, T., Tanaka, I.: Implementation strategies in phonopy and phono3py. J. Phys. Condens. Matter35, 353001 (2023)

work page 2023

[50] [50]

Warren, J.L., Yarnell, J.L., Dolling, G., Cowley, R.A.: Lattice dynamics of diamond. Phys. Rev.158, 805 (1967)

work page 1967

[51] [51]

Solin, S.A., Ramdas, A.K.: Raman spectrum of diamond. Phys. Rev. B1, 1687 (1970)

work page 1970

[52] [52]

Leb` egue, S., Harl, J., Gould, T.,´Angy´ an, J.G., Kresse, G., Dobson, J.F.: Cohesive properties and asymptotics of the dispersion interaction in graphite by the random phase approximation. Phys. Rev. Lett.105, 196401 (2010)

work page 2010

[53] [53]

Kotliar, G., Savrasov, S.Y., Haule, K., Oudovenko, V.S., Parcollet, O., Marianetti, C.A.: Electronic structure calculations with dynamical mean-field theory. Rev. Mod. Phys.78, 865 (2006) 29

work page 2006