pith. sign in

arxiv: 2503.10471 · v2 · submitted 2025-03-13 · ❄️ cond-mat.mtrl-sci · cs.AI

Siamese Foundation Models for Crystal Structure Prediction

Pith reviewed 2026-05-23 00:08 UTC · model grok-4.3

classification ❄️ cond-mat.mtrl-sci cs.AI
keywords crystal structure predictiondiffusion modelsSiamese foundation modelsmaterials discoveryenergy predictionsuperconductorsstructure generation
0
0 comments X

The pith

Pretrained Siamese foundation models generate crystal structures from composition that match experiments at 100 percent with 0.0012 atomic-position error while running over 2000 times faster than DFT.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper presents DAO as a pretrain-finetune framework that pairs a diffusion-based structure generator with an energy predictor, both built as Siamese foundation models. The generator is pretrained on large collections of stable and unstable structures, with the predictor used to relax unstable outputs during sampling. This combination improves results on standard benchmarks across multiple architectures and delivers exact experimental matches on three real superconductors. A reader would care because conventional DFT-based prediction is too slow for broad materials exploration, so a fast, generalizable alternative could expand the set of testable compositions.

Core claim

DAO integrates a diffusion-based structure generator and an energy predictor as Siamese foundation models. The generator is pretrained via a two-stage pipeline on a vast dataset of stable and unstable structures, with the predictor relaxing unstable configurations to guide generative sampling. Across benchmarks pretraining boosts performance on multiple backbones, and ablation studies confirm mutual benefit between the two models. On the real superconductor Cr6Os2 the method reaches 100 percent match with experimental references and 0.0012 atomic-position error under 20-shot generation, more than 2000 times faster per iteration than DFT-based predictors.

What carries the argument

The DAO framework of Siamese diffusion generator and energy predictor, where the predictor relaxes unstable structures to steer generative sampling.

If this is right

  • Pretraining on stable and unstable data improves prediction accuracy across multiple backbone architectures on standard benchmarks.
  • Ablation studies show the generator and predictor mutually reinforce each other.
  • The same models reach 100 percent experimental match rate and 0.0012 position error on Cr6Os2 and comparable results on two other superconductors.
  • Generation runs over 2000 times faster per iteration than DFT-based structure predictors.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • The approach could be applied to screen far larger numbers of compositions for candidate materials before any DFT run.
  • If the predictor generalizes, it might replace some relaxation steps inside existing high-throughput workflows.
  • The same pretraining pattern could be tested on structure prediction tasks that involve temperature or external fields.
  • Failure on a held-out real material would indicate the need for larger or more diverse pretraining sets.

Load-bearing premise

Models pretrained on the collection of stable and unstable structures will generalize accurately to real-world materials outside the training distribution.

What would settle it

A new composition whose experimentally determined structure differs substantially from any structure generated by the pretrained DAO model under the reported sampling protocol.

Figures

Figures reproduced from arXiv: 2503.10471 by Fuchun Sun, Hao Sun, Jianxing Huang, Jirong Wen, Liming Wu, Liwei Liu, Rui Jiao, Wenbing Huang, Yang Liu, Yipeng Zhou, Yuxiang Ren.

Figure 1
Figure 1. Figure 1: A summary of our models: (a) offers an overview of the structure generator (DAO-G) and the energy predictor (DAO-P). (a.1) outlines the pretrain-finetune framework. DAO-G conducts a two-stage pretraining process on CrysDB and DAO-P is pretrained on the same dataset. DAO-P enhances DAO-G by dataset relaxation and energy guidance. (a.2) illustrates the pretraining of DAO-P, which involves the diffusion-based… view at source ↗
Figure 2
Figure 2. Figure 2: Statistics of the pretraining dataset CrysDB: (a) shows the global analyses of the dataset, including the number of entries from MP and OQMD, the statistics of the deduplicated version, and the propotion of stable structures. (b) reports the distributions of Ehull, volume and atom number. (c) presents the elements coverage. It is important to note that the statistics presented in (b) and (c) refer to the d… view at source ↗
Figure 3
Figure 3. Figure 3: In-depth analyses of our models on the CSP benchmarks: (a) compares the performance of DAO-G across various configurations. Here, “stage I, Stable” refers to pretraining on the stable-only subset of the deduplicated CrysDB, while “stage I” denotes the first-stage pretraining on the full deduplicated CrysDB. (b) gathers the polymorphs (with 2 to 4 conformations) from MP-20, and subsequently compares the gen… view at source ↗
Figure 4
Figure 4. Figure 4: The performance of DAO-P for crystal property prediction is evaluated on eight datasets. The compared baselines include models both with and without pretraining, with the results directly taken from their respective papers. For baselines where the corresponding experiments were not conducted in the original paper, the results are denoted as N/A. significant energy reduction of 86.8%, decreasing from 0.3198… view at source ↗
Figure 5
Figure 5. Figure 5: Experiments on superconductors: (a) depicts the finetuning process of DAO-P and DAO-G on the SuperCon3D dataset [11], in which 3D structures are known for a subset of materials. (b) presents the distributions of the critical temperature (Tc). (c) displays the Tc prediction error evaluated with the 5-fold cross-validation setting. (d) shows the results of the three recently discovered real-world superconduc… view at source ↗
Figure 6
Figure 6. Figure 6: An illustration of the structure relaxed by DFT and the structure generated by DAO-G, for the superconductor CsV3Sb5 [23]. fractional coordinates deviation. Only four out of ten runs of CsV3Sb5 [23] succeeds, with the best achieving an RMSE of 0.0637 compared to DAO-G’s 0.0085. The visualization results are depicted in [PITH_FULL_IMAGE:figures/full_fig_p030_6.png] view at source ↗
Figure 7
Figure 7. Figure 7: The illustration of energy guidance process. The blue arrow represents standard denoising based on the data distribution, which, however, not lies within stable regions. The brown arrow indicates the influence of energy guidance, steering the generation towards the equilibrium distribution. The resulting energy-guided denoising is depicted by the green arrow. 2.5 Pretraining Dataset Deduplication When pret… view at source ↗
Figure 8
Figure 8. Figure 8: Visualization of the generated structures by DAO-G throughout the diffusion process. We show representative structures at timesteps 1000, 750, 500, 250, and 0. The structures at timestep 0 represent the final generated samples, which are well-aligned with the corresponding ground truth structures. To enhance visual clarity and facilitate comparison, a common atom within each group (represented by a row) ha… view at source ↗
read the original abstract

Predicting crystal structures from chemical compositions is a fundamental challenge in materials discovery, complicated by complex 3D geometries that distinguish it from fields like protein folding. Here, we present Diffusion-based Crystal Omni (DAO), a pretrain-finetune framework for crystal structure prediction integrating two Siamese foundation models: a structure generator and an energy predictor. The generator is pretrained via a two-stage pipeline on a vast dataset of stable and unstable structures, leveraging the predictor to relax unstable configurations and guide the generative sampling. Across two well-known benchmarks, pretraining significantly enhances performance across multiple backbone architectures. Ablation studies confirm that the synergy between the generator and predictor mutually benefits both components. We further validate DAO on three real-world superconductors ($\text{Cr}_6\text{Os}_2$, $\text{Zr}_{16}\text{Rh}_8\text{O}_4$, and $\text{Zr}_{16}\text{Pd}_8\text{O}_4$) typically inaccessible to conventional computation. For $\text{Cr}_6\text{Os}_2$, DAO achieves a 100\% match rate with experimental references and an atomic-position error of 0.0012 under 20-shot generation, performing over 2000$\times$ faster per iteration than DFT-based structure predictors. These compelling results collectively highlight the potential of our approach for advancing materials science research.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 2 minor

Summary. The manuscript presents Diffusion-based Crystal Omni (DAO), a pretrain-finetune framework that integrates two Siamese foundation models—a structure generator pretrained via a two-stage pipeline on stable and unstable structures and an energy predictor used to relax unstable configurations during generative sampling. The paper claims that pretraining enhances performance across multiple backbone architectures on two benchmarks, that ablation studies confirm mutual benefits between generator and predictor, and that the approach achieves 100% match rate with experimental references and 0.0012 atomic-position error for Cr6Os2 (plus results on two Zr-based superconductors) under 20-shot generation while being over 2000× faster per iteration than DFT-based predictors.

Significance. If the generalization claims hold, the work could meaningfully accelerate materials discovery for complex compositions by replacing expensive DFT relaxations with fast learned sampling and relaxation. The two-stage pretraining on both stable and unstable structures plus the Siamese predictor-generator coupling is a concrete technical idea whose value would be established by the reported benchmark gains and real-world matches.

major comments (2)
  1. [Results on real-world validation] Results section on real-world superconductors: the 100% match rate and 0.0012 position error for Cr6Os2 under 20-shot generation is offered as evidence that the framework works on materials “typically inaccessible to conventional computation,” yet no overlap statistics between the three test compositions and the pretraining distribution, no leave-one-family-out protocol, and no analysis of predictor behavior on out-of-manifold proposals are supplied; without these the numerical result cannot confirm the claimed extrapolation.
  2. [Methods on pretraining pipeline] Methods on the two-stage pretraining pipeline: the claim that the predictor “guides the generative sampling” by relaxing unstable configurations is central to the synergy argument, but the manuscript provides no quantitative characterization (e.g., predictor error distribution or success rate) of how the predictor behaves when the generator proposes structures far from the pretraining manifold.
minor comments (2)
  1. [Abstract] Abstract: the two “well-known benchmarks” are never named; this information should appear in the first paragraph of the results or methods.
  2. [Abstract] Abstract and methods: dataset sizes, exact model architectures, and training hyperparameters are omitted, which impedes immediate assessment of reproducibility even if the central claims are later supported.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the constructive comments. We address each major point below and indicate where revisions will be made.

read point-by-point responses
  1. Referee: [Results on real-world validation] Results section on real-world superconductors: the 100% match rate and 0.0012 position error for Cr6Os2 under 20-shot generation is offered as evidence that the framework works on materials “typically inaccessible to conventional computation,” yet no overlap statistics between the three test compositions and the pretraining distribution, no leave-one-family-out protocol, and no analysis of predictor behavior on out-of-manifold proposals are supplied; without these the numerical result cannot confirm the claimed extrapolation.

    Authors: We agree that overlap statistics would better contextualize the results. In the revised manuscript we will add compositional similarity metrics (e.g., element-frequency overlap and space-group distribution) between Cr6Os2, Zr16Rh8O4, Zr16Pd8O4 and the pretraining set. A full leave-one-family-out protocol is not part of the standard benchmarks used in the field and would require new large-scale experiments; we will instead expand the discussion of chemical-family membership. For predictor behavior on out-of-manifold proposals, the two-stage pretraining on unstable structures is intended to improve robustness; we will add a quantitative error-distribution analysis on the generated real-world samples. revision: partial

  2. Referee: [Methods on pretraining pipeline] Methods on the two-stage pretraining pipeline: the claim that the predictor “guides the generative sampling” by relaxing unstable configurations is central to the synergy argument, but the manuscript provides no quantitative characterization (e.g., predictor error distribution or success rate) of how the predictor behaves when the generator proposes structures far from the pretraining manifold.

    Authors: We accept that additional quantitative detail is needed. The revised methods section will report the predictor’s error distribution and relaxation success rate on structures proposed by the generator during sampling, including cases distant from the pretraining manifold, using statistics collected from the ablation experiments already performed. revision: yes

Circularity Check

0 steps flagged

No circularity; empirical results on external benchmarks

full rationale

The paper describes a pretrain-finetune ML framework evaluated on standard benchmarks and three external experimental compositions (Cr6Os2 etc.). No equations, derivations, or first-principles steps are presented that reduce claimed performance metrics to fitted parameters or self-referential definitions by construction. Ablation studies and match-rate numbers are reported against held-out or real-world references, keeping the derivation chain independent of its own inputs.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

Abstract-only review; no explicit free parameters, axioms, or invented entities are stated. The framework implicitly relies on standard assumptions of diffusion models and energy predictors but these are not detailed.

pith-pipeline@v0.9.0 · 5798 in / 1161 out tokens · 38850 ms · 2026-05-23T00:08:03.020897+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Reference graph

Works this paper leans on

83 extracted references · 83 canonical work pages · 8 internal anchors

  1. [1]

    https://doi.org/10.48505/nims.3739, 2022

    Supercon database. https://doi.org/10.48505/nims.3739, 2022

  2. [2]

    Accurate structure prediction of biomolecular interactions with alphafold 3.Nature, 630(8016):493–500, 2024

    Josh Abramson, Jonas Adler, Jack Dunger, Richard Evans, Tim Green, Alexander Pritzel, Olaf Ronneberger, Lindsay Willmore, Andrew J Ballard, Joshua Bambrick, et al. Accurate structure prediction of biomolecular interactions with alphafold 3.Nature, 630(8016):493–500, 2024

  3. [3]

    GPT-4 Technical Report

    Josh Achiam, Steven Adler, Sandhini Agarwal, Lama Ahmad, Ilge Akkaya, Florencia Leoni Aleman, Diogo Almeida, Janko Altenschmidt, Sam Altman, Shyamal Anadkat, et al. Gpt-4 technical report. arXiv preprint arXiv:2303.08774, 2023

  4. [4]

    Equivariant energy-guided sde for inverse molecular design

    Fan Bao, Min Zhao, Zhongkai Hao, Peiyao Li, Chongxuan Li, and Jun Zhu. Equivariant energy-guided sde for inverse molecular design. InThe eleventh international conference on learning representations, 2022

  5. [5]

    Microscopic theory of superconductivity

    John Bardeen, Leon N Cooper, and J Robert Schrieffer. Microscopic theory of superconductivity. Physical Review, 106(1):162, 1957

  6. [6]

    A foundation model for atomistic materials chemistry

    Ilyes Batatia, Philipp Benner, Yuan Chiang, Alin M Elena, Dávid P Kovács, Janosh Riebesell, Xavier R Advincula, Mark Asta, Matthew Avaylon, William J Baldwin, et al. A foundation model for atomistic materials chemistry.arXiv preprint arXiv:2401.00096, 2023

  7. [7]

    The inorganic crystal structure data base

    Guenter Bergerhoff, R Hundt, R Sievers, and ID Brown. The inorganic crystal structure data base. Journal of chemical information and computer sciences, 23(2):66–69, 1983. 19/25

  8. [8]

    On the Opportunities and Risks of Foundation Models

    Rishi Bommasani, Drew A Hudson, Ehsan Adeli, Russ Altman, Simran Arora, Sydney von Arx, Michael S Bernstein, Jeannette Bohg, Antoine Bosselut, Emma Brunskill, et al. On the opportunities and risks of foundation models.arXiv preprint arXiv:2108.07258, 2021

  9. [9]

    Language models are few-shot learners.Advances in neural information processing systems, 33:1877–1901, 2020

    Tom Brown, Benjamin Mann, Nick Ryder, Melanie Subbiah, Jared D Kaplan, Prafulla Dhariwal, Arvind Neelakantan, Pranav Shyam, Girish Sastry, Amanda Askell, et al. Language models are few-shot learners.Advances in neural information processing systems, 33:1877–1901, 2020

  10. [10]

    Graph networks as a universal machine learning framework for molecules and crystals.Chemistry of Materials, 31 (9):3564–3572, 2019

    Chi Chen, Weike Ye, Yunxing Zuo, Chen Zheng, and Shyue Ping Ong. Graph networks as a universal machine learning framework for molecules and crystals.Chemistry of Materials, 31 (9):3564–3572, 2019

  11. [11]

    Learning superconductivity from ordered and disordered material structures

    Pin Chen, Luoxuan Peng, Rui Jiao, Qing Mo, Zhen Wang, Wenbing Huang, Yang Liu, and Yutong Lu. Learning superconductivity from ordered and disordered material structures. Advances in Neural Information Processing Systems, 37:108902–108928, 2025

  12. [12]

    Atomistic line graph neural network for improved materials property predictions.npj Computational Materials, 7(1):185, 2021

    Kamal Choudhary and Brian DeCost. Atomistic line graph neural network for improved materials property predictions.npj Computational Materials, 7(1):185, 2021

  13. [13]

    High-throughput identification and characterization of two-dimensional materials using density functional theory

    Kamal Choudhary, Irina Kalish, Ryan Beams, and Francesca Tavazza. High-throughput identification and characterization of two-dimensional materials using density functional theory. Scientific reports, 7(1):5179, 2017

  14. [14]

    The joint automated repository for various integrated simulations (jarvis) for data-driven materials design.npj computational materials, 6(1):173, 2020

    Kamal Choudhary, Kevin F Garrity, Andrew CE Reid, Brian DeCost, Adam J Biacchi, Angela R Hight Walker, Zachary Trautt, Jason Hattrick-Simpers, A Gilad Kusne, Andrea Centrone, et al. The joint automated repository for various integrated simulations (jarvis) for data-driven materials design.npj computational materials, 6(1):173, 2020

  15. [15]

    Jarvis-leaderboard: a large scale benchmark of materials design methods.npj Computational Materials, 10(1):93, 2024

    Kamal Choudhary, Daniel Wines, Kangming Li, Kevin F Garrity, Vishu Gupta, Aldo H Romero, Jaron T Krogel, Kayahan Saritas, Addis Fuhr, Panchapakesan Ganesh, et al. Jarvis-leaderboard: a large scale benchmark of materials design methods.npj Computational Materials, 10(1):93, 2024

  16. [16]

    Crystal stability and the theory of ferroelectricity.Advances in Physics, 9(36): 387–423, 1960

    W Cochran. Crystal stability and the theory of ferroelectricity.Advances in Physics, 9(36): 387–423, 1960

  17. [17]

    3-d inorganic crystal structure generation and property prediction via representation learning.Journal of Chemical Information and Modeling, 60(10):4518–4535, 2020

    Callum J Court, Batuhan Yildirim, Apoorv Jain, and Jacqueline M Cole. 3-d inorganic crystal structure generation and property prediction via representation learning.Journal of Chemical Information and Modeling, 60(10):4518–4535, 2020

  18. [18]

    Crysgnn: Distilling pre-trained knowledge to enhance property prediction for crystalline materials

    Kishalay Das, Bidisha Samanta, Pawan Goyal, Seung-Cheol Lee, Satadeep Bhattacharjee, and Niloy Ganguly. Crysgnn: Distilling pre-trained knowledge to enhance property prediction for crystalline materials. InProceedings of the AAAI Conference on Artificial Intelligence, volume 37, pages 7323–7331, 2023

  19. [19]

    Riemannian score-based generative modelling.Advances in Neural Information Processing Systems, 35:2406–2422, 2022

    Valentin De Bortoli, Emile Mathieu, Michael Hutchinson, James Thornton, Yee Whye Teh, and Arnaud Doucet. Riemannian score-based generative modelling.Advances in Neural Information Processing Systems, 35:2406–2422, 2022. 20/25

  20. [20]

    Charting the complete elastic properties of inorganic crystalline compounds.Scientific data, 2 (1):1–13, 2015

    Maarten De Jong, Wei Chen, Thomas Angsten, Anubhav Jain, Randy Notestine, Anthony Gamst, Marcel Sluiter, Chaitanya Krishna Ande, Sybrand Van Der Zwaag, Jose J Plata, et al. Charting the complete elastic properties of inorganic crystalline compounds.Scientific data, 2 (1):1–13, 2015

  21. [21]

    Cryptic crystallography.Nature materials, 1(2):77–79, 2002

    Gautam R Desiraju. Cryptic crystallography.Nature materials, 1(2):77–79, 2002

  22. [22]

    Benchmarking materials property prediction methods: the matbench test set and automatminer reference algorithm

    Alexander Dunn, Qi Wang, Alex Ganose, Daniel Dopp, and Anubhav Jain. Benchmarking materials property prediction methods: the matbench test set and automatminer reference algorithm. npj Computational Materials, 6(1):138, 2020

  23. [23]

    Charge-4 e and charge-6 e flux quantization and higher charge superconductivity in kagome superconductor ring devices.Physical Review X, 14(2):021025, 2024

    Jun Ge, Pinyuan Wang, Ying Xing, Qiangwei Yin, Anqi Wang, Jie Shen, Hechang Lei, Ziqiang Wang, and Jian Wang. Charge-4 e and charge-6 e flux quantization and higher charge superconductivity in kagome superconductor ring devices.Physical Review X, 14(2):021025, 2024

  24. [24]

    Inverse design of 3d molecular structures with conditional generative neural networks

    Niklas WA Gebauer, Michael Gastegger, Stefaan SP Hessmann, Klaus-Robert Müller, and Kristof T Schütt. Inverse design of 3d molecular structures with conditional generative neural networks. Nature communications, 13(1):973, 2022

  25. [25]

    Quantum espresso: a modular and open-source software project for quantumsimulations of materials

    Paolo Giannozzi, Stefano Baroni, Nicola Bonini, Matteo Calandra, Roberto Car, Carlo Cavaz- zoni, Davide Ceresoli, Guido L Chiarotti, Matteo Cococcioni, Ismaila Dabo, et al. Quantum espresso: a modular and open-source software project for quantumsimulations of materials. Journal of physics: Condensed matter, 21(39):395502, 2009

  26. [26]

    Advanced capabilities for materials modelling with quantum espresso.Journal of physics: Condensed matter, 29(46):465901, 2017

    Paolo Giannozzi, Oliviero Andreussi, Thomas Brumme, Oana Bunau, M Buongiorno Nardelli, Matteo Calandra, Roberto Car, Carlo Cavazzoni, Davide Ceresoli, Matteo Cococcioni, et al. Advanced capabilities for materials modelling with quantum espresso.Journal of physics: Condensed matter, 29(46):465901, 2017

  27. [27]

    Deep residual learning for image recognition

    Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. Deep residual learning for image recognition. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 770–778, 2016

  28. [28]

    Denoising diffusion probabilistic models.Advances in neural information processing systems, 33:6840–6851, 2020

    Jonathan Ho, Ajay Jain, and Pieter Abbeel. Denoising diffusion probabilistic models.Advances in neural information processing systems, 33:6840–6851, 2020

  29. [29]

    Video diffusion models.Advances in Neural Information Processing Systems, 35:8633–8646, 2022

    Jonathan Ho, Tim Salimans, Alexey Gritsenko, William Chan, Mohammad Norouzi, and David J Fleet. Video diffusion models.Advances in Neural Information Processing Systems, 35:8633–8646, 2022

  30. [30]

    Equivariant diffusion for molecule generation in 3d

    Emiel Hoogeboom, Vıctor Garcia Satorras, Clément Vignac, and Max Welling. Equivariant diffusion for molecule generation in 3d. InInternational conference on machine learning, pages 8867–8887. PMLR, 2022

  31. [31]

    Distance matrix-based crystal structure prediction using evolutionary algorithms.The Journal of Physical Chemistry A, 124(51):10909–10919, 2020

    Jianjun Hu, Wenhui Yang, and Edirisuriya M Dilanga Siriwardane. Distance matrix-based crystal structure prediction using evolutionary algorithms.The Journal of Physical Chemistry A, 124(51):10909–10919, 2020. 21/25

  32. [32]

    Mdm: Molecular diffusion model for 3d molecule generation

    Lei Huang, Hengtong Zhang, Tingyang Xu, and Ka-Chun Wong. Mdm: Molecular diffusion model for 3d molecule generation. In Proceedings of the AAAI Conference on Artificial Intelligence, volume 37, pages 5105–5112, 2023

  33. [33]

    The materials project: a materials genome approach to accelerating materials innovation

    A Jain, SP Ong, G Hautier, W Chen, WD Richards, S Dacek, S Cholia, D Gunter, D Skinner, G Ceder, et al. The materials project: a materials genome approach to accelerating materials innovation. apl mater 1: 011002, 2013

  34. [34]

    Planning with diffusion for flexible behavior synthesis

    Michael Janner, Yilun Du, Joshua Tenenbaum, and Sergey Levine. Planning with diffusion for flexible behavior synthesis. InInternational Conference on Machine Learning, pages 9902–9915. PMLR, 2022

  35. [35]

    Crystal structure prediction by joint equivariant diffusion.Advances in Neural Information Processing Systems, 36, 2024

    Rui Jiao, Wenbing Huang, Peijia Lin, Jiaqi Han, Pin Chen, Yutong Lu, and Yang Liu. Crystal structure prediction by joint equivariant diffusion.Advances in Neural Information Processing Systems, 36, 2024

  36. [36]

    Space group constrained crystal generation

    Rui Jiao, Wenbing Huang, Yu Liu, Deli Zhao, and Yang Liu. Space group constrained crystal generation. In The Twelfth International Conference on Learning Representations, 2024

  37. [37]

    Highly accurate protein structure prediction with alphafold.nature, 596(7873):583–589, 2021

    John Jumper, Richard Evans, Alexander Pritzel, Tim Green, Michael Figurnov, Olaf Ron- neberger, Kathryn Tunyasuvunakool, Russ Bates, Augustin Žídek, Anna Potapenko, et al. Highly accurate protein structure prediction with alphafold.nature, 596(7873):583–589, 2021

  38. [38]

    The open quantum materials database (oqmd): assessing the accuracy of dft formation energies.npj Computational Materials, 1(1):1–15, 2015

    Scott Kirklin, James E Saal, Bryce Meredig, Alex Thompson, Jeff W Doak, Muratahan Aykol, Stephan Rühl, and Chris Wolverton. The open quantum materials database (oqmd): assessing the accuracy of dft formation energies.npj Computational Materials, 1(1):1–15, 2015

  39. [39]

    Self-consistent equations including exchange and correlation effects

    Walter Kohn and Lu Jeu Sham. Self-consistent equations including exchange and correlation effects. Physical review, 140(4A):A1133, 1965

  40. [40]

    Graph contrastive learning for materials.arXiv preprint arXiv:2211.13408, 2022

    Teddy Koker, Keegan Quigley, Will Spaeth, Nathan C Frey, and Lin Li. Graph contrastive learning for materials.arXiv preprint arXiv:2211.13408, 2022

  41. [41]

    Quantum computers.nature, 464(7285):45–53, 2010

    Thaddeus D Ladd, Fedor Jelezko, Raymond Laflamme, Yasunobu Nakamura, Christopher Monroe, and Jeremy Lloyd O’Brien. Quantum computers.nature, 464(7285):45–53, 2010

  42. [42]

    From fundamental studies of reactivity on single crystals to the design of catalysts.Surface Science Reports, 35(5-8):163–222, 1999

    Jane H Larsen and Ib Chorkendorff. From fundamental studies of reactivity on single crystals to the design of catalysts.Surface Science Reports, 35(5-8):163–222, 1999

  43. [43]

    Statistical physics, course of theoretical physics

    EM Lifshitz and LP Pitaevskii. Statistical physics, course of theoretical physics. InPart 2: Theory of the Condensed State, volume 9. Butterworth-Heinemann Pergamon, London, 1980

  44. [44]

    Equivariant diffusion for crystal structure prediction

    Peijia Lin, Pin Chen, Rui Jiao, Qing Mo, Cen Jianhuan, Wenbing Huang, Yang Liu, Dan Huang, and Yutong Lu. Equivariant diffusion for crystal structure prediction. InForty-first International Conference on Machine Learning, 2024

  45. [45]

    Efficient ap- proximations of complete interatomic potentials for crystal property prediction

    Yuchao Lin, Keqiang Yan, Youzhi Luo, Yi Liu, Xiaoning Qian, and Shuiwang Ji. Efficient ap- proximations of complete interatomic potentials for crystal property prediction. InInternational Conference on Machine Learning, pages 21260–21287. PMLR, 2023. 22/25

  46. [46]

    Flow Matching for Generative Modeling

    Yaron Lipman, Ricky TQ Chen, Heli Ben-Hamu, Maximilian Nickel, and Matt Le. Flow matching for generative modeling.arXiv preprint arXiv:2210.02747, 2022

  47. [47]

    DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model

    Aixin Liu, Bei Feng, Bin Wang, Bingxuan Wang, Bo Liu, Chenggang Zhao, Chengqi Dengr, Chong Ruan, Damai Dai, Daya Guo, et al. Deepseek-v2: A strong, economical, and efficient mixture-of-experts language model.arXiv preprint arXiv:2405.04434, 2024

  48. [48]

    DeepSeek-V3 Technical Report

    Aixin Liu, Bei Feng, Bing Xue, Bingxuan Wang, Bochao Wu, Chengda Lu, Chenggang Zhao, Chengqi Deng, Chenyu Zhang, Chong Ruan, et al. Deepseek-v3 technical report.arXiv preprint arXiv:2412.19437, 2024

  49. [49]

    On the limited memory bfgs method for large scale optimization

    Dong C Liu and Jorge Nocedal. On the limited memory bfgs method for large scale optimization. Mathematical programming, 45(1):503–528, 1989

  50. [50]

    Contrastive energy prediction for exact energy-guided diffusion sampling in offline reinforcement learning

    Cheng Lu, Huayu Chen, Jianfei Chen, Hang Su, Chongxuan Li, and Jun Zhu. Contrastive energy prediction for exact energy-guided diffusion sampling in offline reinforcement learning. In International Conference on Machine Learning, pages 22825–22855. PMLR, 2023

  51. [51]

    One transformer can understand both 2d & 3d molecular data

    Shengjie Luo, Tianlang Chen, Yixian Xu, Shuxin Zheng, Tie-Yan Liu, Liwei Wang, and Di He. One transformer can understand both 2d & 3d molecular data. InThe Eleventh International Conference on Learning Representations, 2022

  52. [52]

    Crystalflow: A flow-based generative model for crystalline materials.arXiv preprint arXiv:2412.11693, 2024

    Xiaoshan Luo, Zhenyu Wang, Jian Lv, Lei Wang, Yanchao Wang, and Yanming Ma. Crystalflow: A flow-based generative model for crystalline materials.arXiv preprint arXiv:2412.11693, 2024

  53. [53]

    New developments in evolutionary structure prediction algorithm uspex.Computer Physics Communications, 184 (4):1172–1182, 2013

    Andriy O Lyakhov, Artem R Oganov, Harold T Stokes, and Qiang Zhu. New developments in evolutionary structure prediction algorithm uspex.Computer Physics Communications, 184 (4):1172–1182, 2013

  54. [54]

    Crystal twins: self-supervised learning for crystalline material property prediction.npj Computational Materials, 8(1):231, 2022

    Rishikesh Magar, Yuyang Wang, and Amir Barati Farimani. Crystal twins: self-supervised learning for crystalline material property prediction.npj Computational Materials, 8(1):231, 2022

  55. [55]

    Scaling deep learning for materials discovery.Nature, 624(7990):80–85, 2023

    Amil Merchant, Simon Batzner, Samuel S Schoenholz, Muratahan Aykol, Gowoon Cheon, and Ekin Dogus Cubuk. Scaling deep learning for materials discovery.Nature, 624(7990):80–85, 2023

  56. [56]

    Benjamin Kurt Miller, Ricky T. Q. Chen, Anuroop Sriram, and Brandon M Wood. FlowMM: Generating materials with riemannian flow matching. InForty-first International Conference on Machine Learning, 2024

  57. [57]

    Self-supervised learning for crystal property prediction via denoising

    Alexander New, Nam Q Le, Michael Pekala, and Christopher D Stiles. Self-supervised learning for crystal property prediction via denoising. InICML 2024 AI for Science Workshop, 2021

  58. [58]

    CrystalGAN: Learning to Discover Crystallographic Structures with Generative Adversarial Networks

    Asma Nouira, Nataliya Sokolovska, and Jean-Claude Crivello. Crystalgan: learning to discover crystallographic structures with generative adversarial networks. arXiv preprint arXiv:1810.11203, 2018

  59. [59]

    Crystal structure prediction using ab initio evolutionary techniques: Principles and applications.The Journal of chemical physics, 124(24), 2006

    Artem R Oganov and Colin W Glass. Crystal structure prediction using ab initio evolutionary techniques: Principles and applications.The Journal of chemical physics, 124(24), 2006. 23/25

  60. [60]

    Python materials genomics (pymatgen): A robust, open-source python library for materials analysis

    Shyue Ping Ong, William Davidson Richards, Anubhav Jain, Geoffroy Hautier, Michael Kocher, Shreyas Cholia, Dan Gunter, Vincent L Chevrier, Kristin A Persson, and Gerbrand Ceder. Python materials genomics (pymatgen): A robust, open-source python library for materials analysis. Computational Materials Science, 68:314–319, 2013

  61. [61]

    Prediction of crystal structures from crystal chemistry rules by simulated annealing.Nature, 346(6282): 343–345, 1990

    Jean Pannetier, J Bassas-Alsina, Juan Rodriguez-Carvajal, and Vincent Caignaert. Prediction of crystal structures from crystal chemistry rules by simulated annealing.Nature, 346(6282): 343–345, 1990

  62. [62]

    Pytorch: An imperative style, high-performance deep learning library

    Adam Paszke, Sam Gross, Francisco Massa, Adam Lerer, James Bradbury, Gregory Chanan, Trevor Killeen, Zeming Lin, Natalia Gimelshein, Luca Antiga, et al. Pytorch: An imperative style, high-performance deep learning library. Advances in neural information processing systems, 32, 2019

  63. [63]

    High-throughput screening of inorganic compounds for the discovery of novel dielectric and optical materials

    Ioannis Petousis, David Mrdjenovich, Eric Ballouz, Miao Liu, Donald Winston, Wei Chen, Tanja Graf, Thomas D Schladt, Kristin A Persson, and Fritz B Prinz. High-throughput screening of inorganic compounds for the discovery of novel dielectric and optical materials. Scientific data, 4(1):1–12, 2017

  64. [64]

    Ab initio random structure searching.Journal of Physics: Condensed Matter, 23(5):053201, 2011

    Chris J Pickard and RJ Needs. Ab initio random structure searching.Journal of Physics: Condensed Matter, 23(5):053201, 2011

  65. [65]

    Schnet: A continuous-filter convolutional neural network for modeling quantum interactions.Advances in neural information processing systems, 30, 2017

    Kristof Schütt, Pieter-Jan Kindermans, Huziel Enoc Sauceda Felix, Stefan Chmiela, Alexandre Tkatchenko, and Klaus-Robert Müller. Schnet: A continuous-filter convolutional neural network for modeling quantum interactions.Advances in neural information processing systems, 30, 2017

  66. [66]

    Improved protein structure prediction using potentials from deep learning.Nature, 577(7792):706–710, 2020

    Andrew W Senior, Richard Evans, John Jumper, James Kirkpatrick, Laurent Sifre, Tim Green, Chongli Qin, Augustin Žídek, Alexander WR Nelson, Alex Bridgland, et al. Improved protein structure prediction using potentials from deep learning.Nature, 577(7792):706–710, 2020

  67. [67]

    Maximum likelihood training of score-based diffusion models.Advances in neural information processing systems, 34:1415–1428, 2021

    Yang Song, Conor Durkan, Iain Murray, and Stefano Ermon. Maximum likelihood training of score-based diffusion models.Advances in neural information processing systems, 34:1415–1428, 2021

  68. [68]

    Attention is all you need.Advances in Neural Information Processing Systems, 2017

    A Vaswani. Attention is all you need.Advances in Neural Information Processing Systems, 2017

  69. [69]

    Energy landscapes and structure prediction using basin-hopping.Modern Methods of Crystal Structure Prediction, pages 29–54, 2010

    David J Wales. Energy landscapes and structure prediction using basin-hopping.Modern Methods of Crystal Structure Prediction, pages 29–54, 2010

  70. [70]

    Organic semiconductor crystals

    Chengliang Wang, Huanli Dong, Lang Jiang, and Wenping Hu. Organic semiconductor crystals. Chemical Society Reviews, 47(2):422–500, 2018

  71. [71]

    Protein conformation generation via force-guided se (3) diffusion models

    Lihao Wang, Yuning Shen, Yiqun Wang, Huizhuo Yuan, Yue Wu, Quanquan Gu, et al. Protein conformation generation via force-guided se (3) diffusion models. InForty-first International Conference on Machine Learning, 2024

  72. [72]

    Calypso: A method for crystal structure prediction

    Yanchao Wang, Jian Lv, Li Zhu, and Yanming Ma. Calypso: A method for crystal structure prediction. Computer Physics Communications, 183(10):2063–2070, 2012. 24/25

  73. [73]

    Observation of superconductivity and enhanced upper critical field ofη-carbide-type oxide zr4pd2o

    Yuto Watanabe, Akira Miura, Chikako Moriyoshi, Aichi Yamashita, and Yoshikazu Mizuguchi. Observation of superconductivity and enhanced upper critical field ofη-carbide-type oxide zr4pd2o. Scientific Reports, 13(1):22458, 2023

  74. [74]

    Cambridge university press, 1991

    David Williams.Probability with martingales. Cambridge university press, 1991

  75. [75]

    Crystal graph convolutional neural networks for an accurate and interpretable prediction of material properties.Physical review letters, 120(14):145301, 2018

    Tian Xie and Jeffrey C Grossman. Crystal graph convolutional neural networks for an accurate and interpretable prediction of material properties.Physical review letters, 120(14):145301, 2018

  76. [76]

    Jaakkola

    Tian Xie, Xiang Fu, Octavian-Eugen Ganea, Regina Barzilay, and Tommi S. Jaakkola. Crystal diffusion variational autoencoder for periodic material generation. InInternational Conference on Learning Representations, 2022

  77. [77]

    On layer normalization in the transformer architecture

    Ruibin Xiong, Yunchang Yang, Di He, Kai Zheng, Shuxin Zheng, Chen Xing, Huishuai Zhang, Yanyan Lan, Liwei Wang, and Tieyan Liu. On layer normalization in the transformer architecture. InInternational Conference on Machine Learning, pages 10524–10533. PMLR, 2020

  78. [78]

    Periodic graph transformers for crystal material property prediction.Advances in Neural Information Processing Systems, 35:15066– 15080, 2022

    Keqiang Yan, Yi Liu, Yuchao Lin, and Shuiwang Ji. Periodic graph transformers for crystal material property prediction.Advances in Neural Information Processing Systems, 35:15066– 15080, 2022

  79. [79]

    MatterSim: A Deep Learning Atomistic Model Across Elements, Temperatures and Pressures

    Han Yang, Chenxi Hu, Yichi Zhou, Xixian Liu, Yu Shi, Jielan Li, Guanzhi Li, Zekun Chen, Shuizhou Chen, Claudio Zeni, et al. Mattersim: A deep learning atomistic model across elements, temperatures and pressures.arXiv preprint arXiv:2405.04967, 2024

  80. [80]

    A crystal-specific pre-training framework for crystal material property prediction.arXiv preprint arXiv:2306.05344, 2023

    Haomin Yu, Yanru Song, Jilin Hu, Chenjuan Guo, and Bin Yang. A crystal-specific pre-training framework for crystal material property prediction.arXiv preprint arXiv:2306.05344, 2023

Showing first 80 references.