pith. sign in

arxiv: 2604.01017 · v2 · submitted 2026-04-01 · ❄️ cond-mat.mtrl-sci · physics.comp-ph

Parameter-Efficient Fine-Tuning of Machine-Learning Interatomic Potentials for Phonon and Thermal Properties

Pith reviewed 2026-05-13 22:05 UTC · model grok-4.3

classification ❄️ cond-mat.mtrl-sci physics.comp-ph
keywords machine-learning interatomic potentialsfine-tuningphononsthermal propertiesLoRAEquitrainparameter-efficient adaptationmaterials simulation
0
0 comments X

The pith

Fine-tuning pretrained interatomic potentials with as few as 10 structures improves phonon and thermal predictions across 53 materials.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

Machine-learning interatomic potentials act as fast surrogates for density functional theory calculations in materials simulations. The work tests multiple fine-tuning approaches on these models to enhance accuracy for phonon band structures, thermal properties, and energy surfaces near imaginary modes. Even small additions of target data produce clear gains over the original pretrained models and over models built from scratch on the same data. A new LoRA-based method named Equitrain delivers the strongest results overall. The findings indicate that fine-tuning can turn general pretrained potentials into reliable tools for phonon-related properties without large new datasets.

Core claim

Fine-tuning strategies applied to pretrained machine-learning interatomic potentials, including the introduced Equitrain framework that uses LoRA-based adaptation, produce accurate predictions of harmonic phonon band structures, thermal properties, and potential energy surfaces along imaginary modes. These improvements occur with minimal extra training data and consistently exceed the performance of both the underlying pretrained models and models trained from scratch across 53 materials systems.

What carries the argument

Equitrain, a parameter-efficient fine-tuning framework implementing LoRA-based adaptation to adjust pretrained interatomic potential models for new material systems.

If this is right

  • Phonon and thermal predictions become feasible for many materials using only tiny amounts of target data.
  • Equitrain outperforms other fine-tuning strategies and full retraining from scratch in overall accuracy.
  • Fine-tuned models retain broad applicability without evident degradation on unrelated properties.
  • The same adaptation pattern works across a wide variety of 53 tested materials systems.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • The approach could cut the cost of creating material-specific potentials by reusing large pretrained models.
  • Similar fine-tuning might extend to other simulation targets such as defect formation or diffusion rates.
  • Coupling the method with active learning could reduce the required extra structures even further.
  • It may enable faster screening of candidate materials for thermal management applications.

Load-bearing premise

A very small number of additional training structures suffice to produce substantial, generalizable improvements in phonon predictions without overfitting or degrading performance on other properties across diverse materials.

What would settle it

Testing a pretrained model fine-tuned on only 10 new structures and finding no gain, or a loss, in phonon frequency accuracy or thermal conductivity predictions on held-out materials.

Figures

Figures reproduced from arXiv: 2604.01017 by Janine George, Jonas Grandel, Philipp Benner.

Figure 1
Figure 1. Figure 1: Overview of the concepts and fine-tuning strategies considered in this work. Rattled [PITH_FULL_IMAGE:figures/full_fig_p002_1.png] view at source ↗
Figure 2
Figure 2. Figure 2: (a) Distribution of crystal systems and the number of atoms per primitive unit [PITH_FULL_IMAGE:figures/full_fig_p005_2.png] view at source ↗
Figure 3
Figure 3. Figure 3: Deviation [%] of thermal and elastic properties from the DFT reference results. [PITH_FULL_IMAGE:figures/full_fig_p008_3.png] view at source ↗
Figure 4
Figure 4. Figure 4: (a) Section of the phonon band structure of K [PITH_FULL_IMAGE:figures/full_fig_p010_4.png] view at source ↗
Figure 5
Figure 5. Figure 5: (a) A section of the phonon band structure calculation of [PITH_FULL_IMAGE:figures/full_fig_p011_5.png] view at source ↗
Figure 6
Figure 6. Figure 6: Average computational cost per material as a function of the minimum required [PITH_FULL_IMAGE:figures/full_fig_p014_6.png] view at source ↗
Figure 7
Figure 7. Figure 7: Heatmap of elements contained in the material dataset. The heatmap was created [PITH_FULL_IMAGE:figures/full_fig_p022_7.png] view at source ↗
Figure 8
Figure 8. Figure 8: Energy, force, and stress median MAE and inter-quartile range on the validation [PITH_FULL_IMAGE:figures/full_fig_p023_8.png] view at source ↗
Figure 9
Figure 9. Figure 9: Deviation [%] of thermal properties from the DFT reference results. Heat capacity, [PITH_FULL_IMAGE:figures/full_fig_p025_9.png] view at source ↗
Figure 10
Figure 10. Figure 10: (a) Comparison of the full phonon band structure of [PITH_FULL_IMAGE:figures/full_fig_p026_10.png] view at source ↗
Figure 11
Figure 11. Figure 11: Energy, force, and stress median MAE and IQR on the validation sets of all 53 [PITH_FULL_IMAGE:figures/full_fig_p032_11.png] view at source ↗
read the original abstract

Machine-learning interatomic potentials are widely used as computationally efficient surrogates for density functional theory in atomistic simulations, enabling large-scale, long-time modeling of materials systems. We investigate how different fine-tuning strategies influence the prediction of harmonic phonon band structures, thermal properties, and the potential energy surface along imaginary phonon modes. We achieve substantial accuracy improvements with minimal additional data, with as few as 10 additional training structures already yielding significant gains. In addition to existing approaches, we introduce Equitrain, a finetuning framework that implements LoRA-based adaptation. Across 53 materials systems, we show that fine-tuned models consistently outperform both the underlying pretrained model and models trained from scratch. Equitrain achieves the best overall performance, and our results demonstrate that fine-tuning enables accurate phonon predictions.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 2 minor

Summary. The manuscript investigates parameter-efficient fine-tuning strategies for machine-learning interatomic potentials (MLIPs) to improve predictions of harmonic phonon band structures, thermal properties, and the potential energy surface along imaginary phonon modes. It introduces Equitrain, a LoRA-based adaptation framework, and reports that fine-tuning with as few as 10 additional structures yields consistent outperformance over both the pretrained model and models trained from scratch across 53 materials systems.

Significance. If the quantitative results and validation protocols hold under scrutiny, the work would be significant for materials modeling: it shows that minimal targeted data can adapt general MLIPs for phonon-relevant properties without full retraining, which is computationally valuable for exploring thermal transport and dynamical stability in diverse materials. The empirical comparison of fine-tuning methods, including the new Equitrain approach, provides a practical contribution if supported by rigorous error metrics and generalization tests.

major comments (2)
  1. [Abstract] Abstract: the central claim of consistent outperformance and substantial gains with as few as 10 structures is load-bearing but unsupported by any quantitative error metrics (e.g., MAE or RMSE on phonon frequencies), baseline model specifications, or data-selection criteria; without these, it is impossible to evaluate whether the reported improvements are generalizable or artifactual.
  2. [Results] Results section (phonon and thermal property comparisons): the assertion that fine-tuning on 10 structures produces generalizable improvements in second-derivative quantities (phonon frequencies and imaginary-mode energies) requires explicit evidence of held-out validation sets and checks against overfitting; phonon predictions depend on PES curvature, and small datasets chosen without active learning or mode-specific sampling commonly overfit local minima without improving global behavior.
minor comments (2)
  1. [Methods] Methods: provide the precise rank and scaling hyperparameters used in the LoRA adaptation for Equitrain, and clarify how the fine-tuning loss weights phonon-related terms versus total energy.
  2. [Figures] Figure captions: ensure all performance plots include error bars, the exact number of test structures per material, and a clear legend distinguishing pretrained, from-scratch, and fine-tuned variants.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the constructive and detailed feedback. We address each major comment below and have revised the manuscript to strengthen the presentation of quantitative results and validation protocols.

read point-by-point responses
  1. Referee: [Abstract] Abstract: the central claim of consistent outperformance and substantial gains with as few as 10 structures is load-bearing but unsupported by any quantitative error metrics (e.g., MAE or RMSE on phonon frequencies), baseline model specifications, or data-selection criteria; without these, it is impossible to evaluate whether the reported improvements are generalizable or artifactual.

    Authors: We agree that the abstract should include concrete metrics to support the claims. In the revised version we have added representative MAE/RMSE values for phonon frequencies (e.g., average MAE reduction from 1.8 THz to 0.6 THz with 10 structures) together with the baseline model specification (pretrained MACE-MP-0) and data-selection protocol (uniform random sampling from DFT-relaxed supercells). The main text already contains full tables and figures with these metrics across all 53 systems; the abstract update makes the key numbers immediately visible. revision: yes

  2. Referee: [Results] Results section (phonon and thermal property comparisons): the assertion that fine-tuning on 10 structures produces generalizable improvements in second-derivative quantities (phonon frequencies and imaginary-mode energies) requires explicit evidence of held-out validation sets and checks against overfitting; phonon predictions depend on PES curvature, and small datasets chosen without active learning or mode-specific sampling commonly overfit local minima without improving global behavior.

    Authors: We have added an explicit description of the held-out test-set protocol: 20 % of the DFT structures per material were reserved and never seen during fine-tuning. We report phonon-frequency MAE and imaginary-mode energy errors on this test set, showing consistent improvement with the 10-structure fine-tuning. To address overfitting concerns we include (i) learning curves demonstrating that test-set error continues to decrease up to 10 structures without subsequent rise, and (ii) direct comparison against models trained from scratch on the same 10 structures, which exhibit higher test errors. While active learning or mode-specific sampling was not employed, the uniform sampling combined with the held-out evaluation and the scratch-trained baseline already provides evidence that the gains reflect improved global PES curvature rather than local overfitting. Additional phonon-dispersion plots on unseen supercells have been added to the supplementary information. revision: partial

Circularity Check

0 steps flagged

No circularity in empirical fine-tuning benchmark

full rationale

The paper is an empirical study comparing fine-tuning strategies (including the introduced Equitrain LoRA-based method) for pretrained ML interatomic potentials on phonon and thermal properties across 53 materials. Central claims rest on direct performance metrics versus pretrained baselines and from-scratch models using small additional datasets, with no derivation chain, equations, or fitted quantities that reduce by construction to the inputs. No self-citations, uniqueness theorems, or ansatzes are invoked as load-bearing steps; the work is self-contained as a benchmark evaluation without any of the enumerated circularity patterns.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

Review based on abstract only; ledger is therefore minimal. The central claim rests on the domain assumption that small targeted updates to a pretrained MLIP can correct phonon-specific errors without large-scale retraining.

axioms (1)
  • domain assumption Pretrained ML interatomic potentials contain transferable knowledge that can be efficiently adapted for phonon properties using limited new data
    Implicit in the claim that 10 structures yield significant gains; not derived or tested in the abstract.

pith-pipeline@v0.9.0 · 5439 in / 1294 out tokens · 37409 ms · 2026-05-13T22:05:19.152618+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Universal Interatomic Potentials as Configuration-Space Generators for One-Shot and Iterative Fine-Tuning of Ab Initio-Accurate Material-Specific Models

    cond-mat.mtrl-sci 2026-06 unverdicted novelty 5.0

    Universal MLIPs serve as configuration generators whose DFT-relabeled subsamples enable one-shot or iterative training of material-specific MLIPs that recover accurate reactive energy profiles with 600-2000 DFT calculations.

Reference graph

Works this paper leans on

43 extracted references · 43 canonical work pages · cited by 1 Pith paper

  1. [1]

    A foundation model for atomistic materials chemistry

    Ilyes Batatia et al. “A foundation model for atomistic materials chemistry”. In:The Journal of Chemical Physics163.18 (2025)

  2. [2]

    Machine learned potential for high-throughput phonon calcu- lations of metal—organic frameworks

    Alin Marin Elena et al. “Machine learned potential for high-throughput phonon calcu- lations of metal—organic frameworks”. In:npj Computational Materials11.1 (2025), p. 125

  3. [3]

    Accelerating high-throughput phonon calculations via machine learn- ing universal potentials

    Huiju Lee et al. “Accelerating high-throughput phonon calculations via machine learn- ing universal potentials”. In:Materials Today Physics53 (2025), p. 101688

  4. [4]

    Universal machine learning interatomic potentials are ready for phonons

    Antoine Loew et al. “Universal machine learning interatomic potentials are ready for phonons”. In:npj Computational Materials11.1 (2025), p. 178

  5. [5]

    Fine-tuning foundation models of materials interatomic potentials with frozen transfer learning

    Mariia Radova et al. “Fine-tuning foundation models of materials interatomic potentials with frozen transfer learning”. In:npj Computational Materials11.1 (2025), p. 237

  6. [6]

    Systematic softening in universal machine learning interatomic potentials

    Bowen Deng et al. “Systematic softening in universal machine learning interatomic potentials”. In:npj Computational Materials11.1 (2025), p. 9

  7. [7]

    Catastrophic interference in connectionist net- works: The sequential learning problem

    Michael McCloskey and Neal J Cohen. “Catastrophic interference in connectionist net- works: The sequential learning problem”. In:Psychology of learning and motivation. Vol. 24. Elsevier, 1989, pp. 109–165

  8. [8]

    MACE: Higher order equivariant message passing neural networks for fast and accurate force fields

    Ilyes Batatia et al. “MACE: Higher order equivariant message passing neural networks for fast and accurate force fields”. In:Advances in neural information processing systems 35 (2022), pp. 11423–11436

  9. [9]

    Lora: Low-rank adaptation of large language models

    Edward J Hu et al. “Lora: Low-rank adaptation of large language models.” In:ICLR 1.2 (2022), p. 3

  10. [10]

    2025.url:https://github.com/BAMeScience/equitrain

    Philipp Benner.Equitrain. 2025.url:https://github.com/BAMeScience/equitrain

  11. [11]

    (53) Togo, A

    Atsushi Togo et al. “Implementation strategies in phonopy and phono3py”. In:J. Phys. Condens. Matter35.35 (2023), p. 353001.doi:10.1088/1361-648X/acd831

  12. [12]

    (54) Blöchl, P

    Atsushi Togo. “First-principles Phonon Calculations with Phonopy and Phono3py”. In:J. Phys. Soc. Jpn.92.1 (2023), p. 012001.doi:10.7566/JPSJ.92.012001

  13. [13]

    The Hiphive Package for the ex- traction of high-order force constants by machine learning

    Fredrik Eriksson, Erik Fransson, and Paul Erhart. “The Hiphive Package for the ex- traction of high-order force constants by machine learning”. In:Advanced Theory and Simulations2.5 (2019), p. 1800184

  14. [14]

    The design space of E (3)-equivariant atom-centred interatomic potentials

    Ilyes Batatia et al. “The design space of E (3)-equivariant atom-centred interatomic potentials”. In:Nature Machine Intelligence7.1 (2025), pp. 56–67

  15. [15]

    Updates in phase change materials for thermoelectric devices: Status and challenges

    Raunak Pandey et al. “Updates in phase change materials for thermoelectric devices: Status and challenges”. In:Materialia21 (2022), p. 101357

  16. [16]

    Commentary: The Materials Project: A materials genome ap- proach to accelerating materials innovation

    Anubhav Jain et al. “Commentary: The Materials Project: A materials genome ap- proach to accelerating materials innovation”. In:APL materials1.1 (2013)

  17. [17]

    Accelerated data-driven materials science with the Materials Project

    Matthew K Horton et al. “Accelerated data-driven materials science with the Materials Project”. In:Nature Materials24.10 (2025), pp. 1522–1532. 19

  18. [18]

    Nonmetallic crystals with high thermal conductivity

    Glen A Slack. “Nonmetallic crystals with high thermal conductivity”. In:Journal of Physics and Chemistry of Solids34.2 (1973), pp. 321–335

  19. [19]

    Lattice thermal conductivity evalu- ated using elastic properties

    Tiantian Jia, Gang Chen, and Yongsheng Zhang. “Lattice thermal conductivity evalu- ated using elastic properties”. In:Physical Review B95.15 (2017), p. 155206

  20. [20]

    Atomate2: Modular workflows for materials science

    Alex M Ganose et al. “Atomate2: Modular workflows for materials science”. In:Digital Discovery(2025)

  21. [21]

    The physical significance of imaginary phonon modes in crys- tals

    Ioanna Pallikara et al. “The physical significance of imaginary phonon modes in crys- tals”. In:Electronic Structure4.3 (2022), p. 033002

  22. [22]

    Evolution of crystal structures in metallic elements

    Atsushi Togo and Isao Tanaka. “Evolution of crystal structures in metallic elements”. In:Physical Review B—Condensed Matter and Materials Physics87.18 (2013), p. 184104

  23. [23]

    Anharmonicity in the high-temperature C mcm phase of SnSe: Soft modes and three-phonon interactions

    Jonathan M Skelton et al. “Anharmonicity in the high-temperature C mcm phase of SnSe: Soft modes and three-phonon interactions”. In:Physical Review Letters117.7 (2016), p. 075502

  24. [24]

    Phonon collapse and second-order phase transition in thermo- electric SnSe

    Unai Aseginolaza et al. “Phonon collapse and second-order phase transition in thermo- electric SnSe”. In:Physical Review Letters122.7 (2019), p. 075901

  25. [25]

    Ab initio molecular dynamics for liquid metals

    Georg Kresse and J¨ urgen Hafner. “Ab initio molecular dynamics for liquid metals”. In: Physical review B47.1 (1993), p. 558

  26. [26]

    Efficient iterative schemes for ab initio total- energy calculations using a plane-wave basis set

    Georg Kresse and J¨ urgen Furthm¨ uller. “Efficient iterative schemes for ab initio total- energy calculations using a plane-wave basis set”. In:Physical review B54.16 (1996), p. 11169

  27. [27]

    Efficiency of ab-initio total energy calcula- tions for metals and semiconductors using a plane-wave basis set

    Georg Kresse and J¨ urgen Furthm¨ uller. “Efficiency of ab-initio total energy calcula- tions for metals and semiconductors using a plane-wave basis set”. In:Computational materials science6.1 (1996), pp. 15–50

  28. [28]

    Generalized gradient approximation made simple

    John P Perdew. “Generalized gradient approximation made simple”. In:Phys. Rev. Lett.77 (1997), p. 3868

  29. [29]

    Blöchl, Projector augmented-wave method, Phys

    P. E. Bl¨ ochl. “Projector Augmented-Wave Method”. In:Phys. Rev. B50.24 (Dec. 1994), pp. 17953–17979.issn: DOI: 10.1103/PhysRevB.50.17953

  30. [30]

    From Ultrasoft Pseudopotentials to the Projector Augmented- Wave Method

    G. Kresse and D. Joubert. “From Ultrasoft Pseudopotentials to the Projector Augmented- Wave Method”. In:Phys. Rev. B59.3 (Jan. 1999), pp. 1758–1775.doi:10 . 1103 / PhysRevB.59.1758

  31. [31]

    Projector-based efficient estimation of force constants

    Atsuto Seko and Atsushi Togo. “Projector-based efficient estimation of force constants”. In:arXiv preprint arXiv:2403.03588(2024)

  32. [32]

    Finite elastic strain of cubic crystals

    Francis Birch. “Finite elastic strain of cubic crystals”. In:Physical review71.11 (1947), p. 809

  33. [33]

    Structural relaxation made simple

    Erik Bitzek et al. “Structural relaxation made simple”. In:Physical Review Letters 97.17 (2006), p. 170201

  34. [34]

    Explicit inductive bias for transfer learning with convolutional networks

    LI Xuhong, Yves Grandvalet, and Franck Davoine. “Explicit inductive bias for transfer learning with convolutional networks”. In:International conference on machine learn- ing. PMLR. 2018, pp. 2825–2834. 20

  35. [35]

    Lora vs full fine-tuning: An illusion of equivalence.arXiv preprint arXiv:2410.21228, 2024

    Reece Shuttleworth et al. “Lora vs full fine-tuning: An illusion of equivalence”. In: arXiv preprint arXiv:2410.21228(2024)

  36. [36]

    Spglib: a software library for crys- tal symmetry search

    Kohei Shinohara Atsushi Togo and Isao Tanaka. “Spglib: a software library for crys- tal symmetry search”. In:Sci. Technol. Adv. Mater., Meth.4.1 (2024), pp. 2384822– 2384836.doi:10.1080/27660400.2024.2384822.url:https://doi.org/10.1080/ 27660400.2024.2384822

  37. [37]

    Batatia, C

    Ilyes Batatia et al. “Cross learning between electronic structure theories for unifying molecular, surface, and inorganic crystal foundation force fields”. In:arXiv preprint arXiv:2510.25380(2025)

  38. [38]

    Riebesell, H

    Janosh Riebesell et al.Pymatviz: visualization toolkit for materials informatics. Ver- sion 0.8.2. 10.5281/zenodo.7486816 - https://github.com/janosh/pymatviz. Oct. 1, 2022. doi:10.5281/zenodo.7486816.url:https://github.com/janosh/pymatviz(vis- ited on 01/01/2023)

  39. [39]

    Ab initio study of high-pressure behavior of a low compressibility metal and a hard material: Osmium and diamond

    M Hebbache and M Zemzemi. “Ab initio study of high-pressure behavior of a low compressibility metal and a hard material: Osmium and diamond”. In:Physical Review B—Condensed Matter and Materials Physics70.22 (2004), p. 224107. 21 7 Supplementary information 7.1 Foundation model comparison At the time this study was conducted, MP-0b3 was the most recent fou...

  40. [40]

    The shapes of the energy potential and the relaxed structure obtained at the energy minimum are consistent between the two methods

    DFT and ML predict identical phase transition pathways. The shapes of the energy potential and the relaxed structure obtained at the energy minimum are consistent between the two methods. 25

  41. [41]

    This behavior can be attributed to interpolation errors inherent to the finite-displacement approach

    The DFT phonon band structure exhibits imaginary modes, but the energy poten- tial along the corresponding mode remains purely harmonic. This behavior can be attributed to interpolation errors inherent to the finite-displacement approach. Z R S T Y T Z 1 0 1 2 3 4 5 Frequency [THz] (a) 1.0 0.5 0.0 0.5 1.0Q0 [amu1/2Å] 1.0 0.5 0.0 0.5 1.0 Q1 [amu1/2 Å] 10.0...

  42. [42]

    However, the ML model captures an anharmonic potential and relaxes to the same primitive unit cell with same space group, differing only by slightly adjusted lattice parameters

    As in scenario 2, DFT exhibits imaginary modes due to interpolation errors. However, the ML model captures an anharmonic potential and relaxes to the same primitive unit cell with same space group, differing only by slightly adjusted lattice parameters. It should be noted here that the energy potential is not correctly represented by the ML model, as a mi...

  43. [43]

    An example for this is the transition pathway of K 3Sb in the Multihead Model, which was discussed in the main part in 3.5

    Both DFT and ML models exhibit imaginary modes, but the ML model follows an incorrect transition pathway and relaxes into a different space group than obtained from DFT. An example for this is the transition pathway of K 3Sb in the Multihead Model, which was discussed in the main part in 3.5. For each material, the set of final phases,S M L, is compared t...