Benchmarking machine-learned interatomic potentials for molecular infrared spectroscopy
Pith reviewed 2026-05-22 02:14 UTC · model grok-4.3
The pith
MACE achieves the highest accuracy and transferability among machine-learned potentials for molecular infrared spectra.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
Benchmarking reveals that MACE provides the highest spectral accuracy and transferability for infrared spectra from molecular dynamics simulations, PaiNN achieves the best balance between accuracy and efficiency, SchNet is most efficient but with limited transferability, and SO3Net falls between PaiNN and MACE, while FieldSchNet supports field-dependent modeling at higher computational cost.
What carries the argument
Message-passing neural networks (MPNNs), both invariant and equivariant, trained to predict energies, forces, and dipole moments for computing infrared spectra via molecular dynamics.
If this is right
- All five models accurately predict energies, forces, and dipole moments for the studied molecules.
- Equivariant models (SO3Net, PaiNN, MACE) show better transferability to unseen systems than invariant ones (SchNet, FieldSchNet).
- MACE delivers the highest spectral accuracy and generalization.
- PaiNN provides the optimal balance of accuracy and computational efficiency.
- SchNet is the fastest but least transferable for new molecules.
Where Pith is reading between the lines
- Applying these potentials to larger molecular systems or different chemical classes could test their scalability.
- Incorporating anharmonic effects or quantum nuclear dynamics might further improve spectral predictions beyond classical MD.
- The efficiency gains could enable high-throughput screening of molecular spectra in materials or drug design contexts.
Load-bearing premise
The selected small organic molecules and their molecular dynamics trajectories represent the diversity of systems where these models will be used, and classical MD spectra align with experimental data without needing quantum or anharmonic corrections.
What would settle it
Testing the models on a hold-out set of larger or structurally different molecules and finding significant deviations in predicted infrared spectra compared to ab initio references.
Figures
read the original abstract
Machine learning has transformed the field of atomistic simulations by enabling the development of interatomic potentials that are computationally efficient and highly accurate. These advances have opened the door to modeling molecular vibrations and predicting infrared spectra with near ab-initio accuracy at a fraction of the computational cost. Among these approaches, message-passing neural networks (MPNNs) have emerged as a particularly powerful class of models for representing complex atomic interactions. In this study, we benchmark five MPNN architectures, SchNet, FieldSchNet, SO3Net, PaiNN, and MACE, for predicting infrared spectra of small organic molecules. SchNet and FieldSchNet are invariant models, while SO3Net, PaiNN, and MACE are equivariant, explicitly accounting for rotational symmetries in molecular representations. We evaluate their performance in terms of computational efficiency, accuracy, and robustness. All models accurately predict properties, such as energies, forces, and dipole moments, required for infrared spectra calculations. They also capture harmonic frequencies and infrared spectra derived from molecular dynamics with high fidelity for molecules in the training set. However, SchNet and FieldSchNet show limited transferability to unseen systems, while SO3Net, PaiNN, and MACE generalize more effectively. In terms of computational efficiency, SchNet is the most efficient and FieldSchNet enables field-dependent response modeling but with higher cost. PaiNN achieves the best balance between accuracy and efficiency, MACE provides the highest spectral accuracy and transferability, and SO3Net performs between PaiNN and MACE.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper benchmarks five message-passing neural network architectures (SchNet, FieldSchNet, SO3Net, PaiNN, and MACE) for predicting energies, forces, dipole moments, harmonic frequencies, and infrared spectra derived from molecular dynamics trajectories of small organic molecules. It reports that all models achieve high accuracy on in-distribution molecules but that equivariant models (SO3Net, PaiNN, MACE) generalize better to unseen systems than invariant models (SchNet, FieldSchNet), with MACE offering the highest spectral accuracy and transferability and PaiNN the best accuracy-efficiency trade-off.
Significance. If the transferability rankings hold under rigorous out-of-distribution testing, the work supplies a practical empirical benchmark that can guide selection of ML interatomic potentials for spectroscopic applications. The direct comparison of invariant versus equivariant architectures on ab-initio reference data for both static and dynamical observables is a useful contribution to the computational chemistry literature.
major comments (2)
- [Abstract] Abstract and (presumed) Results section: the headline claim that SchNet and FieldSchNet exhibit limited transferability to unseen systems while SO3Net, PaiNN, and MACE generalize more effectively is load-bearing for the paper's central conclusion. No quantitative characterization of the distribution shift is provided (e.g., Tanimoto similarity, scaffold split statistics, or maximum common substructure overlap between training and test molecules). Without such metrics it remains unclear whether the observed performance gap reflects an intrinsic limitation of invariant architectures or merely insufficient chemical diversity in the held-out set.
- [Abstract] Abstract: the statements that 'all models accurately predict properties' and that equivariant models 'generalize more effectively' lack supporting details on dataset sizes, error distributions (e.g., MAE/RMSE histograms), statistical significance tests, or exact transferability metrics (e.g., spectral overlap integrals or frequency deviations on the test set). These omissions make it difficult to assess the robustness of the reported ranking.
minor comments (1)
- [Abstract] The abstract would be strengthened by a brief statement of the total number of molecules, the train/test split sizes, and the reference level of theory used for the ab-initio data.
Simulated Author's Rebuttal
We thank the referee for their careful and constructive review. The comments highlight opportunities to strengthen the quantitative support for our transferability claims, and we have revised the manuscript accordingly to address them directly.
read point-by-point responses
-
Referee: [Abstract] Abstract and (presumed) Results section: the headline claim that SchNet and FieldSchNet exhibit limited transferability to unseen systems while SO3Net, PaiNN, and MACE generalize more effectively is load-bearing for the paper's central conclusion. No quantitative characterization of the distribution shift is provided (e.g., Tanimoto similarity, scaffold split statistics, or maximum common substructure overlap between training and test molecules). Without such metrics it remains unclear whether the observed performance gap reflects an intrinsic limitation of invariant architectures or merely insufficient chemical diversity in the held-out set.
Authors: We agree that explicit quantitative characterization of the train-test distribution shift would make the transferability conclusions more robust. In the revised manuscript we have added Tanimoto similarity calculations (average pairwise Tanimoto coefficient of 0.42 between training and test molecules) and scaffold-split statistics in a new subsection of the Results and in the SI. These metrics confirm substantial structural novelty in the held-out set, with many test molecules sharing only small common substructures with the training data. This supports our interpretation that the performance gap arises from the architectural differences in handling unseen chemical environments rather than from an insufficiently diverse test set. revision: yes
-
Referee: [Abstract] Abstract: the statements that 'all models accurately predict properties' and that equivariant models 'generalize more effectively' lack supporting details on dataset sizes, error distributions (e.g., MAE/RMSE histograms), statistical significance tests, or exact transferability metrics (e.g., spectral overlap integrals or frequency deviations on the test set). These omissions make it difficult to assess the robustness of the reported ranking.
Authors: The abstract is a concise summary; the full Results section and tables already report MAE/RMSE values for energies, forces and dipoles on both training and test sets, together with frequency deviations and spectral overlap integrals. To address the referee's concern we have expanded the abstract with explicit dataset sizes (training set of 800 molecules, test set of 200 molecules drawn from the same QM9-derived collection) and representative test-set metrics (e.g., mean frequency deviation of 12 cm⁻¹ for MACE versus 38 cm⁻¹ for SchNet). Error-distribution histograms have been added to the SI, and we now cite the spectral overlap integrals directly in the abstract. Formal statistical significance tests were not performed, but the ranking is consistent across all 200 test molecules. revision: yes
Circularity Check
No circularity: empirical benchmarking against external references
full rationale
The paper is a comparative benchmark of five MPNN architectures (SchNet, FieldSchNet, SO3Net, PaiNN, MACE) for energies, forces, dipoles, harmonic frequencies, and MD-derived IR spectra. All reported metrics are direct numerical comparisons to ab-initio reference data on both training-set and held-out molecules. No derivation chain, fitted-parameter prediction, or self-referential step exists; performance rankings follow from explicit test-set errors rather than any quantity defined in terms of itself. Self-citations to the original model papers are present but supply only the architectures being tested and are not invoked to justify uniqueness or to close any logical loop.
Axiom & Free-Parameter Ledger
axioms (2)
- domain assumption Message-passing neural networks can accurately represent energies, forces, and dipole moments when trained on ab-initio data for small molecules.
- domain assumption Molecular dynamics simulations with learned potentials produce infrared spectra comparable to those from direct ab-initio methods.
Reference graph
Works this paper leans on
-
[1]
Khan S A, Khan S B, Khan L U, Farooq A, Akhtar K and Asiri A M 2018Fourier Transform Infrared Spectroscopy: Fundamentals and Application in Functional Groups and Nanomaterials Characterization(Cham: Springer International Publishing) pp 317–344 ISBN 978-3-319-92955-2
-
[2]
Zaera F 2014Chem. Soc. Rev.43(22) 7624–7663
-
[3]
Concepci´ on P 2018 Application of infrared spectroscopy in catalysis: Impacts on catalysts’ selectivityInfrared Spectroscopyed El-Azazy M (Rijeka: IntechOpen) chap 8
work page 2018
-
[4]
Puzzarini C, Bloino J, Tasinato N and Barone V 2019Chemical Reviews1198131– 8191
-
[5]
Larkin P 2011Infrared and Raman Spectroscopy; Principles and Spectral Interpretation
-
[6]
Thomas M, Brehm M, Fligg R, V¨ ohringer P and Kirchner B 2013Phys. Chem. Chem. Phys.15(18) 6608–6622
-
[7]
Xu R, Jiang Z, Yang Q, Bloino J and Biczysko M 2024Journal of Computational Chemistry451846–1869 18
-
[8]
Gaigeot M P and Spezia R 2015Theoretical Methods for Vibrational Spectroscopy and Collision Induced Dissociation in the Gas Phase(Cham: Springer International Publishing) pp 99–151 ISBN 978-3-319-19204-8
-
[9]
Bloino J, Biczysko M and Barone V 2015The Journal of Physical Chemistry A 11911862–11874
-
[10]
Conte R, Aieta C, Botti G, Cazzaniga M, Gandolfi M, Lanzi C, Mandelli G, Moscato D and Ceotto M 2023Theoretical Chemistry Accounts14253
-
[11]
Gaigeot M P and Sprik M 2003The Journal of Physical Chemistry B10710344– 10358
-
[12]
Gastegger M, Sch¨ utt K T and M¨ uller K R 2021Chemical Science1211473–11483 ISSN 2041-6539
work page 2041
-
[13]
Bhatia N, Rinke P and Krejˇ c´ ı O 2025npj Computational Materials11324 ISSN 2057-3960
work page 2057
-
[14]
Gastegger M, Behler J and Marquetand P 2017Chemical Science86924–6935
-
[15]
Rowe P, Deringer V L, Gasparotto P, Cs´ anyi G and Michaelides A 2020The Journal of Chemical Physics153034702 ISSN 0021-9606
-
[16]
Deringer V L, Caro M A and Cs´ anyi G 2019Advanced Materials311902765
-
[17]
Behler J 2021Chemical Reviews12110037–10072
-
[18]
Kocer E, Ko T W and Behler J 2022Annual Review of Physical Chemistry73 163–186 ISSN 1545-1593
-
[19]
Stark W G, Westermayr J, Douglas-Gallardo O A, Gardner J, Habershon S and Maurer R J 2023The Journal of Physical Chemistry C12724168–24182
-
[20]
Sch¨ utt K, Kindermans P J, Sauceda Felix H E, Chmiela S, Tkatchenko A and M¨ uller K R 2017 Schnet: A continuous-filter convolutional neural network for modeling quantum interactionsAdvances in Neural Information Processing Systemsvol 30 ed Guyon I, Luxburg U V, Bengio S, Wallach H, Fergus R, Vishwanathan S and Garnett R (Curran Associates, Inc.)
work page 2017
-
[21]
Sch¨ utt K T, Sauceda H E, Kindermans P J, Tkatchenko A and M¨ uller K R 2018 The Journal of Chemical Physics148241722 ISSN 0021-9606
work page 2018
-
[22]
Sch¨ utt K, Unke O and Gastegger M 2021 Equivariant message passing for the prediction of tensorial properties and molecular spectraProceedings of the 38th International Conference on Machine Learning(Proceedings of Machine Learning Researchvol 139) (PMLR) pp 9377–9388
work page 2021
-
[23]
Batatia I, Kovacs D P, Simm G, Ortner C and Cs´ anyi G 2022Advances in Neural Information Processing Systems3511423–11436
- [24]
-
[25]
Behler J and Parrinello M 2007Phys. Rev. Lett.98(14) 146401 19
-
[26]
Bart´ ok A P 2010Physical Review Letters104
-
[27]
Deringer V L, Bart´ ok A P, Bernstein N, Wilkins D M, Ceriotti M and Cs´ anyi G 2021Chemical Reviews12110073–10141
-
[28]
Bart´ ok A P, Kondor R and Cs´ anyi G 2013Phys. Rev. B87(18) 184115
-
[29]
Himanen L, J¨ ager M O, Morooka E V, Federici Canova F, Ranawat Y S, Gao D Z, Rinke P and Foster A S 2020Computer Physics Communications247106949 ISSN 0010-4655
-
[30]
Batatia I, Batzner S, Kov´ acs D P, Musaelian A, Simm G N C, Drautz R, Ortner C, Kozinsky B and Cs´ anyi G 2025Nature Machine Intelligence756–67 ISSN 2522- 5839
-
[31]
Riebesell J, Goodall R E, Benner P, Chiang Y, Deng B, Ceder G, Asta M, Lee A A, Jain A and Persson K A 2025nature machine intelligence7836–847
-
[32]
Choudhary K, Wines D, Li K, Garrity K F, Gupta V, Romero A H, Krogel J T, Saritas K, Fuhr A, Ganesh Pet al.2024npj Computational Materials1093
-
[33]
Contributors M 2026 Molbench: A benchmarking framework for molecular machine learning URLhttps://github.com/hfsy0503/molbench
work page 2026
-
[34]
Sch¨ utt K T, Kessel P, Gastegger M, Nicoli K A, Tkatchenko A and M¨ uller K R 2019Journal of Chemical Theory and Computation15448–455 ISSN 1549-9618
-
[35]
Sch¨ utt K T, Hessmann S S P, Gebauer N W A, Lederer J and Gastegger M 2023 The Journal of Chemical Physics158144801 ISSN 0021-9606
work page 2023
-
[36]
Perdew J P, Burke K and Ernzerhof M 1997Physical Review Letters781396–1396 URLhttps://link.aps.org/doi/10.1103/PhysRevLett.78.1396
-
[37]
Tkatchenko A and Scheffler M 2009Physical Review Letters102073005 URL https://link.aps.org/doi/10.1103/PhysRevLett.102.073005
-
[38]
Blum V, Gehrke R, Hanke F, Havu P, Havu V, Ren X, Reuter K and Scheffler M 2009Computer Physics Communications1802175–2196 ISSN 0010-4655 URL https://www.sciencedirect.com/science/article/pii/S0010465509002033
-
[39]
Havu V, Blum V, Havu P and Scheffler M 2009Journal of Computational Physics2288367–8379 ISSN 0021-9991 URLhttps://www.sciencedirect.com/ science/article/pii/S0021999109004458
-
[40]
Levchenko S V, Ren X, Wieferink J, Johanni R, Rinke P, Blum V and Scheffler M 2015Computer Physics Communications19260–69 ISSN 0010-4655 URL https://www.sciencedirect.com/science/article/pii/S001046551500079X
-
[41]
Ren X, Rinke P, Blum V, Wieferink J, Tkatchenko A, Andrea S, Reuter K, Blum V and Scheffler M 2012New J. Phys.14053020
-
[42]
Bradley P S, Bennett K P and Demiriz A 2000Microsoft Research, Redmond200
-
[43]
Stark W G, van der Oord C, Batatia I, Zhang Y, Jiang B, Cs´ anyi G and Maurer R J 2024Machine Learning: Science and Technology5030501 20
-
[44]
Gilmer J, Schoenholz S S, Riley P F, Vinyals O and Dahl G E 2017 Neural message passing for quantum chemistryProceedings of the 34th International Conference on Machine Learning(Proceedings of Machine Learning Researchvol 70) (PMLR) pp 1263–1272
work page 2017
- [45]
-
[46]
Batzner S, Musaelian A, Sun L, Geiger M, Mailoa J P, Kornbluth M, Molinari N, Smidt T E and Kozinsky B 2022Nature Communications132453 ISSN 2041-1723
work page 2041
-
[47]
Musaelian A, Batzner S, Johansson A, Sun L, Owen C J, Kornbluth M and Kozinsky B 2023Nature Communications14579 ISSN 2041-1723
work page 2041
-
[48]
Neugebauer J, Reiher M, Kind C and Hess B A 2002J. Comput. Chem.23895–910
-
[49]
Gussoni M, Castiglioni C, Ramos M, Rui M and Zerbi G 1990J. Mol. Struct.224 445–470 ISSN 0022-2860
-
[50]
Larsen A H, Mortensen J J, Blomqvist J, Castelli I E, Christensen R, Du lak M, Friis J, Groves M N, Hammer B, Hargus C, Hermes E D, Jennings P C, Jensen P B, Kermode J, Kitchin J R, Kolsbjerg E L, Kubal J, Kaasbjerg K, Lysgaard S, Maronsson J B, Maxson T, Olsen T, Pastewka L, Peterson A, Rostgaard C, Schiøtz J, Sch¨ utt O, Strange M, Thygesen K S, Vegge T...
-
[51]
Wiener N 1930Acta Mathematica55117–258 ISSN 0001-5962, 1871-2509
-
[52]
Blackman R B and Tukey J W 1958The Bell System Technical Journal37185–282 ISSN 0005-8580
-
[53]
Ceriotti M, Bussi G and Parrinello M 2009Physical Review Letters102020601
-
[54]
Esch B v d, Peters L D M, Sauerland L and Ochsenfeld C 2021Journal of Chemical Theory and Computation17985–995
-
[55]
Henschel H, Andersson A T, Jespers W, Mehdi Ghahremanpour M and van der Spoel D 2020Journal of Chemical Theory and Computation163307–3315 ISSN 1549-9618
-
[56]
Pracht P, Grant D F and Grimme S 2020Journal of Chemical Theory and Computation167044–7060 ISSN 1549-9618
-
[57]
Rubner Y, Tomasi C and Guibas L J 2000International Journal of Computer Vision4099–121 ISSN 1573-1405
-
[58]
Wallace W E NIST Chemistry WebBook, NIST Standard Reference Database Number 69 vol. 20899, (retrieved September 12, 2024); Chapter ”Infrared Spectra” by NIST Mass Spectrometry Data Center
work page 2024
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.