DPA4: Pushing the Accuracy-Cost Frontier of Interatomic Potentials with EMFA SO(2) Convolution

· 2026 · physics.chem-ph · arXiv 2606.02419

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

open full Pith review browse 2 citing papers arXiv PDF

abstract

Machine-learning interatomic potentials now approach quantum-mechanical accuracy on standard benchmarks, but the training cost of the most expressive equivariant architectures has become a serious bottleneck. We introduce DPA4, an SE(3)-equivariant interatomic-potential architecture with an EMFA (Edge-conditioned, Multi-Focus, Attention) SO(2)-equivariant convolution that combines a low-rank edge-node SO(2)-equivariant product, a multi-focus design for message nonlinearity, and envelope-gated attention for message aggregation. A Lebedev-grid projection further preserves SO(3)-equivariance in the nonlinearity to machine precision. A compiler-friendly conservative energy-gradient training path provides up to $\sim$3 times wall-clock speedup under torch compile. On the compliant Matbench Discovery benchmark, DPA4-Pro attains the best Combined Performance Score (CPS) on the leaderboard, while the 2.76M-parameter DPA4-Air exceeds the accuracy of the 30.1M-parameter eSEN-30M-MP baseline with 10.9$\times$ fewer parameters and 42.9$\times$ less training compute. On SPICE-MACE-OFF, the 5.4M-parameter DPA4-Plus lowers the aggregate molecular energy and force errors of the 6.5M-parameter eSEN baseline by 29% and 30%, while the 2.7M-parameter DPA4-Air still surpasses that baseline with $\sim$2.4$\times$ fewer parameters. Together these results place DPA4 on a new accuracy-cost Pareto frontier on Matbench Discovery and position it as a strong candidate backbone for future multi-task large atomistic model (LAM) pretraining.

representative citing papers

Enerzyme: A Framework for Efficient Training of Reactive Neural Network Potentials for Enzyme Catalysis with Application to Methyltransferases

physics.chem-ph · 2026-07-01 · unverdicted · novelty 6.0

Enerzyme framework trains electrostatics-aware NNPs on under 1,000 system-specific points to reproduce MTase reaction energetics and transition states for clusters up to 545 atoms.

Non-covalent Interactions at cm$^{-1}$ Accuracy: Data Efficient Physics-Informed Distillation for Machine Learning Interatomic Potentials

physics.chem-ph · 2026-06-03 · unverdicted · novelty 6.0

Physics-informed distillation from a universal MLIP plus limited CCSD(T) fine-tuning yields cm^{-1} accurate potentials for non-covalent interactions, with teacher choice strongly affecting accuracy on some systems.

citing papers explorer

Showing 2 of 2 citing papers.

Enerzyme: A Framework for Efficient Training of Reactive Neural Network Potentials for Enzyme Catalysis with Application to Methyltransferases physics.chem-ph · 2026-07-01 · unverdicted · none · ref 4 · internal anchor
Enerzyme framework trains electrostatics-aware NNPs on under 1,000 system-specific points to reproduce MTase reaction energetics and transition states for clusters up to 545 atoms.
Non-covalent Interactions at cm$^{-1}$ Accuracy: Data Efficient Physics-Informed Distillation for Machine Learning Interatomic Potentials physics.chem-ph · 2026-06-03 · unverdicted · none · ref 51 · internal anchor
Physics-informed distillation from a universal MLIP plus limited CCSD(T) fine-tuning yields cm^{-1} accurate potentials for non-covalent interactions, with teacher choice strongly affecting accuracy on some systems.

DPA4: Pushing the Accuracy-Cost Frontier of Interatomic Potentials with EMFA SO(2) Convolution

fields

years

verdicts

representative citing papers

citing papers explorer