Synthetic pre-training of graph-network models for predicting solid-state NMR parameters

Carlos Bornes; Chiheb Ben Mahmoud; Christopher J. Heard; Jonathan R. Yates; Luk\'a\v{s} Grajciar; Volker L. Deringer

arxiv: 2606.11038 · v1 · pith:BWN4K6T6new · submitted 2026-06-09 · ❄️ cond-mat.mtrl-sci · physics.comp-ph

Synthetic pre-training of graph-network models for predicting solid-state NMR parameters

Chiheb Ben Mahmoud , Carlos Bornes , Christopher J. Heard , Luk\'a\v{s} Grajciar , Jonathan R. Yates , Volker L. Deringer This is my paper

classification ❄️ cond-mat.mtrl-sci physics.comp-ph

keywords modelsdatasyntheticparameterspre-trainingsolid-statetensorialfine-tuning

0 comments

read the original abstract

Nuclear magnetic resonance (NMR) is a powerful probe of atomic structure, but accurate quantum-mechanical predictions of tensorial NMR parameters are computationally demanding. This creates a bottleneck both for direct quantum-mechanical studies and for collecting high-quality training data for machine-learning (ML) models. Here, we introduce a synthetic pre-training and fine-tuning protocol for graph-based ML models of solid-state NMR parameters. We first pre-train models on synthetic tensorial data, as obtained using an existing ML model, and subsequently fine-tune those models on new ground-truth data. We observe a pronounced improvement in data efficiency when pre-training and fine-tuning span the same compositional and configurational space, and we carry out initial experiments regarding chemical transferability. Our work outlines a route toward future data-efficient training workflows for tensorial ML models for solid-state NMR, combining inexpensive synthetic supervision with targeted first-principles refinement.

This paper has not been read by Pith yet.

Synthetic pre-training of graph-network models for predicting solid-state NMR parameters

discussion (0)