Recognition: unknown
Experimentally Accurate Graph Neural Network Predictions of Core-Electron Binding Energies
Pith reviewed 2026-05-07 09:41 UTC · model grok-4.3
The pith
Graph neural network predicts carbon 1s core-electron binding energies to 0.33 eV experimental accuracy with size transferability.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
A graph neural network model trained on 8637 carbon atoms in 2116 small molecules achieves an experimental mean absolute error of 0.33 eV on 570 carbon 1s binding energy values drawn from 113 molecules containing 3-45 atoms. The model exhibits good size transferability, and examination of message passing depth shows that two chemically informed node features encode molecule-specific information when normalized across the graph, capturing effects beyond the immediate receptive field. The E(3)-equivariant version outperforms an invariant counterpart on non-equilibrium geometries such as a methanol C-O stretch, and the approach enables instant analysis of tautomers in the 45-atom molecule avob
What carries the argument
Graph neural network whose number of message passing layers sets the topological receptive field for local bond environments, augmented by normalized node features for atomic binding energy and environment electronegativity.
Load-bearing premise
That local message passing trained only on small molecules captures all relevant environment effects for accurate predictions on larger experimental molecules without significant distribution shift or missing long-range contributions.
What would settle it
Prediction errors substantially exceeding 0.33 eV on a test set of molecules larger than 45 atoms or on systems dominated by long-range electrostatic effects not present in the training distribution.
Figures
read the original abstract
Graph neural network architectures are advantageous for predicting core-electron binding energies which depend on local bond environment effects, as the number of message passing layers defines the topological (bond) radius of the model's receptive field. This provides an interpretable connection between the model's architecture and the definition of locality in the considered environment. Here we present a graph neural network model for predicting carbon 1s core-electron binding energies in organic molecules. The model is trained with multiconfiguration pair-density functional theory on 8637 carbon atoms in 2116 molecules with 4-16 atoms and evaluated against 570 experimental values in 113 different molecules containing 3-45 atoms. Previous work benchmarked a mean absolute error of 0.27 eV to experiment for the training data level of theory [J. Phys. Chem. A 2025, 129, 36, 8419-8431] and the present model demonstrates an experimental evaluation error of 0.33 eV with good size transferability to larger systems. By examining the effect of the number of message passing layers on the performance, we show that two chemically informed node features, the atomic binding energy and environment electronegativity, encode molecule-specific information when normalized across the graph and capture beyond nearest-neighbor environment effects outside the receptive field. A case study on the 45 atom avobenzone tautomers demonstrates the model's ability for instant and precise analysis of complex molecules. Finally, the model's E(3)-equivariance is shown to out-perform an invariant model on non-equilibrium geometries from a methanol C-O bond stretch. The software and data are provided by the open-source AugerNet package at https://doi.org/10.5281/zenodo.19689244.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript presents a graph neural network (GNN) model for predicting carbon 1s core-electron binding energies. The model is trained on multiconfiguration pair-density functional theory data for 8637 carbon atoms in 2116 molecules (4-16 atoms) and evaluated on 570 experimental values from 113 molecules (3-45 atoms), achieving 0.33 eV MAE. It claims good size transferability, shows that two chemically informed node features (atomic binding energy and graph-normalized electronegativity) capture beyond-nearest-neighbor effects, includes a 45-atom avobenzone case study, and demonstrates E(3)-equivariance advantages on non-equilibrium geometries. The software and data are released via the open-source AugerNet package.
Significance. If the size-transferability claim holds under scrutiny, the work would offer a practical, interpretable tool for rapid C 1s binding-energy predictions on larger organic systems where direct computation or experiment is costly. The open-source release and explicit link between message-passing depth and locality are positive features that could facilitate adoption and further development in computational chemistry.
major comments (3)
- [Abstract] Abstract and (presumably) Results section: The central claim of 'good size transferability' to systems up to 45 atoms rests on the headline 0.33 eV MAE, yet no size-stratified breakdown is provided for the 570 experimental values (e.g., separate MAE for the subset of molecules with >16 atoms). Because the training distribution is strictly limited to 4-16 atom molecules and message passing is local, the absence of this partition leaves open the possibility that the reported error is dominated by smaller test molecules whose local environments overlap the training set, undermining the transferability assertion.
- [Abstract] Abstract and Methods: The manuscript states that the model was 'evaluated against 570 experimental values in 113 different molecules' but provides no details on train/test splits, cross-validation procedure, error bars, or potential data-selection criteria for the experimental set. This information is required to assess whether the 0.33 eV figure reflects genuine generalization or inadvertent overlap/leakage with the training distribution.
- [Results] Results (discussion of message-passing layers and node features): The claim that the two chemically informed node features 'encode molecule-specific information when normalized across the graph and capture beyond nearest-neighbor environment effects outside the receptive field' is central to the interpretability argument, yet the supporting evidence appears to rest on performance trends with layer count rather than a direct ablation or receptive-field analysis that isolates the contribution of graph normalization versus raw local features.
minor comments (2)
- [Abstract] The abstract cites a prior benchmark of 0.27 eV MAE for the underlying level of theory to experiment; a brief comparison of this baseline error with the model's 0.33 eV error would help readers gauge how much additional error is introduced by the GNN approximation.
- [Figures] Figure captions and text should explicitly state the number of molecules and carbon atoms in each size bin when reporting performance metrics to make the size-transferability discussion self-contained.
Simulated Author's Rebuttal
We thank the referee for their constructive and detailed comments, which have helped us improve the clarity and rigor of the manuscript. We address each major comment below and have revised the manuscript to incorporate additional analyses and details where appropriate.
read point-by-point responses
-
Referee: [Abstract] Abstract and (presumably) Results section: The central claim of 'good size transferability' to systems up to 45 atoms rests on the headline 0.33 eV MAE, yet no size-stratified breakdown is provided for the 570 experimental values (e.g., separate MAE for the subset of molecules with >16 atoms). Because the training distribution is strictly limited to 4-16 atom molecules and message passing is local, the absence of this partition leaves open the possibility that the reported error is dominated by smaller test molecules whose local environments overlap the training set, undermining the transferability assertion.
Authors: We appreciate this observation and agree that a size-stratified breakdown strengthens the transferability claim. In the revised manuscript we have added this analysis to the Results section (new Figure and accompanying text). The 570 experimental values were partitioned into molecules with 3–16 atoms and those with 17–45 atoms; the MAE on the larger-molecule subset remains comparable to the overall 0.33 eV value. The abstract has been updated to reference this partition, directly addressing the concern that performance might be driven by smaller systems whose local environments overlap the training distribution. revision: yes
-
Referee: [Abstract] Abstract and Methods: The manuscript states that the model was 'evaluated against 570 experimental values in 113 different molecules' but provides no details on train/test splits, cross-validation procedure, error bars, or potential data-selection criteria for the experimental set. This information is required to assess whether the 0.33 eV figure reflects genuine generalization or inadvertent overlap/leakage with the training distribution.
Authors: We agree that these details are necessary for a complete assessment of generalization. We have expanded the Methods section with a new subsection that explicitly states: (i) the 113 experimental molecules are structurally disjoint from the 2116 training molecules; (ii) model selection and error estimation used 5-fold cross-validation on the training set, with the final 0.33 eV MAE obtained on the held-out experimental set; (iii) error bars are the standard deviation across the cross-validation folds; and (iv) experimental data were selected as all available neutral organic C 1s values in the 3–45 atom range from the NIST XPS database and primary literature, with duplicates and ambiguous assignments removed. The abstract now directs readers to this Methods description. revision: yes
-
Referee: [Results] Results (discussion of message-passing layers and node features): The claim that the two chemically informed node features 'encode molecule-specific information when normalized across the graph and capture beyond nearest-neighbor environment effects outside the receptive field' is central to the interpretability argument, yet the supporting evidence appears to rest on performance trends with layer count rather than a direct ablation or receptive-field analysis that isolates the contribution of graph normalization versus raw local features.
Authors: We acknowledge that the original evidence, while consistent with the claim, relied primarily on layer-count trends. To provide a more direct test we have added an ablation study in the revised Results section comparing the full model (with graph-normalized atomic binding energy and electronegativity) against an otherwise identical model using only raw local features. The normalized features yield a measurable improvement that increases with layer depth beyond the local receptive field, together with a brief feature-propagation analysis. These additions isolate the contribution of graph normalization and are now presented alongside the original layer-count results. revision: yes
Circularity Check
Minor self-citation to prior theory benchmark; no reduction of model predictions to inputs by construction
full rationale
The paper trains the GNN on MC-PDFT computed core-electron binding energies for small molecules (4-16 atoms) and reports a 0.33 eV MAE against a separate set of 570 experimental values from molecules up to 45 atoms. The single citation to prior work [J. Phys. Chem. A 2025, 129, 36, 8419-8431] supplies only the 0.27 eV benchmark error of the underlying level of theory to experiment; it is not used to fit the GNN, define its targets, or constrain its architecture. No equations, node features, or message-passing steps are shown to be self-definitional or to rename fitted quantities as predictions. The evaluation remains against external experimental data, rendering the central accuracy claim independent of the model's own outputs.
Axiom & Free-Parameter Ledger
free parameters (2)
- number of message passing layers
- neural network weights and biases
axioms (2)
- domain assumption Core-electron binding energies are dominated by local bond environment effects
- domain assumption Multiconfiguration pair-density functional theory calculations provide sufficiently accurate labels for training
Reference graph
Works this paper leans on
-
[1]
The interpretation of XPS spec- tra: Insights into materials properties,
1P. S. Bagus, E. S. Ilton, and C. J. Nelin, “The interpretation of XPS spec- tra: Insights into materials properties,” Surface Science Reports68, 273– 304 (2013). 2K. Siegbahn,ESCA applied to free molecules. By K. Siegbahn [and oth- ers](North-Holland Pub. Co., Amsterdam,
2013
-
[2]
Valence Electronic Structure of Aqueous Solutions: Insights from Photoelectron Spectroscopy,
publication Title: ESCA applied to free molecules. 3R. Seidel, B. Winter, and S. E. Bradforth, “Valence Electronic Structure of Aqueous Solutions: Insights from Photoelectron Spectroscopy,” Annual Review of Physical Chemistry67, 283–305 (2016). 4B. B. Hurisso, K. R. J. Lovelock, and P. Licence, “Amino acid-based ionic liquids: using XPS to probe the elect...
work page doi:10.1063/1.3092928/13871093/124308_1_online.pdf 2016
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.