Prospects for Deep-Learning-Based Mass Reconstruction of Ultra-High-Energy Cosmic Rays using Simulated Air-Shower Profiles
Pith reviewed 2026-05-18 22:19 UTC · model grok-4.3
The pith
A convolutional neural network predicts the mass of ultra-high-energy cosmic rays from their full longitudinal air-shower profiles.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
We present the first study of a deep-learning neural-network approach to predict a primary's mass (ln A) directly from the longitudinal energy-deposit profile of simulated extensive air showers. After rescaling, our network achieves a maximum bias better than 0.4 in ln A with a resolution between 1.5 for protons and 1 for iron, corresponding to a proton-iron Merit Factor of 2.19 (AUC = 0.976). The CNN outperforms simpler ML models trained on profile-shape parameters (X_max, E_cal, R, and L) extracted from the same data and shows only mild degradation when cross-predicting on simulations made with the Sibyll-2.3d hadronic interaction model.
What carries the argument
A convolutional neural network that ingests the full longitudinal energy-deposit profile of a simulated air shower and regresses the primary mass ln A.
If this is right
- The full longitudinal profile supplies composition-sensitive structure beyond what the Gaisser-Hillas parameterization captures.
- Even simple machine-learning models operating on standard profile parameters already surpass several published benchmarks for mass sensitivity.
- Performance remains usable when the network is tested on showers generated with a different hadronic interaction model.
- The approach continues to function across a wide range of added noise levels representative of detector conditions.
Where Pith is reading between the lines
- The method could be retrained or fine-tuned on real fluorescence data once sufficient labeled events become available.
- Improved mass resolution per event would tighten constraints on the distribution of cosmic-ray sources and their acceleration spectra.
- The same network architecture might be extended to incorporate additional observables such as arrival direction or muon content.
Load-bearing premise
The simulated longitudinal profiles generated with CONEX and EPOS-LHC faithfully reproduce the composition-sensitive features present in real ultra-high-energy cosmic ray air showers observed by fluorescence telescopes.
What would settle it
Apply the trained network to real fluorescence-telescope profiles recorded by an observatory and compare the resulting mass distribution against independent composition constraints obtained from other detectors or analysis techniques.
read the original abstract
Knowledge of the mass composition of ultra-high-energy cosmic rays is crucial to understanding their origins; however, current approaches have limited event-by-event resolution. With fluorescence telescope measurements of the longitudinal shower profile, there are opportunities to improve this situation by applying Machine Learning (ML) to leverage more information beyond $X_{max}$ alone. To our knowledge, we present the first study of a deep-learning neural-network approach to predict a primary's mass ($\ln{A}$) directly from the longitudinal energy-deposit profile of simulated extensive air showers. We train and validate our model on simulated showers, generated with CONEX and EPOS-LHC, covering nuclei from $A = 1$ to 61, sampled uniformly in $\ln{A}$. After rescaling, our network achieves a maximum bias better than 0.4 in $\ln{A}$ with a resolution between 1.5 for protons and 1 for iron, corresponding to a proton-iron Merit Factor of 2.19 (AUC = 0.976). We benchmark this against simpler ML models trained on profile-shape parameters ($X_{ max}$, $E_{cal}$, $R$, and $L$) extracted from the same data. We find that even simple models can substantially exceed published benchmarks for combinations of these observables, demonstrating that ML methods applied even to standard profile-shape parameters can significantly improve available mass sensitivity. The CNN outperforms this strong baseline, and this performance is only mildly degraded when cross-predicting on simulations made with the Sibyll-2.3d hadronic interaction model, showing robustness against model choice. The network also maintains its performance across a wide range of noise conditions. An ablation study further demonstrates that the full profile contains composition-sensitive structure not captured by the GH parameterization, while the strong performance of the CNN suggests this information should be resolvable in real events.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper presents the first deep-learning study using a CNN to predict primary mass (ln A) directly from simulated longitudinal energy-deposit profiles of ultra-high-energy cosmic ray air showers. Trained and validated on CONEX+EPOS-LHC simulations spanning A=1 to 61 (uniform in ln A), the network achieves, after rescaling, a maximum bias better than 0.4 in ln A, resolutions of 1.5 (protons) to 1 (iron), a proton-iron merit factor of 2.19 (AUC=0.976), and outperforms a strong baseline using Gaisser-Hillas parameters (X_max, E_cal, R, L). Performance degrades only mildly on cross-prediction with Sibyll-2.3d profiles and remains stable under varied noise; an ablation study indicates additional composition-sensitive structure beyond the GH parameterization.
Significance. If transferable to real fluorescence-telescope data, the approach could meaningfully advance event-by-event mass composition resolution beyond current X_max-limited methods. The work is strengthened by clear quantitative metrics, direct comparison to a strong baseline on identical simulated data, explicit robustness tests against a second hadronic model and added noise, and the demonstration that even simple ML on standard profile parameters already exceeds published benchmarks. These elements provide a solid foundation for the stated prospects.
major comments (2)
- [Abstract] Abstract: The forward-looking claim that 'the strong performance of the CNN suggests this information should be resolvable in real events' rests on the untested assumption that composition-sensitive sub-structure in the simulated profiles (beyond X_max, R, L) matches that present in actual air showers. Only two hadronic models are compared; systematic differences in multiplicity, elasticity, or electromagnetic development at the 10-20% level between models and nature could shift the learned features and erase the reported gain over the GH baseline.
- [Results (cross-prediction subsection)] Cross-prediction results: While mild degradation is reported when applying the EPOS-LHC-trained network to Sibyll-2.3d profiles, the manuscript does not quantify the profile-level differences (e.g., in longitudinal development or fluctuation statistics) between the two models. This quantification is needed to evaluate whether the CNN is learning robust, model-independent features or model-specific artifacts.
minor comments (2)
- [Methods] The rescaling procedure applied to the network output should be described in full (including how the scaling parameters are chosen and whether they are fixed or data-dependent) so that the reported bias and resolution values can be reproduced and compared unambiguously to the baseline.
- [Figures] Figure captions and axis labels should explicitly state the units and normalization conventions used for the energy-deposit profiles to improve clarity for readers unfamiliar with CONEX output.
Simulated Author's Rebuttal
We thank the referee for the careful review, positive assessment of the work, and recommendation for minor revision. We address each major comment below and have incorporated revisions to strengthen the manuscript.
read point-by-point responses
-
Referee: [Abstract] Abstract: The forward-looking claim that 'the strong performance of the CNN suggests this information should be resolvable in real events' rests on the untested assumption that composition-sensitive sub-structure in the simulated profiles (beyond X_max, R, L) matches that present in actual air showers. Only two hadronic models are compared; systematic differences in multiplicity, elasticity, or electromagnetic development at the 10-20% level between models and nature could shift the learned features and erase the reported gain over the GH baseline.
Authors: We agree that the original wording in the abstract was overly forward-looking given the simulation-only nature of the study. We have revised the abstract to remove the specific claim and instead state that the results 'demonstrate the potential of deep learning to extract additional composition information from profiles beyond standard parameterizations, motivating further studies with experimental data.' We have also added a dedicated paragraph in the conclusions section discussing the limitations of hadronic model comparisons and the possibility that unmodeled differences with nature could affect performance. This provides a more balanced presentation while retaining the core findings from the simulations. revision: yes
-
Referee: [Results (cross-prediction subsection)] Cross-prediction results: While mild degradation is reported when applying the EPOS-LHC-trained network to Sibyll-2.3d profiles, the manuscript does not quantify the profile-level differences (e.g., in longitudinal development or fluctuation statistics) between the two models. This quantification is needed to evaluate whether the CNN is learning robust, model-independent features or model-specific artifacts.
Authors: We thank the referee for highlighting this gap. In the revised manuscript we have added a new figure and accompanying text in the cross-prediction subsection that directly quantifies the differences between EPOS-LHC and Sibyll-2.3d. This includes comparisons of mean longitudinal profiles, X_max distributions, and profile fluctuation widths for proton and iron primaries. The observed differences are modest (a few g cm^{-2} in X_max and comparable in other moments), consistent with known variations between the models. The mild performance degradation of the CNN despite these differences supports that it captures at least partially robust features rather than purely model-specific artifacts. revision: yes
Circularity Check
No circularity: performance metrics derived from held-out simulation test set
full rationale
The paper trains a CNN on CONEX+EPOS-LHC simulated longitudinal profiles to predict ln A, then reports bias, resolution, merit factor (2.19), and AUC (0.976) on held-out simulated events from the same generator. These quantities are computed directly from network outputs versus true labels and do not reduce to any fitted parameters, self-citations, or ansatzes by construction. The benchmark against GH-parameter models and the cross-test on Sibyll-2.3d are independent evaluations. The derivation is a standard supervised ML pipeline with no load-bearing self-referential steps.
Axiom & Free-Parameter Ledger
free parameters (2)
- Rescaling applied to network output
- CNN architecture and training hyperparameters
axioms (1)
- domain assumption Simulations generated with CONEX and EPOS-LHC (and cross-checked with Sibyll-2.3d) accurately capture the mass-dependent features of real air-shower profiles.
Lean theorems connected to this paper
-
IndisputableMonolith/Foundation/RealityFromDistinction.leanreality_from_one_distinction unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
we present the first study of a deep-learning neural-network approach to directly predict a primary’s mass (ln A) from the full longitudinal energy-deposit profile
-
IndisputableMonolith/Cost/FunctionalEquation.leanwashburn_uniqueness_aczel unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
merit factor … X_max … N_μ … GH parameterization
What do these tags mean?
- matches
- The paper's claim is directly supported by a theorem in the formal canon.
- supports
- The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
- extends
- The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
- uses
- The paper appears to rely on the theorem as machinery.
- contradicts
- The paper's claim conflicts with a theorem or certificate in the canon.
- unclear
- Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.
Reference graph
Works this paper leans on
-
[1]
A. Aab, et al., Depth of Maximum of Air-Shower Profiles at the Pierre Auger Observatory: Measurements at Energies above 10 17.8 eV, Phys. Rev. D 90 (12) (2014) 122005. arXiv:1409.4809,doi:10.1103/PhysRevD.90.122005
work page internal anchor Pith review Pith/arXiv arXiv doi:10.1103/physrevd.90.122005 2014
-
[2]
A. Aab, et al., Features of the Energy Spectrum of Cosmic Rays above 2.5×10 18 eV Using the Pierre Auger Observatory, Phys. Rev. Lett. 125 (12) (2020) 121106.arXiv:2008.06488, doi:10.1103/PhysRevLett.125.121106
-
[3]
T. Pierog, I. Karpenko, J. M. Katzy, E. Yatsenko, K. Werner, Epos lhc: Test of collective hadronization with data measured at the cern large hadron collider, Phys. Rev. C 92 (2015) 034906.doi:10.1103/PhysRevC.92.034906. – 17 –
-
[4]
F. Riehn, R. Engel, A. Fedynitch, T. K. Gaisser, T. Stanev, Hadronic interaction model sibyll 2.3d and extensive air showers, Phys. Rev. D 102 (2020) 063002. doi:10.1103/PhysRevD.102.063002
-
[5]
A. Coleman, J. Eser, E. Mayotte, F. Sarazin, F. Schr¨ oder, D. Soldin, T. Venters, et al., Ultra high energy cosmic rays The intersection of the Cosmic and Energy Frontiers, Astropart. Phys. 149 (2023) 102819.arXiv:2205.05845,doi:10.1016/j.astropartphys.2023.102819
-
[6]
Measurements of the Cosmic Ray Composition with Air Shower Experiments
K.-H. Kampert, M. Unger, Measurements of the Cosmic Ray Composition with Air Shower Experiments, Astropart. Phys. 35 (2012) 660–678.arXiv:1201.0018, doi:10.1016/j.astropartphys.2012.02.004
work page internal anchor Pith review Pith/arXiv arXiv doi:10.1016/j.astropartphys.2012.02.004 2012
-
[7]
A. V. Olinto, et al., The POEMMA (Probe of Extreme Multi-Messenger Astrophysics) observatory, JCAP 06 (2021) 007.arXiv:2012.07945, doi:10.1088/1475-7516/2021/06/007
-
[8]
M. Ambrosio, C. Aramo, C. Donalek, D. D’Urso, A. D. Erlykin, F. Guarino, A. Insoiia, G. Longo, Comparison between methods for the determination of the primary cosmic ray mass composition from the longitudinal profile of atmospheric cascades, Astropart. Phys. 24 (2005) 355–371.arXiv:astro-ph/0507543,doi:10.1016/j.astropartphys.2005.07.008
work page internal anchor Pith review Pith/arXiv arXiv doi:10.1016/j.astropartphys.2005.07.008 2005
-
[9]
S. Riggi, R. Caruso, A. Insolia, M. Scuderi, A neural network approach to event-by-event cosmic ray primary mass idenfication, PoS ACAT (2007) 035.doi:10.22323/1.050.0035
-
[10]
S. Andringa, R. Conceicao, M. Pimenta, Mass composition and cross-section from the shape of cosmic ray shower longitudinal profiles, Astropart. Phys. 34 (2011) 360–367. doi:10.1016/j.astropartphys.2010.10.002
-
[11]
A. Aab, et al., Measurement of the average shape of longitudinal profiles of cosmic-ray air showers at the Pierre Auger Observatory, JCAP 03 (2019) 018.arXiv:1811.04660, doi:10.1088/1475-7516/2019/03/018
work page internal anchor Pith review Pith/arXiv arXiv doi:10.1088/1475-7516/2019/03/018 2019
-
[12]
A. Abdul Halim, et al., Inference of the Mass Composition of Cosmic Rays with Energies from 1018.5 to 1020 eV Using the Pierre Auger Observatory and Deep Learning, Phys. Rev. Lett. 134 (2) (2025) 021001.arXiv:2406.06315,doi:10.1103/PhysRevLett.134.021001
-
[13]
One-dimensional Hybrid Approach to Extensive Air Shower Simulation
T. Bergmann, et al., One-dimensional hybrid approach to extensive air shower simulation, Astropart. Phys. 26 (2007) 420–432.arXiv:astro-ph/0606564
work page internal anchor Pith review Pith/arXiv arXiv 2007
-
[14]
First Results of Fast One-dimensional Hybrid Simulation of EAS Using CONEX
T. Pierog, et al., First results of fast one-dimensional hybrid simulation of eas using conex, Nucl. Phys. Proc. Suppl. 151 (2006) 159–162.arXiv:astro-ph/0411260
work page internal anchor Pith review Pith/arXiv arXiv 2006
-
[15]
A. Aab, et al., Inferences on mass composition and tests of hadronic interactions from 0.3 to 100 EeV using the water-Cherenkov detectors of the Pierre Auger Observatory, Phys. Rev. D 96 (12) (2017) 122003.arXiv:1710.07249,doi:10.1103/PhysRevD.96.122003
work page internal anchor Pith review Pith/arXiv arXiv doi:10.1103/physrevd.96.122003 2017
-
[16]
Very Deep Convolutional Networks for Large-Scale Image Recognition
K. Simonyan, A. Zisserman, Very Deep Convolutional Networks for Large-Scale Image Recognition (9 2014).arXiv:1409.1556
work page internal anchor Pith review Pith/arXiv arXiv 2014
-
[17]
K. He, X. Zhang, S. Ren, J. Sun, Deep Residual Learning for Image Recognition (12 2015). arXiv:1512.03385,doi:10.1109/CVPR.2016.90
work page internal anchor Pith review Pith/arXiv arXiv doi:10.1109/cvpr.2016.90 2015
-
[18]
Fast and Accurate Deep Network Learning by Exponential Linear Units (ELUs)
D.-A. Clevert, T. Unterthiner, S. Hochreiter, Fast and accurate deep network learning by exponential linear units (elus) (2016).arXiv:1511.07289. URLhttps://arxiv.org/abs/1511.07289
work page internal anchor Pith review Pith/arXiv arXiv 2016
-
[19]
URLhttps://doi.org/10.5281/zenodo.10488964
The Pierre Auger Collaboration, Pierre auger observatory open data (v3) (2024). URLhttps://doi.org/10.5281/zenodo.10488964
-
[20]
B. Flaggs, A. Coleman, F. G. Schr¨ oder, Studying the mass sensitivity of air-shower observables using simulated cosmic rays, Phys. Rev. D 109 (4) (2024) 042002.arXiv:2306.13246, doi:10.1103/PhysRevD.109.042002. – 18 –
-
[21]
E. Mayotte, Measurement and Interpretation of UHECR Mass Composition at the Pierre Auger Observatory, in: 39th International Cosmic Ray Conference, 2025.arXiv:2507.10292
-
[22]
The Fluorescence Detector of the Pierre Auger Observatory
J. Abraham, et al., The Fluorescence Detector of the Pierre Auger Observatory, Nucl. Instrum. Meth. A 620 (2010) 227–251.arXiv:0907.4282,doi:10.1016/j.nima.2010.04.023
work page internal anchor Pith review Pith/arXiv arXiv doi:10.1016/j.nima.2010.04.023 2010
-
[23]
Kawai, et al., Telescope array experiment, Nucl
H. Kawai, et al., Telescope array experiment, Nucl. Phys. B Proc. Suppl. 175-176 (2008) 221–226.doi:10.1016/j.nuclphysbps.2007.11.002
-
[24]
T. Fujii, The Global Cosmic Ray Observatory – Challenging next-generation multi-messenger astronomy with interdisciplinary research, in: 39th International Cosmic Ray Conference, 2025. arXiv:2507.04588. – 19 –
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.