Recognition: no theorem link
Open LHC Monte Carlo Event Generation
Pith reviewed 2026-05-13 04:38 UTC · model grok-4.3
The pith
Sharing pre-generated LHC Monte Carlo events as open data reduces duplicated computing effort and resource costs across high energy physics.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
Making LHC Monte Carlo events openly available allows the high energy physics community to reuse existing simulations rather than recompute them, directly lowering duplicated effort and resource consumption while maintaining scientific utility.
What carries the argument
Open Event Generation, the practice of sharing completed Monte Carlo simulation datasets so others can analyze them without rerunning the generation step.
If this is right
- Groups can complete analyses faster by starting from existing events rather than waiting for new simulations.
- Overall electricity use and hardware wear from particle physics computing decreases as redundant runs are avoided.
- Smaller research teams gain access to high-statistics samples they could not afford to generate themselves.
- Standard formats and repositories for these shared events become more valuable community resources.
Where Pith is reading between the lines
- Wider adoption could shift funding priorities from raw simulation production toward curation and validation of shared datasets.
- Combining open events with public analysis frameworks might speed up the path from raw data to published results.
- Environmental accounting of LHC computing would improve if reuse rates were tracked systematically.
Load-bearing premise
That sharing the events through new infrastructure can happen without creating major extra work or problems with data accuracy and versioning.
What would settle it
A documented case where reuse of shared events produced measurably worse physics results or higher total computing costs than independent generation would disprove the benefit.
Figures
read the original abstract
The LHC physics programme involves a vast amount of Monte Carlo event simulation. This paper reviews current efforts towards sharing the generated events as Open Data. Open Event Generation helps reduce duplication of effort and resource consumption, and benefits the whole High Energy Physics community. We give examples of use cases and user experiences, discuss financial and environmental savings, and suggest future directions.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript reviews current efforts to share LHC Monte Carlo event simulations as Open Data. It claims that Open Event Generation reduces duplication of effort and resource consumption while benefiting the High Energy Physics community overall. The paper summarizes existing projects, presents use cases and user experiences, discusses qualitative financial and environmental savings, and outlines future directions.
Significance. The review provides a useful overview of ongoing open-data initiatives in LHC Monte Carlo generation and explicitly credits collaborative projects aimed at reducing redundant computations. If the advocated sharing practices are widely adopted, they could yield meaningful reductions in computing costs and environmental impact for the LHC program while improving accessibility across the HEP community.
minor comments (2)
- The discussion of financial and environmental savings would be strengthened by adding at least one concrete quantitative estimate or reference to an external study, even if only as an illustrative example.
- A summary table listing the key open event generation projects, their current status, and main features would improve readability and allow readers to quickly compare the initiatives described.
Simulated Author's Rebuttal
We thank the referee for their positive assessment of our manuscript on Open LHC Monte Carlo Event Generation. The review correctly identifies the value of open data initiatives in reducing redundant simulations, computational costs, and environmental impact within the HEP community. We note the recommendation for minor revision but observe that no specific major comments were provided in the report.
Circularity Check
No significant circularity; review paper with no derivations
full rationale
This is a review paper summarizing existing open data efforts for LHC Monte Carlo events, presenting use cases, qualitative savings, and future directions. It contains no equations, derivations, fitted parameters, or quantitative predictions that could reduce to inputs by construction. Central claims rest on descriptive examples from external efforts rather than any self-referential chain or ansatz. The structure is self-contained against external benchmarks with no load-bearing self-citations or uniqueness theorems invoked.
Axiom & Free-Parameter Ledger
Reference graph
Works this paper leans on
-
[1]
J. Butterworth, S. Kraml, H. Prosper et al., Reinterpretation and preservation of data and analyses in HEP (2025), 2504.00256
-
[2]
H. Simma, Fair Data and ILDG , Presented at Lattice Practices 2023, DESY, Hamburg, Germany , https://indico.desy.de/event/40590/contributions/149745/ (13 Oct. 2023)
work page 2023
-
[3]
Scientific data3(1), 1–9 (2016)
M. Wilkinson et al., The FAIR Guiding Principles for scientific data management and stewardship , Sci Data 3, 160018 (2016), doi:10.1038/sdata.2016.18
-
[4]
F. Karsch, H. Simma and T. Yoshie, The International Lattice Data Grid -- towards FAIR data , PoS LATTICE2022, 244 (2023), doi:10.22323/1.430.0244, 2212.08392
-
[5]
Virgo Consortium , VirgoDB , https://virgodb.dur.ac.uk/
-
[6]
CERN Open Data Portal , https://opendata.cern.ch/ (2026)
work page 2026
-
[7]
Šimko et al., Reana: A system for reusable research data analyses, EPJ Web Conf
A. Rizzi, G. Petrucciani and M. Peruzzi, A further reduction in CMS event data for analysis: the NANOAOD format , EPJ Web Conf. 214, 06021 (2019), doi:10.1051/epjconf/201921406021
-
[9]
M. Dobbs and J. B. Hansen, The HepMC C++ Monte Carlo Event Record for High Energy Physics , Tech. Rep. ATL-SOFT-2000-001 https://cds.cern.ch/record/684090, CERN, Geneva (2000)
work page 2000
-
[10]
cernopendata-client, https://cernopendata-client.readthedocs.io/en/latest/index.html (2026)
work page 2026
-
[11]
Creative Commons , CC0 1.0 UNIVERSAL , https://creativecommons.org/publicdomain/zero/1.0/
-
[12]
The ATLAS Open Data team , Citing ATLAS , https://opendata.atlas.cern/docs/documentation/ethical_legal/citation_policy (2025)
work page 2025
-
[13]
CMS Data Preservation and Open Access Group , About CMS , https://opendata.cern.ch/docs/about-cms (2026)
work page 2026
-
[14]
E. Maguire, L. Heinrich and G. Watt, HEPData: a repository for high energy physics data , J. Phys. Conf. Ser. 898, 102006 (2017), doi:10.1088/1742-6596/898/10/102006, 1704.05473
-
[15]
S. Bailey et al., Data and Analysis Preservation, Recasting, and Reinterpretation (2022), Contribution to Snowmass 2021, 2203.10057
-
[16]
CERN Open Data , Discussion forum , https://opendata-forum.cern.ch/
-
[17]
ATLAS Collaboration , DAOD\_PHYSLITE format 2015-2016 Open Data for Research from the ATLAS experiment , doi:10.7483/OPENDATA.ATLAS.9HK7.P5SI
-
[18]
ATLAS collaboration , HEPMC format 13 TeV proton-proton Open Data from the ATLAS experiment , 10.7483/OPENDATA.ATLAS.XPJL.4L42 (2025)
-
[19]
Bothmann et al., Event Generation with Sherpa 2.2 , SciPost Phys
E. Bothmann et al., Event Generation with Sherpa 2.2 , SciPost Phys. 7(3), 034 (2019), doi:10.21468/SciPostPhys.7.3.034, 1905.09127
-
[20]
E. Bothmann et al., Event generation with Sherpa 3 , JHEP 12, 156 (2024), doi:10.1007/JHEP12(2024)156, 2410.22148
-
[21]
A. Buckley, P. Ilten, D. Konstantinov, L. L\"onnblad, J. Monk, W. Pokorski, T. Przedzinski and A. Verbytskyi, The HepMC3 event record library for Monte Carlo event generators , Comput. Phys. Commun. 260, 107310 (2021), doi:10.1016/j.cpc.2020.107310, 1912.08005
-
[22]
M. Barisits, T. Beermann, F. Berghaus, B. Bockelman, J. Bogado, D. Cameron, D. Christidis, D. Ciangottini, G. Dimitrov, M. Elsing, V. Garonne, A. di Girolamo et al., Rucio: Scientific data management, Computing and Software for Big Science 3(1) (2019), doi:10.1007/s41781-019-0026-3
-
[23]
H. Nielsen, R. T. Fielding and T. Berners-Lee, Hypertext Transfer Protocol -- HTTP/1.0 , RFC 1945, doi:10.17487/RFC1945 (1996)
- [24]
- [25]
-
[26]
Šimko et al., Reana: A system for reusable research data analyses, EPJ Web Conf
E. Bocchi, D. Castro, H. Gonzalez, M. Lamanna, P. Mato, J. Moscicki, D. Piparo and E. Tejedor, Facilitating collaborative analysis in swan, EPJ Web Conf. 214, 07022 (2019), doi:10.1051/epjconf/201921407022
-
[27]
Giordano et al., HEPScore: A new CPU benchmark for the WLCG , EPJ Web of Conf
Gazzarrini, Elena , Garcia Garcia, Enrique , Gosein, Domenic and Espinal, Xavier , The virtual research environment: A multi-science analysis platform, EPJ Web of Conf. 295, 08023 (2024), doi:10.1051/epjconf/202429508023
-
[28]
The ATLAS Open Data team , atlasopenmagic, https://github.com/atlas-outreach-data-tools/atlasopenmagic (2025)
work page 2025
-
[29]
ATLAS Collaboration , Using the open event generation data, https://opendata.atlas.cern/docs/tutresearch/openevgentut (2025)
work page 2025
-
[30]
DELPHES 3, A modular framework for fast simulation of a generic collider experiment
J. de Favereau, C. Delaere, P. Demin, A. Giammanco, V. Lema \^ tre, A. Mertens and M. Selvaggi, DELPHES 3, A modular framework for fast simulation of a generic collider experiment , JHEP 02, 057 (2014), doi:10.1007/JHEP02(2014)057, 1307.6346
work page internal anchor Pith review doi:10.1007/jhep02(2014)057 2014
-
[31]
CMS Collaboration , CMS Open Data Guide , https://cms-opendata-guide.web.cern.ch (2026)
work page 2026
-
[32]
CMS Simulated Dataset Names , https://opendata.cern.ch/docs/cms-simulated-dataset-names (2026)
work page 2026
-
[33]
CMS Collaboration , Simulated dataset TTToSemiLeptonic\_TuneCP5\_13TeV-powheg-pythia8 in NANOAODSIM format for 2016 collision data , doi:10.7483/OPENDATA.CMS.4J3Y.1CME, The link under Dataset Semantics directs to the variable listing. (2024)
-
[34]
C. Bierlich, L. L \"o nnblad and T. Sj \"o strand, The PYTHIA Facility (2026), 2603.01744
- [35]
-
[36]
A. Dainese, M. Mangano, A. B. Meyer, A. Nisati, G. Salam and M. A. Vesterinen, Report on the Physics at the HL-LHC, and Perspectives for the HE-LHC , Tech. Rep. CERN-2019-007 https://cds.cern.ch/record/2703572, Geneva, Switzerland, doi:10.23731/CYRM-2019-007 (2019)
- [37]
-
[38]
M. Drees, H. Dreiner, D. Schmeier, J. Tattersall and J. S. Kim, CheckMATE: Confronting your Favourite New Physics Model with LHC Data , Comput. Phys. Commun. 187, 227 (2015), doi:10.1016/j.cpc.2014.10.018, 1312.2591
-
[39]
D. Dercks, N. Desai, J. S. Kim, K. Rolbiecki, J. Tattersall and T. Weber, CheckMATE 2: From the model to the limit , Comput. Phys. Commun. 221, 383 (2017), doi:10.1016/j.cpc.2017.08.021, 1611.09856
-
[40]
E. Conte, B. Fuks and G. Serret, MadAnalysis 5, A User-Friendly Framework for Collider Phenomenology , Comput. Phys. Commun. 184, 222 (2013), doi:10.1016/j.cpc.2012.09.009, 1206.1599
-
[41]
E. Conte and B. Fuks, Confronting new physics theories to LHC data with MADANALYSIS 5 , Int. J. Mod. Phys. A 33(28), 1830027 (2018), doi:10.1142/S0217751X18300272, 1808.00480
-
[42]
C. Bierlich, A. Buckley, J. M. Butterworth, C. Gutschow, L. Lonnblad, T. Procter, P. Richardson and Y. Yeh, Robust independent validation of experiment and theory: Rivet version 4 release note , SciPost Phys. Codeb. 36, 1 (2024), doi:10.21468/SciPostPhysCodeb.36, 2404.15984
-
[43]
A. Buckley, J. Butterworth, D. Grellscheid, H. Hoeth, L. Lonnblad, J. Monk, H. Schulz and F. Siegert, Rivet user manual , Comput. Phys. Commun. 184, 2803 (2013), doi:10.1016/j.cpc.2013.05.021, 1003.0694
- [44]
- [45]
-
[46]
Matching NLO QCD computations with Parton Shower simulations: the POWHEG method
S. Frixione, P. Nason and C. Oleari, Matching NLO QCD computations with Parton Shower simulations: the POWHEG method , JHEP 11, 070 (2007), doi:10.1088/1126-6708/2007/11/070, 0709.2092
work page internal anchor Pith review Pith/arXiv arXiv doi:10.1088/1126-6708/2007/11/070 2007
-
[47]
A general framework for implementing NLO calculations in shower Monte Carlo programs: the POWHEG BOX
S. Alioli, P. Nason, C. Oleari and E. Re, A general framework for implementing NLO calculations in shower Monte Carlo programs: the POWHEG BOX , JHEP 06, 043 (2010), doi:10.1007/JHEP06(2010)043, 1002.2581
work page internal anchor Pith review Pith/arXiv arXiv doi:10.1007/jhep06(2010)043 2010
-
[48]
A. Buckley et al., Testing new physics models with global comparisons to collider measurements: the Contur toolkit , SciPost Phys. Core 4, 013 (2021), doi:10.21468/SciPostPhysCore.4.2.013, 2102.04377
-
[49]
J. M. Butterworth, D. Grellscheid, M. Kr \"a mer, B. Sarrazin and D. Yallup, Constraining new physics with collider measurements of Standard Model signatures , JHEP 03, 078 (2017), doi:10.1007/JHEP03(2017)078, 1606.05296
-
[50]
A. M. Sirunyan et al., Measurement of the differential Drell-Yan cross section in proton-proton collisions at s = 13 TeV , JHEP 12, 059 (2019), doi:10.1007/JHEP12(2019)059, 1812.10529
-
[51]
E. Erdogan and J. Butterworth, Hepmc access repository, https://gitlab.cern.ch/mcnet/hepmc-access
-
[52]
HTCondor , Htcondor documentation, https://htcondor.readthedocs.io/en/latest/
-
[53]
YODA Collaboration , Yoda: Yet more objects for data analysis, https://yoda.hepforge.org/
-
[54]
E. Bothmann, A. Buckley, C. G\"utschow, S. Prestel, M. Sch\"onherr et al., A standard convention for particle-level monte carlo event-variation weights, SciPost Phys. Core 6, 007 (2023), doi:10.21468/SciPostPhysCore.6.1.007, 2203.08230
-
[55]
ATLAS Collaboration , Generator event weights, https://opendata.atlas.cern/docs/data/for_research/evgen_weights (2026)
work page 2026
-
[56]
J. Y. Araz, A. Buckley, B. Fuks, H. Reyes-Gonzalez, W. Waltenberger, S. L. Williamson and J. Yellen, Strength in numbers: Optimal and scalable combination of LHC new-physics searches , SciPost Phys. 14(4), 077 (2023), doi:10.21468/SciPostPhys.14.4.077, 2209.00025
-
[57]
Yellen, Selection techniques for optimal meta-analysis of beyond standard model physics , Ph.D
J. Yellen, Selection techniques for optimal meta-analysis of beyond standard model physics , Ph.D. thesis, Glasgow U., doi:10.5525/gla.thesis.85009 (2025)
-
[58]
M. Aaboud et al., Measurements of top-quark pair differential cross-sections in the lepton+jets channel in pp collisions at s =13 TeV using the ATLAS detector , JHEP 11, 191 (2017), doi:10.1007/JHEP11(2017)191, 1708.00727
-
[59]
M. Aaboud et al., Measurements of differential cross sections of top quark pair production in association with jets in pp collisions at s =13 TeV using the ATLAS detector , JHEP 10, 159 (2018), doi:10.1007/JHEP10(2018)159, 1802.06572
-
[60]
M. Aaboud et al., Measurements of inclusive and differential fiducial cross-sections of t t production with additional heavy-flavour jets in proton-proton collisions at s = 13 TeV with the ATLAS detector , JHEP 04, 046 (2019), doi:10.1007/JHEP04(2019)046, 1811.12113
-
[61]
G. Aad et al., Measurements of top-quark pair differential and double-differential cross-sections in the +jets channel with pp collisions at s =13 TeV using the ATLAS detector , Eur. Phys. J. C 79(12), 1028 (2019), doi:10.1140/epjc/s10052-019-7525-6, [Erratum: Eur.Phys.J.C 80, 1092 (2020)], 1908.07305
-
[62]
M. Aaboud et al., Measurement of the cross section for isolated-photon plus jet production in pp collisions at s=13 TeV using the ATLAS detector , Phys. Lett. B 780, 578 (2018), doi:10.1016/j.physletb.2018.03.035, 1801.00112
-
[63]
G. Aad et al., Measurement of isolated-photon plus two-jet production in pp collisions at s=13 TeV with the ATLAS detector , JHEP 03, 179 (2020), doi:10.1007/JHEP03(2020)179, 1912.09866
-
[64]
M. Aaboud et al., Measurements of inclusive and differential fiducial cross-sections of t t production in leptonic final states at s =13 TeV in ATLAS , Eur. Phys. J. C 79(5), 382 (2019), doi:10.1140/epjc/s10052-019-6849-6, 1812.01697
-
[65]
G. Aad et al., Measurements of inclusive and differential cross-sections of t t production in pp collisions at s = 13 TeV with the ATLAS detector , JHEP 10, 191 (2024), doi:10.1007/JHEP10(2024)191, 2403.09452
-
[66]
M. Aaboud et al., Searches for scalar leptoquarks and differential cross-section measurements in dilepton-dijet events in proton-proton collisions at a centre-of-mass energy of s = 13 TeV with the ATLAS experiment , Eur. Phys. J. C 79(9), 733 (2019), doi:10.1140/epjc/s10052-019-7181-x, 1902.00377
-
[67]
G. Aad et al., Measurement of the t t production cross-section and lepton differential distributions in e dilepton events from pp collisions at s =13\, TeV with the ATLAS detector , Eur. Phys. J. C 80(6), 528 (2020), doi:10.1140/epjc/s10052-020-7907-9, 1910.08819
-
[68]
G. Aad et al., Measurements of W^+W^-+ 1 jet production cross-sections in pp collisions at s =13 TeV with the ATLAS detector , JHEP 06, 003 (2021), doi:10.1007/JHEP06(2021)003, 2103.10319
-
[69]
G. Aad et al., Inclusive and differential cross-sections for dilepton t t production measured in s = 13 TeV pp collisions with the ATLAS detector , JHEP 07, 141 (2023), doi:10.1007/JHEP07(2023)141, 2303.15340
- [70]
-
[71]
J. Alwall, M. Herquet, F. Maltoni, O. Mattelaer and T. Stelzer, MadGraph 5 : Going Beyond , JHEP 06, 128 (2011), doi:10.1007/JHEP06(2011)128, 1106.0522
-
[72]
J. Alwall, R. Frederix, S. Frixione, V. Hirschi, F. Maltoni, O. Mattelaer, H. S. Shao, T. Stelzer, P. Torrielli and M. Zaro, The automated computation of tree-level and next-to-leading order differential cross sections, and their matching to parton shower simulations , JHEP 07, 079 (2014), doi:10.1007/JHEP07(2014)079, 1405.0301
work page internal anchor Pith review Pith/arXiv arXiv doi:10.1007/jhep07(2014)079 2014
-
[73]
G. Aad et al., Tools for estimating fake/non-prompt lepton backgrounds with the ATLAS detector at the LHC , Journal of Instrumentation 18(11), T11004 (2023), doi:10.1088/1748-0221/18/11/T11004
-
[74]
ATLAS Collaboration , Search for top-squark pair production in final states with one lepton, jets, and missing transverse momentum using \(36\, fb ^ -1 \) of \( s = 13\, TeV \) \(pp\) collision data with the ATLAS detector , JHEP 06, 108 (2018), doi:10.1007/JHEP06(2018)108, HEPData entry https://www.hepdata.net/record/resource/406718?landing_page=true, 1711.11520
-
[76]
A. Buckley, L. Corpe, M. Habedank and T. Procter, Enabling stable preservation of ML algorithms in high-energy physics with petrifyML (2025), doi:10.48550/arXiv.2509.11830, 2509.11830
-
[77]
P. Baro n , M. H. Seymour and A. Si \'o dmok, Novel approach to measure quark/gluon jets at the LHC , Eur. Phys. J. C 84(1), 28 (2024), doi:10.1140/epjc/s10052-023-12363-4, 2307.15378
-
[78]
Bahr et al., Herwig++ Physics and Manual , Eur
M. Bahr et al., Herwig++ Physics and Manual , Eur. Phys. J. C 58, 639 (2008), doi:10.1140/epjc/s10052-008-0798-9, 0803.0883
-
[79]
Bellm et al., Herwig 7.0/Herwig++ 3.0 release note , Eur
J. Bellm et al., Herwig 7.0/Herwig++ 3.0 release note , Eur. Phys. J. C 76(4), 196 (2016), doi:10.1140/epjc/s10052-016-4018-8, 1512.01178
-
[80]
Bewick et al., Herwig 7.3 release note , Eur
G. Bewick et al., Herwig 7.3 release note , Eur. Phys. J. C 84(10), 1053 (2024), doi:10.1140/epjc/s10052-024-13211-9, 2312.05175
-
[81]
Bellm, et al., arXiv:2512.16645 (2025)
J. Bellm et al., The Physics of Herwig 7 (2025), 2512.16645
-
[82]
CMS Collaboration , Jet primary dataset in AOD format from RunB of 2011 (/Jet/Run2011B-12Oct2013-v1/AOD) , doi:10.7483/OPENDATA.CMS.5JYA.HKSG (2022)
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.