pith. sign in

arxiv: 2507.16913 · v2 · submitted 2025-07-22 · ✦ hep-ph · hep-ex

A linear PDF model for Bayesian inference

Pith reviewed 2026-05-19 03:10 UTC · model grok-4.3

classification ✦ hep-ph hep-ex
keywords parton distribution functionsBayesian inferencelinear modelsneural network reductiondeep inelastic scatteringglobal fitsuncertainty quantificationmodel selection
0
0 comments X

The pith

A linear model using reduced neural-network bases supports fast Bayesian inference for parton distribution functions.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper introduces a linear representation for PDFs to make Bayesian methods feasible for global fits. PDFs are expressed as vectors in a functional space spanned by bases obtained through dimensional reduction of a neural network functional space, yielding a compact yet flexible form. Low dimensionality speeds up sampling while the adjustable basis size lets users balance under- and over-fitting and perform principled model selection inside Bayesian workflows. The approach is tested on synthetic deep-inelastic-scattering data through multi-closure tests to establish readiness for full global analyses.

Core claim

PDFs are represented as linear combinations of specially chosen basis functions derived from dimensional reduction of a neural-network functional space, producing low-dimensional models that enable computationally efficient Bayesian inference with transparent control over model complexity.

What carries the argument

Linear PDF model whose functional space is spanned by bases obtained from dimensional reduction of a neural network functional space, which carries the representation for fast sampling and adjustable expressivity.

If this is right

  • Inference becomes fast enough for routine use because the preferred models remain low-dimensional.
  • Basis size can be increased or decreased systematically to trade off underfitting against overfitting.
  • Bayesian model selection becomes straightforward once the basis dimension is treated as a discrete hyperparameter.
  • The same framework can be scaled to global PDF fits once validated on synthetic data.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • The reduced bases may let analysts compare the impact of different prior choices more directly than with full neural-network models.
  • Low-dimensional linear models could be combined with existing PDF codes to produce uncertainty bands that are both faster and easier to interpret.
  • If the bases prove stable under addition of new data sets, the method would lower the barrier to frequent re-fits during the High-Luminosity LHC era.

Load-bearing premise

Bases obtained by reducing a neural-network functional space still span every feature of PDFs that matters for accurate global fits.

What would settle it

A multi-closure test in which the linear model recovers input PDFs from synthetic data with systematically larger uncertainties or biased central values compared with a full neural-network parametrization in the same kinematic range.

read the original abstract

A robust uncertainty estimate in global analyses of Parton Distribution Functions (PDFs) is essential at the Large Hadron Collider (LHC), especially in view of the high-precision data anticipated by experimentalists in the High-Luminosity phase of the LHC. A Bayesian framework to determine PDFs provides a rigorous treatment of uncertainties and full control on the prior, though its practical implementation can be computationally demanding. To address these challenges, we introduce a novel approach to PDF determination tailored for Bayesian inference, based on the use of linear models. Unlike traditional parametrisations, our method represents PDFs as vectors in a functional space spanned by specially chosen bases, derived from the dimensional reduction of a neural network functional space, providing a compact yet versatile representation of PDFs. The low-dimensionality of the preferred models allows for particularly fast inference. The size of the bases can be systematically adjusted, allowing for transparent control over underfitting and overfitting, and facilitating principled model selection through Bayesian workflows. In this work, the methodology is applied to a fit of Deep Inelastic Scattering synthetic data, and thoroughly tested via multi-closure tests, thus paving the way to its application to global PDF fits.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

3 major / 2 minor

Summary. The paper introduces a linear parametrization of PDFs as vectors in a low-dimensional functional space obtained via dimensional reduction of a neural network ensemble. This basis is intended to enable computationally efficient Bayesian inference while allowing systematic adjustment of basis size to control under- and overfitting. The approach is demonstrated on synthetic DIS data with multi-closure tests.

Significance. If the retained basis truly spans all relevant PDF shapes without material loss of expressivity, the method would offer a practical route to Bayesian global PDF fits with transparent model selection and reduced computational cost. The adjustable dimensionality and emphasis on Bayesian workflows are positive features that could complement existing NN-based approaches.

major comments (3)
  1. [Methodology / basis construction] The central claim that the NN-derived linear basis provides a compact yet versatile representation spanning all relevant PDF features (abstract and methodology) rests on the untested assumption that variations orthogonal to the retained principal directions are negligible. No explicit reproduction of independent parametrizations (NNPDF or CT18 replicas) is reported, nor is preservation of sum rules and positivity quantified beyond the DIS training ensemble.
  2. [Results / multi-closure tests] Multi-closure tests are performed exclusively on synthetic DIS data (results section). This setup cannot detect systematic bias when the same fixed basis is applied to global fits that include Drell-Yan, jet, or top data, because any PDF feature outside the span of the original NN ensemble is projected out by construction.
  3. [Results] The paper does not provide quantitative metrics (e.g., bias, variance, or uncertainty coverage) from the closure tests that would allow assessment of whether the linear model recovers the input PDFs to within the target precision while remaining parameter-free in the sense claimed.
minor comments (2)
  1. [Methodology] Notation for the basis vectors and the projection operator should be defined explicitly with an equation number to avoid ambiguity when discussing linear combinations.
  2. [Figures] Figure captions for the closure-test results should include the number of replicas, the chosen basis dimension, and a direct comparison to a standard NN fit on the same data.

Simulated Author's Rebuttal

3 responses · 0 unresolved

We thank the referee for their constructive and detailed report. We address each major comment below, indicating where revisions will be made to strengthen the manuscript while clarifying the scope of the current work, which focuses on a proof-of-concept application to synthetic DIS data.

read point-by-point responses
  1. Referee: [Methodology / basis construction] The central claim that the NN-derived linear basis provides a compact yet versatile representation spanning all relevant PDF features (abstract and methodology) rests on the untested assumption that variations orthogonal to the retained principal directions are negligible. No explicit reproduction of independent parametrizations (NNPDF or CT18 replicas) is reported, nor is preservation of sum rules and positivity quantified beyond the DIS training ensemble.

    Authors: We agree that explicit quantification of the retained variance and constraints would strengthen the presentation. In the revised manuscript we will add a supplementary figure showing the cumulative explained variance ratio from the principal component analysis of the NN ensemble, confirming that the chosen basis dimensionality captures the large majority of functional variation. We will also report numerical measures of sum-rule violation and the incidence of positivity violations across the closure-test replicas. Direct reproduction of NNPDF or CT18 replicas lies outside the present scope, which is restricted to synthetic DIS data; we will add a clarifying sentence in the methodology section noting that the basis construction procedure is general and can be applied to broader ensembles in future global analyses. revision: partial

  2. Referee: [Results / multi-closure tests] Multi-closure tests are performed exclusively on synthetic DIS data (results section). This setup cannot detect systematic bias when the same fixed basis is applied to global fits that include Drell-Yan, jet, or top data, because any PDF feature outside the span of the original NN ensemble is projected out by construction.

    Authors: The referee correctly identifies the controlled nature of the test. The manuscript explicitly frames the study as a validation of the linear-model methodology on synthetic DIS data, with the explicit goal of paving the way for global fits. We will revise the discussion and conclusions to state this limitation more prominently and to outline how a future basis can be constructed from an NN ensemble trained on a wider set of processes. Within the DIS setting the multi-closure tests already demonstrate faithful recovery, and the adjustable dimensionality of the linear model provides a direct mechanism to enlarge the span when additional data are included. revision: partial

  3. Referee: [Results] The paper does not provide quantitative metrics (e.g., bias, variance, or uncertainty coverage) from the closure tests that would allow assessment of whether the linear model recovers the input PDFs to within the target precision while remaining parameter-free in the sense claimed.

    Authors: We thank the referee for this observation. Although the results section presents visual comparisons and qualitative statements of recovery, we agree that explicit numerical metrics would improve clarity. In the revised version we will add a table (or inline values) reporting the mean bias, root-mean-square deviation, and the fraction of input PDF points lying inside the 68 % and 95 % credible intervals, averaged over the full set of closure-test replicas. These quantities will be computed both for the central values and for the uncertainty bands, directly addressing the assessment of precision and coverage. revision: yes

Circularity Check

0 steps flagged

Basis construction presented as independent upstream step; no reduction to fitted inputs by construction

full rationale

The paper describes representing PDFs via linear combinations of bases obtained from dimensional reduction of a neural network functional space, then performing Bayesian inference on the resulting low-dimensional coefficients. This basis choice is framed as a prior methodological step separate from the subsequent fit to synthetic DIS data. No equations or claims in the abstract equate a derived quantity (such as a predicted PDF shape or uncertainty) directly to a fitted parameter or self-citation by construction. Multi-closure tests are described as validation on held-out synthetic data rather than tautological recovery of the same inputs. The derivation chain therefore remains self-contained against external benchmarks and does not trigger any of the enumerated circularity patterns.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 1 invented entities

Abstract-only information limits the ledger to the core modeling assumption stated in the text; no explicit free parameters or invented entities are quantified.

axioms (1)
  • domain assumption The functional space of PDFs can be effectively spanned by a reduced basis obtained from dimensional reduction of a neural network functional space.
    This premise underpins the entire linear model construction and is invoked to justify compactness and versatility.
invented entities (1)
  • Linear PDF model in reduced NN-derived basis no independent evidence
    purpose: To enable fast Bayesian inference while retaining control over underfitting and overfitting
    New representation introduced to address computational demands of traditional Bayesian PDF fits.

pith-pipeline@v0.9.0 · 5736 in / 1301 out tokens · 35454 ms · 2026-05-19T03:10:23.743367+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

What do these tags mean?
matches
The paper's claim is directly supported by a theorem in the formal canon.
supports
The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends
The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses
The paper appears to rely on the theorem as machinery.
contradicts
The paper's claim conflicts with a theorem or certificate in the canon.
unclear
Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Forward citations

Cited by 2 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Proton Structure from Neural Simulation-Based Inference at the LHC

    hep-ph 2026-04 unverdicted novelty 8.0

    Neural simulation-based inference on unbinned top-quark pair data at 13 TeV yields improved gluon PDF precision over traditional binned analyses while incorporating experimental and theoretical uncertainties.

  2. Precision QCD with the Electron-Ion Collider

    hep-ph 2026-04 unverdicted novelty 1.0

    A workshop summary report outlines discussion topics in perturbative QCD, nuclear structure, and related techniques for the upcoming Electron-Ion Collider.

Reference graph

Works this paper leans on

68 extracted references · 68 canonical work pages · cited by 2 Pith papers · 20 internal anchors

  1. [1]

    Amorosoet al., Snowmass 2021 Whitepaper: Proton Structure at the Precision Frontier, Acta Phys

    S. Amoroso et al., “Snowmass 2021 Whitepaper: Proton Structure at the Precision Frontier,” Acta Phys. Polon. B 53 no. 12, (2022) 12–A1, arXiv:2203.13923 [hep-ph]

  2. [2]

    Parton Distribution Functions and Their Impact on Precision of the Current Theory Calculations,

    M. Ubiali, “Parton Distribution Functions and Their Impact on Precision of the Current Theory Calculations,” 4, 2024. arXiv:2404.08508 [hep-ph]

  3. [3]

    Towards Ultimate Parton Distributions at the High-Luminosity LHC

    R. Abdul Khalek, S. Bailey, J. Gao, L. Harland-Lang, and J. Rojo, “Towards Ultimate Parton Distributions at the High-Luminosity LHC,” Eur. Phys. J. C 78 no. 11, (2018) 962, arXiv:1810.03639 [hep-ph]

  4. [4]

    Nuclear Uncertainties in the Determination of Proton PDFs

    NNPDF Collaboration, R. D. Ball, E. R. Nocera, and R. L. Pearson, “Nuclear Uncertainties in the Determination of Proton PDFs,” Eur. Phys. J. C 79 no. 3, (2019) 282, arXiv:1812.09074 [hep-ph]

  5. [5]

    Determination of the theory uncertainties from missing higher orders on NNLO parton distributions with percent accuracy,

    NNPDF Collaboration, R. D. Ball et al., “Determination of the theory uncertainties from missing higher orders on NNLO parton distributions with percent accuracy,” Eur. Phys. J. C 84 no. 5, (2024) 517, arXiv:2401.10319 [hep-ph]

  6. [6]

    Evaluating the faithfulness of PDF uncertainties in the presence of inconsistent data,

    A. Barontini, M. N. Costantini, G. De Crescenzo, S. Forte, and M. Ubiali, “Evaluating the faithfulness of PDF uncertainties in the presence of inconsistent data,” arXiv:2503.17447 [hep-ph]

  7. [7]

    Parton Distribution Functions, $\alpha_s$ and Heavy-Quark Masses for LHC Run II

    S. Alekhin, J. Bl¨ umlein, S. Moch, and R. Placakyte, “Parton distribution functions, αs, and heavy-quark masses for LHC Run II,” Phys. Rev. D 96 no. 1, (2017) 014011, arXiv:1701.05838 [hep-ph]

  8. [8]

    New CTEQ global analysis of quantum chromodynamics with high-precision data from the LHC

    T.-J. Hou et al., “New CTEQ global analysis of quantum chromodynamics with high-precision data from the LHC,” Phys. Rev. D 103 no. 1, (2021) 014013, arXiv:1912.10053 [hep-ph]

  9. [9]

    Bailey, T

    S. Bailey, T. Cridge, L. A. Harland-Lang, A. D. Martin, and R. S. Thorne, “Parton distributions from LHC, HERA, Tevatron and fixed target data: MSHT20 PDFs,” Eur. Phys. J. C 81 no. 4, (2021) 341, arXiv:2012.04684 [hep-ph]

  10. [10]

    Cridge, L

    T. Cridge, L. A. Harland-Lang, and R. S. Thorne, “Combining QED and approximate N 3LO QCD corrections in a global PDF fit: MSHT20qed an3lo PDFs,” SciPost Phys. 17 no. 1, (2024) 026, arXiv:2312.07665 [hep-ph] . – 32 –

  11. [11]

    Ball et al., The path to proton structure at 1% accuracy , Eur

    NNPDF Collaboration, R. D. Ball et al., “The path to proton structure at 1% accuracy,” Eur. Phys. J. C 82 no. 5, (2022) 428, arXiv:2109.02653 [hep-ph]

  12. [12]

    The path to N 3LO parton distributions,

    NNPDF Collaboration, R. D. Ball et al., “The path to N 3LO parton distributions,” Eur. Phys. J. C 84 no. 7, (2024) 659, arXiv:2402.18635 [hep-ph]

  13. [13]

    Parton distributions confront LHC Run II data: a quantitative appraisal,

    A. Chiefa, M. N. Costantini, J. Cruz-Martinez, E. R. Nocera, T. R. Rabemananjara, J. Rojo, T. Sharma, R. Stegeman, and M. Ubiali, “Parton distributions confront LHC Run II data: a quantitative appraisal,” arXiv:2501.10359 [hep-ph]

  14. [14]

    PDF4LHC recommendations for LHC Run II

    J. Butterworth et al., “PDF4LHC recommendations for LHC Run II,” J. Phys. G 43 (2016) 023001, arXiv:1510.03865 [hep-ph]

  15. [15]

    High-energy physics event generation with PYTHIA 5.7 and JETSET 7.4,

    PDF4LHC Working Group Collaboration, R. D. Ball et al., “The PDF4LHC21 combination of global PDF fits for the LHC Run III,” J. Phys. G 49 no. 8, (2022) 080501, arXiv:2203.05506 [hep-ph]

  16. [16]

    Bayesian approach to inverse problems: an application to NNPDF closure testing,

    L. Del Debbio, T. Giani, and M. Wilson, “Bayesian approach to inverse problems: an application to NNPDF closure testing,” Eur. Phys. J. C 82 no. 4, (2022) 330, arXiv:2111.05787 [hep-ph]

  17. [17]

    A stress test of global PDF fits: closure testing the MSHT PDFs and a first direct comparison to the neural net approach,

    L. A. Harland-Lang, T. Cridge, and R. S. Thorne, “A stress test of global PDF fits: closure testing the MSHT PDFs and a first direct comparison to the neural net approach,” Eur. Phys. J. C 85 no. 3, (2025) 316, arXiv:2407.07944 [hep-ph]

  18. [18]

    ATLAS Collaboration, G. Aad et al., “A precise measurement of the Z-boson double-differential transverse momentum and rapidity distributions in the full phase space of the decay leptons with the ATLAS experiment at √s = 8 TeV,” Eur. Phys. J. C 84 no. 3, (2024) 315, arXiv:2309.09318 [hep-ex]

  19. [19]

    Submitted to Physics Letters B

    CMS Collaboration, A. Hayrapetyan et al., “Measurement of the Drell–Yan forward-backward asymmetry and of the effective leptonic weak mixing angle in proton-proton collisions at √s = 13 TeV,” arXiv:2408.07622 [hep-ex]

  20. [20]

    Measurement of the W-boson mass and width with the ATLAS detector using proton-proton collisions at √s = 7 TeV

    ATLAS Collaboration, G. Aad et al., “Measurement of the W-boson mass and width with the ATLAS detector using proton–proton collisions at √s = 7 TeV,” Eur. Phys. J. C 84 no. 12, (2024) 1309, arXiv:2403.15085 [hep-ex]

  21. [21]

    High-precision measurement of the W boson mass with the CMS experiment

    CMS Collaboration, V. Chekhovsky et al., “High-precision measurement of the W boson mass with the CMS experiment at the LHC,” arXiv:2412.13872 [hep-ex]

  22. [22]

    A critical study of the Monte Carlo replica method,

    M. N. Costantini, M. Madigan, L. Mantani, and J. M. Moore, “A critical study of the Monte Carlo replica method,” arXiv:2404.10056 [hep-ph]

  23. [23]

    SIMUnet: an open-source tool for simultaneous global fits of EFT Wilson coefficients and PDFs,

    PBSP Collaboration, M. N. Costantini, E. Hammou, Z. Kassabov, M. Madigan, L. Mantani, M. Morales Alvarado, J. M. Moore, and M. Ubiali, “SIMUnet: an open-source tool for simultaneous global fits of EFT Wilson coefficients and PDFs,” Eur. Phys. J. C 84 no. 8, (2024) 805, arXiv:2402.03308 [hep-ph]

  24. [24]

    The top quark legacy of the LHC Run II for PDF and SMEFT analyses,

    Z. Kassabov, M. Madigan, L. Mantani, J. Moore, M. Morales Alvarado, J. Rojo, and M. Ubiali, “The top quark legacy of the LHC Run II for PDF and SMEFT analyses,” JHEP 05 (2023) 205, arXiv:2303.06159 [hep-ph]

  25. [25]

    A new generation of simultaneous fits to LHC data using deep learning,

    S. Iranipour and M. Ubiali, “A new generation of simultaneous fits to LHC data using deep learning,” JHEP 05 (2022) 032, arXiv:2201.07240 [hep-ph]

  26. [26]

    Markov Chain Monte Carlo technics applied to Parton Distribution Functions determination: proof of concept

    Y. G. Gbedo and M. Mangin-Brinet, “Markov chain Monte Carlo techniques applied to – 33 – parton distribution functions determination: Proof of concept,” Phys. Rev. D 96 no. 1, (2017) 014015, arXiv:1701.07678 [hep-ph]

  27. [27]

    Constraints on the Up-Quark Valence Distribution in the Proton,

    R. Aggarwal, M. Botje, A. Caldwell, F. Capel, and O. Schulz, “Constraints on the Up-Quark Valence Distribution in the Proton,” Phys. Rev. Lett. 130 no. 14, (2023) 141901, arXiv:2209.06571 [hep-ph]

  28. [28]

    A comparison of Bayesian sampling algorithms for high-dimensional particle physics and cosmology applications,

    J. Albert, C. Balazs, A. Fowlie, W. Handley, N. Hunt-Smith, R. R. de Austri, and M. White, “A comparison of Bayesian sampling algorithms for high-dimensional particle physics and cosmology applications,” arXiv:2409.18464 [hep-ph]

  29. [29]

    Novel parton density determination code,

    F. Capel, R. Aggarwal, M. Botje, A. Caldwell, O. Schulz, and A. Verbytskyi, “Novel parton density determination code,” Phys. Rev. D 110 no. 1, (2024) 014024, arXiv:2401.17729 [hep-ph]

  30. [30]

    Candido, L

    A. Candido, L. Del Debbio, T. Giani, and G. Petrillo, “Bayesian inference with Gaussian processes for the determination of parton distribution functions,” Eur. Phys. J. C 84 no. 7, (2024) 716, arXiv:2404.07573 [hep-ph]

  31. [31]

    Colibri: an open-source tool for Bayesian PDF fits,

    M. N. Costantini, L. Mantani, J. M. Moore, V. Sch¨ utze S´ anchez, and M. Ubiali, “Colibri: an open-source tool for Bayesian PDF fits,” arXiv:240x.xxxx [hep-ph]

  32. [32]

    The Intrinsic Charm of the Proton,

    S. J. Brodsky, P. Hoyer, C. Peterson, and N. Sakai, “The Intrinsic Charm of the Proton,” Phys. Lett. B 93 (1980) 451–455

  33. [33]

    A review of the intrinsic heavy quark content of the nucleon

    S. J. Brodsky, A. Kusina, F. Lyonnet, I. Schienbein, H. Spiesberger, and R. Vogt, “A review of the intrinsic heavy quark content of the nucleon,” Adv. High Energy Phys. 2015 (2015) 231547, arXiv:1504.06287 [hep-ph]

  34. [34]

    The Intrinsic Charm of the Proton,

    NNPDF Collaboration, R. D. Ball, A. Candido, J. Cruz-Martinez, S. Forte, T. Giani, F. Hekhorn, K. Kudashkin, G. Magni, and J. Rojo, “Evidence for intrinsic charm quarks in the proton,” Nature 608 no. 7923, (2022) 483–487, arXiv:2208.08372 [hep-ph]

  35. [35]

    A Determination of the Charm Content of the Proton

    NNPDF Collaboration, R. D. Ball, V. Bertone, M. Bonvini, S. Carrazza, S. Forte, A. Guffanti, N. P. Hartland, J. Rojo, and L. Rottoli, “A Determination of the Charm Content of the Proton,” Eur. Phys. J. C 76 no. 11, (2016) 647, arXiv:1605.06515 [hep-ph]

  36. [36]

    Intrinsic charm quark valence distribution of the proton,

    NNPDF Collaboration, R. D. Ball, A. Candido, J. Cruz-Martinez, S. Forte, T. Giani, F. Hekhorn, G. Magni, E. R. Nocera, J. Rojo, and R. Stegeman, “Intrinsic charm quark valence distribution of the proton,” Phys. Rev. D 109 no. 9, (2024) L091501, arXiv:2311.00743 [hep-ph]

  37. [37]

    Extended Parameterisations for MSTW PDFs and their effect on Lepton Charge Asymmetry from W Decays

    A. D. Martin, A. J. T. M. Mathijssen, W. J. Stirling, R. S. Thorne, B. J. A. Watt, and G. Watt, “Extended Parameterisations for MSTW PDFs and their effect on Lepton Charge Asymmetry from W Decays,” Eur. Phys. J. C 73 no. 2, (2013) 2318, arXiv:1211.1215 [hep-ph]

  38. [38]

    Holmes, J

    P. Holmes, J. L. Lumley, G. Berkooz, and C. W. Rowley, Proper orthogonal decomposition, p. 68–105. Cambridge Monographs on Mechanics. Cambridge University Press, 2012

  39. [39]

    LHAPDF6: parton density access in the LHC precision era

    A. Buckley, J. Ferrando, S. Lloyd, K. Nordstr¨ om, B. Page, M. R¨ ufenacht, M. Sch¨ onherr, and G. Watt, “LHAPDF6: parton density access in the LHC precision era,” Eur. Phys. J. C 75 (2015) 132, arXiv:1412.7420 [hep-ph]

  40. [40]

    A determination of parton distributions with faithful uncertainty estimation

    NNPDF Collaboration, R. D. Ball, L. Del Debbio, S. Forte, A. Guffanti, J. I. Latorre, A. Piccione, J. Rojo, and M. Ubiali, “A Determination of parton distributions with faithful – 34 – uncertainty estimation,” Nucl. Phys. B 809 (2009) 1–63, arXiv:0808.1231 [hep-ph] . [Erratum: Nucl.Phys.B 816, 293 (2009)]

  41. [41]

    A first unbiased global NLO determination of parton distributions and their uncertainties

    R. D. Ball, L. Del Debbio, S. Forte, A. Guffanti, J. I. Latorre, J. Rojo, and M. Ubiali, “A first unbiased global NLO determination of parton distributions and their uncertainties,” Nucl. Phys. B 838 (2010) 136–206, arXiv:1002.4407 [hep-ph]

  42. [42]

    Carrazza and J.M

    S. Carrazza and J. Cruz-Martinez, “Towards a new generation of parton densities with deep learning models,” Eur. Phys. J. C 79 no. 8, (2019) 676, arXiv:1907.05075 [hep-ph]

  43. [43]

    Hyperparameter optimisation in deep learning from ensemble methods: applications to proton structure,

    J. Cruz-Martinez, A. Jansen, G. van Oord, T. R. Rabemananjara, C. M. R. Rocha, J. Rojo, and R. Stegeman, “Hyperparameter optimisation in deep learning from ensemble methods: applications to proton structure,” Mach. Learn. Sci. Tech. 6 no. 2, (2025) 025027, arXiv:2410.16248 [hep-ph]

  44. [44]

    Understanding the difficulty of training deep feedforward neural networks,

    X. Glorot and Y. Bengio, “Understanding the difficulty of training deep feedforward neural networks,” in Proceedings of the Thirteenth International Conference on Artificial Intelligence and Statistics, Y. W. Teh and M. Titterington, eds., vol. 9 of Proceedings of Machine Learning Research, pp. 249–256. PMLR, Chia Laguna Resort, Sardinia, Italy, 13–15 may...

  45. [45]

    EKO: evolution kernel operators,

    A. Candido, F. Hekhorn, and G. Magni, “EKO: evolution kernel operators,” Eur. Phys. J. C 82 no. 10, (2022) 976, arXiv:2202.02338 [hep-ph]

  46. [46]

    Fitting Parton Distribution Data with Multiplicative Normalization Uncertainties

    NNPDF Collaboration, R. D. Ball, L. Del Debbio, S. Forte, A. Guffanti, J. I. Latorre, J. Rojo, and M. Ubiali, “Fitting Parton Distribution Data with Multiplicative Normalization Uncertainties,” JHEP 05 (2010) 075, arXiv:0912.2276 [hep-ph]

  47. [47]

    A first determination of parton distributions with theoretical uncertainties,

    NNPDF Collaboration, R. Abdul Khalek et al., “A first determination of parton distributions with theoretical uncertainties,” Eur. Phys. J. C (2019) 79:838, arXiv:1905.04311 [hep-ph]

  48. [48]

    Abdul Khalek et al., Parton Distributions with Theory Uncertainties: General Formalism and First Phenomenological Studies , Eur

    NNPDF Collaboration, R. Abdul Khalek et al., “Parton Distributions with Theory Uncertainties: General Formalism and First Phenomenological Studies,” Eur. Phys. J. C 79 no. 11, (2019) 931, arXiv:1906.10698 [hep-ph]

  49. [49]

    Candido, S

    A. Candido, S. Forte, and F. Hekhorn, “Can MS parton distributions be negative?,” JHEP 11 (2020) 129, arXiv:2006.07377 [hep-ph]

  50. [50]

    On the positivity of MSbar parton distributions

    A. Candido, S. Forte, T. Giani, and F. Hekhorn, “On the positivity of MS parton distributions,” Eur. Phys. J. C 84 no. 3, (2024) 335, arXiv:2308.00025 [hep-ph]

  51. [51]

    Collaborative Nested Sampling: Big Data versus Complex Physical Models,

    J. Buchner, “Collaborative Nested Sampling: Big Data versus Complex Physical Models,” pasp 131 no. 1004, (Oct., 2019) 108005, arXiv:1707.04476 [stat.CO]

  52. [52]

    2021, UltraNest – a robust, general purpose Bayesian inference engine

    J. Buchner, “UltraNest - a robust, general purpose Bayesian inference engine,” The Journal of Open Source Software 6 no. 60, (Apr., 2021) 3001, arXiv:2101.09604 [stat.CO]

  53. [53]

    D. J. C. MacKay, Information Theory, Inference & Learning Algorithms. Cambridge University Press, USA, 2002

  54. [54]

    Deuteron Uncertainties in the Determination of Proton PDFs,

    R. D. Ball, E. R. Nocera, and R. L. Pearson, “Deuteron Uncertainties in the Determination of Proton PDFs,” Eur. Phys. J. C 81 no. 1, (2021) 37, arXiv:2011.00009 [hep-ph]

  55. [55]

    Accurate Measurement of F2d/F2p and Rd-Rp

    New Muon Collaboration, M. Arneodo et al., “Accurate measurement of F d 2 /F p 2 and Rd − Rp,” Nucl. Phys. B487 (1997) 3–26, arXiv:hep-ex/9611022. – 35 –

  56. [56]

    Precise measurements of the proton and deuteron structure functions from a global analysis of the SLAC deep inelastic electron scattering cross-sections,

    L. W. Whitlow, E. M. Riordan, S. Dasu, S. Rock, and A. Bodek, “Precise measurements of the proton and deuteron structure functions from a global analysis of the SLAC deep inelastic electron scattering cross-sections,” Phys. Lett. B 282 (1992) 475–482

  57. [57]

    A High Statistics Measurement of the Proton Structure Functions F2(x, Q2) and R from Deep Inelastic Muon Scattering at High Q2,

    BCDMS Collaboration, A. C. Benvenuti et al., “A High Statistics Measurement of the Proton Structure Functions F2(x, Q2) and R from Deep Inelastic Muon Scattering at High Q2,” Phys. Lett. B223 (1989) 485

  58. [58]

    Measurement of the proton and deuteron structure functions, F2p and F2d, and of the ratio sigma(L)/sigma(T)

    New Muon Collaboration, M. Arneodo et al., “Measurement of the proton and deuteron structure functions, F p 2 and F d 2 , and of the ratio σL/σT ,” Nucl. Phys. B483 (1997) 3–43, arXiv:hep-ph/9610231

  59. [59]

    Measurement of nucleon structure functions in neutrino scattering,

    CHORUS Collaboration, G. Onengut et al., “Measurement of nucleon structure functions in neutrino scattering,” Phys. Lett. B632 (2006) 65–75

  60. [60]

    Precise Measurement of Dimuon Production Cross-Sections in muon neutrino Fe and muon antineutrino Fe Deep Inelastic Scattering at the Tevatron

    NuTeV Collaboration, M. Goncharov et al., “Precise measurement of dimuon production cross-sections in νµFe and ¯νµFe deep inelastic scattering at the Tevatron,” Phys. Rev. D64 (2001) 112006, arXiv:hep-ex/0102049

  61. [62]

    Combination and QCD analysis of charm and beauty production cross-section measurements in deep inelastic $ep$ scattering at HERA

    H1, ZEUS Collaboration, H. Abramowicz et al., “Combination and QCD analysis of charm and beauty production cross-section measurements in deep inelastic ep scattering at HERA,” Eur. Phys. J. C78 no. 6, (2018) 473, arXiv:1804.01019 [hep-ex]

  62. [63]

    The FAIR Guiding Principles for scientific data management and stewardship,

    M. D. Wilkinson et al., “The FAIR Guiding Principles for scientific data management and stewardship,” Sci. data 160018 (201)

  63. [64]

    Nnpdf/reportengine: Dask scheduler,

    Zaharid, M. N. Costantini, wilsonmr, J. M. Cruz-Martinez, R. Stegeman, C. Voisey, A. Candido, S. Carrazza, and T. Kluyver, “Nnpdf/reportengine: Dask scheduler,” Feb.,

  64. [65]

    https://doi.org/10.5281/zenodo.14803041

  65. [66]

    An open-source machine learning framework for global analyses of parton distributions,

    NNPDF Collaboration, R. D. Ball et al., “An open-source machine learning framework for global analyses of parton distributions,” Eur. Phys. J. C 81 no. 10, (2021) 958, arXiv:2109.02671 [hep-ph]

  66. [67]

    Nnpdf/eko: v0.13.5,

    A. Candido, F. Hekhorn, and G. Magni, “Nnpdf/eko: v0.13.5,” June, 2025. https://doi.org/10.5281/zenodo.15655649

  67. [68]

    A High Statistics Measurement of the Proton Structure Functions F(2) (x, Q**2) and R from Deep Inelastic Muon Scattering at High Q**2,

    BCDMS Collaboration, A. C. Benvenuti et al., “A High Statistics Measurement of the Proton Structure Functions F(2) (x, Q**2) and R from Deep Inelastic Muon Scattering at High Q**2,” Phys. Lett. B 223 (1989) 485–489

  68. [69]

    Combination of Measurements of Inclusive Deep Inelastic $e^{\pm}p$ Scattering Cross Sections and QCD Analysis of HERA Data

    H1, ZEUS Collaboration, H. Abramowicz et al., “Combination of measurements of inclusive deep inelastic e±p scattering cross sections and QCD analysis of HERA data,” Eur. Phys. J. C 75 no. 12, (2015) 580, arXiv:1506.06042 [hep-ex] . – 36 –