pith. sign in

arxiv: 2606.09950 · v1 · pith:MDNBLAUZnew · submitted 2026-06-08 · 💻 cs.LG · nucl-th· physics.comp-ph· physics.data-an

Integrating Out, Twice:The Open-System Case That Neural-Network Ensemble Theory Is Missing

Pith reviewed 2026-06-27 17:11 UTC · model grok-4.3

classification 💻 cs.LG nucl-thphysics.comp-phphysics.data-an
keywords neural network ensemblesopen systemsSchur complementnon-Hermitian operatorsensemble theoryattention mechanismsexpert routingGaussian marginalization
0
0 comments X

The pith

Neural-network ensemble averaging produces only closed-system covariances and misses the open case that conserves flux into a continuous spectrum.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

Averaging a network over random parameters equals marginalizing a Gaussian sector via the Schur complement. When the eliminated sector is closed, the result is a covariance and its inverse—the full output of current ensemble methods. The open case, drawn from scattering theory, projects onto channels while the rest carries probability irreversibly into a continuum, yielding a non-Hermitian generator that tracks exactly what is lost. Tests on a truncated attention map, token-level transfer operator, and sparse expert router find this distinctive open content absent. The structural reason is that mainstream neural objects supply only finite or dissipative sectors, not the required continuous-spectrum wave dynamics.

Core claim

The closed case of neural ensemble averaging is the Schur complement of a Gaussian block that returns a covariance and its inverse; this maps the neural tangent kernel to the Fisher sensitivity kernel, the infinite-width limit to the Gaussian-process emulator, and the lazy-to-feature transition to the validity boundary of a reduced-basis emulator. The open case requires an eliminated sector with continuous spectrum and wave-like dynamics to produce a non-Hermitian effective generator that itemizes conserved flux, as in the nuclear optical model. The three tested partitions lack this sector, so the open ledger is either absent, an artifact of the partition, or pinned near a floor by the train

What carries the argument

The Schur complement of the eliminated block, which returns covariance and inverse when the sector is closed and a non-Hermitian generator that conserves lost probability when the sector is open with continuous spectrum.

If this is right

  • The neural tangent kernel is the Fisher sensitivity kernel under the closed-case dictionary.
  • The infinite-width Gaussian limit is the Gaussian-process emulator.
  • The lazy-to-feature transition marks the validity boundary of a reduced-basis emulator.
  • The conserved flux ledger appears only where openness is genuinely present with a continuous-spectrum sector.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • Architectures whose internal partitions naturally eliminate a continuum sector could make the open ledger usable for uncertainty that current ensembles treat as epistemic.
  • Training objectives that penalize flux loss may systematically suppress the open-sector signature even when the partition geometry allows it.
  • Replacing relaxational layers with wave-propagating ones in selected blocks would provide a direct test of whether the missing dynamics can be engineered inside existing networks.

Load-bearing premise

That the three tested neural objects constitute representative instances of genuine openness with an eliminated continuous-spectrum sector.

What would settle it

Observation of a neural-network partition whose eliminated sector exhibits continuous spectrum and produces wave-like rather than relaxational dynamics, yielding a non-Hermitian generator whose flux ledger matches the open-system predictions.

read the original abstract

Averaging a neural network over its random parameters and marginalizing a Gaussian sector are the same operation, the Schur complement of the eliminated block, and when that block is closed it returns a covariance and its inverse. That is all a network ensemble produces, the closed case. The open case is missing, and nuclear reaction theory has it worked out. Projecting a scattering problem onto a chosen set of channels, with the rest carrying probability irreversibly to a continuum, leaves a non-Hermitian effective generator that conserves and itemizes exactly what it loses: the nuclear optical model and its generalized optical theorem. I set the two cases side by side using only the moments of a distribution, the algebra of Gaussians, and block inversion, no field theory, and give the closed-case dictionary in full: the neural tangent kernel is the Fisher sensitivity kernel, the infinite-width Gaussian limit is the Gaussian-process emulator, and the lazy-to-feature transition is the validity boundary of a reduced-basis emulator. I then test the open export on a truncated attention map, a token-level transfer operator, and a sparse expert router, and report a mostly negative result. The conserved flux ledger ports wherever openness is genuinely present, but its distinctive content is absent, an artifact of the chosen partition, or pinned near a floor by the training objective, and the operationally useful uncertainty turns out to be epistemic, living in the closed half of the correspondence, not the open one. The negative has a structural reason this note makes precise: the open case needs an eliminated sector with a continuous spectrum and wave-like, not relaxational, dynamics, which mainstream learning's finite or dissipative objects do not supply. This is a note, not a result; its main finding is that negative one, and its value is the map that locates it.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 1 minor

Summary. The manuscript claims that neural-network ensemble theory realizes only the closed case of marginalization over random parameters, which via the Schur complement on a Gaussian sector yields a covariance and its inverse; the open case from nuclear reaction theory—producing a non-Hermitian effective generator that conserves and itemizes flux to a continuous-spectrum eliminated sector—is absent. It supplies a closed-case dictionary (NTK as Fisher sensitivity kernel, infinite-width limit as GP emulator, lazy-to-feature transition as reduced-basis validity boundary) using only moments and block inversion, then tests three NN objects (truncated attention map, token-level transfer operator, sparse expert router) and reports mostly negative results for open-case signatures such as the conserved flux ledger. The structural negative is attributed to the lack of an eliminated sector with continuous spectrum and wave-like (non-relaxational) dynamics in mainstream finite or dissipative NN objects.

Significance. If the structural diagnosis is correct, the work supplies a precise diagnostic map locating why open-system features have not appeared in ensemble theory and unifies several existing NN equivalences in a parameter-free manner via moments and block inversion. The closed-case dictionary is a clear contribution; the negative result on the three objects, while preliminary, identifies a concrete obstruction that future constructions would need to overcome.

major comments (2)
  1. [Testing section (three NN objects)] Testing section on the three NN objects: the partitions applied to the truncated attention map, token-level transfer operator, and sparse expert router must be shown explicitly to eliminate a sector whose spectrum is continuous and whose dynamics are wave-like rather than relaxational; absent such verification, the reported absence of conserved-flux signatures follows by construction from the Schur-complement algebra and does not establish that no NN ensemble can realize the required open sector.
  2. [Abstract and conclusion] Abstract and concluding paragraph: the claim that 'mainstream learning's finite or dissipative objects do not supply' the eliminated continuous-spectrum sector is load-bearing for the central negative; the three chosen examples are treated as representative, yet the manuscript provides no general argument that every possible partition of an NN object must eliminate only finite or dissipative sectors.
minor comments (1)
  1. The manuscript would benefit from an explicit side-by-side equation block comparing the closed-case Schur complement (covariance) with the open-case non-Hermitian generator and its optical theorem, to make the dictionary immediately usable.

Simulated Author's Rebuttal

2 responses · 1 unresolved

We thank the referee for the careful and constructive report. The comments correctly identify points where additional clarification would strengthen the manuscript. We respond to each major comment below.

read point-by-point responses
  1. Referee: Testing section (three NN objects): the partitions applied to the truncated attention map, token-level transfer operator, and sparse expert router must be shown explicitly to eliminate a sector whose spectrum is continuous and whose dynamics are wave-like rather than relaxational; absent such verification, the reported absence of conserved-flux signatures follows by construction from the Schur-complement algebra and does not establish that no NN ensemble can realize the required open sector.

    Authors: We agree that the partitions should be characterized explicitly to confirm the nature of the eliminated sectors. In the revised manuscript we will add explicit descriptions of the partitions for the truncated attention map, token-level transfer operator, and sparse expert router, together with arguments based on their finite dimensionality or dissipative character showing why the eliminated sectors lack continuous spectra and wave-like dynamics. This will make clear that the reported absence of open-case signatures follows from the structure of the tested objects rather than from the algebra alone. revision: yes

  2. Referee: Abstract and conclusion: the claim that 'mainstream learning's finite or dissipative objects do not supply' the eliminated continuous-spectrum sector is load-bearing for the central negative; the three chosen examples are treated as representative, yet the manuscript provides no general argument that every possible partition of an NN object must eliminate only finite or dissipative sectors.

    Authors: The three examples are presented as representative of mainstream finite or dissipative NN objects. We acknowledge that the manuscript supplies no general proof that every conceivable partition of every NN object must eliminate only finite or dissipative sectors; such a proof would require a classification of all possible architectures and partitions and lies outside the scope of this note. We will revise the abstract and conclusion to qualify the claim accordingly while preserving the structural diagnosis and the negative result for the tested cases. revision: partial

standing simulated objections not resolved
  • A general argument establishing that no possible partition of any neural-network object can eliminate a continuous-spectrum sector with wave-like dynamics.

Circularity Check

0 steps flagged

Derivation self-contained; no load-bearing reductions to inputs or self-citations

full rationale

Closed-case results follow directly from moments, Gaussian algebra, and block inversion (Schur complement) with no fitted parameters or self-citations invoked as justification. Open-case negative result is anchored in external nuclear-reaction-theory results whose authors do not overlap with the present paper, so the central claim does not reduce to any quantity defined inside the paper's own objects or fits. The three tested partitions are presented as representative instances rather than tautological choices that force the negative outcome by construction. No step matches any enumerated circularity pattern.

Axiom & Free-Parameter Ledger

0 free parameters · 2 axioms · 0 invented entities

The paper relies on standard linear-algebra facts (Schur complement, block inversion) and the algebraic properties of Gaussians; no free parameters are introduced, no new entities are postulated, and the open-system construction is imported from existing nuclear-reaction literature rather than invented here.

axioms (2)
  • standard math Block inversion and Schur complement yield the marginal covariance and its inverse for a partitioned Gaussian
    Invoked to equate parameter averaging with marginalization; stated in the abstract as the common operation for both closed cases.
  • domain assumption An eliminated sector with continuous spectrum produces a non-Hermitian effective generator obeying a generalized optical theorem
    Taken from nuclear reaction theory; used to define the missing open case that neural objects are claimed to lack.

pith-pipeline@v0.9.1-grok · 5869 in / 1624 out tokens · 19850 ms · 2026-06-27T17:11:51.741838+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Reference graph

Works this paper leans on

39 extracted references · 25 canonical work pages

  1. [1]

    Reduced basis emulator for elastic scattering in continuum-discretized coupled- channels calculations.Phys

    Jin Lei. Reduced basis emulator for elastic scattering in continuum-discretized coupled- channels calculations.Phys. Rev. C, 113:044610, 2026. doi: 10.1103/n24x-d9gm. URL https://doi.org/10.1103/n24x-d9gm

  2. [2]

    Bidirectional neural networks for global nucleon-nucleus optical model calculations

    Jin Lei. Bidirectional neural networks for global nucleon-nucleus optical model calculations. arXiv preprint, 2025. URLhttps://arxiv.org/abs/2512.22500. under review, Phys. Rev. C

  3. [3]

    Unified theory of nuclear reactions.Ann

    Herman Feshbach. Unified theory of nuclear reactions.Ann. Phys. (N.Y.), 5:357–390,

  4. [4]

    URLhttps://doi.org/10.1016/0003-4916(58) 90007-1

    doi: 10.1016/0003-4916(58)90007-1. URLhttps://doi.org/10.1016/0003-4916(58) 90007-1

  5. [5]

    A unified theory of nuclear reactions

    Herman Feshbach. A unified theory of nuclear reactions. II.Ann. Phys. (N.Y.), 19:287–313,

  6. [6]

    URLhttps://doi.org/10.1016/0003-4916(62) 90221-X

    doi: 10.1016/0003-4916(62)90221-X. URLhttps://doi.org/10.1016/0003-4916(62) 90221-X

  7. [7]

    R. J. Furnstahl, A. J. Garcia, P. J. Millican, and Xilin Zhang. Efficient emulators for scattering using eigenvector continuation.Phys. Lett. B, 809:135719, 2020. doi: 10.1016/j.physletb.2020. 135719. URLhttps://doi.org/10.1016/j.physletb.2020.135719

  8. [8]

    Drischler, J

    C. Drischler, J. A. Melendez, R. J. Furnstahl, A. J. Garcia, and Xilin Zhang. BUQEYE guide to projection-based emulators in nuclear physics.Front. Phys., 10:1092931, 2023. doi: 10.3389/fphy.2022.1092931. URLhttps://doi.org/10.3389/fphy.2022.1092931

  9. [9]

    Neal.Bayesian Learning for Neural Networks, volume 118 ofLecture Notes in Statistics

    Radford M. Neal.Bayesian Learning for Neural Networks, volume 118 ofLecture Notes in Statistics. Springer, New York, 1996. ISBN 978-0-387-94724-2. doi: 10.1007/ 978-1-4612-0745-0. URLhttps://doi.org/10.1007/978-1-4612-0745-0

  10. [10]

    Roberts, Sho Yaida, and Boris Hanin.The Principles of Deep Learning Theory

    Daniel A. Roberts, Sho Yaida, and Boris Hanin.The Principles of Deep Learning Theory. Cambridge University Press, 2022. URLhttps://arxiv.org/abs/2106.10165

  11. [11]

    Austern and C

    N. Austern and C. M. Vincent. Inclusive breakup reactions.Phys. Rev. C, 23:1847, 1981. doi: 10.1103/PhysRevC.23.1847. URLhttps://doi.org/10.1103/PhysRevC.23.1847

  12. [12]

    Ichimura, N

    M. Ichimura, N. Austern, and C. M. Vincent. Equivalence of post and prior sum rules for inclusive breakup reactions.Phys. Rev. C, 32:431, 1985. doi: 10.1103/PhysRevC.32.431. URL https://doi.org/10.1103/PhysRevC.32.431

  13. [13]

    Exact treatment of continuum couplings in nuclear optical potentials via feshbach theory.arXiv preprint, 2025

    Hao Liu, Jin Lei, and Zhongzhou Ren. Exact treatment of continuum couplings in nuclear optical potentials via feshbach theory.arXiv preprint, 2025. URLhttps://arxiv.org/abs/ 2508.07584. 22

  14. [14]

    Navarro P´ erez and Jin Lei

    R. Navarro P´ erez and Jin Lei. Is the unusual near-threshold potential behavior in elastic scattering of weakly-bound nuclei a precision error?Phys. Lett. B, 795:200, 2019. doi: 10. 1016/j.physletb.2019.06.005. URLhttps://doi.org/10.1016/j.physletb.2019.06.005

  15. [15]

    Neural tangent kernel: Convergence and generalization in neural networks

    Arthur Jacot, Franck Gabriel, and Cl´ ement Hongler. Neural tangent kernel: Convergence and generalization in neural networks. InAdvances in Neural Information Processing Systems, volume 31, 2018. URLhttps://arxiv.org/abs/1806.07572

  16. [16]

    Ekstr¨ om and G

    A. Ekstr¨ om and G. Hagen. Global sensitivity analysis of bulk properties of an atomic nucleus. Phys. Rev. Lett., 123:252501, 2019. doi: 10.1103/PhysRevLett.123.252501. URLhttps: //doi.org/10.1103/PhysRevLett.123.252501

  17. [17]

    Frame, R

    D. Frame, R. He, I. Ipsen, D. Lee, D. Lee, and E. Rrapaj. Eigenvector continuation with subspace learning.Phys. Rev. Lett., 121:032501, 2018. doi: 10.1103/PhysRevLett.121.032501. URLhttps://doi.org/10.1103/PhysRevLett.121.032501

  18. [18]

    Direct boundary matching: A bound-state technique for nuclear scattering with lagrange-legendre functions.Phys

    Jin Lei. Direct boundary matching: A bound-state technique for nuclear scattering with lagrange-legendre functions.Phys. Rev. C, 113:024614, 2026. doi: 10.1103/ddcx-cslb. URL https://doi.org/10.1103/ddcx-cslb

  19. [19]

    COLOSS: Complex-scaled optical and coulomb scattering solver.Comput

    Junzhe Liu, Jin Lei, and Zhongzhou Ren. COLOSS: Complex-scaled optical and coulomb scattering solver.Comput. Phys. Commun., 311:109568, 2025. doi: 10.1016/j.cpc.2025.109568. URLhttps://doi.org/10.1016/j.cpc.2025.109568

  20. [20]

    On the Generators of Quantum Dynamical Semigroups.Communications in Mathematical Physics 1976,48, 119–130

    G. Lindblad. On the generators of quantum dynamical semigroups.Commun. Math. Phys., 48:119–130, 1976. doi: 10.1007/BF01608499. URLhttps://doi.org/10.1007/BF01608499

  21. [21]

    A baseline for detecting misclassified and out-of- distribution examples in neural networks

    Dan Hendrycks and Kevin Gimpel. A baseline for detecting misclassified and out-of- distribution examples in neural networks. InInternational Conference on Learning Repre- sentations (ICLR), 2017. URLhttps://arxiv.org/abs/1610.02136

  22. [22]

    Selective classification for deep neural networks

    Yonatan Geifman and Ran El-Yaniv. Selective classification for deep neural networks. In Advances in Neural Information Processing Systems (NeurIPS), 2017. URLhttps://arxiv. org/abs/1705.08500

  23. [23]

    (2015, April 1)

    Naftali Tishby and Noga Zaslavsky. Deep learning and the information bottleneck principle. In2015 IEEE Information Theory Workshop (ITW), 2015. doi: 10.1109/ITW.2015.7133169. URLhttps://doi.org/10.1109/ITW.2015.7133169

  24. [24]

    Pritzel, and C

    Balaji Lakshminarayanan, A. Pritzel, and C. Blundell. Simple and scalable predictive uncer- tainty estimation using deep ensembles. InAdvances in Neural Information Processing Systems (NeurIPS), 2017. URLhttps://arxiv.org/abs/1612.01474

  25. [25]

    Ian Davidson, Michael Livanos, Antoine Gourru, Peter Walker, Julien Velcin, and S

    Sebastian Farquhar, Jannik Kossen, Lorenz Kuhn, and Yarin Gal. Detecting halluci- nations in large language models using semantic entropy.Nature, 630:625, 2024. doi: 10.1038/s41586-024-07421-0. URLhttps://doi.org/10.1038/s41586-024-07421-0

  26. [26]

    Udagawa and T

    T. Udagawa and T. Tamura. Derivation of breakup-fusion cross sections from the optical theorem.Phys. Rev. C, 24:1348, 1981. doi: 10.1103/PhysRevC.24.1348. URLhttps://doi. org/10.1103/PhysRevC.24.1348. 23

  27. [27]

    S. R. Cotanch. Coupled channels optical theorem and non-elastic cross section sum rule.Nucl. Phys. A, 842:48–58, 2010. doi: 10.1016/j.nuclphysa.2010.04.011. URLhttps://doi.org/10. 1016/j.nuclphysa.2010.04.011

  28. [28]

    Coherent absorption dynamics: The dual role of off- diagonal couplings in weakly bound nuclei.Phys

    Hao Liu, Jin Lei, and Zhongzhou Ren. Coherent absorption dynamics: The dual role of off- diagonal couplings in weakly bound nuclei.Phys. Rev. C, 113:054601, 2026. doi: 10.1103/ bgwc-x5wj. URLhttps://doi.org/10.1103/bgwc-x5wj

  29. [29]

    Channel couplings redirect absorbed flux from periph- eral loss to fusion in weakly bound nuclear reactions.Phys

    Hao Liu, Jin Lei, and Zhongzhou Ren. Channel couplings redirect absorbed flux from periph- eral loss to fusion in weakly bound nuclear reactions.Phys. Lett. B, 877:140479, 2026. doi: 10. 1016/j.physletb.2026.140479. URLhttps://doi.org/10.1016/j.physletb.2026.140479

  30. [30]

    Dynamical origin of spectroscopic quenching in knockout reactions.arXiv preprint,

    Jin Lei. Dynamical origin of spectroscopic quenching in knockout reactions.arXiv preprint,

  31. [31]

    submitted to Phys

    URLhttps://arxiv.org/abs/2602.12690. submitted to Phys. Rev. Lett

  32. [32]

    Perey and B

    F. Perey and B. Buck. A non-local potential model for the scattering of neutrons by nuclei. Nucl. Phys., 32:353–380, 1962. doi: 10.1016/0029-5582(62)90345-0. URLhttps://doi.org/ 10.1016/0029-5582(62)90345-0

  33. [33]

    Mahaux and R

    C. Mahaux and R. Sartor. Single-particle motion in nuclei. In J. W. Negele and E. Vogt, editors,Advances in Nuclear Physics, volume 20, pages 1–223. Springer, Boston, MA, 1991. doi: 10.1007/978-1-4613-9910-0 1. URLhttps://doi.org/10.1007/978-1-4613-9910-0_1

  34. [34]

    Exact construction and uniqueness of the coupled- channel green’s function.arXiv preprint, 2026

    Hao Liu, Jin Lei, and Zhongzhou Ren. Exact construction and uniqueness of the coupled- channel green’s function.arXiv preprint, 2026. URLhttps://arxiv.org/abs/2604.00471

  35. [35]

    Assessing continuum channel importance in continuum-discretized coupled-channels via dynamic polarization potential decomposition

    Jin Lei and Hao Liu. Assessing continuum channel importance in continuum-discretized coupled-channels via dynamic polarization potential decomposition. submitted to Phys. Rev. C (Letter), manuscript CRR1074, 2026

  36. [36]

    W. D. Heiss. The physics of exceptional points.J. Phys. A: Math. Theor., 45:444016, 2012. doi: 10.1088/1751-8113/45/44/444016. URLhttps://doi.org/10.1088/1751-8113/45/44/ 444016

  37. [37]

    Bender and Stefan Boettcher

    Carl M. Bender and Stefan Boettcher. Real spectra in non-hermitian hamiltonians having PT symmetry.Phys. Rev. Lett., 80:5243–5246, 1998. doi: 10.1103/PhysRevLett.80.5243. URL https://doi.org/10.1103/PhysRevLett.80.5243

  38. [38]

    J. A. Melendez, C. Drischler, R. J. Furnstahl, A. J. Garcia, and X. Zhang. Model reduction methods for nuclear emulators.J. Phys. G: Nucl. Part. Phys., 49:102001, 2022. doi: 10.1088/ 1361-6471/ac83dd. URLhttps://doi.org/10.1088/1361-6471/ac83dd

  39. [39]

    export target

    Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N. Gomez, Lukasz Kaiser, and Illia Polosukhin. Attention is all you need. InAdvances in Neural Infor- mation Processing Systems, volume 30, 2017. URLhttps://arxiv.org/abs/1706.03762. 24 Neural network ensemble Nuclear reaction theory Status Closed, statistical correspondence (b...