pith. sign in

arxiv: 2605.14783 · v1 · pith:5Z6LCVODnew · submitted 2026-05-14 · ✦ hep-ph

AI-Driven Discovery of Information-Efficient Collider Observables for Interference Measurements

Pith reviewed 2026-06-30 20:25 UTC · model grok-4.3

classification ✦ hep-ph
keywords optimal observablessymbolic regressionFisher informationCP violationHiggs bosoncollider observablesinterference measurementsmatrix element reweighting
0
0 comments X

The pith

Symbolic evolution guided by matrix-element scores discovers compact analytic observables that capture substantially more local Fisher information about CP-violating interactions than standard angular baselines.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper aims to establish that AI-driven symbolic evolution, using scores from matrix-element reweighting as the target, can identify compact analytic event-level observables for interference measurements. These observables are tested on the CP-sensitive HZ_{\mu\nu}\tilde Z^{\mu\nu} coupling in two channels: associated production e^+e^- \to Z(\to \mu^-\mu^+)H and the decay pp \to H \to ZZ^* \to e^-e^+\mu^-\mu^+. A sympathetic reader would care because the expressions remain simple enough for direct use while extracting more statistical information than conventional angular variables. The work shows that the learned forms recover helicity-interference harmonics, supplemented in one channel by laboratory-frame mappings and in the other by an angular kernel with a mass-ratio prefactor.

Core claim

By applying symbolic evolution to matrix-element reweighting scores, the method identifies observables for the HZ_{\mu\nu}\tilde Z^{\mu\nu} interaction that retain substantially more local Fisher information than angular baselines while remaining compact analytic functions in both associated production and four-lepton decay channels.

What carries the argument

The symbolic evolution procedure that treats the matrix-element score from reweighting as the statistical fitness target to evolve compact analytic expressions.

If this is right

  • In associated production the expressions supplement helicity-interference harmonics with laboratory-frame asymmetry mappings.
  • In four-lepton decay the angular kernel is the robust component while the mass-ratio factor acts as a bounded prefactor.
  • Optimal-observable design is recast as a symbolic discovery problem that yields transparent, interpretable probes.
  • The approach supplies a route to information-efficient observables for collider interference studies.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • The same guided symbolic search could be applied to other couplings where closed-form optimal observables are unavailable.
  • One could test retention of the information advantage by injecting the discovered expressions into existing analysis frameworks with realistic smearing.
  • The recovered harmonic structures might suggest new ways to classify interference terms across different processes.

Load-bearing premise

The symbolic evolution procedure guided only by the matrix-element score will converge to expressions that remain information-efficient once detector resolution, backgrounds, and higher-order corrections are included.

What would settle it

A direct numerical comparison, in full detector-level simulation including backgrounds and higher-order corrections, of the local Fisher information obtained from the learned observables versus standard angular baselines; reversal of the reported advantage would falsify the central claim.

Figures

Figures reproduced from arXiv: 2605.14783 by Jiahui Lin, Yandong Liu.

Figure 1
Figure 1. Figure 1: FIG. 1. Schematic overview of the AI-driven symbolic-observable construction. A local score target derived from matrix [PITH_FULL_IMAGE:figures/full_fig_p003_1.png] view at source ↗
Figure 2
Figure 2. Figure 2: FIG. 2. Laboratory-frame energy asymmetry [PITH_FULL_IMAGE:figures/full_fig_p003_2.png] view at source ↗
Figure 3
Figure 3. Figure 3: shows this improvement directly: compared with O2 = sin(2ϕ ∗ ), the learned observable fext follows the score much more monotonically [PITH_FULL_IMAGE:figures/full_fig_p003_3.png] view at source ↗
Figure 4
Figure 4. Figure 4: FIG. 4. Standard Model and signed interference distributions for [PITH_FULL_IMAGE:figures/full_fig_p004_4.png] view at source ↗
Figure 5
Figure 5. Figure 5: FIG. 5. Standard Model and signed interference distributions [PITH_FULL_IMAGE:figures/full_fig_p004_5.png] view at source ↗
read the original abstract

Optimal observables provide statistically powerful probes of small deformations from a reference theory, but in realistic collider measurements they are rarely available in compact analytic form. We show that interpretable event-level observables can be discovered by AI-driven symbolic evolution using score information from matrix-element reweighting as the statistical target. Focusing on the CP-sensitive interaction $HZ_{\mu\nu}\tilde Z^{\mu\nu}$, we study two complementary realizations of the same coupling structure: associated production $e^+e^-\to Z(\to \mu^-\mu^+)H$ and the decay channel $pp\to H\to ZZ^*\to e^-e^+\mu^-\mu^+$. The learned observables retain substantially more local Fisher information than standard angular baselines while remaining compact analytic functions. In both cases, the discovered expressions recover characteristic helicity-interference harmonics. In associated production these harmonics are supplemented by laboratory-frame asymmetry mappings, while in four-lepton decay the robust component is the angular kernel, with the mass-ratio factor serving as a bounded representative prefactor. These results recast optimal-observable design as a symbolic discovery problem and provide a transparent route to information-efficient, interpretable probes of collider interference.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 2 minor

Summary. The paper claims that AI-driven symbolic evolution, guided by matrix-element reweighting scores for the CP-sensitive HZ_{\mu\nu}\tilde Z^{\mu\nu} coupling, can discover compact analytic observables for e^+e^- \to ZH(\to \mu\mu) and H \to 4\ell that retain substantially more local Fisher information than standard angular baselines, recover helicity-interference harmonics, and remain interpretable; the method is demonstrated in associated production (with lab-frame asymmetries) and four-lepton decay (with angular kernels plus bounded mass-ratio prefactors).

Significance. If the information gains hold under realistic conditions, the work recasts optimal-observable construction as an automated symbolic search problem and supplies a transparent route to compact, high-information probes of interference effects; the explicit recovery of known harmonics plus supplementary structures is a concrete strength.

major comments (2)
  1. [Results (Fisher information comparisons)] The central claim that the discovered observables are substantially more information-efficient for collider measurements rests on Fisher comparisons performed exclusively at parton/truth level (see results sections on associated production and four-lepton decay). No folding with detector resolution, acceptance, backgrounds, or NLO corrections is described, so it remains untested whether the reported gains survive the conditions under which the observables would actually be used.
  2. [Methods (symbolic evolution procedure)] The symbolic evolution is optimized directly against the matrix-element score; while this avoids circularity in the reported numbers, the manuscript does not demonstrate that the resulting expressions remain optimal or even stable once the score is replaced by a realistic, detector-level likelihood or when higher-order corrections alter the interference structure.
minor comments (2)
  1. [Associated production results] Notation for the laboratory-frame mappings in the e^+e^- channel could be clarified with an explicit definition of the coordinate system used for the asymmetry terms.
  2. [Summary of discovered observables] The abstract states that the observables 'remain compact analytic functions,' but the manuscript would benefit from a short table listing the exact discovered expressions alongside their Fisher values for direct comparison with the angular baselines.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the careful reading and constructive feedback on the manuscript. We address the two major comments point by point below.

read point-by-point responses
  1. Referee: [Results (Fisher information comparisons)] The central claim that the discovered observables are substantially more information-efficient for collider measurements rests on Fisher comparisons performed exclusively at parton/truth level (see results sections on associated production and four-lepton decay). No folding with detector resolution, acceptance, backgrounds, or NLO corrections is described, so it remains untested whether the reported gains survive the conditions under which the observables would actually be used.

    Authors: We agree that the Fisher information comparisons are performed exclusively at parton level. This is a deliberate scope choice to isolate the information efficiency of the symbolically discovered observables using the matrix-element score as target, without confounding factors from experimental effects. The manuscript's core contribution is the automated discovery of compact analytic forms that outperform angular baselines at this level and recover known interference structures. We acknowledge that survival of the gains under detector resolution, acceptance, backgrounds, and NLO corrections remains untested. In the revised version we will add a paragraph in the Conclusions explicitly stating this limitation and identifying full detector-level validation as an important direction for follow-up work. revision: partial

  2. Referee: [Methods (symbolic evolution procedure)] The symbolic evolution is optimized directly against the matrix-element score; while this avoids circularity in the reported numbers, the manuscript does not demonstrate that the resulting expressions remain optimal or even stable once the score is replaced by a realistic, detector-level likelihood or when higher-order corrections alter the interference structure.

    Authors: The direct optimization against the matrix-element reweighting score is the methodological point of the work: it supplies an unambiguous, circularity-free target for information efficiency at truth level. The resulting expressions are therefore optimal by construction within that well-defined setting. Demonstrating stability or re-optimality under a detector-level likelihood or NLO-altered interference would require an entirely separate campaign of full simulation and re-running the symbolic search, which lies outside the present scope. The paper presents a proof-of-principle that the symbolic-evolution approach can recover high-information, interpretable observables; we do not claim broader robustness at this stage. revision: no

Circularity Check

0 steps flagged

No circularity in derivation chain; symbolic discovery uses external score target with independent Fisher evaluation

full rationale

The paper's core procedure applies symbolic evolution to discover compact observables guided by an external matrix-element reweighting score for the CP parameter, then performs separate Fisher information comparisons against angular baselines at parton level. This does not reduce the reported information gains to a fitted parameter inside the evolved expressions, a self-citation chain, or a self-definitional loop; the optimization target and the subsequent Fisher metric are distinct, and no load-bearing uniqueness theorem or ansatz is imported from prior author work. The derivation remains self-contained against the stated external benchmark.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

The approach rests on the assumption that matrix-element reweighting supplies an accurate local score for Fisher information and that the symbolic search space is rich enough to contain near-optimal compact expressions.

axioms (1)
  • domain assumption Matrix-element reweighting yields a reliable statistical target for local Fisher information
    Used as the fitness function for symbolic evolution

pith-pipeline@v0.9.1-grok · 5734 in / 1120 out tokens · 21202 ms · 2026-06-30T20:25:25.820106+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Reference graph

Works this paper leans on

50 extracted references · 33 canonical work pages · 17 internal anchors

  1. [1]

    1.9×10 −2 CP-sensitive angular interference kernel; smooth bounded representative prefactor 1.00 0.75 0.50 0.25 0.00 0.25 0.50 0.75 1.00 cos 0.8 0.6 0.4 0.2 0.0 0.2 0.4 0.6 0.8 AE = (E + E )/(E + + E ) SM 1.00 0.75 0.50 0.25 0.00 0.25 0.50 0.75 1.00 cos AE = (E + E )/(E + + E ) Int 0.0 0.2 0.4 0.6 0.8 1.0 0.0 0.2 0.4 0.6 0.8 1.0 FIG. 2. Laboratory-frame e...

  2. [2]

    The angle and transverse-axis conventions for (θ∗ i , ϕ∗ i ) are defined in Appendix C

    sin(ϕ∗ 1 +ϕ ∗ 2),(9) wherer Z ≡m Z2 /(mZ1 +m Z2), with the fixed-flavor conventionZ 1 ≡e −e+ andZ 2 ≡µ −µ+. The angle and transverse-axis conventions for (θ∗ i , ϕ∗ i ) are defined in Appendix C. Again, the angular dependence reconstructs the expected CP-sensitive interference harmonic, now appropriate to theH→ZZ ∗ →4ℓdecay geometry [29– 31, 34, 38–40]. W...

  3. [3]

    The accompanying invariant-mass factorr Z(1−r Z) is a smooth, bounded representative prefactor selected by the symbolic search

    sin(2θ∗ 2) isolates the relevant longitudinal–transverse interference. The accompanying invariant-mass factorr Z(1−r Z) is a smooth, bounded representative prefactor selected by the symbolic search. Direct ablations indicate that the Fisher efficiency is dominated by the angular kernel, so the physical interpretation below focuses on the CP- sensitive ang...

  4. [4]

    carries only a tiny fraction of the available information, withϵ≃ 2.3×10 −4, whereas the symbolic observable reaches 1.0 0.5 0.0 0.5 1.0 O1 0.0 0.2 0.4 0.6 0.8 1.0 1.2(1/ SM) d SM/dOi O1 = sin( 1 + 2 ) 0.2 0.1 0.0 0.1 0.2 O4 0 2 4 6 8 10 12 14(1/ SM) d SM/dOi O4 1.0 0.5 0.0 0.5 1.0 O1 2 1 0 1 2 (1/| int)| d int/dOi O1 = sin( 1 + 2 ) 0.2 0.1 0.0 0.1 0.2 O4...

  5. [5]

    The symbolic observable separates opposite- sign interference regions much more clearly than the angular baseline

    andO 4ℓ inpp→H→ZZ ∗ → e−e+µ−µ+. The symbolic observable separates opposite- sign interference regions much more clearly than the angular baseline. The Fisher-information efficiency improves from ϵ≃2.3×10 −4 toϵ≃1.9×10 −2. ϵ≃1.9×10 −2. Figure 5 shows this improvement directly: the learned observable reorganizes the events so that opposite-sign interference...

  6. [6]

    Davier, L

    M. Davier, L. Duflot, F. Le Diberder, and A. Rouge, Phys. Lett. B306, 411 (1993)

  7. [7]

    Diehl and O

    M. Diehl and O. Nachtmann, Z. Phys. C62, 397 (1994)

  8. [8]

    Brehmer, F

    J. Brehmer, F. Kling, I. Espejo, and K. Cranmer, Comput. Softw. Big Sci.4, 3 (2020), 1907.10621

  9. [9]

    Guest, K

    D. Guest, K. Cranmer, and D. Whiteson, Annual Review of Nuclear and Particle Science68, 161 (2018), ISSN 1545-4134, URLhttp://dx.doi.org/10.1146/ annurev-nucl-101917-021019

  10. [10]

    X. C. Cid Vidal, L. D. Dieste Maro˜ nas, and ´A. D. Dosil Su´ arez, Applied Sciences11, 11076 (2021), ISSN 2076-3417, URLhttp://dx.doi.org/10.3390/ app112211076

  11. [11]

    A. J. Larkoski, I. Moult, and B. Nachman, Phys. Rept. 841, 1 (2020), 1709.04464

  12. [12]

    Radovic, M

    A. Radovic, M. Williams, D. Rousseau, M. Kagan, D. Bonacorsi, A. Himmel, A. Aurisano, K. Terao, and T. Wongjirad, Nature560, 41 (2018)

  13. [13]

    J. H. Collins, K. Howe, and B. Nachman, Phys. Rev. D 99, 014038 (2019), 1902.02634

  14. [14]

    J. Ren, L. Wu, J. M. Yang, and J. Zhao, Nucl. Phys. B 943, 114613 (2019), 1708.06615

  15. [15]

    Abdughani, D

    M. Abdughani, D. Wang, L. Wu, J. M. Yang, and J. Zhao, Phys. Rev. D104, 056003 (2021), 2005.11086

  16. [16]

    J. Ren, D. Wang, L. Wu, J. M. Yang, and M. Zhang, JHEP11, 138 (2021), 2106.07018

  17. [17]

    C.-T. Lu, H. Lv, W. Shen, L. Wu, and J. Zhang, JHEP 08, 187 (2023), 2304.03237

  18. [18]

    S. S. Cruz, M. Kolosova, C. Ram´ on ´Alvarez, G. Petrucciani, and P. Vischia, Phys. Rev. D110, 096023 (2024), 2405.13524

  19. [19]

    Barru´ e, P

    R. Barru´ e, P. Conde-Mu´ ı˜ no, V. Dao, and R. Santos, JHEP04, 014 (2024), 2308.02882

  20. [20]

    Badger et al., SciPost Phys.14, 079 (2023), 2203.07460

    S. Badger et al., SciPost Phys.14, 079 (2023), 2203.07460

  21. [21]

    Butter, T

    A. Butter, T. Plehn, N. Soybelman, and J. Brehmer, SciPost Phys.16, 037 (2024), 2109.10414. 6

  22. [22]

    H. Bahl, E. Fuchs, M. Menen, and T. Plehn, SciPost Phys.20, 040 (2026), 2507.05858

  23. [23]

    Pawan Kumar, Emilien Dupont, Francisco J

    B. Romera-Paredes, M. Barekatain, A. Novikov, M. Balog, M. P. Kumar, E. Dupont, F. J. R. Ruiz, J. S. Ellenberg, P. Wang, O. Fawzi, et al., Nature625(2024), URLhttps://doi.org/10.1038/s41586-023-06924-6

  24. [24]

    AlphaEvolve: A coding agent for scientific and algorithmic discovery

    A. Novikov, N. V˜ u, M. Eisenberger, E. Dupont, P.-S. Huang, A. Z. Wagner, S. Shirobokov, B. Kozlovskii, F. J. R. Ruiz, A. Mehrabian, et al. (2025), 2506.13131, URLhttps://arxiv.org/abs/2506.13131

  25. [25]

    Song, T.-Z

    Z.-Y. Song, T.-Z. Yang, Q.-H. Cao, M.-x. Luo, and H. X. Zhu (2025), 2502.09544

  26. [26]

    Cao, Z.-Y

    Q.-H. Cao, Z.-Y. Hou, Y.-Y. Li, X. Liu, Z.-Y. Song, L.-Q. Zhang, S. Zhang, and K. Zhao (2025), 2505.06347

  27. [27]

    Automation of the matrix element reweighting method

    P. Artoisenet, V. Lemaitre, F. Maltoni, and O. Mattelaer, JHEP12, 068 (2010), 1007.3300

  28. [28]

    Cranmer, J

    K. Cranmer, J. Brehmer, and G. Louppe, Proc. Nat. Acad. Sci.117, 30055 (2020), 1911.01429

  29. [29]

    G. J. Gounaris, F. M. Renard, and N. D. Vlachos, Nucl. Phys. B459, 51 (1996)

  30. [30]

    Deconstructing Angular Correlations in ZH, ZZ, and WW Production at LEP2

    G. Mahlon and S. J. Parke, Phys. Rev. D58, 054015 (1998), hep-ph/9803410

  31. [31]

    Anomalous Higgs couplings in angular asymmetries of H --> Zl+l- and e+e- --> HZ

    M. Beneke, D. Boito, and Y.-M. Wang, JHEP11, 028 (2014), 1406.1361

  32. [32]

    Beyond Higgs Couplings: Probing the Higgs with Angular Observables at Future $e^+ e^-$ Colliders

    N. Craig, J. Gu, Z. Liu, and K. Wang, JHEP03, 050 (2016), 1512.06877

  33. [33]

    Q. Sha, A. Fadol, F. Guo, G. Li, Y. Fang, J. Gu, and X. Lou, Eur. Phys. J. C82, 981 (2022), 2203.11707

  34. [34]

    Y. Gao, A. V. Gritsan, Z. Guo, K. Melnikov, M. Schulze, and N. V. Tran, Phys. Rev. D81, 075022 (2010), 1001.3396

  35. [35]

    On the spin and parity of a single-produced resonance at the LHC

    S. Bolognesi, Y. Gao, A. V. Gritsan, K. Melnikov, M. Schulze, N. V. Tran, and A. Whitbeck, Phys. Rev. D86, 095031 (2012), 1208.4018

  36. [36]

    Constraining anomalous HVV interactions at proton and lepton colliders

    I. Anderson, S. Bolognesi, F. Caola, Y. Gao, A. V. Gritsan, C. B. Martin, K. Melnikov, M. Schulze, N. V. Tran, A. Whitbeck, et al., Phys. Rev. D89, 035007 (2014), 1309.4819

  37. [37]

    R. M. Godbole, D. J. Miller, and M. M. Muhlleitner, JHEP12, 031 (2007), 0708.0458

  38. [38]

    Extracting Effective Higgs Couplings in the Golden Channel

    Y. Chen and R. Vega-Morales, JHEP04, 057 (2014), 1310.2893

  39. [39]

    Y. Chen, E. Di Marco, J. Lykken, M. Spiropulu, R. Vega- Morales, and S. Xie, JHEP01, 125 (2015), 1401.2077

  40. [40]

    Atwood and A

    D. Atwood and A. Soni, Phys. Rev. D45, 2405 (1992)

  41. [41]

    A. A. Babich, P. Osland, A. A. Pankov, and N. Paver, Phys. Lett. B476, 95 (2000), hep-ph/9910403

  42. [42]

    Bhattacharya, A

    S. Bhattacharya, A. Subba, and A. Sarkar, Phys. Rev. D 112, 115018 (2025), 2508.08893

  43. [43]

    S. Y. Choi, D. J. Miller, M. M. M¨ uhlleitner, and P. M. Zerwas, Phys. Lett. B553, 61 (2003), hep-ph/0210077

  44. [44]

    Angular asymmetries as a probe for anomalous contributions to HZZ vertex at the LHC

    N. Belyaev, R. Konoplich, L. E. Pedersen, and K. Prokofiev, Phys. Rev. D91, 115014 (2015), 1502.03045

  45. [45]

    Precision studies of the Higgs boson decay channel H -> ZZ -> 4l with MEKD

    P. Avery et al., Phys. Rev. D87, 055006 (2013), 1210.0896. 7 Appendix A: Score Estimation and Fisher-Information Projection This appendix is organized around the points needed to reproduce and interpret the main-text results. Sec. A defines the score and projected Fisher-information estimator. Sec. B collects the event samples, benchmark efficiencies, and...

  46. [46]

    baseline 2.3×10 −4 4ℓsymbolicO 4ℓ 1.9×10 −2 The two symbolicZHrepresentatives are fmin ∼ sin2 θZ sin2 θ∗ sin(2ϕ∗)q 0.5 + sin2 θZ sin2 θ∗ 2 (1.5 + cos 2ϕ∗) ,(B1) fext = 0.706 sin(2ϕ∗) sin2 θ∗ sinθ Z + sinϕ ∗ sinθ ∗ cosθ Z(2AE +A pT ) (pZ T )2 (pZ T )2 +p 2 0 .(B2) For the four-lepton channel, O4ℓ =r Z(1−r Z) sin(2θ∗

  47. [47]

    sin(ϕ∗ 1 +ϕ ∗ 2).(B3) All kinematic variables entering these expressions are defined in Sec. C. The numerical coefficients inf ext, including the relative coefficient multiplyingA E and the scalep 0 = 46 GeV, are selected by the search. Appendix C: Kinematic Conventions a. Associated production.Fore +e− →Z(→ℓ −ℓ+)H, all angles are defined in the laborator...

  48. [48]

    sin(ϕ∗ 1 +ϕ ∗ 2),(D12) with the angle convention of Sec. C. This is the angular kernel appearing inO 4ℓ in the main text; the role of the additional mass-ratio envelope is tested by the ablations in Sec. E. 11 Appendix E: Symbolic Search and Robustness Checks The symbolic search evolves compact analytic expressions of the available event-level variables. ...

  49. [49]

    In the selected compact representative the prefactor is the symmetric mass-ratio enveloper Z(1−r Z)

    and closely related polar-angle structures, while the smooth kinematic prefactor is less unique. In the selected compact representative the prefactor is the symmetric mass-ratio enveloper Z(1−r Z). Direct ablations confirm this interpretation. With the nominal 30-bin estimator,r Z(1−r Z) alone givesϵ≃3.4×10 −5, while the angular kernel sin(2θ∗

  50. [50]

    sin(ϕ∗ 1 +ϕ ∗ 2) givesϵ≃1.9×10 −2, essentially the same as the full displayed observable. This supports the interpretation that the stable output of the search is the helicity-interference angular pattern rather than a unique algebraic representative or a uniquely informative mass envelope. Appendix F: Asymmetry Selection in the F our-Lepton Channel For t...