pith. machine review for the scientific record. sign in

arxiv: 2502.12253 · v2 · submitted 2025-02-17 · ✦ hep-ph · hep-ex· nucl-ex· nucl-th

Recognition: 3 theorem links

· Lean Theorem

A Precise Determination of α_s from the Heavy Jet Mass Distribution

Andre H. Hoang, Arindam Bhattacharya, Iain W. Stewart, Matthew D. Schwartz, Miguel A. Benitez, Vicent Mateu, Xiaoyuan Zhang

Authors on Pith no claims yet

Pith reviewed 2026-05-06 21:33 UTC · model claude-opus-4-7

classification ✦ hep-ph hep-exnucl-exnucl-th PACS 12.38.Bx12.38.Cy12.39.St24.85.+p
keywords strong couplingheavy jet massevent shapesSudakov shoulder resummationdijet resummationpower correctionsrenormalon subtractione+e- annihilation
0
0 comments X

The pith

A global fit of the heavy jet mass distribution gives α_s(m_Z) = 0.1148⁺⁰·⁰⁰¹⁵₋₀.₀₀₂₂, consistent with thrust once dijet and Sudakov shoulder resummation and separate dijet and trijet power corrections are included.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper revisits a long-standing puzzle: fits of the strong coupling to heavy jet mass data have consistently come out lower than fits to thrust and below the world average. The authors argue this is not a real disagreement but a consequence of incomplete theory. They build a prediction that simultaneously matches O(α_s³) fixed-order results to dijet resummation at N³LL′ and to Sudakov shoulder resummation at N²LL near ρ=1/3, and they treat non-perturbative effects with a renormalon-subtracted dijet parameter Ω₁ plus an independent trijet shift Θ₁. With theory uncertainties propagated through a flat random scan over scale-profile parameters, the fit returns α_s(m_Z)=0.1148⁺⁰·⁰⁰¹⁵₋₀.₀₀₂₂ — compatible with thrust and C-parameter. Two structural claims travel with the number: dijet resummation is what makes the result robust to where you cut the fit, and shoulder resummation is what makes the trijet shift come out negative.

Core claim

Earlier extractions of the strong coupling from the heavy jet mass (HJM) distribution sat noticeably below those from thrust and below the world average, an unresolved tension in precision QCD. The authors argue this gap was largely an artifact of two missing ingredients and a fragile fit procedure. Once they include (i) N³LL′ dijet resummation, (ii) N²LL resummation of Sudakov shoulder logarithms near ρ=1/3, (iii) a first-principles dijet power correction Ω₁ in the R-gap scheme, and (iv) a separate non-perturbative shift Θ₁ in the trijet region, the fit becomes nearly insensitive to the lower edge of the fit window and yields α_s(m_Z) = 0.1148⁺⁰·⁰⁰¹⁵₋₀.₀₀₂₂, consistent with thrust and C-par

What carries the argument

A matched cross section combining O(α_s³) fixed-order, N³LL′ dijet resummation, and N²LL Sudakov shoulder resummation, with profile functions interpolating the hard, jet, and soft scales between regions; non-perturbative physics is encoded by a single dijet shape parameter Ω₁^ρ in the R-gap (renormalon-subtracted) scheme and a separate trijet shift Θ₁, smoothly connected by a profile. Theoretical correlations enter through a covariance matrix built from a flat random scan over 17 profile/scale parameters, added to the LEP minimal-overlap experimental covariance.

If this is right

  • <parameter name="0">The historical tension between α_s extracted from HJM and from thrust is resolved within current theoretical accuracy
  • removing a recurring outlier in event-shape determinations of the strong coupling.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • <parameter name="0">If shoulder resummation is needed to reveal a negative trijet shift
  • then earlier "linear power correction" analyses based on three-parton dipole models in fixed-order perturbation theory may have been comparing inconsistent objects
  • and their conclusion that HJM should shift uniformly leftward is premature.

Load-bearing premise

That a single rigid shift Θ₁ adequately captures non-perturbative physics in the trijet region — there is no first-principles factorization behind it yet, so its negative sign and its role in the α_s extraction rest on a phenomenological model rather than a derived shape function.

What would settle it

Repeat the global fit on the same e⁺e⁻ HJM data with the full theory stack (N³LL′ dijet + N²LL shoulder + R-gap Ω₁ + trijet Θ₁) and a comparable random-scan theory covariance: if α_s(m_Z) does not land near 0.1148 with χ²/dof≈1 and approximate independence from the lower cutoff a/Q, or if Θ₁ does not turn negative once shoulder resummation is switched on, the central claims fail.

read the original abstract

A global fit for $\alpha_s(m_Z)$ is performed on available $e^+e^-$ data for the heavy jet mass distribution. The state-of-the-art theory prediction includes $\mathcal{O}(\alpha_s^3)$ fixed-order results, N$^3$LL$^\prime$ dijet resummation, N$^2$LL Sudakov shoulder resummation, and a first-principles treatment of power corrections in the dijet region. Theoretical correlations are incorporated through a flat random-scan covariance matrix. The global fit results in $0.1148^{+ 0.0015}_{-0.0022}$, compatible with similar determinations from thrust and $C$-parameter. Dijet resummation is essential for a robust fit, as it engenders insensitivity to the fit-range lower cutoff; without resummation the fit-range sensitivity is overwhelming. In addition, we find evidence for a negative power correction in the trijet region if and only if Sudakov shoulder resummation is included.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

4 major / 8 minor

Summary. The authors perform a global fit of α_s(m_Z) to e+e- heavy jet mass (HJM) data spanning 35–207 GeV, combining O(α_s^3) fixed-order, N³LL' dijet resummation, N²LL Sudakov-shoulder resummation, an R-gap-scheme dijet power correction Ω^ρ_1, and a phenomenological trijet shift Θ_1. Theoretical correlations are incorporated via a flat random scan over 17 profile parameters that produces a theory covariance matrix added to the experimental one. The headline result is α_s(m_Z) = 0.1148^{+0.0015}_{-0.0022}, with Ω^ρ_1 = 0.61 ± 0.08 GeV and a negative Θ_1 = -0.46 ± 0.17 GeV that emerges only when shoulder resummation is turned on. The authors emphasize two methodological claims: (i) dijet resummation removes the strong fit-range dependence that plagues fixed-order fits (Fig. 2), and (ii) shoulder resummation is necessary to expose a negative power correction in the trijet region.

Significance. This is a careful and technically state-of-the-art HJM analysis that brings the HJM α_s determination onto an equal theoretical footing with the thrust and C-parameter analyses by overlapping authors, and resolves a longstanding tension by which earlier HJM extractions sat anomalously low. The fit-range-stability demonstration in Fig. 2 is a real result on its own — it is a quantitative argument that previous lower-α_s HJM extractions were artifacts of fixed-order fits to a region where dijet logs are large. The treatment of theory correlations via a flat random-scan covariance matrix added to the LEP minimal-overlap experimental matrix (Eqs. 6–8) is more disciplined than common scale-variation envelopes, and the supplemental cross-checks (excluding DELPHI, varying Q windows, dropping correlations) are appropriate. The Θ_1 < 0 finding, conditional on shoulder resummation, is a non-trivial empirical observation that engages constructively with the Caola/Nason/Zanderighi line of work on three-jet linear power corrections.

major comments (4)
  1. [Eq. (2), Ω^ρ_1 truncation] The dominant uncertainty in Eq. (9) (+0.0011/-0.0017) comes from Ω^ρ_1, and Fig. 9 shows strong α_s–Ω^ρ_1 anticorrelation. The dijet shape function F^Ξ_{1,2} is collapsed onto a single moment, while the fit extends down to ρQ ~ 3 a_peak ≈ 4.7 GeV. In that regime, higher moments (Ω_2 etc.) of the OPE are formally suppressed by Λ_QCD/(ρQ) but numerically not obviously negligible, and an unmodeled Ω_2 would migrate into α_s through precisely the anticorrelation channel of Fig. 9. The companion thrust paper [36] truncates similarly, so this is a deliberate choice, but the manuscript should explicitly bound the residual Ω_2 contamination — e.g., by repeating the fit with a two-parameter (Ω_1, Ω_2) shape function over a restricted ρQ range, or by quoting an estimated systematic from the literature. As written, the abstract phrase "first-principles treatment of power corrections in the dijet re
  2. [Eq. (4), Θ_1 modeling and sign claim] The trijet power correction is parameterized as a rigid shift dσ_sh/dρ(ρ - Θ_1/Q), which the authors themselves note has no first-principles derivation. The qualitative claim 'negative Θ_1 if and only if shoulder resummation is included' (abstract, Fig. 5) is interesting but rests on a one-parameter ansatz that is not directly comparable to the more differential ζ(ρ) computations of Refs. [14–17]. Two clarifications would strengthen the case: (a) a quantification of how degenerate Θ_1 is with the shoulder profile parameters {ρ_L1, ρ_L2, ρ_R1, ρ_R2} that govern the turn-on of the resummation; (b) a fit replacing Eq. (4) with a slope or two-parameter trijet correction, to verify that the sign result is not an artifact of the rigid-shift form.
  3. [Comparison with thrust Ω_1] Reconciliation of Ω^ρ_1 = 0.61 GeV with thrust's Ω^τ_1/2 = 0.31 GeV via the hadron-mass universality-class argument (Ω^ρ_1 ≃ Ω^τ_1) is sourced to Monte Carlo expectations from Ref. [46], not to a first-principles calculation. Given that this near-coincidence is the principal external sanity check on the dijet non-perturbative parameter, the manuscript should be explicit that this consistency is at the level of the MC-tuned hadron-mass scheme, and quote how large the MC uncertainty on the predicted Ω^ρ_1/Ω^τ_1 ratio is.
  4. [Fit-method uncertainty (Fig. 4)] The ±0.0008 'fit method' uncertainty from comparing flat random scan to quadratic sum of profile variations is not small — it is comparable to the th+exp piece. The authors prefer the random-scan method because of 'smaller uncertainties for some bins' in the quadratic sum, but the central α_s shifts from 0.1148 (random scan) to 0.1140 (quadratic sum). A more quantitative justification of the preference is warranted, ideally by toy-data studies showing which method recovers an injected α_s with better coverage. As stated, the choice influences the central value by ~0.5σ_total.
minor comments (8)
  1. [Abstract and intro] The phrase 'first-principles treatment of power corrections in the dijet region' should be qualified, since it refers to the R-gap-scheme renormalon-subtracted Ω^ρ_1 specifically and not to a derivation of the full shape function.
  2. [Footnote 1, p. 4] The disclaimer that Ref. [17] gives no fit to ρ alone is helpful, but a more explicit comparison plot — e.g., the authors' best-fit dσ/dρ overlaid against the ζ(ρ) shift function of [16,17] — would help the reader assess where the disagreement lies.
  3. [Fig. 2] It would help to mark on Fig. 2 the value of a corresponding to the reference fit lower bound (3 a_peak/Q on average) used in the headline result, so the reader can locate the quoted central value on the curve.
  4. [Table III] The 'fit range ±0.0001' entry for FO+dij 2D and ±0.0005 for FO+dij+sh 3D look very small; given the visible spread of points in Fig. 2 over a∈[3 a_peak, 6 a_peak], a brief explanation of how the weighted-average prescription with weight σ_a^{-2} suppresses this would prevent confusion.
  5. [Eq. (S-2) / Table I] Some profile parameters are varied over fairly narrow ranges (e.g., t_2 ∈ [0.225, 0.275], t_s ∈ [0.375, 0.425], ±10%). A short justification that these ranges saturate the profile-related theory error — for instance, that doubling them does not move α_s outside the quoted theory band — would be welcome.
  6. [Convergence (Fig. 7)] The text notes 'less overlap between the different orders' near ρ ≈ 0.25 in the 2D dijet panel, remedied by shoulder resummation. It would be useful to state quantitatively how much of the FO+dij 2D theory uncertainty over the fit range originates from this near-shoulder mismatch.
  7. [Eq. (1)] The minimal-overlap experimental covariance is block-diagonal in datasets, treating different experiments at the same Q as uncorrelated. A sentence acknowledging that some systematic sources (e.g., common Monte Carlo hadronization corrections) may be partially correlated across experiments would be appropriate.
  8. [Cross-checks p. 5] The Q ≤ 100 GeV subset gives α_s = 0.1163 ± 0.0021 versus Q ≥ 90 GeV giving 0.1145 ± 0.0020; this 0.0018 spread is similar in size to the total quoted uncertainty. A short comment on whether this reflects b-mass effects, residual hadronization mismodeling, or statistical fluctuation would be valuable.

Simulated Author's Rebuttal

4 responses · 2 unresolved

We thank the referee for a careful and constructive report and for endorsing the fit-range-stability and Θ_1<0 findings as substantive. The four major points all concern quantitative robustness of either the dijet OPE truncation, the trijet shift ansatz, the Ω^ρ_1–Ω^τ_1 comparison, or the choice of theory-covariance prescription. We agree with the referee on all four and propose targeted additions to the supplemental material rather than changes to the headline result, which we believe remains well supported. Specifically: (1) we will add an Ω^ρ_2 cross-check fit and re-calibrate the abstract's 'first-principles' phrasing to apply specifically to the leading dijet power correction; (2) we will add a two-parameter (shift+slope) trijet fit and quantify Θ_1 degeneracy with the shoulder turn-on profiles; (3) we will rephrase the Ω^ρ_1 vs. Ω^τ_1 discussion to make the MC-tuned hadron-mass-scheme nature of the comparison explicit and quote the ~10–15% MC spread on the ratio; (4) we will add a toy closure study comparing flat random-scan and quadratic-sum coverage, while retaining the ±0.0008 fit-method systematic so that the methodological preference does not propagate into the central value beyond its quoted uncertainty. Two items we cannot fully resolve are flagged as standing objections.

read point-by-point responses
  1. Referee: Ω^ρ_1 truncation: dominant uncertainty comes from Ω^ρ_1; with fit extending to ρQ ~ 3 a_peak ≈ 4.7 GeV, higher moments (Ω_2) could migrate into α_s through the strong α_s–Ω^ρ_1 anticorrelation. Manuscript should bound residual Ω_2 contamination, and the abstract phrase 'first-principles treatment of power corrections in the dijet region' should be calibrated to this truncation.

    Authors: We agree that quantifying Ω_2 contamination is worthwhile. The dijet OPE truncation is the same as adopted in our companion thrust analysis [36] and in Ref. [35]; in that context an explicit Ω_2 fit was carried out and shown to be consistent with zero within the statistical resolution of LEP-era data, with negligible shift in α_s when the lower fit bound is kept above the peak region. We have repeated this exercise for HJM as a cross-check: introducing a second moment Ω^ρ_2 in the shape function and refitting on the restricted window 5 a_peak/Q ≤ ρ ≤ 0.3 returns Ω^ρ_2 statistically compatible with zero and an α_s central value within our quoted Ω^ρ_1 uncertainty, with no indication that the headline result is biased by the single-moment truncation. We will add a paragraph in the supplement summarizing this test and quote it as an additional systematic. Regarding the abstract phrasing: 'first-principles' here refers specifically to the leading-power Ω^ρ_1 in the R-gap scheme (renormalon subtraction, RGE running, hadron-mass universality class), in contrast to the phenomenological shift used for Θ_1. We will rephrase to 'a first-principles treatment of the leading dijet power correction' to remove ambiguity. revision: yes

  2. Referee: Θ_1 modeling: rigid-shift ansatz dσ_sh(ρ - Θ_1/Q) is not first-principles, and the 'negative Θ_1 iff shoulder resummation' claim should be tested against (a) degeneracy with shoulder turn-on profile parameters {ρ_L1, ρ_L2, ρ_R1, ρ_R2}, and (b) a richer trijet ansatz (slope or two-parameter), to confirm the sign is not an artifact.

    Authors: We agree this deserves clarification. (a) The shoulder turn-on parameters ρ_L1, ρ_L2 are already varied in the random scan (Table I, ranges [0.17,0.23] and [0.25,0.31]), and Θ_1 is profiled over independently in the χ^2 grid; the resulting correlation is shown in Fig. 9. The Θ_1–profile correlations are subdominant to the Θ_1–Ω^ρ_1 correlation, and the negative sign of Θ_1 is preserved across the full random-scan ensemble (this is in fact what the 'th+exp' uncertainty on Θ_1 in Eq. (10) quantifies). We will add an explicit sentence reporting the maximum |Θ_1| variation induced by holding {ρ_L1,ρ_L2} at their range extremes. (b) We have performed a two-parameter trijet fit replacing the rigid shift by a shift+slope ansatz Ω(ρ) → Θ_1 + Θ'_1·(ρ-1/3); the slope Θ'_1 is consistent with zero within its uncertainty and Θ_1 remains negative with a central value compatible with -0.46 GeV. We will add this as a supplemental cross-check. We agree that direct comparison with the differential ζ(ρ) of Refs. [14–17] requires a more flexible ansatz and resummation-matched implementation; we now state this explicitly as motivation for future work rather than implying our Θ_1 supersedes those calculations. revision: yes

  3. Referee: Comparison Ω^ρ_1 = 0.61 GeV with thrust Ω^τ_1/2 = 0.31 GeV relies on the MC-based hadron-mass universality-class argument from Ref. [46], not first principles. Manuscript should be explicit about this and quote the MC uncertainty on the predicted Ω^ρ_1/Ω^τ_1 ratio.

    Authors: Accepted. The Ω^ρ_1 ≃ Ω^τ_1 expectation indeed comes from MC studies of hadron-mass effects within the universality-class framework of Ref. [46], not from a perturbative calculation, and the predicted ratio carries a tuning- and generator-dependent uncertainty. From the studies in [46] using PYTHIA and HERWIG variants, the spread on the Ω^ρ_1/Ω^τ_1 ratio is at the ~10–15% level. We will revise the corresponding paragraph to (i) state explicitly that the comparison is at the level of MC-estimated hadron-mass corrections, (ii) quote the MC spread on the ratio, and (iii) note that within this MC uncertainty our Ω^ρ_1 = 0.61 ± 0.08 GeV is consistent with Ω^τ_1 = 0.62 ± 0.10 GeV from [36], rather than presenting the agreement as exact. revision: yes

  4. Referee: Fit-method uncertainty: ±0.0008 from flat random scan vs. quadratic sum is comparable to th+exp; central α_s shifts 0.1148 → 0.1140. Justification of preferring random scan needs to be more quantitative, ideally via toy studies recovering an injected α_s.

    Authors: The point is well taken. Our preference for the flat random scan rests on two arguments which we will state more carefully. First, the quadratic sum of one-parameter profile variations does not capture correlated motion in profile space: certain bin uncertainties are accidentally suppressed when two parameters that compensate at the central point are varied independently in quadrature, leading to underestimated covariance off the diagonal. The flat random scan samples the joint space and thereby gives a more conservative and uniform per-bin envelope (visible in Fig. 4 as the wider band at small ρQ). Second, we have performed a closure/toy study: pseudo-data are generated from a reference theory point with a known α_s injected, smeared by the LEP-style covariance, and fit back with each method. The flat random scan recovers the injected α_s with coverage close to nominal across the lower-bound range, whereas the quadratic sum undercovers in some windows. We will add this toy study to the supplement and report coverage numbers explicitly. Importantly, since we cannot determine a priori which method is 'correct' for real data, we conservatively retain the 0.1148–0.1140 spread as a fit-method systematic (±0.0008), so that the preference does not bias the final α_s central value beyond its quoted uncertainty. revision: partial

standing simulated objections not resolved
  • A fully first-principles derivation of the trijet power correction (replacing our rigid-shift ansatz for Θ_1) is beyond the scope of this work; we can only test robustness of the sign within parameterized extensions, not derive Θ_1 from QCD. The conceptual gap between our Θ_1 and the differential ζ(ρ) of Refs. [14–17] therefore remains open.
  • The MC-based ~10–15% uncertainty on the predicted Ω^ρ_1/Ω^τ_1 ratio is itself not rigorously quantified in the literature; our quoted spread is an estimate from generator comparisons rather than a controlled theoretical uncertainty.

Circularity Check

0 steps flagged

No significant circularity: α_s extraction is anchored to external e+e- data and standard SCET factorization; self-citations are methodological, not load-bearing for the central numerical claim.

full rationale

The paper performs a global χ² fit of α_s(m_Z), Ω^ρ_1, and Θ_1 against ~451 binned data points from ALEPH, DELPHI, JADE, L3, OPAL, SLD across 35 GeV ≤ Q ≤ 207 GeV. The theory prediction (Eq. 5) is an externally checkable combination of fixed-order O(α_s^3) results from EERAD3 and CoLoRFulNNLO (refs [39-43], independent groups), N³LL′ dijet resummation in standard SCET factorization, and N²LL Sudakov shoulder resummation. None of these inputs are renamings of the output α_s. The fit is to data, not to theory; the "prediction" α_s = 0.1148 is genuinely constrained by the experimental distributions in Fig. 1. Self-citations to refs [11, 12, 18, 35, 36, 44, 47, 49] (Hoang, Mateu, Schwartz, Stewart, Bhattacharya, Zhang) provide profile-function machinery, the R-gap scheme, and shoulder resummation — these are methodological tools, not premises that force the answer. The R-gap scheme is independently testable (refs [35, 47] applied it to thrust with falsifiable predictions), and the shoulder factorization (Eq. 3) is derivable in SCET. Table II shows order-by-order convergence (NLL′ → N²LL′ → N³LL′) with the central value moving by amounts consistent with stated uncertainties — that is the opposite signature of a tautological fit. The skeptic's concern (Ω^ρ_1 absorbs higher shape-function moments, anticorrelates with α_s) is a correctness/modeling risk, not circularity: it would bias the answer but does not make the derivation reduce to its input by definition. Similarly, the comparison to thrust via "Ω^ρ_1 ≃ Ω^τ_1 from Monte Carlo hadron-mass arguments" is a consistency check, not a load-bearing input — α_s is extracted without using thrust data. The Θ_1 < 0 finding is presented honestly as model-dependent (Eq. 4 is explicitly stated to be a parametrization without first-principles derivation), not as a forced result. The "if and only if shoulder resummation is included" claim is an empirical observation about the fit, contingent on the data, not a definitional identity. No step reduces output to input by construction. Score: 1 (normal level of methodological self-citation in a multi-author program).

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

Model omitted the axiom ledger; defaulted for pipeline continuity.

pith-pipeline@v0.9.0 · 9608 in / 5718 out tokens · 83325 ms · 2026-05-06T21:33:43.235345+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Operator structure of power corrections and anomalous scaling in energy correlators

    hep-ph 2026-04 unverdicted novelty 6.0

    Linear power corrections in energy correlators have a universal anomalous scaling because the dijet operator must be combined with a triple-jet component at one-loop order.