pith. machine review for the scientific record. sign in

arxiv: 2605.10935 · v1 · submitted 2026-05-11 · 🌌 astro-ph.CO

Recognition: 2 theorem links

· Lean Theorem

Demonstrating the Use of the Spherical Fourier Bessel Basis for Large Scale Clustering Systematics Discovery and Mitigation with eBOSS

Henry S. Grasshorn Gebhardt, James R. Cheshire IV, Olivier Dor\'e, Robin Y. Wen, Sean Bruton

Pith reviewed 2026-05-12 03:16 UTC · model grok-4.3

classification 🌌 astro-ph.CO
keywords spherical fourier-bessel basisclustering systematicsstellar contaminationlarge scale structurefNLquasar clusteringgalaxy clusteringmode selection
0
0 comments X

The pith

The Spherical Fourier-Bessel basis identifies a large-scale stellar contamination systematic in quasar clustering and an unknown systematic at plate scales.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper shows that the Spherical Fourier-Bessel basis separates angular and radial modes of the clustering power spectrum, allowing targeted identification of contaminated modes while keeping more cosmological signal than standard bases. Applied to the LRG and QSO samples, the method checks for inconsistencies in the inferred value of fNL when different sets of modes are included or excluded. This reveals evidence for a systematic affecting large physical scales in the QSO sample that matches residual stellar contamination, plus an unidentified effect operating at the angular scale of the survey plates in both samples. A reader would care because such mode-by-mode diagnostics can improve the reliability of primordial non-Gaussianity measurements in large-scale structure data.

Core claim

The Spherical Fourier-Bessel basis, by separating angular and radial modes, permits targeted identification and mitigation of systematics in clustering surveys while retaining more cosmological signal than traditional bases. Varying the selection of angular and radial modes in the eBOSS DR16 LRG and QSO samples produces inconsistencies in the inferred fNL value. In the QSO sample this yields evidence (p less than 0.005 compared to the same cuts on mocks) of a systematic afflicting large physical scales, consistent with residual stellar contamination; the analysis also finds evidence (p less than 0.05) for an unknown systematic in both the QSO and LRG samples at the approximate angular plate,

What carries the argument

The Spherical Fourier-Bessel (SFB) basis, which decomposes the density field into independent spherical-harmonic angular modes and spherical-Bessel radial modes so that contaminated scales can be isolated and removed.

If this is right

  • Residual stellar contamination affects the large-scale clustering signal measured in the QSO sample.
  • An unidentified systematic impacts both QSO and LRG clustering at the angular scale of the survey plates and imaging.
  • The SFB basis allows removal of contaminated modes while preserving more cosmological signal than conventional bases.
  • Inconsistencies in fNL across mode selections serve as a diagnostic for scale-dependent systematics in clustering data.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • The same mode-selection test could be applied to future surveys to clean large-scale modes before measuring primordial non-Gaussianity.
  • If imaging artifacts or stellar residuals are present at similar levels in other datasets, comparable fNL inconsistencies would be expected.
  • Extending the test to a larger suite of mock realizations would help confirm whether the observed discrepancies arise from data features rather than simulation limitations.

Load-bearing premise

That inconsistencies in the inferred fNL value when different angular and radial modes are selected reliably indicate underlying systematics rather than statistical fluctuations or mismatches with the mocks.

What would settle it

Applying the same mode cuts to the mocks and finding p-values for fNL inconsistencies that match those seen in the data would falsify the interpretation that the inconsistencies signal real systematics in the observations.

Figures

Figures reproduced from arXiv: 2605.10935 by Henry S. Grasshorn Gebhardt, James R. Cheshire IV, Olivier Dor\'e, Robin Y. Wen, Sean Bruton.

Figure 1
Figure 1. Figure 1: FIG. 1. The fit QSO bias for 10 EZMocks (colored lines, 16th [PITH_FULL_IMAGE:figures/full_fig_p005_1.png] view at source ↗
Figure 2
Figure 2. Figure 2: FIG. 2. The inferred median [PITH_FULL_IMAGE:figures/full_fig_p007_2.png] view at source ↗
Figure 4
Figure 4. Figure 4: FIG. 4. Various cuts on the average measured SFB power spectrum for 1,000 QSO EZmocks. Top left: the entire SFB power [PITH_FULL_IMAGE:figures/full_fig_p009_4.png] view at source ↗
Figure 5
Figure 5. Figure 5: FIG. 5. Similar to Figure [PITH_FULL_IMAGE:figures/full_fig_p010_5.png] view at source ↗
Figure 6
Figure 6. Figure 6: FIG. 6. Inferred [PITH_FULL_IMAGE:figures/full_fig_p010_6.png] view at source ↗
Figure 7
Figure 7. Figure 7: FIG. 7. Inferred [PITH_FULL_IMAGE:figures/full_fig_p011_7.png] view at source ↗
Figure 8
Figure 8. Figure 8: shows the derived posteriors on fNL for the LRG and QSO samples with and without the off diago￾nals included. We see that, when removing the off-diagonals (∆n = 0), the quasar’s lowest ℓ bin comes into agreement with the LRG data points, indicating that the systematic in this bin, which we hypothesized in Section IV C was stel￾lar contamination, is specifically contaminating the off￾diagonal modes of the S… view at source ↗
Figure 9
Figure 9. Figure 9: FIG. 9. Inferred values of [PITH_FULL_IMAGE:figures/full_fig_p016_9.png] view at source ↗
Figure 11
Figure 11. Figure 11: FIG. 11. Corner plots for inferred biases, [PITH_FULL_IMAGE:figures/full_fig_p017_11.png] view at source ↗
Figure 12
Figure 12. Figure 12: FIG. 12. Similar to Figure [PITH_FULL_IMAGE:figures/full_fig_p017_12.png] view at source ↗
read the original abstract

The Spherical Fourier-Bessel (SFB) basis, in separating the angular and radial modes of the power spectrum, permits a targeted identification and mitigation of systematics in clustering surveys while retaining more cosmological signal than traditional bases. We demonstrate this principle on the eBOSS DR16 LRG and QSO samples, identifying modes which may be contaminated by systematics. Our initial inference on the LRG sample yields an fNL value consistent with zero, while the QSO value is in slight tension with zero. Using the SFB basis, we vary the selection of angular and radial modes to search for inconsistencies in the inferred value of fNL, an indicator of underlying systematics. In the QSO sample, we find evidence (p < 0.005 compared to the same cuts on EZMocks) of a systematic afflicting large physical scales, which is consistent with residual stellar contamination; we also find evidence (p < 0.05) for an unknown systematic in the QSO and LRG samples at the approximate angular plate and imaging scale of eBOSS.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 1 minor

Summary. The manuscript proposes the use of the Spherical Fourier-Bessel (SFB) basis for identifying and mitigating systematics in large-scale galaxy clustering measurements, as it separates angular and radial modes while potentially retaining more cosmological information than standard bases. Applied to the eBOSS DR16 LRG and QSO samples, the authors vary selections of angular and radial modes and use inconsistencies in the inferred local primordial non-Gaussianity parameter f_NL as a diagnostic for systematics. They report statistically significant evidence (p < 0.005 relative to EZMocks) for a large-scale systematic in the QSO sample consistent with residual stellar contamination, and evidence (p < 0.05) for an unknown systematic affecting both samples at the scale of eBOSS angular plates and imaging.

Significance. If the central claims hold after detailed validation, the SFB-based mode-variation approach could provide a useful diagnostic tool for systematics in future surveys (e.g., DESI, Euclid) where large-scale modes inform primordial non-Gaussianity and other cosmological parameters. The choice of f_NL as an independent diagnostic is a methodological strength, as is the explicit comparison to EZMocks. However, the overall significance remains provisional given the absence of full methodological details.

major comments (2)
  1. [Abstract] Abstract: The reported p < 0.005 (QSO large-scale) and p < 0.05 (plate-scale) evidence for systematics is load-bearing for the central claim, yet the abstract provides no information on the precise angular/radial mode cuts applied, the exact statistical procedure for the p-value calculation, the number of tests performed, or any multiple-comparison correction. Without these, it is impossible to assess whether the significances are robust or inflated.
  2. [Abstract] Abstract: The interpretation that f_NL inconsistencies under SFB mode selections reliably flag systematics (stellar contamination or unknown plate-scale effects) rather than statistical fluctuations, EZMocks mismatches, or post-selection biases is a key assumption. This requires explicit validation in the methods, including quantitative checks that SFB retains more signal than traditional bases and that the reported p-values survive verification of post-selection effects.
minor comments (1)
  1. [Abstract] Abstract: Adding brief context on sample sizes, redshift ranges, and the exact definition of the 'large physical scales' and 'angular plate scale' would improve clarity for readers unfamiliar with eBOSS specifics.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for their constructive feedback, which has helped us improve the clarity and robustness of our presentation. We address each major comment below and have revised the manuscript to incorporate additional details and validations where appropriate.

read point-by-point responses
  1. Referee: [Abstract] Abstract: The reported p < 0.005 (QSO large-scale) and p < 0.05 (plate-scale) evidence for systematics is load-bearing for the central claim, yet the abstract provides no information on the precise angular/radial mode cuts applied, the exact statistical procedure for the p-value calculation, the number of tests performed, or any multiple-comparison correction. Without these, it is impossible to assess whether the significances are robust or inflated.

    Authors: We agree that the abstract should provide sufficient information for readers to evaluate the statistical claims. In the revised manuscript, we have expanded the abstract to specify the angular and radial mode cuts (large-scale radial modes with k < 0.01 h/Mpc and angular modes corresponding to eBOSS plate scales), the p-value procedure (fraction of EZMocks yielding more extreme f_NL shifts under identical cuts), and clarified that the tests are a small number of targeted, pre-specified selections rather than an exhaustive search requiring multiple-comparison correction. The full procedural details remain in the methods section. revision: yes

  2. Referee: [Abstract] Abstract: The interpretation that f_NL inconsistencies under SFB mode selections reliably flag systematics (stellar contamination or unknown plate-scale effects) rather than statistical fluctuations, EZMocks mismatches, or post-selection biases is a key assumption. This requires explicit validation in the methods, including quantitative checks that SFB retains more signal than traditional bases and that the reported p-values survive verification of post-selection effects.

    Authors: We acknowledge that explicit validation strengthens the central claim. The manuscript already demonstrates that SFB retains additional cosmological information relative to traditional bases through direct comparison of f_NL constraints. We have added a new methods subsection with quantitative checks: (i) signal retention is quantified by comparing the variance of f_NL posteriors in SFB versus standard bases on the same data, and (ii) post-selection effects are tested by repeating the exact mode-variation procedure on EZMocks, confirming that the observed p-values do not arise from the selection process itself. These additions address the concern without altering the reported results. revision: partial

Circularity Check

0 steps flagged

No significant circularity

full rationale

The abstract presents the SFB basis as a tool for mode separation to detect systematics via inconsistencies in fNL inferences across angular/radial cuts, benchmarked against EZMocks with reported p-values. No derivation chain, fitted parameter, or self-citation is described that reduces the claimed evidence (stellar contamination at large scales, unknown effect at plate scale) to an input by construction. The diagnostic use of fNL is independent and externally compared, making the approach self-contained against the provided text.

Axiom & Free-Parameter Ledger

0 free parameters · 2 axioms · 0 invented entities

Abstract-only review provides no explicit free parameters or invented entities. The claims rest on domain assumptions about SFB mode separation and the interpretation of fNL variations as systematics indicators.

axioms (2)
  • domain assumption The Spherical Fourier-Bessel basis separates angular and radial modes more effectively than traditional bases, permitting targeted systematics identification while retaining more cosmological signal.
    Directly stated in the opening sentence of the abstract as the enabling principle.
  • domain assumption Inconsistencies in inferred fNL across different selections of angular and radial modes indicate the presence of underlying systematics in the data.
    Used as the core diagnostic method for identifying contaminated modes in both samples.

pith-pipeline@v0.9.0 · 5483 in / 1511 out tokens · 88834 ms · 2026-05-12T03:16:34.389652+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

Reference graph

Works this paper leans on

13 extracted references · 13 canonical work pages · 1 internal anchor

  1. [1]

    Demonstrating the Use of the Spherical Fourier Bessel Basis for Large Scale Clustering Systematics Discovery and Mitigation with eBOSS

    are or will soon be measuring the 3-dimensional large-scale structure (LSS) of the Universe with unprece- dented statistical precision. Advances in technology, infrastructure, and in observational and computational techniques mean that these surveys will measureO(109) galaxies over an unparalleled cosmological volume. With such advances in statistical pre...

  2. [2]

    close- pair

    Chebyshev expansion The linear galaxy biasb1 is redshift-dependent for se- lected galaxy samples, and here we use the Chebyshev polynomialsT n up to theNth order to model its redshift evolution b1(z) = PN n=0 b1,nTn(˜x(z)) D(z) .(14) We performed the Chebyshev expansion over the nor- malized comoving distance˜x(x(z)) = (x(z)−x mid)/∆x wherex mid = 1 2(xma...

  3. [3]

    complete

    were constructed using the Effective Zel’dovich ap- proximation and a standard flatΛCDM cosmology with Ωm = 0.31. Both “complete” and “realistic” catalogs are provided, the former representing an ideal galaxy cat- alog observed perfectly, free from any observational ef- fects, and the latter representing catalogs which have had all known observational and...

  4. [4]

    A givennmode expresses as a continuous “trace” in the SFB power spectrum withℓincreasing as effectivekincreases (the upper right panel shows the n=0 mode and its first two off-diagonals),

  5. [5]

    lowℓmodes tend to turn over and rapidly approach zero power acrossnmodes (compare the modes re- moved between the upper- and lower-left panels), a result of the integral constraint, and

  6. [6]

    off diagonals (n̸=n ′) tend to cluster near zero, a reflection of cosmological homogeneity (lower right panel) (Kheket al.2024). A. Radial Cuts We take one LRG EZMock and successively increase the lowestnkept in the data vector,nmin and inferfNL. Note that all of the cuts also include anℓmin = 8cut to remove modes which are too large to be probed by the s...

  7. [7]

    the lowℓrange is biased high and this bias is worse for the QSO sample and 2) removing then= 0mode from the QSO data vector removes a significant amount of this bias. Recalling the Wenet al.(2025) result that stellar con- tamination largely resides in the(n= 0, ℓ <∼ 50)modes, one possible source of this systematic may be residual stellar contamination in ...

  8. [8]

    cutting the off-diagonals from the data vector,

  9. [9]

    changing the assumedpvalue, and

  10. [10]

    allowing a more flexible FoG model and fitting its parameters

  11. [11]

    Such radial inhomogeneities, on the largest scales, could arise from , e.g., systematics or FoG model misspecification

    Off-Diagonals Unaccounted for radial inhomogeneities in the data would manifest as excess power in the off-diagonals of the SFB power spectrum. Such radial inhomogeneities, on the largest scales, could arise from , e.g., systematics or FoG model misspecification. As such, it is reasonable to try cutting the off-diagonals, which may be where prob- lematic ...

  12. [12]

    QSOs, inpre- vious studies, are typically prescribed a value ofp= 1.6, adivergencefromtheuniversalityrelation, reflectingthat quasars, as a population average, have a recent merger

    Changing the value ofp The value ofp, which, as described in Section IIB, characterizes the merger history of the tracer, controls the response of a tracer tofNL throughb ϕ, which is in turnfullydegeneratewiththevalueoff NL. QSOs, inpre- vious studies, are typically prescribed a value ofp= 1.6, adivergencefromtheuniversalityrelation, reflectingthat quasar...

  13. [13]

    Totestthis, weallowmoreflexibilityintheFoGmodel, letting it evolve in redshift in the same manner that our bias model evolves (with Chebyshev polynomials, see Eq

    FoG Modeling Finally, we consider that the FoG model has insuffi- cient complexity to describe the real behavior, which we hypothesize could lead to a bias in the inferredfNL. Totestthis, weallowmoreflexibilityintheFoGmodel, letting it evolve in redshift in the same manner that our bias model evolves (with Chebyshev polynomials, see Eq. (16)). With this n...