Augmented Correlation Functions for Spectroscopic Galaxy Surveys
Pith reviewed 2026-06-29 05:35 UTC · model grok-4.3
The pith
The augmented correlation function extracts more cosmological information from galaxy redshift surveys than standard two-point statistics by incorporating latent dimensions from field transformations.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
The augmented correlation function is a general framework in which an arbitrary transformation of the galaxy field defines additional latent dimensions that extend the standard two-point correlation function and isolate clustering properties averaged out in conventional analyses. In a proof-of-concept application using the pairwise gradient of the inverse Laplacian of the galaxy density field, this statistic distinguishes clustering regimes associated with infalling and outflowing pairs. Fisher forecasts based on z=1 halo catalogues from the Quijote simulations within νΛCDM cosmology show that the augmented correlation systematically yields tighter constraints on all cosmological parameters
What carries the argument
The augmented correlation function, which extends the two-point correlation function by adding latent dimensions from transformations of the galaxy field to isolate otherwise averaged clustering properties.
If this is right
- The augmented correlation yields tighter constraints on cosmological parameters in νΛCDM models.
- The specific implementation distinguishes infalling and outflowing pair clustering regimes.
- It serves as a flexible and computationally efficient alternative to standard two-point statistics for spectroscopic surveys.
Where Pith is reading between the lines
- Applying the framework to real survey data could reveal whether the forecasted gains translate to actual observations.
- Other transformations of the density field might isolate different physical effects like bias or redshift-space distortions.
- Combining multiple latent variables could further increase the information content beyond the single example tested.
Load-bearing premise
Fisher forecasts applied to halo catalogues from the Quijote simulations accurately reflect the true information gain from the augmented correlation function.
What would settle it
Running the same analysis on mock catalogs that include additional effects like survey geometry, incompleteness, or more realistic galaxy formation models and checking if the constraint improvements persist would test the claim.
read the original abstract
Galaxy redshift surveys encode a wealth of information generated by nonlinear gravitational evolution, galaxy bias, and redshift-space distortions, only part of which is accessible through standard two-point statistics. Motivated by the need for flexible and computationally efficient alternatives, we introduce the augmented correlation function, a general framework in which an arbitrary transformation of the galaxy field defines additional ``latent'' dimensions that extend the standard two-point correlation function and isolate clustering properties averaged out in conventional analyses. As a proof of concept, we study a latent variable constructed from the pairwise gradient of the inverse Laplacian of the galaxy density field, showing that the resulting statistics naturally distinguish clustering regimes associated with infalling and outflowing pairs. Using Fisher forecasts based on $z=1$ halo catalogues from the Quijote simulations within $\nu\Lambda\mathrm{CDM}$ cosmology, we find that the augmented correlation systematically yields tighter constraints on all cosmological parameters considered. Although these improvements should be regarded as indicative given the exploratory nature of the analysis and the limitations of Fisher forecasts and simulations, our results demonstrate the potential of augmented correlations as a flexible framework for extracting additional information from spectroscopic galaxy surveys.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper introduces the augmented correlation function as a framework that extends the standard two-point correlation function by incorporating arbitrary transformations of the galaxy field as additional 'latent' dimensions. As a proof of concept, it constructs a latent variable from the pairwise gradient of the inverse Laplacian of the galaxy density field and applies Fisher forecasts to z=1 halo catalogs from the Quijote simulations in νΛCDM cosmology, reporting systematically tighter constraints on all cosmological parameters considered. The abstract qualifies these improvements as indicative due to the exploratory nature of the analysis.
Significance. If validated beyond Fisher forecasts, the framework could offer a computationally efficient way to isolate additional clustering information from spectroscopic surveys that is averaged out in conventional analyses. The current results, however, rest entirely on unvalidated Fisher matrices for a higher-dimensional statistic, limiting their immediate impact.
major comments (1)
- [Fisher analysis / abstract] Fisher forecast section (implicit in abstract and methods): The headline claim of tighter constraints on all parameters relies on Fisher matrices for the augmented data vector, but the manuscript does not verify the Gaussian likelihood assumption for this higher-dimensional statistic (including the latent gradient field) nor demonstrate that the covariance estimated from the finite Quijote suite remains invertible and unbiased. This directly undermines the reported information gain, as noted by the paper's own caveats on the exploratory character of both Fisher methods and simulations.
minor comments (1)
- [Methods] Clarify the precise definition and normalization of the latent variable (pairwise gradient of inverse Laplacian) in the methods section to ensure reproducibility.
Simulated Author's Rebuttal
We thank the referee for their constructive comments. We address the single major comment below, agreeing where the observation is accurate while noting the manuscript's existing qualifications.
read point-by-point responses
-
Referee: [Fisher analysis / abstract] Fisher forecast section (implicit in abstract and methods): The headline claim of tighter constraints on all parameters relies on Fisher matrices for the augmented data vector, but the manuscript does not verify the Gaussian likelihood assumption for this higher-dimensional statistic (including the latent gradient field) nor demonstrate that the covariance estimated from the finite Quijote suite remains invertible and unbiased. This directly undermines the reported information gain, as noted by the paper's own caveats on the exploratory character of both Fisher methods and simulations.
Authors: We agree that the manuscript does not provide explicit verification of the Gaussian likelihood assumption for the augmented statistic or additional checks on covariance invertibility and bias beyond the standard Quijote analysis pipeline. The analysis is presented as exploratory, and the abstract already qualifies the reported improvements as indicative owing to the limitations of Fisher forecasts and simulations. We do not claim the results are validated but rather illustrate the framework's potential. To address the point, we will add a short paragraph in the methods section discussing the standard assumptions of the Fisher approach in this context and reiterating the exploratory nature. revision: partial
Circularity Check
No circularity: new statistic defined independently and validated via external simulations
full rationale
The derivation introduces the augmented correlation function via an explicit transformation (pairwise gradient of inverse Laplacian) applied to the galaxy density field; this definition does not reference the target cosmological parameters or the Fisher results. The claim of tighter constraints is obtained by applying standard Fisher forecasts to external Quijote halo catalogues, which constitute an independent numerical experiment rather than a fit or self-referential prediction. No self-citations, ansatzes smuggled via prior work, or renamings of known results appear as load-bearing steps. The chain remains self-contained against external benchmarks.
Axiom & Free-Parameter Ledger
axioms (1)
- domain assumption Fisher matrix forecasts accurately indicate the improvement in cosmological parameter constraints from the new statistic
invented entities (1)
-
latent variable from pairwise gradient of the inverse Laplacian of the galaxy density field
no independent evidence
Reference graph
Works this paper leans on
-
[1]
The DESI Experiment Part I: Science,Targeting, and Survey Design
DESI Collaboration, A. Aghamousa, J. Aguilar, S. Ahlen, S. Alam, L.E. Allen et al.,The DESI Experiment Part I: Science,Targeting, and Survey Design,arXiv e-prints(2016) arXiv:1611.00036 [1611.00036]. – 24 –
work page internal anchor Pith review Pith/arXiv arXiv 2016
-
[2]
The DESI Experiment Part II: Instrument Design
DESI Collaboration, A. Aghamousa, J. Aguilar, S. Ahlen, S. Alam, L.E. Allen et al.,The DESI Experiment Part II: Instrument Design,arXiv e-prints(2016) arXiv:1611.00037 [1611.00037]
work page internal anchor Pith review Pith/arXiv arXiv 2016
-
[3]
Euclid Collaboration, Y. Mellier, Abdurro’uf, J.A. Acevedo Barroso, A. Ach´ ucarro, J. Adamek et al.,Euclid: I. Overview of the Euclid mission, A&A697(2025) A1 [2405.13491]
-
[4]
H. Gil-Mar´ ın, W.J. Percival, L. Verde, J.R. Brownstein, C.-H. Chuang, F.-S. Kitaura et al., The clustering of galaxies in the SDSS-III Baryon Oscillation Spectroscopic Survey: RSD measurement from the power spectrum and bispectrum of the DR12 BOSS galaxies, MNRAS 465(2017) 1757 [1606.00439]
work page internal anchor Pith review Pith/arXiv arXiv 2017
-
[5]
Z. Slepian, D.J. Eisenstein, J.R. Brownstein, C.-H. Chuang, H. Gil-Mar´ ın, S. Ho et al., Detection of baryon acoustic oscillation features in the large-scale three-point correlation function of SDSS BOSS DR12 CMASS galaxies, MNRAS469(2017) 1738 [1607.06097]
work page internal anchor Pith review Pith/arXiv arXiv 2017
- [6]
-
[7]
M.M. Ivanov, O.H.E. Philcox, G. Cabass, T. Nishimichi, M. Simonovi´ c and M. Zaldarriaga, Cosmology with the galaxy bispectrum multipoles: Optimal estimation and application to BOSS data, Phys. Rev. D107(2023) 083515 [2302.04414]
-
[8]
G. D’Amico, Y. Donath, M. Lewandowski, L. Senatore and P. Zhang,The BOSS bispectrum analysis at one loop from the Effective Field Theory of Large-Scale Structure, J. Cosmology Astropart. Phys.2024(2024) 059 [2206.08327]
-
[9]
S. Novell-Masot, H. Gil-Mar´ ın, L. Verde, J. Aguilar, S. Ahlen, S. Bailey et al.,Full-Shape analysis of the power spectrum and bispectrum of DESI DR1 LRG and QSO samples, J. Cosmology Astropart. Phys.2025(2025) 005 [2503.09714]
-
[10]
A. Chudaykin, M.M. Ivanov and O.H.E. Philcox,Reanalyzing DESI DR1: 1. LambdaCDM Constraints from the Power Spectrum and Bispectrum,arXiv e-prints(2025) arXiv:2507.13433 [2507.13433]
-
[11]
T. Nishimichi, M. Takada, R. Takahashi, K. Osato, M. Shirasaki, T. Oogi et al.,Dark Quest. I. Fast and Accurate Emulation of Halo Clustering Statistics and Its Application to Galaxy Clustering, ApJ884(2019) 29 [1811.09504]
-
[12]
C. Hahn, M. Eickenberg, S. Ho, J. Hou, P. Lemos, E. Massara et al.,A forward modeling approach to analyzing galaxy clustering with SIMBIG,Proceedings of the National Academy of Science120(2023) e2218810120
2023
-
[13]
C. Cuesta-Lazaro, E. Paillas, S. Yuan, Y.-C. Cai, S. Nadathur, W.J. Percival et al.,SUNBIRD: a simulation-based model for full-shape density-split clustering, MNRAS531(2024) 3336 [2309.16539]
-
[14]
I. S´ aez-Casares, Y. Rasera and B. Li,The e-MANTIS emulator: fast predictions of the non-linear matter power spectrum in f(R)CDM cosmology, MNRAS527(2024) 7242 [2303.08899]
-
[15]
S. Mallat,Group invariant scattering,Communications on Pure and Applied Mathematics65 (2012) 1331 [https://onlinelibrary.wiley.com/doi/pdf/10.1002/cpa.21413]
-
[16]
G. Valogiannis and C. Dvorkin,Towards an optimal estimation of cosmological parameters with the wavelet scattering transform, Phys. Rev. D105(2022) 103534 [2108.07821]
-
[17]
G. Valogiannis, S. Yuan and C. Dvorkin,Precise cosmological constraints from BOSS galaxy clustering with a simulation-based emulator of the wavelet scattering transform, Phys. Rev. D 109(2024) 103503 [2310.16116]. – 25 –
-
[18]
B. R´ egaldo-Saint Blancard, C. Hahn, S. Ho, J. Hou, P. Lemos, E. Massara et al.,Galaxy clustering analysis with SimBIG and the wavelet scattering transform, Phys. Rev. D109(2024) 083535 [2310.15250]
-
[19]
Robust Morphological Measures for Large-Scale Structure in the Universe
K.R. Mecke, T. Buchert and H. Wagner,Robust morphological measures for large-scale structure in the Universe, A&A288(1994) 697 [astro-ph/9312028]
work page internal anchor Pith review Pith/arXiv arXiv 1994
-
[20]
W. Fang, B. Li and G.-B. Zhao,New Probe of Departures from General Relativity Using Minkowski Functionals, Phys. Rev. Lett.118(2017) 181301 [1704.02325]
work page internal anchor Pith review Pith/arXiv arXiv 2017
- [21]
- [22]
-
[23]
A. Banerjee and T. Abel,Nearest neighbour distributions: New statistical measures for cosmological clustering, MNRAS500(2021) 5479 [2007.13342]
- [24]
-
[25]
Barrow, S.P
J.D. Barrow, S.P. Bhavsar and D.H. Sonoda,Minimal spanning trees, filaments and galaxy clustering, MNRAS216(1985) 17
1985
-
[26]
M. Alpaslan, A.S.G. Robotham, S. Driver, P. Norberg, I. Baldry, A.E. Bauer et al.,Galaxy And Mass Assembly (GAMA): the large-scale structure of galaxies and comparison to mock universes, MNRAS438(2014) 177 [1311.1211]
work page internal anchor Pith review Pith/arXiv arXiv 2014
- [27]
- [28]
-
[29]
Kaiser,Clustering in real space and in redshift space, MNRAS227(1987) 1
N. Kaiser,Clustering in real space and in redshift space, MNRAS227(1987) 1
1987
-
[30]
Alcock and B
C. Alcock and B. Paczynski,An evolution free test for non-zero cosmological constant, Nature 281(1979) 358
1979
-
[31]
Precision cosmography with stacked voids
G. Lavaux and B.D. Wandelt,Precision Cosmography with Stacked Voids, ApJ754(2012) 109 [1110.0345]
work page internal anchor Pith review Pith/arXiv arXiv 2012
-
[32]
Cosmology with Void-Galaxy Correlations
N. Hamaus, B.D. Wandelt, P.M. Sutter, G. Lavaux and M.S. Warren,Cosmology with Void-Galaxy Correlations, Phys. Rev. Lett.112(2014) 041304 [1307.2571]
work page internal anchor Pith review Pith/arXiv arXiv 2014
-
[33]
Y.-C. Cai, A. Taylor, J.A. Peacock and N. Padilla,Redshift-space distortions around voids, MNRAS462(2016) 2465 [1603.05184]
work page internal anchor Pith review Pith/arXiv arXiv 2016
-
[34]
S. Nadathur, P.M. Carter, W.J. Percival, H.A. Winther and J.E. Bautista,Beyond BAO: Improving cosmological constraints from BOSS data with measurement of the void-galaxy cross-correlation, Phys. Rev. D100(2019) 023504 [1904.01030]
work page internal anchor Pith review Pith/arXiv arXiv 2019
- [35]
-
[36]
The environmental dependence of galaxy clustering in the Sloan Digital Sky Survey
U. Abbas and R.K. Sheth,The environmental dependence of galaxy clustering in the Sloan Digital Sky Survey, MNRAS372(2006) 1749 [astro-ph/0601407]
work page internal anchor Pith review Pith/arXiv arXiv 2006
-
[37]
E. Paillas, Y.-C. Cai, N. Padilla and A.G. S´ anchez,Redshift-space distortions with split densities, MNRAS505(2021) 5731 [2101.09854]. – 26 –
-
[38]
E. Paillas, C. Cuesta-Lazaro, P. Zarrouk, Y.-C. Cai, W.J. Percival, S. Nadathur et al., ConstrainingνΛCDM with density-split clustering, MNRAS522(2023) 606 [2209.04310]
-
[39]
E. Paillas, C. Cuesta-Lazaro, W.J. Percival, S. Nadathur, Y.-C. Cai, S. Yuan et al., Cosmological constraints from density-split clustering in the BOSS CMASS galaxy sample, MNRAS531(2024) 898 [2309.16541]
-
[40]
A marked correlation function for constraining modified gravity models
M. White,A marked correlation function for constraining modified gravity models, J. Cosmology Astropart. Phys.2016(2016) 057 [1609.08632]
work page internal anchor Pith review Pith/arXiv arXiv 2016
-
[41]
Testing modified gravity using a marked correlation function
J. Armijo, Y.-C. Cai, N. Padilla, B. Li and J.A. Peacock,Testing modified gravity using a marked correlation function, MNRAS478(2018) 3627 [1801.08975]
work page internal anchor Pith review Pith/arXiv arXiv 2018
-
[42]
Marked clustering statistics in $f(R)$ gravity cosmologies
C. Hern´ andez-Aguayo, C.M. Baugh and B. Li,Marked clustering statistics in f(R) gravity cosmologies, MNRAS479(2018) 4824 [1801.08880]
work page internal anchor Pith review Pith/arXiv arXiv 2018
- [43]
-
[44]
E. Massara, F. Villaescusa-Navarro, S. Ho, N. Dalal and D.N. Spergel,Using the Marked Power Spectrum to Detect the Signature of Neutrinos in Large-Scale Structure, Phys. Rev. Lett.126(2021) 011301 [2001.11024]
-
[45]
M. K¨ archer, J. Bel and S. de la Torre,Towards an optimal marked correlation function analysis for the detection of modified gravity, A&A694(2025) A253 [2406.02504]
-
[46]
Improving Cosmological Distance Measurements by Reconstruction of the Baryon Acoustic Peak
D.J. Eisenstein, H.-J. Seo, E. Sirko and D.N. Spergel,Improving Cosmological Distance Measurements by Reconstruction of the Baryon Acoustic Peak, ApJ664(2007) 675 [astro-ph/0604362]
work page internal anchor Pith review Pith/arXiv arXiv 2007
-
[47]
Y. Wang, G.-B. Zhao, K. Koyama, W.J. Percival, R. Takahashi, C. Hikage et al.,Extracting high-order cosmological information in galaxy surveys with power spectra,Communications Physics7(2024) 130 [2202.05248]
- [48]
-
[49]
Landy and A.S
S.D. Landy and A.S. Szalay,Bias and Variance of Angular Correlation Functions, ApJ412 (1993) 64
1993
-
[50]
On the Prediction of Velocity Fields from Redshift Space Galaxy Samples
A. Nusser and M. Davis,On the Prediction of Velocity Fields from Redshift Space Galaxy Samples, ApJ421(1994) L1 [astro-ph/9309009]
work page internal anchor Pith review Pith/arXiv arXiv 1994
-
[51]
N. Padmanabhan, X. Xu, D.J. Eisenstein, R. Scalzo, A.J. Cuesta, K.T. Mehta et al.,A 2 per cent distance to z = 0.35 by reconstructing baryon acoustic oscillations - I. Methods and application to the Sloan Digital Sky Survey, MNRAS427(2012) 2132 [1202.0090]
work page internal anchor Pith review Pith/arXiv arXiv 2012
- [52]
-
[53]
C. Saulder, C. Howlett, K.A. Douglass, K. Said, S. BenZvi, S. Ahlen et al.,Target selection for the DESI Peculiar Velocity Survey, MNRAS525(2023) 1106 [2302.13760]
-
[54]
Fisher,The logic of inductive inference,Journal of the Royal Statistical Society98(1935) 39
R.A. Fisher,The logic of inductive inference,Journal of the Royal Statistical Society98(1935) 39
1935
-
[55]
Measuring cosmological parameters with galaxy surveys
M. Tegmark,Measuring Cosmological Parameters with Galaxy Surveys, Phys. Rev. Lett.79 (1997) 3806 [astro-ph/9706198]
work page internal anchor Pith review Pith/arXiv arXiv 1997
-
[56]
Karhunen-Loeve eigenvalue problems in cosmology: how should we tackle large data sets?
M. Tegmark, A.N. Taylor and A.F. Heavens,Karhunen-Lo` eve Eigenvalue Problems in Cosmology: How Should We Tackle Large Data Sets?, ApJ480(1997) 22 [astro-ph/9603021]
work page internal anchor Pith review Pith/arXiv arXiv 1997
-
[57]
J. Carron,On the assumption of Gaussianity for cosmological two-point statistics and parameter dependent covariance matrices, A&A551(2013) A88 [1204.4724]. – 27 –
work page internal anchor Pith review Pith/arXiv arXiv 2013
-
[58]
A.A. Berlind and D.H. Weinberg,The Halo Occupation Distribution: Toward an Empirical Determination of the Relation between Galaxies and Mass, ApJ575(2002) 587 [astro-ph/0109001]
work page internal anchor Pith review Pith/arXiv arXiv 2002
-
[59]
F. Villaescusa-Navarro, C. Hahn, E. Massara, A. Banerjee, A.M. Delgado, D.K. Ramanah et al.,The Quijote Simulations, ApJS250(2020) 2 [1909.05273]
-
[60]
The cosmological simulation code GADGET-2
V. Springel,The cosmological simulation code GADGET-2, MNRAS364(2005) 1105 [astro-ph/0505010]
work page internal anchor Pith review Pith/arXiv arXiv 2005
-
[61]
Davis, G
M. Davis, G. Efstathiou, C.S. Frenk and S.D.M. White,The evolution of large-scale structure in a universe dominated by cold dark matter, ApJ292(1985) 371
1985
-
[62]
Accurate Estimators of Correlation Functions in Fourier Space
E. Sefusatti, M. Crocce, R. Scoccimarro and H.M.P. Couchman,Accurate estimators of correlation functions in Fourier space, MNRAS460(2016) 3624 [1512.07295]
work page internal anchor Pith review Pith/arXiv arXiv 2016
-
[63]
Correcting for the alias effect when measuring the power spectrum using FFT
Y.P. Jing,Correcting for the Alias Effect When Measuring the Power Spectrum Using a Fast Fourier Transform, ApJ620(2005) 559 [astro-ph/0409240]
work page internal anchor Pith review Pith/arXiv arXiv 2005
-
[64]
Redshift-Space Distortions, Pairwise Velocities and Nonlinearities
R. Scoccimarro,Redshift-space distortions, pairwise velocities, and nonlinearities, Phys. Rev. D 70(2004) 083007 [astro-ph/0407214]
work page internal anchor Pith review Pith/arXiv arXiv 2004
-
[65]
D. Bianchi, M. Chiesa and L. Guzzo,Improving the modelling of redshift-space distortions - I. A bivariate Gaussian description for the galaxy pairwise velocity distributions, MNRAS446 (2015) 75 [1407.4753]
work page internal anchor Pith review Pith/arXiv arXiv 2015
-
[66]
D. Bianchi, W.J. Percival and J. Bel,Improving the modelling of redshift-space distortions- II. A pairwise velocity model covering large and small scales, MNRAS463(2016) 3783 [1602.02780]
work page internal anchor Pith review Pith/arXiv arXiv 2016
-
[67]
Density-dependent clustering: I. Pulling back the curtains on motions of the BAO peak
M.C. Neyrinck, I. Szapudi, N. McCullagh, A.S. Szalay, B. Falck and J. Wang, Density-dependent clustering - I. Pullingback the curtains on motions of the BAO peak, MNRAS478(2018) 2495 [1610.06215]
work page internal anchor Pith review Pith/arXiv arXiv 2018
-
[68]
Eulerian BAO Reconstructions and N-Point Statistics
M. Schmittfull, Y. Feng, F. Beutler, B. Sherwin and M.Y. Chu,Eulerian BAO reconstructions and N -point statistics, Phys. Rev. D92(2015) 123522 [1508.06972]
work page internal anchor Pith review Pith/arXiv arXiv 2015
- [69]
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.