pith. machine review for the scientific record. sign in

arxiv: 1707.06529 · v2 · submitted 2017-07-20 · 🌌 astro-ph.CO · astro-ph.IM· stat.ME

Recognition: unknown

Massive data compression for parameter-dependent covariance matrices

Authors on Pith no claims yet
classification 🌌 astro-ph.CO astro-ph.IMstat.ME
keywords covariancematrixdatanumbercompressionanalysisrequiredsimulations
0
0 comments X
read the original abstract

We show how the massive data compression algorithm MOPED can be used to reduce, by orders of magnitude, the number of simulated datasets that are required to estimate the covariance matrix required for the analysis of gaussian-distributed data. This is relevant when the covariance matrix cannot be calculated directly. The compression is especially valuable when the covariance matrix varies with the model parameters. In this case, it may be prohibitively expensive to run enough simulations to estimate the full covariance matrix throughout the parameter space. This compression may be particularly valuable for the next-generation of weak lensing surveys, such as proposed for Euclid and LSST, for which the number of summary data (such as band power or shear correlation estimates) is very large, $\sim 10^4$, due to the large number of tomographic redshift bins that the data will be divided into. In the pessimistic case where the covariance matrix is estimated separately for all points in an MCMC analysis, this may require an unfeasible $10^9$ simulations. We show here that MOPED can reduce this number by a factor of 1000, or a factor of $\sim 10^6$ if some regularity in the covariance matrix is assumed, reducing the number of simulations required to a manageable $10^3$, making an otherwise intractable analysis feasible.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 2 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. KiDS+VIKING-450 cosmology with Bayesian hierarchical model redshift distributions

    astro-ph.CO 2026-05 conditional novelty 4.0

    Bayesian hierarchical modeling of photometric redshifts in KiDS+VIKING-450 raises S8 to 0.756 ± 0.039 and reduces Planck tension to 1.9σ.

  2. Machine-learning applications for weak-lensing cosmology

    astro-ph.CO 2026-05 unverdicted novelty 2.0

    Machine learning techniques can mitigate limitations in traditional weak-lensing analyses and enhance extraction of cosmological information from galaxy imaging surveys.