Nonparametric Riemannian Empirical Bayes, and Denoising Measurements on Manifolds
Pith reviewed 2026-06-27 11:01 UTC · model grok-4.3
The pith
A tangential Bayes denoiser derived from a Tweedie-Eddington formula nearly achieves the Bayes risk for manifold-valued measurements in low noise.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
The tangential Bayes denoiser, identified by a novel Tweedie-Eddington formula for Riemannian Gaussian mixture models, achieves nearly the Bayes risk in a low-noise regime. A spectral approximation to this denoiser converges at finite-sample rates, which on the circle are shown to be minimax optimal and nonparametric due to density singularities at the cut locus.
What carries the argument
The tangential Bayes denoiser: a first-order approximation to the posterior Fréchet mean obtained from the marginal distribution of measurements via the Tweedie-Eddington formula.
If this is right
- The tangential Bayes denoiser nearly achieves the Bayes risk in low-noise regimes on manifolds.
- Finite-sample rates for the spectral approximation are derived, slower than Euclidean due to cut locus singularities.
- The denoiser is minimax-optimal on the circle with a nonparametric convergence rate.
- The methodology applies to denoising sphere-valued gamma ray burst locations and torus-valued protein torsion angles.
Where Pith is reading between the lines
- Geometric features like the cut locus impose limits on denoising performance that are absent in Euclidean settings.
- The tangential approximation approach could extend to other manifold-valued statistical problems.
- Practical implementation in scientific applications demonstrates utility beyond theoretical rates.
Load-bearing premise
The first-order tangential approximation to the posterior Fréchet mean remains accurate enough for near-Bayes performance in the low-noise regime.
What would settle it
Numerical experiments on the circle where the excess risk of the tangential denoiser over the Bayes risk fails to approach zero as noise decreases would disprove the near-Bayes claim.
read the original abstract
We initiate the study of nonparametric empirical Bayes denoising methods in the setting where both the latent variables and their measurements lie on a compact Riemannian manifold, and where the likelihood is a Riemannian Gaussian distribution. Our starting point is a novel Tweedie-Eddington formula for Riemannian Gaussian mixture models which identifies a certain surrogate oracle denoiser in terms of the marginal distribution of the measurements; it avoids the explicit computation of the posterior Fr\'echet mean (as required by the Bayes denoiser) via a first-order approximation, hence we refer to it as the "tangential" Bayes denoiser. We show that this surrogate oracle achieves nearly the Bayes risk in a low-noise regime, we construct a fully data-driven approximation of it using the spectral theory of the Laplace-Beltrami operator, and we establish finite-sample rates of convergence for the distance between the the surrogate oracle and its approximation. Contrasting the nearly-parametric rates from the Euclidean setting, the rates in the Riemannian setting are slower due to the singularities of the Riemannian Gaussian density at the cut locus of its Fr\'echet mean; in the special case of the circle we establish matching lower bounds which show that our proposed denoiser is minimax-optimal, and that the denoising problem exhibits a genuinely nonparametric rate of convergence. Lastly, we implement our methodology in two scientific applications: in astronomy, the sphere-valued problem of denoising the locations of gamma ray bursts; in structural biology, the torus-valued problem of denoising pairs of torsion angles of adjacent amino acids in a protein (i.e., the Ramachandran plot).
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript develops nonparametric empirical Bayes denoising for data on compact Riemannian manifolds with Riemannian Gaussian noise. It introduces a Tweedie-Eddington formula leading to a tangential Bayes denoiser that approximates the posterior Fréchet mean, proves it nearly achieves the Bayes risk in low-noise regimes, constructs a spectral Laplace-Beltrami based estimator with finite-sample convergence rates, establishes matching lower bounds on the circle showing minimax optimality and nonparametric rates due to cut-locus singularities, and demonstrates the method on gamma-ray burst locations and protein torsion angles.
Significance. If the central approximation error bounds hold, this provides a novel extension of empirical Bayes methods to manifold-valued data, with the geometric singularities leading to slower rates than in Euclidean space. The matching lower bounds on the circle and the data-driven spectral estimator are notable strengths. The applications suggest practical relevance in astronomy and structural biology.
major comments (2)
- [§4 (Tweedie-Eddington formula and tangential denoiser)] §4 (Tweedie-Eddington formula and tangential denoiser): The claim that the first-order tangential approximation achieves risk(surrogate) = risk(Bayes) + o(rate), where rate is the nonparametric rate induced by cut-locus singularities, requires an explicit quantitative bound separating the remainder (involving second derivatives of the log-density and curvature of the exponential map) from the statistical error. The non-smoothness of the Riemannian Gaussian density at the cut locus may make this remainder O(σ) rather than o(σ) in the low-noise regime, which would be comparable to the claimed convergence rate.
- [§5 (Finite-sample rates for spectral estimator)] §5 (Finite-sample rates for spectral estimator): The stated finite-sample rates for the distance between the surrogate oracle and its spectral approximation must be shown to remain valid after accounting for the tangential approximation error; if the approximation remainder is not o of the rate, the near-Bayes performance and minimax optimality claims on the circle are affected.
minor comments (2)
- [Abstract] Abstract: briefly state the explicit form of the finite-sample rates (e.g., the order in n and σ) to clarify the nonparametric nature of the convergence.
- [Notation and definitions] Notation: ensure the definition of the tangential projection and the surrogate oracle is introduced with a single consistent symbol before its use in the risk bounds.
Simulated Author's Rebuttal
We thank the referee for the careful and constructive report. The comments on the quantitative control of the tangential approximation error are well-taken and point to a place where the manuscript can be strengthened. We address each major comment below.
read point-by-point responses
-
Referee: [§4 (Tweedie-Eddington formula and tangential denoiser)] §4 (Tweedie-Eddington formula and tangential denoiser): The claim that the first-order tangential approximation achieves risk(surrogate) = risk(Bayes) + o(rate), where rate is the nonparametric rate induced by cut-locus singularities, requires an explicit quantitative bound separating the remainder (involving second derivatives of the log-density and curvature of the exponential map) from the statistical error. The non-smoothness of the Riemannian Gaussian density at the cut locus may make this remainder O(σ) rather than o(σ) in the low-noise regime, which would be comparable to the claimed convergence rate.
Authors: We agree that an explicit quantitative bound on the remainder would make the argument clearer. In the revision we will add a lemma that expands the difference between the tangential surrogate and the true posterior Fréchet mean to second order, isolating the terms that involve the Hessian of the log-density and the curvature of the exponential map. We will then show that, under the low-noise regime and the compactness of the manifold, these remainder terms are o(σ) uniformly outside a neighborhood of the cut locus whose measure vanishes fast enough that their contribution remains smaller than the nonparametric rate. The non-smoothness at the cut locus is handled by a separate integral estimate that exploits the fact that the Riemannian Gaussian density is bounded and the cut locus has codimension at least one. revision: yes
-
Referee: [§5 (Finite-sample rates for spectral estimator)] §5 (Finite-sample rates for spectral estimator): The stated finite-sample rates for the distance between the surrogate oracle and its spectral approximation must be shown to remain valid after accounting for the tangential approximation error; if the approximation remainder is not o of the rate, the near-Bayes performance and minimax optimality claims on the circle are affected.
Authors: We will revise §5 to include a triangle inequality that explicitly adds the tangential approximation error to the statistical error of the spectral estimator. Because the new bound in §4 establishes that the tangential remainder is o of the nonparametric rate (including on the circle), the overall rate between the data-driven estimator and the Bayes denoiser remains unchanged. The matching lower bound on the circle is unaffected because it applies directly to any estimator and does not rely on the tangential approximation. revision: yes
Circularity Check
No circularity: new Tweedie-Eddington formula and spectral rates derived independently
full rationale
The paper introduces a novel Tweedie-Eddington identity for Riemannian Gaussians, defines the tangential Bayes denoiser via first-order approximation to the posterior Fréchet mean, and derives finite-sample rates plus matching lower bounds on the circle from the Laplace-Beltrami spectral theory and cut-locus singularities. These steps rely on external manifold geometry and operator theory rather than self-citation chains or re-labeling of fitted quantities as predictions. The central claims rest on explicit constructions and minimax arguments that do not reduce to the paper's own inputs by definition.
Axiom & Free-Parameter Ledger
axioms (2)
- domain assumption Both latent variables and measurements lie on a compact Riemannian manifold
- domain assumption The likelihood is a Riemannian Gaussian distribution
invented entities (1)
-
tangential Bayes denoiser
no independent evidence
Reference graph
Works this paper leans on
-
[1]
Helgason, Sigurdur , TITLE =. 2000 , PAGES =. doi:10.1090/surv/083 , URL =
-
[2]
All-sky maps , booktitle=
Tirion, Wil , year=. All-sky maps , booktitle=
-
[3]
Taylor, Michael E. , TITLE =. 1996 , PAGES =. doi:10.1007/978-1-4684-9320-7 , URL =
-
[4]
Park , doi =
Ho Yun and Byeong U. Park , doi =. Bernoulli , keywords =. 2023 , bdsk-url-1 =
2023
-
[5]
Christof Sch. 2025 , bdsk-url-1 =. doi:10.1214/25-EJP1273 , journal =
-
[6]
Christof Sch. 2019 , bdsk-url-1 =. doi:10.1214/19-EJS1618 , journal =
-
[7]
Proceedings of The 35th International Conference on Algorithmic Learning Theory , pages =
Concentration of empirical barycenters in metric spaces , author =. Proceedings of The 35th International Conference on Algorithmic Learning Theory , pages =. 2024 , editor =
2024
-
[8]
Journal of Machine Learning Research , year =
Paul Escande , title =. Journal of Machine Learning Research , year =
-
[9]
Empirical Bayes: Some Tools, Rules, and Duals , publisher=
Koenker, Roger and Gu, Jiaying , year=. Empirical Bayes: Some Tools, Rules, and Duals , publisher=
-
[10]
Information Theory: From Coding to Learning , publisher=
Polyanskiy, Yury and Wu, Yihong , year=. Information Theory: From Coding to Learning , publisher=
-
[11]
Narcowich and P
F. Narcowich and P. Petrushev and J. Ward , doi =. Decomposition of Besov and Triebel--Lizorkin spaces on the sphere , url =. Journal of Functional Analysis , keywords =. 2006 , bdsk-url-1 =
2006
-
[12]
Baldi and G
P. Baldi and G. Kerkyacharian and D. Marinucci and D. Picard , doi =. The Annals of Statistics , keywords =. 2009 , bdsk-url-1 =
2009
-
[13]
and Shen, Xiaoping , TITLE =
Walter, Gilbert G. and Shen, Xiaoping , TITLE =. 2001 , PAGES =
2001
-
[14]
Riemannian geometric statistics in medical image analysis , PAGES =
Sommer, Stefan and Fletcher, Tom and Pennec, Xavier , TITLE =. Riemannian geometric statistics in medical image analysis , PAGES =. 2020 , ISBN =. doi:10.1016/B978-0-12-814725-2.00008-X , URL =
-
[15]
2023 , PAGES =
Boumal, Nicolas , TITLE =. 2023 , PAGES =
2023
-
[16]
Petersen, Peter , TITLE =. 2016 , PAGES =. doi:10.1007/978-3-319-26654-1 , URL =
-
[17]
, author=
Estimation of non-normalized statistical models by score matching. , author=. J. Mach. Learn. Res. , volume=
-
[18]
Neural Comput
A connection between score matching and denoising autoencoders , author=. Neural Comput. , volume=. 2011 , publisher=
2011
-
[19]
Huckemann , doi =
Benjamin Eltzner and Stephan F. Huckemann , doi =. Ann. Statist. , keywords =. 2019 , bdsk-url-1 =
2019
-
[20]
Bernoulli , keywords =
Benjamin Eltzner , doi =. Bernoulli , keywords =. 2022 , bdsk-url-1 =
2022
-
[21]
Hotz, T. and Huckemann, S. , date =. Intrinsic means on the circle: uniqueness, locus and asymptotics , url =. Ann. Inst. Stat. Math. , number =. 2015 , bdsk-url-1 =. doi:10.1007/s10463-013-0444-7 , isbn =
-
[22]
Pennec, Xavier , TITLE =. Ann. Statist. , FJOURNAL =. 2018 , NUMBER =
2018
-
[23]
Evans, Steven N. and Jaffe, Adam Q. , TITLE =. Bernoulli , FJOURNAL =. 2024 , NUMBER =. doi:10.3150/23-bej1603 , URL =
-
[24]
Sch\"otz, Christof , TITLE =. Statistics , FJOURNAL =. 2022 , NUMBER =. doi:10.1080/02331888.2022.2032063 , URL =
-
[25]
and Guntuboyina, Adityanand and Sen, Bodhisattva , TITLE =
Soloff, Jake A. and Guntuboyina, Adityanand and Sen, Bodhisattva , TITLE =. J. R. Stat. Soc. Ser. B. Stat. Methodol. , FJOURNAL =. 2025 , NUMBER =. doi:10.1093/jrsssb/qkae040 , URL =
-
[26]
Jiang, Wenhua and Zhang, Cun-Hui , TITLE =. Ann. Statist. , FJOURNAL =. 2009 , NUMBER =. doi:10.1214/08-AOS638 , URL =
-
[27]
Saha, Sujayam and Guntuboyina, Adityanand , TITLE =. Ann. Statist. , FJOURNAL =. 2020 , NUMBER =. doi:10.1214/19-AOS1817 , URL =
-
[28]
Pennec, Xavier , TITLE =. J. Math. Imaging Vision , FJOURNAL =. 2006 , NUMBER =. doi:10.1007/s10851-006-6228-4 , URL =
-
[29]
Transactions of the
Ziezold, Herbert , TITLE =. Transactions of the. 1977 , ISBN =
1977
-
[30]
Sverdrup-Thygeson, Harald , TITLE =. Ann. Statist. , FJOURNAL =. 1981 , NUMBER =
1981
-
[31]
Afsari, Bijan , TITLE =. Proc. Amer. Math. Soc. , FJOURNAL =. 2011 , NUMBER =. doi:10.1090/S0002-9939-2010-10541-5 , URL =
-
[32]
Bhattacharya, Rabi and Patrangenaru, Vic , TITLE =. Ann. Statist. , FJOURNAL =. 2003 , NUMBER =. doi:10.1214/aos/1046294456 , URL =
-
[33]
Bhattacharya, Rabi and Patrangenaru, Vic , TITLE =. Ann. Statist. , FJOURNAL =. 2005 , NUMBER =. doi:10.1214/009053605000000093 , URL =
-
[34]
Lee, John M. , TITLE =. 2011 , PAGES =. doi:10.1007/978-1-4419-7940-7 , URL =
-
[35]
, TITLE =
Lee, John M. , TITLE =. 2013 , PAGES =
2013
-
[36]
, TITLE =
Lee, John M. , TITLE =. 2018 , PAGES =
2018
-
[37]
Sturm, Karl-Theodor , TITLE =. Heat kernels and analysis on manifolds, graphs, and metric spaces (. 2003 , MRCLASS =. doi:10.1090/conm/338/06080 , URL =
-
[38]
McCormack, Andrew and Hoff, Peter , TITLE =. Ann. Statist. , FJOURNAL =. 2022 , NUMBER =. doi:10.1214/22-aos2245 , URL =
-
[39]
McCormack, A. and Hoff, P. D. , TITLE =. Biometrika , FJOURNAL =. 2023 , NUMBER =. doi:10.1093/biomet/asad014 , URL =
-
[40]
, TITLE =
Miolane, Nina and Guigui, Nicolas and Le Brigant, Alice and et al. , TITLE =. J. Mach. Learn. Res. , FJOURNAL =. 2020 , ISSN =
2020
-
[41]
Eltzner, Benjamin and Huckemann, Stephan F. , TITLE =. Ann. Statist. , FJOURNAL =. 2019 , NUMBER =. doi:10.1214/18-AOS1781 , URL =
-
[42]
Before and below ‘theory of mind’: embodied simulation and the neural correlates of social cognition
Healy, Jr., Dennis M. and Kim, Peter T. , TITLE =. Ann. Statist. , FJOURNAL =. 1996 , NUMBER =. doi:10.1214/aos/1033066208 , URL =
-
[43]
Kim, Peter T. , TITLE =. Ann. Statist. , FJOURNAL =. 1998 , NUMBER =. doi:10.1214/aos/1024691089 , URL =
-
[44]
Pigoli, D. and Aston, J. A. D. and Dryden, I. L. and Secchi, P. , TITLE =. Biometrika , FJOURNAL =. 2014 , NUMBER =. doi:10.1093/biomet/asu008 , URL =
-
[45]
Dryden, I. L. and Koloydenko, A. and Zhou, D. , TITLE =. Ann. Appl. Stat. , FJOURNAL =. 2009 , NUMBER =. doi:10.1214/09-AOAS249 , URL =
-
[46]
Thanwerdas, Yann and Pennec, Xavier , TITLE =. SIAM J. Matrix Anal. Appl. , FJOURNAL =. 2022 , NUMBER =. doi:10.1137/22M1471729 , URL =
-
[47]
Zhang, Cun-Hui , TITLE =. Statist. Sinica , FJOURNAL =. 1997 , NUMBER =
1997
-
[48]
Hendriks, Harrie , TITLE =. Ann. Statist. , FJOURNAL =. 1990 , NUMBER =. doi:10.1214/aos/1176347628 , URL =
-
[49]
Chevallier, Emmanuel and Li, Didong and Lu, Yulong and Dunson, David , TITLE =. SIAM J. Math. Data Sci. , FJOURNAL =. 2022 , NUMBER =. doi:10.1137/21M1461551 , URL =
-
[50]
Kim, Peter T. and Koo, Ja-Yong , TITLE =. J. Fourier Anal. Appl. , FJOURNAL =. 2005 , NUMBER =. doi:10.1007/s00041-005-3041-1 , URL =
-
[51]
Chevallier, Emmanuel and Kalunga, Emmanuel and Angulo, Jes\'us , TITLE =. SIAM J. Imaging Sci. , FJOURNAL =. 2017 , NUMBER =. doi:10.1137/15M1053566 , URL =
-
[52]
Pelletier, Bruno , TITLE =. Statist. Probab. Lett. , FJOURNAL =. 2005 , NUMBER =. doi:10.1016/j.spl.2005.04.004 , URL =
-
[53]
Henry, Guillermo and Rodriguez, Daniela , TITLE =. J. Math. Imaging Vision , FJOURNAL =. 2009 , NUMBER =. doi:10.1007/s10851-009-0145-2 , URL =
-
[54]
Berry, Tyrus and Sauer, Timothy , TITLE =. Comput. Statist. Data Anal. , FJOURNAL =. 2017 , PAGES =. doi:10.1016/j.csda.2016.09.011 , URL =
-
[55]
Huckemann, Stephan F. and Kim, Peter T. and Koo, Ja-Yong and Munk, Axel , TITLE =. Ann. Statist. , FJOURNAL =. 2010 , NUMBER =. doi:10.1214/09-AOS783 , URL =
-
[56]
Efron, Bradley , TITLE =. Statist. Sci. , FJOURNAL =. 2019 , NUMBER =. doi:10.1214/18-STS674 , URL =
-
[57]
Hsu, Elton P. , TITLE =. 2002 , PAGES =. doi:10.1090/gsm/038 , URL =
-
[58]
Springer Science & Business Media, 2008
Villani, C\'edric , TITLE =. 2009 , PAGES =. doi:10.1007/978-3-540-71050-9 , URL =
-
[59]
1984 , PAGES =
Chavel, Isaac , TITLE =. 1984 , PAGES =
1984
-
[60]
Aubin, Thierry , TITLE =. 1998 , PAGES =. doi:10.1007/978-3-662-13006-3 , URL =
-
[61]
Jana, Soham and Polyanskiy, Yury and Wu, Yihong , TITLE =. Inf. Inference , FJOURNAL =. 2025 , NUMBER =. doi:10.1093/imaiai/iaaf027 , URL =
-
[62]
and Greenshtein, Eitan , TITLE =
Brown, Lawrence D. and Greenshtein, Eitan , TITLE =. Ann. Statist. , FJOURNAL =. 2009 , NUMBER =. doi:10.1214/08-AOS630 , URL =
-
[63]
Fan, Jianqing , TITLE =. Ann. Statist. , FJOURNAL =. 1991 , NUMBER =. doi:10.1214/aos/1176348248 , URL =
-
[64]
and Hall, Peter , TITLE =
Carroll, Raymond J. and Hall, Peter , TITLE =. J. Amer. Statist. Assoc. , FJOURNAL =. 1988 , NUMBER =
1988
-
[65]
Zhang, Cun-Hui , TITLE =. Ann. Statist. , FJOURNAL =. 1990 , NUMBER =. doi:10.1214/aos/1176347627 , URL =
-
[66]
Masarotto, V. and Panaretos, V. M. and Zemel, Y. , TITLE =. Sankhya A , FJOURNAL =. 2019 , NUMBER =. doi:10.1007/s13171-018-0130-1 , URL =
-
[67]
Santoro, Leonardo V. and Panaretos, Victor M. , TITLE =. Ann. Appl. Probab. , FJOURNAL =. 2025 , NUMBER =. doi:10.1214/25-aap2192 , URL =
-
[68]
Le, Huiling and Kume, Alfred , TITLE =. Adv. in Appl. Probab. , FJOURNAL =. 2000 , NUMBER =. doi:10.1239/aap/1013540025 , URL =
-
[69]
and Lin, Lizhen and Rosenberg, Steven and Walters, Jackson and Xu, Jie , TITLE =
Kolaczyk, Eric D. and Lin, Lizhen and Rosenberg, Steven and Walters, Jackson and Xu, Jie , TITLE =. Ann. Statist. , FJOURNAL =. 2020 , NUMBER =
2020
-
[70]
Ferguson, Daniel and Meyer, Fran cois G. , TITLE =. Inf. Inference , FJOURNAL =. 2023 , NUMBER =. doi:10.1093/imaiai/iaad002 , URL =
-
[71]
Cazals, Fr\'ed\'eric and Delmas, Bernard and O'Donnell, Timothee , TITLE =. 19th. 2021 , ISBN =
2021
-
[72]
Eichfelder, Gabriele and Hotz, Thomas and Wieditz, Johannes , TITLE =. Optim. Lett. , FJOURNAL =. 2019 , NUMBER =. doi:10.1007/s11590-019-01415-y , URL =
-
[73]
Electron
Shayan Hundrieser and Benjamin Eltzner and Stephan Huckemann , doi =. Electron. J. Stat. , keywords =. 2024 , bdsk-url-1 =
2024
-
[74]
, TITLE =
Chakraborty, Rudrasis and Vemuri, Baba C. , TITLE =. Ann. Statist. , FJOURNAL =. 2019 , NUMBER =
2019
-
[75]
and Holmes, Susan P
Billera, Louis J. and Holmes, Susan P. and Vogtmann, Karen , TITLE =. Adv. in Appl. Math. , FJOURNAL =. 2001 , NUMBER =
2001
-
[76]
Bhattacharya, Rabi and Oliver, Rachel , TITLE =. Sankhya A , FJOURNAL =. 2019 , NUMBER =. doi:10.1007/s13171-018-0127-9 , URL =
-
[77]
Bhattacharya, Abhishek and Dunson, David B. , TITLE =. Biometrika , FJOURNAL =. 2010 , NUMBER =. doi:10.1093/biomet/asq044 , URL =
-
[78]
Krajsek, Kai and Menzel, Marion I. and Scharr, Hanno , TITLE =. Int. J. Comput. Vis. , FJOURNAL =. 2016 , NUMBER =. doi:10.1007/s11263-016-0909-2 , URL =
-
[79]
Tony and Li, Hongzhe , title =
Yang, Chun-Hao and Doss, Hani and Vemuri, Baba C. , TITLE =. J. Amer. Statist. Assoc. , FJOURNAL =. 2024 , NUMBER =. doi:10.1080/01621459.2022.2110877 , URL =
-
[80]
Geneyro, Emiliano and N\'u\ nez-Antonio, Gabriel , TITLE =. J. Appl. Stat. , FJOURNAL =. 2024 , NUMBER =. doi:10.1080/02664763.2022.2156485 , URL =
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.