pith. sign in

arxiv: 2407.08094 · v3 · pith:CBIGPAT7new · submitted 2024-07-10 · 📊 stat.ML · cs.LG· physics.chem-ph· physics.data-an

Density Estimation via Binless Multidimensional Integration

Pith reviewed 2026-05-23 22:51 UTC · model grok-4.3

classification 📊 stat.ML cs.LGphysics.chem-phphysics.data-an
keywords density estimationnonparametric methodsthermodynamic integrationmanifold hypothesishigh-dimensional dataneighborhood graphbinless estimation
0
0 comments X

The pith

BMTI estimates the logarithm of the density by integrating log-density differences between neighboring points using maximum likelihood.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper presents BMTI as a nonparametric density estimation technique that works by calculating log-density differences between nearby data points and then integrating those differences in a maximum-likelihood way weighted by uncertainties. This is positioned as a multidimensional version of thermodynamic integration from physics. A reader might care because it promises to handle high-dimensional data better than traditional methods that rely on bins or partitions, which suffer in high dimensions. The approach relies on building a neighborhood graph with adaptive bandwidths instead of binning or explicit coordinate maps.

Core claim

BMTI estimates the logarithm of the density by initially computing log-density differences between neighbouring data points. Subsequently, such differences are integrated, weighted by their associated uncertainties, using a maximum-likelihood formulation. This procedure can be seen as an extension to a multidimensional setting of the thermodynamic integration, a technique developed in statistical physics. The method leverages the manifold hypothesis, estimating quantities within the intrinsic data manifold without defining an explicit coordinate map. It does not rely on any binning or space partitioning, but rather on the construction of a neighbourhood graph based on an adaptive bandwidth.

What carries the argument

Binless multidimensional thermodynamic integration on an adaptive bandwidth neighborhood graph, which integrates local log-density differences weighted by uncertainties via maximum likelihood.

If this is right

  • Reconstructs smooth density profiles even in high-dimensional embedding spaces.
  • Outperforms traditional estimators on complex synthetic high-dimensional datasets.
  • Applies successfully to realistic datasets from chemical physics without binning.
  • Mitigates limitations of nonparametric density estimators that rely on space partitioning.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • The integration step might allow density estimation with smaller sample sizes than histogram methods in high dimensions.
  • The approach could extend to other manifold-based inference tasks where explicit coordinates are unavailable.
  • Hybrid use with physics simulation techniques might improve robustness in molecular modeling applications.

Load-bearing premise

The adaptive bandwidth neighborhood graph accurately captures local density differences without introducing systematic bias.

What would settle it

Running BMTI on a synthetic dataset with a known ground-truth density and observing that the recovered log-density deviates from the true values by more than the reported uncertainties.

Figures

Figures reproduced from arXiv: 2407.08094 by Aldo Glielmo, Alessandro Laio, Alex Rodriguez, Matteo Carli.

Figure 1
Figure 1. Figure 1: The BMTI method Panels A to D illustrate of the 4 steps, described in Sec. 3.2, needed to construct the BMTI log-likelihood: estimating the intrinsic dimension d, adaptive neighbourhoods selection and the neighbourhood graph, NLD gradients ˆgi , and finally NLD differences ˆδF estimation. Panel E illustrates the reconstruction of the NLD starting from measurements of NLD differences as described in Sec. 3.… view at source ↗
Figure 2
Figure 2. Figure 2: Accuracy in the estimation of δFˆ and its error. Density scatter plots of true vs estimated δF’s for 6 test datasets. The insets show the distribution of the standardised variables ( ˆ δFij − δFij )/εij in blue, and a standard normal PDF in red; the agreement between the two demonstrate the accuracy of error estimates. Notice that within our framework other radially￾symmetric kernels can be employed [95, 6… view at source ↗
Figure 3
Figure 3. Figure 3: BMTI performance on various datasets. Top: scatter plots of estimated vs GT negative log￾densities for BMTI and GKDE on 4 datasets of increasing intrinsic dimensionality. Bottom: Running averages of the absolute error of Fˆ as a function of the GT value of F for BMTI and other baseline methods; the insets show zoomed-out versions when the error is too large to be visualised in a single graph. have only a s… view at source ↗
Figure 4
Figure 4. Figure 4: A: BMTI smoothness and accuracy Fˆ along the minimum energy path connecting the two main minima of a 2d Mueller-Brown potential for various methods. The inset depicts the dataset used in the analysis and, as a red curve, the minimum energy path. B: BMTI data-efficiency Mean absolute error of various nonparametric methods as a function of the number of training points for the 6-dimensional dataset. Points i… view at source ↗
Figure 5
Figure 5. Figure 5: Time scaling: single CPU training times measured in seconds as a function of sample size for the 6-dimensional dataset in the case of uncorrelated δF’s illustrated in Sec. C.2.2 of the SM. σˆy by looking at the distribution of the standardised scores (ˆy−y)/σˆy, also called the pull distribution [105], which is expected to be a standard Gaussian N (0, 1). 4.1 Performance assessment and discussion The perfo… view at source ↗
Figure 6
Figure 6. Figure 6: Performance of various NLD estima￾tors on a dataset with disconnected NG. The dataset considered is obtained from the Mueller-Brown potential presented Sec. D.2.1 and tested in Fig. 4A, but with a scaling factor double as the one used to obtain that sample. Again, 5.000 points are sam￾pled from the corresponding distribution. Row A: scatter plots of estimated vs GT NLDs for the PAk (A1), BMTI (A2) and PAk-… view at source ↗
Figure 7
Figure 7. Figure 7: NLD gradient components estimator performance tested on various bivariate Gaussian datasets. All four datasets considered, one for each column, have a bivariate normal PDF centred at the origin of the Cartesian plane (see Sec. D.1.1 of the SM) sampled 10.000 times. The entries of each dataset’s covariance matrix are indicated in the column header. Top row: correlation plots of estimated x gradient componen… view at source ↗
Figure 9
Figure 9. Figure 9: In the top two rows we can see the correlation plots of the two estimated gradient components along [PITH_FULL_IMAGE:figures/full_fig_p023_9.png] view at source ↗
Figure 8
Figure 8. Figure 8: NLD gradient estimator performance tested on various datasets. The four datasets, one for each column, are indicated in the column header; they are all described in Sec. D of the SM; their dimensionality goes from 2 to 9. For all of them, the analytic expression of the NLD gradient is known. In the fourth and last column, the nine-dimensional case, 80.000 sample points are considered; for all other dataset… view at source ↗
Figure 9
Figure 9. Figure 9: Bivariate potentials from four Gaussian distributions used as test datasets. Each column represents a different dataset. All Gaussians are centred at the origin. The parameters of each dataset’s covariance matrix are indicated in the header of each column. Top: contour plots of the potential surfaces. Bottom: four samples of 10.000 points from the above potentials. D.1.1 2-dimensional Gaussian distibutions… view at source ↗
Figure 10
Figure 10. Figure 10: Bivariate potential U2d used to define the first two directions in the 6-dimensional potential. A: contour plots of the potential surface. B: 10.000 points sampled from the above potentials. D.2 Synthetic distributions with realistic features D.2.1 2-dimensional Mueller-Brown potential The dataset is a sample of 5.000 points sampled from a PDF whose negative logarithm is proportional to the classical biva… view at source ↗
Figure 4
Figure 4. Figure 4: D.2.2 2-dimensional multimodal potential on a glassy background This is a synthetic potential which was designed in order to challenge density estimators despite being defined in a low-dimensional space (D = 2). The dataset contains 10.000 points sampled from the corresponding PDF [PITH_FULL_IMAGE:figures/full_fig_p033_4.png] view at source ↗
Figure 11
Figure 11. Figure 11: Illustration of the Mueller-Brown potential used as test system. A Contour plot of the Mueller-Brown potential in Eq. (S.67). For the reader’s convenience, the minimum of the potential has been shifted to 0. Also, for better readability, the colour map has been cut to 230, otherwise it would be saturated by the diverging behaviour in the top right corner. The black dashed curve represents the MEP connecti… view at source ↗
Figure 12
Figure 12. Figure 12: 2-dimensional multimodal potential on a glassy background.. A Contour plot of the negative logarithm of the PDF defined in Sec. D.2.2 of the SM. B Scatter plots of 5.000 points sampled from the potential. run a Replica Exchange molecular dynamics [118] simulation with 16 replicas using equally spaced temperatures from 340K to 470K as done previously in reference [119]. We choose as feature space the 9-dim… view at source ↗
read the original abstract

We introduce the Binless Multidimensional Thermodynamic Integration (BMTI) method for nonparametric, robust, and data-efficient density estimation. BMTI estimates the logarithm of the density by initially computing log-density differences between neighbouring data points. Subsequently, such differences are integrated, weighted by their associated uncertainties, using a maximum-likelihood formulation. This procedure can be seen as an extension to a multidimensional setting of the thermodynamic integration, a technique developed in statistical physics. The method leverages the manifold hypothesis, estimating quantities within the intrinsic data manifold without defining an explicit coordinate map. It does not rely on any binning or space partitioning, but rather on the construction of a neighbourhood graph based on an adaptive bandwidth selection procedure. BMTI mitigates the limitations commonly associated with traditional nonparametric density estimators, effectively reconstructing smooth profiles even in high-dimensional embedding spaces. The method is tested on a variety of complex synthetic high-dimensional datasets, where it is shown to outperform traditional estimators, and is benchmarked on realistic datasets from the chemical physics literature.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 2 minor

Summary. The manuscript introduces Binless Multidimensional Thermodynamic Integration (BMTI) for nonparametric density estimation. It computes log-density differences between neighboring points on an adaptive-bandwidth neighborhood graph, then integrates these differences via a maximum-likelihood formulation weighted by their uncertainties. The approach is presented as a multidimensional extension of thermodynamic integration that operates without binning or explicit coordinate maps, leveraging the manifold hypothesis to estimate quantities on the intrinsic data manifold. Experiments on synthetic high-dimensional datasets and chemical-physics benchmarks are reported to show outperformance over traditional estimators.

Significance. If the central integration step recovers unbiased log-densities, the binless graph-based formulation would offer a useful alternative to histogram or kernel methods in high-dimensional settings where binning becomes impractical. The explicit connection to thermodynamic integration and the use of uncertainty-weighted maximum likelihood are strengths that could support reproducible implementations. The reported tests on both synthetic manifolds and realistic chemical-physics data provide a concrete basis for assessing practical utility.

major comments (2)
  1. [Method description (adaptive bandwidth paragraph)] The adaptive-bandwidth neighborhood-graph construction (described in the paragraph beginning 'It does not rely on any binning...') supplies the input log-density differences that are subsequently integrated. No derivation or numerical check is supplied showing that these differences remain unbiased when local point spacing (which determines the bandwidth) correlates with the density gradient itself; because the maximum-likelihood step solves a weighted least-squares problem on the supplied deltas, any systematic bias introduced at the graph stage propagates directly into the estimated log-density.
  2. [Section 4 and benchmark results] The claim that BMTI 'mitigates the limitations commonly associated with traditional nonparametric density estimators' and 'effectively reconstructing smooth profiles even in high-dimensional embedding spaces' rests on the integration step being able to correct for local errors. Section 4 (synthetic datasets) and the chemical-physics benchmarks report outperformance, but without an ablation that isolates the graph-construction step or a consistency proof under controlled curvature/density-gradient conditions, it is unclear whether the reported gains survive when the weakest assumption is violated.
minor comments (2)
  1. [Method] Notation for the uncertainty weights in the maximum-likelihood objective is introduced without an explicit equation number; adding a numbered display equation would improve traceability from the graph step to the integration step.
  2. [Figures in Section 4] Figure captions for the synthetic-dataset results do not state the embedding dimension or the number of points used; these details are needed to assess whether the reported advantage scales with the regime where binning fails.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for their careful reading and constructive comments. We address each major comment below and indicate the revisions planned for the manuscript.

read point-by-point responses
  1. Referee: [Method description (adaptive bandwidth paragraph)] The adaptive-bandwidth neighborhood-graph construction (described in the paragraph beginning 'It does not rely on any binning...') supplies the input log-density differences that are subsequently integrated. No derivation or numerical check is supplied showing that these differences remain unbiased when local point spacing (which determines the bandwidth) correlates with the density gradient itself; because the maximum-likelihood step solves a weighted least-squares problem on the supplied deltas, any systematic bias introduced at the graph stage propagates directly into the estimated log-density.

    Authors: We agree that the manuscript currently lacks an explicit derivation or numerical validation demonstrating that the neighbor log-density differences remain unbiased when local spacing correlates with the density gradient. Because the integration step is a weighted least-squares procedure, any such bias would propagate. In the revision we will add both a theoretical analysis of bias in the adaptive-bandwidth difference estimator and controlled numerical experiments on synthetic manifolds where spacing and gradient are deliberately correlated. revision: yes

  2. Referee: [Section 4 and benchmark results] The claim that BMTI 'mitigates the limitations commonly associated with traditional nonparametric density estimators' and 'effectively reconstructing smooth profiles even in high-dimensional embedding spaces' rests on the integration step being able to correct for local errors. Section 4 (synthetic datasets) and the chemical-physics benchmarks report outperformance, but without an ablation that isolates the graph-construction step or a consistency proof under controlled curvature/density-gradient conditions, it is unclear whether the reported gains survive when the weakest assumption is violated.

    Authors: We acknowledge that the present experiments do not include an ablation isolating the graph-construction stage nor a formal consistency analysis under controlled curvature and gradient conditions. While the reported benchmarks already span a range of synthetic manifolds and chemical-physics data, we agree that stronger evidence is needed. The revised manuscript will therefore contain an ablation study separating graph construction from integration and additional consistency checks on synthetic data with systematically varied curvature and density gradients. revision: yes

Circularity Check

0 steps flagged

No circularity: neighbor differences computed independently before ML integration

full rationale

The central chain computes log-density differences on the neighborhood graph from raw point spacings, then feeds those as fixed inputs into a standard weighted maximum-likelihood integration. No equation defines a quantity in terms of its own output, no fitted parameter is relabeled as a prediction, and the thermodynamic-integration reference is to external statistical-physics literature rather than a self-citation chain. The adaptive-bandwidth step is a preprocessing choice whose bias risk is a correctness issue, not a definitional loop. The derivation therefore remains self-contained against external benchmarks.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

Based on abstract only; the central claim rests on the manifold hypothesis and the validity of the adaptive neighborhood graph for capturing local log-density differences. No free parameters or invented entities are explicitly named.

axioms (1)
  • domain assumption Manifold hypothesis: data lie on a lower-dimensional intrinsic manifold that can be estimated via neighborhood graph without explicit coordinate map
    Invoked in the abstract as the basis for operating directly on the data manifold.

pith-pipeline@v0.9.0 · 5715 in / 1239 out tokens · 17869 ms · 2026-05-23T22:51:44.141823+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Reference graph

Works this paper leans on

118 extracted references · 118 canonical work pages · 3 internal anchors

  1. [1]

    Density estimation for statistics and data analysis , volume 26

    Bernard W Silverman. Density estimation for statistics and data analysis , volume 26. CRC press, 1986

  2. [2]

    David W. Scott. Multivariate Density Estima- tion: Theory, Practice, and Visualization, Sec- ond Edition. John Wiley & Sons, Inc., Hoboken, New Jersey, USA, 2 edition, 2015

  3. [3]

    Husic, Alex Rodriguez, Cecilia Clementi, Frank No´ e, and Alessan- dro Laio

    Aldo Glielmo, Brooke E. Husic, Alex Rodriguez, Cecilia Clementi, Frank No´ e, and Alessan- dro Laio. Unsupervised Learning Methods for Molecular Simulation Data. Chem. Rev. , 121(16):9722–9758, 2021

  4. [4]

    The Elements of Statistical Learn- ing Data Mining, Inference, and Prediction

    Trevor Hastie, Robert Tibshirani, and Jerome Friedman. The Elements of Statistical Learn- ing Data Mining, Inference, and Prediction . Springer, New York, 2 edition, 2009

  5. [5]

    Maximum likelihood from incom- plete data via the em algorithm

    Arthur P Dempster, Nan M Laird, and Don- ald B Rubin. Maximum likelihood from incom- plete data via the em algorithm. J. R. Stat. Soc. B, 39(1):1–22, 1977

  6. [6]

    Representation Learning: A Review and New Perspectives

    Yoshua Bengio, Aaron Courville, and Pascal Vincent. Representation Learning: A Review and New Perspectives. IEEE Transactions on Pattern Analysis and Machine Intelligence , 35(8):1798 – 1828, 2013. Density estimation via binless multidimensional integration

  7. [7]

    Deep learning

    Yann Lecun, Yoshua Bengio, and Geoffrey Hin- ton. Deep learning. Nature, 521(7553):436–444, 2015

  8. [8]

    Deep Learning in neural networks: An overview

    J¨ urgen Schmidhuber. Deep Learning in neural networks: An overview. Neural Networks, 61:85– 117, 2015

  9. [9]

    Christopher M. Bishop. Pattern Recognition and Machine Learning. Springer, New York, 1 edi- tion, 2006

  10. [10]

    Recent Developments in Nonparametric Density Estimation

    Alan Julian Izenman. Recent Developments in Nonparametric Density Estimation. J. Am. Stat. Assoc., 86(413):205, 1991

  11. [11]

    Modern multivariate statistical techniques, volume 1

    Alan J Izenman. Modern multivariate statistical techniques, volume 1. Springer, 2008

  12. [12]

    On the Estimation of Probability Den- sity Functions and Mode

    E Parzen. On the Estimation of Probability Den- sity Functions and Mode. Ann. Math. Statist , 33:1065–1076, 1962

  13. [13]

    Discriminatory Analy- sis

    E Fix and JL Hodges. Discriminatory Analy- sis. Nonparametric Discrimination: Consistency Properties. USAF School of Aviation Medicine, Randolph Field, Texas , Report 4(Project Num- ber 21-49-004), 1951

  14. [14]

    E. S. Page and Richard Bellman. Adaptive Con- trol Processes: A Guided Tour. Princeton Uni- versity Press, Princeton, NJ, 1961

  15. [15]

    Friedman

    Jerome H. Friedman. On bias, variance, 0/1- loss, and the curse-of-dimensionality. Data Min. Knowl. Discov., 1(1):55–77, 1997

  16. [16]

    Charles J. Stone. An Asymptotically Optimal Window Selection Rule for Kernel Density Es- timates. The Annals of Statistics , 12(4):1285 – 1297, 1984

  17. [17]

    Bandwidth selection in ker- nel density estimation: A review

    Berwin A Turlach. Bandwidth selection in ker- nel density estimation: A review. In CORE and Institut de Statistique , 1993

  18. [18]

    Bandwidth selection for kernel density estimation: a review of fully automatic selectors

    Nils-Bastian Heidenreich, Anja Schindler, and Stefan Sperlich. Bandwidth selection for kernel density estimation: a review of fully automatic selectors. AStA Adv. Stat. Anal., 97(4):403–433, 2013

  19. [19]

    Hostetler

    Keinosuke Fukunaga and Larry D. Hostetler. The Estimation of the Gradient of a Density Function, with Applications in Pattern Recog- nition. IEEE Trans. Inf. Theory , 21(1):32–40, 1975

  20. [20]

    Dadapy: Distance-based analysis of data-manifolds in python

    Aldo Glielmo, Iuri Macocco, Diego Doimo, Matteo Carli, Claudio Zeni, Romina Wild, Maria d’Errico, Alex Rodriguez, and Alessan- dro Laio. Dadapy: Distance-based analysis of data-manifolds in python. Patterns, 3, 2022

  21. [22]

    Korn, B.-U

    F. Korn, B.-U. Pagel, and C. Faloutsos. On the ”dimensionality curse” and the ”self-similarity blessing”. IEEE Transactions on Knowledge and Data Engineering, 13(1):96–111, 2001

  22. [23]

    Cover trees for nearest neighbor

    Alina Beygelzimer, Sham Kakade, and John Langford. Cover trees for nearest neighbor. In Proceedings of the 23rd International Conference on Machine Learning , ICML ’06, page 97–104, New York, NY, USA, 2006. Association for Com- puting Machinery

  23. [24]

    Intrinsic dimension estimation for discrete metrics

    Iuri Macocco, Aldo Glielmo, Jacopo Grilli, and Alessandro Laio. Intrinsic dimension estimation for discrete metrics. Physical Review Letters , 130(6):067401, 2023

  24. [25]

    The intrinsic dimension of protein sequence evolution

    Elena Facco, Andrea Pagnani, Elena Tea Russo, and Alessandro Laio. The intrinsic dimension of protein sequence evolution. PLoS computational biology, 15(4):e1006767, 2019

  25. [26]

    The intrinsic manifolds of radiological images and their role in deep learning

    Nicholas Konz, Hanxue Gu, Haoyu Dong, and Maciej A Mazurowski. The intrinsic manifolds of radiological images and their role in deep learning. In International Conference on Medical Image Computing and Computer-Assisted Inter- vention, pages 684–694. Springer, 2022

  26. [27]

    On the intrinsic dimensionality of covid-19 data: a global perspective

    Abhishek Varghese, Edgar Santos-Fernandez, Francesco Denti, Antonietta Mira, and Ker- rie Mengersen. On the intrinsic dimensionality of covid-19 data: a global perspective. arXiv preprint arXiv:2203.04165, 2022

  27. [28]

    Intrinsic dimen- sion of data representations in deep neural net- works

    Alessio Ansuini, Alessandro Laio, Jakob H Macke, and Davide Zoccolan. Intrinsic dimen- sion of data representations in deep neural net- works. Advances in Neural Information Process- ing Systems, 32, 2019

  28. [29]

    The in- trinsic dimension of images and its impact on learning

    Phillip Pope, Chen Zhu, Ahmed Abdelkader, Micah Goldblum, and Tom Goldstein. The in- trinsic dimension of images and its impact on learning. arXiv preprint arXiv:2104.08894, 2021

  29. [30]

    In- trinsic dimension estimation: Advances and open problems

    Francesco Camastra and Antonino Staiano. In- trinsic dimension estimation: Advances and open problems. Inf. Sci., 328:26–41, 2016. Matteo Carli ∗, Alex Rodriguez, Alessandro Laio ∗, Aldo Glielmo ∗

  30. [31]

    Estimating the intrinsic di- mension of datasets by a minimal neighborhood information

    Elena Facco, Maria D’Errico, Alex Rodriguez, and Alessandro Laio. Estimating the intrinsic di- mension of datasets by a minimal neighborhood information. Sci. Rep., 7(1):1–11, 2017

  31. [32]

    Intrinsic dimension estimation for locally undersampled data

    Vittorio Erba, Marco Gherardi, and Pietro Ro- tondo. Intrinsic dimension estimation for locally undersampled data. Sci. Rep., 9(1):1–9, 2019

  32. [33]

    The generalized ratios in- trinsic dimension estimator

    Francesco Denti, Diego Doimo, Alessandro Laio, and Antonietta Mira. The generalized ratios in- trinsic dimension estimator. Scientific Reports, 12(1):20005, 2022

  33. [34]

    Scikit-dimension: a python package for intrin- sic dimension estimation

    Jonathan Bac, Evgeny M Mirkes, Alexander N Gorban, Ivan Tyukin, and Andrei Zinovyev. Scikit-dimension: a python package for intrin- sic dimension estimation. Entropy, 23(10):1368, 2021

  34. [35]

    Submani- fold density estimation

    Arkadas Ozakin and Alexander Gray. Submani- fold density estimation. In Y. Bengio, D. Schuur- mans, J. Lafferty, C. Williams, and A. Culotta, editors, Advances in Neural Information Pro- cessing Systems, volume 22. Curran Associates, Inc., 2009

  35. [36]

    Computing the Free En- ergy without Collective Variables

    Alex Rodriguez, Maria D’Errico, Elena Facco, and Alessandro Laio. Computing the Free En- ergy without Collective Variables. J. Chem. Theory Comput., 14(3):1206–1215, 2018

  36. [37]

    Density estimation using deep genera- tive neural networks

    Qiao Liu, Jiaze Xu, Rui Jiang, and Wing Hung Wong. Density estimation using deep genera- tive neural networks. Proceedings of the National Academy of Sciences , 118(15):e2101344118, 2021

  37. [38]

    Den- sity estimation on low-dimensional manifolds: an inflation-deflation approach

    Christian Horvat and Jean-Pascal Pfister. Den- sity estimation on low-dimensional manifolds: an inflation-deflation approach. J. Mach. Learn. Res., 24:61–1, 2023

  38. [39]

    Normalizing flows: An introduction and review of current methods

    Ivan Kobyzev, Simon JD Prince, and Marcus A Brubaker. Normalizing flows: An introduction and review of current methods. IEEE trans- actions on pattern analysis and machine intel- ligence, 43(11):3964–3979, 2020

  39. [40]

    Introduction to Statisti- cal Pattern Recognition

    Keinosuke Fukunaga. Introduction to Statisti- cal Pattern Recognition . Academic Press, San Diego, CA, United States, 1990

  40. [41]

    Park and J

    Byeong U. Park and J. S. Marron. Comparison of data-driven bandwidth selectors. Journal of the American Statistical Association, 85(409):66–72, 1990

  41. [42]

    Contributions to the mathemat- ical theory of evolution

    Karl Pearson. Contributions to the mathemat- ical theory of evolution. Philosophical Transac- tions of the Royal Society of London. A , 185:71– 110, 1894

  42. [43]

    Silverman and M

    Bernard W. Silverman and M. Chris Jones. E. fix and j.l. hodges (1951): An important con- tribution to nonparametric discriminant analysis and density estimation: Commentary on fix and hodges (1951). International Statistical Review , 57:233, 1989

  43. [44]

    Remarks on Some Nonpara- metric Estimates of a Density Function.The An- nals of Mathematical Statistics , 27(3):832 – 837, 1956

    Murray Rosenblatt. Remarks on Some Nonpara- metric Estimates of a Density Function.The An- nals of Mathematical Statistics , 27(3):832 – 837, 1956

  44. [45]

    Estimation of a Multivari- ate Density

    Theophilos Cacoullos. Estimation of a Multivari- ate Density. In Tech. report; No. 40. University of Minnesota, Department of Statistics, 1964

  45. [46]

    Variable kernel estimates of multivariate densities

    Leo Breiman, William Meisel, and Edward Pur- cell. Variable kernel estimates of multivariate densities. Technometrics, 19(2):135–144, 1977

  46. [47]

    A brief survey of bandwidth selection for density estimation

    M Chris Jones, James S Marron, and Simon J Sheather. A brief survey of bandwidth selection for density estimation. Journal of the American statistical association, 91(433):401–407, 1996

  47. [48]

    Analysis of knn density estimation

    Puning Zhao and Lifeng Lai. Analysis of knn density estimation. IEEE Transactions on In- formation Theory, 68(12):7971–7995, 2022

  48. [49]

    On bandwidth variation in ker- nel estimates-a square root law

    Ian S Abramson. On bandwidth variation in ker- nel estimates-a square root law. The annals of Statistics, pages 1217–1223, 1982

  49. [50]

    Arbitrariness of the pilot es- timator in adaptive kernel methods

    Ian S Abramson. Arbitrariness of the pilot es- timator in adaptive kernel methods. Journal of Multivariate analysis, 12(4):562–567, 1982

  50. [51]

    A parametrically-defined nearest neighbor distance measure

    Keinosuke Fukunaga and Thomas E Flick. A parametrically-defined nearest neighbor distance measure. Pattern Recognition Letters, 1(1):3–5, 1982

  51. [52]

    Peter Hall and J. S. Marron. Choice of Ker- nel Order in Density Estimation. The Annals of Statistics, 16(1):161 – 173, 1988

  52. [53]

    The multi-class metric problem in nearest neighbour discrimi- nation rules

    JP Myles and David J Hand. The multi-class metric problem in nearest neighbour discrimi- nation rules. Pattern Recognition, 23(11):1291– 1297, 1990

  53. [54]

    How many trees in a forest

    JK Ord. How many trees in a forest. Mathemat- ical Scientist, 3:23–33, 1978. Density estimation via binless multidimensional integration

  54. [55]

    Pokorny, and Danica Kragic

    Vladislav Polianskii, Giovanni Luca Marchetti, Alexander Kravberg, Anastasiia Varava, Flo- rian T. Pokorny, and Danica Kragic. Voronoi density estimator for high-dimensional data: Computation, compactification and conver- gence. In James Cussens and Kun Zhang, ed- itors, Proceedings of the Thirty-Eighth Confer- ence on Uncertainty in Artificial Intelligen...

  55. [56]

    An efficient and continuous voronoi density estimator

    Giovanni Luca Marchetti, Vladislav Polianskii, Anastasiia Varava, Florian T Pokorny, and Dan- ica Kragic. An efficient and continuous voronoi density estimator. In International Conference on Artificial Intelligence and Statistics , pages 4732–4744. PMLR, 2023

  56. [57]

    Auto-Encoding Variational Bayes

    Diederik P Kingma and Max Welling. Auto- encoding variational bayes. arXiv preprint arXiv:1312.6114, 2013

  57. [58]

    Generative adversarial nets

    Ian Goodfellow, Jean Pouget-Abadie, Mehdi Mirza, Bing Xu, David Warde-Farley, Sherjil Ozair, Aaron Courville, and Yoshua Bengio. Generative adversarial nets. Advances in neu- ral information processing systems, 27, 2014

  58. [59]

    Density estimation using Real NVP

    Laurent Dinh, Jascha Sohl-Dickstein, and Samy Bengio. Density estimation using real nvp. arXiv preprint arXiv:1605.08803, 2016

  59. [60]

    Masked autoregressive flow for density estimation

    George Papamakarios, Theo Pavlakou, and Iain Murray. Masked autoregressive flow for density estimation. Advances in neural information pro- cessing systems, 30, 2017

  60. [61]

    Density estimation using deep genera- tive neural networks

    Qiao Liu, Jiaze Xu, Rui Jiang, and Wing Hong Wong. Density estimation using deep genera- tive neural networks. Proceedings of the Na- tional Academy of Sciences of the United States of America, 118, 2021

  61. [62]

    Estimation of non-normalized statistical models by score matching

    Aapo Hyv¨ arinen. Estimation of non-normalized statistical models by score matching. Journal of Machine Learning Research , 6(24):695–709, 2005

  62. [63]

    A unified energy- based framework for unsupervised learning

    Marc’Aurelio Ranzato, Y-Lan Boureau, Sumit Chopra, and Yann LeCun. A unified energy- based framework for unsupervised learning. In Marina Meila and Xiaotong Shen, editors, Pro- ceedings of the Eleventh International Confer- ence on Artificial Intelligence and Statistics, vol- ume 2 of Proceedings of Machine Learning Re- search, pages 371–379, San Juan, Pu...

  63. [64]

    Strategies for the exploration of free energy landscapes: Unity in diversity and challenges ahead

    Fabio Pietrucci. Strategies for the exploration of free energy landscapes: Unity in diversity and challenges ahead. Reviews in Physics , 2:32–45, 2017

  64. [65]

    A density-based algorithm for discovering clusters in large spatial databases with noise

    Martin Ester, Hans-Peter Kriegel, J¨ org Sander, Xiaowei Xu, et al. A density-based algorithm for discovering clusters in large spatial databases with noise. In kdd, volume 96, pages 226–231, 1996

  65. [66]

    Mean shift analysis and applications

    Dorin Comaniciu and Peter Meer. Mean shift analysis and applications. In Proceedings of the seventh IEEE international conference on com- puter vision, volume 2, pages 1197–1203. IEEE, 1999

  66. [67]

    Mean-shift anal- ysis using quasinewton methods

    Changjiang Yang, Ramani Duraiswami, Daniel DeMenthon, and Larry Davis. Mean-shift anal- ysis using quasinewton methods. In Proceedings 2003 International Conference on Image Pro- cessing (Cat. No. 03CH37429) , volume 2, pages II–447. IEEE, 2003

  67. [68]

    A tutorial on energy-based learning

    Yann LeCun, Sumit Chopra, Raia Hadsell, Au- relio Ranzato, and Fu Jie Huang. A tutorial on energy-based learning. 2006

  68. [69]

    A complete gradient clustering algorithm formed with kernel estimators

    Piotr Kulczycki and Ma lgorzata Charytanowicz. A complete gradient clustering algorithm formed with kernel estimators. International Journal of Applied Mathematics and Computer Science , 20(1):123–134, 2010

  69. [70]

    Clustering by fast search and find of density peaks

    Alex Rodriguez and Alessandro Laio. Clustering by fast search and find of density peaks. Science, 344(6191):1492–1496, 2014

  70. [71]

    Automatic topography of high-dimensional data sets by non-parametric density peak clustering

    Maria d’Errico, Elena Facco, Alessandro Laio, and Alex Rodriguez. Automatic topography of high-dimensional data sets by non-parametric density peak clustering. Information Sciences , 560:476–492, 2021

  71. [72]

    Statistical mechanics: the- ory and molecular simulation

    Mark Tuckerman. Statistical mechanics: the- ory and molecular simulation. Oxford university press, 2010

  72. [73]

    Escap- ing free-energy minima

    Alessandro Laio and Michele Parrinello. Escap- ing free-energy minima. Proc. Natl. Acad. Sci. , 99(20):12562–12566, 2002

  73. [74]

    Metastabil- ity, conformation dynamics, and transition path- ways in complex systems

    E Weinan and Eric Vanden-Eijnden. Metastabil- ity, conformation dynamics, and transition path- ways in complex systems. In Multiscale mod- elling and simulation , pages 35–68. Springer, 2004. Matteo Carli ∗, Alex Rodriguez, Alessandro Laio ∗, Aldo Glielmo ∗

  74. [75]

    Free Energy Calculations Theory and Applications in Chemistry and Biology

    Christophe Chipot and Andrew Pohorille. Free Energy Calculations Theory and Applications in Chemistry and Biology. Springer-Verlag, Berlin, Heidelberg, 1 edition, 2007

  75. [76]

    Deep Energy Estimator Networks

    Saeed Saremi, Arash Mehrjou, Bernhard Scholkopf, and Aapo Hyv¨ arinen. Deep energy es- timator networks. ArXiv, abs/1805.08306, 2018

  76. [77]

    Energy-based models for sparse overcomplete representations

    Yee Whye Teh, Max Welling, Simon Osindero, and Geoffrey E Hinton. Energy-based models for sparse overcomplete representations. Journal of Machine Learning Research, 4(Dec):1235–1260, 2003

  77. [78]

    Learning methods for generic object recognition with invariance to pose and lighting

    Yann LeCun, Fu Jie Huang, and Leon Bottou. Learning methods for generic object recognition with invariance to pose and lighting. In Proceed- ings of the 2004 IEEE Computer Society Confer- ence on Computer Vision and Pattern Recogni- tion, 2004. CVPR 2004., volume 2, pages II–104. IEEE, 2004

  78. [79]

    Kirkwood

    John G. Kirkwood. Statistical mechanics of fluid mixtures. J. Chem. Phys. , 3(5):300–313, 1935

  79. [80]

    Energetics of ion transport in a peptide nanotube

    Fran¸ cois Dehez, Mounir Tarek, and Christophe Chipot. Energetics of ion transport in a peptide nanotube. The Journal of Physical Chemistry B, 111(36):10633–10635, 2007. PMID: 17705530

  80. [81]

    Blue Moon sampling, vec- torial reaction coordinates, and unbiased con- strained dynamics

    Giovanni Ciccotti, Raymond Kapral, and Eric Vanden-Eijnden. Blue Moon sampling, vec- torial reaction coordinates, and unbiased con- strained dynamics. ChemPhysChem, 6(9):1809– 1814, 2005

Showing first 80 references.