The No-U-Turn Sampler: Adaptively Setting Path Lengths in Hamiltonian Monte Carlo
read the original abstract
Hamiltonian Monte Carlo (HMC) is a Markov chain Monte Carlo (MCMC) algorithm that avoids the random walk behavior and sensitivity to correlated parameters that plague many MCMC methods by taking a series of steps informed by first-order gradient information. These features allow it to converge to high-dimensional target distributions much more quickly than simpler methods such as random walk Metropolis or Gibbs sampling. However, HMC's performance is highly sensitive to two user-specified parameters: a step size {\epsilon} and a desired number of steps L. In particular, if L is too small then the algorithm exhibits undesirable random walk behavior, while if L is too large the algorithm wastes computation. We introduce the No-U-Turn Sampler (NUTS), an extension to HMC that eliminates the need to set a number of steps L. NUTS uses a recursive algorithm to build a set of likely candidate points that spans a wide swath of the target distribution, stopping automatically when it starts to double back and retrace its steps. Empirically, NUTS perform at least as efficiently as and sometimes more efficiently than a well tuned standard HMC method, without requiring user intervention or costly tuning runs. We also derive a method for adapting the step size parameter {\epsilon} on the fly based on primal-dual averaging. NUTS can thus be used with no hand-tuning at all. NUTS is also suitable for applications such as BUGS-style automatic inference engines that require efficient "turnkey" sampling algorithms.
This paper has not been read by Pith yet.
Forward citations
Cited by 21 Pith papers
-
Bayesian Doppler Imaging: Simultaneous Inference of Surface Maps and Geometric Parameters
A fully Bayesian pixel-based Doppler imaging framework uses Gaussian Process priors and Hamiltonian Monte Carlo to simultaneously infer surface maps and geometric parameters from spectral data.
-
High-dimensional inference for the $\gamma$-ray sky with differentiable programming
A differentiable forward model and likelihood enable probabilistic inference over many spatial morphologies for the Galactic Center gamma-ray Excess using variational methods on GPUs.
-
People readily follow personal advice from AI but it does not improve their well-being
Large longitudinal RCT finds high rates of following AI personal advice but no sustained well-being gains versus a hobbies control condition.
-
Differentiable Fuzzy Cosmic-Web for Field Level Inference
Introduces HICOBIAN, a differentiable fuzzy hierarchical cosmic-web bias model using sigmoid gradients for smooth region transitions, enabling accurate Bayesian field-level reconstruction of primordial density fields ...
-
A Strongly Parametrized Mass Ratio Model for the Stable Mass Transfer Channel: a Case Study of the $10 \, \rm{M}_{\odot}$ Peak
A parametrized analytical model for BBH mass ratios from the stable mass transfer channel is derived and applied to the 10 solar-mass peak in GWTC-4, favoring little mass-ratio reversal.
-
A renormalization-group inspired lattice-based framework for piecewise generalized linear models
RG-inspired lattice models for piecewise GLMs provide explicit interpretable partitions and a replica-analysis-derived scaling law for regularization that allows increasing complexity without expected rise in generali...
-
Tokenised Flow Matching for Hierarchical Simulation Based Inference
TFMPE combines likelihood factorisation with tokenised flow matching to enable efficient hierarchical SBI from single-site simulations, producing well-calibrated posteriors at lower computational cost on a new benchma...
-
A unified harmonic framework for dark siren cosmology
The GW-galaxy cross-correlation method, unified with spectral sirens in a harmonic framework, can measure H0 to 1% and Omega_m to 5% precision with 2 years of data from next-generation detectors like Einstein Telescop...
-
Stochastic gravitational-wave background search using data from five pulsar timing arrays
Combined five-PTA dataset yields posterior on SGWB power-law amplitude and index consistent with nonzero signal but below 5-sigma significance, with reconstructed angular correlations matching the Hellings-Downs prediction.
-
Conversational AI increases political knowledge as effectively as self-directed internet search
Conversational AI matches self-directed internet search in increasing belief in true political information and decreasing belief in misinformation.
-
Sifting for a Stream: The Morphology of the $300S$ Stellar Stream
300S stellar stream exhibits three density peaks, smooth width variations, a possible 4.7 degree gap, and a kink modeled as resulting from Large Magellanic Cloud interaction across its full known footprint.
-
DESI 2024 V: Full-Shape Galaxy Clustering from Galaxies and Quasars
DESI DR1 full-shape galaxy clustering constrains Omega_m = 0.296 ± 0.010, H0 = 68.63 ± 0.79 km/s/Mpc, and sigma_8 = 0.841 ± 0.034, consistent with LambdaCDM and Planck.
-
dynesty: A Dynamic Nested Sampling Package for Estimating Bayesian Posteriors and Evidences
dynesty is an open-source Python package for dynamic nested sampling that improves efficiency in Bayesian posterior and evidence estimation compared to MCMC on certain problems.
-
StanBKT: Rethinking Parameter Estimation in Bayesian Knowledge Tracing
StanBKT provides a unified Bayesian inference framework for BKT models supporting HMC, variational inference, and hierarchical variants, evaluated on ASSISTments and intervention datasets.
-
Gravitational-wave constraints on $H_0$ are robust to (putative) redshift evolution in the binary black hole mass spectrum at current sensitivity
Spectral-siren H0 constraints from GWTC-4.0 binary black holes remain robust when the mass spectrum is permitted to evolve with redshift at current detector sensitivity.
-
QCD-factorization amplitudes from flavour symmetries: beyond the $SU(3)$ symmetric case
A data-driven SU(3)-breaking analysis of B to PP decays yields QCD-factorization amplitudes that resemble dynamical predictions and require no enhanced annihilation terms.
-
Bathymetry Reconstruction by Bayesian Inference
Bayesian inference reconstructs bathymetry from point water height measurements, improving NRMSE over adjoint optimization on real wave flume data while quantifying uncertainty.
-
Determining the Host Stars of Planets in Binary Star Systems with Asterodensity Profiling: Investigating the Canonical Radius Gap
Probabilistic host-star assignments via asterodensity profiling suggest the exoplanet radius gap is less empty in binary systems once possible circumsecondary planets are included.
-
Discovery of a compact hierarchical triple main-sequence star system while searching for binary stars with compact objects
A new compact hierarchical triple main-sequence star system G1010 was discovered through combined low- and high-SNR spectroscopy, Gaia DR3 data, and TESS light curve analysis, showing an inner eclipsing binary rather ...
-
Symbolic Emulators for Cosmology: Accelerating Cosmological Analyses Without Sacrificing Precision
Symbolic emulators approximate key Lambda CDM functions to 0.001-0.05% accuracy across relevant redshifts and Omega_m values, enabling faster 3x2pt inference with consistent results.
-
Deployable probabilistic programming
Design guidelines and a Go library (Infergo) for deploying probabilistic programming in production systems, with benchmark comparisons.
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.