Recognition: unknown
Data analysis recipes: Fitting a model to data
read the original abstract
We go through the many considerations involved in fitting a model to data, using as an example the fit of a straight line to a set of points in a two-dimensional plane. Standard weighted least-squares fitting is only appropriate when there is a dimension along which the data points have negligible uncertainties, and another along which all the uncertainties can be described by Gaussians of known variance; these conditions are rarely met in practice. We consider cases of general, heterogeneous, and arbitrarily covariant two-dimensional uncertainties, and situations in which there are bad data (large outliers), unknown uncertainties, and unknown but expected intrinsic scatter in the linear relationship being fit. Above all we emphasize the importance of having a "generative model" for the data, even an approximate one. Once there is a generative model, the subsequent fitting is non-arbitrary because the model permits direct computation of the likelihood of the parameters or the posterior probability distribution. Construction of a posterior probability distribution is indispensible if there are "nuisance parameters" to marginalize away.
This paper has not been read by Pith yet.
Forward citations
Cited by 6 Pith papers
-
A Census of Na D-traced neutral ISM and outflows at $0.6<z<4$
A JWST census detects neutral ISM absorption in 76 of 309 galaxies at 0.6<z<4 and outflows in 26, indicating AGN-driven neutral outflows dominate in quiescent systems at cosmic noon.
-
emcee: The MCMC Hammer
emcee delivers a stable Python implementation of the affine-invariant ensemble MCMC algorithm that requires minimal hand-tuning and supports easy parallelization.
-
Probabilistic Spectral Reconstruction of Trans-Neptunian Objects from Sparse Photometry: A Framework for Taxonomy, Survey Optimization, and Outlier Detection
A PCA-based latent space model with Bayesian reconstruction achieves 95% credible interval coverage for TNO spectra from photometry using 4-10 components.
-
Inferring the star-formation histories of massive quiescent galaxies with BAGPIPES: Evidence for multiple quenching mechanisms
BAGPIPES fitting of 9289 massive quiescent galaxies shows most SFHs rise gradually then quench in 1-2 Gyr, with faster quenching at z>1 and slower at z<1, interpreted as multiple AGN feedback and gas-supply mechanisms.
-
Constraining the Molecular Kennicutt-Schmidt Relation with Multi-Transition CO Observations of Nearby Galaxies
Multi-transition CO observations reveal that the star formation-molecular gas relation becomes more linear for denser gas tracers, implying a volume density power-law index of approximately 1.5.
-
Stellar Population Inference with Prospector
Prospector is a flexible code for Bayesian inference of stellar population parameters from multi-wavelength photometry and spectroscopy via forward modeling and posterior sampling.
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.