Conditional Density Estimation with Bayesian Normalising Flows
read the original abstract
Modeling complex conditional distributions is critical in a variety of settings. Despite a long tradition of research into conditional density estimation, current methods employ either simple parametric forms or are difficult to learn in practice. This paper employs normalising flows as a flexible likelihood model and presents an efficient method for fitting them to complex densities. These estimators must trade-off between modeling distributional complexity, functional complexity and heteroscedasticity without overfitting. We recognize these trade-offs as modeling decisions and develop a Bayesian framework for placing priors over these conditional density estimators using variational Bayesian neural networks. We evaluate this method on several small benchmark regression datasets, on some of which it obtains state of the art performance. Finally, we apply the method to two spatial density modeling tasks with over 1 million datapoints using the New York City yellow taxi dataset and the Chicago crime dataset.
This paper has not been read by Pith yet.
Forward citations
Cited by 4 Pith papers
-
Data-Driven Predictions for Dark Photon and Millicharged Particle Production
A data-driven framework using normalizing flows predicts the rate and kinematic distributions of dark photon and millicharged particle production directly from measured dilepton events.
-
Inherited or produced? Inferring protein production kinetics when protein counts are shaped by a cell's division history
Conditional normalizing flows approximate intractable likelihoods arising from cell division history to conclude that glc3 is mostly inactive under nutrient stress in yeast, with brief transient expression.
-
Addressing prior dependence in hierarchical Bayesian modeling for PTA data analysis II: Noise and SGWB inference through parameter decorrelation
A reparametrized hierarchical Bayesian approach using normalizing flows and orthogonal projection of hyperparameters yields tighter noise constraints and partially breaks the red-noise-SGWB degeneracy in a minimal 3-p...
-
Decomposing Ensemble Spread in Lorenz '96 With Learned Stochastic Parameterizations
In the Lorenz '96 testbed, ensemble perturbations regulate decorrelation rates without raising long-term variance, while temporally persistent stochastic parameterizations boost early spread growth and spread-error co...
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.