Deep Gaussian Processes with Decoupled Inducing Inputs
read the original abstract
Deep Gaussian Processes (DGP) are hierarchical generalizations of Gaussian Processes (GP) that have proven to work effectively on a multiple supervised regression tasks. They combine the well calibrated uncertainty estimates of GPs with the great flexibility of multilayer models. In DGPs, given the inputs, the outputs of the layers are Gaussian distributions parameterized by their means and covariances. These layers are realized as Sparse GPs where the training data is approximated using a small set of pseudo points. In this work, we show that the computational cost of DGPs can be reduced with no loss in performance by using a separate, smaller set of pseudo points when calculating the layerwise variance while using a larger set of pseudo points when calculating the layerwise mean. This enabled us to train larger models that have lower cost and better predictive performance.
This paper has not been read by Pith yet.
Forward citations
Cited by 1 Pith paper
-
An Analysis of Posterior Collapse, Parameterization and Initialization in Variational Deep Gaussian Processes
Analysis of posterior collapse in variational DGPs shows the linear prior mean benefit comes from better optimization conditioning at initialization; a new zero-mean initialization prevents collapse and matches linear...
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.