pith. machine review for the scientific record. sign in

arxiv: 1703.03664 · v1 · submitted 2017-03-10 · 💻 cs.CV · cs.NE

Recognition: unknown

Parallel Multiscale Autoregressive Density Estimation

A\"aron van den Oord, Dan Belov, Nal Kalchbrenner, Nando de Freitas, Scott Reed, Sergio G\'omez Colmenarejo, Ziyu Wang

Authors on Pith no claims yet
classification 💻 cs.CV cs.NE
keywords densityachievesestimationgenerationmodelpixelpixelcnnefficient
0
0 comments X
read the original abstract

PixelCNN achieves state-of-the-art results in density estimation for natural images. Although training is fast, inference is costly, requiring one network evaluation per pixel; O(N) for N pixels. This can be sped up by caching activations, but still involves generating each pixel sequentially. In this work, we propose a parallelized PixelCNN that allows more efficient inference by modeling certain pixel groups as conditionally independent. Our new PixelCNN model achieves competitive density estimation and orders of magnitude speedup - O(log N) sampling instead of O(N) - enabling the practical generation of 512x512 images. We evaluate the model on class-conditional image generation, text-to-image synthesis, and action-conditional video generation, showing that our model achieves the best results among non-pixel-autoregressive density models that allow efficient sampling.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Generating Long Sequences with Sparse Transformers

    cs.LG 2019-04 unverdicted novelty 7.0

    Sparse Transformers factorize attention to handle sequences tens of thousands long, achieving new SOTA density modeling on Enwik8, CIFAR-10, and ImageNet-64.