ML LGJul 16, 2019

The continuous Bernoulli: fixing a pervasive error in variational autoencoders

Gabriel Loaiza-Ganem, John P. Cunningham

arXiv:1907.06845v521.098 citationsHas Code

Originality Incremental advance

AI Analysis

This addresses a fundamental modeling error in VAEs that affects researchers and practitioners using them for continuous data, though it is an incremental fix rather than a new paradigm.

The paper identifies and fixes a pervasive error in variational autoencoders (VAEs) where Bernoulli likelihoods are incorrectly applied to [0,1]-valued data like MNIST pixels, introducing the continuous Bernoulli distribution to correct this. This leads to meaningful performance improvements, including sharper image samples and better metrics across datasets.

Variational autoencoders (VAE) have quickly become a central tool in machine learning, applicable to a broad range of data types and latent variable models. By far the most common first step, taken by seminal papers and by core software libraries alike, is to model MNIST data using a deep network parameterizing a Bernoulli likelihood. This practice contains what appears to be and what is often set aside as a minor inconvenience: the pixel data is [0,1] valued, not {0,1} as supported by the Bernoulli likelihood. Here we show that, far from being a triviality or nuisance that is convenient to ignore, this error has profound importance to VAE, both qualitative and quantitative. We introduce and fully characterize a new [0,1]-supported, single parameter distribution: the continuous Bernoulli, which patches this pervasive bug in VAE. This distribution is not nitpicking; it produces meaningful performance improvements across a range of metrics and datasets, including sharper image samples, and suggests a broader class of performant VAE.

View on arXiv PDF Code

Similar