MLLGJul 16, 2019

The continuous Bernoulli: fixing a pervasive error in variational autoencoders

arXiv:1907.06845v598 citations
AI Analysis

This addresses a fundamental modeling error in VAEs that affects researchers and practitioners using them for continuous data, though it is an incremental fix rather than a new paradigm.

The paper identifies and fixes a pervasive error in variational autoencoders (VAEs) where Bernoulli likelihoods are incorrectly applied to [0,1]-valued data like MNIST pixels, introducing the continuous Bernoulli distribution to correct this. This leads to meaningful performance improvements, including sharper image samples and better metrics across datasets.

Variational autoencoders (VAE) have quickly become a central tool in machine learning, applicable to a broad range of data types and latent variable models. By far the most common first step, taken by seminal papers and by core software libraries alike, is to model MNIST data using a deep network parameterizing a Bernoulli likelihood. This practice contains what appears to be and what is often set aside as a minor inconvenience: the pixel data is [0,1] valued, not {0,1} as supported by the Bernoulli likelihood. Here we show that, far from being a triviality or nuisance that is convenient to ignore, this error has profound importance to VAE, both qualitative and quantitative. We introduce and fully characterize a new [0,1]-supported, single parameter distribution: the continuous Bernoulli, which patches this pervasive bug in VAE. This distribution is not nitpicking; it produces meaningful performance improvements across a range of metrics and datasets, including sharper image samples, and suggests a broader class of performant VAE.

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes