Stefanie Schwaar

LGJun 11, 2020

A Generalised Linear Model Framework for $β$-Variational Autoencoders based on Exponential Dispersion Families

Robert Sicks, Ralf Korn, Stefanie Schwaar

Although variational autoencoders (VAE) are successfully used to obtain meaningful low-dimensional representations for high-dimensional data, the characterization of critical points of the loss function for general observation models is not fully understood. We introduce a theoretical framework that is based on a connection between $β$-VAE and generalized linear models (GLM). The equality between the activation function of a $β$-VAE and the inverse of the link function of a GLM enables us to provide a systematic generalization of the loss analysis for $β$-VAE based on the assumption that the observation model distribution belongs to an exponential dispersion family (EDF). As a result, we can initialize $β$-VAE nets by maximum likelihood estimates (MLE) that enhance the training performance on both synthetic and real world data sets. As a further consequence, we analytically describe the auto-pruning property inherent in the $β$-VAE objective and reason for posterior collapse.

LGMar 26, 2020

A lower bound for the ELBO of the Bernoulli Variational Autoencoder

Robert Sicks, Ralf Korn, Stefanie Schwaar

We consider a variational autoencoder (VAE) for binary data. Our main innovations are an interpretable lower bound for its training objective, a modified initialization and architecture of such a VAE that leads to faster training, and a decision support for finding the appropriate dimension of the latent space via using a PCA. Numerical examples illustrate our theoretical result and the performance of the new architecture.

Stefanie Schwaar

2 Papers