LGMLJun 28, 2020

Reliable Categorical Variational Inference with Mixture of Discrete Normalizing Flows

arXiv:2006.15568v22 citations
AI Analysis

This addresses a key bottleneck in variational inference for researchers and practitioners working with models that have discrete latent variables, offering a more reliable alternative to existing continuous relaxations like Gumbel-Softmax.

The paper tackles the challenge of gradient-based optimization for discrete latent variables in variational inference by introducing a mixture of discrete normalizing flows as a differentiable reparameterization for categorical distributions, which defines a proper discrete distribution and reduces sensitivity to hyperparameters.

Variational approximations are increasingly based on gradient-based optimization of expectations estimated by sampling. Handling discrete latent variables is then challenging because the sampling process is not differentiable. Continuous relaxations, such as the Gumbel-Softmax for categorical distribution, enable gradient-based optimization, but do not define a valid probability mass for discrete observations. In practice, selecting the amount of relaxation is difficult and one needs to optimize an objective that does not align with the desired one, causing problems especially with models having strong meaningful priors. We provide an alternative differentiable reparameterization for categorical distribution by composing it as a mixture of discrete normalizing flows. It defines a proper discrete distribution, allows directly optimizing the evidence lower bound, and is less sensitive to the hyperparameter controlling relaxation.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes