ML LG NE STJun 26, 2020

Deep Involutive Generative Models for Neural MCMC

Span Spanbauer, Cameron Freer, Vikash Mansinghka

arXiv:2006.15167v211.412 citations

Originality Highly original

AI Analysis

This work addresses the challenge of slow mixing in MCMC for complex distributions, offering a novel approach that could benefit practitioners in Bayesian inference and machine learning.

The paper tackles the problem of designing efficient neural MCMC methods by introducing deep involutive generative models, which enable fast exploration of multi-modal distributions and achieve faster convergence than existing techniques like A-NICE-MC.

We introduce deep involutive generative models, a new architecture for deep generative modeling, and use them to define Involutive Neural MCMC, a new approach to fast neural MCMC. An involutive generative model represents a probability kernel $G(φ\mapsto φ')$ as an involutive (i.e., self-inverting) deterministic function $f(φ, π)$ on an enlarged state space containing auxiliary variables $π$. We show how to make these models volume preserving, and how to use deep volume-preserving involutive generative models to make valid Metropolis-Hastings updates based on an auxiliary variable scheme with an easy-to-calculate acceptance ratio. We prove that deep involutive generative models and their volume-preserving special case are universal approximators for probability kernels. This result implies that with enough network capacity and training time, they can be used to learn arbitrarily complex MCMC updates. We define a loss function and optimization algorithm for training parameters given simulated data. We also provide initial experiments showing that Involutive Neural MCMC can efficiently explore multi-modal distributions that are intractable for Hybrid Monte Carlo, and can converge faster than A-NICE-MC, a recently introduced neural MCMC technique.

View on arXiv PDF

Similar