LGMLJun 17, 2020

Analytical Probability Distributions and EM-Learning for Deep Generative Networks

arXiv:2006.10023v11 citations
Originality Highly original
AI Analysis

This work addresses a bottleneck in probabilistic modeling for DGNs, offering a novel method that could improve training efficiency and accuracy for researchers in machine learning.

The authors tackled the problem of training Deep Generative Networks (DGNs) by deriving analytical probability distributions for the posterior and marginal, enabling a gradient-free Expectation-Maximization (EM) algorithm. They demonstrated empirically that EM training produces greater likelihood than Variational Autoencoder (VAE) training.

Deep Generative Networks (DGNs) with probabilistic modeling of their output and latent space are currently trained via Variational Autoencoders (VAEs). In the absence of a known analytical form for the posterior and likelihood expectation, VAEs resort to approximations, including (Amortized) Variational Inference (AVI) and Monte-Carlo (MC) sampling. We exploit the Continuous Piecewise Affine (CPA) property of modern DGNs to derive their posterior and marginal distributions as well as the latter's first moments. These findings enable us to derive an analytical Expectation-Maximization (EM) algorithm that enables gradient-free DGN learning. We demonstrate empirically that EM training of DGNs produces greater likelihood than VAE training. Our findings will guide the design of new VAE AVI that better approximate the true posterior and open avenues to apply standard statistical tools for model comparison, anomaly detection, and missing data imputation.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes