LGAIMLOct 5, 2023

Learning Energy-Based Prior Model with Diffusion-Amortized MCMC

arXiv:2310.03218v117 citationsh-index: 22
AI Analysis

This addresses a key bottleneck in generative modeling for researchers and practitioners by improving sampling efficiency and stability in latent space EBMs.

The paper tackles the problem of degraded generation quality and training instability in latent space Energy-Based Models (EBMs) caused by non-convergent short-run MCMC sampling, and introduces a diffusion-based amortization method for long-run MCMC that demonstrates superior performance on image modeling benchmarks.

Latent space Energy-Based Models (EBMs), also known as energy-based priors, have drawn growing interests in the field of generative modeling due to its flexibility in the formulation and strong modeling power of the latent space. However, the common practice of learning latent space EBMs with non-convergent short-run MCMC for prior and posterior sampling is hindering the model from further progress; the degenerate MCMC sampling quality in practice often leads to degraded generation quality and instability in training, especially with highly multi-modal and/or high-dimensional target distributions. To remedy this sampling issue, in this paper we introduce a simple but effective diffusion-based amortization method for long-run MCMC sampling and develop a novel learning algorithm for the latent space EBM based on it. We provide theoretical evidence that the learned amortization of MCMC is a valid long-run MCMC sampler. Experiments on several image modeling benchmark datasets demonstrate the superior performance of our method compared with strong counterparts

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes