ML IT LG PR STDec 25, 2024

Generative Models with ELBOs Converging to Entropy Sums

Jan Warnken, Dmytro Velychko, Simon Damm, Asja Fischer, Jörg Lücke

arXiv:2501.09022v14.5h-index: 32

Originality Synthesis-oriented

AI Analysis

This provides a theoretical foundation for understanding ELBO behavior in unsupervised learning, but it is incremental as it extends existing convergence results to more models.

The paper proves that the evidence lower bound (ELBO) for various generative models, including probabilistic PCA and Gaussian mixture models, converges to entropy sums at all stationary points under realistic conditions, such as finite data and model mismatches.

The evidence lower bound (ELBO) is one of the most central objectives for probabilistic unsupervised learning. For the ELBOs of several generative models and model classes, we here prove convergence to entropy sums. As one result, we provide a list of generative models for which entropy convergence has been shown, so far, along with the corresponding expressions for entropy sums. Our considerations include very prominent generative models such as probabilistic PCA, sigmoid belief nets or Gaussian mixture models. However, we treat more models and entire model classes such as general mixtures of exponential family distributions. Our main contributions are the proofs for the individual models. For each given model we show that the conditions stated in Theorem 1 or Theorem 2 of [arXiv:2209.03077] are fulfilled such that by virtue of the theorems the given model's ELBO is equal to an entropy sum at all stationary points. The equality of the ELBO at stationary points applies under realistic conditions: for finite numbers of data points, for model/data mismatches, at any stationary point including saddle points etc, and it applies for any well behaved family of variational distributions.

View on arXiv PDF

Similar