LGDSSTOct 3, 2023

Sampling Multimodal Distributions with the Vanilla Score: Benefits of Data-Based Initialization

arXiv:2310.01762v111 citationsh-index: 15
Originality Incremental advance
AI Analysis

This provides a theoretical solution for sampling multimodal distributions in statistical modeling, addressing a known limitation of vanilla score matching.

The paper tackles the challenge of sampling from multimodal distributions using vanilla score matching, proving that Langevin diffusion with early stopping and data-based initialization successfully generates mixtures of log-concave distributions.

There is a long history, as well as a recent explosion of interest, in statistical and generative modeling approaches based on score functions -- derivatives of the log-likelihood of a distribution. In seminal works, Hyvärinen proposed vanilla score matching as a way to learn distributions from data by computing an estimate of the score function of the underlying ground truth, and established connections between this method and established techniques like Contrastive Divergence and Pseudolikelihood estimation. It is by now well-known that vanilla score matching has significant difficulties learning multimodal distributions. Although there are various ways to overcome this difficulty, the following question has remained unanswered -- is there a natural way to sample multimodal distributions using just the vanilla score? Inspired by a long line of related experimental works, we prove that the Langevin diffusion with early stopping, initialized at the empirical distribution, and run on a score function estimated from data successfully generates natural multimodal distributions (mixtures of log-concave distributions).

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes