LGCVMLJul 13, 2020

Bridging Maximum Likelihood and Adversarial Learning via $α$-Divergence

arXiv:2007.06178v112 citations
AI Analysis

This work addresses the trade-off between mode coverage and sample quality in generative modeling for machine learning researchers, offering a theoretical framework that bridges two dominant training paradigms.

The paper tackles the complementary limitations of maximum likelihood and adversarial learning in generative models by proposing an α-Bridge that unifies them via α-divergence, enabling smooth transfer between approaches and providing insights into prior regularization methods.

Maximum likelihood (ML) and adversarial learning are two popular approaches for training generative models, and from many perspectives these techniques are complementary. ML learning encourages the capture of all data modes, and it is typically characterized by stable training. However, ML learning tends to distribute probability mass diffusely over the data space, $e.g.$, yielding blurry synthetic images. Adversarial learning is well known to synthesize highly realistic natural images, despite practical challenges like mode dropping and delicate training. We propose an $α$-Bridge to unify the advantages of ML and adversarial learning, enabling the smooth transfer from one to the other via the $α$-divergence. We reveal that generalizations of the $α$-Bridge are closely related to approaches developed recently to regularize adversarial learning, providing insights into that prior work, and further understanding of why the $α$-Bridge performs well in practice.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes