ML LGMay 22, 2024

Robust Generative Learning with Lipschitz-Regularized $α$-Divergences Allows Minimal Assumptions on Target Distributions

Ziyu Chen, Hyemin Gu, Markos A. Katsoulakis, Luc Rey-Bellet, Wei Zhu

arXiv:2405.13962v37.52 citationsh-index: 32Inf Inference J IMA

Originality Incremental advance

AI Analysis

This addresses robustness issues in generative modeling for practitioners dealing with complex real-world data distributions, though it appears incremental as it builds on existing divergence frameworks.

The paper tackles the problem of stable generative modeling across diverse target distributions by demonstrating that Lipschitz-regularized α-divergences remain finite under minimal assumptions (finite first moment of source distribution) and provide necessary/sufficient conditions for heavy-tailed targets, with numerical experiments showing stable learning in challenging scenarios like heavy tails or fractal support.

This paper demonstrates the robustness of Lipschitz-regularized $α$-divergences as objective functionals in generative modeling, showing they enable stable learning across a wide range of target distributions with minimal assumptions. We establish that these divergences remain finite under a mild condition-that the source distribution has a finite first moment-regardless of the properties of the target distribution, making them adaptable to the structure of target distributions. Furthermore, we prove the existence and finiteness of their variational derivatives, which are essential for stable training of generative models such as GANs and gradient flows. For heavy-tailed targets, we derive necessary and sufficient conditions that connect data dimension, $α$, and tail behavior to divergence finiteness, that also provide insights into the selection of suitable $α$'s. We also provide the first sample complexity bounds for empirical estimations of these divergences on unbounded domains. As a byproduct, we obtain the first sample complexity bounds for empirical estimations of these divergences and the Wasserstein-1 metric with group symmetry on unbounded domains. Numerical experiments confirm that generative models leveraging Lipschitz-regularized $α$-divergences can stably learn distributions in various challenging scenarios, including those with heavy tails or complex, low-dimensional, or fractal support, all without any prior knowledge of the structure of target distributions.

View on arXiv PDF

Similar