LGOCDec 10, 2020

Convex Potential Flows: Universal Probability Distributions with Optimal Transport and Convex Optimization

arXiv:2012.05942v2109 citations
AI Analysis

This work provides a new, theoretically grounded approach for designing invertible probabilistic models for researchers and practitioners in machine learning, offering an alternative to existing flow-based models.

This paper introduces Convex Potential Flows (CP-Flow), a new parameterization of invertible models inspired by optimal transport theory, which are the gradient map of a strongly convex neural potential function. The method achieves competitive performance on standard benchmarks for density estimation and variational inference.

Flow-based models are powerful tools for designing probabilistic models with tractable density. This paper introduces Convex Potential Flows (CP-Flow), a natural and efficient parameterization of invertible models inspired by the optimal transport (OT) theory. CP-Flows are the gradient map of a strongly convex neural potential function. The convexity implies invertibility and allows us to resort to convex optimization to solve the convex conjugate for efficient inversion. To enable maximum likelihood training, we derive a new gradient estimator of the log-determinant of the Jacobian, which involves solving an inverse-Hessian vector product using the conjugate gradient method. The gradient estimator has constant-memory cost, and can be made effectively unbiased by reducing the error tolerance level of the convex optimization routine. Theoretically, we prove that CP-Flows are universal density approximators and are optimal in the OT sense. Our empirical results show that CP-Flow performs competitively on standard benchmarks of density estimation and variational inference.

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes