ML LG FA CO MESep 22, 2020

On the representation and learning of monotone triangular transport maps

Ricardo Baptista, Youssef Marzouk, Olivier Zahm

arXiv:2009.10303v319.671 citationsHas Code

Originality Incremental advance

AI Analysis

This work addresses the challenge of efficient and robust density estimation and inference in machine learning, offering a general framework with applications in generative modeling and Bayesian inference, though it appears incremental in building upon existing transport map methods.

The paper tackles the problem of representing and learning monotone triangular transport maps for modeling complex probability distributions, establishing conditions for global optimization and proposing an adaptive algorithm that achieves stable generalization across various sample sizes.

Transportation of measure provides a versatile approach for modeling complex probability distributions, with applications in density estimation, Bayesian inference, generative modeling, and beyond. Monotone triangular transport maps$\unicode{x2014}$approximations of the Knothe$\unicode{x2013}$Rosenblatt (KR) rearrangement$\unicode{x2014}$are a canonical choice for these tasks. Yet the representation and parameterization of such maps have a significant impact on their generality and expressiveness, and on properties of the optimization problem that arises in learning a map from data (e.g., via maximum likelihood estimation). We present a general framework for representing monotone triangular maps via invertible transformations of smooth functions. We establish conditions on the transformation such that the associated infinite-dimensional minimization problem has no spurious local minima, i.e., all local minima are global minima; and we show for target distributions satisfying certain tail conditions that the unique global minimizer corresponds to the KR map. Given a sample from the target, we then propose an adaptive algorithm that estimates a sparse semi-parametric approximation of the underlying KR map. We demonstrate how this framework can be applied to joint and conditional density estimation, likelihood-free inference, and structure learning of directed graphical models, with stable generalization performance across a range of sample sizes.

View on arXiv PDF Code

Similar