ML LG PRNov 15, 2020

Entropic regularization of Wasserstein distance between infinite-dimensional Gaussian measures and Gaussian processes

arXiv:2011.07489v39.016 citations

Originality Incremental advance

AI Analysis

It addresses theoretical challenges in optimal transport for machine learning and statistics, offering tools for applications like kernel methods, but is incremental as it extends finite-dimensional results to infinite dimensions.

This work tackles the problem of computing entropic regularized Wasserstein distances for Gaussian measures in infinite-dimensional Hilbert spaces, providing closed-form formulas for optimal transport plans, distances, and Sinkhorn divergences, with results including Fréchet differentiability and unique barycenter solutions.

This work studies the entropic regularization formulation of the 2-Wasserstein distance on an infinite-dimensional Hilbert space, in particular for the Gaussian setting. We first present the Minimum Mutual Information property, namely the joint measures of two Gaussian measures on Hilbert space with the smallest mutual information are joint Gaussian measures. This is the infinite-dimensional generalization of the Maximum Entropy property of Gaussian densities on Euclidean space. We then give closed form formulas for the optimal entropic transport plan, entropic 2-Wasserstein distance, and Sinkhorn divergence between two Gaussian measures on a Hilbert space, along with the fixed point equations for the barycenter of a set of Gaussian measures. Our formulations fully exploit the regularization aspect of the entropic formulation and are valid both in singular and nonsingular settings. In the infinite-dimensional setting, both the entropic 2-Wasserstein distance and Sinkhorn divergence are Fréchet differentiable, in contrast to the exact 2-Wasserstein distance, which is not differentiable. Our Sinkhorn barycenter equation is new and always has a unique solution. In contrast, the finite-dimensional barycenter equation for the entropic 2-Wasserstein distance fails to generalize to the Hilbert space setting. In the setting of reproducing kernel Hilbert spaces (RKHS), our distance formulas are given explicitly in terms of the corresponding kernel Gram matrices, providing an interpolation between the kernel Maximum Mean Discrepancy (MMD) and the kernel 2-Wasserstein distance.

View on arXiv PDF

Similar