LGSep 15, 2016

Tsallis Regularized Optimal Transport and Ecological Inference

arXiv:1609.04495v161 citations
AI Analysis

This work addresses the need for flexible optimal transport frameworks in social sciences, particularly for ecological inference, though it appears incremental by generalizing existing methods.

The authors tackled the problem of unifying and extending optimal transport methods by introducing Tsallis regularized optimal transport (TROT), which interpolates between various divergences like Wasserstein and Kullback-Leibler, and they applied it to ecological inference, achieving faithful reconstruction of joint distributions in US election data.

Optimal transport is a powerful framework for computing distances between probability distributions. We unify the two main approaches to optimal transport, namely Monge-Kantorovitch and Sinkhorn-Cuturi, into what we define as Tsallis regularized optimal transport (\trot). \trot~interpolates a rich family of distortions from Wasserstein to Kullback-Leibler, encompassing as well Pearson, Neyman and Hellinger divergences, to name a few. We show that metric properties known for Sinkhorn-Cuturi generalize to \trot, and provide efficient algorithms for finding the optimal transportation plan with formal convergence proofs. We also present the first application of optimal transport to the problem of ecological inference, that is, the reconstruction of joint distributions from their marginals, a problem of large interest in the social sciences. \trot~provides a convenient framework for ecological inference by allowing to compute the joint distribution --- that is, the optimal transportation plan itself --- when side information is available, which is \textit{e.g.} typically what census represents in political science. Experiments on data from the 2012 US presidential elections display the potential of \trot~in delivering a faithful reconstruction of the joint distribution of ethnic groups and voter preferences.

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes