MLLGCOMEJun 30, 2020

Sinkhorn EM: An Expectation-Maximization algorithm based on entropic optimal transport

arXiv:2006.16548v113 citations
Originality Incremental advance
AI Analysis

This work addresses the issue of local optima in expectation-maximization algorithms for mixture models, offering a more robust alternative for tasks like biological data inference, though it is incremental as it builds on existing EM and optimal transport frameworks.

The authors tackled the problem of EM algorithms getting stuck in local optima by proposing Sinkhorn EM, a variant based on entropic optimal transport for mixture models, which demonstrated better global convergence and significantly improved cell label learning in C. elegans neurons compared to other methods.

We study Sinkhorn EM (sEM), a variant of the expectation maximization (EM) algorithm for mixtures based on entropic optimal transport. sEM differs from the classic EM algorithm in the way responsibilities are computed during the expectation step: rather than assign data points to clusters independently, sEM uses optimal transport to compute responsibilities by incorporating prior information about mixing weights. Like EM, sEM has a natural interpretation as a coordinate ascent procedure, which iteratively constructs and optimizes a lower bound on the log-likelihood. However, we show theoretically and empirically that sEM has better behavior than EM: it possesses better global convergence guarantees and is less prone to getting stuck in bad local optima. We complement these findings with experiments on simulated data as well as in an inference task involving C. elegans neurons and show that sEM learns cell labels significantly better than other approaches.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes