MLLGJun 10, 2017

An Alternative to EM for Gaussian Mixture Models: Batch and Stochastic Riemannian Optimization

arXiv:1706.03267v171 citations
Originality Highly original
AI Analysis

This work addresses a fundamental problem in statistical estimation for machine learning practitioners, offering a novel optimization method with demonstrated performance gains.

The authors tackled maximum likelihood estimation for Gaussian Mixture Models by proposing a Riemannian optimization approach as an alternative to the EM algorithm, developing batch and stochastic gradient methods that often substantially outperform EM in empirical results.

We consider maximum likelihood estimation for Gaussian Mixture Models (Gmms). This task is almost invariably solved (in theory and practice) via the Expectation Maximization (EM) algorithm. EM owes its success to various factors, of which is its ability to fulfill positive definiteness constraints in closed form is of key importance. We propose an alternative to EM by appealing to the rich Riemannian geometry of positive definite matrices, using which we cast Gmm parameter estimation as a Riemannian optimization problem. Surprisingly, such an out-of-the-box Riemannian formulation completely fails and proves much inferior to EM. This motivates us to take a closer look at the problem geometry, and derive a better formulation that is much more amenable to Riemannian optimization. We then develop (Riemannian) batch and stochastic gradient algorithms that outperform EM, often substantially. We provide a non-asymptotic convergence analysis for our stochastic method, which is also the first (to our knowledge) such global analysis for Riemannian stochastic gradient. Numerous empirical results are included to demonstrate the effectiveness of our methods.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes