LGMLOct 18, 2013

A Theoretical and Experimental Comparison of the EM and SEM Algorithm

arXiv:1310.5034v24 citations
Originality Synthesis-oriented
AI Analysis

This provides theoretical and experimental insights into stochastic EM variants for mixture models, but appears incremental as it builds on existing algorithms.

The paper analyzes the SEM algorithm for mixture distributions, showing that for Gaussian mixture models with sufficiently large input sets, the update equations of EM and SEM are almost identical with high probability, and experiments confirm SEM runs nearly twice as fast.

In this paper we provide a new analysis of the SEM algorithm. Unlike previous work, we focus on the analysis of a single run of the algorithm. First, we discuss the algorithm for general mixture distributions. Second, we consider Gaussian mixture models and show that with high probability the update equations of the EM algorithm and its stochastic variant are almost the same, given that the input set is sufficiently large. Our experiments confirm that this still holds for a large number of successive update steps. In particular, for Gaussian mixture models, we show that the stochastic variant runs nearly twice as fast.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes