STMLMar 23, 2020

Deterministic Approximate EM Algorithm; Application to the Riemann Approximation EM and the Tempered EM

arXiv:2003.10126v38 citations
Originality Incremental advance
AI Analysis

This work addresses a gap in optimization methods for non-convex likelihoods with latent variables, offering deterministic alternatives that are incremental improvements over existing stochastic approximations.

The authors tackled the lack of deterministic approximations in the EM algorithm by introducing a theoretical framework with convergence guarantees, and they developed methods like Riemann approximation EM and tempered EM, showing that new temperature profiles can more successfully escape adversarial initializations.

The Expectation Maximisation (EM) algorithm is widely used to optimise non-convex likelihood functions with latent variables. Many authors modified its simple design to fit more specific situations. For instance, the Expectation (E) step has been replaced by Monte Carlo (MC), Markov Chain Monte Carlo or tempered approximations, etc. Most of the well-studied approximations belong to the stochastic class. By comparison, the literature is lacking when it comes to deterministic approximations. In this paper, we introduce a theoretical framework, with state-of-the-art convergence guarantees, for any deterministic approximation of the E step. We analyse theoretically and empirically several approximations that fit into this framework. First, for intractable E-steps, we introduce a deterministic version of MC-EM using Riemann sums. A straightforward method, not requiring any hyper-parameter fine-tuning, useful when the low dimensionality does not warrant a MC-EM. Then, we consider the tempered approximation, borrowed from the Simulated Annealing literature and used to escape local extrema. We prove that the tempered EM verifies the convergence guarantees for a wider range of temperature profiles than previously considered. We showcase empirically how new non-trivial profiles can more successfully escape adversarial initialisations. Finally, we combine the Riemann and tempered approximations into a method that accomplishes both their purposes.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes