LGDec 11, 2022

Stochastic First-Order Learning for Large-Scale Flexibly Tied Gaussian Mixture Model

arXiv:2212.05402v33 citationsh-index: 38
Originality Incremental advance
AI Analysis

This addresses scalability issues in GMMs for applications with streaming or large-dimensional data, representing an incremental improvement.

The paper tackled the challenge of fitting flexibly-tied Gaussian Mixture Models (GMMs) for high-dimensional data by proposing a stochastic optimization algorithm on orthogonal matrices, resulting in better likelihood, faster convergence with fewer epochs, and reduced time per epoch compared to expectation-maximization.

Gaussian Mixture Models (GMMs) are one of the most potent parametric density models used extensively in many applications. Flexibly-tied factorization of the covariance matrices in GMMs is a powerful approach for coping with the challenges of common GMMs when faced with high-dimensional data and complex densities which often demand a large number of Gaussian components. However, the expectation-maximization algorithm for fitting flexibly-tied GMMs still encounters difficulties with streaming and very large dimensional data. To overcome these challenges, this paper suggests the use of first-order stochastic optimization algorithms. Specifically, we propose a new stochastic optimization algorithm on the manifold of orthogonal matrices. Through numerous empirical results on both synthetic and real datasets, we observe that stochastic optimization methods can outperform the expectation-maximization algorithm in terms of attaining better likelihood, needing fewer epochs for convergence, and consuming less time per each epoch.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes