LGDec 20, 2013

Adaptive Seeding for Gaussian Mixture Models

arXiv:1312.5946v322 citations
Originality Synthesis-oriented
AI Analysis

This work addresses initialization challenges in Gaussian mixture modeling for data analysis, but it is incremental as it builds on existing algorithms.

The authors tackled the problem of initializing expectation-maximization for Gaussian mixture models by adapting K-means++ and Gonzalez algorithms, resulting in methods that outperform common techniques on artificial and real-world datasets.

We present new initialization methods for the expectation-maximization algorithm for multivariate Gaussian mixture models. Our methods are adaptions of the well-known $K$-means++ initialization and the Gonzalez algorithm. Thereby we aim to close the gap between simple random, e.g. uniform, and complex methods, that crucially depend on the right choice of hyperparameters. Our extensive experiments indicate the usefulness of our methods compared to common techniques and methods, which e.g. apply the original $K$-means++ and Gonzalez directly, with respect to artificial as well as real-world data sets.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes