LGMLJan 16, 2013

Minimum Message Length Clustering Using Gibbs Sampling

arXiv:1301.3851v111 citations
Originality Incremental advance
AI Analysis

This provides a more robust clustering tool for data scientists by addressing key issues in existing methods, though it appears incremental as it builds on established principles.

The paper tackles the limitations of K-Means and EM algorithms in clustering, such as local optima and requiring a priori specification of cluster numbers, by developing a Bayesian mixture modeling tool using Minimum Message Length (MML) and Gibbs sampling, resulting in a method that samples models according to their posterior probability to avoid local optima.

The K-Mean and EM algorithms are popular in clustering and mixture modeling, due to their simplicity and ease of implementation. However, they have several significant limitations. Both coverage to a local optimum of their respective objective functions (ignoring the uncertainty in the model space), require the apriori specification of the number of classes/clsuters, and are inconsistent. In this work we overcome these limitations by using the Minimum Message Length (MML) principle and a variation to the K-Means/EM observation assignment and parameter calculation scheme. We maintain the simplicity of these approaches while constructing a Bayesian mixture modeling tool that samples/searches the model space using a Markov Chain Monte Carlo (MCMC) sampler known as a Gibbs sampler. Gibbs sampling allows us to visit each model according to its posterior probability. Therefore, if the model space is multi-modal we will visit all models and not get stuck in local optima. We call our approach multiple chains at equilibrium (MCE) MML sampling.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes