LGCVMLMar 2, 2015

A review of mean-shift algorithms for clustering

arXiv:1503.00687v1129 citations
Originality Synthesis-oriented
AI Analysis

This is an incremental review paper summarizing existing methods for clustering and related tasks.

The paper reviews mean-shift algorithms for clustering, which use kernel density estimates to identify high-density regions in datasets, covering theory, variants, extensions, and applications.

A natural way to characterize the cluster structure of a dataset is by finding regions containing a high density of data. This can be done in a nonparametric way with a kernel density estimate, whose modes and hence clusters can be found using mean-shift algorithms. We describe the theory and practice behind clustering based on kernel density estimates and mean-shift algorithms. We discuss the blurring and non-blurring versions of mean-shift; theoretical results about mean-shift algorithms and Gaussian mixtures; relations with scale-space theory, spectral clustering and other algorithms; extensions to tracking, to manifold and graph data, and to manifold denoising; K-modes and Laplacian K-modes algorithms; acceleration strategies for large datasets; and applications to image segmentation, manifold denoising and multivalued regression.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes