LGMar 7, 2015

Estimating the Mean Number of K-Means Clusters to Form

arXiv:1503.03488v25 citations

Originality Synthesis-oriented

AI Analysis

This addresses a fundamental parameter selection issue in unsupervised learning for data analysts, though it appears incremental as it builds on existing random cluster models.

The paper tackles the problem of determining how many clusters to form in K-Means classification by estimating the mean number of clusters using the dataset's sample size and a random cluster model, but no concrete results or numbers are provided.

Utilizing the sample size of a dataset, the random cluster model is employed in order to derive an estimate of the mean number of K-Means clusters to form during classification of a dataset.

View on arXiv PDF

Similar