LGMar 7, 2015
Estimating the Mean Number of K-Means Clusters to Form
arXiv:1503.03488v25 citations
Originality Synthesis-oriented
AI Analysis
This addresses a fundamental parameter selection issue in unsupervised learning for data analysts, though it appears incremental as it builds on existing random cluster models.
The paper tackles the problem of determining how many clusters to form in K-Means classification by estimating the mean number of clusters using the dataset's sample size and a random cluster model, but no concrete results or numbers are provided.
Utilizing the sample size of a dataset, the random cluster model is employed in order to derive an estimate of the mean number of K-Means clusters to form during classification of a dataset.