LG OC MLJun 8, 2025

Modified K-means Algorithm with Local Optimality Guarantees

Mingyi Li, Michael R. Metel, Akiko Takeda

arXiv:2506.06990v24.11 citationsh-index: 6Has CodeICML

Originality Incremental advance

AI Analysis

This work addresses a theoretical gap in clustering algorithms for machine learning practitioners, though it is incremental as it builds on the well-established K-means framework.

The paper tackles the lack of rigorous local optimality guarantees in the K-means algorithm by proposing modifications that ensure local optimality under general Bregman divergences, with numerical experiments showing reduced clustering loss compared to standard K-means.

The K-means algorithm is one of the most widely studied clustering algorithms in machine learning. While extensive research has focused on its ability to achieve a globally optimal solution, there still lacks a rigorous analysis of its local optimality guarantees. In this paper, we first present conditions under which the K-means algorithm converges to a locally optimal solution. Based on this, we propose simple modifications to the K-means algorithm which ensure local optimality in both the continuous and discrete sense, with the same computational complexity as the original K-means algorithm. As the dissimilarity measure, we consider a general Bregman divergence, which is an extension of the squared Euclidean distance often used in the K-means algorithm. Numerical experiments confirm that the K-means algorithm does not always find a locally optimal solution in practice, while our proposed methods provide improved locally optimal solutions with reduced clustering loss. Our code is available at https://github.com/lmingyi/LO-K-means.

View on arXiv PDF Code

Similar