NELGDec 4, 2022

Can Evolutionary Clustering Have Theoretical Guarantees?

arXiv:2212.01771v27 citationsh-index: 28
Originality Incremental advance
AI Analysis

This work addresses the lack of theoretical foundations for evolutionary clustering methods, offering guarantees for practitioners in machine learning and data analysis, though it is incremental in extending existing algorithms to new theoretical contexts.

The paper proves that the GSEMO evolutionary algorithm achieves theoretical approximation guarantees for four clustering formulations (k-tMM, k-center, discrete k-median, and k-means), and extends this to discrete k-median clustering under individual fairness, providing bounds on both objective and fairness constraints.

Clustering is a fundamental problem in many areas, which aims to partition a given data set into groups based on some distance measure, such that the data points in the same group are similar while that in different groups are dissimilar. Due to its importance and NP-hardness, a lot of methods have been proposed, among which evolutionary algorithms are a class of popular ones. Evolutionary clustering has found many successful applications, but all the results are empirical, lacking theoretical support. This paper fills this gap by proving that the approximation performance of the GSEMO (a simple multi-objective evolutionary algorithm) for solving four formulations of clustering, i.e., $k$-tMM, $k$-center, discrete $k$-median and $k$-means, can be theoretically guaranteed. Furthermore, we consider clustering under fairness, which tries to avoid algorithmic bias, and has recently been an important research topic in machine learning. We prove that for discrete $k$-median clustering under individual fairness, the approximation performance of the GSEMO can be theoretically guaranteed with respect to both the objective function and the fairness constraint.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes