DSAILGMay 16, 2024

A Polynomial-Time Approximation for Pairwise Fair $k$-Median Clustering

arXiv:2405.10378v22 citationsh-index: 18
Originality Incremental advance
AI Analysis

This addresses fairness in clustering for multiple demographic groups, offering a practical solution with theoretical guarantees, though it is incremental relative to prior work on fair clustering.

The paper tackles the problem of pairwise fair k-median clustering with multiple groups, where cluster compositions must satisfy fairness constraints, and presents the first polynomial-time O(k^2·ℓ·t)-approximation algorithm that does not violate these constraints. It also provides hardness of approximation results, showing the problem is nearly as hard as uniform capacitated k-median.

In this work, we study pairwise fair clustering with $\ell \ge 2$ groups, where for every cluster $C$ and every group $i \in [\ell]$, the number of points in $C$ from group $i$ must be at most $t$ times the number of points in $C$ from any other group $j \in [\ell]$, for a given integer $t$. To the best of our knowledge, only bi-criteria approximation and exponential-time algorithms follow for this problem from the prior work on fair clustering problems when $\ell > 2$. In our work, focusing on the $\ell > 2$ case, we design the first polynomial-time $O(k^2\cdot \ell \cdot t)$-approximation for this problem with $k$-median cost that does not violate the fairness constraints. We complement our algorithmic result by providing hardness of approximation results, which show that our problem even when $\ell=2$ is almost as hard as the popular uniform capacitated $k$-median, for which no polynomial-time algorithm with an approximation factor of $o(\log k)$ is known.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes