MLLGApr 10, 2019

Attraction-Repulsion clustering with applications to fairness

arXiv:1904.05254v42 citationsHas Code
Originality Incremental advance
AI Analysis

This addresses fairness in clustering for applications requiring demographic parity, though it is incremental as it builds on existing pre-processing methods.

The paper tackles the problem of diversity-enhancing clustering for fairness by introducing attraction-repulsion perturbations to distances, which penalizes homogeneity in protected attributes and improves diversity, as demonstrated with synthetic and real data.

We consider the problem of diversity enhancing clustering, i.e, developing clustering methods which produce clusters that favour diversity with respect to a set of protected attributes such as race, sex, age, etc. In the context of fair clustering, diversity plays a major role when fairness is understood as demographic parity. To promote diversity, we introduce perturbations to the distance in the unprotected attributes that account for protected attributes in a way that resembles attraction-repulsion of charged particles in Physics. These perturbations are defined through dissimilarities with a tractable interpretation. Cluster analysis based on attraction-repulsion dissimilarities penalizes homogeneity of the clusters with respect to the protected attributes and leads to an improvement in diversity. An advantage of our approach, which falls into a pre-processing set-up, is its compatibility with a wide variety of clustering methods and whit non-Euclidean data. We illustrate the use of our procedures with both synthetic and real data and provide discussion about the relation between diversity, fairness, and cluster structure. Our procedures are implemented in an R package freely available at https://github.com/HristoInouzhe/AttractionRepulsionClustering.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes