ML LGApr 10, 2019

Attraction-Repulsion clustering with applications to fairness

Eustasio del Barrio, Hristo Inouzhe, Jean-Michel Loubes

arXiv:1904.05254v43.22 citationsHas Code

Originality Incremental advance

AI Analysis

This addresses fairness in clustering for applications requiring demographic parity, though it is incremental as it builds on existing pre-processing methods.

The paper tackles the problem of diversity-enhancing clustering for fairness by introducing attraction-repulsion perturbations to distances, which penalizes homogeneity in protected attributes and improves diversity, as demonstrated with synthetic and real data.

We consider the problem of diversity enhancing clustering, i.e, developing clustering methods which produce clusters that favour diversity with respect to a set of protected attributes such as race, sex, age, etc. In the context of fair clustering, diversity plays a major role when fairness is understood as demographic parity. To promote diversity, we introduce perturbations to the distance in the unprotected attributes that account for protected attributes in a way that resembles attraction-repulsion of charged particles in Physics. These perturbations are defined through dissimilarities with a tractable interpretation. Cluster analysis based on attraction-repulsion dissimilarities penalizes homogeneity of the clusters with respect to the protected attributes and leads to an improvement in diversity. An advantage of our approach, which falls into a pre-processing set-up, is its compatibility with a wide variety of clustering methods and whit non-Euclidean data. We illustrate the use of our procedures with both synthetic and real data and provide discussion about the relation between diversity, fairness, and cluster structure. Our procedures are implemented in an R package freely available at https://github.com/HristoInouzhe/AttractionRepulsionClustering.

View on arXiv PDF Code

Similar