ML LGMar 30, 2021

Statistical inference for individual fairness

Subha Maity, Songkai Xue, Mikhail Yurochkin, Yuekai Sun

arXiv:2103.16714v113.821 citationsHas Code

Originality Incremental advance

AI Analysis

This addresses the issue of ML models perpetuating biases like gender and racial biases, providing auditors with principled methods to assess fairness, though it is incremental in building on existing fairness concepts.

The paper tackles the problem of detecting violations of individual fairness in machine learning models by formalizing it as measuring susceptibility to adversarial attacks and developing statistical inference tools, such as confidence intervals and hypothesis tests, with demonstrated utility in a real-world case study.

As we rely on machine learning (ML) models to make more consequential decisions, the issue of ML models perpetuating or even exacerbating undesirable historical biases (e.g., gender and racial biases) has come to the fore of the public's attention. In this paper, we focus on the problem of detecting violations of individual fairness in ML models. We formalize the problem as measuring the susceptibility of ML models against a form of adversarial attack and develop a suite of inference tools for the adversarial cost function. The tools allow auditors to assess the individual fairness of ML models in a statistically-principled way: form confidence intervals for the worst-case performance differential between similar individuals and test hypotheses of model fairness with (asymptotic) non-coverage/Type I error rate control. We demonstrate the utility of our tools in a real-world case study.

View on arXiv PDF Code

Similar