LG DS GT MLDec 6, 2018

Differentially Private Fair Learning

Matthew Jagielski, Michael Kearns, Jieming Mao, Alina Oprea, Aaron Roth, Saeed Sharifi-Malvajerdi, Jonathan Ullman

arXiv:1812.02696v325.8170 citations

Originality Incremental advance

AI Analysis

This addresses the challenge of ensuring fairness and privacy simultaneously in machine learning, which is crucial for applications like healthcare or hiring where sensitive data is restricted, though it builds incrementally on existing fairness methods.

The paper tackles the problem of learning predictive models that are both fair (ensuring equalized odds) and differentially private, especially when sensitive attributes cannot be explicitly collected. It introduces two algorithms that achieve these properties, showing new trade-offs between fairness, accuracy, and privacy, with milder trade-offs when group membership is used at test time.

Motivated by settings in which predictive models may be required to be non-discriminatory with respect to certain attributes (such as race), but even collecting the sensitive attribute may be forbidden or restricted, we initiate the study of fair learning under the constraint of differential privacy. We design two learning algorithms that simultaneously promise differential privacy and equalized odds, a 'fairness' condition that corresponds to equalizing false positive and negative rates across protected groups. Our first algorithm is a private implementation of the equalized odds post-processing approach of [Hardt et al., 2016]. This algorithm is appealingly simple, but must be able to use protected group membership explicitly at test time, which can be viewed as a form of 'disparate treatment'. Our second algorithm is a differentially private version of the oracle-efficient in-processing approach of [Agarwal et al., 2018] that can be used to find the optimal fair classifier, given access to a subroutine that can solve the original (not necessarily fair) learning problem. This algorithm is more complex but need not have access to protected group membership at test time. We identify new tradeoffs between fairness, accuracy, and privacy that emerge only when requiring all three properties, and show that these tradeoffs can be milder if group membership may be used at test time. We conclude with a brief experimental evaluation.

View on arXiv PDF

Similar