LG ITFeb 24, 2021

A Stochastic Optimization Framework for Fair Risk Minimization

Andrew Lowy, Sina Baharlouei, Rakesh Pavan, Meisam Razaviyayn, Ahmad Beirami

arXiv:2102.12586v519.930 citationsHas Code

Originality Highly original

AI Analysis

This work addresses the challenge of scalable fairness in machine learning for applications requiring large models and datasets, offering a practical solution for researchers and practitioners dealing with multiple sensitive attributes and non-binary targets.

The paper tackled the problem of fair classification with discrete sensitive attributes in large-scale settings, where existing fairness algorithms are impractical or lack convergence guarantees, and developed FERMI, the first stochastic in-processing fairness algorithm with proven convergence for demographic parity, equalized odds, and equal opportunity, achieving the most favorable tradeoffs between fairness violation and test accuracy across all tested setups, especially with small batch sizes and non-binary classification.

Despite the success of large-scale empirical risk minimization (ERM) at achieving high accuracy across a variety of machine learning tasks, fair ERM is hindered by the incompatibility of fairness constraints with stochastic optimization. We consider the problem of fair classification with discrete sensitive attributes and potentially large models and data sets, requiring stochastic solvers. Existing in-processing fairness algorithms are either impractical in the large-scale setting because they require large batches of data at each iteration or they are not guaranteed to converge. In this paper, we develop the first stochastic in-processing fairness algorithm with guaranteed convergence. For demographic parity, equalized odds, and equal opportunity notions of fairness, we provide slight variations of our algorithm--called FERMI--and prove that each of these variations converges in stochastic optimization with any batch size. Empirically, we show that FERMI is amenable to stochastic solvers with multiple (non-binary) sensitive attributes and non-binary targets, performing well even with minibatch size as small as one. Extensive experiments show that FERMI achieves the most favorable tradeoffs between fairness violation and test accuracy across all tested setups compared with state-of-the-art baselines for demographic parity, equalized odds, equal opportunity. These benefits are especially significant with small batch sizes and for non-binary classification with large number of sensitive attributes, making FERMI a practical, scalable fairness algorithm. The code for all of the experiments in this paper is available at: https://github.com/optimization-for-data-driven-science/FERMI.

View on arXiv PDF Code

Similar