Men Also Do Laundry: Multi-Attribute Bias Amplification
This addresses bias amplification in computer vision for fairness and deployment, but is incremental as it improves metrics rather than introducing a new mitigation method.
The paper tackles the problem of bias amplification in computer vision models, showing that existing metrics fail to capture multi-attribute correlations and can misleadingly indicate minimal bias, and proposes a new metric validated on gender bias in COCO and imSitu datasets, benchmarking mitigation methods.
As computer vision systems become more widely deployed, there is increasing concern from both the research community and the public that these systems are not only reproducing but amplifying harmful social biases. The phenomenon of bias amplification, which is the focus of this work, refers to models amplifying inherent training set biases at test time. Existing metrics measure bias amplification with respect to single annotated attributes (e.g., $\texttt{computer}$). However, several visual datasets consist of images with multiple attribute annotations. We show models can learn to exploit correlations with respect to multiple attributes (e.g., {$\texttt{computer}$, $\texttt{keyboard}$}), which are not accounted for by current metrics. In addition, we show current metrics can give the erroneous impression that minimal or no bias amplification has occurred as they involve aggregating over positive and negative values. Further, these metrics lack a clear desired value, making them difficult to interpret. To address these shortcomings, we propose a new metric: Multi-Attribute Bias Amplification. We validate our proposed metric through an analysis of gender bias amplification on the COCO and imSitu datasets. Finally, we benchmark bias mitigation methods using our proposed metric, suggesting possible avenues for future bias mitigation