CLCYLGMar 30, 2024

Addressing Both Statistical and Causal Gender Fairness in NLP Models

arXiv:2404.00463v131 citationsh-index: 5NAACL-HLT
Originality Incremental advance
AI Analysis

This work addresses fairness issues in NLP for users affected by biased AI systems, but it is incremental as it builds on existing debiasing techniques.

The paper tackled the problem of gender bias in NLP models by evaluating both statistical and causal debiasing methods, finding that individual methods reduce bias only in targeted metrics, but combinations of these techniques effectively reduce bias across both statistical and causal fairness metrics.

Statistical fairness stipulates equivalent outcomes for every protected group, whereas causal fairness prescribes that a model makes the same prediction for an individual regardless of their protected characteristics. Counterfactual data augmentation (CDA) is effective for reducing bias in NLP models, yet models trained with CDA are often evaluated only on metrics that are closely tied to the causal fairness notion; similarly, sampling-based methods designed to promote statistical fairness are rarely evaluated for causal fairness. In this work, we evaluate both statistical and causal debiasing methods for gender bias in NLP models, and find that while such methods are effective at reducing bias as measured by the targeted metric, they do not necessarily improve results on other bias metrics. We demonstrate that combinations of statistical and causal debiasing techniques are able to reduce bias measured through both types of metrics.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes