CLDec 20, 2022

Trustworthy Social Bias Measurement

arXiv:2212.11672v115 citationsh-index: 102
Originality Highly original
AI Analysis

This work addresses the need for reliable social bias measurement in NLP, offering a foundational approach that could improve fairness assessments across applications.

The paper tackles the problem of designing trustworthy measures of social bias in NLP by proposing a measurement framework grounded in social science, which is validated through a rigorous testing protocol showing evidence of overcoming prior deficiencies.

How do we design measures of social bias that we trust? While prior work has introduced several measures, no measure has gained widespread trust: instead, mounting evidence argues we should distrust these measures. In this work, we design bias measures that warrant trust based on the cross-disciplinary theory of measurement modeling. To combat the frequently fuzzy treatment of social bias in NLP, we explicitly define social bias, grounded in principles drawn from social science research. We operationalize our definition by proposing a general bias measurement framework DivDist, which we use to instantiate 5 concrete bias measures. To validate our measures, we propose a rigorous testing protocol with 8 testing criteria (e.g. predictive validity: do measures predict biases in US employment?). Through our testing, we demonstrate considerable evidence to trust our measures, showing they overcome conceptual, technical, and empirical deficiencies present in prior measures.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes