SEMar 12

How Fair is Software Fairness Testing?

Ann Barcomb, Mariana Pinheiro Bento, Giuseppe Destefanis, Sherlock Licorish, Cleyton Magalhães, Ronnie de Souza Santos, Mairieli Wessel

arXiv:2603.1251164.6

Predicted impact top 32% in SE · last 90 daysOriginality Synthesis-oriented

AI Analysis

This is a vision paper that highlights cultural and ethical limitations in fairness testing for AI systems, which could impact developers, researchers, and affected communities.

The paper critiques software fairness testing for treating fairness as universal rather than culturally situated, identifying issues with fairness metrics, datasets, and ethical concerns like data labeling practices and environmental impacts. It calls for rethinking fairness testing to respect cultural plurality and acknowledge the right to refuse algorithmic mediation.

Software fairness testing is a central method for evaluating AI systems, yet the meaning of fairness is often treated as fixed and universally applicable. This vision paper positions fairness testing as culturally situated and examines the problem across three dimensions. First, fairness metrics encode particular cultural values while marginalizing others. Second, test datasets are predominantly designed from Western contexts, excluding knowledge systems grounded in oral traditions, Indigenous languages, and non-digital communities. Third, fairness testing raises ethical concerns, including the reliance on low-paid data labeling in the Global South, and associated with this, the environmental costs of training and deploying large-scale models, which disproportionately affect climate-vulnerable populations. Addressing these issues requires rethinking fairness testing beyond universal metrics and moving toward evaluation frameworks that respect cultural plurality and acknowledge the right to refuse algorithmic mediation.

View on arXiv PDF

Similar