LG MLSep 28, 2025

Demographic-Agnostic Fairness without Harm

Zhongteng Cai, Mohammad Mahdi Khalili, Xueru Zhang

arXiv:2509.24077v17.11 citationsh-index: 16Proceedings of the AAAI/ACM Conference on AI, Ethics, and Society

Originality Incremental advance

AI Analysis

This addresses fairness in high-stakes domains like healthcare where demographic data may be unavailable, offering a method to avoid accuracy loss while ensuring fairness, though it appears incremental as it builds on preference-based fairness.

The paper tackles the problem of achieving fairness in machine learning without requiring demographic information, proposing a demographic-agnostic fairness without harm (DAFH) algorithm that jointly learns group partitions and classifiers, and shows it can outperform baselines in experiments.

As machine learning (ML) algorithms are increasingly used in social domains to make predictions about humans, there is a growing concern that these algorithms may exhibit biases against certain social groups. Numerous notions of fairness have been proposed in the literature to measure the unfairness of ML. Among them, one class that receives the most attention is \textit{parity-based}, i.e., achieving fairness by equalizing treatment or outcomes for different social groups. However, achieving parity-based fairness often comes at the cost of lowering model accuracy and is undesirable for many high-stakes domains like healthcare. To avoid inferior accuracy, a line of research focuses on \textit{preference-based} fairness, under which any group of individuals would experience the highest accuracy and collectively prefer the ML outcomes assigned to them if they were given the choice between various sets of outcomes. However, these works assume individual demographic information is known and fully accessible during training. In this paper, we relax this requirement and propose a novel \textit{demographic-agnostic fairness without harm (DAFH)} optimization algorithm, which jointly learns a group classifier that partitions the population into multiple groups and a set of decoupled classifiers associated with these groups. Theoretically, we conduct sample complexity analysis and show that our method can outperform the baselines when demographic information is known and used to train decoupled classifiers. Experiments on both synthetic and real data validate the proposed method.

View on arXiv PDF

Similar