LGAICYMLOct 4, 2025

Cost Efficient Fairness Audit Under Partial Feedback

arXiv:2510.03734v1h-index: 10
Originality Incremental advance
AI Analysis

This addresses the practical challenge of efficiently auditing fairness in high-stakes domains like lending, where full feedback is unavailable, though it builds incrementally on prior work on truncated samples and mixture models.

The paper tackles the problem of auditing classifier fairness when only partial feedback is available (true labels only for positively classified individuals), introducing a novel cost model for acquiring labeled data and developing algorithms that are more cost-effective than baselines. The results show algorithms outperform natural baselines by around 50% in audit cost on real-world datasets.

We study the problem of auditing the fairness of a given classifier under partial feedback, where true labels are available only for positively classified individuals, (e.g., loan repayment outcomes are observed only for approved applicants). We introduce a novel cost model for acquiring additional labeled data, designed to more accurately reflect real-world costs such as credit assessment, loan processing, and potential defaults. Our goal is to find optimal fairness audit algorithms that are more cost-effective than random exploration and natural baselines. In our work, we consider two audit settings: a black-box model with no assumptions on the data distribution, and a mixture model, where features and true labels follow a mixture of exponential family distributions. In the black-box setting, we propose a near-optimal auditing algorithm under mild assumptions and show that a natural baseline can be strictly suboptimal. In the mixture model setting, we design a novel algorithm that achieves significantly lower audit cost than the black-box case. Our approach leverages prior work on learning from truncated samples and maximum-a-posteriori oracles, and extends known results on spherical Gaussian mixtures to handle exponential family mixtures, which may be of independent interest. Moreover, our algorithms apply to popular fairness metrics including demographic parity, equal opportunity, and equalized odds. Empirically, we demonstrate strong performance of our algorithms on real-world fair classification datasets like Adult Income and Law School, consistently outperforming natural baselines by around 50% in terms of audit cost.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes