SD LGMay 9

Towards Trustworthy Audio Deepfake Detection: A Systematic Framework for Diagnosing and Mitigating Gender Bias

Aishwarya Fursule, Shruti Kshirsagar, Anderson R. Avila

arXiv:2605.0908774.3

Predicted impact top 41% in SD · last 90 daysOriginality Highly original

AI Analysis

For developers and deployers of audio deepfake detection systems, this work provides a systematic approach to diagnose and mitigate gender bias, highlighting the necessity of identifying bias sources before applying fixes.

This work presents the first diagnosis-first framework for identifying and mitigating gender bias in audio deepfake detection. Their diagnosis reveals bias stems from acoustic representation differences and gender leakage, not data imbalance, and their threshold adjustment reduces unfairness by 54-75% without accuracy loss.

Audio deepfake detection systems are increasingly deployed in high-stakes security applications, yet their fairness across demographic groups remains critically underexamined. Prior work measures gender disparity but does not investigate where it comes from or how to fix it systematically. We present the first diagnosis-first framework that identifies bias source before applying targeted mitigation, evaluated on two models, AASIST and Wav2Vec2+ResNet18, on ASVSpoof5. Our diagnosis shows that bias does not stem from imbalanced training data but from acoustic representation differences, gender leakage in learned features, and structural evaluation asymmetry. We test mitigation strategies across in-processing, post-processing and combined families, including novel methods introduced in this work. Adjusting the decision threshold separately per gender reduces unfairness by 54% to 75% at no cost to detection accuracy, and our new epoch-level fairness regularisation method outperforms existing per-batch approaches. Adversarial debiasing succeeds only when gender leakage is localised, and fails when it is diffuse, an outcome correctly predicted by our diagnosis before training. No single method fully closes the fairness gap, confirming that bias sources must be identified before fixes are applied and that fairer benchmark design is equally important

View on arXiv PDF

Similar