ASCLSDMar 28, 2021

Quantifying Bias in Automatic Speech Recognition

arXiv:2103.15122v2112 citations
Originality Synthesis-oriented
AI Analysis

This work addresses bias in ASR systems, which is a critical issue for fairness in speech technology, but it is incremental as it focuses on quantifying bias in a specific system rather than proposing a novel solution.

The paper quantifies bias in a Dutch state-of-the-art automatic speech recognition system by analyzing word error rates and phoneme-level errors across gender, age, regional accents, and non-native accents, and suggests mitigation strategies.

Automatic speech recognition (ASR) systems promise to deliver objective interpretation of human speech. Practice and recent evidence suggests that the state-of-the-art (SotA) ASRs struggle with the large variation in speech due to e.g., gender, age, speech impairment, race, and accents. Many factors can cause the bias of an ASR system. Our overarching goal is to uncover bias in ASR systems to work towards proactive bias mitigation in ASR. This paper is a first step towards this goal and systematically quantifies the bias of a Dutch SotA ASR system against gender, age, regional accents and non-native accents. Word error rates are compared, and an in-depth phoneme-level error analysis is conducted to understand where bias is occurring. We primarily focus on bias due to articulation differences in the dataset. Based on our findings, we suggest bias mitigation strategies for ASR development.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes