CLCYAug 1, 2022

Global Performance Disparities Between English-Language Accents in Automatic Speech Recognition

arXiv:2208.01157v27 citationsh-index: 44
Originality Incremental advance
AI Analysis

This reveals a novel form of bias in ASR that could exacerbate global inequalities, though it is incremental by expanding beyond known demographic factors.

The study investigated bias in automatic speech recognition (ASR) services based on geopolitical alignment, finding that performance is statistically significantly worse for speakers from countries less politically aligned with the United States, across all tested services.

Past research has identified discriminatory automatic speech recognition (ASR) performance as a function of the racial group and nationality of the speaker. In this paper, we expand the discussion beyond bias as a function of the individual national origin of the speaker to look for bias as a function of the geopolitical orientation of their nation of origin. We audit some of the most popular English language ASR services using a large and global data set of speech from The Speech Accent Archive, which includes over 2,700 speakers of English born in 171 different countries. We show that, even when controlling for multiple linguistic covariates, ASR service performance has a statistically significant relationship to the political alignment of the speaker's birth country with respect to the United States' geopolitical power. This holds for all ASR services tested. We discuss this bias in the context of the historical use of language to maintain global and political power.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes