CLApr 30

AppTek Call-Center Dialogues: A Multi-Accent Long-Form Benchmark for English ASR

Eugen Beck, Sarah Beranek, Uma Moothiringote, Daniel Mann, Wilfried Michel, Katie Nguyen, Taylor Tragemann

arXiv:2604.2754331.6Has Code

Predicted impact top 42% in CL · last 90 daysOriginality Incremental advance

AI Analysis

It provides a new benchmark for evaluating English ASR robustness across diverse accents in conversational settings, addressing a gap in existing corpora.

The paper introduces the AppTek Call-Center Dialogues corpus, a multi-accent English ASR benchmark, and shows that ASR performance varies significantly across accents and segmentation methods, with good results on general American English not generalizing to other accents.

Evaluating English ASR systems for conversational AI applications remains difficult, as many publicly available corpora are either pre-segmented into short segments, consist of read or prepared speech, or lack explicit dialect annotations to evaluate robustness for a diverse user base. This work presents the AppTek Call-Center Dialogues corpus, a collection of spontaneous, role-played agent-customer conversations spanning fourteen English accents covering sixteen service-oriented scenarios. The dataset was commissioned specifically for evaluation and none of the audio or text was publicly available prior to release, reducing the risk of overlap with existing large-scale pretraining corpora. We benchmark a set of open-source ASR systems under different segmentation approaches. Results show substantial variation across accents and segmentation methods, indicating that good performance on general American English benchmarks does not necessarily generalize to other accents.

View on arXiv PDF

Similar