SDAILGASJan 25, 2025

Robust Cross-Etiology and Speaker-Independent Dysarthric Speech Recognition

arXiv:2501.14994v16 citationsh-index: 44ICASSP
Originality Incremental advance
AI Analysis

This addresses the problem of limited generalizability in dysarthric speech recognition systems for individuals with speech impairments, though it is incremental as it builds on existing Whisper models.

The paper tackled robust dysarthric speech recognition by developing a speaker-independent model using the Whisper model, achieving a CER of 6.99% and WER of 10.71% on the SAP-1005 dataset and demonstrating cross-etiology performance with CER of 25.08% and WER of 39.56% on the TORGO dataset.

In this paper, we present a speaker-independent dysarthric speech recognition system, with a focus on evaluating the recently released Speech Accessibility Project (SAP-1005) dataset, which includes speech data from individuals with Parkinson's disease (PD). Despite the growing body of research in dysarthric speech recognition, many existing systems are speaker-dependent and adaptive, limiting their generalizability across different speakers and etiologies. Our primary objective is to develop a robust speaker-independent model capable of accurately recognizing dysarthric speech, irrespective of the speaker. Additionally, as a secondary objective, we aim to test the cross-etiology performance of our model by evaluating it on the TORGO dataset, which contains speech samples from individuals with cerebral palsy (CP) and amyotrophic lateral sclerosis (ALS). By leveraging the Whisper model, our speaker-independent system achieved a CER of 6.99% and a WER of 10.71% on the SAP-1005 dataset. Further, in cross-etiology settings, we achieved a CER of 25.08% and a WER of 39.56% on the TORGO dataset. These results highlight the potential of our approach to generalize across unseen speakers and different etiologies of dysarthria.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes