CLJun 14, 2024

Enhancing Voice Wake-Up for Dysarthria: Mandarin Dysarthria Speech Corpus Release and Customized System Design

arXiv:2406.10304v17 citationsHas Code
Originality Synthesis-oriented
AI Analysis

It addresses a critical accessibility issue for dysarthric individuals in smart home environments, though the approach is incremental as it adapts existing methods to a new dataset.

This paper tackles the problem of voice wake-up word spotting for individuals with dysarthria, a motor speech disorder, by releasing the open-source Mandarin Dysarthria Speech Corpus (MDSC) and developing a customized system that achieves exceptional performance and robustness in handling speech intelligibility.

Smart home technology has gained widespread adoption, facilitating effortless control of devices through voice commands. However, individuals with dysarthria, a motor speech disorder, face challenges due to the variability of their speech. This paper addresses the wake-up word spotting (WWS) task for dysarthric individuals, aiming to integrate them into real-world applications. To support this, we release the open-source Mandarin Dysarthria Speech Corpus (MDSC), a dataset designed for dysarthric individuals in home environments. MDSC encompasses information on age, gender, disease types, and intelligibility evaluations. Furthermore, we perform comprehensive experimental analysis on MDSC, highlighting the challenges encountered. We also develop a customized dysarthria WWS system that showcases robustness in handling intelligibility and achieving exceptional performance. MDSC will be released on https://www.aishelltech.com/AISHELL_6B.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes