ASCLSDNov 5, 2024

Unified Pathological Speech Analysis with Prompt Tuning

arXiv:2411.04142v1h-index: 22
Originality Incremental advance
AI Analysis

This work addresses the need for efficient and connected analysis of pathological speech across multiple diseases, offering an incremental improvement over disease-specific models.

The paper tackles the problem of pathological speech analysis for multiple diseases by proposing a unified system using prompt tuning, which achieves competitive performance across Alzheimer's disease, depression, and Parkinson's disease while fine-tuning only a fraction of parameters.

Pathological speech analysis has been of interest in the detection of certain diseases like depression and Alzheimer's disease and attracts much interest from researchers. However, previous pathological speech analysis models are commonly designed for a specific disease while overlooking the connection between diseases, which may constrain performance and lower training efficiency. Instead of fine-tuning deep models for different tasks, prompt tuning is a much more efficient training paradigm. We thus propose a unified pathological speech analysis system for as many as three diseases with the prompt tuning technique. This system uses prompt tuning to adjust only a small part of the parameters to detect different diseases from speeches of possible patients. Our system leverages a pre-trained spoken language model and demonstrates strong performance across multiple disorders while only fine-tuning a fraction of the parameters. This efficient training approach leads to faster convergence and improved F1 scores by allowing knowledge to be shared across tasks. Our experiments on Alzheimer's disease, Depression, and Parkinson's disease show competitive results, highlighting the effectiveness of our method in pathological speech analysis.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes