Connected Speech-Based Cognitive Assessment in Chinese and English
This work addresses the need for language-generalizable cognitive assessment tools, though it is incremental as it builds on existing speech-based methods with a new dataset.
The authors tackled the problem of assessing cognitive function through connected speech analysis by creating a bilingual benchmark dataset and baseline models for Mandarin Chinese and English speakers, achieving 59.2% unweighted average recall in diagnosis and a root mean squared error of 2.89 in score prediction.
We present a novel benchmark dataset and prediction tasks for investigating approaches to assess cognitive function through analysis of connected speech. The dataset consists of speech samples and clinical information for speakers of Mandarin Chinese and English with different levels of cognitive impairment as well as individuals with normal cognition. These data have been carefully matched by age and sex by propensity score analysis to ensure balance and representativity in model training. The prediction tasks encompass mild cognitive impairment diagnosis and cognitive test score prediction. This framework was designed to encourage the development of approaches to speech-based cognitive assessment which generalise across languages. We illustrate it by presenting baseline prediction models that employ language-agnostic and comparable features for diagnosis and cognitive test score prediction. The models achieved unweighted average recall was 59.2% in diagnosis, and root mean squared error of 2.89 in score prediction.