ASSDOct 24, 2019

Meta-learning for robust child-adult classification from speech

arXiv:1910.11400v221 citations
Originality Incremental advance
AI Analysis

This work addresses a domain-specific challenge in clinical applications for autism diagnosis, offering incremental improvements in speaker classification robustness.

The paper tackled the problem of robust child-adult speaker classification in dyadic conversations for autism diagnosis, using meta-learning to improve generalizability over speaker and channel variability, achieving up to 14.53% relative improvement in F1-scores compared to state-of-the-art methods.

Computational modeling of naturalistic conversations in clinical applications has seen growing interest in the past decade. An important use-case involves child-adult interactions within the autism diagnosis and intervention domain. In this paper, we address a specific sub-problem of speaker diarization, namely child-adult speaker classification in such dyadic conversations with specified roles. Training a speaker classification system robust to speaker and channel conditions is challenging due to inherent variability in the speech within children and the adult interlocutors. In this work, we propose the use of meta-learning, in particular, prototypical networks which optimize a metric space across multiple tasks. By modeling every child-adult pair in the training set as a separate task during meta-training, we learn a representation with improved generalizability compared to conventional supervised learning. We demonstrate improvements over state-of-the-art speaker embeddings (x-vectors) under two evaluation settings: weakly supervised classification (up to 14.53% relative improvement in F1-scores) and clustering (up to relative 9.66% improvement in cluster purity). Our results show that protonets can potentially extract robust speaker embeddings for child-adult classification from speech.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes