ASAICLApr 15, 2023

Evaluation of Speaker Anonymization on Emotional Speech

arXiv:2305.01759v110 citationsh-index: 23
Originality Synthesis-oriented
AI Analysis

This work addresses privacy threats in speech technology by assessing how anonymization affects emotional information, but it is incremental as it builds on existing methods and benchmarks.

The paper evaluated the speaker anonymization baseline system from the VoicePrivacy 2020 Challenge on emotional speech, finding that it fails to suppress speakers' emotions against informed attackers, with emotion recognition performance degrading by 15% relative to IEMOCAP data.

Speech data carries a range of personal information, such as the speaker's identity and emotional state. These attributes can be used for malicious purposes. With the development of virtual assistants, a new generation of privacy threats has emerged. Current studies have addressed the topic of preserving speech privacy. One of them, the VoicePrivacy initiative aims to promote the development of privacy preservation tools for speech technology. The task selected for the VoicePrivacy 2020 Challenge (VPC) is about speaker anonymization. The goal is to hide the source speaker's identity while preserving the linguistic information. The baseline of the VPC makes use of a voice conversion. This paper studies the impact of the speaker anonymization baseline system of the VPC on emotional information present in speech utterances. Evaluation is performed following the VPC rules regarding the attackers' knowledge about the anonymization system. Our results show that the VPC baseline system does not suppress speakers' emotions against informed attackers. When comparing anonymized speech to original speech, the emotion recognition performance is degraded by 15\% relative to IEMOCAP data, similar to the degradation observed for automatic speech recognition used to evaluate the preservation of the linguistic information.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes