SDNCAug 25, 2020

ANGUS: Real-time manipulation of vocal roughness for emotional speech transformations

arXiv:2008.11241v1Has Code
AI Analysis

This work provides a tool for experimental emotion research and affective computing by enabling real-time manipulation of vocal roughness, though it is incremental as it builds on existing concepts for emotional speech transformation.

The authors tackled the problem of simulating vocal roughness for emotional speech transformations by developing ANGUS, a real-time algorithm using amplitude modulation and time-domain filtering, which achieved comparable emotional negativity to state-of-the-art methods and was indistinguishable from non-transformed sounds by listeners.

Vocal arousal, the non-linear acoustic features taken on by human and animal vocalizations when highly aroused, has an important communicative function because it signals aversive states such as fear, pain or distress. In this work, we present a computationally-efficient, real-time voice transformation algorithm, ANGUS, which uses amplitude modulation and time-domain filtering to simulate roughness, an important component of vocal arousal, in arbitrary voice recordings. In a series of 4 studies, we show that ANGUS allows parametric control over the spectral features of roughness like the presence of sub-harmonics and noise; that ANGUS increases the emotional negativity perceived by listeners, to a comparable level as a non-real-time analysis/resynthesis algorithm from the state-of-the-art; that listeners cannot distinguish transformed and non-transformed sounds above chance level; and that ANGUS has a similar emotional effect on animal vocalizations and musical instrument sounds than on human vocalizations. A real-time implementation of ANGUS is made available as open-source software, for use in experimental emotion reseach and affective computing.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes