SDASFeb 11, 2021

Onoma-to-wave: Environmental sound synthesis from onomatopoeic words

arXiv:2102.05872v416 citations
Originality Synthesis-oriented
AI Analysis

This addresses a domain-specific problem for audio synthesis and human-computer interaction, offering an incremental improvement over existing techniques.

The paper tackles the problem of environmental sound synthesis by using onomatopoeic words, proposing a sequence-to-sequence framework that achieves higher diversity and naturalness than conventional methods based on sound event labels alone.

In this paper, we propose a framework for environmental sound synthesis from onomatopoeic words. As one way of expressing an environmental sound, we can use an onomatopoeic word, which is a character sequence for phonetically imitating a sound. An onomatopoeic word is effective for describing diverse sound features. Therefore, using onomatopoeic words for environmental sound synthesis will enable us to generate diverse environmental sounds. To generate diverse sounds, we propose a method based on a sequence-to-sequence framework for synthesizing environmental sounds from onomatopoeic words. We also propose a method of environmental sound synthesis using onomatopoeic words and sound event labels. The use of sound event labels in addition to onomatopoeic words enables us to capture each sound event's feature depending on the input sound event label. Our subjective experiments show that our proposed methods achieve higher diversity and naturalness than conventional methods using sound event labels.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes