SDCVLGASAug 2, 2021

Musical Speech: A Transformer-based Composition Tool

arXiv:2108.01043v12 citations
Originality Incremental advance
AI Analysis

This provides musicians with a novel tool for creating music from speech without requiring paired training datasets.

The authors developed a transformer-based tool that generates musical outlines from user speech for composition, demonstrating its effectiveness through examples created by musicians.

In this paper, we propose a new compositional tool that will generate a musical outline of speech recorded/provided by the user for use as a musical building block in their compositions. The tool allows any user to use their own speech to generate musical material, while still being able to hear the direct connection between their recorded speech and the resulting music. The tool is built on our proposed pipeline. This pipeline begins with speech-based signal processing, after which some simple musical heuristics are applied, and finally these pre-processed signals are passed through Transformer models trained on new musical tasks. We illustrate the effectiveness of our pipeline -- which does not require a paired dataset for training -- through examples of music created by musicians making use of our tool.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes