SDAILGDec 12, 2016

A Unit Selection Methodology for Music Generation Using Deep Neural Networks

arXiv:1612.03789v158 citations
Originality Incremental advance
AI Analysis

This work addresses music generation for computational creativity, but it is incremental as it builds on existing unit selection and deep learning techniques.

The paper tackled music generation by proposing a unit selection and concatenation method using deep neural networks, achieving competitive results in objective metrics and subjective listening tests compared to a note-level baseline.

Several methods exist for a computer to generate music based on data including Markov chains, recurrent neural networks, recombinancy, and grammars. We explore the use of unit selection and concatenation as a means of generating music using a procedure based on ranking, where, we consider a unit to be a variable length number of measures of music. We first examine whether a unit selection method, that is restricted to a finite size unit library, can be sufficient for encompassing a wide spectrum of music. We do this by developing a deep autoencoder that encodes a musical input and reconstructs the input by selecting from the library. We then describe a generative model that combines a deep structured semantic model (DSSM) with an LSTM to predict the next unit, where units consist of four, two, and one measures of music. We evaluate the generative model using objective metrics including mean rank and accuracy and with a subjective listening test in which expert musicians are asked to complete a forced-choiced ranking task. We compare our model to a note-level generative baseline that consists of a stacked LSTM trained to predict forward by one note.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes