SDAIASJul 10, 2019

Explicitly Conditioned Melody Generation: A Case Study with Interdependent RNNs

arXiv:1907.05208v114 citations
Originality Incremental advance
AI Analysis

This work addresses the problem of generating higher-quality symbolic music for applications in music composition and AI creativity, though it is incremental as it builds on existing recurrent neural network methods by adding explicit conditioning.

The study investigated the impact of explicitly conditioning deep generative models with musically relevant information on monophonic melody generation, finding that such conditioning significantly improves learning and performance, as indicated by objective evaluations across two music genres and an informal subjective assessment of aesthetic quality.

Deep generative models for symbolic music are typically designed to model temporal dependencies in music so as to predict the next musical event given previous events. In many cases, such models are expected to learn abstract concepts such as harmony, meter, and rhythm from raw musical data without any additional information. In this study, we investigate the effects of explicitly conditioning deep generative models with musically relevant information. Specifically, we study the effects of four different conditioning inputs on the performance of a recurrent monophonic melody generation model. Several combinations of these conditioning inputs are used to train different model variants which are then evaluated using three objective evaluation paradigms across two genres of music. The results indicate musically relevant conditioning significantly improves learning and performance, and reveal how this information affects learning of musical features related to pitch and rhythm. An informal subjective evaluation suggests a corresponding improvement in the aesthetic quality of generations.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes