SDAIASOct 21, 2023

Fast Diffusion GAN Model for Symbolic Music Generation Controlled by Emotions

arXiv:2310.14040v111 citationsh-index: 17
Originality Incremental advance
AI Analysis

This work addresses the incremental challenge of emotion control and efficiency in symbolic music generation for applications in creative AI and music technology.

The authors tackled the problem of generating symbolic music with controlled emotions and the slow sampling of diffusion models, achieving a model that can generate music with desired emotions and reduces computational cost by requiring only four denoising steps compared to thousands in current state-of-the-art methods.

Diffusion models have shown promising results for a wide range of generative tasks with continuous data, such as image and audio synthesis. However, little progress has been made on using diffusion models to generate discrete symbolic music because this new class of generative models are not well suited for discrete data while its iterative sampling process is computationally expensive. In this work, we propose a diffusion model combined with a Generative Adversarial Network, aiming to (i) alleviate one of the remaining challenges in algorithmic music generation which is the control of generation towards a target emotion, and (ii) mitigate the slow sampling drawback of diffusion models applied to symbolic music generation. We first used a trained Variational Autoencoder to obtain embeddings of a symbolic music dataset with emotion labels and then used those to train a diffusion model. Our results demonstrate the successful control of our diffusion model to generate symbolic music with a desired emotion. Our model achieves several orders of magnitude improvement in computational cost, requiring merely four time steps to denoise while the steps required by current state-of-the-art diffusion models for symbolic music generation is in the order of thousands.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes