SDAIASAug 1, 2024

Expressive MIDI-format Piano Performance Generation

arXiv:2408.00900v21 citationsh-index: 2
AI Analysis

This work tackles the problem of generating expressive symbolic music for musicians and AI researchers, though it appears incremental as it builds on existing neural network approaches for music generation.

The authors developed a generative neural network to create expressive piano performances in MIDI format, addressing common criticisms of symbolic music generation by producing vivid micro-timing, polyphonic texture, dynamics, and sustain pedal effects, though the model was not fully trained and sometimes generated incoherent results.

This work presents a generative neural network that's able to generate expressive piano performance in MIDI format. The musical expressivity is reflected by vivid micro-timing, rich polyphonic texture, varied dynamics, and the sustain pedal effects. This model is innovative from many aspects of data processing to neural network design. We claim that this symbolic music generation model overcame the common critics of symbolic music and is able to generate expressive music flows as good as, if not better than generations with raw audio. One drawback is that, due to the limited time for submission, the model is not fine-tuned and sufficiently trained, thus the generation may sound incoherent and random at certain points. Despite that, this model shows its powerful generative ability to generate expressive piano pieces.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes