SDAIASMar 18, 2024

Notochord: a Flexible Probabilistic Model for Real-Time MIDI Performance

arXiv:2403.12000v14 citationsh-index: 5Has Code
Originality Incremental advance
AI Analysis

This work addresses the need for low-latency, interpretable models in interactive music performance, though it is incremental as it builds on existing deep learning approaches for musical data.

The authors tackled the problem of real-time interactive music generation by developing Notochord, a deep probabilistic model for MIDI sequences, which achieves latency below ten milliseconds and supports diverse musical functions like steerable generation and harmonization.

Deep learning-based probabilistic models of musical data are producing increasingly realistic results and promise to enter creative workflows of many kinds. Yet they have been little-studied in a performance setting, where the results of user actions typically ought to feel instantaneous. To enable such study, we designed Notochord, a deep probabilistic model for sequences of structured events, and trained an instance of it on the Lakh MIDI dataset. Our probabilistic formulation allows interpretable interventions at a sub-event level, which enables one model to act as a backbone for diverse interactive musical functions including steerable generation, harmonization, machine improvisation, and likelihood-based interfaces. Notochord can generate polyphonic and multi-track MIDI, and respond to inputs with latency below ten milliseconds. Training code, model checkpoints and interactive examples are provided as open source software.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes