ASAILGSDNov 22, 2024

Open-Amp: Synthetic Data Framework for Audio Effect Foundation Models

arXiv:2411.14972v15 citationsh-index: 18Has CodeICASSP
Originality Incremental advance
AI Analysis

This addresses a data bottleneck for researchers and practitioners in musical audio processing and Music Information Retrieval, though it is incremental as it builds on existing emulation software and focuses on a specific domain.

The paper tackles the problem of limited and low-diversity audio effects datasets by introducing Open-Amp, a synthetic data framework that crowdsources neural network emulations of guitar amplifiers and effects, enabling large-scale and diverse data generation. The result is new state-of-the-art performance on multiple guitar effects classification tasks and successful transfer to unseen analog effects via latent space manipulation.

This paper introduces Open-Amp, a synthetic data framework for generating large-scale and diverse audio effects data. Audio effects are relevant to many musical audio processing and Music Information Retrieval (MIR) tasks, such as modelling of analog audio effects, automatic mixing, tone matching and transcription. Existing audio effects datasets are limited in scope, usually including relatively few audio effects processors and a limited amount of input audio signals. Our proposed framework overcomes these issues, by crowdsourcing neural network emulations of guitar amplifiers and effects, created by users of open-source audio effects emulation software. This allows users of Open-Amp complete control over the input signals to be processed by the effects models, as well as providing high-quality emulations of hundreds of devices. Open-Amp can render audio online during training, allowing great flexibility in data augmentation. Our experiments show that using Open-Amp to train a guitar effects encoder achieves new state-of-the-art results on multiple guitar effects classification tasks. Furthermore, we train a one-to-many guitar effects model using Open-Amp, and use it to emulate unseen analog effects via manipulation of its learned latent space, indicating transferability to analog guitar effects data.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes