SDAILGASJun 18, 2023

MARBLE: Music Audio Representation Benchmark for Universal Evaluation

DeepMindMILA
arXiv:2306.10548v456 citationsh-index: 34Has Code
Originality Synthesis-oriented
AI Analysis

This addresses the problem of evaluating music AI models for researchers, though it is incremental as it builds on existing datasets and tasks.

The authors tackled the lack of a universal benchmark for music understanding in AI by introducing MARBLE, a comprehensive benchmark with 14 tasks across 8 datasets, which found that large-scale pre-trained models perform best but have room for improvement.

In the era of extensive intersection between art and Artificial Intelligence (AI), such as image generation and fiction co-creation, AI for music remains relatively nascent, particularly in music understanding. This is evident in the limited work on deep music representations, the scarcity of large-scale datasets, and the absence of a universal and community-driven benchmark. To address this issue, we introduce the Music Audio Representation Benchmark for universaL Evaluation, termed MARBLE. It aims to provide a benchmark for various Music Information Retrieval (MIR) tasks by defining a comprehensive taxonomy with four hierarchy levels, including acoustic, performance, score, and high-level description. We then establish a unified protocol based on 14 tasks on 8 public-available datasets, providing a fair and standard assessment of representations of all open-sourced pre-trained models developed on music recordings as baselines. Besides, MARBLE offers an easy-to-use, extendable, and reproducible suite for the community, with a clear statement on copyright issues on datasets. Results suggest recently proposed large-scale pre-trained musical language models perform the best in most tasks, with room for further improvement. The leaderboard and toolkit repository are published at https://marble-bm.shef.ac.uk to promote future music AI research.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes