Aldo Aguilar

2papers

2 Papers

SDOct 25, 2021Code
Deep Learning Tools for Audacity: Helping Researchers Expand the Artist's Toolkit

Hugo Flores Garcia, Aldo Aguilar, Ethan Manilow et al.

We present a software framework that integrates neural networks into the popular open-source audio editing software, Audacity, with a minimal amount of developer effort. In this paper, we showcase some example use cases for both end-users and neural network developers. We hope that this work fosters a new level of interactivity between deep learning practitioners and end-users.

SDJul 14, 2021
Leveraging Hierarchical Structures for Few-Shot Musical Instrument Recognition

Hugo Flores Garcia, Aldo Aguilar, Ethan Manilow et al.

Deep learning work on musical instrument recognition has generally focused on instrument classes for which we have abundant data. In this work, we exploit hierarchical relationships between instruments in a few-shot learning setup to enable classification of a wider set of musical instruments, given a few examples at inference. We apply a hierarchical loss function to the training of prototypical networks, combined with a method to aggregate prototypes hierarchically, mirroring the structure of a predefined musical instrument hierarchy. These extensions require no changes to the network architecture and new levels can be easily added or removed. Compared to a non-hierarchical few-shot baseline, our method leads to a significant increase in classification accuracy and significant decrease mistake severity on instrument classes unseen in training.