SDLGMMJun 19, 2017

Kapre: On-GPU Audio Preprocessing Layers for a Quick Implementation of Deep Neural Network Models with Keras

arXiv:1706.05781v174 citations
Originality Synthesis-oriented
AI Analysis

This solves a problem for researchers in music and audio deep learning by providing a quick implementation tool, though it is incremental as it builds on existing Keras frameworks.

The paper tackles the heavy and tedious preprocessing stage in music research using deep neural networks by introducing Kapre, Keras layers for audio and music signal preprocessing, and reports that real-time on-GPU preprocessing adds a reasonable amount of computation.

We introduce Kapre, Keras layers for audio and music signal preprocessing. Music research using deep neural networks requires a heavy and tedious preprocessing stage, for which audio processing parameters are often ignored in parameter optimisation. To solve this problem, Kapre implements time-frequency conversions, normalisation, and data augmentation as Keras layers. We report simple benchmark results, showing real-time on-GPU preprocessing adds a reasonable amount of computation.

Code Implementations7 repos

Data from Papers with Code (CC-BY-SA-4.0)

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes