SDAILGMMASApr 17, 2024

Music Enhancement with Deep Filters: A Technical Report for The ICASSP 2024 Cadenza Challenge

arXiv:2404.11116v12 citationsh-index: 372024 IEEE International Conference on Acoustics, Speech, and Signal Processing Workshops (ICASSPW)
Originality Synthesis-oriented
AI Analysis

This work addresses music enhancement for audio processing applications, but it is incremental as it builds on existing methods.

The authors tackled music enhancement by incorporating deep filters from DeepfilterNet into a Spec-UNet-based network to improve a hybrid Demucs pipeline, resulting in incremental improvements in Signal-to-Distortion Ratio (SDR) and Hearing Aid Audio Quality Index (HAAQI) metrics.

In this challenge, we disentangle the deep filters from the original DeepfilterNet and incorporate them into our Spec-UNet-based network to further improve a hybrid Demucs (hdemucs) based remixing pipeline. The motivation behind the use of the deep filter component lies at its potential in better handling temporal fine structures. We demonstrate an incremental improvement in both the Signal-to-Distortion Ratio (SDR) and the Hearing Aid Audio Quality Index (HAAQI) metrics when comparing the performance of hdemucs against different versions of our model.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes