ASSDJun 3, 2021

Joint Multi-Channel Dereverberation and Noise Reduction Using a Unified Convolutional Beamformer With Sparse Priors

arXiv:2106.01902v21 citations
AI Analysis

This work addresses speech enhancement for applications like hearing aids or communication systems, but it is incremental as it builds on an existing beamformer method.

The paper tackled the problem of joint multi-channel dereverberation and noise reduction by generalizing a convolutional beamformer with an lp-norm cost function to control speech sparsity, resulting in improved objective speech quality metrics on the REVERB challenge dataset.

Recently, the convolutional weighted power minimization distortionless response (WPD) beamformer was proposed, which unifies multi-channel weighted prediction error dereverberation and minimum power distortionless response beamforming. To optimize the convolutional filter, the desired speech component is modeled with a time-varying Gaussian model, which promotes the sparsity of the desired speech component in the short-time Fourier transform domain compared to the noisy microphone signals. In this paper we generalize the convolutional WPD beamformer by using an lp-norm cost function, introducing an adjustable shape parameter which enables to control the sparsity of the desired speech component. Experiments based on the REVERB challenge dataset show that the proposed method outperforms the conventional convolutional WPD beamformer in terms of objective speech quality metrics.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes