SDMay 9, 2016

Speech Enhancement In Multiple-Noise Conditions using Deep Neural Networks

arXiv:1605.02427v1126 citations
Originality Incremental advance
AI Analysis

This addresses a practical problem for speech processing systems by moving beyond single-noise scenarios, though it is incremental as it builds on existing DNN methods.

The paper tackles speech enhancement in real-world conditions with multiple simultaneous noises, proposing DNN-based strategies and a psychoacoustic training approach, achieving improved speech quality in office environments.

In this paper we consider the problem of speech enhancement in real-world like conditions where multiple noises can simultaneously corrupt speech. Most of the current literature on speech enhancement focus primarily on presence of single noise in corrupted speech which is far from real-world environments. Specifically, we deal with improving speech quality in office environment where multiple stationary as well as non-stationary noises can be simultaneously present in speech. We propose several strategies based on Deep Neural Networks (DNN) for speech enhancement in these scenarios. We also investigate a DNN training strategy based on psychoacoustic models from speech coding for enhancement of noisy speech

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes