SD LG ASJun 25, 2021

Deep Residual Echo Suppression with A Tunable Tradeoff Between Signal Distortion and Echo Suppression

arXiv:2106.13531v114.210 citations

Originality Incremental advance

AI Analysis

This work addresses echo cancellation for on-device audio processing, presenting an incremental improvement with tunable parameters.

The paper tackles the problem of residual echo suppression in double-talk scenarios by proposing a UNet-based method that allows a tunable tradeoff between signal distortion and echo suppression, achieving performance that meets timing and computational constraints for on-device applications.

In this paper, we propose a residual echo suppression method using a UNet neural network that directly maps the outputs of a linear acoustic echo canceler to the desired signal in the spectral domain. This system embeds a design parameter that allows a tunable tradeoff between the desired-signal distortion and residual echo suppression in double-talk scenarios. The system employs 136 thousand parameters, and requires 1.6 Giga floating-point operations per second and 10 Mega-bytes of memory. The implementation satisfies both the timing requirements of the AEC challenge and the computational and memory limitations of on-device applications. Experiments are conducted with 161~h of data from the AEC challenge database and from real independent recordings. We demonstrate the performance of the proposed system in real-life conditions and compare it with two competing methods regarding echo suppression and desired-signal distortion, generalization to various environments, and robustness to high echo levels.

View on arXiv PDF

Similar