SDASMay 31, 2021

EchoFilter: End-to-End Neural Network for Acoustic Echo Cancellation

arXiv:2105.14666v115 citations
Originality Incremental advance
AI Analysis

This work addresses echo cancellation for voice interaction systems, offering incremental improvements over linear adaptive filters by handling real-world distortions like noise and clipping.

The paper tackled acoustic echo cancellation in voice interaction by proposing an end-to-end neural network that processes time-domain audio waveforms, achieving improved performance in echo return loss enhancement and speech quality scores under noisy and nonlinear conditions.

Acoustic Echo Cancellation (AEC) whose aim is to suppress the echo originated from acoustic coupling between loudspeakers and microphones, plays a key role in voice interaction. Linear adaptive filter (AF) is always used for handling this problem. However, since there would be some severe effects in real scenarios, such nonlinear distortions, background noises, and microphone clipping, it would lead to considerable residual echo, giving poor performance in practice. In this paper, we propose an end-to-end network structure for echo cancellation, which is directly done on time-domain audio waveform. It is transformed to deep representation by temporal convolution, and modelled by Long Short-Term Memory (LSTM) for considering temporal property. Since time delay and severe reverberation may exist at the near-end with respect to the far-end, a local attention is employed for alignment. The network is trained using multitask learning by employing an auxiliary classification network for double-talk detection. Experiments show the superiority of our proposed method in terms of the echo return loss enhancement (ERLE) for single-talk periods and the perceptual evaluation of speech quality (PESQ) score for double-talk periods in background noise and nonlinear distortion scenarios.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes