You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Low-Complexity Acoustic Echo Cancellation with Neural Kalman Filtering
Dong Yang*, Fei Jiang*, Wei Wu, Xuefei Fang, and Muyong Cao
Tencent Technology
(*Equal contribution)
Abstract
The Kalman filter has been adopted in acoustic echo cancellation due to its robustness to double-talk, fast convergence, and tracking ability. The performance of Kalman filter is closely related to the accuracy of the state noise covariance and the observation noise covariance, which are either manually designed or estimated according to the echo cancellation results. The estimation error may lead to unacceptable results, especially when the echo path suffers abrupt changes, the tracking performance of the Kalman filter could be degraded significantly. In this paper, we propose the neural kalman filtering (NKF), which uses neural networks to implicitly model the covariance of the state noise and observation noise and to output the Kalman gain in real-time. Experimental results on both synthetic test sets and real-recorded test sets show that, the proposed NKF has superior convergence and re-convergence performance while ensuring low near-end speech degradation comparing with the state-of-the-art Kalman filters. Moreover, the proposed model size is merely 5.3 K and the RTF is as low as 0.09, which indicates that the NKF can be deployed in low-resource platforms.
Results
Synthetic Test Set
(Averaged ERLE curves of the synthetic double-talk test set. Abrupt echo path change occurs at the shaded region. RNN-AEC refers to the baseline model of Interspeech 2021 AEC challenge.)
(Mel spectrograms of the first test sample below. Abrupt echo path change occurs at 4.2 s.)