End-to-End Deep Learning-Based Adaptation Control for Linear Acoustic Echo Cancellation (2024)

research-article

Authors: Thomas Haubner, Andreas Brendel, and Walter Kellermann

IEEE/ACM Transactions on Audio, Speech, and Language Processing, Volume 32

Pages 227 - 238

https://doi.org/10.1109/TASLP.2023.3325923

Published: 19 October 2023 Publication History

0citation
6
Downloads

Metrics

Total Citations0Total Downloads6

Last 12 Months6

Last 6 weeks2

Get Citation Alerts

New Citation Alert added!

This alert has been successfully added and will be sent to:

You will be notified whenever a record that you have chosen has been cited.

To manage your alert preferences, click on the button below.

Manage my Alerts

New Citation Alert!

Please log in to your account

Get Access

IEEE/ACM Transactions on Audio, Speech, and Language Processing
Volume 32
PREVIOUS ARTICLEOn the Predictive Power of Objective Intelligibility Metrics for the Subjective Performance of Deep Complex Convolutional Recurrent Speech Enhancement NetworksPreviousNEXT ARTICLEOne General Teacher for Multi-Data Multi-Task: A New Knowledge Distillation Framework for Discourse Relation AnalysisNext
- Abstract
- References
- Get Access
- References
- Media
- Tables
- Share

Abstract

The attenuation of acoustic loudspeaker echoes remains to be one of the open challenges to achieve pleasant full-duplex hands free speech communication. In many modern signal enhancement interfaces, this problem is addressed by a linear acoustic echo canceler which subtracts a loudspeaker echo estimate from the recorded microphone signal. To obtain precise echo estimates, the parameters of the echo canceler, i.e., the filter coefficients, need to be estimated quickly and precisely from the observed loudspeaker and microphone signals. For this a sophisticated adaptation control is required to deal with high-power double-talk and rapidly track time-varying acoustic environments which are often faced with portable devices. In this paper, we address this problem by end-to-end deep learning. In particular, we suggest to infer the step-size for a least mean squares frequency-domain adaptive filter update by a Deep Neural Network (DNN). Two different step-size inference approaches are investigated. On the one hand broadband approaches, which use a single DNN to jointly infer step-sizes for all frequency bands, and on the other hand narrowband methods, which exploit individual DNNs per frequency band. The discussion of benefits and disadvantages of both approaches leads to a novel hybrid approach which shows improved echo cancellation while requiring only small DNN architectures. Furthermore, we investigate the effect of different loss functions, signal feature vectors, and DNN output layer architectures on the echo cancellation performance from which we obtain valuable insights into the general design and functionality of DNN-based adaptation control algorithms.

References

[1]

E. Hänsler and G. Schmidt, Acoustic Echo and Noise Control: A Practical Approach. New York, NY, USA: Wiley, 2004.

Crossref

Google Scholar

[2]

G. Enzner, H. Buchner, A. Favrot, and F. Kuech, “Acoustic echo control,” in Academic Press Library in Signal Processing, vol. 4. Florida, USA: Elsevier, 2014, pp. 807–877.

Google Scholar

[3]

K. Sridhar et al., “ICASSP 2021 acoustic echo cancellation challenge: Datasets, testing framework, and results,” in Proc. IEEE Int. Conf. Acoust. Speech Signal Process.2021, pp. 151–155.

Google Scholar

[4]

R. Cutler et al., “Interspeech 2021 acoustic echo cancellation challenge,” in Proc. Interspeech, 2021, pp. 4748–4752.

Google Scholar

[5]

R. Cutler et al., “ICASSP 2022 acoustic echo cancellation challenge,” in Proc. IEEE Int. Conf. Acoust. Speech Signal Process., 2022, pp. 9107–9111.

Google Scholar

[6]

S. Haykin, Adaptive Filter Theory, 4th ed. Upper Saddle River, Englewood Cliffs, NJ, USA: Prentice Hall, 2002.

Google Scholar

[7]

A. Mader, H. Puder, and G. U. Schmidt, “Step-size control for acoustic echo cancellation filters–An overview,” Signal Process., vol. 80, no. 9, pp. 1697–1719, 2000.

Google Scholar

[8]

T. Gansler, M. Hansson, C.-J. Ivarsson, and G. Salomonsson, “A double-talk detector based on coherence,” IEEE Trans. Commun., vol. 44, no. 11, pp. 1421–1427, Nov. 1996.

Google Scholar

[9]

J. Benesty, D. R. Morgan, and J. H. Cho, “A new class of doubletalk detectors based on cross-correlation,” IEEE Speech Audio Process., vol. 8, no. 2, pp. 168–172, Mar. 2000.

Google Scholar

[10]

B. H. Nitsch, “A frequency-selective stepfactor control for an adaptive filter algorithm working in the frequency domain,” Signal Process., vol. 80, no. 9, pp. 1733–1745, Sep. 2000.

Google Scholar

[11]

G. Enzner and P. Vary, “Frequency-domain adaptive Kalman filter for acoustic echo control in hands-free telephones,” Signal Process., vol. 86, no. 6, pp. 1140–1156, 2006.

Google Scholar

Index Terms

End-to-End Deep Learning-Based Adaptation Control for Linear Acoustic Echo Cancellation
1. Applied computing
  1. Arts and humanities
    1. Sound and music computing
2. Computing methodologies
  1. Machine learning
    1. Machine learning approaches
      1. Neural networks
3. Hardware
  1. Communication hardware, interfaces and storage
    1. Signal processing systems
4. Information systems
  1. Information retrieval
    1. Specialized information retrieval
      1. Multimedia and multimodal retrieval

Index terms have been assigned to the content through auto-classification.

Recommendations

State-Space Microphone Array Nonlinear Acoustic Echo Cancellation Using Multi-Microphone Near-End Speech Covariance
Nonlinear acoustic echo cancellation AEC is a highly challenging task in a single-microphone; hence, the AEC technique with a microphone array has also been considered to more effectively reduce the residual echo. However, these algorithms track only a ...
Read More
Deep Neural Network Based Regression Approach for Acoustic Echo Cancellation
ICMSSP '19: Proceedings of the 2019 4th International Conference on Multimedia Systems and Signal Processing
An acoustic echo canceller (AEC) aims to remove the acoustic echo in the mixture signal received by the near-end microphone. The conventional method uses an adaptive finite impulse response (FIR) filter to identify a room impulse response (RIR)which is ...
Read More
Deep Learning for Acoustic Echo Cancellation and Active Noise Control
Read More

Comments

Information & Contributors

Information

Published In

IEEE/ACM Transactions on Audio, Speech and Language Processing Volume 32, Issue

2024

2883 pages

ISSN:2329-9290

EISSN:2329-9304

Issue’s Table of Contents

Publisher

IEEE Press

Publication History

Published: 19 October 2023

Published inTASLPVolume 32

Qualifiers

Research-article

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

Total Citations
6
Total Downloads

Downloads (Last 12 months)6
Downloads (Last 6 weeks)2

Other Metrics

View Author Metrics

Citations

View Options

Get Access

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

End-to-End Deep Learning-Based Adaptation Control for Linear Acoustic Echo Cancellation (2024)

New Citation Alert added!

New Citation Alert!

Abstract

References

Index Terms

Recommendations

Comments

Information & Contributors

Information

Published In

Publisher

Publication History

Qualifiers

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

Other Metrics

Citations

View Options

Get Access

Login options

Full Access

View options

PDF

eReader

Media

Figures

Other

Tables

References