[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/3330393.3330399acmotherconferencesArticle/Chapter ViewAbstractPublication PagesicmsspConference Proceedingsconference-collections
research-article

Deep Neural Network Based Regression Approach for Acoustic Echo Cancellation

Published: 10 May 2019 Publication History

Abstract

An acoustic echo canceller (AEC) aims to remove the acoustic echo in the mixture signal received by the near-end microphone. The conventional method uses an adaptive finite impulse response (FIR) filter to identify a room impulse response (RIR)which is not robust to various wild scenarios. In this paper, we propose a deep neural network-based regression approach that directly estimates the amplitude spectrum of the near-end target signal from features extracted from the mixtures of near-end and far-end signals. Depend on the powerful modelling and generalizing ability of deep learning, the complex echo signal can be well eliminated. Experimental results show the effectiveness of the proposed method for echo removal in double-talk, background noise, RIR variation and nonlinear distortion scenarios. In addition, the proposed method generalizes well to real-life acoustic echoes recorded in vehicles.

References

[1]
Benesty, J., Gänsler, T., Morgan, D. R., Sondhi, M. M., & Gay, S. L. (2001). Advances in network and acoustic echo cancellation. Berlin: Springer.
[2]
Benesty, J., Paleologu, C., Gänsler, T., & Ciochină, S. (2011). A perspective on stereophonic acoustic echo cancellation (Vol. 4). Springer Science & Business Media.
[3]
D. Duttweiler. A Twelve-Channel Digital Echo Canceler. In IEEE Transactions on Communications vol. 26, no. 5, pp. 647--653, May 1978
[4]
Mahfoud Hamidia, Abderrahmane Amrouche. A new robust double-talk detector based on the Stockwell transform for acoustic echo cancellation. In Digital Signal Processing, Vol. 60, ISSN 1051-2004, Pages 99-112, 2017
[5]
Turbin, V., Gilloire, A., & Scalart, P. (1997, April). Comparison of three post-filtering algorithms for residual acoustic echo reduction. In icassp (p. 307). IEEE.
[6]
Schwarz, A., Hofmann, C., & Kellermann, W. (2013, October). Spectral feature-based nonlinear residual echo suppression. In Applications of Signal Processing to Audio and Acoustics (WASPAA), 2013 IEEE Workshop on (pp. 1--4). IEEE.
[7]
Kuech, F., & Kellermann, W. (2007, April). Nonlinear residual echo suppression using a power filter model of the acoustic echo path. In Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. IEEE International Conference on (Vol. 1, pp. I--73). IEEE.
[8]
Xu, Y., Du, J., Dai, L. R., & Lee, C. H. (2015). A regression approach to speech enhancement based on deep neural networks. IEEE/ACM Transactions on Audio, Speech and Language Processing (TASLP), 23(1), 7--19.
[9]
Carbajal, G., Serizel, R., Vincent, E., & Humbert, E. (2018, April). Multiple-input neural network-based residual echo suppression. In ICASSP 2018-IEEE International Conference on Acoustics, Speech and Signal Processing (pp. 1--5).
[10]
Valin, J. M. (2007). On adjusting the learning rate in frequency domain echo cancellation with double-talk. IEEE Transactions on Audio, Speech, and Language Processing, 15(3), 1030--1034.
[11]
Schwarz, A., Hofmann, C., & Kellermann, W. (2013, October). Spectral feature-based nonlinear residual echo suppression. In Applications of Signal Processing to Audio and Acoustics (WASPAA), 2013 IEEE Workshop on (pp. 1--4). IEEE.
[12]
Enzner, G., Buchner, H., Favrot, A., & Kuech, F. (2014). Acoustic echo control. In Academic press library in signal processing (Vol. 4, pp. 807--877). Elsevier.
[13]
Vincent, E., Gribonval, R., & Févotte, C. (2006). Performance measurement in blind audio source separation. IEEE transactions on audio, speech, and language processing, 14(4), 1462--1469.
[14]
Zhang, H., & Wang, D. (2018). Deep Learning for Acoustic Echo Cancellation in Noisy and Double-Talk Scenarios. Training, 161(2), 322.
[15]
Rix, A. W., Beerends, J. G., Hollier, M. P., & Hekstra, A. P. (2001). Perceptual evaluation of speech quality (PESQ)-a new method for speech quality assessment of telephone networks and codecs. In Acoustics, Speech, and Signal Processing, 2001. Proceedings. (ICASSP'01). 2001 IEEE International Conference on (Vol. 2, pp. 749--752). IEEE.
[16]
Glorot, X., Bordes, A., & Bengio, Y. (2011, June). Deep sparse rectifier neural networks. In Proceedings of the fourteenth international conference on artificial intelligence and statistics (pp. 315--323).
[17]
Ioffe, S., & Szegedy, C. (2015). Batch normalization: Accelerating deep network training by reducing internal covariate shift. arXiv preprint arXiv:1502.03167.
[18]
Kingma, D. P., & Ba, J. (2014). Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980.
[19]
Scheibler, R., Bezzam, E., & Dokmanić, I. (2018, April). Pyroomacoustics: A python package for audio room simulation and array processing algorithms. In 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (pp. 351--355). IEEE.
[20]
Allen, J. B., & Berkley, D. A. (1979). Image method for efficiently simulating small-room acoustics. The Journal of the Acoustical Society of America, 65(4), 943--950.
[21]
Lehmann, E. A., & Johansson, A. M. (2010). Diffuse reverberation model for efficient image-source simulation of room impulse responses. IEEE Transactions on Audio, Speech, and Language Processing, 18(6), 1429--1439.
[22]
Lamel, L. F., Kassel, R. H., & Seneff, S. (1989). Speech database development: Design and analysis of the acoustic-phonetic corpus. In Speech Input/Output Assessment and Speech Databases.

Cited By

View all
  • (2023)Implicit Acoustic Echo Cancellation for Keyword Spotting and Device-Directed Speech Detection2022 IEEE Spoken Language Technology Workshop (SLT)10.1109/SLT54892.2023.10022358(1052-1058)Online publication date: 9-Jan-2023
  • (2022)Time-Variance Aware Dynamic Kernel Generation for Real-Time Acoustic Echo CancellationIEEE Signal Processing Letters10.1109/LSP.2022.316435929(967-971)Online publication date: 2022
  • (2021)Deep Residual Echo Suppression With A Tunable Tradeoff Between Signal Distortion And Echo SuppressionICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)10.1109/ICASSP39728.2021.9414958(126-130)Online publication date: 6-Jun-2021
  • Show More Cited By

Index Terms

  1. Deep Neural Network Based Regression Approach for Acoustic Echo Cancellation

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Other conferences
    ICMSSP '19: Proceedings of the 2019 4th International Conference on Multimedia Systems and Signal Processing
    May 2019
    213 pages
    ISBN:9781450371711
    DOI:10.1145/3330393
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    In-Cooperation

    • Shenzhen University: Shenzhen University
    • Sun Yat-Sen University

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 10 May 2019

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. deep learning
    2. echo cancellation
    3. neural network
    4. regression

    Qualifiers

    • Research-article
    • Research
    • Refereed limited

    Conference

    ICMSSP 2019

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)17
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 28 Jan 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2023)Implicit Acoustic Echo Cancellation for Keyword Spotting and Device-Directed Speech Detection2022 IEEE Spoken Language Technology Workshop (SLT)10.1109/SLT54892.2023.10022358(1052-1058)Online publication date: 9-Jan-2023
    • (2022)Time-Variance Aware Dynamic Kernel Generation for Real-Time Acoustic Echo CancellationIEEE Signal Processing Letters10.1109/LSP.2022.316435929(967-971)Online publication date: 2022
    • (2021)Deep Residual Echo Suppression With A Tunable Tradeoff Between Signal Distortion And Echo SuppressionICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)10.1109/ICASSP39728.2021.9414958(126-130)Online publication date: 6-Jun-2021
    • (2021)ICASSP 2021 Acoustic Echo Cancellation Challenge: Integrated Adaptive Echo Cancellation with Time Alignment and Deep Learning-Based Residual Echo Plus Noise SuppressionICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)10.1109/ICASSP39728.2021.9414462(146-150)Online publication date: 6-Jun-2021
    • (2021)A Neural Acoustic Echo Canceller Optimized Using An Automatic Speech Recognizer and Large Scale Synthetic DataICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)10.1109/ICASSP39728.2021.9413585(7128-7132)Online publication date: 6-Jun-2021
    • (2021)Textual Echo Cancellation2021 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU)10.1109/ASRU51503.2021.9688214(548-555)Online publication date: 13-Dec-2021
    • (2021)A Conformer-Based ASR Frontend for Joint Acoustic Echo Cancellation, Speech Enhancement and Speech Separation2021 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU)10.1109/ASRU51503.2021.9687942(304-311)Online publication date: 13-Dec-2021

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media