[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
research-article

A new robust double-talk detector based on the Stockwell transform for acoustic echo cancellation

Published: 01 January 2017 Publication History

Abstract

Despite great developments in the field of acoustic echo cancellation (AEC), the presence of double-talk remains difficult problem. The main role of double-talk detection (DTD) is to control adaptation of the filter coefficients by halting their update in double-talk situations. In this paper, we propose a new method of DTD based on a timefrequency analysis that uses the Stockwell transform (ST).The ST is a timefrequency spectral localization method that combines the characteristics of the short-time Fourier transform and the wavelet transform. This method provides better timefrequency resolution, especially for non-stationary signals. In the experimental tests, the normalized least mean squares (NLMS) algorithm is used to update the filter coefficients along with speech signals taken from the TIMIT database. The obtained results show better performance compared to existing methods in terms of misalignment convergence and speech intelligibility enhancement. Acoustic echo cancellation in communication systems based on adaptive filtering.A double-talk problem in acoustic echo cancellation process.The Stockwell transform for timefrequency analysis.Proposition of a new method of double-talk detection using the Stockwell transform.Evaluation of the proposed method, and discussion of the obtained results.

References

[1]
J. Benesty, T. Gnsler, D.R. Morgan, M.M. Sondhi, S.L. Gay, Advances in Network and Acoustic Echo Cancellation, Springer-Verlag, Berlin, Germany, 2001.
[2]
M. Hamidia, A. Amrouche, Influence of noisy channel on acoustic echo cancellation in mobile communication, in: 24th International Conference on Microelectronics, IEEE, Algeirs, Algeria, 2012, pp. 1-4.
[3]
M. Fukui, S. Shimauchi, Y. Hioka, A. Nakagawa, Y. Haneda, Double-talk robust acoustic echo cancellation for CD-quality hands-free videoconferencing system, IEEE Trans. Consum. Electron., 60 (2014) 468-475.
[4]
B. Widrow, S.D. Stearns, Adaptive Signal Processing, Prentice-Hall, Inc., Englewood Cliffs, NJ, 1985.
[5]
S. Haykin, Adaptive Filter Theory, Prentice-Hall, Englewood Cliffs, NJ, 2002.
[6]
K. Ozeki, T. Umeda, An adaptive filtering algorithm using an orthogonal projection to an affine subspace and its properties, Electron. Commun. Jpn., Part I, Commun., 67 (1984) 19-27.
[7]
I. Lee, C.E. Kinney, B. Lee, A.A. Kalker, Solving the acoustic echo cancellation problem in double-talk scenario using non-gaussianity of the near-end signal, in: Independent Component Analysis and Signal Separation, Springer, 2009, pp. 589-596.
[8]
E. Hnsler, G. Schmidt, Speech and Audio Processing in Adverse Environments, Springer Science & Business Media, 2008.
[9]
J. Benesty, D.R. Morgan, J.H. Cho, A new class of doubletalk detectors based on cross-correlation, IEEE Trans. Speech Audio Process., 8 (2000) 168-172.
[10]
T. Gnsler, J. Benesty, A frequencydomain double-talk detector based on a normalized cross-correlation vector, Signal Process., 81 (2001) 1783-1787.
[11]
J. Benesty, T. Gnsler, A multichannel acoustic echo canceler double-talk detector based on a normalized cross-correlation matrix, Eur. Trans. Telecommun., 13 (2002) 95-101.
[12]
T. Gnsler, J. Benesty, The fast normalized cross-correlation double-talk detector, Signal Process., 86 (2006) 1124-1139.
[13]
T. Gansler, M. Hansson, C.-J. Ivarsson, G. Salomonsson, A double-talk detector based on coherence, IEEE Trans. Commun., 44 (1996) 1421-1427.
[14]
I.J. Tashev, Coherence based double talk detector with soft decision, in: IEEE International Conference on Acoustics, Speech and Signal Processing, IEEE, 2012, pp. 165-168.
[15]
H. Bao, Y. Yang, J. Liu, X. Bao, Q. Yuan, A robust algorithm of double talk detection based on voice activity detection, in: International Conference on Audio Language and Image Processing, IEEE, 2010, pp. 12-15.
[16]
M.Z. Ikram, Double-talk detection in acoustic echo cancellers using zero-crossings rate, in: IEEE International Conference on Acoustics, Speech and Signal Processing, IEEE, 2015, pp. 1121-1125.
[17]
S. Cecchi, L. Romoli, F. Piazza, Multichannel double-talk detector based on fundamental frequency estimation, IEEE Signal Process. Lett., 23 (2016) 94-97.
[18]
A. Moukadem, Z. Bouguila, D.O. Abdeslam, A. Dieterlen, A new optimized Stockwell transform applied on synthetic and real non-stationary signals, Digit. Signal Process., 46 (2015) 226-238.
[19]
R.G. Stockwell, L. Mansinha, R. Lowe, Localization of the complex spectrum: the S transform, IEEE Trans. Signal Process., 44 (1996) 998-1001.
[20]
R. Stockwell, Why use the S-transform, in: AMS Pseudo-Differential Operators: Partial Differential Equations and TimeFrequency Analysis, 2007, pp. 279-309.
[21]
S. Roopa, S. Narasimhan, S-transform based on analytic discrete cosine transform for timefrequency analysis, Signal Process., 105 (2014) 207-215.
[22]
A. Benammar, R. Drai, A. Guessoum, Ultrasonic flaw detection using threshold modified S-transform, Ultrasonics, 54 (2014) 676-683.
[23]
J. Liu, D. Ma, M. Li, S-transform and its application in the spectrum analysis of seismic signal, in: Advances in Electronic Engineering, Communication and Management, vol. 1, Springer, 2012, pp. 81-86.
[24]
M.M. Sondhi, The history of echo cancellation, IEEE Signal Process. Mag., 23 (2006) 95-102.
[25]
K. Chen, P.-y. Xu, J. Lu, B.-l. Xu, An improved post-filter of acoustic echo canceller based on subband implementation, Appl. Acoust. (2009) 886-893.
[26]
M. Hamidia, A. Amrouche, Double-talk detector based on speech feature extraction for acoustic echo cancellation, in: 22nd International Conference on Software, Telecommunications and Computer Networks, SoftCOM, IEEE, Split, Croatie, 2014, pp. 393-397.
[27]
R. Le Bouquin-Jeanns, G. Faucon, Control of an adaptive echo canceller using a near-end speech detector, Signal Process., 81 (2001) 483-489.
[28]
M. Hamidia, A. Amrouche, Improved variable step-size NLMS adaptive filtering algorithm for acoustic echo cancellation, Digit. Signal Process., 49 (2016) 44-55.
[29]
L.R. Vega, H. Rey, A Rapid Introduction to Adaptive Filtering, Springer Science & Business Media, 2012.
[30]
D. Havelock, S. Kuwano, M. Vorlnder, Handbook of Signal Processing in Acoustics, Springer Science & Business Media, 2008.
[31]
S. Haykin, Adaptive Filter Theory, Prentice Hall Inc., New York, 1996.
[32]
K. Mohanaprasad, P. Arulmozhivarman, Wavelet based ICA using maximisation of non-Gaussianity for acoustic echo cancellation during double talk situation, Appl. Acoust., 97 (2015) 37-45.
[33]
A. Jain, S. Goel, K. Nathwani, R.M. Hegde, Robust acoustic echo cancellation using Kalman filter in double talk scenario, Speech Commun., 70 (2015) 65-75.
[34]
J. Gunther, Learning echo paths during continuous double-talk using semi-blind source separation, IEEE Trans. Audio Speech Lang. Process., 20 (2012) 646-660.
[35]
D.L. Duttweiler, A twelve-channel digital echo canceler, IEEE Trans. Commun., 26 (1978) 647-653.
[36]
G. Szwoch, A. Czyewski, M. Kulesza, A low complexity double-talk detector based on the signal envelope, Signal Process., 88 (2008) 2856-2862.
[37]
K.-H. Lee, J.-H. Chang, N.S. Kim, S. Kang, Y. Kim, Frequencydomain double-talk detection based on the Gaussian mixture model, IEEE Signal Process. Lett., 17 (2010) 453-456.
[38]
G.C. Sih, Doubletalk detection by means of spectral content, U.S. Patent and Trademark Office, Washington, DC, 1998.
[39]
K. Rahbar, Double talk detection method based on spectral acoustic properties, U.S. Patent and Trademark Office, Washington, DC, 2012.
[40]
S.Y. Low, S. Venkatesh, S. Nordholm, A spectral slit approach to doubletalk detection, IEEE Trans. Audio Speech Lang. Process., 20 (2012) 1074-1080.
[41]
C. Paleologu, J. Benesty, T. Gaensler, S. Ciochin, Class of double-talk detectors based on the holder inequality, in: IEEE International Conference on Acoustics, Speech and Signal Processing, 2011, pp. 425-428.
[42]
G. Szwoch, A. Czyzewski, A. Ciarkowski, A double-talk detector using audio watermarking, J. Audio Eng. Soc., 57 (2009) 916-926.
[43]
A. Ciarkowski, A. Czyewski, Performance of watermarking-based DTD algorithm under time-varying echo path conditions, in: Intelligent Interactive Multimedia Systems and Services, Springer, Berlin, Heidelberg, 2010, pp. 69-78.
[44]
H. Urakami, Y. Kajikawa, A double-talk-detector using sound and image information, in: International Symposium on Communications and Information Technologies, 2010, pp. 447-452.
[45]
T.-A. Vu, H. Ding, M. Bouchard, A survey of double-talk detection schemes for echo cancellation applications, Can. Acoust., 32 (2004) 144-145.
[46]
V. Das, A. Kar, M. Chandra, Advanced adaptive algorithms for double talk detection in echo cancellers: a technical review, in: Proceedings of the 3rd International Conference on Frontiers of Intelligent Computing: Theory and Applications, Springer, Bhubaneswar, India, 2015, pp. 297-305.
[47]
C. Schldt, F. Lindstrom, I. Claesson, A delay-based double-talk detector, IEEE Trans. Audio Speech Lang. Process., 20 (2012) 1725-1733.
[48]
M. Hamidia, A. Amrouche, A new structure for acoustic echo cancellation in double-talk scenario using auxiliary filter, in: 14th International Workshop on Acoustic Signal Enhancement, 2014, pp. 253-257.
[49]
M. Hamidia, A. Amrouche, Double-talk detection using the singular value decomposition for acoustic echo cancellation, in: IEEE International Conference on Communications, 2013, pp. 4745-4749.
[50]
W.M. Fisher, V. Zue, J. Bernstein, D.S. Pallett, An acousticphonetic data base, J. Acoust. Soc. Am., 81 (1987) S92-S93.
[51]
M. Djendi, A. Benallal, A. Guessoum, D. Berkani, Three new versions for the Newton type adaptive filtering algorithm, in: Proceedings of the Seventh International Symposium on Signal Processing and Its Applications, 2003, pp. 559-562.
[52]
M. Djendi, M. Bouchard, A. Guessoum, A. Benallal, D. Berkani, Improvement of the convergence speed and the tracking ability of the fast Newton type adaptive filtering (FNTF) algorithm, Signal Process., 86 (2006) 1704-1719.
[53]
J.H. Cho, D.R. Morgan, J. Benesty, An objective technique for evaluating doubletalk detectors in acoustic echo cancelers, IEEE Trans. Speech Audio Process., 7 (1999) 718-724.
[54]
ITU-T, P.862, Perceptual evaluation of speech quality (PESQ), an objective method for end-to-end speech quality assessment of narrowband telephone networks and speech codecs, 2001.

Cited By

View all
  • (2024)Stockwell-Transform based feature representation for detection and assessment of voice disordersInternational Journal of Speech Technology10.1007/s10772-024-10085-w27:1(101-119)Online publication date: 1-Mar-2024
  • (2023)Epoch Extraction from Telephonic Speech Signal using Stockwell TransformCircuits, Systems, and Signal Processing10.1007/s00034-023-02312-742:7(4238-4251)Online publication date: 26-Feb-2023
  • (2020)Fractional S-Transform and Its Properties: A Comprehensive SurveyWireless Personal Communications: An International Journal10.1007/s11277-020-07339-6113:4(2519-2541)Online publication date: 1-Aug-2020
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image Digital Signal Processing
Digital Signal Processing  Volume 60, Issue C
January 2017
189 pages

Publisher

Academic Press, Inc.

United States

Publication History

Published: 01 January 2017

Author Tags

  1. Acoustic echo cancellation
  2. Double-talk detection
  3. Geigel
  4. Singular value decomposition
  5. Stockwell transform

Qualifiers

  • Research-article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 02 Mar 2025

Other Metrics

Citations

Cited By

View all
  • (2024)Stockwell-Transform based feature representation for detection and assessment of voice disordersInternational Journal of Speech Technology10.1007/s10772-024-10085-w27:1(101-119)Online publication date: 1-Mar-2024
  • (2023)Epoch Extraction from Telephonic Speech Signal using Stockwell TransformCircuits, Systems, and Signal Processing10.1007/s00034-023-02312-742:7(4238-4251)Online publication date: 26-Feb-2023
  • (2020)Fractional S-Transform and Its Properties: A Comprehensive SurveyWireless Personal Communications: An International Journal10.1007/s11277-020-07339-6113:4(2519-2541)Online publication date: 1-Aug-2020
  • (2019)Deep Neural Network Based Regression Approach for Acoustic Echo CancellationProceedings of the 2019 4th International Conference on Multimedia Systems and Signal Processing10.1145/3330393.3330399(94-98)Online publication date: 10-May-2019
  • (2019)A New Approach of Adaptive Filtering Updating for Acoustic Echo CancellationSpeech and Computer10.1007/978-3-030-26061-3_16(150-159)Online publication date: 20-Aug-2019

View Options

View options

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media