[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
article
Free access

Time delay estimation in room acoustic environments: an overview

Published: 01 January 2006 Publication History

Abstract

Time delay estimation has been a research topic of significant practical importance in many fields (radar, sonar, seismology, geophysics, ultrasonics, hands-free communications, etc.). It is a first stage that feeds into subsequent processing blocks for identifying, localizing, and tracking radiating sources. This area has made remarkable advances in the past few decades, and is continuing to progress, with an aim to create processors that are tolerant to both noise and reverberation. This paper presents a systematic overview of the state-of-the-art of time-delay-estimation algorithms ranging from the simple cross-correlation method to the advanced blind channel identification based techniques. We discuss the pros and cons of each individual algorithm, and outline their inherent relationships. We also provide experimental results to illustrate their performance differences in room acoustic environments where reverberation and noise are commonly encountered.

References

[1]
{1} J. E. Ehrenberg, T. E. Ewart, and R. D. Morris, "Signal-processing techniques for resolving individual pulses in a multipath signal," Journal of the Acoustical Society of America, vol. 63, no. 6, pp. 1861-1865, 1978.
[2]
{2} N. L. Owsley and G. R. Swope, "Time delay estimation in a sensor array," IEEE Transactions on Acoustics, Speech, and Signal Processing, vol. 29, no. 3, pp. 519-523, 1981.
[3]
{3} R. J. Tremblay, G. C. Carter, and D. W. Lytle, "A practical approach to the estimation of amplitude and time-delay parameters of a composite signal," IEEE Journal of Oceanic Engineering , vol. 12, no. 1, pp. 273-278, 1987.
[4]
{4} R. Wu, J. Li, and Z.-S. Liu, "Super resolution time delay estimation via MODE-WRELAX," IEEE Transactions on Aerospace and Electronic Systems, vol. 35, no. 1, pp. 294-307, 1999.
[5]
{5} C. H. Knapp and G. C. Carter, "The generalized correlation method for estimation of time delay," IEEE Transactions on Acoustics, Speech, and Signal Processing, vol. 24, no. 4, pp. 320-327, 1976.
[6]
{6} G. C. Carter, "Time delay estimation for passive sonar signal processing," IEEE Transactions on Acoustics, Speech, and Signal Processing, vol. 29, no. 3, pp. 463-470, 1981.
[7]
{7} G. C. Carter, "Coherence and time delay estimation," in Signal Processing Handbook, C. H. Chen, Ed., pp. 443-482, Marcel Dekker, New York, NY, USA, 1988.
[8]
{8} A. H. Quazi, "An overview on the time delay estimate in active and passive systems for target localization," IEEE Transactions on Acoustics, Speech, and Signal Processing, vol. 29, no. 3, pp. 527-533, 1981.
[9]
{9} G. C. Carter, Ed., Coherence and Time Delay Estimation: An Applied Tutorial for Research, Development, Test and Evaluation Engineers, IEEE Press, New York, NY, USA, 1993.
[10]
{10} M. Feder and E. Weinstein, "Parameter estimation of superimposed signals using the EM algorithm," IEEE Transactions on Acoustics, Speech, and Signal Processing, vol. 36, no. 4, pp. 477-489, 1988.
[11]
{11} G. Su and M. Morf, "The signal subspace approach for multiple wide-band emitter location," IEEE Transactions on Acoustics, Speech, and Signal Processing, vol. 31, no. 6, pp. 1502-1522, 1983.
[12]
{12} S. S. Reddi, "Multiple source location--a digital approach," IEEE Transactions on Aerospace and Electronic Systems, vol. 15, no. 1, pp. 95-105, 1979.
[13]
{13} T. G. Manickam, R. J. Vaccaro, and D. W. Tufts, "A least-squares algorithm for multipath time-delay estimation," IEEE Transactions on Signal Processing, vol. 42, no. 11, pp. 3229-3233, 1994.
[14]
{14} J.-J. Fuchs, "Multipath time-delay detection and estimation," IEEE Transactions on Signal Processing, vol. 47, no. 1, pp. 237-243, 1999.
[15]
{15} J. Benesty, "Adaptive eigenvalue decomposition algorithm for passive acoustic source localization," Journal of the Acoustical Society of America, vol. 107, no. 1, pp. 384-391, 2000.
[16]
{16} S. Doclo and M. Moonen, "Robust adaptive time delay estimation for speaker localization in noisy and reverberantacoustic environments," EURASIP Journal on Applied Signal Processing, vol. 2003, no. 11, pp. 1110-1124, 2003.
[17]
{17} T. G. Dvorkind and S. Gannot, "Approaches for time different of arrival estimation in a noisy and reververant environment," in Proceedings of International Workshop on Acoustic Echo and Noise Control (IWAENC '03), pp. 215-218, Kyoto, Japan, September 2003.
[18]
{18} J. C. Hassab and R. E. Boucher, "Performance of the generalized cross correlator in the presence of a strong spectral peak in the signal," IEEE Transactions on Acoustics, Speech, and Signal Processing, vol. 29, no. 3, pp. 549-555, 1981.
[19]
{19} L. E. Miller and J. S. Lee, "Error analysis of time delay estimation using a finite integration time correlator," IEEE Transactions on Acoustics, Speech, and Signal Processing, vol. 29, no. 3, pp. 490-496, 1981.
[20]
{20} J. P. Ianniello, "Time delay estimation via cross-correlation in the presence of large estimation errors," IEEE Transactions on Acoustics, Speech, and Signal Processing, vol. 30, no. 6, pp. 998-1003, 1982.
[21]
{21} M. Azaria and D. Hertz, "Time delay estimation by generalized cross correlation methods," IEEE Transactions on Acoustics, Speech, and Signal Processing, vol. 32, no. 2, pp. 280-285, 1984.
[22]
{22} Y. Bar-Shalom, F. Palmieri, A. Kumar, and H. M. Shertukde, "Analysis of wide-band cross correlation for time-delay estimation," IEEE Transactions on Signal Processing, vol. 41, no. 1, pp. 385-387, 1993.
[23]
{23} J. K. Tugnait, "Time delay estimation with unknown spatially correlated Gaussian noise," IEEE Transactions on Signal Processing , vol. 41, no. 2, pp. 549-558, 1993.
[24]
{24} Y. Wu, "Time delay estimation of non-Gaussian signal in unknown Gaussian noises using third-order cumulants," Electronics Letters, vol. 38, no. 16, pp. 930-931, 2002.
[25]
{25} Y. (Arden) Huang and J. Benesty, "A class of frequency-domain adaptive approaches to blind multichannel identification," IEEE Transactions on Signal Processing, vol. 51, no. 1, pp. 11-24, 2003.
[26]
{26} F. A. Reed, P. L. Feintuch, and N. J. Bershad, "Time delay estimation using the LMS adaptive filter--static behavior," IEEE Transactions on Acoustics, Speech, and Signal Processing, vol. 29, no. 3, pp. 561-571, 1981.
[27]
{27} D. M. Etter and S. D. Stearns, "Adaptive estimation of time delays in sampled data systems," IEEE Transactions on Acoustics, Speech, and Signal Processing, vol. 29, no. 3, pp. 582-587, 1981.
[28]
{28} D. H. Youn, N. Ahmed, and G. C. Carter, "On using the LMS algorithm for time delay estimation," IEEE Transactions on Acoustics, Speech, and Signal Processing, vol. 30, no. 5, pp. 798-801, 1982.
[29]
{29} P. C. Ching and Y. T. Chan, "Adaptive time delay estimation with constraints," IEEE Transactions on Acoustics, Speech, and Signal Processing, vol. 36, no. 4, pp. 599-602, 1988.
[30]
{30} H. C. So, P. C. Ching, and Y. T. Chan, "A new algorithm for explicit adaptation of time delay," IEEE Transactions on Signal Processing, vol. 42, no. 7, pp. 1816-1820, 1994.
[31]
{31} P. P. Moghaddam, H. Amindavar, and R. L. Kirlin, "A new time-delay estimation in multipath," IEEE Transactions on Signal Processing, vol. 51, no. 5, pp. 1129-1142, 2003.
[32]
{32} J. P. Ianniello, "Large and small error performance limits for multipath time delay estimation," IEEE Transactions on Acoustics, Speech, and Signal Processing, vol. 34, no. 2, pp. 245-251, 1986.
[33]
{33} J. C. Hassab, "Contact localization and motion analysis in the ocean environment: a perspective," IEEE Journal of Oceanic Engineering, vol. 8, no. 3, pp. 136-147, 1983.
[34]
{34} F. El-Hawary, F. Aminzadeh, and G. A. N. Mbamalu, "The generalized Kalman filter approach to adaptive underwater target tracking," IEEE Journal of Oceanic Engineering, vol. 17, no. 1, pp. 129-137, 1992.
[35]
{35} C. S. Clay and H. Medwin, Acoustical Oceanography, John Wiley & Sons, New York, NY, USA, 1977.
[36]
{36} A. Stéphenne and B. Champagne, "Cepstral prefiltering for time delay estimation in reverberant environments," in Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '95), vol. 5, pp. 3055-3058, Detroit, Mich, USA, May 1995.
[37]
{37} M. S. Brandstein and H. F. Silverman, "A robust method for speech signal time-delay estimation in reverberant rooms," in Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '97), vol. 1, pp. 375-378, Munich, Germany, April 1997.
[38]
{38} T. G. Dvorkind and S. Gannot, "Time difference of arrival estimation of speech source in a noisy and reverberant environment," Signal Processing, vol. 85, no. 1, pp. 177-204, 2005.
[39]
{39} Y. (Arden) Huang and J. Benesty, "Adaptive multichannel time delay estimation based on blind system identification for acoustic source localization," in Adaptive Signal Processing-- Applications to Real-World Problems, J. Benesty and Y. (Arden) Huang, Eds., chapter 8, pp. 227-248, Springer, Berlin, Germany, 2003.
[40]
{40} G. Jacovitti and G. Scarano, "Discrete time techniques for time delay estimation," IEEE Transactions on Signal Processing, vol. 41, no. 2, pp. 525-533, 1993.
[41]
{41} G. Jacovitti, A. Neri, and R. Cusani, "On a fast digital method of estimating the autocorrelation of a Gaussian stationary process," IEEE Transactions on Acoustics, Speech, and Signal Processing , vol. 32, no. 5, pp. 968-976, 1984.
[42]
{42} G. Jacovitti and R. Cusani, "An efficient technique for high correlation estimation," IEEE Transactions on Acoustics, Speech, and Signal Processing, vol. 35, no. 5, pp. 654-660, 1987.
[43]
{43} J. Chen, J. Benesty, and Y. (Arden) Huang, "Performance of GCC- and AMDF-based time-delay estimation in practical reverberant environments," EURASIP Journal on Applied Signal Processing, vol. 2005, no. 1, pp. 25-36, 2005.
[44]
{44} G. C. Carter, A. H. Nuttall, and P. G. Cable, "The smoothed coherence transform," Proceedings of the IEEE, vol. 61, no. 10, pp. 1497-1498, 1973.
[45]
{45} P. R. Roth, "Effective measurements using digital signal analysis," IEEE Spectrum, vol. 8, no. 4, pp. 62-70, 1971.
[46]
{46} H. Wang and P. Chu, "Voice source localization for automatic camera pointing system in video conferencing," in Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '97), vol. 1, pp. 187-190, Munich, Germany, April 1997.
[47]
{47} P. L. Feintuch, N. J. Bershad, and F. A. Reed, "Time delay estimation using the LMS adaptive filter--dynamic behavior," IEEE Transactions on Acoustics, Speech, and Signal Processing, vol. 29, no. 3, pp. 571-576, 1981.
[48]
{48} S. Haykin, "Radar array processing for angle of arrival estimation," in Array Signal Processing, S. Haykin, Ed., pp. 194-292, Prentice-Hall, Englewood Cliffs, NJ, USA, 1985.
[49]
{49} R. L. Kirlin, D. F. Moore, and R. F. Kubichek, "Improvement of delay measurements from sonar arrays via sequential state estimation," IEEE Transactions on Acoustics, Speech, and Signal Processing, vol. 29, no. 3, pp. 514-519, 1981.
[50]
{50} T. Nishiura, T. Yamada, S. Nakamura, and K. Shikano, "Localization of multiple sound sources based on a CSP analysis with a microphone array," in Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '00), vol. 2, pp. 1053-1055, Istanbul, Turkey, June 2000.
[51]
{51} S. M. Griebel and M. S. Brandstein, "Microphone array source localization using realizable delay vectors," in Proceedings of IEEE Workshop on the Applications of Signal Processing to Audio and Acoustics (WASPAA '01), pp. 71-74, New Platz, NY, USA, October 2001.
[52]
{52} J. H. DiBiase, H. F. Silverman, and M. S. Branstein, "Robust localization in reverberant rooms," in Microphone Arrays: Signal Processing Techniques and Applications, M. S. Branstein and D. B. Ward, Eds., chapter 8, pp. 157-180, Springer, New York, NY, USA, 2001.
[53]
{53} J. Chen, J. Benesty, and Y. (Arden) Huang, "Robust time delay estimation exploiting redundancy among multiple microphoens," IEEE Transactions on Speech and Audio Processing, vol. 11, no. 6, pp. 549-557, 2003.
[54]
{54} J. Benesty, J. Chen, and Y. (Arden) Huang, "Time-delay estimation via linear interpolation and cross correlation," IEEE Transactions on Speech and Audio Processing, vol. 12, no. 5, pp. 509-519, 2004.
[55]
{55} Y. (Arden) Huang, J. Benesty, and G. W. Elko, "Adaptive eigenvalue decomposition algorithm for real time acoustic source localization system," in Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '99), vol. 2, pp. 937-940, Phoenix, Ariz, USA, March 1999.
[56]
{56} G. Xu, H. Liu, L. Tong, and T. Kailath, "A least-squares approach to blind channel identification," IEEE Transactions on Signal Processing, vol. 43, no. 12, pp. 2982-2993, 1995.
[57]
{57} H.-F. Chen, X.-R. Cao, and J. Zhu, "Convergence of stochastic-approximation-based algorithms for blind channel identification," IEEE Transactions on Information Theory, vol. 48, no. 5, pp. 1214-1225, 2002.
[58]
{58} M. I. Gürelli and C. L. Nikias, "EVAM: an eigenvector-based algorithm for multichannel blind deconvolution of input colored signals," IEEE Transactions on Signal Processing, vol. 43, no. 1, pp. 134-149, 1995.
[59]
{59} L. Tong and S. Perreau, "Multichannel blind identification: from subspace to maximum likelihood methods," Proceedings of the IEEE, vol. 86, no. 10, pp. 1951-1968, 1998.
[60]
{60} Y. (Arden) Huang and J. Benesty, "Adaptive multi-channel least mean square and Newton algorithms for blind channel identification," Signal Processing, vol. 82, no. 8, pp. 1127-1138, 2002.
[61]
{61} H. V. Sorensen, D. L. Jones, M. T. Heideman, and C. S. Burrus, "Real-valued fast Fourier transform algorithms," IEEE Transactions on Acoustics, Speech, and Signal Processing, vol. 35, no. 6, pp. 849-863, 1987.
[62]
{62} L. Fox, An Introduction to Numerical Linear Algebra, Clarendon Press, Oxford, UK, 1964.
[63]
{63} R. E. Boucher and J. C. Hassab, "Analysis of discrete implementation of generalized cross correlator," IEEE Transactions on Acoustics, Speech, and Signal Processing, vol. 29, no. 3, pp. 609-611, 1981.
[64]
{64} R. Moddemeijer, "On the determination of the position of extrema of sampled correlators," IEEE Transactions on Signal Processing, vol. 39, no. 1, pp. 216-219, 1991.
[65]
{65} J. B. Allen and D. A. Berkley, "Image method for efficiently simulating small-room acoustics," Journal of the Acoustical Society of America, vol. 65, no. 4, pp. 943-950, 1979.
[66]
{66} B. Champagne, S. Bedard, and A. Stephenne, "Performance of time-delay estimation in the presence of room reverberation," IEEE Transactions on Speech and Audio Processing, vol. 4, no. 2, pp. 148-152, 1996.
[67]
{67} T. Gustafsson, B. D. Rao, and M. Trivedi, "Source localization in reverberant environments: modeling and statistical analysis," IEEE Transactions on Speech and Audio Processing, vol. 11, no. 6, pp. 791-803, 2003.

Cited By

View all
  • (2023)Vision‐audio fusion SLAM in dynamic environmentsCAAI Transactions on Intelligence Technology10.1049/cit2.122068:4(1364-1373)Online publication date: 13-Mar-2023
  • (2021)AcousticFusion: Fusing Sound Source Localization to Visual SLAM in Dynamic Environments2021 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)10.1109/IROS51168.2021.9636585(6868-6875)Online publication date: 27-Sep-2021
  • (2020)Test and measurement assisted leak vibration signal analysis for leakages in metallic pipelinesProceedings of the 24th Pan-Hellenic Conference on Informatics10.1145/3437120.3437309(214-218)Online publication date: 20-Nov-2020
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image EURASIP Journal on Advances in Signal Processing
EURASIP Journal on Advances in Signal Processing  Volume 2006, Issue
01 January
3089 pages

Publisher

Hindawi Limited

London, United Kingdom

Publication History

Published: 01 January 2006

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)24
  • Downloads (Last 6 weeks)3
Reflects downloads up to 20 Dec 2024

Other Metrics

Citations

Cited By

View all
  • (2023)Vision‐audio fusion SLAM in dynamic environmentsCAAI Transactions on Intelligence Technology10.1049/cit2.122068:4(1364-1373)Online publication date: 13-Mar-2023
  • (2021)AcousticFusion: Fusing Sound Source Localization to Visual SLAM in Dynamic Environments2021 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)10.1109/IROS51168.2021.9636585(6868-6875)Online publication date: 27-Sep-2021
  • (2020)Test and measurement assisted leak vibration signal analysis for leakages in metallic pipelinesProceedings of the 24th Pan-Hellenic Conference on Informatics10.1145/3437120.3437309(214-218)Online publication date: 20-Nov-2020
  • (2019)Leakage detection using leak noise correlation techniquesProceedings of the 23rd Pan-Hellenic Conference on Informatics10.1145/3368640.3368646(50-57)Online publication date: 28-Nov-2019
  • (2019)Sound Source Localization and Speech Enhancement Algorithm Based on Fixed BeamformingProceedings of the 2019 4th International Conference on Automation, Control and Robotics Engineering10.1145/3351917.3351932(1-7)Online publication date: 19-Jul-2019
  • (2018)Dereverberation and Beamforming in Far-Field Speaker Recognition2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)10.1109/ICASSP.2018.8462365(5254-5258)Online publication date: 15-Apr-2018
  • (2018)Correlation analysis of respiratory signals by using parallel coordinate plotsComputer Methods and Programs in Biomedicine10.1016/j.cmpb.2017.10.003153:C(41-51)Online publication date: 1-Jan-2018
  • (2017)Swarm Intelligence Based Particle Filter for Alternating Talker Localization and Tracking Using Microphone ArraysIEEE/ACM Transactions on Audio, Speech and Language Processing10.1109/TASLP.2017.269356625:6(1384-1397)Online publication date: 1-Jun-2017
  • (2017)A Consolidated Perspective on Multimicrophone Speech Enhancement and Source SeparationIEEE/ACM Transactions on Audio, Speech and Language Processing10.1109/TASLP.2016.264770225:4(692-730)Online publication date: 1-Apr-2017
  • (2016)TDOA Matrices: Algebraic Properties and Their Application to Robust Denoising With Missing DataIEEE Transactions on Signal Processing10.1109/TSP.2016.259369064:20(5242-5254)Online publication date: 15-Oct-2016
  • Show More Cited By

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media