[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
research-article

Acoustic DOA estimation using space alternating sparse Bayesian learning

Published: 06 April 2021 Publication History

Abstract

Estimating the direction-of-arrival (DOA) of multiple acoustic sources is one of the key technologies for humanoid robots and drones. However, it is a most challenging problem due to a number of factors, including the platform size which puts a constraint on the array aperture. To overcome this problem, a high-resolution DOA estimation algorithm based on sparse Bayesian learning is proposed in this paper. A group sparse prior based hierarchical Bayesian model is introduced to encourage spatial sparsity of acoustic sources. To obtain approximate posteriors of the hidden variables, a variational Bayesian approach is proposed. Moreover, to reduce the computational complexity, the space alternating approach is applied to push the variational Bayesian inference to the scalar level. Furthermore, an acoustic DOA estimator is proposed to jointly utilize the estimated source signals from all frequency bins. Compared to state-of-the-art approaches, the high-resolution performance of the proposed approach is demonstrated in experiments with both synthetic and real data. The experiments show that the proposed approach achieves lower root mean square error (RMSE), false alert (FA), and miss-detection (MD) than other methods. Therefore, the proposed approach can be applied to some applications such as humanoid robots and drones to improve the resolution performance for acoustic DOA estimation especially when the size of the array aperture is constrained by the platform, preventing the use of traditional methods to resolve multiple sources.

References

[1]
Hornstein J., Lopes M., Santos-Victor J., and Lacerda F. Sound localization for humanoid robots - building audio-motor maps based on the HRTF 2006 IEEE/RSJ International Conference on Intelligent Robots and Systems 2006 Beijing IEEE 1170-1176
[2]
Rascon C. and Meza I. Localization of sound sources in robotics: a review Robot. Auton. Syst. 2017 96 184-210
[3]
Strauss M., Mordel P., Miguet V., and Deleforge A. DREGON: dataset and methods for UAV-embedded sound source localization 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) 2018 Madrid IEEE
[4]
Deleforge A., Carlo D. D., Strauss M., Serizel R., and Marcenaro L. Audio-based search and rescue with a drone: highlights from the IEEE signal processing cup 2019 student competition IEEE Signal Proc. Mag. 2019 36 5 138-144
[5]
Valin J. M., Michaud F., and Rouat J. Robust 3D localization and tracking of sound sources using beamforming and particle filtering 2006 IEEE International Conference on Acoustics Speed and Signal Processing Proceedings 2006 Toulouse IEEE 841-844
[6]
Zhang C., Florencio D., Ba D. E., and Zhang Z. Maximum likelihood sound source localization and beamforming for directional microphone arrays in distributed meetings IEEE Trans. Multimed. 2008 10 3 538-548
[7]
Farmani M., Pedersen M. S., Tan Z. -H., and Jensen J. Informed sound source localization using relative transfer functions for hearing aid applications IEEE/ACM Trans. Audio Speech Lang. Process. 2017 25 3 611-623
[8]
Van Trees H. L. Optimum array processing Part IV of Detection, Estimation, and Modulation Theory 2004 New York John Wiley and Sons 21-53
[9]
DiBiase J. H., Silverman H. F., and Brandstein M. S. Robust localization in reverberant rooms Microphone arrays 2001 Berlin, Heidelberg Springer 164-180
[10]
Krishnaveni V., Kesavamurthy T., and B A. Beamforming for direction-of-arrival (DOA) estimation-a survey Int. J. Comput. Appl. 2013 61 11 4-11
[11]
Schmidt R. Multiple emitter location and signal parameter estimation IEEE Trans. Antennas Propag. 1986 34 3 276-280
[12]
Roy R. and Kailath T. ESPRIT-estimation of signal parameters via rotational invariance techniques IEEE Trans. Acoustics Speech Sig. Process. 1989 37 7 984-995
[13]
Cox H., Zeskind R., and Owen M. Robust adaptive beamforming IEEE Trans Acoustics Speech Sig. Process. 1987 35 10 1365-1376
[14]
Feldman D. D. and Griffiths L. J. A projection approach for robust adaptive beamforming IEEE Trans Sig. Process. 1994 42 4 867-876
[15]
Pardini M., Lombardini F., and Gini F. The hybrid Cramér–Rao bound on broadside DOA estimation of extended sources in presence of array errors IEEE Trans Sig. Process. 2008 56 4 1726-1730
[16]
Khabbazibasmenj A., Vorobyov S. A., and Hassanien A. Robust adaptive beamforming based on steering vector estimation with as little as possible prior information IEEE Trans Sig. Process. 2012 60 6 2974-2987
[17]
Kintz A. L. and Gupta I. J. A modified MUSIC algorithm for direction of arrival estimation in the presence of antenna array manifold mismatch IEEE Trans. Antennas Propag. 2016 64 11 4836-4847
[18]
Malioutov D., Cetin M., and Willsky A. S. A sparse signal reconstruction perspective for source localization with sensor arrays IEEE Trans. Sig. Process. 2005 53 8 3010-3022
[19]
Wipf D. P. and Rao B. D. An empirical Bayesian strategy for solving the simultaneous sparse approximation problem IEEE Trans. Sig. Process. 2007 55 7 3704-3716
[20]
Fortunati S., Grasso R., Gini F., Greco M. S., and LePage K. Single-snapshot DOA estimation by using compressed sensing EURASIP J. Adv. Sig. Process. 2014 2014 1 1-17
[21]
Gerstoft P., Mecklenbrauker C. F., Xenaki A., and Nannuru S. Multisnapshot sparse Bayesian learning for DOA IEEE Sig. Process. Lett. 2016 23 10 1469-1473
[22]
Xenakia A., Boldt J. B., and Christensen M. G. Sound source localization and speech enhancement with sparse Bayesian learning beamforming J. Acoust. Soc. Am. 2018 143 6 3912-3921
[23]
Xenaki A., Gerstoft P., and Mosegaard K. Compressive beamforming J. Acoust. Soc. Am. 2014 136 1 260-271
[24]
Mecklenbräuker C. F., Gerstoft P., and Zöchmann E. c–LASSO and its dual for sparse signal estimation from array data Sig. Process. 2017 130 204-216
[25]
Wang X., Meng D., Huang M., and Wan L. Reweighted regularized sparse recovery for DOA estimation with unknown mutual coupling IEEE Commun. Lett. 2019 23 2 290-293
[26]
Z. Yang, J. Li, P. Stoica, L. Xie, C. Rama, T. Sergios, in Academic Press Library in Signal Processing. One, 7. Sparse methods for direction-of-arrival estimation (New York, 2018), pp. 509–581.
[27]
Tipping M. E. and Smola A. Sparse Bayesian learning and the relevance vector machine J. Mach. Learn. Res. 2001 59 1 211-244
[28]
Ji S., Xue Y., and Carin L. Bayesian compressive sensing IEEE Trans. Sig. Process. 2008 56 6 2346-2356
[29]
Babacan S. D., Molina R., and Katsaggelos A. K. Bayesian compressive sensing using laplace priors IEEE Trans. Image Process. 2010 19 1 53-63
[30]
Worley B. Scalable mean-field sparse bayesian learning IEEE Trans. Sig. Process. 2019 67 24 6314-6326
[31]
Wipf D. and Nagarajan S. Beamforming using the relevance vector machine Proceedings of the 24th International Conference on Machine Learning - ICML 07 2007 New York, USA ACM Press 1-8
[32]
Yang Z., Xie L., and Zhang C. Off-grid direction of arrival estimation using sparse Bayesian inference IEEE Trans. Sig. Process. 2013 61 1 38-43
[33]
Zhao L., Li X., Wang L., and Bi G. Computationally efficient wide-band DOA estimation methods based on sparse Bayesian framework IEEE Trans. Veh. Technol. 2017 66 12 11108-11121
[34]
Bai Z., Sun J., Jensen J. R., and Christensen M. G. Indoor sound source localization based on sparse Bayesian learning and compressed data 2019 27th European Signal Processing Conference (EUSIPCO) 2019 A Coruna, Spain IEEE 1-5
[35]
Bai Z., Jensen J. R., Sun J., and Christensen M. G. A sparse Bayesian learning based RIR reconstruction method for acoustic TOA and DOA estimation 2019 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA) 2019 New York, USA IEEE 1-5
[36]
Tipping M. E., Faul A., Avenue J. J. T., and Avenue J. J. T. Fast marginal likelihood maximisation for sparse Bayesian models Proceedings of the Ninth International Workshop on Artificial Intelligence and Statistics 2003 Key West JMLR 3-6
[37]
Duan H., Yang L., Fang J., and Li H. Fast inverse-free sparse Bayesian learning via relaxed evidence lower bound maximization IEEE Sig. Process. Lett. 2017 24 6 774-778
[38]
Thomas C. K. and Slock D. Space alternating variational Bayesian learning for LMMSE filtering 2018 26th European Signal Processing Conference (EUSIPCO) 2018 Rome, Italy IEEE 1-5
[39]
Wipf D. P. and Rao B. D. Sparse Bayesian learning for basis selection IEEE Trans. Sig. Process. 2004 52 8 2153-2164
[40]
Zhang Z. and Rao B. D. Sparse signal recovery with temporally correlated source vectors using sparse Bayesian learning IEEE J. Sel. Top. Sig. Process. 2011 5 5 912-926
[41]
Huang J. and Zhang T.The benefit of group sparsityAnn. Stat.20103841978-2004https://doi.org/10.1214/09-aos778
[42]
Bishop C. M. Approximate inference Pattern recognition and machine learning 2006 New York Springer 472-485
[43]
Tzikas D. G., Likas A. C., and Galatsanos N. P. The variational approximation for Bayesian inference IEEE Sig. Process. Mag. 2008 25 6 131-146
[44]
Fessler J. A. and Hero A. O. Space-alternating generalized expectation-maximization algorithm IEEE Trans. Sig. Process. 1994 42 10 2664-2677
[45]
Dorfan Y. and Gannot S. Tree-based recursive expectation-maximization algorithm for localization of acoustic sources IEEE/ACM Trans. Audio Speech Lang. Process. 2015 23 10 1692-1703
[46]
Li X., Ban Y., Girin L., Xavier A. P., and Horaud R. Online localization and tracking of multiple moving speakers in reverberant environments IEEE J. Sel. Top. Sig. Process. 2019 13 1 88-103
[47]
Birchfield S. T. and Gillmor D. K. Fast Bayesian acoustic localization IEEE International Conference on Acoustics Speech and Signal Processing 2002 Palo Alto, California IEEE 1-4
[48]
Traa J., Wingate D., Stein N. D., and Smaragdis P. Robust source localization and enhancement with a probabilistic steered response power model IEEE/ACM Trans. Audio Speech. Lang. Process. 2016 24 3 493-503
[49]
Nowak R. D. Distributed EM algorithms for density estimation and clustering in sensor networks IEEE Trans. Sig. Process. 2003 51 8 2245-2253
[50]
Dorfan Y., Hazan G., and Gannot S. Multiple acoustic sources localization using distributed expectation-maximization algorithm 2014 4th Joint Workshop on Hands-free Speech Communication and Microphone Arrays (HSCMA) 2014 Villers-les-Nancy, France IEEE 1-5
[51]
Lollmann H. W., Evers C., Schmidt A., Mellmann H., Barfuss H., Naylor P. A., and Kellermann W. The LOCATA challenge data corpus for acoustic source localization and tracking 2018 IEEE 10th Sensor Array and Multichannel Signal Processing Workshop (SAM) 2018 Sheffield IEEE 410-414
[52]
Sohn J., Kim N. S., and Sung W. A statistical model-based voice activity detection IEEE Sig. Process. Lett. 1999 6 1 1-3

Cited By

View all
  • (2023)Direction-of-arrival and power spectral density estimation using a single directional microphone and group-sparse optimizationEURASIP Journal on Audio, Speech, and Music Processing10.1186/s13636-023-00304-82023:1Online publication date: 4-Oct-2023
  • (2023)BLEselectProceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies10.1145/35694826:4(1-28)Online publication date: 11-Jan-2023

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image EURASIP Journal on Audio, Speech, and Music Processing
EURASIP Journal on Audio, Speech, and Music Processing  Volume 2021, Issue 1
Dec 2021
675 pages
ISSN:1687-4714
EISSN:1687-4722
Issue’s Table of Contents

Publisher

Hindawi Limited

London, United Kingdom

Publication History

Published: 06 April 2021
Accepted: 27 January 2021
Received: 14 September 2020

Author Tags

  1. Sparse Bayesian learning
  2. Acoustic DOA estimation
  3. Sound source localization

Qualifiers

  • Research-article

Funding Sources

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 20 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2023)Direction-of-arrival and power spectral density estimation using a single directional microphone and group-sparse optimizationEURASIP Journal on Audio, Speech, and Music Processing10.1186/s13636-023-00304-82023:1Online publication date: 4-Oct-2023
  • (2023)BLEselectProceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies10.1145/35694826:4(1-28)Online publication date: 11-Jan-2023

View Options

View options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media