[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to main content

Two stage speaker verification using Self Organising Map and Multilayer Perceptron Neural Network

  • Conference paper
  • First Online:
Research and Development in Intelligent Systems XXVIII (SGAI 2011)

Abstract

In this paper a two stage speaker verification system is presented. The first stage contains a modified Self Organising Map (SOM) that filters speech data using cluster information extracted from three selected vowels for a claimed speaker. Filtered frames from the first stage are then fed into the second stage which consists of three Multi Layer Perceptron (MLP) networks; these networks acting as individual claimed speaker vowel verifiers. Sixty four Discrete Fourier Transform (DFT) spectrum components are used as the input feature vectors. The system provides a verification performance of 94.54% when evaluated using 50 speakers from the Centre for Spoken Language Understanding (CSLU2002) speaker verification database.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
£29.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
GBP 19.95
Price includes VAT (United Kingdom)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
GBP 103.50
Price includes VAT (United Kingdom)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
GBP 129.99
Price includes VAT (United Kingdom)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Similar content being viewed by others

References

  1. Reynolds, D.A. and R.C. Rose: Robust text-independent speaker identification using Gaussian mixture speaker models. Speech and Audio Processing, IEEE Transactions on, 1995. 3(1): p. 72-83.

    Article  Google Scholar 

  2. Campbell, W.M., et al.: Support vector machines for speaker and language recognition. Computer Speech & Language, 2006. 20(2-3): p. 210-229.

    Article  Google Scholar 

  3. Seddik, H., A. Rahmouni, and M. Sayadi: Text independent speaker recognition using the Mel frequency cepstral coefficients and a neural network classifier. in Control, Communications and Signal Processing, First International Symposium on. 2004.

    Google Scholar 

  4. Oglesby, J. and J.S. Mason: Radial basis function networks for speaker recognition. in Acoustics, Speech, and Signal Processing, ICASSP-91., International Conference on. 1991.

    Google Scholar 

  5. Farrell, K.R., R.J. Mammone, and K.T. Assaleh: Speaker recognition using neural networks and conventional classifiers. Speech and Audio Processing, IEEE Transactions on, 1994. 2(1): p. 194-205.

    Article  Google Scholar 

  6. Kishore, S.P. and B. Yegnanarayana: Speaker verification: minimizing the channel effects using autoassociative neural network models. in Acoustics, Speech, and Signal Processing, ICASSP '00. Proceedings. IEEE International Conference on. 2000.

    Google Scholar 

  7. Mueen, F., et al: Speaker recognition using artificial neural networks. in Students Conference, ISCON '02. Proceedings. IEEE. 2002.

    Google Scholar 

  8. Kusumoputro, B., et al: Speaker identification in noisy environment using bispectrum analysis and probabilistic neural network. in Computational Intelligence and Multimedia Applications, ICCIMA 2001. Proceedings. Fourth International Conference on. 2001.

    Google Scholar 

  9. George, S., et al: Speaker recognition using dynamic synapse based neural networks with wavelet preprocessing. in Neural Networks, 2001. Proceedings. IJCNN '01. International Joint Conference on. 2001.

    Google Scholar 

  10. Monte, E., et al: Text independent speaker identification on noisy environments by means of self organizing maps. in Spoken Language, ICSLP 96. Proceedings., Fourth International Conference on. 1996.

    Google Scholar 

  11. Tashan, T., T. Allen, and L. Nolle: Vowel based speaker verification using self organising map. in The Eleventh IASTED International Conference on Artificial Intelligence and Applications (AIA 2011). 2011. Innsbruck, Austria: ACTA Press.

    Google Scholar 

  12. Han-Sheng, L. and R.J. Mammone: Speaker verification using phoneme-based neural tree networks and phonetic weighting scoring method. in Neural Networks for Signal Processing V. Proceedings of the IEEE Workshop. 1995.

    Google Scholar 

  13. Jayanna, H.S. and S.R.M. Prasanna: An experimental comparison of modelling techniques for speaker recognition under limited data condition. Sadhana-Academy Proceedings in Engineering Sciences, 2009. 34(5): p. 717-728.

    MATH  Google Scholar 

  14. Rabiner, L.R. and R.W. Schafer: Digital processing of speech signals. Prentice-Hall signal processing series. 1978, Englewood Cliffs, N.J.: Prentice-Hall.

    Google Scholar 

  15. Davis, S. and P. Mermelstein: Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences. Acoustics, Speech and Signal Processing, IEEE Transactions on, 1980. 28(4): p. 357-366.

    Article  Google Scholar 

  16. Sun, F., B. Li, and H. Chi: Some key factors in speaker recognition using neural networks approach. in Neural Networks, IEEE International Joint Conference on. 1991.

    Google Scholar 

  17. Delacretaz, D.P. and J. Hennebert: Text-prompted speaker verification experiments with phoneme specific MLPs. in Acoustics, Speech and Signal Processing, Proceedings of the IEEE International Conference on. 1998.

    Google Scholar 

  18. Sri Rama Murty, K., S.R. Mahadeva Prasanna, and B. Yegnanarayana: Speaker-specific information from residual phase. in Signal Processing and Communications, SPCOM '04. International Conference on. 2004.

    Google Scholar 

  19. Kohonen, T.: The self-organizing map. Proceedings of the IEEE, 1990. 78(9): p. 1464-1480.

    Article  Google Scholar 

  20. Lapidot, I., H. Guterman, and A. Cohen: Unsupervised speaker recognition based on competition between self-organizing maps. Neural Networks, IEEE Transactions on, 2002. 13(4): p. 877-887.

    Article  Google Scholar 

  21. Mafra, A.T. and M.G. Simoes: Text independent automatic speaker recognition using selforganizing maps. in Industry Applications Conference, 39th IAS Annual Meeting. Conference Record of the IEEE. 2004.

    Google Scholar 

  22. Hadjitodorov, S., B. Boyanov, and N. Dalakchieva: A two-level classifier for textindependent speaker identification. Speech Communication, 1997. 21(3): p. 209-217.

    Article  Google Scholar 

  23. Inal, M. and Y.S. Fatihoglu: Self organizing map and associative memory model hybrid classifier for speaker recognition. in Neural Network Applications in Electrical Engineering, NEUREL '02. 6th Seminar on. 2002.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Tariq Tashan .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2011 Springer-Verlag London Limited

About this paper

Cite this paper

Tashan, T., Allen, T. (2011). Two stage speaker verification using Self Organising Map and Multilayer Perceptron Neural Network. In: Bramer, M., Petridis, M., Nolle, L. (eds) Research and Development in Intelligent Systems XXVIII. SGAI 2011. Springer, London. https://doi.org/10.1007/978-1-4471-2318-7_8

Download citation

  • DOI: https://doi.org/10.1007/978-1-4471-2318-7_8

  • Published:

  • Publisher Name: Springer, London

  • Print ISBN: 978-1-4471-2317-0

  • Online ISBN: 978-1-4471-2318-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics