Abstract
In this paper a two stage speaker verification system is presented. The first stage contains a modified Self Organising Map (SOM) that filters speech data using cluster information extracted from three selected vowels for a claimed speaker. Filtered frames from the first stage are then fed into the second stage which consists of three Multi Layer Perceptron (MLP) networks; these networks acting as individual claimed speaker vowel verifiers. Sixty four Discrete Fourier Transform (DFT) spectrum components are used as the input feature vectors. The system provides a verification performance of 94.54% when evaluated using 50 speakers from the Centre for Spoken Language Understanding (CSLU2002) speaker verification database.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Reynolds, D.A. and R.C. Rose: Robust text-independent speaker identification using Gaussian mixture speaker models. Speech and Audio Processing, IEEE Transactions on, 1995. 3(1): p. 72-83.
Campbell, W.M., et al.: Support vector machines for speaker and language recognition. Computer Speech & Language, 2006. 20(2-3): p. 210-229.
Seddik, H., A. Rahmouni, and M. Sayadi: Text independent speaker recognition using the Mel frequency cepstral coefficients and a neural network classifier. in Control, Communications and Signal Processing, First International Symposium on. 2004.
Oglesby, J. and J.S. Mason: Radial basis function networks for speaker recognition. in Acoustics, Speech, and Signal Processing, ICASSP-91., International Conference on. 1991.
Farrell, K.R., R.J. Mammone, and K.T. Assaleh: Speaker recognition using neural networks and conventional classifiers. Speech and Audio Processing, IEEE Transactions on, 1994. 2(1): p. 194-205.
Kishore, S.P. and B. Yegnanarayana: Speaker verification: minimizing the channel effects using autoassociative neural network models. in Acoustics, Speech, and Signal Processing, ICASSP '00. Proceedings. IEEE International Conference on. 2000.
Mueen, F., et al: Speaker recognition using artificial neural networks. in Students Conference, ISCON '02. Proceedings. IEEE. 2002.
Kusumoputro, B., et al: Speaker identification in noisy environment using bispectrum analysis and probabilistic neural network. in Computational Intelligence and Multimedia Applications, ICCIMA 2001. Proceedings. Fourth International Conference on. 2001.
George, S., et al: Speaker recognition using dynamic synapse based neural networks with wavelet preprocessing. in Neural Networks, 2001. Proceedings. IJCNN '01. International Joint Conference on. 2001.
Monte, E., et al: Text independent speaker identification on noisy environments by means of self organizing maps. in Spoken Language, ICSLP 96. Proceedings., Fourth International Conference on. 1996.
Tashan, T., T. Allen, and L. Nolle: Vowel based speaker verification using self organising map. in The Eleventh IASTED International Conference on Artificial Intelligence and Applications (AIA 2011). 2011. Innsbruck, Austria: ACTA Press.
Han-Sheng, L. and R.J. Mammone: Speaker verification using phoneme-based neural tree networks and phonetic weighting scoring method. in Neural Networks for Signal Processing V. Proceedings of the IEEE Workshop. 1995.
Jayanna, H.S. and S.R.M. Prasanna: An experimental comparison of modelling techniques for speaker recognition under limited data condition. Sadhana-Academy Proceedings in Engineering Sciences, 2009. 34(5): p. 717-728.
Rabiner, L.R. and R.W. Schafer: Digital processing of speech signals. Prentice-Hall signal processing series. 1978, Englewood Cliffs, N.J.: Prentice-Hall.
Davis, S. and P. Mermelstein: Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences. Acoustics, Speech and Signal Processing, IEEE Transactions on, 1980. 28(4): p. 357-366.
Sun, F., B. Li, and H. Chi: Some key factors in speaker recognition using neural networks approach. in Neural Networks, IEEE International Joint Conference on. 1991.
Delacretaz, D.P. and J. Hennebert: Text-prompted speaker verification experiments with phoneme specific MLPs. in Acoustics, Speech and Signal Processing, Proceedings of the IEEE International Conference on. 1998.
Sri Rama Murty, K., S.R. Mahadeva Prasanna, and B. Yegnanarayana: Speaker-specific information from residual phase. in Signal Processing and Communications, SPCOM '04. International Conference on. 2004.
Kohonen, T.: The self-organizing map. Proceedings of the IEEE, 1990. 78(9): p. 1464-1480.
Lapidot, I., H. Guterman, and A. Cohen: Unsupervised speaker recognition based on competition between self-organizing maps. Neural Networks, IEEE Transactions on, 2002. 13(4): p. 877-887.
Mafra, A.T. and M.G. Simoes: Text independent automatic speaker recognition using selforganizing maps. in Industry Applications Conference, 39th IAS Annual Meeting. Conference Record of the IEEE. 2004.
Hadjitodorov, S., B. Boyanov, and N. Dalakchieva: A two-level classifier for textindependent speaker identification. Speech Communication, 1997. 21(3): p. 209-217.
Inal, M. and Y.S. Fatihoglu: Self organizing map and associative memory model hybrid classifier for speaker recognition. in Neural Network Applications in Electrical Engineering, NEUREL '02. 6th Seminar on. 2002.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag London Limited
About this paper
Cite this paper
Tashan, T., Allen, T. (2011). Two stage speaker verification using Self Organising Map and Multilayer Perceptron Neural Network. In: Bramer, M., Petridis, M., Nolle, L. (eds) Research and Development in Intelligent Systems XXVIII. SGAI 2011. Springer, London. https://doi.org/10.1007/978-1-4471-2318-7_8
Download citation
DOI: https://doi.org/10.1007/978-1-4471-2318-7_8
Published:
Publisher Name: Springer, London
Print ISBN: 978-1-4471-2317-0
Online ISBN: 978-1-4471-2318-7
eBook Packages: Computer ScienceComputer Science (R0)