Speech Recognition Using Stereo Vision Neural Networks with Competition and Cooperation

Sung-III Kim¹⁹

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 3497))

Included in the following conference series:

International Symposium on Neural Networks

1124 Accesses

Abstract

This paper describes the speech recognition based on stereoscopic vision neural networks(SVNN) that has a dynamic process of self-organization that has been proved to be successful in recognizing a depth perception in stereoscopic vision. This study has shown that the process has also been useful in recognizing human speech. In the stereoscopic vision neural networks, the similarities are first obtained by comparing input vocal signals with standard models. They are then given to a dynamic process in which both competitive and cooperative processes are conducted among neighboring similarities. Finally, only one winner neuron is finally detected through the dynamic process. In a comparative study, the average phoneme recognition accuracies on the SVNN was 6.6 % higher than the existing recognizer based on Hidden Markov Models(HMM) with the structures of a single mixture and three states. From the results, therefore, it was noticed that the speech recognizer using SVNN outperformed the conventional recognizer in phoneme recognition under the same conditions.

This work is supported by the Kyungnam University Research Fund, 2005.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic

£29.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: GBP 19.95; Price includes VAT (United Kingdom)

eBook: GBP 71.50; Price includes VAT (United Kingdom)

Softcover Book: GBP 89.99; Price includes VAT (United Kingdom)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Auxiliary Features from Laser-Doppler Vibrometer Sensor for Deep Neural Network Based Robust Speech Recognition

Article 27 September 2017

Visual Speech Recognition Using PCA Networks and LSTMs in a Tandem GMM-HMM System

Spiking Cooperative Stereo-Matching at 2 ms Latency with Neuromorphic Hardware

References

Woodland, P.C., Leggestter, C.J., Odell, J.J., et al.: The 1994 HTK Large Vocabulary Speech Recognition System. In: Proc. IEEE Int. Con. on Acoustics, Speech, and Signal Processing, vol. 1, pp. 73–76 (1995)
Google Scholar
Lee, K.F., Hon, H.W.: Speaker-Independent Phone Recognition Using Hidden Markov Models. IEEE Tran. on Acoustic, Speech and Signal Processing 37, 1641–1648 (1989)
Article Google Scholar
Bourlard, H., Wellekens, C.J.: Links between Markov Models and Multi-layer Perceptrons. IEEE Tran. Patt. Anal. Machine Intell. 12, 1167–1178 (1990)
Article Google Scholar
Lang, J., Waibel, A., Hinton, G.E.: A Time-Delay Neural Network Architecture for Isolated Word Recognition. In: Artificial Neural Networks, Paradigms, Applications and Hardware Implementations, pp. 388–408. IEEE Press, New York (1992)
Google Scholar
Martinelli, G.: Hidden Control Neural Network. IEEE Tran. on Circuits and Systems, Analog and Signal Processing 41, 245–247 (1994)
Google Scholar
Reinmann, D., Haken, H.: Stereo Vision by Self-organization. Biol. Cybern. 71, 17–26 (1994)
Article Google Scholar
Amari, S., Arbib, M.A.: Competition and Cooperation in Neural Nets. Systems Neuroscience, pp. 119–165. Academic Press, London (1977)
Google Scholar
Yoshitomi, Y., Kanda, T., Kitazoe, T.: Neural Nets Pattern Recognition Equation for Stereo Vision. Trans. IPS, 29–38 (1998)
Google Scholar
Yoshitomi, Y., Kitazoe, T., Tomiyama, J., Tatebe, Y.: Sequential Stereo Vision and Phase Transition. In: Proc. Int. Sym. on Artificial Life and Robotics, pp. 318–323 (1998)
Google Scholar

Download references

Author information

Authors and Affiliations

Division of Electronic and Electrical Engineering, Kyungnam University, 449 Wolyoung-dong, Masan City, 631-701, Korea
Sung-III Kim

Authors

Sung-III Kim
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Mechanical and Automation Engineering, The Chinese University of Hong Kong, Shatin, New Territories, Hong Kong, China
Jun Wang
The Key Laboratory of Optoelectric Technology & Systems, Ministry of Education, China
Xiao-Feng Liao
Computational Intelligence Laboratory, School of Computer Science and Engineering, University of Electronic Science and Technology of China, 610054, Chengdu, P.R. China
Zhang Yi

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Kim, SI. (2005). Speech Recognition Using Stereo Vision Neural Networks with Competition and Cooperation. In: Wang, J., Liao, XF., Yi, Z. (eds) Advances in Neural Networks – ISNN 2005. ISNN 2005. Lecture Notes in Computer Science, vol 3497. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11427445_54

Download citation

DOI: https://doi.org/10.1007/11427445_54
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-25913-8
Online ISBN: 978-3-540-32067-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Speech Recognition Using Stereo Vision Neural Networks with Competition and Cooperation

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Auxiliary Features from Laser-Doppler Vibrometer Sensor for Deep Neural Network Based Robust Speech Recognition

Visual Speech Recognition Using PCA Networks and LSTMs in a Tandem GMM-HMM System

Spiking Cooperative Stereo-Matching at 2 ms Latency with Neuromorphic Hardware

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Speech Recognition Using Stereo Vision Neural Networks with Competition and Cooperation

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Auxiliary Features from Laser-Doppler Vibrometer Sensor for Deep Neural Network Based Robust Speech Recognition

Visual Speech Recognition Using PCA Networks and LSTMs in a Tandem GMM-HMM System

Spiking Cooperative Stereo-Matching at 2 ms Latency with Neuromorphic Hardware

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation