Some Like It Gaussian. . .

Pavel Matějka³,
Petr Schwarz⁴,
Martin Karafiát⁴ &
…
Jan Černocký⁴

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 2448))

Included in the following conference series:

International Conference on Text, Speech and Dialogue

576 Accesses

Abstract

In Hidden Markov models, speech features are modeled by Gaussian distributions. In this paper, we propose to gaussianize the features to better fit to this modeling. A distribution of the data is estimated and a transform function is derived. We have tested two methods of the transform estimation (global and speaker based). The results are reported on recognition of isolated Czech words (SpeechDat-E) with CI and CD models and on medium vocabulary continuous speech recognition task (SPINE). Gaussianized data provided in all three cases results superior to standard MFC coefficients proving, that the gaussianization is a cheap way to increase the recognition accuracy

Supported by Grant Agency of Czech Republic under project No. 102/02/0124.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic

£29.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: GBP 19.95; Price includes VAT (United Kingdom)

eBook: GBP 35.99; Price includes VAT (United Kingdom)

Softcover Book: GBP 44.99; Price includes VAT (United Kingdom)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

B. Gold and N. Morgan. Speech and audio signal processing. John Wiley & Sons, 2000.
Google Scholar
J. Pelecanos and S. Sridharan. Feature warping for robust speaker verification. In: Proc. Speaker Odyssey 2001 conference, June 2001.
Google Scholar
R. Singh, M. L. Seltzer, B. Raj, and R. M. Stern. Speech in noisy environments: robust automatic segmentation, feature extraction, and hypothesis combination. In: Proc. ICASSP 2001, Salt Lake City, Utah, USA, May 2001.
Google Scholar
H. van den Heuvel et al. Speechdat-east: Five multilingual speech databases for voice-operated teleservices completed. In: EuroSpeech 2001, Aalborg, Denmark, September 2001.
Google Scholar
S. Young, J. Jansen, J. Odell, D. Ollason, and P. Woodland. The HTK book. Entropics Cambridge Research Lab., Cambridge, UK, 1996.
Google Scholar

Download references

Author information

Authors and Affiliations

Faculty of Elec. Eng. and Communication, VUT Brno, Brno
Pavel Matějka
Fac. of Inf. Technology, VUT Brno, Brno
Petr Schwarz, Martin Karafiát & Jan Černocký

Authors

Pavel Matějka
View author publications
You can also search for this author in PubMed Google Scholar
Petr Schwarz
View author publications
You can also search for this author in PubMed Google Scholar
Martin Karafiát
View author publications
You can also search for this author in PubMed Google Scholar
Jan Černocký
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Faculty of Informatics Department of Programming Systems and Communication, Masaryk University, Botanická 68a, 602 00, Brno, Czech Republic
Petr Sojka
Faculty of Informatics Department of Information Technologies, Masaryk University, Botanická 68a, 602 00, Brno, Czech Republic
Ivan Kopeček & Karel Pala &

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Matějka, P., Schwarz, P., Karafiát, M., Černocký, J. (2002). Some Like It Gaussian. . .. In: Sojka, P., Kopeček, I., Pala, K. (eds) Text, Speech and Dialogue. TSD 2002. Lecture Notes in Computer Science(), vol 2448. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-46154-X_44

Download citation

DOI: https://doi.org/10.1007/3-540-46154-X_44
Published: 23 August 2002
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-44129-8
Online ISBN: 978-3-540-46154-8
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics