Abstract
In Hidden Markov models, speech features are modeled by Gaussian distributions. In this paper, we propose to gaussianize the features to better fit to this modeling. A distribution of the data is estimated and a transform function is derived. We have tested two methods of the transform estimation (global and speaker based). The results are reported on recognition of isolated Czech words (SpeechDat-E) with CI and CD models and on medium vocabulary continuous speech recognition task (SPINE). Gaussianized data provided in all three cases results superior to standard MFC coefficients proving, that the gaussianization is a cheap way to increase the recognition accuracy
Supported by Grant Agency of Czech Republic under project No. 102/02/0124.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
B. Gold and N. Morgan. Speech and audio signal processing. John Wiley & Sons, 2000.
J. Pelecanos and S. Sridharan. Feature warping for robust speaker verification. In: Proc. Speaker Odyssey 2001 conference, June 2001.
R. Singh, M. L. Seltzer, B. Raj, and R. M. Stern. Speech in noisy environments: robust automatic segmentation, feature extraction, and hypothesis combination. In: Proc. ICASSP 2001, Salt Lake City, Utah, USA, May 2001.
H. van den Heuvel et al. Speechdat-east: Five multilingual speech databases for voice-operated teleservices completed. In: EuroSpeech 2001, Aalborg, Denmark, September 2001.
S. Young, J. Jansen, J. Odell, D. Ollason, and P. Woodland. The HTK book. Entropics Cambridge Research Lab., Cambridge, UK, 1996.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2002 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Matějka, P., Schwarz, P., Karafiát, M., Černocký, J. (2002). Some Like It Gaussian. . .. In: Sojka, P., Kopeček, I., Pala, K. (eds) Text, Speech and Dialogue. TSD 2002. Lecture Notes in Computer Science(), vol 2448. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-46154-X_44
Download citation
DOI: https://doi.org/10.1007/3-540-46154-X_44
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-44129-8
Online ISBN: 978-3-540-46154-8
eBook Packages: Springer Book Archive