[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to main content

Some Like It Gaussian. . .

  • Conference paper
  • First Online:
Text, Speech and Dialogue (TSD 2002)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 2448))

Included in the following conference series:

  • 576 Accesses

Abstract

In Hidden Markov models, speech features are modeled by Gaussian distributions. In this paper, we propose to gaussianize the features to better fit to this modeling. A distribution of the data is estimated and a transform function is derived. We have tested two methods of the transform estimation (global and speaker based). The results are reported on recognition of isolated Czech words (SpeechDat-E) with CI and CD models and on medium vocabulary continuous speech recognition task (SPINE). Gaussianized data provided in all three cases results superior to standard MFC coefficients proving, that the gaussianization is a cheap way to increase the recognition accuracy

Supported by Grant Agency of Czech Republic under project No. 102/02/0124.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
£29.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
GBP 19.95
Price includes VAT (United Kingdom)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
GBP 35.99
Price includes VAT (United Kingdom)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
GBP 44.99
Price includes VAT (United Kingdom)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. B. Gold and N. Morgan. Speech and audio signal processing. John Wiley & Sons, 2000.

    Google Scholar 

  2. J. Pelecanos and S. Sridharan. Feature warping for robust speaker verification. In: Proc. Speaker Odyssey 2001 conference, June 2001.

    Google Scholar 

  3. R. Singh, M. L. Seltzer, B. Raj, and R. M. Stern. Speech in noisy environments: robust automatic segmentation, feature extraction, and hypothesis combination. In: Proc. ICASSP 2001, Salt Lake City, Utah, USA, May 2001.

    Google Scholar 

  4. H. van den Heuvel et al. Speechdat-east: Five multilingual speech databases for voice-operated teleservices completed. In: EuroSpeech 2001, Aalborg, Denmark, September 2001.

    Google Scholar 

  5. S. Young, J. Jansen, J. Odell, D. Ollason, and P. Woodland. The HTK book. Entropics Cambridge Research Lab., Cambridge, UK, 1996.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2002 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Matějka, P., Schwarz, P., Karafiát, M., Černocký, J. (2002). Some Like It Gaussian. . .. In: Sojka, P., Kopeček, I., Pala, K. (eds) Text, Speech and Dialogue. TSD 2002. Lecture Notes in Computer Science(), vol 2448. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-46154-X_44

Download citation

  • DOI: https://doi.org/10.1007/3-540-46154-X_44

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-44129-8

  • Online ISBN: 978-3-540-46154-8

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics