[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.5555/1950280.1950311guideproceedingsArticle/Chapter ViewAbstractPublication PagesConference Proceedingsacm-pubtype
Article

Slovak language model from internet text data

Published: 15 March 2010 Publication History

Abstract

Automatic speech recognition system is one of the parts of the multimodal dialogue system. It is necessary to create correct vocabulary and to generate suitable language model for this purpose. The main aim of this article is to describe a process of building statistical models of the Slovak language with large vocabulary trained on the text data gathered mainly from Internet sources. Several smoothing techniques for different sizes of vocabulary have been used in order to obtain an optimal model of the Slovak language. We have also employed pruning technique based on relative entropy for size reduction of a language model to find the maximum threshold of pruning with minimum degradation in recognition accuracy. Tests were performed by the decoder based on the HTK Toolkit.

References

[1]
Chollet, G., Esposito, A., Gentes, A., Horain, P., Karam, W., Li, Z., Pelachaud, C., Perrot, P., Petrovska-Delacrétaz, D., Zhou, D., Zouari, L.: Multimodal Human Machine Interactions in Virtual and Augmented Reality. In: Esposito, A., Hussain, A., Marinaro, M., Martone, R. (eds.) COST Action 2102. LNCS(LNAI), vol. 5398, pp. 1-23. Springer, Heidelberg (2009).
[2]
Jurafsky, D., Martin, J.H.: Speech and Language Processing: An Introduction to Natural Language Processing, Computational Linguistics, and Speech Recognition, 2nd edn., p. 998. Prentice Hall, Englewood Cliffs (2009) ISBN-13 978-0-13-504196-3.
[3]
Chen, S.F., Goodman, J.: An Empirical Study of Smoothing Techniques for Language Modeling. Technical Report TR-10-98, p. 63 (1998).
[4]
Stolcke, A.: Entropy-based Pruning of Backoff Language Models. In: Proc. DARPA Broadcast News Transcription and Understanding Workshop, pp. 270-274 (1998).
[5]
Mirilovič, M., Juhár, J., Cižmár, A.: Large Vocabulary Continuous Speech Recognition in Slovak. In: Proc. of International Conference on Applied Electrical Engineering and Informatics, Athens, Greece, pp. 73-77 (2008) ISBN 978-80-553-0066-5.
[6]
Stolcke, A.: SRILM - An Extensible Language Modeling Toolkit. In: Proc. of the 7th International Conference on Spoken Language Processing, Denver, Colorado, pp. 901-904 (2002).
[7]
Cowan, I.A., Moore, D., Dines, J., Gatiza-Perez, D., Flynn, M., Wellner, P., Bourlard, H.: On the Use of Information Retrieval Measures for Speech Recognition Evaluation. In: IDIAP-RR-73, Martigny, Switzerland, p. 15 (2005).
[8]
Young, S., Odell, J., Ollason, D., Valtchev, V., Woodland, P., Evermann, G., Hain, T., Kershaw, D., Moore, G.: The HTK Book (v3.4). Cambridge University, Cambridge (2009).
[9]
Rusko, M., Trnka, M., Daržagín, S.: MobilDat-SK - A Mobile Telephone Extension to the SpeechDat-E SK Telephone Speech Database in Slovak. In: Proc. of the 11th International Conference Speech and Computer, SPECOM 2006, pp. 485-488 (2006).
[10]
Mirilovič, M., Juhár, J., Čižmár, A.: Comparison of Grapheme and Phoneme Based Acoustic Modeling in LVCSR Task in Slovak. In: Proc. of the 7th International Conference on Spoken Language Processing, Denver, Colorado, pp. 901-904 (2002).

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image Guide Proceedings
Proceedings of the Third COST 2102 international training school conference on Toward autonomous, adaptive, and context-aware multimodal interfaces: theoretical and practical issues
March 2010
472 pages
ISBN:9783642181832
  • Editors:
  • Anna Esposito,
  • Antonietta M. Esposito,
  • Raffaele Martone,
  • Vincent C. Müller,
  • Gaetano Scarpetta

Sponsors

  • Provincia di Salerno: Provincia di Salerno
  • International Institute for Advanced Scientific Studies "E.R. Caianiello": International Institute for Advanced Scientific Studies "E.R. Caianiello"
  • ESF: European Science Foundation
  • Regione Campania
  • SERN: Società Italiana Reti Neuroniche

Publisher

Springer-Verlag

Berlin, Heidelberg

Publication History

Published: 15 March 2010

Author Tags

  1. language model
  2. n-grams
  3. speech recognition
  4. spellchecking
  5. text normalization
  6. vocabulary

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 0
    Total Downloads
  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 22 Dec 2024

Other Metrics

Citations

View Options

View options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media