Abstract
Our paper discusses the progress achieved during one-year effort with building the Czech LVCSR system for an automatic transcription of spontaneously pronounced testimonies in the MALACH project [1]. The difficulty of this task stems from the highly inflectional nature of the Czech language and is further multiplied by the presence of many colloquial words in spontaneous Czech speech and also by the need to handle emotional speech filled with disfluencies, heavy accents, age-related coarticulation and language switching. In this paper we concetrate mainly on the acoustic issues – the proper choice of the front-end parameterization, handling the non-speech events in acoustic modeling and especially the unsupervised usage of the MLLR adaptation technique. A method for selecting suitable language model data is also briefly mentioned.
Support for this work was provided by NSF (U.S.A.) under the Information Technology Research (ITR) program, NSF IIS Award No. 0122466 and by the Ministry of Education of the Czech Republic, projects No. MSM234200004 and No. LN00A063
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Psutka, J., Ircing, P., Psutka, J.V., Radová, V., Byrne, V., Hajič, J., Gustman, S., Ramabhadran, B.: Automatic Transcription of Czech Language Oral History in the MALACH Project: Resources and Initial Experiments. In: Proceedings of TSD 2002, Brno (2002)
Psutka, J., Ircing, P., Psutka, J.V., Radová, V., Byrne, W., Hajič, J., Mírovský, J., Gustman, S.: Large Vocabulary ASR for Spontaneous Czech in the MALACH Project. Submitted to Eurospeech (2003)
Psutka, J., Müller, L., Psutka, J.V.: Comparison of MFCC and PLP Parameterization in the Speaker Independent Continuous Speech Recognition Task. In: Proceedings of Eurospeech 2001, Aalborg (2001)
Psutka, J., Radová, V., Müller, L., Matoušek, J., Ircing, P., Graff, D.: Large Broadcast News and Read Speech Corpora of Spoken Czech. In: Proceedings of Eurospeech 2001, Aalborg (2001)
Young, S., et al.: The HTK Book. Entropic Inc., Cambridge (1999)
Stolcke, A.: SRILM - an Extensible Language Modeling Toolkit. In: Proceedings of ICSLP 2002, Denver (2002)
Mohri, M., Pereira, F., Riley, M.: Weighted Finite-State Transducers in Speech Recognition. In: Proceedings of ASR 2000, International Workshop on Automatic Speech Recognition: Challenges for the Next Millennium, Paris (2000)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2003 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Psutka, J. et al. (2003). Towards Automatic Transcription of Spontaneous Czech Speech in the MALACH Project. In: Matoušek, V., Mautner, P. (eds) Text, Speech and Dialogue. TSD 2003. Lecture Notes in Computer Science(), vol 2807. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-39398-6_30
Download citation
DOI: https://doi.org/10.1007/978-3-540-39398-6_30
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-20024-6
Online ISBN: 978-3-540-39398-6
eBook Packages: Springer Book Archive