Abstract
This paper deals with the automatic segmentation for Czech Concatenative speech synthesis. Statistical approach to speech segmentation using hidden Markov models (HMMs) is applied in the baseline system [1]. Several experiments that concern various issues in the process of building the segmentation system, such as speech parameterization or HMM initialization problems, are described here. An objective comparison of various experimental automatic and manual segmentations is performed to find out the best settings of the segmentation system with respect to our single-female-speaker continuous speech corpus.
This research was supported by the Grant Agency of Czech Republic No. 102/02/P134 and the Ministry of Education of Czech Republic, project No. MSM235200004.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Matoušek, J., Psutka, J.: ARTIC: a New Czech Text-to-Speech System Using Statistical Approach to Speech Segment Database Construction. In: Proceedings of ICSLP 2000, Beijing, vol. IV, pp. 612–615 (2000)
Matoušek, J., Psutka, J., Krůta, J.: On Building Speech Corpus for Concatenation-Based Speech Synthesis. In: Proceedings of Eurospeech 2001, Ålborg, vol. 3, pp. 2047–2050 (2001)
Ljolje, A., Hirschberg, J., van Santen, J.P.H.: Automatic Speech Segmentation for Concatenative Inventory Selection. In: Progress in Speech Synthesis, pp. 305–311. Springer, Heidelberg (1996)
Psutka, J., Müller, L., Psutka, J.V.: Comparison of MFCC and PLP Parameterization in the Speaker Independent Continuous Speech Recognition Task. In: Proceedings of Eurospeeech 2001, Ålborg, pp. 1813–1816 (2001)
Young, S., et al.: The HTK Book (for HTK Version 3.2). Cambridge University Press. Cambridge (2002)
Kim, Y.-J., Conkie, A.: Automatic Segmentation Combining an HMM-Based Approach and Spectral Boundary Correction. In: Proceedings of ICSLP 2002, Denver, pp. 145–148 (2002)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2003 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Matoušek, J., Tihelka, D., Psutka, J. (2003). Experiments with Automatic Segmentation for Czech Speech Synthesis. In: Matoušek, V., Mautner, P. (eds) Text, Speech and Dialogue. TSD 2003. Lecture Notes in Computer Science(), vol 2807. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-39398-6_41
Download citation
DOI: https://doi.org/10.1007/978-3-540-39398-6_41
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-20024-6
Online ISBN: 978-3-540-39398-6
eBook Packages: Springer Book Archive