Abstract
Emotion speech synthesis is the most important process to generate the naturalness of utterances in text-to-speech system. The interjection utterances in Thai language are employed in express a number of emotions. This paper presents a study of the prosody parameters of the interjection utterances clipped from Thai utterances in the movies. The Thai emotional utterances from various movies have been analyzed and classified into 8 emotional types consisting of neutral, anger, happiness, sadness, fear, pleasant, unpleasant and surprise. The classification of prosodic features is based on fundamental frequency (F0), intensity and duration. This paper compares the prosodic features in the Thai language and other languages including English, Italian, French, Spanish and Arabic. The comparison results show that there are significant differences of prosodic features for each emotion in each language. Therefore, the quality of a text-to-speech system is based on the prosodic analysis of each language.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Tumtavitikul, A., Thitikannara, K.: Thai Intonation of Thai emotional speech. In: Proceedings of the 11th Australian International Conference on Speech Science & Technology, New Zealand, December 6-8 (2006)
Luksaneeyanawin, S., Intonation in Thai. Unpublished Doctoral Dissertation, University of Edinburgh (1983)
Cahn, J.: From Sad to Glad: Emotional Computer Voices. In: Proceedings of Speech Tech. 1988, Voice Input/Output Applications Conference and Exhibition, New York City, pp. 35–37 (1988)
Chuenwattanapranithi, S., Xu, Y., Thipakorn, B., Maneewongvatana, S.: Encoding emotions in speech with the size code. A perceptual investigation. Phonetica 65, 210–230 (2008)
Boersma, P.: Praat, a system for doing phonetics by computer. Glot. International 5(9-10), 341–345 (2001)
Burkhardt, F., Audibert, N., Malatesta, L., Türk, O., Arslan, L., Auberge, V.: Emotional Prosody – Does Culture Make A Difference? Speech Prosody, Dresden, Germany (2006)
Wutiwiwatchai, C., Furui, S.: Thai speech processing technology: a review. Speech Communication 49(1), 8–27 (2007)
Yimngam, S., Premchaisawadi, W., Kreesuradej, W.: Thai Emotion Words Analysis. In: The Eighth International Symposium on Natural Language Processing, SNLP (2009)
Dakkak, O., Ghneim, N., Abou Zliekha, M., Moubayed, S.: Emotion Inclusion in an Arabic Text-to-Speech. In:13th European Signal Processing Conference (2005)
Schröder, M.: Emotional Speech Synthesis - A Review. In: Proc. Eurospeech 2001, Aalborg, vol. 1, pp. 561–564 (2001)
Ser, W., Cen, L., Yu, Z.L.: A Hybrid PNN-GMM classification scheme for speech emotion recognition. In: ICPR 2008, pp. 1–4 (2008)
Ekman, P.: Basic emotions. In: Dalgleish, T., Power, M.J. (eds.) Handbook of Cognition & Emotion, pp. 301–320. John Wiley, New York (1999)
Schlosberg, H.: A scale for the judgement of facial expressions. Journal of Experimental Psychology 29, 497–510 (1941)
Mittrapiyanurak. P., Hansakunbuntheung, C., Tesprasit, V., Sornlertlamvanich, V.: Issues in Thai Text-to-Speech Synthesis: The NECTEC Approach. NECTEC, 483–495 (June 2000)
Tesprasit. V., Charoenpornsawat. P., Sornlertlamvanich. V.: A Context-Sensitive Homograph Disambiguation in Thai Text-to-Speech Synthesis. In: Proceedings of HLT-NAACL 2003 Short Papers, Edmonton, May-June 2003, pp. 103–103 (2003)
Cahn, J.E.: Generating expression in synthesized speech. Technical Report, MIT, Media Technology Laboratory, MA, USA (1990)
Drioli, C., Tisato, G., Cosi, P., Tesser, F.: Emotions and Voice Quality: Experiments with Sinusoidal Modeling. In: Proceedings of Voqual 2003, Voice Quality: Functions, Analysis and Synthesis, ISCA (2003)
Boula de Mareüil, P., Célérier, P., Toen, J.: Generation of Emotions by a Morphing Technique in English, French and Spanish. In: Proc. Speech Prosody, pp. 187–190 (2002)
Alm, C.O., Sproat, R.: Perceptions of Emotions in Expressive Storytelling. InterSpeech, 533–536 (2005)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Yimngam, S., Premchaisawadi, W., Kreesuradej, W. (2011). Prosody Analysis of Thai Emotion Utterances. In: Muñoz, R., Montoyo, A., Métais, E. (eds) Natural Language Processing and Information Systems. NLDB 2011. Lecture Notes in Computer Science, vol 6716. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-22327-3_18
Download citation
DOI: https://doi.org/10.1007/978-3-642-22327-3_18
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-22326-6
Online ISBN: 978-3-642-22327-3
eBook Packages: Computer ScienceComputer Science (R0)