Abstract
In this paper, we propose a novel method of perceptual evaluation of pronunciation quality for Computer Assisted Language Learning used in e-learning. The overall score of the pronunciation quality is the combination of the matching score, the perceptual score and the asymmetric score. The matching score is the measure of the acoustic distortion of the test speech, the perceptual score models the perceived distortion by human in perception domain and the asymmetric score describes the asymmetric effect of the sensation of the deletion error and the insertion error in spoken English. The correlation coefficient between the predicted objective score and the subjective score by the experts is 0.75, which is advantageous over current methods based on HMM.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Young, S., Evermann, G., et al.: The HTK book (for HTK Version 3.2). Cambridge University, UK (2002)
ITU-T P.862, Perceptual evaluation of speech quality (PESQ): An objective method for end-to-end speech quality assessment of narrow-band telephone networks and speech codecs
ITU-T P.48, Specification for an intermediate reference system
Use of speech recognition in computer-assisted-language-learning. PhD. Dissertation, Cambridge University (1999)
Different aspects of expert pronunciation quality ratings and their relation to scores produced by speech recognition algorithms. In: Speech Communication, vol. 30, pp. 109–119. Elsevier Science B.V., Amsterdam (2000)
Combination of machine scores for automatic grading of pronunciation quality. In: Speech Communication, vol. 30, pp. 121–130. Elsevier Science B. V., Amsterdam (2000)
Phone-based pronunciation quality assessment algorithm. Journal of Tsinghua University (Sci. and Tech.) 45(1), 5–8 (2005)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Li, CL., Liu, J., Xia, SH. (2006). Perceptual Evaluation of Pronunciation Quality for Computer Assisted Language Learning. In: Pan, Z., Aylett, R., Diener, H., Jin, X., Göbel, S., Li, L. (eds) Technologies for E-Learning and Digital Entertainment. Edutainment 2006. Lecture Notes in Computer Science, vol 3942. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11736639_6
Download citation
DOI: https://doi.org/10.1007/11736639_6
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-33423-1
Online ISBN: 978-3-540-33424-8
eBook Packages: Computer ScienceComputer Science (R0)