Abstract
This paper presents a novel approach to the identification of phonetic similarity using properties observed during the speech recognition process. Experiments are presented whereby specific phones are removed during the training phase of a statistical speech recognition system so that the behaviour of the system can be analysed to see which alternative phone is selected. The domain of the analysis is restricted to specific contexts and the alternatively recognised (or substituted) phones are analysed with respect to a number of factors namely, the common phonetic properties, the phonetic neighbourhood and the frequency of occurrence with respect to a particular corpus. The results indicate that a measure of phonetic similarity based on alternatively recognised observed properties can be predicted based on a combination of these factors and as such can serve as an important additional source of information for the purposes of modelling pronunciation variation.
This research is supported by the Science Foundation Ireland (Grant 07/CE/I1142) as part of the Centre for Next Generation Localisation ( www.cngl.ie ) at University College Dublin and Dublin City University. The opinions, findings and conclusions or recommendations expressed in this material are those of the authors and do not necessarily reflect the views of Science Foundation Ireland.
The research presented in this paper is a revised and extended version of the paper presented at LTC 2009, Poznań Poland.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Halberstadt, A., Glass, J.: Heterogeneous acoustic measurements for phonetic classification. In: Eurospeech Proceedings, pp. 401–404 (1997)
Scanlon, P., Ellis, D., Reilly, R.: Using broad phonetic group experts for improved speech recognition. IEEE Transactions on Audio, Speech and Language Processing 15(3), 803–812 (2007)
Garofolo, J., Lamel, L., Fisher, W., Fiscus, J., Pallett, D., Dahlgren, N.: The DARPA TIMIT Acoustic-Phonetic Continuous Speech Corpus CDROM (1993)
Mauclair, J., Aioanei, D., Carson-Berndsen, J.: Exploiting phonetic and phonological similarities as a first step for robust speech recognition. In: EUSIPCO Proceedings (2009)
Van Thuan, P., Kubin, G.: Dwt-based phonetic groups classification using neural networks. In: ICASSP Proceedings, pp. 401–404 (2005)
Ghiselli-Crippa, T., El-Jaroudi, A.: Voiced-unvoiced-silence classification of speech using neural nets. In: IJCNN Proceedings, pp. 851–856 (1991)
The-International-Phonetic-Alphabet (2005), http://www.langsci.ucl.ac.uk/ipa/
Young, S., Evermann, G., Gales, M., Hain, T., Kershaw, D., Liu, X., Moore, G., Odell, J., Ollason, D., Povey, D., Valtchev, V., Woodland, P.: Hidden markov model toolkit (htk) (2009), http://htk.eng.cam.ac.uk/ , Version 3.4.1
IPDS, CD-ROM#2: The Kiel Corpus of Spontaneous Speech, vol. 1, Kiel, IPDS (1995)
Chomsky, N., Halle, M.: The sound pattern of english. Harper & Row, New York (1968)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Kane, M., Mauclair, J., Carson-Berndsen, J. (2011). Automatic Identification of Phonetic Similarity Based on Underspecification. In: Vetulani, Z. (eds) Human Language Technology. Challenges for Computer Science and Linguistics. LTC 2009. Lecture Notes in Computer Science(), vol 6562. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-20095-3_5
Download citation
DOI: https://doi.org/10.1007/978-3-642-20095-3_5
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-20094-6
Online ISBN: 978-3-642-20095-3
eBook Packages: Computer ScienceComputer Science (R0)