Abstract
This paper introduces three grammar-segmentation methods capable of handling the large grammar issues associated with producing a real-time speech-enabled VXML bus travel application for London. Large grammars tend to produce relatively slow recognition interfaces and this work shows how this limitation can be successfully addressed. Comparative experimental results show that the novel last-word recognition based grammar segmentation method described here achieves an optimal balance between recognition rate, speed of processing and naturalness of interaction.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Zhao, B., Allen, T., & Bargiela, A. Evaluation of a mixed-initiative dialogue multimodal interface. In: Macintosh, A., Ellis, R. & Allen, T. (ed) Application and innovations in intelligent system XII, Springer-Verlag, London, 2004, 265–278.
Zhao, B., Allen, T., & Bargiela, A. Usability evaluation of a directed-dialogue speech-enabled query interface for the ATTAIN travel information system. RASC 2004, 265–278
Tang, M. Large vocabulary continuous speech recognition using linguistic features and constraints. PhD Thesis, MIT, USA, 2005.
Levow, G. Making sense of silence. Conference on Human Factors in Computing Systems, 1997, Workshop on Speech User Interface Design Challenge.
Deroo, 0. Hidden Markov Models and neural networks for speech recognition. PhD Thesis, Faculté Polytechnique de Mons, Belgium, 1998.
Young, S., Adda-Dekker, M., Aubet, X., Dugast, C., Gauvain, J., Kershaw, D., Lamel, L., Leeuwen, D., Pye, D., Robinson, A., A., Steeneken, H., & Woodland, P. Multilingual large vocabulary speech recognition: the European SQALE project. Computer Speech and Language, 11:73–89.
Peissner, M. What the relationship between correct recognition rates and usability measures can tell us about the quality of a speech application. WWDU 2002, 296–298.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2007 Springer-Verlag London Limited
About this paper
Cite this paper
Zhao, B., Allen, T., Bargiela, A. (2007). Speech-Enabled Interfaces for Travel Information Systems with Large Grammars. In: Ellis, R., Allen, T., Tuson, A. (eds) Applications and Innovations in Intelligent Systems XIV. SGAI 2006. Springer, London. https://doi.org/10.1007/978-1-84628-666-7_16
Download citation
DOI: https://doi.org/10.1007/978-1-84628-666-7_16
Publisher Name: Springer, London
Print ISBN: 978-1-84628-665-0
Online ISBN: 978-1-84628-666-7
eBook Packages: Computer ScienceComputer Science (R0)