Abstract
vAssist (Voice Controlled Assistive Care and Communication Services for the Home) is a European project for which several research institutes and companies have been working on the development of adapted spoken interfaces to support home care and communication services. This paper describes the spoken dialog system that has been built. Its natural language understanding module includes a novel reference resolver and it introduces a new hierarchical paradigm to model dialog tasks. The user-centered approach applied to the whole development process led to the setup of several experiment sessions with real users. Multilingual experiments carried out in Austria, France and Spain are described along with their analyses and results in terms of both system performance and user experience. An additional experimental comparison of the RavenClaw and Disco-LFF dialog managers built into the vAssist spoken dialog system highlighted similar performance and user acceptance.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Aust, H., Oerder, M., Seide, F., Steinbiss, V.: The Philips automatic train timetable information system. Speech Commun. 17(3–4), 249–262 (1995)
Bohus, D., Raux, A., Harris, T.K., Eskenazi, M., Rudnicky, A.I.: Olympus: an open-source framework for conversational spoken language interface research. In: Proceedings of the Workshop on Bridging the Gap: Academic and Industrial Research in Dialog Technologies, pp. 32–39 (2007)
Bohus, D., Rudnicky, A.I.: The RavenClaw dialog management framework: architecture and systems. Comput. Speech Lang. 23(3), 332–361 (2009)
Brill, E.: Transformation-based error-driven learning and natural language processing: a case study in part-of-speech tagging. Comput. Linguist. 21(4), 543–565 (1995)
Brooke, J.: SUS-a quick and dirty usability scale. Usability evaluation in industry 189(194), 4–7 (1996)
Chollet, G., Caon, D.R., Simonnet, T., Boudy, J.: vAssist: Le majordome des personnes dépendantes. In: Proceedings of 2e Conférence Internationale sur l’Accessibilité et les Systémes de Suppléance aux personnes en Handicap (2011)
Crook, P.A., Roblin, B., Loidl, H.W., Lemon, O.: Parallel computing and practical constraints when applying the standard POMDP belief update formalism to spoken dialogue management. In: Delgado, R.L.C., Kobayashi, T. (eds.) Proceedings of the Paralinguistic Information and its Integration in Spoken Dialogue Systems Workshop, pp. 189–201. Springer, New York (2011)
Cuayáhuitl, H., Renals, S., Lemon, O., Shimodaira, H.: Evaluation of a hierarchical reinforcement learning spoken dialogue system. Comput. Speech Lang. 24, 395–429 (2010)
Ghigi, F., Torres, M.I.: Decision making strategies for finite state bi-automaton in dialog management. In: Proceedings of the International Workshop Series on Spoken Dialogue Systems Technology, IWSDS, pp. 308–312 (2015)
Griol, D., Hurtado, L., Segarra, E., Sanchis, E.: A statistical approach to spoken dialog systems design and evaluation. Speech Commun. 50, 666–682 (2008)
Hone, K.S., Graham, R.: Towards a tool for the subjective assessment of speech system interfaces (SASSI). Nat. Lang. Eng. 6, 287–303 (2000)
Jurčíček, F., Mairesse, F., Gašić, M., Keizer, S., Thomson, B., Yu, K., Young, S.: Transformation-based Learning for semantic parsing. In: Proceedings of the InterSpeech, pp. 2719–2722 (2009)
Jurčíček, F., Thomson, B., Young, S.: Reinforcement learning for parameter estimation in statistical spoken dialogue systems. Comput. Speech Lang. 26(3), 168–192 (2011)
Larsson, S., Traum, D.: Information state and dialogue management in the TRINDI Dialogue Move Engine Toolkit. Nat. Lang. Eng. 6, 323–340 (1998)
Lee, C., Jung, S., Eun, J., Jeong, M., Lee, G.G.: A situation-based dialogue management using dialogue examples. In: Proceedings of the International Conference on Acoustics, Speech and Signal Processing, pp. 69–72 (2006)
Levin, E., Pieraccini, R., Eckert, W.: Using Markov decision process for learning dialogue strategies. In: Proceedings of the International Conference on Acoustics, Speech and Signal Processing, pp. 201–204 (1998)
Lison, P.: A hybrid approach to dialogue management based on probabilistic rules. Comput. Speech Lang. 34(1), 232–255 (2015)
Milhorat, P.: An Open-source Framework for Supporting the Design and Implementation of Natural-language Spoken Dialog Systems. Ph.D. thesis, Télécom Paris-Tech - 46, rue Barrault - 75013 Paris (2015)
Milhorat, P., Schlögl, S., Chollet, G., Boudy, J.: Un Systéme de Dialogue Vocal pour les Seniors: Études et Spécifications. Journées d’étude sur la TéléSanté (2013)
Raux, A., Langner, B., Bohus, D.: Lets go public! taking a spoken dialog system to the real world. In: Proceedings of the InterSpeech (2005)
Rich, C.: Building task-based user interfaces with ANSI/CEA-2018. IEEE Comput. 8, 20–27 (2009)
Schlögl, S., Milhorat, P., Chollet, G.: Designing, building and evaluating voice user interfaces for the home. In: Proceedings of the Workshop on Methods for Studying Technology in the Home at the ACM SIGCHI Conference on Human Factors in Computing Systems (CHI13) (2013)
Schröder, M., Trouvain, J.: The German text-to-speech synthesis system MARY: a tool for research, development and teaching. Int. J. Speech Technol. 6(4), 365–377 (2003)
Tedesco, D., Tullis, T.: A comparison of methods for eliciting post-task subjective ratings in usability testing. Usability Prof. Assoc. (UPA) 2006, 1–9 (2006)
Torres, M.I.: Stochastic bi-languages to model dialogs. In: Proceedings of the International Conference on Finite State Methods and Natural Language Processing, pp. 9–17 (2013)
Venkatesh, V., Bala, H.: Technology acceptance model 3 and a research agenda on interventions. Decis. Sci. 39(2), 273–315 (2008)
Weizenbaum, J.: ELIZA—a computer program for the study of natural language communication between man and machine. Commun. ACM 9(1), 36–45 (1966)
Williams, J.D., Young, S.: Partially observable Markov decision processes for spoken dialog systems. Comput. Speech Lang. 21(2), 393–422 (2007)
Young, S.: Probabilistic methods in spoken dialogue systems. Philos. Trans. R. Soc. Lond. (2000)
Young, S., Gašić, M., Thomson, B., Williams, J.D.: POMDP-based statistical spoken dialog systems: a review. In: Proceedings of the IEEE 101(5), 1160–1179 (2013)
Zue, V., Seneff, S., Glass, J.R., Polifroni, J., Pao, C., Hazen, T.J., Hetherington, L.: JUPlTER: a telephone-based conversational interface for weather information. IEEE Trans. Speech Audio Process. 8(1), 85–96 (2000)
Acknowledgements
The presented research is conducted as part of the vAssist project (AAL-2010-3-106), which is partially funded by the European Ambient Assisted Living Joint Programme and the National Funding Agencies from Austria, France and Italy. It has also been partially supported by the Spanish Ministry of Science under grant TIN2014-54288-C4-4-R and by the Basque Government under grant IT685-13.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer Science+Business Media Singapore
About this chapter
Cite this chapter
Olaso, J.M. et al. (2017). A Multi-lingual Evaluation of the vAssist Spoken Dialog System. Comparing Disco and RavenClaw. In: Jokinen, K., Wilcock, G. (eds) Dialogues with Social Robots. Lecture Notes in Electrical Engineering, vol 427. Springer, Singapore. https://doi.org/10.1007/978-981-10-2585-3_17
Download citation
DOI: https://doi.org/10.1007/978-981-10-2585-3_17
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-10-2584-6
Online ISBN: 978-981-10-2585-3
eBook Packages: EngineeringEngineering (R0)