A Multi-lingual Evaluation of the vAssist Spoken Dialog System. Comparing Disco and RavenClaw

Javier Mikel Olaso³,
Pierrick Milhorat⁴,
Julia Himmelsbach⁵,
Jérôme Boudy⁶,
Gérard Chollet⁴,
Stephan Schlögl⁷ &
…
María Inés Torres³

Part of the book series: Lecture Notes in Electrical Engineering ((LNEE,volume 427))

1554 Accesses
5 Citations
3 Altmetric

Abstract

vAssist (Voice Controlled Assistive Care and Communication Services for the Home) is a European project for which several research institutes and companies have been working on the development of adapted spoken interfaces to support home care and communication services. This paper describes the spoken dialog system that has been built. Its natural language understanding module includes a novel reference resolver and it introduces a new hierarchical paradigm to model dialog tasks. The user-centered approach applied to the whole development process led to the setup of several experiment sessions with real users. Multilingual experiments carried out in Austria, France and Spain are described along with their analyses and results in terms of both system performance and user experience. An additional experimental comparison of the RavenClaw and Disco-LFF dialog managers built into the vAssist spoken dialog system highlighted similar performance and user acceptance.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic

£29.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: GBP 19.95; Price includes VAT (United Kingdom)

eBook: GBP 127.50; Price includes VAT (United Kingdom)

Softcover Book: GBP 159.99; Price includes VAT (United Kingdom)

Hardcover Book: GBP 179.99; Price includes VAT (United Kingdom)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Evaluating SpeakyAcutattile: A System Based on Spoken Language for Ambient Assisted Living

Dialog Systems and Their Inputs

Towards Personalization of Spoken Dialogue System Communication Strategies

References

Aust, H., Oerder, M., Seide, F., Steinbiss, V.: The Philips automatic train timetable information system. Speech Commun. 17(3–4), 249–262 (1995)
Article MATH Google Scholar
Bohus, D., Raux, A., Harris, T.K., Eskenazi, M., Rudnicky, A.I.: Olympus: an open-source framework for conversational spoken language interface research. In: Proceedings of the Workshop on Bridging the Gap: Academic and Industrial Research in Dialog Technologies, pp. 32–39 (2007)
Google Scholar
Bohus, D., Rudnicky, A.I.: The RavenClaw dialog management framework: architecture and systems. Comput. Speech Lang. 23(3), 332–361 (2009)
Google Scholar
Brill, E.: Transformation-based error-driven learning and natural language processing: a case study in part-of-speech tagging. Comput. Linguist. 21(4), 543–565 (1995)
MathSciNet Google Scholar
Brooke, J.: SUS-a quick and dirty usability scale. Usability evaluation in industry 189(194), 4–7 (1996)
Google Scholar
Chollet, G., Caon, D.R., Simonnet, T., Boudy, J.: vAssist: Le majordome des personnes dépendantes. In: Proceedings of 2e Conférence Internationale sur l’Accessibilité et les Systémes de Suppléance aux personnes en Handicap (2011)
Google Scholar
Crook, P.A., Roblin, B., Loidl, H.W., Lemon, O.: Parallel computing and practical constraints when applying the standard POMDP belief update formalism to spoken dialogue management. In: Delgado, R.L.C., Kobayashi, T. (eds.) Proceedings of the Paralinguistic Information and its Integration in Spoken Dialogue Systems Workshop, pp. 189–201. Springer, New York (2011)
Google Scholar
Cuayáhuitl, H., Renals, S., Lemon, O., Shimodaira, H.: Evaluation of a hierarchical reinforcement learning spoken dialogue system. Comput. Speech Lang. 24, 395–429 (2010)
Article Google Scholar
Ghigi, F., Torres, M.I.: Decision making strategies for finite state bi-automaton in dialog management. In: Proceedings of the International Workshop Series on Spoken Dialogue Systems Technology, IWSDS, pp. 308–312 (2015)
Google Scholar
Griol, D., Hurtado, L., Segarra, E., Sanchis, E.: A statistical approach to spoken dialog systems design and evaluation. Speech Commun. 50, 666–682 (2008)
Article Google Scholar
Hone, K.S., Graham, R.: Towards a tool for the subjective assessment of speech system interfaces (SASSI). Nat. Lang. Eng. 6, 287–303 (2000)
Article Google Scholar
Jurčíček, F., Mairesse, F., Gašić, M., Keizer, S., Thomson, B., Yu, K., Young, S.: Transformation-based Learning for semantic parsing. In: Proceedings of the InterSpeech, pp. 2719–2722 (2009)
Google Scholar
Jurčíček, F., Thomson, B., Young, S.: Reinforcement learning for parameter estimation in statistical spoken dialogue systems. Comput. Speech Lang. 26(3), 168–192 (2011)
Google Scholar
Larsson, S., Traum, D.: Information state and dialogue management in the TRINDI Dialogue Move Engine Toolkit. Nat. Lang. Eng. 6, 323–340 (1998)
Article Google Scholar
Lee, C., Jung, S., Eun, J., Jeong, M., Lee, G.G.: A situation-based dialogue management using dialogue examples. In: Proceedings of the International Conference on Acoustics, Speech and Signal Processing, pp. 69–72 (2006)
Google Scholar
Levin, E., Pieraccini, R., Eckert, W.: Using Markov decision process for learning dialogue strategies. In: Proceedings of the International Conference on Acoustics, Speech and Signal Processing, pp. 201–204 (1998)
Google Scholar
Lison, P.: A hybrid approach to dialogue management based on probabilistic rules. Comput. Speech Lang. 34(1), 232–255 (2015)
Article Google Scholar
Milhorat, P.: An Open-source Framework for Supporting the Design and Implementation of Natural-language Spoken Dialog Systems. Ph.D. thesis, Télécom Paris-Tech - 46, rue Barrault - 75013 Paris (2015)
Google Scholar
Milhorat, P., Schlögl, S., Chollet, G., Boudy, J.: Un Systéme de Dialogue Vocal pour les Seniors: Études et Spécifications. Journées d’étude sur la TéléSanté (2013)
Google Scholar
Raux, A., Langner, B., Bohus, D.: Lets go public! taking a spoken dialog system to the real world. In: Proceedings of the InterSpeech (2005)
Google Scholar
Rich, C.: Building task-based user interfaces with ANSI/CEA-2018. IEEE Comput. 8, 20–27 (2009)
Article Google Scholar
Schlögl, S., Milhorat, P., Chollet, G.: Designing, building and evaluating voice user interfaces for the home. In: Proceedings of the Workshop on Methods for Studying Technology in the Home at the ACM SIGCHI Conference on Human Factors in Computing Systems (CHI13) (2013)
Google Scholar
Schröder, M., Trouvain, J.: The German text-to-speech synthesis system MARY: a tool for research, development and teaching. Int. J. Speech Technol. 6(4), 365–377 (2003)
Article Google Scholar
Tedesco, D., Tullis, T.: A comparison of methods for eliciting post-task subjective ratings in usability testing. Usability Prof. Assoc. (UPA) 2006, 1–9 (2006)
Google Scholar
Torres, M.I.: Stochastic bi-languages to model dialogs. In: Proceedings of the International Conference on Finite State Methods and Natural Language Processing, pp. 9–17 (2013)
Google Scholar
Venkatesh, V., Bala, H.: Technology acceptance model 3 and a research agenda on interventions. Decis. Sci. 39(2), 273–315 (2008)
Article Google Scholar
Weizenbaum, J.: ELIZA—a computer program for the study of natural language communication between man and machine. Commun. ACM 9(1), 36–45 (1966)
Article Google Scholar
Williams, J.D., Young, S.: Partially observable Markov decision processes for spoken dialog systems. Comput. Speech Lang. 21(2), 393–422 (2007)
Article Google Scholar
Young, S.: Probabilistic methods in spoken dialogue systems. Philos. Trans. R. Soc. Lond. (2000)
Google Scholar
Young, S., Gašić, M., Thomson, B., Williams, J.D.: POMDP-based statistical spoken dialog systems: a review. In: Proceedings of the IEEE 101(5), 1160–1179 (2013)
Google Scholar
Zue, V., Seneff, S., Glass, J.R., Polifroni, J., Pao, C., Hazen, T.J., Hetherington, L.: JUPlTER: a telephone-based conversational interface for weather information. IEEE Trans. Speech Audio Process. 8(1), 85–96 (2000)
Article Google Scholar

Download references

Acknowledgements

The presented research is conducted as part of the vAssist project (AAL-2010-3-106), which is partially funded by the European Ambient Assisted Living Joint Programme and the National Funding Agencies from Austria, France and Italy. It has also been partially supported by the Spanish Ministry of Science under grant TIN2014-54288-C4-4-R and by the Basque Government under grant IT685-13.

Author information

Authors and Affiliations

Universidad del País Vasco UPV/EHU, Leioa, Spain
Javier Mikel Olaso & María Inés Torres
Télécom ParisTech, Paris, France
Pierrick Milhorat & Gérard Chollet
AIT Austrian Institute of Technology GmbH, Vienna, Austria
Julia Himmelsbach
Télécom SudParis, Évry, France
Jérôme Boudy
MCI Management Center Innsbruck, Innsbruck, Austria
Stephan Schlögl

Authors

Javier Mikel Olaso
View author publications
You can also search for this author in PubMed Google Scholar
Pierrick Milhorat
View author publications
You can also search for this author in PubMed Google Scholar
Julia Himmelsbach
View author publications
You can also search for this author in PubMed Google Scholar
Jérôme Boudy
View author publications
You can also search for this author in PubMed Google Scholar
Gérard Chollet
View author publications
You can also search for this author in PubMed Google Scholar
Stephan Schlögl
View author publications
You can also search for this author in PubMed Google Scholar
María Inés Torres
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Javier Mikel Olaso .

Editor information

Editors and Affiliations

Institute of Behavioural Sciences, University of Helsinki Institute of Behavioural Sciences, Helsinki, Finland
Kristiina Jokinen
University of Helsinki , Helsinki, Finland
Graham Wilcock

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Olaso, J.M. et al. (2017). A Multi-lingual Evaluation of the vAssist Spoken Dialog System. Comparing Disco and RavenClaw. In: Jokinen, K., Wilcock, G. (eds) Dialogues with Social Robots. Lecture Notes in Electrical Engineering, vol 427. Springer, Singapore. https://doi.org/10.1007/978-981-10-2585-3_17

Download citation

DOI: https://doi.org/10.1007/978-981-10-2585-3_17
Published: 25 December 2016
Publisher Name: Springer, Singapore
Print ISBN: 978-981-10-2584-6
Online ISBN: 978-981-10-2585-3
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics

A Multi-lingual Evaluation of the vAssist Spoken Dialog System. Comparing Disco and RavenClaw

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Evaluating SpeakyAcutattile: A System Based on Spoken Language for Ambient Assisted Living

Dialog Systems and Their Inputs

Towards Personalization of Spoken Dialogue System Communication Strategies

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this chapter

Cite this chapter

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

A Multi-lingual Evaluation of the vAssist Spoken Dialog System. Comparing Disco and RavenClaw

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Evaluating SpeakyAcutattile: A System Based on Spoken Language for Ambient Assisted Living

Dialog Systems and Their Inputs

Towards Personalization of Spoken Dialogue System Communication Strategies

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this chapter

Cite this chapter

Download citation

Share this chapter

Publish with us

Search

Navigation