More Web Proxy on the site http://driver.im/

research-article

What’s The Talk on VUI Guidelines? A Meta-Analysis of Guidelines for Voice User Interface Design

Authors:

Christine Murad,

Heloisa Candello,

Cosmin MunteanuAuthors Info & Claims

CUI '23: Proceedings of the 5th International Conference on Conversational User Interfaces

Article No.: 19, Pages 1 - 16

https://doi.org/10.1145/3571884.3597129

Published: 19 July 2023 Publication History

Editorial Notes

The authors have requested minor, non-substantive changes to the VoR and, in accordance with ACM policies, a Corrected VoR was published on January 4, 2024. For reference purposes the VoR may still be accessed via the Supplemental Material section on this page.

Abstract

Over the past decade, voice user interface (VUI) design has been steadily growing, along with a growing VUI presence in consumer markets. However, there is currently a lack of widely-established guidelines for VUI design. While many sets of VUI guidelines have been proposed, they tend to be developed independently of each other, leading to a lack of consensus on appropriate guidelines for VUI design. This can hinder the wider adoption of practical VUI guidelines. To address this gap, we performed a large-scale meta-analysis of 336 VUI design guidelines that have been proposed in academic literature. Using thematic analysis, we present a unified and synthesized set of 14 guidelines, representing the most universally proposed principles of VUI design as captured by the 336 VUI guidelines identified in academic literature. We hope that this synthesized set can address several of the challenges to the adoption of VUI guidelines in design practice.

Supplemental Material

PDF File - 3597129-VoR

Version of Record for "What?s The Talk on VUI Guidelines? A Meta-Analysis of Guidelines for Voice User Interface Design" by Murad et al., Proceedings of the 5th International Conference on Conversational User Interfaces (CUI '23).

Download
1.16 MB

References

[1]

2022. Conversation Design. https://developers.google.com/assistant/conversation-design/welcome

[2]

2022. Get Started with the Guide | Alexa Design Guide. Amazon (Alexa). https://developer.amazon.com/en-US/docs/alexa/alexa-design/get-started.html

[3]

2022. Introduction - Siri - Human Interface Guidelines - Apple Developer. https://developer.apple.com/design/human-interface-guidelines/siri/overview/introduction/

[4]

Samer Al Moubayed, Gabriel Skantze, Jonas Beskow, Kalin Stefanov, and Joakim Gustafson. 2012. Multimodal Multiparty Social Interaction with the Furhat Head. In Proceedings of the 14th ACM International Conference on Multimodal Interaction(ICMI ’12). Association for Computing Machinery, New York, NY, USA, 293–294. https://doi.org/10.1145/2388676.2388736 event-place: Santa Monica, California, USA.

Digital Library

[5]

Ghazanfar Ali, Myungho Lee, and Jae-In Hwang. 2020. Automatic text-to-gesture rule generation for embodied conversational agents. COMPUTER ANIMATION AND VIRTUAL WORLDS 31, 4–5 (Jul 2020). https://doi.org/10.1002/cav.1944

[6]

M. Allison and L. M. Kendrick. 2013. Towards an expressive embodied conversational agent utilizing multi-ethnicity to augment solution focused therapy. In FLAIRS 2013 - Proceedings of the 26th International Florida Artificial Intelligence Research Society Conference. 332–337. www.scopus.com

[7]

Marco Almada and Juliano Maranhao. 2021. Voice-based diagnosis of covid-19: ethical and legal challenges. INTERNATIONAL DATA PRIVACY LAW 11, 1 (Feb 2021), 63–75. https://doi.org/10.1093/idpl/ipab004

[8]

Nuno Almeida, Samuel Silva, António Teixeira, Maksym Ketsmur, Diogo Guimarães, and Emanuel Fonseca. 2018. Multimodal Interaction for Accessible Smart Homes. In Proceedings of the 8th International Conference on Software Development and Technologies for Enhancing Accessibility and Fighting Info-Exclusion(DSAI 2018). Association for Computing Machinery, New York, NY, USA, 63–70. https://doi.org/10.1145/3218585.3218595 event-place: Thessaloniki, Greece.

Digital Library

[9]

M. Anabuki, H. Kakuta, H. Yamamoto, and H. Tamura. 2000. Welbo: An embodied conversational agent living in mixed reality space. In Conference on Human Factors in Computing Systems - Proceedings. 10–11. www.scopus.com

[10]

Marco Avvenuti and Alessio Vecchio. 2009. Mobile Visual Access to Legacy Voice-Based Applications. In Proceedings of the 6th International Conference on Mobile Technology, Application & Systems(Mobility ’09). Association for Computing Machinery, New York, NY, USA. https://doi.org/10.1145/1710035.1710097 event-place: Nice, France.

Digital Library

[11]

Matthew P. Aylett, Per Ola Kristensson, Steve Whittaker, and Yolanda Vazquez-Alvarez. 2014. None of a CHInd. Proc. of CHI EA ’14 (2014), 749–760. https://doi.org/10.1145/2559206.2578868

Digital Library

[12]

Rajesh Balchandran, Mark E. Epstein, Gerasimos Potamianos, and Ladislav Seredi. 2008. A Multi-Modal Spoken Dialog System for Interactive TV. In Proceedings of the 10th International Conference on Multimodal Interfaces(ICMI ’08). Association for Computing Machinery, New York, NY, USA, 191–192. https://doi.org/10.1145/1452392.1452429 event-place: Chania, Crete, Greece.

Digital Library

[13]

N. O. Bernsen, H. Dybkjaer, and L. Dybkjaer. 1996. Principles for the design of cooperative spoken human-machine dialogue. In Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP ’96, Vol. 2. 729–732 vol.2. https://doi.org/10.1109/ICSLP.1996.607465

[14]

Niels O. Bernsen, Hans Dybkjær, and Laila Dybkjær. 1996. Cooperativity in human–machine and human–human spoken dialogue. Discourse Processes 21, 2 (March 1996), 213–236. http://myaccess.library.utoronto.ca/login?qurl=https%3A%2F%2Fsearch.proquest.com%2Fdocview%2F618843297%3Faccountid%3D14771 ISBN: 0163-853X, 0163-853X.

[15]

Dan Bohus and Eric Horvitz. 2010. Facilitating Multiparty Dialog with Gaze, Gesture, and Speech. In International Conference on Multimodal Interfaces and the Workshop on Machine Learning for Multimodal Interaction(ICMI-MLMI ’10). Association for Computing Machinery, New York, NY, USA. https://doi.org/10.1145/1891903.1891910 event-place: Beijing, China.

Digital Library

[16]

Jose A Borges, Israel Morales, and Nkstor J Rodriguez. 1996. Guidelines for Designing Usable World Wide Web Pages. Technical Report. http://delivery.acm.org/10.1145/260000/257320/p277-borges.pdf?ip=174.112.248.232&id=257320&acc=ACTIVE SERVICE&key=FD0067F557510FFB.148C9AE997532579.2370BB3FAC5962EF.4D4702B0C3E38B35&__acm__=1537318774_5ad8eb060e1eefa36cb28cf6616570b5

[17]

M. . Bourguet. 2006. Towards a taxonomy of error-handling strategies in recognition-based multi-modal human-computer interfaces. Signal Processing 86, 12 (2006), 3625–3643. www.scopus.com

Digital Library

[18]

Stacy M. Branham and Antony Rishin Mukkath Roy. 2019. Reading Between the Guidelines: How Commercial Voice Assistant Guidelines Hinder Accessibility for Blind Users. In Proceedings of the 21st International ACM SIGACCESS Conference on Computers and Accessibility (Pittsburgh, PA, USA) (ASSETS ’19). Association for Computing Machinery, New York, NY, USA, 446–458. https://doi.org/10.1145/3308561.3353797

Digital Library

[19]

Virginia Braun and Victoria Clarke. 2006. Using thematic analysis in psychology. Qualitative Research in Psychology 3, 2 (Jan. 2006), 77–101. https://doi.org/10.1191/1478088706qp063oa

[20]

Robin N. Brewer, Leah Findlater, Joseph ’Jofish’ Kaye, Walter Lasecki, Cosmin Munteanu, and Astrid Weber. 2018. Accessible Voice Interfaces. In Companion of the 2018 ACM Conference on Computer Supported Cooperative Work and Social Computing(CSCW ’18). Association for Computing Machinery, New York, NY, USA, 441–446. https://doi.org/10.1145/3272973.3273006 event-place: Jersey City, NJ, USA.

Digital Library

[21]

Justine Cassell. 2000. Embodied conversational interface agents. Association for Computing Machinery.Communications of the ACM 43, 4 (2000), 70–78. http://myaccess.library.utoronto.ca/login?qurl=https%3A%2F%2Fsearch.proquest.com%2Fdocview%2F237048747%3Faccountid%3D14771 ISBN: 00010782.

Digital Library

[22]

Leigh Clark, Benjamin R. Cowan, Abi Roper, Stephen Lindsay, and Owen Sheers. 2020. Speech Diversity and Speech Interfaces: Considering an Inclusive Future through Stammering. In Proceedings of the 2nd Conference on Conversational User Interfaces(CUI ’20). Association for Computing Machinery, New York, NY, USA. https://doi.org/10.1145/3405755.3406139 event-place: Bilbao, Spain.

Digital Library

[23]

Leigh Clark, Philip Doyle, Diego Garaialde, Emer Gilmartin, Stephan Schlögl, Jens Edlund, Matthew Aylett, João Cabral, Cosmin Munteanu, Justin Edwards, and Benjamin R Cowan. 2019. The State of Speech in HCI: Trends, Themes and Challenges. Interacting with Computers 31, 4 (Dec. 2019), 349–371. https://doi.org/10.1093/iwc/iwz016

[24]

Leigh Clark, Nadia Pantidi, Orla Cooney, Philip Doyle, Diego Garaialde, Justin Edwards, Brendan Spillane, Emer Gilmartin, Christine Murad, Cosmin Munteanu, Vincent Wade, and Benjamin R. Cowan. 2019. What Makes a Good Conversation? Challenges in Designing Truly Conversational Agents. In Proceedings of the 2019 CHI Conference on Human Factors in Computing Systems(CHI ’19). Association for Computing Machinery, New York, NY, USA, 1–12. https://doi.org/10.1145/3290605.3300705 event-place: Glasgow, Scotland Uk.

Digital Library

[25]

Eric Corbett and Astrid Weber. 2016. What Can I Say? Addressing User Experience Challenges of a Mobile Voice User Interface for Accessibility. In Proceedings of the 18th International Conference on Human-Computer Interaction with Mobile Devices and Services(MobileHCI ’16). Association for Computing Machinery, New York, NY, USA, 72–82. https://doi.org/10.1145/2935334.2935386 event-place: Florence, Italy.

Digital Library

[26]

Benjamin R Cowan, Nadia Pantidi, David Coyle, Kellie Morrissey, Peter Clarke, Sara Al-Shehri, David Earley, and Natasha Bandeira. 2017. "What Can I Help You With?": Infrequent Users’ Experiences of Intelligent Personal Assistants. In Proc. of MobileHCI ’17. 1–12. https://doi.org/10.1145/3098279.3098539

Digital Library

[27]

Colleen E Crangle, Lawrence M Fagan, Robert W Carlson, Mark S Erlbaum, David D Sherertz, and Mark S Tuttle. 1998. Collaborative conversational interfaces. International Journal of Speech Technology 2 (1998), 187–200. https://doi.org/10.1007/BF02111207

[28]

Andreea Danielescu. 2020. Eschewing Gender Stereotypes in Voice Assistants to Promote Inclusion. In Proceedings of the 2nd Conference on Conversational User Interfaces(CUI ’20). Association for Computing Machinery, New York, NY, USA. https://doi.org/10.1145/3405755.3406151 event-place: Bilbao, Spain.

Digital Library

[29]

Alan Lopes de Sousa Freitas, Vinícius Paes de Camargo, Heloise Manica Paris Teixeira, Renato Balancieri, and Thelma Elita Colanzi. 2017. Gesture and Voice-Based Natural User Interface for Electronic Whiteboard System in a Medical Emergency Department. In Proceedings of the XVI Brazilian Symposium on Human Factors in Computing Systems(IHC 2017). Association for Computing Machinery, New York, NY, USA. https://doi.org/10.1145/3160504.3160534 event-place: Joinville, Brazil.

Digital Library

[30]

Carlos Delgado Kloos, Carlos Alario-Hoyos, Pedro J. Munoz-Merino, Cristina Catalan Aguirre, and Nuria Gonzalez Castro. 2019. Principles for the Design of an Educational Voice Assistant for Learning Java. In SUSTAINABLE ICT, EDUCATION AND LEARNING(IFIP Advances in Information and Communication Technology, Vol. 564), Tatnall, A and Mavengere, N (Ed.). 99–106. https://doi.org/10.1007/978-3-030-28764-1_12 ISSN: 1868-4238.

[31]

Laila Dybkjær, Niels Ole Bernsen, and Hans Dybkjær. 1996. Grice Incorporated: Cooperativity in Spoken Dialogue. In Proceedings of the 16th Conference on Computational Linguistics - Volume 1(COLING ’96). Association for Computational Linguistics, USA, 328–333. https://doi.org/10.3115/992628.992686 event-place: Copenhagen, Denmark.

Digital Library

[32]

F. Ebbers, J. Zibuschka, C. Zimmermann, and O. Hinz. 2020. User preferences for privacy features in digital assistants. Electronic Markets (2020). https://doi.org/10.1007/s12525-020-00447-y

[33]

S. Estes, J. Helleberg, K. Long, M. Pollack, and M. Quezada. 2018. Guidelines for speech interactions between pilot and cognitive assistant. In 2018 Integrated Communications, Navigation, Surveillance Conference (ICNS). 3H2–1–3H2–10. https://doi.org/10.1109/ICNSURV.2018.8384875

[34]

Raymond Fok, Harmanpreet Kaur, Skanda Palani, Martez E. Mott, and Walter S. Lasecki. 2018. Towards More Robust Speech Interactions for Deaf and Hard of Hearing Users. In Proceedings of the 20th International ACM SIGACCESS Conference on Computers and Accessibility(ASSETS ’18). Association for Computing Machinery, New York, NY, USA, 57–67. https://doi.org/10.1145/3234695.3236343 event-place: Galway, Ireland.

Digital Library

[35]

Natalie Friedman, Andrea Cuadra, Ruchi Patel, Shiri Azenkot, Joel Stein, and Wendy Ju. 2019. Voice Assistant Strategies and Opportunities for People with Tetraplegia. In The 21st International ACM SIGACCESS Conference on Computers and Accessibility(ASSETS ’19). Association for Computing Machinery, New York, NY, USA, 575–577. https://doi.org/10.1145/3308561.3354605 event-place: Pittsburgh, PA, USA.

Digital Library

[36]

Lokesh Fulfagar, Anupriya Gupta, Arpit Mathur, and Abhishek Shrivastava. 2021. Development and Evaluation of Usability Heuristics for Voice User Interfaces. In Design for Tomorrow—Volume 1(Smart Innovation, Systems and Technologies), Amaresh Chakrabarti, Ravi Poovaiah, Prasad Bokil, and Vivek Kant (Eds.). Springer, Singapore, 375–385. https://doi.org/10.1007/978-981-16-0041-8_32

[37]

Kotaro Funakoshi, Mikio Nakano, Kazuki Kobayashi, Takanori Komatsu, and Seiji Yamada. 2010. Non-Humanlike Spoken Dialogue: A Design Perspective. In Proceedings of the 11th Annual Meeting of the Special Interest Group on Discourse and Dialogue(SIGDIAL ’10). Association for Computational Linguistics, USA, 176–184. event-place: Tokyo, Japan.

[38]

M. Funk, C. Cunningham, D. Kanver, C. Saikalis, and R. Pansare. 2020. Usable and Acceptable Response Delays of Conversational Agents in Automotive User Interfaces. In Proceedings - 12th International ACM Conference on Automotive User Interfaces and Interactive Vehicular Applications, AutomotiveUI 2020. 262–269. https://doi.org/10.1145/3409120.3410651

Digital Library

[39]

Anushay Furqan, Chelsea Myers, and Jichen Zhu. 2017. Learnability through Adaptive Discovery Tools in Voice User Interfaces. In Proceedings of the 2017 CHI Conference Extended Abstracts on Human Factors in Computing Systems(CHI EA ’17). Association for Computing Machinery, New York, NY, USA, 1617–1623. https://doi.org/10.1145/3027063.3053166 event-place: Denver, Colorado, USA.

Digital Library

[40]

Abraham Glasser, Vaishnavi Mande, and Matt Huenerfauth. 2020. Accessibility for Deaf and Hard of Hearing Users: Sign Language Conversational User Interfaces. In Proceedings of the 2nd Conference on Conversational User Interfaces(CUI ’20). Association for Computing Machinery, New York, NY, USA. https://doi.org/10.1145/3405755.3406158 event-place: Bilbao, Spain.

Digital Library

[41]

S Gopalakrishnan and P Ganeshkumar. 2013. Systematic Reviews and Meta-analysis: Understanding the Best Evidence in Primary Healthcare. J Family Med Prim Care (2013). https://doi.org/10.4103/2249-4863.109934

[42]

Mardé Greeff, Louis Coetzee, and Martin Pistorius. 2008. Usability Evaluation of the South African National Accessibility Portal Interactive Voice Response System. In Proceedings of the 2008 Annual Research Conference of the South African Institute of Computer Scientists and Information Technologists on IT Research in Developing Countries: Riding the Wave of Technology(SAICSIT ’08). Association for Computing Machinery, New York, NY, USA, 76–85. https://doi.org/10.1145/1456659.1456669 event-place: Wilderness, South Africa.

Digital Library

[43]

Mohammad Hadian, Thamer Altuwaiyan, Xiaohui Liang, and Wei Li. 2017. Efficient and Privacy-Preserving Voice-Based Search over Mhealth Data. In Proceedings of the Second IEEE/ACM International Conference on Connected Health: Applications, Systems and Engineering Technologies(CHASE ’17). IEEE Press, 96–101. https://doi.org/10.1109/CHASE.2017.66 event-place: Philadelphia, Pennsylvania.

Digital Library

[44]

Aki Halonen, Sami Hyrynsalmi, Kai K. Kimppa, Timo Knuutila, Jouni Smed, and Harri Hakonen. 2012. Towards Usability Heuristics for Games Utilizing Speech Recognition. In 4TH ASIAN CONFERENCE ON IN℡LIGENT GAMES AND SIMULATION - 4TH ASIAN SIMULATION TECHNOLOGY CONFERENCE, Inaba, M and Hosoi, K and Thawonmas, R and Nakamura, A and Uemura, M (Ed.). 51–55.

[45]

X. Han and T. Yeh. 2020. How does your alexa behave?: Evaluating voice applications by design guidelines using an automatic voice crawler. In CEUR Workshop Proceedings, Vol. 2848.

[46]

Danula Hettiachchi, Zhanna Sarsenbayeva, Fraser Allison, Niels van Berkel, Tilman Dingler, Gabriele Marini, Vassilis Kostakos, and Jorge Goncalves. 2020. "Hi! I Am the Crowd Tasker" Crowdsourcing through Digital Voice Assistants. In Proceedings of the 2020 CHI Conference on Human Factors in Computing Systems(CHI ’20). Association for Computing Machinery, New York, NY, USA, 1–14. https://doi.org/10.1145/3313831.3376320 event-place: Honolulu, HI, USA.

Digital Library

[47]

K. S. Hone and C. Baber. 2001. Designing habitable dialogues for speech-based interaction with computers. International Journal of Human-Computer Studies 54, 4 (2001), 637–662. http://myaccess.library.utoronto.ca/login?qurl=https%3A%2F%2Fsearch.proquest.com%2Fdocview%2F619618966%3Faccountid%3D14771 ISBN: 1071-5819, 1071-5819.

Digital Library

[48]

Tamino Huxohl, Marian Pohling, Birte Carlmeyer, Britta Wrede, and Thomas Hermann. 2019. Interaction guidelines for personal voice assistants in smart homes. In 2019 10th international conference on speech technology and human-computer dialogue, SpeD 2019. 1–10. https://doi.org/10.1109/SPED.2019.8906642

[49]

Rodolfo Inostroza, Cristian Rusu, Silvana Roncagliolo, Cristhy Jimenez, and Virginica Rusu. 2012. Usability Heuristics for Touchscreen-based Mobile Devices. In 2012 Ninth International Conference on Information Technology - New Generations. IEEE, 662–667. https://doi.org/10.1109/ITNG.2012.134

Digital Library

[50]

Lopatovska Irene, Alice L. Griffin, Kelsey Gallagher, Ballingall Caitlin, Clair Rock, and Mildred Velazquez. 2020. User recommendations for intelligent personal assistants. Journal of Librarianship and Information Science 52, 2 (2020), 577–591. http://myaccess.library.utoronto.ca/login?qurl=https%3A%2F%2Fsearch.proquest.com%2Fdocview%2F2389579821%3Faccountid%3D14771 ISBN: 0961-0006.

[51]

Ing-Marie Jonsson and Nils Dahlback. 2011. I Can’t Hear You? Drivers Interacting with Male or Female Voices in Native or Non-native Language. In UNIVERSAL ACCESS IN HUMAN-COMPUTER INTERACTION: CONTEXT DIVERSITY, PT 3(Lecture Notes in Computer Science, Vol. 6767), Stephanidis, C (Ed.). 298–305. ISSN: 0302-9743 Issue: 3.

[52]

C. A. Kamm and M. A. Walker. 1997. Design and evaluation of spoken dialog systems. In 1997 IEEE Workshop on Automatic Speech Recognition and Understanding Proceedings. 11–18. https://doi.org/10.1109/ASRU.1997.658969

[53]

Junhan Kim, Yoojung Kim, Byungjoon Kim, Sukyung Yun, Minjoon Kim, and Joongseek Lee. 2018. Can a Machine Tend to Teenagers’ Emotional Needs? A Study with Conversational Agents. In Extended Abstracts of the 2018 CHI Conference on Human Factors in Computing Systems(CHI EA ’18). Association for Computing Machinery, New York, NY, USA, 1–6. https://doi.org/10.1145/3170427.3188548 event-place: Montreal QC, Canada.

Digital Library

[54]

Junhan Kim, Yoojung Kim, Byungjoon Kim, Sukyung Yun, Minjoon Kim, and Joongseek Lee. 2018. Can a Machine Tend to Teenagers’ Emotional Needs? A Study with Conversational Agents. In Extended Abstracts of the 2018 CHI Conference on Human Factors in Computing Systems(CHI EA ’18). Association for Computing Machinery, New York, NY, USA, 1–6. https://doi.org/10.1145/3170427.3188548 event-place: Montreal QC, Canada.

Digital Library

[55]

Sunyoung Kim. 2021. Exploring how older adults use a smart Speaker-Based voice assistant in their first interactions: Qualitative study. JMIR MHEALTH AND UHEALTH 9, 1 (Jan 2021). https://doi.org/10.2196/20427

[56]

Y. Kim, M. Reza, J. McGrenere, and D. Yoon. 2021. Designers characterize naturalness in voice user interfaces: Their goals, practices, and challenges. https://doi.org/10.1145/3411764.3445579

Digital Library

[57]

Raina Langevin, Ross J Lordon, Thi Avrahami, Benjamin R. Cowan, Tad Hirsch, and Gary Hsieh. 2021. Heuristic Evaluation of Conversational Agents. In Proceedings of the 2021 CHI Conference on Human Factors in Computing Systems (Yokohama, Japan) (CHI ’21). Association for Computing Machinery, New York, NY, USA, Article 632, 15 pages. https://doi.org/10.1145/3411764.3445312

Digital Library

[58]

Martha Larson, Nelleke Oostdijk, and Frederik Zuiderveen Borgesius. 2021. Not directly stated, not explicitly stored: Conversational agents and the privacy threat of implicit information. In Adjunct proceedings of the 29th ACM conference on user modeling, adaptation and personalization(UMAP ’21). Association for Computing Machinery, New York, NY, USA, 388–391. https://doi.org/10.1145/3450614.3463601

Digital Library

[59]

Minha Lee and Sangsu Lee. 2021. “I Don’t Know Exactly but I Know a Little”: Exploring Better Responses of Conversational Agents with Insufficient Information. In Extended Abstracts of the 2021 CHI Conference on Human Factors in Computing Systems (Yokohama, Japan) (CHI EA ’21). Association for Computing Machinery, New York, NY, USA, Article 427, 5 pages. https://doi.org/10.1145/3411763.3451812

Digital Library

[60]

Alessandro Liberati, Douglas G. Altman, Jennifer Tetzlaff, Cynthia Mulrow, Peter C. Gøtzsche, John P. A. Ioannidis, Mike Clarke, P. J. Devereaux, Jos Kleijnen, and David Moher. 2009. The PRISMA statement for reporting systematic reviews and meta-analyses of studies that evaluate health care interventions: explanation and elaboration. PLoS medicine 6, 7 (Jul 2009), e1000100. https://doi.org/10.1371/journal.pmed.1000100

[61]

Isabella Loddo and Dario Martini. 2017. The cocktail party effect. An inclusive vision of conversational interactions. The Design Journal 20 (2017), 4076. http://myaccess.library.utoronto.ca/login?qurl=https%3A%2F%2Fsearch.proquest.com%2Fdocview%2F1936558654%3Faccountid%3D14771 ISBN: 14606925.

[62]

Ewa Luger and Abigail Sellen. 2016. "Like Having a Really Bad PA": The Gulf between User Expectation and Experience of Conversational Agents. In Proceedings of the 2016 CHI Conference on Human Factors in Computing Systems - CHI ’16. 5286–5297. https://doi.org/10.1145/2858036.2858288

Digital Library

[63]

Oussama Metatla, Alison Oldfield, Taimur Ahmed, Antonis Vafeas, and Sunny Miglani. 2019. Voice User Interfaces in Schools: Co-Designing for Inclusion with Visually-Impaired and Sighted Pupils. In Proceedings of the 2019 CHI Conference on Human Factors in Computing Systems(CHI ’19). Association for Computing Machinery, New York, NY, USA, 1–15. https://doi.org/10.1145/3290605.3300608 event-place: Glasgow, Scotland Uk.

Digital Library

[64]

Aarthi Easwara Moorthy and Kim Phuong L. Vu. 2015. Privacy Concerns for Use of Voice Activated Personal Assistant in the Public Space. International Journal of Human-Computer Interaction 31, 4 (2015), 307–335. https://doi.org/10.1080/10447318.2014.986642

[65]

Cosmin Munteanu, Ben Cowan, Keisuke Nakamura, Pourang Irani, Sharon Oviatt, Matthew Aylett, Gerald Penn, Shimei Pan, Nikhil Sharma, Frank Rudzicz, and Randy Gomez. 2017. Designing Speech, Acoustic and Multimodal Interactions. In Proc. of CHI EA ’17. 601–608. https://doi.org/10.1145/3027063.3027086

Digital Library

[66]

Christine Murad and Cosmin Munteanu. 2019. "I Don’t Know What You’re Talking about, HALexa": The Case for Voice User Interface Guidelines. In Proceedings of the 1st International Conference on Conversational User Interfaces(CUI ’19). Association for Computing Machinery, New York, NY, USA. https://doi.org/10.1145/3342775.3342795 event-place: Dublin, Ireland.

Digital Library

[67]

Christine Murad, Cosmin Munteanu, Leigh Clark, and Benjamin R. Cowan. 2018. Design guidelines for hands-free speech interaction. In Proc. of MobileHCI ’18. ACM Press, New York, New York, USA, 269–276. https://doi.org/10.1145/3236112.3236149

Digital Library

[68]

Christine Murad, Cosmin Munteanu, Benjamin R. Cowan, and Leigh Clark. 2019. Revolution or Evolution? Speech Interaction and HCI Design Guidelines. IEEE PERVASIVE COMPUTING 18, 2 (June 2019), 33–45. https://doi.org/10.1109/MPRV.2019.2906991

Digital Library

[69]

Christine Murad, Cosmin Munteanu, Benjamin R. Cowan, Leigh Clark, Martin Porcheron, Heloisa Candello, Stephan Schlögl, Matthew P. Aylett, Jaisie Sin, Robert J. Moore, Grace Hughes, and Andrew Ku. 2021. Let’s Talk About CUIs: Putting Conversational User Interface Design Into Practice. In Extended Abstracts of the 2021 CHI Conference on Human Factors in Computing Systems (Yokohama, Japan) (CHI EA ’21). Association for Computing Machinery, New York, NY, USA, Article 98, 6 pages. https://doi.org/10.1145/3411763.3441336

Digital Library

[70]

Christine Murad, Cosmin Munteanu, Benjamin R. Cowan, and Leigh Clark. 2021. Finding a New Voice: Transitioning Designers from GUI to VUI Design. In CUI 2021 - 3rd Conference on Conversational User Interfaces(CUI ’21). Association for Computing Machinery, New York, NY, USA, 1–12. https://doi.org/10.1145/3469595.3469617

Digital Library

[71]

Christine Murad, Humaira Tasnim, and Cosmin Munteanu. 2022. “Voice-First Interfaces in a GUI-First Design World”: Barriers and Opportunities to Supporting VUI Designers On-the-Job. In Proceedings of the 4th Conference on Conversational User Interfaces (Glasgow, United Kingdom) (CUI ’22). Association for Computing Machinery, New York, NY, USA, Article 17, 10 pages. https://doi.org/10.1145/3543829.3543842

Digital Library

[72]

Chelsea M. Myers. 2019. Adaptive suggestions to increase learnability for voice user interfaces. In Proceedings of the 24th International Conference on Intelligent User Interfaces Companion - IUI ’19. ACM Press, New York, New York, USA, 159–160. https://doi.org/10.1145/3308557.3308727

Digital Library

[73]

Chelsea M. Myers, Anushay Furqan, and Jichen Zhu. 2019. The Impact of User Characteristics and Preferences on Performance with an Unfamiliar Voice User Interface. In Proceedings of the 2019 CHI Conference on Human Factors in Computing Systems(CHI ’19). Association for Computing Machinery, New York, NY, USA, 1–9. https://doi.org/10.1145/3290605.3300277 event-place: Glasgow, Scotland Uk.

Digital Library

[74]

T. J. Ndwe, M. E. Dlodlo, and D. J. Mashao. 2008. Usability engineering of an interactive voice response system in a diverse-cultured and multilingual setting. In Innovative Techniques in Instruction Technology, E-Learning, E-Assessment, and Education. 554–559. www.scopus.com

[75]

Jakob Nielsen. 1994. Enhancing the explanatory power of usability heuristics. Proc. of CHI ’94 (1994), 152–158. https://doi.org/10.1145/191666.191729

Digital Library

[76]

Donald Norman. 1988. The Design of Everyday Things. Doubled Currency (1988).

[77]

Matthew J. Page, Joanne E. McKenzie, Patrick M. Bossuyt, Isabelle Boutron, Tammy C. Hoffmann, Cynthia D. Mulrow, Larissa Shamseer, Jennifer M. Tetzlaff, Elie A. Akl, Sue E. Brennan, Roger Chou, Julie Glanville, Jeremy M. Grimshaw, Asbjørn Hróbjartsson, Manoj M. Lalu, Tianjing Li, Elizabeth W. Loder, Evan Mayo-Wilson, Steve McDonald, Luke A. McGuinness, Lesley A. Stewart, James Thomas, Andrea C. Tricco, Vivian A. Welch, Penny Whiting, and David Moher. 2021. The PRISMA 2020 statement: An updated guideline for reporting systematic reviews. 134 (Jun 2021), 178–189. https://doi.org/10.1016/j.jclinepi.2021.03.001

[78]

N. Patel, S. Agarwal, N. Rajput, A. Nanavati, P. Dave, and T. S. Parikh. 2008. Experiences designing a voice interface for rural India. In 2008 IEEE Spoken Language Technology Workshop. 21–24. https://doi.org/10.1109/SLT.2008.4777830

[79]

Cathy Pearl. 2016. Designing Voice User Interfaces: Principles of Conversational Experiences (1st ed.). O’Reilly Media, Inc.

[80]

David Pinelle, Nelson Wong, and Tadeusz Stach. 2008. Heuristic evaluation for games: usability principles for video game design. Proceedings of SIGCHI Conference on Human Factors in Computing Systems (2008), 1453–1462. https://doi.org/10.1145/1357054.1357282

Digital Library

[81]

Dominik Pins, Alexander Boden, Britta Essing, and Gunnar Stevens. 2020. "Miss Understandable": A Study on How Users Appropriate Voice Assistants and Deal with Misunderstandings. In Proceedings of Mensch Und Computer 2020 (Magdeburg, Germany) (MuC ’20). Association for Computing Machinery, New York, NY, USA, 349–359. https://doi.org/10.1145/3404983.3405511

Digital Library

[82]

V. Raveendran, M. R. Sanjeev, N. Paul, and Jijina K.P.2016. Speech only interface approach for personal computing environment. In 2016 IEEE International Conference on Engineering and Technology (ICETECH). 372–377. https://doi.org/10.1109/ICETECH.2016.7569279

[83]

Steven Ross, Elizabeth Brownholtz, and Robert Armes. 2004. Voice User Interface Principles for a Conversational Agent. In Proceedings of the 9th International Conference on Intelligent User Interfaces(IUI ’04). Association for Computing Machinery, New York, NY, USA, 364–365. https://doi.org/10.1145/964442.964536 event-place: Funchal, Madeira, Portugal.

Digital Library

[84]

V. F. M. Salvador and L. de Assis Moura. 2010. Heuristic evaluation for automatic radiology reporting transcription systems. In 10th International Conference on Information Science, Signal Processing and their Applications (ISSPA 2010). 292–295. https://doi.org/10.1109/ISSPA.2010.5605467

[85]

Robert M. Schumacher, Mary L. Hardzinski, and Amy L. Schwartz. 1995. Increasing the Usability of Interactive Voice Response Systems: Research and Guidelines for Phone-Based Interfaces. Human factors 37, 2 (June 1995), 251. http://myaccess.library.utoronto.ca/login?qurl=https%3A%2F%2Fsearch.proquest.com%2Fdocview%2F1311858959%3Faccountid%3D14771 ISBN: 0018-7208.

[86]

J Sherwani, Dong Yu, and Tim Paek. 2007. Voicepedia: towards speech-based access to unstructured information.Interspeech (2007), 2–5. http://research.microsoft.com/pubs/78835/VoicePedia-Interspeech2007.pdf

[87]

J.Y. Shin and J. Huh-Yoo. 2020. Designing everyday conversational agents for managing health and wellness: A study of alexa skills reviews. In ACM International Conference Proceeding Series. 50–61. https://doi.org/10.1145/3421937.3422024

Digital Library

[88]

Ben Shneiderman. 2000. The limits of speech recognition. Commun. ACM 43, 9 (2000), 63–65. https://doi.org/10.1145/348941.348990

Digital Library

[89]

Shoupu Chen, Z. Kazi, M. Beitler, M. Salganicoff, D. Chester, and R. Foulds. 1996. Gesture-speech based HMI for a rehabilitation robot. In Proceedings of SOUTHEASTCON ’96. 29–36. https://doi.org/10.1109/SECON.1996.510021

[90]

Bernhard Suhm. 2003. Towards Best Practices for Speech User Interface Design. In Proc. of EuroSpeech ’03. 2217–2220.

[91]

Alistair Sutcliffe and Brian Gault. 2004. Heuristic evaluation of virtual reality applications. Interacting with Computers 16, 4 (2004), 831–849. https://doi.org/10.1016/j.intcom.2004.05.001

[92]

Vanessa Tobisch, Markus Funk, and Adam Emfield. 2020. Dealing with Input Uncertainty in Automotive Voice Assistants. In 12th International Conference on Automotive User Interfaces and Interactive Vehicular Applications (Virtual Event, DC, USA) (AutomotiveUI ’20). Association for Computing Machinery, New York, NY, USA, 161–168. https://doi.org/10.1145/3409120.3410660

Digital Library

[93]

Tandy Trower. 1997. Creating Conversational Interfaces for Interactive Software Agents. In CHI ’97 Extended Abstracts on Human Factors in Computing Systems(CHI EA ’97). Association for Computing Machinery, New York, NY, USA, 198–199. https://doi.org/10.1145/1120212.1120341 event-place: Atlanta, Georgia.

Digital Library

[94]

Carla Tubin, João Pedro Mazuco Rodriguez, and Ana Carolina Bertoletti de Marchi. 2021. User experience with conversational agent: a systematic review of assessment methods. (Dec 2021). https://doi.org/10.6084/m9.figshare.17168875.v1

[95]

M. Vimalkumar, S.K. Sharma, J.B. Singh, and Y.K. Dwivedi. 2021. ‘Okay google, what about my privacy?’: User’s privacy perceptions and acceptance of voice based digital assistants. Computers in Human Behavior 120 (2021).

[96]

Z. Wei and J. A. Landay. 2018. Evaluating Speech-Based Smart Devices Using New Usability Heuristics. IEEE Pervasive Computing 17, 2 (2018), 84–96. www.scopus.com

Digital Library

[97]

J. Weizenbaum. 1966. ELIZA- A computer program for the study of natural language communication between men and machine. Commun. ACM 9 (1966), 36–45. https://doi.org/10.1145/365153.365168

Digital Library

[98]

Kathryn Whitenton. 2016. Voice Interaction UX: Brave New World...Same Old Story. https://www.nngroup.com/articles/voice-interaction-ux/

[99]

Y. Xu, S.M. Branham, X. Deng, P. Collins, and M. Warschauer. 2021. Are current voice interfaces designed to support children’s language development?. In Conference on Human Factors in Computing Systems - Proceedings. https://doi.org/10.1145/3411764.3445271

Digital Library

[100]

Y. Xu and M. Warschauer. 2020. A content analysis of voice-based apps on the market for early literacy development. In Proceedings of the Interaction Design and Children Conference, IDC 2020. 361–371. https://doi.org/10.1145/3392063.3394418

Digital Library

[101]

X. Yang and M. Aurisicchio. 2021. Designing conversational agents: A self-determination theory approach. In Conference on Human Factors in Computing Systems - Proceedings. https://doi.org/10.1145/3411764.3445445

Digital Library

[102]

Nicole Yankelovich, Gina-Anne Levow, and Matt Marx. 1995. Designing SpeechActs: Issues in Speech User Interfaces. In Proc. of CHI ’95. 369–376. https://doi.org/10.1145/223904.223952

Digital Library

[103]

G. Yeratziotis and D. Van Greunen. 2013. Making ICT accessible for the deaf. In 2013 IST-Africa Conference Exhibition. 1–9.

[104]

L. Zhou. 2007. Natural language interface for information management on mobile devices. Behaviour & Information Technology 26, 3 (2007), 197–207. http://myaccess.library.utoronto.ca/login?qurl=https%3A%2F%2Fsearch.proquest.com%2Fdocview%2F621775007%3Faccountid%3D14771 ISBN: 0144-929X, 0144-929X.

Digital Library

Cited By

Abhari SMcMurray JRandhawa TBin Noon GHanjahanja-Phiri TMcNeil HManning FDebergue PTeague JPelegrini Morita P(2024)Exploring the Landscape of Standards and Guidelines in AgeTech Design and Development: Scoping Review and Thematic AnalysisJMIR Aging10.2196/581967(e58196)Online publication date: 31-Oct-2024
https://doi.org/10.2196/58196
Iñiguez-Carrillo AAréchiga DRangel-Romero MEspíndola-Barajas AFarzan RLópez CCardoso Llach DQuercia DMustafa MNiu SWong-Villacrés M(2024)Bridging Communication Gaps Using Augmented Reality: Designing a User Conversational Interface for Hearing Impaired StudentCompanion Publication of the 2024 Conference on Computer-Supported Cooperative Work and Social Computing10.1145/3678884.3681905(553-557)Online publication date: 11-Nov-2024
https://dl.acm.org/doi/10.1145/3678884.3681905
Alghamdi EHalvey MNicol E(2024)System and User Strategies to Repair Conversational Breakdowns of Spoken Dialogue Systems: A Scoping ReviewProceedings of the 6th ACM Conference on Conversational User Interfaces10.1145/3640794.3665558(1-13)Online publication date: 8-Jul-2024
https://dl.acm.org/doi/10.1145/3640794.3665558
Show More Cited By

Index Terms

What’s The Talk on VUI Guidelines? A Meta-Analysis of Guidelines for Voice User Interface Design
1. Human-centered computing
  1. Human computer interaction (HCI)
    1. HCI design and evaluation methods
  2. Interaction design
    1. Interaction design process and methods

Recommendations

“Voice-First Interfaces in a GUI-First Design World”: Barriers and Opportunities to Supporting VUI Designers On-the-Job
CUI '22: Proceedings of the 4th Conference on Conversational User Interfaces

Voice user interfaces (VUIs) are currently experiencing rapid growth as commercial devices like Google Home, Amazon Echo, and Apple Homepod are adopted by users. However, due to the pace of this growth, the tech industry has had to adapt quickly and ...
Finding a New Voice: Transitioning Designers from GUI to VUI Design
CUI '21: Proceedings of the 3rd Conference on Conversational User Interfaces

As Voice User Interfaces (VUIs) become widely popular, designers must handle new usability challenges. However, compared to other established domains such as Graphical User Interfaces (GUIs), VUI designers have fewer resources (training support, ...
Design guidelines for hands-free speech interaction
MobileHCI '18: Proceedings of the 20th International Conference on Human-Computer Interaction with Mobile Devices and Services Adjunct

As research on speech interfaces continues to grow in the field of HCI, there is a need to develop design guidelines that help solve usability and learnability issues that exist in hands-free speech interfaces. While several sets of established ...

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences

CUI '23: Proceedings of the 5th International Conference on Conversational User Interfaces

July 2023

504 pages

ISBN:9798400700149

DOI:10.1145/3571884

Editors:
Minha Lee
Eindhoven University of Technology, Netherlands
,
Cosmin Munteanu
University of Waterloo, Canada
,
Martin Porcheron
Bold Insight, UK
,
Johanne Trippas
RMIT University, Australia
,
Sarah Theres Völkel
Google, Germany

Copyright © 2023 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Sponsors

SIGCHI: ACM Special Interest Group on Computer-Human Interaction

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 19 July 2023

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article
Research
Refereed limited

Conference

CUI '23

Sponsor:

SIGCHI

CUI '23: ACM conference on Conversational User Interfaces

July 19 - 21, 2023

Eindhoven, Netherlands

Acceptance Rates

Overall Acceptance Rate 34 of 100 submissions, 34%

Upcoming Conference

CUI '25

Sponsor:
sigchi

ACM Conversational User Interfaces 2025

July 7 - 9, 2025

Waterloo , ON , Canada

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

5
Total Citations
View Citations
549
Total Downloads

Downloads (Last 12 months)376
Downloads (Last 6 weeks)49

Reflects downloads up to 20 Dec 2024

Other Metrics

View Author Metrics

Citations

Cited By

Abhari SMcMurray JRandhawa TBin Noon GHanjahanja-Phiri TMcNeil HManning FDebergue PTeague JPelegrini Morita P(2024)Exploring the Landscape of Standards and Guidelines in AgeTech Design and Development: Scoping Review and Thematic AnalysisJMIR Aging10.2196/581967(e58196)Online publication date: 31-Oct-2024
https://doi.org/10.2196/58196
Iñiguez-Carrillo AAréchiga DRangel-Romero MEspíndola-Barajas AFarzan RLópez CCardoso Llach DQuercia DMustafa MNiu SWong-Villacrés M(2024)Bridging Communication Gaps Using Augmented Reality: Designing a User Conversational Interface for Hearing Impaired StudentCompanion Publication of the 2024 Conference on Computer-Supported Cooperative Work and Social Computing10.1145/3678884.3681905(553-557)Online publication date: 11-Nov-2024
https://dl.acm.org/doi/10.1145/3678884.3681905
Alghamdi EHalvey MNicol E(2024)System and User Strategies to Repair Conversational Breakdowns of Spoken Dialogue Systems: A Scoping ReviewProceedings of the 6th ACM Conference on Conversational User Interfaces10.1145/3640794.3665558(1-13)Online publication date: 8-Jul-2024
https://dl.acm.org/doi/10.1145/3640794.3665558
Alizadeh FTolmie PLee MWintersberger PPins DStevens G(2024)Voice Assistants' Accountability through Explanatory DialoguesProceedings of the 6th ACM Conference on Conversational User Interfaces10.1145/3640794.3665557(1-12)Online publication date: 8-Jul-2024
https://dl.acm.org/doi/10.1145/3640794.3665557
Murad CMunteanu CPenn G(2024)Conversational Voice Interfaces: Translating Research Into Actionable DesignExtended Abstracts of the CHI Conference on Human Factors in Computing Systems10.1145/3613905.3636277(1-3)Online publication date: 11-May-2024
https://dl.acm.org/doi/10.1145/3613905.3636277

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

HTML Format

View this article in HTML Format.

Media

Figures

Other

Tables

View Table of Contents