[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/3571884.3597129acmconferencesArticle/Chapter ViewAbstractPublication PagescuiConference Proceedingsconference-collections
research-article

What’s The Talk on VUI Guidelines? A Meta-Analysis of Guidelines for Voice User Interface Design

Published: 19 July 2023 Publication History

Editorial Notes

The authors have requested minor, non-substantive changes to the VoR and, in accordance with ACM policies, a Corrected VoR was published on January 4, 2024. For reference purposes the VoR may still be accessed via the Supplemental Material section on this page.

Abstract

Over the past decade, voice user interface (VUI) design has been steadily growing, along with a growing VUI presence in consumer markets. However, there is currently a lack of widely-established guidelines for VUI design. While many sets of VUI guidelines have been proposed, they tend to be developed independently of each other, leading to a lack of consensus on appropriate guidelines for VUI design. This can hinder the wider adoption of practical VUI guidelines. To address this gap, we performed a large-scale meta-analysis of 336 VUI design guidelines that have been proposed in academic literature. Using thematic analysis, we present a unified and synthesized set of 14 guidelines, representing the most universally proposed principles of VUI design as captured by the 336 VUI guidelines identified in academic literature. We hope that this synthesized set can address several of the challenges to the adoption of VUI guidelines in design practice.

Supplemental Material

PDF File - 3597129-VoR
Version of Record for "What?s The Talk on VUI Guidelines? A Meta-Analysis of Guidelines for Voice User Interface Design" by Murad et al., Proceedings of the 5th International Conference on Conversational User Interfaces (CUI '23).

References

[1]
2022. Conversation Design. https://developers.google.com/assistant/conversation-design/welcome
[2]
2022. Get Started with the Guide | Alexa Design Guide. Amazon (Alexa). https://developer.amazon.com/en-US/docs/alexa/alexa-design/get-started.html
[3]
2022. Introduction - Siri - Human Interface Guidelines - Apple Developer. https://developer.apple.com/design/human-interface-guidelines/siri/overview/introduction/
[4]
Samer Al Moubayed, Gabriel Skantze, Jonas Beskow, Kalin Stefanov, and Joakim Gustafson. 2012. Multimodal Multiparty Social Interaction with the Furhat Head. In Proceedings of the 14th ACM International Conference on Multimodal Interaction(ICMI ’12). Association for Computing Machinery, New York, NY, USA, 293–294. https://doi.org/10.1145/2388676.2388736 event-place: Santa Monica, California, USA.
[5]
Ghazanfar Ali, Myungho Lee, and Jae-In Hwang. 2020. Automatic text-to-gesture rule generation for embodied conversational agents. COMPUTER ANIMATION AND VIRTUAL WORLDS 31, 4–5 (Jul 2020). https://doi.org/10.1002/cav.1944
[6]
M. Allison and L. M. Kendrick. 2013. Towards an expressive embodied conversational agent utilizing multi-ethnicity to augment solution focused therapy. In FLAIRS 2013 - Proceedings of the 26th International Florida Artificial Intelligence Research Society Conference. 332–337. www.scopus.com
[7]
Marco Almada and Juliano Maranhao. 2021. Voice-based diagnosis of covid-19: ethical and legal challenges. INTERNATIONAL DATA PRIVACY LAW 11, 1 (Feb 2021), 63–75. https://doi.org/10.1093/idpl/ipab004
[8]
Nuno Almeida, Samuel Silva, António Teixeira, Maksym Ketsmur, Diogo Guimarães, and Emanuel Fonseca. 2018. Multimodal Interaction for Accessible Smart Homes. In Proceedings of the 8th International Conference on Software Development and Technologies for Enhancing Accessibility and Fighting Info-Exclusion(DSAI 2018). Association for Computing Machinery, New York, NY, USA, 63–70. https://doi.org/10.1145/3218585.3218595 event-place: Thessaloniki, Greece.
[9]
M. Anabuki, H. Kakuta, H. Yamamoto, and H. Tamura. 2000. Welbo: An embodied conversational agent living in mixed reality space. In Conference on Human Factors in Computing Systems - Proceedings. 10–11. www.scopus.com
[10]
Marco Avvenuti and Alessio Vecchio. 2009. Mobile Visual Access to Legacy Voice-Based Applications. In Proceedings of the 6th International Conference on Mobile Technology, Application & Systems(Mobility ’09). Association for Computing Machinery, New York, NY, USA. https://doi.org/10.1145/1710035.1710097 event-place: Nice, France.
[11]
Matthew P. Aylett, Per Ola Kristensson, Steve Whittaker, and Yolanda Vazquez-Alvarez. 2014. None of a CHInd. Proc. of CHI EA ’14 (2014), 749–760. https://doi.org/10.1145/2559206.2578868
[12]
Rajesh Balchandran, Mark E. Epstein, Gerasimos Potamianos, and Ladislav Seredi. 2008. A Multi-Modal Spoken Dialog System for Interactive TV. In Proceedings of the 10th International Conference on Multimodal Interfaces(ICMI ’08). Association for Computing Machinery, New York, NY, USA, 191–192. https://doi.org/10.1145/1452392.1452429 event-place: Chania, Crete, Greece.
[13]
N. O. Bernsen, H. Dybkjaer, and L. Dybkjaer. 1996. Principles for the design of cooperative spoken human-machine dialogue. In Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP ’96, Vol. 2. 729–732 vol.2. https://doi.org/10.1109/ICSLP.1996.607465
[14]
Niels O. Bernsen, Hans Dybkjær, and Laila Dybkjær. 1996. Cooperativity in human–machine and human–human spoken dialogue. Discourse Processes 21, 2 (March 1996), 213–236. http://myaccess.library.utoronto.ca/login?qurl=https%3A%2F%2Fsearch.proquest.com%2Fdocview%2F618843297%3Faccountid%3D14771 ISBN: 0163-853X, 0163-853X.
[15]
Dan Bohus and Eric Horvitz. 2010. Facilitating Multiparty Dialog with Gaze, Gesture, and Speech. In International Conference on Multimodal Interfaces and the Workshop on Machine Learning for Multimodal Interaction(ICMI-MLMI ’10). Association for Computing Machinery, New York, NY, USA. https://doi.org/10.1145/1891903.1891910 event-place: Beijing, China.
[16]
Jose A Borges, Israel Morales, and Nkstor J Rodriguez. 1996. Guidelines for Designing Usable World Wide Web Pages. Technical Report. http://delivery.acm.org/10.1145/260000/257320/p277-borges.pdf?ip=174.112.248.232&id=257320&acc=ACTIVE SERVICE&key=FD0067F557510FFB.148C9AE997532579.2370BB3FAC5962EF.4D4702B0C3E38B35&__acm__=1537318774_5ad8eb060e1eefa36cb28cf6616570b5
[17]
M. . Bourguet. 2006. Towards a taxonomy of error-handling strategies in recognition-based multi-modal human-computer interfaces. Signal Processing 86, 12 (2006), 3625–3643. www.scopus.com
[18]
Stacy M. Branham and Antony Rishin Mukkath Roy. 2019. Reading Between the Guidelines: How Commercial Voice Assistant Guidelines Hinder Accessibility for Blind Users. In Proceedings of the 21st International ACM SIGACCESS Conference on Computers and Accessibility (Pittsburgh, PA, USA) (ASSETS ’19). Association for Computing Machinery, New York, NY, USA, 446–458. https://doi.org/10.1145/3308561.3353797
[19]
Virginia Braun and Victoria Clarke. 2006. Using thematic analysis in psychology. Qualitative Research in Psychology 3, 2 (Jan. 2006), 77–101. https://doi.org/10.1191/1478088706qp063oa
[20]
Robin N. Brewer, Leah Findlater, Joseph ’Jofish’ Kaye, Walter Lasecki, Cosmin Munteanu, and Astrid Weber. 2018. Accessible Voice Interfaces. In Companion of the 2018 ACM Conference on Computer Supported Cooperative Work and Social Computing(CSCW ’18). Association for Computing Machinery, New York, NY, USA, 441–446. https://doi.org/10.1145/3272973.3273006 event-place: Jersey City, NJ, USA.
[21]
Justine Cassell. 2000. Embodied conversational interface agents. Association for Computing Machinery.Communications of the ACM 43, 4 (2000), 70–78. http://myaccess.library.utoronto.ca/login?qurl=https%3A%2F%2Fsearch.proquest.com%2Fdocview%2F237048747%3Faccountid%3D14771 ISBN: 00010782.
[22]
Leigh Clark, Benjamin R. Cowan, Abi Roper, Stephen Lindsay, and Owen Sheers. 2020. Speech Diversity and Speech Interfaces: Considering an Inclusive Future through Stammering. In Proceedings of the 2nd Conference on Conversational User Interfaces(CUI ’20). Association for Computing Machinery, New York, NY, USA. https://doi.org/10.1145/3405755.3406139 event-place: Bilbao, Spain.
[23]
Leigh Clark, Philip Doyle, Diego Garaialde, Emer Gilmartin, Stephan Schlögl, Jens Edlund, Matthew Aylett, João Cabral, Cosmin Munteanu, Justin Edwards, and Benjamin R Cowan. 2019. The State of Speech in HCI: Trends, Themes and Challenges. Interacting with Computers 31, 4 (Dec. 2019), 349–371. https://doi.org/10.1093/iwc/iwz016
[24]
Leigh Clark, Nadia Pantidi, Orla Cooney, Philip Doyle, Diego Garaialde, Justin Edwards, Brendan Spillane, Emer Gilmartin, Christine Murad, Cosmin Munteanu, Vincent Wade, and Benjamin R. Cowan. 2019. What Makes a Good Conversation? Challenges in Designing Truly Conversational Agents. In Proceedings of the 2019 CHI Conference on Human Factors in Computing Systems(CHI ’19). Association for Computing Machinery, New York, NY, USA, 1–12. https://doi.org/10.1145/3290605.3300705 event-place: Glasgow, Scotland Uk.
[25]
Eric Corbett and Astrid Weber. 2016. What Can I Say? Addressing User Experience Challenges of a Mobile Voice User Interface for Accessibility. In Proceedings of the 18th International Conference on Human-Computer Interaction with Mobile Devices and Services(MobileHCI ’16). Association for Computing Machinery, New York, NY, USA, 72–82. https://doi.org/10.1145/2935334.2935386 event-place: Florence, Italy.
[26]
Benjamin R Cowan, Nadia Pantidi, David Coyle, Kellie Morrissey, Peter Clarke, Sara Al-Shehri, David Earley, and Natasha Bandeira. 2017. "What Can I Help You With?": Infrequent Users’ Experiences of Intelligent Personal Assistants. In Proc. of MobileHCI ’17. 1–12. https://doi.org/10.1145/3098279.3098539
[27]
Colleen E Crangle, Lawrence M Fagan, Robert W Carlson, Mark S Erlbaum, David D Sherertz, and Mark S Tuttle. 1998. Collaborative conversational interfaces. International Journal of Speech Technology 2 (1998), 187–200. https://doi.org/10.1007/BF02111207
[28]
Andreea Danielescu. 2020. Eschewing Gender Stereotypes in Voice Assistants to Promote Inclusion. In Proceedings of the 2nd Conference on Conversational User Interfaces(CUI ’20). Association for Computing Machinery, New York, NY, USA. https://doi.org/10.1145/3405755.3406151 event-place: Bilbao, Spain.
[29]
Alan Lopes de Sousa Freitas, Vinícius Paes de Camargo, Heloise Manica Paris Teixeira, Renato Balancieri, and Thelma Elita Colanzi. 2017. Gesture and Voice-Based Natural User Interface for Electronic Whiteboard System in a Medical Emergency Department. In Proceedings of the XVI Brazilian Symposium on Human Factors in Computing Systems(IHC 2017). Association for Computing Machinery, New York, NY, USA. https://doi.org/10.1145/3160504.3160534 event-place: Joinville, Brazil.
[30]
Carlos Delgado Kloos, Carlos Alario-Hoyos, Pedro J. Munoz-Merino, Cristina Catalan Aguirre, and Nuria Gonzalez Castro. 2019. Principles for the Design of an Educational Voice Assistant for Learning Java. In SUSTAINABLE ICT, EDUCATION AND LEARNING(IFIP Advances in Information and Communication Technology, Vol. 564), Tatnall, A and Mavengere, N (Ed.). 99–106. https://doi.org/10.1007/978-3-030-28764-1_12 ISSN: 1868-4238.
[31]
Laila Dybkjær, Niels Ole Bernsen, and Hans Dybkjær. 1996. Grice Incorporated: Cooperativity in Spoken Dialogue. In Proceedings of the 16th Conference on Computational Linguistics - Volume 1(COLING ’96). Association for Computational Linguistics, USA, 328–333. https://doi.org/10.3115/992628.992686 event-place: Copenhagen, Denmark.
[32]
F. Ebbers, J. Zibuschka, C. Zimmermann, and O. Hinz. 2020. User preferences for privacy features in digital assistants. Electronic Markets (2020). https://doi.org/10.1007/s12525-020-00447-y
[33]
S. Estes, J. Helleberg, K. Long, M. Pollack, and M. Quezada. 2018. Guidelines for speech interactions between pilot and cognitive assistant. In 2018 Integrated Communications, Navigation, Surveillance Conference (ICNS). 3H2–1–3H2–10. https://doi.org/10.1109/ICNSURV.2018.8384875
[34]
Raymond Fok, Harmanpreet Kaur, Skanda Palani, Martez E. Mott, and Walter S. Lasecki. 2018. Towards More Robust Speech Interactions for Deaf and Hard of Hearing Users. In Proceedings of the 20th International ACM SIGACCESS Conference on Computers and Accessibility(ASSETS ’18). Association for Computing Machinery, New York, NY, USA, 57–67. https://doi.org/10.1145/3234695.3236343 event-place: Galway, Ireland.
[35]
Natalie Friedman, Andrea Cuadra, Ruchi Patel, Shiri Azenkot, Joel Stein, and Wendy Ju. 2019. Voice Assistant Strategies and Opportunities for People with Tetraplegia. In The 21st International ACM SIGACCESS Conference on Computers and Accessibility(ASSETS ’19). Association for Computing Machinery, New York, NY, USA, 575–577. https://doi.org/10.1145/3308561.3354605 event-place: Pittsburgh, PA, USA.
[36]
Lokesh Fulfagar, Anupriya Gupta, Arpit Mathur, and Abhishek Shrivastava. 2021. Development and Evaluation of Usability Heuristics for Voice User Interfaces. In Design for Tomorrow—Volume 1(Smart Innovation, Systems and Technologies), Amaresh Chakrabarti, Ravi Poovaiah, Prasad Bokil, and Vivek Kant (Eds.). Springer, Singapore, 375–385. https://doi.org/10.1007/978-981-16-0041-8_32
[37]
Kotaro Funakoshi, Mikio Nakano, Kazuki Kobayashi, Takanori Komatsu, and Seiji Yamada. 2010. Non-Humanlike Spoken Dialogue: A Design Perspective. In Proceedings of the 11th Annual Meeting of the Special Interest Group on Discourse and Dialogue(SIGDIAL ’10). Association for Computational Linguistics, USA, 176–184. event-place: Tokyo, Japan.
[38]
M. Funk, C. Cunningham, D. Kanver, C. Saikalis, and R. Pansare. 2020. Usable and Acceptable Response Delays of Conversational Agents in Automotive User Interfaces. In Proceedings - 12th International ACM Conference on Automotive User Interfaces and Interactive Vehicular Applications, AutomotiveUI 2020. 262–269. https://doi.org/10.1145/3409120.3410651
[39]
Anushay Furqan, Chelsea Myers, and Jichen Zhu. 2017. Learnability through Adaptive Discovery Tools in Voice User Interfaces. In Proceedings of the 2017 CHI Conference Extended Abstracts on Human Factors in Computing Systems(CHI EA ’17). Association for Computing Machinery, New York, NY, USA, 1617–1623. https://doi.org/10.1145/3027063.3053166 event-place: Denver, Colorado, USA.
[40]
Abraham Glasser, Vaishnavi Mande, and Matt Huenerfauth. 2020. Accessibility for Deaf and Hard of Hearing Users: Sign Language Conversational User Interfaces. In Proceedings of the 2nd Conference on Conversational User Interfaces(CUI ’20). Association for Computing Machinery, New York, NY, USA. https://doi.org/10.1145/3405755.3406158 event-place: Bilbao, Spain.
[41]
S Gopalakrishnan and P Ganeshkumar. 2013. Systematic Reviews and Meta-analysis: Understanding the Best Evidence in Primary Healthcare. J Family Med Prim Care (2013). https://doi.org/10.4103/2249-4863.109934
[42]
Mardé Greeff, Louis Coetzee, and Martin Pistorius. 2008. Usability Evaluation of the South African National Accessibility Portal Interactive Voice Response System. In Proceedings of the 2008 Annual Research Conference of the South African Institute of Computer Scientists and Information Technologists on IT Research in Developing Countries: Riding the Wave of Technology(SAICSIT ’08). Association for Computing Machinery, New York, NY, USA, 76–85. https://doi.org/10.1145/1456659.1456669 event-place: Wilderness, South Africa.
[43]
Mohammad Hadian, Thamer Altuwaiyan, Xiaohui Liang, and Wei Li. 2017. Efficient and Privacy-Preserving Voice-Based Search over Mhealth Data. In Proceedings of the Second IEEE/ACM International Conference on Connected Health: Applications, Systems and Engineering Technologies(CHASE ’17). IEEE Press, 96–101. https://doi.org/10.1109/CHASE.2017.66 event-place: Philadelphia, Pennsylvania.
[44]
Aki Halonen, Sami Hyrynsalmi, Kai K. Kimppa, Timo Knuutila, Jouni Smed, and Harri Hakonen. 2012. Towards Usability Heuristics for Games Utilizing Speech Recognition. In 4TH ASIAN CONFERENCE ON IN℡LIGENT GAMES AND SIMULATION - 4TH ASIAN SIMULATION TECHNOLOGY CONFERENCE, Inaba, M and Hosoi, K and Thawonmas, R and Nakamura, A and Uemura, M (Ed.). 51–55.
[45]
X. Han and T. Yeh. 2020. How does your alexa behave?: Evaluating voice applications by design guidelines using an automatic voice crawler. In CEUR Workshop Proceedings, Vol. 2848.
[46]
Danula Hettiachchi, Zhanna Sarsenbayeva, Fraser Allison, Niels van Berkel, Tilman Dingler, Gabriele Marini, Vassilis Kostakos, and Jorge Goncalves. 2020. "Hi! I Am the Crowd Tasker" Crowdsourcing through Digital Voice Assistants. In Proceedings of the 2020 CHI Conference on Human Factors in Computing Systems(CHI ’20). Association for Computing Machinery, New York, NY, USA, 1–14. https://doi.org/10.1145/3313831.3376320 event-place: Honolulu, HI, USA.
[47]
K. S. Hone and C. Baber. 2001. Designing habitable dialogues for speech-based interaction with computers. International Journal of Human-Computer Studies 54, 4 (2001), 637–662. http://myaccess.library.utoronto.ca/login?qurl=https%3A%2F%2Fsearch.proquest.com%2Fdocview%2F619618966%3Faccountid%3D14771 ISBN: 1071-5819, 1071-5819.
[48]
Tamino Huxohl, Marian Pohling, Birte Carlmeyer, Britta Wrede, and Thomas Hermann. 2019. Interaction guidelines for personal voice assistants in smart homes. In 2019 10th international conference on speech technology and human-computer dialogue, SpeD 2019. 1–10. https://doi.org/10.1109/SPED.2019.8906642
[49]
Rodolfo Inostroza, Cristian Rusu, Silvana Roncagliolo, Cristhy Jimenez, and Virginica Rusu. 2012. Usability Heuristics for Touchscreen-based Mobile Devices. In 2012 Ninth International Conference on Information Technology - New Generations. IEEE, 662–667. https://doi.org/10.1109/ITNG.2012.134
[50]
Lopatovska Irene, Alice L. Griffin, Kelsey Gallagher, Ballingall Caitlin, Clair Rock, and Mildred Velazquez. 2020. User recommendations for intelligent personal assistants. Journal of Librarianship and Information Science 52, 2 (2020), 577–591. http://myaccess.library.utoronto.ca/login?qurl=https%3A%2F%2Fsearch.proquest.com%2Fdocview%2F2389579821%3Faccountid%3D14771 ISBN: 0961-0006.
[51]
Ing-Marie Jonsson and Nils Dahlback. 2011. I Can’t Hear You? Drivers Interacting with Male or Female Voices in Native or Non-native Language. In UNIVERSAL ACCESS IN HUMAN-COMPUTER INTERACTION: CONTEXT DIVERSITY, PT 3(Lecture Notes in Computer Science, Vol. 6767), Stephanidis, C (Ed.). 298–305. ISSN: 0302-9743 Issue: 3.
[52]
C. A. Kamm and M. A. Walker. 1997. Design and evaluation of spoken dialog systems. In 1997 IEEE Workshop on Automatic Speech Recognition and Understanding Proceedings. 11–18. https://doi.org/10.1109/ASRU.1997.658969
[53]
Junhan Kim, Yoojung Kim, Byungjoon Kim, Sukyung Yun, Minjoon Kim, and Joongseek Lee. 2018. Can a Machine Tend to Teenagers’ Emotional Needs? A Study with Conversational Agents. In Extended Abstracts of the 2018 CHI Conference on Human Factors in Computing Systems(CHI EA ’18). Association for Computing Machinery, New York, NY, USA, 1–6. https://doi.org/10.1145/3170427.3188548 event-place: Montreal QC, Canada.
[54]
Junhan Kim, Yoojung Kim, Byungjoon Kim, Sukyung Yun, Minjoon Kim, and Joongseek Lee. 2018. Can a Machine Tend to Teenagers’ Emotional Needs? A Study with Conversational Agents. In Extended Abstracts of the 2018 CHI Conference on Human Factors in Computing Systems(CHI EA ’18). Association for Computing Machinery, New York, NY, USA, 1–6. https://doi.org/10.1145/3170427.3188548 event-place: Montreal QC, Canada.
[55]
Sunyoung Kim. 2021. Exploring how older adults use a smart Speaker-Based voice assistant in their first interactions: Qualitative study. JMIR MHEALTH AND UHEALTH 9, 1 (Jan 2021). https://doi.org/10.2196/20427
[56]
Y. Kim, M. Reza, J. McGrenere, and D. Yoon. 2021. Designers characterize naturalness in voice user interfaces: Their goals, practices, and challenges. https://doi.org/10.1145/3411764.3445579
[57]
Raina Langevin, Ross J Lordon, Thi Avrahami, Benjamin R. Cowan, Tad Hirsch, and Gary Hsieh. 2021. Heuristic Evaluation of Conversational Agents. In Proceedings of the 2021 CHI Conference on Human Factors in Computing Systems (Yokohama, Japan) (CHI ’21). Association for Computing Machinery, New York, NY, USA, Article 632, 15 pages. https://doi.org/10.1145/3411764.3445312
[58]
Martha Larson, Nelleke Oostdijk, and Frederik Zuiderveen Borgesius. 2021. Not directly stated, not explicitly stored: Conversational agents and the privacy threat of implicit information. In Adjunct proceedings of the 29th ACM conference on user modeling, adaptation and personalization(UMAP ’21). Association for Computing Machinery, New York, NY, USA, 388–391. https://doi.org/10.1145/3450614.3463601
[59]
Minha Lee and Sangsu Lee. 2021. “I Don’t Know Exactly but I Know a Little”: Exploring Better Responses of Conversational Agents with Insufficient Information. In Extended Abstracts of the 2021 CHI Conference on Human Factors in Computing Systems (Yokohama, Japan) (CHI EA ’21). Association for Computing Machinery, New York, NY, USA, Article 427, 5 pages. https://doi.org/10.1145/3411763.3451812
[60]
Alessandro Liberati, Douglas G. Altman, Jennifer Tetzlaff, Cynthia Mulrow, Peter C. Gøtzsche, John P. A. Ioannidis, Mike Clarke, P. J. Devereaux, Jos Kleijnen, and David Moher. 2009. The PRISMA statement for reporting systematic reviews and meta-analyses of studies that evaluate health care interventions: explanation and elaboration. PLoS medicine 6, 7 (Jul 2009), e1000100. https://doi.org/10.1371/journal.pmed.1000100
[61]
Isabella Loddo and Dario Martini. 2017. The cocktail party effect. An inclusive vision of conversational interactions. The Design Journal 20 (2017), 4076. http://myaccess.library.utoronto.ca/login?qurl=https%3A%2F%2Fsearch.proquest.com%2Fdocview%2F1936558654%3Faccountid%3D14771 ISBN: 14606925.
[62]
Ewa Luger and Abigail Sellen. 2016. "Like Having a Really Bad PA": The Gulf between User Expectation and Experience of Conversational Agents. In Proceedings of the 2016 CHI Conference on Human Factors in Computing Systems - CHI ’16. 5286–5297. https://doi.org/10.1145/2858036.2858288
[63]
Oussama Metatla, Alison Oldfield, Taimur Ahmed, Antonis Vafeas, and Sunny Miglani. 2019. Voice User Interfaces in Schools: Co-Designing for Inclusion with Visually-Impaired and Sighted Pupils. In Proceedings of the 2019 CHI Conference on Human Factors in Computing Systems(CHI ’19). Association for Computing Machinery, New York, NY, USA, 1–15. https://doi.org/10.1145/3290605.3300608 event-place: Glasgow, Scotland Uk.
[64]
Aarthi Easwara Moorthy and Kim Phuong L. Vu. 2015. Privacy Concerns for Use of Voice Activated Personal Assistant in the Public Space. International Journal of Human-Computer Interaction 31, 4 (2015), 307–335. https://doi.org/10.1080/10447318.2014.986642
[65]
Cosmin Munteanu, Ben Cowan, Keisuke Nakamura, Pourang Irani, Sharon Oviatt, Matthew Aylett, Gerald Penn, Shimei Pan, Nikhil Sharma, Frank Rudzicz, and Randy Gomez. 2017. Designing Speech, Acoustic and Multimodal Interactions. In Proc. of CHI EA ’17. 601–608. https://doi.org/10.1145/3027063.3027086
[66]
Christine Murad and Cosmin Munteanu. 2019. "I Don’t Know What You’re Talking about, HALexa": The Case for Voice User Interface Guidelines. In Proceedings of the 1st International Conference on Conversational User Interfaces(CUI ’19). Association for Computing Machinery, New York, NY, USA. https://doi.org/10.1145/3342775.3342795 event-place: Dublin, Ireland.
[67]
Christine Murad, Cosmin Munteanu, Leigh Clark, and Benjamin R. Cowan. 2018. Design guidelines for hands-free speech interaction. In Proc. of MobileHCI ’18. ACM Press, New York, New York, USA, 269–276. https://doi.org/10.1145/3236112.3236149
[68]
Christine Murad, Cosmin Munteanu, Benjamin R. Cowan, and Leigh Clark. 2019. Revolution or Evolution? Speech Interaction and HCI Design Guidelines. IEEE PERVASIVE COMPUTING 18, 2 (June 2019), 33–45. https://doi.org/10.1109/MPRV.2019.2906991
[69]
Christine Murad, Cosmin Munteanu, Benjamin R. Cowan, Leigh Clark, Martin Porcheron, Heloisa Candello, Stephan Schlögl, Matthew P. Aylett, Jaisie Sin, Robert J. Moore, Grace Hughes, and Andrew Ku. 2021. Let’s Talk About CUIs: Putting Conversational User Interface Design Into Practice. In Extended Abstracts of the 2021 CHI Conference on Human Factors in Computing Systems (Yokohama, Japan) (CHI EA ’21). Association for Computing Machinery, New York, NY, USA, Article 98, 6 pages. https://doi.org/10.1145/3411763.3441336
[70]
Christine Murad, Cosmin Munteanu, Benjamin R. Cowan, and Leigh Clark. 2021. Finding a New Voice: Transitioning Designers from GUI to VUI Design. In CUI 2021 - 3rd Conference on Conversational User Interfaces(CUI ’21). Association for Computing Machinery, New York, NY, USA, 1–12. https://doi.org/10.1145/3469595.3469617
[71]
Christine Murad, Humaira Tasnim, and Cosmin Munteanu. 2022. “Voice-First Interfaces in a GUI-First Design World”: Barriers and Opportunities to Supporting VUI Designers On-the-Job. In Proceedings of the 4th Conference on Conversational User Interfaces (Glasgow, United Kingdom) (CUI ’22). Association for Computing Machinery, New York, NY, USA, Article 17, 10 pages. https://doi.org/10.1145/3543829.3543842
[72]
Chelsea M. Myers. 2019. Adaptive suggestions to increase learnability for voice user interfaces. In Proceedings of the 24th International Conference on Intelligent User Interfaces Companion - IUI ’19. ACM Press, New York, New York, USA, 159–160. https://doi.org/10.1145/3308557.3308727
[73]
Chelsea M. Myers, Anushay Furqan, and Jichen Zhu. 2019. The Impact of User Characteristics and Preferences on Performance with an Unfamiliar Voice User Interface. In Proceedings of the 2019 CHI Conference on Human Factors in Computing Systems(CHI ’19). Association for Computing Machinery, New York, NY, USA, 1–9. https://doi.org/10.1145/3290605.3300277 event-place: Glasgow, Scotland Uk.
[74]
T. J. Ndwe, M. E. Dlodlo, and D. J. Mashao. 2008. Usability engineering of an interactive voice response system in a diverse-cultured and multilingual setting. In Innovative Techniques in Instruction Technology, E-Learning, E-Assessment, and Education. 554–559. www.scopus.com
[75]
Jakob Nielsen. 1994. Enhancing the explanatory power of usability heuristics. Proc. of CHI ’94 (1994), 152–158. https://doi.org/10.1145/191666.191729
[76]
Donald Norman. 1988. The Design of Everyday Things. Doubled Currency (1988).
[77]
Matthew J. Page, Joanne E. McKenzie, Patrick M. Bossuyt, Isabelle Boutron, Tammy C. Hoffmann, Cynthia D. Mulrow, Larissa Shamseer, Jennifer M. Tetzlaff, Elie A. Akl, Sue E. Brennan, Roger Chou, Julie Glanville, Jeremy M. Grimshaw, Asbjørn Hróbjartsson, Manoj M. Lalu, Tianjing Li, Elizabeth W. Loder, Evan Mayo-Wilson, Steve McDonald, Luke A. McGuinness, Lesley A. Stewart, James Thomas, Andrea C. Tricco, Vivian A. Welch, Penny Whiting, and David Moher. 2021. The PRISMA 2020 statement: An updated guideline for reporting systematic reviews. 134 (Jun 2021), 178–189. https://doi.org/10.1016/j.jclinepi.2021.03.001
[78]
N. Patel, S. Agarwal, N. Rajput, A. Nanavati, P. Dave, and T. S. Parikh. 2008. Experiences designing a voice interface for rural India. In 2008 IEEE Spoken Language Technology Workshop. 21–24. https://doi.org/10.1109/SLT.2008.4777830
[79]
Cathy Pearl. 2016. Designing Voice User Interfaces: Principles of Conversational Experiences (1st ed.). O’Reilly Media, Inc.
[80]
David Pinelle, Nelson Wong, and Tadeusz Stach. 2008. Heuristic evaluation for games: usability principles for video game design. Proceedings of SIGCHI Conference on Human Factors in Computing Systems (2008), 1453–1462. https://doi.org/10.1145/1357054.1357282
[81]
Dominik Pins, Alexander Boden, Britta Essing, and Gunnar Stevens. 2020. "Miss Understandable": A Study on How Users Appropriate Voice Assistants and Deal with Misunderstandings. In Proceedings of Mensch Und Computer 2020 (Magdeburg, Germany) (MuC ’20). Association for Computing Machinery, New York, NY, USA, 349–359. https://doi.org/10.1145/3404983.3405511
[82]
V. Raveendran, M. R. Sanjeev, N. Paul, and Jijina K.P.2016. Speech only interface approach for personal computing environment. In 2016 IEEE International Conference on Engineering and Technology (ICETECH). 372–377. https://doi.org/10.1109/ICETECH.2016.7569279
[83]
Steven Ross, Elizabeth Brownholtz, and Robert Armes. 2004. Voice User Interface Principles for a Conversational Agent. In Proceedings of the 9th International Conference on Intelligent User Interfaces(IUI ’04). Association for Computing Machinery, New York, NY, USA, 364–365. https://doi.org/10.1145/964442.964536 event-place: Funchal, Madeira, Portugal.
[84]
V. F. M. Salvador and L. de Assis Moura. 2010. Heuristic evaluation for automatic radiology reporting transcription systems. In 10th International Conference on Information Science, Signal Processing and their Applications (ISSPA 2010). 292–295. https://doi.org/10.1109/ISSPA.2010.5605467
[85]
Robert M. Schumacher, Mary L. Hardzinski, and Amy L. Schwartz. 1995. Increasing the Usability of Interactive Voice Response Systems: Research and Guidelines for Phone-Based Interfaces. Human factors 37, 2 (June 1995), 251. http://myaccess.library.utoronto.ca/login?qurl=https%3A%2F%2Fsearch.proquest.com%2Fdocview%2F1311858959%3Faccountid%3D14771 ISBN: 0018-7208.
[86]
J Sherwani, Dong Yu, and Tim Paek. 2007. Voicepedia: towards speech-based access to unstructured information.Interspeech (2007), 2–5. http://research.microsoft.com/pubs/78835/VoicePedia-Interspeech2007.pdf
[87]
J.Y. Shin and J. Huh-Yoo. 2020. Designing everyday conversational agents for managing health and wellness: A study of alexa skills reviews. In ACM International Conference Proceeding Series. 50–61. https://doi.org/10.1145/3421937.3422024
[88]
Ben Shneiderman. 2000. The limits of speech recognition. Commun. ACM 43, 9 (2000), 63–65. https://doi.org/10.1145/348941.348990
[89]
Shoupu Chen, Z. Kazi, M. Beitler, M. Salganicoff, D. Chester, and R. Foulds. 1996. Gesture-speech based HMI for a rehabilitation robot. In Proceedings of SOUTHEASTCON ’96. 29–36. https://doi.org/10.1109/SECON.1996.510021
[90]
Bernhard Suhm. 2003. Towards Best Practices for Speech User Interface Design. In Proc. of EuroSpeech ’03. 2217–2220.
[91]
Alistair Sutcliffe and Brian Gault. 2004. Heuristic evaluation of virtual reality applications. Interacting with Computers 16, 4 (2004), 831–849. https://doi.org/10.1016/j.intcom.2004.05.001
[92]
Vanessa Tobisch, Markus Funk, and Adam Emfield. 2020. Dealing with Input Uncertainty in Automotive Voice Assistants. In 12th International Conference on Automotive User Interfaces and Interactive Vehicular Applications (Virtual Event, DC, USA) (AutomotiveUI ’20). Association for Computing Machinery, New York, NY, USA, 161–168. https://doi.org/10.1145/3409120.3410660
[93]
Tandy Trower. 1997. Creating Conversational Interfaces for Interactive Software Agents. In CHI ’97 Extended Abstracts on Human Factors in Computing Systems(CHI EA ’97). Association for Computing Machinery, New York, NY, USA, 198–199. https://doi.org/10.1145/1120212.1120341 event-place: Atlanta, Georgia.
[94]
Carla Tubin, João Pedro Mazuco Rodriguez, and Ana Carolina Bertoletti de Marchi. 2021. User experience with conversational agent: a systematic review of assessment methods. (Dec 2021). https://doi.org/10.6084/m9.figshare.17168875.v1
[95]
M. Vimalkumar, S.K. Sharma, J.B. Singh, and Y.K. Dwivedi. 2021. ‘Okay google, what about my privacy?’: User’s privacy perceptions and acceptance of voice based digital assistants. Computers in Human Behavior 120 (2021).
[96]
Z. Wei and J. A. Landay. 2018. Evaluating Speech-Based Smart Devices Using New Usability Heuristics. IEEE Pervasive Computing 17, 2 (2018), 84–96. www.scopus.com
[97]
J. Weizenbaum. 1966. ELIZA- A computer program for the study of natural language communication between men and machine. Commun. ACM 9 (1966), 36–45. https://doi.org/10.1145/365153.365168
[98]
Kathryn Whitenton. 2016. Voice Interaction UX: Brave New World...Same Old Story. https://www.nngroup.com/articles/voice-interaction-ux/
[99]
Y. Xu, S.M. Branham, X. Deng, P. Collins, and M. Warschauer. 2021. Are current voice interfaces designed to support children’s language development?. In Conference on Human Factors in Computing Systems - Proceedings. https://doi.org/10.1145/3411764.3445271
[100]
Y. Xu and M. Warschauer. 2020. A content analysis of voice-based apps on the market for early literacy development. In Proceedings of the Interaction Design and Children Conference, IDC 2020. 361–371. https://doi.org/10.1145/3392063.3394418
[101]
X. Yang and M. Aurisicchio. 2021. Designing conversational agents: A self-determination theory approach. In Conference on Human Factors in Computing Systems - Proceedings. https://doi.org/10.1145/3411764.3445445
[102]
Nicole Yankelovich, Gina-Anne Levow, and Matt Marx. 1995. Designing SpeechActs: Issues in Speech User Interfaces. In Proc. of CHI ’95. 369–376. https://doi.org/10.1145/223904.223952
[103]
G. Yeratziotis and D. Van Greunen. 2013. Making ICT accessible for the deaf. In 2013 IST-Africa Conference Exhibition. 1–9.
[104]
L. Zhou. 2007. Natural language interface for information management on mobile devices. Behaviour & Information Technology 26, 3 (2007), 197–207. http://myaccess.library.utoronto.ca/login?qurl=https%3A%2F%2Fsearch.proquest.com%2Fdocview%2F621775007%3Faccountid%3D14771 ISBN: 0144-929X, 0144-929X.

Cited By

View all
  • (2024)Exploring the Landscape of Standards and Guidelines in AgeTech Design and Development: Scoping Review and Thematic AnalysisJMIR Aging10.2196/581967(e58196)Online publication date: 31-Oct-2024
  • (2024)Bridging Communication Gaps Using Augmented Reality: Designing a User Conversational Interface for Hearing Impaired StudentCompanion Publication of the 2024 Conference on Computer-Supported Cooperative Work and Social Computing10.1145/3678884.3681905(553-557)Online publication date: 11-Nov-2024
  • (2024)System and User Strategies to Repair Conversational Breakdowns of Spoken Dialogue Systems: A Scoping ReviewProceedings of the 6th ACM Conference on Conversational User Interfaces10.1145/3640794.3665558(1-13)Online publication date: 8-Jul-2024
  • Show More Cited By

Index Terms

  1. What’s The Talk on VUI Guidelines? A Meta-Analysis of Guidelines for Voice User Interface Design

      Recommendations

      Comments

      Please enable JavaScript to view thecomments powered by Disqus.

      Information & Contributors

      Information

      Published In

      cover image ACM Conferences
      CUI '23: Proceedings of the 5th International Conference on Conversational User Interfaces
      July 2023
      504 pages
      ISBN:9798400700149
      DOI:10.1145/3571884
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

      Sponsors

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 19 July 2023

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tags

      1. Design
      2. Design guidelines
      3. Speech interfaces
      4. User experience design
      5. Voice user interfaces

      Qualifiers

      • Research-article
      • Research
      • Refereed limited

      Conference

      CUI '23
      Sponsor:
      CUI '23: ACM conference on Conversational User Interfaces
      July 19 - 21, 2023
      Eindhoven, Netherlands

      Acceptance Rates

      Overall Acceptance Rate 34 of 100 submissions, 34%

      Upcoming Conference

      CUI '25
      ACM Conversational User Interfaces 2025
      July 7 - 9, 2025
      Waterloo , ON , Canada

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)376
      • Downloads (Last 6 weeks)49
      Reflects downloads up to 20 Dec 2024

      Other Metrics

      Citations

      Cited By

      View all
      • (2024)Exploring the Landscape of Standards and Guidelines in AgeTech Design and Development: Scoping Review and Thematic AnalysisJMIR Aging10.2196/581967(e58196)Online publication date: 31-Oct-2024
      • (2024)Bridging Communication Gaps Using Augmented Reality: Designing a User Conversational Interface for Hearing Impaired StudentCompanion Publication of the 2024 Conference on Computer-Supported Cooperative Work and Social Computing10.1145/3678884.3681905(553-557)Online publication date: 11-Nov-2024
      • (2024)System and User Strategies to Repair Conversational Breakdowns of Spoken Dialogue Systems: A Scoping ReviewProceedings of the 6th ACM Conference on Conversational User Interfaces10.1145/3640794.3665558(1-13)Online publication date: 8-Jul-2024
      • (2024)Voice Assistants' Accountability through Explanatory DialoguesProceedings of the 6th ACM Conference on Conversational User Interfaces10.1145/3640794.3665557(1-12)Online publication date: 8-Jul-2024
      • (2024)Conversational Voice Interfaces: Translating Research Into Actionable DesignExtended Abstracts of the CHI Conference on Human Factors in Computing Systems10.1145/3613905.3636277(1-3)Online publication date: 11-May-2024

      View Options

      Login options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      HTML Format

      View this article in HTML Format.

      HTML Format

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media