[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/302979.303160acmconferencesArticle/Chapter ViewAbstractPublication PageschiConference Proceedingsconference-collections
Article
Free access

Patterns of entry and correction in large vocabulary continuous speech recognition systems

Published: 01 May 1999 Publication History

Abstract

A study was conducted to evaluate user performance and satisfaction in completion of a set of text creation tasks using three commercially available continuous speech recognition systems. The study also compared user performance on similar tasks using keyboard input. One part of the study (Initial Use) involved 24 users who enrolled, received training and carried out practice tasks, and then completed a set of transcription and composition tasks in a single session. In a parallel effort (Extended Use), four researchers used speech recognition to carry out real work tasks over 10 sessions with each of the three speech recognition software products. This paper presents results from the Initial Use phase of the study along with some preliminary results from the Extended Use phase. We present details of the kinds of usability and system design problems likely in current systems and several common patterns of error correction that we found.

References

[1]
Clark, H. H. & Brennan, S. E. (1991). Grounding in communication. In J. Levine, L. B. Resnick, and S. D. Behrand (Eds.), Shared Cognition: Thinking as Social Practice. APA Books, Washington.
[2]
Danis, C. & Karat, J. (1995). Technology-driven design of speech recognition systems, in G. Olson and S. Schuon (eds.) Symposium on designing interactive systems. ACM: New York, 17-24.
[3]
Gould, J. D., Conti, J., & Hovanyecz, T, (1983). Composing letters with a simulated listening typewriter. Communications of the ACM, 26, 4, 295- 308.
[4]
Karat, J. (1995). Scenario use in the design of a speech recognition system. In J. Carroll (ed.) Scenario-based design. New York: Wiley.
[5]
Kidd, A. (1994). The marks are on the knowledge worker, in Proceedings of CH1 '94 (Boston M.A, April 1994), ACM Press, 186-191.
[6]
Lai, J. & Vergo, J. (1997). MedSpeak: Report Creation with Continuous Speech Recognition, in Proceedings of CHI '97 (Atlanta GA, March 1997), ACM Press, 431 - 438.
[7]
Laurel, B. (1993). Computers as Theatre. Adison Wesley, New York.
[8]
Ogozalek, V.Z., & Praag, J.V. (1986). Comparison of elderly and younger users on keyboard and voice input computer-based composition tasks, in Proceedings of CH1 '86, ACM Press, 205-211.
[9]
Oviatt, S. (1995). Predicting spoken disfluencies during human-computer interaction. Computer Speech and Language, 9, 19-35.
[10]
Yankelovich, N., Levow, G. A., & Marx, M. (1995). Designing SpeechActs: Issues in speech user interfaces, in Proceedings of CHI ~95 (Denver CO, May 1995), ACM Press, 369-376.

Cited By

View all
  • (2024)Governing Open Vocabulary Data Leaks Using an Edge LLM through Programming by ExampleProceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies10.1145/36997608:4(1-31)Online publication date: 21-Nov-2024
  • (2024)Towards Detecting and Mitigating Cognitive Bias in Spoken Conversational SearchAdjunct Proceedings of the 26th International Conference on Mobile Human-Computer Interaction10.1145/3640471.3680245(1-10)Online publication date: 21-Sep-2024
  • (2024)Rambler: Supporting Writing With Speech via LLM-Assisted Gist ManipulationProceedings of the 2024 CHI Conference on Human Factors in Computing Systems10.1145/3613904.3642217(1-19)Online publication date: 11-May-2024
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
CHI '99: Proceedings of the SIGCHI conference on Human Factors in Computing Systems
May 1999
632 pages
ISBN:0201485591
DOI:10.1145/302979
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 01 May 1999

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. analysis methods
  2. input techniques
  3. speech recognition
  4. speech user interfaces

Qualifiers

  • Article

Conference

CHI99
Sponsor:
CHI99: Conference on Human Factors in Computing Systems
May 15 - 20, 1999
Pennsylvania, Pittsburgh, USA

Acceptance Rates

CHI '99 Paper Acceptance Rate 78 of 312 submissions, 25%;
Overall Acceptance Rate 6,199 of 26,314 submissions, 24%

Upcoming Conference

CHI 2025
ACM CHI Conference on Human Factors in Computing Systems
April 26 - May 1, 2025
Yokohama , Japan

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)198
  • Downloads (Last 6 weeks)32
Reflects downloads up to 06 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2024)Governing Open Vocabulary Data Leaks Using an Edge LLM through Programming by ExampleProceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies10.1145/36997608:4(1-31)Online publication date: 21-Nov-2024
  • (2024)Towards Detecting and Mitigating Cognitive Bias in Spoken Conversational SearchAdjunct Proceedings of the 26th International Conference on Mobile Human-Computer Interaction10.1145/3640471.3680245(1-10)Online publication date: 21-Sep-2024
  • (2024)Rambler: Supporting Writing With Speech via LLM-Assisted Gist ManipulationProceedings of the 2024 CHI Conference on Human Factors in Computing Systems10.1145/3613904.3642217(1-19)Online publication date: 11-May-2024
  • (2024)Improving Error Correction and Text Editing Using Voice and Mouse Multimodal InterfaceInternational Journal of Human–Computer Interaction10.1080/10447318.2024.2352932(1-24)Online publication date: 22-May-2024
  • (2024)Voice user interfaces for effortless navigation in medical virtual reality environmentsComputers & Graphics10.1016/j.cag.2024.104069(104069)Online publication date: Sep-2024
  • (2023)YouTube et l’initiation politique des jeunes adultes. Quand les commentaires amorcent une pratique délibérativeRevue française des sciences de l’information et de la communication10.4000/rfsic.1403926Online publication date: 2023
  • (2023)Gist and Verbatim: Understanding Speech to Inform New Interfaces for Verbal Text CompositionProceedings of the 5th International Conference on Conversational User Interfaces10.1145/3571884.3597134(1-11)Online publication date: 19-Jul-2023
  • (2023)A Challenge for Bringing a BCI Closer to Motor Control: The “Interface Uncanny Valley” Hypothesis2023 IEEE Ural-Siberian Conference on Computational Technologies in Cognitive Science, Genomics and Biomedicine (CSGB)10.1109/CSGB60362.2023.10329830(242-247)Online publication date: 28-Sep-2023
  • (2023)‘I finally understand my mistakes’ – the benefits of screencast feedbackTechnology, Pedagogy and Education10.1080/1475939X.2023.225813433:1(43-55)Online publication date: 20-Sep-2023
  • (2022)Typist Experiment: an Investigation of Human-to-Human Dictation via Role-play to Inform Voice-based Text AuthoringProceedings of the ACM on Human-Computer Interaction10.1145/35557586:CSCW2(1-33)Online publication date: 11-Nov-2022
  • Show More Cited By

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media