Takagi et al., 2015 - Google Patents

Evaluation of real-time captioning by machine recognition with human support

Takagi et al., 2015

Document ID: 12909274583501498734
Author: Takagi H; Itoh T; Shinkawa K
Publication year: 2015
Publication venue: Proceedings of the 12th International Web for All Conference

External Links

Cited by

Snippet

Verbal meetings are important at work, but employees who are deaf or hard of hearing (DHH) find it difficult to participate. Manual real-time captioning is a solution, but professional stenographers are too expensive for routine use. We are exploring the possibilities of real …

Continue reading at dl.acm.org (other versions)

238000011156 evaluation 0 title description 2

Classifications

- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/06—Transformation of speech into a non-audible representation, e.g. speech visualisation or speech processing for tactile aids
- G10L21/10—Transformation of speech into a non-audible representation, e.g. speech visualisation or speech processing for tactile aids transforming into visible information
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/26—Speech to text systems
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/28—Constructional details of speech recognition systems
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/28—Processing or translating of natural language
- G06F17/289—Use of machine translation, e.g. multi-lingual retrieval, server side translation for client devices, real-time translation
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/16—Sound input; Sound output
- G06F3/167—Audio in a user interface, e.g. using voice commands for navigating, audio feedback
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04M—TELEPHONIC COMMUNICATION
- H04M2201/00—Electronic components, circuits, software, systems or apparatus used in telephone systems
- H04M2201/40—Electronic components, circuits, software, systems or apparatus used in telephone systems using speech recognition
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04M—TELEPHONIC COMMUNICATION
- H04M3/00—Automatic or semi-automatic exchanges
- H04M3/42—Systems providing special services or facilities to subscribers
- H04M3/56—Arrangements for connecting several subscribers to a common circuit, i.e. affording conference facilities
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems

Similar Documents

Publication	Publication Date	Title
Braun	2019	Technology and interpreting
US10678501B2 (en)	2020-06-09	Context based identification of non-relevant verbal communications
Romero-Fresco	2020	Subtitling through speech recognition: Respeaking
US11483273B2 (en)	2022-10-25	Chat-based interaction with an in-meeting virtual assistant
Kawas et al.	2016	Improving real-time captioning experiences for deaf and hard of hearing students
US11170782B2 (en)	2021-11-09	Real-time audio transcription, video conferencing, and online collaboration system and methods
US8407049B2 (en)	2013-03-26	Systems and methods for conversation enhancement
Kushalnagar et al.	2014	Accessibility evaluation of classroom captions
Romero-Fresco	2018	Respeaking: Subtitling through speech recognition
US20070100626A1 (en)	2007-05-03	System and method for improving speaking ability
Bain et al.	2005	Accessibility, transcription, and access everywhere
US10613825B2 (en)	2020-04-07	Providing electronic text recommendations to a user based on what is discussed during a meeting
Wald	2011	Crowdsourcing correction of speech recognition captioning errors
Seita et al.	2018	Behavioral changes in speakers who are automatically captioned in meetings with deaf or hard-of-hearing peers
Chmiel et al.	2017	Ear–voice span and pauses in intra-and interlingual respeaking: An exploratory study into temporal aspects of the respeaking process
US20220405492A1 (en)	2022-12-22	Systems, methods, and apparatus for switching between and displaying translated text and transcribed text in the original spoken language
US20210264812A1 (en)	2021-08-26	Language learning system and method
Hirvonen et al.	2018	How are translations created? Using multimodal conversation analysis to study a team translation process
US20190121860A1 (en)	2019-04-25	Conference And Call Center Speech To Text Machine Translation Engine
Takagi et al.	2015	Evaluation of real-time captioning by machine recognition with human support
Matamala et al.	2017	The Use of Respeaking for the Transcription of Non-Fictional Genres: An Exploratory Study.
Silber-Varod et al.	2020	Positioning oneself in different roles: Structural and lexical measures of power relations between speakers in Map Task Corpus
Wald et al.	2007	Enhancing the usability of real-time speech recognition captioning through personalised displays and real-time multiple speaker editing and annotation
Wald	2013	Concurrent collaborative captioning
US10657202B2 (en)	2020-05-19	Cognitive presentation system and method