Takagi et al., 2015 - Google Patents
Evaluation of real-time captioning by machine recognition with human supportTakagi et al., 2015
- Document ID
- 12909274583501498734
- Author
- Takagi H
- Itoh T
- Shinkawa K
- Publication year
- Publication venue
- Proceedings of the 12th International Web for All Conference
External Links
Snippet
Verbal meetings are important at work, but employees who are deaf or hard of hearing (DHH) find it difficult to participate. Manual real-time captioning is a solution, but professional stenographers are too expensive for routine use. We are exploring the possibilities of real …
- 238000011156 evaluation 0 title description 2
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/06—Transformation of speech into a non-audible representation, e.g. speech visualisation or speech processing for tactile aids
- G10L21/10—Transformation of speech into a non-audible representation, e.g. speech visualisation or speech processing for tactile aids transforming into visible information
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/26—Speech to text systems
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/28—Constructional details of speech recognition systems
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/28—Processing or translating of natural language
- G06F17/289—Use of machine translation, e.g. multi-lingual retrieval, server side translation for client devices, real-time translation
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/16—Sound input; Sound output
- G06F3/167—Audio in a user interface, e.g. using voice commands for navigating, audio feedback
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04M—TELEPHONIC COMMUNICATION
- H04M2201/00—Electronic components, circuits, software, systems or apparatus used in telephone systems
- H04M2201/40—Electronic components, circuits, software, systems or apparatus used in telephone systems using speech recognition
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04M—TELEPHONIC COMMUNICATION
- H04M3/00—Automatic or semi-automatic exchanges
- H04M3/42—Systems providing special services or facilities to subscribers
- H04M3/56—Arrangements for connecting several subscribers to a common circuit, i.e. affording conference facilities
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Braun | Technology and interpreting | |
US10678501B2 (en) | Context based identification of non-relevant verbal communications | |
Romero-Fresco | Subtitling through speech recognition: Respeaking | |
US11483273B2 (en) | Chat-based interaction with an in-meeting virtual assistant | |
Kawas et al. | Improving real-time captioning experiences for deaf and hard of hearing students | |
US11170782B2 (en) | Real-time audio transcription, video conferencing, and online collaboration system and methods | |
US8407049B2 (en) | Systems and methods for conversation enhancement | |
Kushalnagar et al. | Accessibility evaluation of classroom captions | |
Romero-Fresco | Respeaking: Subtitling through speech recognition | |
US20070100626A1 (en) | System and method for improving speaking ability | |
Bain et al. | Accessibility, transcription, and access everywhere | |
US10613825B2 (en) | Providing electronic text recommendations to a user based on what is discussed during a meeting | |
Wald | Crowdsourcing correction of speech recognition captioning errors | |
Seita et al. | Behavioral changes in speakers who are automatically captioned in meetings with deaf or hard-of-hearing peers | |
Chmiel et al. | Ear–voice span and pauses in intra-and interlingual respeaking: An exploratory study into temporal aspects of the respeaking process | |
US20220405492A1 (en) | Systems, methods, and apparatus for switching between and displaying translated text and transcribed text in the original spoken language | |
US20210264812A1 (en) | Language learning system and method | |
Hirvonen et al. | How are translations created? Using multimodal conversation analysis to study a team translation process | |
US20190121860A1 (en) | Conference And Call Center Speech To Text Machine Translation Engine | |
Takagi et al. | Evaluation of real-time captioning by machine recognition with human support | |
Matamala et al. | The Use of Respeaking for the Transcription of Non-Fictional Genres: An Exploratory Study. | |
Silber-Varod et al. | Positioning oneself in different roles: Structural and lexical measures of power relations between speakers in Map Task Corpus | |
Wald et al. | Enhancing the usability of real-time speech recognition captioning through personalised displays and real-time multiple speaker editing and annotation | |
Wald | Concurrent collaborative captioning | |
US10657202B2 (en) | Cognitive presentation system and method |