[go: up one dir, main page]
More Web Proxy on the site http://driver.im/

Takagi et al., 2015 - Google Patents

Evaluation of real-time captioning by machine recognition with human support

Takagi et al., 2015

Document ID
12909274583501498734
Author
Takagi H
Itoh T
Shinkawa K
Publication year
Publication venue
Proceedings of the 12th International Web for All Conference

External Links

Snippet

Verbal meetings are important at work, but employees who are deaf or hard of hearing (DHH) find it difficult to participate. Manual real-time captioning is a solution, but professional stenographers are too expensive for routine use. We are exploring the possibilities of real …
Continue reading at dl.acm.org (other versions)

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/06Transformation of speech into a non-audible representation, e.g. speech visualisation or speech processing for tactile aids
    • G10L21/10Transformation of speech into a non-audible representation, e.g. speech visualisation or speech processing for tactile aids transforming into visible information
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/18Speech classification or search using natural language modelling
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/26Speech to text systems
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/28Constructional details of speech recognition systems
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/20Handling natural language data
    • G06F17/28Processing or translating of natural language
    • G06F17/289Use of machine translation, e.g. multi-lingual retrieval, server side translation for client devices, real-time translation
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L17/00Speaker identification or verification
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/16Sound input; Sound output
    • G06F3/167Audio in a user interface, e.g. using voice commands for navigating, audio feedback
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M2201/00Electronic components, circuits, software, systems or apparatus used in telephone systems
    • H04M2201/40Electronic components, circuits, software, systems or apparatus used in telephone systems using speech recognition
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M3/00Automatic or semi-automatic exchanges
    • H04M3/42Systems providing special services or facilities to subscribers
    • H04M3/56Arrangements for connecting several subscribers to a common circuit, i.e. affording conference facilities
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L13/00Speech synthesis; Text to speech systems

Similar Documents

Publication Publication Date Title
Braun Technology and interpreting
US10678501B2 (en) Context based identification of non-relevant verbal communications
Romero-Fresco Subtitling through speech recognition: Respeaking
US11483273B2 (en) Chat-based interaction with an in-meeting virtual assistant
Kawas et al. Improving real-time captioning experiences for deaf and hard of hearing students
US11170782B2 (en) Real-time audio transcription, video conferencing, and online collaboration system and methods
US8407049B2 (en) Systems and methods for conversation enhancement
Kushalnagar et al. Accessibility evaluation of classroom captions
Romero-Fresco Respeaking: Subtitling through speech recognition
US20070100626A1 (en) System and method for improving speaking ability
Bain et al. Accessibility, transcription, and access everywhere
US10613825B2 (en) Providing electronic text recommendations to a user based on what is discussed during a meeting
Wald Crowdsourcing correction of speech recognition captioning errors
Seita et al. Behavioral changes in speakers who are automatically captioned in meetings with deaf or hard-of-hearing peers
Chmiel et al. Ear–voice span and pauses in intra-and interlingual respeaking: An exploratory study into temporal aspects of the respeaking process
US20220405492A1 (en) Systems, methods, and apparatus for switching between and displaying translated text and transcribed text in the original spoken language
US20210264812A1 (en) Language learning system and method
Hirvonen et al. How are translations created? Using multimodal conversation analysis to study a team translation process
US20190121860A1 (en) Conference And Call Center Speech To Text Machine Translation Engine
Takagi et al. Evaluation of real-time captioning by machine recognition with human support
Matamala et al. The Use of Respeaking for the Transcription of Non-Fictional Genres: An Exploratory Study.
Silber-Varod et al. Positioning oneself in different roles: Structural and lexical measures of power relations between speakers in Map Task Corpus
Wald et al. Enhancing the usability of real-time speech recognition captioning through personalised displays and real-time multiple speaker editing and annotation
Wald Concurrent collaborative captioning
US10657202B2 (en) Cognitive presentation system and method