Wegmann, 1994 - Google Patents

Final Technical Report (• 1• on Phase I SBIR Study on" Semi-Automated Speech Transcription Systems at Dragon Systems Co• Semi-Automated Speech Transcription …

Wegmann, 1994

View PDF

Document ID: 5016415517964384310
Author: Wegmann S
Publication year: 1994

External Links

Cited by

Snippet

This report describes preliminary explorations towards the design of a semi-automatic transcription system. Current transcription practices were studied and are described in this report. The promising results of several speech recognition experiments as well as a topic …

Continue reading at citeseerx.ist.psu.edu (PDF) (other versions)

Classifications

- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L2015/088—Word spotting
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/26—Speech to text systems
- G10L15/265—Speech recognisers specially adapted for particular applications
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
- G10L15/065—Adaptation
- G10L15/07—Adaptation to the speaker
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/06—Transformation of speech into a non-audible representation, e.g. speech visualisation or speech processing for tactile aids
- G10L21/10—Transformation of speech into a non-audible representation, e.g. speech visualisation or speech processing for tactile aids transforming into visible information
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification
- G10L17/26—Recognition of special voice characteristics, e.g. for use in lie detectors; Recognition of animal voices
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
- G10L25/78—Detection of presence or absence of voice signals
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/28—Constructional details of speech recognition systems
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signal, using source filter models or psychoacoustic analysis
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems

Similar Documents

Publication	Publication Date	Title
US6505153B1 (en)	2003-01-07	Efficient method for producing off-line closed captions
Hauptmann et al.	1997	Informedia: News-on-demand multimedia information acquisition and retrieval
US6434520B1 (en)	2002-08-13	System and method for indexing and querying audio archives
US20070118373A1 (en)	2007-05-24	System and method for generating closed captions
JP2007519987A (en)	2007-07-19	Integrated analysis system and method for internal and external audiovisual data
WO2001020596A1 (en)	2001-03-22	Method and apparatus to determine and use audience affinity and aptitude
Haubold et al.	2007	Alignment of speech to highly imperfect text transcriptions
Roy et al.	1997	Speaker identification based text to audio alignment for an audio retrieval system
Wilcox et al.	1998	Annotation and segmentation for multimedia indexing and retrieval
JP2004302175A (en)	2004-10-28	System, method, and program for speech recognition
Jacobs et al.	2023	Towards hate speech detection in low-resource languages: Comparing ASR to acoustic word embeddings on Wolof and Swahili
Brown et al.	1994	Video mail retrieval by voice: An overview of the Cambridge/Olivetti retrieval system
Munteanu et al.	2006	Measuring the acceptable word error rate of machine-generated webcast transcripts
Wegmann	1994	Final Technical Report (• 1• on Phase I SBIR Study on" Semi-Automated Speech Transcription Systems at Dragon Systems Co• Semi-Automated Speech Transcription System Study
Lim et al.	2022	Developing an automatic speech recognizer for filipino with english code-switching in news broadcast
Nouza et al.	2015	System for producing subtitles to internet audio-visual documents
Kubala et al.	1997	Broadcast news transcription
Saz et al.	2014	Background-tracking acoustic features for genre identification of broadcast shows
Hansen et al.	2000	Audio stream phrase recognition for a national gallery of the spoken word:" one small step".
Álvarez et al.	2010	APyCA: Towards the automatic subtitling of television content in Spanish
Jones et al.	1995	Video mail retrieval using voice: an overview of the Stage 2 system
Nouza et al.	2006	A system for information retrieval from large records of Czech spoken data
Chaloupka et al.	2020	Optical character recognition for audio-visual broadcast transcription system
Rigoll	2001	The ALERT system: Advanced broadcast speech recognition technology for selective dissemination of multimedia information
Teja et al.	2023	A Novel Approach in the Automatic Generation of Regional Language Subtitles for Videos in English