Wegmann, 1994 - Google Patents
Final Technical Report (• 1• on Phase I SBIR Study on" Semi-Automated Speech Transcription Systems at Dragon Systems Co• Semi-Automated Speech Transcription …Wegmann, 1994
View PDF- Document ID
- 5016415517964384310
- Author
- Wegmann S
- Publication year
External Links
Snippet
This report describes preliminary explorations towards the design of a semi-automatic transcription system. Current transcription practices were studied and are described in this report. The promising results of several speech recognition experiments as well as a topic …
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L2015/088—Word spotting
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/26—Speech to text systems
- G10L15/265—Speech recognisers specially adapted for particular applications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
- G10L15/065—Adaptation
- G10L15/07—Adaptation to the speaker
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/06—Transformation of speech into a non-audible representation, e.g. speech visualisation or speech processing for tactile aids
- G10L21/10—Transformation of speech into a non-audible representation, e.g. speech visualisation or speech processing for tactile aids transforming into visible information
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification
- G10L17/26—Recognition of special voice characteristics, e.g. for use in lie detectors; Recognition of animal voices
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
- G10L25/78—Detection of presence or absence of voice signals
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/28—Constructional details of speech recognition systems
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signal, using source filter models or psychoacoustic analysis
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US6505153B1 (en) | Efficient method for producing off-line closed captions | |
Hauptmann et al. | Informedia: News-on-demand multimedia information acquisition and retrieval | |
US6434520B1 (en) | System and method for indexing and querying audio archives | |
US20070118373A1 (en) | System and method for generating closed captions | |
JP2007519987A (en) | Integrated analysis system and method for internal and external audiovisual data | |
WO2001020596A1 (en) | Method and apparatus to determine and use audience affinity and aptitude | |
Haubold et al. | Alignment of speech to highly imperfect text transcriptions | |
Roy et al. | Speaker identification based text to audio alignment for an audio retrieval system | |
Wilcox et al. | Annotation and segmentation for multimedia indexing and retrieval | |
JP2004302175A (en) | System, method, and program for speech recognition | |
Jacobs et al. | Towards hate speech detection in low-resource languages: Comparing ASR to acoustic word embeddings on Wolof and Swahili | |
Brown et al. | Video mail retrieval by voice: An overview of the Cambridge/Olivetti retrieval system | |
Munteanu et al. | Measuring the acceptable word error rate of machine-generated webcast transcripts | |
Wegmann | Final Technical Report (• 1• on Phase I SBIR Study on" Semi-Automated Speech Transcription Systems at Dragon Systems Co• Semi-Automated Speech Transcription System Study | |
Lim et al. | Developing an automatic speech recognizer for filipino with english code-switching in news broadcast | |
Nouza et al. | System for producing subtitles to internet audio-visual documents | |
Kubala et al. | Broadcast news transcription | |
Saz et al. | Background-tracking acoustic features for genre identification of broadcast shows | |
Hansen et al. | Audio stream phrase recognition for a national gallery of the spoken word:" one small step". | |
Álvarez et al. | APyCA: Towards the automatic subtitling of television content in Spanish | |
Jones et al. | Video mail retrieval using voice: an overview of the Stage 2 system | |
Nouza et al. | A system for information retrieval from large records of Czech spoken data | |
Chaloupka et al. | Optical character recognition for audio-visual broadcast transcription system | |
Rigoll | The ALERT system: Advanced broadcast speech recognition technology for selective dissemination of multimedia information | |
Teja et al. | A Novel Approach in the Automatic Generation of Regional Language Subtitles for Videos in English |