Jin et al., 1995 - Google Patents
Output-based objective speech quality using vector quantization techniquesJin et al., 1995
- Document ID
- 9913892813280301691
- Author
- Jin C
- Kubichek R
- Publication year
- Publication venue
- Conference Record of The Twenty-Ninth Asilomar Conference on Signals, Systems and Computers
External Links
Snippet
Output-based speech quality (OBQ) refers to an objective speech quality measure that uses only received speech without access to the input speech record. This paper proposes two new OBQ measures and evaluates their performance. Perceptual linear prediction (PLP) …
- 238000000034 method 0 title description 7
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
- G10L25/27—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 characterised by the analysis technique
- G10L25/30—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 characterised by the analysis technique using neural networks
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signal, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signal, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/16—Vocoder architecture
- G10L19/18—Vocoders using multiple modes
- G10L19/24—Variable rate codecs, e.g. for generating different qualities using a scalable representation such as hierarchical encoding or layered encoding
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/14—Speech classification or search using statistical models, e.g. hidden Markov models [HMMs]
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/02—Feature extraction for speech recognition; Selection of recognition unit
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signal, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signal, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
- G10L25/93—Discriminating between voiced and unvoiced parts of speech signals
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
- G10L25/78—Detection of presence or absence of voice signals
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
- G10L15/065—Adaptation
- G10L15/07—Adaptation to the speaker
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
- G10L25/03—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 characterised by the type of extracted parameters
- G10L25/09—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 characterised by the type of extracted parameters the extracted parameters being zero crossing rates
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN108900725B (en) | Voiceprint recognition method and device, terminal equipment and storage medium | |
CN1121681C (en) | Speech processing | |
US6188981B1 (en) | Method and apparatus for detecting voice activity in a speech signal | |
US4815134A (en) | Very low rate speech encoder and decoder | |
EP1536414B1 (en) | Method and apparatus for multi-sensory speech enhancement | |
US6477490B2 (en) | Audio signal compression method, audio signal compression apparatus, speech signal compression method, speech signal compression apparatus, speech recognition method, and speech recognition apparatus | |
US9786300B2 (en) | Single-sided speech quality measurement | |
Mermelstein | Evaluation of a segmental SNR measure as an indicator of the quality of ADPCM coded speech | |
JP2002366174A (en) | Method for covering g.729 annex b compliant voice activity detection circuit | |
Liang et al. | Output-based objective speech quality | |
Picovici et al. | Output-based objective speech quality measure using self-organizing map | |
Jin et al. | Output-based objective speech quality using vector quantization techniques | |
Kubichek et al. | Advances in objective voice quality assessment | |
US7013266B1 (en) | Method for determining speech quality by comparison of signal properties | |
JP2953238B2 (en) | Sound quality subjective evaluation prediction method | |
Lam et al. | Objective speech quality measure for cellular phone | |
Kim | A cue for objective speech quality estimation in temporal envelope representations | |
Li et al. | Output-based objective speech quality measurement using continuous Hidden Markov Models | |
Dimolitsas | Subjective assessment methods for the measurement of digital speech coder quality | |
Wang et al. | Mapping methods for output-based objective speech quality assessment using data mining | |
Falk et al. | Enhanced non-intrusive speech quality measurement using degradation models | |
KR100701253B1 (en) | System and Methods of Speech Coding for Server?Based Speech Recognition in Mobile Communication Environments | |
Rahdari et al. | An ensemble learning model for single-ended speech quality assessment using multiple-level signal decomposition method | |
Beritelli et al. | A psychoacoustic auditory model to evaluate the performance of a voice activity detector | |
Audhkhasi et al. | Two-scale auditory feature based non-intrusive speech quality evaluation |