Liang et al., 1994 - Google Patents
Output-based objective speech qualityLiang et al., 1994
- Document ID
- 5371675932734049097
- Author
- Liang J
- Kubichek R
- Publication year
- Publication venue
- Proceedings of IEEE Vehicular Technology Conference (VTC)
External Links
Snippet
Objective speech quality measures automatically assess performance of communication systems without the need for human listeners. Typical objective quality methods are based on some distortion measure between the known input speech record and the received …
- 238000000034 method 0 abstract description 13
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
- G10L25/48—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 specially adapted for particular use
- G10L25/69—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 specially adapted for particular use for evaluating synthetic or decoded voice signals
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
- G10L15/065—Adaptation
- G10L15/07—Adaptation to the speaker
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification
- G10L17/26—Recognition of special voice characteristics, e.g. for use in lie detectors; Recognition of animal voices
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification
- G10L17/04—Training, enrolment or model building
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
- G10L25/93—Discriminating between voiced and unvoiced parts of speech signals
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
- G10L25/78—Detection of presence or absence of voice signals
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signal, using source filter models or psychoacoustic analysis
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Avila et al. | Non-intrusive speech quality assessment using neural networks | |
Liang et al. | Output-based objective speech quality | |
US6446038B1 (en) | Method and system for objectively evaluating speech | |
Kubichek | Mel-cepstral distance measure for objective speech quality assessment | |
Rix et al. | Perceptual evaluation of speech quality (PESQ)-a new method for speech quality assessment of telephone networks and codecs | |
US6609092B1 (en) | Method and apparatus for estimating subjective audio signal quality from objective distortion measures | |
US6651041B1 (en) | Method for executing automatic evaluation of transmission quality of audio signals using source/received-signal spectral covariance | |
US9786300B2 (en) | Single-sided speech quality measurement | |
JPH10505718A (en) | Analysis of audio quality | |
Rix | Perceptual speech quality assessment-a review | |
Dubey et al. | Non-intrusive speech quality assessment using several combinations of auditory features | |
Dimolitsas | Objective speech distortion measures and their relevance to speech quality assessments | |
Kitawaki et al. | Quality assessment of speech coding and speech synthesis systems | |
Kondo et al. | Speech quality | |
Dubey et al. | Comparison of subjective and objective speech quality assessment for different degradation/noise conditions | |
Picovici et al. | Output-based objective speech quality measure using self-organizing map | |
Barnwell III | Objective measures for speech quality testing | |
Huber et al. | Single-ended speech quality prediction based on automatic speech recognition | |
Dimolitsas | Subjective assessment methods for the measurement of digital speech coder quality | |
Kim | A cue for objective speech quality estimation in temporal envelope representations | |
Li et al. | Output-based objective speech quality measurement using continuous Hidden Markov Models | |
Takahashi et al. | On non-reference speech intelligibility estimation using DNN noise reduction | |
Meky et al. | Prediction of speech quality using radial basis functions neural networks | |
Jin et al. | Output-based objective speech quality using vector quantization techniques | |
Mittag et al. | Single-ended packet loss rate estimation of transmitted speech signals |