Jin et al., 1995 - Google Patents

Output-based objective speech quality using vector quantization techniques

Jin et al., 1995

Document ID: 9913892813280301691
Author: Jin C; Kubichek R
Publication year: 1995
Publication venue: Conference Record of The Twenty-Ninth Asilomar Conference on Signals, Systems and Computers

External Links

Cited by

Snippet

Output-based speech quality (OBQ) refers to an objective speech quality measure that uses only received speech without access to the input speech record. This paper proposes two new OBQ measures and evaluates their performance. Perceptual linear prediction (PLP) …

Continue reading at ieeexplore.ieee.org (other versions)

238000000034 method 0 title description 7

Classifications

- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
- G10L25/27—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 characterised by the analysis technique
- G10L25/30—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 characterised by the analysis technique using neural networks
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signal, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signal, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/16—Vocoder architecture
- G10L19/18—Vocoders using multiple modes
- G10L19/24—Variable rate codecs, e.g. for generating different qualities using a scalable representation such as hierarchical encoding or layered encoding
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/14—Speech classification or search using statistical models, e.g. hidden Markov models [HMMs]
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/02—Feature extraction for speech recognition; Selection of recognition unit
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signal, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signal, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
- G10L25/93—Discriminating between voiced and unvoiced parts of speech signals
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
- G10L25/78—Detection of presence or absence of voice signals
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
- G10L15/065—Adaptation
- G10L15/07—Adaptation to the speaker
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
- G10L25/03—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 characterised by the type of extracted parameters
- G10L25/09—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 characterised by the type of extracted parameters the extracted parameters being zero crossing rates

Similar Documents

Publication	Publication Date	Title
CN108900725B (en)	2020-05-29	Voiceprint recognition method and device, terminal equipment and storage medium
CN1121681C (en)	2003-09-17	Speech processing
US6188981B1 (en)	2001-02-13	Method and apparatus for detecting voice activity in a speech signal
US4815134A (en)	1989-03-21	Very low rate speech encoder and decoder
EP1536414B1 (en)	2012-05-23	Method and apparatus for multi-sensory speech enhancement
US6477490B2 (en)	2002-11-05	Audio signal compression method, audio signal compression apparatus, speech signal compression method, speech signal compression apparatus, speech recognition method, and speech recognition apparatus
US9786300B2 (en)	2017-10-10	Single-sided speech quality measurement
Mermelstein	1979	Evaluation of a segmental SNR measure as an indicator of the quality of ADPCM coded speech
JP2002366174A (en)	2002-12-20	Method for covering g.729 annex b compliant voice activity detection circuit
Liang et al.	1994	Output-based objective speech quality
Picovici et al.	2003	Output-based objective speech quality measure using self-organizing map
Jin et al.	1995	Output-based objective speech quality using vector quantization techniques
Kubichek et al.	1992	Advances in objective voice quality assessment
US7013266B1 (en)	2006-03-14	Method for determining speech quality by comparison of signal properties
JP2953238B2 (en)	1999-09-27	Sound quality subjective evaluation prediction method
Lam et al.	1996	Objective speech quality measure for cellular phone
Kim	2004	A cue for objective speech quality estimation in temporal envelope representations
Li et al.	2003	Output-based objective speech quality measurement using continuous Hidden Markov Models
Dimolitsas	1993	Subjective assessment methods for the measurement of digital speech coder quality
Wang et al.	2014	Mapping methods for output-based objective speech quality assessment using data mining
Falk et al.	2006	Enhanced non-intrusive speech quality measurement using degradation models
KR100701253B1 (en)	2007-03-29	System and Methods of Speech Coding for Server?Based Speech Recognition in Mobile Communication Environments
Rahdari et al.	2014	An ensemble learning model for single-ended speech quality assessment using multiple-level signal decomposition method
Beritelli et al.	2000	A psychoacoustic auditory model to evaluate the performance of a voice activity detector
Audhkhasi et al.	2010	Two-scale auditory feature based non-intrusive speech quality evaluation