Hines et al., 2013 - Google Patents

Robustness of speech quality metrics to background noise and network degradations: Comparing ViSQOL, PESQ and POLQA

Hines et al., 2013

Document ID: 17020290135915797090
Author: Hines A; Skoglund J; Kokaram A; Harte N
Publication year: 2013
Publication venue: 2013 IEEE International Conference on Acoustics, Speech and Signal Processing

External Links

Cited by

Snippet

The Virtual Speech Quality Objective Listener (ViSQOL) is a new objective speech quality model. It is a signal based full reference metric that uses a spectro-temporal measure of similarity between a reference and a test speech signal. ViSQOL aims to predict the overall …

Continue reading at arrow.tudublin.ie (PDF) (other versions)

230000015556 catabolic process 0 title abstract description 17

Classifications

- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
- G10L25/48—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 specially adapted for particular use
- G10L25/69—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 specially adapted for particular use for evaluating synthetic or decoded voice signals
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
- G10L25/48—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 specially adapted for particular use
- G10L25/51—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 specially adapted for particular use for comparison or discrimination
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04M—TELEPHONIC COMMUNICATION
- H04M3/00—Automatic or semi-automatic exchanges
- H04M3/22—Supervisory, monitoring, management, i.e. operation, administration, maintenance or testing arrangements
- H04M3/2236—Quality of speech transmission monitoring
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
- G10L25/90—Pitch determination of speech signals
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
- G10L25/78—Detection of presence or absence of voice signals
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification
- G10L17/26—Recognition of special voice characteristics, e.g. for use in lie detectors; Recognition of animal voices
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signal, using source filter models or psychoacoustic analysis
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition

Similar Documents

Publication	Publication Date	Title
Hines et al.	2013	Robustness of speech quality metrics to background noise and network degradations: Comparing ViSQOL, PESQ and POLQA
Hines et al.	2015	ViSQOL: an objective speech quality model
Hines et al.	2012	ViSQOL: The virtual speech quality objective listener
US9524733B2 (en)	2016-12-20	Objective speech quality metric
JP5542206B2 (en)	2014-07-09	Method and system for determining perceptual quality of an audio system
Rix	2004	Perceptual speech quality assessment-a review
JP6522508B2 (en)	2019-05-29	Method for evaluating intelligibility of degraded speech signal and device therefor
EP3120356B1 (en)	2018-05-02	Method of and apparatus for evaluating quality of a degraded speech signal
CN104269180B (en)	2018-04-13	A kind of quasi- clean speech building method for speech quality objective assessment
Kandadai et al.	2008	Audio quality assessment using the mean structural similarity measure
EP2410517B1 (en)	2017-02-22	Method and system for the integral and diagnostic assessment of listening speech quality
Ding et al.	2007	Non-intrusive single-ended speech quality assessment in VoIP
Möller et al.	2013	Speech quality prediction for artificial bandwidth extension algorithms.
Avila et al.	2019	Intrusive quality measurement of noisy and enhanced speech based on i-vector similarity
EP2388779A1 (en)	2011-11-23	Method for estimating speech quality
Mittag et al.	2018	Single-ended packet loss rate estimation of transmitted speech signals
Möller et al.	2010	Towards a universal scale for perceptual value
Voran	2021	Measuring speech quality of system input while observing only system output
Wang et al.	2012	Objective Intelligibility Assessment of Text-to-Speech System using Template Constrained Generalized Posterior Probability.
Hines et al.	2013	Monitoring the effects of temporal clipping on voip speech quality
JP4116955B2 (en)	2008-07-09	Voice quality objective evaluation apparatus and voice quality objective evaluation method
Zhang et al.	2014	Performance analyze of QoE-based speech quality evaluation model
Sharma et al.	2011	Short-time objective assessment of speech quality
Hines et al.	2016	Monitoring voip speech quality for chopped and clipped speech
Ghimire	2012	Speech intelligibility measurement on the basis of ITU-T Recommendation P. 863