Hines et al., 2013 - Google Patents
Robustness of speech quality metrics to background noise and network degradations: Comparing ViSQOL, PESQ and POLQAHines et al., 2013
View PDF- Document ID
- 17020290135915797090
- Author
- Hines A
- Skoglund J
- Kokaram A
- Harte N
- Publication year
- Publication venue
- 2013 IEEE International Conference on Acoustics, Speech and Signal Processing
External Links
Snippet
The Virtual Speech Quality Objective Listener (ViSQOL) is a new objective speech quality model. It is a signal based full reference metric that uses a spectro-temporal measure of similarity between a reference and a test speech signal. ViSQOL aims to predict the overall …
- 230000015556 catabolic process 0 title abstract description 17
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
- G10L25/48—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 specially adapted for particular use
- G10L25/69—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 specially adapted for particular use for evaluating synthetic or decoded voice signals
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
- G10L25/48—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 specially adapted for particular use
- G10L25/51—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 specially adapted for particular use for comparison or discrimination
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04M—TELEPHONIC COMMUNICATION
- H04M3/00—Automatic or semi-automatic exchanges
- H04M3/22—Supervisory, monitoring, management, i.e. operation, administration, maintenance or testing arrangements
- H04M3/2236—Quality of speech transmission monitoring
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
- G10L25/90—Pitch determination of speech signals
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
- G10L25/78—Detection of presence or absence of voice signals
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification
- G10L17/26—Recognition of special voice characteristics, e.g. for use in lie detectors; Recognition of animal voices
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signal, using source filter models or psychoacoustic analysis
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Hines et al. | Robustness of speech quality metrics to background noise and network degradations: Comparing ViSQOL, PESQ and POLQA | |
Hines et al. | ViSQOL: an objective speech quality model | |
Hines et al. | ViSQOL: The virtual speech quality objective listener | |
US9524733B2 (en) | Objective speech quality metric | |
JP5542206B2 (en) | Method and system for determining perceptual quality of an audio system | |
Rix | Perceptual speech quality assessment-a review | |
JP6522508B2 (en) | Method for evaluating intelligibility of degraded speech signal and device therefor | |
EP3120356B1 (en) | Method of and apparatus for evaluating quality of a degraded speech signal | |
CN104269180B (en) | A kind of quasi- clean speech building method for speech quality objective assessment | |
Kandadai et al. | Audio quality assessment using the mean structural similarity measure | |
EP2410517B1 (en) | Method and system for the integral and diagnostic assessment of listening speech quality | |
Ding et al. | Non-intrusive single-ended speech quality assessment in VoIP | |
Möller et al. | Speech quality prediction for artificial bandwidth extension algorithms. | |
Avila et al. | Intrusive quality measurement of noisy and enhanced speech based on i-vector similarity | |
EP2388779A1 (en) | Method for estimating speech quality | |
Mittag et al. | Single-ended packet loss rate estimation of transmitted speech signals | |
Möller et al. | Towards a universal scale for perceptual value | |
Voran | Measuring speech quality of system input while observing only system output | |
Wang et al. | Objective Intelligibility Assessment of Text-to-Speech System using Template Constrained Generalized Posterior Probability. | |
Hines et al. | Monitoring the effects of temporal clipping on voip speech quality | |
JP4116955B2 (en) | Voice quality objective evaluation apparatus and voice quality objective evaluation method | |
Zhang et al. | Performance analyze of QoE-based speech quality evaluation model | |
Sharma et al. | Short-time objective assessment of speech quality | |
Hines et al. | Monitoring voip speech quality for chopped and clipped speech | |
Ghimire | Speech intelligibility measurement on the basis of ITU-T Recommendation P. 863 |