[go: up one dir, main page]
More Web Proxy on the site http://driver.im/

Hines et al., 2013 - Google Patents

Robustness of speech quality metrics to background noise and network degradations: Comparing ViSQOL, PESQ and POLQA

Hines et al., 2013

View PDF
Document ID
17020290135915797090
Author
Hines A
Skoglund J
Kokaram A
Harte N
Publication year
Publication venue
2013 IEEE International Conference on Acoustics, Speech and Signal Processing

External Links

Snippet

The Virtual Speech Quality Objective Listener (ViSQOL) is a new objective speech quality model. It is a signal based full reference metric that uses a spectro-temporal measure of similarity between a reference and a test speech signal. ViSQOL aims to predict the overall …
Continue reading at arrow.tudublin.ie (PDF) (other versions)

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
    • G10L25/48Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 specially adapted for particular use
    • G10L25/69Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 specially adapted for particular use for evaluating synthetic or decoded voice signals
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
    • G10L25/48Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 specially adapted for particular use
    • G10L25/51Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 specially adapted for particular use for comparison or discrimination
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M3/00Automatic or semi-automatic exchanges
    • H04M3/22Supervisory, monitoring, management, i.e. operation, administration, maintenance or testing arrangements
    • H04M3/2236Quality of speech transmission monitoring
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
    • G10L25/90Pitch determination of speech signals
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
    • G10L25/78Detection of presence or absence of voice signals
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L17/00Speaker identification or verification
    • G10L17/26Recognition of special voice characteristics, e.g. for use in lie detectors; Recognition of animal voices
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signal, using source filter models or psychoacoustic analysis
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition

Similar Documents

Publication Publication Date Title
Hines et al. Robustness of speech quality metrics to background noise and network degradations: Comparing ViSQOL, PESQ and POLQA
Hines et al. ViSQOL: an objective speech quality model
Hines et al. ViSQOL: The virtual speech quality objective listener
US9524733B2 (en) Objective speech quality metric
JP5542206B2 (en) Method and system for determining perceptual quality of an audio system
Rix Perceptual speech quality assessment-a review
JP6522508B2 (en) Method for evaluating intelligibility of degraded speech signal and device therefor
EP3120356B1 (en) Method of and apparatus for evaluating quality of a degraded speech signal
CN104269180B (en) A kind of quasi- clean speech building method for speech quality objective assessment
Kandadai et al. Audio quality assessment using the mean structural similarity measure
EP2410517B1 (en) Method and system for the integral and diagnostic assessment of listening speech quality
Ding et al. Non-intrusive single-ended speech quality assessment in VoIP
Möller et al. Speech quality prediction for artificial bandwidth extension algorithms.
Avila et al. Intrusive quality measurement of noisy and enhanced speech based on i-vector similarity
EP2388779A1 (en) Method for estimating speech quality
Mittag et al. Single-ended packet loss rate estimation of transmitted speech signals
Möller et al. Towards a universal scale for perceptual value
Voran Measuring speech quality of system input while observing only system output
Wang et al. Objective Intelligibility Assessment of Text-to-Speech System using Template Constrained Generalized Posterior Probability.
Hines et al. Monitoring the effects of temporal clipping on voip speech quality
JP4116955B2 (en) Voice quality objective evaluation apparatus and voice quality objective evaluation method
Zhang et al. Performance analyze of QoE-based speech quality evaluation model
Sharma et al. Short-time objective assessment of speech quality
Hines et al. Monitoring voip speech quality for chopped and clipped speech
Ghimire Speech intelligibility measurement on the basis of ITU-T Recommendation P. 863