[go: up one dir, main page]
More Web Proxy on the site http://driver.im/

Ru et al., 2003 - Google Patents

The synergy between speech production and perception

Ru et al., 2003

View PDF
Document ID
12501241264198966035
Author
Ru P
Chi T
Shamma S
Publication year
Publication venue
The Journal of the Acoustical Society of America

External Links

Snippet

Speech intelligibility is known to be relatively unaffected by certain deformations of the acoustic spectrum. These include translations, stretching or contracting dilations, and shearing of the spectrum (represented along the logarithmic frequency axis). It is argued …
Continue reading at www.researchgate.net (PDF) (other versions)

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/003Changing voice quality, e.g. pitch or formants
    • G10L21/007Changing voice quality, e.g. pitch or formants characterised by the process used
    • G10L21/013Adapting to target pitch
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
    • G10L25/48Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 specially adapted for particular use
    • G10L25/51Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 specially adapted for particular use for comparison or discrimination
    • G10L25/66Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 specially adapted for particular use for comparison or discrimination for extracting parameters related to health condition
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L13/00Speech synthesis; Text to speech systems
    • G10L13/06Elementary speech units used in speech synthesisers; Concatenation rules
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/06Transformation of speech into a non-audible representation, e.g. speech visualisation or speech processing for tactile aids
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L13/00Speech synthesis; Text to speech systems
    • G10L13/02Methods for producing synthetic speech; Speech synthesisers
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L17/00Speaker identification or verification
    • G10L17/26Recognition of special voice characteristics, e.g. for use in lie detectors; Recognition of animal voices
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signal, using source filter models or psychoacoustic analysis
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition

Similar Documents

Publication Publication Date Title
Kuwabara et al. Acoustic characteristics of speaker individuality: Control and conversion
Chi et al. Multiresolution spectrotemporal analysis of complex sounds
Cooke et al. The auditory organization of speech and other sources in listeners and computational models
Story Phrase-level speech simulation with an airway modulation model of speech production
Childers et al. Voice conversion
Pitton et al. Time-frequency analysis and auditory modeling for automatic recognition of speech
US7376553B2 (en) Fractal harmonic overtone mapping of speech and musical sounds
Mamun et al. Prediction of speech intelligibility using a neurogram orthogonal polynomial measure (NOPM)
Stevens et al. Some acoustical and perceptual correlates of nasal vowels
Ru et al. The synergy between speech production and perception
Jeon et al. Speech analysis in a model of the central auditory system
Assmann et al. Modeling the perception of frequency-shifted vowels
Přibilová et al. Non-linear frequency scale mapping for voice conversion in text-to-speech system with cepstral description
Hermansky et al. The effective second formant F2'and the vocal tract front-cavity
Malathi et al. Speech enhancement via smart larynx of variable frequency for laryngectomee patient for Tamil language syllables using RADWT algorithm
Rodriguez et al. A fuzzy information space approach to speech signal non‐linear analysis
Exter et al. DNN-Based Automatic Speech Recognition as a Model for Human Phoneme Perception.
Ruinskiy et al. Stochastic models of pitch jitter and amplitude shimmer for voice modification
Bu et al. Perceptual speech processing and phonetic feature mapping for robust vowel recognition
Kawahara Exemplar-based voice quality analysis and control using a high quality auditory morphing procedure based on straight
Miller-Ockhuizen et al. Acoustics of contrastive palatal affricates predict phonological patterning
Hagmüller Speech enhancement for disordered and substitution voices
Teixeira et al. A software tool to study Portuguese vowels
Ru Perception-based multi-resolution auditory processing of acoustic signals
Elhilali Neural basis and computational strategies for auditory processing