Ru et al., 2003 - Google Patents
The synergy between speech production and perceptionRu et al., 2003
View PDF- Document ID
- 12501241264198966035
- Author
- Ru P
- Chi T
- Shamma S
- Publication year
- Publication venue
- The Journal of the Acoustical Society of America
External Links
Snippet
Speech intelligibility is known to be relatively unaffected by certain deformations of the acoustic spectrum. These include translations, stretching or contracting dilations, and shearing of the spectrum (represented along the logarithmic frequency axis). It is argued …
- 238000004519 manufacturing process 0 title abstract description 31
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/003—Changing voice quality, e.g. pitch or formants
- G10L21/007—Changing voice quality, e.g. pitch or formants characterised by the process used
- G10L21/013—Adapting to target pitch
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
- G10L25/48—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 specially adapted for particular use
- G10L25/51—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 specially adapted for particular use for comparison or discrimination
- G10L25/66—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 specially adapted for particular use for comparison or discrimination for extracting parameters related to health condition
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G10L13/06—Elementary speech units used in speech synthesisers; Concatenation rules
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/06—Transformation of speech into a non-audible representation, e.g. speech visualisation or speech processing for tactile aids
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G10L13/02—Methods for producing synthetic speech; Speech synthesisers
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification
- G10L17/26—Recognition of special voice characteristics, e.g. for use in lie detectors; Recognition of animal voices
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signal, using source filter models or psychoacoustic analysis
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Kuwabara et al. | Acoustic characteristics of speaker individuality: Control and conversion | |
Chi et al. | Multiresolution spectrotemporal analysis of complex sounds | |
Cooke et al. | The auditory organization of speech and other sources in listeners and computational models | |
Story | Phrase-level speech simulation with an airway modulation model of speech production | |
Childers et al. | Voice conversion | |
Pitton et al. | Time-frequency analysis and auditory modeling for automatic recognition of speech | |
US7376553B2 (en) | Fractal harmonic overtone mapping of speech and musical sounds | |
Mamun et al. | Prediction of speech intelligibility using a neurogram orthogonal polynomial measure (NOPM) | |
Stevens et al. | Some acoustical and perceptual correlates of nasal vowels | |
Ru et al. | The synergy between speech production and perception | |
Jeon et al. | Speech analysis in a model of the central auditory system | |
Assmann et al. | Modeling the perception of frequency-shifted vowels | |
Přibilová et al. | Non-linear frequency scale mapping for voice conversion in text-to-speech system with cepstral description | |
Hermansky et al. | The effective second formant F2'and the vocal tract front-cavity | |
Malathi et al. | Speech enhancement via smart larynx of variable frequency for laryngectomee patient for Tamil language syllables using RADWT algorithm | |
Rodriguez et al. | A fuzzy information space approach to speech signal non‐linear analysis | |
Exter et al. | DNN-Based Automatic Speech Recognition as a Model for Human Phoneme Perception. | |
Ruinskiy et al. | Stochastic models of pitch jitter and amplitude shimmer for voice modification | |
Bu et al. | Perceptual speech processing and phonetic feature mapping for robust vowel recognition | |
Kawahara | Exemplar-based voice quality analysis and control using a high quality auditory morphing procedure based on straight | |
Miller-Ockhuizen et al. | Acoustics of contrastive palatal affricates predict phonological patterning | |
Hagmüller | Speech enhancement for disordered and substitution voices | |
Teixeira et al. | A software tool to study Portuguese vowels | |
Ru | Perception-based multi-resolution auditory processing of acoustic signals | |
Elhilali | Neural basis and computational strategies for auditory processing |