Ru et al., 2003 - Google Patents

The synergy between speech production and perception

Ru et al., 2003

Document ID: 12501241264198966035
Author: Ru P; Chi T; Shamma S
Publication year: 2003
Publication venue: The Journal of the Acoustical Society of America

External Links

Cited by

Snippet

Speech intelligibility is known to be relatively unaffected by certain deformations of the acoustic spectrum. These include translations, stretching or contracting dilations, and shearing of the spectrum (represented along the logarithmic frequency axis). It is argued …

Continue reading at www.researchgate.net (PDF) (other versions)

238000004519 manufacturing process 0 title abstract description 31

Classifications

- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/003—Changing voice quality, e.g. pitch or formants
- G10L21/007—Changing voice quality, e.g. pitch or formants characterised by the process used
- G10L21/013—Adapting to target pitch
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
- G10L25/48—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 specially adapted for particular use
- G10L25/51—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 specially adapted for particular use for comparison or discrimination
- G10L25/66—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 specially adapted for particular use for comparison or discrimination for extracting parameters related to health condition
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G10L13/06—Elementary speech units used in speech synthesisers; Concatenation rules
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/06—Transformation of speech into a non-audible representation, e.g. speech visualisation or speech processing for tactile aids
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G10L13/02—Methods for producing synthetic speech; Speech synthesisers
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification
- G10L17/26—Recognition of special voice characteristics, e.g. for use in lie detectors; Recognition of animal voices
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signal, using source filter models or psychoacoustic analysis
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition

Similar Documents

Publication	Publication Date	Title
Kuwabara et al.	1995	Acoustic characteristics of speaker individuality: Control and conversion
Chi et al.	2005	Multiresolution spectrotemporal analysis of complex sounds
Cooke et al.	2001	The auditory organization of speech and other sources in listeners and computational models
Story	2013	Phrase-level speech simulation with an airway modulation model of speech production
Childers et al.	1989	Voice conversion
Pitton et al.	1996	Time-frequency analysis and auditory modeling for automatic recognition of speech
US7376553B2 (en)	2008-05-20	Fractal harmonic overtone mapping of speech and musical sounds
Mamun et al.	2015	Prediction of speech intelligibility using a neurogram orthogonal polynomial measure (NOPM)
Stevens et al.	1987	Some acoustical and perceptual correlates of nasal vowels
Ru et al.	2003	The synergy between speech production and perception
Jeon et al.	2007	Speech analysis in a model of the central auditory system
Assmann et al.	2002	Modeling the perception of frequency-shifted vowels
Přibilová et al.	2006	Non-linear frequency scale mapping for voice conversion in text-to-speech system with cepstral description
Hermansky et al.	1989	The effective second formant F2'and the vocal tract front-cavity
Malathi et al.	2019	Speech enhancement via smart larynx of variable frequency for laryngectomee patient for Tamil language syllables using RADWT algorithm
Rodriguez et al.	2000	A fuzzy information space approach to speech signal non‐linear analysis
Exter et al.	2016	DNN-Based Automatic Speech Recognition as a Model for Human Phoneme Perception.
Ruinskiy et al.	2008	Stochastic models of pitch jitter and amplitude shimmer for voice modification
Bu et al.	2000	Perceptual speech processing and phonetic feature mapping for robust vowel recognition
Kawahara	2003	Exemplar-based voice quality analysis and control using a high quality auditory morphing procedure based on straight
Miller-Ockhuizen et al.	2003	Acoustics of contrastive palatal affricates predict phonological patterning
Hagmüller	2009	Speech enhancement for disordered and substitution voices
Teixeira et al.	1997	A software tool to study Portuguese vowels
Ru	2000	Perception-based multi-resolution auditory processing of acoustic signals
Elhilali	2004	Neural basis and computational strategies for auditory processing