Vitela et al., 2015 - Google Patents

Phoneme categorization relying solely on high-frequency energy

Vitela et al., 2015

Document ID: 15507933289488072602
Author: Vitela A; Monson B; Lotto A
Publication year: 2015
Publication venue: The Journal of the Acoustical Society of America

External Links

Cited by

Snippet

Speech perception studies generally focus on the acoustic information present in the frequency regions below 6 kHz. Recent evidence suggests that there is perceptually relevant information in the higher frequencies, including information affecting speech …

Continue reading at pubs.aip.org (HTML) (other versions)

238000001914 filtration 0 abstract description 4

Classifications

- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0202—Applications
- G10L21/0205—Enhancement of intelligibility of clean or coded speech
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/003—Changing voice quality, e.g. pitch or formants
- G10L21/007—Changing voice quality, e.g. pitch or formants characterised by the process used
- G10L21/013—Adapting to target pitch
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/06—Transformation of speech into a non-audible representation, e.g. speech visualisation or speech processing for tactile aids
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
- G10L25/48—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 specially adapted for particular use
- G10L25/51—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 specially adapted for particular use for comparison or discrimination
- G10L25/66—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 specially adapted for particular use for comparison or discrimination for extracting parameters related to health condition
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification
- G10L17/26—Recognition of special voice characteristics, e.g. for use in lie detectors; Recognition of animal voices
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signal, using source filter models or psychoacoustic analysis

Similar Documents

Publication	Publication Date	Title
Vitela et al.	2015	Phoneme categorization relying solely on high-frequency energy
Stone et al.	2012	Notionally steady background noise acts primarily as a modulation masker of speech
Cooke et al.	2010	Spectral and temporal changes to speech produced in the presence of energetic and informational maskers
Hopkins et al.	2008	Effects of moderate cochlear hearing loss on the ability to benefit from temporal fine structure information in speech
Li et al.	2012	A psychoacoustic method for studying the necessary and sufficient perceptual cues of American English fricative consonants in noise
Healy et al.	2017	An algorithm to increase intelligibility for hearing-impaired listeners in the presence of a competing talker
Fogerty et al.	2012	The relative importance of consonant and vowel segments to the recognition of words and sentences: Effects of age and hearing loss
Souza et al.	2006	Measuring the acoustic effects of compression amplification on speech in noise
Chen et al.	2011	Predicting the intelligibility of vocoded and wideband Mandarin Chinese
Steinmetzger et al.	2015	The role of periodicity in perceiving speech in quiet and in background noise
Monson et al.	2011	Detection of high-frequency energy changes in sustained vowels produced by singers
Gnansia et al.	2009	Effects of spectral smearing and temporal fine structure degradation on speech masking release
Gonzalez et al.	2005	Gender and speaker identification as a function of the number of channels in spectrally reduced speech
Mackersie et al.	2011	Effects of fundamental frequency and vocal-tract length cues on sentence segregation by listeners with hearing loss
Lu et al.	2009	Speech production modifications produced in the presence of low-pass and high-pass filtered noise
Wang et al.	2008	Speech perception of noise with binary gains
Westermann et al.	2015	The influence of informational masking in reverberant, multi-talker environments
Mi et al.	2013	English vowel identification in long-term speech-shaped noise and multi-talker babble for English and Chinese listeners
Monson et al.	2019	The maximum audible low-pass cutoff frequency for speech
Bosker et al.	2020	Enhanced amplitude modulations contribute to the Lombard intelligibility benefit: Evidence from the Nijmegen Corpus of Lombard Speech
Cabrera et al.	2014	The role of spectro-temporal fine structure cues in lexical-tone discrimination for French and Mandarin listeners
Deroche et al.	2014	Roles of the target and masker fundamental frequencies in voice segregation
Monson et al.	2022	On the use of the TIMIT, QuickSIN, NU-6, and other widely used bandlimited speech materials for speech perception experiments
Bhargava et al.	2012	Effects of low-pass filtering on intelligibility of periodically interrupted speech
Bhattacharya et al.	2011	Combined spectral and temporal enhancement to improve cochlear-implant speech perception