Vitela et al., 2015 - Google Patents
Phoneme categorization relying solely on high-frequency energyVitela et al., 2015
View HTML- Document ID
- 15507933289488072602
- Author
- Vitela A
- Monson B
- Lotto A
- Publication year
- Publication venue
- The Journal of the Acoustical Society of America
External Links
Snippet
Speech perception studies generally focus on the acoustic information present in the frequency regions below 6 kHz. Recent evidence suggests that there is perceptually relevant information in the higher frequencies, including information affecting speech …
- 238000001914 filtration 0 abstract description 4
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0202—Applications
- G10L21/0205—Enhancement of intelligibility of clean or coded speech
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/003—Changing voice quality, e.g. pitch or formants
- G10L21/007—Changing voice quality, e.g. pitch or formants characterised by the process used
- G10L21/013—Adapting to target pitch
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/06—Transformation of speech into a non-audible representation, e.g. speech visualisation or speech processing for tactile aids
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
- G10L25/48—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 specially adapted for particular use
- G10L25/51—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 specially adapted for particular use for comparison or discrimination
- G10L25/66—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 specially adapted for particular use for comparison or discrimination for extracting parameters related to health condition
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification
- G10L17/26—Recognition of special voice characteristics, e.g. for use in lie detectors; Recognition of animal voices
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signal, using source filter models or psychoacoustic analysis
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Vitela et al. | Phoneme categorization relying solely on high-frequency energy | |
Stone et al. | Notionally steady background noise acts primarily as a modulation masker of speech | |
Cooke et al. | Spectral and temporal changes to speech produced in the presence of energetic and informational maskers | |
Hopkins et al. | Effects of moderate cochlear hearing loss on the ability to benefit from temporal fine structure information in speech | |
Li et al. | A psychoacoustic method for studying the necessary and sufficient perceptual cues of American English fricative consonants in noise | |
Healy et al. | An algorithm to increase intelligibility for hearing-impaired listeners in the presence of a competing talker | |
Fogerty et al. | The relative importance of consonant and vowel segments to the recognition of words and sentences: Effects of age and hearing loss | |
Souza et al. | Measuring the acoustic effects of compression amplification on speech in noise | |
Chen et al. | Predicting the intelligibility of vocoded and wideband Mandarin Chinese | |
Steinmetzger et al. | The role of periodicity in perceiving speech in quiet and in background noise | |
Monson et al. | Detection of high-frequency energy changes in sustained vowels produced by singers | |
Gnansia et al. | Effects of spectral smearing and temporal fine structure degradation on speech masking release | |
Gonzalez et al. | Gender and speaker identification as a function of the number of channels in spectrally reduced speech | |
Mackersie et al. | Effects of fundamental frequency and vocal-tract length cues on sentence segregation by listeners with hearing loss | |
Lu et al. | Speech production modifications produced in the presence of low-pass and high-pass filtered noise | |
Wang et al. | Speech perception of noise with binary gains | |
Westermann et al. | The influence of informational masking in reverberant, multi-talker environments | |
Mi et al. | English vowel identification in long-term speech-shaped noise and multi-talker babble for English and Chinese listeners | |
Monson et al. | The maximum audible low-pass cutoff frequency for speech | |
Bosker et al. | Enhanced amplitude modulations contribute to the Lombard intelligibility benefit: Evidence from the Nijmegen Corpus of Lombard Speech | |
Cabrera et al. | The role of spectro-temporal fine structure cues in lexical-tone discrimination for French and Mandarin listeners | |
Deroche et al. | Roles of the target and masker fundamental frequencies in voice segregation | |
Monson et al. | On the use of the TIMIT, QuickSIN, NU-6, and other widely used bandlimited speech materials for speech perception experiments | |
Bhargava et al. | Effects of low-pass filtering on intelligibility of periodically interrupted speech | |
Bhattacharya et al. | Combined spectral and temporal enhancement to improve cochlear-implant speech perception |