[go: up one dir, main page]
More Web Proxy on the site http://driver.im/

Vitela et al., 2015 - Google Patents

Phoneme categorization relying solely on high-frequency energy

Vitela et al., 2015

View HTML
Document ID
15507933289488072602
Author
Vitela A
Monson B
Lotto A
Publication year
Publication venue
The Journal of the Acoustical Society of America

External Links

Snippet

Speech perception studies generally focus on the acoustic information present in the frequency regions below 6 kHz. Recent evidence suggests that there is perceptually relevant information in the higher frequencies, including information affecting speech …
Continue reading at pubs.aip.org (HTML) (other versions)

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0202Applications
    • G10L21/0205Enhancement of intelligibility of clean or coded speech
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/003Changing voice quality, e.g. pitch or formants
    • G10L21/007Changing voice quality, e.g. pitch or formants characterised by the process used
    • G10L21/013Adapting to target pitch
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/06Transformation of speech into a non-audible representation, e.g. speech visualisation or speech processing for tactile aids
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
    • G10L25/48Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 specially adapted for particular use
    • G10L25/51Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 specially adapted for particular use for comparison or discrimination
    • G10L25/66Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 specially adapted for particular use for comparison or discrimination for extracting parameters related to health condition
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L17/00Speaker identification or verification
    • G10L17/26Recognition of special voice characteristics, e.g. for use in lie detectors; Recognition of animal voices
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signal, using source filter models or psychoacoustic analysis

Similar Documents

Publication Publication Date Title
Vitela et al. Phoneme categorization relying solely on high-frequency energy
Stone et al. Notionally steady background noise acts primarily as a modulation masker of speech
Cooke et al. Spectral and temporal changes to speech produced in the presence of energetic and informational maskers
Hopkins et al. Effects of moderate cochlear hearing loss on the ability to benefit from temporal fine structure information in speech
Li et al. A psychoacoustic method for studying the necessary and sufficient perceptual cues of American English fricative consonants in noise
Healy et al. An algorithm to increase intelligibility for hearing-impaired listeners in the presence of a competing talker
Fogerty et al. The relative importance of consonant and vowel segments to the recognition of words and sentences: Effects of age and hearing loss
Souza et al. Measuring the acoustic effects of compression amplification on speech in noise
Chen et al. Predicting the intelligibility of vocoded and wideband Mandarin Chinese
Steinmetzger et al. The role of periodicity in perceiving speech in quiet and in background noise
Monson et al. Detection of high-frequency energy changes in sustained vowels produced by singers
Gnansia et al. Effects of spectral smearing and temporal fine structure degradation on speech masking release
Gonzalez et al. Gender and speaker identification as a function of the number of channels in spectrally reduced speech
Mackersie et al. Effects of fundamental frequency and vocal-tract length cues on sentence segregation by listeners with hearing loss
Lu et al. Speech production modifications produced in the presence of low-pass and high-pass filtered noise
Wang et al. Speech perception of noise with binary gains
Westermann et al. The influence of informational masking in reverberant, multi-talker environments
Mi et al. English vowel identification in long-term speech-shaped noise and multi-talker babble for English and Chinese listeners
Monson et al. The maximum audible low-pass cutoff frequency for speech
Bosker et al. Enhanced amplitude modulations contribute to the Lombard intelligibility benefit: Evidence from the Nijmegen Corpus of Lombard Speech
Cabrera et al. The role of spectro-temporal fine structure cues in lexical-tone discrimination for French and Mandarin listeners
Deroche et al. Roles of the target and masker fundamental frequencies in voice segregation
Monson et al. On the use of the TIMIT, QuickSIN, NU-6, and other widely used bandlimited speech materials for speech perception experiments
Bhargava et al. Effects of low-pass filtering on intelligibility of periodically interrupted speech
Bhattacharya et al. Combined spectral and temporal enhancement to improve cochlear-implant speech perception