Calandruccio et al., 2017 - Google Patents
Effectiveness of two-talker maskers that differ in talker congruity and perceptual similarity to the target speechCalandruccio et al., 2017
View HTML- Document ID
- 1108824577981695900
- Author
- Calandruccio L
- Buss E
- Bowdrie K
- Publication year
- Publication venue
- Trends in Hearing
External Links
Snippet
Previous work has shown that masked-sentence recognition is particularly poor when the masker is composed of two competing talkers, a finding that is attributed to informational masking. Informational masking tends to be largest when the target and masker talkers are …
- 230000000873 masking 0 abstract description 79
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification
- G10L17/26—Recognition of special voice characteristics, e.g. for use in lie detectors; Recognition of animal voices
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/003—Changing voice quality, e.g. pitch or formants
- G10L21/007—Changing voice quality, e.g. pitch or formants characterised by the process used
- G10L21/013—Adapting to target pitch
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0202—Applications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L2015/088—Word spotting
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G10L13/02—Methods for producing synthetic speech; Speech synthesisers
- G10L13/033—Voice editing, e.g. manipulating the voice of the synthesiser
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
- G10L25/48—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 specially adapted for particular use
- G10L25/51—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 specially adapted for particular use for comparison or discrimination
- G10L25/66—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 specially adapted for particular use for comparison or discrimination for extracting parameters related to health condition
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G10L13/06—Elementary speech units used in speech synthesisers; Concatenation rules
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/28—Processing or translating of natural language
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Holmes et al. | Familiar voices are more intelligible, even if they are not recognized as familiar | |
Johnsrude et al. | Swinging at a cocktail party: Voice familiarity aids speech perception in the presence of a competing voice | |
MacPherson et al. | Variations in the slope of the psychometric functions for speech intelligibility: A systematic survey | |
Lash et al. | Expectation and entropy in spoken word recognition: Effects of age and hearing acuity | |
Ozimek et al. | Polish sentence matrix test for speech intelligibility measurement in noise | |
Calandruccio et al. | Effectiveness of two-talker maskers that differ in talker congruity and perceptual similarity to the target speech | |
Koeritzer et al. | The impact of age, background noise, semantic ambiguity, and hearing loss on recognition memory for spoken sentences | |
Markham et al. | The effect of talker-and listener-related factors on intelligibility for a real-word, open-set perception test | |
Brunnegård et al. | Untrained listeners' ratings of speech disorders in a group with cleft palate: A comparison with speech and language pathologists ‚ratings | |
Trine et al. | Extended high frequencies provide both spectral and temporal information to improve speech-in-speech recognition | |
Liu et al. | Perception of a native vowel contrast by Dutch monolingual and bilingual infants: A bilingual perceptual lead | |
Hustad et al. | Variability and diagnostic accuracy of speech intelligibility scores in children | |
Aydelott et al. | Normal adult aging and the contextual influences affecting speech and meaningful sound perception | |
McCloy et al. | Talker versus dialect effects on speech intelligibility: A symmetrical study | |
Nuesse et al. | Measuring speech recognition with a matrix test using synthetic speech | |
Holmes et al. | How long does it take for a voice to become familiar? Speech intelligibility and voice recognition are differentially sensitive to voice training | |
Fontan et al. | Predicting speech perception in older listeners with sensorineural hearing loss using automatic speech recognition | |
Schlueter et al. | Normal and time-compressed speech: How does learning affect speech recognition thresholds in noise? | |
Rotman et al. | Rapid perceptual learning: A potential source of individual differences in speech perception under adverse conditions? | |
Morse-Fortier et al. | The effects of musical training on speech detection in the presence of informational and energetic masking | |
Potter et al. | Effect of vocal-pitch difference on automatic attention to voice changes in audio messages | |
Chen et al. | Masking effects in the perception of multiple simultaneous talkers in normal-hearing and cochlear implant listeners | |
Gordon-Salant et al. | Recognition of accented English in quiet by younger normal-hearing listeners and older listeners with normal-hearing and hearing loss | |
Hoyte et al. | Components of speech prosody and their use in detection of syntactic structure by older adults | |
Ibelings et al. | Speech recognition and listening effort of meaningful sentences using synthetic speech |