Saleem et al., 2020 - Google Patents
Multi-scale decomposition based supervised single channel deep speech enhancementSaleem et al., 2020
- Document ID
- 14185038128538454095
- Author
- Saleem N
- Khattak M
- Publication year
- Publication venue
- Applied Soft Computing
External Links
Snippet
Speech signals reaching our ears are in general contaminated by the background noise distortion which is detrimental to both speech quality and intelligibility. In this paper, we propose a nonlinear multi-scale decomposition-based deep speech enhancement method …
- 238000000354 decomposition reaction 0 title abstract description 19
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
- G10L25/03—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 characterised by the type of extracted parameters
- G10L25/09—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 characterised by the type of extracted parameters the extracted parameters being zero crossing rates
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0202—Applications
- G10L21/0205—Enhancement of intelligibility of clean or coded speech
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/02—Computer systems based on biological models using neural network models
- G06N3/06—Physical realisation, i.e. hardware implementation of neural networks, neurons or parts of neurons
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
- G10L25/27—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 characterised by the analysis technique
- G10L25/30—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 characterised by the analysis technique using neural networks
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Saleem et al. | Multi-scale decomposition based supervised single channel deep speech enhancement | |
Saraiva et al. | Daily streamflow forecasting in Sobradinho Reservoir using machine learning models coupled with wavelet transform and bootstrapping | |
Balaji et al. | Combining statistical models using modified spectral subtraction method for embedded system | |
Hu et al. | DCCRN: Deep complex convolution recurrent network for phase-aware speech enhancement | |
US20220223144A1 (en) | Method and apparatus for speech source separation based on a convolutional neural network | |
CN111564160B (en) | Voice noise reduction method based on AEWGAN | |
CN107967920A (en) | A kind of improved own coding neutral net voice enhancement algorithm | |
Kang et al. | DNN-based monaural speech enhancement with temporal and spectral variations equalization | |
Ram et al. | Speech enhancement through improvised conditional generative adversarial networks | |
Venkateswarlu et al. | Speech intelligibility quality in telugu speech patterns using a wavelet-based hybrid threshold transform method | |
CN112331232A (en) | A Speech Emotion Recognition Method Combining CGAN Spectrogram Denoising and Bilateral Filtering Spectrogram Enhancement | |
Luo et al. | Delayless generative fixed-filter active noise control based on deep learning and bayesian filter | |
Taghia et al. | A frequency-domain adaptive line enhancer with step-size control based on mutual information for harmonic noise reduction | |
Wang et al. | Mask estimation incorporating phase-sensitive information for speech enhancement | |
Zhang et al. | Pipelined set-membership approach to adaptive Volterra filtering | |
Soliman et al. | Performance enhancement of speaker identification systems using speech encryption and cancelable features | |
Wang et al. | Impulsive noise detection by double noise detector and removal using adaptive neural-fuzzy inference system | |
Zhang et al. | Supervised attention multi-scale temporal convolutional network for monaural speech enhancement | |
Liu | Representation of digital image by fuzzy neural network | |
Wei et al. | Analysis and implementation of low‐power perceptual multiband noise reduction for the hearing aids application | |
Singh et al. | A wavelet packet based approach for speech enhancement using modulation channel selection | |
Mousa et al. | Adaptive noise cancellation algorithms sensitivity to parameters | |
Yüksel et al. | Performance enhancement of image impulse noise filters by image rotation and fuzzy processing | |
Yu et al. | Endangered Tujia language speech enhancement research based on improved DCGAN | |
Yüksel | A simple neuro-fuzzy method for improving the performances of impulse noise filters for digital images |