Saleem et al., 2020 - Google Patents

Multi-scale decomposition based supervised single channel deep speech enhancement

Saleem et al., 2020

Document ID: 14185038128538454095
Author: Saleem N; Khattak M
Publication year: 2020
Publication venue: Applied Soft Computing

External Links

Cited by

Snippet

Speech signals reaching our ears are in general contaminated by the background noise distortion which is detrimental to both speech quality and intelligibility. In this paper, we propose a nonlinear multi-scale decomposition-based deep speech enhancement method …

Continue reading at www.sciencedirect.com (other versions)

238000000354 decomposition reaction 0 title abstract description 19

Classifications

- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
- G10L25/03—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 characterised by the type of extracted parameters
- G10L25/09—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 characterised by the type of extracted parameters the extracted parameters being zero crossing rates
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0202—Applications
- G10L21/0205—Enhancement of intelligibility of clean or coded speech
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/02—Computer systems based on biological models using neural network models
- G06N3/06—Physical realisation, i.e. hardware implementation of neural networks, neurons or parts of neurons
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
- G10L25/27—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 characterised by the analysis technique
- G10L25/30—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 characterised by the analysis technique using neural networks

Similar Documents

Publication	Publication Date	Title
Saleem et al.	2020	Multi-scale decomposition based supervised single channel deep speech enhancement
Saraiva et al.	2021	Daily streamflow forecasting in Sobradinho Reservoir using machine learning models coupled with wavelet transform and bootstrapping
Balaji et al.	2020	Combining statistical models using modified spectral subtraction method for embedded system
Hu et al.	2020	DCCRN: Deep complex convolution recurrent network for phase-aware speech enhancement
US20220223144A1 (en)	2022-07-14	Method and apparatus for speech source separation based on a convolutional neural network
CN111564160B (en)	2022-10-18	Voice noise reduction method based on AEWGAN
CN107967920A (en)	2018-04-27	A kind of improved own coding neutral net voice enhancement algorithm
Kang et al.	2018	DNN-based monaural speech enhancement with temporal and spectral variations equalization
Ram et al.	2020	Speech enhancement through improvised conditional generative adversarial networks
Venkateswarlu et al.	2022	Speech intelligibility quality in telugu speech patterns using a wavelet-based hybrid threshold transform method
CN112331232A (en)	2021-02-05	A Speech Emotion Recognition Method Combining CGAN Spectrogram Denoising and Bilateral Filtering Spectrogram Enhancement
Luo et al.	2023	Delayless generative fixed-filter active noise control based on deep learning and bayesian filter
Taghia et al.	2016	A frequency-domain adaptive line enhancer with step-size control based on mutual information for harmonic noise reduction
Wang et al.	2019	Mask estimation incorporating phase-sensitive information for speech enhancement
Zhang et al.	2016	Pipelined set-membership approach to adaptive Volterra filtering
Soliman et al.	2017	Performance enhancement of speaker identification systems using speech encryption and cancelable features
Wang et al.	2011	Impulsive noise detection by double noise detector and removal using adaptive neural-fuzzy inference system
Zhang et al.	2024	Supervised attention multi-scale temporal convolutional network for monaural speech enhancement
Liu	2002	Representation of digital image by fuzzy neural network
Wei et al.	2014	Analysis and implementation of low‐power perceptual multiband noise reduction for the hearing aids application
Singh et al.	2017	A wavelet packet based approach for speech enhancement using modulation channel selection
Mousa et al.	2011	Adaptive noise cancellation algorithms sensitivity to parameters
Yüksel et al.	2010	Performance enhancement of image impulse noise filters by image rotation and fuzzy processing
Yu et al.	2019	Endangered Tujia language speech enhancement research based on improved DCGAN
Yüksel	2005	A simple neuro-fuzzy method for improving the performances of impulse noise filters for digital images