[go: up one dir, main page]
More Web Proxy on the site http://driver.im/

Saleem et al., 2020 - Google Patents

Multi-scale decomposition based supervised single channel deep speech enhancement

Saleem et al., 2020

Document ID
14185038128538454095
Author
Saleem N
Khattak M
Publication year
Publication venue
Applied Soft Computing

External Links

Snippet

Speech signals reaching our ears are in general contaminated by the background noise distortion which is detrimental to both speech quality and intelligibility. In this paper, we propose a nonlinear multi-scale decomposition-based deep speech enhancement method …
Continue reading at www.sciencedirect.com (other versions)

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
    • G10L25/03Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 characterised by the type of extracted parameters
    • G10L25/09Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 characterised by the type of extracted parameters the extracted parameters being zero crossing rates
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0202Applications
    • G10L21/0205Enhancement of intelligibility of clean or coded speech
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computer systems based on biological models
    • G06N3/02Computer systems based on biological models using neural network models
    • G06N3/06Physical realisation, i.e. hardware implementation of neural networks, neurons or parts of neurons
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
    • G10L25/27Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 characterised by the analysis technique
    • G10L25/30Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 characterised by the analysis technique using neural networks

Similar Documents

Publication Publication Date Title
Saleem et al. Multi-scale decomposition based supervised single channel deep speech enhancement
Saraiva et al. Daily streamflow forecasting in Sobradinho Reservoir using machine learning models coupled with wavelet transform and bootstrapping
Balaji et al. Combining statistical models using modified spectral subtraction method for embedded system
Hu et al. DCCRN: Deep complex convolution recurrent network for phase-aware speech enhancement
US20220223144A1 (en) Method and apparatus for speech source separation based on a convolutional neural network
CN111564160B (en) Voice noise reduction method based on AEWGAN
CN107967920A (en) A kind of improved own coding neutral net voice enhancement algorithm
Kang et al. DNN-based monaural speech enhancement with temporal and spectral variations equalization
Ram et al. Speech enhancement through improvised conditional generative adversarial networks
Venkateswarlu et al. Speech intelligibility quality in telugu speech patterns using a wavelet-based hybrid threshold transform method
CN112331232A (en) A Speech Emotion Recognition Method Combining CGAN Spectrogram Denoising and Bilateral Filtering Spectrogram Enhancement
Luo et al. Delayless generative fixed-filter active noise control based on deep learning and bayesian filter
Taghia et al. A frequency-domain adaptive line enhancer with step-size control based on mutual information for harmonic noise reduction
Wang et al. Mask estimation incorporating phase-sensitive information for speech enhancement
Zhang et al. Pipelined set-membership approach to adaptive Volterra filtering
Soliman et al. Performance enhancement of speaker identification systems using speech encryption and cancelable features
Wang et al. Impulsive noise detection by double noise detector and removal using adaptive neural-fuzzy inference system
Zhang et al. Supervised attention multi-scale temporal convolutional network for monaural speech enhancement
Liu Representation of digital image by fuzzy neural network
Wei et al. Analysis and implementation of low‐power perceptual multiband noise reduction for the hearing aids application
Singh et al. A wavelet packet based approach for speech enhancement using modulation channel selection
Mousa et al. Adaptive noise cancellation algorithms sensitivity to parameters
Yüksel et al. Performance enhancement of image impulse noise filters by image rotation and fuzzy processing
Yu et al. Endangered Tujia language speech enhancement research based on improved DCGAN
Yüksel A simple neuro-fuzzy method for improving the performances of impulse noise filters for digital images