Parchami et al., 2017 - Google Patents
Model-based estimation of late reverberant spectral variance using modified weighted prediction error methodParchami et al., 2017
View PDF- Document ID
- 6510503863871114006
- Author
- Parchami M
- Zhu W
- Champagne B
- Publication year
- Publication venue
- Speech Communication
External Links
Snippet
In this paper, we propose a new approach to estimate the late reverberant spectral variance (LRSV) for speech dereverberation in the short-time Fourier transform (STFT) domain. Our approach uses a model-based scheme involving the estimation of a smoothing (shape) …
- 230000003595 spectral 0 title abstract description 57
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
- G10L21/0216—Noise filtering characterised by the method used for estimating noise
- G10L2021/02161—Number of inputs available containing the signal or the noise to be suppressed
- G10L2021/02166—Microphone arrays; Beamforming
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
- G10L21/0216—Noise filtering characterised by the method used for estimating noise
- G10L21/0232—Processing in the frequency domain
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
- G10L25/78—Detection of presence or absence of voice signals
- G10L25/84—Detection of presence or absence of voice signals for discriminating voice from noise
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signal, using source filter models or psychoacoustic analysis
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
- G10L15/063—Training
- G10L2015/0635—Training updating or merging of old and new templates; Mean values; Weighting
- G10L2015/0636—Threshold criteria for the updating
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Parchami et al. | Recent developments in speech enhancement in the short-time Fourier transform domain | |
US7295972B2 (en) | Method and apparatus for blind source separation using two sensors | |
US11133019B2 (en) | Signal processor and method for providing a processed audio signal reducing noise and reverberation | |
US8849657B2 (en) | Apparatus and method for isolating multi-channel sound source | |
US10127919B2 (en) | Determining noise and sound power level differences between primary and reference channels | |
Braun et al. | Online dereverberation for dynamic scenarios using a Kalman filter with an autoregressive model | |
Cohen | Speech enhancement using super-Gaussian speech models and noncausal a priori SNR estimation | |
Zhang et al. | A novel fast nonstationary noise tracking approach based on MMSE spectral power estimator | |
Parchami et al. | Speech dereverberation using weighted prediction error with correlated inter-frame speech components | |
Li et al. | Multichannel online dereverberation based on spectral magnitude inverse filtering | |
Habets et al. | Dereverberation | |
Wang et al. | Mask estimation incorporating phase-sensitive information for speech enhancement | |
Martín-Doñas et al. | Dual-channel DNN-based speech enhancement for smartphones | |
Braun et al. | Late reverberation PSD estimation for single-channel dereverberation using relative convolutive transfer functions | |
US9875748B2 (en) | Audio signal noise attenuation | |
Sadjadi et al. | Blind reverberation mitigation for robust speaker identification | |
Parchami et al. | Model-based estimation of late reverberant spectral variance using modified weighted prediction error method | |
Sunnydayal et al. | A survey on statistical based single channel speech enhancement techniques | |
Ravi et al. | A survey on speech enhancement methodologies | |
Tashev et al. | Unified framework for single channel speech enhancement | |
KR20200095370A (en) | Detection of fricatives in speech signals | |
CN103187068B (en) | Priori signal-to-noise ratio estimation method, device and noise inhibition method based on Kalman | |
Parchami et al. | Speech reverberation suppression for time-varying environments using weighted prediction error method with time-varying autoregressive model | |
Jukić et al. | Speech dereverberation with convolutive transfer function approximation using MAP and variational deconvolution approaches | |
Aichner et al. | Convolutive blind source separation for noisy mixtures |