Fischer et al., 2018 - Google Patents

Robust constrained MFMVDR filtering for single-microphone speech enhancement

Fischer et al., 2018

Document ID: 17026858458236549028
Author: Fischer D; Doclo S
Publication year: 2018
Publication venue: 2018 16th International Workshop on Acoustic Signal Enhancement (IWAENC)

External Links

Cited by

Snippet

The multi-frame minimum variance distortionless response (MFMVDR) filter for single- microphone speech enhancement exploits speech correlation across consecutive time frames. This filter is designed to avoid speech distortion while minimizing the total signal …

Continue reading at uol.de (PDF) (other versions)

238000001914 filtration 0 title description 3

Classifications

- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
- G10L21/0216—Noise filtering characterised by the method used for estimating noise
- G10L2021/02161—Number of inputs available containing the signal or the noise to be suppressed
- G10L2021/02166—Microphone arrays; Beamforming
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
- G10L21/0216—Noise filtering characterised by the method used for estimating noise
- G10L21/0232—Processing in the frequency domain
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
- G10L15/065—Adaptation
- G10L15/07—Adaptation to the speaker
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0272—Voice signal separating
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
- G10L25/78—Detection of presence or absence of voice signals
- G10L25/84—Detection of presence or absence of voice signals for discriminating voice from noise
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
- G10L25/03—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 characterised by the type of extracted parameters
- G10L25/18—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 characterised by the type of extracted parameters the extracted parameters being spectral information of each sub-band
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification
- G10L17/04—Training, enrolment or model building
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
- G10L25/48—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 specially adapted for particular use
- G10L25/51—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 specially adapted for particular use for comparison or discrimination
- G10L25/66—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 specially adapted for particular use for comparison or discrimination for extracting parameters related to health condition
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/20—Speech recognition techniques specially adapted for robustness in adverse environments, e.g. in noise, of stress induced speech
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signal, using source filter models or psychoacoustic analysis

Similar Documents

Publication	Publication Date	Title
Braun et al.	2018	Evaluation and comparison of late reverberation power spectral density estimators
Togami et al.	2013	Optimized speech dereverberation from probabilistic perspective for time varying acoustic transfer function
Krueger et al.	2010	Speech enhancement with a GSC-like structure employing eigenvector-based transfer function ratios estimation
Wang et al.	2015	Noise power spectral density estimation using MaxNSR blocking matrix
Schasse et al.	2014	Estimation of subband speech correlations for noise reduction via MVDR processing
Kodrasi et al.	2018	Analysis of eigenvalue decomposition-based late reverberation power spectral density estimation
Tammen et al.	2021	Deep multi-frame MVDR filtering for single-microphone speech enhancement
Thuene et al.	2016	Maximum-likelihood approach to adaptive multichannel-Wiener postfiltering for wind-noise reduction
Tammen et al.	2020	DNN-based speech presence probability estimation for multi-frame single-microphone speech enhancement
Fischer et al.	2018	Robust constrained MFMVDR filtering for single-microphone speech enhancement
Nesta et al.	2013	Blind source extraction for robust speech recognition in multisource noisy environments
Habets et al.	2018	Dereverberation
Zhao et al.	2015	Robust speech recognition using beamforming with adaptive microphone gains and multichannel noise reduction
Hoang et al.	2021	Joint maximum likelihood estimation of power spectral densities and relative acoustic transfer functions for acoustic beamforming
Fischer et al.	2020	Subspace-based speech correlation vector estimation for single-microphone multi-frame MVDR filtering
Taseska et al.	2017	DOA-informed source extraction in the presence of competing talkers and background noise
Mirabilii et al.	2020	Spatial coherence-aware multi-channel wind noise reduction
Zhang et al.	2023	SDW-SWF: Speech Distortion Weighted Single-Channel Wiener Filter for Noise Reduction
Fischer et al.	2020	Robust constrained MFMVDR filters for single-channel speech enhancement based on spherical uncertainty set
Tammen et al.	2018	Complexity reduction of eigenvalue decomposition-based diffuse power spectral density estimators using the power method
Taghia et al.	2013	Dual-channel noise reduction based on a mixture of circular-symmetric complex Gaussians on unit hypersphere
Dietzen et al.	2021	Instantaneous PSD estimation for speech enhancement based on generalized principal components
Fischer et al.	2019	Comparison of parameter estimation methods for Single-Microphone Multi-Frame wiener filtering
Fischer et al.	2016	Single-microphone speech enhancement using MVDR filtering and Wiener post-filtering
Fischer et al.	2018	Evaluation of Robust Constrained MFMVDR Filtering for Single-Channel Speech Enhancement