Ceolini et al., 2018 - Google Patents

Speaker Activity Detection and Minimum Variance Beamforming for Source Separation.

Ceolini et al., 2018

Document ID: 15978486240747240117
Author: Ceolini E; Anumula J; Huber A; Kiselev I; Liu S
Publication year: 2018
Publication venue: Interspeech

External Links

Cited by

Snippet

This work proposes a framework that renders minimum variance beamforming blind allowing for source separation in real world environments with an ad-hoc multi-microphone setup using no assumptions other than knowing the number of speakers. The framework …

Continue reading at www.researchgate.net (PDF) (other versions)

230000000694 effects 0 title abstract description 23

Classifications

- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
- G10L21/0216—Noise filtering characterised by the method used for estimating noise
- G10L2021/02161—Number of inputs available containing the signal or the noise to be suppressed
- G10L2021/02166—Microphone arrays; Beamforming
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04R—LOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
- H04R3/00—Circuits for transducers, loudspeakers or microphones
- H04R3/005—Circuits for transducers, loudspeakers or microphones for combining the signals of two or more microphones
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification
- G10L17/04—Training, enrolment or model building
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S7/00—Indicating arrangements; Control arrangements, e.g. balance control
- H04S7/30—Control circuits for electronic adaptation of the sound field
- H04S7/301—Automatic calibration of stereophonic sound system, e.g. with test microphone
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04R—LOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
- H04R25/00—Deaf-aid sets providing an auditory perception; Electric tinnitus maskers providing an auditory perception
- H04R25/40—Arrangements for obtaining a desired directivity characteristic
- H04R25/407—Circuits for combining signals of a plurality of transducers
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04R—LOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
- H04R2225/00—Details of deaf aids covered by H04R25/00, not provided for in any of its subgroups
- H04R2225/43—Signal processing in hearing aids to enhance the speech intelligibility
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04R—LOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
- H04R29/00—Monitoring arrangements; Testing arrangements
- H04R29/004—Monitoring arrangements; Testing arrangements for microphones
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04R—LOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
- H04R2430/00—Signal processing covered by H04R, not provided for in its groups
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signal, using source filter models or psychoacoustic analysis

Similar Documents

Publication	Publication Date	Title
Braun et al.	2018	Evaluation and comparison of late reverberation power spectral density estimators
Cauchi et al.	2015	Combination of MVDR beamforming and single-channel spectral processing for enhancing noisy and reverberant speech
US9100734B2 (en)	2015-08-04	Systems, methods, apparatus, and computer-readable media for far-field multi-source tracking and separation
Zohourian et al.	2017	Binaural speaker localization integrated into an adaptive beamformer for hearing aids
Taseska et al.	2014	Informed spatial filtering for sound extraction using distributed microphone arrays
WO2014032738A1 (en)	2014-03-06	Apparatus and method for providing an informed multichannel speech presence probability estimation
Taherian et al.	2019	Deep learning based multi-channel speaker recognition in noisy and reverberant environments
Zohourian et al.	2016	Binaural speaker localization and separation based on a joint ITD/ILD model and head movement tracking
DEREVERBERATION et al.	2014	REVERB Workshop 2014
Schwartz et al.	2016	Joint maximum likelihood estimation of late reverberant and speech power spectral density in noisy environments
Jarrett et al.	2014	Noise reduction in the spherical harmonic domain using a tradeoff beamformer and narrowband DOA estimates
Jin et al.	2017	Multi-channel noise reduction for hands-free voice communication on mobile phones
Marquardt et al.	2017	Noise power spectral density estimation for binaural noise reduction exploiting direction of arrival estimates
Hoang et al.	2021	Joint maximum likelihood estimation of power spectral densities and relative acoustic transfer functions for acoustic beamforming
Kovalyov et al.	2023	Dsenet: Directional signal extraction network for hearing improvement on edge devices
Gößling et al.	2019	RTF-steered binaural MVDR beamforming incorporating multiple external microphones
Tammen et al.	2019	Joint estimation of RETF vector and power spectral densities for speech enhancement based on alternating least squares
Zohourian et al.	2018	GSC-based binaural speaker separation preserving spatial cues
Ceolini et al.	2018	Speaker Activity Detection and Minimum Variance Beamforming for Source Separation.
Aroudi et al.	2020	Cognitive-driven convolutional beamforming using EEG-based auditory attention decoding
Hadad et al.	2017	Comparison of two binaural beamforming approaches for hearing aids
Pfeifenberger et al.	2014	Blind source extraction based on a direction-dependent a-priori SNR.
Ji et al.	2017	Coherence-Based Dual-Channel Noise Reduction Algorithm in a Complex Noisy Environment.
Zheng et al.	2016	Statistical analysis and improvement of coherent-to-diffuse power ratio estimators for dereverberation
Hammer et al.	2020	FCN approach for dynamically locating multiple speakers