Qian et al., 2022 - Google Patents

Speaker front‐back disambiguity using multi‐channel speech signals

Qian et al., 2022

Document ID: 11607703097124362362
Author: Qian X; Yang J; Brutti A
Publication year: 2022
Publication venue: Electronics Letters

External Links

Cited by

Snippet

This paper tackles the front‐back disambiguity problem in speaker localization when the audio signals are captured by a symmetric microphone array. To this end, a deep neural network is proposed with an attention‐based mechanism designed to assign different …

Continue reading at ietresearch.onlinelibrary.wiley.com (PDF) (other versions)

230000004807 localization 0 abstract description 8

Classifications

- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signal, using source filter models or psychoacoustic analysis
- G10L19/008—Multichannel audio signal coding or decoding, i.e. using interchannel correlation to reduce redundancies, e.g. joint-stereo, intensity-coding, matrixing
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S2400/00—Details of stereophonic systems covered by H04S but not provided for in its groups
- H04S2400/15—Aspects of sound capture and related signal processing for recording or reproduction
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification

Similar Documents

Publication	Publication Date	Title
Grumiaux et al.	2022	A survey of sound source localization with deep learning methods
US9008329B1 (en)	2015-04-14	Noise reduction using multi-feature cluster tracker
Zhang et al.	2022	Ambiear: mmwave based voice recognition in nlos scenarios
Yang et al.	2022	Deepear: Sound localization with binaural microphones
Jiang et al.	2020	Deep and CNN fusion method for binaural sound source localisation
Wang et al.	2022	Deep-learning-assisted sound source localization from a flying drone
Dai et al.	2021	Blind source separation‐based IVA‐Xception model for bird sound recognition in complex acoustic environments
Paikrao et al.	2023	Consumer Personalized Gesture Recognition in UAV Based Industry 5.0 Applications
Malek et al.	2020	Block‐online multi‐channel speech enhancement using deep neural network‐supported relative transfer function estimates
Liu et al.	2022	Head‐related transfer function–reserved time‐frequency masking for robust binaural sound source localization
Ding et al.	2020	Joint estimation of binaural distance and azimuth by exploiting deep neural networks
Liu et al.	2024	Wavoice: An mmWave-Assisted Noise-Resistant Speech Recognition System
Wu et al.	2021	Sound source localization based on multi-task learning and image translation network
Koizumi et al.	2017	Informative acoustic feature selection to maximize mutual information for collecting target sources
Qian et al.	2022	Speaker front‐back disambiguity using multi‐channel speech signals
US20230352040A1 (en)	2023-11-02	Audio source feature separation and target audio source generation
Jayaram et al.	2023	HRTF Estimation in the Wild
Liu et al.	2021	Binaural sound source localization based on weighted template matching
Deleforge et al.	2019	Audio-motor integration for robot audition
CN116868267A (en)	2023-10-10	Multi-channel speech compression system and method
Liu et al.	2021	Reverberation aware deep learning for environment tolerant microphone array DOA estimation
Wu et al.	2023	Multi-speaker DoA Estimation Using Audio and Visual Modality
Jahanirad et al.	2017	Blind source computer device identification from recorded VoIP calls for forensic investigation
Mathews	2021	Development and evaluation of spherical microphone array-enabled systems for immersive multi-user environments
Chen et al.	2024	Hearable devices with sound bubbles