Bai et al., 2020 - Google Patents
Audio enhancement and intelligent classification of household sound events using a sparsely deployed arrayBai et al., 2020
View PDF- Document ID
- 8454485220942241872
- Author
- Bai M
- Lan S
- Huang J
- Hsu Y
- So H
- Publication year
- Publication venue
- The Journal of the Acoustical Society of America
External Links
Snippet
A household sound event classification system consisting of an audio localization and enhancement front-end cascaded with an intelligent classification back-end is presented. The front-end is composed of a sparsely deployed microphone array and a preprocessing …
- 230000004807 localization 0 abstract description 62
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
- G10L21/0216—Noise filtering characterised by the method used for estimating noise
- G10L2021/02161—Number of inputs available containing the signal or the noise to be suppressed
- G10L2021/02166—Microphone arrays; Beamforming
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01S—RADIO DIRECTION-FINDING; RADIO NAVIGATION; DETERMINING DISTANCE OR VELOCITY BY USE OF RADIO WAVES; LOCATING OR PRESENCE-DETECTING BY USE OF THE REFLECTION OR RERADIATION OF RADIO WAVES; ANALOGOUS ARRANGEMENTS USING OTHER WAVES
- G01S3/00—Direction-finders for determining the direction from which infrasonic, sonic, ultrasonic, or electromagnetic waves, or particle emission, not having a directional significance, are being received
- G01S3/80—Direction-finders for determining the direction from which infrasonic, sonic, ultrasonic, or electromagnetic waves, or particle emission, not having a directional significance, are being received using ultrasonic, sonic or infrasonic waves
- G01S3/802—Systems for determining direction or deviation from predetermined direction
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Perotin et al. | CRNN-based multiple DoA estimation using acoustic intensity features for Ambisonics recordings | |
Zhang et al. | Deep learning based binaural speech separation in reverberant environments | |
Wang et al. | Robust speaker localization guided by deep learning-based time-frequency masking | |
Freiberger | Development and evaluation of source localization algorithms for coincident microphone arrays | |
CN109597022A (en) | The operation of sound bearing angle, the method, apparatus and equipment for positioning target audio | |
Wan et al. | Sound source localization based on discrimination of cross-correlation functions | |
Bai et al. | Audio enhancement and intelligent classification of household sound events using a sparsely deployed array | |
Vesperini et al. | Localizing speakers in multiple rooms by using deep neural networks | |
Pujol et al. | BeamLearning: An end-to-end deep learning approach for the angular localization of sound sources using raw multichannel acoustic pressure data | |
Liu et al. | Deep learning assisted sound source localization using two orthogonal first-order differential microphone arrays | |
CN111863020A (en) | Voice signal processing method, device, equipment and storage medium | |
Pertilä et al. | Multichannel source activity detection, localization, and tracking | |
Ding et al. | Joint estimation of binaural distance and azimuth by exploiting deep neural networks | |
Zheng et al. | Spectral mask estimation using deep neural networks for inter-sensor data ratio model based robust DOA estimation | |
Aarabi et al. | Robust sound localization using conditional time–frequency histograms | |
Wu et al. | Sound source localization based on multi-task learning and image translation network | |
Kühne et al. | A novel fuzzy clustering algorithm using observation weighting and context information for reverberant blind speech separation | |
CN118053443A (en) | Target speaker tracking method and system with selective hearing | |
Dang et al. | An iteratively reweighted steered response power approach to multisource localization using a distributed microphone network | |
Zhang et al. | Sound event localization and classification using WASN in Outdoor Environment | |
Zhang et al. | Binaural Reverberant Speech Separation Based on Deep Neural Networks. | |
Dwivedi et al. | Spherical harmonics domain-based approach for source localization in presence of directional interference | |
Hu et al. | A generalized network based on multi-scale densely connection and residual attention for sound source localization and detection | |
Pasha et al. | Distributed microphone arrays, emerging speech and audio signal processing platforms: A review | |
Huang et al. | DOA estimation using two independent convolutional neural networks with residual blocks |