Azetsu et al., 2007 - Google Patents
Blind separation and sound localization by using frequency-domain ICAAzetsu et al., 2007
- Document ID
- 3726115841397243019
- Author
- Azetsu T
- Uchino E
- Suetake N
- Publication year
- Publication venue
- Soft Computing
External Links
Snippet
The independent component analysis (ICA) in the frequency domain is a method to deal with a blind signal separation problem in which propagation time delays are included in the mixing process of signals. We propose an extended method of the frequency-domain ICA …
- 238000000926 separation method 0 title abstract description 22
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
- G10L21/0216—Noise filtering characterised by the method used for estimating noise
- G10L2021/02161—Number of inputs available containing the signal or the noise to be suppressed
- G10L2021/02166—Microphone arrays; Beamforming
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6217—Design or setup of recognition systems and techniques; Extraction of features in feature space; Clustering techniques; Blind source separation
- G06K9/6232—Extracting features by transforming the feature space, e.g. multidimensional scaling; Mappings, e.g. subspace methods
- G06K9/624—Extracting features by transforming the feature space, e.g. multidimensional scaling; Mappings, e.g. subspace methods based on a separation criterion, e.g. independent component analysis
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6217—Design or setup of recognition systems and techniques; Extraction of features in feature space; Clustering techniques; Blind source separation
- G06K9/6232—Extracting features by transforming the feature space, e.g. multidimensional scaling; Mappings, e.g. subspace methods
- G06K9/6247—Extracting features by transforming the feature space, e.g. multidimensional scaling; Mappings, e.g. subspace methods based on an approximation criterion, e.g. principal component analysis
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/00496—Recognising patterns in signals and combinations thereof
- G06K9/0057—Source localisation; Inverse modelling
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0272—Voice signal separating
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6267—Classification techniques
- G06K9/6268—Classification techniques relating to the classification paradigm, e.g. parametric or non-parametric approaches
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/00496—Recognising patterns in signals and combinations thereof
- G06K9/00503—Preprocessing, e.g. filtering
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US9420368B2 (en) | Time-frequency directional processing of audio signals | |
Mittal et al. | Signal/noise KLT based approach for enhancing speech degraded by colored noise | |
Nesta et al. | Generalized state coherence transform for multidimensional TDOA estimation of multiple sources | |
Benesty et al. | Frequency-domain blind source separation | |
Makino et al. | Blind source separation of convolutive mixtures of speech in frequency domain | |
Kim et al. | Blind source separation exploiting higher-order frequency dependencies | |
Nesta et al. | Convolutive BSS of short mixtures by ICA recursively regularized across frequencies | |
Sawada et al. | Measuring dependence of bin-wise separated signals for permutation alignment in frequency-domain BSS | |
Pedersen et al. | Convolutive blind source separation methods | |
Yang et al. | Under-determined convolutive blind source separation combining density-based clustering and sparse reconstruction in time-frequency domain | |
Boashash et al. | Robust multisensor time–frequency signal processing: A tutorial review with illustrations of performance enhancement in selected application areas | |
Wang et al. | A region-growing permutation alignment approach in frequency-domain blind source separation of speech mixtures | |
US10410641B2 (en) | Audio source separation | |
Knaak et al. | Geometrically constrained independent component analysis | |
EP3050056B1 (en) | Time-frequency directional processing of audio signals | |
Rao et al. | A denoising approach to multisensor signal estimation | |
Cobos et al. | Maximum a posteriori binary mask estimation for underdetermined source separation using smoothed posteriors | |
Li et al. | An EM algorithm for audio source separation based on the convolutive transfer function | |
Kim et al. | Efficient online target speech extraction using DOA-constrained independent component analysis of stereo data for robust speech recognition | |
Das et al. | ICA methods for blind source separation of instantaneous mixtures: A case study | |
Hoffmann et al. | Using information theoretic distance measures for solving the permutation problem of blind source separation of speech signals | |
Albataineh et al. | A RobustICA-based algorithmic system for blind separation of convolutive mixtures | |
Azetsu et al. | Blind separation and sound localization by using frequency-domain ICA | |
Mitianoudis | Audio source separation using independent component analysis | |
Fontaine et al. | Scalable source localization with multichannel α-stable distributions |