Mogami et al., 2018 - Google Patents

Independent deeply learned matrix analysis for multichannel audio source separation

Mogami et al., 2018

Document ID: 427619975562848294
Author: Mogami S; Sumino H; Kitamura D; Takamune N; Takamichi S; Saruwatari H; Ono N
Publication year: 2018
Publication venue: 2018 26th European Signal Processing Conference (EUSIPCO)

External Links

Cited by

Snippet

In this paper, we address a multichannel audio source separation task and propose a new efficient method called independent deeply learned matrix analysis (IDLMA). IDLMA estimates the demixing matrix in a blind manner and updates the time-frequency structures …

Continue reading at arxiv.org (PDF) (other versions)

239000011159 matrix material 0 title abstract description 29

Classifications

- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
- G10L21/0216—Noise filtering characterised by the method used for estimating noise
- G10L2021/02161—Number of inputs available containing the signal or the noise to be suppressed
- G10L2021/02166—Microphone arrays; Beamforming
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/14—Speech classification or search using statistical models, e.g. hidden Markov models [HMMs]
- G10L15/142—Hidden Markov Models [HMMs]
- G10L15/144—Training of HMMs
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
- G10L15/065—Adaptation
- G10L15/07—Adaptation to the speaker
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signal, using source filter models or psychoacoustic analysis
- G10L19/008—Multichannel audio signal coding or decoding, i.e. using interchannel correlation to reduce redundancies, e.g. joint-stereo, intensity-coding, matrixing
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
- G10L25/03—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 characterised by the type of extracted parameters
- G10L25/18—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 characterised by the type of extracted parameters the extracted parameters being spectral information of each sub-band

Similar Documents

Publication	Publication Date	Title
Mogami et al.	2018	Independent deeply learned matrix analysis for multichannel audio source separation
Makishima et al.	2019	Independent deeply learned matrix analysis for determined audio source separation
Kitamura et al.	2018	Determined blind source separation with independent low-rank matrix analysis
Shimada et al.	2019	Unsupervised speech enhancement based on multichannel NMF-informed beamforming for noise-robust automatic speech recognition
Nugraha et al.	2016	Multichannel audio source separation with deep neural networks
Scheibler et al.	2019	Independent vector analysis with more microphones than sources
Kitamura et al.	2015	Efficient multichannel nonnegative matrix factorization exploiting rank-1 spatial model
Sawada et al.	2012	Efficient algorithms for multichannel extensions of Itakura-Saito nonnegative matrix factorization
Mogami et al.	2017	Independent low-rank matrix analysis based on complex student's t-distribution for blind audio source separation
Yoshii et al.	2018	Independent low-rank tensor analysis for audio source separation
Mitsui et al.	2017	Blind source separation based on independent low-rank matrix analysis with sparse regularization for time-series activity
Zhang et al.	2022	End-to-end dereverberation, beamforming, and speech recognition in a cocktail party
Kubo et al.	2019	Efficient full-rank spatial covariance estimation using independent low-rank matrix analysis for blind source separation
Li et al.	2022	FastMVAE2: On improving and accelerating the fast variational autoencoder-based source separation algorithm for determined mixtures
Kitamura et al.	2015	Relaxation of rank-1 spatial constraint in overdetermined blind source separation
Boeddeker et al.	2022	An initialization scheme for meeting separation with spatial mixture models
Kitamura et al.	2017	Experimental analysis of optimal window length for independent low-rank matrix analysis
Nugraha et al.	2018	Deep neural network based multichannel audio source separation
Kang et al.	2019	A low-complexity permutation alignment method for frequency-domain blind source separation
Wang	2024	USDnet: Unsupervised Speech Dereverberation via Neural Forward Filtering
Saleem et al.	2018	Low rank sparse decomposition model based speech enhancement using gammatone filterbank and Kullback–Leibler divergence
Bando et al.	2023	Neural fast full-rank spatial covariance analysis for blind source separation
Mitsui et al.	2017	Independent low-rank matrix analysis based on parametric majorization-equalization algorithm
Hasumi et al.	2023	PoP-IDLMA: Product-of-Prior Independent Deeply Learned Matrix Analysis for Multichannel Music Source Separation
Sawada et al.	2023	Multi-frame full-rank spatial covariance analysis for underdetermined blind source separation and dereverberation