[go: up one dir, main page]
More Web Proxy on the site http://driver.im/

Mogami et al., 2018 - Google Patents

Independent deeply learned matrix analysis for multichannel audio source separation

Mogami et al., 2018

View PDF
Document ID
427619975562848294
Author
Mogami S
Sumino H
Kitamura D
Takamune N
Takamichi S
Saruwatari H
Ono N
Publication year
Publication venue
2018 26th European Signal Processing Conference (EUSIPCO)

External Links

Snippet

In this paper, we address a multichannel audio source separation task and propose a new efficient method called independent deeply learned matrix analysis (IDLMA). IDLMA estimates the demixing matrix in a blind manner and updates the time-frequency structures …
Continue reading at arxiv.org (PDF) (other versions)

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • G10L21/0216Noise filtering characterised by the method used for estimating noise
    • G10L2021/02161Number of inputs available containing the signal or the noise to be suppressed
    • G10L2021/02166Microphone arrays; Beamforming
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/14Speech classification or search using statistical models, e.g. hidden Markov models [HMMs]
    • G10L15/142Hidden Markov Models [HMMs]
    • G10L15/144Training of HMMs
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/06Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
    • G10L15/065Adaptation
    • G10L15/07Adaptation to the speaker
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signal, using source filter models or psychoacoustic analysis
    • G10L19/008Multichannel audio signal coding or decoding, i.e. using interchannel correlation to reduce redundancies, e.g. joint-stereo, intensity-coding, matrixing
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
    • G10L25/03Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 characterised by the type of extracted parameters
    • G10L25/18Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 characterised by the type of extracted parameters the extracted parameters being spectral information of each sub-band

Similar Documents

Publication Publication Date Title
Mogami et al. Independent deeply learned matrix analysis for multichannel audio source separation
Makishima et al. Independent deeply learned matrix analysis for determined audio source separation
Kitamura et al. Determined blind source separation with independent low-rank matrix analysis
Shimada et al. Unsupervised speech enhancement based on multichannel NMF-informed beamforming for noise-robust automatic speech recognition
Nugraha et al. Multichannel audio source separation with deep neural networks
Scheibler et al. Independent vector analysis with more microphones than sources
Kitamura et al. Efficient multichannel nonnegative matrix factorization exploiting rank-1 spatial model
Sawada et al. Efficient algorithms for multichannel extensions of Itakura-Saito nonnegative matrix factorization
Mogami et al. Independent low-rank matrix analysis based on complex student's t-distribution for blind audio source separation
Yoshii et al. Independent low-rank tensor analysis for audio source separation
Mitsui et al. Blind source separation based on independent low-rank matrix analysis with sparse regularization for time-series activity
Zhang et al. End-to-end dereverberation, beamforming, and speech recognition in a cocktail party
Kubo et al. Efficient full-rank spatial covariance estimation using independent low-rank matrix analysis for blind source separation
Li et al. FastMVAE2: On improving and accelerating the fast variational autoencoder-based source separation algorithm for determined mixtures
Kitamura et al. Relaxation of rank-1 spatial constraint in overdetermined blind source separation
Boeddeker et al. An initialization scheme for meeting separation with spatial mixture models
Kitamura et al. Experimental analysis of optimal window length for independent low-rank matrix analysis
Nugraha et al. Deep neural network based multichannel audio source separation
Kang et al. A low-complexity permutation alignment method for frequency-domain blind source separation
Wang USDnet: Unsupervised Speech Dereverberation via Neural Forward Filtering
Saleem et al. Low rank sparse decomposition model based speech enhancement using gammatone filterbank and Kullback–Leibler divergence
Bando et al. Neural fast full-rank spatial covariance analysis for blind source separation
Mitsui et al. Independent low-rank matrix analysis based on parametric majorization-equalization algorithm
Hasumi et al. PoP-IDLMA: Product-of-Prior Independent Deeply Learned Matrix Analysis for Multichannel Music Source Separation
Sawada et al. Multi-frame full-rank spatial covariance analysis for underdetermined blind source separation and dereverberation