[go: up one dir, main page]
More Web Proxy on the site http://driver.im/

CN103559888B - Based on non-negative low-rank and the sound enhancement method of sparse matrix decomposition principle - Google Patents

Based on non-negative low-rank and the sound enhancement method of sparse matrix decomposition principle Download PDF

Info

Publication number
CN103559888B
CN103559888B CN201310548773.9A CN201310548773A CN103559888B CN 103559888 B CN103559888 B CN 103559888B CN 201310548773 A CN201310548773 A CN 201310548773A CN 103559888 B CN103559888 B CN 103559888B
Authority
CN
China
Prior art keywords
matrix
rank
noisy speech
low
sparse matrix
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
CN201310548773.9A
Other languages
Chinese (zh)
Other versions
CN103559888A (en
Inventor
孙成立
须明
王希敏
谢坚筱
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
KEY LABORATORY OF SCIENCE AND TECHNOLOGY ON AVIONICS INTEGRATION TECHNOLOGIES
Original Assignee
KEY LABORATORY OF SCIENCE AND TECHNOLOGY ON AVIONICS INTEGRATION TECHNOLOGIES
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by KEY LABORATORY OF SCIENCE AND TECHNOLOGY ON AVIONICS INTEGRATION TECHNOLOGIES filed Critical KEY LABORATORY OF SCIENCE AND TECHNOLOGY ON AVIONICS INTEGRATION TECHNOLOGIES
Priority to CN201310548773.9A priority Critical patent/CN103559888B/en
Publication of CN103559888A publication Critical patent/CN103559888A/en
Application granted granted Critical
Publication of CN103559888B publication Critical patent/CN103559888B/en
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Landscapes

  • Circuit For Audible Band Transducer (AREA)
  • Noise Elimination (AREA)

Abstract

The invention discloses a kind of based on non-negative low-rank with the sound enhancement method of sparse matrix decomposition principle.First noisy speech signal is smoothed by the method, framing and discrete Fourier transform, obtains noisy speech frequency spectrum;Then the noisy speech amplitude spectrum of every frame is sequentially arranged one noisy speech time-frequency matrix of composition as column vector, by noisy speech time-frequency matrix is carried out non-negative low-rank and sparse matrix decomposition, it is thus achieved that the low-rank of non-negative and sparse matrix;Utilize sparse matrix and noisy speech phase reconstruction to strengthen voice spectrum, obtain the enhancing voice of forms of time and space finally by inverse Fourier transform.The present invention is strong to noise adaptation, be made without end-point detection and model training, parameter the most easily regulate, and strong noise environment performance is good, has good application prospect.

Description

Based on non-negative low-rank and the sound enhancement method of sparse matrix decomposition principle
Technical field
The present invention relates to signal processing field, it is adaptable to the noise suppressed of noisy speech, be based particularly on non-negative low-rank and sparse square The sound enhancement method of battle array decomposition principle.
Background technology
Voice signal is Human communication's information means the most natural, maximally effective.Along with the mankind enter the information age, in the urgent need to Use the voice processing technology of advanced person to promote that human society is intelligent.As far back as 2000, Bill Gates was just it is proposed that " following 10 Year is the epoch of voice ".Recent years, successively release intelligent sound service, intelligence along with companies such as Fructus Mali pumilae, Google, Microsofts Voice industry has become as the new industry in areas of information technology, and user cognition degree and market scale the most gradually expand.Especially It is that the smart mobile phone that Fructus Mali pumilae is released recently has voice assistant's function, and the voice " cloud " that University of Science and Technology news fly is released, and makes intelligence Voice technology can face more wide application.But, inevitably by from ring around in voice communication and application process Border, communication media and the interference of inside communication equipment noise, had a strong impact on the actual application of intelligent sound technology.
Speech enhan-cement is the effective technology solving sound pollution.Speech enhan-cement is by the suppression noise interference to voice so that strengthen The voice signal processed is minimum with the distortion between original clean voice signal.Come in the past few decades, emerged in large numbers many speech enhan-cement Algorithm, typical algorithm includes spectrum-subtraction, based on spectral amplitude least mean-square error, subspace method, wavelet de-noising method.In noise In the environment of the highest, speech enhan-cement has been obtained for effectively solving.But, due to the multiformity of noise and voice in natural environment The complexity of signal itself, voice enhancement algorithm is different according to the difference of applied environment, and this makes its research work difficulty very big, The Speech Enhancement problem of very noisy and multiple noise circumstance is solved the most very well.
In existing voice enhancement algorithm, many methods attempt to use the pdf model of voice signal and noise signal to come Remove noise signal in big degree, but research in recent years shows that certain single distribution can not be applicable to all of voice or make an uproar Sound, needs mathematical model and model algorithm for estimating the most flexibly are to adapt to the feature of signal self.Additionally, increase at existing voice In strong algorithms, Noise Estimation is the early stage indispensability work of voice enhancement algorithm.By Noise Estimation can obtain noise power spectrum and The prior weight of voice signal, the improvement to speech enhan-cement effect is most important.Existing sound enhancement method passes through end-speech The voice signal collected is divided into noise segment and noisy speech section by some detection, utilizes noise segment to estimate and updates Noise Estimation amount, But this is a kind of suboptimal estimation mode, the instantaneous noise of noise segment and noisy speech section being not exclusively consistent in reality, therefore, This noise estimation method always brings error, moreover existing voice end-point detection technology is at low signal-to-noise ratio and nonstationary noise environment Under the most immature, easily cause erroneous judgement, can cause voice exists the biggest residual noise.
In recent years compressive sensing theory research shows, many actual observed quantities can be attributed to a low-rank component and sparse point The pattern that amount is added, by low-rank and the sparse matrix decomposition of matrix, can recover former from big noise or exceptional value contamination data Beginning data message.The low-rank of matrix and sparse matrix decomposition have been used to image enhaucament, video object detection, data mining etc. and are permitted Many sciemtifec and technical spheres.
Stationary random noise and periodic noise are modal two kinds of noise class.Stationary random noise's single order and second-order statistic Describing its stochastic process, its average and auto-correlation function are unrelated with the time, owing to the Fourier of stochastic signal auto-correlation function becomes Changing is power spectrum, therefore the time-frequency matrix of stationary random noise be an order number be the low-rank matrix of 1.Equally, if noise is week Phase property noise, owing to its time-frequency matrix only has value at some fixed frequency, its rectangular array vector has stronger dependency, must Also it is so a low-rank matrix.
In sum, the time-frequency rectangular array vector of background noise has the strongest dependency, and therefore the time-frequency matrix of noise has low Order.Relatively for background noise, speech source signals value in major part time frequency point is zero or close to zero, only minority Sample point value is relatively big, so speech source signals has certain openness, applicable sparse matrix describes.Therefore, it can Consider to use for reference the low-rank of matrix and sparse resolution theory solves Speech Enhancement problem.Chinese patent disclose a kind of based on low-rank with The single channel of sparse matrix decomposition is made an uproar separation method (publication number: CN102915742A) without supervision language.The method first by Noisy speech time domain waveform is transformed to time-frequency domain thus obtains the amplitude spectrum of noisy speech by Short Time Fourier Transform;Utilize low-rank with The amplitude spectrum of noisy speech is decomposed into noise amplitude spectrum, voice amplitudes spectrum and residual noise amplitude spectrum three by sparse matrix decomposition algorithm Person's sum;Finally, the voice time domain waveform that inverse Fourier transform reconstructs from the amplitude spectrum of voice in short-term is utilized.The method Deficiency is low-rank and sparse matrix decomposition not to be added nonnegativity restriction, is easily caused isolated from noisy speech amplitude spectrum Voice amplitudes spectrum containing negative result.And the amplitude spectrum of reality is non-negative physical quantity, negative value phenomenon should not occur.Negative value width Degree spectrum not only causes resolution error, and can produce the music noise that human ear is felt to feel bad, thus affects phonetic hearing quality.
The present invention devises a kind of based on non-negative low-rank with the sound enhancement method of sparse matrix decomposition principle, and the method uses non-negative Low-rank and sparse matrix decomposition principle decompose noisy speech amplitude spectrum, and the voice amplitudes spectrum that decomposition can be made to obtain meets nonnegativity, has Effect improves low-rank and sparse matrix decomposition effect.The method has strong robustness, be made without end-point detection and parameter is few The advantages such as easy regulation, are suitable for the speech enhan-cement task under strong noise environment.
Summary of the invention
The technical problem to be solved is to provide a kind of based on non-negative low-rank and the speech enhan-cement side of sparse matrix decomposition principle Method, by introducing noise and the low-rank of voice and sparse constraint and nonnegativity restriction carries out low-rank and sparse in time-frequency domain Matrix decomposition, it is achieved the language of noisy speech is made an uproar separation.
The present invention takes techniques below scheme, based on non-negative low-rank and the sound enhancement method of sparse matrix decomposition principle, uses non-negative Low-rank and sparse matrix decomposition method isolate voice signal from noisy speech, and implementation step is as follows:
(1) discrete noisy speech signal being carried out pretreatment, pretreatment includes signal smoothing and framing;
(2) noisy speech signal after framing is carried out discrete Fourier transform, obtain noisy speech frequency spectrum;
(3) in a frequency domain, using the spectrum amplitude of every frame voice as column vector, it is sequentially arranged, by several speech frame structures Become noisy speech time-frequency matrix;
(4) utilize non-negative low-rank and sparse matrix decomposition algorithm that noisy speech time-frequency matrix is decomposed, it is thus achieved that the low-rank square of non-negative Battle array and sparse matrix;Decomposing expression formula is:
Y=L+S+E meets rank (L)≤r, | | S | |0≤h,L≥0,S≥0;
Wherein: Y is noisy speech time-frequency matrix;L is low-rank matrix, the amplitude spectrum of corresponding noise;S is sparse matrix, corresponding language The amplitude spectrum of sound, | | S | |0Representing the non-zero element number that sparse matrix S contains, the order of rank (L) representing matrix L, E is residual matrix, R and h represents low-rank and sparse constraint upper limit parameter;
(5) the phase spectrum reconstruct utilizing sparse matrix S and noisy speech strengthens voice spectrum, then by inverse Fourier transform, Enhancing voice to forms of time and space.
The processing procedure that discrete noisy speech signal carries out in described step (1) pretreatment is:
(1) P point arest neighbors signal average is used to carry out signal smoothing, in order to the amplitude wave-shape of smooth noisy speech;
(2) to noisy speech signal framing, the window function that framing uses is Hamming window, and a length of 200 points of window, each interframe moves Overlap to count be 80 points.
The step calculating low-rank matrix L and sparse matrix S is as follows:
(1) initialize: Y0=Y;L0=S0=[0]N×K
Iterations initial value i=1;Maximum iteration time imax=103;Relative error threshold value δ=10-3
(2) use NMF renewal low-rank matrix: (W, H)=NMF (Yi-1), Li=WH;W∈RN×r, H ∈ Rr×K
NMF represent Non-negative Matrix Factorization, NMF represent Non-negative Matrix Factorization, W and H be order be the NMF decomposition result of r, the survey of NMF Degree function selects Itakura-Saito to estimate;
(3) Soft-thresholding operator is used to update sparse matrix: Si=(Yi-1-Li+Si-1> λ) (Yi-1-Li+Si-1-λ);
Wherein: symbolRepresenting matrix correspondence position element product, λ is thresholding constant;λ is relevant with noise level, it is recommended that value λ=σ, wherein σ is the mean square deviation of noise;
(4) superposition matrix: Y is updatedi=Li+Si
(5) if i reach maximum iteration time i=imax orStop iteration, the estimated value of output L and S LiAnd Si;Otherwise jump to step (2), i=i+1;Continue executing with iterative process.
The phase spectrum utilizing sparse matrix and noisy speech in described step (5) reconstructs and strengthens voice spectrum:
Wherein: ∠ Y (n, k) is the spectral phase of noisy speech, and S is sparse matrix, S (n, k) is sparse matrix spectrum amplitude angle value,For the enhancing voice spectrum of reconstruct, n is time frame index, and k is frequency indices.
The sound enhancement method that the present invention provides, by non-negative low-rank and sparse matrix decomposition, can make the low-rank matrix that decomposition obtains It it is all nonnegative value with the element in sparse matrix.The method is made without end-point detection and model training, have strong robustness, The advantages such as parameter the most easily regulation, the speech enhan-cement task being particularly suitable under strong noise environment.
Accompanying drawing explanation
Fig. 1 is the speech-enhancement system block diagram of the present invention.
Detailed description of the invention
In conjunction with accompanying drawing, the invention will be further described, sees Fig. 1, based on non-negative low-rank and the voice of sparse matrix decomposition principle Enhancement Method, including step in detail below:
1) noisy speech signal y (t) is carried out pretreatment 101;Pretreatment 101 stage includes signal smoothing and framing so that it is be easy to Subsequent processes.Signal smoothing refer to use y (t) P point arest neighbors signal average to calculate noisy speech signal currency, in order to smooth The amplitude wave-shape of noisy speech signal.In the present invention, the value of P is 3, i.e.Framing uses Window function be Hamming window, a length of 200 points of window, it is 80 points that the overlap that each interframe moves is counted;
2) noisy speech signal after framing is carried out DFT (discrete Fourier transform) 102, obtain signal spectrum, signal spectrum bag Include the amplitude spectrum 104 of signal | Y (n, k) | and phase spectrum 103 ∠ Y (n, k).Wherein n represents frame index, n=1,2 ..., N;K represents frequency Rate indexes, k=1, and 2 ..., k;N is total time frame number;K is that Fourier transformation is counted;
3) in a frequency domain, being arranged as column vector order by the amplitude spectrum 104 of every frame voice, so several speech frames just constitute one The noisy time-frequency matrix Y of individual N × K.
4) noisy time-frequency matrix Y is carried out NLSMD (non-negative low-rank and sparse matrix decomposition) 105, calculate non-negative low-rank matrix L and non- Negative sparse matrix S.
Y=L+S+E meets rank (L)≤r, | | S | |0≤h,L≥0,S≥0
Here the amplitude spectrum of L correspondence noise;The amplitude spectrum of S correspondence voice;||S||0The non-zero element number that representing matrix S contains;E For residual matrix;The order of rank (L) representing matrix L;R and h represents low-rank and sparse constraint upper limit parameter;Through contrast test, r Preferable noise reduction is obtained during value 1~3.
The calculating process of NLSMD (non-negative low-rank and sparse matrix decomposition) 105 is as follows:
1. initialize: Y0=Y;L0=S0=[0]N×K;Iterations i=1;Maximum iteration time imax=103;Relative error threshold Value δ=10-3
2. use Non-negative Matrix Factorization renewal low-rank matrix: (W, H)=NMF (Yi-1), Li=WH;W∈RN×r, H ∈ Rr×K
Wherein: LiThe estimated value through ith iteration L, NMF represent Non-negative Matrix Factorization, W and H be order be r NMF decompose As a result, owing to W and H is nonnegative value, so LiInevitable is also nonnegative matrix.The measure function of NMF algorithm can select Euclidean Distance, Kullback-Leibler estimate and estimate with Itakura-Saito.Through contrast test, use Itakura-Saito to estimate and obtain Good effect.Therefore, the present invention uses the NMF method estimated based on Itakura-Saito to calculate L.
3. Soft-thresholding operator is used to update sparse matrix Si: Si=(Yi-1-Li+Si-1> λ) (Yi-1-Li+ Si-1-λ);
Wherein: symbolRepresenting matrix correspondence position element product, λ is threshold value, and the value of λ is relevant with noise intensity, pushes away Recommending value λ=σ, wherein σ is noise mean square deviation.
4. superposition matrix: Y is updatedi=Li+Si
If 5. i reach maximum iteration time i=imax orStop iteration, estimated value Li of output L and S And Si;Otherwise jump to step 2., i=i+1, continue executing with iterative process;
5) sparse matrix S and the reconstruct of noisy speech phase spectrum is utilized to strengthen voice spectrum, owing to human ear is unwise to the phase information of sound Sense, can with the phase place ∠ Y of noisy speech frequency spectrum (n, k) replaces the phase place strengthening voice, obtains strengthening the complex number spectrum of voice:
6) the complex number spectrum matrix strengthening voice is expanded into vector, it is carried out IDFT (inverse discrete Fourier transform) 106, obtain The discrete time strengthening voice represents:
Wherein:Rectangular array vector temporally frame sequential is concatenated into the operation of one-dimensional vector by vec function representation.

Claims (1)

1. based on non-negative low-rank and the sound enhancement method of sparse matrix decomposition principle, by non-negative low-rank and sparse matrix decomposition method Isolating voice signal from noisy speech, discrete noisy speech signal is carried out pretreatment, pretreatment includes signal smoothing and divides Frame;Noisy speech signal after framing is carried out discrete Fourier transform, obtains noisy speech frequency spectrum;In a frequency domain, by every frame The spectrum amplitude of voice, as column vector, is sequentially arranged, and is made up of noisy speech time-frequency matrix several speech frames;Profit With non-negative low-rank and sparse matrix decomposition algorithm, noisy speech time-frequency matrix is decomposed, it is thus achieved that the low-rank matrix of non-negative and sparse Matrix;Decomposing expression formula is:
Y=L+S+E meets rank (L)≤r, | | S | |0≤h,L≥0,S≥0;
Wherein: Y is noisy speech time-frequency matrix;L is low-rank matrix, the amplitude spectrum of corresponding noise;S is sparse matrix, corresponding language The amplitude spectrum of sound, | | S | |0Representing the non-zero element number that sparse matrix S contains, the order of rank (L) representing matrix L, E is residual matrix, R and h represents low-rank and sparse constraint upper limit parameter;The phase spectrum reconstruct utilizing sparse matrix S and noisy speech strengthens voice spectrum, Then by inverse Fourier transform, the enhancing voice of forms of time and space is obtained;It is characterized in that, calculate low-rank matrix L and sparse matrix The step of S is as follows:
(1) initialize: Y0=Y;L0=S0=[0]N×K
Iterations initial value i=1;Maximum iteration time imax=103;Relative error threshold value δ=10-3
(2) use NMF renewal low-rank matrix: (W, H)=NMF (Yi-1), Li=WH;W∈RN×r, H ∈ Rr×K
NMF represent Non-negative Matrix Factorization, W and H be order be the NMF decomposition result of r, the measure function of NMF selects Itakura-Saito Estimate;
(3) Soft-thresholding operator is used to update sparse matrix: Si=(Yi-1-Li+Si-1> λ) (Yi-1-Li+Si-1-λ);
Wherein: symbol representing matrix correspondence position element product, λ is thresholding constant;λ is relevant with noise level, it is recommended that value λ=σ, wherein σ is the mean square deviation of noise;
(4) superposition matrix: Y is updatedi=Li+Si
(5) if i reach maximum iteration time i=imax orStop iteration, the estimated value of output L and S LiAnd Si;Otherwise jump to step (2), i=i+1;Continue executing with iterative process.
CN201310548773.9A 2013-11-07 2013-11-07 Based on non-negative low-rank and the sound enhancement method of sparse matrix decomposition principle Expired - Fee Related CN103559888B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201310548773.9A CN103559888B (en) 2013-11-07 2013-11-07 Based on non-negative low-rank and the sound enhancement method of sparse matrix decomposition principle

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201310548773.9A CN103559888B (en) 2013-11-07 2013-11-07 Based on non-negative low-rank and the sound enhancement method of sparse matrix decomposition principle

Publications (2)

Publication Number Publication Date
CN103559888A CN103559888A (en) 2014-02-05
CN103559888B true CN103559888B (en) 2016-10-05

Family

ID=50014118

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201310548773.9A Expired - Fee Related CN103559888B (en) 2013-11-07 2013-11-07 Based on non-negative low-rank and the sound enhancement method of sparse matrix decomposition principle

Country Status (1)

Country Link
CN (1) CN103559888B (en)

Families Citing this family (21)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104021797A (en) * 2014-06-19 2014-09-03 南昌大学 Voice signal enhancement method based on frequency domain sparse constraint
EP3010017A1 (en) * 2014-10-14 2016-04-20 Thomson Licensing Method and apparatus for separating speech data from background data in audio communication
CN104751855A (en) * 2014-11-25 2015-07-01 北京理工大学 Speech enhancement method in music background based on non-negative matrix factorization
CN104505100B (en) * 2015-01-06 2017-12-12 中国人民解放军理工大学 A kind of unsupervised sound enhancement method based on robust Non-negative Matrix Factorization and data fusion
US10528147B2 (en) 2017-03-06 2020-01-07 Microsoft Technology Licensing, Llc Ultrasonic based gesture recognition
US10276179B2 (en) 2017-03-06 2019-04-30 Microsoft Technology Licensing, Llc Speech enhancement with low-order non-negative matrix factorization
CN108573711A (en) * 2017-03-09 2018-09-25 中国科学院声学研究所 A kind of single microphone speech separating method based on NMF algorithms
US10984315B2 (en) 2017-04-28 2021-04-20 Microsoft Technology Licensing, Llc Learning-based noise reduction in data produced by a network of sensors, such as one incorporated into loose-fitting clothing worn by a person
CN107528648A (en) * 2017-10-23 2017-12-29 北京邮电大学 A kind of blind frequency spectrum sensing method and device based on low-rank sparse matrix decomposition
CN108899045A (en) * 2018-06-29 2018-11-27 中国航空无线电电子研究所 Subspace sound enhancement method based on constraint low-rank and sparse decomposition
CN108986834B (en) * 2018-08-22 2023-04-07 中国人民解放军陆军工程大学 Bone conduction voice blind enhancement method based on codec framework and recurrent neural network
CN109036452A (en) * 2018-09-05 2018-12-18 北京邮电大学 A kind of voice information processing method, device, electronic equipment and storage medium
CN109215671B (en) * 2018-11-08 2022-12-02 西安电子科技大学 Voice enhancement system and method based on MFrSRRPCA algorithm
WO2020113575A1 (en) * 2018-12-07 2020-06-11 广东省智能制造研究所 Sound classification method, device and medium based on semi-nonnegative materix factorization with constraint
CN109658944B (en) * 2018-12-14 2020-08-07 中国电子科技集团公司第三研究所 Helicopter acoustic signal enhancement method and device
CN111863014B (en) * 2019-04-26 2024-09-17 北京嘀嘀无限科技发展有限公司 Audio processing method, device, electronic equipment and readable storage medium
CN111312270B (en) * 2020-02-10 2022-11-22 腾讯科技(深圳)有限公司 Voice enhancement method and device, electronic equipment and computer readable storage medium
CN111402909B (en) * 2020-03-02 2023-07-07 东华大学 Speech enhancement method based on constant frequency domain transformation
CN114913874B (en) * 2021-02-08 2024-06-18 北京小米移动软件有限公司 Voice signal processing method and device, electronic equipment and storage medium
CN113129872B (en) * 2021-04-06 2023-03-14 新疆大学 Voice enhancement method based on deep compressed sensing
CN117935834B (en) * 2024-03-12 2024-05-28 深圳市声优创科技有限公司 Intelligent audio noise reduction method and equipment

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101441872A (en) * 2007-11-19 2009-05-27 三菱电机株式会社 Denoising acoustic signals using constrained non-negative matrix factorization
CN102855884A (en) * 2012-09-11 2013-01-02 中国人民解放军理工大学 Speech time scale modification method based on short-term continuous nonnegative matrix decomposition
CN102915742A (en) * 2012-10-30 2013-02-06 中国人民解放军理工大学 Single-channel monitor-free voice and noise separating method based on low-rank and sparse matrix decomposition

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR20100111499A (en) * 2009-04-07 2010-10-15 삼성전자주식회사 Apparatus and method for extracting target sound from mixture sound
US8080724B2 (en) * 2009-09-14 2011-12-20 Electronics And Telecommunications Research Institute Method and system for separating musical sound source without using sound source database

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101441872A (en) * 2007-11-19 2009-05-27 三菱电机株式会社 Denoising acoustic signals using constrained non-negative matrix factorization
CN102855884A (en) * 2012-09-11 2013-01-02 中国人民解放军理工大学 Speech time scale modification method based on short-term continuous nonnegative matrix decomposition
CN102915742A (en) * 2012-10-30 2013-02-06 中国人民解放军理工大学 Single-channel monitor-free voice and noise separating method based on low-rank and sparse matrix decomposition

Also Published As

Publication number Publication date
CN103559888A (en) 2014-02-05

Similar Documents

Publication Publication Date Title
CN103559888B (en) Based on non-negative low-rank and the sound enhancement method of sparse matrix decomposition principle
Sigg et al. Speech enhancement using generative dictionary learning
CN111445905B (en) Mixed voice recognition network training method, mixed voice recognition method, device and storage medium
CN107564513B (en) Voice recognition method and device
CN109524020B (en) Speech enhancement processing method
CN102915742B (en) Single-channel monitor-free voice and noise separating method based on low-rank and sparse matrix decomposition
CN104685562B (en) Method and apparatus for reconstructing echo signal from noisy input signal
Talmon et al. Single-channel transient interference suppression with diffusion maps
CN104505100B (en) A kind of unsupervised sound enhancement method based on robust Non-negative Matrix Factorization and data fusion
CN111508518B (en) Single-channel speech enhancement method based on joint dictionary learning and sparse representation
Mavaddaty et al. A novel speech enhancement method by learnable sparse and low-rank decomposition and domain adaptation
Wang Multi-band multi-centroid clustering based permutation alignment for frequency-domain blind speech separation
Li et al. A multi-objective learning speech enhancement algorithm based on IRM post-processing with joint estimation of SCNN and TCNN
JP2013068938A (en) Signal processing apparatus, signal processing method, and computer program
Nie et al. Exploiting spectro-temporal structures using NMF for DNN-based supervised speech separation
Kang et al. NMF-based speech enhancement incorporating deep neural network.
Xu et al. Methods for learning adaptive dictionary in underdetermined speech separation
JP5807914B2 (en) Acoustic signal analyzing apparatus, method, and program
Casebeer et al. Deep tensor factorization for spatially-aware scene decomposition
Narayanaswamy et al. Audio source separation via multi-scale learning with dilated dense u-nets
Fang et al. Integrating statistical uncertainty into neural network-based speech enhancement
CN102509268B (en) Immune-clonal-selection-based nonsubsampled contourlet domain image denoising method
Ben Messaoud et al. Sparse representations for single channel speech enhancement based on voiced/unvoiced classification
Zdunek Improved convolutive and under-determined blind audio source separation with MRF smoothing
Ben et al. Chirp signal denoising based on convolution neural network

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant
CF01 Termination of patent right due to non-payment of annual fee
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20161005

Termination date: 20171107