CN105845152A

CN105845152A - Method for detecting audio signal echoes

Info

Publication number: CN105845152A
Application number: CN201610169131.1A
Authority: CN
Inventors: 杜鹢; 杨俊�; 郑泽俊
Original assignee: CHENGDU PTLC COMMUNICATIONS TECHNOLOGY Co Ltd
Current assignee: CHENGDU PTLC COMMUNICATIONS TECHNOLOGY Co Ltd
Priority date: 2016-03-23
Filing date: 2016-03-23
Publication date: 2016-08-10

Abstract

The invention discloses a method for detecting audio signal echoes. The method comprises following steps: sectional calculation is performed on power cepstrum of the audio signal to obtain kurtosis values and the kurtosis values are compared with a preset value to determine whether there are echoes in the audio signal according to the comparison results; thereby, whether there are echoes in the audio signal transmission process is detected in real time without reference signals and the detection accuracy is high; the problems that traditional detection methods are subject to reference resources and therefore the detection accuracy is difficult to guaranteed are solved.

Description

A kind of audio signal echo detection method

Technical field

The present invention relates to audio transmission system technical field, be specifically related to a kind of audio signal echo detection method.

Background technology

Along with the fast development of communication technology, people are more and more higher to the requirement of communication quality, and at audio frequency During transmission, the existence of echo is it is difficult to ensure that communication quality.In order to eliminate echo, and then improve communication Quality, lifting Consumer's Experience, detect whether to there is echo most important in sound transmission course in real time.Pass System echo checking system be use anechoic audio signal as reference signal, with the audio frequency of current transmission Signal compares, thus judges whether the audio signal of current transmission exists echo, but this dependence is with reference to letter Number detection method be limited by reference source, accuracy it is difficult to ensure that.

Summary of the invention

The technical problem to be solved is on the basis of without necessarily referring to signal, it is achieved believe at audio frequency Number transmitting procedure detects whether to there is echo in real time, it is therefore intended that provide a kind of audio signal detection of echoes side Method, solves current detection method and relies on reference source, there is detection inaccurate in the case of reference source instability Problem.

The present invention is achieved through the following technical solutions:

A kind of audio signal echo detection method, comprises the steps:

A, continuous print audio signal is carried out sub-frame processing；

B, calculate the power cepstrum of every frame audio signal；

C, to power cepstrum segmentation, calculate the kurtosis numerical value of every section；

D, comparison peak number of degrees value and setting threshold size, it is judged that whether audio signal exists echo.

Especially, described step B specifically includes:

B1, the audio signal that every frame is to be measured is carried out Fourier transformation, it is thus achieved that the power spectrum of every frame audio signal；

B2, power spectrum is carried out logarithm operation；

B3, the power spectrum through logarithm operation is carried out inverse Fourier transform, it is thus achieved that power cepstrum.

Especially, described step C specifically includes:

C1, power cepstrum to every frame audio signal carry out segmentation；

C2, calculate the kurtosis numerical value of every section of power cepstrum；

C3, obtain every frame audio signal maximum peak number of degrees value.

Especially, described step D specifically includes:

D1, comparison every frame audio signal maximum peak number of degrees value and setting threshold size；

D2, when the maximum peak number of degrees value of N continuous frame audio signal is all higher than setting threshold value, then judge this sound Frequently there is echo in signal, and N is positive integer.

The present invention compared with prior art, has such advantages as and beneficial effect:

A kind of audio signal echo detection method of the present invention, it is possible on the basis of without necessarily referring to signal, Realize detecting whether to exist echo during audio signal transmission in real time, detection accuracy is high, and customer service passes The detection method of system is limited by reference source, the accuracy of detection it is difficult to ensure that problem.

Accompanying drawing explanation

Accompanying drawing described herein is used for providing being further appreciated by the embodiment of the present invention, constitutes the application's A part, is not intended that the restriction to the embodiment of the present invention.In the accompanying drawings:

The audio signal echo detection method flow chart that Fig. 1 provides for the embodiment of the present invention 1.

Detailed description of the invention

For making the object, technical solutions and advantages of the present invention clearer, below in conjunction with embodiment and attached Figure, the present invention is described in further detail, and exemplary embodiment and the explanation thereof of the present invention are only used for Explain the present invention, not as a limitation of the invention.

Embodiment 1

As it is shown in figure 1, the audio signal echo detection method flow chart that Fig. 1 provides for the embodiment of the present invention 1.

In the present embodiment, audio signal echo detection method specifically includes:

S101, continuous print audio signal is carried out sub-frame processing.

Audio signal can not directly carry out signal processing as quasi-steady state signal, in the present embodiment, is carrying out letter Number process before to continuous print signal framing, every frame length about 20-30ms, the audio signal after framing is as stable state Signal is further carried out processing.

S102, the audio signal that every frame is to be measured is carried out Fourier transformation, it is thus achieved that the power of every frame audio signal Spectrum.

Power spectrum is the abbreviation of power spectral density function, and it is defined as the signal power in per unit band.Its table Show that signal power is along with the situation of change of frequency, i.e. signal power are in the distribution situation of frequency domain.The present embodiment In audio signal x (n) that certain frame is to be measured is carried out Fourier transformation, it is thus achieved that the power spectrum of audio signal is F (x (n)).

S103, power spectrum is carried out logarithm operation.

The power spectrum of audio signal x (n) is taken natural logrithm is ln (| F (x (n)) |²)。

S104, the power spectrum through logarithm operation is carried out inverse Fourier transform, it is thus achieved that power cepstrum.

Power cepstrum refers to the Fourier inversion that the Fourier transform spectrum of audio signal is carried out after logarithm operation again, When there is echo in audio signal, can form more sharp-pointed signal in power cepstrum, the present embodiment uses This characteristic is as the standard judging echo.For audio signal x (n), its power cepstrum PCx (n)= F-1 [ln (| F (x (n)) | 2)], wherein, F (.) represents Fourier transformation, and F-1 (.) represents Fourier inversion, ln (.) Represent and take natural logrithm.

S105, power cepstrum to every frame audio signal carry out segmentation.

Power cepstrum segmentation to every frame audio signal calculates kurtosis numerical value, if certain section exists echo, then should The kurtosis numerical value of echo section can be relatively big, in this, as the standard judging echo.

S106, calculate the kurtosis numerical value of every section of power cepstrum.

Kurtosis characterizes probability distribution density curve characteristic number of peak value height at meansigma methods.Coefficient of kurtosis is the biggest, The distribution of sample data just has more extremum, then its residual value must be more concentrated on around mode, Its distribution is inevitable the most precipitous.

The kurtosis computational methods of the stochastic audio signal are defined as: the fourth central square of stochastic variable and the ratio of variance Value.Wherein, central moment defines: for positive integer k, if E (X) exists, and E [| X-E (X) |^k)] < ∞, Then claim E{ [X-E (X)]^kIt it is the k rank central moment of stochastic variable X.If the variance of X is the second-order moment around mean of X, I.e. D (X)=E{ [X-E (X)]²}；Variance defines: variance is to weigh stochastic variable at theory of probability and statistical variance Or the tolerance of dispersion degree during one group of data.In theory of probability, variance is used for measuring stochastic variable and its mathematic expectaion Departure degree between (i.e. average).Variance (sample variance) in statistics be each data respectively with its The difference of average square the average of sum.

In the present embodiment, the concrete formula that kurtosis calculates is:

K u r t o s i s = (Σ_{i = 1}^{N} {(Y_{i} - \overset{&OverBar;}{Y})}^{4}) / ((N - 1) S^{4})

Wherein, N is power spectrum cepstrum data number, and i is power spectrum data index, Y_iEach for power spectrum Numerical value,For the meansigma methods of power spectrum,S⁴It is four degree variance,

S107, obtain every frame audio signal maximum peak number of degrees value.

The kurtosis numerical value calculated for power cepstrum segmentation, it is thus achieved that every frame audio signal maximum peak number of degrees value, looks for Go out the signal segment that kurtosis value is maximum.

S108, comparison every frame audio signal maximum peak number of degrees value and setting threshold size.

The threshold value of default suitable size is as thresholding, and the peak every frame audio signal power cepstrum segmentation calculated Number of degrees value is compared with the threshold value of setting.

S109, when the maximum peak number of degrees value of N continuous frame audio signal is all higher than setting threshold value, then judging should There is echo in audio signal.

If N continuous frame audio signal maximum peak number of degrees value all occurs more than the threshold value set, then judge this audio frequency There is echo in signal, otherwise it is assumed that this audio signal does not has echo.

Technical scheme by the power cepstrum segmentation of audio signal being calculated kurtosis numerical value, and with in advance If threshold value compare, and then judge whether audio signal exists echo according to comparative result, thus not On the basis of needing to refer to signal, it is achieved detect whether in real time to there is echo during audio signal transmission, Detection accuracy is high, and the traditional detection method of customer service is limited by reference source, the accuracy of detection it is difficult to ensure that Problem.

Above-described detailed description of the invention, is carried out the purpose of the present invention, technical scheme and beneficial effect Further describe, be it should be understood that the detailed description of the invention that the foregoing is only the present invention, The protection domain being not intended to limit the present invention, all within the spirit and principles in the present invention, that is done is any Amendment, equivalent, improvement etc., should be included within the scope of the present invention.

Claims

1. an audio signal echo detection method, it is characterised in that comprise the steps:

A, continuous print audio signal is carried out sub-frame processing；

B, calculate the power cepstrum of every frame audio signal；

C, to power cepstrum segmentation, calculate the kurtosis numerical value of every section;

2. audio signal echo detection method as claimed in claim 1, it is characterised in that described step B specifically includes:

B2, power spectrum is carried out logarithm operation；

3. audio signal echo detection method as claimed in claim 1, it is characterised in that described step C specifically includes:

C1, power cepstrum to every frame audio signal carry out segmentation；

C3, obtain every frame audio signal maximum peak number of degrees value.

4. audio signal echo detection method as claimed in claim 1, it is characterised in that described step D specifically includes:

D2, when the maximum peak number of degrees value of N continuous frame audio signal is all higher than setting threshold value, then judge that this audio signal exists echo, N is positive integer.