CN116072140A

CN116072140A - Howling processing method of interphone

Info

Publication number: CN116072140A
Application number: CN202310014214.3A
Authority: CN
Inventors: 侯丹; 施钟鸣
Original assignee: Wuxi Sicomm Communication Technology Co ltd
Current assignee: Wuxi Sicomm Communication Technology Co ltd
Priority date: 2023-01-05
Filing date: 2023-01-05
Publication date: 2023-05-05

Abstract

The invention relates to the technical field of interphones, in particular to a howling processing method of an interphone, which comprises the steps that firstly audio is subjected to Hamming window windowing; then, a band-pass filter is used for limiting the frequency band of the signal subjected to windowing processing to fr0-fr1, and energy is calculated; then, the fr0-fr1 time domain audio is converted into frequency domain audio through FFT; finding out the frequency points of the top three energy ranks in the fr0-fr1 range and recording f0, f1 and f2; continuously tracking for a period of time, counting a histogram, and recording frequency points F0, F1 and F2 of the top three frequency points; if a certain frequency point is in a high energy range for a long time, determining that the signal is howling; the trap cuts off energy at the frequency points required to be restrained; and converting the frequency domain signal subjected to notch processing into a time domain signal through inverse fast Fourier transform and outputting the time domain signal. According to the howling characteristics of the interphone, the invention automatically identifies and automatically suppresses the howling characteristics, and is easy to realize on an interphone platform.

Description

Howling processing method of interphone

Technical Field

The invention relates to the technical field of interphones, in particular to a howling processing method of an interphone.

Background

The development of traditional interphone equipment is today, except for digital reform, the improvement of voice communication quality, especially the suppression of noise under abnormal environment, and the howling is one of them. When the communication distance of the traditional interphone is relatively short, the sound emitted by the loudspeaker of the receiving equipment is transmitted back to the microphone of the transmitting equipment to form self-excitation, so that howling is often caused. Since the conventional interphone is a single built-in microphone and there is no treatment of howling suppression, it is necessary to manually pull a physical distance or reduce a volume to solve the problem.

The existing echo cancellation system algorithm aiming at howling is extremely complex, the tone quality can be damaged to a certain extent, and a method for automatically identifying and automatically suppressing howling, which is more practical, aiming at interphones is lacking.

Disclosure of Invention

The invention provides a howling processing method of an interphone, which has the advantages of simple algorithm, easy realization and good effect.

In order to achieve the purpose of the invention, the technical scheme adopted is as follows: a method for howling processing of an intercom, the method comprising:

s1, collecting audio of an interphone, wherein the audio is subjected to Hamming window windowing;

s2, limiting the signal frequency range after windowing to fr0-fr1 through a band-pass filter, and calculating energy;

s3, converting fr0-fr1 time domain audio into frequency domain audio through FFT;

s4, finding out frequency points of the top three energy ranks in the fr0-fr1 range and recording f0, f1 and f2;

s5, continuously tracking for a period of time, counting a histogram, and recording frequency points F0, F1 and F2 of the top three frequency points;

s6, if a certain frequency point is in a high energy range for a long time, determining that the signal is howling;

s7, deleting energy at a frequency point required to be restrained by the wave trap;

s8, converting the frequency domain signal subjected to the notch processing into a time domain signal through inverse fast Fourier transform and outputting the time domain signal.

As an optimization scheme of the invention, the frequency range fr0-fr1 is 1000hz-3000hz.

As an optimization of the present invention, 50 10ms times are continuously tracked in step S5.

As an optimization scheme of the invention, in step S7, the trap cuts off the energy to 24-64dB at the frequency point required to be restrained.

As an optimization scheme of the invention, the audio collected by the interphone is the audio collected by the microphone of 10 ms.

As an optimization scheme of the invention, the outputs of the step S3, the step S5 and the step S6 are processed by the step S7 wave trap.

The invention has the positive effects that: 1) According to the howling characteristics of the interphone, the invention automatically identifies and automatically suppresses the howling characteristics, and is easy to realize on an interphone platform;

2) The invention uses the simplest digital signal processing, adopts a feedforward mode, is stable and reliable, does not damage the tone quality, and is easy to realize on each platform.

Drawings

The invention will be described in further detail with reference to the drawings and the detailed description.

Fig. 1 is a flow chart of the method of the present invention.

Detailed Description

As shown in fig. 1, the invention discloses a howling processing method of an interphone, wherein the input of the method is audio acquired by a microphone of 10ms, and the output of the method is the audio of 10ms processed by howling, so that the real-time effect is achieved. The method comprises the following steps:

a: the audio passes through the hamming window.

B: band-pass filtering, the range fr0 to fr1 hz, fr0 to fr1 hz is the frequency range where intercom howling is most likely to occur. It is generally considered to be 1000hz to 3000hz.

C: FFT (fast fourier transform) converts a time domain signal into a frequency domain signal. The 256-point FFT is described below. A 256-point FFT according to the nyquist principle then contains 128 frequency points of energy.

D: and (3) finding out three frequency points f0, f1 and f2 with the maximum energy in 128 points according to the result of C, and adding 1 to the corresponding frequency point in the frequency histogram in E.

E: the peak count histogram is accumulated (e.g., 50 times). The section outputs 128 numbers, each representing the number of times that it becomes one of the maximum three times in 50 times at this frequency point, as a criterion.

F: the trap cuts off the energy at the frequency point requiring the suppression, which can reach 24-64dB.

G: the iFFT (inverse fast fourier transform) converts the notch-processed frequency domain signal into a time domain signal output.

H: from the 128-point data given by E, a peak was found. The larger the peak data is, the longer the energy is concentrated at this frequency point, and the tendency is to determine howling. Otherwise, the normal audio is the step S6.

The invention includes two parts howling detection and howling suppression. Wherein A, B, C, D, E in fig. 1 is a howling detection section; f and G are howling suppressing parts. And will be described in detail below.

Howling detection

Howling range (1 khz-3 khz) of the intercom and howling characteristics (single frequency point or two-three frequency points switched back and forth), which are responsible for detection, are prepared for call suppression. The specific detection procedure is described below.

Howling occurrence determination flow:

assuming that the frequency band of the howling is fr0 to fr1 (unit hertz), the method collects new data of 10ms at the microphone end each time, limits the signal frequency band to fr0 to fr1 through a band-pass filter, calculates the total energy (figure 1A), and considers that the howling is likely to be continuously recognized if a certain threshold is exceeded (figure 1B); then, the fr0-fr1 time domain audio is converted into frequency domain audio through FFT (figure 1C), the frequency points of the top three energy ranks in the fr0-fr1 range are found and recorded as f0, f1 and f2; continuously tracking 50 time of 10ms, counting a histogram, if one or two frequency points are in a high energy range for a long time, judging that the frequency points are howling, and recording frequency points F0, F1 and F2 of the top three frequency points in units of hertz (D, E in fig. 1).

Howling disappearance determination flow:

because the non-howling sound energy is more dispersed, the energy is not concentrated over the spectrum for a long period of time. In order to ensure normal communication, if the frequency points of the top three ranks are scattered on the histogram, the howling is eliminated, and suppression is not needed.

Howling suppression:

the module is responsible for processing howling that has occurred, and if it is determined that howling has occurred, only the frequency points where howling has occurred are notched (F0, F1, F2 in fig. 1E), thereby preserving the quality of normal voice communication as much as possible. For three frequency points F0, F1, F2, etc. recorded in the flow 1, the energy thereof is reduced in the frequency domain, respectively. The longer the continuous time of occurrence, the greater the suppression amplitude (e.g., 24dB for a continuous 500ms energy decrease, 30dB for a continuous 1000ms energy decrease, and so on).

While the foregoing is directed to embodiments of the present invention, other and further details of the invention may be had by the present invention, it should be understood that the foregoing description is merely illustrative of the present invention and that no limitations are intended to the scope of the invention, except insofar as modifications, equivalents, improvements or modifications are within the spirit and principles of the invention.

Claims

1. The method for processing the howling of the interphone is characterized by comprising the following steps of: the method comprises the following steps:

2. The howling processing method of an intercom as in claim 1, wherein: fr0-fr1 has a frequency in the range of 1000hz to 3000hz.

3. The howling processing method of an intercom as in claim 2, wherein: the tracking continues for 50 10ms times in step S5.

4. A howling processing method of an intercom as in claim 3, wherein: in step S7, the trap cuts off the energy to 24-64dB at the frequency point where suppression is required.

5. The howling processing method of an intercom as in claim 4, wherein: the audio collected by the interphone is the audio collected by a microphone of 10 ms.

6. The howling processing method of an intercom as in claim 5, wherein: the outputs of the step S3, the step S5 and the step S6 are processed by the step S7 wave trap.