CN116072140A - Howling processing method of interphone - Google Patents
Howling processing method of interphone Download PDFInfo
- Publication number
- CN116072140A CN116072140A CN202310014214.3A CN202310014214A CN116072140A CN 116072140 A CN116072140 A CN 116072140A CN 202310014214 A CN202310014214 A CN 202310014214A CN 116072140 A CN116072140 A CN 116072140A
- Authority
- CN
- China
- Prior art keywords
- howling
- frequency
- energy
- interphone
- audio
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000003672 processing method Methods 0.000 title claims abstract description 11
- 238000012545 processing Methods 0.000 claims abstract description 8
- 238000000034 method Methods 0.000 claims description 11
- 230000001629 suppression Effects 0.000 claims description 9
- 238000005457 optimization Methods 0.000 description 5
- 238000004891 communication Methods 0.000 description 4
- 238000001514 detection method Methods 0.000 description 4
- 230000000694 effects Effects 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 230000002159 abnormal effect Effects 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 230000008034 disappearance Effects 0.000 description 1
- 238000001914 filtration Methods 0.000 description 1
- 230000008092 positive effect Effects 0.000 description 1
- 238000011895 specific detection Methods 0.000 description 1
- 238000001228 spectrum Methods 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
- G10L21/0216—Noise filtering characterised by the method used for estimating noise
- G10L21/0232—Processing in the frequency domain
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0316—Speech enhancement, e.g. noise reduction or echo cancellation by changing the amplitude
- G10L21/0324—Details of processing therefor
- G10L21/034—Automatic adjustment
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
- G10L2021/02082—Noise filtering the noise being echo, reverberation of the speech
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y02—TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
- Y02D—CLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
- Y02D30/00—Reducing energy consumption in communication networks
- Y02D30/70—Reducing energy consumption in communication networks in wireless communication networks
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Quality & Reliability (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Circuit For Audible Band Transducer (AREA)
Abstract
The invention relates to the technical field of interphones, in particular to a howling processing method of an interphone, which comprises the steps that firstly audio is subjected to Hamming window windowing; then, a band-pass filter is used for limiting the frequency band of the signal subjected to windowing processing to fr0-fr1, and energy is calculated; then, the fr0-fr1 time domain audio is converted into frequency domain audio through FFT; finding out the frequency points of the top three energy ranks in the fr0-fr1 range and recording f0, f1 and f2; continuously tracking for a period of time, counting a histogram, and recording frequency points F0, F1 and F2 of the top three frequency points; if a certain frequency point is in a high energy range for a long time, determining that the signal is howling; the trap cuts off energy at the frequency points required to be restrained; and converting the frequency domain signal subjected to notch processing into a time domain signal through inverse fast Fourier transform and outputting the time domain signal. According to the howling characteristics of the interphone, the invention automatically identifies and automatically suppresses the howling characteristics, and is easy to realize on an interphone platform.
Description
Technical Field
The invention relates to the technical field of interphones, in particular to a howling processing method of an interphone.
Background
The development of traditional interphone equipment is today, except for digital reform, the improvement of voice communication quality, especially the suppression of noise under abnormal environment, and the howling is one of them. When the communication distance of the traditional interphone is relatively short, the sound emitted by the loudspeaker of the receiving equipment is transmitted back to the microphone of the transmitting equipment to form self-excitation, so that howling is often caused. Since the conventional interphone is a single built-in microphone and there is no treatment of howling suppression, it is necessary to manually pull a physical distance or reduce a volume to solve the problem.
The existing echo cancellation system algorithm aiming at howling is extremely complex, the tone quality can be damaged to a certain extent, and a method for automatically identifying and automatically suppressing howling, which is more practical, aiming at interphones is lacking.
Disclosure of Invention
The invention provides a howling processing method of an interphone, which has the advantages of simple algorithm, easy realization and good effect.
In order to achieve the purpose of the invention, the technical scheme adopted is as follows: a method for howling processing of an intercom, the method comprising:
s1, collecting audio of an interphone, wherein the audio is subjected to Hamming window windowing;
s2, limiting the signal frequency range after windowing to fr0-fr1 through a band-pass filter, and calculating energy;
s3, converting fr0-fr1 time domain audio into frequency domain audio through FFT;
s4, finding out frequency points of the top three energy ranks in the fr0-fr1 range and recording f0, f1 and f2;
s5, continuously tracking for a period of time, counting a histogram, and recording frequency points F0, F1 and F2 of the top three frequency points;
s6, if a certain frequency point is in a high energy range for a long time, determining that the signal is howling;
s7, deleting energy at a frequency point required to be restrained by the wave trap;
s8, converting the frequency domain signal subjected to the notch processing into a time domain signal through inverse fast Fourier transform and outputting the time domain signal.
As an optimization scheme of the invention, the frequency range fr0-fr1 is 1000hz-3000hz.
As an optimization of the present invention, 50 10ms times are continuously tracked in step S5.
As an optimization scheme of the invention, in step S7, the trap cuts off the energy to 24-64dB at the frequency point required to be restrained.
As an optimization scheme of the invention, the audio collected by the interphone is the audio collected by the microphone of 10 ms.
As an optimization scheme of the invention, the outputs of the step S3, the step S5 and the step S6 are processed by the step S7 wave trap.
The invention has the positive effects that: 1) According to the howling characteristics of the interphone, the invention automatically identifies and automatically suppresses the howling characteristics, and is easy to realize on an interphone platform;
2) The invention uses the simplest digital signal processing, adopts a feedforward mode, is stable and reliable, does not damage the tone quality, and is easy to realize on each platform.
Drawings
The invention will be described in further detail with reference to the drawings and the detailed description.
Fig. 1 is a flow chart of the method of the present invention.
Detailed Description
As shown in fig. 1, the invention discloses a howling processing method of an interphone, wherein the input of the method is audio acquired by a microphone of 10ms, and the output of the method is the audio of 10ms processed by howling, so that the real-time effect is achieved. The method comprises the following steps:
a: the audio passes through the hamming window.
B: band-pass filtering, the range fr0 to fr1 hz, fr0 to fr1 hz is the frequency range where intercom howling is most likely to occur. It is generally considered to be 1000hz to 3000hz.
C: FFT (fast fourier transform) converts a time domain signal into a frequency domain signal. The 256-point FFT is described below. A 256-point FFT according to the nyquist principle then contains 128 frequency points of energy.
D: and (3) finding out three frequency points f0, f1 and f2 with the maximum energy in 128 points according to the result of C, and adding 1 to the corresponding frequency point in the frequency histogram in E.
E: the peak count histogram is accumulated (e.g., 50 times). The section outputs 128 numbers, each representing the number of times that it becomes one of the maximum three times in 50 times at this frequency point, as a criterion.
F: the trap cuts off the energy at the frequency point requiring the suppression, which can reach 24-64dB.
G: the iFFT (inverse fast fourier transform) converts the notch-processed frequency domain signal into a time domain signal output.
H: from the 128-point data given by E, a peak was found. The larger the peak data is, the longer the energy is concentrated at this frequency point, and the tendency is to determine howling. Otherwise, the normal audio is the step S6.
The invention includes two parts howling detection and howling suppression. Wherein A, B, C, D, E in fig. 1 is a howling detection section; f and G are howling suppressing parts. And will be described in detail below.
Howling detection
Howling range (1 khz-3 khz) of the intercom and howling characteristics (single frequency point or two-three frequency points switched back and forth), which are responsible for detection, are prepared for call suppression. The specific detection procedure is described below.
Howling occurrence determination flow:
assuming that the frequency band of the howling is fr0 to fr1 (unit hertz), the method collects new data of 10ms at the microphone end each time, limits the signal frequency band to fr0 to fr1 through a band-pass filter, calculates the total energy (figure 1A), and considers that the howling is likely to be continuously recognized if a certain threshold is exceeded (figure 1B); then, the fr0-fr1 time domain audio is converted into frequency domain audio through FFT (figure 1C), the frequency points of the top three energy ranks in the fr0-fr1 range are found and recorded as f0, f1 and f2; continuously tracking 50 time of 10ms, counting a histogram, if one or two frequency points are in a high energy range for a long time, judging that the frequency points are howling, and recording frequency points F0, F1 and F2 of the top three frequency points in units of hertz (D, E in fig. 1).
Howling disappearance determination flow:
because the non-howling sound energy is more dispersed, the energy is not concentrated over the spectrum for a long period of time. In order to ensure normal communication, if the frequency points of the top three ranks are scattered on the histogram, the howling is eliminated, and suppression is not needed.
Howling suppression:
the module is responsible for processing howling that has occurred, and if it is determined that howling has occurred, only the frequency points where howling has occurred are notched (F0, F1, F2 in fig. 1E), thereby preserving the quality of normal voice communication as much as possible. For three frequency points F0, F1, F2, etc. recorded in the flow 1, the energy thereof is reduced in the frequency domain, respectively. The longer the continuous time of occurrence, the greater the suppression amplitude (e.g., 24dB for a continuous 500ms energy decrease, 30dB for a continuous 1000ms energy decrease, and so on).
While the foregoing is directed to embodiments of the present invention, other and further details of the invention may be had by the present invention, it should be understood that the foregoing description is merely illustrative of the present invention and that no limitations are intended to the scope of the invention, except insofar as modifications, equivalents, improvements or modifications are within the spirit and principles of the invention.
Claims (6)
1. The method for processing the howling of the interphone is characterized by comprising the following steps of: the method comprises the following steps:
s1, collecting audio of an interphone, wherein the audio is subjected to Hamming window windowing;
s2, limiting the signal frequency range after windowing to fr0-fr1 through a band-pass filter, and calculating energy;
s3, converting fr0-fr1 time domain audio into frequency domain audio through FFT;
s4, finding out frequency points of the top three energy ranks in the fr0-fr1 range and recording f0, f1 and f2;
s5, continuously tracking for a period of time, counting a histogram, and recording frequency points F0, F1 and F2 of the top three frequency points;
s6, if a certain frequency point is in a high energy range for a long time, determining that the signal is howling;
s7, deleting energy at a frequency point required to be restrained by the wave trap;
s8, converting the frequency domain signal subjected to the notch processing into a time domain signal through inverse fast Fourier transform and outputting the time domain signal.
2. The howling processing method of an intercom as in claim 1, wherein: fr0-fr1 has a frequency in the range of 1000hz to 3000hz.
3. The howling processing method of an intercom as in claim 2, wherein: the tracking continues for 50 10ms times in step S5.
4. A howling processing method of an intercom as in claim 3, wherein: in step S7, the trap cuts off the energy to 24-64dB at the frequency point where suppression is required.
5. The howling processing method of an intercom as in claim 4, wherein: the audio collected by the interphone is the audio collected by a microphone of 10 ms.
6. The howling processing method of an intercom as in claim 5, wherein: the outputs of the step S3, the step S5 and the step S6 are processed by the step S7 wave trap.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202310014214.3A CN116072140A (en) | 2023-01-05 | 2023-01-05 | Howling processing method of interphone |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202310014214.3A CN116072140A (en) | 2023-01-05 | 2023-01-05 | Howling processing method of interphone |
Publications (1)
Publication Number | Publication Date |
---|---|
CN116072140A true CN116072140A (en) | 2023-05-05 |
Family
ID=86183225
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202310014214.3A Pending CN116072140A (en) | 2023-01-05 | 2023-01-05 | Howling processing method of interphone |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN116072140A (en) |
-
2023
- 2023-01-05 CN CN202310014214.3A patent/CN116072140A/en active Pending
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN101430882B (en) | Method and apparatus for restraining wind noise | |
US8977545B2 (en) | System and method for multi-channel noise suppression | |
CA2527461C (en) | Reverberation estimation and suppression system | |
CN112004177B (en) | Howling detection method, microphone volume adjustment method and storage medium | |
CN102160296B (en) | Method and apparatus for detecting double talk | |
EP2352148B1 (en) | Noise gate, sound collection device, and noise removal method | |
US20020103636A1 (en) | Frequency-domain post-filtering voice-activity detector | |
CN110536215A (en) | Method, apparatus, calculating and setting and the storage medium of Audio Signal Processing | |
US20130182866A1 (en) | Sound processing apparatus and sound processing method | |
CN110956975B (en) | Echo cancellation method and device | |
CN105427868A (en) | Method for eliminating noise of VOIP system bidirectional duplex hand-free voice | |
CN113196733B (en) | Acoustic echo cancellation using low frequency near-end speech detection | |
CN106453762A (en) | A method and system for processing voice whistlers in an audio system | |
CN215187351U (en) | Pickup device | |
CN115348507A (en) | Impulse noise suppression method, system, readable storage medium and computer equipment | |
CN100508031C (en) | Method for identifying and eliminating echo generated by speech at remote end in SCDMA handset | |
US11373669B2 (en) | Acoustic processing method and acoustic device | |
CN116072140A (en) | Howling processing method of interphone | |
CN116962934B (en) | Pickup noise reduction method and system | |
CN109862463A (en) | Earphone audio playback method, earphone and its computer readable storage medium | |
JP3310225B2 (en) | Noise level time variation calculation method and apparatus, and noise reduction method and apparatus | |
CN112637438B (en) | Entrance guard double-end intercom method and system based on single-line transmission | |
CN109068213A (en) | A kind of earphone volume control method and device | |
JP2002064617A (en) | Echo suppression method and echo suppression equipment | |
WO2020203258A1 (en) | Echo suppression device, echo suppression method, and echo suppression program |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination |