[go: up one dir, main page]
More Web Proxy on the site http://driver.im/

CN116072140A - Howling processing method of interphone - Google Patents

Howling processing method of interphone Download PDF

Info

Publication number
CN116072140A
CN116072140A CN202310014214.3A CN202310014214A CN116072140A CN 116072140 A CN116072140 A CN 116072140A CN 202310014214 A CN202310014214 A CN 202310014214A CN 116072140 A CN116072140 A CN 116072140A
Authority
CN
China
Prior art keywords
howling
frequency
energy
interphone
audio
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202310014214.3A
Other languages
Chinese (zh)
Inventor
侯丹
施钟鸣
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Wuxi Sicomm Communication Technology Co ltd
Original Assignee
Wuxi Sicomm Communication Technology Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Wuxi Sicomm Communication Technology Co ltd filed Critical Wuxi Sicomm Communication Technology Co ltd
Priority to CN202310014214.3A priority Critical patent/CN116072140A/en
Publication of CN116072140A publication Critical patent/CN116072140A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • G10L21/0216Noise filtering characterised by the method used for estimating noise
    • G10L21/0232Processing in the frequency domain
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0316Speech enhancement, e.g. noise reduction or echo cancellation by changing the amplitude
    • G10L21/0324Details of processing therefor
    • G10L21/034Automatic adjustment
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • G10L2021/02082Noise filtering the noise being echo, reverberation of the speech
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02DCLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
    • Y02D30/00Reducing energy consumption in communication networks
    • Y02D30/70Reducing energy consumption in communication networks in wireless communication networks

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Quality & Reliability (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Circuit For Audible Band Transducer (AREA)

Abstract

The invention relates to the technical field of interphones, in particular to a howling processing method of an interphone, which comprises the steps that firstly audio is subjected to Hamming window windowing; then, a band-pass filter is used for limiting the frequency band of the signal subjected to windowing processing to fr0-fr1, and energy is calculated; then, the fr0-fr1 time domain audio is converted into frequency domain audio through FFT; finding out the frequency points of the top three energy ranks in the fr0-fr1 range and recording f0, f1 and f2; continuously tracking for a period of time, counting a histogram, and recording frequency points F0, F1 and F2 of the top three frequency points; if a certain frequency point is in a high energy range for a long time, determining that the signal is howling; the trap cuts off energy at the frequency points required to be restrained; and converting the frequency domain signal subjected to notch processing into a time domain signal through inverse fast Fourier transform and outputting the time domain signal. According to the howling characteristics of the interphone, the invention automatically identifies and automatically suppresses the howling characteristics, and is easy to realize on an interphone platform.

Description

Howling processing method of interphone
Technical Field
The invention relates to the technical field of interphones, in particular to a howling processing method of an interphone.
Background
The development of traditional interphone equipment is today, except for digital reform, the improvement of voice communication quality, especially the suppression of noise under abnormal environment, and the howling is one of them. When the communication distance of the traditional interphone is relatively short, the sound emitted by the loudspeaker of the receiving equipment is transmitted back to the microphone of the transmitting equipment to form self-excitation, so that howling is often caused. Since the conventional interphone is a single built-in microphone and there is no treatment of howling suppression, it is necessary to manually pull a physical distance or reduce a volume to solve the problem.
The existing echo cancellation system algorithm aiming at howling is extremely complex, the tone quality can be damaged to a certain extent, and a method for automatically identifying and automatically suppressing howling, which is more practical, aiming at interphones is lacking.
Disclosure of Invention
The invention provides a howling processing method of an interphone, which has the advantages of simple algorithm, easy realization and good effect.
In order to achieve the purpose of the invention, the technical scheme adopted is as follows: a method for howling processing of an intercom, the method comprising:
s1, collecting audio of an interphone, wherein the audio is subjected to Hamming window windowing;
s2, limiting the signal frequency range after windowing to fr0-fr1 through a band-pass filter, and calculating energy;
s3, converting fr0-fr1 time domain audio into frequency domain audio through FFT;
s4, finding out frequency points of the top three energy ranks in the fr0-fr1 range and recording f0, f1 and f2;
s5, continuously tracking for a period of time, counting a histogram, and recording frequency points F0, F1 and F2 of the top three frequency points;
s6, if a certain frequency point is in a high energy range for a long time, determining that the signal is howling;
s7, deleting energy at a frequency point required to be restrained by the wave trap;
s8, converting the frequency domain signal subjected to the notch processing into a time domain signal through inverse fast Fourier transform and outputting the time domain signal.
As an optimization scheme of the invention, the frequency range fr0-fr1 is 1000hz-3000hz.
As an optimization of the present invention, 50 10ms times are continuously tracked in step S5.
As an optimization scheme of the invention, in step S7, the trap cuts off the energy to 24-64dB at the frequency point required to be restrained.
As an optimization scheme of the invention, the audio collected by the interphone is the audio collected by the microphone of 10 ms.
As an optimization scheme of the invention, the outputs of the step S3, the step S5 and the step S6 are processed by the step S7 wave trap.
The invention has the positive effects that: 1) According to the howling characteristics of the interphone, the invention automatically identifies and automatically suppresses the howling characteristics, and is easy to realize on an interphone platform;
2) The invention uses the simplest digital signal processing, adopts a feedforward mode, is stable and reliable, does not damage the tone quality, and is easy to realize on each platform.
Drawings
The invention will be described in further detail with reference to the drawings and the detailed description.
Fig. 1 is a flow chart of the method of the present invention.
Detailed Description
As shown in fig. 1, the invention discloses a howling processing method of an interphone, wherein the input of the method is audio acquired by a microphone of 10ms, and the output of the method is the audio of 10ms processed by howling, so that the real-time effect is achieved. The method comprises the following steps:
a: the audio passes through the hamming window.
B: band-pass filtering, the range fr0 to fr1 hz, fr0 to fr1 hz is the frequency range where intercom howling is most likely to occur. It is generally considered to be 1000hz to 3000hz.
C: FFT (fast fourier transform) converts a time domain signal into a frequency domain signal. The 256-point FFT is described below. A 256-point FFT according to the nyquist principle then contains 128 frequency points of energy.
D: and (3) finding out three frequency points f0, f1 and f2 with the maximum energy in 128 points according to the result of C, and adding 1 to the corresponding frequency point in the frequency histogram in E.
E: the peak count histogram is accumulated (e.g., 50 times). The section outputs 128 numbers, each representing the number of times that it becomes one of the maximum three times in 50 times at this frequency point, as a criterion.
F: the trap cuts off the energy at the frequency point requiring the suppression, which can reach 24-64dB.
G: the iFFT (inverse fast fourier transform) converts the notch-processed frequency domain signal into a time domain signal output.
H: from the 128-point data given by E, a peak was found. The larger the peak data is, the longer the energy is concentrated at this frequency point, and the tendency is to determine howling. Otherwise, the normal audio is the step S6.
The invention includes two parts howling detection and howling suppression. Wherein A, B, C, D, E in fig. 1 is a howling detection section; f and G are howling suppressing parts. And will be described in detail below.
Howling detection
Howling range (1 khz-3 khz) of the intercom and howling characteristics (single frequency point or two-three frequency points switched back and forth), which are responsible for detection, are prepared for call suppression. The specific detection procedure is described below.
Howling occurrence determination flow:
assuming that the frequency band of the howling is fr0 to fr1 (unit hertz), the method collects new data of 10ms at the microphone end each time, limits the signal frequency band to fr0 to fr1 through a band-pass filter, calculates the total energy (figure 1A), and considers that the howling is likely to be continuously recognized if a certain threshold is exceeded (figure 1B); then, the fr0-fr1 time domain audio is converted into frequency domain audio through FFT (figure 1C), the frequency points of the top three energy ranks in the fr0-fr1 range are found and recorded as f0, f1 and f2; continuously tracking 50 time of 10ms, counting a histogram, if one or two frequency points are in a high energy range for a long time, judging that the frequency points are howling, and recording frequency points F0, F1 and F2 of the top three frequency points in units of hertz (D, E in fig. 1).
Howling disappearance determination flow:
because the non-howling sound energy is more dispersed, the energy is not concentrated over the spectrum for a long period of time. In order to ensure normal communication, if the frequency points of the top three ranks are scattered on the histogram, the howling is eliminated, and suppression is not needed.
Howling suppression:
the module is responsible for processing howling that has occurred, and if it is determined that howling has occurred, only the frequency points where howling has occurred are notched (F0, F1, F2 in fig. 1E), thereby preserving the quality of normal voice communication as much as possible. For three frequency points F0, F1, F2, etc. recorded in the flow 1, the energy thereof is reduced in the frequency domain, respectively. The longer the continuous time of occurrence, the greater the suppression amplitude (e.g., 24dB for a continuous 500ms energy decrease, 30dB for a continuous 1000ms energy decrease, and so on).
While the foregoing is directed to embodiments of the present invention, other and further details of the invention may be had by the present invention, it should be understood that the foregoing description is merely illustrative of the present invention and that no limitations are intended to the scope of the invention, except insofar as modifications, equivalents, improvements or modifications are within the spirit and principles of the invention.

Claims (6)

1. The method for processing the howling of the interphone is characterized by comprising the following steps of: the method comprises the following steps:
s1, collecting audio of an interphone, wherein the audio is subjected to Hamming window windowing;
s2, limiting the signal frequency range after windowing to fr0-fr1 through a band-pass filter, and calculating energy;
s3, converting fr0-fr1 time domain audio into frequency domain audio through FFT;
s4, finding out frequency points of the top three energy ranks in the fr0-fr1 range and recording f0, f1 and f2;
s5, continuously tracking for a period of time, counting a histogram, and recording frequency points F0, F1 and F2 of the top three frequency points;
s6, if a certain frequency point is in a high energy range for a long time, determining that the signal is howling;
s7, deleting energy at a frequency point required to be restrained by the wave trap;
s8, converting the frequency domain signal subjected to the notch processing into a time domain signal through inverse fast Fourier transform and outputting the time domain signal.
2. The howling processing method of an intercom as in claim 1, wherein: fr0-fr1 has a frequency in the range of 1000hz to 3000hz.
3. The howling processing method of an intercom as in claim 2, wherein: the tracking continues for 50 10ms times in step S5.
4. A howling processing method of an intercom as in claim 3, wherein: in step S7, the trap cuts off the energy to 24-64dB at the frequency point where suppression is required.
5. The howling processing method of an intercom as in claim 4, wherein: the audio collected by the interphone is the audio collected by a microphone of 10 ms.
6. The howling processing method of an intercom as in claim 5, wherein: the outputs of the step S3, the step S5 and the step S6 are processed by the step S7 wave trap.
CN202310014214.3A 2023-01-05 2023-01-05 Howling processing method of interphone Pending CN116072140A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202310014214.3A CN116072140A (en) 2023-01-05 2023-01-05 Howling processing method of interphone

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202310014214.3A CN116072140A (en) 2023-01-05 2023-01-05 Howling processing method of interphone

Publications (1)

Publication Number Publication Date
CN116072140A true CN116072140A (en) 2023-05-05

Family

ID=86183225

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202310014214.3A Pending CN116072140A (en) 2023-01-05 2023-01-05 Howling processing method of interphone

Country Status (1)

Country Link
CN (1) CN116072140A (en)

Similar Documents

Publication Publication Date Title
CN101430882B (en) Method and apparatus for restraining wind noise
US8977545B2 (en) System and method for multi-channel noise suppression
CA2527461C (en) Reverberation estimation and suppression system
CN112004177B (en) Howling detection method, microphone volume adjustment method and storage medium
CN102160296B (en) Method and apparatus for detecting double talk
EP2352148B1 (en) Noise gate, sound collection device, and noise removal method
US20020103636A1 (en) Frequency-domain post-filtering voice-activity detector
CN110536215A (en) Method, apparatus, calculating and setting and the storage medium of Audio Signal Processing
US20130182866A1 (en) Sound processing apparatus and sound processing method
CN110956975B (en) Echo cancellation method and device
CN105427868A (en) Method for eliminating noise of VOIP system bidirectional duplex hand-free voice
CN113196733B (en) Acoustic echo cancellation using low frequency near-end speech detection
CN106453762A (en) A method and system for processing voice whistlers in an audio system
CN215187351U (en) Pickup device
CN115348507A (en) Impulse noise suppression method, system, readable storage medium and computer equipment
CN100508031C (en) Method for identifying and eliminating echo generated by speech at remote end in SCDMA handset
US11373669B2 (en) Acoustic processing method and acoustic device
CN116072140A (en) Howling processing method of interphone
CN116962934B (en) Pickup noise reduction method and system
CN109862463A (en) Earphone audio playback method, earphone and its computer readable storage medium
JP3310225B2 (en) Noise level time variation calculation method and apparatus, and noise reduction method and apparatus
CN112637438B (en) Entrance guard double-end intercom method and system based on single-line transmission
CN109068213A (en) A kind of earphone volume control method and device
JP2002064617A (en) Echo suppression method and echo suppression equipment
WO2020203258A1 (en) Echo suppression device, echo suppression method, and echo suppression program

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination