CN115348507A - Impulse noise suppression method, system, readable storage medium and computer equipment - Google Patents
Impulse noise suppression method, system, readable storage medium and computer equipment Download PDFInfo
- Publication number
- CN115348507A CN115348507A CN202210946827.6A CN202210946827A CN115348507A CN 115348507 A CN115348507 A CN 115348507A CN 202210946827 A CN202210946827 A CN 202210946827A CN 115348507 A CN115348507 A CN 115348507A
- Authority
- CN
- China
- Prior art keywords
- audio
- signal
- frame
- impulse noise
- short
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000000034 method Methods 0.000 title claims abstract description 34
- 230000001629 suppression Effects 0.000 title claims abstract description 30
- 230000005236 sound signal Effects 0.000 claims abstract description 102
- 230000008030 elimination Effects 0.000 claims abstract description 10
- 238000003379 elimination reaction Methods 0.000 claims abstract description 10
- 230000006870 function Effects 0.000 claims description 23
- 238000005070 sampling Methods 0.000 claims description 19
- 238000012545 processing Methods 0.000 claims description 14
- 238000004590 computer program Methods 0.000 claims description 13
- 238000000605 extraction Methods 0.000 claims description 9
- 230000008569 process Effects 0.000 claims description 7
- 238000004364 calculation method Methods 0.000 claims description 6
- 239000000284 extract Substances 0.000 claims description 5
- 230000002401 inhibitory effect Effects 0.000 abstract 1
- 230000009467 reduction Effects 0.000 description 10
- 238000004422 calculation algorithm Methods 0.000 description 5
- 230000000694 effects Effects 0.000 description 4
- 238000010586 diagram Methods 0.000 description 3
- 238000003491 array Methods 0.000 description 2
- 230000006872 improvement Effects 0.000 description 2
- 230000008439 repair process Effects 0.000 description 2
- 230000002123 temporal effect Effects 0.000 description 2
- 210000001260 vocal cord Anatomy 0.000 description 2
- 238000012935 Averaging Methods 0.000 description 1
- 230000009471 action Effects 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 230000000903 blocking effect Effects 0.000 description 1
- 238000006243 chemical reaction Methods 0.000 description 1
- 230000006835 compression Effects 0.000 description 1
- 238000007906 compression Methods 0.000 description 1
- 230000007812 deficiency Effects 0.000 description 1
- 238000001514 detection method Methods 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 210000000613 ear canal Anatomy 0.000 description 1
- 238000011156 evaluation Methods 0.000 description 1
- 239000000835 fiber Substances 0.000 description 1
- 230000014509 gene expression Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 238000005457 optimization Methods 0.000 description 1
- 230000008447 perception Effects 0.000 description 1
- 238000012216 screening Methods 0.000 description 1
- 238000004088 simulation Methods 0.000 description 1
- 238000001228 spectrum Methods 0.000 description 1
- 238000012360 testing method Methods 0.000 description 1
Images
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04R—LOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
- H04R3/00—Circuits for transducers, loudspeakers or microphones
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04R—LOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
- H04R2410/00—Microphones
- H04R2410/01—Noise reduction using microphones having different directional characteristics
Landscapes
- Physics & Mathematics (AREA)
- Engineering & Computer Science (AREA)
- Acoustics & Sound (AREA)
- Signal Processing (AREA)
- Soundproofing, Sound Blocking, And Sound Damping (AREA)
Abstract
Description
技术领域technical field
本发明涉及音频信号处理技术领域,特别涉及一种脉冲噪声抑制方法、系统、可读存储介质及计算机设备。The invention relates to the technical field of audio signal processing, in particular to an impulse noise suppression method, system, readable storage medium and computer equipment.
背景技术Background technique
随着科技的飞速发展和高技术信息化综合水平的提高,人们对于视频/音频服务的需求呈指数地增长。当人们进行视频/音频服务,例如:语音通话、视频通话、听音乐等,通常会佩戴耳机,随着耳机的使用率的增加,耳机的音频优化效果也变得越来越重要。With the rapid development of science and technology and the improvement of the comprehensive level of high-tech informationization, people's demand for video/audio services is growing exponentially. When people perform video/audio services, such as: voice calls, video calls, listening to music, etc., they usually wear earphones. As the usage rate of earphones increases, the audio optimization effect of earphones becomes more and more important.
当人们在佩戴耳机进行视频/音频服务时,若处于嘈杂的环境下,会影响视频/音频服务的使用舒适度,现有技术中,通常是通过采用环境声降噪的方式将环境中的声音音频降低,进而减小环境声对人们的入耳声的影响,以提升人们的体验感和使用舒适度。When people wear headphones for video/audio services, if they are in a noisy environment, it will affect the comfort of using video/audio services. In the prior art, the sound in the environment is usually reduced The audio frequency is reduced, thereby reducing the impact of ambient sound on people's in-ear sound, so as to improve people's experience and comfort.
然而当面临突发性噪声时,由于环境声降噪无法识别该突发性噪声,该突发性噪声会直接入耳,进而影响用户的体验感及舒适性。However, when faced with sudden noise, since the ambient noise reduction cannot recognize the sudden noise, the sudden noise will directly enter the ear, thereby affecting the user's experience and comfort.
发明内容Contents of the invention
基于此,本发明的目的是提供一种脉冲噪声抑制方法、系统、可读存储介质及计算机设备,以至少解决上述相关技术中的不足。Based on this, the object of the present invention is to provide an impulse noise suppression method, system, readable storage medium and computer equipment, so as to at least solve the above-mentioned deficiencies in the related art.
本发明提出一种脉冲噪声抑制方法,包括:The present invention proposes a method for suppressing impulse noise, including:
实时获取当前环境的音频信号,并对所述音频信号进行音频特征提取,以得到所述音频信号的多个音频特征;Acquire the audio signal of the current environment in real time, and perform audio feature extraction on the audio signal to obtain multiple audio features of the audio signal;
利用音频数据库对各所述音频特征进行特征判别,以识别出所述音频信号中的脉冲噪声信号;Using the audio database to perform feature discrimination on each of the audio features to identify the impulse noise signal in the audio signal;
判断所述脉冲噪声信号是否大于预设音频阈值;judging whether the impulse noise signal is greater than a preset audio threshold;
若所述脉冲噪声信号大于预设音频阈值,则利用限幅函数抑制所述脉冲噪声信号中的脉冲样本幅值,并对处理后的脉冲噪声信号进行动态范围压扩,以得到初步脉冲噪声信号;If the impulsive noise signal is greater than the preset audio threshold, a clipping function is used to suppress the pulse sample amplitude in the impulsive noise signal, and dynamic range companding is performed on the processed impulsive noise signal to obtain a preliminary impulsive noise signal ;
对所述初步脉冲噪声信号进行反馈消除,以抑制所述音频信号中脉冲噪声信号,得到目标信号。Feedback elimination is performed on the preliminary impulse noise signal to suppress the impulse noise signal in the audio signal to obtain a target signal.
进一步的,对所述音频信号进行音频特征提取,以得到所述音频信号的多个音频特征的步骤包括:Further, the step of performing audio feature extraction on the audio signal to obtain multiple audio features of the audio signal includes:
获取所述音频信号的音频参数,并基于所述音频信号的单位用帧以及所述音频参数提取所述音频信号中的多个音频帧特征;Acquiring audio parameters of the audio signal, and extracting multiple audio frame features in the audio signal based on the unit frame of the audio signal and the audio parameters;
将各所述音频帧特征依次通过均值、方差以及标准差处理,以得到各所述音频帧特征所对应的音频段特征。Each of the audio frame features is sequentially processed through mean value, variance and standard deviation to obtain the audio segment feature corresponding to each of the audio frame features.
进一步的,利用音频数据库对各所述音频特征进行特征判别,以识别出所述音频信号中的脉冲噪声信号的步骤包括:Further, the step of using the audio database to perform feature discrimination on each of the audio features to identify the impulse noise signal in the audio signal includes:
获取所述音频信号的采样频率以及各所述音频段特征的各帧内数据;Obtain the sampling frequency of the audio signal and the data in each frame of each audio segment feature;
根据所述采样频率计算出各所述帧内数据的短时能量和短时过零数,并根据各所述帧内数据的短时能量和短时过零数以及短时能量阈值和过零数阈值识别出各所述音频段特征中的脉冲噪声信号。Calculate the short-term energy and the number of short-time zero crossings of the data in each frame according to the sampling frequency, and calculate the short-term energy, the number of short-term zero crossings and the short-term energy threshold and the number of zero crossings of the data in each frame The number threshold identifies impulsive noise signals in each of the audio segment features.
进一步的,各所述帧内数据的短时能量的计算公式为:Further, the calculation formula of the short-term energy of the data in each frame is:
式中,En为短时能量,x(m)为帧内数据,w(n)为窗口函数,N为采样频率所对应的数据帧样点数。In the formula, En is the short-term energy, x(m) is the intra-frame data, w( n ) is the window function, and N is the number of data frame samples corresponding to the sampling frequency.
进一步的,各所述帧内数据的短时过零数的计算公式为:Further, the formula for calculating the number of short-term zero-crossings of the data in each frame is:
式中,Zn为短时过零数,x(m)为帧内数据,sgn[·]为符号函数、即w(n)为窗口函数,N为采样频率所对应的数据帧样点数。In the formula, Z n is the short-term zero-crossing number, x(m) is the intra-frame data, sgn[ ] is the sign function, namely w(n) is a window function, and N is the number of data frame samples corresponding to the sampling frequency.
进一步的,根据各所述帧内数据的短时能量和短时过零数以及短时能量阈值和过零数阈值识别出各所述音频段特征中的脉冲噪声信号的步骤包括:Further, the step of identifying the impulsive noise signal in each of the audio segment features according to the short-term energy and the number of short-time zero-crossings and the short-term energy threshold and the number of zero-crossings of the data in each said frame includes:
当所述帧内数据的短时能量大于等于短时能量阈值、且该帧内数据的短时过零数小于过零数阈值时,判定所述帧内数据为语音信号;When the short-term energy of the data in the frame is greater than or equal to the short-term energy threshold, and the number of short-term zero-crossings of the data in the frame is less than the threshold of the number of zero-crossings, it is determined that the data in the frame is a speech signal;
当所述帧内数据的短时能量小于短时能量阈值、且该帧内数据的短时过零数大于过零数阈值时,判定所述帧内数据为脉冲噪声信号。When the short-term energy of the intra-frame data is less than a short-term energy threshold and the number of short-term zero-crossings of the intra-frame data is greater than the zero-crossing threshold, it is determined that the intra-frame data is an impulse noise signal.
本发明还提出一种脉冲噪声抑制系统,包括:The present invention also proposes an impulse noise suppression system, including:
音频信号获取模块,用于实时获取当前环境的音频信号,并对所述音频信号进行音频特征提取,以得到所述音频信号的多个音频特征;The audio signal acquisition module is used to acquire the audio signal of the current environment in real time, and perform audio feature extraction on the audio signal to obtain multiple audio features of the audio signal;
特征判别模块,用于利用音频数据库对各所述音频特征进行特征判别,以识别出所述音频信号中的脉冲噪声信号;A feature discrimination module, configured to perform feature discrimination on each of the audio features using the audio database, so as to identify the impulse noise signal in the audio signal;
判断模块,用于判断所述脉冲噪声信号是否大于预设音频阈值;A judging module, configured to judge whether the impulse noise signal is greater than a preset audio threshold;
噪声信号抑制模块,用于若所述脉冲噪声信号大于预设音频阈值,则利用限幅函数抑制所述脉冲噪声信号中的脉冲样本幅值,并对处理后的脉冲噪声信号进行动态范围压扩,以得到初步脉冲噪声信号;A noise signal suppression module, configured to use a clipping function to suppress the pulse sample amplitude in the pulse noise signal if the pulse noise signal is greater than a preset audio threshold, and perform dynamic range companding on the processed pulse noise signal , to get the preliminary impulse noise signal;
反馈消除模块,用于对所述初步脉冲噪声信号进行反馈消除,以抑制所述音频信号中脉冲噪声信号,得到目标信号。The feedback elimination module is configured to perform feedback elimination on the preliminary impulse noise signal to suppress the impulse noise signal in the audio signal to obtain a target signal.
进一步的,所述音频信号获取模块包括:Further, the audio signal acquisition module includes:
音频参数获取单元,用于获取所述音频信号的音频参数,并基于所述音频信号的单位用帧以及所述音频参数提取所述音频信号中的多个音频帧特征;An audio parameter acquisition unit, configured to acquire audio parameters of the audio signal, and extract features of multiple audio frames in the audio signal based on the unit frame of the audio signal and the audio parameters;
特征处理单元,用于将各所述音频帧特征依次通过均值、方差以及标准差处理,以得到各所述音频帧特征所对应的音频段特征。The feature processing unit is configured to sequentially process each of the audio frame features through mean value, variance, and standard deviation to obtain the audio segment features corresponding to each of the audio frame features.
进一步的,所述特征判别模块包括:Further, the feature discrimination module includes:
特征获取单元,用于获取所述音频信号的采样频率以及各所述音频段特征的各帧内数据;A feature acquisition unit, configured to acquire the sampling frequency of the audio signal and each intra-frame data of each audio segment feature;
特征判别单元,用于根据所述采样频率计算出各所述帧内数据的短时能量和短时过零数,并根据各所述帧内数据的短时能量和短时过零数以及短时能量阈值和过零数阈值识别出各所述音频段特征中的脉冲噪声信号。The feature discrimination unit is used to calculate the short-term energy and the number of short-term zero crossings of the data in each frame according to the sampling frequency, and calculate the short-term energy, the number of short-term zero crossings and the number of short-term zero crossings of the data in each frame The impulsive noise signal in each of the audio segment features is identified by using the temporal energy threshold and the zero crossing threshold.
进一步的,所述特征判别单元还用于:Further, the feature discrimination unit is also used for:
当所述帧内数据的短时能量大于等于短时能量阈值、且该帧内数据的短时过零数小于过零数阈值时,判定所述帧内数据为语音信号;When the short-term energy of the data in the frame is greater than or equal to the short-term energy threshold, and the number of short-term zero-crossings of the data in the frame is less than the threshold of the number of zero-crossings, it is determined that the data in the frame is a speech signal;
当所述帧内数据的短时能量小于短时能量阈值、且该帧内数据的短时过零数大于过零数阈值时,判定所述帧内数据为脉冲噪声信号。When the short-term energy of the intra-frame data is less than a short-term energy threshold and the number of short-term zero-crossings of the intra-frame data is greater than the zero-crossing threshold, it is determined that the intra-frame data is an impulse noise signal.
本发明还提出一种可读存储介质,其上存储有计算机程序,该计算机程序被处理器执行时实现上述的脉冲噪声抑制方法。The present invention also proposes a readable storage medium on which a computer program is stored, and when the computer program is executed by a processor, the above-mentioned impulse noise suppression method is realized.
本发明还提出一种计算机设备,包括存储器、处理器以及存储在所述存储器上并可在所述处理器上运行的计算机程序,所述处理器执行所述计算机程序时实现上述的脉冲噪声抑制方法。The present invention also proposes a computer device, comprising a memory, a processor, and a computer program stored on the memory and operable on the processor, and the above-mentioned impulse noise suppression is realized when the processor executes the computer program method.
与现有技术相比,本发明的有益效果是:通过实时获取环境的音频信号,并对音频信号进行音频特征提取,并对提取到的音频特征进行特征判别,以识别出音频信号中的脉冲噪声信号,通过特征判别的方式识别出脉冲噪声信号,以便于对其进行抑制处理;具体的,当脉冲噪声信号大于预设阈值,则利用限幅函数抑制脉冲噪声信号中的脉冲样本幅值、并进行动态范围压扩,以提升脉冲降噪性能,进一步的,对脉冲噪声信号进行反馈消除,进而抑制音频信号中的脉冲噪声信号,以避免突发性噪声进入到人耳中,影响舒适度。Compared with the prior art, the beneficial effect of the present invention is: by acquiring the audio signal of the environment in real time, performing audio feature extraction on the audio signal, and performing feature discrimination on the extracted audio feature, to identify the pulse in the audio signal For the noise signal, the impulsive noise signal is identified by means of feature discrimination, so as to suppress it; specifically, when the impulsive noise signal is greater than the preset threshold, the limiting function is used to suppress the pulse sample amplitude, And dynamic range companding is carried out to improve the performance of pulse noise reduction. Further, the pulse noise signal is eliminated by feedback, thereby suppressing the pulse noise signal in the audio signal, so as to prevent sudden noise from entering the human ear and affecting comfort. .
附图说明Description of drawings
图1为本发明第一实施例中脉冲噪声抑制方法的流程图;Fig. 1 is the flowchart of the impulse noise suppressing method in the first embodiment of the present invention;
图2为图1中步骤S101的详细流程图;Fig. 2 is the detailed flowchart of step S101 in Fig. 1;
图3为本发明第一实施例中音频段特征的示意图;FIG. 3 is a schematic diagram of audio segment features in the first embodiment of the present invention;
图4为图1中步骤S102的详细流程图;Fig. 4 is the detailed flowchart of step S102 in Fig. 1;
图5为本发明第一实施例中脉冲噪声抑制前后的对比曲线图;Fig. 5 is a comparison graph before and after impulse noise suppression in the first embodiment of the present invention;
图6为本发明第二实施例中脉冲噪声抑制系统的结构框图;FIG. 6 is a structural block diagram of the impulse noise suppression system in the second embodiment of the present invention;
图7为本发明第三实施例中计算机设备的结构框图。Fig. 7 is a structural block diagram of a computer device in a third embodiment of the present invention.
主要元件符号说明:Description of main component symbols:
如下具体实施方式将结合上述附图进一步说明本发明。The following specific embodiments will further illustrate the present invention in conjunction with the above-mentioned drawings.
具体实施方式Detailed ways
为了便于理解本发明,下面将参照相关附图对本发明进行更全面的描述。附图中给出了本发明的若干实施例。但是,本发明可以以许多不同的形式来实现,并不限于本文所描述的实施例。相反地,提供这些实施例的目的是使对本发明的公开内容更加透彻全面。In order to facilitate the understanding of the present invention, the present invention will be described more fully below with reference to the associated drawings. Several embodiments of the invention are shown in the drawings. However, the present invention can be embodied in many different forms and is not limited to the embodiments described herein. Rather, these embodiments are provided so that the disclosure of the present invention will be thorough and complete.
需要说明的是,当元件被称为“固设于”另一个元件,它可以直接在另一个元件上或者也可以存在居中的元件。当一个元件被认为是“连接”另一个元件,它可以是直接连接到另一个元件或者可能同时存在居中元件。本文所使用的术语“垂直的”、“水平的”、“左”、“右”以及类似的表述只是为了说明的目的。It should be noted that when an element is referred to as being “fixed on” another element, it may be directly on the other element or there may be an intervening element. When an element is referred to as being "connected to" another element, it can be directly connected to the other element or intervening elements may also be present. The terms "vertical," "horizontal," "left," "right," and similar expressions are used herein for purposes of illustration only.
除非另有定义,本文所使用的所有的技术和科学术语与属于本发明的技术领域的技术人员通常理解的含义相同。本文中在本发明的说明书中所使用的术语只是为了描述具体的实施例的目的,不是旨在于限制本发明。本文所使用的术语“及/或”包括一个或多个相关的所列项目的任意的和所有的组合。、实施例一Unless otherwise defined, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the technical field of the invention. The terms used herein in the description of the present invention are for the purpose of describing specific embodiments only, and are not intended to limit the present invention. As used herein, the term "and/or" includes any and all combinations of one or more of the associated listed items. , embodiment one
请参阅图1,所示为本发明第一实施例中的脉冲噪声抑制方法,所述方法具体包括步骤S101至S105:Please refer to FIG. 1, which shows the impulse noise suppression method in the first embodiment of the present invention, the method specifically includes steps S101 to S105:
S101,实时获取当前环境的音频信号,并对所述音频信号进行音频特征提取,以得到所述音频信号的多个音频特征;S101, acquire an audio signal of the current environment in real time, and perform audio feature extraction on the audio signal, so as to obtain multiple audio features of the audio signal;
进一步的,请参阅图2,所述步骤S101具体包括步骤S1011~S1012:Further, please refer to FIG. 2, the step S101 specifically includes steps S1011-S1012:
S1011,获取所述音频信号的音频参数,并基于所述音频信号的单位用帧以及所述音频参数提取所述音频信号中的多个音频帧特征;S1011. Acquire audio parameters of the audio signal, and extract features of multiple audio frames in the audio signal based on a unit frame of the audio signal and the audio parameters;
S1012,将各所述音频帧特征依次通过均值、方差以及标准差处理,以得到各所述音频帧特征所对应的音频段特征。S1012, sequentially process each of the audio frame features through mean value, variance, and standard deviation to obtain audio segment features corresponding to each of the audio frame features.
在具体实施时,对于不同的音频信号的识别过程中,由于音频信号具有短时特性,利用音频信号的单位用帧,该单位用帧通常为10~40ms(在本实施例中为20ms),获取该音频信号的音频参数,并利用上述的单位用帧以及音频参数提取上述音频信号中的多个音频帧特征,所述音频帧特征包括时域特征、频域特征以及声学感知特征:During specific implementation, in the identification process for different audio signals, because the audio signal has short-term characteristics, the unit frame of the audio signal is utilized, and the unit frame is usually 10~40ms (20ms in the present embodiment), Obtain the audio parameters of the audio signal, and use the above-mentioned unit frame and audio parameters to extract multiple audio frame features in the above-mentioned audio signal, the audio frame features include time-domain features, frequency-domain features and acoustic perception features:
时域特征是利用了音频信号在时域上面的信息,可以理解成横轴为时间,纵轴为音频信号。通过过零数、短时能量、音量、自相关系数等来描述音频信号在时间上的信息。The time domain feature utilizes the information of the audio signal in the time domain, which can be understood as the time on the horizontal axis and the audio signal on the vertical axis. The time information of the audio signal is described by the number of zero crossings, short-term energy, volume, autocorrelation coefficient, etc.
过零数是指:音频信号由正到负、由负到正过程中,通过零的次数。例如:浊音具有较低的过零数,清音具有较高的过零数(发音时声带振动的是浊音,声带不振动的是清音)。The number of zero crossings refers to the number of times the audio signal passes through zero in the process of going from positive to negative and from negative to positive. For example: voiced sounds have a lower zero-crossing number, and unvoiced sounds have a higher zero-crossing number (voiced sounds are voiced when the vocal cords vibrate during pronunciation, and voiceless sounds are not vibrated when the vocal cords are not vibrated).
短时能量则是:通过能量的计算方法用于监测浊音和清音的转换时刻。在本实施例中,短时能量大的地方过零数低,短时能量小的地方过零数高。The short-term energy is: it is used to monitor the conversion moment of voiced and unvoiced sounds through the calculation method of energy. In this embodiment, the number of zero crossings is low where the short-term energy is large, and the number of zero crossings is high where the short-term energy is small.
频域特征是把时域波形信号转换到频谱,然后进行计算。The frequency domain feature is to convert the time domain waveform signal into a frequency spectrum, and then perform calculations.
进一步的,将上述各音频帧特征依次通过均值、方差以及标准差处理,进而得到各音频帧特征所对应的音频段特征(详细请参阅图3)。Further, the above audio frame features are sequentially processed through the mean value, variance and standard deviation, and then the audio segment features corresponding to each audio frame feature are obtained (see FIG. 3 for details).
S102,利用音频数据库对各所述音频特征进行特征判别,以识别出所述音频信号中的脉冲噪声信号;S102, using the audio database to perform feature discrimination on each of the audio features, so as to identify the impulse noise signal in the audio signal;
进一步的,请参阅图4,所述步骤S102具体包括步骤S1021~S1022:Further, please refer to FIG. 4, the step S102 specifically includes steps S1021-S1022:
S1021,获取所述音频信号的采样频率以及各所述音频段特征的各帧内数据;S1021. Obtain the sampling frequency of the audio signal and the intra-frame data of each audio segment feature;
S1022,根据所述采样频率计算出各所述帧内数据的短时能量和短时过零数,并根据各所述帧内数据的短时能量和短时过零数以及短时能量阈值和过零数阈值识别出各所述音频段特征中的脉冲噪声信号。S1022. Calculate the short-term energy and the number of short-term zero crossings of the data in each frame according to the sampling frequency, and calculate the short-term energy, the number of short-time zero crossings and the short-term energy threshold and A zero-crossing threshold identifies impulsive noise signals in each of said audio segment features.
在具体实施时,由于音频信号是非稳态信号,具有短时平稳特性,绝大多数噪声是稳态的,如白噪声/机器噪声,并且音频信号中能量绝大部分包含在低频带内,而噪音信号通常能量较小且含有较高频段的信息;In actual implementation, since the audio signal is a non-stationary signal with short-term stationary characteristics, most of the noise is steady-state, such as white noise/machine noise, and most of the energy in the audio signal is contained in the low-frequency band, while Noise signals usually have less energy and contain information in higher frequency bands;
因此,在本实施例中,通过获取上述音频信号的采样频率以及各音频段特征的各帧内数据,其中,在本实施例中,采样频率为8KHz,意味着每一帧包含有160个样点数,根据该采样频率以及对应的样点数计算出各帧内数据的短时能量即为该音频信号的短时能量,对帧内样点数平移1,再计算出对应两相邻样点的乘积,其中符号为负的点即为此处过零,将帧内所有负数乘积数目计算出该帧的短时过零数,各所述帧内数据的短时能量的计算公式为:Therefore, in this embodiment, by obtaining the sampling frequency of the above-mentioned audio signal and the data in each frame of each audio segment feature, wherein, in this embodiment, the sampling frequency is 8KHz, which means that each frame contains 160 samples The number of points, according to the sampling frequency and the corresponding number of samples, calculate the short-term energy of the data in each frame, which is the short-term energy of the audio signal, shift the number of samples in the frame by 1, and then calculate the product corresponding to two adjacent samples , where the point with a negative sign is the zero-crossing here, and the number of short-term zero-crossings of the frame is calculated by multiplying all the negative numbers in the frame. The calculation formula of the short-term energy of the data in each frame is:
式中,En为短时能量,x(m)为帧内数据,w(n)为窗口函数,N为采样频率所对应的数据帧样点数。In the formula, En is the short-term energy, x(m) is the intra-frame data, w( n ) is the window function, and N is the number of data frame samples corresponding to the sampling frequency.
各所述帧内数据的短时过零数的计算公式为:The calculation formula of the short-term zero-crossing number of the data in each said frame is:
式中,Zn为短时过零数,x(m)为帧内数据,sgn[·]为符号函数、即w(n)为窗口函数,N为采样频率所对应的数据帧样点数。In the formula, Z n is the short-term zero-crossing number, x(m) is the intra-frame data, sgn[ ] is the sign function, namely w(n) is a window function, and N is the number of data frame samples corresponding to the sampling frequency.
利用音频数据库得到脉冲噪声信号的短时能量阈值和过零数阈值,并利用脉冲噪声信号的短时能量阈值和过零数阈值以及上述得到的各所述帧内数据的短时能量和短时过零数来识别出各音频段特征中的脉冲噪声信号;Use the audio database to obtain the short-term energy threshold and the zero-crossing threshold of the impulse noise signal, and use the short-term energy threshold and the zero-crossing threshold of the impulse noise signal and the short-term energy and short-time of the data in each frame obtained above The number of zero crossings is used to identify the impulsive noise signal in the characteristics of each audio segment;
具体的,当所述帧内数据的短时能量大于等于短时能量阈值、且该帧内数据的短时过零数小于过零数阈值时,判定所述帧内数据为语音信号;Specifically, when the short-term energy of the data in the frame is greater than or equal to the short-term energy threshold, and the number of short-term zero-crossings of the data in the frame is less than the threshold of the number of zero-crossings, it is determined that the data in the frame is a speech signal;
即:当En≥STE阈值(短时能量阈值),且Zn<ZCC阈值(过零数阈值)时判定该帧内数据为语音信号;That is: when E n ≥ STE threshold (short-term energy threshold), and Z n < ZCC threshold (zero-crossing threshold), it is determined that the data in the frame is a speech signal;
当所述帧内数据的短时能量小于短时能量阈值、且该帧内数据的短时过零数大于过零数阈值时,判定所述帧内数据为脉冲噪声信号;When the short-term energy of the data in the frame is less than the short-term energy threshold and the number of short-term zero-crossings of the data in the frame is greater than the threshold of the number of zero-crossings, it is determined that the data in the frame is an impulse noise signal;
即:当En<STE阈值(短时能量阈值),且Zn>ZCC阈值(过零数阈值)时判定该帧内数据为脉冲噪声信号。That is: when E n <STE threshold (short-term energy threshold), and Z n >ZCC threshold (zero-crossing threshold), it is determined that the data in the frame is an impulse noise signal.
S103,判断所述脉冲噪声信号是否大于预设音频阈值;S103, judging whether the impulse noise signal is greater than a preset audio threshold;
在具体实施时,判断上述的脉冲噪声信号的分贝是否大于预设音频阈值(在本实施例中,该预设音频阈值为85dB),当脉冲噪声信号的分贝大于预设音频阈值,则意味着该脉冲噪声信号影响人耳的听觉感受,因此,需要对该脉冲噪声信号进行对应的抑制处理;During specific implementation, it is judged whether the decibel of the above-mentioned impulse noise signal is greater than the preset audio threshold (in this embodiment, the preset audio threshold is 85dB), when the decibel of the impulse noise signal is greater than the preset audio threshold, it means The impulsive noise signal affects the auditory experience of the human ear, therefore, it is necessary to perform corresponding suppression processing on the impulsive noise signal;
当脉冲噪声信号的分贝不大于预设音频阈值,则意味着该脉冲噪声信号属于耳道内声压的承受范围内,针对该脉冲噪声信号的处理,仅需要将除该脉冲噪声信号外的正常语音信号进行增强即可。When the decibel of the impulse noise signal is not greater than the preset audio threshold, it means that the impulse noise signal belongs to the tolerance range of the sound pressure in the ear canal. For the processing of the impulse noise signal, only the normal speech except the impulse noise signal needs to be The signal can be enhanced.
需要说明的是,在一些可选实施例中,由于在佩戴耳机时,通过耳机的耳塞会对外部噪声进行一定的降噪功能,通常该物理降噪能够达到15~20dB的降噪效果,因此,当该耳机具备物理降噪功能时,该预设音频阈值为105dB。It should be noted that, in some optional embodiments, when the earphone is worn, the earplug of the earphone will perform a certain noise reduction function on external noise, usually the physical noise reduction can achieve a noise reduction effect of 15-20dB, so , when the headset has a physical noise reduction function, the preset audio threshold is 105dB.
S104,若所述脉冲噪声信号大于预设音频阈值,则利用限幅函数抑制所述脉冲噪声信号中的脉冲样本幅值,并对处理后的脉冲噪声信号进行动态范围压扩,以得到初步脉冲噪声信号;S104. If the impulse noise signal is greater than the preset audio threshold, use a clipping function to suppress the pulse sample amplitude in the impulse noise signal, and perform dynamic range companding on the processed impulse noise signal to obtain a preliminary impulse noise signal;
在具体实施时,若所述脉冲噪声信号大于预设音频阈值,则利用限幅函数抑制所述脉冲噪声信号中的脉冲样本幅值,并对处理后的脉冲噪声信号进行动态范围压扩,以得到初步脉冲噪声信号;In specific implementation, if the impulse noise signal is greater than the preset audio threshold, then use the clipping function to suppress the pulse sample amplitude in the impulse noise signal, and perform dynamic range companding on the processed impulse noise signal, so as to Obtain a preliminary impulse noise signal;
具体的,输入音频信号,并设置窗函数类型和窗长、带噪信号STFT、利用时频谱图,设置首次筛选阈值、选出可能脉冲噪声的窗口,设置相关性检测系数、再次筛选、由公式返回噪声时域位置,通过信号的修复和重建,进而将处理后的音频信号输出,其中,修复算法步骤包含设置门限带通限制器、带噪信号通过限制器、设置改进中值滤波参数、阈值过滤信号重建、得到重建后的信号。Specifically, input the audio signal, and set the window function type and window length, the STFT of the noisy signal, and use the time spectrogram, set the threshold for the first screening, select the window for possible impulse noise, set the correlation detection coefficient, and filter again, according to the formula Return the position of the noise in the time domain, and then output the processed audio signal through signal repair and reconstruction. The repair algorithm steps include setting the threshold band-pass limiter, the noisy signal through the limiter, setting the improved median filter parameters, threshold The filtered signal is reconstructed to obtain the reconstructed signal.
在得到上述的处理后的音频信号后,对音频信号的动态范围进行压缩或限制,把信号的最大电平和最小电平之间的相对变化范围加以减小,从而达到减小失真、降低噪声的效果。After the above-mentioned processed audio signal is obtained, the dynamic range of the audio signal is compressed or limited, and the relative variation range between the maximum level and the minimum level of the signal is reduced, so as to reduce distortion and noise. Effect.
本实施例结合基于递归平均的噪声估计算法和动态范围压扩算法,提出了一种动态调整噪声拟制临界点的动态范围扩算法。通过对噪声水平的动态评估,实时调整噪声拟制阈值,在进行拾音增强的同时进一步针对瞬间产生的高噪音进行降噪处理,避免了动态范围压扩算法在放大语音的同时,噪声也被放大的问题。In this embodiment, a noise estimation algorithm based on recursive averaging and a dynamic range companding algorithm are combined to propose a dynamic range expansion algorithm that dynamically adjusts the critical point of noise simulation. Through the dynamic evaluation of the noise level, the noise imitation threshold is adjusted in real time, and the high noise generated at the moment is further processed for noise reduction while the sound pickup is enhanced, so as to avoid the noise being amplified by the dynamic range companding algorithm while amplifying the voice. Amplify the problem.
S105,对所述初步脉冲噪声信号进行反馈消除,以抑制所述音频信号中脉冲噪声信号,得到目标信号。S105. Perform feedback elimination on the preliminary impulse noise signal to suppress the impulse noise signal in the audio signal to obtain a target signal.
在具体实施时,对上述的初步脉冲噪声信号反馈消除脉冲噪声信号的动作指令,将该初步脉冲噪声信号中超大信号进行闭锁反馈处理,进而得到抑制该脉冲噪声信号或者消除该脉冲噪声信号。During specific implementation, the action instruction for eliminating the impulse noise signal is fed back to the above-mentioned preliminary impulse noise signal, and the super large signal in the preliminary impulse noise signal is subjected to blocking feedback processing, and then the impulse noise signal is suppressed or the impulse noise signal is eliminated.
具体的,在电路上会控制增益输出,限制初步脉冲噪声信号在声音信号内。对瞬间产生的高噪音进行降噪处理,控制增益输出,当噪声信号超过设定的门限值时,自动切除语音增强电路,阻断受话器与功放电路的连接,并延时至冲击噪声结束后200mS以上。同时对送话器信号进行处理,对突发的脉冲信号进行限幅平滑抑制处理,避免将大的噪声信号传送给接收端。Specifically, the gain output is controlled on the circuit to limit the preliminary impulse noise signal within the sound signal. Perform noise reduction processing for instantaneous high noise, control gain output, when the noise signal exceeds the set threshold value, automatically cut off the voice enhancement circuit, block the connection between the receiver and the power amplifier circuit, and delay until the end of the impact noise More than 200mS. At the same time, the microphone signal is processed, and the burst pulse signal is limited and smoothed to suppress the processing, so as to avoid transmitting large noise signals to the receiving end.
请参阅图5,所示为经过脉冲噪声抑制后的测试曲线,其中,L1表示空场(噪声环境等效为0dB基准值)0dB的音频信号曲线,L2表示降噪曲线(即佩戴耳机时采集到的音频信号曲线),L3表示拾音增强曲线,L4表示脉冲噪声抑制后的音频信号曲线。Please refer to Figure 5, which shows the test curve after impulse noise suppression, where L1 represents the audio signal curve of 0dB in the empty field (the noise environment is equivalent to the 0dB reference value), and L2 represents the noise reduction curve (that is, when wearing headphones The audio signal curve obtained), L3 represents the pickup enhancement curve, and L4 represents the audio signal curve after impulse noise suppression.
综上,本发明上述实施例当中的脉冲噪声抑制方法,通过实时获取环境的音频信号,并对音频信号进行音频特征提取,并对提取到的音频特征进行特征判别,以识别出音频信号中的脉冲噪声信号,通过特征判别的方式识别出脉冲噪声信号,以便于对其进行抑制处理;具体的,当脉冲噪声信号大于预设阈值,则利用限幅函数抑制脉冲噪声信号中的脉冲样本幅值、并进行动态范围压扩,以提升脉冲降噪性能,进一步的,对脉冲噪声信号进行反馈消除,进而抑制音频信号中的脉冲噪声信号,以避免突发性噪声进入到人耳中,影响舒适度。In summary, the impulsive noise suppression method in the above-mentioned embodiments of the present invention obtains the audio signal of the environment in real time, extracts audio features from the audio signal, and performs feature discrimination on the extracted audio features to identify the audio signal in the audio signal. For impulse noise signals, the impulse noise signals are identified by means of feature discrimination, so as to suppress them; specifically, when the impulse noise signals are greater than the preset threshold, the limiting function is used to suppress the pulse sample amplitude in the impulse noise signals , and perform dynamic range companding to improve the performance of pulse noise reduction. Further, the pulse noise signal is eliminated by feedback, thereby suppressing the pulse noise signal in the audio signal, so as to avoid sudden noise entering the human ear and affecting comfort Spend.
实施例二Embodiment two
本发明另一方面还提出一种脉冲噪声抑制系统,请查阅图6,所示为本发明第二实施例中的脉冲噪声抑制系统,所述系统包括:Another aspect of the present invention also proposes an impulse noise suppression system, please refer to Figure 6, which shows the impulse noise suppression system in the second embodiment of the present invention, the system includes:
音频信号获取模块11,用于实时获取当前环境的音频信号,并对所述音频信号进行音频特征提取,以得到所述音频信号的多个音频特征;The audio
进一步的,所述音频信号获取模块11包括:Further, the audio
音频参数获取单元,用于获取所述音频信号的音频参数,并基于所述音频信号的单位用帧以及所述音频参数提取所述音频信号中的多个音频帧特征;An audio parameter acquisition unit, configured to acquire audio parameters of the audio signal, and extract features of multiple audio frames in the audio signal based on the unit frame of the audio signal and the audio parameters;
特征处理单元,用于将各所述音频帧特征依次通过均值、方差以及标准差处理,以得到各所述音频帧特征所对应的音频段特征。The feature processing unit is configured to sequentially process each of the audio frame features through mean value, variance, and standard deviation to obtain the audio segment features corresponding to each of the audio frame features.
特征判别模块12,用于利用音频数据库对各所述音频特征进行特征判别,以识别出所述音频信号中的脉冲噪声信号;The
进一步的,所述特征判别模块12包括:Further, the
特征获取单元,用于获取所述音频信号的采样频率以及各所述音频段特征的各帧内数据;A feature acquisition unit, configured to acquire the sampling frequency of the audio signal and each intra-frame data of each audio segment feature;
特征判别单元,用于根据所述采样频率计算出各所述帧内数据的短时能量和短时过零数,并根据各所述帧内数据的短时能量和短时过零数以及短时能量阈值和过零数阈值识别出各所述音频段特征中的脉冲噪声信号。The feature discrimination unit is used to calculate the short-term energy and the number of short-term zero crossings of the data in each frame according to the sampling frequency, and calculate the short-term energy, the number of short-term zero crossings and the number of short-term zero crossings of the data in each frame The impulsive noise signal in each of the audio segment features is identified by using the temporal energy threshold and the zero crossing threshold.
在一些可选实施例中,所述特征判别单元还用于:In some optional embodiments, the feature discrimination unit is also used for:
当所述帧内数据的短时能量大于等于短时能量阈值、且该帧内数据的短时过零数小于过零数阈值时,判定所述帧内数据为语音信号;When the short-term energy of the data in the frame is greater than or equal to the short-term energy threshold, and the number of short-term zero-crossings of the data in the frame is less than the threshold of the number of zero-crossings, it is determined that the data in the frame is a speech signal;
当所述帧内数据的短时能量小于短时能量阈值、且该帧内数据的短时过零数大于过零数阈值时,判定所述帧内数据为脉冲噪声信号。When the short-term energy of the intra-frame data is less than a short-term energy threshold and the number of short-term zero-crossings of the intra-frame data is greater than the zero-crossing threshold, it is determined that the intra-frame data is an impulse noise signal.
判断模块13,用于判断所述脉冲噪声信号是否大于预设音频阈值;A judging
噪声信号抑制模块14,用于若所述脉冲噪声信号大于预设音频阈值,则利用限幅函数抑制所述脉冲噪声信号中的脉冲样本幅值,并对处理后的脉冲噪声信号进行动态范围压扩,以得到初步脉冲噪声信号;The noise
反馈消除模块15,用于对所述初步脉冲噪声信号进行反馈消除,以抑制所述音频信号中脉冲噪声信号,得到目标信号。The
上述各模块、单元被执行时所实现的功能或操作步骤与上述方法实施例大体相同,在此不再赘述。The functions or operation steps realized by the above-mentioned modules and units when executed are substantially the same as those of the above-mentioned method embodiments, and will not be repeated here.
本发明实施例所提供的脉冲噪声抑制系统,其实现原理及产生的技术效果和前述方法实施例相同,为简要描述,装置实施例部分未提及之处,可参考前述方法实施例中相应内容。The impulsive noise suppression system provided by the embodiment of the present invention has the same realization principle and technical effect as the foregoing method embodiment. For the sake of brief description, for the parts not mentioned in the device embodiment, please refer to the corresponding content in the foregoing method embodiment. .
实施例三Embodiment Three
本发明还提出一种计算机设备,请参阅图7,所示为本发明第三实施例中的计算机设备,包括存储器10、处理器20以及存储在所述存储器10上并可在所述处理器20上运行的计算机程序30,所述处理器20执行所述计算机程序30时实现上述的脉冲噪声抑制方法。The present invention also proposes a computer device, please refer to FIG. 7, which shows a computer device in the third embodiment of the present invention, including a
其中,存储器10至少包括一种类型的可读存储介质,所述可读存储介质包括闪存、硬盘、多媒体卡、卡型存储器(例如,SD或DX存储器等)、磁性存储器、磁盘、光盘等。存储器10在一些实施例中可以是计算机设备的内部存储单元,例如该计算机设备的硬盘。存储器10在另一些实施例中也可以是外部存储装置,例如插接式硬盘,智能存储卡(Smart MediaCard,SMC),安全数字(Secure Digital,SD)卡,闪存卡(Flash Card)等。进一步地,存储器10还可以既包括计算机设备的内部存储单元也包括外部存储装置。存储器10不仅可以用于存储安装于计算机设备的应用软件及各类数据,还可以用于暂时地存储已经输出或者将要输出的数据。Wherein, the
其中,处理器20在一些实施例中可以是电子控制单元(Electronic ControlUnit,简称ECU,又称行车电脑)、中央处理器(Central Processing Unit,CPU)、控制器、微控制器、微处理器或其他数据处理芯片,用于运行存储器10中存储的程序代码或处理数据,例如执行访问限制程序等。Wherein, the
需要指出的是,图7示出的结构并不构成对计算机设备的限定,在其它实施例当中,该计算机设备可以包括比图示更少或者更多的部件,或者组合某些部件,或者不同的部件布置。It should be noted that the structure shown in FIG. 7 does not constitute a limitation to the computer device. In other embodiments, the computer device may include fewer or more components than those shown in the illustration, or combine certain components, or be different layout of the components.
本发明实施例还提出一种可读存储介质,其上存储有计算机程序,该计算机程序被处理器执行时实现如上述的脉冲噪声抑制方法。The embodiment of the present invention also provides a readable storage medium on which a computer program is stored, and when the computer program is executed by a processor, the above-mentioned impulse noise suppression method is realized.
本领域技术人员可以理解,在流程图中表示或在此以其他方式描述的逻辑和/或步骤,例如,可以被认为是用于实现逻辑功能的可执行指令的定序列表,可以具体实现在任何计算机可读介质中,以供指令执行系统、装置或设备(如基于计算机的系统、包括处理器的系统或其他可以从指令执行系统、装置或设备取指令并执行指令的系统)使用,或结合这些指令执行系统、装置或设备而使用。就本说明书而言,“计算机可读介质”可以是任何可以包含、存储、通信、传播或传输程序以供指令执行系统、装置或设备或结合这些指令执行系统、装置或设备而使用的装置。Those skilled in the art will understand that the logic and/or steps shown in the flowchart or otherwise described herein, for example, can be considered as a sequenced list of executable instructions for implementing logical functions, which can be specifically implemented in on any computer-readable medium for use by an instruction execution system, apparatus, or device (such as a computer-based system, a system including a processor, or other system that can fetch instructions from an instruction execution system, apparatus, or device and execute them), or Used in conjunction with these instruction execution systems, devices or equipment. For the purposes of this specification, a "computer-readable medium" may be any device that can contain, store, communicate, propagate or transmit a program for use in or in conjunction with an instruction execution system, device or device.
计算机可读介质的更具体的示例(非穷尽性列表)包括以下:具有一个或多个布线的电连接部(电子装置),便携式计算机盘盒(磁装置),随机存取存储器(RAM),只读存储器(ROM),可擦除可编辑只读存储器(EPROM或闪速存储器),光纤装置,以及便携式光盘只读存储器(CDROM)。另外,计算机可读介质甚至可以是可在其上打印所述程序的纸或其他合适的介质,因为可以例如通过对纸或其他介质进行光学扫描,接着进行编辑、解译或必要时以其他合适方式进行处理来以电子方式获得所述程序,然后将其存储在计算机存储器中。More specific examples (non-exhaustive list) of computer-readable media include the following: electrical connection with one or more wires (electronic device), portable computer disk case (magnetic device), random access memory (RAM), Read Only Memory (ROM), Erasable and Editable Read Only Memory (EPROM or Flash Memory), Fiber Optic Devices, and Portable Compact Disc Read Only Memory (CDROM). In addition, the computer-readable medium may even be paper or other suitable medium on which the program can be printed, since the program can be read, for example, by optically scanning the paper or other medium, followed by editing, interpretation or other suitable processing if necessary. processing to obtain the program electronically and store it in computer memory.
应当理解,本发明的各部分可以用硬件、软件、固件或它们的组合来实现。在上述实施方式中,多个步骤或方法可以用存储在存储器中且由合适的指令执行系统执行的软件或固件来实现。例如,如果用硬件来实现,和在另一实施方式中一样,可用本领域公知的下列技术中的任一项或它们的组合来实现:具有用于对数据信号实现逻辑功能的逻辑门电路的离散逻辑电路,具有合适的组合逻辑门电路的专用集成电路,可编程门阵列(PGA),现场可编程门阵列(FPGA)等。It should be understood that various parts of the present invention can be realized by hardware, software, firmware or their combination. In the embodiments described above, various steps or methods may be implemented by software or firmware stored in memory and executed by a suitable instruction execution system. For example, if implemented in hardware, as in another embodiment, it can be implemented by any one or combination of the following techniques known in the art: Discrete logic circuits, ASICs with suitable combinational logic gates, Programmable Gate Arrays (PGAs), Field Programmable Gate Arrays (FPGAs), etc.
以上所述实施例的各技术特征可以进行任意的组合,为使描述简洁,未对上述实施例中的各个技术特征所有可能的组合都进行描述,然而,只要这些技术特征的组合不存在矛盾,都应当认为是本说明书记载的范围。The various technical features of the above-mentioned embodiments can be combined arbitrarily. To make the description concise, all possible combinations of the various technical features in the above-mentioned embodiments are not described. However, as long as there is no contradiction in the combination of these technical features, should be considered as within the scope of this specification.
以上所述实施例仅表达了本申请的几种实施方式,其描述较为具体和详细,但并不能因此而理解为对发明专利范围的限制。应当指出的是,对于本领域的普通技术人员来说,在不脱离本申请构思的前提下,还可以做出若干变形和改进,这些都属于本申请的保护范围。因此,本申请专利的保护范围应以所附权利要求为准。The above-mentioned embodiments only represent several implementation modes of the present application, and the description thereof is relatively specific and detailed, but it should not be construed as limiting the scope of the patent for the invention. It should be noted that those skilled in the art can make several modifications and improvements without departing from the concept of the present application, and these all belong to the protection scope of the present application. Therefore, the scope of protection of the patent application should be based on the appended claims.
Claims (10)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202210946827.6A CN115348507A (en) | 2022-08-09 | 2022-08-09 | Impulse noise suppression method, system, readable storage medium and computer equipment |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202210946827.6A CN115348507A (en) | 2022-08-09 | 2022-08-09 | Impulse noise suppression method, system, readable storage medium and computer equipment |
Publications (1)
Publication Number | Publication Date |
---|---|
CN115348507A true CN115348507A (en) | 2022-11-15 |
Family
ID=83950815
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202210946827.6A Pending CN115348507A (en) | 2022-08-09 | 2022-08-09 | Impulse noise suppression method, system, readable storage medium and computer equipment |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN115348507A (en) |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN116614742A (en) * | 2023-07-20 | 2023-08-18 | 江西红声技术有限公司 | Clear voice transmitting and receiving noise reduction earphone |
CN116758934A (en) * | 2023-08-18 | 2023-09-15 | 深圳市微克科技有限公司 | Method, system and medium for realizing intercom function of intelligent wearable device |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN104658544A (en) * | 2013-11-20 | 2015-05-27 | 大连佑嘉软件科技有限公司 | Method for inhibiting transient noise in voice |
CN106157967A (en) * | 2015-04-28 | 2016-11-23 | 杜比实验室特许公司 | Impulse noise mitigation |
CN107305774A (en) * | 2016-04-22 | 2017-10-31 | 腾讯科技(深圳)有限公司 | Speech detection method and device |
CN110503973A (en) * | 2019-08-28 | 2019-11-26 | 浙江大华技术股份有限公司 | Audio signal Transient Noise suppressing method, system and storage medium |
-
2022
- 2022-08-09 CN CN202210946827.6A patent/CN115348507A/en active Pending
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN104658544A (en) * | 2013-11-20 | 2015-05-27 | 大连佑嘉软件科技有限公司 | Method for inhibiting transient noise in voice |
CN106157967A (en) * | 2015-04-28 | 2016-11-23 | 杜比实验室特许公司 | Impulse noise mitigation |
CN107305774A (en) * | 2016-04-22 | 2017-10-31 | 腾讯科技(深圳)有限公司 | Speech detection method and device |
CN110503973A (en) * | 2019-08-28 | 2019-11-26 | 浙江大华技术股份有限公司 | Audio signal Transient Noise suppressing method, system and storage medium |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN116614742A (en) * | 2023-07-20 | 2023-08-18 | 江西红声技术有限公司 | Clear voice transmitting and receiving noise reduction earphone |
CN116758934A (en) * | 2023-08-18 | 2023-09-15 | 深圳市微克科技有限公司 | Method, system and medium for realizing intercom function of intelligent wearable device |
CN116758934B (en) * | 2023-08-18 | 2023-11-07 | 深圳市微克科技有限公司 | A method, system and medium for realizing the intercom function of smart wearable devices |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US9812147B2 (en) | System and method for generating an audio signal representing the speech of a user | |
US9196258B2 (en) | Spectral shaping for speech intelligibility enhancement | |
CN102016995B (en) | An apparatus for processing an audio signal and method thereof | |
CN103874002A (en) | Audio processing device comprising reduced artifacts | |
TW200910793A (en) | System and method for adaptive intelligent noise suppression | |
CN112086093A (en) | Automatic speech recognition system for countering audio attack based on perception | |
CN115348507A (en) | Impulse noise suppression method, system, readable storage medium and computer equipment | |
CN112004177A (en) | Howling detection method, microphone volume adjustment method and medium | |
CN117321681A (en) | Speech optimization in noisy environments | |
Kim et al. | Nonlinear enhancement of onset for robust speech recognition. | |
CN112312258B (en) | Intelligent earphone with hearing protection and hearing compensation | |
TWI451405B (en) | Hearing aid and method of enhancing speech output in real time | |
JP2000081900A (en) | Sound absorbing method, and device and program recording medium therefor | |
JP4922427B2 (en) | Signal correction device | |
CN116312606A (en) | High-frequency noise suppression method and device, terminal equipment and storage medium | |
CN114023352B (en) | Voice enhancement method and device based on energy spectrum depth modulation | |
CN112153533B (en) | Method and device for eliminating sound breaking of audio signal, storage medium and terminal | |
CN105812995B (en) | A kind of noise-filtering method and apparatus | |
CN117912485A (en) | Speech band extension method, noise reduction audio device, and storage medium | |
CN115914971A (en) | Wind noise detection method and device, earphone and storage medium | |
JP2024526679A (en) | Data Augmentation for Speech Improvement | |
EP2063420A1 (en) | Method and assembly to enhance the intelligibility of speech | |
CN114143667A (en) | Volume adjusting method, storage medium and electronic device | |
CN115240700B (en) | Acoustic device and sound processing method thereof | |
Cordourier Maruri et al. | V-speech: Noise-robust speech capturing glasses using vibration sensors |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination |