CN106157978A

CN106157978A - Speech signal processing device and audio signal processing method

Info

Publication number: CN106157978A
Application number: CN201510177492.6A
Authority: CN
Inventors: 杜博仁; 张嘉仁; 曾凯盟
Original assignee: Acer Inc
Current assignee: Acer Inc
Priority date: 2015-04-15
Filing date: 2015-04-15
Publication date: 2016-11-23
Anticipated expiration: 2035-04-15
Also published as: CN106157978B

Abstract

The present invention provides a kind of speech signal processing device and audio signal processing method.The present invention calculates the value of the interpolated parameter function corresponding with sampled signal window according to three sampled values of continuous print in sampled signal window, and calculates in down-scaled signals window the interpolated value between adjacent two sampled points according to the value of interpolated parameter function.There is the situation of distorted signals in voice signal after the present invention can be effectively prevented from frequency reducing.

Description

Speech signal processing device and audio signal processing method

Technical field

The present invention relates to a kind of signal processing apparatus, particularly relate to a kind of speech signal processing device and voice Signal processing method.

Background technology

For Hearing Impaired, it often cannot clearly receive the voice signal of higher-frequency, example Such as consonant signal, but the signal for low frequency can clearly be heard.But, signal is being dropped After Pin, owing to time span is elongated, the signal value between two sampled signals of continuous print need to utilize interpolation Mode is tried to achieve, if such as by acoustical signal when high-frequency signal reduces to the low frequency signal of only half frequency, Time span will become twice originally, the new signal between sampled signal and sampled signal, it is necessary to by interior The mode inserted is tried to achieve.Owing to the Property comparison of acoustical signal is close to string ripple, if with the side of general arithmetic mean Formula asks for the signal value of interpolation, often makes the signal after frequency reducing the situation of distorted signals occur.

Summary of the invention

The present invention provides a kind of speech signal processing device and audio signal processing method, can be effectively prevented from There is the situation of distorted signals in voice signal after frequency reducing.

The speech signal processing device of the present invention includes processing unit, and it receives the sampled signal including sequence The sampled speech signal of window, and calculate and each sampling according to three sampled values of continuous print in each sampled signal window The value of the interpolated parameter function that signal window is corresponding, down sample voice signal, to produce the fall including sequence Frequently the down-scaled signals of signal window, calculates each fall according to the value of interpolated parameter function corresponding to each down-scaled signals window Frequently interpolated value between adjacent two sampled points in signal window.

In one embodiment of this invention, above-mentioned speech signal processing device also includes sampling unit, its coupling Connecing processing unit, primary speech signal of sampling, to produce sampled speech signal, in processing unit also judges Whether the value inserting parametric function is less than higher limit and more than or equal to lower limit, if the value of interpolated parameter function is not Less than higher limit or not more than or equal to lower limit, the value of modified interpolation parametric function.

In one embodiment of this invention, if wherein the value of interpolated parameter function is more than or equal to higher limit, will The value of interpolated parameter function is modified to higher limit, if the value of interpolated parameter function is less than lower limit, by interpolation The value of parametric function is modified to lower limit.

In one embodiment of this invention, above-mentioned higher limit and lower limit are associated with the frequency of primary speech signal Rate and the sample frequency of sampling unit.

In one embodiment of this invention, above-mentioned processing unit is also according to continuous print three in each sampled signal window Trigonometric function relation between individual sampled value calculates the interpolated parameter function that each sampled signal window is corresponding.

In one embodiment of this invention, above-mentioned interpolated parameter function is trigonometric function.

The audio signal processing method of the present invention comprises the following steps.Sampling primary speech signal, to produce Sampled speech signal including the sampled signal window of sequence.Adopt according to continuous print three in each sampled signal window Sample value calculates the value of interpolated parameter function corresponding to each sampled signal window.Down sample voice signal, to produce The down-scaled signals of the raw down-scaled signals window including sequence.According to the interpolated parameter letter that each down-scaled signals window is corresponding The value of number calculates the interpolated value in each down-scaled signals window between adjacent two sampled points.

In one embodiment of this invention, above-mentioned audio signal processing method also includes, it is judged that interpolated parameter Whether the value of function is less than higher limit and more than or equal to lower limit, if the value of interpolated parameter function is not less than upper Limit value or more than or equal to lower limit, the value of modified interpolation parametric function.

In one embodiment of this invention, wherein higher limit and lower limit are associated with the frequency of primary speech signal Rate and the sample frequency of sampling unit.

In one embodiment of this invention, above-mentioned audio signal processing method includes, according to each sampled signal In window, the trigonometric function relation between three sampled values of continuous print calculates the interpolated parameter that each sampled signal window is corresponding Function.

Based on above-mentioned, embodiments of the invention calculate according to three sampled values of continuous print in sampled signal window The value of the interpolated parameter function corresponding with sampled signal window, and calculate fall according to the value of interpolated parameter function Frequently interpolated value between adjacent two sampled points in signal window, to obtain accurate interpolated value, is effectively prevented from fall There is the situation of distorted signals in voice signal after Pin.

For the features described above of the present invention and advantage can be become apparent, special embodiment below, and coordinate Accompanying drawing is described in detail below.

Accompanying drawing explanation

Fig. 1 is shown as the schematic diagram of the speech signal processing device of one embodiment of the invention；

Fig. 2 illustrates the schematic diagram of the down-scaled signals of one embodiment of the invention；

Fig. 3 illustrates the schematic flow sheet of the audio signal processing method of one embodiment of the invention.

Description of reference numerals:

102: processing unit；

104: sampling unit；

S1: primary speech signal；

S2: sampled speech signal；

S3: down-scaled signals；

Wm, Wm+1: down-scaled signals window；

S (2n), s (2n+2), s (2n+4), s (2n+6), s (2n+8): sampled point；

S (2n+1), s (2n+3), s (2n+5), s (2n+7): interpolated point；

S302～S312: step.

Detailed description of the invention

Fig. 1 is shown as the schematic diagram of the speech signal processing device of one embodiment of the invention, refer to Fig. 1. Speech signal processing device includes processing unit 102 and sampling unit 104, and processing unit 102 couples Sampling unit 104, wherein processing unit 102 can such as be implemented with CPU, and list of sampling Unit 104 then can such as implement with logic circuit, but is not limited.Sampling unit 104 can be sampled former Beginning voice signal S1, to produce sampled speech signal S2, wherein sampled speech signal S2 includes a sequence Sampled signal window.Processing unit 102 can be according to three sampled value meters of continuous print in each sampled signal window Calculate the value of the interpolated parameter function corresponding with each sampled signal window, additionally, also can down sample voice letter Number S2 is to produce the down-scaled signals of the down-scaled signals window including a sequence, and according to each down-scaled signals window institute The value of corresponding interpolated parameter function calculates the interpolation in each down-scaled signals window between adjacent two sampled points Value, wherein interpolated parameter function is trigonometric function, such as SIN function or cosine function, but not as Limit.

For example, Fig. 2 illustrates the schematic diagram of the down-scaled signals of one embodiment of the invention, refer to Fig. 2. In fig. 2, the part of black circle is the sampled point of sampling unit 104, and the part of soft dot is then The interpolated point calculated by processing unit 102.It is assumed herein that m-th is adopted in sampled speech signal S2 In sample signal window, the sampled value at time point n isWherein m is positive integer, and n is 0 or just Integer.It addition, in the present embodiment, frequency reducing letter obtained after sampled speech signal S2 is carried out frequency reducing The half of the frequency that frequency is sampled speech signal S2 of number S3, if assuming m in down-scaled signals S3 Time point n in individual down-scaled signals window Wm (the m-th sampled signal window of its corresponding sampled speech signal S2) Sampled value be s_m(n), then before and after frequency reducing, the corresponding relation of same sampled point can be shown below:

s_{m} (2 n) = \hat{B} 2_{m} (n) - - - (1)

Processing unit 102 can calculate and each according to three sampled values of continuous print in each sampled signal window The interpolated parameter function that sampled signal window is corresponding, such as, the interpolation ginseng corresponding to m-th sampled signal window Number function C_mG () can be according to sampling unit 104 three sampled values of continuous sampling in sampled signal windowAndBetween trigonometric function relation try to achieve, in sampled signal Interpolated parameter function corresponding in the time range of window can be shown below:

C_{m} (g) = \frac{\hat{B} 2_{m} (2 g) + \hat{B} 2_{m} (2 g + 2) + 2 \hat{B} 2_{m} (2 g + 1)}{4 \hat{B} 2_{m} (2 g + 1)} - - - (2)

Wherein g is 0 or positive integer, C_mG () is the interpolated parameter function functional value at time point g, interpolation Parametric function C_mG () is trigonometric function.

Produce owing to speech signal processing device there may be noise during signal processing, and cause The composition that the value of the interpolated parameter function calculated comprises noise, so asks for affecting processing unit 102 The degree of accuracy of interpolated value.Whether processing unit 102 can fall within default by the value judging interpolated parameter function In the range of whether inspect the value of interpolated parameter function by noise jamming, such as can determine whether interpolated parameter letter Whether the value of number is less than higher limit and more than or equal to lower limit, if the value of interpolated parameter function is not less than the upper limit It is worth or more than or equal to lower limit, then the value of representation parameter function is by noise jamming, processing unit 102 Can the value of modified interpolation parametric function, to remove the noise contribution included in the value of interpolated parameter function. Such as, if the value of interpolated parameter function is more than or equal to higher limit, processing unit 102 can be by interpolated parameter letter The value of number is modified to higher limit, if the value of interpolated parameter function is less than lower limit, processing unit 102 can be by The value of interpolated parameter function is modified to lower limit, and if the value of interpolated parameter function less than higher limit and is more than Equal to lower limit, then it is not required to the value of interpolation parametric function is modified.For example, in the reality of Fig. 2 Execute in example, interpolated parameter function C_mG the correcting mode of the value of () can be with following formula subrepresentation:

C_{m} (g) = \{\begin{matrix} C_{m} (g), 0.5 \leq C_{m} (g) < 1 \\ 0.5, C_{m} (g) < 0.5 \\ 1, C_{m} (g) &GreaterEqual; 1 \end{matrix} - - - (3)

The most above-mentioned higher limit and lower limit are respectively 1 and 0.5 in the embodiment of fig. 2, if voice Signal processing apparatus by effect of noise, and makes interpolated parameter function during signal processing C_mG the value of () is more than or equal to 1, then processing unit 102 is by interpolated parameter function C_mG the value of () is modified to 1, And if interpolated parameter function C_mG the value of () is less than 0.5, then processing unit 102 is by interpolated parameter function C_m(g) Value be modified to 0.5.It should be noted that higher limit and the lower limit of formula (3) are only exemplary embodiment, It is not limited thereto.Wherein the situation of higher limit and the interference of lower limit visual actual noise adjusts, such as Higher limit and lower limit can be adjusted according to the sample frequency of the frequency of primary speech signal Yu sampling unit.

After the value obtaining interpolated parameter function, processing unit 102 just can be counted according to interpolated parameter function Calculate the interpolated value between adjacent two sampled points in down-scaled signals window.As a example by the embodiment of Fig. 2, believe in frequency reducing In number window Wm interpolated point s (2n+1) between the sampled point s (2n), s (2n+2) of sampling unit 104 with And the interpolated point s (2n+3) between sampled point s (2n+2), s (2n+4) can distinguish shown in following formula:

s (2 n + 1) = \frac{s (2 n) + s (2 n + 2)}{2 \sqrt{C_{m} (\frac{n}{2})}} - - - (4)

s (2 n + 3) = \frac{s (2 n + 2) + s (2 n + 4)}{2 \sqrt{C_{m} (\frac{n}{2})}} - - - (5)

In formula (4), formula (5), n is 0 or positive even numbers.The rest may be inferred, sampled point in other down-scaled signals windows Between interpolated value can also identical mode try to achieve, such as sampled point in the down-scaled signals window Wm+1 of Fig. 2 Interpolated point s (2n+5) between s (2n+4), s (2n+6) and between sampled point s (2n+6), s (2n+8) Interpolated point s (2n+7) can also the embodiment of Fig. 2 try to achieve, those skilled in the art should be according to above-mentioned The method of embodiment pushes away to obtain its embodiment, thus does not repeats them here.

As it has been described above, the present embodiment is the interpolated value utilizing trigonometric function to come between estimating sampling point, according to interior Slotting parametric function calculates the interpolated value in down-scaled signals window between adjacent two sampled points, due to trigonometric function Characteristic is more similar to the characteristic of acoustical signal, therefore merely utilizes arithmetical average compared to prior art Asking for interpolated value, the calculation of the present embodiment can obtain more accurate interpolated value, and can effectively keep away Exempt from the voice signal after frequency reducing and the situation of distorted signals occurs.

Fig. 3 illustrates the schematic flow sheet of the audio signal processing method of one embodiment of the invention, refer to figure 3.From above-described embodiment, the audio signal processing method of speech signal processing device can include following step Suddenly.First, primary speech signal of sampling, to produce the sampled speech of the sampled signal window including a sequence Signal (step S302).Then, each sampling is calculated according to three sampled values of continuous print in each sampled signal window The value (step S304) of the interpolated parameter function that signal window is corresponding, wherein interpolated parameter function can be according to respectively adopting In sample signal window, the trigonometric function relation between three sampled values of continuous print calculates and obtains, and interpolated parameter function can For trigonometric function.Afterwards, can then judge whether the value of interpolated parameter function is less than higher limit and is more than In lower limit (step S306), if the value of interpolated parameter function is not less than higher limit or not more than or equal to lower limit Value, then the value (step S308) of modified interpolation parametric function, to remove unnecessary noise.The wherein upper limit The situation of value and the interference of lower limit visual actual noise adjusts, such as can be according to the frequency of primary speech signal The sample frequency of rate and sampling unit adjusts higher limit and lower limit, and the repairing of the value of interpolated parameter function Positive way can for example, when the value of interpolated parameter function is more than or equal to higher limit, by interpolated parameter function Value be modified to higher limit, when interpolated parameter function value less than lower limit time, by interpolated parameter function Value is modified to lower limit.After the value having revised interpolated parameter function, can then down sample voice signal, To produce the down-scaled signals (step S310) of the down-scaled signals window including a sequence, then believe according to each frequency reducing The value of number interpolated parameter function that window is corresponding calculates the interpolated value in each down-scaled signals window between adjacent two sampled points (step S312).If on the contrary, the value of interpolated parameter function is less than higher limit and more than or equal to lower limit, Then it is directly entered step S310, down sample voice signal.

In sum, embodiments of the invention utilize trigonometric function to carry out the interpolated value between estimating sampling point, also Just it is based on interpolated parameter function to the interpolated value calculating in down-scaled signals window between adjacent two sampled points, due to The characteristic of trigonometric function is more similar to the characteristic of acoustical signal, therefore compared to prior art, can obtain more For accurate interpolated value, and the voice signal after frequency reducing can be effectively prevented from the situation of distorted signals occurs.

Last it is noted that various embodiments above is only in order to illustrate technical scheme, rather than to it Limit；Although the present invention being described in detail with reference to foregoing embodiments, the common skill of this area Art personnel it is understood that the technical scheme described in foregoing embodiments still can be modified by it, Or the most some or all of technical characteristic is carried out equivalent；And these amendments or replacement, and The essence not making appropriate technical solution departs from the scope of various embodiments of the present invention technical scheme.

Claims

1. a speech signal processing device, it is characterised in that including:

Processing unit, receives the sampled speech signal of the sampled signal window including sequence, and according to each described In sampled signal window, three sampled values of continuous print calculate the interpolated parameter letter corresponding with each described sampled signal window The value of number, sampled speech signal described in frequency reducing, to produce the down-scaled signals of the down-scaled signals window including sequence, Phase in each described down-scaled signals window is calculated according to the value of interpolated parameter function corresponding to each described down-scaled signals window Interpolated value between adjacent two sampled points.

Speech signal processing device the most according to claim 1, it is characterised in that also include:

Sampling unit, couples described processing unit, and primary speech signal of sampling, to produce described sampling language Tone signal, described processing unit also judges whether the value of described interpolated parameter function is less than higher limit and is more than Equal to lower limit, if the value of described interpolated parameter function is not less than described higher limit or not more than or equal to described Lower limit, revises the value of described interpolated parameter function.

Speech signal processing device the most according to claim 2, it is characterised in that if described interpolation The value of parametric function is more than or equal to described higher limit, the value of described interpolated parameter function is modified on described Limit value, if the value of described interpolated parameter function is less than described lower limit, by the value of described interpolated parameter function It is modified to described lower limit.

Speech signal processing device the most according to claim 3, it is characterised in that described higher limit The frequency of described primary speech signal and the sample frequency of described sampling unit it is associated with described lower limit.

Speech signal processing device the most according to claim 1, it is characterised in that described process list Unit also calculates each institute according to trigonometric function relation between three sampled values of continuous print in each described sampled signal window State the interpolated parameter function that sampled signal window is corresponding.

Speech signal processing device the most according to claim 5, it is characterised in that described interpolation is joined Number function is trigonometric function.

7. an audio signal processing method, it is characterised in that including:

Sampling primary speech signal, to produce the sampled speech signal of the sampled signal window including sequence；

Calculate and each described sampled signal window pair according to three sampled values of continuous print in each described sampled signal window The value of the interpolated parameter function answered；

Sampled speech signal described in frequency reducing, to produce the down-scaled signals of the down-scaled signals window including sequence；With And

Each described down-scaled signals window is calculated according to the value of interpolated parameter function corresponding to each described down-scaled signals window In interpolated value between adjacent two sampled points.

Audio signal processing method the most according to claim 7, it is characterised in that also include:

Judge whether the value of described interpolated parameter function is less than higher limit and more than or equal to lower limit, if described The value of interpolated parameter function, not less than described higher limit or not more than or equal to described lower limit, is revised described interior Insert the value of parametric function.

Audio signal processing method the most according to claim 8, it is characterised in that if described interpolation The value of parametric function is more than or equal to described higher limit, the value of described interpolated parameter function is modified on described Limit value, if the value of described interpolated parameter function is less than described lower limit, by the value of described interpolated parameter function It is modified to described lower limit.

Audio signal processing method the most according to claim 9, it is characterised in that the described upper limit Value and described lower limit are associated with the frequency of described primary speech signal and the sampling frequency of described sampling unit Rate.