JPH098680A

JPH098680A - Voice decoding device

Info

Publication number: JPH098680A
Application number: JP7173886A
Authority: JP
Inventors: Ichiro Matsumoto; 一郎松本; Osamu Watanabe; 治渡辺
Original assignee: Kokusai Electric Corp
Current assignee: Kokusai Electric Corp
Priority date: 1995-06-19
Filing date: 1995-06-19
Publication date: 1997-01-10
Anticipated expiration: 2019-10-27
Also published as: JP3580906B2

Abstract

PURPOSE: To reduce the abnormal change of the false background noise outputted to a silent section at the time when the level of the background noise as the last frame of a sound section is quickly changed by disturbance or the like with respect to the voice decoding device which receives and decodes the intermittent transmission signal transmitted only in sound sections in the adaptive difference PCM encoding communication. CONSTITUTION: A controller 4 outputs control signals (d) and (e) indicating a sound section or a silent section in accordance with a sound detection flag (b) separated from a reception demodulated signal (a) by a demultiplexer 3. An auxiliary prediction coefficient holder 7 is provided, and a prediction coefficient (i) of the last frame in a prediction coefficient holder 6 which extracts, updates, and stores a prediction coefficient (g) of each frame used for decoding of voice encoded data (c) by a decoder 5 is stored and updated plural times, and the average value of these plural stored values is obtained as a prediction coefficient h=f for reproducing of the false background noise in the silent section.

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】本発明は音声符号化通信における
音声復号装置に関し、特に、符号化方式に適応差分ＰＣ
Ｍ（ＡＤＰＣＭ）方式を適用した音声復号装置に関する
ものである。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a speech decoding device in speech coding communication, and more particularly to an adaptive difference PC for a coding system.
The present invention relates to a speech decoding device to which the M (ADPCM) system is applied.

【０００２】[0002]

【従来の技術】音声による通信が行われている場合、通
話者のどちらか一方が発声している時間率は約３５％で
あるといわれている。近年、個人主体の通信であるパー
ソナルコミュニケーションがその範囲を拡大してきてい
る。そこでは携帯に便利な端末を利用した音声通信が主
体である。このような携帯端末に要求される事項とし
て、第１にコードレス化が挙げられる。第２に携帯に便
利なように電池が使用され長時間にわたる使用に耐える
必要があるため回路消費電力の低減化が要求されてい
る。2. Description of the Related Art When voice communication is performed, it is said that the time rate at which either one of the callers speaks is about 35%. In recent years, the range of personal communication, which is a communication mainly based on individuals, has been expanding. Here, voice communication using a portable terminal is mainly performed. As a matter required for such a portable terminal, firstly, there is cordlessness. Secondly, it is required to reduce circuit power consumption because a battery is used for convenience of carrying and it is required to endure long-term use.

【０００３】回路消費電力の従来の低減方法として、音
声の発声時間率に着目し発声している時のみ送信回路を
動作させ、その他の送信時間は回路を休止状態にする方
法がある。このような技術を実現するためには送信側に
音声検出機能を設けて有音区間のみ送信する不連続送信
装置を付加すればよい。その場合に問題となるのは受信
側である。すなわち、受信側では再生音声が断続する為
に非常に不愉快な音声になる。この原因は、音声を伝送
しているときは音声に背景雑音が重畳されているが、音
声がないときは背景雑音が伝送されない。すなわち、背
景雑音は音声信号がある時のみ変調されて伝送されるた
めであることが知られている。このような問題点を解決
する方法として、受信側で音声信号が伝送されて来ない
無音区間は送信側の背景雑音に類似した擬似背景雑音を
発生させる方法が知られている。As a conventional method of reducing the circuit power consumption, there is a method of paying attention to the utterance time rate of a voice and operating the transmission circuit only when uttering a voice, and putting the circuit in a rest state during the other transmission time. In order to realize such a technique, a voice detection function may be provided on the transmission side to add a discontinuous transmission device that transmits only a sound period. In that case, the problem is on the receiving side. That is, the reproduced sound is intermittent on the receiving side, which makes the sound very unpleasant. The cause is that the background noise is superimposed on the voice when the voice is transmitted, but the background noise is not transmitted when there is no voice. That is, it is known that the background noise is modulated and transmitted only when there is a voice signal. As a method for solving such a problem, a method is known in which a pseudo background noise similar to the background noise on the transmitting side is generated in a silent section in which a voice signal is not transmitted on the receiving side.

【０００４】本発明は受信側の復号器を対象とするもの
であるが、受信側の復号器には送信側の符号化器と同じ
構成部分があるので、送信側のＡＤＰＣＭ符号化器につ
いて概要を説明する。図５は送信側符号化器のブロック
図である。図において、５１は６４ｋｂ／ｓのμ則ＰＣ
Ｍ入力信号を線形１３ビットＰＣＭに変換する均一ＰＣ
Ｍ変換器である。５２は均一ＰＣＭ変換器５１の出力か
ら適応予測器５３の出力である予測信号ｊを差し引いて
差分信号ｋを得る減算器５２である。この差分信号ｋは
適応量子化器５４により量子化され、ＡＤＰＣＭ音声符
号化器の出力として３２ｋｂ／ｓの音声データが伝送路
に送出される。一方、適応逆量子化器５６は、３２ｋｂ
／ｓの出力音声データを適応逆量子化して量子化差分信
号ｍを出力する。加算器５５は、量子化差分信号ｍと予
測信号ｊを加算して再生信号ｎを出力する。適応予測器
５３は、予測係数を算出しそれを用いて量子化差分信号
ｍおよび再生信号ｎから予測信号ｊを生成して出力す
る。The present invention is intended for the decoder on the receiving side, but since the decoder on the receiving side has the same constituent parts as the encoder on the transmitting side, the outline of the ADPCM encoder on the transmitting side will be described. Will be explained. FIG. 5 is a block diagram of the transmission side encoder. In the figure, 51 is a μ-law PC of 64 kb / s
Uniform PC for converting M input signal to linear 13-bit PCM
M converter. Reference numeral 52 is a subtracter 52 that subtracts the prediction signal j that is the output of the adaptive predictor 53 from the output of the uniform PCM converter 51 to obtain the difference signal k. This differential signal k is quantized by the adaptive quantizer 54, and 32 kb / s voice data is sent to the transmission line as the output of the ADPCM voice encoder. On the other hand, the adaptive inverse quantizer 56 has a capacity of 32 kb.
The output voice data of / s is adaptively dequantized and the quantized difference signal m is output. The adder 55 adds the quantized difference signal m and the prediction signal j and outputs the reproduction signal n. The adaptive predictor 53 calculates a prediction coefficient and uses it to generate and output a prediction signal j from the quantized difference signal m and the reproduction signal n.

【０００５】適応予測器５３が予測信号ｊを生成するた
めに算出する予測係数は、ある時点の標本値を、隣接す
る過去の２つの標本値で予測するための係数であり、そ
の値は音色を表す。例えば、予測係数は、音声信号の場
合と背景雑音の場合とでは異なった発生分布となり、同
じ背景雑音でも、平坦な周波数特性を持つ白色雑音と、
一般によく知られているピンク雑音のように傾きを持っ
た雑音とでも発生分布が異なる。一例として、図６は入
力信号と予測係数の波形例図であり、適応予測器５３で
計算される予測係数のうちａ１，ａ２と音声の時間的変
化を示す。区間Ａ，Ｂ，Ｃの音声のある区間と、区間
Ｄ，Ｅの背景雑音のみの区間では予測係数の値が異なる
ことが分かる。ここで音声に重畳している背景雑音は、
例えば、傾いた周波数特性をもつ実際の空気調節装置
（エアコン）の雑音である。従来技術、本発明は、とも
に擬似背景雑音を発生させる一手段としてこの予測係数
に注目したものである。その一例を説明する。The prediction coefficient calculated by the adaptive predictor 53 to generate the prediction signal j is a coefficient for predicting a sample value at a certain time point by two adjacent sample values in the past, and the value is a tone color. Represents For example, the prediction coefficient has different occurrence distributions in the case of a voice signal and the case of background noise, and even with the same background noise, white noise having a flat frequency characteristic,
The generation distribution is different even for noise having a slope such as the well-known pink noise. As an example, FIG. 6 is a waveform example diagram of the input signal and the prediction coefficient, and shows a1 and a2 of the prediction coefficients calculated by the adaptive predictor 53 and the temporal change of the voice. It can be seen that the value of the prediction coefficient is different between the sections of the sections A, B, and C where the voice is present and the sections D and E of the background noise only. Here, the background noise superimposed on the voice is
For example, it is noise of an actual air conditioner (air conditioner) having a tilted frequency characteristic. The prior art and the present invention both focus on this prediction coefficient as one means for generating pseudo background noise. An example will be described.

【０００６】図４は従来の復号装置のブロック図であ
る。図４において、１はアンテナである。２は送信側の
符号化器からの音声の有り無しを判断する音声検出器の
音声検出フラグとＡＤＰＣＭ符号化データとが多重され
て変調された変調波を受信復調する受信復調器である。
３は受信復調された信号ａをＡＤＰＣＭ符号化データｃ
と音声検出フラグｂに分離する多重分離器である。４は
多重分離器３からの音声検出フラグｂを受け、電力保持
器、ＡＤＰＣＭ復号器５、予測係数保持器６にそれぞれ
を後に記述する制御を行うための制御信号を出力する制
御器である。５は制御器４からの制御信号ｄを受けて、
擬似背景雑音用ランダムＡＤＰＣＭデータを発生すると
ともに、予測係数保持器６と予測係数データの送受を行
い、多重分離器３からのＡＤＰＣＭ符号化データｃを復
号するＡＤＰＣＭ復号器である。６はＡＤＰＣＭ復号器
５の予測係数ｇをフレーム毎に平均値を計算して、制御
器４からの制御信号ｅにより有音区間は平均値の記憶の
更新を行い、無音区間は更新を中止する予測係数保持器
である。８はスピーカである。FIG. 4 is a block diagram of a conventional decoding device. In FIG. 4, 1 is an antenna. Reference numeral 2 denotes a reception demodulator that receives and demodulates a modulated wave that is obtained by multiplexing a voice detection flag of the voice detector that determines the presence or absence of voice from the encoder on the transmission side and ADPCM coded data.
3 is the ADPCM encoded data c of the received and demodulated signal a
And a voice detection flag b. Reference numeral 4 denotes a controller that receives the voice detection flag b from the demultiplexer 3 and outputs a control signal to the power holder, the ADPCM decoder 5, and the prediction coefficient holder 6 for performing control described later. 5 receives the control signal d from the controller 4,
The ADPCM decoder generates random ADPCM data for pseudo background noise, transmits and receives the prediction coefficient data to and from the prediction coefficient holder 6, and decodes the ADPCM coded data c from the demultiplexer 3. Reference numeral 6 calculates the average value of the prediction coefficient g of the ADPCM decoder 5 for each frame, updates the storage of the average value in the voiced section and stops the update in the silent section by the control signal e from the controller 4. It is a prediction coefficient holder. Reference numeral 8 is a speaker.

【０００７】図４と図７を用いて動作を説明する。ま
ず、図４において、音声検出フラグとＡＤＰＣＭ符号化
データが多重化され変調された変調波をアンテナ１で受
け、受信復調器２で受信復調して多重分離器３に復調信
号ａを送る。ここでいう音声検出フラグとは、送信側の
符号化器の入力音声の、音声が有る部分（有音区間）と
無い部分（無音区間）を音声検出器で検出した結果を示
すものである。多重分離器３は、音声検出フラグｂとＡ
ＤＰＣＭ符号化データｃに分離する。この場合、例とし
て、ＡＤＰＣＭ符号化データｃは５ｍｓｅｃを１フレー
ムとしている。５ｍｓｅｃ毎に音声検出フラグｂが制御
器４に送られる。制御器４は、音声検出フラグｂを受け
て、有音区間，無音区間のいずれかを示す制御信号ｄ，
ｅを出力する。ＡＤＰＣＭ符号器５は、無音区間では変
調信号が途絶えてしまうため、自ら擬似背景雑音を生成
するためにＡＤＰＣＭ符号化データｃが取り得る範囲の
ランダムなデータを内部生成し、そのデータを予測係数
保持器６から与えられる予測係数ｆを用いて復号する。The operation will be described with reference to FIGS. 4 and 7. First, in FIG. 4, a modulated wave in which a voice detection flag and ADPCM coded data are multiplexed and modulated is received by an antenna 1, received and demodulated by a reception demodulator 2, and a demodulation signal a is sent to a demultiplexer 3. The voice detection flag mentioned here indicates the result of detection by the voice detector of a portion with voice (voiced section) and a portion without voice (voiceless section) of the input voice of the encoder on the transmission side. The demultiplexer 3 uses the voice detection flags b and A
Separate into DPCM encoded data c. In this case, as an example, the ADPCM coded data c has 5 msec as one frame. The voice detection flag b is sent to the controller 4 every 5 msec. Upon receiving the voice detection flag b, the controller 4 receives a control signal d, which indicates either a voiced section or a silent section.
Output e. The ADPCM encoder 5 internally generates random data within a range that the ADPCM encoded data c can take in order to generate pseudo background noise, because the modulated signal is cut off in the silent section, and the data is held as a prediction coefficient. Decoding is performed using the prediction coefficient f given from the device 6.

【０００８】予測係数保持器６は、送信断時のＡＤＰＣ
Ｍ復号器５で生成した擬似背景雑音に、実際の雑音のス
ペクトラム情報を付加するために、ＡＤＰＣＭ復号器５
内の予測係数ｇを抽出してフレーム毎に平均値を求めて
更新保持する。制御信号ｅが有音区間から無音区間に変
わったとき、その直前のフレーム、即ち、有音区間の最
後のフレームの予測係数の平均値を保持して、更新を中
止する。ＡＤＰＣＭ復号器５は、この更新を中止して保
持されている予測係数ｆを使用して内部で発生したラン
ダムデータの復号を行い、擬似背景雑音を出力する。The prediction coefficient holder 6 is used for ADPC when transmission is interrupted.
In order to add the spectrum information of the actual noise to the pseudo background noise generated by the M decoder 5, the ADPCM decoder 5
The prediction coefficient g in is extracted, the average value is calculated for each frame, and the updated value is held. When the control signal e changes from the voiced section to the silent section, the average value of the prediction coefficient of the immediately preceding frame, that is, the last frame of the voiced section is held and the update is stopped. The ADPCM decoder 5 cancels this update, decodes the internally generated random data using the prediction coefficient f held, and outputs pseudo background noise.

【０００９】符号化器側の音声検出器には、有音区間か
ら無音区間に変化したとき、会話の語尾切れを防ぐため
に約数１００ｍｓｅｃの間有音区間と判定するハングオ
ーバーと一般に言われている機能が備わっているため、
有音区間の最後のフレームの予測係数値は背景雑音の値
となる。この効果で、ランダムなデータを用いて復号し
ても実際の符号化器側の音声に重畳されている背景雑音
と似た音色を持つ雑音が生成される。It is generally said that the speech detector on the encoder side is a hangover in which when a voiced section is changed to a silent section, it is judged as a voiced section for about several 100 msec in order to prevent the ending of conversation. Is equipped with
The prediction coefficient value of the last frame of the voiced section is the value of background noise. With this effect, even if decoding is performed using random data, noise having a tone color similar to the background noise superimposed on the actual voice on the encoder side is generated.

【００１０】図７は擬似背景雑音に対する予測係数を説
明する波形例図であり、外乱のない場合を示す。図にお
いて、（Ａ）は符号化器の入力音声信号（音声＋背景雑
音）を示し、（Ｂ）はこれに対する予測係数のうち、例
として予測係数ａ１の時間変化を示す。符号化器側の音
声検出処理で、音声が音声検出フラグ（Ｃ）のように理
想的に検出されたとすると、符号化器は音声信号（Ｄ）
の音声データだけを断続送信する。（Ｅ）はこの音声信
号（Ｄ）に対する予測係数ａ１である。従来技術では、
音声信号（Ｄ）のように区間ａ，ｂ，ｃに擬似背景雑音
を挿入するために、有音区間から無音区間に変わる直前
のフレーム、ここでは、予測係数保持器６で保持される
ｔ，ｔ−１，ｔ−２の予測係数ａ１の平均値（Ｆ）を使
用してそれぞれ区間ａ，ｂ，ｃのランダムデータを復号
し、擬似背景雑音を出力する。つまり復号器５が復号に
用いる予測係数は予測係数ａ１（Ｇ）のようになる。こ
の状態では無音区間ａ，ｂ，ｃの擬似背景雑音の音色は
予測係数ａ１（Ｆ）のように区間ａ，ｂ，ｃでほぼ同じ
値の実際の背景雑音の予測係数値になっているため、違
和感のない連続性のある擬似背景雑音になる。FIG. 7 is a waveform example diagram for explaining the prediction coefficient for the pseudo background noise, showing the case where there is no disturbance. In the figure, (A) shows the input speech signal (speech + background noise) of the encoder, and (B) shows the temporal change of the prediction coefficient a1 as an example among the prediction coefficients for this. If the speech is ideally detected like the speech detection flag (C) in the speech detection processing on the encoder side, the encoder outputs the speech signal (D).
Only the voice data of is transmitted intermittently. (E) is a prediction coefficient a1 for this audio signal (D). In the prior art,
In order to insert the pseudo background noise into the sections a, b, and c like the voice signal (D), the frame immediately before the transition from the voiced section to the silence section, here, t held by the prediction coefficient holder 6, Random data of sections a, b, and c are decoded using the average value (F) of the prediction coefficients a1 of t-1 and t-2, and pseudo background noise is output. That is, the prediction coefficient used by the decoder 5 for decoding is the prediction coefficient a1 (G). In this state, the tone color of the pseudo background noise in the silent sections a, b, and c is the same as the prediction coefficient value of the actual background noise in the sections a, b, and c, as in the prediction coefficient a1 (F). , It becomes a pseudo-background noise that is continuous without any discomfort.

【００１１】[0011]

【発明が解決しようとする課題】次に、従来技術の問題
点を図８を用いて説明する。図８は外乱のある場合の予
測係数の波形例図である。有音区間の最後の背景雑音の
フレーム（ｔ−１）に急に変化する外乱等が加わった場
合、例えば、符号化器側のマイクロホンに異音が入った
場合や伝送路上での誤りによる場合等、または受信した
音声検出フラグが誤った判定結果を示した場合、例え
ば、符号化器側の音声検出器が判定誤りを起こして有音
区間中に無音区間と判定してしまった場合や伝送路上で
音声検出フラグに誤りが生じてしまった場合等、入力音
声信号（Ａ）、予測係数（Ｂ）、音声検出フラグ（Ｃ）
で示したように、有音区間の最終フレームｔ−１は、入
力音声（Ａ），予測係数ａ１（Ｂ）ともに背景雑音とレ
ベルが異なった状態になっている。従来方式では、有音
区間の最後のフレームの予測係数の平均値のみを用いて
引き続く無音区間の擬似背景雑音を生成している。即
ち、区間ｂの予測係数として予測係数ａ１（Ｅ）のＺの
部分の平均値が用いられるため、予測係数ａ１（Ｆ）の
差ｘ，ｙのように、区間ａと区間ｃとがかなり異なった
値が用いられるため、区間ｂの擬似背景雑音の音色が異
なり、区間ａ，ｂ，ｃでは、（Ｇ）に示すように連続性
のない違和感のあるものになってしまうという欠点があ
る。Next, problems of the prior art will be described with reference to FIG. FIG. 8 is a waveform example diagram of the prediction coefficient when there is a disturbance. When a sudden disturbance or the like is added to the last background noise frame (t-1) in the voiced section, for example, when an abnormal noise is input to the microphone on the encoder side or due to an error on the transmission path. Etc., or when the received voice detection flag indicates an incorrect determination result, for example, when the voice detector on the encoder side causes a determination error and determines that there is a silent section in the voiced section or transmission When an error occurs in the voice detection flag on the road, the input voice signal (A), the prediction coefficient (B), the voice detection flag (C)
As shown in, the final frame t-1 in the voiced section has a level different from the background noise in both the input speech (A) and the prediction coefficient a1 (B). In the conventional method, only the average value of the prediction coefficient of the last frame in the voiced section is used to generate the pseudo background noise in the subsequent silence section. That is, since the average value of the Z part of the prediction coefficient a1 (E) is used as the prediction coefficient of the section b, the section a and the section c are considerably different like the difference x and y of the prediction coefficient a1 (F). Since different values are used, the tone color of the pseudo background noise in the section b is different, and in the sections a, b, and c, there is a discontinuity and discomfort as shown in (G).

【００１２】本発明の目的は、従来技術の問題点である
短時間の断続受信に伴う擬似背景雑音生成におよぼす悪
影響を低減させ、ＡＤＰＣＭ音声符号化方式において、
有音区間の最後の部分、無声になった直前にレベルや背
景雑音と音色の違いが大きい外乱があっても、再生され
る擬似背景雑音に違和感が生じないようにした音声復号
装置を提供することにある。An object of the present invention is to reduce the adverse effect on the pseudo background noise generation due to intermittent reception for a short time, which is a problem of the prior art, and to realize the ADPCM voice coding system.
(EN) Provided is a voice decoding device which prevents discomfort in reproduced pseudo background noise even when there is a disturbance having a large difference in tone color from the last part of a voiced section or immediately before becoming unvoiced. Especially.

【００１３】[0013]

【課題を解決するための手段】本発明の音声復号装置
は、音声信号の適応差分ＰＣＭ符号化信号と、発声期間
と該発声期間から無声期間になった直後の一定時間の背
景雑音期間とを含む有音区間と次の有音区間までの無音
区間のいずれかを示す音声検出フラグとが多重変調さ
れ、前記有音区間のみ送信され無音区間の送信が停止さ
れる断続送信信号を受信して復調する受信復調器と、該
受信復調器の出力を適応差分ＰＣＭ符号化信号と音声検
出フラグとに分離する多重分離器と、該多重分離器から
出力される音声検出フラグに従って有音区間と無音区間
のいずれかを示す制御信号を出力する制御器と、前記多
重分離器から出力される有音区間の適応差分ＰＣＭ符号
化信号を前記制御器からの制御信号が有音区間を示すと
き予測係数を用いて復号し再生音声として出力するとと
もに、該制御信号が無音区間を示すとき内部で発生させ
たランダム符号を外部から与えられるの予測係数平均値
によって復号し無音区間の再生雑音として擬似背景雑音
を出力するＡＤＰＣＭ復号器と、前記制御器からの制御
信号が有音区間を示すとき該ＡＤＰＣＭ復号器の前記予
測係数を抽出しフレーム毎の平均値を求めて記憶，更新
し、前記制御器からの制御信号が無音区間を示すとき記
憶，更新を停止して直前の前記背景雑音期間の予測係数
を前記ＡＤＰＣＭ復号器に与える予測係数保持器とが備
えられた音声復号装置において、前記予測係数保持器が
求めた前記背景雑音期間の予測係数平均値を抽出記憶
し、過去の値に重み付けを与えて求めた予測係数を前記
予測係数保持器に与えて前記ＡＤＰＣＭ復号器に対する
無音区間の予測係数平均値とする補助予測係数保持器を
設けたことを特徴とするものである。A speech decoding apparatus of the present invention comprises an adaptive differential PCM coded signal of a speech signal, a utterance period and a background noise period of a fixed time immediately after the utterance period becomes a silent period. A voice detection flag indicating any one of a voiced section including a voiced section up to the next voiced section is multi-modulated, and an intermittent transmission signal is transmitted in which only the voiced section is transmitted and the transmission of the voiceless section is stopped. A reception demodulator for demodulation, a demultiplexer for separating the output of the reception demodulator into an adaptive differential PCM coded signal and a voice detection flag, and a voiced section and a silence according to the voice detection flag output from the demultiplexer. A controller that outputs a control signal indicating any one of the sections, and a prediction coefficient for the adaptive differential PCM coded signal of the sound section output from the demultiplexer when the control signal from the controller indicates the sound section. Using ADPCM for outputting a reproduced voice and outputting a pseudo background noise as reproduced noise in a silent section by decoding a random code generated internally when the control signal indicates a silent section by an average value of a prediction coefficient given from the outside. When the control signal from the decoder and the controller indicates a voiced section, the prediction coefficient of the ADPCM decoder is extracted, an average value for each frame is obtained, stored and updated, and the control signal from the controller is In a speech decoding apparatus provided with a prediction coefficient holder that stops storing and updating when indicating a silent section and gives the prediction coefficient of the immediately preceding background noise period to the ADPCM decoder, the prediction coefficient holder calculates The average value of the prediction coefficient of the background noise period is extracted and stored, and the prediction coefficient obtained by weighting the past value is given to the prediction coefficient holder to give the ADPCM recovery value. It is characterized in that an auxiliary prediction coefficients cage for the prediction coefficient average value of a silent interval for vessels.

【００１４】さらに、前記補助予測係数保持器は、前記
予測係数保持器がフレーム毎に平均値を求めた予測係数
のうち前記背景雑音期間の予測係数を過去の複数回にわ
たって抽出し順次更新記憶する予測係数メモリと、該予
測係数メモリの複数回の記憶値の平均値を算出して前記
予測係数保持器に与える予測係数とする平均値算出器と
を備えたことを特徴とし、または請求項１記載の補助予
測係数保持器は、前記予測係数保持器がフレーム毎の平
均値を求める毎に抽出した該フレームの前記背景雑音期
間の予測係数に０．５未満の正数を第１の乗算値として
乗算する第１の乗算器と、該第１の乗算器からの入力値
と他方の入力値とを加算して前記前記予測係数保持器に
与える予測係数とする加算器と、該加算器の出力を更新
記憶して出力する予測係数メモリと、該予測係数メモリ
の出力値に１から前記第１の乗算値を差し引いた値を第
２の乗算値として乗算し前記加算器の前記他方の入力値
とする第２の乗算器とを備えたことを特徴とするもので
ある。Further, the auxiliary prediction coefficient holder extracts the prediction coefficient of the background noise period from the prediction coefficients for which the prediction coefficient holder has calculated the average value for each frame over a plurality of past times, and sequentially updates and stores it. 3. A prediction coefficient memory, and an average value calculator for calculating an average value of stored values of the prediction coefficient memory for a plurality of times and providing a prediction coefficient to the prediction coefficient holder. According to the auxiliary prediction coefficient holder described above, the prediction coefficient of the background noise period of the frame extracted each time the prediction coefficient holder obtains an average value for each frame is multiplied by a first positive value less than 0.5. A first multiplier that multiplies the input value from the first multiplier, an adder that adds the input value from the first multiplier and the other input value to obtain a prediction coefficient to be given to the prediction coefficient holder, and Update and store output and output And a second coefficient multiplier for multiplying the output value of the prediction coefficient memory by subtracting the first multiplication value from 1 as a second multiplication value and setting the other input value of the adder. It is characterized by having and.

【００１５】[0015]

【実施例】図１は本発明の実施例を示すブロック図であ
る。図１の１はアンテナ、２は音声検出フラグとＡＤＰ
ＣＭ符号化データとが多重変調された変調波を受信復調
する受信復調器、３は受信復調信号ａをＡＤＰＣＭ符号
化データｃと音声検出フラグｂに分離する多重分離器、
４は音声検出フラグｂを受け、制御信号ｄ，ｅを出力す
る制御器、５はＡＤＰＣＭ復号器であり、以上は従来回
路と同じである。６予測係数保持器であり、有音区間は
ＡＤＰＣＭ復号器５の内部変数である予測係数ｇを抽出
してフレーム毎に平均値を計算して平均値を記憶更新
し、無音区間は更新を停止する。７は本発明で付加した
補助予測係数保持器、８はスピーカである。FIG. 1 is a block diagram showing an embodiment of the present invention. In FIG. 1, 1 is an antenna, 2 is a voice detection flag and ADP
A reception demodulator for receiving and demodulating a modulated wave in which CM coded data is subjected to multiplex modulation, a demultiplexer 3 for separating a reception demodulated signal a into ADPCM coded data c and a voice detection flag b,
Reference numeral 4 is a controller that receives the voice detection flag b and outputs control signals d and e. Reference numeral 5 is an ADPCM decoder. The above is the same as the conventional circuit. 6 predictive coefficient holder, the predictive coefficient g which is an internal variable of the ADPCM decoder 5 is extracted in the voiced section, the average value is calculated for each frame, the average value is stored and updated, and the update is stopped in the silent section. To do. Reference numeral 7 is an auxiliary prediction coefficient holder added in the present invention, and 8 is a speaker.

【００１６】〔動作〕予測係数保持器６及び補助予測係
数保持器７の動作以外は従来技術と同じであるので、従
来技術と異なる点を詳細に説明する。予測係数保持器６
は、制御信号ｅに従って有音区間はＡＤＰＣＭ復号器５
内で計算される予測係数ｇを抽出しフレーム毎に平均値
を計算して保持更新を行い、有音区間から無音区間に変
化したとき更新を停止して最終フレームの値ｉを補助予
測係数保持器７に出力する。補助予測係数保持器７は、
予測係数保持器６で計算された最終フレームの値ｉが入
力される毎に、過去の有音区間から無音区間に変化した
ときの直前の予測係数値を用いて擬似背景雑音を生成す
るための予測係数値に対して従来技術の問題点にある影
響を軽減する処理を行う。例えば、過去の最終フレーム
のデータに重み付けを与えて現在のデータの影響を抑圧
する、または、過去数フレームのデータの平均をとるこ
とにより過去のデータに重み付けを与える処理を行って
予測係数保持器６に処理後の値ｈを出力する。予測係数
保持器６はこの値ｈを無音区間のｆとして出力し、ＡＤ
ＰＣＭ復号器５はこの値ｆを用いて擬似背景雑音を生成
する。[Operation] The operation is the same as the prior art except for the operations of the prediction coefficient holder 6 and the auxiliary prediction coefficient holder 7, and the differences from the prior art will be described in detail. Prediction coefficient holder 6
Is the ADPCM decoder 5 for the voiced section according to the control signal e.
The prediction coefficient g calculated in the above is extracted, the average value is calculated for each frame, and the update is held, and when the sound section changes to the silent section, the update is stopped and the value i of the last frame is held as the auxiliary prediction coefficient. Output to the container 7. The auxiliary prediction coefficient holder 7 is
Every time the value i of the final frame calculated by the prediction coefficient holder 6 is input, the pseudo background noise is generated by using the prediction coefficient value immediately before the change from the past sound section to the silent section. A process for reducing the influence of the problem of the conventional technique on the prediction coefficient value is performed. For example, weighting the data of the last frame in the past to suppress the influence of the current data, or performing the process of weighting the past data by averaging the data of the past several frames to perform the prediction coefficient storage. The processed value h is output to 6. The prediction coefficient holder 6 outputs this value h as f in the silent section, and AD
The PCM decoder 5 uses this value f to generate pseudo background noise.

【００１７】図２は本発明の要部をなす補助予測係数保
持器７の第１の実施例を示すブロック図である。図２に
おいて、２１は平均値算出器、２２は予測係数メモリで
ある。平均値算出器２１はメモリ２２の複数回の最終フ
レームの予測係数の平均値を算出し、予測係数保持器６
に対してその平均値ｈを出力する。予測係数メモリ２２
は複数回の最終フレーム、例えば、ｔ〜ｔ−４の５フレ
ーム分の過去の予測係数値を順次記憶更新する。FIG. 2 is a block diagram showing a first embodiment of the auxiliary prediction coefficient holder 7 which forms the essential part of the present invention. In FIG. 2, 21 is an average value calculator, and 22 is a prediction coefficient memory. The average value calculator 21 calculates the average value of the prediction coefficients of the last frame of the memory 22 a plurality of times, and the prediction coefficient holder 6
, The average value h is output. Prediction coefficient memory 22
Sequentially stores and updates the past prediction coefficient values for a plurality of final frames, for example, 5 frames from t to t-4.

【００１８】〔動作〕有音区間から無音区間への変化が
検出されると、補助予測係数保持器７は予測係数保持器
６から最終フレームの予測係数平均値ｉを抽出する。メ
モリ２２は、最終フレームの予測係数平均値が入力され
る毎にメモリ内の値を更新し記憶していく。ここではｔ
は現在入力されたｉの値であり、ｔ−１は１つ過去に入
力された値、同様にｔ−４まで計５つの値が記憶されて
おり、新たな値ｉが入力される毎にそれぞれの値が順次
過去のメモリへと更新される。次に平均値算出器２１は
メモリ２２から出力される５つの値の平均値を算出し
て、その平均値ｈを予測係数保持器６に供給する。予測
係数保持器６はその値ｈを値ｆとして復号器５に与え
る。この処理により、図８の（Ｆ）の領域ｂの予測係数
は破線ｌのようになり、領域ｃでは破線ｋのように正規
の値に滑らかに近づいていき、Ｚのような影響を受けて
も領域ａ，領域ｂ，領域ｃに極端な差が生ずることな
く、従来の差ｘはｍになり、ｙはｎになり違和感が極め
て低減される。[Operation] When the change from the voiced section to the silent section is detected, the auxiliary prediction coefficient holder 7 extracts the prediction coefficient average value i of the final frame from the prediction coefficient holder 6. The memory 22 updates and stores the value in the memory every time the average value of the prediction coefficients of the final frame is input. Where t
Is the value of i that is currently input, t-1 is the value that was input one time ago, and a total of 5 values up to t-4 are stored. Each time a new value i is input, Each value is sequentially updated in the past memory. Next, the average value calculator 21 calculates the average value of the five values output from the memory 22, and supplies the average value h to the prediction coefficient holder 6. The prediction coefficient holder 6 gives the value h to the decoder 5 as the value f. By this processing, the prediction coefficient of the region b in FIG. 8F becomes as shown by the broken line l, and in the region c, it smoothly approaches the normal value as shown by the broken line k, and is affected by Z. Also, the conventional difference x becomes m and y becomes n without any extreme difference between the regions a, b, and c, and the discomfort is greatly reduced.

【００１９】次に、図３は本発明の要部をなす補助予測
係数保持器７の第２の実施例を示すブロック図である。
図３において、３１は加算器、３２，３３は乗算器，３
４は予測係数メモリである。乗算器３２には、０＜α＜
０．５の乗算値αが設定され、現在入力された最終フレ
ームの予測係数平均値ｉの重み付けを軽くする。一方の
乗算器３３には、１−α＝βとする乗算値βが設定さ
れ、加算器３１から出力された前回の予測係数平均値ｈ
の重み付けを重くするように作用する。例えば、α＝
０．０５、β＝１−α＝０．９５なる乗算値を設定する
と、加算器３１は、現在の入力値ｉの５％と前回の加算
器３１からの出力値ｈの９５％を加算して出力すること
となる。予測係数メモリ３４は、加算器３１の出力値ｈ
を順次記憶更新し、記憶した値ｐを乗算器３３に与えて
乗算値βが乗算されて次回の加算器３１の一方の入力と
する。Next, FIG. 3 is a block diagram showing a second embodiment of the auxiliary prediction coefficient holder 7 forming the essential part of the present invention.
In FIG. 3, 31 is an adder, 32 and 33 are multipliers, 3
4 is a prediction coefficient memory. The multiplier 32 has 0 <α <
A multiplication value α of 0.5 is set to lighten the weighting of the currently input prediction coefficient average value i of the last frame. A multiplication value β that sets 1−α = β is set in one of the multipliers 33, and the previous prediction coefficient average value h output from the adder 31 is set.
Acts to increase the weighting of. For example, α =
When a multiplication value of 0.05 and β = 1-α = 0.95 is set, the adder 31 adds 5% of the current input value i and 95% of the previous output value h from the adder 31. Will be output. The prediction coefficient memory 34 uses the output value h of the adder 31.
Are sequentially stored and updated, and the stored value p is given to the multiplier 33 and multiplied by the multiplication value β to be one input of the adder 31 for the next time.

【００２０】有音区間から無音区間に変化すると、補助
予測係数保持器７は予測係数保持器６から最終フレーム
の予測係数値ｉを抽出する。加算器３１、乗算器３２，
３３では次の式（１）の計算がされる。When the voiced section is changed to the silent section, the auxiliary prediction coefficient holder 7 extracts the prediction coefficient value i of the final frame from the prediction coefficient holder 6. Adder 31, multiplier 32,
At 33, the following equation (1) is calculated.

【数１】ｈ＝ｐ×０．９５＋ｉ×０．０５（１）つまり、ｈは前回のｉの値を含んだ予測係数値となり、
現在入力されたｉの値には大きくは左右されることはな
い。このことは、仮にある時点のｉの値が急激に変わっ
ても、出力されるｈの値にはあまり影響することはな
く、また適切な値に落ち着いていく。予測係数メモリ３
４は新しいｈを更新記憶する。初期値としてはここでは
０としている（短時間で適切な値に落ち着く。）この処理により、図２の第１の実施例と同様に過去の値
によって現在の値の急激な変動が抑えられるという効果
が得られる。## EQU00001 ## h = p.times.0.95 + i.times.0.05 (1) That is, h is the prediction coefficient value that includes the value of the previous i,
It does not greatly depend on the value of i currently input. This means that even if the value of i at a certain point changes abruptly, it will not affect the output value of h so much and will settle to an appropriate value. Prediction coefficient memory 3
4 updates and stores the new h. The initial value is set to 0 here (it settles to an appropriate value in a short time.) By this processing, it is said that the rapid change of the present value is suppressed by the past value as in the first embodiment of FIG. The effect is obtained.

【００２１】[0021]

【発明の効果】以上詳細に説明したように、過去の最終
フレームの予測係数を利用して擬似背景雑音を生成する
ことにより、短時間の急激な変化をする信号等が入って
来た場合もその信号の影響をそのまま受けることがなく
音声に重畳している背景雑音とほぼ同じスペクトラム情
報をもった擬似背景雑音を生成することができるため、
受話者に与える違和感が軽減される。As described in detail above, the pseudo background noise is generated by using the prediction coefficient of the last frame in the past, so that even when a signal or the like which changes abruptly in a short time comes in. Since it is possible to generate pseudo background noise having almost the same spectrum information as the background noise superimposed on the voice without being affected by the signal as it is,
The feeling of strangeness given to the listener is reduced.

[Brief description of drawings]

【図１】本発明の実施例を示す音声復号装置の構成例図
である。FIG. 1 is a configuration example diagram of a speech decoding apparatus showing an embodiment of the present invention.

【図２】本発明の図１の補助予測係数保持器の第１の実
施例を示す構成例図である。FIG. 2 is a configuration diagram showing a first embodiment of the auxiliary prediction coefficient holder shown in FIG. 1 of the present invention.

【図３】本発明の図１の補助予測係数保持器の第２の実
施例を示す構成例図である。FIG. 3 is a configuration diagram showing a second embodiment of the auxiliary prediction coefficient holder shown in FIG. 1 of the present invention.

【図４】従来の音声復号装置の構成例図である。FIG. 4 is a diagram showing a configuration example of a conventional speech decoding device.

【図５】ＡＤＰＣＭ符号化方式の送信側符号化器のブロ
ック図である。FIG. 5 is a block diagram of a transmission side encoder of the ADPCM encoding system.

【図６】入力信号と予測信号の波形例図である。FIG. 6 is a waveform example diagram of an input signal and a prediction signal.

【図７】擬似背景雑音に対する予測係数を説明する波形
例図である。FIG. 7 is a waveform example diagram illustrating a prediction coefficient for pseudo background noise.

【図８】外乱がある場合の擬似背景雑音に対する予測係
数を説明する波形例図である。FIG. 8 is a waveform example diagram illustrating a prediction coefficient for pseudo background noise when there is a disturbance.

[Explanation of symbols]

１アンテナ２受信復調器３多重分離器４制御器５ＡＤＰＣＭ復号器６予測係数保持器７補助予測係数保持器８スピーカａ復調信号ｂ音声検出フラグｃＡＤＰＣＭ符号化信号ｅ制御信号ｆ擬似敗勢雑音に与える予測係数値ｇ予測係数２１平均値算出器２２予測係数メモリ３１加算器３２乗算器３３乗算器３４予測係数メモリ５１均一ＰＣＭ変換器５２減算器５３適応予測器５４適応量子化器５５加算器５６適応逆量子化器ｊ予測信号ｍ量子化差分信号ｎ再生信号 1 antenna 2 reception demodulator 3 demultiplexer 4 controller 5 ADPCM decoder 6 prediction coefficient holder 7 auxiliary prediction coefficient holder 8 speaker a demodulation signal b voice detection flag c ADPCM coded signal e control signal f pseudo spurious noise Prediction coefficient value given to g g Prediction coefficient 21 Average value calculator 22 Prediction coefficient memory 31 Adder 32 Multiplier 33 Multiplier 34 Prediction coefficient memory 51 Uniform PCM converter 52 Subtractor 53 Adaptive predictor 54 Adaptive quantizer 55 Adder 56 Adaptive inverse quantizer j Prediction signal m Quantization difference signal n Reproduction signal

Claims

[Claims]

1. A voiced section including an adaptive differential PCM coded signal of a voice signal, a voiced period and a background noise period of a fixed time immediately after the voiced period is changed to a voiceless period and a next voiced section. A reception demodulator for receiving and demodulating an intermittent transmission signal in which a voice detection flag indicating any one of the silent sections is subjected to multiple modulation, and only the voiced section is transmitted and transmission in the silent section is stopped; A demultiplexer that separates an output into an adaptive differential PCM coded signal and a voice detection flag, and control that outputs a control signal indicating either a voiced section or a silent section according to the voice detection flag output from the demultiplexer And an adaptive difference PC of a voiced section output from the demultiplexer
When the control signal from the controller indicates a voiced section, the M coded signal is decoded using a prediction coefficient and is output as reproduced voice, and when the control signal indicates a silent section, a random code generated internally is output. An ADPCM decoder which decodes according to an average value of prediction coefficients given from the outside and outputs pseudo background noise as reproduction noise in a silent section; and when the control signal from the controller indicates a sound section, the ADPCM decoder
The prediction coefficient of the PCM decoder is extracted, the average value for each frame is obtained and stored and updated, and when the control signal from the controller indicates a silent section, the storage and update are stopped and the immediately preceding background noise period In a speech decoding device provided with a prediction coefficient holder for giving a prediction coefficient to the ADPCM decoder, the prediction coefficient average value of the background noise period obtained by the prediction coefficient holder is extracted and stored, and a past value is weighted. To the ADP to give the prediction coefficient obtained by
A speech decoding apparatus, characterized in that an auxiliary prediction coefficient holder for setting a prediction coefficient average value in a silent section for a CM decoder is provided.

2. The auxiliary prediction coefficient holder retains the prediction coefficient of the background noise period among the prediction coefficients for which the prediction coefficient holder has calculated the average value for each frame over a plurality of past times, and sequentially updates and stores it. The prediction coefficient memory and an average value calculator that calculates an average value of a plurality of values stored in the prediction coefficient memory and uses the prediction coefficient as a prediction coefficient to be given to the prediction coefficient holder. Voice decoding device.

3. The auxiliary prediction coefficient holder according to claim 1,
A first multiplier that multiplies the prediction coefficient of the background noise period of the frame extracted each time the prediction coefficient holder calculates the average value for each frame, by using a positive number less than 0.5 as a first multiplication value; , An adder for adding the input value from the first multiplier and the other input value to obtain a prediction coefficient given to the prediction coefficient holder, and a prediction for updating and storing the output of the adder A coefficient memory and a second multiplier for multiplying an output value of the prediction coefficient memory by subtracting the first multiplication value from 1 as a second multiplication value and setting the other input value of the adder. The speech decoding apparatus according to claim 1, further comprising: