JP2002198870A

JP2002198870A - Echo processing device

Info

Publication number: JP2002198870A
Application number: JP2000399136A
Authority: JP
Inventors: Masaya Takahashi; 真哉高橋; Tadashi Yamaura; 正山浦; Hirohisa Tazaki; 裕久田崎; Fumihiro Matsuoka; 文啓松岡
Original assignee: Mitsubishi Electric Corp
Current assignee: Mitsubishi Electric Corp
Priority date: 2000-12-27
Filing date: 2000-12-27
Publication date: 2002-07-12

Abstract

PROBLEM TO BE SOLVED: To provide an echo processing device in which a pseudo background noise with high quality is generated by a simple process. SOLUTION: The device is provided with a sound coding means for generating and inputting coded data equivalent to a background noise by using an analytic parameter for coding obtained in an analytic frame to be previously judged as a noise section in an analytic frame judged as a frame during which only a near end background noise component and an echo signal are contained in a transmission signal.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】この発明は、固定電話、車載
電話および携帯電話などの音声通信において、通信路や
スピーカとマイク間の反響路で生じる、送信音声信号に
含まれるエコーを低減するエコー処理装置に関するもの
である。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to an echo processing for reducing an echo included in a transmission audio signal, which is generated in a communication path or an echo path between a speaker and a microphone, in voice communication such as a fixed telephone, a vehicle-mounted telephone, and a portable telephone. It concerns the device.

【０００２】[0002]

【従来の技術】従来のエコー処理装置には特開平１１−
１７５８９号公報に開示されたものがあった。図７は特
開平１１−１７５８９号公報に開示された従来のエコー
処理装置を簡略化して示したブロック図であり、説明の
ためエコー処理装置本体に加え、送信信号を符号化する
音声符号化手段と、受信信号を復号化する音声復号化手
段及び、遠端側で音声信号を符号化する音声符号化手段
と音声信号を復号化する音声復号化手段を付加してい
る。2. Description of the Related Art A conventional echo processing apparatus is disclosed in
No. 17589 was disclosed. FIG. 7 is a simplified block diagram of a conventional echo processing apparatus disclosed in Japanese Patent Application Laid-Open No. 11-17589. For the sake of explanation, in addition to the echo processing apparatus main body, audio encoding means for encoding a transmission signal Voice decoding means for decoding the received signal, voice coding means for coding the voice signal at the far end, and voice decoding means for decoding the voice signal.

【０００３】図７において、１は擬似エコー生成手段、
２と７は加算手段、３はエコーキャンセラー手段、４と
５は有音無音判定手段、６はエコーサプレス手段、８は
擬似背景騒音生成手段であり、これ等によりエコー処理
装置本体を構成している。９は音声符号化手段、１０は
音声復号化手段であり、これ等により音声符号化復号化
手段１１を構成している。また、１８は遠端側の音声復
号化手段、１９は遠端側の音声符号化手段である。In FIG. 7, reference numeral 1 denotes a pseudo echo generating means,
2 and 7 are addition means, 3 is an echo canceller means, 4 and 5 are sound / silence determination means, 6 is echo suppression means, 8 is pseudo background noise generation means, and these constitute an echo processing apparatus main body. I have. Reference numeral 9 denotes an audio encoding unit, and reference numeral 10 denotes an audio decoding unit. These components constitute an audio encoding / decoding unit 11. Reference numeral 18 denotes a far-end speech decoding unit, and reference numeral 19 denotes a far-end speech encoding unit.

【０００４】次に動作について説明する。遠端側の話者
が発声した音声信号は音声符号化手段１９で符号化さ
れ、通信路を経由して符号化データＣＲとして近端側の
音声復号化手段１０に入力される。音声復号化手段１０
は符号化データＣＲを復号し、アナログ信号として受信
信号ＲＩ（ｉ）を出力する。有音無音判定手段５は受信
信号ＲＩ（ｉ）のパワーによって受信信号ＲＩ（ｉ）が
有音であるか無音であるかを判定し、その結果をエコー
サプレス手段６と擬似背景騒音生成手段８に出力する。
また、受信信号ＲＩ（ｉ）はそのまま受信信号ＲＯ
（ｉ）となり、反響路を含む外部に出力される。Next, the operation will be described. The speech signal uttered by the far end speaker is encoded by the speech encoding unit 19 and input to the near end speech decoding unit 10 as encoded data CR via a communication path. Voice decoding means 10
Decodes the encoded data CR and outputs a received signal RI (i) as an analog signal. The sound / silence determining means 5 determines whether the received signal RI (i) is sound or no sound based on the power of the received signal RI (i), and determines the result as the echo suppressor 6 and the pseudo background noise generating means 8. Output to
Also, the received signal RI (i) is directly received signal RO
(I), which is output to the outside including the echo path.

【０００５】一方、エコーキャンセラー手段３に入力さ
れた送信信号ＳＩ（ｉ）には、アナログ信号に変換され
た受信信号ＲＯ（ｉ）が反響路を通じてエコーとなって
入力されるエコー信号と、近端話者が発声する音声信号
と、近端の背景騒音とが含まれる。エコーキャンセラー
手段３は反響路の伝達特性を推定して擬似エコー生成手
段１で擬似エコー信号を生成し、受信信号ＳＩ（ｉ）か
ら加算手段２を用いて擬似エコー信号を差し引き、残差
信号ＳＡ（ｉ）を求める。有音無音判定手段４は残差信
号ＳＡ（ｉ）のパワーによって、残差信号ＳＡ（ｉ）が
有音か無音かを判定し、判定結果をエコーサプレス手段
６と擬似背景騒音生成手段８に出力する。なお、パワー
はサンプルの二乗和を求めることで算出する。On the other hand, the transmission signal SI (i) input to the echo canceller means 3 includes a reception signal RO (i) converted into an analog signal and an echo signal input as an echo through an echo path. The speech signal uttered by the end speaker and the background noise at the near end are included. The echo canceller means 3 estimates the transfer characteristic of the echo path, generates a pseudo echo signal by the pseudo echo generation means 1, subtracts the pseudo echo signal from the received signal SI (i) by using the addition means 2, and generates a residual signal SA. Find (i). The voiced / silence determining means 4 determines whether the residual signal SA (i) is voiced or silent based on the power of the residual signal SA (i), and sends the determination result to the echo suppressor 6 and the pseudo background noise generating means 8. Output. The power is calculated by calculating the sum of squares of the samples.

【０００６】エコーサプレス手段６は、受信信号ＲＩ
（ｉ）が有音無音判定手段５で有音と判定され、残差信
号ＳＡ（ｉ）が有音無音判定手段４で無音と判定される
区間は、残差信号ＳＡ（ｉ）に含まれる信号はエコー信
号のみであると判断し、残差信号ＳＡ（ｉ）の振幅を抑
圧することでエコー信号を抑圧する。また、有音無音判
定手段５の判定結果と有音無音判定手段４の判定結果が
両方とも有音の場合や有音無音判定手段５の判定結果が
無音の場合はＳＡ（ｉ）には振幅の抑圧は行わない。The echo suppressor 6 receives the received signal RI
The section where (i) is determined to be sound by the sound / silence determination means 5 and the residual signal SA (i) is determined to be silent by the sound / silence determination means 4 is included in the residual signal SA (i). It is determined that the signal is only an echo signal, and the echo signal is suppressed by suppressing the amplitude of the residual signal SA (i). If both the determination result of the voiced / silent determination means 5 and the determination result of the voiced / silence determination means 4 are sound, or if the determination result of the voiced / silence determination means 5 is silent, the amplitude is SA (i). Is not suppressed.

【０００７】擬似背景騒音生成手段８は受信信号ＲＩ
（ｉ）が有音無音判定手段５で無音と判定された区間の
スペクトルパラメータ（線形予測係数）を算出し保存し
ておく。そして、エコーサプレス手段６が残差信号ＳＡ
（ｉ）の振幅を抑圧している区間では、無音区間で求め
たスペクトルパラメータとホワイトノイズ（白色雑音と
称され、周波数スペクトルの形状がフラットな雑音であ
る）を用いた合成フィルタ処理を行って擬似背景騒音を
生成する。この擬似背景騒音はエコーサプレス手段６で
振幅抑圧された信号に加算手段７を介して加算され、加
算手段７から送信信号ＳＯ（ｉ）が得られる。[0007] The pseudo background noise generating means 8 receives the received signal RI.
(I) Calculates and stores the spectrum parameter (linear prediction coefficient) of the section determined to be silent by the voiced / silent determining means 5. Then, the echo suppressor 6 outputs the residual signal SA.
In the section in which the amplitude of (i) is suppressed, a synthesis filter process is performed using the spectrum parameters obtained in the silent section and white noise (referred to as white noise, which has a flat frequency spectrum shape). Generates pseudo background noise. This pseudo background noise is added to the signal whose amplitude has been suppressed by the echo suppressor 6 via the adder 7, and a transmission signal SO (i) is obtained from the adder 7.

【０００８】この処理により、エコ−サプレス手段６で
一旦エコー信号と共に抑圧された背景騒音が、実際の背
景騒音に近いスペクトル特性を持つ擬似背景騒音によっ
て埋め合わされるので、背景騒音の抑圧で生じる聴覚上
の違和感が軽減される。次に送信信号ＳＯ（ｉ）は音声
符号化手段９で符号化され、符号化データＣＳとして出
力される。符号化データＣＳは通信路を経由して遠端側
の音声復号化手段１８に入力される。音声復号化手段１
８は符号化データを復号化して音声信号を出力する。By this processing, the background noise once suppressed together with the echo signal by the eco-suppressing means 6 is compensated for by the pseudo background noise having a spectral characteristic close to that of the actual background noise. The above discomfort is reduced. Next, the transmission signal SO (i) is encoded by the audio encoding means 9 and output as encoded data CS. The encoded data CS is input to the voice decoding means 18 on the far end side via the communication path. Voice decoding means 1
Numeral 8 decodes the encoded data and outputs an audio signal.

【０００９】[0009]

【発明が解決しようとする課題】従来のエコー処理装置
は上記のように構成されているので、擬似背景騒音を生
成する際、スペクトルパラメータを算出する処理と、こ
のスペクトルパラメータとホワイトノイズから擬似背景
騒音を生成する合成フィルタ処理を行う必要があり、処
理が複雑で演算量が大きいという課題があった。Since the conventional echo processing apparatus is configured as described above, when generating a pseudo background noise, a process of calculating a spectrum parameter and a process of calculating a pseudo background from the spectrum parameter and white noise are performed. It is necessary to perform synthesis filter processing for generating noise, and there is a problem that the processing is complicated and the amount of calculation is large.

【００１０】この発明は上記のような従来の課題を解決
するためになされたもので、簡易な処理で品質の良い擬
似背景騒音を生成するエコー処理装置を提供することを
目的とする。SUMMARY OF THE INVENTION The present invention has been made to solve the above-mentioned conventional problems, and has as its object to provide an echo processing apparatus which generates high-quality pseudo background noise by simple processing.

【００１１】[0011]

【発明が解決するための手段】この発明に係るエコー処
理装置は、送信信号には近端の背景騒音成分と受信信号
が反響路によって反響したエコー信号のみが存在するこ
とを判定するエコー判定手段と、送信信号を符号化する
と共に、エコー判定手段において、送信信号には近端の
背景騒音成分とエコー信号のみが存在すると判定された
分析フレームでは、予め騒音区間と判定される分析フレ
ームで求めた符号化用の分析パラメータを用い、背景騒
音に相当する符号化データを生成し出力する音声符号化
手段とを備えたものである。An echo processing apparatus according to the present invention is characterized in that an echo determining means determines that only a near-end background noise component and a received signal are echoed by an echo path in a transmission signal. And the transmission signal is encoded, and in the analysis frame in which the echo determination means determines that only the near-end background noise component and the echo signal are present in the transmission signal, the analysis frame determined in advance is a noise section. Speech encoding means for generating and outputting encoded data corresponding to background noise using the encoded analysis parameters.

【００１２】この発明に係るエコー処理装置は、音声符
号化手段の音声符号化方式がＣＥＬＰ方式であり、背景
騒音に相当する符号化データが線形予測係数と適応音源
符号と駆動音源符号とこれら両音源符号のゲインで構成
されているものである。In the echo processing apparatus according to the present invention, the speech encoding method of the speech encoding means is the CELP method, and the encoded data corresponding to the background noise is a linear prediction coefficient, an adaptive excitation code, a driving excitation code, and both of them. It is composed of the gain of the excitation code.

【００１３】この発明に係るエコー処理装置は、エコー
判定手段において、送信信号には近端の背景騒音成分と
エコー信号のみが存在すると判定された分析フレームで
は、予め騒音区間と判定される複数の分析フレームで求
めた複数の符号化されたスペクトルパラメータをランダ
ムに選んで順次出力するものである。In the echo processing apparatus according to the present invention, in the analysis frame in which only the near-end background noise component and the echo signal are present in the transmission signal in the echo determination means, a plurality of analysis frames previously determined to be noise sections are provided. A plurality of coded spectral parameters obtained in the analysis frame are randomly selected and sequentially output.

【００１４】この発明に係るエコー処理装置は、エコー
判定手段において、送信信号には近端の背景騒音成分と
エコー信号のみが存在すると判定された分析フレームで
は、予め騒音区間と判定される複数の分析フレームで求
めた複数のスペクトルパラメータをランダムに選んで符
号化して順次出力するものである。In the echo processing apparatus according to the present invention, in the analysis frame in which the echo determination means determines that only the near-end background noise component and the echo signal are present in the transmission signal, a plurality of the noise frames are determined in advance as noise sections. A plurality of spectral parameters obtained in the analysis frame are randomly selected, encoded, and sequentially output.

【００１５】この発明に係るエコー処理装置は、エコー
判定手段において、送信信号には近端の背景騒音成分と
エコー信号のみが存在すると判定された分析フレームで
は、予め騒音区間と判定される複数の分析フレームにお
けるスペクトルパラメータを平均化し、平均化したスペ
クトルパラメータの各次元の値を分析フレーム毎に揺ら
がせて符号化して順次出力するものである。In the echo processing apparatus according to the present invention, in the analysis frame in which only the near-end background noise component and the echo signal are present in the transmission signal in the echo determination means, a plurality of noise sections previously determined as noise sections are provided. The spectrum parameters in the analysis frame are averaged, and the values of each dimension of the averaged spectrum parameter are fluctuated for each analysis frame, encoded, and sequentially output.

【００１６】この発明に係るエコー処理装置は、選択さ
れたスペクトルパラメータが求められた分析フレームと
同じ分析フレームで求めた適応音源符号と駆動音源符号
及び両音源符号に対応する符号化されたゲインを選択
し、順次出力するものである。According to the echo processing apparatus of the present invention, the adaptive excitation code, the driving excitation code, and the coded gains corresponding to both excitation codes, which are obtained in the same analysis frame in which the selected spectral parameter is obtained, are obtained. And outputs them sequentially.

【００１７】この発明に係るエコー処理装置は、エコー
判定手段において、送信信号には近端の背景騒音成分と
エコー信号のみが存在すると判定された分析フレームで
は、予め騒音区間と判定される複数の分析フレームで求
めた複数の適応音源符号及びそれぞれの適応音源符号に
対応する符号化されたゲインのセットをランダムに選ん
で順次出力し、また予め騒音区間と判定される複数の分
析フレームで求めた複数の駆動音源符号及びそれぞれの
駆動音源符号に対応する符号化されたゲインのセットを
ランダムに選んで順次出力するものである。In the echo processing apparatus according to the present invention, in the analysis frame in which the echo determination means determines that only the near-end background noise component and the echo signal are present in the transmission signal, a plurality of the noise frames are determined in advance as noise sections. A plurality of adaptive excitation codes determined in the analysis frame and a set of encoded gains corresponding to the respective adaptive excitation codes are randomly selected and sequentially output, and also determined in a plurality of analysis frames previously determined to be noise sections. A plurality of drive excitation codes and a set of coded gains corresponding to the respective drive excitation codes are randomly selected and sequentially output.

【００１８】この発明に係るエコー処理装置は、適応音
源符号のゲインと駆動音源符号のゲインは符号化する前
の値を求めて保持し、選択される都度符号化して出力す
るものである。In the echo processing apparatus according to the present invention, the gain of the adaptive excitation code and the gain of the driving excitation code are obtained and held before encoding, and are encoded and output each time they are selected.

【００１９】この発明に係るエコー処理装置は、予め騒
音区間と判定される複数の分析フレームで求めた目標信
号の平均パワーを基準に、選択された適応音源符号と駆
動音源符号のゲインを修正して符号化して出力するもの
である。The echo processing apparatus according to the present invention corrects the gains of the selected adaptive excitation code and driving excitation code based on the average power of the target signal obtained in advance in a plurality of analysis frames determined as noise sections. And outputs it.

【００２０】この発明に係るエコー処理装置は、予め騒
音区間と判定される分析フレームで駆動音源符号のゲイ
ンを求める際、適応音源符号のゲインはゼロ乃至ゼロに
近い値を代入した後に駆動音源符号のゲインを求めるも
のである。In the echo processing apparatus according to the present invention, when the gain of the driving excitation code is obtained in an analysis frame which is determined to be a noise section in advance, the adaptive excitation code gain is set to a value between zero and a value close to zero. Is obtained.

【００２１】この発明に係るエコー処理装置は、エコー
判定手段において送信信号には近端の背景騒音成分とエ
コー信号のみが存在すると判定された分析フレームで
は、ＤＴＸ（ＤｉｓｃｏｎｔｉｎｕｏｕｓＴｒａｓｍ
ｉｓｓｉｏｎ）における無音フラグを出力すると共に、
ＤＴＸにおける無音区間処理のモードで内部動作する音
声符号化手段を備えるものである。In the echo processing apparatus according to the present invention, a DTX (Discontinuous Transaction) is used for an analysis frame in which only a near-end background noise component and an echo signal are present in the transmission signal in the echo determination means.
issue) and output the silence flag in
It is provided with a speech encoding unit that operates internally in a silent section processing mode in DTX.

【００２２】この発明に係るエコー処理装置は、予め騒
音区間と判定される複数の分析フレームで求めた複数の
スペクトルパラメータをランダムに選んで平均化して符
号化し、順次出力する音声符号化手段を備えるものであ
る。The echo processing apparatus according to the present invention comprises a voice coding means for randomly selecting, averaging and coding a plurality of spectral parameters obtained in advance in a plurality of analysis frames determined to be noise sections, and sequentially outputting them. Things.

【００２３】この発明に係るエコー処理装置は、符号化
手段を備え、この符号化手段は、エコー判定手段の判定
結果に基づいて、送信信号中にエコー成分が多く含まれ
ている場合には、送信信号の符号化データの代わりに背
景騒音に相当する予め記憶された符号化データを送信デ
ータとして出力する背景騒音生成手段を備えたものであ
る。The echo processing apparatus according to the present invention includes an encoding unit. The encoding unit, based on a result of the determination by the echo determining unit, includes: The image processing apparatus further includes a background noise generation unit that outputs coded data corresponding to background noise stored in advance instead of coded data of the transmission signal as transmission data.

【００２４】[0024]

【発明の実施の形態】実施の形態１．図１は、この発明
に係るエコー処理装置の構成を示すブロック図である。
また、図２は図１の音声符号化手段の詳細構成を示すブ
ロック図である。従来のエコー処理装置は擬似背景騒音
生成手段を設け、擬似背景騒音を生成する処理を行って
いたが、図１に示す実施の形態１によるエコー処理装置
では、音声符号化手段１３において擬似背景騒音に相当
する符号化データを生成するものである。DESCRIPTION OF THE PREFERRED EMBODIMENTS Embodiment 1 FIG. 1 is a block diagram showing a configuration of an echo processing device according to the present invention.
FIG. 2 is a block diagram showing a detailed configuration of the audio encoding unit in FIG. The conventional echo processing apparatus has provided a pseudo-background noise generating means to perform processing for generating pseudo-background noise. However, in the echo processing apparatus according to the first embodiment shown in FIG. Is generated.

【００２５】以下、図１と図２を用いて、この発明の実
施の形態１について説明する。図１において、図７に示
す符号と同一の符号は、同一または相当部分を示すの
で、動作が同じものについては説明を省略する。１２は
エコー判定手段、１３はエコー判定手段１２の判定結果
と有音無音判定手段４の判定結果を入力する音声符号化
手段である。The first embodiment of the present invention will be described below with reference to FIGS. In FIG. 1, the same reference numerals as those shown in FIG. 7 indicate the same or corresponding parts, and therefore, description of the same operations will be omitted. Reference numeral 12 denotes an echo determination unit, and 13 denotes a voice encoding unit that inputs the determination result of the echo determination unit 12 and the determination result of the voiced / silence determination unit 4.

【００２６】また、音声符号化手段１３の具体的構成を
示す図２において、５１は適応音源符号帳、５２は駆動
音源符号帳、５３および５４はアンプ、５５は加算手
段、５６は線形予測分析手段、５７は合成フィルタ手
段、５８は加算手段、５９は音源探索手段、６０は背景
騒音生成手段、６１は多重化手段である。In FIG. 2 showing a specific configuration of the speech coding means 13, reference numeral 51 denotes an adaptive excitation codebook, 52 denotes a driving excitation codebook, 53 and 54 denote amplifiers, 55 denotes addition means, and 56 denotes linear prediction analysis. Means, 57 is a synthesis filter means, 58 is an addition means, 59 is a sound source search means, 60 is a background noise generation means, and 61 is a multiplexing means.

【００２７】図１において、エコー判定手段１２は音声
復号化手段１０から出力された受信信号ＲＩ（ｉ）と、
エコーキャンセラー手段３から出力された残差信号ＳＡ
（ｉ）を入力し、分析フレーム毎にそれぞれのパワーを
ＲＰ，ＳＡＰとして求める。そして、例えば（１）式に
示す条件が成り立つか否かを判定する。In FIG. 1, echo determination means 12 receives a received signal RI (i) output from speech decoding means 10,
Residual signal SA output from echo canceller means 3
(I) is input, and the respective powers are obtained as RP and SAP for each analysis frame. Then, for example, it is determined whether or not the condition shown in Expression (1) is satisfied.

【００２８】ＲＰ＞ＳＡＰで且つＳＡＰ＜ＴＨ・・（１）ＲＰ，ＳＡＰは単なる記号である。ＴＨは予め設定した
固定の閾値である。（１）式の条件が成立する分析フレ
ームでは、残差信号ＳＡ（ｉ）にはエコー信号のみが含
まれていると判定し（以降、“エコー残留フレーム”と
呼ぶ）、判定結果を音声符号化手段１３に出力する。RP> SAP and SAP <TH (1) RP and SAP are simply symbols. TH is a fixed threshold value set in advance. In the analysis frame satisfying the condition of the expression (1), it is determined that only the echo signal is included in the residual signal SA (i) (hereinafter, referred to as “echo residual frame”), and the determination result is expressed as a speech code. Output to the converting means 13.

【００２９】有音無音判定手段４は残差信号ＳＡ（ｉ）
が有音であるか無音であるかを例えば残差信号ＳＡ
（ｉ）のパワーやスペクトルを求めて判定し、その判定
結果を音声符号化手段１３に出力する。なお、パワーは
サンプルの二乗和を求めることによって算出する。The sound / non-speech determining means 4 outputs the residual signal SA (i)
Is a sound or a silence, for example, the residual signal SA
The power and spectrum of (i) are obtained and determined, and the result of the determination is output to the speech encoding unit 13. The power is calculated by calculating the sum of squares of the samples.

【００３０】音声符号化手段１３は例えば図２で示すよ
うな携帯電話等の標準音声コーデック方式で非常に良く
使われるＣＥＬＰ符号化方式で構成され、分析フレーム
毎に残差信号ＳＡ（ｉ）を符号化すると共に、有音無音
判定手段４から出力された有音無音判定結果とエコー判
定手段１２から出力されたエコー判定結果に従い、擬似
背景騒音に相当する符号化データを生成する。The speech encoding means 13 is constituted by a CELP encoding scheme which is very often used in a standard speech codec scheme for a cellular phone or the like as shown in FIG. 2, and converts the residual signal SA (i) for each analysis frame. In addition to encoding, the encoded data corresponding to the pseudo background noise is generated according to the voiced / silent determination result output from the voiced / non-voiced determination unit 4 and the echo determination result output from the echo determination unit 12.

【００３１】以下、図２を用い、音声符号化手段１３が
擬似背景騒音に相当する符号化データを生成する動作に
ついて詳しく説明する。The operation of the speech encoding means 13 for generating encoded data corresponding to pseudo background noise will be described in detail below with reference to FIG.

【００３２】図２において、背景騒音生成手段６０以外
は一般的なＣＥＬＰ符号化方式の符号化部の構成と同様
の構成であるので、先ず背景騒音生成手段６０以外の動
作（即ちＣＥＬＰ符号化方式の基本動作）を簡単に説明
する。In FIG. 2, since the configuration other than the background noise generating means 60 is the same as the configuration of the coding section of the general CELP coding method, first, the operation other than the background noise generating means 60 (ie, the CELP coding method) ) Will be briefly described.

【００３３】線形予測分析手段５６は残差信号ＳＡ
（ｉ）を入力し、分析フレーム単位に線形予測分析を行
い、スペクトルパラメータとして例えばＬＳＰ(Ｌｉｎ
ｅＳｐｅｃｔｒｕｍＰａｉｒ)を求めて量子化し合
成フィルタ手段５７に出力する。また、量子化したＬＳ
Ｐを符号化して背景騒音生成手段６０と多重化手段６１
に出力する。適応音源符号帳５１は過去の音源信号を蓄
えている。この適応音源符号帳５１内の音源信号は、ラ
グと呼ばれる長さ（可変長）で切り出され、切り出した
信号をサブフレーム長になるまでラグ周期で繰り返して
適応音源が生成される。また駆動音源符号帳５２は複数
の雑音信号ベクトルで構成されている。The linear prediction analysis means 56 outputs the residual signal SA
(I), a linear predictive analysis is performed for each analysis frame, and for example, LSP (Lin
e Spectrum Pair) is obtained and quantized and output to the synthesis filter means 57. Also, the quantized LS
P and encodes the background noise generation means 60 and the multiplexing means 61
Output to Adaptive excitation codebook 51 stores past excitation signals. The excitation signal in adaptive excitation codebook 51 is cut out at a length (variable length) called a lag, and the cut-out signal is repeated at a lag cycle until the subframe length is reached, to generate an adaptive excitation. Driving excitation codebook 52 is composed of a plurality of noise signal vectors.

【００３４】適応音源符号帳５１の適応音源と駆動音源
符号帳５２の駆動音源はサブフレーム長単位で順次読み
出され、それぞれアンプ５３，５４でゲインを与えられ
て増幅された後、加算手段５５で加算されて音源信号と
なる。合成フィルタ手段５７は上記線形予測分析手段５
６からのＬＳＰと加算手段５５からの音源信号を用いて
合成音声を合成する。加算手段５８は合成フイルタ手段
５７で合成された合成音声と残差信号ＳＡ（ｉ）との加
算を行い歪みを求める。音源探索手段５９はその歪みが
最小になる場合の適応音源のラグと駆動音源符号および
それらに対するゲインを探索して符号化し、背景騒音生
成手段６０と多重化手段６１に出力する。The adaptive excitation of the adaptive excitation codebook 51 and the driving excitation of the driving excitation codebook 52 are sequentially read out in units of subframe lengths, amplified by amplifiers 53 and 54 and given gains, respectively. And a sound source signal is obtained. The synthesis filter means 57 is provided for the linear prediction analysis means 5.
The synthesized speech is synthesized by using the LSP from S.6 and the sound source signal from the adding means 55. The addition means 58 adds the synthesized speech synthesized by the synthesis filter means 57 and the residual signal SA (i) to obtain distortion. The sound source searching means 59 searches and encodes the lag of the adaptive sound source and the driving sound source code and the gain for them when the distortion is minimized, and outputs the lag to the background noise generating means 60 and the multiplexing means 61.

【００３５】多重化手段６１はこれらの符号化データを
多重化し、符号化データＣＳとして通信路に出力する。
符号化データＣＳは通信路を経て遠端側の音声復号化手
段１８に伝達され、音声復号化手段１８で復号化されて
復号化信号が得れられる。The multiplexing means 61 multiplexes these coded data and outputs them as coded data CS to the communication path.
The coded data CS is transmitted to the voice decoding means 18 on the far end via the communication path, and is decoded by the voice decoding means 18 to obtain a decoded signal.

【００３６】以上がＣＥＬＰ符号化方式の動作である。
次に背景騒音生成手段６０の動作を説明する。背景騒音
生成手段６０は上記で説明した符号化データ以外に、図
１の有音無音判定手段４から出力される有音無音判定結
果とエコー判定手段１２から出力されるエコー判定結果
を入力する。The above is the operation of the CELP coding system.
Next, the operation of the background noise generating means 60 will be described. The background noise generation unit 60 receives, in addition to the encoded data described above, the voiced / silent determination result output from the voiced / silence determination unit 4 in FIG. 1 and the echo determination result output from the echo determination unit 12.

【００３７】先ず背景騒音生成手段６０は、予め有音無
音判定手段４から出力される有音無音判定結果が無音で
ある分析フレームで求まったＬＳＰを例えば図３（ａ）
に示すように最近６フレーム分常に蓄えておく。また、
有音無音判定結果が無音である上記と同じ分析フレーム
で求められた適応音源とゲインの符号及び駆動音源とゲ
インの符号を図３（ｂ），（ｃ）に示すように最近１２
サブフレーム分（１分析フレームは２サブフレームに相
当）常に蓄えておく。First, the background noise generating means 60 calculates the LSP previously obtained from the voiced / silent determining means 4 output from the voiced / silent determining means 4 in the analysis frame in which the voice is silent, for example, as shown in FIG.
As shown in (1), the last six frames are always stored. Also,
As shown in FIGS. 3 (b) and 3 (c), the adaptive sound source and the sign of the gain and the sign of the driving sound source and the gain obtained in the same analysis frame in which the result of the sound / silence determination is silence are the latest 12 as shown in FIGS.
Subframes (one analysis frame corresponds to two subframes) are always stored.

【００３８】そして、エコー判定手段１２の判定結果が
エコー残留フレームとされた分析フレームでは、背景騒
音生成手段６０は以下に説明する動作を行う。このと
き、先に説明したＣＥＬＰ符号化の基本動作による音声
符号化処理は行われない。Then, in the analysis frame in which the determination result of the echo determination means 12 is an echo residual frame, the background noise generation means 60 performs the operation described below. At this time, the speech encoding process by the basic operation of the CELP encoding described above is not performed.

【００３９】背景騒音生成手段６０は、エコー残留フレ
ームでは先ず、蓄えた６つのＬＳＰから乱数を用いてラ
ンダムに一つ選出し（図３参照）、多重化手段６１に出
力する。次に、選出されたＬＳＰが分析された分析フレ
ームに対応する２つのサブフレームにおける適応音源符
号とそのゲインの符号及び駆動音源符号とそのゲインの
符号のセットを選出し（図３（ｂ），（ｃ）の点線参
照）、多重化手段６１に出力する。The background noise generating means 60 first randomly selects one of the stored six LSPs from the stored six LSPs using random numbers (see FIG. 3), and outputs it to the multiplexing means 61. Next, a set of an adaptive excitation code and its gain code and a drive excitation code and its gain code in two subframes corresponding to the analysis frame in which the selected LSP is analyzed are selected (FIG. 3B, (See the dotted line in (c)) and output to the multiplexing means 61.

【００４０】多重化手段６１には、過去の無音区間（背
景騒音区間）における同一の分析フレームで得られた符
号化データが順次出力されるので、復号化した場合に得
られる復号化信号は、実際の背景騒音としての周波数特
性とパワーを持ち、良好な品質を示す。また同じものを
繰り返さず違う分析フレームの符号化データがランダム
に順次出力されるので、この符号化データを受信する復
号化手段で復号化される復号化信号には、背景騒音とし
て不適当な周期性が無い。The multiplexing means 61 sequentially outputs coded data obtained in the same analysis frame in a past silent section (background noise section), so that a decoded signal obtained by decoding is: It has frequency characteristics and power as actual background noise, and shows good quality. In addition, since the encoded data of different analysis frames are sequentially output at random without repeating the same, the decoded signal decoded by the decoding unit that receives the encoded data includes an inappropriate period as background noise. There is no sex.

【００４１】また、背景騒音生成手段６０が出力する符
号化データＣＳの内容は、先にＣＥＬＰ符号化の基本動
作で説明した符号化データＣＳの内容と同一である。よ
って、通信路を経て符号化データＣＳを受信し復号化す
る遠端側の音声復号化手段１８では、符号化データＣＳ
が背景騒音生成手段６０で生成したものか否かに因ら
ず、通常のＣＥＬＰの復号化処理によって音声区間の音
声信号とエコー残留フレーム区間の背景騒音を復号化し
て生成する。The contents of the coded data CS output from the background noise generating means 60 are the same as the contents of the coded data CS described in the basic operation of the CELP coding. Therefore, the voice decoding means 18 on the far end that receives and decodes the coded data CS via the communication path,
Is generated by decoding the speech signal in the speech section and the background noise in the echo residual frame section by ordinary CELP decoding processing, regardless of whether or not is generated by the background noise generation means 60.

【００４２】以上説明したように、実施の形態１によれ
ば、エコー残留フレームでは、ＣＥＬＰ方式に基づく音
声符号化手段１３が予め無音区間の同一の分析フレーム
で求められたＬＳＰ、適応音源符号、駆動音源符号、符
号化されたゲインのセットをランダムに選択して順次出
力して背景騒音に相当する符号化データを生成する様に
したので、擬似背景騒音生成のために新たな手段を設け
ることなく、一般に普及しているＣＥＬＰ方式を利用す
る簡易な構成と方法により、送信信号にエコーのみが存
在する区間において周期性が無く背景騒音としての特徴
を持つ品質の良い擬似背景騒音を生成することができ
る。As described above, according to the first embodiment, in the echo residual frame, the speech encoding means 13 based on the CELP method uses the LSP, adaptive excitation code, Since the drive excitation code and the set of coded gains are randomly selected and sequentially output to generate coded data corresponding to background noise, a new means for generating pseudo background noise is provided. In addition, by using a simple configuration and method using the widely-used CELP method, a high-quality pseudo-background noise having no periodicity and a characteristic as a background noise is generated in a section where only an echo exists in a transmission signal. Can be.

【００４３】実施の形態２．実施の形態１の背景騒音生
成手段６０は、ＬＳＰを選択した分析フレームを基準に
適応音源符号と駆動音源符号およびそれらのゲインを選
択したが、適応音源符号とそのゲインのセット、及び駆
動音源符号とそのゲインのセットをそれぞれ別々にラン
ダムに選択しても良い。Embodiment 2 The background noise generating means 60 according to the first embodiment selects the adaptive excitation code, the driving excitation code, and their gain based on the analysis frame from which the LSP has been selected. And a set of gains thereof may be individually and randomly selected.

【００４４】以上説明したように、実施の形態２によれ
ば、ＬＳＰと適応音源符号と駆動音源符号およびそれら
のゲインをそれぞれ別々にランダムに選択するようにし
たので、背景騒音生成手段６０で生成される擬似背景騒
音は、よりランダム雑音性が増し、偏った特性を持たな
い効果がある。As described above, according to the second embodiment, the LSP, the adaptive excitation code, the driving excitation code, and their gains are individually and randomly selected. The simulated background noise has an effect that the random noise property is further increased and does not have a biased characteristic.

【００４５】実施の形態３．実施の形態１の背景騒音生
成手段６０は、符号化されたＬＳＰ及びゲインを予め保
持しそれを出力したが、符号化される前における実際の
ＬＳＰあるいはゲインの値を保持し、出力する際に符号
化しても良い。Embodiment 3 The background noise generation unit 60 according to the first embodiment holds and outputs the encoded LSP and gain in advance, but retains and outputs the actual LSP or gain value before encoding. It may be encoded.

【００４６】以上説明したように、実施の形態３によれ
ば、実際のＬＳＰあるいはゲインの値を保持して符号化
し直すので、例えば前の分析フレームのＬＳＰ（あるい
はゲイン）と現フレームのＬＳＰ（あるいはゲイン）の
差分を符号化するような他のパラメータとの関連性を利
用して当該パラメータを符号化する方式を用いる場合に
も対応できる効果がある。As described above, according to the third embodiment, since the actual LSP or gain value is retained and re-encoded, for example, the LSP (or gain) of the previous analysis frame and the LSP (current gain) of the current frame are Alternatively, there is an effect that it is possible to cope with a case of using a method of encoding the parameter by utilizing the association with another parameter such as encoding the difference of the gain).

【００４７】実施の形態４．実施の形態１の背景騒音生
成手段６０では、予め無音区間で求めた適応音源符号及
び駆動音源符号のゲインをそのまま出力した。しかし、
無音区間の分析フレームで得られた音源信号（適応音源
符号帳内）を目標信号とし、そのパワーの平均値を基準
音源パワーとして求め、背景騒音生成手段６０が選択し
た適応音源符号と駆動音源符号及び両音源符号のゲイン
を用いて生成する音源信号のパワーがこの基準音源パワ
ーと一致するように、両音源符号のゲインを修正し、そ
れを符号化しても良い。また、このとき、適応音源符号
のゲインをゼロないしゼロに近い値にして駆動音源符号
のみで生成する音源信号のパワーが基準音源パワーと一
致するように、駆動音源のゲインを求めても良い。Embodiment 4 In the background noise generating means 60 of the first embodiment, the gains of the adaptive excitation code and the driving excitation code previously obtained in the silent section are output as they are. But,
The excitation signal (in the adaptive excitation codebook) obtained in the analysis frame of the silent section is used as the target signal, the average value of the power is determined as the reference excitation power, and the adaptive excitation code and the driving excitation code selected by the background noise generation unit 60 are selected. Alternatively, the gain of both excitation codes may be corrected and coded so that the power of the excitation signal generated using the gain of both excitation codes matches the reference excitation power. Also, at this time, the gain of the driving excitation may be obtained by setting the gain of the adaptive excitation code to zero or a value close to zero so that the power of the excitation signal generated only by the driving excitation code matches the reference excitation power.

【００４８】以上説明したように、実施の形態４によれ
ば、背景騒音生成手段６０が生成する符号化データで生
成する音源信号のパワーが基準音源パワーに一致するの
で、大きなパワー変動の無い安定した擬似背景騒音が生
成できる効果がある。また、適応音源のゲインをゼロに
して適応音源を使用しないようにしたので、適応音源符
号帳の内容が誤って有音区間の特徴を持った場合でも、
適応音源符号帳の内容によらず良好な品質の擬似背景騒
音が生成できる。As described above, according to the fourth embodiment, since the power of the excitation signal generated by the encoded data generated by the background noise generation means 60 matches the reference excitation power, there is no stable power fluctuation. There is an effect that the generated pseudo background noise can be generated. In addition, since the adaptive excitation is not used by setting the gain of the adaptive excitation to zero, even if the content of the adaptive excitation codebook has the characteristic of a voiced section by mistake,
Good quality pseudo background noise can be generated regardless of the contents of the adaptive excitation codebook.

【００４９】実施の形態５．実施の形態４における背景
騒音生成手段６０は、無音区間における音源信号を目標
信号として基準音源パワーを求め、背景騒音生成手段６
０が生成する符号化データで生成する音源信号のパワー
がこの基準音源パワーに一致するようにしたが、無音区
間における残差信号ＳＡ（ｉ）を目標信号とし、そのパ
ワーの平均値を基準音声パワーとして求め、背景騒音生
成手段６０が選択したＬＳＰと適応音源符号と駆動音源
符号及び両音源符号のゲインを用いて合成する合成信号
のパワーが基準音声パワーと一致するように、両音源ゲ
インを修正しても良い。Embodiment 5 The background noise generation means 60 according to the fourth embodiment obtains a reference sound source power using the sound source signal in a silent section as a target signal, and
The power of the excitation signal generated by the coded data generated by 0 is set to match the reference excitation power. However, the residual signal SA (i) in the silent section is set as the target signal, and the average value of the power is used as the reference sound. The two-source excitation gain is calculated so that the power of the synthesized signal synthesized using the LSP selected by the background noise generation means 60, the adaptive excitation code, the driving excitation code, and the gain of both excitation codes matches the reference audio power. May be modified.

【００５０】以上説明したように、実施の形態５によれ
ば、背景騒音生成手段６０が生成する符号化データで生
成する合成信号のパワーが基準音声パワーに一致するの
で、パワー変動の無い安定した擬似背景騒音が生成でき
る効果がある。As described above, according to the fifth embodiment, since the power of the synthesized signal generated by the encoded data generated by the background noise generating means 60 matches the reference audio power, the power does not fluctuate and is stable. There is an effect that pseudo background noise can be generated.

【００５１】実施の形態６．図４は、この発明の実施の
形態６に係るエコー処理装置の構成を示すブロック図で
ある。また、図５は図４の音声符号化手段１４の詳細構
成を示すブロック図である。実施の形態１で説明した図
１では、エコー判定手段１２の判定結果を音声符号化手
段１３に直接入力したが、この実施の形態６に係るエコ
ー処理装置は、携帯電話の様な無線通信端末における音
声処理で一般に用いられるＤＴＸ（Ｄｉｓｃｏｎｔｉｎ
ｕｏｕｓＴｒａｎｓｍｉｓｓｉｏｎ）制御手段１５を
介する構成としている。このＤＴＸ制御手段１５は、送
信信号が無音区間の場合は無線による送信出力をオフ
し、その区間の消費電力を低減するものである。ＤＴＸ
制御手段１５による送信制御方法や快適雑音は例えば、
第三世代携帯電話の標準規格である３ＧＴＳ２６．
０９３“ＡＭＲＳｐｅｅｃｈＣｏｄｅｃ;Ｓｏｕｒ
ｃｅＣｏｎｔｒｏｌｌｅｄＲａｔｅＯｐｅｒａｔ
ｉｏｎ”に記載されている。Embodiment 6 FIG. FIG. 4 is a block diagram showing a configuration of an echo processing device according to Embodiment 6 of the present invention. FIG. 5 is a block diagram showing a detailed configuration of the speech encoding unit 14 of FIG. In FIG. 1 described in the first embodiment, the determination result of the echo determination unit 12 is directly input to the voice encoding unit 13. However, the echo processing device according to the sixth embodiment is a wireless communication terminal such as a mobile phone. DTX (Discintin) commonly used in audio processing in
The configuration is via a uuos transmission (control) unit 15. This DTX control means 15 turns off the wireless transmission output when the transmission signal is in a silent section, and reduces power consumption in that section. DTX
The transmission control method and the comfort noise by the control means 15 include, for example,
3G TS which is a standard for third generation mobile phones
093 "AMR Speech Codec; Sour
ce Controlled Rate Operat
ion ".

【００５２】以下、図４と図５を用いて、この発明によ
る実施の形態６の動作を説明する。図４と図５におい
て、図１と図２に示す符号と同一の符号は、同一または
相当部分を示すので、動作が同じものについては説明を
省略する。図４において、１４はＤＴＸ制御手段１５の
出力する制御情報を入力する音声符号化手段、１６は符
号化データＣＳとフラグ情報ＤＴＸを入力し無線信号に
変調する変調手段、１７は無線信号を入力し符号化デー
タＣＲを出力する復調手段である。また図５において、
６２はＤＴＸ制御手段１５からのフラグ情報ＤＴＸを入
力する背景騒音生成手段である。また、２０は遠端側に
おいて無線信号から符号化データとフラグ情報ＤＴＸを
復調する復調手段、２１は遠端側において符号化データ
を変調し無線出力する変調手段である。The operation of the sixth embodiment according to the present invention will be described below with reference to FIGS. 4 and 5, the same reference numerals as those shown in FIGS. 1 and 2 denote the same or corresponding parts, and a description of the same operations will be omitted. In FIG. 4, reference numeral 14 denotes voice coding means for inputting control information output from the DTX control means 15, 16 denotes modulation means for inputting the coded data CS and flag information DTX and modulates them into a radio signal, and 17 denotes a radio signal. And a demodulation means for outputting encoded data CR. In FIG. 5,
Reference numeral 62 denotes a background noise generation unit that inputs the flag information DTX from the DTX control unit 15. Reference numeral 20 denotes demodulation means for demodulating coded data and flag information DTX from a radio signal on the far end side, and reference numeral 21 denotes modulation means for modulating coded data on the far end side and wirelessly outputting the same.

【００５３】ＤＴＸ制御手段１５は有音無音判定手段４
とエコー判定手段１２の判定結果を入力し、エコー判定
手段１２の判定結果がエコー残留フレームを示す場合
か、もしくは有音無音判定手段４の判定結果が無音であ
る場合は、フラグ情報ＤＴＸを無線送信オフを意味する
「０」に設定して音声符号化手段１４に出力すると共に
変調手段１６に出力する。またＤＴＸ制御手段１５は、
エコー判定手段１２の判定結果がエコー残留フレームで
無く、且つ有音無音判定手段４の判定結果が有音である
場合は、フラグ情報ＤＴＸを無線送信オンを意味する
「１」に設定して音声符号化手段１４に出力すると共に
変調手段１６に出力する。The DTX control means 15 is a sound / silence determination means 4
And the determination result of the echo determination means 12 is input. If the determination result of the echo determination means 12 indicates an echo residual frame, or if the determination result of the voiced / silent determination means 4 is silent, the flag information DTX is transmitted wirelessly. It is set to “0” meaning transmission off and output to the voice encoding means 14 and to the modulation means 16. The DTX control means 15
If the result of the determination by the echo determination means 12 is not an echo residual frame and the result of the determination by the voiced / silence determination means 4 is voiced, the flag information DTX is set to "1" which means that wireless transmission is on, and the Output to the encoding means 14 and output to the modulation means 16.

【００５４】音声符号化手段１４は残差信号ＳＡ（ｉ）
を符号化する機能と、ＤＴＸ制御手段１５からのフラグ
情報ＤＴＸと有音無音判定手段４からの有音無音判定結
果とエコー判定手段１２からのエコー判定結果に応じて
背景騒音に相当する符号化データを生成する機能を有す
る。以降、図５を用いて音声符号化手段１４の動作を詳
しく説明する。The voice coding means 14 generates a residual signal SA (i)
In accordance with the flag information DTX from the DTX control unit 15, the voiced / silent determination result from the voiced / silent determination unit 4, and the echo determination result from the echo determination unit 12, It has a function to generate data. Hereinafter, the operation of the speech encoding unit 14 will be described in detail with reference to FIG.

【００５５】音声符号化手段１４はフラグ情報ＤＴＸが
「１」の場合（無線送信オン）、受信信号ＳＡ（ｉ）を
実施の形態１で説明したＣＥＬＰ方式によって符号化
し、符号化データＣＳを変調手段１６に出力する。変調
手段１６はこの符号化データを連続的に変調して無線出
力する。When the flag information DTX is "1" (wireless transmission ON), the voice coding means 14 codes the received signal SA (i) by the CELP method described in the first embodiment and modulates the coded data CS. Output to means 16. The modulating means 16 continuously modulates the encoded data and outputs it wirelessly.

【００５６】また、音声符号化手段１４はフラグ情報Ｄ
ＴＸが「０」で且つエコー判定結果がエコー残留フレー
ムで無いことを示している場合、ＤＴＸ処理モードに入
り、例えば３ＧＴＳ２６．０９９２“ＡＭＲＳｐ
ｅｅｃｈＣｏｄｅｃ;Ｃｏｎｆｏｒｔｎｏｉｓｅ
ａｓｐｅｃｔｓ”に示される様な方法で間欠的に背景騒
音を符号化する。このモードでは背景騒音生成手段６２
は例えば６フレーム毎に線形予測分析手段５６から入力
した符号化されたＬＳＰを多重化手段６１に出力する。
また同じ６フレーム毎に残差信号ＳＡ（ｉ）のパワーを
求めて符号化して多重化手段６１に出力する。多重化手
段６１はこの符号化データを多重化して変調手段１６に
出力し、変調手段１６は間欠的（６フレーム毎）にこの
符号化データを変調して無線出力する。なお、変調手段
は常にフラグ情報ＤＴＸを変調して無線出力する。この
ことで変調手段１６における消費電力がセーブされる。The speech encoding means 14 outputs the flag information D
When TX is “0” and the echo determination result indicates that the frame is not an echo residual frame, a DTX processing mode is entered, for example, 3G TS 26.092 “AMR Sp
ech Codec; Comfort noise
Aspects ", the background noise is intermittently coded by a method as shown in FIG.
Outputs the encoded LSP input from the linear prediction analysis means 56 to the multiplexing means 61, for example, every six frames.
Further, the power of the residual signal SA (i) is obtained and encoded for each of the same six frames, and output to the multiplexing means 61. The multiplexing means 61 multiplexes the coded data and outputs the multiplexed data to the modulation means 16, and the modulation means 16 intermittently (every six frames) modulates the coded data and wirelessly outputs the modulated data. The modulating means always modulates the flag information DTX and outputs it wirelessly. This saves power consumption in the modulating means 16.

【００５７】変調手段１６で変調された無線信号は通信
路を経由して遠端側の復調手段２０に入力され、復調手
段２０において符号化データとフラグ情報ＤＴＸに復調
され、音声復号化手段１８に入力される。このとき、符
号化データは間欠的に音声復号化手段１８に入力され
る。音声復号化手段１８は入力したフラグ情報ＤＴＸが
「０」であることによってＤＴＸ処理モードに入り、例
えば３ＧＴＳ２６．０９９２“ＡＭＲＳｐｅｅｃ
ｈＣｏｄｅｃ;Ｃｏｎｆｏｒｔｎｏｉｓｅａｓｐｅ
ｃｔｓ”に示される様な方法で、受信したＬＳＰとパワ
ーを用いて背景騒音に相当する信号を復号化する。The radio signal modulated by the modulating means 16 is input to the far-end demodulating means 20 via the communication path, and is demodulated into coded data and flag information DTX by the demodulating means 20. Is input to At this time, the encoded data is intermittently input to the audio decoding unit 18. The audio decoding means 18 enters the DTX processing mode when the input flag information DTX is “0”, and for example, 3G TS 26.0992 “AMR Spec”
h Codec; Comfort noiseaspe
cts ", a signal corresponding to background noise is decoded using the received LSP and power.

【００５８】また、音声符号化手段１４はフラグ情報Ｄ
ＴＸが「０」で且つエコー判定結果がエコー残留フレー
ムを示している場合、背景騒音生成を伴うＤＴＸ処理モ
ードに入る。このＤＴＸ処理モードでは背景騒音生成手
段６２は、図６に示すように、予め無音区間で求めて保
存したＬＳＰとパワーのセット中から乱数によりランダ
ムに選択手段６５で複数選択し、それぞれ平均化手段６
３で平均化した後、符号化手段６４で符号化して６フレ
ーム毎に多重化手段６１に出力する。このとき例えば全
体でＬＳＰとパワーのセットをＮ個保存すると、Ｎより
小さい数Ｍだけランダムに選択するようにする。多重化
手段６１はこの符号化データを多重化して変調手段１６
に出力し、変調手段１６は間欠的（６フレーム毎）にこ
の符号化データを変調して無線出力する。The voice encoding means 14 outputs the flag information D
If TX is “0” and the echo determination result indicates an echo residual frame, the DTX processing mode involving background noise generation is entered. In this DTX processing mode, as shown in FIG. 6, the background noise generator 62 randomly selects a plurality of LSPs and powers from a set of LSPs and powers previously obtained and stored in a silent section by random numbers using a random number. 6
After averaging in 3, the data is encoded by the encoding means 64 and output to the multiplexing means 61 every six frames. At this time, for example, if N sets of LSPs and powers are stored in total, a number M smaller than N is selected at random. The multiplexing means 61 multiplexes the encoded data and modulates the
The modulation means 16 intermittently (every six frames) modulates this encoded data and outputs it wirelessly.

【００５９】なお、背景騒音生成手段６２はフラグ情報
ＤＴＸの値とエコー判定結果に因らず有音無音判定結果
が無音の場合、線形予測分析手段５６から入力したＬＳ
Ｐを予め最近数フレーム分常に保存し、同様に、ＬＳＰ
を保存した同じ分析フレームにおける受信信号ＳＡ
（ｉ）のパワーを求めて保存しておくものとする。The background noise generation means 62 outputs the LS input from the linear prediction analysis means 56 when the sound / non-speech judgment result is silence regardless of the value of the flag information DTX and the echo judgment result.
P is always stored in advance for the last few frames.
Signal SA in the same analysis frame storing
The power of (i) is obtained and stored.

【００６０】通信路と復調手段２０を経てこの符号化デ
ータを間欠的に入力する音声復号化手段１８は、入力し
たフラグ情報ＤＴＸが「０」であることによってＤＴＸ
処理モードに入っており、受信したＬＳＰとパワーを用
いて背景騒音に相当する信号を復号化する。このときＬ
ＳＰとパワーは無音区間の分析フレームで予め求められ
たものをランダム選択し平均化されているので、遠端側
の音声復号化手段１８で復号化される復号化信号は、適
度に変化する背景騒音としてのスペクトル特性を持ち良
好な品質を有する。The speech decoding means 18 intermittently inputs the encoded data via the communication path and the demodulation means 20.
In the processing mode, a signal corresponding to background noise is decoded using the received LSP and power. Then L
Since the SP and the power are randomly selected and averaged from those previously obtained in the analysis frame of the silent section, the decoded signal decoded by the voice decoding means 18 on the far end side has a moderately changing background. It has spectral characteristics as noise and has good quality.

【００６１】以上説明したように、実施の形態６によれ
ば、エコー判定結果がエコー残留フレームである場合
は、音声符号化手段１４が無線通信用音声符号化に一般
的に使われるＤＴＸ制御手法を利用し、予め無音区間で
求めたＬＳＰと受信信号のパワーをランダムに選択して
順次出力して背景騒音に相当する符号化データを生成す
る様にしたので、擬似背景騒音生成のために新たな手段
を設けることなく簡易な構成と方法により、送信信号に
エコーのみが存在する区間において品質の良い擬似背景
騒音を生成することができる。As described above, according to the sixth embodiment, when the echo determination result is an echo residual frame, the speech encoding means 14 uses the DTX control method generally used for speech encoding for wireless communication. Is used to randomly select and sequentially output the power of the LSP and the received signal obtained in the silent section in advance to generate encoded data corresponding to the background noise. By using a simple configuration and method without providing any simple means, it is possible to generate high-quality pseudo background noise in a section in which only an echo exists in a transmission signal.

【００６２】実施の形態７．実施の形態６のエコー処理
装置は、予め無音区間の分析フレームで求めたＬＳＰと
パワーを、エコー残留フレームにおいて６フレーム毎に
その都度ランダムに選択して平均化した。しかし、エコ
ー残留フレームと判定された最初の分析フレームで求め
たＬＳＰの各次元の値を乱数を用いて微小にランダムに
揺らがせて新たなＬＳＰを６フレーム毎に生成しても良
い。Embodiment 7 The echo processing apparatus according to the sixth embodiment randomly selects and averages the LSP and the power obtained in advance in the analysis frame of the silent section every six frames in the echo residual frame. However, a new LSP may be generated every six frames by slightly fluctuating the value of each dimension of the LSP obtained in the first analysis frame determined as the echo residual frame using random numbers.

【００６３】以上で説明したように、実施の形態７によ
れば、間欠的に送信されるＬＳＰが微小に変化する様に
したので、送信信号にエコーのみが存在する区間におい
て、大きな変動の無い品質の良い擬似背景騒音を生成す
ることができる。As described above, according to the seventh embodiment, the intermittently transmitted LSP is made to change slightly, so that there is no large fluctuation in the section where only the echo exists in the transmission signal. High quality pseudo background noise can be generated.

【００６４】[0064]

【発明の効果】以上のように、この発明によれば、送信
信号には近端の背景騒音成分と、受信信号が反響路によ
って反響したエコー信号のみが存在することを判定する
エコー判定手段と、送信信号を符号化すると共に、エコ
ー判定手段において送信信号には近端の背景騒音成分と
エコー信号のみが存在すると判定された分析フレームで
は、予め騒音区間と判定される分析フレームで求めた符
号化用の分析パラメータを用い、背景騒音に相当する符
号化データを生成し出力する音声符号化手段を備えるよ
うに構成したので、擬似背景騒音生成のために新たな手
段を設けることなく、簡易な構成により、送信信号にエ
コーのみが存在する区間において品質の良い擬似背景騒
音を生成することができる効果がある。As described above, according to the present invention, there is provided an echo judgment means for judging that only a near-end background noise component exists in a transmission signal and an echo signal in which a reception signal is echoed by an echo path. In the analysis frame in which the transmission signal is encoded and only the near-end background noise component and the echo signal are present in the transmission signal by the echo determination means, the code determined in advance in the analysis frame determined as the noise section is used. Is configured to include a speech encoding unit that generates and outputs encoded data corresponding to background noise using analysis parameters for quantization, so that there is no need to provide a new unit for generating pseudo background noise, and a simple With the configuration, there is an effect that high-quality pseudo background noise can be generated in a section in which only an echo exists in a transmission signal.

【００６５】この発明によれば、音声符号化方式がＣＥ
ＬＰ方式であり、背景騒音に相当する符号化データがス
ペクトルパラメータと適応音源符号と駆動音源符号とこ
れら両音源符号のゲインを用いる音声符号化手段を備え
るように構成したので、携帯電話などの標準音声コーデ
ック方式として一般に広く使われるＣＥＬＰ方式に簡単
に適用できる効果がある。According to the present invention, the speech encoding method is CE
It is an LP method, and the encoded data corresponding to the background noise is configured to include the speech parameter using the spectrum parameter, the adaptive excitation code, the driving excitation code, and the gain of these two excitation codes. There is an effect that it can be easily applied to the CELP system which is widely used as a voice codec system.

【００６６】この発明によれば、エコー判定手段におい
て送信信号には近端の背景騒音成分とエコー信号のみが
存在すると判定された分析フレームでは、予め騒音区間
と判定される複数の分析フレームで求めた複数の符号化
されたスペクトルパラメータをランダムに選んで順次出
力する音声符号化手段を備えるように構成したので、背
景騒音のスペクトル特徴を有し、送信信号にエコーのみ
が存在する区間において周期性が無く背景騒音としての
特徴を持つ品質の良い擬似背景騒音を生成することがで
きる効果がある。According to the present invention, in the analysis frame in which only the near-end background noise component and the echo signal are present in the transmission signal by the echo determination means, a plurality of analysis frames previously determined to be noise sections are obtained. And a speech encoding unit for randomly selecting a plurality of encoded spectral parameters and sequentially outputting the selected encoded spectral parameters. There is an effect that it is possible to generate a high-quality pseudo background noise having characteristics as a background noise without any noise.

【００６７】この発明によれば、エコー判定手段におい
て送信信号には近端の背景騒音成分とエコー信号のみが
存在すると判定された分析フレームでは、予め騒音区間
と判定される複数の分析フレームで求めた複数のスペク
トルパラメータをランダムに選んで符号化して順次出力
する音声符号化手段を備えたので、異なる分析フレーム
で得たスペクトルパラメータや他のパラメータとの関連
性を利用してスペクトルパラメータを符号化する方式を
用いる場合にも対応できる効果がある。According to the present invention, in the analysis frame in which the echo determination means determines that only the near-end background noise component and the echo signal are present in the transmission signal, the analysis frame is obtained in advance from a plurality of analysis frames that are determined to be noise sections. Speech coding means that randomly selects and encodes a plurality of spectral parameters and sequentially outputs the plurality of spectral parameters, so that the spectral parameters obtained in different analysis frames and the relevance with other parameters are used to encode the spectral parameters. There is also an effect that can be applied to the case where a method of performing the above is used.

【００６８】この発明によれば、エコー判定手段におい
て送信信号には近端の背景騒音成分とエコー信号のみが
存在すると判定された分析フレームでは、予め騒音区間
と判定される複数の分析フレームのスペクトルパラメー
タを平均化し、平均化したスペクトルパラメータの各次
元の値を揺らがせて符号化して順次出力する音声符号化
手段を備えるように構成したので、大きな変動の無い品
質の良い擬似背景騒音を生成することができる効果があ
る。According to the present invention, in the analysis frame for which it is determined that only the near-end background noise component and the echo signal are present in the transmission signal by the echo determination means, the spectrum of a plurality of analysis frames previously determined to be noise sections is determined. Since the apparatus is provided with an audio encoding means for averaging the parameters, fluctuating and encoding the values of each dimension of the averaged spectral parameters, and sequentially outputting them, a high quality pseudo background noise without large fluctuation is generated. There is an effect that can be.

【００６９】この発明によれば、ランダムに選択された
スペクトルパラメータが求められた分析フレームと同じ
分析フレームで求めた適応音源符号と駆動音源符号及び
両音源符号に対応する符号化されたゲインを選択し、順
次出力する音声符号化手段を備えるように構成したの
で、実際の背景騒音のスペクトル特徴とパワーを持つ品
質の良い擬似背景騒音を生成することができる効果があ
る。According to the present invention, the adaptive excitation code, the driving excitation code, and the coded gain corresponding to both excitation codes, which are determined in the same analysis frame in which the spectral parameters randomly selected are determined, are selected. In addition, since the apparatus is configured to include the audio encoding means for sequentially outputting, it is possible to generate high quality pseudo background noise having the spectral characteristics and power of the actual background noise.

【００７０】この発明によれば、エコー判定手段におい
て、送信信号には近端の背景騒音成分とエコー信号のみ
が存在すると判定された分析フレームでは、予め騒音区
間と判定される複数の分析フレームで求めた複数の適応
音源符号及びそれぞれの適応音源符号に対応する符号化
されたゲインのセットをランダムに選んで順次出力し、
また、予め騒音区間と判定される複数の分析フレームで
求めた複数の駆動音源符号及びそれぞれの駆動音源符号
に対応する符号化されたゲインのセットをランダムに選
んで順次出力する音声符号化手段を備えるように構成し
たので、偏った特性を持たずランダム性の大きい品質の
良い擬似背景騒音を生成することができる効果がある。According to the present invention, in the analysis frame in which only the near-end background noise component and the echo signal are present in the transmission signal in the echo determination means, a plurality of analysis frames previously determined to be noise sections are used. A plurality of determined adaptive excitation codes and a set of encoded gains corresponding to the respective adaptive excitation codes are randomly selected and sequentially output,
Further, there is provided a speech encoding unit which randomly selects a plurality of drive excitation codes obtained in a plurality of analysis frames determined in advance as noise sections and sets of encoded gains corresponding to the respective drive excitation codes, and sequentially outputs the selected sets. With this configuration, there is an effect that it is possible to generate high-quality pseudo-background noise with large randomness without biased characteristics.

【００７１】この発明によれば、適応音源符号のゲイン
と駆動音源符号のゲインは符号化する前の値を求めて保
持し、選択される都度符号化して出力する音声符号化手
段を備えるように構成したので、異なる分析フレームで
得たゲインや他のパラメータとの関連性を利用してゲイ
ンを符号化する方式を用いる場合にも対応できる効果が
ある。According to the present invention, the gain of the adaptive excitation code and the gain of the driving excitation code are obtained and retained before encoding, and are provided with a speech encoding means for encoding and outputting each time they are selected. With this configuration, there is an effect that it is possible to cope with a case where a method of encoding a gain using the gain obtained in different analysis frames and the relationship with other parameters is used.

【００７２】この発明によれば、予め騒音区間と判定さ
れる複数の分析フレームで求めた目標信号の平均パワー
を基準に、選択された適応音源符号と駆動音源符号のゲ
インを修正して符号化して出力する音声符号化手段を備
えるように構成したので、大きなパワー変動の無い安定
した擬似背景騒音を生成できる効果がある。According to the present invention, the gains of the selected adaptive excitation code and driving excitation code are corrected based on the average power of the target signal obtained in advance in a plurality of analysis frames determined as noise sections, and the coding is performed. Is configured to include the voice encoding means for outputting the pseudo-background noise with no large power fluctuation.

【００７３】この発明によれば、予め騒音区間と判定さ
れる分析フレームで駆動音源符号のゲインを求める際、
適応音源符号のゲインはゼロ乃至ゼロに近い値を代入し
た後に駆動音源符号のゲインを求める音声符号化手段を
備えるように構成したので、適応音源符号帳の内容によ
らず良好な品質の擬似背景騒音が生成できる効果があ
る。According to the present invention, when the gain of the driving excitation code is obtained in the analysis frame previously determined to be a noise section,
Since the adaptive excitation code gain is configured to include a speech encoding unit that obtains the gain of the driving excitation code after substituting a value of zero to a value close to zero, a pseudo background of good quality regardless of the contents of the adaptive excitation codebook. There is an effect that noise can be generated.

【００７４】この発明によれば、エコー判定手段におい
て、送信信号には近端の背景騒音成分とエコー信号のみ
が存在すると判定された分析フレームでは、ＤＴＸにお
ける無音フラグを出力すると共に、ＤＴＸにおける無音
区間処理のモードで内部動作する音声符号化手段を備え
るように構成したので、一般的に使われるＤＴＸ制御方
法を利用し、擬似背景騒音生成のための新たな手段を設
けることなく簡易な構成と方法により、品質の良い擬似
背景騒音を生成できる効果がある。According to the present invention, in the analysis frame in which it is determined that only the near-end background noise component and the echo signal are present in the transmission signal by the echo determination means, the DTX silence flag is output and the DTX silence flag is output. Since it is configured to include a voice encoding unit that operates internally in the section processing mode, a simple configuration without using a new unit for generating pseudo background noise using a commonly used DTX control method is provided. The method has an effect that high-quality pseudo background noise can be generated.

【００７５】この発明によれば、予め騒音区間と判定さ
れる複数の分析フレームで求めた複数のスペクトルパラ
メータをランダムに選んで平均化して符号化し、順次出
力する音声符号化手段を備えるように構成したので、適
度に変化する背景騒音としてのスペクトル特性を持つ品
質の良い擬似背景騒音を生成できる効果がある。According to the present invention, there is provided a speech encoding means for randomly selecting, averaging and encoding a plurality of spectrum parameters obtained in advance in a plurality of analysis frames determined as noise sections, and sequentially outputting the speech parameters. Therefore, there is an effect that it is possible to generate high-quality pseudo background noise having a spectral characteristic as background noise that changes appropriately.

【００７６】この発明によれば、符号化手段は、エコー
判定手段の判定結果に基づいて、送信信号中にエコー成
分が多く含まれている場合には、送信信号の符号化デー
タの代わりに背景騒音に相当する予め記憶された符号化
データを送信データとして出力する背景騒音生成手段を
備えるように構成したので、送信信号中にエコー成分が
多く含まれている場合でも、背景騒音に相当する符号化
データを送信データとして出力できる効果がある。According to the present invention, when the transmission signal contains a large amount of echo components based on the determination result of the echo determination means, the encoding means replaces the encoded data of the transmission signal with the background data. Since the apparatus is provided with background noise generation means for outputting coded data corresponding to noise stored in advance as transmission data, even if a transmission signal contains a large amount of echo components, a code corresponding to background noise is generated. There is an effect that the converted data can be output as transmission data.

[Brief description of the drawings]

【図１】この発明の実施の形態１から実施の形態５に
よるエコー処理装置のブロック図である。FIG. 1 is a block diagram of an echo processing device according to Embodiments 1 to 5 of the present invention.

【図２】この発明の実施の形態１から実施の形態５に
よるエコー処理装置に備えられた音声符号化手段のブロ
ック図である。FIG. 2 is a block diagram of a speech encoding unit provided in the echo processing device according to the first to fifth embodiments of the present invention.

【図３】この発明の実施の形態１から実施の形態５に
よる背景騒音生成の動作を説明する動作説明図である。FIG. 3 is an operation explanatory diagram illustrating an operation of generating background noise according to Embodiments 1 to 5 of the present invention.

【図４】この発明の実施の形態６によるエコー処理装
置のブロック図である。FIG. 4 is a block diagram of an echo processing device according to a sixth embodiment of the present invention.

【図５】この発明の実施の形態６によるエコー処理装
置に備えられた音声符号化手段のブロック図である。FIG. 5 is a block diagram of a speech encoding unit provided in an echo processing device according to a sixth embodiment of the present invention.

【図６】この発明の実施の形態６による背景騒音生成
の動作を説明する動作説明図であるFIG. 6 is an operation explanatory diagram illustrating an operation of generating background noise according to Embodiment 6 of the present invention;

【図７】従来のエコー処理装置のブロック図である。FIG. 7 is a block diagram of a conventional echo processing device.

[Explanation of symbols]

１擬似エコー生成手段、２加算手段、４有音無音
判定手段、１０音声復号化手段、１１音声符号化復
号化手段、１２エコー判定手段、１３音声符号化手
段、１４音声符号化手段、１５ＤＴＸ制御手段、１
６変調手段、１７復調手段、５１適応音源符号
帳、５２駆動音源符号帳、５３アンプ、５４アン
プ、５５加算手段、５６線形予測分析手段、５７
合成フィルタ手段、５８加算手段、５９音源探索手
段、６０背景騒音生成手段、６１多重化手段、６２
背景騒音生成手段、６３平均化手段、６４符号化手
段、６５選択手段。REFERENCE SIGNS LIST 1 pseudo echo generation means, 2 addition means, 4 voiced / silence determination means, 10 voice decoding means, 11 voice coding / decoding means, 12 echo determination means, 13 voice coding means, 14 voice coding means, 15 DTX Control means, 1
6 modulation means, 17 demodulation means, 51 adaptive excitation codebook, 52 driving excitation codebook, 53 amplifier, 54 amplifier, 55 addition means, 56 linear prediction analysis means, 57
Synthesis filter means, 58 addition means, 59 sound source search means, 60 background noise generation means, 61 multiplexing means, 62
Background noise generating means, 63 averaging means, 64 coding means, 65 selecting means.

───────────────────────────────────────────────────── フロントページの続き (72)発明者田崎裕久東京都千代田区丸の内二丁目２番３号三菱電機株式会社内 (72)発明者松岡文啓東京都千代田区丸の内二丁目２番３号三菱電機株式会社内Ｆターム(参考） 5D020 CC06 5D045 CA01 5K027 AA11 AA16 BB03 DD10 DD18 5K046 HH11 HH79 ──────────────────────────────────────────────────続き Continuing on the front page (72) Inventor Hirohisa Tazaki 2-3-2 Marunouchi, Chiyoda-ku, Tokyo Mitsui Electric Co., Ltd. (72) Inventor Fumihiro Matsuoka 2-3-2 Marunouchi, Chiyoda-ku, Tokyo F term in Mitsubishi Electric Corporation (reference) 5D020 CC06 5D045 CA01 5K027 AA11 AA16 BB03 DD10 DD18 5K046 HH11 HH79

Claims

[Claims]

1. An echo determining means for determining that only a near-end background noise component and an echo signal in which a received signal is echoed by an echo path are present in a transmitted signal, and the transmission signal is encoded, and the echo signal is encoded. In the determination unit, in the analysis frame determined that only the near-end background noise component and the echo signal are present in the transmission signal, an analysis parameter for encoding obtained in advance in the analysis frame determined to be a noise section is used. And an audio encoding means for generating and outputting encoded data corresponding to background noise.

2. The speech encoding method according to claim 1, wherein the speech encoding method of the speech encoding means is CE.
2. The echo processing apparatus according to claim 1, wherein the coded data corresponding to the background noise is an LP coding system, and the coded data corresponding to the background noise is composed of a spectrum parameter, an adaptive excitation code, a driving excitation code, and a gain of both excitation codes.

3. An analysis frame in which only a near-end background noise component and an echo signal are present in the transmission signal by the echo determination means, a plurality of analysis frames previously determined from a plurality of analysis frames determined to be noise sections. 2. The method according to claim 1, wherein the coded spectral parameters are randomly selected and sequentially output.
Or the echo processing apparatus according to claim 2.

4. An analysis frame in which it is determined that only a near-end background noise component and an echo signal are present in a transmission signal by the echo determination means, a plurality of analysis frames previously determined from a plurality of analysis frames determined to be noise sections. 3. The echo processing apparatus according to claim 1, wherein the spectral parameters are randomly selected, encoded, and sequentially output.

5. An analysis frame in which only a near-end background noise component and an echo signal are present in the transmission signal by the echo determination means, averages the spectral parameters in a plurality of analysis frames previously determined to be noise sections. 3. The echo processing apparatus according to claim 1, wherein the dimensional values of the spectral parameters that have been converted and averaged are fluctuated and encoded for each analysis frame and sequentially output.

6. An adaptive excitation code, a driving excitation code, and an encoded gain corresponding to both excitation codes determined in the same analysis frame as the analysis frame from which the selected spectral parameter is determined, and sequentially output. The echo processing device according to any one of claims 2 to 5.

7. An analysis frame in which only a near-end background noise component and an echo signal are present in the transmission signal by the echo determination means, a plurality of analysis frames previously determined from a plurality of analysis frames determined to be noise sections. An adaptive excitation code and a set of coded gains corresponding to the respective adaptive excitation codes are randomly selected and sequentially output, and a plurality of driving excitation codes and a plurality of driving excitation codes determined in advance in a plurality of analysis frames determined to be noise sections in advance. The echo processing apparatus according to any one of claims 2 to 5, wherein a set of coded gains corresponding to the driving excitation code is randomly selected and sequentially output.

8. The echo processing apparatus according to claim 6, wherein a gain of the adaptive excitation code and a gain of the driving excitation code are obtained and held before encoding, and are encoded and output each time they are selected. .

9. A method for modifying the gains of a selected adaptive excitation code and a driving excitation code based on an average power of a target signal obtained in a plurality of analysis frames previously determined to be a noise section, encoding and outputting the modified excitation code and the driving excitation code. The echo processor according to any one of claims 6 to 8.

10. The gain of a driving excitation code after obtaining a gain of an adaptive excitation code from zero to a value close to zero when obtaining the gain of the driving excitation code in an analysis frame previously determined to be a noise section. The echo processing device according to the above.

11. An analysis frame in which it is determined that only a near-end background noise component and an echo signal are present in the transmission signal by the echo determination means, outputs a silence flag in the DTX control means, and outputs a silence flag in the DTX control means. 2. The echo processing apparatus according to claim 1, further comprising a speech encoding unit that operates internally in a section processing mode.

12. The echo processing apparatus according to claim 1, further comprising a voice coding unit for randomly selecting a plurality of spectral parameters obtained in a plurality of analysis frames determined in advance as noise sections, averaging and encoding the resulting parameters, and sequentially outputting the selected parameters. apparatus.

13. An echo determination means for determining a state in which a transmission signal contains more echo components of a received signal reflected by an echo path than voice components of a near-end speaker, and encoding the transmission signal. Encoding means for outputting the encoded signal, based on the determination result of the echo determining means, when the transmission signal contains a large amount of echo components. An echo processing apparatus, comprising: a background noise generating unit that outputs, as transmission data, encoded data corresponding to background noise instead of data as transmission data.