JPH0946233A

JPH0946233A - Sound encoding method/device and sound decoding method/ device

Info

Publication number: JPH0946233A
Application number: JP7195277A
Authority: JP
Inventors: Seiji Sasaki; 誠司佐々木; Osamu Watanabe; 治渡辺; Ichiro Matsumoto; 一郎松本
Original assignee: Kokusai Electric Corp
Current assignee: Kokusai Electric Corp
Priority date: 1995-07-31
Filing date: 1995-07-31
Publication date: 1997-02-14

Abstract

PROBLEM TO BE SOLVED: To transmit a wide band sound signal by transmitting a spectrum envelope information parameter and a sound source information parameter to a first frequency band signal and transmitting only the spectrum envelope information parameter to a second frequency band signal. SOLUTION: A transmission orthogonal mirror filter 60 divides an input sound signal a4 into a low band component c4 and a high band component b4. The component c4 is encoding-processed by a highly efficient sound encoder 62 and a sound information parameter e4 is outputted. On the other hand, the component b4 is processed by a linear analyzer 61 and a linear prediction coefficient d4 for representing spectrum envelope is outputted. The coefficient d4 from the analyzer 61 and the parameter e4 from the encoder 62 are encoded and are made into frames by an encoder 63. Then, a data string f4 being the result is transmitted to a transmission line. Thus, the wide band sound signal can be encoded and transmitted as well in mobile communication taken place by using a radio line limited in information transmission speed.

Description

Detailed Description of the Invention

【０００１】[0001]

【発明の属する技術分野】本発明は電話における音声信
号を符号化・復号する音声符号化・復号方法に関し、特
に、広帯域音声信号の符号化に適した音声符号化・復号
方法、さらには、これを利用した音声符号化・復号装置
に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a voice encoding / decoding method for encoding / decoding a voice signal in a telephone, and more particularly to a voice encoding / decoding method suitable for encoding a wideband voice signal, and further to this. The present invention relates to a voice encoding / decoding device using the.

【０００２】[0002]

【従来の技術】一般に、従来の電話装置では、通信する
音声の伝送を通常の電話帯域（３００〜３４００Ｈｚ）
の信号により行っていたが、近年の高背景雑音下におけ
る電話による通話では、了解性の改善が期待されてい
る。2. Description of the Related Art Generally, in a conventional telephone device, communication voice is transmitted in a normal telephone band (300 to 3400 Hz).
However, it is expected that the intelligibility will be improved in the recent telephone conversations under high background noise.

【０００３】かかる高背景雑音下での通話においての了
解性を改善するためには、伝送する音声信号の帯域を電
話帯域信号（３００〜３４００Ｈｚ）から広帯域（５０
〜７０００Ｈｚ）にすることが有効である。また、かか
る広帯域音声信号の符号化方法としては、例えば、TTC
標準JT−G722として標準化されている、いわゆるSB-ADP
CM（Sub Band‐Adaptive Differencial Pulse Code Mod
ulation）と呼ばれるものがある。In order to improve the intelligibility in a call under such high background noise, the band of the voice signal to be transmitted is wide band (50 to 50 Hz) from the telephone band signal (300 to 3400 Hz).
Up to 7000 Hz) is effective. In addition, as a method of encoding such a wideband speech signal, for example, TTC
So-called SB-ADP standardized as standard JT-G722
CM (Sub Band‐Adaptive Differencial Pulse Code Mod
There is something called ulation).

【０００４】[0004]

【発明が解決しようとする課題】上記従来技術になるSB
一ADPCMによる符号化速度は、通常、４８〜６４ｋｂｐ
ｓであるが、これを移動無線通信へ適用することを考え
た場合、次のような問題点ががあった。すなわち、移動
無線通信では、周波数の有効利用のため、１チャネルが
占有する周波数帯域幅を出来る限り狭くする必要があ
り、このため、音声情報の符号化速度は制限されてしま
う。一例としては、日本の自動車電話システムでの音声
符号化速度（誤り訂正を含む）は、フルレートで１１．
２ｋｂｐｓ、ハーフレートで５．６ｋｂｐｓが用いられ
ている。しかし、上記のSB−ADPCMは、移動無線通信の
音声符号化速度に比較して、その音声符号化速度が４８
〜６４ｋｂｐｓと高いことから、そのまま移動無線通信
システムに使用することは適当ではない。The above-mentioned prior art SB
The encoding speed by one ADPCM is usually 48 to 64 kbp
However, when applying this to mobile radio communication, there were the following problems. That is, in mobile radio communication, in order to effectively use the frequency, it is necessary to make the frequency bandwidth occupied by one channel as narrow as possible, which limits the coding rate of voice information. As an example, the voice coding rate (including error correction) in the Japanese car telephone system is 11.
2 kbps and half rate 5.6 kbps are used. However, the above-mentioned SB-ADPCM has a voice coding rate of 48% as compared with the voice coding rate of mobile radio communication.
Since it is as high as ˜64 kbps, it is not appropriate to use it as it is in a mobile radio communication system.

【０００５】そこで、本発明では、上記の従来技術に鑑
み、情報伝送速度に制限のある無線回線を使用して行わ
れる移動通信においても、広帯域音声信号を符号化して
伝送することを可能とする音声符号化方式及びその装置
に関するものであり、すなわち、高い背景雑音下での通
話の了解性を向上させることが出来る広帯域音声信号の
符号化処理を、移動無線通信に適用可能な低い速度で実
現することの可能な音声符号化・復号方法を提供し、さ
らには、かかる方法を利用した音声符号化装置と音声復
号装置を提供することを目的とする。Therefore, in view of the above-mentioned prior art, the present invention makes it possible to code and transmit a wideband voice signal even in mobile communication performed using a wireless line having a limited information transmission rate. The present invention relates to a voice coding system and a device therefor, that is, a wideband voice signal coding process capable of improving the intelligibility of a call under high background noise is realized at a low speed applicable to mobile radio communication. It is an object of the present invention to provide a speech encoding / decoding method capable of performing the above, and further to provide a speech encoding device and a speech decoding device using such a method.

【０００６】[0006]

【課題を解決するための手段】本発明は、伝送する入力
音声信号を第１の周波数帯域信号と第２の周波数帯域信
号に分割した後、前記第１の周波数帯域信号に対して
は、高能率音声符号化処理を行ってその出力であるスペ
クトル包絡情報パラメータと音源情報パラメータを送信
し、さらに前記第２の周波数帯域信号に対しては、スペ
クトル包絡情報パラメータのみを抽出して送信すること
を特徴とする音声符号化方法を開示する。According to the present invention, after dividing an input voice signal to be transmitted into a first frequency band signal and a second frequency band signal, a high frequency signal is added to the first frequency band signal. Performing an efficient speech coding process, transmitting the spectrum envelope information parameter and the sound source information parameter, which are the outputs, and extracting and transmitting only the spectrum envelope information parameter for the second frequency band signal. A featured speech coding method is disclosed.

【０００７】さらに本発明は、前記記載の音声符号化方
法により符号化されて送信された音声信号を復号する方
法であって、前記送信された音声信号のスペクトル包絡
情報パラメータと音源情報パラメータとを受信し、前記
第１の周波数帯域信号に対しては、当該信号対応のスペ
クトル包絡情報パラメータ及び音源情報パラメータから
高能率音声復号処理により復号を行い、前記第２の周波
数帯域信号に対しては、当該信号対応のスペクトル包絡
情報パラメータ及び前記第１の周波数帯域信号の音源情
報パラメータから推定された擬似音源パラメータを用い
て復号を行い、さらに、これらの復号により生成された
２つの音声信号を加算することにより再生音声信号を合
成することを特徴とする音声復号方法を開示する。Further, the present invention is a method for decoding a voice signal transmitted by being encoded by the voice encoding method described above, wherein the spectral envelope information parameter and the sound source information parameter of the transmitted voice signal are For the first frequency band signal, the first frequency band signal is decoded by a high-efficiency speech decoding process from the spectrum envelope information parameter and the sound source information parameter corresponding to the signal, and for the second frequency band signal, Decoding is performed using the spectral envelope information parameter corresponding to the signal and the pseudo sound source parameter estimated from the sound source information parameter of the first frequency band signal, and further, the two voice signals generated by these decoding are added. Disclosed is a speech decoding method characterized by synthesizing reproduced speech signals.

【０００８】さらに本発明は、伝送する音声信号を第１
の周波数帯域信号と第２の周波数帯域信号に分割する帯
域分割器と、前記帯域分割器で分割された前記第１の周
波数帯域信号を高能率音声符号化処理し、その結果であ
るスペクトル包絡情報パラメータ及び音源情報パラメー
タを出力する高能率音声符号化器と、前記帯域分割器で
分割された第２の周波数帯域信号を線形予測分析し、そ
の結果であるスペクトル包絡情報パラメータのみを出力
する線形予測分析器と、前記高能率音声符号化器から出
力された前記第１の周波数帯域信号のスペクトル包絡情
報パラメータ及び音源情報パラメータと、前記線形予測
分析器から出力された前記第２の周波数帯域信号のスペ
クトル包絡情報パラメータとを符号化し、これらを多重
化して伝送路に送出する符号化器とを備えたことを特徴
とする音声符号化装置を開示する。Furthermore, the present invention provides a first audio signal to be transmitted.
Band divider for dividing the frequency band signal into the second frequency band signal and the second frequency band signal, and the high efficiency speech coding processing of the first frequency band signal divided by the band divider, and the resulting spectrum envelope information. High-efficiency speech encoder that outputs parameters and excitation information parameters, and linear prediction that performs linear prediction analysis on the second frequency band signal divided by the band divider, and outputs only the spectrum envelope information parameter that is the result. An analyzer, a spectrum envelope information parameter and a sound source information parameter of the first frequency band signal output from the high-efficiency speech coder, and a second frequency band signal output from the linear prediction analyzer. Speech coding, characterized in that it has a coder for coding a spectrum envelope information parameter, and multiplexing these and sending them to a transmission path. To disclose the location.

【０００９】さらに本発明は、前記記載の音声符号化装
置により符号化されて送信された音声信号を復号する装
置であって、前記多重化されて伝送路に送出された信号
を受信し、前記第１の周波数帯域信号のスペクトル包絡
情報パラメータ及び音源情報パラメータと前記第２の周
波数帯域信号のスペクトル包絡情報パラメータとを分離
してとり出す分離回路と、前記分離回路からの前記第１
の周波数帯域信号のスペクトル包絡情報パラメータ及び
音源情報パラメータから高能率音声復号処理により前記
第１の周波数帯域信号を再生する高能率音声復号器と、
前記分離回路からの前記第２の周波数帯域信号のスペク
トル包絡情報パラメータと共に、前記第１の周波数帯域
信号の音源情報パラメータに基づいて得られるパラメー
タを前記第２の周波数帯域信号用の音源情報パラメータ
として用い、高能率音声復号処理により前記第２の周波
数帯域信号を再生する周波成分復元回路と、前記再生さ
れた第１及び第２の周波数帯域信号を加算することによ
り再生音声信号を合成する帯域合成器とを備えたことを
特徴とする音声復号装置を開示する。Further, the present invention is a device for decoding a voice signal encoded and transmitted by the voice encoding device described above, wherein the multiplexed signal sent to a transmission line is received, and A separation circuit for separating and extracting a spectrum envelope information parameter and a sound source information parameter of a first frequency band signal and a spectrum envelope information parameter of the second frequency band signal, and the first circuit from the separation circuit.
A high-efficiency speech decoder for reproducing the first frequency band signal from the spectrum envelope information parameter and the sound source information parameter of the frequency band signal by a high-efficiency speech decoding process,
A parameter obtained based on the sound source information parameter of the first frequency band signal together with the spectrum envelope information parameter of the second frequency band signal from the separation circuit is used as the sound source information parameter for the second frequency band signal. A frequency component restoration circuit for reproducing the second frequency band signal by high-efficiency audio decoding processing and a band synthesis for combining the reproduced voice signal by adding the reproduced first and second frequency band signals. Disclosed is a speech decoding device characterized by comprising:

【００１０】[0010]

【発明の実施の形態】以下、本発明の実施の形態につい
て、添付の図面を参照しながら説明を行う。まず、図１
には、本発明の音声符号化方法により広帯域音声信号を
符号化し、有線通信、または無線回線を利用して移動体
との間で行われる移動通信（電話）を実現する装置の全
体構成が示されている。なお、この装置は、図において
一点鎖線で示すように、基本的には、広帯域音声符号化
装置１００と広帯域音声複号装置２００とから構成され
ている。BEST MODE FOR CARRYING OUT THE INVENTION Embodiments of the present invention will be described below with reference to the accompanying drawings. First, FIG.
1 shows an overall configuration of an apparatus that implements mobile communication (telephone) with a mobile body by encoding a wideband voice signal by the voice encoding method of the present invention and using a wired communication or a wireless line. Has been done. Note that this device is basically composed of a wideband speech coding device 100 and a wideband speech decoding device 200, as indicated by the alternate long and short dash line in the figure.

【００１１】まず、図１の広帯域音声符号化装置１００
は、帯域分割器６０を構成する送信直交ミラーフィルタ
と、線形予測分析器６１と、高性能音声符号化器６２
と、そして、符号化器６３とから構成されている。さら
に、かかる構成の広帯域音声符号化装置１００において
は、受話器のマイクにより入力され、そして、７ｋＨｚ
で帯域制限された後、１６ｋＨｚで標本化され、さら
に、１６ビットで量子化された広帯域の入力音声信号ａ
４は、送信直交ミラーフィルタ６０により、低域成分ｃ
４（０〜４ｋＨｚ）と高域成分ｂ４（４〜７ｋＨｚ）に
分割される。その後、この分割された入力音声信号の低
域成分ｃ４は、一般的な高能率音声符号化器６２によ
り、例えぱ４０ｍｓ長のフレーム単位で符号化処理さ
れ、これにより、音声情報パラメータｅ４が出力され
る。なお、高能率音声符号化器６２の構成例については
後で詳細を説明する。First, the wideband speech coding apparatus 100 shown in FIG.
Is a transmission quadrature mirror filter that constitutes the band divider 60, a linear prediction analyzer 61, and a high performance speech encoder 62.
And an encoder 63. Furthermore, in the wideband speech coding apparatus 100 having such a configuration, the signal is input by the microphone of the handset, and then 7 kHz.
Wideband input audio signal a sampled at 16 kHz and quantized with 16 bits after being band-limited by
4 is a low-frequency component c due to the transmission quadrature mirror filter 60.
4 (0 to 4 kHz) and the high frequency component b4 (4 to 7 kHz). Thereafter, the divided low frequency component c4 of the input voice signal is encoded by the general high efficiency voice encoder 62 in a frame unit of, for example, 40 ms length, whereby the voice information parameter e4 is output. To be done. The configuration example of the high-efficiency speech encoder 62 will be described later in detail.

【００１２】一方、分割された入力音声信号の高域成分
ｂ４は、例えぱ４０ｍｓ長のフレーム単位で線形予測分
析器６１により処理され、スペクトル包絡を表すための
線形予測係数ｄ４が出力される。なお、通常の電話帯域
音声に対する音声符号化処理では分析次数として８〜１
０の次数のものが用いられるが、この線形予測分析器６
１で高域成分を符号化する際には、人間の聴覚特性上は
分解能が低いことから、スペクトル包絡の外形が大まか
に分かる程度に情報を伝送すれぱ良く、そのため、ここ
での分折次数は２次程度でよい。On the other hand, the high-frequency component b4 of the divided input speech signal is processed by the linear prediction analyzer 61 in a frame unit of, for example, 40 ms, and the linear prediction coefficient d4 for expressing the spectrum envelope is output. It should be noted that in the speech coding processing for ordinary telephone band speech, the analysis order is 8 to 1
A linear prediction analyzer 6 having a degree of 0 is used.
When a high frequency component is encoded with 1, the resolution is low due to human auditory characteristics, so it is sufficient to transmit information to the extent that the outline of the spectral envelope can be roughly understood. Therefore, the division order here Is about the second order.

【００１３】上記線形予測分析器６１からの高域成分の
線形予測係数ｄ４と、高能率音声符号化器６２からの低
域成分の音声情報パラメータｅ４は、さらに、符号化器
６３により符号化及びフレーム化が行われ、その結果で
あるデータ列ｆ４が伝送路に送出されることとなる。The high-frequency component linear prediction coefficient d4 from the linear prediction analyzer 61 and the low-frequency component speech information parameter e4 from the high-efficiency speech encoder 62 are further encoded and encoded by the encoder 63. The data is framed, and the resulting data string f4 is sent to the transmission path.

【００１４】次に、上記高能率音声符号化器６２の具体
的な構成の詳細について説明する。この高能率音声符号
化器６２の具体的な構成の一例が図２に示されている。
この図において、入力音声信号の低域成分ｃ４は、ま
ず、線形予測分折器１１により、４０ｍｓのフレーム単
位でかつ分析次数１０で処理され、その結果として、ス
ペクトル包絡情報である線形予測係数ｂ１が出力され
る。続いて、スペクトル包絡パラメータ量子化器１２
は、出力された線形予測係数ｂ１を量子化し、もって、
量子化線形予測係数ｃ１を出力する。スペクトル包絡パ
ラメータ逆量子化器１３は、入力した量子化線形予測係
数ｃ１を逆量子化し、もって、線形予測係数ｄ１を再生
する。さらに、この高能率音声符号化器６２では、フレ
ーム電力計算・正規化器２６により、入力音声信号の低
域成分ｃ４から入力音声信号のフレーム電力が計算さ
れ、それにより入力音声信号を正規化し、もって、フレ
ーム電力情報ｘ１及び正規化された入力音声信号ｗ１を
出力する。Next, details of a specific configuration of the high efficiency speech encoder 62 will be described. FIG. 2 shows an example of a specific configuration of the high efficiency speech encoder 62.
In this figure, the low-frequency component c4 of the input speech signal is first processed by the linear prediction splitter 11 in 40 ms frame units and in the analysis order 10, and as a result, the linear prediction coefficient b1 which is the spectral envelope information. Is output. Then, the spectrum envelope parameter quantizer 12
Quantizes the output linear prediction coefficient b1, and
The quantized linear prediction coefficient c1 is output. The spectral envelope parameter dequantizer 13 dequantizes the input quantized linear prediction coefficient c1 to reproduce the linear prediction coefficient d1. Further, in this high-efficiency speech encoder 62, the frame power calculator / normalizer 26 calculates the frame power of the input speech signal from the low-frequency component c4 of the input speech signal, thereby normalizing the input speech signal, Accordingly, the frame power information x1 and the normalized input audio signal w1 are output.

【００１５】続いて、局部復号器２５は、図からも明ら
かなように、適応符号帳１４、雑音符号帳１５、利得符
号帳１６、乗算器１７、１８、加算器１９、そして、線
形予測フィルタ２０によって構成されており、最終的に
は、例えば１０ｍｓ長のサブフレーム単位で、再生信号
ｎ１を合成して出力するものである。この局部復号器２
５の内部構成について説明すると、まず、適応符号帳１
４は、音源信号の周期性を有する波形パターンを、例え
ば２５６種類（ベクトル数＝２５６、ベクトルの次元＝
８０）蓄積しており、同様に、雑音符号帳１５は、音源
信号の雑音性を有する波形パターンを、例えば５１２種
類（ベクトル数：５１２、ベクトルの次元：８０）蓄積
している。さらに、利得符号帳１６は、上記の適応符号
帳１４と雑音符号帳１５に乗算される利得の組を、例え
ば１２８種類（ベクトル数：１２８、ベクトルの次元：
２）蓄積している。また、上記高能率音声符号化器６２
は、さらに、聴覚重み付けフィルタ２１、距離計算器２
２、符号帳探索制御器２３、そして符号化器２４を備え
ている。Subsequently, the local decoder 25, as is clear from the figure, has an adaptive codebook 14, a noise codebook 15, a gain codebook 16, multipliers 17 and 18, an adder 19, and a linear prediction filter. Finally, the reproduction signal n1 is finally combined and output in units of subframes having a length of 10 ms, for example. This local decoder 2
The internal configuration of 5 will be described. First, the adaptive codebook 1
4 is a waveform pattern having periodicity of the sound source signal, for example, 256 types (vector number = 256, vector dimension =
80) are accumulated, and similarly, the noise codebook 15 accumulates, for example, 512 types of waveform patterns having noise characteristics of the excitation signal (vector number: 512, vector dimension: 80). Further, the gain codebook 16 sets, for example, 128 types of gain sets by which the adaptive codebook 14 and the noise codebook 15 are multiplied (the number of vectors: 128, the dimension of vector:
2) It is accumulating. In addition, the high efficiency speech encoder 62
Further includes a hearing weighting filter 21 and a distance calculator 2.
2, a codebook search controller 23, and an encoder 24.

【００１６】かかる構成において、上記の適応符号帳１
４及び雑音符号帳１５は、それぞれ、符号帳探索制御器
２３から出力される制御信号ｈ１、ｉ１に従い、信号波
形の形状を表す形状ベクトルｅ１とベクトルの番号を表
すインデックスｑ１、そして、信号波形の形状を表す形
状ベクトルｆ１とベクトルの番号を表すインデックスｒ
１を出力する。ここで、ｑ１はピッチ周期に対応する。
また、上記利得符号帳１６は、上述と同様に、符号帳探
索制御器２３から出力される制御信号ｊ１に従い、適応
符号帳１４の乗算器１７に対する利得ｇ１、雑音符号帳
１５の乗算器１８に対する利得ｕ１、及び、インデック
スｓ１を出力する。これにより乗算器１７、１８は、形
状ベクトルｅ１、ｆ１と利得ｇ１、ｕ１とをそれぞれ乗
算し、もって、利得調整された形状ベクトルｋ１、ｌ１
を出力する。また、加算器１９はこれらの形状ベクトル
ｋ１と形状ベクトルｌ１を加算し、音源信号ｍ１を出力
する。In such a configuration, the above adaptive codebook 1
4 and the random codebook 15, according to the control signals h1 and i1 output from the codebook search controller 23, respectively, a shape vector e1 indicating the shape of the signal waveform, an index q1 indicating the number of the vector, and a signal waveform Shape vector f1 representing the shape and index r representing the vector number
Outputs 1. Here, q1 corresponds to the pitch period.
Further, the gain codebook 16 has the gain g1 for the multiplier 17 of the adaptive codebook 14 and the multiplier 18 of the noise codebook 15 according to the control signal j1 output from the codebook search controller 23, as described above. The gain u1 and the index s1 are output. As a result, the multipliers 17 and 18 multiply the shape vectors e1 and f1 by the gains g1 and u1, respectively, so that the gain-adjusted shape vectors k1 and l1 are obtained.
Is output. Further, the adder 19 adds the shape vector k1 and the shape vector l1 and outputs a sound source signal m1.

【００１７】この音源信号ｍ１を入力とする上記線形予
測合成フィルタ２０は、上記スペクトル包絡パラメータ
逆量子化器１３により逆量子化された線形予測係数ｄ１
を用い、これによりスペクトル包絡特性を音源信号に付
加し、再生信号ｎ１を合成して出力する。また、聴覚重
み付けフィルタ２１は、上記線形予測分折器１１からの
線形予測係数ｂ１を用い、上記フレーム電力計算・正規
化器２６で正規化された入力信号ｗ１及び局部復号器２
５の出力である再生信号ｎ１を聴覚重み付けし、これに
より、聴覚重み付けされた入力信号ｏ１と局部復号器出
力ｖ１とを出力する。距離計算器２２は、上記聴覚重み
付けされた入力信号ｏ１と局部復号器出力ｖ１からこれ
らの距離ｐ１を計算する。符号帳探索制御器２３は、計
算された距離ｐ１を人力とし、この距離ｐが最小となる
ようなベクトルを出力するように、上記局部復号器２５
の適応符号帳１４、雑音符号帳１５、そして、利得符号
帳１６を制御する。The linear prediction synthesis filter 20 having the excitation signal m1 as an input, the linear prediction coefficient d1 dequantized by the spectrum envelope parameter dequantizer 13 is used.
, The spectral envelope characteristic is added to the sound source signal, and the reproduced signal n1 is synthesized and output. Further, the perceptual weighting filter 21 uses the linear prediction coefficient b1 from the linear prediction divider 11 and the input signal w1 normalized by the frame power calculator / normalizer 26 and the local decoder 2
5, the reproduced signal n1 which is the output of FIG. 5 is perceptually weighted, thereby outputting the perceptually weighted input signal o1 and the local decoder output v1. The distance calculator 22 calculates these distances p1 from the perceptually weighted input signal o1 and the local decoder output v1. The codebook search controller 23 uses the calculated distance p1 as a human power, and outputs the vector that minimizes the distance p so as to output the local decoder 25.
The adaptive codebook 14, the noise codebook 15, and the gain codebook 16 are controlled.

【００１８】また、上記スペクトル包絡パラメータ量子
化器１２、適応符号帳１４、雑音符号帳１５、利得符号
帳１６、及びフレーム電力計算・正規化器２６から出力
される伝送パラメータである量子化線形予測係数ｃ１、
インデックスｑ１、ｒ１、ｓ１及びフレーム電力情報ｘ
１は、符号化器２４により、符号化、フレーム化され、
これにより伝送データ列ｔ１となる。なお、この符号化
器２４からの出力である伝送データ列ｔ１は、図１にお
ける音声情報パラメータｅ４に対応する。Quantized linear prediction, which is a transmission parameter output from the spectrum envelope parameter quantizer 12, adaptive codebook 14, noise codebook 15, gain codebook 16, and frame power calculator / normalizer 26. Coefficient c1,
Indexes q1, r1, s1 and frame power information x
1 is encoded and framed by the encoder 24,
As a result, the transmission data string t1 is obtained. The transmission data string t1 output from the encoder 24 corresponds to the voice information parameter e4 in FIG.

【００１９】次に、本発明の広帯域音声復号装置２００
について説明する。この広帯域音声復号装置２００は上
記図１にその概略構成が示されており、さらに、添付の
図３には、その詳細な構成が示されている。これらの図
からも明らかなように、この広帯域音声復号装置２００
は、分離回路７０と、高能率音声復号器７１と、高周波
成分復元回路７２、そして、帯域合成器８２を構成する
送信直交ミラーフィルタとから構成されている。Next, the wideband speech decoding apparatus 200 of the present invention.
Will be described. The schematic configuration of the wideband speech decoding apparatus 200 is shown in FIG. 1, and the detailed configuration thereof is shown in FIG. 3 attached hereto. As is clear from these figures, this wideband speech decoding device 200
Is composed of a separation circuit 70, a high-efficiency speech decoder 71, a high-frequency component restoration circuit 72, and a transmission quadrature mirror filter which constitutes a band synthesizer 82.

【００２０】ここでは、この広帯域音声復号装置２００
の詳細について、図３を参照しながら説明する。高周波
成分復元回路７２は、利得補正回路７３と、適応符号帳
７５と、利得補正回路７７と、雑音発生器７８と、２つ
の乗算器７６、７９と、加算器８０と、線形予測合成フ
ィルタ８１と、利得補正回路８３と、そして、乗算器８
４とから構成されている。分離回路７０からは６種の信
号、ｂ５、ｃ５、ｄ５、ｅ５、ｆ５、ｒ５が出力され、
また、高能率音声復号器７１からは信号ｇ５、ｓ５、ｔ
５が出力されている。Here, this wideband speech decoding apparatus 200 is used.
Will be described in detail with reference to FIG. The high frequency component restoration circuit 72 includes a gain correction circuit 73, an adaptive codebook 75, a gain correction circuit 77, a noise generator 78, two multipliers 76 and 79, an adder 80, and a linear prediction synthesis filter 81. , Gain correction circuit 83, and multiplier 8
And 4. The separation circuit 70 outputs six types of signals, b5, c5, d5, e5, f5, and r5,
In addition, signals g5, s5, t from the high-efficiency speech decoder 71
5 is output.

【００２１】かかる構成の広帯域音声復号装置２００に
おいて、まず、有線または無線回線により伝送されてき
たデータ列ａ５は、分離回路７０により分離、復元さ
れ、もって、高域成分の線形予測係数ｂ５、低域成分の
線形予測係数ｄ５、音源情報である適応符号帳に対応す
るインデックスｃ５、雑音符号帳に対応するインデック
スｅ５、利得符号帳に対応するインデックスｆ５、及
び、フレーム電力情報ｒ５が再生される。信号はこの順
に、図１の信号ｄ４、図２の信号ｃ１、ｑ１、ｒ１、ｓ
１、ｘ１に対応する復号信号であり、これらのうち、信
号ｄ５、ｃ５、ｅ５、ｆ５、及びｒ５を入力として高能
率音声復号器７１は、低域成分を再生し、信号ｇ５とし
てこれを出力する。なお、この高能率音声復号器７１
は、一般的に電話帯域音声の復号処理に用いられるもの
であり、その詳細な構成を、添付の図４に示す。In the wideband speech decoding apparatus 200 having such a configuration, first, the data string a5 transmitted by the wired or wireless line is separated and restored by the separation circuit 70, so that the linear prediction coefficient b5 of the high frequency component and low The linear prediction coefficient d5 of the band component, the index c5 corresponding to the adaptive codebook which is the excitation information, the index e5 corresponding to the noise codebook, the index f5 corresponding to the gain codebook, and the frame power information r5 are reproduced. The signals are, in this order, the signal d4 of FIG. 1 and the signals c1, q1, r1, s of FIG.
1 and x1, which are decoded signals, of which the signals d5, c5, e5, f5, and r5 are input, and the high-efficiency audio decoder 71 reproduces the low frequency component and outputs this as the signal g5. To do. Note that this high-efficiency speech decoder 71
Is generally used for the decoding processing of telephone band voice, and its detailed configuration is shown in FIG. 4 attached.

【００２２】すなわち、この図４の高能率音声復号器７
１では、適応符号帳３１、雑音符号帳３２、利得符号帳
３３は、それぞれ、図２に符号１４、１５、１６で示し
たものと同様の同様の内容のデータを蓄積しており、こ
れにより、インデックスｃ５、ｅ５、ｆ５を入力とし
て、形状ペクトルｆ２、ｇ２、利得ｊ２、ｋ２を出力す
る。さらに、乗算器３４、３５は、上記の形状ペクトル
と利得をそれぞれ乗算し、すなわち、形状ペクトルｆ２
と利得ｊ２（ｆ２×ｊ２）、形状ペクトルｇ２と利得ｋ
２（ｇ２×ｋ２）を乗算し、もって、利得調整された形
状ベクトルｈ２、ｉ２を出力する。一方、加算器３９
は、上記の形状ベクトルｈ２、ｉ２を加算し、音源信号
ｌ２を出力する。ここまでが高能率音声復号器７１にお
けるサブフレーム（１０ｍｓ）単位の処理であり、以
後、ポストフィルタ３８までの処理はフレーム（４０ｍ
ｓ）単位で実行される。That is, the high efficiency speech decoder 7 of FIG.
1, the adaptive codebook 31, the noise codebook 32, and the gain codebook 33 store data of similar contents to those shown by reference numerals 14, 15, and 16 in FIG. 2, respectively. , And inputs the indexes c5, e5, and f5, and outputs the shape vectors f2 and g2 and the gains j2 and k2. Further, the multipliers 34 and 35 respectively multiply the above-described shape vector and the gain, that is, the shape vector f2.
And gain j2 (f2 × j2), shape vector g2 and gain k
2 (g2 × k2) is multiplied, and the gain-adjusted shape vectors h2 and i2 are output. On the other hand, the adder 39
Outputs the sound source signal 12 by adding the above shape vectors h2 and i2. The processing up to this point has been performed in units of subframes (10 ms) in the high-efficiency speech decoder 71, and thereafter, the processing up to the post filter 38 is performed in frames (40 m).
s) is executed in units.

【００２３】線形予測合成フィルタ３６は、量子化線形
予測係数ｄ５を用いてスペクトル包絡特性を音源信号ｌ
２に付加し、これにより、再生信号ｍ２を合成する。そ
の後、乗算器３７はこの再生信号ｍ２とフレーム電力精
報ｒ５を乗算し、これによりフレーム電力情報を付加し
た再生信号ｏ２を出力する。そして、ポストフィルタ３
８は、量子化線形予測係数ｄ５を用いて、再生信号ｏ２
の聴感上の品質を向上させるためのポストフィルタリン
グ処理を行い、もって高能率音声復号器７１の再生出力
である低域成分ｇ５を出力する。また、この高能率音声
復号器７１の利得符号帳３３から出力される利得ｊ２、
ｋ２は、上記図３において、高域周波数成分復元回路７
２へ信号ｓ５、ｔ５として入力されている。The linear predictive synthesis filter 36 uses the quantized linear predictive coefficient d5 to determine the spectrum envelope characteristic of the sound source signal l.
2 and the reproduced signal m2 is synthesized. After that, the multiplier 37 multiplies the reproduction signal m2 by the frame power information r5, and outputs the reproduction signal o2 to which the frame power information is added. And the post filter 3
8 uses the quantized linear prediction coefficient d5 to reproduce signal o2
The post-filtering process for improving the perceptual quality is performed to output the low-frequency component g5 which is the reproduction output of the high-efficiency audio decoder 71. In addition, the gain j2 output from the gain codebook 33 of the high efficiency speech decoder 71,
k2 is the high frequency component restoration circuit 7 in FIG.
2 are input as signals s5 and t5.

【００２４】再び、図３に戻り、広帯域音声復号装置２
００を構成する高域周波数成分復元回路７２は、上記分
離回路７０からの高域成分の線形予測係数ｂ５、低域成
分の適応符号長のインデックスｃ５、そして、上記高能
率音声符号器７１内で発生された利得符号帳出力ｓ５、
ｔ５、さらには、フレーム電力情報ｒ５を利用し、もっ
て、高域成分を再生し、これを信号ｘ５として出力す
る。すなわち、これを言い換えれば、高周波数帯域信号
に対応するスペクトル包絡情報パラメータと、前記低周
波数帯域信号の音源情報パラメータから推定された擬似
の音源パラメータを用いて、高周波数帯域信号を再生す
るのである。Returning to FIG. 3 again, the wideband speech decoding device 2
The high frequency component restoring circuit 72 that composes 00 is a high-frequency component linear prediction coefficient b5 from the separating circuit 70, an adaptive code length index c5 of the low-frequency component, and the high-efficiency speech encoder 71. The generated gain codebook output s5,
At t5, further, the frame power information r5 is used to reproduce the high frequency component and output it as the signal x5. In other words, in other words, by using the spectral envelope information parameter corresponding to the high frequency band signal and the pseudo sound source parameter estimated from the sound source information parameter of the low frequency band signal, the high frequency band signal is reproduced. .

【００２５】このような働きの高域周波数成分復元回路
７２の構成について、以下に説明すると、まず、適応符
号帳７５は分離回路７０からのインデックスｃ５を入力
し、周期性を有する波形の形状ベクトルｊ５を出カす
る。利得補正回路７３は、上記高能率音声符号器７１か
らの適応符号帳の利得ｓ５に係数を乗算して補正した係
数ｈ５を出力する。なお、ここで用いる係数は、高域で
は低域に比較し周期性が低いことを考慮して、１．０よ
り小さい値に設定する。続いて、乗算器７６は、上記の
形状ベクトルｊ５に上記の補正された利得ｈ５を乗算
し、もって、利得調整された形状ベクトルｋ５を出力す
る。The structure of the high frequency component restoring circuit 72 having such a function will be described below. First, the adaptive codebook 75 receives the index c5 from the separating circuit 70 and receives the waveform shape vector having periodicity. Output j5. The gain correction circuit 73 multiplies the gain s5 of the adaptive codebook from the high efficiency speech encoder 71 by a coefficient and outputs a corrected coefficient h5. It should be noted that the coefficient used here is set to a value smaller than 1.0 in consideration of the low periodicity in the high range as compared with the low range. Then, the multiplier 76 multiplies the shape vector j5 by the corrected gain h5, and outputs the gain-adjusted shape vector k5.

【００２６】一方、利得補正回路７７は、上記高能率音
声符号器７１からの雑音符号帳の利得ｔ５に係数を乗算
して補正した利得ｌ５を出力する。なお、ここで用いる
係数は、高域が強調され過ぎないように、やはり１．０
より小さい値に設定される。続いて、乗算器７９は、雑
音発生器７８から出力される白色雑音ｍ５に上記の補正
された利得ｌ５を乗算し、もって、利得調整された形状
ベクトルｎ５を出力する。その後、加算器８０は、上記
２つの乗算器７６、７９からの出力ｋ５とｎ５とを加算
し、これにより音源信号ｏ５を出力する。ここまでが、
上記高域周波数成分復元回路７２におけるサブフレーム
（１０ｍｓ）単位の処理であり、これ以後の処理は、フ
レーム（４０ｍｓ）単位で実行される。すなわち、線形
予測合成フィルタ８１は、量子化線形予測係数ｂ５を用
いてスペクトル包絡特性を音源信号ｏ５に付加し、もっ
て、再生信号ｖ５を合成する。On the other hand, the gain correction circuit 77 outputs a corrected gain l5 obtained by multiplying the gain t5 of the noise codebook from the high efficiency speech encoder 71 by a coefficient. Note that the coefficient used here is also 1.0 so that the high frequencies are not overemphasized.
Set to a smaller value. Then, the multiplier 79 multiplies the white noise m5 output from the noise generator 78 by the above-described corrected gain l5, and outputs the gain-adjusted shape vector n5. After that, the adder 80 adds the outputs k5 and n5 from the two multipliers 76 and 79, and outputs the sound source signal o5. So far,
This is processing in the sub-frame (10 ms) unit in the high frequency component restoring circuit 72, and the subsequent processing is executed in frame (40 ms) units. That is, the linear prediction synthesis filter 81 adds the spectrum envelope characteristic to the sound source signal o5 by using the quantized linear prediction coefficient b5, and thus synthesizes the reproduction signal v5.

【００２７】さらに、上記高域周波数成分復元回路７２
では、利得補正回路８３が上記分離回路７０からのフレ
ーム電力情報ｒ５に係数を乗算して補正したフレーム電
力情報ｗ５を出力している。なお、ここで用いる係数
は、やはり高域が強調され過ぎないように１．０より小
さい値が設定されている。そして、乗算器８４は、上記
の再生信号ｖ５と上記補正されたフレーム電力情報ｗ５
を乗算し、もって、フレーム電力情報を付加した再生信
号ｘ５を出力する。その後、図１に示したように帯域合
成器８２を構成する受信直交ミラーフィルタにより、再
生された低域（０〜４ｋＨｚ）成分ｇ５と高域（４〜７
ｋＨｚ）成分ｘ５とを合成し、もって、再生された広帯
域音声信号ｑ５を出力することなる。Further, the high frequency component restoration circuit 72
Then, the gain correction circuit 83 outputs the corrected frame power information w5 by multiplying the frame power information r5 from the separation circuit 70 by a coefficient. The coefficient used here is set to a value smaller than 1.0 so that the high frequencies are not emphasized too much. Then, the multiplier 84 uses the reproduction signal v5 and the corrected frame power information w5.
And the reproduction signal x5 to which the frame power information is added is output. After that, as shown in FIG. 1, by the reception quadrature mirror filter that constitutes the band synthesizer 82, the reproduced low frequency component (0 to 4 kHz) g5 and high frequency component (4 to 7).
(kHz) component x5, and the reproduced wideband audio signal q5 is output.

【００２８】以上に詳述した広帯域音声符号化装置１０
０及び広帯域音声復号装置２００を備えた装置における
音声情報符号化速度は、下記の表１のようになる。Wideband speech coding apparatus 10 described in detail above.
0 and the speech information coding rate in the apparatus including the wideband speech decoding apparatus 200 are as shown in Table 1 below.

【表１】この表１からも明らかなように、音声情報符号化速度
は、１３７×２５フレーム＝３．４２５ｋｂｐｓとな
る。また、伝送路符号化をも含めた全体の符号化速度を
５．６ｋｂｐｓとすれば、残りの２．１７５ｋｂｐｓを
誤り訂正などの対策に使用することが出来る。[Table 1] As is clear from Table 1, the audio information coding rate is 137 × 25 frames = 3.425 kbps. Further, if the overall coding rate including the transmission path coding is 5.6 kbps, the remaining 2.175 kbps can be used for countermeasures such as error correction.

【００２９】なお、上記に詳述した本発明になる装置で
は、受信側では高域成分を近似（疑似）的に再生してい
るため、再生された高域成分には多少の歪みが発生す
る。この歪みは受信側での背景雑音が小さい時程目立つ
ようになる。そのため背景雑音が小さい環境下では、再
生出力として広帯域信号よりも低域成分信号のみを出力
したほうが望ましいといえる。従って、本発明の広帯域
音声復号装置（図３）に対し、さらに、図５に示すよう
な広帯域再生信号／低域再生信号出力切替え器９０を付
加することが堤案されている。この図５において、出力
切替え器９０は、上記図３の広帯域再生信号ｑ５及び低
域再生信号ｇ５を入力し、制御信号ａ６により、上記の
２つの信号ｑ５、ｇ５のうちのどちらか一方を出力でき
るような切替え機能を有するものであり、その出力を再
生信号ｂ６とする。In the apparatus according to the present invention described in detail above, since the high frequency component is reproduced approximately (pseudo) on the receiving side, some distortion occurs in the reproduced high frequency component. . This distortion becomes more noticeable when the background noise on the receiving side is small. Therefore, in an environment where the background noise is small, it can be said that it is desirable to output only the low-frequency component signal as a reproduction output rather than the wideband signal. Therefore, it is proposed to add a wideband reproduction signal / low range reproduction signal output switch 90 as shown in FIG. 5 to the wideband speech decoding apparatus (FIG. 3) of the present invention. In FIG. 5, the output switcher 90 receives the wideband reproduction signal q5 and the low-frequency reproduction signal g5 shown in FIG. 3, and outputs either one of the above two signals q5 and g5 by the control signal a6. It has such a switching function that its output is the reproduction signal b6.

【００３０】また、上記の広帯域再生信号／低域再生信
号出力切替え器９０において、どちらの信号を違択する
かの情報である制御信号ａ６の発生方法としては、例え
ば手動式と自動式が考えられる。前者は、例えば受聴者
が受信装置の操作パネル上のスイッチを手動で操作する
とにより発生させるものであり、後者は、背景雑音パワ
ーの測定結果基づいて自動的に発生させるものである。Further, in the wide band reproduction signal / low range reproduction signal output switch 90, as a method of generating the control signal a6 which is information as to which signal is to be selected, for example, a manual type and an automatic type are considered. To be The former is generated, for example, by a listener manually operating a switch on the operation panel of the receiving device, and the latter is automatically generated based on the measurement result of the background noise power.

【００３１】さらに、上記の実施例においては、広帯域
音声入力信号の分割した低周波数帯域と高周波数帯域の
うち、特に、高周波数帯域については、送信側では、線
形予測分析器６１により、そのスペクトル包絡情報パラ
メータのみを抽出して送信することとし、また、受信側
では、高周波成分復元回路７２において、高周波数帯域
信号に対応する音源パラメータを低周波数帯域信号の音
源情報パラメータから擬似的に推定し、これを利用して
高周波数帯域信号を再生している。しかしながら、本発
明はこのような実施例だけに限定されるものではなく、
上記の実施例とは逆に、送信側では、線形予測分析器に
より、低周波数帯域信号のスペクトル包絡情報パラメー
タのみを抽出して送信することとし、受信側では、周波
成分復元回路において、低周波数帯域信号に対応する音
源パラメータを高周波数帯域信号の音源情報パラメータ
から擬似的に推定し、これを利用して低周波数帯域信号
を再生するようにすることも可能である。Further, in the above embodiment, of the divided low-frequency band and high-frequency band of the wideband speech input signal, particularly the high-frequency band, on the transmitting side, the spectrum thereof is determined by the linear prediction analyzer 61. It is assumed that only the envelope information parameter is extracted and transmitted, and on the receiving side, in the high frequency component restoration circuit 72, the sound source parameter corresponding to the high frequency band signal is pseudo-estimated from the sound source information parameter of the low frequency band signal. , Is used to reproduce high frequency band signals. However, the present invention is not limited to such an embodiment,
On the contrary to the above embodiment, on the transmission side, the linear prediction analyzer is used to extract and transmit only the spectrum envelope information parameter of the low frequency band signal, and on the reception side, in the frequency component restoration circuit, the low frequency It is also possible to artificially estimate the sound source parameter corresponding to the band signal from the sound source information parameter of the high frequency band signal, and use this to reproduce the low frequency band signal.

【００３２】[0032]

【発明の効果】以上の詳細な説明からも明らかなよう
に、本発明になる音声符号化・復号方法、及び、これを
利用した音声符号化・復号装置によれば、広帯域音声信
号の符号化処理及び復号処理を、移動無線通信または有
線通信に適用可能な低速度で簡易に実現でき、これによ
り、情報伝送速度に制限のある無線回線を使用して行わ
れる移動通信においても広帯域音声信号を符号化して伝
送することを可能とし、もって、高い背景雑音下での通
話の了解性を向上させることが可能となるいう、技術的
にも極めて優れた効果を発揮する。As is clear from the above detailed description, according to the audio encoding / decoding method and the audio encoding / decoding apparatus using the same of the present invention, the encoding of the wideband audio signal is performed. The processing and decoding processing can be easily realized at a low speed applicable to mobile wireless communication or wired communication, which enables broadband audio signals to be transmitted even in mobile communication performed using a wireless line having a limited information transmission speed. This makes it possible to code and transmit, and thus it is possible to improve the intelligibility of a call under high background noise, which is an extremely technically excellent effect.

[Brief description of drawings]

【図１】本発明の音声符号化・復号方法を実施するため
の広帯域音声符号化・復号装置のブロック図である。FIG. 1 is a block diagram of a wideband speech encoding / decoding device for implementing a speech encoding / decoding method of the present invention.

【図２】上記図１に示した広帯域音声符号化・復号装置
の高能率音声符号化器の詳細構造を示すためのブロック
図である。FIG. 2 is a block diagram showing a detailed structure of a high-efficiency speech encoder of the wideband speech encoding / decoding apparatus shown in FIG.

【図３】上記図１に示した装置の広帯域音声復号装置の
構造を示すブロック図である。3 is a block diagram showing a structure of a wideband speech decoding apparatus of the apparatus shown in FIG.

【図４】上記図１に示した広帯域音声復号装置の高能率
音声復号器の詳細を示すブロック図である。FIG. 4 is a block diagram showing details of a high-efficiency speech decoder of the wideband speech decoding device shown in FIG.

【図５】本発明の変形例になる広帯域／低域再生信号出
力切替器の具体的構成を示すブロック図である。FIG. 5 is a block diagram showing a specific configuration of a wideband / lowband reproduction signal output switch according to a modified example of the present invention.

[Explanation of symbols]

６０帯域分割器６１線形予測分析器６２高性能音声符号化器６３符号化器７１高能率音声復号器７２高周波成分復元回路８２帯域合成器９０出力切替器１００広帯域音声符号化装置２００広帯域音声復号装置 60 band divider 61 linear prediction analyzer 62 high-performance speech encoder 63 encoder 71 high-efficiency speech decoder 72 high-frequency component restoration circuit 82 band synthesizer 90 output switcher 100 wide-band speech encoder 200 wide-band speech decoder

Claims

[Claims]

1. An input speech signal is divided into a first frequency band signal and a second frequency band signal, and then the first frequency band signal is subjected to high-efficiency speech coding processing and its output. The speech coding method is characterized in that the spectrum envelope information parameter and the sound source information parameter are transmitted, and for the second frequency band signal, only the spectrum envelope information parameter is extracted and transmitted.

2. The speech coding method according to claim 1, wherein the first frequency band signal is a low frequency band signal and the second frequency band signal is a high frequency band signal. Speech coding method.

3. The speech coding method according to claim 1, wherein the first frequency band signal is a high frequency band signal and the second frequency band signal is a low frequency band signal. Speech coding method.

4. A method for decoding a voice signal transmitted by being encoded by the voice encoding method according to claim 1, comprising a spectrum envelope information parameter and a sound source information parameter of the transmitted voice signal. Received the first
For the frequency band signal, the decoding is performed by the high-efficiency speech decoding process from the spectrum envelope information parameter and the sound source information parameter corresponding to the signal, and for the second frequency band signal, the spectrum envelope corresponding to the signal. Decoding is performed using the information parameter and the pseudo sound source parameter estimated from the sound source information parameter of the first frequency band signal, and further, the reproduced sound signal is obtained by adding the two sound signals generated by these decoding. A speech decoding method characterized by synthesizing.

5. The audio decoding method according to claim 4, wherein the synthesized reproduced audio signal and the audio signal generated by decoding the first frequency band signal can be switched and output. A voice decoding method characterized by.

6. The speech decoding method according to claim 4 or 5, wherein the first frequency band signal is a low frequency band signal, and the second frequency band signal is a high frequency band signal. Characteristic speech decoding method.

7. The speech decoding method according to claim 4 or 5, wherein the first frequency band signal is a high frequency band signal and the second frequency band signal is a low frequency band signal. Characteristic speech decoding method.

8. An audio signal is a first frequency band signal and a second frequency band signal.
And a first frequency band signal divided by the band divider for high efficiency speech coding, and outputs the resulting spectrum envelope information parameter and excitation information parameter. A high-efficiency speech coder, a linear prediction analyzer that performs linear prediction analysis on the second frequency band signal divided by the band divider, and outputs a spectrum envelope information parameter as a result, the high-efficiency speech Encoding the spectrum envelope information parameter and the excitation information parameter of the first frequency band signal output from the encoder, and the spectrum envelope information parameter of the second frequency band signal output from the linear prediction analyzer And a coder that multiplexes these and sends them to a transmission path.

9. The speech coding apparatus according to claim 8, wherein the first frequency band signal is a low frequency band signal, and the second frequency band signal is a high frequency band signal. Speech coding device.

10. The speech coding apparatus according to claim 8, wherein the first frequency band signal is a high frequency band signal, and the second frequency band signal is a low frequency band signal. Speech coding device.

11. A device for decoding a voice signal encoded and transmitted by the voice encoding device according to claim 8, wherein the device receives the multiplexed and transmitted signal on a transmission line, A separation circuit that separates and extracts a spectrum envelope information parameter and a sound source information parameter of the first frequency band signal and a spectrum envelope information parameter of the second frequency band signal; and the first frequency from the separation circuit. A high-efficiency speech decoder that reproduces the first frequency band signal by a high-efficiency speech decoding process from a spectrum envelope information parameter and a sound source information parameter of the band signal, and a spectrum envelope of the second frequency band signal from the separation circuit. Together with the information parameter, a parameter obtained based on the sound source information parameter of the first frequency band signal is set to the second frequency. By using as a sound source information parameter for a band signal, a frequency component restoration circuit that reproduces the second frequency band signal by high-efficiency audio decoding processing, and by adding the reproduced first and second frequency band signals. A voice decoding device comprising: a band synthesizer for synthesizing a reproduced voice signal.

12. The audio decoding device according to claim 11, wherein the output switching for switching and outputting the synthesized reproduced audio signal and the audio signal generated by decoding the first frequency band signal. A speech decoding apparatus characterized by comprising a receiver.

13. The speech decoding device according to claim 11, wherein the first frequency band signal is a low frequency band signal and the second frequency band signal is a high frequency band signal. A characteristic voice decoding device.

14. The speech decoding apparatus according to claim 11 or 12, wherein the first frequency band signal is a high frequency band signal and the second frequency band signal is a low frequency band signal. A characteristic voice decoding device.