JP4977471B2

JP4977471B2 - Encoding apparatus and encoding method

Info

Publication number: JP4977471B2
Application number: JP2006542421A
Authority: JP
Inventors: 正浩押切; 宏幸江原; 幸司吉田
Original assignee: Panasonic Corp; Matsushita Electric Industrial Co Ltd
Current assignee: Panasonic Corp; Panasonic Holdings Corp
Priority date: 2004-11-05
Filing date: 2005-11-02
Publication date: 2012-07-18
Anticipated expiration: 2025-11-02
Also published as: EP1798724A1; EP2752849B1; RU2387024C2; CN102201242A; US8204745B2; RU2500043C2; WO2006049204A1; US7769584B2; US20100256980A1; EP2752843A1; US20080052066A1; CN102184734B; BRPI0517716A; BRPI0517716B1; CN102184734A; CN101048814B; EP1798724B1; RU2009147514A; JPWO2006049204A1; KR101220621B1

Description

本発明は、音声信号、オーディオ信号等を符号化する符号化装置及び符号化方法に関する。 The present invention, audio signal relates to a coding apparatus and coding how that turn into codes an audio signal and the like.

移動体通信システムにおける電波資源等の有効利用のために、音声信号を低ビットレートで圧縮することが要求されている。その一方で、ユーザからは通話音声の品質向上や臨場感の高い通話サービスの実現が望まれている。この実現には、音声信号の高品質化のみならず、より帯域の広いオーディオ信号等の音声以外の信号をも高品質に符号化できることが望ましい。 In order to effectively use radio resources and the like in mobile communication systems, it is required to compress audio signals at a low bit rate. On the other hand, users are demanded to improve the quality of call voice and realize a call service with a high presence. For this realization, it is desirable not only to improve the quality of the audio signal, but also to encode a signal other than audio such as an audio signal having a wider bandwidth with high quality.

このような相反する要求に対し、複数の符号化技術を階層的に統合するアプローチが有望視されている。具体的には、音声信号に適したモデルで入力信号を低ビットレートで符号化する第１レイヤ部と、入力信号と第１レイヤ復号信号との残差信号を音声以外の信号にも適したモデルで符号化する第２レイヤ部を階層的に組み合わせる構成を採る。このような階層構造を持つ符号化方式は、符号化部より得られるビットストリームにスケーラビリティ性（ビットストリームの一部の情報からでも復号信号が得られること）を有するため、スケーラブル符号化と呼ばれる。スケーラブル符号化はその性質から、ビットレートの異なるネットワーク間の通信にも柔軟に対応できる。この特徴は、ＩＰプロトコルで多様なネットワークが統合されていく今後のネットワーク環境に適したものといえる。 In response to such conflicting demands, an approach that hierarchically integrates a plurality of encoding techniques is considered promising. Specifically, a first layer unit that encodes an input signal at a low bit rate with a model suitable for a speech signal, and a residual signal between the input signal and the first layer decoded signal is also suitable for a signal other than speech. A configuration is adopted in which the second layer parts encoded by the model are hierarchically combined. An encoding method having such a hierarchical structure is called scalable encoding because the bit stream obtained from the encoding unit has scalability (a decoded signal can be obtained even from partial information of the bit stream). Because of its nature, scalable coding can flexibly support communication between networks with different bit rates. This feature can be said to be suitable for a future network environment in which various networks are integrated by the IP protocol.

従来のスケーラブル符号化としては、例えば非特許文献１記載のものがある。この文献では、MPEG-4（Moving Picture Experts Group phase-4）で規格化された技術を用いてスケーラブル符号化を構成する方法について述べられている。具体的には、第１レイヤ部（基本レイヤ部）では、CELP（Code Excited Linear Prediction：符号励振線形予測）を用いて音声信号つまり原信号を符号化し、第２レイヤ部（拡張レイヤ部）では、例えばAAC（Advanced Audio Coder）やTwinVQ（Transform Domain Weighted Interleave Vector Quantization：周波数領域重み付きインターリーブベクトル量子化）のような変換符号化を用いて残差信号を符号化する。ここで、残差信号は、第１レイヤ部で得られた符号化コードを復号したもの（第１レイヤ復号信号）を原信号から減算することにより得られる信号である。
三木弼一編著、「MPEG-4の全て」、初版、（株）工業調査会、１９９８年９月３０日、ｐ．１２６−１２７ As a conventional scalable coding, there exists a thing of a nonpatent literature 1, for example. This document describes a method for configuring scalable coding using a technique standardized by MPEG-4 (Moving Picture Experts Group phase-4). Specifically, in the first layer unit (base layer unit), a speech signal, that is, an original signal is encoded using CELP (Code Excited Linear Prediction), and in the second layer unit (enhancement layer unit) For example, the residual signal is encoded using transform coding such as AAC (Advanced Audio Coder) or TwinVQ (Transform Domain Weighted Interleave Vector Quantization). Here, the residual signal is a signal obtained by subtracting, from the original signal, a decoded version of the encoded code obtained in the first layer unit (first layer decoded signal).
Edited by Junichi Miki, “All about MPEG-4”, first edition, Industrial Research Co., Ltd., September 30, 1998, p. 126-127

しかしながら、上記従来技術においては、第２レイヤ部での変換符号化は、原信号から第１レイヤ復号信号を減じて得られる残差信号に対して行われる。よって、原信号に含まれる主要な情報の一部が、第１レイヤ部を介することにより取り除かれることがある。この場合、残差信号の特性が、雑音系列に近い特性となる。したがって、例えばAACやTwinVQのように楽音信号を効率的に符号化するよう設計された変換符号化を、第２レイヤ部に用いる場合、上記特性を持つ残差信号を符号化して復号信号の高品質化を図るには、多くのビットを配分する必要がある。その結果、ビットレートが大きくなってしまうという問題があった。 However, in the above prior art, the transform coding in the second layer unit is performed on the residual signal obtained by subtracting the first layer decoded signal from the original signal. Therefore, some of the main information included in the original signal may be removed by passing through the first layer unit. In this case, the characteristic of the residual signal is a characteristic close to a noise sequence. Therefore, for example, when transform coding designed to efficiently encode a musical tone signal, such as AAC or TwinVQ, is used for the second layer portion, the residual signal having the above characteristics is encoded to increase the decoding signal. In order to improve quality, it is necessary to allocate many bits. As a result, there is a problem that the bit rate is increased.

本発明の目的は、かかる点に鑑みてなされたものであり、第２レイヤ部またはそれよりも上位のレイヤ部で低ビットレートの符号化を行っても高品質な復号信号を得ることができる符号化装置及び符号化方法を提供することである。 The object of the present invention has been made in view of this point, and a high-quality decoded signal can be obtained even if low bit rate encoding is performed in the second layer unit or higher layer unit. to provide a coding apparatus and coding how.

本発明の符号化装置は、原信号から、低周波帯域の符号化情報と高周波帯域の符号化情報とを生成する符号化装置であって、前記低周波帯域の符号化情報の復号信号から低周波帯域の第１スペクトルを算出する第１スペクトル算出手段と、前記原信号から第２スペクトルを算出する第２スペクトル算出手段と、前記第１スペクトルを内部状態として有するフィルタを用いて、前記フィルタの特性を示すパラメータを、前記第１スペクトルと前記第２スペクトルの高周波帯域部との類似具合を示す第１パラメータとして出力する第１パラメータ算出手段と、スペクトル残差の候補を複数記録しているスペクトル残差形状符号帳の中から一つのスペクトル残差の候補の符号を、前記第１スペクトルと前記第２スペクトルの高周波帯域部との変動成分を示す第２パラメータとして出力する第２パラメータ算出手段と、前記出力される第１パラメータおよび第２パラメータの中から、前記第２スペクトルの高周波帯域部と最も類似する推定値を生成する前記第１パラメータと前記第２パラメータを同時に決定する決定手段と、前記決定された第１パラメータと第２パラメータとを前記高周波帯域の符号化情報として符号化する符号化手段と、を有する構成を採る。 An encoding apparatus according to the present invention is an encoding apparatus that generates low frequency band encoded information and high frequency band encoded information from an original signal, wherein the low frequency band encoded information is reduced from the decoded signal. A first spectrum calculating means for calculating a first spectrum of a frequency band; a second spectrum calculating means for calculating a second spectrum from the original signal; and a filter having the first spectrum as an internal state. A first parameter calculating means for outputting a parameter indicating a characteristic as a first parameter indicating the degree of similarity between the first spectrum and the high frequency band portion of the second spectrum; and a spectrum in which a plurality of spectral residual candidates are recorded. the sign of the candidate of one spectral residuals from the residual shape codebook, variations formed between the first spectrum and the second spectrum of the high frequency band portion A second parameter calculating means for outputting as a second parameter indicating, among the first parameter and the second parameter is the output, the first to produce an estimate of the most similar to the high frequency band portion of the second spectrum A configuration is adopted that includes determination means for simultaneously determining a parameter and the second parameter, and encoding means for encoding the determined first parameter and second parameter as encoding information of the high frequency band.

本発明の符号化方法は、原信号から、低周波帯域の符号化情報と高周波帯域の符号化情報とを生成する符号化方法であって、前記低周波帯域の符号化情報の復号信号から低周波帯域の第１スペクトルを算出する第１スペクトル算出ステップと、前記原信号から第２スペクトルを算出する第２スペクトル算出ステップと、前記第１スペクトルを内部状態として有するフィルタを用いて、前記フィルタの特性を示すパラメータを、前記第１スペクトルと前記第２スペクトルの高周波帯域部との類似具合を示す第１パラメータとして算出する第１パラメータ算出ステップと、スペクトル残差の候補を複数記録しているスペクトル残差形状符号帳の中から一つのスペクトル残差の候補の符号を、前記第１スペクトルと前記第２スペクトルの高周波帯域部との変動成分を示す第２パラメータとして算出する第２パラメータ算出ステップと、前記算出された第１パラメータおよび第２パラメータの中から、前記第２スペクトルの高周波帯域部と最も類似する推定値を生成する前記第１パラメータと前記第２パラメータを同時に決定する決定ステップと、前記決定された第１パラメータと第２パラメータとを前記高周波帯域の符号化情報として符号化する符号化ステップと、を有するようにした。 The encoding method of the present invention is an encoding method for generating low-frequency band encoded information and high-frequency band encoded information from an original signal, wherein the low-frequency band encoded information is decoded from the decoded signal. A first spectrum calculating step for calculating a first spectrum of a frequency band; a second spectrum calculating step for calculating a second spectrum from the original signal; and a filter having the first spectrum as an internal state. A first parameter calculating step for calculating a parameter indicating characteristics as a first parameter indicating the degree of similarity between the first spectrum and the high frequency band portion of the second spectrum, and a spectrum in which a plurality of spectral residual candidates are recorded the sign of the candidate of one spectral residuals from the residual shape codebook, the first spectrum and the second spectrum of a high frequency band Generating a second parameter calculating step of calculating a second parameter indicating a fluctuation component, from the first parameter and the second parameter the calculated, the estimated value most similar to the high frequency band portion of the second spectrum and A determination step for simultaneously determining the first parameter and the second parameter, and an encoding step for encoding the determined first parameter and the second parameter as encoding information of the high frequency band. I made it.

本発明によれば、第２レイヤ部またはそれよりも上位のレイヤ部で低ビットレートの符号化を行っても高品質な復号信号を得ることができる。 According to the present invention, a high-quality decoded signal can be obtained even when low bit rate encoding is performed in the second layer unit or higher layer unit.

本発明は、スケーラブル符号化の上位レイヤに適した変換符号化に関し、より具体的には、当該変換符号化におけるスペクトルの効率的な符号化法に関する。 The present invention relates to transform coding suitable for an upper layer of scalable coding, and more specifically to an efficient spectrum coding method in the transform coding.

その主な特徴の１つは、第１レイヤ復号信号を周波数分析して得られるスペクトル（第１レイヤ復号スペクトル）を内部状態（フィルタ状態）として持つフィルタを用いてフィルタリング処理を行い、その出力信号を原スペクトルの高域部の推定値とする。ここで、原スペクトルとは、遅延調整された原信号を周波数分析して得られるスペクトルのことである。そして、原スペクトルの高域部に最も類似する出力信号を生成するときのフィルタ情報を符号化して復号化部へ伝送する。フィルタ情報のみを符号化すれば良いため、低ビットレート化が図れる。 One of the main features is that filtering processing is performed using a filter having a spectrum (first layer decoded spectrum) obtained by frequency analysis of the first layer decoded signal as an internal state (filter state), and an output signal thereof Is the estimated value of the high frequency part of the original spectrum. Here, the original spectrum is a spectrum obtained by frequency analysis of the delay-adjusted original signal. Then, the filter information for generating the output signal most similar to the high frequency part of the original spectrum is encoded and transmitted to the decoding unit. Since only the filter information needs to be encoded, the bit rate can be reduced.

本発明のある実施の形態では、スペクトル残差の候補が複数記録されているスペクトル残差形状符号帳を用いて、前述のフィルタにスペクトル残差を与えてフィルタリング処理を行う。また、他の実施の形態では、第１レイヤ復号スペクトルをフィルタの内部状態に
格納する前に第１レイヤ復号スペクトルの誤差成分を符号化して、第１レイヤ復号スペクトルの品質を向上させてから、フィルタリング処理による原スペクトルの高域部の推定を行う。また、さらに他の実施の形態では、第１レイヤ復号スペクトルの誤差成分を符号化する際に、第１レイヤ復号スペクトルの符号化の性能と第１レイヤ復号スペクトルを使った高域スペクトルの推定の性能とがいずれも高くなるように第１レイヤ復号スペクトルの誤差成分の符号化を行う。 In an embodiment of the present invention, a filtering process is performed by giving a spectral residual to the aforementioned filter using a spectral residual shape codebook in which a plurality of spectral residual candidates are recorded. In another embodiment, the error component of the first layer decoded spectrum is encoded before storing the first layer decoded spectrum in the internal state of the filter to improve the quality of the first layer decoded spectrum. The high-frequency part of the original spectrum is estimated by filtering processing. In still another embodiment, when the error component of the first layer decoded spectrum is encoded, the encoding performance of the first layer decoded spectrum and the estimation of the high frequency spectrum using the first layer decoded spectrum are performed. The error component of the first layer decoded spectrum is encoded so that both performances are high.

以下、本発明の実施の形態について、添付図面を参照して詳細に説明する。なお、各実施の形態では、複数のレイヤからなる階層構造を有するスケーラブル符号化を行う。また、各実施の形態では、一例として、（１）スケーラブル符号化の階層構造は、第１レイヤ（基本レイヤまたは下位レイヤ）と第１レイヤより上位にある第２レイヤ（拡張レイヤまたは上位レイヤ）の２階層とする、（２）第２レイヤの符号化では、周波数領域で符号化（変換符号化）を行う、（３）第２レイヤの符号化における変換方式にはＭＤＣＴ（Modified Discrete Cosine Transform；変形離散コサイン変換）を使用する、（４）第２レイヤの符号化では、全帯域を複数のサブバンドに分割する場合は、全帯域をＢａｒｋスケールで等間隔に分割し各サブバンドを各臨界帯域に対応付ける、（５）第１レイヤの入力信号のサンプリングレート（Ｆ１）と第２レイヤの入力信号のサンプリングレート（Ｆ２）には、Ｆ２はＦ１以上（Ｆ１≦Ｆ２）の関係がある、ものとする。 Hereinafter, embodiments of the present invention will be described in detail with reference to the accompanying drawings. In each embodiment, scalable coding having a hierarchical structure composed of a plurality of layers is performed. In each embodiment, as an example, (1) the hierarchical structure of scalable coding includes a first layer (basic layer or lower layer) and a second layer (an enhancement layer or upper layer) that is higher than the first layer. (2) In the second layer encoding, encoding (transform encoding) is performed in the frequency domain. (3) The transform method in the second layer encoding is MDCT (Modified Discrete Cosine Transform). (4) In the encoding of the second layer, when dividing the entire band into a plurality of subbands, the entire band is divided into equal intervals on the Bark scale and each subband is divided into each subband. (5) F2 is greater than or equal to F1 (F1 ≦ F2) for the sampling rate (F1) of the input signal of the first layer and the sampling rate (F2) of the input signal of the second layer that correspond to the critical band It is assumed that there is a relationship.

（実施の形態１）
図１は、例えば音声符号化装置等を形成する符号化装置１００の構成を示すブロック図である。符号化装置１００は、ダウンサンプリング部１０１、第１レイヤ符号化部１０２、第１レイヤ復号化部１０３、多重化部１０４、第２レイヤ符号化部１０５および遅延部１０６を有する。 (Embodiment 1)
FIG. 1 is a block diagram illustrating a configuration of an encoding apparatus 100 that forms, for example, a speech encoding apparatus. The encoding apparatus 100 includes a downsampling unit 101, a first layer encoding unit 102, a first layer decoding unit 103, a multiplexing unit 104, a second layer encoding unit 105, and a delay unit 106.

図１において、サンプリングレートがＦ２の音声信号やオーディオ信号（原信号）はダウンサンプリング部１０１に与えられ、ダウンサンプリング部１０１にてサンプリング変換処理が行われ、サンプリングレートがＦ１の信号が生成され、第１レイヤ符号化部１０２に与えられる。第１レイヤ符号化部１０２は、サンプリングレートがＦ１の信号を符号化して得られる符号化コードを第１レイヤ復号化部１０３および多重化部１０４に出力する。 In FIG. 1, an audio signal or an audio signal (original signal) having a sampling rate of F2 is given to the downsampling unit 101, and sampling conversion processing is performed in the downsampling unit 101 to generate a signal having a sampling rate of F1, The signal is provided to first layer encoding section 102. First layer encoding section 102 outputs the encoded code obtained by encoding the signal with sampling rate F1 to first layer decoding section 103 and multiplexing section 104.

第１レイヤ復号化部１０３は、第１レイヤ符号化部１０２から出力された符号化コードから第１レイヤ復号信号を生成して第２レイヤ符号化部１０５に出力する。 First layer decoding section 103 generates a first layer decoded signal from the encoded code output from first layer encoding section 102 and outputs the first layer decoded signal to second layer encoding section 105.

遅延部１０６は、原信号に対して所定の長さの遅延を与えて第２レイヤ符号化部１０５に出力する。この遅延は、ダウンサンプリング部１０１、第１レイヤ符号化部１０２および第１レイヤ復号化部１０３で生じる時間遅れを調整するためのものである。 Delay section 106 gives a delay of a predetermined length to the original signal and outputs the delayed signal to second layer encoding section 105. This delay is for adjusting a time delay generated in the downsampling unit 101, the first layer encoding unit 102, and the first layer decoding unit 103.

第２レイヤ符号化部１０５は、遅延部１０６から出力された原信号を、第１レイヤ復号化部１０３から出力された第１レイヤ復号信号を用いて符号化を行う。そして、この符号化により得られる符号化コードを多重化部１０４に出力する。 Second layer encoding section 105 encodes the original signal output from delay section 106 using the first layer decoded signal output from first layer decoding section 103. Then, the encoded code obtained by this encoding is output to multiplexing section 104.

多重化部１０４は、第１レイヤ符号化部１０２から出力された符号化コードと第２レイヤ符号化部１０５から出力された符号化コードとを多重化し、ビットストリームとして出力する。 The multiplexing unit 104 multiplexes the encoded code output from the first layer encoding unit 102 and the encoded code output from the second layer encoding unit 105, and outputs it as a bit stream.

次いで、第２レイヤ符号化部１０５についてより詳細に説明する。第２レイヤ符号化部１０５の構成を図２に示す。第２レイヤ符号化部１０５は、周波数領域変換部２０１、拡張帯域符号化部２０２、周波数領域変換部２０３および聴覚マスキング算出部２０４を有
する。 Next, second layer encoding section 105 will be described in more detail. The configuration of second layer encoding section 105 is shown in FIG. Second layer encoding section 105 includes frequency domain conversion section 201, extension band encoding section 202, frequency domain conversion section 203, and auditory masking calculation section 204.

図２において、周波数領域変換部２０１は、第１レイヤ復号化部１０３から出力された第１レイヤ復号信号をＭＤＣＴ変換により周波数分析してＭＤＣＴ係数（第１レイヤ復号スペクトル）を算出する。そして、第１レイヤ復号スペクトルを拡張帯域符号化部２０２に出力する。 In FIG. 2, the frequency domain transform unit 201 performs frequency analysis on the first layer decoded signal output from the first layer decoding unit 103 by MDCT transform to calculate MDCT coefficients (first layer decoded spectrum). Then, the first layer decoded spectrum is output to extended band coding section 202.

周波数領域変換部２０３は、遅延部１０６から出力された原信号をＭＤＣＴ変換により周波数分析してＭＤＣＴ係数（原スペクトル）を算出する。そして、原スペクトルを拡張帯域符号化部２０２に出力する。 The frequency domain transform unit 203 performs frequency analysis on the original signal output from the delay unit 106 by MDCT transform to calculate MDCT coefficients (original spectrum). Then, the original spectrum is output to extended band coding section 202.

聴覚マスキング算出部２０４は、遅延部１０６から出力された原信号を用いて、帯域毎の聴覚マスキングを算出し、この聴覚マスキングを拡張帯域符号化部２０２に通知する。 The auditory masking calculation unit 204 calculates auditory masking for each band using the original signal output from the delay unit 106 and notifies the extended band encoding unit 202 of the auditory masking.

ここで、人間の聴覚特性には、ある信号が聞こえているときにその信号と周波数の近い音が耳に入ってきてもその音が聞こえにくい、という聴覚マスキング特性がある。上記聴覚マスキングは効率的なスペクトル符号化を実現するために用いられる。このスペクトル符号化では、人間の聴覚マスキング特性を利用して聴感上許容される量子化歪を定量化し、その許容される量子化歪に応じた符号化法を適用する。 Here, human auditory characteristics include an auditory masking characteristic that when a signal is heard, even if a sound having a frequency close to that of the signal enters the ear, the sound is difficult to hear. The auditory masking is used to realize efficient spectral coding. In this spectral coding, quantization distortion that is permissible for auditory perception is quantified using human auditory masking characteristics, and a coding method corresponding to the permissible quantization distortion is applied.

拡張帯域符号化部２０２は、図３に示すように、振幅調整部３０１、フィルタ状態設定部３０２、フィルタリング部３０３、ラグ設定部３０４、スペクトル残差形状符号帳３０５、探索部３０６、スペクトル残差ゲイン符号帳３０７、乗算器３０８、拡張スペクトル復号化部３０９およびスケールファクタ符号化部３１０を有する。 As shown in FIG. 3, the extension band encoding unit 202 includes an amplitude adjustment unit 301, a filter state setting unit 302, a filtering unit 303, a lag setting unit 304, a spectrum residual shape codebook 305, a search unit 306, and a spectrum residual. A gain codebook 307, a multiplier 308, an extended spectrum decoding unit 309, and a scale factor encoding unit 310 are included.

振幅調整部３０１には、周波数領域変換部２０１から第１レイヤ復号スペクトル｛Ｓ１（ｋ）；０≦ｋ＜Ｎｎ｝、周波数領域変換部２０３から原スペクトル｛Ｓ２（ｋ）；０≦ｋ＜Ｎｗ｝が与えられる。ここで、第１レイヤ復号スペクトルのスペクトル点数をＮｎ、原スペクトルのスペクトル点数をＮｗと表し、Ｎｎ＜Ｎｗの関係がある。 The amplitude adjustment unit 301 includes the first layer decoded spectrum {S1 (k); 0 ≦ k <Nn} from the frequency domain transform unit 201, and the original spectrum {S2 (k); 0 ≦ k <Nw from the frequency domain transform unit 203. } Is given. Here, the spectrum score of the first layer decoded spectrum is expressed as Nn, the spectrum score of the original spectrum is expressed as Nw, and there is a relationship of Nn <Nw.

振幅調整部３０１は、第１レイヤ復号スペクトル｛Ｓ１（ｋ）；０≦ｋ＜Ｎｎ｝の最大振幅スペクトルと最小振幅スペクトルの比（ダイナミックレンジ）が、原スペクトルの高域部｛Ｓ２（ｋ）；Ｎｎ≦ｋ＜Ｎｗ｝のダイナミックレンジに近づくよう振幅調整を行う。具体的には、次の式（１）に示すように、振幅スペクトルのべき乗をとる。

The amplitude adjustment unit 301 is configured such that the ratio (dynamic range) between the maximum amplitude spectrum and the minimum amplitude spectrum of the first layer decoded spectrum {S1 (k); 0 ≦ k <Nn} is the high frequency portion {S2 (k) of the original spectrum. The amplitude is adjusted so as to approach the dynamic range of Nn ≦ k <Nw}. Specifically, as shown in the following equation (1), the power of the amplitude spectrum is taken.

ここで、sign()は正号／負号を返す関数、γは０≦γ≦１の範囲にある実数を表す。振幅調整部３０１は、振幅調整後の第１レイヤ復号スペクトルのダイナミックレンジが原スペクトルの高域部｛Ｓ２（ｋ）；Ｎｎ≦ｋ＜Ｎｗ｝のダイナミックレンジに最も近づくときのγ（振幅調整係数）を、あらかじめ用意された複数の候補の中から選択し、その符号化コードを多重化部１０４に出力する。 Here, sign () represents a function that returns a positive / negative sign, and γ represents a real number in the range of 0 ≦ γ ≦ 1. The amplitude adjustment unit 301 uses the γ (amplitude adjustment coefficient) when the dynamic range of the first layer decoded spectrum after amplitude adjustment is closest to the dynamic range of the high-frequency part {S2 (k); Nn ≦ k <Nw} of the original spectrum. ) Is selected from a plurality of candidates prepared in advance, and the encoded code is output to the multiplexing unit 104.

フィルタ状態設定部３０２は、振幅調整後の第１レイヤ復号スペクトル｛Ｓ１’（ｋ）；０≦ｋ＜Ｎｎ｝を後述するピッチフィルタの内部状態に設定する。具体的には、振幅調整後の第１レイヤ復号スペクトル｛Ｓ１’（ｋ）；０≦ｋ＜Ｎｎ｝を生成スペクトルバッファ｛Ｓ（ｋ）；０≦ｋ＜Ｎｎ｝に代入し、フィルタリング部３０３へ出力する。ここで、生成スペクトルバッファＳ（ｋ）は、０≦ｋ＜Ｎｗの範囲で定義される配列変数である
。後述するフィルタリング処理によって（Ｎｗ−Ｎｎ）点の原スペクトルの推定値（以下「推定原スペクトル」と言う）の候補が生成される。 The filter state setting unit 302 sets the first layer decoded spectrum {S1 ′ (k); 0 ≦ k <Nn} after amplitude adjustment to the internal state of the pitch filter described later. Specifically, the first layer decoded spectrum {S1 ′ (k); 0 ≦ k <Nn} after amplitude adjustment is substituted into the generated spectrum buffer {S (k); 0 ≦ k <Nn}, and the filtering unit 303 Output to. Here, the generated spectrum buffer S (k) is an array variable defined in the range of 0 ≦ k <Nw. A candidate of an estimated value (hereinafter referred to as “estimated original spectrum”) of the (Nw−Nn) point original spectrum is generated by a filtering process described later.

ラグ設定部３０４は、探索部３０６からの指示に従い、ラグＴを予め定められた探索範囲ＴＭＩＮ〜ＴＭＡＸの中で漸次的に少しずつ変化させながら、フィルタリング部３０３に順次出力する。 In accordance with an instruction from the search unit 306, the lag setting unit 304 sequentially outputs the lag T to the filtering unit 303 while gradually changing the lag T within a predetermined search range TMIN to TMAX.

スペクトル残差形状符号帳３０５は、複数のスペクトル残差形状ベクトルの候補を格納している。また、探索部３０６からの指示に従い、全ての候補の中から、または、あらかじめ限定された候補の中から、スペクトル残差形状ベクトルを順次出力する。 The spectral residual shape codebook 305 stores a plurality of spectral residual shape vector candidates. Further, in accordance with an instruction from the search unit 306, spectral residual shape vectors are sequentially output from all candidates or from candidates limited in advance.

同様に、スペクトル残差ゲイン符号帳３０７は複数のスペクトル残差ゲインの候補を格納している。また、探索部３０６からの指示に従い、全ての候補の中から、または、あらかじめ限定された候補の中から、スペクトル残差ゲインを順次出力する。 Similarly, the spectral residual gain codebook 307 stores a plurality of spectral residual gain candidates. Further, in accordance with an instruction from the search unit 306, the spectral residual gain is sequentially output from all candidates or from candidates limited in advance.

乗算器３０８は、スペクトル残差形状符号帳３０５から出力されるスペクトル残差形状ベクトルと、スペクトル残差ゲイン符号帳３０７から出力されるスペクトル残差ゲインと、を乗じて、スペクトル残差形状ベクトルをゲイン調整する。そして、ゲイン調整されたスペクトル残差形状ベクトルをフィルタリング部３０３に出力する。 The multiplier 308 multiplies the spectral residual shape vector output from the spectral residual shape codebook 305 by the spectral residual gain output from the spectral residual gain codebook 307 to obtain the spectral residual shape vector. Adjust the gain. The gain-adjusted spectral residual shape vector is output to the filtering unit 303.

フィルタリング部３０３は、フィルタ状態設定部３０２で設定されたピッチフィルタの内部状態と、ラグ設定部３０４から出力されるラグＴと、ゲイン調整されたスペクトル残差形状ベクトルとを用いてフィルタリング処理を行い、推定原スペクトルを算出する。ここで、ピッチフィルタの伝達関数は、次の式（２）で表される。また、このフィルタリング処理は、次の式（３）のように表される。

The filtering unit 303 performs a filtering process using the internal state of the pitch filter set by the filter state setting unit 302, the lag T output from the lag setting unit 304, and the spectral residual shape vector after gain adjustment. The estimated original spectrum is calculated. Here, the transfer function of the pitch filter is expressed by the following equation (2). Further, this filtering process is expressed as the following equation (3).

ここで、Ｃ（ｉ，ｋ）は第ｉ番目のスペクトル残差形状ベクトル、ｇ（ｊ）は第ｊ番目のスペクトル残差形状ゲインを表す。範囲Ｎｎ≦ｋ＜Ｎｗに含まれる生成スペクトルバッファＳ（ｋ）がフィルタリング部３０３の出力信号（つまり、推定原スペクトル）として探索部３０６に出力される。図４に、生成スペクトルバッファ、振幅調整後の第１レイヤ復号スペクトル、フィルタリング部３０３の出力信号の相互関係を示す。 Here, C (i, k) represents the i-th spectral residual shape vector, and g (j) represents the j-th spectral residual shape gain. The generated spectrum buffer S (k) included in the range Nn ≦ k <Nw is output to the search unit 306 as an output signal of the filtering unit 303 (that is, an estimated original spectrum). FIG. 4 shows the correlation among the generated spectrum buffer, the first layer decoded spectrum after amplitude adjustment, and the output signal of the filtering unit 303.

探索部３０６は、ラグ設定部３０４、スペクトル残差形状符号帳３０５およびスペクトル残差ゲイン符号帳３０７に、ラグ、スペクトル残差形状およびスペクトル残差ゲインの出力をそれぞれ指示する。 Search section 306 instructs lag setting section 304, spectral residual shape codebook 305, and spectral residual gain codebook 307 to output lag, spectral residual shape, and spectral residual gain, respectively.

また、探索部３０６は、原スペクトルの高域部｛Ｓ２（ｋ）；Ｎｎ≦ｋ＜Ｎｗ｝とフィルタリング部３０３の出力信号｛Ｓ（ｋ）；Ｎｎ≦ｋ＜Ｎｗ｝との間の歪Ｅを算出する。そして、合成による分析手法（ＡｂＳ；ＡｎａｌｙｓｉｓｂｙＳｙｎｔｈｅｓｉｓ）により、最も歪が小さくなるときのラグ、スペクトル残差形状ベクトルおよびスペクトル残差ゲインの組み合わせを決定する。このとき、聴覚マスキング算出部２０４から出力された聴覚マスキングを利用して、聴感的に最も歪の小さい組み合わせを選択する。この歪をＥとすると、歪Ｅは、例えば聴覚マスキングにより定まる重み関数ｗ（ｋ）を用いて式
（４）によって表される。ここで、重み関数ｗ（ｋ）は、聴覚マスキングの大きい（歪が聞こえ難い）周波数では小さな値をとり、聴覚マスキングの小さい（歪が聞こえ易い）周波数では大きな値をとる。

The search unit 306 also performs distortion E between the high-frequency part {S2 (k); Nn ≦ k <Nw} of the original spectrum and the output signal {S (k); Nn ≦ k <Nw} of the filtering unit 303. Is calculated. Then, a combination of a lag, a spectrum residual shape vector, and a spectrum residual gain when the distortion becomes the smallest is determined by an analysis method (AbS; Analysis by Synthesis) by synthesis. At this time, the auditory masking output from the auditory masking calculation unit 204 is used to select the combination with the smallest audible distortion. Assuming that this distortion is E, the distortion E is expressed by Expression (4) using a weight function w (k) determined by auditory masking, for example. Here, the weighting function w (k) takes a small value at a frequency where auditory masking is large (distortion is difficult to hear) and takes a large value at a frequency where auditory masking is small (distortion is easy to hear).

探索部３０６により決定されたラグの符号化コード、スペクトル残差形状ベクトルの符号化コードおよびスペクトル残差ゲインの符号化コードは、多重化部１０４および拡張スペクトル復号化部３０９に出力される。 The lag encoding code, spectral residual shape vector encoding code, and spectral residual gain encoding code determined by search section 306 are output to multiplexing section 104 and extended spectrum decoding section 309.

上記のＡｂＳによる符号化コード決定法においては、ラグ、スペクトル残差形状ベクトルおよびスペクトル残差ゲインを同時に決定しても良いし、あるいは、演算量を削減するために各パラメータを順次（例えば、ラグ、スペクトル残差形状ベクトル、スペクトル残差ゲインの順に）決定しても良い。 In the above-described coding code determination method using AbS, the lag, the spectral residual shape vector, and the spectral residual gain may be determined at the same time, or each parameter is sequentially set (for example, the lag is reduced in order to reduce the calculation amount). , In the order of spectral residual shape vector and spectral residual gain).

拡張スペクトル復号化部３０９は、振幅調整部３０１より出力される振幅調整係数の符号化コードならびに探索部３０６より出力されるラグの符号化コード、スペクトル残差形状ベクトルの符号化コードおよびスペクトル残差ゲインの符号化コードを復号し、原スペクトルの推定値（推定原スペクトル）を生成する。 The extended spectrum decoding unit 309 includes an amplitude adjustment coefficient encoding code output from the amplitude adjustment unit 301, a lag encoding code output from the search unit 306, a spectral residual shape vector encoding code, and a spectral residual. The encoded code of the gain is decoded, and an estimated value (estimated original spectrum) of the original spectrum is generated.

具体的には、まず、復号された振幅調整係数γを用いて前述の式（１）に従い第１レイヤ復号スペクトル｛Ｓ１（ｋ）；０≦ｋ＜Ｎｎ｝の振幅調整を行う。次に、振幅調整された第１レイヤ復号スペクトルをフィルタの内部状態として用いるとともに、それぞれ復号されたラグ、スペクトル残差形状ベクトルおよびスペクトル残差ゲインを用いて、前述の式（３）に従ってフィルタリング処理を行い、推定原スペクトル｛Ｓ（ｋ）；Ｎｎ≦ｋ＜Ｎｗ｝を生成する。生成された推定原スペクトルはスケールファクタ符号化部３１０に出力される。 Specifically, first, amplitude adjustment of the first layer decoded spectrum {S1 (k); 0 ≦ k <Nn} is performed using the decoded amplitude adjustment coefficient γ according to the above-described equation (1). Next, the amplitude-adjusted first layer decoded spectrum is used as an internal state of the filter, and filtering processing is performed according to the above-described equation (3) using the decoded lag, spectral residual shape vector, and spectral residual gain, respectively. To generate an estimated original spectrum {S (k); Nn ≦ k <Nw}. The generated estimated original spectrum is output to scale factor encoding section 310.

スケールファクタ符号化部３１０は、周波数領域変換部２０３より出力される原スペクトルの高域部｛Ｓ２（ｋ）；Ｎｎ≦ｋ＜Ｎｗ｝と拡張スペクトル復号化部３０９より出力される推定原スペクトル｛Ｓ（ｋ）；Ｎｎ≦ｋ＜Ｎｗ｝とを用いて、聴覚マスキングを利用して聴感上最も適した推定原スペクトルのスケールファクタ（スケーリング係数）を符号化し、その符号化コードを多重化部１０４に出力する。 The scale factor encoding unit 310 includes a high-frequency part {S2 (k); Nn ≦ k <Nw} of the original spectrum output from the frequency domain conversion unit 203 and an estimated original spectrum {output from the extended spectrum decoding unit 309 { Using S (k); Nn ≦ k <Nw}, the scale factor (scaling coefficient) of the estimated original spectrum most suitable for hearing is encoded using auditory masking, and the encoded code is multiplexed by the multiplexing unit 104. Output to.

すなわち、第２レイヤ符号化コードは、振幅調整部３０１から出力される符号化コード（振幅調整係数）、探索部３０６から出力される符号化コード（ラグ、スペクトル残差形状ベクトル、スペクトル残差ゲイン）およびスケールファクタ符号化部３１０から出力される符号化コード（スケールファクタ）の組み合わせからなる。 That is, the second layer encoded code includes an encoded code (amplitude adjustment coefficient) output from the amplitude adjustment unit 301, and an encoded code (lag, spectrum residual shape vector, spectrum residual gain) output from the search unit 306. ) And the encoding code (scale factor) output from the scale factor encoding unit 310.

なお、本実施の形態では、帯域Ｎｎ〜Ｎｗに対して拡張帯域符号化部２０２を適用して一組の符号化コード（振幅調整係数、ラグ、スペクトル残差形状ベクトル、スペクトル残差ゲイン、スケールファクタ）を決定する構成について説明しているが、帯域Ｎｎ〜Ｎｗを複数の帯域に分割し各帯域に対して拡張帯域符号化部２０２を適用する構成にしても良い。この場合、帯域毎に符号化コード（振幅調整係数、ラグ、スペクトル残差形状ベクトル、スペクトル残差ゲイン、スケールファクタ）を決定し、多重化部１０４に出力することになる。例えば、帯域Ｎｎ〜ＮｗをＭ個の帯域に分割して各帯域で拡張帯域符号化部２０２を適用すると、Ｍ組の符号化コード（振幅調整係数、ラグ、スペクトル残差形状ベク
トル、スペクトル残差ゲイン、スケールファクタ）が得られることになる。 In the present embodiment, the extended band coding unit 202 is applied to the bands Nn to Nw, and a set of coded codes (amplitude adjustment coefficient, lag, spectrum residual shape vector, spectrum residual gain, scale) However, a configuration may be adopted in which the bands Nn to Nw are divided into a plurality of bands and the extended band encoding unit 202 is applied to each band. In this case, an encoding code (amplitude adjustment coefficient, lag, spectral residual shape vector, spectral residual gain, scale factor) is determined for each band and is output to the multiplexing unit 104. For example, when the bands Nn to Nw are divided into M bands and the extended band encoding unit 202 is applied to each band, M sets of encoded codes (amplitude adjustment coefficient, lag, spectrum residual shape vector, spectrum residual) Gain, scale factor).

また、複数帯域でそれぞれ独立の符号化コードを送らずに、隣接する帯域同士で一部の符号化コードを共有しても良い。例えば、帯域Ｎｎ〜ＮｗをＭ個の帯域に分割し、隣接する２つの帯域で共通の振幅調整係数を用いる場合、振幅調整係数の符号化コードの数はＭ／２個となり、それ以外の符号化コードの数はそれぞれＭ個となる。 Further, a part of the encoded codes may be shared between adjacent bands without sending independent encoded codes in a plurality of bands. For example, when the bands Nn to Nw are divided into M bands and a common amplitude adjustment coefficient is used in two adjacent bands, the number of encoded codes of the amplitude adjustment coefficient is M / 2, and the other codes The number of conversion codes is M.

なお、本実施の形態は１次のＡＲ型ピッチフィルタを用いた場合について説明してきた。しかしながら、本発明が適用できるフィルタは１次のＡＲ型ピッチフィルタに限定されず、伝達関数が次の式（５）で表されるフィルタにも本発明を適用することができる。フィルタ次数を規定するパラメータＬおよびＭの大きいピッチフィルタを用いるほど多様な特性を表現でき、品質が向上する可能性がある。ただし、次数が大きくなるほどフィルタ係数の符号化ビットを多く配分する必要が出てくるため、実用的なビット配分の元で適切なピッチフィルタの伝達関数を決めておく必要がある。

In the present embodiment, the case where a primary AR type pitch filter is used has been described. However, a filter to which the present invention can be applied is not limited to a first-order AR type pitch filter, and the present invention can also be applied to a filter whose transfer function is expressed by the following equation (5). The more the pitch filter having the parameters L and M that define the filter order is used, the more various characteristics can be expressed and the quality may be improved. However, since it becomes necessary to allocate more encoded bits of the filter coefficient as the order increases, it is necessary to determine an appropriate pitch filter transfer function based on practical bit allocation.

なお、本実施の形態では聴覚マスキングを用いることを前提としているが、聴覚マスキングを用いない構成であっても良い。その場合、図２の聴覚マスキング算出部２０４を第２レイヤ符号化部１０５に設ける必要が無くなり、装置全体の演算量を削減できる。 In this embodiment, it is assumed that auditory masking is used, but a configuration that does not use auditory masking may be used. In this case, it is not necessary to provide the auditory masking calculation unit 204 of FIG. 2 in the second layer encoding unit 105, and the calculation amount of the entire apparatus can be reduced.

ここで、多重化部１０４から出力されるビットストリームの構成について、図５を用いて説明する。ビットストリームのＭＳＢ（Most Significant Bit）から順に、第１レイヤ符号化コード、第２レイヤ符号化コードが格納されている。さらに、第２レイヤ符号化コードは、スケールファクタ、振幅調整係数、ラグ、スペクトル残差ゲイン、スペクトル残差形状ベクトルの順に格納され、後者の情報ほどＬＳＢ（Least Significant Bit）に近い位置に配置されている。このビットストリームの構成は、各符号化コードの符号欠落に対する感度（符号化コードが欠落したときにどの程度復号信号の品質を劣化させるか）に対して、符号誤り感度の高い（大きく劣化する）ものほどＭＳＢに近い位置に配置されている。この構成によれば、伝送路上でビットストリームを部分的に破棄する場合にＬＳＢから順に破棄することで、破棄による劣化を最小限に抑えることができる。ＬＳＢ側から優先的にビットストリームを破棄するネットワーク構成の一例では、図５のように区切られた各符号化コードを別々のパケットで伝送し、各パケットに優先順位付けをして、優先制御のできるパケット網を使う構成が挙げられる。ただし、ネットワーク構成は前述のものに限定されない。 Here, the configuration of the bit stream output from the multiplexing unit 104 will be described with reference to FIG. A first layer encoded code and a second layer encoded code are stored sequentially from the MSB (Most Significant Bit) of the bit stream. Further, the second layer encoded code is stored in the order of scale factor, amplitude adjustment coefficient, lag, spectral residual gain, spectral residual shape vector, and the latter information is arranged at a position closer to LSB (Least Significant Bit). ing. The configuration of this bit stream is high in code error sensitivity (deteriorates greatly) with respect to the sensitivity to code loss of each encoded code (how much the quality of the decoded signal is degraded when the encoded code is lost). The closer one is to the MSB. According to this configuration, when the bit stream is partially discarded on the transmission path, the degradation due to the discarding can be minimized by discarding sequentially from the LSB. In an example of a network configuration in which the bit stream is discarded preferentially from the LSB side, each encoded code divided as shown in FIG. 5 is transmitted in separate packets, prioritized for each packet, A configuration that uses a packet network that can be used. However, the network configuration is not limited to that described above.

また、図５のように符号誤り感度の高い符号化パラメータほどＭＳＢに近い位置に配置されるビットストリーム構成において、ＭＳＢに近いビットほど強い誤り検出・誤り訂正がかけられるようなチャネル符号化を適用すれば、復号品質の劣化を最小限に抑えられるという効果が得られる。例えば、誤り検出、誤り訂正の手法としてはＣＲＣ符号やＲＳ符号などが適用できる。 In addition, in the bit stream configuration in which the coding parameters with higher code error sensitivity are arranged closer to the MSB as shown in FIG. 5, channel coding is applied such that stronger error detection and error correction is applied to the bits closer to the MSB. By doing so, it is possible to obtain an effect that degradation of decoding quality can be minimized. For example, a CRC code, an RS code, or the like can be applied as a method for error detection and error correction.

図６は、例えば音声復号化装置等を形成する復号化装置６００の構成を示すブロック図である。 FIG. 6 is a block diagram illustrating a configuration of a decoding device 600 that forms, for example, a speech decoding device.

復号化装置６００は、符号化装置１００から出力されたビットストリームを第１レイヤ符号化コードと第２レイヤ符号化コードとに分離する分離部６０１、第１レイヤ符号化コードを復号する第１レイヤ復号化部６０２および第２レイヤ符号化コードを復号する第２レイヤ復号化部６０３を有する。 Decoding apparatus 600 includes a separation unit 601 that separates the bitstream output from encoding apparatus 100 into a first layer encoded code and a second layer encoded code, and a first layer that decodes the first layer encoded code. It has the decoding part 602 and the 2nd layer decoding part 603 which decodes a 2nd layer encoding code.

分離部６０１は、符号化装置１００から送出されたビットストリームを受信し、第１レイヤの符号化コードと第２レイヤの符号化コードとに分離し、第１レイヤ復号化部６０２と第２レイヤ復号化部６０３にそれぞれ出力する。 Separating section 601 receives the bitstream sent from encoding apparatus 100, separates it into a first layer encoded code and a second layer encoded code, and first layer decoding section 602 and second layer The data are output to the decryption unit 603, respectively.

第１レイヤ復号化部６０２は、第１レイヤ符号化コードから第１レイヤ復号信号を生成して、第２レイヤ復号化部６０３に出力する。また、生成された第１レイヤ復号信号を、必要に応じて、最低限の品質が担保された復号信号（第１レイヤ復号信号）として出力する。 First layer decoding section 602 generates a first layer decoded signal from the first layer encoded code, and outputs the first layer decoded signal to second layer decoding section 603. In addition, the generated first layer decoded signal is output as a decoded signal (first layer decoded signal) with guaranteed minimum quality, if necessary.

第２レイヤ復号化部６０３は、第１レイヤ復号信号と第２レイヤ符号化コードとを用いて、高品質の復号信号（ここでは、第２レイヤ復号信号と称す）を生成し、必要に応じてこの復号信号を出力する。 Second layer decoding section 603 generates a high-quality decoded signal (herein referred to as a second layer decoded signal) using the first layer decoded signal and the second layer encoded code, and if necessary, This decoded signal is output.

このように、第１レイヤ復号信号によって再生音声の最低限の品質が担保され、第２レイヤ復号信号によって再生音声の品質を高めることができる。また、出力する信号を第１レイヤ復号信号または第２レイヤ復号信号のどちらにするかは、ネットワーク環境（パケットロスの発生等）によって第２レイヤ符号化コードが得られるかどうか、または、アプリケーションやユーザの設定等に依存する。 In this way, the minimum quality of the reproduced sound is ensured by the first layer decoded signal, and the quality of the reproduced sound can be enhanced by the second layer decoded signal. In addition, whether the output signal is the first layer decoded signal or the second layer decoded signal depends on whether the second layer encoded code can be obtained depending on the network environment (occurrence of packet loss, etc.) Depends on user settings.

第２レイヤ復号化部６０３の構成を、図７を用いて詳細に行う。図７において、第２レイヤ復号化部６０３は、拡張帯域復号化部７０１、周波数領域変換部７０２および時間領域変換部７０３を有する。 The configuration of second layer decoding section 603 will be described in detail using FIG. In FIG. 7, second layer decoding section 603 has extended band decoding section 701, frequency domain transform section 702, and time domain transform section 703.

周波数領域変換部７０２は、第１レイヤ復号化部６０２から入力された第１レイヤ復号信号を周波数領域のパラメータ（例えばＭＤＣＴ係数など）に変換し、そのパラメータをスペクトル点数がＮｎの第１レイヤ復号スペクトルとして拡張帯域復号化部７０１に出力する。 Frequency domain transform section 702 transforms the first layer decoded signal input from first layer decoding section 602 into frequency domain parameters (for example, MDCT coefficients), and the parameters are subjected to first layer decoding with a spectrum score of Nn. The spectrum is output to extended band decoding section 701 as a spectrum.

拡張帯域復号化部７０１は、分離部６０１から入力された第２レイヤ符号化コード（この構成では拡張帯域符号化コードと同一）から各種パラメータ（振幅調整係数、ラグ、スペクトル残差形状ベクトル、スペクトル残差ゲイン、スケールファクタ）を復号する。また、復号された各種パラメータと周波数領域変換部７０２から出力された第１レイヤ復号スペクトルとを用いて帯域拡張された第２の復号スペクトルであってスペクトル点数がＮｗの第２のスペクトルを生成する。そして、第２の復号スペクトルを時間領域変換部７０３に出力する。 The extension band decoding unit 701 uses various parameters (amplitude adjustment coefficient, lag, spectrum residual shape vector, spectrum) from the second layer encoded code (in this configuration, the same as the extension band encoded code) input from the separation unit 601. (Residual gain, scale factor). Also, a second decoded spectrum that is band-extended using the various decoded parameters and the first layer decoded spectrum output from the frequency domain transform section 702 and having a spectrum score of Nw is generated. . Then, the second decoded spectrum is output to time domain transform section 703.

時間領域変換部７０３は、第２の復号スペクトルを時間領域の信号に変換した後、必要に応じて適切な窓掛けおよび重ね合わせ加算等の処理を行って、フレーム間に生じる不連続を回避し、第２レイヤ復号信号を出力する。 The time domain conversion unit 703 converts the second decoded spectrum into a time domain signal, and then performs appropriate processing such as windowing and superposition addition as necessary to avoid discontinuities between frames. The second layer decoded signal is output.

次に、拡張帯域復号化部７０１の詳細な説明を、図８を用いて行う。図８において、拡張帯域復号化部７０１は、分離部８０１、振幅調整部８０２、フィルタ状態設定部８０３、フィルタリング部８０４、スペクトル残差形状符号帳８０５、スペクトル残差ゲイン符号帳８０６、乗算器８０７、スケールファクタ復号化部８０８、スケーリング部８０９およびスペクトル合成部８１０を有する。 Next, detailed description of the extended band decoding unit 701 will be given with reference to FIG. In FIG. 8, the extended band decoding unit 701 includes a separation unit 801, an amplitude adjustment unit 802, a filter state setting unit 803, a filtering unit 804, a spectral residual shape codebook 805, a spectral residual gain codebook 806, and a multiplier 807. A scale factor decoding unit 808, a scaling unit 809, and a spectrum synthesis unit 810.

分離部８０１は、分離部６０１から入力される拡張帯域符号化コードを振幅調整係数符号化コード、ラグ符号化コード、残差形状符号化コード、残差ゲイン符号化コード、スケールファクタ符号化コード、に分離する。また、振幅調整係数符号化コードを振幅調整部８０２に、ラグ符号化コードをフィルタリング部８０４に、残差形状符号化コードをスペクトル残差形状符号帳８０５に、残差ゲイン符号化コードをスペクトル残差ゲイン符号帳８０６に、スケールファクタ符号化コードをスケールファクタ復号化部８０８に、それぞれ出力する。 The separation unit 801 converts the extension band encoded code input from the separation unit 601 into an amplitude adjustment coefficient encoded code, a lag encoded code, a residual shape encoded code, a residual gain encoded code, a scale factor encoded code, To separate. Also, the amplitude adjustment coefficient encoded code is stored in the amplitude adjuster 802, the lag encoded code is stored in the filtering unit 804, the residual shape encoded code is stored in the spectral residual shape codebook 805, and the residual gain encoded code is stored in the spectral residual. The scale factor encoded code is output to the difference gain codebook 806 to the scale factor decoding unit 808, respectively.

振幅調整部８０２は、分離部８０１から入力された振幅調整係数符号化コードを復号し、復号された振幅調整係数を用いて、別途周波数領域変換部７０２から入力された第１レイヤ復号スペクトルの振幅を調整し、振幅調整後の第１レイヤ復号スペクトルをフィルタ状態設定部８０３に出力する。振幅調整は、前述の式（１）で表される方法で行う。ここで、Ｓ１（ｋ）は第１レイヤ復号スペクトル、Ｓ１’（ｋ）は振幅調整後の第１レイヤ復号スペクトルを表す。 The amplitude adjustment unit 802 decodes the amplitude adjustment coefficient encoded code input from the separation unit 801, and uses the decoded amplitude adjustment coefficient to separately detect the amplitude of the first layer decoded spectrum input from the frequency domain transform unit 702. The first layer decoded spectrum after amplitude adjustment is output to the filter state setting section 803. The amplitude adjustment is performed by the method represented by the above-described equation (1). Here, S1 (k) represents the first layer decoded spectrum, and S1 '(k) represents the first layer decoded spectrum after amplitude adjustment.

フィルタ状態設定部８０３は、前述の式（２）で表される伝達関数のピッチフィルタのフィルタ状態に振幅調整後の第１レイヤ復号スペクトルを設定する。具体的には振幅調整後の第１レイヤ復号スペクトル｛Ｓ１’（ｋ）；０≦ｋ＜Ｎｎ｝を生成スペクトルバッファＳ（ｋ）に代入し、フィルタリング部８０４へ出力する。ここで、Ｔはピッチフィルタのラグである。また、生成スペクトルバッファＳ（ｋ）は、ｋ＝０〜Ｎｗ−１の範囲で定義される配列変数であり、本フィルタリング処理によって（Ｎｗ−Ｎｎ）点のスペクトルが生成される。 The filter state setting unit 803 sets the first layer decoded spectrum after amplitude adjustment to the filter state of the pitch filter of the transfer function represented by the above equation (2). Specifically, the amplitude-adjusted first layer decoded spectrum {S1 ′ (k); 0 ≦ k <Nn} is substituted into the generated spectrum buffer S (k) and output to the filtering unit 804. Here, T is a pitch filter lag. The generated spectrum buffer S (k) is an array variable defined in a range of k = 0 to Nw−1, and a spectrum of (Nw−Nn) points is generated by this filtering process.

フィルタリング部８０４は、分離部８０１から入力されたラグＴを復号し、復号されたラグＴを用いて、フィルタ状態設定部８０３から入力された生成スペクトルバッファＳ（ｋ）に対してフィルタリング処理を行う。具体的には、前述の式（３）に示される方法によって出力スペクトル｛Ｓ（ｋ）；Ｎｎ≦ｋ＜Ｎｗ｝を生成する。ここで、ｇ（ｊ）は残差ゲイン符号化コードｊにより表されるスペクトル残差ゲイン、Ｃ（ｉ，ｋ）は残差形状符号化コードｉにより表されるスペクトル残差形状ベクトルをそれぞれ示しており、ｇ（ｊ）・Ｃ（ｉ，ｋ）は乗算器８０７から入力される。生成されたフィルタリング部８０４の出力スペクトル｛Ｓ（ｋ）；Ｎｎ≦ｋ＜Ｎｗ｝はスケーリング部８０９へ出力される。 The filtering unit 804 decodes the lag T input from the separation unit 801, and performs a filtering process on the generated spectrum buffer S (k) input from the filter state setting unit 803 using the decoded lag T. . Specifically, the output spectrum {S (k); Nn ≦ k <Nw} is generated by the method shown in the above-described equation (3). Here, g (j) represents a spectral residual gain represented by a residual gain encoding code j, and C (i, k) represents a spectral residual shape vector represented by a residual shape encoding code i. G (j) · C (i, k) is input from the multiplier 807. The generated output spectrum {S (k); Nn ≦ k <Nw} of the filtering unit 804 is output to the scaling unit 809.

スペクトル残差形状符号帳８０５は、分離部８０１から入力された残差形状符号化コードを復号し、復号結果に対応するスペクトル残差形状ベクトルＣ（ｉ，ｋ）を乗算器８０７へ出力する。 The spectral residual shape codebook 805 decodes the residual shape encoded code input from the separation unit 801 and outputs a spectral residual shape vector C (i, k) corresponding to the decoding result to the multiplier 807.

スペクトル残差ゲイン符号帳８０６は、分離部８０１から入力された残差ゲイン符号化コードを復号し、復号結果に対応するスペクトル残差ゲインｇ（ｊ）を乗算器８０７へ出力する。 The spectral residual gain codebook 806 decodes the residual gain encoded code input from the separation unit 801 and outputs a spectral residual gain g (j) corresponding to the decoding result to the multiplier 807.

乗算器８０７は、スペクトル残差形状符号帳８０５から入力されたスペクトル残差形状ベクトルＣ（ｉ，ｋ）と、スペクトル残差ゲイン符号帳８０６から入力されたスペクトル残差ゲインｇ（ｊ）と、の乗算結果をフィルタリング部８０４へ出力する。 The multiplier 807 has a spectral residual shape vector C (i, k) input from the spectral residual shape codebook 805, a spectral residual gain g (j) input from the spectral residual gain codebook 806, and Is output to the filtering unit 804.

スケールファクタ復号化部８０８は、分離部８０１から入力されたスケールファクタ符号化コードを復号し、復号されたスケールファクタをスケーリング部８０９へ出力する。 The scale factor decoding unit 808 decodes the scale factor encoded code input from the separation unit 801 and outputs the decoded scale factor to the scaling unit 809.

スケーリング部８０９は、フィルタリング部８０４から与えられた出力スペクトル｛Ｓ（ｋ）；Ｎｎ≦ｋ＜Ｎｗ｝に、スケールファクタ復号化部８０８から入力されたスケール
ファクタを乗じて、その乗算結果をスペクトル合成部８１０に出力する。 The scaling unit 809 multiplies the output spectrum {S (k); Nn ≦ k <Nw} given from the filtering unit 804 by the scale factor inputted from the scale factor decoding unit 808, and spectrally synthesizes the multiplication result. Output to the unit 810.

スペクトル合成部８１０は、周波数領域変換部７０２より与えられる第１レイヤ復号スペクトル｛Ｓ１（ｋ）；０≦ｋ＜Ｎｎ｝と、スケーリング部８０９から出力されるスケーリング後の生成スペクトルバッファの高域部｛Ｓ（ｋ）；Ｎｎ≦ｋ＜Ｎｗ｝を結合して得られるスペクトルを第２の復号スペクトルとして時間領域変換部７０３に出力する。 Spectrum synthesis section 810 includes first layer decoded spectrum {S1 (k); 0 ≦ k <Nn} given from frequency domain transform section 702, and a high-frequency section of the generated spectrum buffer after scaling output from scaling section 809 A spectrum obtained by combining {S (k); Nn ≦ k <Nw} is output to the time domain transforming unit 703 as a second decoded spectrum.

（実施の形態２）
図９に、本発明の実施の形態２に係る第２レイヤ符号化部１０５の構成を示す。図９において図２と同一名称を持つブロックは、同様の機能を有するので、ここではその詳細な説明を省略する。図２と図９の違いは、周波数領域変換部２０１と拡張帯域符号化部２０２との間に第１スペクトル符号化部９０１が存在する点にある。第１スペクトル符号化部９０１は、周波数領域変換部２０１より出力される第１レイヤ復号スペクトルの品質を向上させ、そのときの符号化コード（第１のスペクトル符号化コード）を多重化部１０４に出力するとともに、品質向上された第１レイヤ復号スペクトル（第１の復号スペクトル）を拡張帯域符号化部２０２に与える。拡張帯域符号化部２０２は、前述の処理を第１の復号スペクトルを用いて行い、その結果として拡張帯域符号化コードを出力する。すなわち、本実施の形態の第２レイヤ符号化コードは、拡張帯域符号化コードと第１のスペクトル符号化コードとの組み合わせからなる。したがって、本実施の形態では、多重化部１０４は、第１レイヤ符号化コード、拡張帯域符号化コードおよび第１のスペクトル符号化コードを多重化して、ビットストリームを生成する。 (Embodiment 2)
FIG. 9 shows the configuration of second layer encoding section 105 according to Embodiment 2 of the present invention. In FIG. 9, blocks having the same names as those in FIG. 2 have the same functions, and therefore detailed description thereof is omitted here. The difference between FIG. 2 and FIG. 9 is that a first spectrum encoding unit 901 exists between the frequency domain transform unit 201 and the extended band encoding unit 202. First spectrum encoding section 901 improves the quality of the first layer decoded spectrum output from frequency domain transform section 201, and the encoded code (first spectrum encoded code) at that time is sent to multiplexing section 104. The first layer decoded spectrum (first decoded spectrum) with improved quality is output to the extended band coding unit 202. The extension band encoding unit 202 performs the above-described processing using the first decoded spectrum, and outputs an extension band encoded code as a result. That is, the second layer encoded code of the present embodiment is a combination of the extended band encoded code and the first spectrum encoded code. Therefore, in the present embodiment, multiplexing section 104 multiplexes the first layer encoded code, the extended band encoded code, and the first spectrum encoded code to generate a bit stream.

次に、第１スペクトル符号化部９０１の詳細を、図１０を用いて説明する。第１スペクトル符号化部９０１は、スケーリング係数符号化部１００１、スケーリング係数復号化部１００２、微細スペクトル符号化部１００３、多重化部１００４、微細スペクトル復号化部１００５、正規化部１００６、減算器１００７および加算器１００８を有する。 Next, details of the first spectrum encoding unit 901 will be described with reference to FIG. The first spectrum encoding unit 901 includes a scaling coefficient encoding unit 1001, a scaling coefficient decoding unit 1002, a fine spectrum encoding unit 1003, a multiplexing unit 1004, a fine spectrum decoding unit 1005, a normalization unit 1006, and a subtractor 1007. And an adder 1008.

減算器１００７は、原スペクトルから第１レイヤ復号スペクトルを減じて残差スペクトルを生成し、残差スペクトルをスケーリング係数符号化部１００１および正規化部１００６に出力する。スケーリング係数符号化部１００１は、残差スペクトルのスペクトル概形を表すスケーリング係数を算出し、当該スケーリング係数を符号化し、その符号化コードを多重化部１００４およびスケーリング係数復号化部１００２に出力する。 The subtractor 1007 generates a residual spectrum by subtracting the first layer decoded spectrum from the original spectrum, and outputs the residual spectrum to the scaling coefficient encoding unit 1001 and the normalizing unit 1006. The scaling coefficient encoding unit 1001 calculates a scaling coefficient representing the spectral outline of the residual spectrum, encodes the scaling coefficient, and outputs the encoded code to the multiplexing unit 1004 and the scaling coefficient decoding unit 1002.

スケーリング係数の符号化においては、聴覚マスキングを用いても良い。例えば、聴覚マスキングを用いてスケーリング係数の符号化に必要なビット配分を決定し、そのビット配分情報に基づき符号化を行う。このとき、全くビットが配分されない帯域がある場合には、その帯域のスケーリング係数は符号化されないことになる。これにより、スケーリング係数の符号化を効率化することができる。 Auditory masking may be used in encoding the scaling factor. For example, bit allocation necessary for encoding the scaling coefficient is determined using auditory masking, and encoding is performed based on the bit allocation information. At this time, if there is a band to which no bits are allocated, the scaling coefficient of that band is not encoded. Thereby, the encoding of a scaling factor can be made efficient.

スケーリング係数復号化部１００２は、入力されたスケーリング係数符号化コードからスケーリング係数を復号し、復号されたスケーリング係数を正規化部１００６、微細スペクトル符号化部１００３および微細スペクトル復号化部１００５に出力する。 The scaling coefficient decoding unit 1002 decodes the scaling coefficient from the input scaling coefficient encoded code, and outputs the decoded scaling coefficient to the normalization unit 1006, the fine spectrum encoding unit 1003, and the fine spectrum decoding unit 1005. .

正規化部１００６は、スケーリング係数復号化部１００２より与えられるスケーリング係数を用いて、減算器１００７より与えられる残差スペクトルの正規化を行い、正規化後の残差スペクトルを微細スペクトル符号化部１００３に出力する。 The normalization unit 1006 normalizes the residual spectrum provided from the subtractor 1007 using the scaling coefficient provided from the scaling coefficient decoding unit 1002, and the normalized residual spectrum is subjected to the fine spectrum encoding unit 1003. Output to.

微細スペクトル符号化部１００３は、スケーリング係数復号化部１００２から入力されたスケーリング係数を用いて各帯域の聴覚的重要度を算出し、各帯域に割り当てられるビット数を求め、このビット数の条件のもとで正規化後の残差スペクトル（微細スペクトル
）の符号化を行う。そして、この符号化によって得られた微細スペクトル符号化コードを多重化部１００４および微細スペクトル復号化部１００５に出力する。 The fine spectrum encoding unit 1003 calculates the auditory importance of each band using the scaling coefficient input from the scaling coefficient decoding unit 1002, obtains the number of bits allocated to each band, and determines the condition of the number of bits. Originally, the normalized residual spectrum (fine spectrum) is encoded. Then, the fine spectrum encoded code obtained by this encoding is output to multiplexing section 1004 and fine spectrum decoding section 1005.

なお、正規化後の残差スペクトルの符号化の際には、聴覚マスキングを用いて聴感的な歪を小さくするように符号化しても良い。また、聴覚的重要度の算出に第１レイヤ復号スペクトルの情報を用いるようにしても良い。その場合、第１レイヤ復号スペクトルを微細スペクトル符号化部１００３へ入力するように構成する。 In addition, when encoding the residual spectrum after normalization, it may be encoded using auditory masking so as to reduce auditory distortion. Moreover, you may make it use the information of a 1st layer decoding spectrum for calculation of auditory importance. In this case, the first layer decoded spectrum is input to the fine spectrum encoding unit 1003.

スケーリング係数符号化部１００１および微細スペクトル符号化部１００３より出力される符号化コードは、多重化部１００４にて多重化され、第１のスペクトル符号化コードとして多重化部１０４に出力される。 The encoded codes output from the scaling coefficient encoding unit 1001 and the fine spectrum encoding unit 1003 are multiplexed by the multiplexing unit 1004 and output to the multiplexing unit 104 as the first spectrum encoded code.

微細スペクトル復号化部１００５は、スケーリング係数復号化部１００２から入力されたスケーリング係数を用いて各帯域の聴覚的重要度を算出し、各帯域に割り当てられたビット数を求め、スケーリング係数と微細スペクトル符号化部１００３から入力された微細スペクトル符号化コードとから各帯域の残差スペクトルを復号し、復号された残差スペクトルを加算器１００８へ出力する。なお、聴覚的重要度の算出に第１レイヤ復号スペクトルの情報を用いるようにしても良い。その場合、第１レイヤ復号スペクトルを微細スペクトル復号化部１００５へ入力するように構成する。 The fine spectrum decoding unit 1005 calculates the auditory importance of each band using the scaling coefficient input from the scaling coefficient decoding unit 1002, obtains the number of bits assigned to each band, and calculates the scaling coefficient and the fine spectrum. The residual spectrum of each band is decoded from the fine spectrum encoded code input from encoding section 1003, and the decoded residual spectrum is output to adder 1008. In addition, you may make it use the information of a 1st layer decoding spectrum for calculation of auditory importance. In this case, the first layer decoded spectrum is configured to be input to the fine spectrum decoding unit 1005.

加算器１００８は、復号された残差スペクトルと第１レイヤ復号スペクトルとを加算して第１の復号スペクトルを生成し、生成された第１の復号スペクトルを拡張帯域符号化部２０２へ出力する。 Adder 1008 adds the decoded residual spectrum and the first layer decoded spectrum to generate a first decoded spectrum, and outputs the generated first decoded spectrum to extension band coding section 202.

このように本実施の形態によれば、第１レイヤ復号スペクトルの品質を改善した後に、品質改善後のスペクトル、つまり第１のスペクトルを使って拡張帯域符号化部２０２で高域部（Ｎｎ≦ｋ＜Ｎｗ）のスペクトルを生成することにより、帯域拡張された復号信号の品質を改善することができる。 As described above, according to the present embodiment, after improving the quality of the first layer decoded spectrum, the extended band encoding unit 202 uses the spectrum after the quality improvement, that is, the first spectrum, in the high band part (Nn ≦ By generating a spectrum of k <Nw), it is possible to improve the quality of the band-extended decoded signal.

本実施の形態の第２レイヤ復号化部６０３の構成を、図１１を用いて詳細に行う。図１１において図７と同一名称のブロックは、同一の機能を有するため、ここではその詳細な説明を省略する。図１１において、第２レイヤ復号化部６０３は、分離部１１０１、第１スペクトル復号化部１１０２、拡張帯域復号化部７０１、周波数領域変換部７０２および時間領域変換部７０３を有する。 The configuration of second layer decoding section 603 of the present embodiment will be described in detail using FIG. In FIG. 11, since the block having the same name as FIG. 7 has the same function, its detailed description is omitted here. In FIG. 11, second layer decoding section 603 has demultiplexing section 1101, first spectrum decoding section 1102, extended band decoding section 701, frequency domain transform section 702 and time domain transform section 703.

分離部１１０１は、第２レイヤ符号化コードを、第１のスペクトル符号化コード、拡張帯域符号化コード、に分離し、第１のスペクトル符号化コードを第１スペクトル復号化部１１０２に、拡張帯域符号化コードを拡張帯域復号化部７０１に、それぞれ出力する。 Separating section 1101 separates the second layer encoded code into a first spectrum encoded code and an extended band encoded code, and sends the first spectrum encoded code to first spectrum decoding section 1102 The encoded code is output to extended band decoding section 701, respectively.

周波数領域変換部７０２は、第１レイヤ復号化部６０２から入力された第１レイヤ復号信号を周波数領域のパラメータ（例えばＭＤＣＴ係数など）に変換し、このパラメータを第１レイヤ復号スペクトルとして第１スペクトル復号化部１１０２に出力する。 Frequency domain transform section 702 transforms the first layer decoded signal input from first layer decoding section 602 into a frequency domain parameter (for example, MDCT coefficient), and uses this parameter as the first layer decoded spectrum as the first spectrum. The data is output to the decoding unit 1102.

第１スペクトル復号化部１１０２は、分離部１１０１から入力された第１のスペクトル符号化コードを復号して得られる第１レイヤの符号化誤差の量子化スペクトルを、周波数領域変換部７０２から入力された第１レイヤ復号スペクトルに加える。そして、その加算結果を第１の復号スペクトルとして拡張帯域復号化部７０１へ出力する。 First spectrum decoding section 1102 receives from frequency domain transform section 702 the quantized spectrum of the first layer encoding error obtained by decoding the first spectrum encoded code input from demultiplexing section 1101. Added to the first layer decoded spectrum. Then, the addition result is output to extended band decoding section 701 as the first decoded spectrum.

ここで、第１スペクトル復号化部１１０２の説明を、図１２を用いて詳細に行う。第１スペクトル復号化部１１０２は、分離部１２０１、スケーリング係数復号化部１２０２、
微細スペクトル復号化部１２０３およびスペクトル復号部１２０４を有する。 Here, the first spectrum decoding section 1102 will be described in detail with reference to FIG. The first spectrum decoding unit 1102 includes a separation unit 1201, a scaling coefficient decoding unit 1202,
A fine spectrum decoding unit 1203 and a spectrum decoding unit 1204 are provided.

分離部１２０１は、入力された第１のスペクトル符号化コードから、スケーリング係数を表す符号化コードと、微細スペクトル（スペクトル微細構造）を表す符号化コードと、を分離し、スケーリング係数符号化コードをスケーリング係数復号化部１２０２に、微細スペクトル符号化コードを微細スペクトル復号化部１２０３に、それぞれ出力する。 The separation unit 1201 separates an encoded code representing a scaling coefficient and an encoded code representing a fine spectrum (spectral fine structure) from the input first spectrum encoded code, and obtains the scaling coefficient encoded code. The fine spectrum encoded code is output to scaling coefficient decoding section 1202 and fine spectrum decoding section 1203, respectively.

スケーリング係数復号化部１２０２は、入力されたスケーリング係数符号化コードからスケーリング係数を復号し、復号されたスケーリング係数をスペクトル復号部１２０４および微細スペクトル復号化部１２０３に出力する。 The scaling coefficient decoding unit 1202 decodes the scaling coefficient from the input scaling coefficient encoded code, and outputs the decoded scaling coefficient to the spectrum decoding unit 1204 and the fine spectrum decoding unit 1203.

微細スペクトル復号化部１２０３は、スケーリング係数復号化部１２０２から入力されたスケーリング係数を用いて各帯域の聴覚的重要度を算出し、各帯域の微細スペクトルに割り当てられたビット数を求める。また、分離部１２０１から入力された微細スペクトル符号化コードから各帯域の微細スペクトルを復号し、復号された微細スペクトルをスペクトル復号部１２０４へ出力する。 The fine spectrum decoding unit 1203 calculates the auditory importance of each band using the scaling coefficient input from the scaling coefficient decoding unit 1202, and obtains the number of bits assigned to the fine spectrum of each band. In addition, the fine spectrum of each band is decoded from the fine spectrum encoded code input from separation section 1201, and the decoded fine spectrum is output to spectrum decoding section 1204.

なお、聴覚的重要度の算出に第１レイヤ復号スペクトルの情報を用いるようにしても良い。その場合、第１レイヤ復号スペクトルを微細スペクトル復号化部１２０３へ入力するように構成する。 In addition, you may make it use the information of a 1st layer decoding spectrum for calculation of auditory importance. In that case, the first layer decoded spectrum is configured to be input to the fine spectrum decoding unit 1203.

スペクトル復号部１２０４は、周波数領域変換部７０２から与えられた第１レイヤ復号スペクトルと、スケーリング係数復号化部１２０２から入力されたスケーリング係数と、微細スペクトル復号化部１２０３から入力された微細スペクトルと、から第１の復号スペクトルを復号し、この復号スペクトルを拡張帯域復号化部７０１へ出力する。 Spectrum decoding section 1204, first layer decoded spectrum provided from frequency domain transform section 702, scaling coefficient input from scaling coefficient decoding section 1202, fine spectrum input from the fine spectrum decoding section 1203, The first decoded spectrum is decoded, and this decoded spectrum is output to extended band decoding section 701.

なお、本実施の形態の拡張帯域符号化部２０２には、スペクトル残差形状符号帳３０５およびスペクトル残差ゲイン符号帳３０７を設けなくとも良い。この場合の拡張帯域符号化部２０２の構成は、図１３に示される。また、拡張帯域復号化部７０１には、スペクトル残差形状符号帳８０５およびスペクトル残差ゲイン符号帳８０６を設けなくとも良い。この場合の拡張帯域復号化部７０１の構成は、図１４に示される。なお、図１３および図１４にそれぞれ示されるフィルタリング部１３０１、１４０１の出力信号は、次の式（６）で表される。

The extended band encoding unit 202 of the present embodiment does not have to be provided with the spectrum residual shape codebook 305 and the spectrum residual gain codebook 307. The configuration of extension band encoding section 202 in this case is shown in FIG. In addition, extended band decoding section 701 does not have to provide spectral residual shape codebook 805 and spectral residual gain codebook 806. The configuration of extended band decoding section 701 in this case is shown in FIG. Note that the output signals of the

filtering units

1301 and 1401 shown in FIGS. 13 and 14 are expressed by the following equation (6).

本実施の形態では、第１レイヤ復号スペクトルの品質を改善した後に、この品質改善後のスペクトルを使って拡張帯域符号化部２０２で高域部（Ｎｎ≦ｋ＜Ｎｗ）のスペクトルを生成する。この構成によれば、復号信号の品質を改善することができる。この利点は、スペクトル残差形状符号帳およびスペクトル残差ゲイン符号帳の有無に関わらず享受できる。 In the present embodiment, after the quality of the first layer decoded spectrum is improved, a spectrum of a high frequency band (Nn ≦ k <Nw) is generated by extension band coding section 202 using the spectrum after the quality improvement. According to this configuration, the quality of the decoded signal can be improved. This advantage can be enjoyed regardless of the presence or absence of the spectral residual shape codebook and the spectral residual gain codebook.

なお、第１スペクトル符号化部９０１では、低域部（０≦ｋ＜Ｎｎ）のスペクトルの符号化を行う際に、全帯域（０≦ｋ＜Ｎｗ）の符号化歪が最も小さくなるように低域部（０≦ｋ＜Ｎｎ）のスペクトルの符号化を行っても良い。この場合、拡張帯域符号化部２０２では、高域部（Ｎｎ≦ｋ＜Ｎｗ）の符号化まで行われる。また、この場合、第１スペクトル符号化部９０１において、低域部の符号化結果が高域部の符号化に与える影響も考慮して低域部の符号化を行うことになる。したがって、全帯域のスペクトルが最適になるよう低域部のスペクトルの符号化が為されるようになるため、品質が向上するという効果が得
られる。 In the first spectrum encoding unit 901, when encoding the spectrum of the low band part (0 ≦ k <Nn), the encoding distortion of the entire band (0 ≦ k <Nw) is minimized. You may encode the spectrum of a low-pass part (0 <= k <Nn). In this case, the extension band encoding unit 202 performs the encoding up to the high band part (Nn ≦ k <Nw). In this case, in the first spectrum encoding unit 901, the low band part is encoded in consideration of the influence of the low band part encoding result on the high band part encoding. Accordingly, since the spectrum of the low frequency band is encoded so that the spectrum of the entire band is optimized, the effect of improving the quality can be obtained.

（実施の形態３）
本発明の実施の形態３に係る第２レイヤ符号化部１０５の構成を図１５に示す。図１５において図９と同一名称のブロックは同一の機能を有するため、ここではその詳細な説明を省略する。 (Embodiment 3)
The configuration of second layer coding section 105 according to Embodiment 3 of the present invention is shown in FIG. In FIG. 15, blocks having the same names as those in FIG. 9 have the same functions, and thus detailed description thereof is omitted here.

図９との違いは、復号機能を有し且つ拡張帯域符号化コードを求める拡張帯域符号化部１５０１と、その拡張帯域符号化コードを用いて第２の復号スペクトルを生成し原スペクトルから第２の復号スペクトルを減じて求められる誤差スペクトルを符号化する第２スペクトル符号化部１５０２と、が設けられた点にある。前述の誤差スペクトルを第２スペクトル符号化部１５０２にて符号化することでより高品質な復号スペクトルを生成できるようになり、復号化装置で得られる復号信号の品質を向上させることができる。 The difference from FIG. 9 is that an extension band encoding unit 1501 that has a decoding function and obtains an extension band encoded code, generates a second decoded spectrum using the extension band encoded code, and generates a second decoded spectrum from the original spectrum. And a second spectrum encoding unit 1502 for encoding an error spectrum obtained by subtracting the decoded spectrum. By encoding the error spectrum described above with the second spectrum encoding unit 1502, a higher quality decoded spectrum can be generated, and the quality of the decoded signal obtained by the decoding apparatus can be improved.

拡張帯域符号化部１５０１は、図３に示された拡張帯域符号化部２０２と同様に拡張帯域符号化コードを生成して出力する。また、拡張帯域符号化部１５０１は、図８に示される拡張帯域復号化部７０１と同様の構成を内包し、拡張帯域復号化部７０１と同様に第２の復号スペクトルを生成する。この第２の復号スペクトルは、第２スペクトル符号化部１５０２に出力される。すなわち、本実施の形態の第２レイヤ符号化コードは、拡張帯域符号化コード、第１のスペクトル符号化コードおよび第２のスペクトル符号化コードからなる。 The extension band encoding unit 1501 generates and outputs an extension band encoded code in the same manner as the extension band encoding unit 202 shown in FIG. Also, the extended band encoding unit 1501 includes the same configuration as the extended band decoding unit 701 shown in FIG. 8, and generates the second decoded spectrum in the same manner as the extended band decoding unit 701. This second decoded spectrum is output to second spectrum encoding section 1502. That is, the second layer encoded code of the present embodiment includes an extended band encoded code, a first spectrum encoded code, and a second spectrum encoded code.

なお、拡張帯域符号化部１５０１の構成において、図３および図８で共通の名称を持つブロックは共有化されていても良い。 In the configuration of extended band encoding section 1501, blocks having common names in FIGS. 3 and 8 may be shared.

第２スペクトル符号化部１５０２は、図１６に示すように、スケーリング係数符号化部１６０１、スケーリング係数復号化部１６０２、微細スペクトル符号化部１６０３、多重化部１６０４、正規化部１６０５および減算器１６０６を有する。 As shown in FIG. 16, the second spectrum encoding unit 1502 includes a scaling coefficient encoding unit 1601, a scaling coefficient decoding unit 1602, a fine spectrum encoding unit 1603, a multiplexing unit 1604, a normalization unit 1605, and a subtracter 1606. Have

減算器１６０６は、原スペクトルから第２の復号スペクトルを減じて残差スペクトルを生成し、残差スペクトルをスケーリング係数符号化部１６０１および正規化部１６０５に出力する。スケーリング係数符号化部１６０１は、残差スペクトルのスペクトル概形を表すスケーリング係数を算出し、当該スケーリング係数を符号化し、スケーリング係数符号化コードを多重化部１６０４およびスケーリング係数復号化部１６０２に出力する。 Subtractor 1606 generates a residual spectrum by subtracting the second decoded spectrum from the original spectrum, and outputs the residual spectrum to scaling coefficient encoding section 1601 and normalization section 1605. Scaling coefficient encoding section 1601 calculates a scaling coefficient that represents the spectral outline of the residual spectrum, encodes the scaling coefficient, and outputs the scaling coefficient encoded code to multiplexing section 1604 and scaling coefficient decoding section 1602. .

ここで、聴覚マスキングを用いてスケーリング係数の符号化の効率化を図っても良い。例えば、聴覚マスキングを用いてスケーリング係数の符号化に必要なビット配分を決定し、そのビット配分情報に基づき符号化を行う。このとき、全くビットが配分されない帯域がある場合には、その帯域のスケーリング係数は符号化されないことになる。 Here, the efficiency of encoding the scaling coefficient may be improved by using auditory masking. For example, bit allocation necessary for encoding the scaling coefficient is determined using auditory masking, and encoding is performed based on the bit allocation information. At this time, if there is a band to which no bits are allocated, the scaling coefficient of that band is not encoded.

スケーリング係数復号化部１６０２は、入力されたスケーリング係数符号化コードからスケーリング係数を復号し、復号されたスケーリング係数を正規化部１６０５および微細スペクトル符号化部１６０３に出力する。 The scaling coefficient decoding unit 1602 decodes the scaling coefficient from the input scaling coefficient encoded code, and outputs the decoded scaling coefficient to the normalization unit 1605 and the fine spectrum encoding unit 1603.

正規化部１６０５は、スケーリング係数復号化部１６０２より与えられるスケーリング係数を用いて、減算器１６０６より与えられる残差スペクトルの正規化を行い、正規化後の残差スペクトルを微細スペクトル符号化部１６０３に出力する。 The normalization unit 1605 normalizes the residual spectrum provided from the subtractor 1606 using the scaling coefficient provided from the scaling coefficient decoding unit 1602, and converts the residual spectrum after normalization into a fine spectrum encoding unit 1603. Output to.

微細スペクトル符号化部１６０３は、スケーリング係数復号化部１６０２から入力された復号スケーリング係数を用いて各帯域の聴覚的重要度を算出し、各帯域に割り当てられ
たビット数を求め、このビット数の条件のもとで正規化後の残差スペクトル（微細スペクトル）の符号化を行う。そして、この符号化によって得られた符号化コードを多重化部１６０４に出力する。 The fine spectrum encoding unit 1603 calculates the aural importance of each band using the decoding scaling coefficient input from the scaling coefficient decoding unit 1602, obtains the number of bits allocated to each band, and calculates the number of bits. The residual spectrum (normal spectrum) after normalization is encoded under the conditions. Then, the encoded code obtained by this encoding is output to multiplexing section 1604.

なお、正規化後の残差スペクトルの符号化の際には、聴覚マスキングを用いて聴感的な歪を小さくするように符号化しても良い。また、聴覚的重要度の算出に第２レイヤ復号スペクトルの情報を用いるようにしても良い。その場合、第２レイヤ復号スペクトルを微細スペクトル符号化部１６０３へ入力するように構成する。 In addition, when encoding the residual spectrum after normalization, it may be encoded using auditory masking so as to reduce auditory distortion. Moreover, you may make it use the information of a 2nd layer decoding spectrum for calculation of auditory importance. In this case, the second layer decoded spectrum is configured to be input to the fine spectrum encoding unit 1603.

スケーリング係数符号化部１６０１および微細スペクトル符号化部１６０３より出力される符号化コードは多重化部１６０４にて多重化され、第２のスペクトル符号化コードとして出力される。 The encoded codes output from the scaling coefficient encoding unit 1601 and the fine spectrum encoding unit 1603 are multiplexed by the multiplexing unit 1604 and output as the second spectrum encoded code.

図１７は、第２スペクトル符号化部１５０２の構成の変形例を示している。図１７において図１６と同一名称のブロックは同一機能を有するため、ここではその詳細な説明を省略する。 FIG. 17 illustrates a modification of the configuration of the second spectrum encoding unit 1502. In FIG. 17, since the block having the same name as that of FIG. 16 has the same function, detailed description thereof is omitted here.

この構成では、第２スペクトル符号化部１５０２は、減算器１６０６より与えられる残差スペクトルを直接符号化する。つまり、残差スペクトルの正規化は行われない。そのため本構成では、図１６に示されたスケーリング係数符号化部１６０１、スケーリング係数復号化部１６０２および正規化部１６０５が設けられていない。この構成によれば、第２スペクトル符号化部１５０２でスケーリング係数にビットを配分する必要が無くなるため、ビットレートを低減させることができる。 In this configuration, the second spectrum encoding unit 1502 directly encodes the residual spectrum given from the subtracter 1606. That is, normalization of the residual spectrum is not performed. Therefore, in this configuration, the scaling coefficient encoding unit 1601, the scaling coefficient decoding unit 1602, and the normalization unit 1605 shown in FIG. 16 are not provided. According to this configuration, it is not necessary for the second spectrum encoding unit 1502 to allocate bits to the scaling coefficient, so that the bit rate can be reduced.

聴覚重要度およびビット配分算出部１７０１は、第２の復号スペクトルから各帯域の聴覚重要度を求め、聴覚重要度に応じて決定される各帯域へのビット配分を求める。求められた聴覚重要度およびビット配分は、微細スペクトル符号化部１６０３へ出力される。 Auditory importance and bit allocation calculation section 1701 calculates the auditory importance of each band from the second decoded spectrum, and calculates the bit allocation to each band determined according to the auditory importance. The obtained auditory importance and bit allocation are output to the fine spectrum encoding unit 1603.

微細スペクトル符号化部１６０３は、聴覚重要度およびビット配分算出部１７０１から入力された聴覚重要度およびビット配分に基づいて、残差スペクトルを符号化する。そして、この符号化によって得られた符号化コードを第２のスペクトル符号化コードとして多重化部１０４に出力する。なお、残差スペクトルの符号化の際には、聴覚マスキングを用いて聴感的な歪を小さくするように符号化しても良い。 The fine spectrum encoding unit 1603 encodes the residual spectrum based on the auditory importance and bit allocation input from the auditory importance and bit allocation calculating unit 1701. Then, the encoded code obtained by this encoding is output to the multiplexing unit 104 as the second spectrum encoded code. When encoding the residual spectrum, it may be encoded using auditory masking to reduce auditory distortion.

本実施の形態の第２レイヤ復号化部６０３の構成を図１８に示す。第２レイヤ復号化部６０３は、拡張帯域復号化部７０１、周波数領域変換部７０２、時間領域変換部７０３、分離部１１０１、第１スペクトル復号化部１１０２および第２スペクトル復号化部１８０１を有する。図１８において図１１と同一名称のブロックは同一の機能を有するので、ここではその詳細な説明を省略する。 The configuration of second layer decoding section 603 in the present embodiment is shown in FIG. Second layer decoding section 603 includes extension band decoding section 701, frequency domain transform section 702, time domain transform section 703, separation section 1101, first spectrum decoding section 1102 and second spectrum decoding section 1801. In FIG. 18, since the block having the same name as that of FIG. 11 has the same function, the detailed description thereof is omitted here.

第２スペクトル復号化部１８０１は、分離部１１０１から入力された第２のスペクトル符号化コードを復号して得られる第２の復号スペクトルの符号化誤差を量子化したスペクトルを、拡張帯域復号化部７０１から入力された第２の復号スペクトルに加える。そして、この加算結果を第３の復号スペクトルとして時間領域変換部７０３へ出力する。 The second spectrum decoding unit 1801 quantizes the spectrum obtained by quantizing the coding error of the second decoded spectrum obtained by decoding the second spectrum encoded code input from the demultiplexing unit 1101, and the extended band decoding unit It adds to the 2nd decoding spectrum input from 701. FIG. Then, the addition result is output to the time domain conversion unit 703 as a third decoded spectrum.

第２スペクトル復号化部１８０１は、第２スペクトル符号化部１５０２が図１６に示す構成を採る場合、図１２と同様の構成を採る。ただし、図１２における第１のスペクトル符号化コード、第１レイヤ復号スペクトルおよび第１の復号スペクトルは、それぞれ、第２のスペクトル符号化コード、第２の復号スペクトルおよび第３の復号スペクトルに置き換わる。 Second spectrum decoding section 1801 adopts the same configuration as that in FIG. 12 when second spectrum encoding section 1502 adopts the configuration shown in FIG. However, the first spectrum encoded code, the first layer decoded spectrum, and the first decoded spectrum in FIG. 12 are replaced with the second spectrum encoded code, the second decoded spectrum, and the third decoded spectrum, respectively.

また、本実施の形態では、第２スペクトル復号化部１８０１の構成について、第２スペクトル符号化部１５０２が図１６に示す構成を採る場合を例に挙げて説明したが、第２スペクトル符号化部１５０２が図１７に示す構成を採る場合、第２スペクトル復号化部１８０１の構成は、図１９のようになる。 Further, in the present embodiment, the configuration of second spectrum decoding section 1801 has been described by taking the case where second spectrum encoding section 1502 adopts the configuration shown in FIG. 16 as an example, but second spectrum encoding section When 1502 adopts the configuration shown in FIG. 17, the configuration of second spectrum decoding section 1801 is as shown in FIG.

つまり図１９は、スケーリング係数を用いない第２スペクトル符号化部１５０２に対応する第２スペクトル復号化部１８０１の構成を示している。第２スペクトル復号化部１８０１は、聴覚重要度およびビット配分算出部１９０１と微細スペクトル復号化部１９０２とスペクトル復号部１９０３とを有する。 That is, FIG. 19 shows the configuration of second spectrum decoding section 1801 corresponding to second spectrum encoding section 1502 that does not use a scaling coefficient. Second spectrum decoding section 1801 has auditory importance and bit allocation calculation section 1901, fine spectrum decoding section 1902, and spectrum decoding section 1903.

図１９において、聴覚重要度およびビット配分算出部１９０１は、拡張帯域復号化部７０１から入力された第２の復号スペクトルから各帯域の聴覚重要度を求め、聴覚重要度に応じて決定される各帯域へのビット配分を求める。求められた聴覚重要度とビット配分は、微細スペクトル復号化部１９０２へ出力される。 In FIG. 19, the auditory importance and bit allocation calculating unit 1901 obtains the auditory importance of each band from the second decoded spectrum input from the extended band decoding unit 701, and is determined according to the auditory importance. Obtain the bit allocation to the band. The obtained auditory importance and bit allocation are output to the fine spectrum decoding unit 1902.

微細スペクトル復号化部１９０２は、聴覚重要度およびビット配分算出部１９０１から入力された聴覚重要度およびビット配分に基づいて、分離部１１０１から第２のスペクトル符号化コードとして入力される微細スペクトル符号化コードを復号し、その復号結果（各帯域の微細スペクトル）をスペクトル復号部１９０３に出力する。 The fine spectrum decoding unit 1902 performs fine spectrum encoding input from the separation unit 1101 as the second spectral encoding code based on the auditory importance and bit allocation input from the auditory importance and bit allocation calculation unit 1901. The code is decoded, and the decoding result (fine spectrum of each band) is output to the spectrum decoding unit 1903.

微細スペクトル復号化部１９０３は、拡張帯域復号化部７０１から入力された第２の復号スペクトルに、微細スペクトル復号化部１９０２から入力された微細スペクトルを加えて、その加算結果を第３の復号スペクトルとして外部へ出力する。 The fine spectrum decoding unit 1903 adds the fine spectrum input from the fine spectrum decoding unit 1902 to the second decoded spectrum input from the extended band decoding unit 701, and adds the addition result to the third decoded spectrum. Output to the outside.

なお、本実施の形態では、第１スペクトル符号化部９０１および第１スペクトル復号化部１１０２を含む構成を例に挙げて説明したが、第１スペクトル符号化部９０１および第１スペクトル復号化部１１０２が無くても本実施の形態の作用効果を実現することができる。その場合の第２レイヤ符号化部１０５の構成を図２０に、第２レイヤ復号化部６０３の構成を図２１に、それぞれ示す。 In the present embodiment, the configuration including first spectrum encoding section 901 and first spectrum decoding section 1102 has been described as an example. However, first spectrum encoding section 901 and first spectrum decoding section 1102 are described. Even if there is no, the effect of this Embodiment is realizable. In this case, the configuration of second layer encoding section 105 is shown in FIG. 20, and the configuration of second layer decoding section 603 is shown in FIG.

以上、本発明によるスケーラブル復号化装置およびスケーラブル符号化装置の実施の形態について説明した。 The embodiments of the scalable decoding device and the scalable encoding device according to the present invention have been described above.

なお、上記実施の形態においては、変換方式としてＭＤＣＴを使って説明したがこれに限定されず、他の変換方式、例えばフーリエ変換やコサイン変換、Wavelet変換などを使用したときにも本発明は適用できる。 In the above-described embodiment, MDCT is used as the conversion method. However, the present invention is not limited to this, and the present invention is also applicable when other conversion methods such as Fourier transform, cosine transform, and Wavelet transform are used. it can.

また、上記実施の形態においては、階層数２を基に説明したがこれに限定されず、２以上の階層を持つスケーラブル符号化／復号化にも適用できる。 In the above-described embodiment, the description has been made based on the number of layers 2. However, the present invention is not limited to this.

また、本発明に係る符号化装置および復号化装置は、上記の実施の形態１〜３に限定されず、種々変更して実施することが可能である。例えば、各実施の形態は、適宜組み合わせて実施することが可能である。 The encoding device and decoding device according to the present invention are not limited to the above-described first to third embodiments, and can be implemented with various modifications. For example, each embodiment can be implemented in combination as appropriate.

本発明に係る符号化装置および復号化装置は、移動体通信システムにおける通信端末装置および基地局装置に搭載することも可能であり、これにより上記と同様の作用効果を有する通信端末装置および基地局装置を提供することができる。 The encoding device and the decoding device according to the present invention can also be mounted on a communication terminal device and a base station device in a mobile communication system, whereby the communication terminal device and the base station having the same operational effects as described above An apparatus can be provided.

また、ここでは、本発明をハードウェアで構成する場合を例にとって説明したが、本発
明はソフトウェアで実現することも可能である。 Further, here, a case has been described as an example where the present invention is configured with hardware, but the present invention can also be implemented with software.

なお、上記各実施の形態の説明に用いた各機能ブロックは、典型的には集積回路であるＬＳＩとして実現される。これらは個別に１チップ化されてもよいし、一部又は全てを含むように１チップ化されてもよい。 Each functional block used in the description of each of the above embodiments is typically realized as an LSI that is an integrated circuit. These may be individually made into one chip, or may be made into one chip so as to include a part or all of them.

ここでは、ＬＳＩとしたが、集積度の違いにより、ＩＣ、システムＬＳＩ、スーパーＬＳＩ、ウルトラＬＳＩと呼称されることもある。 The name used here is LSI, but it may also be called IC, system LSI, super LSI, or ultra LSI depending on the degree of integration.

また、集積回路化の手法はＬＳＩに限るものではなく、専用回路又は汎用プロセッサで実現しても良い。ＬＳＩ製造後に、プログラムすることが可能なＦＰＧＡ（Field Programmable Gate Array）や、ＬＳＩ内部の回路セルの接続や設定を再構成可能なリコンフィギュラブル・プロセッサーを利用しても良い。 Further, the method of circuit integration is not limited to LSI's, and implementation using dedicated circuitry or general purpose processors is also possible. An FPGA (Field Programmable Gate Array) that can be programmed after manufacturing the LSI or a reconfigurable processor that can reconfigure the connection and setting of the circuit cells inside the LSI may be used.

さらには、半導体技術の進歩又は派生する別技術によりＬＳＩに置き換わる集積回路化の技術が登場すれば、当然、その技術を用いて機能ブロックの集積化を行ってもよい。バイオ技術の適応等が可能性としてありえる。 Further, if integrated circuit technology comes out to replace LSI's as a result of the advancement of semiconductor technology or a derivative other technology, it is naturally also possible to carry out function block integration using this technology. Biotechnology can be applied.

すなわち、上記実施の形態に係るスケーラブル符号化装置は、原信号から、低周波帯域の符号化情報と高周波帯域の符号化情報とを生成するスケーラブル符号化装置であって、前記低周波帯域の符号化情報の復号信号から低周波帯域の第１スペクトルを算出する第１スペクトル算出手段と、前記原信号から第２スペクトルを算出する第２スペクトル算出手段と、前記第１スペクトルと前記第２スペクトルの高周波帯域部との類似具合を示す第１パラメータを算出する第１パラメータ算出手段と、前記第１スペクトルと前記第２スペクトルの高周波帯域部との変動成分を示す第２パラメータを算出する第２パラメータ算出手段と、算出された第１パラメータと第２パラメータとを前記高周波帯域の符号化情報として符号化する符号化手段と、を有する構成を採る。 That is, the scalable encoding device according to the above embodiment is a scalable encoding device that generates low frequency band encoded information and high frequency band encoded information from an original signal, and includes the low frequency band code. First spectrum calculating means for calculating a first spectrum in a low frequency band from the decoded signal of the digitized information, second spectrum calculating means for calculating a second spectrum from the original signal, the first spectrum and the second spectrum A first parameter calculating means for calculating a first parameter indicating the degree of similarity with the high frequency band, and a second parameter for calculating a second parameter indicating a fluctuation component between the high frequency band of the first spectrum and the second spectrum. Calculating means, and encoding means for encoding the calculated first parameter and second parameter as encoding information of the high frequency band, A configuration that.

また、上記実施の形態に係るスケーラブル符号化装置は、上記構成において、前記第１パラメータ算出手段は、前記第１スペクトルを内部状態として有するフィルタを用いて、前記フィルタの特性を示すパラメータを前記第１パラメータとして出力する構成を採る。 In the scalable encoding device according to the above-described embodiment, in the above configuration, the first parameter calculation unit uses a filter having the first spectrum as an internal state to set a parameter indicating a characteristic of the filter in the first configuration. A configuration for outputting as one parameter is adopted.

また、上記実施の形態に係るスケーラブル符号化装置は、上記構成において、前記第２パラメータ算出手段は、スペクトル残差の候補を複数記録しているスペクトル残差形状符号帳を有し、前記スペクトル残差の符号を前記第２パラメータとして出力する構成を採る。 In the scalable encoding device according to the above-described embodiment, the second parameter calculation unit includes a spectrum residual shape codebook in which a plurality of spectrum residual candidates are recorded, and the spectrum residual code has the configuration described above. A configuration is adopted in which the sign of the difference is output as the second parameter.

また、上記実施の形態に係るスケーラブル符号化装置は、上記構成において、前記第１スペクトルと前記第２スペクトルの低周波帯域部との残差成分を符号化する残差成分符号化手段をさらに有し、前記第１パラメータ算出手段および前記第２パラメータ算出手段は、前記残差成分符号化手段によって符号化された残差成分を用いて前記第１スペクトルの品質を向上させた後に、前記第１パラメータおよび第２パラメータを算出する構成を採る。 The scalable coding apparatus according to the above embodiment further includes residual component encoding means for encoding the residual component between the first spectrum and the low frequency band portion of the second spectrum in the above configuration. The first parameter calculating means and the second parameter calculating means improve the quality of the first spectrum using the residual component encoded by the residual component encoding means, and then A configuration for calculating the parameter and the second parameter is adopted.

また、上記実施の形態に係るスケーラブル符号化装置は、上記構成において、前記残差成分符号化手段は、前記第１スペクトルの低周波帯域部の品質と、前記符号化手段によって符号化された第１パラメータと第２パラメータとから得られる復号スペクトルの高周波帯域部の品質と、の両方を向上させる構成を採る。 In the scalable encoding device according to the above-described embodiment, in the above configuration, the residual component encoding unit includes the quality of the low frequency band portion of the first spectrum and the encoding unit encoded by the encoding unit. A configuration is adopted in which both the quality of the high frequency band portion of the decoded spectrum obtained from the first parameter and the second parameter are improved.

また、上記実施の形態に係るスケーラブル符号化装置は、上記構成において、前記第１
パラメータは、ラグを含み、前記第２パラメータは、スペクトル残差を含み、前記ラグ、前記スペクトル残差の順に配置されたビットストリームを構成する構成手段をさらに有する構成を採る。 Further, the scalable coding apparatus according to the above embodiment has the above configuration in the first configuration.
The parameter includes a lag, the second parameter includes a spectrum residual, and further includes configuration means for configuring a bitstream arranged in the order of the lag and the spectrum residual.

また、上記実施の形態に係るスケーラブル符号化装置は、原信号から、低周波帯域の符号化情報と高周波帯域の符号化情報とを生成する符号化装置であって、前記低周波帯域の符号化情報の復号信号から低周波帯域の第１スペクトルを算出する第１スペクトル算出手段と、前記原信号から第２スペクトルを算出する第２スペクトル算出手段と、前記第１スペクトルと前記第２スペクトルの高周波帯域部との類似具合を示すパラメータを算出するパラメータ算出手段と、算出されたパラメータを前記高周波帯域の符号化情報として符号化するパラメータ符号化手段と、前記第１スペクトルと前記第２スペクトルの低周波帯域部との残差成分を符号化する残差成分符号化手段と、を有し、前記パラメータ算出手段は、前記残差成分符号化手段によって符号化された残差成分を用いて前記第１スペクトルの品質を向上させた後に、前記パラメータを算出する構成を採る。 The scalable encoding device according to the above embodiment is an encoding device that generates low frequency band encoding information and high frequency band encoding information from an original signal, and the low frequency band encoding A first spectrum calculating means for calculating a first spectrum in a low frequency band from a decoded signal of information; a second spectrum calculating means for calculating a second spectrum from the original signal; and a high frequency of the first spectrum and the second spectrum. Parameter calculating means for calculating a parameter indicating the degree of similarity with the band part, parameter encoding means for encoding the calculated parameter as the encoding information of the high frequency band, low values of the first spectrum and the second spectrum A residual component encoding unit that encodes a residual component with the frequency band unit, and the parameter calculation unit includes the residual component encoding unit. After improving the quality of the first spectrum using the coded residual components Te, a configuration to calculate the parameters.

また、上記実施の形態に係るスケーラブル復号化装置は、低周波帯域に対応する第１スペクトルを取得するスペクトル取得手段と、高周波帯域の符号化情報として符号化された第１パラメータであって、前記第１スペクトルと原信号に対応する第２スペクトルの高周波帯域部との類似具合を示す第１パラメータと、高周波帯域の符号化情報として符号化された第２パラメータであって、前記第１スペクトルと前記高周波帯域部との変動成分を示す第２パラメータと、をそれぞれ取得するパラメータ取得手段と、取得された第１パラメータおよび第２パラメータを用いて前記第２スペクトルを復号する復号手段と、を有する構成を採る。 Further, the scalable decoding device according to the embodiment includes a spectrum acquisition unit that acquires a first spectrum corresponding to a low frequency band, and a first parameter that is encoded as encoded information of a high frequency band, A first parameter indicating the degree of similarity between the first spectrum and the high-frequency band portion of the second spectrum corresponding to the original signal; and a second parameter encoded as high-frequency band encoding information, Parameter acquisition means for acquiring a second parameter indicating a fluctuation component with respect to the high-frequency band section; and decoding means for decoding the second spectrum using the acquired first parameter and second parameter. Take the configuration.

また、上記実施の形態に係るスケーラブル符号化方法は、原信号から、低周波帯域の符号化情報と高周波帯域の符号化情報とを生成するスケーラブル符号化方法であって、前記低周波帯域の符号化情報の復号信号から低周波帯域の第１スペクトルを算出する第１スペクトル算出ステップと、前記原信号から第２スペクトルを算出する第２スペクトル算出ステップと、前記第１スペクトルと前記第２スペクトルの高周波帯域部との類似具合を示す第１パラメータを算出する第１パラメータ算出ステップと、前記第１スペクトルと前記第２スペクトルの高周波帯域部との変動成分を示す第２パラメータを算出する第２パラメータ算出ステップと、算出された第１パラメータと第２パラメータとを前記高周波帯域の符号化情報として符号化する符号化ステップと、
を有するようにした。 The scalable encoding method according to the above embodiment is a scalable encoding method for generating low frequency band encoded information and high frequency band encoded information from an original signal, and the low frequency band code A first spectrum calculating step for calculating a first spectrum in a low frequency band from the decoded signal of the conversion information, a second spectrum calculating step for calculating a second spectrum from the original signal, and the first spectrum and the second spectrum A first parameter calculating step for calculating a first parameter indicating the degree of similarity with the high frequency band part; and a second parameter for calculating a second parameter indicating a fluctuation component between the high frequency band part of the first spectrum and the second spectrum. A code for encoding the calculation step and the calculated first parameter and second parameter as encoding information of the high frequency band And the step,
It was made to have.

また、上記実施の形態に係るスケーラブル復号化方法は、低周波帯域に対応する第１スペクトルを取得するスペクトル取得ステップと、高周波帯域の符号化情報として符号化された第１パラメータであって、前記第１スペクトルと原信号に対応する第２スペクトルの高周波帯域部との類似具合を示す第１パラメータと、高周波帯域の符号化情報として符号化された第２パラメータであって、前記第１スペクトルと前記高周波帯域部との変動成分を示す第２パラメータと、をそれぞれ取得するパラメータ取得ステップと、取得された第１パラメータおよび第２パラメータを用いて前記第２スペクトルを復号する復号ステップと、を有するようにした。 The scalable decoding method according to the embodiment includes a spectrum acquisition step of acquiring a first spectrum corresponding to a low frequency band, and a first parameter encoded as encoded information of a high frequency band, A first parameter indicating the degree of similarity between the first spectrum and the high-frequency band portion of the second spectrum corresponding to the original signal; and a second parameter encoded as high-frequency band encoding information, A parameter acquisition step for acquiring a second parameter indicating a fluctuation component with respect to the high-frequency band section; and a decoding step for decoding the second spectrum using the acquired first parameter and second parameter. I did it.

特に、本発明による第１のスケーラブル符号化装置は、第１スペクトルを内部状態として持つフィルタを用いて第２スペクトルの高域部を推定し、フィルタ情報を符号化して送るスペクトル符号化装置において、スペクトル残差の候補が複数記録されているスペクトル残差形状符号帳を有し、前記フィルタの入力信号としてスペクトル残差を与えフィルタリングを行い第２スペクトルの高域部を推定するもので、スペクトル残差を用いることにより、第１スペクトルの変形では表せない第２スペクトルの高域部の成分を符号化するこ
とができるようになるため、第２スペクトルの高域部の推定性能が向上し高品質化が為される。 In particular, a first scalable coding apparatus according to the present invention uses a filter having a first spectrum as an internal state to estimate a high-frequency part of a second spectrum, and encodes and sends filter information. It has a spectrum residual shape codebook in which a plurality of spectral residual candidates are recorded, and applies a spectral residual as an input signal of the filter to perform filtering to estimate a high frequency part of the second spectrum. By using the difference, it becomes possible to encode the high-frequency component of the second spectrum that cannot be expressed by the deformation of the first spectrum, so that the estimation performance of the high-frequency part of the second spectrum is improved and high quality is achieved. Is made.

また、本発明による第２のスケーラブル符号化装置は、第２スペクトルの低域部と第１スペクトルの間の誤差成分を符号化して第１スペクトルの高品質化を図った後に、この第１スペクトルを内部状態として持つフィルタを用いて第２スペクトルの高域部を推定するもので、第２スペクトルの低域部に対する第１スペクトルの品質を改善させた後に、品質改善後の第１スペクトルを用いて第２スペクトルの高域部を推定することにより、推定性能が向上し高品質化が為される。 The second scalable encoding device according to the present invention encodes an error component between the low-frequency part of the second spectrum and the first spectrum to improve the quality of the first spectrum, and then the first spectrum. Is used to estimate the high-frequency part of the second spectrum using a filter having an internal state, and after improving the quality of the first spectrum with respect to the low-frequency part of the second spectrum, the first spectrum after quality improvement is used. By estimating the high frequency part of the second spectrum, the estimation performance is improved and the quality is improved.

また、本発明による第３のスケーラブル符号化装置は、第１スペクトルを内部状態として持つフィルタを用いて第２スペクトルの高域部を推定して生成される推定スペクトルと第２スペクトルの高域部の間の誤差成分と、第２スペクトルの低域部と第１スペクトルの間の誤差成分の両誤差成分を小さくするように、第２スペクトルの低域部と第１スペクトルの間の誤差成分を符号化するもので、第１スペクトルと第２スペクトルの低域部の間の誤差成分を符号化する際に、第１スペクトルおよび第２スペクトルの高域部の推定スペクトルの両品質が同時に向上する第１スペクトルの符号化が為されるため、高品質化が実現できる。 The third scalable coding apparatus according to the present invention also includes an estimated spectrum generated by estimating a high frequency part of the second spectrum using a filter having the first spectrum as an internal state, and a high frequency part of the second spectrum. The error component between the low-frequency part of the second spectrum and the first spectrum is reduced so as to reduce both the error component between the low-frequency part of the second spectrum and the error component between the low-frequency part of the second spectrum and the first spectrum. When encoding an error component between the first spectrum and the low-frequency part of the second spectrum, both qualities of the estimated spectrum of the first spectrum and the high-frequency part of the second spectrum are improved at the same time. Since the first spectrum is encoded, high quality can be realized.

また、上記第１〜３のスケーラブル符号化装置においては、符号化装置にて復号化装置に伝送されるビットストリームを生成する際に、当該ビットストリームは少なくとも、スケールファクタ、ダイナミックレンジ調整係数、ラグ、を含み、この順番でビットストリームを構成するようにしてもよい。これにより、ビットストリームの構成は復号信号の品質に与える影響が大きいパラメータほどビットストリームのＭＳＢ（Most Significant Bit）の近くに配置されているため、ビットストリームのＬＳＢ（Least Significant Bit）から任意のビット位置でビットが削除されても品質劣化が生じ難いという効果が得られる。 In the first to third scalable encoding devices, when the bit stream to be transmitted to the decoding device is generated by the encoding device, the bit stream includes at least a scale factor, a dynamic range adjustment coefficient, a lag. , And the bit stream may be configured in this order. As a result, the bit stream configuration is arranged closer to the MSB (Most Significant Bit) of the bit stream as the parameter having a greater influence on the quality of the decoded signal, and therefore any bit from the LSB (Least Significant Bit) of the bit stream Even if a bit is deleted at a position, an effect that quality degradation hardly occurs is obtained.

本明細書は、２００４年１１月５日出願の特願２００４−３２２９５９に基づく。この内容はすべてここに含めておく。 This specification is based on Japanese Patent Application No. 2004-322959 filed on November 5, 2004. All this content is included here.

本発明に係る符号化装置、復号化装置、符号化方法及び復号化方法は、スケーラブル符号化／復号化等に適用できる。 The encoding apparatus, decoding apparatus, encoding method, and decoding method according to the present invention can be applied to scalable encoding / decoding and the like.

本発明の実施の形態１に係る符号化装置の構成を示すブロック図FIG. 1 is a block diagram showing a configuration of an encoding apparatus according to Embodiment 1 of the present invention. 本発明の実施の形態１に係る第２レイヤ符号化部の構成を示すブロック図The block diagram which shows the structure of the 2nd layer encoding part which concerns on Embodiment 1 of this invention. 本発明の実施の形態１に係る拡張帯域符号化部の構成を示すブロック図FIG. 3 is a block diagram showing a configuration of an extension band encoding unit according to Embodiment 1 of the present invention. 本発明の実施の形態１に係る拡張帯域符号化部のフィルタリング部で処理される生成スペクトルバッファを示す模式図Schematic diagram showing a generated spectrum buffer processed by the filtering unit of the extension band encoding unit according to Embodiment 1 of the present invention 本発明の実施の形態１に係る符号化装置の多重化部から出力されるビットストリームの内容を示す模式図Schematic diagram showing the contents of the bitstream output from the multiplexing unit of the encoding apparatus according to Embodiment 1 of the present invention. 本発明の実施の形態１に係る復号化装置の構成を示すブロック図The block diagram which shows the structure of the decoding apparatus which concerns on Embodiment 1 of this invention. 本発明の実施の形態１に係る第２レイヤ復号化部の構成を示すブロック図The block diagram which shows the structure of the 2nd layer decoding part which concerns on Embodiment 1 of this invention. 本発明の実施の形態１に係る拡張帯域復号化部の構成を示すブロック図FIG. 2 is a block diagram showing a configuration of an extended band decoding unit according to Embodiment 1 of the present invention. 本発明の実施の形態２に係る第２レイヤ符号化部の構成を示すブロック図The block diagram which shows the structure of the 2nd layer encoding part which concerns on Embodiment 2 of this invention. 本発明の実施の形態２に係る第１スペクトル符号化部の構成を示すブロック図The block diagram which shows the structure of the 1st spectrum encoding part which concerns on Embodiment 2 of this invention. 本発明の実施の形態２に係る第２レイヤ復号化部の構成を示すブロック図The block diagram which shows the structure of the 2nd layer decoding part which concerns on Embodiment 2 of this invention. 本発明の実施の形態２に係る第１スペクトル復号化部の構成を示すブロック図The block diagram which shows the structure of the 1st spectrum decoding part which concerns on Embodiment 2 of this invention. 本発明の実施の形態２に係る拡張帯域符号化部の構成を示すブロック図FIG. 9 is a block diagram showing a configuration of an extension band encoding unit according to Embodiment 2 of the present invention. 本発明の実施の形態２に係る拡張帯域復号化部の構成を示すブロック図FIG. 9 is a block diagram showing a configuration of an extended band decoding unit according to Embodiment 2 of the present invention. 本発明の実施の形態３に係る第２レイヤ符号化部の構成を示すブロック図The block diagram which shows the structure of the 2nd layer encoding part which concerns on Embodiment 3 of this invention. 本発明の実施の形態３に係る第２スペクトル符号化部の構成を示すブロック図The block diagram which shows the structure of the 2nd spectrum encoding part which concerns on Embodiment 3 of this invention. 本発明の実施の形態３に係る第２スペクトル符号化部の構成の変形例を示すブロック図The block diagram which shows the modification of a structure of the 2nd spectrum encoding part which concerns on Embodiment 3 of this invention. 本発明の実施の形態３に係る第２レイヤ復号化部の構成を示すブロック図The block diagram which shows the structure of the 2nd layer decoding part which concerns on Embodiment 3 of this invention. 本発明の実施の形態３に係る第２スペクトル復号化部の構成の変形例を示すブロック図The block diagram which shows the modification of a structure of the 2nd spectrum decoding part which concerns on Embodiment 3 of this invention. 本発明の実施の形態３に係る第２レイヤ符号化部の構成の変形例を示すブロック図Block diagram showing a modification of the configuration of the second layer encoding section according to Embodiment 3 of the present invention 本発明の実施の形態３に係る第２レイヤ復号化部の構成の変形例を示すブロック図Block diagram showing a modification of the configuration of the second layer decoding section according to Embodiment 3 of the present invention

Claims

An encoding device for generating low frequency band encoded information and high frequency band encoded information from an original signal,
First spectrum calculating means for calculating a first spectrum of a low frequency band from a decoded signal of the encoded information of the low frequency band;
Second spectrum calculating means for calculating a second spectrum from the original signal;
Using a filter having the first spectrum as an internal state, first outputs a parameter indicating a characteristic of the filter, as a first parameter indicating a similar degree of high-frequency band portion of the second spectrum and the first spectrum Parameter calculation means;
One spectral residual candidate code from a spectral residual shape codebook in which a plurality of spectral residual candidates are recorded indicates a fluctuation component between the high frequency band portion of the first spectrum and the second spectrum. Second parameter calculating means for outputting as a second parameter;
Determining means for simultaneously determining the first parameter and the second parameter for generating an estimated value most similar to the high-frequency band portion of the second spectrum from the output first parameter and second parameter;
Encoding means for encoding the determined first parameter and second parameter as encoding information of the high frequency band;
An encoding device.

A residual component encoding means for encoding a residual component between the first spectrum and the low frequency band of the second spectrum;
The first parameter calculation means and the second parameter calculation means are:
Calculating the first parameter and the second parameter after improving the quality of the first spectrum using the residual component encoded by the residual component encoding means;
The encoding device according to claim 1.

The residual component encoding means includes:
Improving both the quality of the low frequency band portion of the first spectrum and the quality of the high frequency band portion of the decoded spectrum obtained from the first parameter and the second parameter encoded by the encoding means;
The encoding device according to claim 2 .

The first parameter includes a lag, the second parameter includes a spectral residual;
And further comprising means for configuring a bitstream arranged in the order of the lag and the spectral residuals.
The encoding device according to claim 1.

An encoding method for generating low frequency band encoded information and high frequency band encoded information from an original signal,
A first spectrum calculating step of calculating a first spectrum of a low frequency band from a decoded signal of the encoded information of the low frequency band;
A second spectrum calculating step of calculating a second spectrum from the original signal;
Using a filter having the first spectrum as an internal state, first to calculate the parameter indicating the characteristic of the filter, as a first parameter indicating a similar degree of high-frequency band portion of the second spectrum and the first spectrum A parameter calculation step;
One spectral residual candidate code from a spectral residual shape codebook in which a plurality of spectral residual candidates are recorded indicates a fluctuation component between the high frequency band portion of the first spectrum and the second spectrum. A second parameter calculating step for calculating as a second parameter;
A determination step of simultaneously determining the first parameter and the second parameter that generate an estimated value that is most similar to the high frequency band portion of the second spectrum from the calculated first parameter and second parameter;
An encoding step of encoding the determined first parameter and second parameter as encoding information of the high frequency band;
An encoding method comprising: