JP2789585B2

JP2789585B2 - High efficiency coding device

Info

Publication number: JP2789585B2
Application number: JP27056487A
Authority: JP
Inventors: 哲二郎近藤
Original assignee: Sony Corp
Current assignee: Sony Corp
Priority date: 1987-10-27
Filing date: 1987-10-27
Publication date: 1998-08-20
Anticipated expiration: 2013-08-20
Also published as: JPH01114179A

Description

【発明の詳細な説明】〔産業上の利用分野〕この発明は、コサイン変換（discrete cosine transf
orm）等の変換符号化に適用される高能率符号化装置に
関する。〔従来の技術〕画像信号の冗長度を抑圧するために、所定数の画素か
らなるブロックに画面を分割し、ブロック毎に原画像信
号の特徴と合った変換軸で線形変換を行う変換符号化が
知られている。変換符号化としては、アダマール変換，
コサイン変換等が知られている。例えば「“IEEE TRANS
ACTIONS ON COMMUNICATIONS"VOL,COM−32,NO.3,MARCH,1
984,ページ225〜231」には、第９図に示すような構成の
コサイン変換符号化装置が記載されている。第９図において、31で示す入力端子には、標本化され
た離散的な画像信号ｆ（j,k）が供給され、この入力信
号がコサイン変換回路32に供給される。コサイン変換回
路32では、２次元コサイン変換がなされる。２次元コサ
イン変換では、次式で示される処理がなされる。但し、
原データは、１ブロックが（Ｎ×Ｎ）サンプルの２次元
データｆ（j,k）（j,k＝0,1,...,N−１）とする。コサイン変換回路32からの係数値Ｆ（u,v）がスレッ
ショルディング回路33に供給され、情報量の削減がなさ
れる。スレッショルディング回路33からの係数値F_T（u,
v）が量子化回路34に供給され、量子化回路34の出力が
コーディング回路35に供給され、ランレングス符号化及
びハフマン符号化により、所定ビット数のコード信号に
変換される。コーディング回路35からのコード信号がバッファメモ
リ36に供給される。バッファメモリ36は、コーディング
回路35からのコード信号の伝送レートを伝送路のレート
を超えない範囲のレートに変換するために設けられてい
る。バッファメモリ36の入力側のデータレートは、可変
のものであるが、バッファメモリ36の出力側のデータレ
ートが略々一定となる。バッファメモリ36からの出力デ
ータが端子37に取り出される。上述のように、コサイン変換回路32から得られる係数
値に対してスレッショルディングで情報量の削減がなさ
れ、バッファリングが実行される。スレッショルディン
グは、しきい値をＴとする時に、次式の実行であるが、
ブロックの直流成分を示すＦ（0,0）は、スレッショル
ディングの対象から除かれる。〔発明が解決しようとする問題点〕上述のようなフィードバック型のバッファリングは、
バッファメモリがオーバーフローしそうになると、バッ
ファメモリへの入力データのレートを低下させ、逆に、
バッファメモリがアンダーフローしそうになると、バッ
ファメモリへの入力データのレートを上昇させるよう
に、制御を行う。帰還量に対する感度を上げ過ぎると、
目標値付近で発振し、逆に感度を下げ過ぎると、収束に
時間がかかる問題が生じる。収束に時間がかかる時に
は、バッファメモリ36の容量を増やす必要がある。この
ように、従来のバッファリング処理は、実用に当たって
は、相当のノウハウが必要な欠点があった。また、従来のフィードバック型のバッファリング装置
は、出力データのレートを完全に一定レートにすること
が困難である。ディジタルVTRへの変換符号化の適用を
考えると、変速再生例えば静止画再生を良好に行うため
には、１本のトラックに正確に１フレーム或いは１フィ
ールドのデータが記録される必要がある。従って、従来
の変換符号化装置をそのままディジタルVTRに適用する
ことは、困難であった。従って、この発明の目的は、フィードフォワード型の
バッファリングにより、フレーム単位でデータレートを
一定レートすることができ、また、効率良くデータを圧
縮することができる高能率符号化装置を提供することに
ある。〔問題点を解決するための手段〕この発明は、画面を所定数の画素で構成されるブロッ
クに分割し、ブロック毎に画素データを直交変換し、係
数値データを得る変換回路と、任意の数のブロック毎の係数値データを再ブロック化
するブロック化回路と、再ブロック化により形成された各ブロックの量子化幅
を変化させた時の所定期間の発生情報量を求め、発生情
報量が目標値を越えない範囲で、且つ量子化歪が最小と
なるように、各ブロックの量子化幅を決定するバッファ
リング回路と、再ブロック化により形成された各ブロックの直流成分
以外の係数値データを、元のビット数より少ないビット
数で、且つバッファリング回路により決定された量子化
幅で量子化する量子化回路とよりなり、量子化回路の出力とバッファリング回路により決定さ
れた量子化幅を示す情報とを伝送するようにしたことを
特徴とする高能率符号化装置である。〔作用〕変換符号化からの係数値データは、元の画像データが
圧縮されたものとなる。また、ダイナミックレンジに適
応した符号化は、フィードフォワード型のバッファリン
グを行うことができ、出力データを例えばフレーム単位
で一定レートとすることができる。従って、この発明
は、ディジタルVTRに使用して好適である。〔実施例〕この発明が適用されたディジタルVTRについて図面を
参照して詳細に説明する。この説明は、下記の項目に従
ってなされる。 a.送信側及び受信側の構成 b.可変長量子化とバッファリング c.変形例なお、ディジタルVTRの場合では、送信側が記録側に
対応し、送信側が再生側に対応する。 a.送信側及び受信側の構成第１図において、１で示す入力端子に標本化された離
散的な画像信号が供給され、入力ディジタル信号がコサ
イン変換回路２に供給される。コサイン変換回路２で
は、従来と同様の処理により、２次元コサイン変換がな
される。コサイン変換回路２からブロックサイズ（例え
ば８×８）と対応する（８×８）の係数テーブルが得ら
れる。この係数テーブルにおいて、直流（平均値）成分
が９ビットで残りの交流成分の係数が８ビットとされ
る。コサイン変換回路２からの係数テーブルがブロック
化回路３に供給される。ブロック化回路３により、再ブロック化がなされ、係
数テーブルがADRC符号化の単位であるブロック毎に連続
する信号に変換される。ブロック化回路３では、例えば
（８×８）の係数テーブルが第２図に示すように、４等
分される。この場合、直流成分は、ADRC符号化されず
に、そのまま伝送される。また、第３図Ａに示すよう
に、係数テーブルがジグザグ走査に応じたデータ系列に
変換される時には、このジグザグ走査と対応して、第３
図Ｂに示すような領域に再ブロック化を行うようにして
も良い。ブロック化回路３の出力信号が最大値MAXをブロック
毎に検出する最大値検出回路4,最小値MINをブロック毎
に検出する最小値検出回路５及び遅延回路６に供給され
る。検出された最大値MAX及び最小値MINが減算回路７に
供給され、（MAX−MIN＝DR）で表されるダイナミックレ
ンジDRが減算回路７から得られる。遅延回路６は、最大
値MAX及び最小値MINを検出するために必要の時間、デー
タを遅延させる。遅延回路６からのビデオデータから最
小値MINが減算回路８において減算され、減算回路８か
らは、最小値除去後のデータPDIが得られる。最小値除去後のデータPDIが遅延回路９を介して量子
化回路10に供給されると共に、ROM12から量子化幅Δｉ
のデータが供給される。量子化回路10は、量子化幅Δｉ
を用いてデータPDIを量子化する可変長のADRC符号化を
行う。即ち、量子化回路10では、ブロック内の画素デー
タが共有する最小値MINが除去された画素データPDIが量
子化幅Δｉで除算され、ブロックのダイナミックレンジ
DRに応じた可変のビット数（0,1,2,3,又は４ビット）の
量子化がなされる。ブロック内のビデオ信号は、変換符号化により、ダイ
ナミックレンジDRが元のデータの値に比して小さくな
り、８ビットより少ない０ビット,1ビット,2ビット,3ビ
ット又は４ビットのビット数で量子化しても、量子化歪
が目立たない。量子化回路10は、例えばROMで構成され
る。量子化回路10からは、最大のビット数である４ビッ
トのコード信号が発生し、この量子化回路10の出力信号
の中で有効ビットが次段のフレーム化回路15において選
択される。このため、ROM12では、量子化幅Δｉと共
に、そのブロックのビット数を示すデータNbが形成さ
れ、データNbがフレーム化回路15に供給される。ディジタルVTRでは、記録されるデータの伝送レート
が一定であるため、伝送データ量を制限しないと、一部
のデータを記録できなかったり、必要以上に圧縮率を高
くして再生画像の質が劣化したりする。そこで、バッフ
ァリング回路11が設けられ、ADRC符号化されようとする
１画面の全ブロックのダイナミックレンジDRの度数分布
が調べられ、最適な可変長符号化がなされる。バッファ
リング回路11には、演算回路７からダイナミックレンジ
DRが供給される。バッファリング回路11では、伝送デー
タのレートが一定となるようなしきい値T1,T2,T3,T4が
求められ、このしきい値と対応するパラメータコードPi
が出力される。このパラメータコードPiとブロックのダ
イナミックレンジDRで定まる量子化幅ΔｉがROM12から
読み出される。遅延回路13及び14は、最適なしきい値がバッファリン
グ回路11で求まり、可変長量子化がされる迄の時間、ダ
イナミックレンジDR及び最小値MINを遅延させる。バッ
ファリング回路11からのパラメータコードPiと遅延回路
13,14からのダイナミックレンジDR及び最小値MINと量子
化回路10からのコード信号DTと遅延回路16を介された直
流成分の９ビットのデータとがフレーム化回路15に供給
される。フレーム化回路15は、エラー訂正用の符号化を
施したり、同期信号の付加を行う。フレーム化回路15の
出力端子17に送信データが得られる。１画面で一個のパ
ラメータコードPiが伝送され、１ブロック毎にDR,MINの
データが伝送され、１画素毎にコード信号DTが伝送され
る。また、フレーム化回路15では、前述のように、ビッ
ト数を示すデータNbを用いて、量子化回路10からのコー
ド信号DTの有効なビットの選択がなされる。受信されたデータは、第４図において21で示す入力端
子に供給され、フレーム分解回路22により、パラメータ
コードPi、ダイナミックレンジDR、コード信号DT、最小
値MIN、直流成分のデータの夫々に分解される。復号化
回路23は、ADRCエンコーダの量子化回路10と逆にコード
信号DTを復元レベルに変換する。復号化回路23からの復
元レベルが加算回路25に供給され、最小値MINが復元レ
ベルに加算され、加算回路25からの復元データがブロッ
ク分解回路26に供給される。ブロック分解回路26の出力
信号として、係数テーブルの（７×８）の出力データが
得られる。このブロック分解回路26の出力信号と遅延回路28を介
された直流成分と対応するデータとが逆コサイン変換回
路27に供給され、従来と同様のコサイン変換の逆の処理
がされる。逆コサイン変換回路27の出力端子29にディジ
タルテレビジョン信号の復号出力が得られる。 b.可変長量子化とバッファリング第５図は、量子化回路10においてなされる可変長量子
化を説明するもので、T1,T2,T3,T4が夫々割り当てビッ
ト数を決定するしきい値である。これらのしきい値は、
（T4＜T3＜T2＜T1）の関係にある。ダイナミックレンジDR（＝MAX−MIN）が（DR＝T4−
１）の時には、第５図Ａに示すように、最大値MAXと最
小値MINのみが伝送され、受信側では、両者の中間レベ
ルL0が復元レベルとされる。従って、第５図Ａに示すよ
うに、ダイナミックレンジDRが（T4−１）の時には、量
子化幅がΔ０となる。ダイナミックレンジDRが（０≦DR
≦T4−１）の場合には、割り当てビット数が０ビットで
ある。第５図Ｂは、ダイナミックレンジDRが（T3−１）の場
合を示す。ダイナミックレンジDRが（T4≦DR≦T3−１）
の時には、割り当てビット数が１ビットとされる。従っ
て検出されたダイナミックレンジDRが２つのレベル範囲
に分割され、ブロックの最小値除去後の画素データPDI
が属するレベル範囲が量子化幅Δ１を用いて求められ、
レベル範囲と対応する“0"又は“1"の一方のコード信号
が割り当てられ、復元レベルがL0又はL1とされる。第５図に示される可変長符号化は、ダイナミックレン
ジが大きくなるほど、量子化幅Δｉが（Δ０＜Δ１＜Δ
２＜Δ３＜Δ４）と大きくされる非直線量子化が行われ
る。非直線量子化は、量子化歪が目立ち易いダイナミッ
クレンジが小さいブロックでは、最大歪を小さくし、逆
に、ダイナミックレンジが大きいブロックでは、最大歪
を大きくするもので、圧縮率がより高くされる。ダイナミックレンジDRが（T2−１）の場合には、第５
図Ｃに示すように、検出されたダイナミックレンジDRが
４個のレベル範囲に分割され、レベル範囲の夫々に対し
て、２ビット（00）（01）（10）（11）が割り当てら
れ、各レベル範囲の中央のレベルが復元レベルL0,L1,L
2,L3とされる。従って、量子化幅Δ２を用いてデータPD
Iの属するレベル範囲が求められる。ダイナミックレン
ジDRが（T3≦DR≦T2−１）の場合では、割り当てビット
数が２ビットとされる。また、ダイナミックレンジDRが（T1−１）の場合で
は、第５図Ｄに示すように、検出されたダイナミックレ
ンジDRが８個のレベル範囲に分割され、レベル範囲の夫
々に対して、３ビット（000）（001）・・・（111）が
割り当てられ、各レベル範囲の中央のレベルが復元レベ
ルL0,L1・・・L7とされる。従って量子化幅がΔ３とな
る。ダイナミックレンジDRが（T2≦DR≦T1−１）の場合
では、割り当てビット数が３ビットとされる。更に、ダイナミックレンジが最大の255の場合には、
第５図Ｅに示すように、検出されたダイナミックレンジ
DRが16個のレベル範囲に分割され、レベル範囲の夫々に
対して、４ビット（0000）（0001）・・・（1111）が割
り当てられ、各レベル範囲の中央のレベルが復元レベル
L0,L1・・・L15とされる。従って、量子化幅がΔ４とな
る。ダイナミックレンジDRが（T1≦DR＜256）の場合で
は、割り当てビット数が４ビットとされる。第６図は、（０〜255）の範囲のダイナミックレンジD
Rを横軸とし、発生度数を縦軸とした度数分布の一例で
ある。x₁,x₂,x₃,x₄,x₅の夫々は、前述のように、しきい
値T1〜T4によって分けられたダイナミックレンジDRの五
個の範囲に含まれるブロック数を表している。（T4−
１）以下のダイナミックレンジDRを持つブロックは、０
ビットが割り当てられるので、ブロック数x₅は、発生情
報量に寄与しない。従って、発生情報量は、 4x₁＋3x₂＋2x₃＋x₄ で求まる。この発生情報量がデータしきい値と比較さ
れ、データしきい値を超える時には、より大きいしきい
値のセットが適用され、同様にして発生情報量が算出さ
れる。上式の演算を行うには、設定されたしきい値のセ
ット毎に各範囲で度数分布の和を求め、この和に割り当
てビット数を乗じて加算する処理が必要である。しかし
ながら、しきい値のセットを変更する都度、上記の処理
を行うと、最適なしきい値のセットが求まる迄に時間が
かかる問題が生じる。この一実施例は、第６図に示す度数分布を第７図に示
す積算型の度数分布に変換し、異なるしきい値のセット
と対応する発生情報量をより高速に算出でき、従って、
最適なしきい値のセットが得られる迄の収束時間を短縮
できる。第７図から理解されるように、ダイナミックレンジDR
が最大の発生度数からスタートして、より小さいダイナ
ミックレンジDRの発生度数が順次積算されて積算型の度
数分布グラフが得られる。従って、しきい値T1迄の積算
度数がx₁となり、しきい値T2迄の積算度数が（x₁＋x₂）
となり、しきい値T3迄の積算度数が（x₁＋x₂＋x₃）とな
り、しきい値T4迄の積算度数が（x₁＋x₂＋x₃＋x₄）とな
る。しきい値T1〜T4に対する発生情報量は、４（x₁−０）
＋３〔（x₁＋x₂）−x₁〕＋２〔（x₁＋x₂＋x₃）−（x₁＋
x₂）〕＋１〔（x₁＋x₂＋x₃＋x₄）−（x₁＋x₂＋x₃）＝4x
₁＋3x₂＋2x₃＋1x₄ と求まる。第７図に示される積算型の度数分布グラフ
（積算型度数分布表）を一旦、作成すれば、しきい値の
セットを更新した時に、四個の数の和により直ちに発生
情報量を求めることができる。第８図は、バッファリング回路10の動作を示すフロー
チャートである。最初に、１画面例えば１フレームの全
てのブロックのダイナミックレンジDRが検出される（ス
テップ）。次に、１フレームのダイナミックレンジDR
の度数分布表（第６図参照）が作成される（ステップ
）。この度数分布表が積算型の度数分布表（第７図参
照）に変換される（ステップ）。積算型の度数分布表を用いて、しきい値テーブルのし
きい値のセット（複数しきい値）に対する発生情報量、
即ち、選択されたしきい値のセットを適用してADRC符号
化を行った場合のコード信号DTの全ビット数が算出され
る（ステップ）。この場合、量子化歪が最小となるし
きい値のセット（パラメータコードP0で指定されるしき
い値のセット）から発生情報量の算出がスタートされ
る。求められた発生情報量と目標値（データしきい値）と
が比較される（ステップ）。目標値は、送信データの
伝送レートの最大値であり、例えば（２ビット/1画素）
である。この比較の結果がステップで判定される。発
生情報量が目標値以下の場合には、当該しきい値のセッ
トを用いてADRCの量子化がされる（ステップ）。若
し、発生情報量が目標値を超える場合には、しきい値の
セットの更新がされ（ステップ）。次に、発生情報量
を少なくできる新たなしきい値のセットに関してステッ
プ，，の処理が繰り返される。しきい値T1〜T4のテーブルとしては、例えばパラメー
タコードP0〜P31で特定される32通りのしきい値のセッ
トを使用することができる。パラメータコードPiがP0ら
P31まで変化する時に、何れのしきい値T1〜T4も減少す
ることがないように、しきい値T1〜T4の大きさが設定さ
れている。このしきい値T1〜T4の変化の仕方や大きさ
は、画質を見ながら設定される。最初のパラメータコー
ドP0のしきい値は、可逆符号化を意図して、非常に小さ
い値となる。また、パラメータコードP31のしきい値の
場合には、全画面が１ビット割り当てとなる。このようにしきい値T1〜T4が設定されていると、ステ
ップにおいて、発生情報量を算出する場合に、パラメ
ータコードをP0からP31に向かって順次変化させた時
に、発生情報量が単調減少することになる。従って、パ
ラメータコードをP0からスタートしてP31まで変化させ
ると、ステップにおいて、最初に発生情報量が目標値
以下となるしきい値のセットが必ず得られ、このしきい
値のセットが適用されてADRCの符号化がなされる。なお、コード信号DT以外にダイナミックレンジDR、最
小値MIN、パラメータコードPi、直流成分に対応するデ
ータ及び誤り訂正コードの冗長コードが伝送されるが、
これらのデータは、固定長であるため、伝送データのレ
ートを検査する際に、目標値にオフセットを持たせるこ
とで無視することができる。 c.変形例この発明は、変換符号化で得られた一つの係数テーブ
ルを複数ブロックに再ブロック化するのに限らず、複数
の係数テーブルを集めて、再ブロック化を行うようにし
ても良い。また、変換符号化としては、コサイン変換に限らず、
アダマール変換等の直交変換を使用しても良い。〔発明の効果〕この発明では、フィードフォワード型のバッファリン
グを適用することにより、フレーム単位で一定レートの
出力データを発生させることができ、ディジタルVTRに
適用して好適である。また、この発明では、係数値デー
タを再ブロック化し、再ブロックで形成された各ブロッ
クの係数値データの特性に応じて量子化幅を設定するよ
うにしたので、量子化歪が増大することなく、バッファ
リング制御を行うことができる。DETAILED DESCRIPTION OF THE INVENTION [Industrial Application Field] The present invention relates to a cosine transform (discrete cosine transf
orm) or the like, which relates to a high-efficiency encoding device applied to transform encoding. [Prior Art] In order to suppress the redundancy of an image signal, transform coding in which a screen is divided into blocks each having a predetermined number of pixels and a linear transformation is performed for each block using a transformation axis that matches the characteristics of the original image signal. It has been known. Transform coding includes Hadamard transform,
Cosine transform and the like are known. For example, "IEEE TRANS
ACTIONS ON COMMUNICATIONS "VOL, COM-32, NO.3, MARCH, 1
984, pages 225 to 231 ", a cosine transform coding apparatus having a configuration as shown in FIG. 9 is described. In FIG. 9, a sampled discrete image signal f (j, k) is supplied to an input terminal indicated by 31, and this input signal is supplied to a cosine transform circuit 32. In the cosine conversion circuit 32, two-dimensional cosine conversion is performed. In the two-dimensional cosine transform, a process represented by the following equation is performed. However,
The original data is two-dimensional data f (j, k) (j, k = 0, 1,..., N−1) in which one block is (N × N) samples. The coefficient value F (u, v) from the cosine transform circuit 32 is supplied to the thresholding circuit 33 to reduce the amount of information. The coefficient value F _T (u,
v) is supplied to the quantization circuit 34, the output of the quantization circuit 34 is supplied to the coding circuit 35, and is converted into a code signal having a predetermined number of bits by run-length coding and Huffman coding. The code signal from the coding circuit 35 is supplied to the buffer memory 36. The buffer memory 36 is provided to convert the transmission rate of the code signal from the coding circuit 35 to a rate that does not exceed the rate of the transmission path. The data rate on the input side of the buffer memory 36 is variable, but the data rate on the output side of the buffer memory 36 is substantially constant. Output data from the buffer memory 36 is taken out to a terminal 37. As described above, the amount of information is reduced by thresholding for the coefficient value obtained from the cosine transform circuit 32, and buffering is performed. Thresholding is the execution of the following equation when the threshold value is T,
F (0,0) indicating the DC component of the block is excluded from the thresholding target. [Problems to be Solved by the Invention] The feedback type buffering as described above,
When the buffer memory is about to overflow, the rate of input data to the buffer memory is reduced, and conversely,
When the buffer memory is about to underflow, control is performed so as to increase the rate of input data to the buffer memory. If you raise the sensitivity to the amount of return too much,
Oscillation occurs around the target value, and conversely if the sensitivity is excessively lowered, a problem occurs that it takes time to converge. When it takes time to converge, it is necessary to increase the capacity of the buffer memory 36. As described above, the conventional buffering processing has a drawback that requires considerable know-how in practical use. Further, it is difficult for the conventional feedback type buffering device to make the output data rate completely constant. Considering the application of conversion coding to a digital VTR, data of one frame or one field must be accurately recorded on one track in order to perform variable speed reproduction, for example, still image reproduction. Therefore, it is difficult to apply the conventional transform coding apparatus to a digital VTR as it is. Accordingly, it is an object of the present invention to provide a high-efficiency coding apparatus capable of maintaining a constant data rate in frame units by feed-forward type buffering and efficiently compressing data. is there. [Means for Solving the Problems] The present invention provides a conversion circuit that divides a screen into blocks each including a predetermined number of pixels, performs orthogonal transform on pixel data for each block, and obtains coefficient value data. A block-forming circuit for re-blocking the coefficient value data for each block of the number, and obtaining the amount of information generated during a predetermined period when the quantization width of each block formed by re-blocking is changed. A buffering circuit that determines the quantization width of each block so as not to exceed the target value and minimize the quantization distortion, and coefficient value data other than the DC component of each block formed by reblocking. And a quantization circuit that quantizes the number of bits with a smaller number of bits than the original number of bits and with the quantization width determined by the buffering circuit. The output of the quantization circuit and the buffering circuit Is a high-efficiency encoding apparatus characterized by and information indicating the determined quantization width was set to be transmitted. [Operation] The coefficient value data from the transform encoding is obtained by compressing the original image data. The encoding adapted to the dynamic range can perform feedforward type buffering, and can output data at a constant rate in a frame unit, for example. Therefore, the present invention is suitable for use in a digital VTR. Embodiment A digital VTR to which the present invention is applied will be described in detail with reference to the drawings. This description is made in accordance with the following items. a. Configuration of transmission side and reception side b. Variable length quantization and buffering c. Modification In the case of digital VTR, the transmission side corresponds to the recording side, and the transmission side corresponds to the reproduction side. a. Configuration of the transmitting side and the receiving side In FIG. 1, a sampled discrete image signal is supplied to an input terminal indicated by 1, and an input digital signal is supplied to a cosine transform circuit 2. In the cosine transform circuit 2, two-dimensional cosine transform is performed by the same processing as in the related art. From the cosine transform circuit 2, a (8 × 8) coefficient table corresponding to the block size (for example, 8 × 8) is obtained. In this coefficient table, the DC (average value) component has 9 bits, and the coefficients of the remaining AC components have 8 bits. The coefficient table from the cosine transform circuit 2 is supplied to the blocking circuit 3. Re-blocking is performed by the blocking circuit 3, and the coefficient table is converted into a continuous signal for each block that is a unit of ADRC encoding. In the blocking circuit 3, for example, an (8 × 8) coefficient table is divided into four equal parts as shown in FIG. In this case, the DC component is transmitted as it is without ADRC encoding. Also, as shown in FIG. 3A, when the coefficient table is converted into a data sequence corresponding to the zigzag scanning, the third
Reblocking may be performed on an area as shown in FIG. The output signal of the blocking circuit 3 is supplied to a maximum value detection circuit 4 for detecting the maximum value MAX for each block, a minimum value detection circuit 5 for detecting the minimum value MIN for each block, and a delay circuit 6. The detected maximum value MAX and minimum value MIN are supplied to the subtraction circuit 7, and the dynamic range DR represented by (MAX−MIN = DR) is obtained from the subtraction circuit 7. The delay circuit 6 delays data for a time required to detect the maximum value MAX and the minimum value MIN. The minimum value MIN is subtracted from the video data from the delay circuit 6 in the subtraction circuit 8, and the data PDI from which the minimum value has been removed is obtained from the subtraction circuit 8. The data PDI from which the minimum value has been removed is supplied to the quantization circuit 10 via the delay circuit 9, and the quantization width Δi
Is supplied. The quantization circuit 10 calculates the quantization width Δi
Is used to perform variable-length ADRC encoding for quantizing the data PDI. That is, in the quantization circuit 10, the pixel data PDI from which the minimum value MIN shared by the pixel data in the block has been removed is divided by the quantization width Δi to obtain the dynamic range of the block.
A variable number of bits (0, 1, 2, 3, or 4 bits) are quantized according to the DR. The video signal in the block has a dynamic range DR smaller than that of the original data due to the transform coding, and has a bit number of 0 bits, 1 bit, 2 bits, 3 bits or 4 bits less than 8 bits. Even if quantization is performed, quantization distortion is inconspicuous. The quantization circuit 10 is configured by, for example, a ROM. A 4-bit code signal, which is the maximum number of bits, is generated from the quantization circuit 10, and valid bits are selected from the output signal of the quantization circuit 10 in the framing circuit 15 at the next stage. Therefore, in the ROM 12, together with the quantization width Δi, data Nb indicating the number of bits of the block is formed, and the data Nb is supplied to the framing circuit 15. With a digital VTR, the transmission rate of the recorded data is constant, so if the amount of transmitted data is not limited, some data cannot be recorded or the compression rate will be higher than necessary and the quality of the reproduced image will deteriorate. Or Therefore, a buffering circuit 11 is provided, and the frequency distribution of the dynamic range DR of all the blocks of one screen to be ADRC-coded is examined, and optimal variable-length coding is performed. The buffering circuit 11 has a dynamic range from the arithmetic circuit 7.
DR is supplied. In the buffering circuit 11, threshold values T1, T2, T3, and T4 for obtaining a constant transmission data rate are obtained, and the threshold value and the corresponding parameter code Pi are determined.
Is output. The quantization width Δi determined by the parameter code Pi and the dynamic range DR of the block is read from the ROM 12. The delay circuits 13 and 14 delay the dynamic range DR and the minimum value MIN until the optimum threshold value is obtained by the buffering circuit 11 and variable length quantization is performed. Parameter code Pi from buffering circuit 11 and delay circuit
The dynamic range DR and the minimum value MIN from 13 and 14, the code signal DT from the quantization circuit 10 and the 9-bit data of the DC component passed through the delay circuit 16 are supplied to the framing circuit 15. The framing circuit 15 performs encoding for error correction and adds a synchronization signal. Transmission data is obtained at the output terminal 17 of the framing circuit 15. One parameter code Pi is transmitted on one screen, DR and MIN data are transmitted for each block, and a code signal DT is transmitted for each pixel. Further, as described above, the framing circuit 15 selects valid bits of the code signal DT from the quantization circuit 10 using the data Nb indicating the number of bits. The received data is supplied to an input terminal indicated by 21 in FIG. 4, and is decomposed into a parameter code Pi, a dynamic range DR, a code signal DT, a minimum value MIN, and data of a DC component by a frame decomposition circuit 22. You. The decoding circuit 23 converts the code signal DT into a restoration level, as opposed to the quantization circuit 10 of the ADRC encoder. The restoration level from the decoding circuit 23 is supplied to the addition circuit 25, the minimum value MIN is added to the restoration level, and the restoration data from the addition circuit 25 is supplied to the block decomposition circuit 26. As the output signal of the block decomposition circuit 26, (7 × 8) output data of the coefficient table is obtained. The output signal of the block decomposition circuit 26 and the data corresponding to the DC component passed through the delay circuit 28 are supplied to the inverse cosine transform circuit 27, and the inverse process of the cosine transform as in the conventional case is performed. A decoded output of the digital television signal is obtained at an output terminal 29 of the inverse cosine transform circuit 27. b. Variable-Length Quantization and Buffering FIG. 5 illustrates variable-length quantization performed in the quantization circuit 10. T1, T2, T3, and T4 are threshold values that determine the number of allocated bits. is there. These thresholds are
(T4 <T3 <T2 <T1). When the dynamic range DR (= MAX−MIN) is (DR = T4−
In the case of 1), as shown in FIG. 5A, only the maximum value MAX and the minimum value MIN are transmitted, and the intermediate level L0 between them is set as the restoration level on the receiving side. Therefore, as shown in FIG. 5A, when the dynamic range DR is (T4-1), the quantization width is Δ0. When the dynamic range DR is (0 ≦ DR
In the case of ≤T4-1), the number of allocated bits is 0. FIG. 5B shows a case where the dynamic range DR is (T3-1). Dynamic range DR is (T4 ≦ DR ≦ T3-1)
In the case of, the number of allocated bits is one bit. Therefore, the detected dynamic range DR is divided into two level ranges, and the pixel data PDI after removing the minimum value of the block.
Is determined using the quantization width Δ1,
One of the code signals “0” or “1” corresponding to the level range is assigned, and the restoration level is set to L0 or L1. In the variable length coding shown in FIG. 5, as the dynamic range becomes larger, the quantization width Δi becomes (Δ0 <Δ1 <Δ
Non-linear quantization that is increased to 2 <Δ3 <Δ4) is performed. Non-linear quantization reduces the maximum distortion in a block having a small dynamic range in which quantization distortion is conspicuous, and increases the maximum distortion in a block having a large dynamic range. On the other hand, the compression ratio is increased. When the dynamic range DR is (T2-1), the fifth
As shown in FIG. C, the detected dynamic range DR is divided into four level ranges, and two bits (00) (01) (10) (11) are assigned to each of the level ranges. The level in the middle of the level range is the restoration level L0, L1, L
2, L3. Therefore, the data PD is calculated using the quantization width Δ2.
The level range to which I belongs is required. When the dynamic range DR is (T3 ≦ DR ≦ T2-1), the number of allocated bits is 2 bits. In the case where the dynamic range DR is (T1-1), as shown in FIG. 5D, the detected dynamic range DR is divided into eight level ranges, and each of the level ranges has 3 bits. (000), (001),... (111) are assigned, and the center level of each level range is set as a restoration level L0, L1,. Therefore, the quantization width becomes Δ3. When the dynamic range DR is (T2 ≦ DR ≦ T1-1), the number of allocated bits is 3 bits. Furthermore, if the dynamic range is the maximum of 255,
As shown in FIG. 5E, the detected dynamic range
The DR is divided into 16 level ranges, and 4 bits (0000) (0001)... (1111) are assigned to each of the level ranges, and the center level of each level range is the restoration level.
L0, L1,..., L15. Therefore, the quantization width is Δ4. When the dynamic range DR is (T1 ≦ DR <256), the number of allocated bits is 4 bits. FIG. 6 shows a dynamic range D in the range of (0 to 255).
It is an example of a frequency distribution in which R is the horizontal axis and the frequency of occurrence is the vertical axis. _{_{_{x 1, x 2, x 3}}} , x 4, each of the x _5, as described above, represents the number of blocks included five pieces of the dynamic range DR separated by threshold T1~T4 . (T4−
1) A block having the following dynamic range DR is 0
Since bits are allocated, the number of blocks x ₅ does not contribute to the amount of information generated. Thus, generation amount of information, obtained in _{_{_{4x 1 + 3x 2 + 2x 3}}} + x 4. This amount of generated information is compared with the data threshold, and if it exceeds the data threshold, a larger set of thresholds is applied, and the amount of generated information is calculated in a similar manner. In order to perform the calculation of the above equation, it is necessary to perform a process of obtaining the sum of the frequency distributions in each range for each set of the set thresholds, multiplying the sum by the number of allocated bits, and adding the sum. However, if the above processing is performed every time the set of thresholds is changed, it takes a long time until an optimal set of thresholds is determined. In this embodiment, the frequency distribution shown in FIG. 6 can be converted to the integrated frequency distribution shown in FIG. 7, and the set of different thresholds and the amount of generated information corresponding thereto can be calculated more quickly.
The convergence time until an optimal set of thresholds is obtained can be reduced. As can be understood from FIG. 7, the dynamic range DR
Starts from the maximum occurrence frequency, the occurrence frequencies of the smaller dynamic range DR are sequentially integrated, and an integrated frequency distribution graph is obtained. Therefore, the integration degree is x ₁ next up threshold T1, the accumulated power up threshold T2 (x ₁ + x ₂₎
Next, the accumulated power up threshold T3 is _{_{(x 1 + x 2 + x}} 3) , and the the accumulated power up threshold _{_{T4 (x 1 + x 2 +}} x 3 + x 4). Generated information quantity for threshold T1~T4 is, 4 (x ₁ -0)
+3 _{_{[(x 1 + x 2) -x}} 1 ] + 2 _{_{[(x 1 + x 2 + x}} 3) - (x 1 +
x _2)] + 1 _{_{[(x 1 + x 2 + x}} 3 + x 4) - (x 1 + x 2 + x 3) = 4x
Obtained a _{_{_{1 + 3x 2 + 2x 3 +}}} 1x 4. Once the cumulative frequency distribution graph (cumulative frequency distribution table) shown in FIG. 7 is created, when the set of thresholds is updated, the amount of generated information can be immediately obtained by summing the four numbers. Can be. FIG. 8 is a flowchart showing the operation of the buffering circuit 10. First, the dynamic ranges DR of all the blocks in one screen, for example, one frame are detected (step). Next, the dynamic range DR of one frame
Is created (see FIG. 6) (step). This frequency distribution table is converted into an integration type frequency distribution table (see FIG. 7) (step). The amount of generated information for a set of thresholds (multiple thresholds) in the threshold table,
That is, the total number of bits of the code signal DT when ADRC encoding is performed by applying the selected set of thresholds is calculated (step). In this case, the calculation of the amount of generated information is started from a set of threshold values that minimize the quantization distortion (a set of threshold values specified by the parameter code P0). The obtained amount of generated information is compared with a target value (data threshold) (step). The target value is the maximum value of the transmission rate of transmission data, for example, (2 bits / 1 pixel)
It is. The result of this comparison is determined in steps. When the amount of generated information is equal to or smaller than the target value, the quantization of ADRC is performed using the set of thresholds (step). If the amount of generated information exceeds the target value, the set of thresholds is updated (step). Next, the processes of steps 1 and 2 are repeated for a new set of threshold values that can reduce the amount of generated information. As a table of threshold values T1 to T4, for example, a set of 32 threshold values specified by parameter codes P0 to P31 can be used. Parameter code Pi is P0
The magnitudes of the thresholds T1 to T4 are set so that none of the thresholds T1 to T4 will decrease when changing to P31. The manner and magnitude of the change of the thresholds T1 to T4 are set while observing the image quality. The threshold value of the first parameter code P0 is a very small value for lossless encoding. In the case of the threshold value of the parameter code P31, one bit is allocated to the entire screen. When the thresholds T1 to T4 are set as described above, when calculating the generated information amount in the step, when the parameter code is sequentially changed from P0 to P31, the generated information amount monotonously decreases. become. Therefore, when the parameter code is changed from P0 to P31, a set of thresholds at which the amount of generated information is first equal to or less than the target value is always obtained in the step, and this set of thresholds is applied. ADRC encoding is performed. In addition to the code signal DT, a dynamic range DR, a minimum value MIN, a parameter code Pi, data corresponding to a DC component, and a redundant code of an error correction code are transmitted.
Since these data have a fixed length, they can be ignored by giving an offset to the target value when checking the rate of transmission data. c. Modifications The present invention is not limited to reblocking one coefficient table obtained by transform coding into a plurality of blocks, but may collect a plurality of coefficient tables and perform reblocking. . Also, the transform coding is not limited to cosine transform,
Orthogonal transform such as Hadamard transform may be used. [Effects of the Invention] In the present invention, by applying feedforward type buffering, output data at a constant rate can be generated in frame units, which is suitable for application to a digital VTR. Further, in the present invention, the coefficient value data is re-blocked, and the quantization width is set according to the characteristic of the coefficient value data of each block formed by the re-block, so that the quantization distortion does not increase. , Buffering control can be performed.

【図面の簡単な説明】第１図はこの発明の一実施例の送信側の構成を示すブロ
ック図、第２図及び第３図はブロックの説明のための略
線図、第４図は受信側のブロック図、第５図は可変長量
子化の説明のための略線図、第６図及び第７図は度数分
布表の説明のためのブロック図、第８図はバッファリン
グの説明のためのフローチャート、第９図は従来のバッ
ファリングの説明のためのブロック図である。図面における主要な符号の説明 1:ディジタルビデオ信号の入力端子、 2:コサイン変換回路、3:ブロック化回路、 4:最大値検出回路、5:最小値検出回路、 7,8:減算回路、10:量子化回路、 11:バッファリング回路。BRIEF DESCRIPTION OF THE DRAWINGS FIG. 1 is a block diagram showing a configuration of a transmitting side according to an embodiment of the present invention, FIGS. 2 and 3 are schematic diagrams for explaining blocks, and FIG. 5 is a schematic diagram for explaining variable-length quantization, FIGS. 6 and 7 are block diagrams for explaining a frequency distribution table, and FIG. 8 is a diagram for explaining buffering. FIG. 9 is a block diagram for explaining conventional buffering. Explanation of main symbols in the drawings 1: digital video signal input terminal, 2: cosine conversion circuit, 3: blocking circuit, 4: maximum value detection circuit, 5: minimum value detection circuit, 7, 8: subtraction circuit, 10 : Quantization circuit, 11: Buffering circuit.

───────────────────────────────────────────────────── フロントページの続き (58)調査した分野(Int.Cl.⁶，ＤＢ名) H04N 7/24 - 7/68────────────────────────────────────────────────── ─── Continued on the front page (58) Field surveyed (Int.Cl. ⁶ , DB name) H04N ^7/ 24-7/68

Claims

(57) [Claims] A screen is divided into blocks each having a predetermined number of pixels, and a conversion circuit for orthogonally transforming pixel data for each block to obtain coefficient value data, and reblocking the coefficient value data for each arbitrary number of blocks A block generating circuit that calculates the amount of information generated during a predetermined period when the quantization width of each block formed by re-blocking is changed, and calculates the amount of generated information within a range that does not exceed a target value. A buffering circuit that determines the quantization width of each block so that distortion is minimized, and the coefficient value data other than the DC component of each block formed by reblocking is converted into a bit number smaller than the original bit number. And a quantization circuit that quantizes with the quantization width determined by the buffering circuit. The output of the quantization circuit and the quantization determined by the buffering circuit A high-efficiency coding apparatus characterized by transmitting information indicating an encoding width.