JP2006086707A

JP2006086707A - Image processing method and image processing apparatus, program and program recording medium, and data structure and data recording medium

Info

Publication number: JP2006086707A
Application number: JP2004268305A
Authority: JP
Inventors: Mitsuharu Oki; 光晴大木
Original assignee: Sony Corp
Current assignee: Sony Corp
Priority date: 2004-09-15
Filing date: 2004-09-15
Publication date: 2006-03-30

Abstract

PROBLEM TO BE SOLVED: To compress at a high compression rate moving picture data of a high frame rate and to uncompress the compressed moving picture data. SOLUTION: Each of a transmitter and a receiver detects a positional relation with a high correlation between a past reference image V<SB>P</SB>of a frame just preceding to a target block and a future reference image V<SB>T</SB>of a frame just after the target block, as to the target block, and obtains an estimate value of the target block on the basis of image data in a region R<SB>8201</SB>of the past reference image and image data in a region R<SB>8202</SB>of the future reference image within the positional relation. Then the transmitter compresses the target block by using the estimate value. On the other hand, the receiver uncompresses the target block by using the estimate value. The present invention can be applied to the case where moving pictures are compressed and uncompressed. COPYRIGHT: (C)2006,JPO&NCIPI

Description

本発明は、画像処理装置および画像処理方法、プログラムおよびプログラム記録媒体、並びにデータ構造およびデータ記録媒体に関し、特に、例えば、高フレームレートの動画データを高圧縮し、また、復元することができるようにする画像処理装置および画像処理方法、プログラムおよびプログラム記録媒体、並びにデータ構造およびデータ記録媒体に関する。 The present invention relates to an image processing apparatus and an image processing method, a program and a program recording medium, and a data structure and a data recording medium. In particular, for example, high frame rate moving picture data can be highly compressed and restored. The present invention relates to an image processing apparatus, an image processing method, a program, a program recording medium, a data structure, and a data recording medium.

従来、動画（動画像）データは、毎秒３０乃至６０フレーム（３０乃至６０fps(Frame per second)）程度のフレームレートで録画（記録）され、再生される。しかしながら、この程度のフレームレートでは、人間の視覚において、動被写体（動いている被写体）がぼけた画像に知覚されてしまうため、人間にとって、良好な画質が得られているとは言えなかった。 Conventionally, moving image (moving image) data is recorded (recorded) and reproduced at a frame rate of about 30 to 60 frames per second (30 to 60 fps (Frame per second)). However, at such a frame rate, a moving subject (moving subject) is perceived as a blurred image in human vision, and thus it cannot be said that good image quality is obtained for humans.

ところで、動画データを、２４０fps程度のフレームレートで記録し、再生すると、人間にとって良好な画質が得られることが知られている。 By the way, it is known that when moving image data is recorded and reproduced at a frame rate of about 240 fps, image quality good for human beings can be obtained.

２４０fpsのフレームレートの動画データは、２４０fpsの高フレームレートに対応した表示装置によって表示することができ、さらに、例えば、フレームを間引き、フレームレートを低下させることによって、３０fpsや６０fpsなどの低フレームレートの表示装置で表示することもできる。 Video data with a frame rate of 240 fps can be displayed on a display device that supports a high frame rate of 240 fps, and further, for example, by thinning out frames and reducing the frame rate, a low frame rate such as 30 fps or 60 fps It can also be displayed on the display device.

しかしながら、２４０fpsのフレームレートの動画データのフレームを単純に間引いた場合には、画像において、動きの滑らかさが損なわれることになる。 However, when the frames of moving image data having a frame rate of 240 fps are simply thinned out, the smoothness of the motion is lost in the image.

そこで、特許文献１には、高フレームレートの動画データの単純に間引くのではなく、高フレームレートの動画データの複数フレームの平均値を、低フレームレートの動画データとする方法が提案されている。さらに、特許文献１では、高フレームレートの動画データを、複数のフレームレートに対応する複数の階層に階層符号化する方法も提案されている。 Therefore, Patent Document 1 proposes a method in which an average value of a plurality of frames of high frame rate moving image data is used as low frame rate moving image data, instead of simply thinning out high frame rate moving image data. . Further, Patent Document 1 proposes a method of hierarchically encoding high frame rate moving image data into a plurality of layers corresponding to a plurality of frame rates.

ここで、例えば、１２０fpsの階層と、６０fpsの階層との２階層の階層符号化が行われる場合には、例えば、１２０fpsの動画データの、あるフレームのデータD₁と、次のフレームのデータD₂との平均値(D₁+D₂)/2が、６０fpsの階層の動画データとされ、データD₁とD₂のうちの一方のうちの、例えば、データD₁だけが、１２０fpsの階層の動画データとされる。この場合、データD₂は、６０fpsの階層の動画データ(D₁+D₂)/2と、１２０fpsの階層の動画データD₁とから求めることができる。 Here, for example, when two-layer encoding of a 120 fps layer and a 60 fps layer is performed, for example, data D ₁ of a certain frame and data D of the next frame of 120 fps moving image data are used. average of _{_{_{2 (D 1 + D 2)}}} / 2 is set to the hierarchy of the moving image data of 60 fps, of one of the data D ₁ and D _2, and the example, only the data D ₁ is 120 fps hierarchy Video data. In this case, the data D ₂ is the moving image data of 60fps hierarchy _{_{(D 1 + D 2) /}} 2, can be obtained from the moving picture data D ₁ Metropolitan of 120fps hierarchy.

ところで、上述したように、２４０fpsなどの高フレームレートの動画データによれば、良好な画質を得ることができるが、２４０fpsの動画データは、従来の３０fpsや６０fpsの動画データに比較して、単純には、８倍や４倍の膨大なデータ量になる。このデータ量は、上述の階層符号化を行っても変わらない。そして、動画データのデータ量が膨大であることは、その動画データのデータ処理の負荷を大にし、また、動画データの伝送や記録に好ましいことではない。 As described above, high-frame-rate video data such as 240 fps can provide good image quality, but 240 fps video data is simpler than conventional 30-fps or 60-fps video data. The amount of data is 8 times or 4 times larger. This data amount does not change even when the above-described hierarchical encoding is performed. An enormous amount of moving image data increases the data processing load of the moving image data, and is not preferable for transmission or recording of moving image data.

特開2004-088244号公報。JP 2004-088244 A.

本発明は、このような状況に鑑みてなされたものであり、例えば、高フレームレートの動画データを高圧縮し、また、復元することができるようにするものである。 The present invention has been made in view of such a situation. For example, high-frame-rate moving image data can be highly compressed and restored.

本発明の第１の画像処理装置は、第１の動画データを、その第１の動画データのフレームレートよりも低いフレームレートの第２の動画データと、第１の動画データから第２の動画データを除いた残りの第３の動画データとに分離する分離手段と、第２の動画データの複数フレームにおいて、相関が高い位置関係を検出する検出手段と、検出手段で検出された位置関係にある、第２の動画データの複数フレームの画像データから、第３の動画データのフレームをブロック分割して得られるブロックの推測値を求める推測手段と、ブロックの推測値を用いて、第３の動画データをブロック単位で圧縮する圧縮手段とを備えることを特徴とする。 The first image processing apparatus according to the present invention converts the first moving image data from the first moving image data to the second moving image data having a frame rate lower than the frame rate of the first moving image data. Separating means for separating the remaining third moving image data excluding data, detecting means for detecting a positional relationship having a high correlation in a plurality of frames of the second moving image data, and a positional relationship detected by the detecting means Using the estimation means for obtaining an estimated value of a block obtained by dividing a frame of the third moving image data from a plurality of frames of image data of the second moving image data, and using the estimated value of the block, Compression means for compressing moving image data in units of blocks.

本発明の第１の画像処理方法は、第１の動画データを、その第１の動画データのフレームレートよりも低いフレームレートの第２の動画データと、第１の動画データから第２の動画データを除いた残りの第３の動画データとに分離する分離ステップと、第２の動画データの複数フレームにおいて、相関が高い位置関係を検出する検出ステップと、検出ステップで検出された位置関係にある、第２の動画データの複数フレームの画像データから、第３の動画データのフレームをブロック分割して得られるブロックの推測値を求める推測ステップと、ブロックの推測値を用いて、第３の動画データをブロック単位で圧縮する圧縮ステップとを含むことを特徴とする。 According to the first image processing method of the present invention, the first moving image data is divided into second moving image data having a frame rate lower than the frame rate of the first moving image data, and from the first moving image data to the second moving image. A separation step for separating the remaining third moving image data excluding data, a detection step for detecting a positional relationship having a high correlation in a plurality of frames of the second moving image data, and a positional relationship detected in the detection step. Using an estimation step for obtaining an estimated value of a block obtained by dividing a frame of the third moving image data from a plurality of frames of image data of the second moving image data, and an estimated value of the block, A compression step of compressing the moving image data in units of blocks.

本発明の第１のプログラムは、第１の動画データを、その第１の動画データのフレームレートよりも低いフレームレートの第２の動画データと、第１の動画データから第２の動画データを除いた残りの第３の動画データとに分離する分離ステップと、第２の動画データの複数フレームにおいて、相関が高い位置関係を検出する検出ステップと、検出ステップで検出された位置関係にある、第２の動画データの複数フレームの画像データから、第３の動画データのフレームをブロック分割して得られるブロックの推測値を求める推測ステップと、ブロックの推測値を用いて、第３の動画データをブロック単位で圧縮する圧縮ステップとを含むことを特徴とする。 The first program of the present invention converts the first moving image data, the second moving image data having a frame rate lower than the frame rate of the first moving image data, and the second moving image data from the first moving image data. A separation step that separates into the remaining third moving image data, a detection step that detects a positional relationship having a high correlation in a plurality of frames of the second moving image data, and a positional relationship that is detected in the detection step; An estimation step for obtaining an estimated value of a block obtained by dividing a frame of the third moving image data from a plurality of frames of image data of the second moving image data, and the third moving image data using the estimated value of the block And a compression step of compressing each block in units of blocks.

本発明の第１のプログラム記録媒体に記録されているプログラムは、第１の動画データを、その第１の動画データのフレームレートよりも低いフレームレートの第２の動画データと、第１の動画データから第２の動画データを除いた残りの第３の動画データとに分離する分離ステップと、第２の動画データの複数フレームにおいて、相関が高い位置関係を検出する検出ステップと、検出ステップで検出された位置関係にある、第２の動画データの複数フレームの画像データから、第３の動画データのフレームをブロック分割して得られるブロックの推測値を求める推測ステップと、ブロックの推測値を用いて、第３の動画データをブロック単位で圧縮する圧縮ステップとを含むことを特徴とする。 The program recorded on the first program recording medium of the present invention includes the first moving image data, the second moving image data having a frame rate lower than the frame rate of the first moving image data, and the first moving image data. A separation step for separating the data from the remaining third moving image data excluding the second moving image data, a detecting step for detecting a positional relationship having a high correlation in a plurality of frames of the second moving image data, and a detecting step An estimation step for obtaining an estimated value of a block obtained by dividing a frame of the third moving image data from the plurality of frames of image data of the second moving image data in the detected positional relationship, and an estimated value of the block And a compression step of compressing the third moving image data in units of blocks.

本発明の第１のデータ構造は、第１の動画データを、その第１の動画データのフレームレートよりも低いフレームレートの第２の動画データと、第１の動画データから第２の動画データを除いた残りの第３の動画データとに分離し、第２の動画データの複数フレームにおいて、相関が高い位置関係を検出し、相関が高い位置関係にある、第２の動画データの複数フレームの画像データから、第３の動画データのフレームをブロック分割して得られるブロックの推測値を求め、ブロックの推測値を用いて、ブロックを圧縮することにより得られる圧縮データと、第２の動画データとを含むことを特徴とする。 In the first data structure of the present invention, the first moving image data is divided into second moving image data having a frame rate lower than the frame rate of the first moving image data, and from the first moving image data to the second moving image data. Are separated from the remaining third moving image data, and a positional relationship having a high correlation is detected in a plurality of frames of the second moving image data, and a plurality of frames of the second moving image data in a positional relationship having a high correlation are detected. A block estimated value obtained by dividing the frame of the third moving image data into blocks from the image data, and compressed data obtained by compressing the block using the block estimated value, and the second moving image Data.

本発明の第１のデータ記録媒体に記録されているデータは、第１の動画データを、その第１の動画データのフレームレートよりも低いフレームレートの第２の動画データと、第１の動画データから第２の動画データを除いた残りの第３の動画データとに分離し、第２の動画データの複数フレームにおいて、相関が高い位置関係を検出し、相関が高い位置関係にある、第２の動画データの複数フレームの画像データから、第３の動画データのフレームをブロック分割して得られるブロックの推測値を求め、ブロックの推測値を用いて、ブロックを圧縮することにより得られる圧縮データと、第２の動画データとを含むことを特徴とする。 The data recorded on the first data recording medium of the present invention includes the first moving image data, the second moving image data having a frame rate lower than the frame rate of the first moving image data, and the first moving image data. The data is separated from the data to the remaining third moving image data excluding the second moving image data, a positional relationship having a high correlation is detected in a plurality of frames of the second moving image data, and the positional relationship having a high correlation is detected. Compression obtained by calculating an estimated value of a block obtained by dividing the frame of the third moving image data from the plurality of frames of image data of the second moving image data, and compressing the block using the estimated value of the block Data and second moving image data.

本発明の第２の画像処理装置は、第１の動画データを、その第１の動画データのフレームレートよりも低いフレームレートの第２の動画データと、第１の動画データから第２の動画データを除いた残りの第３の動画データとに分離し、第２の動画データの複数フレームにおいて、相関が高い位置関係を検出し、相関が高い位置関係にある、第２の動画データの複数フレームの画像データから、第３の動画データのフレームをブロック分割して得られるブロックの推測値を求め、ブロックの推測値を用いて、ブロックを圧縮することにより得られる圧縮データを、第２の動画データを用いて、第３の動画データに復元する第１の復元手段と、第２と第３の動画データを合成し、第１の動画データを復元する第２の復元手段とを備え、第１の復元手段は、第２の動画データの複数フレームにおいて、相関が高い位置関係を検出する検出手段と、検出手段で検出された位置関係にある、第２の動画データの複数フレームの画像データから、第３の動画データのフレームをブロック分割して得られるブロックの推測値を求める推測手段と、ブロックの推測値と、圧縮データとを用いて、第３の動画データをブロック単位で復元するブロック復元手段とを有することを特徴とする。 The second image processing apparatus according to the present invention converts the first moving image data from the first moving image data to the second moving image data having a frame rate lower than the frame rate of the first moving image data. The second moving image data is separated into the remaining third moving image data excluding the data, the positional relationship having a high correlation is detected in the plurality of frames of the second moving image data, and the plurality of second moving image data having the high positional relationship is detected. An estimated value of a block obtained by dividing the frame of the third moving image data from the image data of the frame is obtained, and the compressed data obtained by compressing the block using the estimated value of the block is converted into the second A first restoration unit that restores the third video data using the video data; and a second restoration unit that combines the second and third video data to restore the first video data, First restoration The stage includes a detecting unit that detects a positional relationship having a high correlation in a plurality of frames of the second moving image data, and a plurality of frames of image data of the second moving image data that are in the positional relationship detected by the detecting unit. A block restoring unit that restores the third moving image data in units of blocks using an estimation unit that obtains an estimated value of a block obtained by dividing the frame of the three moving image data into blocks, a block estimated value, and compressed data; It is characterized by having.

本発明の第２の画像処理方法は、第１の動画データを、その第１の動画データのフレームレートよりも低いフレームレートの第２の動画データと、第１の動画データから第２の動画データを除いた残りの第３の動画データとに分離し、第２の動画データの複数フレームにおいて、相関が高い位置関係を検出し、相関が高い位置関係にある、第２の動画データの複数フレームの画像データから、第３の動画データのフレームをブロック分割して得られるブロックの推測値を求め、ブロックの推測値を用いて、ブロックを圧縮することにより得られる圧縮データを、第２の動画データを用いて、第３の動画データに復元する第１の復元ステップと、第２と第３の動画データを合成し、第１の動画データを復元する第２の復元ステップとを含み、第１の復元ステップは、第２の動画データの複数フレームにおいて、相関が高い位置関係を検出する検出ステップと、検出ステップで検出された位置関係にある、第２の動画データの複数フレームの画像データから、第３の動画データのフレームをブロック分割して得られるブロックの推測値を求める推測ステップと、ブロックの推測値と、圧縮データとを用いて、第３の動画データをブロック単位で復元するブロック復元ステップとを含むことを特徴とする。 In the second image processing method of the present invention, the first moving image data is divided into second moving image data having a frame rate lower than the frame rate of the first moving image data, and second moving image from the first moving image data. The second moving image data is separated into the remaining third moving image data excluding the data, the positional relationship having a high correlation is detected in the plurality of frames of the second moving image data, and the plurality of second moving image data having the high positional relationship is detected. An estimated value of a block obtained by dividing the frame of the third moving image data from the image data of the frame is obtained, and the compressed data obtained by compressing the block using the estimated value of the block is converted into the second A first restoration step for restoring the third movie data using the movie data; a second restoration step for synthesizing the second and third movie data to restore the first movie data; First The restoration step includes a detection step for detecting a positional relationship having a high correlation in a plurality of frames of the second moving image data, and a plurality of frames of image data of the second moving image data in the positional relationship detected in the detection step. A block for restoring the third moving image data in units of blocks using an estimation step for obtaining an estimated value of a block obtained by dividing the frame of the third moving image data into blocks, the estimated value of the block, and the compressed data And a restoration step.

本発明の第２のプログラムは、第１の動画データを、その第１の動画データのフレームレートよりも低いフレームレートの第２の動画データと、第１の動画データから第２の動画データを除いた残りの第３の動画データとに分離し、第２の動画データの複数フレームにおいて、相関が高い位置関係を検出し、相関が高い位置関係にある、第２の動画データの複数フレームの画像データから、第３の動画データのフレームをブロック分割して得られるブロックの推測値を求め、ブロックの推測値を用いて、ブロックを圧縮することにより得られる圧縮データを、第２の動画データを用いて、第３の動画データに復元する第１の復元ステップと、第２と第３の動画データを合成し、第１の動画データを復元する第２の復元ステップとを含み、第１の復元ステップは、第２の動画データの複数フレームにおいて、相関が高い位置関係を検出する検出ステップと、検出ステップで検出された位置関係にある、第２の動画データの複数フレームの画像データから、第３の動画データのフレームをブロック分割して得られるブロックの推測値を求める推測ステップと、ブロックの推測値と、圧縮データとを用いて、第３の動画データをブロック単位で復元するブロック復元ステップとを含むことを特徴とする。 The second program of the present invention converts the first moving image data, the second moving image data having a frame rate lower than the frame rate of the first moving image data, and the second moving image data from the first moving image data. It is separated into the remaining third moving image data, and a positional relationship having a high correlation is detected in a plurality of frames of the second moving image data, and a plurality of frames of the second moving image data having a high correlation are detected. An estimated value of a block obtained by dividing the frame of the third moving image data from the image data is obtained, and the compressed data obtained by compressing the block using the estimated value of the block is converted into the second moving image data. Using a first restoration step for restoring to the third moving image data, and a second restoration step for combining the second and third moving image data to restore the first moving image data. The restoration step includes a detection step for detecting a positional relationship having a high correlation in a plurality of frames of the second moving image data, and a plurality of frames of image data of the second moving image data in the positional relationship detected in the detection step. A block restoration that restores the third moving image data in units of blocks using an estimation step for obtaining an estimated value of a block obtained by dividing the frame of the third moving image data into blocks, the estimated value of the block, and the compressed data. And a step.

本発明の第２のプログラム記録媒体に記録されているプログラムは、第１の動画データを、その第１の動画データのフレームレートよりも低いフレームレートの第２の動画データと、第１の動画データから第２の動画データを除いた残りの第３の動画データとに分離し、第２の動画データの複数フレームにおいて、相関が高い位置関係を検出し、相関が高い位置関係にある、第２の動画データの複数フレームの画像データから、第３の動画データのフレームをブロック分割して得られるブロックの推測値を求め、ブロックの推測値を用いて、ブロックを圧縮することにより得られる圧縮データを、第２の動画データを用いて、第３の動画データに復元する第１の復元ステップと、第２と第３の動画データを合成し、第１の動画データを復元する第２の復元ステップとを含み、第１の復元ステップは、第２の動画データの複数フレームにおいて、相関が高い位置関係を検出する検出ステップと、検出ステップで検出された位置関係にある、第２の動画データの複数フレームの画像データから、第３の動画データのフレームをブロック分割して得られるブロックの推測値を求める推測ステップと、ブロックの推測値と、圧縮データとを用いて、第３の動画データをブロック単位で復元するブロック復元ステップとを含むことを特徴とする。 The program recorded on the second program recording medium of the present invention includes the first moving image data, the second moving image data having a frame rate lower than the frame rate of the first moving image data, and the first moving image data. The data is separated from the data to the remaining third moving image data excluding the second moving image data, a positional relationship having a high correlation is detected in a plurality of frames of the second moving image data, and the positional relationship having a high correlation is detected. Compression obtained by calculating an estimated value of a block obtained by dividing the frame of the third moving image data from the plurality of frames of image data of the second moving image data, and compressing the block using the estimated value of the block A first restoration step for restoring the data to the third movie data using the second movie data; and the second and third movie data are synthesized to restore the first movie data. A second restoration step, wherein the first restoration step is in a position relationship detected in the detection step and a position relationship detected in the detection step in a plurality of frames of the second moving image data. Using an estimation step for obtaining an estimated value of a block obtained by dividing the frame of the third moving image data from the plurality of frames of image data of the second moving image data, the estimated value of the block, and the compressed data; And a block restoration step for restoring the three moving image data in units of blocks.

本発明の第３の画像処理装置は、第１の動画データを、その第１の動画データのフレームレートよりも低いフレームレートの第２の動画データと、第１の動画データから第２の動画データを除いた残りの第３の動画データとに分離する分離手段と、第３の動画データのフレームをブロック分割して得られるブロックについて、第２の動画データの複数フレームにおいてブロックとの相関が高い位置関係を表す１つの動きベクトルを検出する検出手段と、ブロックとの位置関係が１つの動きベクトルから求められる位置関係にある、第２の動画データの複数フレームの画像データから、ブロックの推測値を求める推測手段と、ブロックの推測値を用いて、第３の動画データをブロック単位で圧縮する圧縮手段とを備えることを特徴とする。 The third image processing apparatus of the present invention converts the first moving image data from the first moving image data to the second moving image data having a frame rate lower than the frame rate of the first moving image data. Separation means for separating the remaining third moving image data excluding data, and a block obtained by dividing a frame of the third moving image data into blocks, there is a correlation with the block in a plurality of frames of the second moving image data. A block is estimated from image data of a plurality of frames of the second moving image data in which the detection means for detecting one motion vector representing a high positional relationship and the positional relationship between the block and the block are obtained from the single motion vector. It is characterized by comprising estimation means for obtaining a value and compression means for compressing the third moving image data in units of blocks using the estimated value of the block.

本発明の第３の画像処理方法は、第１の動画データを、その第１の動画データのフレームレートよりも低いフレームレートの第２の動画データと、第１の動画データから第２の動画データを除いた残りの第３の動画データとに分離する分離ステップと、第３の動画データのフレームをブロック分割して得られるブロックについて、第２の動画データの複数フレームにおいてブロックとの相関が高い位置関係を表す１つの動きベクトルを検出する検出ステップと、ブロックとの位置関係が１つの動きベクトルから求められる位置関係にある、第２の動画データの複数フレームの画像データから、ブロックの推測値を求める推測ステップと、ブロックの推測値を用いて、第３の動画データをブロック単位で圧縮する圧縮ステップとを含むことを特徴とする。 According to the third image processing method of the present invention, the first moving image data is divided into second moving image data having a frame rate lower than the frame rate of the first moving image data, and from the first moving image data to the second moving image. The separation step for separating the third moving image data excluding the data and the block obtained by dividing the frame of the third moving image data into blocks are correlated with the blocks in the plurality of frames of the second moving image data. A detection step for detecting one motion vector representing a high positional relationship and a block estimation from image data of a plurality of frames of the second moving image data in which the positional relationship between the block and the block is a positional relationship obtained from the single motion vector A prediction step for obtaining a value, and a compression step for compressing the third moving image data in units of blocks using the estimated value of the block. To.

本発明の第３のプログラムは、第１の動画データを、その第１の動画データのフレームレートよりも低いフレームレートの第２の動画データと、第１の動画データから第２の動画データを除いた残りの第３の動画データとに分離する分離ステップと、第３の動画データのフレームをブロック分割して得られるブロックについて、第２の動画データの複数フレームにおいてブロックとの相関が高い位置関係を表す１つの動きベクトルを検出する検出ステップと、ブロックとの位置関係が１つの動きベクトルから求められる位置関係にある、第２の動画データの複数フレームの画像データから、ブロックの推測値を求める推測ステップと、ブロックの推測値を用いて、第３の動画データをブロック単位で圧縮する圧縮ステップとを含むことを特徴とする。 The third program of the present invention converts the first moving image data, the second moving image data having a frame rate lower than the frame rate of the first moving image data, and the second moving image data from the first moving image data. The separation step for separating the remaining third moving image data from the removed block and the block obtained by dividing the frame of the third moving image data into blocks are positions having a high correlation with the blocks in the plurality of frames of the second moving image data. From the detection step of detecting one motion vector representing the relationship and the positional relationship between the block and the block, the estimated value of the block is obtained from the image data of a plurality of frames of the second moving image data. And a compression step of compressing the third moving image data in units of blocks using the estimated value of the block. To.

本発明の第３のプログラム記録媒体に記録されているプログラムは、第１の動画データを、その第１の動画データのフレームレートよりも低いフレームレートの第２の動画データと、第１の動画データから第２の動画データを除いた残りの第３の動画データとに分離する分離ステップと、第３の動画データのフレームをブロック分割して得られるブロックについて、第２の動画データの複数フレームにおいてブロックとの相関が高い位置関係を表す１つの動きベクトルを検出する検出ステップと、ブロックとの位置関係が１つの動きベクトルから求められる位置関係にある、第２の動画データの複数フレームの画像データから、ブロックの推測値を求める推測ステップと、ブロックの推測値を用いて、第３の動画データをブロック単位で圧縮する圧縮ステップとを含むことを特徴とする。 The program recorded on the third program recording medium of the present invention includes the first moving image data, the second moving image data having a frame rate lower than the frame rate of the first moving image data, and the first moving image data. A plurality of frames of the second moving image data with respect to a block obtained by dividing the third moving image data into blocks, and a separation step for separating the data into the remaining third moving image data excluding the second moving image data from the data; A plurality of frames of the second moving image data in which the detection step of detecting one motion vector representing a positional relationship having a high correlation with the block and the positional relationship with the block obtained from the single motion vector The third video data is compressed in units of blocks using an estimation step for obtaining an estimated value of the block from the data and the estimated value of the block. Characterized in that it comprises a compression step that.

本発明の第２のデータ構造は、第１の動画データを、その第１の動画データのフレームレートよりも低いフレームレートの第２の動画データと、第１の動画データから第２の動画データを除いた残りの第３の動画データとに分離し、第３の動画データのフレームをブロック分割して得られるブロックについて、第２の動画データの複数フレームにおいてブロックとの相関が高い位置関係を表す１つの動きベクトルを検出し、ブロックとの位置関係が１つの動きベクトルから求められる位置関係にある、第２の動画データの複数フレームの画像データから、ブロックの推測値を求め、ブロックの推測値を用いて、ブロックを圧縮することにより得られる圧縮データと、第２の動画データとを含み、圧縮データは、１つの動きベクトルを含むことを特徴とする。 According to the second data structure of the present invention, the first moving image data is divided into second moving image data having a frame rate lower than the frame rate of the first moving image data, and the first moving image data to the second moving image data. The block obtained by dividing the third moving image data frame into the remaining third moving image data excluding the block has a positional relationship having a high correlation with the block in the plurality of frames of the second moving image data. One motion vector to be expressed is detected, and an estimated value of the block is obtained from image data of a plurality of frames of the second moving image data in which the positional relationship with the block is obtained from the one motion vector, and the block is estimated The compressed data obtained by compressing the block using the value and the second moving image data are included, and the compressed data includes one motion vector. And it features.

本発明の第２のデータ記録媒体に記録されているデータは、第１の動画データを、その第１の動画データのフレームレートよりも低いフレームレートの第２の動画データと、第１の動画データから第２の動画データを除いた残りの第３の動画データとに分離し、第３の動画データのフレームをブロック分割して得られるブロックについて、第２の動画データの複数フレームにおいてブロックとの相関が高い位置関係を表す１つの動きベクトルを検出し、ブロックとの位置関係が１つの動きベクトルから求められる位置関係にある、第２の動画データの複数フレームの画像データから、ブロックの推測値を求め、ブロックの推測値を用いて、ブロックを圧縮することにより得られる圧縮データと、第２の動画データとを含み、圧縮データは、１つの動きベクトルを含むことを特徴とする。 The data recorded on the second data recording medium of the present invention includes the first moving image data, the second moving image data having a frame rate lower than the frame rate of the first moving image data, and the first moving image data. The blocks obtained by separating the third moving image data from the data into the remaining third moving image data and dividing the third moving image data into blocks are divided into blocks in a plurality of frames of the second moving image data. One motion vector representing a positional relationship having a high correlation is detected, and a block is estimated from image data of a plurality of frames of the second moving image data in which the positional relationship with the block is a positional relationship obtained from one motion vector A compressed value obtained by compressing the block using the estimated value of the block and the second moving image data, and the compressed data is 1 Characterized in that it comprises a motion vector.

本発明の第４の画像処理装置は、第１の動画データを、その第１の動画データのフレームレートよりも低いフレームレートの第２の動画データと、第１の動画データから第２の動画データを除いた残りの第３の動画データとに分離し、第３の動画データのフレームをブロック分割して得られるブロックについて、第２の動画データの複数フレームにおいてブロックとの相関が高い位置関係を表す１つの動きベクトルを検出し、ブロックとの位置関係が１つの動きベクトルから求められる位置関係にある、第２の動画データの複数フレームの画像データから、ブロックの推測値を求め、ブロックの推測値を用いて、ブロックを圧縮することにより得られる圧縮データを、第２の動画データを用いて、第３の動画データに復元する第１の復元手段と、第２と第３の動画データを合成し、第１の動画データを復元する第２の復元手段とを備え、第１の復元手段は、１つの動きベクトルから、ブロックについて、第２の動画データの複数フレームにおいてブロックとの相関が高い位置関係を求める検出手段と、ブロックとの位置関係が１つの動きベクトルから求められた位置関係にある、第２の動画データの複数フレームの画像データから、ブロックの推測値を求める推測手段と、ブロックの推測値と、圧縮データとを用いて、ブロックを復元するブロック復元手段とを有することを特徴とする。 The fourth image processing apparatus of the present invention converts the first moving image data from the first moving image data to the second moving image data having a frame rate lower than the frame rate of the first moving image data. Positional relationship in which the blocks obtained by dividing the third moving image data frame into blocks that are separated from the remaining third moving image data excluding the data have a high correlation with the blocks in the plurality of frames of the second moving image data Is detected, and an estimated value of the block is obtained from image data of a plurality of frames of the second moving image data in which the positional relationship with the block is a positional relationship obtained from the single motion vector. 1st decompression | restoration means which decompress | restores the compressed data obtained by compressing a block using an estimated value to 3rd moving image data using 2nd moving image data , And a second restoration unit that synthesizes the second and third moving image data and restores the first moving image data. The first restoration unit uses the second moving image for one block from one motion vector. Detection means for obtaining a positional relationship having a high correlation with a block in a plurality of frames of data, and image data of a plurality of frames of second moving image data in which the positional relationship with the block is a positional relationship obtained from one motion vector In addition, the present invention is characterized by comprising: an estimation means for obtaining an estimated value of a block; and a block restoration means for restoring a block using the estimated value of the block and the compressed data.

本発明の第４の画像処理方法は、第１の動画データを、その第１の動画データのフレームレートよりも低いフレームレートの第２の動画データと、第１の動画データから第２の動画データを除いた残りの第３の動画データとに分離し、第３の動画データのフレームをブロック分割して得られるブロックについて、第２の動画データの複数フレームにおいてブロックとの相関が高い位置関係を表す１つの動きベクトルを検出し、ブロックとの位置関係が１つの動きベクトルから求められる位置関係にある、第２の動画データの複数フレームの画像データから、ブロックの推測値を求め、ブロックの推測値を用いて、ブロックを圧縮することにより得られる圧縮データを、第２の動画データを用いて、第３の動画データに復元する第１の復元ステップと、第２と第３の動画データを合成し、第１の動画データを復元する第２の復元ステップとを含み、第１の復元ステップは、１つの動きベクトルから、ブロックについて、第２の動画データの複数フレームにおいてブロックとの相関が高い位置関係を求める検出ステップと、ブロックとの位置関係が１つの動きベクトルから求められた位置関係にある、第２の動画データの複数フレームの画像データから、ブロックの推測値を求める推測ステップと、ブロックの推測値と、圧縮データとを用いて、ブロックを復元するブロック復元ステップとを含むことを特徴とする。 According to the fourth image processing method of the present invention, the first moving image data is divided into second moving image data having a frame rate lower than the frame rate of the first moving image data, and from the first moving image data to the second moving image. Positional relationship in which the blocks obtained by dividing the third moving image data frame into blocks that are separated from the remaining third moving image data excluding the data have a high correlation with the blocks in the plurality of frames of the second moving image data Is detected, and an estimated value of the block is obtained from image data of a plurality of frames of the second moving image data in which the positional relationship with the block is a positional relationship obtained from the single motion vector. A first restoration step for restoring the compressed data obtained by compressing the block using the estimated value to the third moving image data using the second moving image data. And a second restoration step of synthesizing the second and third moving image data and restoring the first moving image data, wherein the first restoration step is performed on the second block for the block from one motion vector. A detection step for obtaining a positional relationship having a high correlation with a block in a plurality of frames of the moving image data, and an image of a plurality of frames of the second moving image data in which the positional relationship with the block is a positional relationship obtained from one motion vector It includes an estimation step for obtaining an estimated value of a block from data, and a block restoration step for restoring the block using the estimated value of the block and the compressed data.

本発明の第４のプログラムは、第１の動画データを、その第１の動画データのフレームレートよりも低いフレームレートの第２の動画データと、第１の動画データから第２の動画データを除いた残りの第３の動画データとに分離し、第３の動画データのフレームをブロック分割して得られるブロックについて、第２の動画データの複数フレームにおいてブロックとの相関が高い位置関係を表す１つの動きベクトルを検出し、ブロックとの位置関係が１つの動きベクトルから求められる位置関係にある、第２の動画データの複数フレームの画像データから、ブロックの推測値を求め、ブロックの推測値を用いて、ブロックを圧縮することにより得られる圧縮データを、第２の動画データを用いて、第３の動画データに復元する第１の復元ステップと、第２と第３の動画データを合成し、第１の動画データを復元する第２の復元ステップとを含み、第１の復元ステップは、１つの動きベクトルから、ブロックについて、第２の動画データの複数フレームにおいてブロックとの相関が高い位置関係を求める検出ステップと、ブロックとの位置関係が１つの動きベクトルから求められた位置関係にある、第２の動画データの複数フレームの画像データから、ブロックの推測値を求める推測ステップと、ブロックの推測値と、圧縮データとを用いて、ブロックを復元するブロック復元ステップとを含むことを特徴とする。 According to a fourth program of the present invention, the first moving image data is divided into second moving image data having a frame rate lower than the frame rate of the first moving image data, and second moving image data from the first moving image data. The block obtained by separating the remaining third moving image data from the third moving image data and dividing the frame of the third moving image data into blocks represents a positional relationship having a high correlation with the block in a plurality of frames of the second moving image data. One motion vector is detected, and an estimated value of the block is obtained from image data of a plurality of frames of the second moving image data in which the positional relationship with the block is obtained from the one motion vector, and the estimated value of the block The first restoration step of restoring the compressed data obtained by compressing the block to the third moving image data using the second moving image data. And a second restoration step of synthesizing the second and third moving image data and restoring the first moving image data, wherein the first restoration step uses a second motion vector for a block from one motion vector. A detection step for obtaining a positional relationship having a high correlation with a block in a plurality of frames of moving image data, and a plurality of frames of image data of the second moving image data in which the positional relationship with the block is a positional relationship obtained from one motion vector And a block restoring step for restoring the block using the estimated value of the block and the compressed data.

本発明の第４のプログラム記録媒体に記録されているプログラムは、第１の動画データを、その第１の動画データのフレームレートよりも低いフレームレートの第２の動画データと、第１の動画データから第２の動画データを除いた残りの第３の動画データとに分離し、第３の動画データのフレームをブロック分割して得られるブロックについて、第２の動画データの複数フレームにおいてブロックとの相関が高い位置関係を表す１つの動きベクトルを検出し、ブロックとの位置関係が１つの動きベクトルから求められる位置関係にある、第２の動画データの複数フレームの画像データから、ブロックの推測値を求め、ブロックの推測値を用いて、ブロックを圧縮することにより得られる圧縮データを、第２の動画データを用いて、第３の動画データに復元する第１の復元ステップと、第２と第３の動画データを合成し、第１の動画データを復元する第２の復元ステップとを含み、第１の復元ステップは、１つの動きベクトルから、ブロックについて、第２の動画データの複数フレームにおいてブロックとの相関が高い位置関係を求める検出ステップと、ブロックとの位置関係が１つの動きベクトルから求められた位置関係にある、第２の動画データの複数フレームの画像データから、ブロックの推測値を求める推測ステップと、ブロックの推測値と、圧縮データとを用いて、ブロックを復元するブロック復元ステップとを含むことを特徴とする。 The program recorded on the fourth program recording medium of the present invention includes the first moving image data, the second moving image data having a frame rate lower than the frame rate of the first moving image data, and the first moving image data. The blocks obtained by separating the third moving image data from the data into the remaining third moving image data and dividing the third moving image data into blocks are divided into blocks in a plurality of frames of the second moving image data. One motion vector representing a positional relationship having a high correlation is detected, and a block is estimated from image data of a plurality of frames of the second moving image data in which the positional relationship with the block is a positional relationship obtained from one motion vector A compressed value obtained by compressing the block using the estimated value of the block and the third moving image data using the second moving image data. A first restoration step for restoring data, and a second restoration step for synthesizing the second and third moving image data to restore the first moving image data. A detection step for obtaining a positional relationship having a high correlation with the block in a plurality of frames of the second moving image data from the vector, and a positional relationship with the block determined by a single motion vector; And a block restoring step for restoring a block using the estimated value of the block and the compressed data from the image data of a plurality of frames of the moving image data.

本発明の第１の画像処理装置および第１の画像処理方法、並びに第１のプログラムおよび第１のプログラム記録媒体に記録されているプログラムにおいては、第１の動画データが、その第１の動画データのフレームレートよりも低いフレームレートの第２の動画データと、第１の動画データから第２の動画データを除いた残りの第３の動画データとに分離され、第２の動画データの複数フレームにおいて、相関が高い位置関係が検出される。さらに、その検出された位置関係にある、第２の動画データの複数フレームの画像データから、第３の動画データのフレームをブロック分割して得られるブロックの推測値が求められ、そのブロックの推測値を用いて、第３の動画データがブロック単位で圧縮される。 In the first image processing apparatus, the first image processing method, the first program, and the program recorded in the first program recording medium of the present invention, the first moving image data includes the first moving image data. The second moving image data is separated into second moving image data having a frame rate lower than the frame rate of the data and remaining third moving image data obtained by removing the second moving image data from the first moving image data. A positional relationship having a high correlation is detected in the frame. Further, an estimated value of a block obtained by dividing the frame of the third moving image data into blocks is obtained from the image data of the second moving image data in the detected positional relationship, and the block is estimated. Using the value, the third moving image data is compressed in units of blocks.

本発明の第１のデータ構造および第１のデータ記録媒体に記録されているデータには、第１の動画データを、その第１の動画データのフレームレートよりも低いフレームレートの第２の動画データと、第１の動画データから第２の動画データを除いた残りの第３の動画データとに分離し、第２の動画データの複数フレームにおいて、相関が高い位置関係を検出し、相関が高い位置関係にある、第２の動画データの複数フレームの画像データから、第３の動画データのフレームをブロック分割して得られるブロックの推測値を求め、ブロックの推測値を用いて、ブロックを圧縮することにより得られる圧縮データと、第２の動画データとが含まれる。 The data recorded on the first data structure and the first data recording medium of the present invention includes the first moving image data and the second moving image having a frame rate lower than the frame rate of the first moving image data. Data and the remaining third moving image data obtained by removing the second moving image data from the first moving image data, and detecting a positional relationship having a high correlation in a plurality of frames of the second moving image data. The estimated value of the block obtained by dividing the frame of the third moving image data from the plurality of frames of image data of the second moving image data in a high positional relationship is obtained, and the block is calculated using the estimated value of the block. Compressed data obtained by compression and second moving image data are included.

本発明の第２の画像処理装置および第２の画像処理方法、並びに第２のプログラムおよび第２のプログラム記録媒体に記録されているプログラムにおいては、第３の動画データが復元され、さらに、第２と第３の動画データが合成されて、第１の動画データが復元される。この場合において、第３の動画データの復元については、第２の動画データの複数フレームにおいて、相関が高い位置関係が検出され、その位置関係にある、第２の動画データの複数フレームの画像データから、第３の動画データのフレームをブロック分割して得られるブロックの推測値が求められる。そして、そのブロックの推測値と、圧縮データとを用いて、第３の動画データがブロック単位で復元される。 In the second image processing apparatus, the second image processing method, the second program, and the program recorded in the second program recording medium of the present invention, the third moving image data is restored, and The second moving image data and the third moving image data are combined to restore the first moving image data. In this case, for the restoration of the third moving image data, a positional relationship having a high correlation is detected in a plurality of frames of the second moving image data, and the image data of the plurality of frames of the second moving image data in the positional relationship is detected. Thus, an estimated value of a block obtained by dividing the frame of the third moving image data into blocks is obtained. Then, the third moving image data is restored in block units using the estimated value of the block and the compressed data.

本発明の第３の画像処理装置および第３の画像処理方法、並びに第３のプログラムおよび第３のプログラム記録媒体に記録されているプログラムにおいては、第１の動画データが、その第１の動画データのフレームレートよりも低いフレームレートの第２の動画データと、第１の動画データから第２の動画データを除いた残りの第３の動画データとに分離され、第３の動画データのフレームをブロック分割して得られるブロックについて、第２の動画データの複数フレームにおいてブロックとの相関が高い位置関係を表す１つの動きベクトルが検出される。そして、ブロックとの位置関係が１つの動きベクトルから求められる位置関係にある、第２の動画データの複数フレームの画像データから、ブロックの推測値が求められ、そのブロックの推測値を用いて、第３の動画データがブロック単位で圧縮される。 In the third image processing apparatus, the third image processing method, the third program, and the program recorded on the third program recording medium of the present invention, the first moving image data includes the first moving image data. The second moving image data having a frame rate lower than the frame rate of the data and the remaining third moving image data obtained by removing the second moving image data from the first moving image data are separated into frames of the third moving image data. As for a block obtained by dividing the block, one motion vector representing a positional relationship having a high correlation with the block in a plurality of frames of the second moving image data is detected. Then, an estimated value of the block is obtained from the image data of a plurality of frames of the second moving image data, the positional relationship with the block being obtained from one motion vector, and using the estimated value of the block, The third moving image data is compressed in units of blocks.

本発明の第２のデータ構造および第２のデータ記録媒体に記録されているデータには、第１の動画データを、その第１の動画データのフレームレートよりも低いフレームレートの第２の動画データと、第１の動画データから第２の動画データを除いた残りの第３の動画データとに分離し、第３の動画データのフレームをブロック分割して得られるブロックについて、第２の動画データの複数フレームにおいてブロックとの相関が高い位置関係を表す１つの動きベクトルを検出し、ブロックとの位置関係が１つの動きベクトルから求められる位置関係にある、第２の動画データの複数フレームの画像データから、ブロックの推測値を求め、ブロックの推測値を用いて、ブロックを圧縮することにより得られる圧縮データと、第２の動画データとが含まれる。 In the data recorded on the second data structure and the second data recording medium of the present invention, the first moving image data is the second moving image having a frame rate lower than the frame rate of the first moving image data. For the block obtained by separating the data and the remaining third moving image data obtained by removing the second moving image data from the first moving image data and dividing the frame of the third moving image data into blocks, the second moving image One motion vector representing a positional relationship having a high correlation with a block in a plurality of frames of data is detected, and the positional relationship with the block is a positional relationship obtained from one motion vector. From the image data, an estimated value of the block is obtained, and the compressed data obtained by compressing the block using the estimated value of the block, the second moving image data, It is included.

本発明の第４の画像処理装置および第４の画像処理方法、並びに第４のプログラムおよび第４のプログラム記録媒体に記録されているプログラムにおいては、第３の動画データが復元され、さらに、第２と第３の動画データが合成されて、第１の動画データが復元される。この場合において、第３の動画データの復元については、１つの動きベクトルから、ブロックについて、第２の動画データの複数フレームにおいてブロックとの相関が高い位置関係が求められ、ブロックとの位置関係が１つの動きベクトルから求められた位置関係にある、第２の動画データの複数フレームの画像データから、ブロックの推測値が求められる。そして、そのブロックの推測値と、圧縮データとを用いて、ブロックが復元される。 In the fourth image processing apparatus, the fourth image processing method, the fourth program, and the program recorded on the fourth program recording medium of the present invention, the third moving image data is restored, The second moving image data and the third moving image data are combined to restore the first moving image data. In this case, for the restoration of the third moving image data, a positional relationship having a high correlation with the block in a plurality of frames of the second moving image data is obtained from one motion vector, and the positional relationship with the block is determined. An estimated value of the block is obtained from the image data of a plurality of frames of the second moving image data having the positional relationship obtained from one motion vector. Then, the block is restored using the estimated value of the block and the compressed data.

本発明によれば、高フレームレートの動画データを高圧縮することができる。また、本発明によれば、そのように高圧縮された動画データを復元することができる。 According to the present invention, high frame rate moving image data can be highly compressed. Further, according to the present invention, it is possible to restore such highly compressed moving image data.

以下に本発明の実施の形態を説明するが、請求項に記載の構成要件と、発明の実施の形態における具体例との対応関係を例示すると、次のようになる。この記載は、請求項に記載されている発明をサポートする具体例が、発明の実施の形態に記載されていることを確認するためのものである。従って、発明の実施の形態中には記載されているが、構成要件に対応するものとして、ここには記載されていない具体例があったとしても、そのことは、その具体例が、その構成要件に対応するものではないことを意味するものではない。逆に、具体例が構成要件に対応するものとしてここに記載されていたとしても、そのことは、その具体例が、その構成要件以外の構成要件には対応しないものであることを意味するものでもない。 Embodiments of the present invention will be described below. Correspondences between constituent elements described in the claims and specific examples in the embodiments of the present invention are exemplified as follows. This description is to confirm that specific examples supporting the invention described in the claims are described in the embodiments of the invention. Therefore, even though there are specific examples that are described in the embodiment of the invention but are not described here as corresponding to the configuration requirements, the specific examples are not included in the configuration. It does not mean that it does not correspond to a requirement. On the contrary, even if a specific example is described here as corresponding to a configuration requirement, this means that the specific example does not correspond to a configuration requirement other than the configuration requirement. not.

さらに、この記載は、発明の実施の形態に記載されている具体例に対応する発明が、請求項に全て記載されていることを意味するものではない。換言すれば、この記載は、発明の実施の形態に記載されている具体例に対応する発明であって、この出願の請求項には記載されていない発明の存在、すなわち、将来、分割出願されたり、補正により追加される発明の存在を否定するものではない。 Further, this description does not mean that all the inventions corresponding to the specific examples described in the embodiments of the invention are described in the claims. In other words, this description is an invention corresponding to the specific example described in the embodiment of the invention, and the existence of an invention not described in the claims of this application, that is, in the future, a divisional application will be made. Nor does it deny the existence of an invention added by amendment.

請求項１に記載の画像処理装置は、
動画データを処理する画像処理装置（例えば、図７９の送信装置１）において、
第１の動画データ（例えば、240fps動画データ）を、その第１の動画データのフレームレートよりも低いフレームレートの第２の動画データ（例えば、60fps動画データ）と、前記第１の動画データから第２の動画データを除いた残りの第３の動画データ（例えば、240-60fps動画データ）とに分離する分離手段（例えば、図７９の分離回路２１３）と、
前記第２の動画データの複数フレームにおいて、相関が高い位置関係を検出する検出手段（例えば、図９４の相関最大位置検出部２５４）と、
前記検出手段で検出された前記位置関係にある、前記第２の動画データの複数フレームの画像データから、前記第３の動画データのフレームをブロック分割して得られるブロックの推測値を求める推測手段（例えば、図９４の平均値計算部２５５）と、
前記ブロックの推測値を用いて、前記第３の動画データをブロック単位で圧縮する圧縮手段（例えば、図９４の減算部２５６）と
を備えることを特徴とする。 The image processing apparatus according to claim 1,
In an image processing apparatus (for example, the transmission apparatus 1 in FIG. 79) that processes moving image data,
First moving image data (for example, 240 fps moving image data) is obtained from second moving image data (for example, 60 fps moving image data) having a frame rate lower than the frame rate of the first moving image data, and the first moving image data. Separating means (for example, a separation circuit 213 in FIG. 79) that separates the remaining third moving image data excluding the second moving image data (for example, 240-60 fps moving image data);
Detecting means for detecting a positional relationship having a high correlation in a plurality of frames of the second moving image data (for example, the maximum correlation position detecting unit 254 in FIG. 94)
Estimating means for obtaining an estimated value of a block obtained by dividing the frame of the third moving image data from the plurality of frames of image data of the second moving image data in the positional relationship detected by the detecting means. (For example, the average value calculation unit 255 in FIG. 94)
Compression means (for example, a subtracting unit 256 in FIG. 94) for compressing the third moving image data in units of blocks using the estimated value of the block.

請求項３に記載の画像処理装置は、
前記ブロックについて、前記第２の動画データの複数フレームにおいて前記ブロックとの相関が高い位置関係を表す１つの動きベクトルを検出する他の検出手段（例えば、図９６の相関最大位置検出部２７４）と、
前記ブロックとの位置関係が前記１つの動きベクトルから求められる位置関係にある、前記第２の動画データの複数フレームの画像データから、前記ブロックの他の推測値を求める他の推測手段（例えば、図９６の平均値計算部２７５）と、
前記ブロックの他の推測値を用いて、前記第３の動画データをブロック単位で圧縮する他の圧縮手段（例えば、図９６の減算部２７６）と、
前記圧縮手段による前記第３の動画データの圧縮により得られる圧縮データ、または前記他の圧縮手段による前記第３の動画データの圧縮により得られる圧縮データのうちのいずれか一方の圧縮データを選択し、その圧縮データを識別するための識別情報を付加して出力する選択手段（例えば、図８１の選択回路２４０）と
をさらに備えることを特徴とする。 An image processing apparatus according to claim 3 is provided.
Other detection means (for example, the maximum correlation position detection unit 274 in FIG. 96) for detecting one motion vector representing a positional relationship having a high correlation with the block in the plurality of frames of the second moving image data for the block. ,
Other estimation means for determining another estimated value of the block from image data of a plurality of frames of the second moving image data in which the positional relationship with the block is a positional relationship determined from the one motion vector (for example, The average value calculator 275) of FIG.
Other compression means (for example, the subtraction unit 276 in FIG. 96) that compresses the third moving image data in units of blocks using another estimated value of the block;
Selecting either one of the compressed data obtained by compressing the third moving image data by the compressing unit or the compressed data obtained by compressing the third moving image data by the other compressing unit; And selection means (for example, a selection circuit 240 in FIG. 81) that adds and outputs identification information for identifying the compressed data.

請求項７に記載の画像処理装置は、
時間方向の周波数軸と空間方向の周波数軸とで定義される周波数ドメインにおいて、前記第１の動画データの主成分の方向である主成分方向に延びる領域であって、前記時間方向の周波数軸の方向に特定の幅を有する領域を通過帯域として、前記第１の動画データをフィルタリングするフィルタ手段（例えば、図７９の帯域制限フィルタ部２１２）をさらに備え、
前記分離手段は、前記フィルタ手段によるフィルタリング後の前記第１の動画データを、前記第２と第３の動画データに分離する
ことを特徴とする。 An image processing apparatus according to claim 7 is provided.
In the frequency domain defined by the frequency axis in the time direction and the frequency axis in the spatial direction, the region extends in the principal component direction that is the direction of the principal component of the first moving image data, and the frequency axis in the time direction Filter means for filtering the first moving image data with a region having a specific width in the direction as a pass band (for example, a band limiting filter unit 212 in FIG. 79),
The separation unit separates the first moving image data after filtering by the filter unit into the second and third moving image data.

請求項８に記載の画像処理方法は、
動画データを処理する画像処理方法において、
第１の動画データ（例えば、240fps動画データ）を、その第１の動画データのフレームレートよりも低いフレームレートの第２の動画データ（例えば、60fps動画データ）と、前記第１の動画データから第２の動画データを除いた残りの第３の動画データ（例えば、240-60fps動画データ）とに分離する分離ステップ（例えば、図１０８のステップＳ２８２）と、
前記第２の動画データの複数フレームにおいて、相関が高い位置関係を検出する検出ステップ（例えば、図９５のステップＳ２１２）と、
前記検出ステップで検出された前記位置関係にある、前記第２の動画データの複数フレームの画像データから、前記第３の動画データのフレームをブロック分割して得られるブロックの推測値を求める推測ステップ（例えば、図９５のステップＳ２１３）と、
前記ブロックの推測値を用いて、前記第３の動画データをブロック単位で圧縮する圧縮ステップ（例えば、図９５のステップＳ２１４）と
を含むことを特徴とする。 The image processing method according to claim 8 comprises:
In an image processing method for processing video data,
First moving image data (for example, 240 fps moving image data) is obtained from second moving image data (for example, 60 fps moving image data) having a frame rate lower than the frame rate of the first moving image data, and the first moving image data. A separation step (for example, step S282 in FIG. 108) that separates the remaining third moving image data excluding the second moving image data (for example, 240-60 fps moving image data);
A detection step (for example, step S212 in FIG. 95) for detecting a positional relationship having a high correlation in a plurality of frames of the second moving image data;
Estimating step for obtaining an estimated value of a block obtained by dividing the frame of the third moving image data from the plurality of frames of image data of the second moving image data in the positional relationship detected in the detecting step. (For example, step S213 in FIG. 95),
A compression step (for example, step S214 in FIG. 95) of compressing the third moving image data in units of blocks using the estimated value of the block.

請求項９に記載の画像処理方法は、
前記ブロックについて、前記第２の動画データの複数フレームにおいて前記ブロックとの相関が高い位置関係を表す１つの動きベクトルを検出する他の検出ステップ（例えば、図９７のステップＳ２２２）と、
前記ブロックとの位置関係が前記１つの動きベクトルから求められる位置関係にある、前記第２の動画データの複数フレームの画像データから、前記ブロックの他の推測値を求める他の推測ステップ（例えば、図９７のステップＳ２２３）と、
前記ブロックの他の推測値を用いて、前記第３の動画データをブロック単位で圧縮する他の圧縮ステップ（例えば、図９７のステップＳ２２４）と、
前記圧縮ステップによる前記第３の動画データの圧縮により得られる圧縮データ、または前記他の圧縮ステップによる前記第３の動画データの圧縮により得られる圧縮データのうちのいずれか一方の圧縮データを選択し、その圧縮データを識別するための識別情報を付加して出力する選択ステップ（例えば、図１０９のステップＳ２９２およびＳ２９３）と
をさらに含むことを特徴とする。 The image processing method according to claim 9 comprises:
With respect to the block, another detection step (for example, step S222 in FIG. 97) for detecting one motion vector representing a positional relationship having a high correlation with the block in a plurality of frames of the second moving image data;
Another estimation step for obtaining another estimated value of the block from image data of a plurality of frames of the second moving image data, wherein the positional relationship with the block is a positional relationship obtained from the one motion vector (for example, Step S223) of FIG. 97;
Another compression step (for example, step S224 in FIG. 97) for compressing the third moving image data in units of blocks using another estimated value of the block;
One of the compressed data obtained by compressing the third moving image data by the compression step or the compressed data obtained by compressing the third moving image data by the other compression step is selected. And a selection step (for example, steps S292 and S293 in FIG. 109) for adding and outputting identification information for identifying the compressed data.

請求項１０に記載のプログラムおよび請求項１２に記載のプログラム記録媒体に記録されているプログラムは、
動画データの処理をコンピュータに実行させるプログラムにおいて、
第１の動画データ（例えば、240fps動画データ）を、その第１の動画データのフレームレートよりも低いフレームレートの第２の動画データ（例えば、60fps動画データ）と、前記第１の動画データから第２の動画データを除いた残りの第３の動画データ（例えば、240-60fps動画データ）とに分離する分離ステップ（例えば、図１０８のステップＳ２８２）と、
前記第２の動画データの複数フレームにおいて、相関が高い位置関係を検出する検出ステップ（例えば、図９５のステップＳ２１２）と、
前記検出ステップで検出された前記位置関係にある、前記第２の動画データの複数フレームの画像データから、前記第３の動画データのフレームをブロック分割して得られるブロックの推測値を求める推測ステップ（例えば、図９５のステップＳ２１３）と、
前記ブロックの推測値を用いて、前記第３の動画データをブロック単位で圧縮する圧縮ステップ（例えば、図９５のステップＳ２１４）と
を含むことを特徴とする。 The program recorded in the program recording medium according to claim 10 and the program recording medium according to claim 12,
In a program that causes a computer to process video data,
First moving image data (for example, 240 fps moving image data) is obtained from second moving image data (for example, 60 fps moving image data) having a frame rate lower than the frame rate of the first moving image data, and the first moving image data. A separation step (for example, step S282 in FIG. 108) that separates the remaining third moving image data excluding the second moving image data (for example, 240-60 fps moving image data);
A detection step (for example, step S212 in FIG. 95) for detecting a positional relationship having a high correlation in a plurality of frames of the second moving image data;
Estimating step for obtaining an estimated value of a block obtained by dividing the frame of the third moving image data from the plurality of frames of image data of the second moving image data in the positional relationship detected in the detecting step. (For example, step S213 in FIG. 95),
A compression step (for example, step S214 in FIG. 95) of compressing the third moving image data in units of blocks using the estimated value of the block.

請求項１１に記載のプログラムおよび請求項１３に記載のプログラム記録媒体に記録されているプログラムは、
前記ブロックについて、前記第２の動画データの複数フレームにおいて前記ブロックとの相関が高い位置関係を表す１つの動きベクトルを検出する他の検出ステップ（例えば、図９７のステップＳ２２２）と、
前記ブロックとの位置関係が前記１つの動きベクトルから求められる位置関係にある、前記第２の動画データの複数フレームの画像データから、前記ブロックの他の推測値を求める他の推測ステップ（例えば、図９７のステップＳ２２３）と、
前記ブロックの他の推測値を用いて、前記第３の動画データをブロック単位で圧縮する他の圧縮ステップ（例えば、図９７のステップＳ２２４）と、
前記圧縮ステップによる前記第３の動画データの圧縮により得られる圧縮データ、または前記他の圧縮ステップによる前記第３の動画データの圧縮により得られる圧縮データのうちのいずれか一方の圧縮データを選択し、その圧縮データを識別するための識別情報を付加して出力する選択ステップ（例えば、図１０９のステップＳ２９２およびＳ２９３）と
をさらに含むことを特徴とする。 The program according to claim 11 and the program recorded on the program recording medium according to claim 13 are:
With respect to the block, another detection step (for example, step S222 in FIG. 97) for detecting one motion vector representing a positional relationship having a high correlation with the block in a plurality of frames of the second moving image data;
Another estimation step for obtaining another estimated value of the block from image data of a plurality of frames of the second moving image data, wherein the positional relationship with the block is a positional relationship obtained from the one motion vector (for example, Step S223) of FIG. 97;
Another compression step (for example, step S224 in FIG. 97) for compressing the third moving image data in units of blocks using another estimated value of the block;
One of the compressed data obtained by compressing the third moving image data by the compression step or the compressed data obtained by compressing the third moving image data by the other compression step is selected. And a selection step (for example, steps S292 and S293 in FIG. 109) for adding and outputting identification information for identifying the compressed data.

請求項１８に記載の画像処理装置は、
動画データを処理する画像処理装置（例えば、図１１０の受信装置２）において、
第１の動画データ（例えば、240fps動画データ）を、その第１の動画データのフレームレートよりも低いフレームレートの第２の動画データ（例えば、60fps動画データ）と、前記第１の動画データから第２の動画データを除いた残りの第３の動画データ（例えば、240-60fps動画データ）とに分離し、
前記第２の動画データの複数フレームにおいて、相関が高い位置関係を検出し、
前記相関が高い位置関係にある、前記第２の動画データの複数フレームの画像データから、前記第３の動画データのフレームをブロック分割して得られるブロックの推測値を求め、
前記ブロックの推測値を用いて、前記ブロックを圧縮する
ことにより得られる圧縮データを、前記第２の動画データを用いて、前記第３の動画データに復元する第１の復元手段（例えば、図１１０の差分情報復元部３６４）と、
前記第２と第３の動画データを合成し、前記第１の動画データを復元する第２の復元手段（例えば、図１１０の合成部３６５）と
を備え、
前記第１の復元手段は、
前記第２の動画データの複数フレームにおいて、相関が高い位置関係を検出する検出手段（例えば、図１１３の相関最大位置検出部３７７）と、
前記検出手段で検出された前記位置関係にある、前記第２の動画データの複数フレームの画像データから、前記第３の動画データのフレームをブロック分割して得られるブロックの推測値を求める推測手段（例えば、図１１３の推測部３７８）と、
前記ブロックの推測値と、前記圧縮データとを用いて、前記第３の動画データをブロック単位で復元するブロック復元手段（例えば、図１１３の加算部３７９）と
を有する
ことを特徴とする。 An image processing apparatus according to claim 18 is provided.
In an image processing device (for example, the receiving device 2 in FIG. 110) that processes moving image data,
First moving image data (for example, 240 fps moving image data) is obtained from second moving image data (for example, 60 fps moving image data) having a frame rate lower than the frame rate of the first moving image data, and the first moving image data. Separated into the remaining third video data (for example, 240-60fps video data) excluding the second video data,
Detecting a positional relationship having a high correlation in a plurality of frames of the second moving image data;
Obtaining an estimated value of a block obtained by dividing a frame of the third moving image data from a plurality of frames of image data of the second moving image data having a high correlation with the correlation,
First restoration means for restoring compressed data obtained by compressing the block using the estimated value of the block into the third moving image data using the second moving image data (for example, FIG. 110 difference information restoration unit 364),
A second restoring means (for example, the synthesizing unit 365 in FIG. 110) that synthesizes the second and third moving image data and restores the first moving image data;
The first restoration means includes
Detecting means for detecting a positional relationship having a high correlation in a plurality of frames of the second moving image data (for example, the maximum correlation position detecting unit 377 in FIG. 113)
Estimating means for obtaining an estimated value of a block obtained by dividing the frame of the third moving image data from the plurality of frames of image data of the second moving image data in the positional relationship detected by the detecting means. (For example, the estimation unit 378 in FIG. 113);
Block reconstruction means (for example, an adding unit 379 in FIG. 113) that restores the third moving image data in units of blocks using the estimated value of the block and the compressed data.

請求項２１に記載の画像処理方法は、
動画データを処理する画像処理方法において、
第１の動画データ（例えば、240fps動画データ）を、その第１の動画データのフレームレートよりも低いフレームレートの第２の動画データ（例えば、60fps動画データ）と、前記第１の動画データから第２の動画データを除いた残りの第３の動画データ（例えば、240-60fps動画データ）とに分離し、
前記第２の動画データの複数フレームにおいて、相関が高い位置関係を検出し、
前記相関が高い位置関係にある、前記第２の動画データの複数フレームの画像データから、前記第３の動画データのフレームをブロック分割して得られるブロックの推測値を求め、
前記ブロックの推測値を用いて、前記ブロックを圧縮する
ことにより得られる圧縮データを、前記第２の動画データを用いて、前記第３の動画データに復元する第１の復元ステップ（例えば、図１１１のステップＳ３０２）と、
前記第２と第３の動画データを合成し、前記第１の動画データを復元する第２の復元ステップ（例えば、図１１１のステップＳ３０３）と
を含み、
前記第１の復元ステップは、
前記第２の動画データの複数フレームにおいて、相関が高い位置関係を検出する検出ステップ（例えば、図１１６のステップＳ３５１や、図１１７のステップＳ３６１）と、
前記検出ステップで検出された前記位置関係にある、前記第２の動画データの複数フレームの画像データから、前記第３の動画データのフレームをブロック分割して得られるブロックの推測値を求める推測ステップ（例えば、図１１６のステップＳ３５２や、図１１７のＳ３６２）と、
前記ブロックの推測値と、前記圧縮データとを用いて、前記第３の動画データをブロック単位で復元するブロック復元ステップ（例えば、図１１６のステップＳ３５３や、図１１７のステップＳ３６３）と
を含む
ことを特徴とする。 The image processing method according to claim 21,
In an image processing method for processing video data,
First moving image data (for example, 240 fps moving image data) is obtained from second moving image data (for example, 60 fps moving image data) having a frame rate lower than the frame rate of the first moving image data, and the first moving image data. Separated into the remaining third video data (for example, 240-60fps video data) excluding the second video data,
Detecting a positional relationship having a high correlation in a plurality of frames of the second moving image data;
Obtaining an estimated value of a block obtained by dividing a frame of the third moving image data from a plurality of frames of image data of the second moving image data having a high correlation with the correlation,
A first restoration step of restoring compressed data obtained by compressing the block using the estimated value of the block into the third moving picture data using the second moving picture data (for example, FIG. 111 step S302),
A second restoration step (for example, step S303 in FIG. 111) of combining the second and third moving image data and restoring the first moving image data,
The first restoration step includes
A detection step (for example, step S351 in FIG. 116 or step S361 in FIG. 117) for detecting a positional relationship having a high correlation in a plurality of frames of the second moving image data;
Estimating step for obtaining an estimated value of a block obtained by dividing the frame of the third moving image data from the plurality of frames of image data of the second moving image data in the positional relationship detected in the detecting step. (For example, step S352 in FIG. 116 and S362 in FIG. 117)
A block restoration step (for example, step S353 in FIG. 116 or step S363 in FIG. 117) for restoring the third moving image data in units of blocks using the estimated value of the block and the compressed data. It is characterized by.

請求項２３に記載のプログラムおよび請求項２５に記載のプログラム記録媒体に記録されているプログラムは、
動画データの処理を、コンピュータに実行させるプログラムにおいて、
第１の動画データ（例えば、240fps動画データ）を、その第１の動画データのフレームレートよりも低いフレームレートの第２の動画データ（例えば、60fps動画データ）と、前記第１の動画データから第２の動画データを除いた残りの第３の動画データ（例えば、240-60fps動画データ）とに分離し、
前記第２の動画データの複数フレームにおいて、相関が高い位置関係を検出し、
前記相関が高い位置関係にある、前記第２の動画データの複数フレームの画像データから、前記第３の動画データのフレームをブロック分割して得られるブロックの推測値を求め、
前記ブロックの推測値を用いて、前記ブロックを圧縮する
ことにより得られる圧縮データを、前記第２の動画データを用いて、前記第３の動画データに復元する第１の復元ステップ（例えば、図１１１のステップＳ３０２）と、
前記第２と第３の動画データを合成し、前記第１の動画データを復元する第２の復元ステップ（例えば、図１１１のステップＳ３０３）と
を含み、
前記第１の復元ステップは、
前記第２の動画データの複数フレームにおいて、相関が高い位置関係を検出する検出ステップ（例えば、図１１６のステップＳ３５１や、図１１７のステップＳ３６１）と、
前記検出ステップで検出された前記位置関係にある、前記第２の動画データの複数フレームの画像データから、前記第３の動画データのフレームをブロック分割して得られるブロックの推測値を求める推測ステップ（例えば、図１１６のステップＳ３５２や、図１１７のＳ３６２）と、
前記ブロックの推測値と、前記圧縮データとを用いて、前記第３の動画データをブロック単位で復元するブロック復元ステップ（例えば、図１１６のステップＳ３５３や、図１１７のステップＳ３６３）と
を含む
ことを特徴とする。 The program according to claim 23 and the program recorded on the program recording medium according to claim 25 are:
In a program that causes a computer to process video data,
First moving image data (for example, 240 fps moving image data) is obtained from second moving image data (for example, 60 fps moving image data) having a frame rate lower than the frame rate of the first moving image data, and the first moving image data. Separated into the remaining third video data (for example, 240-60fps video data) excluding the second video data,
Detecting a positional relationship having a high correlation in a plurality of frames of the second moving image data;
Obtaining an estimated value of a block obtained by dividing a frame of the third moving image data from a plurality of frames of image data of the second moving image data having a high correlation with the correlation,
A first restoration step of restoring compressed data obtained by compressing the block using the estimated value of the block into the third moving picture data using the second moving picture data (for example, FIG. 111 step S302),
A second restoration step (for example, step S303 in FIG. 111) of combining the second and third moving image data and restoring the first moving image data,
The first restoration step includes
A detection step (for example, step S351 in FIG. 116 or step S361 in FIG. 117) for detecting a positional relationship having a high correlation in a plurality of frames of the second moving image data;
Estimating step for obtaining an estimated value of a block obtained by dividing the frame of the third moving image data from the plurality of frames of image data of the second moving image data in the positional relationship detected in the detecting step. (For example, step S352 in FIG. 116 and S362 in FIG. 117)
A block restoration step (for example, step S353 in FIG. 116 or step S363 in FIG. 117) for restoring the third moving image data in units of blocks using the estimated value of the block and the compressed data. It is characterized by.

請求項２７に記載の画像処理装置は、
動画データを処理する画像処理装置（例えば、図７９の送信装置１）において、
第１の動画データ（例えば、240fps動画データ）を、その第１の動画データのフレームレートよりも低いフレームレートの第２の動画データ（例えば、60fps動画データ）と、前記第１の動画データから第２の動画データを除いた残りの第３の動画データ（例えば、240-60fps動画データ）とに分離する分離手段（例えば、図７９の分離回路２１３）と、
前記第３の動画データのフレームをブロック分割して得られるブロックについて、前記第２の動画データの複数フレームにおいて前記ブロックとの相関が高い位置関係を表す１つの動きベクトルを検出する検出手段（例えば、図９６の相関最大位置検出部２７４）と、
前記ブロックとの位置関係が前記１つの動きベクトルから求められる位置関係にある、前記第２の動画データの複数フレームの画像データから、前記ブロックの推測値を求める推測手段（例えば、図９６の平均値計算部２７５）と、
前記ブロックの推測値を用いて、前記第３の動画データをブロック単位で圧縮する圧縮手段（例えば、図９６の減算部２７６）と
を備えることを特徴とする。 The image processing apparatus according to claim 27,
In an image processing apparatus (for example, the transmission apparatus 1 in FIG. 79) that processes moving image data,
First moving image data (for example, 240 fps moving image data) is obtained from second moving image data (for example, 60 fps moving image data) having a frame rate lower than the frame rate of the first moving image data, and the first moving image data. Separating means (for example, a separation circuit 213 in FIG. 79) that separates the remaining third moving image data excluding the second moving image data (for example, 240-60 fps moving image data);
For a block obtained by dividing the third moving image data frame into blocks, a detecting means for detecting one motion vector representing a positional relationship having a high correlation with the block in a plurality of frames of the second moving image data (for example, 96, the maximum correlation position detector 274) of FIG.
Estimating means for obtaining an estimated value of the block from image data of a plurality of frames of the second moving image data in which the positional relationship with the block is a positional relationship obtained from the one motion vector (for example, the average of FIG. 96 Value calculation unit 275),
Compression means (for example, a subtracting unit 276 in FIG. 96) that compresses the third moving image data in units of blocks using the estimated value of the block.

請求項３０に記載の画像処理装置は、
時間方向の周波数軸と空間方向の周波数軸とで定義される周波数ドメインにおいて、前記第１の動画データの主成分の方向である主成分方向に延びる領域であって、前記時間方向の周波数軸の方向に特定の幅を有する領域を通過帯域として、前記第１の動画データをフィルタリングするフィルタ手段（例えば、図７９の帯域制限フィルタ部２１２）をさらに備え、
前記分離手段は、前記フィルタ手段によるフィルタリング後の前記第１の動画データを、前記第２と第３の動画データに分離する
ことを特徴とする。 An image processing apparatus according to claim 30 is provided.
In the frequency domain defined by the frequency axis in the time direction and the frequency axis in the spatial direction, the region extends in the principal component direction that is the direction of the principal component of the first moving image data, and the frequency axis in the time direction Filter means for filtering the first moving image data with a region having a specific width in the direction as a pass band (for example, a band limiting filter unit 212 in FIG. 79),
The separation unit separates the first moving image data after filtering by the filter unit into the second and third moving image data.

請求項３１に記載の画像処理方法は、
動画データを処理する画像処理方法において、
第１の動画データ（例えば、240fps動画データ）を、その第１の動画データのフレームレートよりも低いフレームレートの第２の動画データ（例えば、60fps動画データ）と、前記第１の動画データから第２の動画データを除いた残りの第３の動画データ（例えば、240-60fps動画データ）とに分離する分離ステップ（例えば、図１０８のステップＳ２８２）と、
前記第３の動画データのフレームをブロック分割して得られるブロックについて、前記第２の動画データの複数フレームにおいて前記ブロックとの相関が高い位置関係を表す１つの動きベクトルを検出する検出ステップ（例えば、図９７のステップＳ２２２）と、
前記ブロックとの位置関係が前記１つの動きベクトルから求められる位置関係にある、前記第２の動画データの複数フレームの画像データから、前記ブロックの推測値を求める推測ステップ（例えば、図９７のステップＳ２２３）と、
前記ブロックの推測値を用いて、前記第３の動画データをブロック単位で圧縮する圧縮ステップ（例えば、図９７のステップＳ２２４）と
を含むことを特徴とする。 The image processing method according to claim 31,
In an image processing method for processing video data,
First moving image data (for example, 240 fps moving image data) is obtained from second moving image data (for example, 60 fps moving image data) having a frame rate lower than the frame rate of the first moving image data, and the first moving image data. A separation step (for example, step S282 in FIG. 108) that separates the remaining third moving image data excluding the second moving image data (for example, 240-60 fps moving image data);
A detection step (for example, detecting one motion vector representing a positional relationship having a high correlation with the block in a plurality of frames of the second moving image data for a block obtained by dividing the frame of the third moving image data into blocks (for example, , Step S222 in FIG. 97,
An estimation step for obtaining an estimated value of the block from image data of a plurality of frames of the second moving image data in which the positional relationship with the block is a positional relationship obtained from the one motion vector (for example, the step of FIG. 97 S223),
And a compression step (for example, step S224 in FIG. 97) of compressing the third moving image data in units of blocks using the estimated value of the block.

請求項３２に記載のプログラム、および請求項３３に記載のプログラム記録媒体に記録されているプログラムは、
動画データの処理を、コンピュータに実行させるプログラムにおいて、
第１の動画データ（例えば、240fps動画データ）を、その第１の動画データのフレームレートよりも低いフレームレートの第２の動画データ（例えば、60fps動画データ）と、前記第１の動画データから第２の動画データを除いた残りの第３の動画データ（例えば、240-60fps動画データ）とに分離する分離ステップ（例えば、図１０８のステップＳ２８２）と、
前記第３の動画データのフレームをブロック分割して得られるブロックについて、前記第２の動画データの複数フレームにおいて前記ブロックとの相関が高い位置関係を表す１つの動きベクトルを検出する検出ステップ（例えば、図９７のステップＳ２２２）と、
前記ブロックとの位置関係が前記１つの動きベクトルから求められる位置関係にある、前記第２の動画データの複数フレームの画像データから、前記ブロックの推測値を求める推測ステップ（例えば、図９７のステップＳ２２３）と、
前記ブロックの推測値を用いて、前記第３の動画データをブロック単位で圧縮する圧縮ステップ（例えば、図９７のステップＳ２２４）と
を含むことを特徴とする。 The program according to claim 32 and the program recorded in the program recording medium according to claim 33 are:
In a program that causes a computer to process video data,
First moving image data (for example, 240 fps moving image data) is obtained from second moving image data (for example, 60 fps moving image data) having a frame rate lower than the frame rate of the first moving image data, and the first moving image data. A separation step (for example, step S282 in FIG. 108) that separates the remaining third moving image data excluding the second moving image data (for example, 240-60 fps moving image data);
A detection step (for example, detecting one motion vector representing a positional relationship having a high correlation with the block in a plurality of frames of the second moving image data for a block obtained by dividing the frame of the third moving image data into blocks (for example, , Step S222 in FIG. 97,
An estimation step for obtaining an estimated value of the block from image data of a plurality of frames of the second moving image data in which the positional relationship with the block is a positional relationship obtained from the one motion vector (for example, the step of FIG. 97 S223),
And a compression step (for example, step S224 in FIG. 97) of compressing the third moving image data in units of blocks using the estimated value of the block.

請求項３６に記載の画像処理装置は、
動画データを処理する画像処理装置（例えば、図１１０の受信装置２）において、
第１の動画データ（例えば、240fps動画データ）を、その第１の動画データのフレームレートよりも低いフレームレートの第２の動画データ（例えば、60fps動画データ）と、前記第１の動画データから第２の動画データを除いた残りの第３の動画データ（例えば、240-60fps動画データ）とに分離し、
前記第３の動画データのフレームをブロック分割して得られるブロックについて、前記第２の動画データの複数フレームにおいて前記ブロックとの相関が高い位置関係を表す１つの動きベクトルを検出し、
前記ブロックとの位置関係が前記１つの動きベクトルから求められる位置関係にある、前記第２の動画データの複数フレームの画像データから、前記ブロックの推測値を求め、
前記ブロックの推測値を用いて、前記ブロックを圧縮する
ことにより得られる圧縮データを、前記第２の動画データを用いて、前記第３の動画データに復元する第１の復元手段（例えば、図１１０の差分情報復元部３６４）と、
前記第２と第３の動画データを合成し、前記第１の動画データを復元する第２の復元手段（例えば、図１１０の合成部３６５）と
を備え、
前記第１の復元手段は、
前記１つの動きベクトルから、前記ブロックについて、前記第２の動画データの複数フレームにおいて前記ブロックとの相関が高い位置関係を求める検出手段（例えば、図１１３の相関最大位置検出部３７７）と、
前記ブロックとの位置関係が前記１つの動きベクトルから求められた位置関係にある、前記第２の動画データの複数フレームの画像データから、前記ブロックの推測値を求める推測手段（例えば、図１１３の推測部３７８）と、
前記ブロックの推測値と、前記圧縮データとを用いて、前記ブロックを復元するブロック復元手段（例えば、図１１３の加算部３７９）と
を有する
ことを特徴とする。 The image processing apparatus according to claim 36,
In an image processing device (for example, the receiving device 2 in FIG. 110) that processes moving image data,
First moving image data (for example, 240 fps moving image data) is obtained from second moving image data (for example, 60 fps moving image data) having a frame rate lower than the frame rate of the first moving image data, and the first moving image data. Separated into the remaining third video data (for example, 240-60fps video data) excluding the second video data,
For a block obtained by dividing the frame of the third moving image data into blocks, one motion vector representing a positional relationship having a high correlation with the block in a plurality of frames of the second moving image data is detected;
Obtaining an estimated value of the block from image data of a plurality of frames of the second moving image data, wherein the positional relationship with the block is a positional relationship obtained from the one motion vector,
First restoration means for restoring compressed data obtained by compressing the block using the estimated value of the block into the third moving image data using the second moving image data (for example, FIG. 110 difference information restoration unit 364),
A second restoring means (for example, the synthesizing unit 365 in FIG. 110) that synthesizes the second and third moving image data and restores the first moving image data;
The first restoration means includes
From the one motion vector, detection means (for example, a maximum correlation position detection unit 377 in FIG. 113) for obtaining a positional relationship having a high correlation with the block in the plurality of frames of the second moving image data for the block;
Estimating means for obtaining an estimated value of the block from a plurality of frames of image data of the second moving image data in which the positional relationship with the block is a positional relationship obtained from the one motion vector (for example, FIG. 113) An estimation unit 378),
Block reconstruction means (for example, an adding unit 379 in FIG. 113) that restores the block using the estimated value of the block and the compressed data is provided.

請求項３７に記載の画像処理方法は、
動画データを処理する画像処理方法において、
第１の動画データ（例えば、240fps動画データ）を、その第１の動画データのフレームレートよりも低いフレームレートの第２の動画データ（例えば、60fps動画データ）と、前記第１の動画データから第２の動画データを除いた残りの第３の動画データ（例えば、240-60fps動画データ）とに分離し、
前記第３の動画データのフレームをブロック分割して得られるブロックについて、前記第２の動画データの複数フレームにおいて前記ブロックとの相関が高い位置関係を表す１つの動きベクトルを検出し、
前記ブロックとの位置関係が前記１つの動きベクトルから求められる位置関係にある、前記第２の動画データの複数フレームの画像データから、前記ブロックの推測値を求め、
前記ブロックの推測値を用いて、前記ブロックを圧縮する
ことにより得られる圧縮データを、前記第２の動画データを用いて、前記第３の動画データに復元する第１の復元ステップ（例えば、図１１１のステップＳ３０２）と、
前記第２と第３の動画データを合成し、前記第１の動画データを復元する第２の復元ステップ（例えば、図１１１のステップＳ３０３）と
を含み、
前記第１の復元ステップは、
前記１つの動きベクトルから、前記ブロックについて、前記第２の動画データの複数フレームにおいて前記ブロックとの相関が高い位置関係を求める検出ステップ（例えば、図１１７のステップＳ３６１）と、
前記ブロックとの位置関係が前記１つの動きベクトルから求められた位置関係にある、前記第２の動画データの複数フレームの画像データから、前記ブロックの推測値を求める推測ステップ（例えば、図１１７のステップＳ３６２）と、
前記ブロックの推測値と、前記圧縮データとを用いて、前記ブロックを復元するブロック復元ステップ（例えば、図１１７のステップＳ３６３）と
を含む
ことを特徴とする。 The image processing method according to claim 37,
In an image processing method for processing video data,
First moving image data (for example, 240 fps moving image data) is obtained from second moving image data (for example, 60 fps moving image data) having a frame rate lower than the frame rate of the first moving image data, and the first moving image data. Separated into the remaining third video data (for example, 240-60fps video data) excluding the second video data,
For a block obtained by dividing the frame of the third moving image data into blocks, one motion vector representing a positional relationship having a high correlation with the block in a plurality of frames of the second moving image data is detected;
Obtaining an estimated value of the block from image data of a plurality of frames of the second moving image data, wherein the positional relationship with the block is a positional relationship obtained from the one motion vector,
A first restoration step of restoring compressed data obtained by compressing the block using the estimated value of the block into the third moving picture data using the second moving picture data (for example, FIG. 111 step S302),
A second restoration step (for example, step S303 in FIG. 111) of combining the second and third moving image data and restoring the first moving image data,
The first restoration step includes
A detection step (for example, step S361 in FIG. 117) for obtaining a positional relationship having a high correlation with the block in the plurality of frames of the second moving image data from the one motion vector;
An estimation step for obtaining an estimated value of the block from image data of a plurality of frames of the second moving image data, wherein the positional relationship with the block is a positional relationship obtained from the one motion vector (for example, FIG. 117). Step S362)
A block restoration step (for example, step S363 in FIG. 117) for restoring the block by using the estimated value of the block and the compressed data is included.

請求項３８に記載のプログラム、および請求項３９に記載のプログラム記録媒体に記録されているプログラムは、
動画データの処理を、コンピュータに実行させるプログラムにおいて、
第１の動画データ（例えば、240fps動画データ）を、その第１の動画データのフレームレートよりも低いフレームレートの第２の動画データ（例えば、60fps動画データ）と、前記第１の動画データから第２の動画データを除いた残りの第３の動画データ（例えば、240-60fps動画データ）とに分離し、
前記第３の動画データのフレームをブロック分割して得られるブロックについて、前記第２の動画データの複数フレームにおいて前記ブロックとの相関が高い位置関係を表す１つの動きベクトルを検出し、
前記ブロックとの位置関係が前記１つの動きベクトルから求められる位置関係にある、前記第２の動画データの複数フレームの画像データから、前記ブロックの推測値を求め、
前記ブロックの推測値を用いて、前記ブロックを圧縮する
ことにより得られる圧縮データを、前記第２の動画データを用いて、前記第３の動画データに復元する第１の復元ステップ（例えば、図１１１のステップＳ３０２）と、
前記第２と第３の動画データを合成し、前記第１の動画データを復元する第２の復元ステップ（例えば、図１１１のステップＳ３０３）と
を含み、
前記第１の復元ステップは、
前記１つの動きベクトルから、前記ブロックについて、前記第２の動画データの複数フレームにおいて前記ブロックとの相関が高い位置関係を求める検出ステップ（例えば、図１１７のステップＳ３６１）と、
前記ブロックとの位置関係が前記１つの動きベクトルから求められた位置関係にある、前記第２の動画データの複数フレームの画像データから、前記ブロックの推測値を求める推測ステップ（例えば、図１１７のステップＳ３６２）と、
前記ブロックの推測値と、前記圧縮データとを用いて、前記ブロックを復元するブロック復元ステップ（例えば、図１１７のステップＳ３６３）と
を含む
ことを特徴とする。 The program according to claim 38 and the program recorded in the program recording medium according to claim 39 are:
In a program that causes a computer to process video data,
First moving image data (for example, 240 fps moving image data) is obtained from second moving image data (for example, 60 fps moving image data) having a frame rate lower than the frame rate of the first moving image data, and the first moving image data. Separated into the remaining third video data (for example, 240-60fps video data) excluding the second video data,
For a block obtained by dividing the frame of the third moving image data into blocks, one motion vector representing a positional relationship having a high correlation with the block in a plurality of frames of the second moving image data is detected;
Obtaining an estimated value of the block from image data of a plurality of frames of the second moving image data, wherein the positional relationship with the block is a positional relationship obtained from the one motion vector,
A first restoration step of restoring compressed data obtained by compressing the block using the estimated value of the block into the third moving picture data using the second moving picture data (for example, FIG. 111 step S302),
A second restoration step (for example, step S303 in FIG. 111) of combining the second and third moving image data and restoring the first moving image data,
The first restoration step includes
A detection step (for example, step S361 in FIG. 117) for obtaining a positional relationship having a high correlation with the block in the plurality of frames of the second moving image data from the one motion vector;
An estimation step for obtaining an estimated value of the block from image data of a plurality of frames of the second moving image data, wherein the positional relationship with the block is a positional relationship obtained from the one motion vector (for example, FIG. 117). Step S362)
A block restoration step (for example, step S363 in FIG. 117) for restoring the block by using the estimated value of the block and the compressed data is included.

次に、本発明の実施の形態について説明する前に、動画について、理論的な事柄を述べておく。 Next, before describing the embodiment of the present invention, a theoretical matter will be described for moving images.

図１は、２次元平面（ｘ，ｙ）に時間方向（ｔ）を加えた３次元空間における動画データを示している。 FIG. 1 shows moving image data in a three-dimensional space in which a time direction (t) is added to a two-dimensional plane (x, y).

図１において、画像P₁₀₁は、ある時刻（フレーム）の画像であり、画像P₁₀₂は、その次の時刻の画像であり、画像P₁₀₃は、さらにその次の時刻の画像である。画像P₁₀₄は、さらにその次の時刻の画像である。図示を省略したが、画像P₁₀₁の前の時刻と、画像P₁₀₄の後の時刻にも、画像が存在する。 In FIG. 1, an image P ₁₀₁ is an image at a certain time (frame), an image P ₁₀₂ is an image at the next time, and an image P ₁₀₃ is an image at the next time. The image P ₁₀₄ is an image at the next time. Although illustration is omitted, there are also images at a time before the image P ₁₀₁ and a time after the image P ₁₀₄ .

図１では、画像P₁₀₁乃至P₁₀₄には、時間の経過とともに、y方向に移動する物体（被写体）が写っている。 In FIG. 1, the images P _{101 to} P ₁₀₄ show an object (subject) that moves in the y direction over time.

以上のような動画（動画データ）を、人間が見た場合、動画のフレームレートが、あるフレームレート以上であるときには、動画の隣接する２つのフレーム（の画像）の違いを認識することは出来ない。実際に被験者に視覚実験を行った結果、人間が、隣接する２つのフレーム（の画像）の違いを認識することができないフレームレートは、240fps程度以上であることが分かっている。 When a movie such as the above (moving image data) is viewed by a human, when the frame rate of the moving image is higher than a certain frame rate, the difference between two adjacent frames (images) of the moving image can be recognized. Absent. As a result of actually conducting a visual experiment on the subject, it has been found that the frame rate at which a human cannot recognize the difference between two adjacent frames (images) is about 240 fps or more.

ここで、隣接する２つのフレームどうしの間の時間（フレーム周期）を、t₀と表すこととすると、フレームレートは、１／t₀と表すことができる。 Here, if the time (frame period) between two adjacent frames is expressed as t ₀ , the frame rate can be expressed as 1 / t ₀ .

なお、以下では、フレーム周期t₀は、１／２４０秒程度として説明を行うが、フレーム周期t₀は、１／２４０秒程度でなくても良く、例えば、１／１２０秒程度であっても良い。但し、フレーム周期t₀が、例えば、１／１２０秒のように、人間が、隣接する２つのフレーム（の画像）の違いを認識することができないフレーム周期（１／２４０秒程度）より長い場合には、後述する表示装置３（図２４）で表示される動画について、その画質の劣化が、多少、知覚される。 In the following description, the frame period t ₀ is about 1/240 seconds. However, the frame period t ₀ may not be about 1/240 seconds, for example, about 1/120 seconds. good. However, when the frame period t ₀ is longer than the frame period (about 1/240 seconds) in which a human cannot recognize the difference between two adjacent frames (images thereof), for example, 1/120 second. The image quality of the moving image displayed on the display device 3 (FIG. 24) described later is somewhat perceived.

次に、人間の空間方向の認識能力、即ち、近接した２つの点を１つの点ではなく２つの点であると認識（知覚）することができる限界の距離を、r₀と表す。 Next, r ₀ represents a human's recognition ability in the spatial direction, that is, a limit distance at which two adjacent points can be recognized (perceived) as two points instead of one point.

この場合、人間の視覚によって認識（知覚）することができる動画の範囲は、図２のように表すことができる。 In this case, the range of moving images that can be recognized (perceived) by human vision can be expressed as shown in FIG.

即ち、図２は、時間t方向の周波数軸Tと空間方向x,yの周波数軸X,Yとで定義される周波数ドメインを示している。 That is, FIG. 2 shows a frequency domain defined by the frequency axis T in the time t direction and the frequency axes X and Y in the spatial directions x and y.

なお、図が煩雑になるのを避けるため、図２では、空間方向xの周波数軸Xと空間方向yの周波数軸Yとを、１軸で表してある。即ち、図２では、左から右方向に、周波数軸XとYをとってある。また、図２では、上から下方向に、周波数軸Tをとってある。さらに、図２では、周波数軸T（縦方向）は、２π／（４t₀）単位で区切ってあり、周波数軸X,Y方向（横方向）は、２π／r₀単位で区切ってある。 In order to avoid complication of the figure, in FIG. 2, the frequency axis X in the spatial direction x and the frequency axis Y in the spatial direction y are represented by one axis. That is, in FIG. 2, the frequency axes X and Y are taken from left to right. In FIG. 2, the frequency axis T is taken from the top to the bottom. Further, in FIG. 2, the frequency axis T (vertical direction) is divided in units of 2π / (4t ₀ ), and the frequency axes X and Y directions (lateral direction) are divided in units of 2π / r ₀ .

ここで、周波数ドメインの、後述する他の図も、図２と同様になっている。 Here, other diagrams of the frequency domain, which will be described later, are the same as those in FIG.

人間は、図２に示す、横が２×２π／（２r₀）で、縦が２×２π／（２r₀）の、原点を中心とする領域R₂₀₁の範囲外にある高周波数成分を認識することができない。そこで、すべての動画データは、領域R₂₀₁内に存在することとして、以下、説明を行う。 Humans, shown in FIG. 2, in the horizontal is _{2 × 2π / (2r 0)} , the vertical of _{2 × 2π / (2r 0)} , recognizes the high frequency components are outside the range of the region R ₂₀₁ centered at the origin Can not do it. Therefore, the following description will be made assuming that all moving image data exists in the region _R201 .

なお、図２では、領域R₂₀₁を、長方形状の領域として図示してあるが、領域R₂₀₁は、実際には、直方体状の領域である。即ち、領域R₂₀₁は、X方向が、−（π／r₀）乃至＋（π／r₀）で、Y方向が、−（π／r₀）乃至＋（π／r₀）で、T方向が、−（π／t₀）乃至＋（π／t₀）の範囲の領域である。 In FIG. 2, the region R ₂₀₁ is illustrated as a rectangular region, but the region R ₂₀₁ is actually a rectangular parallelepiped region. That is, the region R ₂₀₁ has an X direction of − (π / r ₀ ) to + (π / r ₀ ), a Y direction of − (π / r ₀ ) to + (π / r ₀ ), and T The direction is a region in the range of − (π / t ₀ ) to + (π / t ₀ ).

次に、動画の各部分のデータの周波数ドメイン上の位置（分布）について説明する。 Next, the position (distribution) on the frequency domain of the data of each part of the moving image will be described.

動画において、ある部分では、そこに投影されている被写体は静止しており、別の部分では、そこに投影されている被写体は動いている。 In a moving image, a subject projected on a certain portion is stationary, and a subject projected on the moving portion is moving in another portion.

まず、静止している動画の部分（静止している被写体が投影されている動画の部分）は、時間方向に対して変化しない。 First, the stationary moving image portion (the moving image portion on which the stationary subject is projected) does not change with respect to the time direction.

図３は、そのような静止している動画の部分のデータが分布する周波数ドメイン上の領域R₃₀₁を示している。 FIG. 3 shows a region R ₃₀₁ on the frequency domain in which data of such a still moving image portion is distributed.

領域R₃₀₁は、X,Y方向が、−２π／（２r₀）乃至＋２π／（２r₀）で、T方向が−２π／（４t₀）乃至＋２π／（４t₀）の領域になっている。 The region R ₃₀₁ is a region in which the X and Y directions are −2π / (2r ₀ ) to + 2π / (2r ₀ ) and the T direction is −2π / (4t ₀ ) to + 2π / (4t ₀ ). .

なお、動画において完全に静止している部分のデータであれば、T=0であるが、つまり、領域R₃₀₁はT方向の幅が０であるが、ここでは、被写体が多少動いている場合も考慮して、領域R₃₀₁は、T方向の幅が０ではなく、２π／（２t₀）になっている。また、X,Y方向が、−２π／（２r₀）乃至＋２π／（２r₀）の範囲に制限されるのは、図２で説明したように、動画データの周波数成分は、X,Y方向については、−２π／（２r₀）乃至＋２π／（２r₀）の範囲を超える範囲（図２の領域R₂₀₁の範囲外）には、存在しないからである。 Note that if the data is a completely stationary part of the video, T = 0, that is, the region R ₃₀₁ has a width in the T direction of 0, but here the subject is moving slightly. In consideration of the above, the region R ₃₀₁ has a width in the T direction that is not 0 but 2π / (2t ₀ ). The X and Y directions are limited to the range of −2π / (2r ₀ ) to + 2π / (2r ₀ ) as described with reference to FIG. This is because it does not exist in a range exceeding the range of −2π / (2r ₀ ) to + 2π / (2r ₀ ) (outside the range of the region R _{201 in} FIG. 2).

ここで、周波数ドメインにおいて、静止している部分のデータは、T=0の直線（平面）の方向に分布するから、この方向は、静止している部分のデータの主成分（例えば、第１主成分）の方向である。 Here, in the frequency domain, the data of the stationary part is distributed in the direction of the straight line (plane) of T = 0, so this direction is the main component (for example, the first data of the stationary part). Main component) direction.

次に、図４は、動画に投影されている被写体が速度（r₀／t₀）／２程度で動いている部分のデータが分布する周波数ドメイン上の領域R401を示している。 Next, FIG. 4 shows a region R401 on the frequency domain in which data of a portion where a subject projected on a moving image moves at a speed (r ₀ / t ₀ ) / 2 is distributed.

ここで、以下、適宜、説明を簡単にするために、周波数ドメイン上の点を、X,Y方向の座標Aと、T方向の座標Bとの２つだけを用いて、(A,B)と表す。 Here, in order to simplify the explanation as appropriate, the points on the frequency domain are expressed by using only two coordinates A and B in the X and Y directions, and (A, B). It expresses.

領域R401は、原点(0,0)から点（π／r0，-π／(2t0)）の方向に延びる領域であって、T方向に2π／(2t0)の幅を有する領域になっている。 The region R401 extends from the origin (0,0) to the point (π / r0, −π / (2t0)) and has a width of 2π / (2t0) in the T direction. .

なお、被写体が変形することなく、速度（r₀／t₀）／２で正確に等速直線運動していれば、領域R401は、T方向の幅が０となるが、ここでは、被写体の多少の変形や、移動速度が（r₀／t₀）／２から多少ぶれること等を考慮して、T方向の幅が０ではなく、2π／(2t0)になっている。 It should be noted that if the subject does not deform and moves accurately at a constant linear velocity at a speed (r ₀ / t ₀ ) / 2, the region R401 has a width in the T direction of 0. The width in the T direction is not 0 but 2π / (2t0) in consideration of some deformation and the movement speed slightly deviating from (r ₀ / t ₀ ) / 2.

また、領域R401は、X,Y方向が、−２π／（２r₀）乃至＋２π／（２r₀）の範囲に制限されているが、これは、領域R₃₀₁のX,Y方向の範囲が制限されているのと同一の理由による。 In the region R401, the X and Y directions are limited to the range of −2π / (2r ₀ ) to + 2π / (2r ₀ ), but this is limited to the range of the region R _{301 in} the X and Y directions. For the same reason that has been done.

ここで、動画に投影されている被写体が速度（r₀／t₀）／２程度で動いている部分のデータが、原点(0,0)から点（π／r0，-π／(2t0)）の方向上に分布するのは、次の理由による。 Here, the data of the portion where the subject projected on the moving image moves at a speed (r ₀ / t ₀ ) / 2 is obtained from the origin (0,0) to the point (π / r0, −π / (2t0) ) Is distributed for the following reason.

即ち、図５の波形R₅₀₁は、ある時刻t₁における被写体の空間方向xの分布を示している。 That is, the waveform R _{501 in} FIG. 5 shows the distribution of the subject in the spatial direction x at a certain time t ₁ .

被写体が、例えば、速度（r₀／t₀）／２で、例えば、空間方向xに移動しているとき、その移動している被写体は、波形R₅₀₁，R₅₀₂，R₅₀₃，R₅₀₄，R₅₀₅，R₅₀₆，・・・で表すことができる。 For example, when the subject is moving at a speed (r ₀ / t ₀ ) / 2, for example, in the spatial direction x, the moving subjects are waveform R ₅₀₁ , R ₅₀₂ , R ₅₀₃ , R ₅₀₄ , R ₅₀₅ , R ₅₀₆ ,...

ここで、波形R₅₀₁は、ある時刻（フレーム）t₁における被写体を表している。波形R₅₀₂は、波形R₅₀₁の時刻t₁の次の時刻（フレーム）t₁+t₀における被写体を表しており、波形R₅₀₁の位置から、r₀／２だけ、x方向に移動している。以下、同様に、波形R₅₀₃，R₅₀₄，R₅₀₅，R₅₀₆は、それぞれ、時刻t₁+2t₀，t₁+3t₀，t₁+4t₀，t₁+5t₀における被写体を表している。 Here, the waveform R ₅₀₁ represents the subject at a certain time (frame) t ₁ . Waveform R ₅₀₂ represents a subject at the next time (frame) t ₁ + t ₀ of time t ₁ of the waveform R _501, from the position of the waveform R _501, only r _0/2, moves in the x-direction Yes. Hereinafter, similarly, waveforms R ₅₀₃ , R ₅₀₄ , R ₅₀₅ , and R ₅₀₆ represent subjects at times t ₁ + 2t ₀ , t ₁ + 3t ₀ , t ₁ + 4t ₀ , and t ₁ + 5t _{0, respectively} . Yes.

図５における波形を空間位置xと時間tの関数と考え、この波形をFunc(x,t)と定義する。図５の波形R₅₀₁，R₅₀₂，R₅₀₃，R₅₀₄，R₅₀₅，R₅₀₆，・・・から明らかなように、Func(x,t)=Func(x-(（r₀／t₀）／２)×t,０)という関係がある。 The waveform in FIG. 5 is considered as a function of the spatial position x and time t, and this waveform is defined as Func (x, t). As is clear from the waveforms R ₅₀₁ , R ₅₀₂ , R ₅₀₃ , R ₅₀₄ , R ₅₀₅ , R ₅₀₆ ,..., Func (x, t) = Func (x − ((r ₀ / t ₀ ) / 2) × t, 0).

さて、xとtという２つの変数による関数Funcを２次元フーリエ変換した周波数ドメイン上のデータを(X,T)とすると、この(X,T)の成分は、フーリエ変換の定義から明らかなように、空間方向xの周期がXであり、時間方向tの周期がTである成分を表している。 Assuming that the data on the frequency domain obtained by two-dimensional Fourier transform of the function Func with two variables x and t is (X, T), the component of (X, T) is apparent from the definition of Fourier transform. In addition, a component in which the period in the spatial direction x is X and the period in the time direction t is T is represented.

Func(x,t)=Func(x-(（r₀／t₀）／２)×t,０)という関係より、(X,T)は、２次元ベクトル(r₀／２, t₀)に直交することが言える。 From the relationship Func (x, t) = Func (x − ((r ₀ / t ₀ ) / 2) × t, 0), (X, T) is a two-dimensional vector (r _0/2 , t ₀ ) Can be said to be orthogonal.

今、図５を用いて説明したが、実際には、空間方向は２次元であり、時間軸も加えると３次元となり、３次元フーリエ変換を考えないといけない。そこで、図６を用いて再度説明する。図６では、図１と同様に、３次元上での説明を行うための図である。 Although described with reference to FIG. 5, the spatial direction is actually two-dimensional, and when the time axis is added, it becomes three-dimensional, and a three-dimensional Fourier transform must be considered. Then, it demonstrates again using FIG. FIG. 6 is a diagram for explaining three-dimensionally as in FIG. 1.

図６の２次元空間(x,y)上の波形R₆₀₁は、ある時刻t₁における被写体の空間方向(x,y)の分布を示している。 A waveform R ₆₀₁ on the two-dimensional space (x, y) in FIG. 6 shows the distribution in the spatial direction (x, y) of the subject at a certain time t ₁ .

被写体が、例えば、速度（r₀／t₀）／２で、移動しているとき、その移動している被写体は、波形R₆₀₁，R₆₀₂，R₆₀₃，R₆₀₄，・・・で表すことができる。なお、r₀は、図５の説明とは違い、２次元空間(x,y)上の２次元ベクトルである。 For example, when the subject is moving at a speed (r ₀ / t ₀ ) / 2, the moving subject is represented by waveforms R ₆₀₁ , R ₆₀₂ , R ₆₀₃ , R _604,. Can do. Note that r ₀ is a two-dimensional vector on a two-dimensional space (x, y) unlike the description of FIG.

ここで、波形R₆₀₁は、ある時刻（フレーム）t₁における被写体を表している。波形R₆₀₂は、波形R₆₀₁の時刻t₁の次の時刻（フレーム）t₁+t₀における被写体を表しており、波形R₆₀₁の位置から、r₀／２だけ移動している。以下、同様に、波形R₆₀₃，R₆₀₄は、それぞれ、時刻t₁+2t₀，t₁+3t₀における被写体を表している。 Here, the waveform R ₆₀₁ represents the subject at a certain time (frame) t ₁ . Waveform R ₆₀₂ represents a subject at the next time (frame) t ₁ + t ₀ of time t ₁ of the waveform R _601, from the position of the waveform R _601, is moved by r _0/2. Hereinafter, similarly, waveforms R ₆₀₃ and R ₆₀₄ represent subjects at times t ₁ + 2t ₀ and t ₁ + 3t _0, respectively.

図６における波形を空間位置(x,y)と時間tの関数と考え、この波形をFunc((x,y),t)と定義する。図６の波形R₆₀₁，R₆₀₂，R₆₀₃，R₆₀₄，・・・から明らかなように、Func((x,y),t)=Func((x,y)-(（r₀／t₀）／２)×t,０)という関係がある。 The waveform in FIG. 6 is considered as a function of the spatial position (x, y) and time t, and this waveform is defined as Func ((x, y), t). As apparent from the waveforms R ₆₀₁ , R ₆₀₂ , R ₆₀₃ , R ₆₀₄ ,... In FIG. 6, Func ((x, y), t) = Func ((x, y) − ((r ₀ / t ₀ ) / 2) × t, 0).

さて、xとyとtという３つの変数による関数Funcを３次元フーリエ変換した周波数ドメイン上のデータを(X,Y,T)とすると、この(X,Y,T)の成分は、フーリエ変換の定義から明らかなように、空間方向xの周期がXであり、空間方向yの周期がYであり、時間方向tの周期がTである成分を表している。 Now, if the data on the frequency domain obtained by three-dimensional Fourier transform of the function Func with three variables x, y and t is (X, Y, T), the component of (X, Y, T) is Fourier transform. As is clear from the definition of FIG. 4, a component in which the period in the spatial direction x is X, the period in the spatial direction y is Y, and the period in the time direction t is T is represented.

Func((x,y),t)=Func((x,y)-(（r₀／t₀）／２)×t,０)という関係より、(X,Y,T)は、３次元ベクトル(r₀／２, t₀)に直交することが言える。 From the relationship Func ((x, y), t) = Func ((x, y)-((r ₀ / t ₀ ) / 2) × t, 0), (X, Y, T) is three-dimensional It can be said that it is orthogonal to the vector (r _0/2 , t ₀ ).

図５および図６に示したように、(X,T)あるいは(X,Y,T)は、ベクトル(r₀／２, t₀)に直交するので、例えば、図７に示すように、周波数ドメイン上の点（２π／（２r₀），−２π／（4t₀））（図７において○印で示す位置）にデータが存在したり、点（２π／（４r₀），−２π／（8t₀））（図７において△印で示す位置）上にデータが存在する。 As shown in FIGS. 5 and 6, since (X, T) or (X, Y, T) is orthogonal to the vector (r _0/2 , t ₀ ), for example, as shown in FIG. There is data at a point (2π / (2r ₀ ), −2π / (4t ₀ )) (position indicated by a circle in FIG. 7) on the frequency domain, or a point (2π / (4r ₀ ), −2π / There is data on (8t ₀ )) (the position indicated by Δ in FIG. 7).

即ち、速度（r₀／t₀）／２で移動している波形は、いずれも原点(0,0)と点（π／r0，-π／(2t0)）とを通る直線（平面）R₇₀₁上に分布する。 That is, each waveform moving at a speed (r ₀ / t ₀ ) / 2 is a straight line (plane) R passing through the origin (0,0) and the point (π / r0, -π / (2t0)). Distributed on ₇₀₁ .

速度（r₀／t₀）／２で動いている部分の、他の周波数成分Xのデータ（波形）も、同様に、直線R₇₀₁上に分布する。 Similarly, the data (waveform) of the other frequency component X of the portion moving at the speed (r ₀ / t ₀ ) / 2 is also distributed on the straight line R ₇₀₁ .

速度（r₀／t₀）／２で動いている部分は、時間がt₀だけ進むと、r₀／２だけ空間方向の位置がずれるので、上述のように、周波数ドメイン上において、原点(0,0)と点（π／r0，-π／(2t0)）とを通る直線R₇₀₁上に分布する。 The moving parts at a rate _{_{(r 0 / t 0) /}} 2 , when the time advances by t _0, since r _0/2 only spatial directions position shifts, as described above, in a frequency domain, the origin ( 0,0) and the point (π / r0, -π / ( 2t0) distributed on a straight line R ₇₀₁ through a).

なお、周波数ドメインにおいて、速度（r₀／t₀）／２で動いている部分のデータは、直線（平面）R₇₀₁の方向（π／r0，-π／(2t0)）に分布するから、この方向（π／r0，-π／(2t0)）は、速度（r₀／t₀）／２で動いている部分のデータの主成分の方向である。 In the frequency domain, the data of the portion moving at the speed (r ₀ / t ₀ ) / 2 is distributed in the direction of the straight line (plane) R ₇₀₁ (π / r0, −π / (2t0)). This direction (π / r0, −π / (2t0)) is the direction of the principal component of the data of the portion moving at the speed (r ₀ / t ₀ ) / 2.

次に、図８は、動画に投影されている被写体が速度r₀／t₀程度で動いている部分のデータが分布する周波数ドメイン上の領域R₈01を示している。 Next, FIG. 8 shows a region R ₈ 01 on the frequency domain in which data of a portion in which a subject projected on a moving image moves at a speed r ₀ / t ₀ is distributed.

領域R₈01は、原点(0,0)から点（π／r0，-π／t0）の方向に延びる領域であって、T方向に2π／(2t0)の幅を有する領域になっている。 The region R ₈ 01 is a region extending in the direction from the origin (0,0) to the point (π / r0, −π / t0) and having a width of 2π / (2t0) in the T direction. .

即ち、速度r₀／t₀で動いている部分は、時間がt₀だけ進むと、r₀だけ空間方向の位置がずれるので、周波数ドメイン上において、原点(0,0)と点（π／r0，-π／t0）とを通る直線上に分布する。 That is, the portion moving at the speed r ₀ / t ₀ is shifted in position in the spatial direction by r ₀ when the time advances by t _0, so that the origin (0,0) and the point (π / r0, -π / t0).

なお、被写体が変形することなく、速度r₀／t₀で正確に等速直線運動していれば、領域R₈01は、T方向の幅が０となるが、ここでは、被写体の多少の変形や、移動速度がr₀／t₀から多少ぶれること等を考慮して、T方向の幅が０ではなく、2π／(2t0)になっている。 Note that if the subject does not deform and moves accurately linearly at a constant velocity r ₀ / t ₀ , the region R ₈ 01 has a width of 0 in the T direction. In consideration of deformation and the movement speed slightly deviating from r ₀ / t ₀ , the width in the T direction is not 0 but 2π / (2t 0).

また、領域R₈01は、X,Y方向が、−２π／（２r₀）乃至＋２π／（２r₀）の範囲に制限されているが、これは、図３の領域R₃₀₁のX,Y方向の範囲が制限されているのと同一の理由による。 In the region R ₈ 01, the X and Y directions are limited to the range of −2π / (2r ₀ ) to + 2π / (2r ₀ ), which is the same as the region R _{301 in} FIG. For the same reason that the range of directions is limited.

さらに、領域R₈01は、T方向が、−２π／（２t₀）乃至＋２π／（２t₀）の範囲に制限されているが、これは、図２で説明したように、動画データの周波数成分は、T方向については、−２π／（２t₀）乃至＋２π／（２t₀）の範囲を超える範囲（図２の領域R₂₀₁の範囲外）には、存在しないからである。 Further, in the region R ₈ 01, the T direction is limited to a range of −2π / (2t ₀ ) to + 2π / (2t ₀ ), which is the frequency of the moving image data as described with reference to FIG. This is because the component does not exist in the T direction beyond the range of −2π / (2t ₀ ) to + 2π / (2t ₀ ) (outside the range of the region R _{201 in} FIG. 2).

ここで、周波数ドメインにおいて、速度r₀／t₀で動いている部分のデータは、原点(0,0)と、点（π／r0，-π／t0）とを通る直線の方向（π／r0，-π／t0）に分布するから、この方向（π／r0，-π／t0）は、速度r₀／t₀で動いている部分のデータの主成分の方向である。 Here, in the frequency domain, the data of the portion moving at the velocity r ₀ / t ₀ is the direction of the straight line (π / t) passing through the origin (0,0) and the point (π / r0, −π / t0). r0, -π / t0), this direction (π / r0, -π / t0) is the direction of the main component of the data of the portion moving at the speed r ₀ / t ₀ .

次に、図９は、動画に投影されている被写体が速度2r₀／t₀程度で動いている部分のデータが分布する周波数ドメイン上の領域R₉01を示している。 Next, FIG. 9 shows a region R ₉ 01 on the frequency domain data of the part object is projected onto a video is moving at about the speed 2r ₀ / t ₀ is distributed.

領域R₉01は、原点(0,0)から点（π／r0，-2π／t0）の方向に延びる領域であって、T方向に2π／(2t0)の幅を有する領域になっている。 Region R ₉ 01 is the point from the origin (0,0) (π / r0, -2π / t0) an area extending in the direction of, and is a region having a width of 2π / (2t0) in the T direction .

即ち、速度2r₀／t₀で動いている部分は、時間がt₀だけ進むと、2r₀だけ空間方向の位置がずれるので、周波数ドメイン上において、原点(0,0)と点（π／r0，-2π／t0）とを通る直線上に分布する。 That is, the portion moving at the speed 2r ₀ / t ₀ is shifted in the spatial direction by 2r ₀ when the time advances by t _0, so that the origin (0,0) and the point (π / r0, -2π / t0).

なお、被写体が変形することなく、速度2r₀／t₀で正確に等速直線運動していれば、領域R₉01は、T方向の幅が０となるが、ここでは、被写体の多少の変形や、移動速度がr₀／t₀から多少ぶれること等を考慮して、T方向の幅が０ではなく、2π／(2t0)になっている。 Incidentally, without the subject is deformed, if exactly uniform linear motion at a speed 2r ₀ / t _0, the region R ₉ 01, although the width of the T direction is 0, here, some of the subject In consideration of deformation and the movement speed slightly deviating from r ₀ / t ₀ , the width in the T direction is not 0 but 2π / (2t 0).

また、領域R₉01は、T方向が、−２π／（２t₀）乃至＋２π／（２t₀）の範囲に制限されているが、これは、図２で説明したように、動画データの周波数成分は、T方向については、−２π／（２t₀）乃至＋２π／（２t₀）の範囲を超える範囲（図２の領域R₂₀₁の範囲外）には、存在しないからである。 In the region R ₉ 01, the T direction is limited to the range of −2π / (2t ₀ ) to + 2π / (2t ₀ ). This is because the frequency of the moving image data is as described with reference to FIG. This is because the component does not exist in the T direction beyond the range of −2π / (2t ₀ ) to + 2π / (2t ₀ ) (outside the range of the region R _{201 in} FIG. 2).

ここで、周波数ドメインにおいて、速度2r₀／t₀で動いている部分のデータは、原点(0,0)と、点（π／r0，-2π／t0）とを通る直線の方向（π／r0，-2π／t0）に分布するから、この方向（π／r0，-2π／t0）は、速度2r₀／t₀で動いている部分のデータの主成分の方向である。 Here, in the frequency domain, the data of the portion moving at the speed 2r ₀ / t ₀ is the direction of the straight line (π / 0) passing through the origin (0,0) and the point (π / r0, -2π / t0). r0, -2π / t0), this direction (π / r0, -2π / t0) is the direction of the main component of the data of the portion moving at the speed 2r ₀ / t ₀ .

以上、被写体が静止している場合と、被写体が速度(r₀／t₀)／2，r₀／t₀，2r₀／t₀程度で移動している場合それぞれについて、そのような被写体が投影されている動画の部分のデータの、周波数ドメイン上の分布について説明したが、他の速度で移動している部分のデータも、周波数ドメインにおいて、同様に分布する。 As described above, when the subject is stationary, and when the subject is moving at a speed (r ₀ / t ₀ ) / 2, r ₀ / t ₀ , 2r ₀ / t ₀ , such a subject is The distribution on the frequency domain of the data of the portion of the moving image being projected has been described, but the data of the portion moving at other speeds is similarly distributed in the frequency domain.

次に、人間の視覚効果について説明する。 Next, human visual effects will be described.

人間の目は、物体から発せられる光を時間方向に積分して認識している。この人間の目における積分機能（視覚の積分機能）は、ブロックの法則（Bloch’s Law）と言われている。人間の目における光の積分時間は、環境によって異なるが、大体１／６０秒程度である。 The human eye recognizes the light emitted from the object by integrating it in the time direction. This integration function (visual integration function) in the human eye is called the Block's Law. The integration time of light in the human eye is approximately 1/60 seconds, although it varies depending on the environment.

ここで、t₀を、上述のように、１／２４０秒程度とすると、人間の目における光の積分時間は、４×t₀秒と表すことができる。 Here, if t ₀ is about 1/240 seconds as described above, the integration time of light in the human eye can be expressed as 4 × t ₀ seconds.

動画の中で静止している被写体が投影されている部分を、人間が視線を動かさずに注視して認識する画像は、視覚の積分機能により、静止している部分のデータを、時間方向に、「４×t₀」秒程度のローパスフィルタでフィルタリングして得られる画像と等価である。 An image in which a stationary subject is projected in a moving image is recognized by a human gaze without moving the line of sight. This is equivalent to an image obtained by filtering with a low-pass filter of about “4 × t ₀ ” seconds.

このローパスフィルタの通過帯域は、周波数ドメイン上において、図１０に示す領域R₁₀₀₁として表すことができる。 The pass band of this low-pass filter can be represented as a region R ₁₀₀₁ shown in FIG. 10 on the frequency domain.

図１０の領域R₁₀₀₁は、X,Y方向の範囲が、図２の領域R₂₀₁と同一の−２π／（２r₀）乃至＋２π／（２r₀）であり、T方向の範囲が、-(2π/(4t₀))/2乃至(2π/(4t₀))/2の領域である。この領域R₁₀₀₁のT方向の幅は、2π/(4t₀)であるが、これが、4t₀の時間の積分を表す。 In the region R _{1001 in} FIG. 10, the range in the X and Y directions is −2π / (2r ₀ ) to + 2π / (2r ₀ ), which is the same as the region R _{201 in} FIG. 2, and the range in the T direction is − ( The region is 2π / (4t ₀ )) / 2 to (2π / (4t ₀ )) / 2. The width in the T direction of this region R ₁₀₀₁ is 2π / (4t ₀ ), which represents the integration of the time of 4t ₀ .

動画の静止している部分のデータは、図３に示したように、領域R₃₀₁内に存在するが、人間は、その領域R₃₀₁のうちの、領域R₁₀₀₁内の情報（データ）しか認識することができない。従って、領域R₃₀₁のうちの、領域R₁₀₀₁外の情報は、人間にとって無駄な情報である。 As shown in FIG. 3, the data of the still part of the moving image exists in the region R ₃₀₁ , but the human recognizes only the information (data) in the region R ₁₀₀₁ in the region R _301. Can not do it. Therefore, information outside the region R ₁₀₀₁ in the region R ₃₀₁ is useless information for humans.

次に、動画の中で、そこに投影されている被写体が速度「（r₀／t₀）／２」程度で動いている部分を、人間が、その被写体が移動する方向に視線を動かしながら注視して認識する画像、即ち、人間が追従視により認識する画像は、視覚の積分機能により、速度「（r₀／t₀）／２」程度で動いている部分のデータを、時間方向に、「４×t₀」秒程度のローパスフィルタでフィルタリングして得られる画像と等価である。 Next, while moving the line of sight in the direction in which the subject moves in the part of the moving image where the subject being projected is moving at a speed of “(r ₀ / t ₀ ) / 2” An image that is recognized by gazing, that is, an image that a human recognizes by following vision, uses data of a portion moving at a speed of “(r ₀ / t ₀ ) / 2” in the time direction by a visual integration function. This is equivalent to an image obtained by filtering with a low-pass filter of about “4 × t ₀ ” seconds.

但し、いまの場合の追従視では、時間の経過とともに、注視点が速度「（r₀／t₀）／２」程度で移動するので、ローパスフィルタの通過帯域は、周波数ドメイン上において、図１１に示す領域R₁₁₀₁として表すことができる。 However, in the following tracking in this case, the gazing point moves at a speed of about “(r ₀ / t ₀ ) / 2” with the passage of time. Therefore, the pass band of the low-pass filter is as shown in FIG. Can be represented as a region R ₁₁₀₁ shown in FIG.

図１１の領域R₁₁₀₁は、図４の領域R401と同様に、原点(0,0)から点（π／r0，-π／(2t0)）の方向に延びる領域である。さらに、領域R₁₁₀₁は、T方向に2π／(4t0)の幅を有する領域であり、これが、4t₀の時間の積分を表す。なお、領域R₁₁₀₁は、図２に示した、動画データが存在する領域R₂₀₁内に制限されている。 A region R _{1101 in} FIG. 11 is a region extending in the direction from the origin (0,0) to the point (π / r0, −π / (2t0)), similarly to the region R401 in FIG. Further, a region R ₁₁₀₁ is a region having a width of 2π / (4t0) in the T direction, and this represents an integration of time of 4t ₀ . Note that the area R ₁₁₀₁ is limited to the area R ₂₀₁ where the moving image data exists, as shown in FIG.

速度「（r₀／t₀）／２」程度で移動している部分のデータは、図４に示したように、領域R₄₀₁内に存在するが、人間は、その領域R₄₀₁のうちの、領域R₁₁₀₁内の情報しか認識することができない。従って、領域R₄₀₁のうちの、領域R₁₁₀₁外の情報は、人間にとって無駄な情報である。 Speed _{_{"(r 0 / t 0) /}} 2 " data of the portion moving at a degree, as shown in FIG. 4, are present in the region R _401, human, of its region R ₄₀₁ Only the information in the region R ₁₁₀₁ can be recognized. Therefore, information outside the region R ₁₁₀₁ in the region R ₄₀₁ is useless information for humans.

次に、動画の中で、そこに投影されている被写体が速度「r₀／t₀」程度で動いている部分を、人間が追従視を行うことにより認識する画像は、やはり、視覚の積分機能により、速度「r₀／t₀」程度で動いている部分のデータを、時間方向に、「４×t₀」秒程度のローパスフィルタでフィルタリングして得られる画像と等価である。 Next, the image in which the subject projected on the moving image moves at a speed of about “r ₀ / t ₀ ” by human follow-up is still the visual integration. This function is equivalent to an image obtained by filtering data of a portion moving at a speed of about “r ₀ / t ₀ ” with a low-pass filter of about “4 × t ₀ ” seconds in the time direction.

但し、いまの場合の追従視では、時間の経過とともに、注視点が速度「r₀／t₀」程度で移動するので、ローパスフィルタの通過帯域は、周波数ドメイン上において、図１２に示す領域R₁₂₀₁として表すことができる。 However, in the following tracking in this case, the gazing point moves at a speed of about “r ₀ / t ₀ ” with the passage of time, so the pass band of the low-pass filter is the region R shown in FIG. 12 on the frequency domain. Can be represented as ₁₂₀₁ .

図１２の領域R₁₂₀₁は、図８の領域R₈01と同様に、原点(0,0)から点（π／r0，-π／t0）の方向に延びる領域である。さらに、領域R₁₂₀₁は、T方向に2π／(4t0)の幅を有する領域であり、これが、4t₀の時間の積分を表す。なお、領域R₁₂₀₁は、図２に示した、動画データが存在する領域R₂₀₁内に制限されている。 Region R _{1201 in} FIG. 12 is a region extending in the direction from the origin (0, 0) to the point (π / r 0, −π / t 0), similarly to the region R ₈ 01 in FIG. Further, the region R ₁₂₀₁ is a region having a width of 2π / (4t0) in the T direction, and this represents the integration of time of 4t ₀ . Note that the region R ₁₂₀₁ is limited to the region R ₂₀₁ where the moving image data exists as shown in FIG.

速度「r₀／t₀」程度で移動している部分のデータは、図８に示したように、領域R₈₀₁内に存在するが、人間は、その領域R₈₀₁のうちの、領域R₁₂₀₁内の情報しか認識することができない。従って、領域R₈₀₁のうちの、領域R₁₂₀₁外の情報は、人間にとって無駄な情報である。 As shown in FIG. 8, the data of the portion moving at the speed “r ₀ / t ₀ ” is present in the region R ₈₀₁ , but the human is the region R _{1201 in the} region R _801. Only the information inside can be recognized. Therefore, information outside the region R ₁₂₀₁ in the region R ₈₀₁ is useless information for humans.

次に、動画の中で、そこに投影されている被写体が速度「2r₀／t₀」程度で動いている部分を、人間が追従視を行うことにより認識する画像は、やはり、視覚の積分機能により、速度「2r₀／t₀」程度で動いている部分のデータを、時間方向に、「４×t₀」秒程度のローパスフィルタでフィルタリングして得られる画像と等価である。 Next, an image that is recognized by humans following the moving part of the moving subject at a speed of about “2r ₀ / t ₀ ” in the moving image is still a visual integral. This function is equivalent to an image obtained by filtering data of a portion moving at a speed of about “2r ₀ / t ₀ ” with a low-pass filter of about “4 × t ₀ ” seconds in the time direction.

但し、いまの場合の追従視では、時間の経過とともに、注視点が速度「2r₀／t₀」程度で移動するので、ローパスフィルタの通過帯域は、周波数ドメイン上において、図１３に示す領域R₁₃₀₁として表すことができる。 However, in the follow-up view in this case, since the gazing point moves at a speed of about “2r ₀ / t ₀ ” with time, the pass band of the low-pass filter is the region R shown in FIG. 13 on the frequency domain. Can be represented as ₁₃₀₁ .

図１３の領域R₁₃₀₁は、図９の領域R₉₀₁と同様に、原点(0,0)から点（π／r0，-2π／t0）の方向に延びる領域である。さらに、領域R₁₃₀₁は、T方向に2π／(4t0)の幅を有する領域であり、これが、4t₀の時間の積分を表す。なお、領域R₁₃₀₁は、図２に示した、動画データが存在する領域R₂₀₁内に制限されている。 A region R _{1301 in} FIG. 13 is a region extending in the direction from the origin (0,0) to the point (π / r0, −2π / t0), similarly to the region R ₉₀₁ in FIG. Further, the region R ₁₃₀₁ is a region having a width of 2π / (4t0) in the T direction, and this represents the integration of time of 4t ₀ . Note that the region R ₁₃₀₁ is limited to the region R ₂₀₁ where the moving image data exists as shown in FIG.

速度「2r₀／t₀」程度で移動している部分のデータは、図９に示したように、領域R₉₀₁内に存在するが、人間は、その領域R₉₀₁のうちの、領域R₁₃₀₁内の情報しか認識することができない。従って、領域R₉₀₁のうちの、領域R₁₃₀₁外の情報は、人間にとって無駄な情報である。 As shown in FIG. 9, the data of the portion moving at the speed “2r ₀ / t ₀ ” exists in the region R ₉₀₁ , but the human is the region R _{1301 in the} region R _901. Only the information inside can be recognized. Therefore, information outside the region R ₁₃₀₁ in the region R ₉₀₁ is useless information for humans.

次に、人間が固定視を行う場合について説明する。ここで、固定視とは、追従視とは違い、被写体の移動とは別の視線移動をする場合で、例えば、人間が視線を動かさないで（見ている方向を常に一定に保って）、動画を見る場合が該当する。 Next, a case where a human performs fixed vision will be described. Here, fixed vision is different from follow-up vision in the case of moving the line of sight different from the movement of the subject. For example, humans do not move the line of sight (keep the viewing direction constant) Applicable when watching a video.

動画の中で、そこに投影されている被写体が速度「（r₀／t₀）／２」程度で動いている部分を、人間が固定視することにより認識する画像は、視覚の積分機能により、速度「（r₀／t₀）／２」程度で動いている部分のデータを、時間方向に、「４×t₀」秒程度のローパスフィルタでフィルタリングして得られる画像と等価である。 An image that is recognized by a human fixed view of a moving part of a moving image at a speed of about “(r ₀ / t ₀ ) / 2” in a moving image is obtained by a visual integration function. This is equivalent to an image obtained by filtering data in a portion moving at a speed of “(r ₀ / t ₀ ) / 2” with a low-pass filter of about “4 × t ₀ ” seconds in the time direction.

このローパスフィルタの通過帯域は、周波数ドメイン上において、静止している部分を注視する場合と同一の領域、即ち、図１０の領域R₁₀₀₁で表すことができる。 The pass band of the low-pass filter can be represented by the same region on the frequency domain as that when a stationary portion is watched, that is, a region R ₁₀₀₁ in FIG.

ここで、図１４は、周波数ドメインにおいて、速度「（r₀／t₀）／２」程度で動いている部分のデータが分布する領域R₄₀₁（図４）と、人間が固定視を行う場合のローパスフィルタの通過帯域としての領域R₁₀₀₁（図１０）とを示している。 Here, FIG. 14 shows a region R ₄₀₁ (FIG. 4) in which data of a portion moving at a speed of “(r ₀ / t ₀ ) / 2” in the frequency domain is distributed, and a case where a human performs fixed vision. A region R ₁₀₀₁ (FIG. 10) as a pass band of the low-pass filter is shown.

人間は、領域R₄₀₁のうちの、領域R₁₀₀₁内の情報、つまり、領域R₄₀₁とR₁₀₀₁とが重複する部分の領域R₁₄₀₁内の情報しか認識することができない。従って、領域R₄₀₁のうちの、領域R₁₀₀₁外の情報は、人間にとって無駄な情報である。 Humans, of area R _401, information in the region R _1001, that is, can not be a region R ₄₀₁ and R ₁₀₀₁ are only recognized information in the region R ₁₄₀₁ of the portions overlapping. Therefore, information outside the region R ₁₀₀₁ in the region R ₄₀₁ is useless information for humans.

なお、図１４において、人間が固定視を行う場合のローパスフィルタの通過帯域としての領域R₁₀₀₁には、動画データが存在する領域R₄₀₁とは重複しない部分の領域R₁₄₀₂とR₁₄₀₃（図中斜線を付して示す）が存在するが、いまの場合、この領域R₁₄₀₂とR₁₄₀₃には、動画データは存在しない。 In FIG. 14, a region R ₁₀₀₁ as a pass band of the low-pass filter when a human performs fixed vision is a region R ₁₄₀₂ and R ₁₄₀₃ (parts R ₁₄₀₂ and R _{1403 in the} figure) that do not overlap with the region R ₄₀₁ where the moving image data exists. In this case, there is no moving image data in the regions R ₁₄₀₂ and R ₁₄₀₃ .

また、上述のように、速度「（r₀／t₀）／２」程度で動いている部分を、人間が固定視する場合には、人間の視線方向は、動被写体（動いている被写体）の動きとは別の動きをしているので、多少ぼけた画像であっても良好な動画として認識される。 Further, as described above, when a human is fixedly viewing a portion that is moving at a speed of about “(r ₀ / t ₀ ) / 2”, the human gaze direction is the moving subject (moving subject). Therefore, even a slightly blurred image is recognized as a good moving image.

従って、図１４の領域R₁₄₀₁内のデータを、注視点が速度「（r₀／t₀）／２」程度で移動する場合のローパスフィルタ、即ち、図１１の領域R₁₁₀₁を通過帯域として有するローパスフィルタでフィルタリングしても、良好な画質の動画として認識される。 Therefore, the data in the region R ₁₄₀₁ in FIG. 14 has a low-pass filter when the gazing point moves at a speed of “(r ₀ / t ₀ ) / 2”, that is, the region R _{1101 in} FIG. Even if it is filtered with a low-pass filter, it is recognized as a moving image with good image quality.

以上から、図１５に示すように、領域R₁₄₀₁のうちの、領域R₁₁₀₁と重複する部分の領域R₁₅₀₁（図中黒色で塗りつぶして示す）内のデータがあれば、速度「（r₀／t₀）／２」程度で動いている部分は、良好な画質の動画として認識される。 From the above, as shown in FIG. 15, if there is data in a region R ₁₅₀₁ of the region R ₁₄₀₁ that overlaps the region R ₁₁₀₁ (shown in black in the drawing), the speed “(r ₀ / A portion moving at about t ₀ ) / 2 ”is recognized as a moving image with good image quality.

従って、動画において、速度「（r₀／t₀）／２」程度で動いている部分を、人間が固定視する場合には、その部分のデータ、即ち、領域R₄₀₁内のデータから、固定視が行われている場合のローパスフィルタの通過帯域としての領域R₁₀₀₁と、注視点が速度「（r₀／t₀）／２」程度で移動する場合のローパスフィルタの通過帯域としての領域R₁₁₀₁との重複部分の領域R₁₅₀₁を除いた領域のデータは、無駄である（なくても、人間が認識する画質に、ほとんど影響しない）。 Therefore, in the case where a portion of a moving image moving at a speed of “(r ₀ / t ₀ ) / 2” is fixedly viewed by a human, the data of that portion, that is, the data in the region R ₄₀₁ is fixed. Region R ₁₀₀₁ as the pass band of the low-pass filter when viewing is performed, and region R as the pass band of the low-pass filter when the gazing point moves at a speed of “(r ₀ / t ₀ ) / 2” The data in the area excluding the area R ₁₅₀₁ that overlaps the area ₁₁₀₁ is useless (even if it does not, it hardly affects the image quality recognized by humans).

次に、動画の中で、そこに投影されている被写体が速度「r₀／t₀」程度で動いている部分を、人間が固定視することにより認識する画像は、視覚の積分機能により、速度「r₀／t₀」程度で動いている部分のデータを、時間方向に、「４×t₀」秒程度のローパスフィルタでフィルタリングして得られる画像と等価である。 Next, an image that is recognized by a human fixed view of a moving part of a moving image at a speed of about “r ₀ / t ₀ ” in a moving image is obtained by a visual integration function, This is equivalent to an image obtained by filtering data in a portion moving at a speed of about “r ₀ / t ₀ ” with a low-pass filter of about “4 × t ₀ ” seconds in the time direction.

ここで、図１６は、周波数ドメインにおいて、速度「r₀／t₀」程度で動いている部分のデータが分布する領域R₈₀₁（図８）と、人間が固定視を行う場合のローパスフィルタの通過帯域としての領域R₁₀₀₁（図１０）とを示している。 Here, FIG. 16 shows a region R ₈₀₁ (FIG. 8) in which data of a portion moving at a speed of “r ₀ / t ₀ ” in the frequency domain is distributed, and a low-pass filter when a human performs fixed vision. A region R ₁₀₀₁ (FIG. 10) as a pass band is shown.

人間は、領域R₈₀₁のうちの、領域R₁₀₀₁内の情報、つまり、領域R₈₀₁とR₁₀₀₁とが重複する部分の領域R₁₆₀₁内の情報しか認識することができない。従って、領域R₈₀₁のうちの、領域R₁₀₀₁外の情報は、人間にとって無駄な情報である。 Humans, of area R _801, information in the region R _1001, that is, can not be a region R ₈₀₁ and R ₁₀₀₁ are only recognized information in the region R ₁₆₀₁ of the portions overlapping. Therefore, information outside the region R ₁₀₀₁ in the region R ₈₀₁ is useless information for humans.

なお、図１６において、人間が固定視を行う場合のローパスフィルタの通過帯域としての領域R₁₀₀₁には、動画データが存在する領域R₈₀₁とは重複しない部分の領域R₁₆₀₂とR₁₆₀₃（図中斜線を付して示す）が存在するが、いまの場合、この領域R₁₆₀₂とR₁₆₀₃には、動画データは存在しない。 In FIG. 16, a region R ₁₀₀₁ as a pass band of the low-pass filter when a human performs fixed vision is a portion of regions R ₁₆₀₂ and R ₁₆₀₃ that do not overlap with the region R ₈₀₁ where moving image data exists (in the drawing) In this case, there is no moving image data in these areas R ₁₆₀₂ and R ₁₆₀₃ .

また、上述のように、速度「r₀／t₀」程度で動いている部分を、人間が固定視する場合には、人間の視線方向は、動被写体（動いている被写体）の動きとは別の動きをしているので、多少ぼけた画像であっても良好な動画として認識される。 In addition, as described above, when a human is fixedly viewing a portion that is moving at a speed of about “r ₀ / t ₀ ”, the direction of the human gaze is the movement of the moving subject (moving subject). Since it moves differently, even a slightly blurred image is recognized as a good moving image.

従って、図１６の領域R₁₆₀₁内のデータを、注視点が速度「r₀／t₀」程度で移動する場合のローパスフィルタ、即ち、図１２の領域R₁₂₀₁を通過帯域として有するローパスフィルタでフィルタリングしても、良好な画質の動画として認識される。 Therefore, the data in the region R ₁₆₀₁ in FIG. 16 is filtered by a low-pass filter when the gazing point moves at a speed of “r ₀ / t ₀ ”, that is, a low-pass filter having the region R _{1201 in} FIG. Even so, it is recognized as a moving image with good image quality.

以上から、図１７に示すように、領域R₁₆₀₁のうちの、領域R₁₂₀₁と重複する部分の領域R₁₇₀₁（図中黒色で塗りつぶして示す）内のデータがあれば、速度「r₀／t₀」程度で動いている部分は、良好な画質の動画として認識される。 From the above, as shown in FIG. 17, if there is data in a region R ₁₇₀₁ (shown in black in the drawing) that overlaps the region R ₁₂₀₁ in the region R ₁₆₀₁ , the speed “r ₀ / t A portion moving at about “ ₀ ” is recognized as a moving image with good image quality.

従って、動画において、速度「r₀／t₀」程度で動いている部分を、人間が固定視する場合には、その部分のデータ、即ち、領域R₈₀₁内のデータから、固定視が行われている場合のローパスフィルタの通過帯域としての領域R₁₀₀₁と、注視点が速度「r₀／t₀」程度で移動する場合のローパスフィルタの通過帯域としての領域R₁₂₀₁との重複部分の領域R₁₇₀₁を除いた領域のデータは、無駄である。 Therefore, when a human views a moving part of a moving image at a speed of about “r ₀ / t ₀ ”, the fixed view is performed from the data of that part, that is, the data in the region R ₈₀₁ . Region R ₁₀₀₁ as the pass band of the low-pass filter in the case where the gazing point moves and the region R ₁₂₀₁ as the pass band of the low-pass filter when the gazing point moves at a speed of about “r ₀ / t ₀ ” The data in the area excluding ₁₇₀₁ is useless.

次に、動画の中で、そこに投影されている被写体が速度「2r₀／t₀」程度で動いている部分を、人間が固定視することにより認識する画像は、視覚の積分機能により、速度「r₀／t₀」程度で動いている部分のデータを、時間方向に、「４×t₀」秒程度のローパスフィルタでフィルタリングして得られる画像と等価である。 Next, an image that is recognized by a human fixed view of a moving part of a moving image at a speed of about “2r ₀ / t ₀ ” in a moving image is obtained by a visual integration function, This is equivalent to an image obtained by filtering data in a portion moving at a speed of about “r ₀ / t ₀ ” with a low-pass filter of about “4 × t ₀ ” seconds in the time direction.

ここで、図１８は、周波数ドメインにおいて、速度「2r₀／t₀」程度で動いている部分のデータが分布する領域R₉₀₁（図９）と、人間が固定視を行う場合のローパスフィルタの通過帯域としての領域R₁₀₀₁（図１０）とを示している。 Here, FIG. 18 shows a region R ₉₀₁ (FIG. 9) in which data of a portion moving at a speed of about “2r ₀ / t ₀ ” is distributed in the frequency domain, and a low-pass filter when a human performs fixed vision. A region R ₁₀₀₁ (FIG. 10) as a pass band is shown.

人間は、領域R₉₀₁のうちの、領域R₁₀₀₁内の情報、つまり、領域R₉₀₁とR₁₀₀₁とが重複する部分の領域R₁₈₀₁内の情報しか認識することができない。従って、領域R₉₀₁のうちの、領域R₁₀₀₁外の情報は、人間にとって無駄な情報である。 Humans, of area R _901, information in the region R _1001, that is, can not be a region R ₉₀₁ and R ₁₀₀₁ are only recognized information in the region R ₁₈₀₁ of the portions overlapping. Therefore, information outside the region R ₁₀₀₁ in the region R ₉₀₁ is useless information for humans.

なお、図１８において、人間が固定視を行う場合のローパスフィルタの通過帯域としての領域R₁₀₀₁には、動画データが存在する領域R₉₀₁とは重複しない部分の領域R₁₈₀₂とR₁₈₀₃（図中斜線を付して示す）が存在するが、いまの場合、この領域R₁₈₀₂とR₁₈₀₃には、動画データは存在しない。 In FIG. 18, a region R ₁₀₀₁ as a pass band of the low-pass filter when a human performs fixed vision is a region R ₁₈₀₂ and R _{1803 of} portions that do not overlap with the region R ₉₀₁ where moving image data exists (in the drawing) In this case, there is no moving image data in the regions R ₁₈₀₂ and R ₁₈₀₃ .

また、上述のように、速度「2r₀／t₀」程度で動いている部分を、人間が固定視する場合には、人間の視線方向は、動被写体（動いている被写体）の動きとは別の動きをしているので、多少ぼけた画像であっても良好な動画として認識される。 In addition, as described above, when a human is fixedly viewing a portion that is moving at a speed of about “2r ₀ / t ₀ ”, the direction of the human line of sight is the movement of the moving subject (moving subject). Since it moves differently, even a slightly blurred image is recognized as a good moving image.

従って、図１８の領域R₁₈₀₁内のデータを、注視点が速度「2r₀／t₀」程度で移動する場合のローパスフィルタ、即ち、図１３の領域R₁₃₀₁を通過帯域として有するローパスフィルタでフィルタリングしても、良好な画質の動画として認識される。 Accordingly, the data in the region R ₁₈₀₁ in FIG. 18 is filtered by a low-pass filter when the point of interest moves at a speed of about “2r ₀ / t ₀ ”, that is, a low-pass filter having the region R _{1301 in} FIG. Even so, it is recognized as a moving image with good image quality.

以上から、図１９に示すように、領域R₁₈₀₁のうちの、領域R₁₃₀₁と重複する部分の領域R₁₉₀₁（図中黒色で塗りつぶして示す）内のデータがあれば、速度「2r₀／t₀」程度で動いている部分は、良好な画質の動画として認識される。 From the above, as shown in FIG. 19, if there is data in a region R ₁₉₀₁ (shown in black in the drawing) that overlaps the region R ₁₃₀₁ in the region R ₁₈₀₁ , the speed “2r ₀ / t A portion moving at about “ ₀ ” is recognized as a moving image with good image quality.

従って、動画において、速度「2r₀／t₀」程度で動いている部分を、人間が固定視する場合には、その部分のデータ、即ち、領域R₉₀₁内のデータから、固定視が行われている場合のローパスフィルタの通過帯域としての領域R₁₀₀₁と、注視点が速度「2r₀／t₀」程度で移動する場合のローパスフィルタの通過帯域としての領域R₁₃₀₁との重複部分の領域R₁₉₀₁を除いた領域のデータは、無駄である。 Therefore, when a human views a moving part of a moving image at a speed of about “2r ₀ / t ₀ ”, the fixed view is performed from the data of that part, that is, the data in the region R ₉₀₁ . Region R ₁₀₀₁ as the pass band of the low-pass filter when the gazing point moves and the region R ₁₃₀₁ as the pass band of the low-pass filter when the gazing point moves at a speed of about “2r ₀ / t ₀ ” The data in the area excluding ₁₉₀₁ is useless.

以上の固定視と追従視の場合の、人間が認識することができる周波数ドメイン上の範囲（領域）についてまとめると、図２０乃至図２３に示すようになる。 The ranges (regions) on the frequency domain that can be recognized by humans in the case of fixed vision and follow-up vision are summarized as shown in FIGS.

即ち、図２０は、動画の中で静止している物体（被写体）が投影されている部分について、人間が認識することができるデータが存在する領域を示している。 That is, FIG. 20 shows an area where there is data that can be recognized by humans in a portion where a stationary object (subject) is projected in a moving image.

動画の中で静止している物体が投影されている部分について、人間が認識することができるデータの周波数ドメイン上の領域は、図２０に示すように、図１０に示した領域R₁₀₀₁であり、領域R₁₀₀₁以外にあるデータは、無駄である。 As shown in FIG. 20, the region on the frequency domain of the data that can be recognized by humans for a portion where a stationary object is projected in the moving image is a region R ₁₀₀₁ shown in FIG. The data outside the area R ₁₀₀₁ is useless.

次に、図２１は、動画の中で、そこに投影されている被写体が速度「（r₀／t₀）／２」程度で動いている部分について、人間が認識することができるデータが存在する領域を示している。 Next, FIG. 21 shows data in which a person can recognize a portion of a moving image where a subject projected on the moving image moves at a speed of “(r ₀ / t ₀ ) / 2”. The area to be shown is shown.

動画の中で速度「（r₀／t₀）／２」程度で動いている部分について、人間が追従視で認識することができるデータの周波数ドメイン上の領域は、図１１に示した領域R₁₁₀₁である。さらに、固定視で認識することができるデータの周波数ドメイン上の領域は、図１５に示した領域R₁₅₀₁であり、この領域R₁₅₀₁は、図１１に示した領域R₁₁₀₁に含まれる。 A region on the frequency domain of data that can be recognized by humans by following vision for a portion moving at a speed of “(r ₀ / t ₀ ) / 2” in the moving image is a region R shown in FIG. ₁₁₀₁ . Furthermore, the region on the frequency domain of data that can be recognized by fixed vision is a region R ₁₅₀₁ shown in FIG. 15, and this region R ₁₅₀₁ is included in the region R ₁₁₀₁ shown in FIG.

従って、速度「（r₀／t₀）／２」程度で動いている部分について、人間が認識することができるデータの周波数ドメイン上の領域は、図２１に示すように、図１１に示した領域R₁₁₀₁であり、領域R₁₁₀₁以外にあるデータは、無駄である。 Therefore, the region on the frequency domain of the data that can be recognized by humans for the portion moving at the speed “(r ₀ / t ₀ ) / 2” is shown in FIG. 11 as shown in FIG. a region R _1101, data in the region other than the region R ₁₁₀₁ is waste.

次に、図２２は、動画の中で、そこに投影されている被写体が速度「r₀／t₀」程度で動いている部分について、人間が認識することができるデータが存在する領域を示している。 Next, FIG. 22 shows an area where data that can be recognized by humans exists in a portion of the moving image where the subject projected on the moving image moves at a speed of about “r ₀ / t ₀ ”. ing.

動画の中で速度「r₀／t₀」程度で動いている部分について、人間が追従視で認識することができるデータの周波数ドメイン上の領域は、図１２に示した領域R₁₂₀₁である。さらに、固定視で認識することができるデータの周波数ドメイン上の領域は、図１７に示した領域R₁₇₀₁であり、この領域R₁₇₀₁は、図１２に示した領域R₁₂₀₁に含まれる。 A region on the frequency domain of data that can be recognized by a human by tracking vision for a portion moving at a speed of “r ₀ / t ₀ ” in the moving image is a region R ₁₂₀₁ shown in FIG. Furthermore, the region on the frequency domain of data that can be recognized by fixed vision is a region R ₁₇₀₁ shown in FIG. 17, and this region R ₁₇₀₁ is included in the region R ₁₂₀₁ shown in FIG.

従って、速度「r₀／t₀」程度で動いている部分について、人間が認識することができるデータの周波数ドメイン上の領域は、図２２に示すように、図１２に示した領域R₁₂₀₁であり、領域R₁₂₀₁以外にあるデータは、無駄である。 Therefore, as shown in FIG. 22, the region on the frequency domain of the data that can be recognized by humans for the portion moving at the speed “r ₀ / t ₀ ” is a region R ₁₂₀₁ shown in FIG. _Yes , the data outside the area R ₁₂₀₁ is useless.

次に、図２３は、動画の中で、そこに投影されている被写体が速度「2r₀／t₀」程度で動いている部分について、人間が認識することができるデータが存在する領域を示している。 Next, FIG. 23 shows an area where data that can be recognized by humans exists in a portion of the moving image where the subject projected on the moving image moves at a speed of about “2r ₀ / t ₀ ”. ing.

動画の中で速度「2r₀／t₀」程度で動いている部分について、人間が追従視で認識することができるデータの周波数ドメイン上の領域は、図１３に示した領域R₁₃₀₁である。さらに、固定視で認識することができるデータの周波数ドメイン上の領域は、図１９に示した領域R₁₉₀₁であり、この領域R₁₉₀₁は、図１３に示した領域R₁₃₀₁に含まれる。 A region on the frequency domain of the data that can be recognized by humans by following vision for a portion moving at a speed of “2r ₀ / t ₀ ” in the moving image is a region R ₁₃₀₁ shown in FIG. Furthermore, the region on the frequency domain of data that can be recognized with fixed vision is a region R ₁₉₀₁ shown in FIG. 19, and this region R ₁₉₀₁ is included in the region R ₁₃₀₁ shown in FIG.

従って、速度「2r₀／t₀」程度で動いている部分について、人間が認識することができるデータの周波数ドメイン上の領域は、図２３に示すように、図１３に示した領域R₁₃₀₁であり、領域R₁₃₀₁以外にあるデータは、無駄である。 Therefore, as shown in FIG. 23, the region on the frequency domain of the data that can be recognized by humans for the portion moving at the speed “2r ₀ / t ₀ ” is a region R ₁₃₀₁ shown in FIG. _Yes , data outside the area R ₁₃₀₁ is useless.

ここで、上述の場合には、図２０の領域R₁₀₀₁、図２１の領域R₁₁₀₁、図２２の領域R₁₂₀₁、図２３の領域R₁₃₀₁のT方向の幅を、2π/(4t₀)としたが、この幅は、視覚の積分機能による積分の時間に相当する幅であればよく、2π/(4t₀)に限定されるものではない。 In the above case, the width in the T direction of the region R ₁₀₀₁ in FIG. 20, the region R ₁₁₀₁ in FIG. 21, the region R ₁₂₀₁ in FIG. 22, and the region R _{1301 in} FIG. 23 is 2π / (4t ₀ ). However, this width may be a width corresponding to the integration time by the visual integration function, and is not limited to 2π / (4t ₀ ).

動画の中で、他の速度で動いている部分のデータについても、同様のことが言える。 The same can be said for the data of the moving part of the video at other speeds.

以上のように、人間の視覚効果（視覚特性）、即ち、視覚の積分機能を利用して、周波数ドメイン上において、動画データを、人間が認識することができる領域のデータと、それ以外の無駄なデータとに分類することができる。 As described above, by utilizing the human visual effect (visual characteristics), that is, the visual integration function, on the frequency domain, moving image data can be recognized as data in a region that can be recognized by humans and other waste. Data.

そして、周波数ドメイン上において、動画データから、人間が認識することができる領域のデータだけを抽出すれば、即ち、それ以外の無駄なデータを削除すれば、画質を劣化させずに（人間が動画を見たときに認識する画質）、動画データを圧縮することができる。 Then, if only data in a region that can be recognized by humans is extracted from video data on the frequency domain, that is, if unnecessary data other than that is deleted, image quality is not degraded ( The image quality recognized when viewing the video), and the video data can be compressed.

そこで、図２４は、動画データを、視覚の積分機能を利用して圧縮し、さらに、そのように圧縮された動画データを復号（伸張）する画像処理システムの構成例を示している。 FIG. 24 shows an example of the configuration of an image processing system that compresses moving image data using a visual integration function and decodes (decompresses) the compressed moving image data.

図２４において、送信装置１には、例えば、1/t₀=240fpsなどの高フレームレートの動画データが供給される。送信装置１は、そこに供給される動画データを、図２０の領域R₁₀₀₁や、図２１の領域R₁₁₀₁、図２２の領域R₁₂₀₁、図２３の領域R₁₃₀₁などを通過帯域とするフィルタ（帯域通過フィルタまたは帯域制限フィルタ）でフィルタリングし、その結果得られる、人間が認識することができる周波数ドメイン上の領域のデータを出力する。送信装置１が出力するデータは、フレキシブルディスク、CD-ROM(Compact Disc Read Only Memory)，MO(Magneto Optical)ディスク，DVD(Digital Versatile Disc)、磁気ディスク、半導体メモリなどの記録媒体１１に記録され、あるいは、電話回線、地上波、衛星回線、インターネット、有線または無線LAN(Local Area Network)などの伝送媒体１２を介して伝送される。 In FIG. 24, the transmission apparatus 1 is supplied with moving image data having a high frame rate such as 1 / t ₀ = 240 fps. The transmission apparatus 1 filters the moving image data supplied to the region R _{1001 in} FIG. 20, the region R ₁₁₀₁ in FIG. 21, the region R ₁₂₀₁ in FIG. 22, the region R _{1301 in} FIG. Band-pass filter or band-limiting filter), and outputs the data in the frequency domain that can be recognized by humans. Data output from the transmission device 1 is recorded on a recording medium 11 such as a flexible disk, CD-ROM (Compact Disc Read Only Memory), MO (Magneto Optical) disk, DVD (Digital Versatile Disc), magnetic disk, or semiconductor memory. Alternatively, the data is transmitted via a transmission medium 12 such as a telephone line, a terrestrial wave, a satellite line, the Internet, a wired line, or a wireless LAN (Local Area Network).

受信装置２には、記録媒体１１から再生されたデータ、あるいは、伝送媒体１２を介して伝送されてくるデータが供給される。受信装置２は、そこに供給されるデータを受信して、所定の処理を施し、その結果得られる動画データを、例えば、CRT(Cathode Ray Tube)やLCD(Liquid Crystal Display)などで構成される表示装置３に供給して表示させる。 The receiver 2 is supplied with data reproduced from the recording medium 11 or data transmitted via the transmission medium 12. The receiving device 2 receives the data supplied thereto, performs predetermined processing, and the moving image data obtained as a result is composed of, for example, a CRT (Cathode Ray Tube), an LCD (Liquid Crystal Display), or the like. It is supplied to the display device 3 and displayed.

なお、図２４において、送信装置１、受信装置２、表示装置３は、それぞれ物理的に独立の装置として構成することができる。また、送信装置１および受信装置２は、図２４において点線で囲んで示すように、全体として、物理的に１つの装置として構成することができる。さらに、送信装置１、受信装置２、および表示装置３の全体や、受信装置２および表示装置３を、物理的に１つの装置として構成することもできる。 In FIG. 24, the transmission device 1, the reception device 2, and the display device 3 can be configured as physically independent devices. Further, the transmitter 1 and the receiver 2 can be physically configured as one device as a whole as shown by being surrounded by a dotted line in FIG. Furthermore, the whole of the transmission device 1, the reception device 2, and the display device 3, or the reception device 2 and the display device 3 can be physically configured as one device.

次に、図２５は、図２４の送信装置１の第１の構成例を示している。 Next, FIG. 25 illustrates a first configuration example of the transmission device 1 of FIG.

バッファ部２１には、送信装置１に供給された高フレームレートの動画データが供給される。バッファ部２１は、そこに供給される動画データを順次記憶する。 The buffer unit 21 is supplied with the high frame rate moving image data supplied to the transmission apparatus 1. The buffer unit 21 sequentially stores the moving image data supplied thereto.

フィルタ部２２は、バッファ部２１に記憶された動画データを適宜読み出し、その動画データを、後述するフィルタ生成部２３から供給されるフィルタ情報にしたがってフィルタリングして、そのフィルタリングにより得られる動画データを、エンコード部２４に供給する。 The filter unit 22 appropriately reads out the moving image data stored in the buffer unit 21, filters the moving image data according to the filter information supplied from the filter generation unit 23, which will be described later, and obtains the moving image data obtained by the filtering, This is supplied to the encoding unit 24.

フィルタ生成部２３は、バッファ部２１に記憶された動画データを適宜読み出し、その動画データの各部分をフィルタリングするフィルタの情報であるフィルタ情報を生成して、フィルタ部２２に供給する。 The filter generation unit 23 appropriately reads the moving image data stored in the buffer unit 21, generates filter information that is filter information for filtering each part of the moving image data, and supplies the filter information to the filter unit 22.

即ち、フィルタ生成部２３は、主成分方向取得部３１とフィルタ情報供給部３２とから構成される。 That is, the filter generation unit 23 includes a principal component direction acquisition unit 31 and a filter information supply unit 32.

主成分方向取得部３１は、バッファ部２１から読み出された動画データの各部分について、周波数ドメイン上での主成分の方向である主成分方向を取得し、その主成分方向の情報を、フィルタ情報供給部３２に供給する。 The principal component direction acquisition unit 31 acquires the principal component direction which is the direction of the principal component on the frequency domain for each part of the moving image data read from the buffer unit 21, and filters the information on the principal component direction. The information is supplied to the information supply unit 32.

フィルタ情報供給部３２は、主成分方向取得部３１からの主成分方向の情報にしたがい、周波数ドメインにおいて、主成分方向に延びる領域であって、時間方向の周波数軸Tの方向に特定の幅としての、例えば、2π/(4t₀)を有する領域、即ち、例えば、図２０の領域R₁₀₀₁や、図２１の領域R₁₁₀₁、図２２の領域R₁₂₀₁、図２３の領域R₁₃₀₁を、フィルタの通過帯域として決定し、その通過帯域を、フィルタ情報として、フィルタ部２２に出力する。 The filter information supply unit 32 is a region extending in the principal component direction in the frequency domain according to the information on the principal component direction from the principal component direction acquisition unit 31, and has a specific width in the direction of the frequency axis T in the time direction. For example, a region having 2π / (4t ₀ ), that is, a region R _{1001 in} FIG. 20, a region R ₁₁₀₁ in FIG. 21, a region R ₁₂₀₁ in FIG. 22, and a region R _{1301 in} FIG. The pass band is determined, and the pass band is output to the filter unit 22 as filter information.

エンコード部２４は、フィルタ部２２から供給される動画データを、例えば、MPEG(Moving Picture Experts Group)1や2などの所定のエンコード（符号化）方法によってエンコードし、その結果得られるエンコードデータを出力する。このエンコードデータが、図２４の記録媒体１１に記録され、あるいは伝送媒体１２を介して伝送される。 The encoding unit 24 encodes the moving image data supplied from the filter unit 22 by a predetermined encoding (encoding) method such as MPEG (Moving Picture Experts Group) 1 or 2, and outputs the encoded data obtained as a result. To do. This encoded data is recorded on the recording medium 11 of FIG. 24 or transmitted via the transmission medium 12.

次に、図２６のフローチャートを参照して、図２５の送信装置１の処理について説明する。 Next, processing of the transmission apparatus 1 in FIG. 25 will be described with reference to the flowchart in FIG.

バッファ部２１には、フレームレート1/t₀が、例えば、240fpsの高フレームレートの動画データが供給されて順次記憶される。 The buffer unit 21 is supplied with moving image data having a high frame rate of 240 fps, for example, at a frame rate of 1 / t ₀ and sequentially stored.

そして、ステップＳ１において、フィルタ生成部２３は、バッファ部２１に記憶された動画データを読み出して、自身（フィルタ生成部２３）に入力し、ステップＳ２に進む。 In step S1, the filter generation unit 23 reads the moving image data stored in the buffer unit 21, inputs the moving image data to itself (filter generation unit 23), and proceeds to step S2.

ステップＳ２では、フィルタ生成部２３（の主成分方向取得部３１およびフィルタ情報供給部３２）が、ステップＳ１で入力された動画データの各部分について、周波数ドメイン上で人間が認識することができる領域、即ち、例えば、図２０の領域R₁₀₀₁や、図２１の領域R₁₁₀₁、図２２の領域R₁₂₀₁、図２３の領域R₁₃₀₁を、フィルタの通過帯域として求め、その通過帯域をフィルタ情報として、フィルタ部２２に供給する、後述する図３１で説明する「必要な情報の通過帯域を求める処理」を行い、ステップＳ３に進む。 In step S2, a region in which the filter generation unit 23 (its principal component direction acquisition unit 31 and filter information supply unit 32) can recognize each part of the moving image data input in step S1 on the frequency domain. That is, for example, the region R _{1001 in} FIG. 20, the region R ₁₁₀₁ in FIG. 21, the region R ₁₂₀₁ in FIG. 22, and the region R _{1301 in} FIG. 23 are obtained as filter passbands, and the passbands are used as filter information. The “processing for obtaining a pass band of necessary information” described later with reference to FIG.

ステップＳ３では、フィルタ部２２が、バッファ部２１から、ステップＳ２でフィルタ生成部２３から供給されたフィルタ情報のフィルタによるフィルタリングに用いるデータを読み出し、そのデータに対して、ステップＳ２でフィルタ生成部２３から供給されたフィルタ情報が表す通過帯域のフィルタを適用しながら、時間方向のサンプル数を1/4に間引くダウンサンプリングを行う。 In step S3, the filter unit 22 reads data used for filtering by the filter of the filter information supplied from the filter generation unit 23 in step S2 from the buffer unit 21, and the filter generation unit 23 in step S2 for the data. Downsampling is performed by thinning out the number of samples in the time direction to 1/4 while applying the filter of the passband represented by the filter information supplied from.

即ち、フィルタ部２２は、バッファ部２１から読み出したデータを、ステップＳ２でフィルタ生成部２３から供給されたフィルタ情報が表す通過帯域のフィルタでフィルタリングしながら、時間方向のサンプリング間隔、即ち、フレーム周期が、元の動画データの４倍である4t₀の低フレームレートの動画データを得て、エンコード部２４に出力し、ステップＳ４に進む。 In other words, the filter unit 22 filters the data read from the buffer unit 21 with a filter in the passband represented by the filter information supplied from the filter generation unit 23 in step S2, while sampling in the time direction, that is, the frame period. However, moving image data having a low frame rate of 4t ₀ , which is four times the original moving image data, is output to the encoding unit 24, and the process proceeds to step S4.

ステップＳ４では、エンコード部２４が、フィルタ部２２からの動画データをエンコードし、その結果得られるエンコードを出力する。 In step S4, the encoding unit 24 encodes the moving image data from the filter unit 22, and outputs the resulting encoding.

なお、ステップＳ１乃至Ｓ４の処理は、バッファ部２１に記憶された動画の各部分のデータすべてについて行われる。 Note that the processing in steps S1 to S4 is performed for all the data of each part of the moving image stored in the buffer unit 21.

また、ステップＳ３におけるダウンサンプリングは、バッファ部２１に記憶された動画データの４フレームごとに、１フレームのフィルタリング結果を出力する処理である。従って、バッファ部２１に記憶された動画データの４フレームごとに、その４つのフレームのうちの１フレームついて、フィルタ生成部２３から供給されたフィルタ情報が表す通過帯域のフィルタによるフィルタリングを行えば良い。 Further, the downsampling in step S3 is a process of outputting a filtering result of one frame for every four frames of moving image data stored in the buffer unit 21. Accordingly, for every four frames of the moving image data stored in the buffer unit 21, one frame out of the four frames may be filtered by a passband filter represented by the filter information supplied from the filter generation unit 23. .

以上のように、周波数ドメイン上で人間が認識することができる領域、即ち、例えば、図２０の領域R₁₀₀₁や、図２１の領域R₁₁₀₁、図２２の領域R₁₂₀₁、図２３の領域R₁₃₀₁を、通過帯域とするフィルタによって、動画データをフィルタリングし、さらに、ダウンサンプリングを行うようにしたので、人間の目の視覚効果を考慮した動画データの削減を行うことができる。 As described above, regions that can be recognized by humans on the frequency domain, that is, for example, region R _{1001 in} FIG. 20, region R ₁₁₀₁ in FIG. 21, region R ₁₂₀₁ in FIG. 22, region R _{1301 in} FIG. Since the moving image data is filtered by the filter having the pass band and further down-sampling is performed, the moving image data can be reduced in consideration of the visual effect of the human eye.

即ち、人間が視覚によって認識することができる周波数ドメイン上の領域が、図２に示したように、X,Y方向が２×２π／（２r₀）で、T方向が２×２π／（２t₀）の、原点を中心とする領域R₂₀₁であるとすると、人間が認識することができる最高の画質の動画は、フレームレートが１／t₀で、空間方向の画素ピッチがr₀の動画である。 That is, as shown in FIG. 2, the regions on the frequency domain that humans can visually recognize are 2 × 2π / (2r ₀ ) in the X and Y directions and 2 × 2π / (2t in the T direction. ₀ ) of the region R ₂₀₁ centered on the origin, the highest quality video that can be recognized by humans is the video with a frame rate of 1 / t ₀ and a pixel pitch r ₀ in the spatial direction. It is.

そして、図１０乃至図２３で説明した視覚効果を考慮しない場合には、例えば、動画を時間方向にダウンサンプリングすることにより、そのフレームレートを１／t₀よりも低下させると、そのフレームレートが低下した動画を表示した場合に、その動画を見た人間は、画質の劣化を認識する。 If the visual effects described with reference to FIGS. 10 to 23 are not considered, for example, if the frame rate is reduced below 1 / t ₀ by down-sampling the moving image in the time direction, the frame rate is reduced. When a reduced moving image is displayed, a person who views the moving image recognizes the deterioration of the image quality.

これに対して、送信装置１では、図１０乃至図２３で説明した視覚効果を考慮した処理、即ち、周波数ドメイン上で人間が認識することができる領域である、例えば、図２０の領域R₁₀₀₁や、図２１の領域R₁₁₀₁、図２２の領域R₁₂₀₁、図２３の領域R₁₃₀₁を、通過帯域とするフィルタによって、動画データをフィルタリングするので、動画のフレームレートを１／(4t₀)に低下させても、即ち、動画データのデータ量を１／４に削減しても、人間が画質の劣化を感じない動画を表示することができる。 On the other hand, in the transmission apparatus 1, the processing in consideration of the visual effect described with reference to FIGS. 10 to 23, that is, an area that can be recognized by a human on the frequency domain, for example, an area R _{1001 in} FIG. In addition, since the moving image data is filtered by a filter having the region R ₁₁₀₁ in FIG. 21, the region R ₁₂₀₁ in FIG. 22 and the region R _{1301 in} FIG. 23 as a pass band, the frame rate of the moving image is reduced to 1 / (4t ₀ ). Even if it is reduced, that is, even if the data amount of the moving image data is reduced to ¼, it is possible to display a moving image in which humans do not feel deterioration in image quality.

即ち、図２７乃至図３０は、それぞれ、図２０の領域R₁₀₀₁、図２１の領域R₁₁₀₁、図２２の領域R₁₂₀₁、図２３の領域R₁₃₀₁を、通過帯域とするフィルタによって、高フレームレートの動画（フレームレートが1/t₀の動画）のデータをフィルタリングしながら、時間方向に1/4のダウンサンプリングを行って得られる低フレームレートの動画（フレームレートが1/(4t₀)の動画）のデータの周波数ドメイン上の分布を示している。 That is, FIG. 27 to FIG. 30 show a high frame rate by a filter in which the region R ₁₀₀₁ in FIG. 20, the region R ₁₁₀₁ in FIG. 21, the region R ₁₂₀₁ in FIG. 22, and the region R _{1301 in} FIG. videos while filtering data (frame rate video 1 / t _0), low frame rate video obtained by performing 1/4 down-sampling in the time direction (frame rate 1 / (4t ₀₎ Video) distribution on the frequency domain.

まず、動画の中の静止している部分のデータは、フィルタ部２２において、図２０の領域R₁₀₀₁を通過帯域とするフィルタによってフィルタリングされる。従って、フィルタリング後の動画データは、領域R₁₀₀₁内にのみ存在する。 First, the data of the stationary part in the moving image is filtered by the filter unit 22 with a filter having a region R _{1001 in} FIG. 20 as a pass band. Therefore, the moving image data after filtering exists only in the region R ₁₀₀₁ .

領域R₁₀₀₁内の動画データを、時間方向に1/4にダウンサンプリングすると、その動画データは、空間方向x,yに対してはr₀間隔でサンプリングされ、時間方向tに対しては４×t₀間隔でサンプリングされたデータとなる。 When the moving image data in the region R ₁₀₀₁ is downsampled to 1/4 in the time direction, the moving image data is sampled at intervals of r _{0 in} the spatial direction x, y and 4 × in the time direction t. t Data sampled at ₀ intervals.

このため、図２７に示すように、周波数ドメイン上では、X,Y方向には、2π/r₀間隔で、折り返し成分が生じ、T方向には、2π/(4t₀)間隔で、折り返し成分が生じる。 For this reason, as shown in FIG. 27, on the frequency domain, aliasing components are generated at intervals of 2π / r _{0 in the} X and Y directions, and aliasing components are generated at intervals of 2π / (4t ₀ ) in the T direction. Occurs.

但し、領域R₁₀₀₁は、図１０で説明したことから、X,Y方向が2π/r₀で、T方向が2π/(4t₀)の領域であるため、折り返し成分どうしは重ならない。 However, since the region R ₁₀₀₁ is a region in which the X and Y directions are 2π / r ₀ and the T direction is 2π / (4t ₀ ) as described with reference to FIG. 10, the aliasing components do not overlap.

なお、図２７において、影を付してある部分が、ダウンサンプリング後の動画データが存在する部分である。 In FIG. 27, the shaded part is the part where the down-sampled video data exists.

次に、動画の中で、そこに投影されている被写体が速度「（r₀／t₀）／２」程度で動いている部分のデータは、フィルタ部２２において、図２１の領域R₁₁₀₁を通過帯域とするフィルタによってフィルタリングされる。従って、フィルタリング後の動画データは、領域R₁₁₀₁内にのみ存在する。 Next, the data of the portion of the moving image where the subject projected on the moving image moves at a speed of about “(r ₀ / t ₀ ) / 2” is stored in the region R _{1101 of} FIG. Filtered by a filter as a pass band. Therefore, the moving image data after filtering exists only in the region R ₁₁₀₁ .

領域R₁₁₀₁内の動画データを、時間方向に1/4にダウンサンプリングすると、その動画データは、空間方向x,yに対してはr₀間隔でサンプリングされ、時間方向tに対しては４×t₀間隔でサンプリングされたデータとなる。 When the moving image data in the region R ₁₁₀₁ is downsampled to 1/4 in the time direction, the moving image data is sampled at intervals of r _{0 in} the spatial direction x, y and 4 × in the time direction t. t Data sampled at ₀ intervals.

このため、図２８に示すように、周波数ドメイン上では、X,Y方向には、2π/r₀間隔で、折り返し成分が生じ、T方向には、2π/(4t₀)間隔で、折り返し成分が生じる。 Therefore, as shown in FIG. 28, on the frequency domain, aliasing components are generated at intervals of 2π / r _{0 in the} X and Y directions, and aliasing components are generated at intervals of 2π / (4t ₀ ) in the T direction. Occurs.

但し、領域R₁₁₀₁は、図１１で説明したことから、原点(0,0)と点（π／r0，-π／(2t0)）とを結ぶ直線からT方向に2π／(4t0)の幅を有する領域で、かつ、X,Y方向が、−（π／r₀）乃至＋（π／r₀）で、T方向が、−（π／t₀）乃至＋（π／t₀）の範囲の領域であるため、折り返し成分どうしは重ならない。 However, the region R ₁₁₀₁ has a width of 2π / (4t0) in the T direction from the straight line connecting the origin (0,0) and the point (π / r0, −π / (2t0)), as described in FIG. And the X and Y directions are-(π / r ₀ ) to + (π / r ₀ ) and the T direction is-(π / t ₀ ) to + (π / t ₀ ). Since it is a range area, the aliasing components do not overlap.

なお、図２８において、影を付してある部分が、ダウンサンプリング後の動画データが存在する部分である。 In FIG. 28, the shaded part is the part where the down-sampled moving image data exists.

次に、動画の中で、そこに投影されている被写体が速度「r₀／t₀」程度で動いている部分のデータは、フィルタ部２２において、図２２の領域R₁₂₀₁を通過帯域とするフィルタによってフィルタリングされる。従って、フィルタリング後の動画データは、領域R₁₂₀₁内にのみ存在する。 Next, the data of the portion of the moving image where the subject projected on the moving image is moving at a speed of about “r ₀ / t ₀ ” has a region R _{1201 in} FIG. Filtered by the filter. Therefore, the moving image data after filtering exists only in the region R ₁₂₀₁ .

領域R₁₂₀₁内の動画データを、時間方向に1/4にダウンサンプリングすると、その動画データは、空間方向x,yに対してはr₀間隔でサンプリングされ、時間方向tに対しては４×t₀間隔でサンプリングされたデータとなる。 When the moving image data in the region R ₁₂₀₁ is downsampled to 1/4 in the time direction, the moving image data is sampled at intervals of r _{0 with} respect to the spatial directions x and y, and 4 × with respect to the time direction t. t Data sampled at ₀ intervals.

このため、図２９に示すように、周波数ドメイン上では、X,Y方向には、2π/r₀間隔で、折り返し成分が生じ、T方向には、2π/(4t₀)間隔で、折り返し成分が生じる。 Therefore, as shown in FIG. 29, on the frequency domain, aliasing components are generated at intervals of 2π / r _{0 in the} X and Y directions, and aliasing components are generated at intervals of 2π / (4t ₀ ) in the T direction. Occurs.

但し、領域R₁₂₀₁は、図１２で説明したことから、原点(0,0)と点（π／r0，-2π／(2t0)）とを結ぶ直線からT方向に2π／(4t0)の幅を有する領域で、かつ、X,Y方向が、−（π／r₀）乃至＋（π／r₀）で、T方向が、−（π／t₀）乃至＋（π／t₀）の範囲の領域であるため、折り返し成分どうしは重ならない。 However, the region R ₁₂₀₁ has a width of 2π / (4t0) in the T direction from the straight line connecting the origin (0,0) and the point (π / r0, -2π / (2t0)), as described in FIG. And the X and Y directions are-(π / r ₀ ) to + (π / r ₀ ) and the T direction is-(π / t ₀ ) to + (π / t ₀ ). Since it is a range area, the aliasing components do not overlap.

なお、図２９において、影を付してある部分が、ダウンサンプリング後の動画データが存在する部分である。 In FIG. 29, the shaded portion is the portion where the down-sampled moving image data exists.

次に、動画の中で、そこに投影されている被写体が速度「2r₀／t₀」程度で動いている部分のデータは、フィルタ部２２において、図２３の領域R₁₃₀₁を通過帯域とするフィルタによってフィルタリングされる。従って、フィルタリング後の動画データは、領域R₁₃₀₁内にのみ存在する。 Next, the data of the portion of the moving image where the subject projected on the moving image moves at a speed of about “2r ₀ / t ₀ ” has a region R _{1301 in} FIG. Filtered by the filter. Therefore, the moving image data after filtering exists only in the region R ₁₃₀₁ .

領域R₁₃₀₁内の動画データを、時間方向に1/4にダウンサンプリングすると、その動画データは、空間方向x,yに対してはr₀間隔でサンプリングされ、時間方向tに対しては４×t₀間隔でサンプリングされたデータとなる。 When the moving image data in the region R ₁₃₀₁ is downsampled to 1/4 in the time direction, the moving image data is sampled at intervals of r _{0 in} the spatial direction x, y and 4 × in the time direction t. t Data sampled at ₀ intervals.

このため、図３０に示すように、周波数ドメイン上では、X,Y方向には、2π/r₀間隔で、折り返し成分が生じ、T方向には、2π/(4t₀)間隔で、折り返し成分が生じる。 Therefore, as shown in FIG. 30, on the frequency domain, aliasing components are generated at intervals of 2π / r _{0 in the} X and Y directions, and aliasing components are generated at intervals of 2π / (4t ₀ ) in the T direction. Occurs.

但し、領域R₁₃₀₁は、図１３で説明したことから、原点(0,0)と点（π／r0，-2π／t0）とを結ぶ直線からT方向に2π／(4t0)の幅を有する領域で、かつ、X,Y方向が、−（π／r₀）乃至＋（π／r₀）で、T方向が、−（π／t₀）乃至＋（π／t₀）の範囲の領域であるため、折り返し成分どうしは重ならない。 However, the region R ₁₃₀₁ has a width of 2π / (4t0) in the T direction from the straight line connecting the origin (0,0) and the point (π / r0, −2π / t0), as described in FIG. area, and, X, Y direction, - at (π / r ₀₎ to _{+ (π / r 0),} T direction, - the range of (π / t ₀₎ to + (π / t ₀₎ Since it is a region, the aliasing components do not overlap.

なお、図３０において、影を付してある部分が、ダウンサンプリング後の動画データが存在する部分である。 In FIG. 30, the shaded part is the part where the down-sampled moving image data exists.

以上の図２７乃至図３０に示したように、周波数ドメインにおいて、ダウンサンプリング後の動画データの折り返し成分どうしは重ならないということは、そのダウンサンプリング後の動画データから、元のデータ、即ち、図２０の領域R₁₀₀₁、図２１の領域R₁₁₀₁、図２２の領域R₁₂₀₁、図２３の領域R₁₃₀₁内のデータを抽出することができることを意味し、換言すれば、ダウンサンプリング後の動画データが、人間の視覚特性を考慮した必要な情報（人間が認識することができる情報）を、正確に保持していることを意味する。 As shown in FIGS. 27 to 30 above, the fact that the aliasing components of the downsampled video data do not overlap in the frequency domain means that the original data, that is, 20 area R ₁₀₀₁ , area R ₁₁₀₁ in FIG. 21, area R ₁₂₀₁ in FIG. 22, and area R ₁₃₀₁ in FIG. 23 can be extracted. This means that necessary information (information that can be recognized by humans) taking into consideration human visual characteristics is accurately held.

なお、ダウンサンプリング後の動画データを、ダウンサンプリング前の動画データに復号（復元）するためには、動画の各部分のデータに、どのようなフィルタを適用したかの情報が必要となる。 Note that in order to decode (restore) the moving image data after downsampling into moving image data before downsampling, information on what kind of filter is applied to the data of each part of the moving image is required.

そこで、図２５の送信装置１では、フィルタ情報供給部３２からエンコード部２４に対して、動画の各部分のデータに適用したフィルタのフィルタ情報としての通過帯域である、例えば、図２０の領域R₁₀₀₁や、図２１の領域R₁₁₀₁、図２２の領域R₁₂₀₁、図２３の領域R₁₃₀₁の情報や、その領域R₁₀₀₁，R₁₁₀₁，R₁₂₀₁，R₁₃₀₁の主成分方向の情報を供給し、エンコード部２４では、その情報を、エンコードデータに多重化して出力するようにすることができる。 Therefore, in the transmission device 1 in FIG. 25, the filter information supply unit 32 transmits to the encoding unit 24 the passband as the filter information of the filter applied to the data of each part of the moving image, for example, the region R in FIG. ₁₀₀₁ , information on the area R ₁₁₀₁ in FIG. 21, area R ₁₂₀₁ in FIG. 22, area R _{1301 in} FIG. 23, and information on the principal component directions of the areas R ₁₀₀₁ , R ₁₁₀₁ , R ₁₂₀₁ , and R ₁₃₀₁ , In the encoding unit 24, the information can be multiplexed with the encoded data and output.

次に、図３１のフローチャートを参照して、図２６のステップＳ２でフィルタ生成部２３が行う「必要な情報の通過帯域を求める処理」について説明する。 Next, with reference to the flowchart of FIG. 31, the “process for obtaining the passband of necessary information” performed by the filter generation unit 23 in step S2 of FIG. 26 will be described.

まず最初に、ステップＳ１１において、フィルタ生成部２３（図２５）の主成分方向取得部３１は、図２６のステップＳ１で入力された動画データの各部分について、空間方向x,yの周波数軸X,Y、および時間方向tの周波数軸Tで定義される周波数ドメインでの主成分方向を取得し、フィルタ情報供給部３２に供給して、ステップＳ１２に進む。なお、主成分方向は、例えば、空間方向x,yと、時間方向tとの３次元の方向について、フーリエ変換（３次元フーリエ変換）を行うことにより求めることができる。 First, in step S11, the principal component direction acquisition unit 31 of the filter generation unit 23 (FIG. 25) performs the frequency axis X in the spatial directions x and y for each part of the moving image data input in step S1 of FIG. , Y, and the principal component direction in the frequency domain defined by the frequency axis T in the time direction t are acquired, supplied to the filter information supply unit 32, and the process proceeds to step S12. The principal component direction can be obtained, for example, by performing Fourier transform (three-dimensional Fourier transform) on the three-dimensional direction of the spatial direction x, y and the time direction t.

ステップＳ１２では、フィルタ情報供給部３２は、周波数ドメインにおいて、原点(0,0)から、主成分方向取得部３１からの主成分方向に延びる領域であって、T方向に２π／（４×t₀）の幅を有し、X,Y方向が、−（π／r₀）乃至＋（π／r₀）で、T方向が、−（π／t₀）乃至＋（π／t₀）の範囲の領域、即ち、図２０の領域R₁₀₀₁や、図２１の領域R₁₁₀₁、図２２の領域R₁₂₀₁、図２３の領域R₁₃₀₁を、フィルタの通過帯域として決定し、ステップＳ１３に進む。 In step S12, the filter information supply unit 32 is a region extending in the principal component direction from the principal component direction acquisition unit 31 from the origin (0,0) in the frequency domain, and 2π / (4 × t in the T direction. ₀ ), the X and Y directions are-(π / r ₀ ) to + (π / r ₀ ), and the T direction is-(π / t ₀ ) to + (π / t ₀ ). a range of area, i.e., and a region R ₁₀₀₁ in FIG. 20, the region R ₁₁₀₁ in FIG. 21, the region R ₁₂₀₁ in FIG. 22, the region R ₁₃₀₁ in FIG. 23, determined as the pass band of the filter, the flow proceeds to step S13.

ステップＳ１３では、フィルタ情報供給部３２は、ステップＳ１２で求めたフィルタの通過帯域（を表す情報）を、フィルタ情報として、フィルタ部２２に供給し、「必要な情報の通過帯域を求める処理」を終了する。 In step S13, the filter information supply unit 32 supplies the filter pass band (information representing) obtained in step S12 to the filter unit 22 as filter information, and performs a “process for obtaining the pass band of necessary information”. finish.

以上のような「必要な情報の通過帯域を求める処理」によれば、例えば、動画像において静止している被写体が投影されている部分に対しては、図２０の領域R₁₀₀₁をフィルタ情報として求めることができる。また、動画において投影されている被写体が速度「（r₀／t₀）／２」程度で動いている部分に対しては、図２１の領域R₁₁₀₁をフィルタ情報として求めることができる。さらに、動画において投影されている被写体が速度「r₀／t₀」程度で動いている部分に対しては、図２２の領域R₁₂₀₁をフィルタ情報として求めることができる。また、動画において投影されている被写体が速度「２r₀／t₀」程度で動いている部分に対しては、図２３の領域R₁₃₀₁をフィルタ情報として求めることができる。動画においてその他の速度で被写体が移動している部分に対しても、人間が認識することができる周波数ドメイン上の範囲の領域をフィルタ情報として求めることができる。 According to the “process for obtaining the pass band of necessary information” as described above, for example, the region R _{1001 in} FIG. 20 is used as filter information for a portion where a stationary subject is projected in a moving image. Can be sought. Further, for a portion where the subject projected in the moving image moves at a speed of “(r ₀ / t ₀ ) / 2”, the region R _{1101 in} FIG. 21 can be obtained as filter information. Furthermore, the region R _{1201 in} FIG. 22 can be obtained as filter information for a portion where the subject projected in the moving image moves at a speed of about “r ₀ / t ₀ ”. Further, for a portion where the subject projected in the moving image moves at a speed of about “2r ₀ / t ₀ ”, the region R _{1301 in} FIG. 23 can be obtained as filter information. Even in a portion of the moving image where the subject is moving at other speeds, a region in the frequency domain that can be recognized by humans can be obtained as filter information.

なお、本実施の形態では、フィルタ生成部２３（図２５）のフィルタ情報供給部３２において、主成分方向取得部３１から供給される主成分方向に延びる周波数ドメイン上の領域を、フィルタの通過帯域として求め、その通過帯域をフィルタ情報として、フィルタ部２２に供給するようにしたが、フィルタ情報供給部３２では、複数の主成分方向それぞれに対する上述のような通過帯域としてのフィルタ情報をあらかじめ求めて記憶しておき、その記憶しているフィルタ情報の中から、主成分方向取得部３１から供給される主成分方向に対応するものを選択して、フィルタ部２２に供給するようにしても良い。 In the present embodiment, in the filter information supply unit 32 of the filter generation unit 23 (FIG. 25), the region on the frequency domain extending in the main component direction supplied from the main component direction acquisition unit 31 is defined as the passband of the filter. Although the pass band is supplied to the filter unit 22 as filter information, the filter information supply unit 32 obtains filter information as the pass band as described above for each of a plurality of principal component directions in advance. It may be stored and the filter information corresponding to the principal component direction supplied from the principal component direction acquisition unit 31 may be selected from the stored filter information and supplied to the filter unit 22.

また、フィルタ生成部２３は、送信装置１とは別の、フィルタ情報を出力する独立の装置として構成することができる。 Further, the filter generation unit 23 can be configured as an independent device that outputs filter information different from the transmission device 1.

ところで、図３１のステップＳ１１では、３次元フーリエ変換によって、動画データの各部分の主成分方向を求めるようにしたが、３次元フーリエ変換の計算量は膨大であるため、ステップＳ１１における、主成分方向を取得する処理に要する計算量も膨大となる。 By the way, in step S11 of FIG. 31, the principal component direction of each part of the moving image data is obtained by the three-dimensional Fourier transform. However, since the calculation amount of the three-dimensional Fourier transform is enormous, the principal component in step S11 The amount of calculation required for the process of acquiring the direction is enormous.

そこで、図３２のフローチャートを参照して、図３１のステップＳ１１において、３次元フーリエ変換を行う場合よりも少ない計算量で主成分方向を取得する方法について説明する。 A method for acquiring the principal component direction with a smaller amount of calculation than in the case of performing the three-dimensional Fourier transform in step S11 of FIG. 31 will be described with reference to the flowchart of FIG.

まず最初に、ステップＳ２１において、フィルタ生成部２３（図２５）の主成分方向取得部３１は、バッファ部２１に記憶された動画データの各フレームのデータを、例えば、横×縦が１６×１６画素などのブロックに分割し、ステップＳ２２に進む。なお、ここでは、各ブロックの画像データが、上述の動画データの各部分のデータとなる。 First, in step S21, the principal component direction acquisition unit 31 of the filter generation unit 23 (FIG. 25) stores the data of each frame of the moving image data stored in the buffer unit 21, for example, horizontal × vertical is 16 × 16. Dividing into blocks such as pixels, the process proceeds to step S22. Here, the image data of each block is the data of each part of the above-described moving image data.

ここで、ブロックは、複数の画素で構成されている必要はなく、１画素であっても良い。 Here, the block does not need to be composed of a plurality of pixels, and may be one pixel.

ステップＳ２２では、主成分方向取得部３１は、ステップＳ２１で得られた各ブロックを、順次、注目ブロックとし、注目ブロックについて、その注目ブロックのフレーム（以下、適宜、注目フレームという）の次のフレームとの相関を表す相関情報を求める。さらに、ステップＳ２２では、主成分方向取得部３１は、注目ブロックの相関情報が表す相関が最も高くなる、注目フレームの次のフレーム上の空間方向の位置を求める。即ち、ステップＳ２２では、主成分方向取得部３１は、いわゆるブロックマッチング等により、注目ブロックの空間方向x,yの２次元の動きベクトルを検出する。 In step S22, the principal component direction acquisition unit 31 sequentially sets each block obtained in step S21 as a target block, and for the target block, a frame subsequent to the frame of the target block (hereinafter, referred to as a target frame as appropriate). Correlation information representing the correlation with is obtained. Further, in step S22, the principal component direction acquisition unit 31 obtains the position in the spatial direction on the frame next to the frame of interest where the correlation represented by the correlation information of the block of interest is the highest. That is, in step S22, the principal component direction acquisition unit 31 detects a two-dimensional motion vector in the spatial direction x, y of the block of interest by so-called block matching or the like.

ここで、相関情報としては、いわゆる相関係数を採用することができるが、ここでは、計算コストを考慮して、例えば、注目ブロックを、動きベクトルとの探索範囲内で空間方向x,yに、それぞれ、u,vだけずらした位置における、注目ブロックの各画素と、その画素と同一位置にある、注目フレームの次のフレームの画素との画素値の自乗誤差や差分絶対値の総和などを採用する。この場合、相関情報の「値」が最小になる空間方向の位置u,vが、相関情報が表す相関が最も高くなる、注目フレームの次のフレーム上の空間方向の位置となる。 Here, as the correlation information, a so-called correlation coefficient can be adopted. Here, considering the calculation cost, for example, the target block is placed in the spatial direction x, y within the search range with the motion vector. , The square error of the pixel value of each pixel of the target block at the position shifted by u, v and the pixel of the next frame of the target frame at the same position as the pixel, the sum of absolute differences, etc. adopt. In this case, the position u, v in the spatial direction where the “value” of the correlation information is the minimum is the position in the spatial direction on the next frame of the frame of interest where the correlation represented by the correlation information is the highest.

なお、相関情報が表す相関が最大（相関情報の値が最小）になる空間方向の位置u,vを、それぞれ、u₀,v₀と表し、ステップＳ２２で検出される空間方向x,yの動きベクトルを、(u₀,v₀)と表す。
す。 Note that the spatial direction positions u and v at which the correlation represented by the correlation information is maximum (correlation information value is minimum) are represented by u ₀ and v ₀ , respectively, in the spatial directions x and y detected in step S22. The motion vector is represented as (u ₀ , v ₀ ).
The

その後、ステップＳ２２からＳ２３に進み、主成分方向取得部３１は、ステップＳ２２で検出された動きベクトル(u₀,v₀)に、元の動画データのフレーム周期t₀を、時間方向tのコンポーネントとして加えた３次元の動きベクトル(u₀,v₀,t₀)の方向と直交する方向を、主成分方向として検出し、フィルタ情報供給部３２に供給して、処理を終了する。 Thereafter, the process proceeds from step S22 to S23, and the principal component direction acquisition unit 31 adds the frame period t ₀ of the original moving image data to the motion vector (u ₀ , v ₀ ) detected in step S22 and the component in the time direction t. The direction orthogonal to the direction of the three-dimensional motion vector (u ₀ , v ₀ , t ₀ ) added as is detected as the principal component direction, supplied to the filter information supply unit 32, and the process is terminated.

以上の処理によれば、ブロックの主成分方向が、その動きベクトル(u₀,v₀,t₀)に垂直な平面（の拡がり方向）であるとして検出される。 According to the above processing, the principal component direction of the block is detected as a plane (expansion direction) perpendicular to the motion vector (u ₀ , v ₀ , t ₀ ).

即ち、注目ブロックについて、動きベクトル(u₀,v₀,t₀)が検出された場合、理想的には、空間方向x,yおよび時間方向tで定義される３次元空間において、動きベクトル(u₀,v₀,t₀)の方向に、注目ブロックの画素値が続いているということ、つまり、注目ブロックのある点(x,y,t)における動画データの値は、mを整数として表される、x,y,tの３次元空間上の点(x+mu₀,y+mv₀,t+mt₀)における動画データの値と同一であることになる。 That is, when a motion vector (u ₀ , v ₀ , t ₀ ) is detected for the block of interest, ideally, in a three-dimensional space defined by the spatial direction x, y and the temporal direction t, the motion vector ( u ₀ , v ₀ , t ₀ ) in the direction of the pixel value of the target block, that is, the value of the video data at the point (x, y, t) where the target block is This is the same as the value of the moving image data at the point (x + mu ₀ , y + mv ₀ , t + mt ₀ ) on the three-dimensional space represented by x, y, t.

従って、時空間の注目ブロックのデータの周波数成分、即ち、時空間の注目ブロックのデータを周波数ドメイン上のデータに変換して得られるデータは、動きベクトル(u₀,v₀,t₀)に垂直な平面上にのみ存在することになる。 Therefore, the frequency component of the data of the spatio-temporal block of interest, that is, the data obtained by converting the spatio-temporal block of interest data into the data on the frequency domain is the motion vector (u ₀ , v ₀ , t ₀ ). It exists only on a vertical plane.

図３２のフローチャートにしたがった方法では、このことを利用して、動きベクトル(u₀,v₀,t₀)に垂直な平面を、主成分方向として検出している。従って、この場合、主成分方向を取得するためには、動きベクトルを検出すれば良く、３次元フーリエ変換を行う必要がないので、少ない計算量で、主成分方向を検出することができる。 In the method according to the flowchart of FIG. 32, this is used to detect a plane perpendicular to the motion vector (u ₀ , v ₀ , t ₀ ) as the principal component direction. Therefore, in this case, in order to acquire the principal component direction, it is only necessary to detect a motion vector, and it is not necessary to perform a three-dimensional Fourier transform. Therefore, the principal component direction can be detected with a small amount of calculation.

なお、相関情報としては、例えば、相関係数を採用することができる。この場合、相関情報の値（相関係数）が最大の場合が、その相関情報が表す相関が最大の場合となる。 As the correlation information, for example, a correlation coefficient can be adopted. In this case, the correlation information value (correlation coefficient) is the maximum when the correlation represented by the correlation information is the maximum.

また、送信装置１（図２５）のエンコード部２４では、フィルタ部２２からの動画データを、特にエンコードせずに、そのまま出力することもできる。 Further, the encoding unit 24 of the transmission device 1 (FIG. 25) can output the moving image data from the filter unit 22 as it is without encoding.

次に、図３３は、図２４の受信装置２の構成例を示している。 Next, FIG. 33 illustrates a configuration example of the receiving device 2 of FIG.

図２５の送信装置１が出力するエンコードデータは、フレームレートが1/(4t₀)の低フレームレートの動画データをエンコードしたものであり、この低フレームレートの動画データを、そのまま、その低フレームレートと同一のフレームレートを有する表示装置に供給して表示させたのでは、本来表示されるべき周波数成分の他、図２８乃至図３０に示した折り返し成分のうちの一部も表示され、その結果、表示装置で表示される動画の画質が劣化する。 The encoded data output from the transmission apparatus 1 in FIG. 25 is encoded low-frame-rate video data with a frame rate of 1 / (4t ₀ ). When the image is supplied and displayed on a display device having the same frame rate as the rate, in addition to the frequency component that should be displayed, a part of the aliasing component shown in FIGS. 28 to 30 is also displayed. As a result, the image quality of the moving image displayed on the display device deteriorates.

そこで、受信装置２では、フレームレートが1/(4t₀)の低フレームレートの動画データを、元のフレームレート1/t₀にアップサンプリングし、その結果得られる、フレームレートが1/t₀の高フレームレートの動画データを、表示装置３（図２４）に供給することで、表示装置３に、高画質の（画質の劣化が認識されない）動画データを表示させるようになっている。なお、ここでは、表示装置３は、1/t₀のフレームレートで動画を表示するようになっているものとする。 Therefore, the receiving apparatus 2 upsamples the low frame rate moving image data with the frame rate of 1 / (4t ₀ ) to the original frame rate 1 / t ₀ , and the resulting frame rate is 1 / t _0. The high-frame-rate moving image data is supplied to the display device 3 (FIG. 24), so that the display device 3 displays the high-quality moving image data (the deterioration of the image quality is not recognized). Here, it is assumed that the display device 3 is configured to display a moving image at a frame rate of 1 / t ₀ .

受信装置２では、上述のように、表示装置３において高画質の動画を表示するために、送信装置１から供給される、フレームレートが1/(4t₀)の低フレームレートの動画データに対して、所定の通過帯域の周波数成分を通過させるフィルタを適用しながら、時間方向のアップサンプリングを行う。 In the receiving device 2, as described above, in order to display a high-quality moving image on the display device 3, low-frame-rate moving image data with a frame rate of 1 / (4t ₀ ) supplied from the transmitting device 1 is used. Thus, upsampling in the time direction is performed while applying a filter that allows a frequency component in a predetermined passband to pass.

即ち、デコード部５０には、送信装置１（図２４）から記録媒体１１または伝送媒体１２を介して、エンコードデータが供給される。デコード部５０は、エンコードデータをデコードし、その結果得られる低フレームレートの動画データを、バッファ部５１に供給する。 That is, the decoding unit 50 is supplied with encoded data from the transmission device 1 (FIG. 24) via the recording medium 11 or the transmission medium 12. The decoding unit 50 decodes the encoded data, and supplies the low frame rate moving image data obtained as a result to the buffer unit 51.

バッファ部５１は、デコード部５０から供給される低フレームレートの動画データを順次記憶する。 The buffer unit 51 sequentially stores the low frame rate moving image data supplied from the decoding unit 50.

フィルタ部５２は、バッファ部５１に記憶された動画データを適宜読み出し、その動画データを、後述するフィルタ生成部５３から供給されるフィルタ情報にしたがってフィルタリングしながらアップサンプリングし、そのフィルタリングおよびアップサンプリングにより得られる高フレームレートの動画データを出力する。フィルタ部５２が出力する高フレームレートの動画データは、表示装置３（図２４）に供給されて表示される。 The filter unit 52 appropriately reads out the moving image data stored in the buffer unit 51, up-samples the moving image data while filtering according to the filter information supplied from the filter generation unit 53 described later, and performs the filtering and up-sampling. The resulting high frame rate video data is output. The high frame rate moving image data output from the filter unit 52 is supplied to the display device 3 (FIG. 24) and displayed.

フィルタ生成部５３は、バッファ部５１に記憶された動画データを適宜読み出し、その動画データの各部分を用いてフィルタリングするフィルタの情報であるフィルタ情報を生成して、フィルタ部５２に供給する。 The filter generation unit 53 appropriately reads out the moving image data stored in the buffer unit 51, generates filter information that is filter information to be filtered using each portion of the moving image data, and supplies the filter information to the filter unit 52.

即ち、フィルタ生成部５３は、主成分方向取得部６１とフィルタ情報供給部６２とから構成される。 That is, the filter generation unit 53 includes a main component direction acquisition unit 61 and a filter information supply unit 62.

主成分方向取得部６１は、バッファ部５１から読み出された動画データの各部分について、周波数ドメイン上での主成分の方向である主成分方向を取得し、その主成分方向の情報を、フィルタ情報供給部６２に供給する。 The principal component direction acquisition unit 61 acquires, for each part of the moving image data read from the buffer unit 51, a principal component direction that is the direction of the principal component on the frequency domain, and filters the information on the principal component direction. The information is supplied to the information supply unit 62.

フィルタ情報供給部６２は、主成分方向取得部６１からの主成分方向の情報にしたがい、周波数ドメインにおいて、主成分方向に延びる領域であって、時間方向の周波数軸Tの方向に特定の幅としての、例えば、2π/(4t₀)を有する領域、即ち、例えば、図２０の領域R₁₀₀₁や、図２１の領域R₁₁₀₁、図２２の領域R₁₂₀₁、図２３の領域R₁₃₀₁を、フィルタの通過帯域として決定し、その通過帯域を、フィルタ情報として、フィルタ部５２に出力する。 The filter information supply unit 62 is an area extending in the principal component direction in the frequency domain according to the information of the principal component direction from the principal component direction acquisition unit 61, and has a specific width in the direction of the frequency axis T in the time direction. For example, a region having 2π / (4t ₀ ), that is, a region R _{1001 in} FIG. 20, a region R ₁₁₀₁ in FIG. 21, a region R ₁₂₀₁ in FIG. 22, and a region R _{1301 in} FIG. The pass band is determined, and the pass band is output to the filter unit 52 as filter information.

次に、図３４のフローチャートを参照して、図３３の受信装置２の処理について説明する。 Next, processing of the receiving device 2 in FIG. 33 will be described with reference to the flowchart in FIG.

デコード部５０には、エンコードデータが供給される。デコード部５０は、そのエンコードデータを、フレームレートが1/(4t₀)の低フレームレートの動画データにデコードし、バッファ部５１に供給する。バッファ部５１では、デコード部５０から供給される1/(4t₀)のフレームレートの低フレームレートの動画データが順次記憶される。 The decoding unit 50 is supplied with encoded data. The decoding unit 50 decodes the encoded data into moving image data with a frame rate of 1 / (4t ₀ ) and a low frame rate, and supplies it to the buffer unit 51. The buffer unit 51 sequentially stores moving image data having a low frame rate of 1 / (4t ₀ ) supplied from the decoding unit 50.

そして、ステップＳ３１において、フィルタ生成部５３は、バッファ部５１に記憶された動画データのうちのある部分のデータ、即ち、例えば、図３２のステップＳ２１で説明したような１６×１６画素のブロックのデータを読み出して、自身（フィルタ生成部５３）に入力し、ステップＳ３２乃至Ｓ３４に順次進む。 In step S31, the filter generation unit 53 generates a certain part of the moving image data stored in the buffer unit 51, that is, for example, a block of 16 × 16 pixels as described in step S21 of FIG. Data is read out and input to itself (filter generation unit 53), and the process proceeds to steps S32 to S34 in sequence.

ステップＳ３２乃至Ｓ３４では、フィルタ生成部５３（の主成分方向取得部６１およびフィルタ情報供給部６２）が、ステップＳ３１で入力されたデータについて、周波数ドメイン上で人間が認識することができる領域を、フィルタの通過帯域として求め、その通過帯域をフィルタ情報として、フィルタ部５２に供給する処理、即ち、上述した図３１の「必要な情報の通過帯域を求める処理」と同様の処理を行う。 In steps S32 to S34, the filter generation unit 53 (the principal component direction acquisition unit 61 and the filter information supply unit 62) can recognize a region that can be recognized by humans on the frequency domain for the data input in step S31. A process for obtaining the pass band of the filter and supplying the pass band as filter information to the filter unit 52, that is, a process similar to the above-described "process for obtaining the pass band of necessary information" in FIG. 31 is performed.

即ち、ステップＳ３２では、フィルタ生成部５３の主成分方向取得部６１は、ステップＳ３１で入力された動画データについて、空間方向x,yの周波数軸X,Y、および時間方向tの周波数軸Tで定義される周波数ドメインでの主成分方向を取得し、フィルタ情報供給部６２に供給して、ステップＳ３３に進む。 That is, in step S32, the principal component direction acquisition unit 61 of the filter generation unit 53 uses the frequency axes X and Y in the spatial directions x and y and the frequency axis T in the time direction t for the moving image data input in step S31. The principal component direction in the defined frequency domain is acquired and supplied to the filter information supply unit 62, and the process proceeds to step S33.

なお、ステップＳ３２において主成分方向は、図３１で説明したように、３次元フーリエ変換を行うことにより求めることもできるし、図３２で説明したように、動きベクトルを検出することにより求めることもできる。また、上述したように、送信装置１からのエンコードデータに、主成分方向（の情報）が多重化されている場合には、デコード部５０において、エンコードデータから主成分方向を分離し、主成分方向取得部６１において、デコード部５０から、その主成分方向（の情報）の供給を受けることにより、主成分方向を取得することもできる。 In step S32, the principal component direction can be obtained by performing a three-dimensional Fourier transform as described in FIG. 31, or can be obtained by detecting a motion vector as described in FIG. it can. Further, as described above, when the principal component direction (information thereof) is multiplexed in the encoded data from the transmission device 1, the decoding unit 50 separates the principal component direction from the encoded data, In the direction acquisition unit 61, the principal component direction can be acquired by receiving the supply of the principal component direction (information thereof) from the decoding unit 50.

さらに、エンコードデータに空間方向x,yの動きベクトル(u₀,v₀)が含まれる場合には、その動きベクトル(u₀,v₀)に、エンコードデータに含まれる動画データのフレーム同期4t₀を、時間方向tのコンポーネントとして加えて、空間方向x,yおよび時間方向tの３次元の動きベクトル(u₀,v₀,4t₀)とし、図３２で説明したように、その３次元の動きベクトル(u₀,v₀,4t₀)に直交する平面を主成分方向とすることもできる。 Furthermore, encoded data in the spatial direction x, in the case that contains the y motion vector (u _0, v ₀₎ is the motion vector (u _0, v _0), the frame synchronization 4t of the moving image data included in the encoded data ₀ is added as a component in the time direction t to obtain a three-dimensional motion vector (u ₀ , v ₀ , 4t ₀ ) in the spatial direction x, y and the time direction t, and as shown in FIG. The plane orthogonal to the motion vector (u ₀ , v ₀ , 4t ₀ ) can be the principal component direction.

即ち、図２５の送信装置１のエンコード部２４において、動画データが、少なくとも動き補償を利用する、例えば、MPEGなどのエンコード方法によってエンコードされる場合には、エンコードデータには、動き補償に用いられる動きベクトルが含まれる。そこで、デコード部５０においてエンコードデータから動きベクトル（の情報）を抽出し、バッファ部５１を介して、主成分方向取得部６１に、エンコードデータに含まれる動きベクトルを供給するようにし、主成分方向取得部６１では、その動きベクトルに直交する平面を、主成分方向として求める（検出する）ようにすることができる。 That is, in the encoding unit 24 of the transmission apparatus 1 of FIG. 25, when moving image data is encoded by an encoding method such as MPEG that uses at least motion compensation, the encoded data is used for motion compensation. Contains motion vectors. Therefore, the decoding unit 50 extracts the motion vector (information thereof) from the encoded data, and supplies the motion vector included in the encoded data to the principal component direction acquisition unit 61 via the buffer unit 51, thereby the principal component direction. The acquisition unit 61 can obtain (detect) a plane orthogonal to the motion vector as the principal component direction.

なお、MPEGでは、Iピクチャのブロック（マクロブロック）には、動きベクトルが存在しないが、Iピクチャの動きベクトルは、例えば、PまたはBピクチャのブロックの動きベクトルから推定するようにすればよい。 In MPEG, a motion vector does not exist in an I picture block (macroblock), but an I picture motion vector may be estimated from a motion vector of a P or B picture block, for example.

ステップＳ３３では、フィルタ情報供給部６２は、周波数ドメインにおいて、原点(0,0)から、主成分方向取得部６１からの主成分方向に延びる領域であって、T方向に２π／（４×t₀）の幅を有し、X,Y方向が、−（π／r₀）乃至＋（π／r₀）で、T方向が、−（π／t₀）乃至＋（π／t₀）の範囲の領域、即ち、図２０の領域R₁₀₀₁や、図２１の領域R₁₁₀₁、図２２の領域R₁₂₀₁、図２３の領域R₁₃₀₁を、フィルタの通過帯域として決定し、ステップＳ３４に進む。 In step S33, the filter information supply unit 62 is a region extending in the principal component direction from the principal component direction acquisition unit 61 from the origin (0, 0) in the frequency domain, and is 2π / (4 × t in the T direction. ₀ ), the X and Y directions are-(π / r ₀ ) to + (π / r ₀ ), and the T direction is-(π / t ₀ ) to + (π / t ₀ ). a range of area, i.e., and a region R ₁₀₀₁ in FIG. 20, the region R ₁₁₀₁ in FIG. 21, the region R ₁₂₀₁ in FIG. 22, the region R ₁₃₀₁ in FIG. 23, determined as the pass band of the filter, the flow proceeds to step S34.

ステップＳ３４では、フィルタ情報供給部６２は、ステップＳ３３で求めたフィルタの通過帯域（を表す情報）を、フィルタ情報として、フィルタ部５２に供給し、ステップＳ３５に進む。 In step S34, the filter information supply unit 62 supplies the filter passband (information representing) obtained in step S33 to the filter unit 52 as filter information, and the process proceeds to step S35.

ステップＳ３５では、フィルタ部５２が、バッファ部５１から動画データを読み出し、その動画データに対して、ステップＳ３４でフィルタ生成部５３から供給されたフィルタ情報が表す通過帯域のフィルタを適用しながら、時間方向のサンプル数を4倍にするアップサンプリングを行う。 In step S35, the filter unit 52 reads the moving image data from the buffer unit 51, and applies the filter of the passband represented by the filter information supplied from the filter generation unit 53 in step S34 to the moving image data. Upsampling is performed to quadruple the number of samples in the direction.

即ち、例えば、いま、ステップＳ３１で入力された動画データが、低フレームレートの動画データのあるフレームのブロック（のデータ）であるとして、そのブロックを注目ブロックとするとともに、注目ブロックのフレームを注目フレームとする。 That is, for example, assuming that the moving image data input in step S31 is a block (data) of a certain frame of moving image data at a low frame rate, that block is the attention block, and the frame of the attention block is noted. Frame.

フィルタ部５２は、バッファ部５１から読み出した動画データを、フィルタ生成部５３から供給されたフィルタ情報が表す通過帯域のフィルタ、即ち、例えば、図２０の領域R₁₀₀₁や、図２１の領域R₁₁₀₁、図２２の領域R₁₂₀₁、図２３の領域R₁₃₀₁を通過帯域とするフィルタによってフィルタリングすることにより、注目ブロックの各画素のデータ（画素値）を求める。 The filter unit 52 converts the moving image data read from the buffer unit 51 into a passband filter represented by the filter information supplied from the filter generation unit 53, for example, the region R _{1001 in} FIG. 20 or the region R _{1101 in} FIG. The data (pixel value) of each pixel of the block of interest is obtained by filtering with a filter that uses the region R ₁₂₀₁ in FIG. 22 and the region R _{1301 in} FIG. 23 as a pass band.

また、フィルタ部５２は、バッファ部５１から読み出した動画データを、フィルタ生成部５３から供給されたフィルタ情報が表す通過帯域のフィルタ、即ち、例えば、図２０の領域R₁₀₀₁や、図２１の領域R₁₁₀₁、図２２の領域R₁₂₀₁、図２３の領域R₁₃₀₁を通過帯域とするフィルタによってフィルタリングすることにより、注目フレームから次のフレームまでの時間4t₀を、時間t₀に等分する３つの時刻それぞれにおける画素であって、注目ブロックの各画素を通り、注目ブロックの３次元の動きベクトルの方向に延びる直線上にある画素のデータを求める。なお、注目ブロックの３次元の動きベクトルは、例えば、主成分方向取得部６１で求め、フィルタ情報供給部６２を介して、フィルタ情報とともに、フィルタ部５２に供給することができる。 Further, the filter unit 52 filters the moving image data read from the buffer unit 51 in the passband represented by the filter information supplied from the filter generation unit 53, that is, for example, the region _{R1001 in} FIG. 20 or the region in FIG. By filtering with R ₁₁₀₁ , a region R ₁₂₀₁ in FIG. 22 and a filter having a region R _{1301 in} FIG. 23 as a pass band, the time 4 t ₀ from the frame of interest to the next frame is divided into three equal to the time t ₀ . Data of pixels on the straight line extending in the direction of the three-dimensional motion vector of the block of interest through each pixel of the block of interest is obtained at each time. Note that the three-dimensional motion vector of the block of interest can be obtained by, for example, the principal component direction acquisition unit 61 and supplied to the filter unit 52 together with the filter information via the filter information supply unit 62.

フィルタ部５２は、以上のようなフィルタリングおよびアップサンプリングを行うことにより、フレームレートが1/t₀の高フレームレートの動画データを得て、ステップＳ３５からＳ３６に進む。 The filter unit 52 performs the above filtering and upsampling to obtain high frame rate moving image data with a frame rate of 1 / t ₀ , and proceeds from step S35 to step S36.

ステップＳ３６では、フィルタ部５２は、ステップＳ３５で得た高フレームレートの動画データを、表示装置３（図２４）に出力して、処理を終了する。 In step S36, the filter unit 52 outputs the high frame rate moving image data obtained in step S35 to the display device 3 (FIG. 24), and ends the process.

なお、ステップＳ３１乃至Ｓ３４の処理は、バッファ部５１に記憶された動画データの各部分すべてについて行われる。 Note that the processing in steps S31 to S34 is performed for all portions of the moving image data stored in the buffer unit 51.

以上のように、周波数ドメイン上で人間が認識することができる領域、即ち、例えば、図２０の領域R₁₀₀₁や、図２１の領域R₁₁₀₁、図２２の領域R₁₂₀₁、図２３の領域R₁₃₀₁を、通過帯域とするフィルタによって、動画データをフィルタリングし、さらに、アップサンプリングを行うようにしたので、人間の目の視覚効果を考慮したデータ量の削減が行われた低フレームレートの動画データから、高画質の動画データ、つまり、元の高フレームレートの動画データと同様の画質の動画データを得て（元の高フレームレートの動画データと同様の画質を認識することができる動画データを復元して）表示することができる。 As described above, regions that can be recognized by humans on the frequency domain, that is, for example, region R _{1001 in} FIG. 20, region R ₁₁₀₁ in FIG. 21, region R ₁₂₀₁ in FIG. 22, region R _{1301 in} FIG. Since the video data is filtered by a filter that uses a passband, and further upsampling is performed, the amount of data is reduced in consideration of the visual effects of the human eye. , Obtain high-quality video data, that is, video data with the same image quality as the original high-frame-rate video data (restore video data that can recognize the same image quality as the original high-frame-rate video data Can be displayed).

即ち、図３５乃至図３８は、それぞれ、図２０の領域R₁₀₀₁、図２１の領域R₁₁₀₁、図２２の領域R₁₂₀₁、図２３の領域R₁₃₀₁を、通過帯域とするフィルタによって、図２７乃至図３０の低フレームレートの動画（フレームレートが1/(4t₀)の動画）のデータをフィルタリングしながら、時間方向に4倍のアップサンプリングを行って得られる高フレームレートの動画（フレームレートが1/t₀の動画）のデータの周波数ドメイン上の分布を示している。 That is, FIGS. 35 to 38, respectively, the regions R ₁₀₀₁ in FIG. 20, the region R ₁₁₀₁ in FIG. 21, the region R ₁₂₀₁ in FIG. 22, the region R ₁₃₀₁ in FIG. 23, the filter having a pass band to 27 A high frame rate video (frame rate is obtained by performing upsampling four times in the time direction while filtering the data of the low frame rate video (video with a frame rate of 1 / (4t ₀ )) in FIG. 1 / t ₀ (moving data) on the frequency domain.

まず、動画の中の静止している部分のデータに対しては、フィルタ部５２において、図２０の領域R₁₀₀₁を通過帯域とするフィルタによるフィルタリングを行いながらアップサンプリングを行う。従って、そのフィルタリングおよびアップサンプリングによって得られる動画データは、図２０の領域R₁₀₀₁内の周波数成分を有し、さらに、空間方向x,yに対してはr₀間隔でサンプリングされ、時間方向tに対してはt₀間隔でサンプリングされたデータとなる。 First, the data of the still portion in the moving image is up-sampled in the filter unit 52 while performing filtering by a filter having the region R _{1001 in} FIG. 20 as a pass band. Accordingly, the moving image data obtained by the filtering and upsampling has frequency components in the region R ₁₀₀₁ in FIG. 20, and is further sampled at intervals of r _{0 with} respect to the spatial directions x and y and in the time direction t. On the other hand, the data is sampled at t ₀ intervals.

このため、図３５に示すように、周波数ドメイン上では、X,Y方向には、2π/r₀間隔で、折り返し成分が生じ、T方向には、2π/t₀間隔で、折り返し成分が生じる。即ち、折り返し成分が生じるT方向の周期が、図２７に示した場合の４倍になる。 Therefore, as shown in FIG. 35, on the frequency domain, aliasing components are generated at intervals of 2π / r _{0 in the} X and Y directions, and aliasing components are generated at intervals of 2π / t _{0 in the} T direction. . That is, the period in the T direction in which the aliasing component occurs is four times that shown in FIG.

なお、図３５において、影を付してある部分が、動画データが存在する部分である。 In FIG. 35, the shaded portion is a portion where moving image data exists.

次に、動画の中で、そこに投影されている被写体が速度「（r₀／t₀）／２」程度で動いている部分のデータに対しては、フィルタ部５２において、図２１の領域R₁₁₀₁を通過帯域とするフィルタによるフィルタリングを行いながらアップサンプリングを行う。従って、そのフィルタリングおよびアップサンプリングによって得られる動画データは、図２１の領域R₁₁₀₁内の周波数成分を有し、さらに、空間方向x,yに対してはr₀間隔でサンプリングされ、時間方向tに対してはt₀間隔でサンプリングされたデータとなる。 Next, with respect to data of a portion of the moving image where the subject projected on the moving image moves at a speed of about “(r ₀ / t ₀ ) / 2”, the filter unit 52 uses the area shown in FIG. Upsampling is performed while filtering with a filter having R ₁₁₀₁ as a pass band. Accordingly, the moving image data obtained by the filtering and upsampling has frequency components in the region R ₁₁₀₁ in FIG. 21, and is sampled at intervals of r _{0 with} respect to the spatial direction x, y, and in the time direction t. On the other hand, the data is sampled at t ₀ intervals.

このため、図３６に示すように、周波数ドメイン上では、X,Y方向には、2π/r₀間隔で、折り返し成分が生じ、T方向には、2π/t₀間隔で、折り返し成分が生じる。即ち、折り返し成分が生じるT方向の周期が、図２８に示した場合の４倍になる。 Therefore, as shown in FIG. 36, on the frequency domain, aliasing components occur at intervals of 2π / r _{0 in the} X and Y directions, and aliasing components occur at intervals of 2π / t _{0 in the} T direction. . That is, the period in the T direction in which the aliasing component occurs is four times that shown in FIG.

なお、図３６において、影を付してある部分が、動画データが存在する部分である。 In FIG. 36, the shaded portion is a portion where moving image data exists.

次に、動画の中で、そこに投影されている被写体が速度「r₀／t₀」程度で動いている部分のデータに対しては、フィルタ部５２において、図２２の領域R₁₂₀₁を通過帯域とするフィルタによるフィルタリングを行いながらアップサンプリングを行う。従って、そのフィルタリングおよびアップサンプリングによって得られる動画データは、図２２の領域R₁₂₀₁内の周波数成分を有し、さらに、空間方向x,yに対してはr₀間隔でサンプリングされ、時間方向tに対してはt₀間隔でサンプリングされたデータとなる。 Next, for the data of the portion of the moving image where the subject projected on the moving image moves at a speed of about “r ₀ / t ₀ ”, the filter unit 52 passes the region R _{1201 in} FIG. Upsampling is performed while filtering with a band filter. Therefore, the moving image data obtained by the filtering and up-sampling has frequency components in the region R ₁₂₀₁ in FIG. 22, and is further sampled at intervals of r _{0 with} respect to the spatial directions x and y, and in the time direction t. On the other hand, the data is sampled at t ₀ intervals.

このため、図３７に示すように、周波数ドメイン上では、X,Y方向には、2π/r₀間隔で、折り返し成分が生じ、T方向には、2π/t₀間隔で、折り返し成分が生じる。即ち、折り返し成分が生じるT方向の周期が、図２９に示した場合の４倍になる。 Therefore, as shown in FIG. 37, on the frequency domain, aliasing components occur at intervals of 2π / r _{0 in the} X and Y directions, and aliasing components occur at intervals of 2π / t _{0 in the} T direction. . That is, the period in the T direction in which the aliasing component occurs is four times that shown in FIG.

なお、図３７において、影を付してある部分が、動画データが存在する部分である。 In FIG. 37, the shaded portion is a portion where moving image data exists.

次に、動画の中で、そこに投影されている被写体が速度「2r₀／t₀」程度で動いている部分のデータに対しては、フィルタ部５２において、図２３の領域R₁₃₀₁を通過帯域とするフィルタによるフィルタリングを行いながらアップサンプリングを行う。従って、そのフィルタリングおよびアップサンプリングによって得られる動画データは、図２３の領域R₁₃₀₁内の周波数成分を有し、さらに、空間方向x,yに対してはr₀間隔でサンプリングされ、時間方向tに対してはt₀間隔でサンプリングされたデータとなる。 Next, for the data of the portion of the moving image where the subject projected on the moving image moves at a speed of about “2r ₀ / t ₀ ”, the filter unit 52 passes the region R _{1301 in} FIG. Upsampling is performed while filtering with a band filter. Accordingly, the moving image data obtained by the filtering and upsampling has frequency components in the region R ₁₃₀₁ in FIG. 23, and is further sampled at intervals of r _{0 with} respect to the spatial directions x and y, and in the time direction t. On the other hand, the data is sampled at t ₀ intervals.

このため、図３８に示すように、周波数ドメイン上では、X,Y方向には、2π/r₀間隔で、折り返し成分が生じ、T方向には、2π/t₀間隔で、折り返し成分が生じる。即ち、折り返し成分が生じるT方向の周期が、図３０に示した場合の４倍になる。 Therefore, as shown in FIG. 38, on the frequency domain, aliasing components occur at intervals of 2π / r _{0 in the} X and Y directions, and aliasing components occur at intervals of 2π / t _{0 in the} T direction. . That is, the period in the T direction in which the aliasing component occurs is four times that shown in FIG.

なお、図３８において、影を付してある部分が、動画データが存在する部分である。 In FIG. 38, the shaded portion is a portion where moving image data exists.

図２で説明したように、人間が認識することができるのは、X,Y方向が、−（π／r₀）乃至＋（π／r₀）で、T方向が、−（π／t₀）乃至＋（π／t₀）の範囲の領域R₂₀₁であり、図３５乃至図３８において、この領域R₂₀₁内には、折り返し成分は存在しない。 As described in FIG. 2, humans can recognize that the X and Y directions are − (π / r ₀ ) to + (π / r ₀ ) and the T direction is − (π / t ₀ ) to + (π / t ₀ ) range R ₂₀₁ , and in FIG. 35 to FIG. 38, there is no aliasing component in this region R ₂₀₁ .

従って、フィルタ部５２におけるフィルタリングおよびアップサンプリングにより得られる、1/t₀の高フレームレートの動画データが、表示装置３に表示された場合に、その表示された動画に対して、人間は、元の動画（送信装置１に供給された動画）と同様の画質を認識することができる。 Therefore, when moving image data with a high frame rate of 1 / t ₀ obtained by filtering and upsampling in the filter unit 52 is displayed on the display device 3, The same image quality as that of the moving image (the moving image supplied to the transmission device 1) can be recognized.

なお、上述したように、図２５の送信装置１のエンコード部２４では、エンコードデータに、フィルタ部２２で用いられたフィルタのフィルタ情報を多重化することができるが、エンコードデータにフィルタ情報が多重化されている場合には、受信装置２（図３３）は、フィルタ生成部５３なしで構成することができる。この場合、受信装置２は、デコード部５０において、エンコードデータからフィルタ情報を分離し、フィルタ部５２に供給するように構成すればよい。 As described above, the encoding unit 24 of the transmission device 1 in FIG. 25 can multiplex the filter information of the filter used in the filter unit 22 into the encoded data, but the filter information is multiplexed into the encoded data. If it is configured, the receiving device 2 (FIG. 33) can be configured without the filter generation unit 53. In this case, the receiving device 2 may be configured such that the decoding unit 50 separates the filter information from the encoded data and supplies it to the filter unit 52.

また、上述の場合には、図２４の送信装置１において1/t₀の高フレームレートの動画データを処理して得られる1/(4t₀)の低フレームレートの動画データを、受信装置２に入力し、受信装置２において、その低フレームレートの動画データを、1/t₀の高フレームレートの動画データに変換することにより、表示装置３において、高画質（送信装置１に入力される動画と同様の画質）の動画を表示するようにしたが、受信装置２には、元々、1/(4t₀)の低フレームレートの動画データを入力しても、表示装置３において、ある程度高画質の動画を表示することが可能である。 In the above case, 1 / (4t ₀ ) low frame rate moving image data obtained by processing the high frame rate moving image data of 1 / t ₀ in the transmitting device 1 of FIG. And the receiving device 2 converts the low-frame-rate moving image data into high-frame-rate moving image data of 1 / t ₀ , so that the display device 3 has high image quality (input to the transmitting device 1). Although a moving image having the same image quality as the moving image is displayed, even if moving image data with a low frame rate of 1 / (4t ₀ ) is originally input to the receiving device 2, the display device 3 has a certain amount of high quality. It is possible to display a moving image of image quality.

即ち、動画において、（ほぼ）静止している被写体が投影されている部分では、時間方向の周波数成分はほとんど存在しないから、上述の図１０に示した領域R₃₀₁から領域R₁₀₀₁を除いた領域における周波数成分はほとんど存在しない。 That is, in the moving image, in a portion where a (substantially) stationary subject is projected, there is almost no frequency component in the time direction, and thus the region obtained by removing the region R ₁₀₀₁ from the region R ₃₀₁ shown in FIG. There are almost no frequency components in.

また、動画において、そこに投影されている被写体が速度「（r₀／t₀）／２」程度で動いている部分では、その動きの方向（空間方向x,yおよび時間方向tの３次元の動きの方向）に垂直な方向の周波数成分以外はほとんど存在しないから、上述の図１１に示した領域R₄₀₁から領域R₁₁₀₁を除いた領域における周波数成分はほとんど存在しない。 In the moving image, in the part where the subject projected there is moving at a speed of “(r ₀ / t ₀ ) / 2”, the direction of the movement (spatial direction x, y and time direction t three-dimensional Since there is almost no frequency component other than the frequency component in the direction perpendicular to the direction of the movement of (1), there is almost no frequency component in the region excluding the region R ₁₁₀₁ from the region R ₄₀₁ shown in FIG.

さらに、動画において、そこに投影されている被写体が速度「r₀／t₀」程度で動いている部分では、その動きの方向に垂直な方向の周波数成分以外はほとんど存在しないから、上述の図１２に示した領域R₈₀₁から領域R₁₂₀₁を除いた領域における周波数成分はほとんど存在しない。 Further, in the moving image, there is almost no frequency component other than the frequency component in the direction perpendicular to the direction of the movement in the portion where the object projected there is moving at a speed of about “r ₀ / t ₀ ”. There are almost no frequency components in the region excluding the region R ₁₂₀₁ from the region R ₈₀₁ shown in FIG.

また、動画において、そこに投影されている被写体が速度「2r₀／t₀」程度で動いている部分では、その動きの方向に垂直な方向の周波数成分以外はほとんど存在しないから、上述の図１３に示した領域R₉₀₁から領域R₁₃₀₁を除いた領域における周波数成分はほとんど存在しない。 Further, in the moving image, in the portion where the subject projected there is moving at a speed of about “2r ₀ / t ₀ ”, there is almost no frequency component in the direction perpendicular to the direction of the movement. There are almost no frequency components in the region excluding the region R ₁₃₀₁ from the region R ₉₀₁ shown in FIG.

但し、実際には、動画においては、被写体の投影像は、時間の経過に伴って複雑に変化するため、動画において静止している部分であっても、図１０に示した領域R₃₀₁から領域R₁₀₀₁を除いた領域に、多少のデータ（周波数成分）が存在する。 However, in actuality, in the moving image, the projected image of the subject changes in a complicated manner with time, so even a portion that is stationary in the moving image is a region from the region R ₃₀₁ shown in FIG. There is some data (frequency components) in the area excluding _R1001 .

同様に、動画において速度「（r₀／t₀）／２」程度で動いている部分には、図１１に示した領域R₄₀₁から領域R₁₁₀₁を除いた領域に、多少のデータが存在し、また、動画において速度「r₀／t₀」程度で動いている部分には、図１２に示した領域R₈₀₁から領域R₁₂₀₁を除いた領域に、多少のデータが存在する。さらに、動画において速度「2r₀／t₀」程度で動いている部分には、図１３に示した領域R₉₀₁から領域R₁₃₀₁を除いた領域に、多少のデータが存在する。 Similarly, in the moving part of the moving image at a speed of “(r ₀ / t ₀ ) / 2”, there is some data in the area excluding the area R ₁₁₀₁ from the area R ₄₀₁ shown in FIG. Further, in the moving part of the moving image at the speed “r ₀ / t ₀ ”, there is some data in the area excluding the area R ₁₂₀₁ from the area R ₈₀₁ shown in FIG. Furthermore, in the moving part of the moving image at a speed of about “2r ₀ / t ₀ ”, there is some data in the area excluding the area R ₁₃₀₁ from the area R ₉₀₁ shown in FIG.

このような領域R₁₀₀₁，R₁₁₀₁，R₁₂₀₁，R₁₃₀₁外に存在するデータを削除するために、送信装置１（図２５）のフィルタ部２２では、領域R₁₀₀₁，R₁₁₀₁，R₁₂₀₁，R₁₃₀₁を通過帯域とするフィルタによるフィルタリングを行っている。 In order to delete data existing outside the regions R ₁₀₀₁ , R ₁₁₀₁ , R ₁₂₀₁ , and R ₁₃₀₁ , the filter unit 22 of the transmission device 1 (FIG. 25) uses the regions R ₁₀₀₁ , R ₁₁₀₁ , R ₁₂₀₁ , R Filtering is performed by a filter having a pass band of ₁₃₀₁ .

従って、受信装置２に入力される低フレームレートの動画データが、送信装置１の処理の結果得られたものでない場合には、領域R₁₀₀₁，R₁₁₀₁，R₁₂₀₁，R₁₃₀₁外に、多少のデータが存在することがあり得る。そして、領域R₁₀₀₁，R₁₁₀₁，R₁₂₀₁，R₁₃₀₁外に、データが存在する場合、受信装置２（図３３）のフィルタ部５２におけるフィルタリングおよびアップサンプリングによって得られる動画データについては、図２に示した人間が認識することができる領域R₂₀₁内に折り返し成分が混入し、その折り返し成分によって、動画の画質は劣化する。 Therefore, when the low frame rate moving image data input to the receiving device 2 is not obtained as a result of the processing of the transmitting device 1, some data outside the regions R ₁₀₀₁ , R ₁₁₀₁ , R ₁₂₀₁ , R ₁₃₀₁ There can be data. Then, when data exists outside the regions R ₁₀₀₁ , R ₁₁₀₁ , R ₁₂₀₁ , and R ₁₃₀₁ , moving image data obtained by filtering and upsampling in the filter unit 52 of the receiving device 2 (FIG. 33) is shown in FIG. The aliasing component is mixed in the region R ₂₀₁ that can be recognized by the human being, and the image quality of the moving image deteriorates due to the aliasing component.

しかしながら、領域R₁₀₀₁，R₁₁₀₁，R₁₂₀₁，R₁₃₀₁外に存在するデータは僅かであり、そのような僅かなデータによる折り返し成分に起因して生じる画質の劣化も僅かである。従って、受信装置２に対して、元々、1/(4t₀)の低フレームレートの動画データを入力する場合には、送信装置１において得られる低フレームレートの動画データを入力する場合に比較して、多少の画質の劣化はあるが、それでも、元々、1/(4t₀)の低フレームレートの動画データを、そのまま表示する場合に比較して、画質が改善された動画を表示することができる。 However, the data existing outside the regions R ₁₀₀₁ , R ₁₁₀₁ , R ₁₂₀₁ , and R ₁₃₀₁ is very small, and image quality degradation caused by the aliasing component due to such a small amount of data is also small. Therefore, originally when moving image data with a low frame rate of 1 / (4t ₀ ) is originally input to the receiving device 2, it is compared with when inputting moving image data with a low frame rate obtained by the transmitting device 1. Although there is some degradation in image quality, it is still possible to display movies with improved image quality compared to the case of displaying 1 / (4t ₀ ) low frame rate movie data as it is. it can.

なお、受信装置２に入力される1/(4t₀)の低フレームレートの動画データが、いわゆる電子シャッタを採用する撮像装置で撮像されたものである場合には、受信装置２（図３３）のフィルタ情報供給部６２において、その低フレームレートの動画データの撮像時の露光時間に対応してT方向の幅が制限された領域を、フィルタの通過帯域として決定することができる。 Note that when the 1 / (4t ₀ ) low-frame-rate moving image data input to the receiving device 2 is captured by an imaging device that employs a so-called electronic shutter, the receiving device 2 (FIG. 33). In the filter information supply unit 62, a region in which the width in the T direction is limited corresponding to the exposure time at the time of capturing the moving image data at the low frame rate can be determined as the pass band of the filter.

即ち、1/(4t₀)の低フレームレートの動画データの各フレームの撮像時の露光時間が、t₀より大で、4t₀より小であるt₀'である場合には、各フレームに投影されている被写体の画像は、被写体からの光を、時間t₀'だけ積分した値（受光量）に等しい。従って、被写体が動いている場合には、各フレームの画像には、時間t₀'に対応するボケ（モーションブラー）が生じる。なお、t₀が、例えば、1/240秒程度である場合には、t₀'は、例えば、1/120秒程度である。 That is, 1 / the (4t ₀₎ exposure time for the imaging of each frame of the moving image data of a low frame rate, with greater than t _0, when it is t ₀ 'is smaller than 4t ₀ is on each frame The projected image of the subject is equal to a value (light reception amount) obtained by integrating light from the subject for a time t ₀ ′. Therefore, when the subject is moving, blur (motion blur) corresponding to time t ₀ ′ is generated in the image of each frame. When t ₀ is about 1/240 seconds, for example, t ₀ ′ is about 1/120 seconds, for example.

時間t₀'だけの積分は、図３９に示すような、周波数ドメインのT方向に、2π/t₀'の幅を有する領域R₃₉₀₁を通過帯域とするローパスフィルタによるフィルタリングを行うことと等価であり、従って、電子シャッタにより、露光時間をt₀'として撮像された動画の各フレームのデータは、領域R₃₉₀₁を通過帯域とするローパスフィルタを通過したデータであると言うことができる。 The integration of only time t ₀ ′ is equivalent to performing filtering by a low-pass filter having a region R ₃₉₀₁ having a width of 2π / t ₀ ′ in the frequency domain T direction as shown in FIG. Therefore, it can be said that the data of each frame of the moving image captured with the exposure time t ₀ ′ by the electronic shutter is the data that has passed through the low-pass filter having the region R ₃₉₀₁ as the pass band.

なお、領域R₃₉₀₁は、T方向が、-π／t₀'乃至π／t₀'の範囲の2π/t₀'の幅を有し、かつ、X,Y方向が、人間が認識することができる-π／r₀乃至π／r₀の範囲の2π／r₀の幅を有する領域である。上述のように、例えば、t₀が1/240秒で、t₀'が1/120秒であり、従って、2t₀=t₀'である場合には、領域R₃₉₀₁は、T方向が、-π／(2t₀)乃至π／(2t₀)の範囲のπ/t₀の幅を有する。 Note that the region R ₃₉₀₁ has a width of 2π / t ₀ ′ in the range of −π / t ₀ ′ to π / t ₀ ′ in the T direction, and a human recognizes the X and Y directions. This is a region having a width of 2π / r _{0 in} the range of −π / r _{0 to} π / r ₀ . As described above, for example, when t ₀ is 1/240 seconds and t ₀ ′ is 1/120 seconds, and therefore 2t ₀ = t ₀ ′, the region R ₃₉₀₁ has the T direction It has a width of π / t _{0 in} the range of −π / (2t ₀ ) to π / (2t ₀ ).

以上のように、露光時間をt₀'として撮像された動画の各フレームのデータは、領域R₃₉₀₁を通過帯域とするローパスフィルタを通過したデータであるから、周波数ドメインにおいて、領域R₃₉₀₁以外には、データは存在しない。 As described above, the data of each frame of the imaged moving image exposure time as t ₀ ', since the data that has passed through the low-pass filter having a pass band region R _3901, in the frequency domain, in addition to regions R ₃₉₀₁ There is no data.

従って、人間が認識することができる、例えば、図２０の領域R₁₀₀₁や、図２１の領域R₁₁₀₁、図２２の領域R₁₂₀₁、図２３の領域R₁₃₀₁内のデータであっても、図３９の領域R₃₉₀₁外にあるデータは、ノイズ等の、動画本来のデータでないから、不要なデータとして削除するのが望ましい。 Therefore, for example, even the data in the region R _{1001 in} FIG. 20, the region R ₁₁₀₁ in FIG. 21, the region R ₁₂₀₁ in FIG. 22, and the region R ₁₃₀₁ in FIG. Since the data outside the area R ₃₉₀₁ is not the original data of the moving image such as noise, it is desirable to delete it as unnecessary data.

そこで、受信装置２（図３３）のフィルタ情報供給部６２では、図２０の領域R₁₀₀₁や、図２１の領域R₁₁₀₁、図２２の領域R₁₂₀₁、図２３の領域R₁₃₀₁のT方向を図３９の領域R₃₉₀₁によって制限した領域を、フィルタの通過帯域として決定することができる。 Therefore, in the filter information supply unit 62 of the receiving device 2 (FIG. 33), the T direction of the region R _{1001 in} FIG. 20, the region R ₁₁₀₁ in FIG. 21, the region R ₁₂₀₁ in FIG. 22, and the region R ₁₃₀₁ in FIG. A region limited by 39 regions R ₃₉₀₁ can be determined as the passband of the filter.

例えば、いま、上述したように、2t₀=t₀'であるとすると、図２０の領域R₁₀₀₁は、図３９の領域R₃₉₀₁に含まれるから、フィルタ情報供給部６２では、動画において被写体が静止している部分については、図２０の領域R₁₀₀₁を、そのまま、フィルタの通過帯域として決定することができる。 For example, as described above, assuming that 2t ₀ = t ₀ ′, the region R _{1001 in} FIG. 20 is included in the region R ₃₉₀₁ in FIG. For the stationary part, the region R _{1001 in} FIG. 20 can be determined as it is as the pass band of the filter.

また、図２１の領域R₁₁₀₁は、図４０に示すように、その領域R₁₁₀₁のうちの、T方向が、-π／(2t₀)乃至π／(2t₀)の範囲で、X,Y方向が、-π／r₀乃至π／r₀の範囲の領域R₄₀₀₁において、図３９の領域R₃₉₀₁と重複するから、フィルタ情報供給部６２では、動画に投影されている被写体が速度「（r₀／t₀）／２」程度で動いている部分については、図４０の領域R₄₀₀₁を、フィルタの通過帯域として決定することができる。 In addition, as shown in FIG. 40, the region R _{1101 in} FIG. 21 has X, Y in the T direction in the region R ₁₁₀₁ in the range of −π / (2t ₀ ) to π / (2t ₀ ). direction, in the region R ₄₀₀₁ in the range of - [pi] / r ₀ to [pi / r _0, since overlapping the region R ₃₉₀₁ in FIG. 39, the filter information supply section 62, the subject being projected video speed "( For a portion moving at about “r ₀ / t ₀ ) / 2”, the region R _{4001 in} FIG. 40 can be determined as the passband of the filter.

さらに、図２２の領域R₁₂₀₁は、図４１に示すように、その領域R₁₂₀₁のうちの、T方向が、-π／(2t₀)乃至π／(2t₀)の範囲で、X,Y方向が、-π／r₀乃至π／r₀の範囲の領域R₄₁₀₁において、図３９の領域R₃₉₀₁と重複するから、フィルタ情報供給部６２では、動画に投影されている被写体が速度「r₀／t₀」程度で動いている部分については、図４１の領域R₄₁₀₁を、フィルタの通過帯域として決定することができる。 Furthermore, as shown in FIG. 41, the region R _{1201 in} FIG. 22 has X, Y in the T direction in the region R ₁₂₀₁ in the range of −π / (2t ₀ ) to π / (2t ₀ ). Since the direction R overlaps with the region R _{3901 in} FIG. 39 in the region R ₄₁₀₁ in the range of −π / r _{0 to} π / r ₀ , the filter information supply unit 62 uses the velocity “r For a portion moving at about “ ₀ / t ₀ ”, the region R _{4101 in} FIG. 41 can be determined as the passband of the filter.

また、図２３の領域R₁₃₀₁は、図４２に示すように、その領域R₁₃₀₁のうちの、T方向が、-π／(2t₀)乃至π／(2t₀)の範囲で、X,Y方向が、-π／r₀乃至π／r₀の範囲の領域R₄₂₀₁において、図３９の領域R₃₉₀₁と重複するから、フィルタ情報供給部６２では、動画に投影されている被写体が速度「2r₀／t₀」程度で動いている部分については、図４２の領域R₄₂₀₁を、フィルタの通過帯域として決定することができる。 In addition, as shown in FIG. 42, the region R _{1301 in} FIG. 23 has X, Y in the T direction in the region R ₁₃₀₁ in the range of −π / (2t ₀ ) to π / (2t ₀ ). direction, in the region R ₄₂₀₁ in the range of - [pi] / r ₀ to [pi / r _0, since overlapping the region R ₃₉₀₁ in FIG. 39, the filter information supply unit 62, the object being projected video speed "2r For the portion moving at about “ ₀ / t ₀ ”, the region R _{4201 in} FIG. 42 can be determined as the passband of the filter.

受信装置２（図３３）のフィルタ情報供給部６２において、以上のようにフィルタの通過帯域を決定し、フィルタ部５２でフィルタリングを行うことにより、領域R₃₉₀₁外にある、ノイズ等の、動画本来のデータでない不要なデータが除去されるので、S/N(Signal to Noise ratio)等が向上した動画データを得ることができる。 In the filter information supply unit 62 of the receiving apparatus 2 (FIG. 33), the pass band of the filter is determined as described above, and filtering is performed by the filter unit 52, so that the original moving image such as noise outside the region R ₃₉₀₁ is obtained. Since unnecessary data other than the above data is removed, moving image data with improved S / N (Signal to Noise ratio) and the like can be obtained.

ここで、受信装置２において、フィルタの通過帯域を、図３９の領域R₃₉₀₁で制限する場合には、受信装置２に入力される低フレームレートの動画データの撮像時の露光時間t₀'が必要となるが、この露光時間t₀'は、例えば、受信装置２に入力される動画データに多重化する等して、受信装置２に入力することができる。 Here, in the receiving apparatus 2, when the pass band of the filter is limited by the region R ₃₉₀₁ in FIG. 39, the exposure time t ₀ ′ at the time of capturing the low frame rate moving image data input to the receiving apparatus 2 is Although necessary, the exposure time t ₀ ′ can be input to the receiving device 2 by multiplexing the moving image data input to the receiving device 2, for example.

なお、受信装置２に入力されるデータがエンコードされていない場合には、デコード部５０は、そのデータをデコードすることなく、バッファ部５１に供給する。 When the data input to the receiving device 2 is not encoded, the decoding unit 50 supplies the data to the buffer unit 51 without decoding the data.

ところで、例えば、送信装置１（図２５）の主成分方向取得部３１では、上述したように、注目ブロックの、注目フレームの次のフレームへの動きベクトル(u₀,v₀)を検出し、その動きベクトル(u₀,v₀)から、注目ブロックの周波数ドメインでの主成分方向を求めることができる。即ち、主成分方向取得部３１では、注目ブロックの動きベクトル(u₀,v₀)に、時間方向tのコンポーネントt₀を加えた３次元の動きベクトル(u₀,v₀,t₀)と直交する平面を、注目ブロックの周波数ドメインでの主成分方向として求めることができる。 By the way, for example, the principal component direction acquisition unit 31 of the transmission device 1 (FIG. 25) detects the motion vector (u ₀ , v ₀ ) of the block of interest to the next frame of the frame of interest, as described above. From the motion vector (u ₀ , v ₀ ), the principal component direction in the frequency domain of the block of interest can be obtained. That is, in the principal component direction acquisition unit 31, a three-dimensional motion vector (u ₀ , v ₀ , t ₀ ) obtained by adding the component t ₀ in the time direction t to the motion vector (u ₀ , v ₀ ) of the block of interest An orthogonal plane can be obtained as the principal component direction in the frequency domain of the block of interest.

この場合、注目ブロックの、注目フレームの次のフレームへの動きベクトル(u₀,v₀)を、例えば、-2倍、-1倍、2倍、3倍したベクトルが、それぞれ、注目ブロックの、注目フレームの前の前のフレーム、前のフレーム、次の次のフレーム、次の次の次のフレームへの動きベクトルに一致していれば、動きベクトル(u₀,v₀)から求められる注目ブロックの主成分方向は、正確なものとなる。 In this case, for example, a vector obtained by multiplying the motion vector (u ₀ , v ₀ ) of the block of interest to the next frame of the frame of interest by, for example, -2, -1, 2 or 3 times, If it matches the motion vector to the previous frame, previous frame, next next frame, next next next frame of the frame of interest, it can be obtained from the motion vector (u ₀ , v ₀ ) The principal component direction of the block of interest is accurate.

しかしながら、動きベクトル(u₀,v₀)を、例えば、-2倍や、-1倍、2倍、3倍したベクトルが、注目ブロックの、注目フレームの前の前のフレームや、前のフレーム、次の次のフレーム、次の次の次のフレームへの動きベクトルからずれている場合、そのずれに応じて、動きベクトル(u₀,v₀)から求められる注目ブロックの主成分方向は、誤差を含み、不正確なものとなる。 However, a vector obtained by multiplying the motion vector (u ₀ , v ₀ ) by, for example, −2 times, −1 times, 2 times, or 3 times is the frame before the attention frame or the previous frame of the attention block. If the motion vector is shifted from the motion vector to the next next frame, the next next frame, the principal component direction of the target block obtained from the motion vector (u ₀ , v ₀ ) is It contains errors and is inaccurate.

即ち、図４３は、送信装置１に供給される高フレームレートの動画データを示している。 That is, FIG. 43 shows high frame rate moving image data supplied to the transmission apparatus 1.

なお、図４３においては、横軸を、空間方向xとして、動画データを示してある。また、動画には動被写体が投影されており、その動被写体は、空間方向xにのみ移動する（空間方向yへの移動はしない）ものとする。図４３では、動被写体は、時間の経過とともに、左から右方向に移動している。 In FIG. 43, the moving image data is shown with the horizontal axis as the spatial direction x. In addition, a moving subject is projected on the moving image, and the moving subject moves only in the spatial direction x (does not move in the spatial direction y). In FIG. 43, the moving subject moves from left to right as time passes.

図４３には、上から順に、時刻（フレーム）t-4t₀，t-3t₀，t-2t₀，t-t₀，t，t+t₀，t+2t₀，t+3t₀，t+4t₀それぞれにおける画像データが示されている。 In FIG. 43, in order from the top, time (frames) t-4t ₀ , t-3t ₀ , t-2t ₀ , tt ₀ , t, t + t ₀ , t + 2t ₀ , t + 3t ₀ , t + Image data in each of 4t ₀ is shown.

いま、時刻tの画像データを基準とすると、その時刻tの画像データに対して、時刻t-4t₀の画像データはK₄だけ、時刻t-3t₀の画像データはK₃だけ、時刻t-2t₀の画像データはK₂だけ、時刻t-t₀の画像データはK₁だけ、時刻t+t₀の画像データはH₁だけ、時刻t+2t₀の画像データはH₂だけ、時刻t+3t₀の画像データはH₃だけ、時刻t+4t₀の画像データはH₄だけ、それぞれ、空間方向xにずれている。 If the image data at time t is used as a reference, the image data at time t-4t ₀ is only K ₄ , the image data at time t-3t ₀ is only K ₃ , and the time t Image data at -2t ₀ is K ₂ only, image data at time tt ₀ is K ₁ only, image data at time t + t ₀ is H ₁ only, image data at time t + 2t ₀ is H ₂ only, time t The image data at + 3t ₀ is shifted by H _{3 and} the image data at time t + 4t ₀ is shifted by H ₄ in the spatial direction x.

従って、時刻tの画像データにおけるあるブロックR₄₃₀₀を注目ブロックとすると、その注目ブロックR₄₃₀₀と同一の画像データ（注目ブロックR₄₃₀₀に対応する画像データ）は、時刻t-4t₀の画像データでは、注目ブロックR₄₃₀₀から空間方向xにK₄だけずれた位置の領域R₄₃₁₄に存在し、時刻t-3t₀の画像データでは、注目ブロックR₄₃₀₀から空間方向xにK₃だけずれた位置の領域R₄₃₁₃に存在する。同様に、注目ブロックR₄₃₀₀と同一の画像データは、時刻t-2t₀の画像データでは、注目ブロックR₄₃₀₀から空間方向xにK₂だけずれた位置の領域R₄₃₁₂に、時刻t-t₀の画像データでは、注目ブロックR₄₃₀₀から空間方向xにK₁だけずれた位置の領域R₄₃₁₁に、時刻t+t₀の画像データでは、注目ブロックR₄₃₀₀から空間方向xにH₁だけずれた位置の領域R₄₃₀₁に、時刻t+2t₀の画像データでは、注目ブロックR₄₃₀₀から空間方向xにH₂だけずれた位置のR₄₃₀₂に、時刻t+3t₀の画像データでは、注目ブロックR₄₃₀₀から空間方向xにH₃だけずれた位置の領域R₄₃₀₃に、時刻t+4t₀の画像データでは、注目ブロックR₄₃₀₀から空間方向xにH₄だけずれた位置の領域R₄₃₀₄に、それぞれ存在する。 Therefore, when a certain block R ₄₃₀₀ in the image data at time t and the subject block, the same image data and the block of interest R ₄₃₀₀ (image data corresponding to the target block R _4300), the image data at time t-4t ₀ is , from the target block R ₄₃₀₀ present in a region R ₄₃₁₄ of the position shifted by K ₄ in the spatial direction x, the image data at time t-3t _0, from the target block R ₄₃₀₀ of position shifted in the spatial direction x by K ₃ Exists in region R ₄₃₁₃ . Similarly, the same image data as the _target block R ₄₃₀₀ is the image data at time tt _{0 in} the region R ₄₃₁₂ at a position shifted by K _{2 in the} spatial direction x from the _target block R _{4300 in} the image data at time t−2t _0. In the data, the region R ₄₃₁₁ at a position shifted by K _{1 in the} spatial direction x from the _target block R ₄₃₀₀ , and at the time t + t ₀ image data at the position shifted by H _{1 in the} spatial direction x from the _target block R ₄₃₀₀ . In the region R ₄₃₀₁ , in the image data at time t + 2t ₀ , the image data at time t + 3t ₀ is shifted from the _target block R ₄₃₀₀ to R ₄₃₀₂ at a position shifted by H _{2 in the} spatial direction x from the _target block R _4300. In the region R ₄₃₀₃ at a position shifted by H _{3 in the} spatial direction x, the image data at time t + 4t ₀ exists in the region R ₄₃₀₄ at a position shifted by H _{4 in the} spatial direction x from the target block R ₄₃₀₀ , respectively. .

なお、図４３において、動被写体の動きの大きさは一定ではなく、従って、K₄からK₃への増加率，K₃からK₂への増加率、K₂からK₁への増加率、K₁から0への増加率、0からH₁への増加率、H₁からH₂への増加率，H₂からH₃への増加率、H₃からH₄への増加率も一定にはなっていない。 In FIG. 43, the magnitude of the movement of the moving subject is not constant. Therefore, the rate of increase from K ₄ to K ₃ , the rate of increase from K ₃ to K ₂ , the rate of increase from K ₂ to K ₁ , The rate of increase from K ₁ to 0, the rate of increase from 0 to H ₁ , the rate of increase from H ₁ to H ₂ , the rate of increase from H ₂ to H ₃ and the rate of increase from H ₃ to H ₄ are also constant. It is not.

図４３に示したように、被写体が移動している場合、注目ブロックR₄₃₀₀の動きベクトル(u₀,v₀)は、図４４に示すように、ベクトル(H₁,0)が求められる。従って、送信装置１（図２５）の主成分方向取得部３１では、この注目ブロックR₄₃₀₀の動きベクトル(H₁,0)から、注目ブロックR₄₃₀₀の主成分方向が求められる。 As shown in FIG. 43, when the subject is moving, the motion vector (u ₀ , v ₀ ) of the block of interest R ₄₃₀₀ is obtained as shown in FIG. 44 as a vector (H ₁ , 0). Therefore, the principal component direction acquisition unit 31 of the transmission device 1 (FIG. 25) obtains the principal component direction of the _target block R _{4300 from} the motion vector (H ₁ , 0) of the target block R ₄₃₀₀ .

そして、送信装置１のフィルタ情報供給部３２では、その主成分方向から、例えば、図２０の領域R₁₀₀₁や、図２１の領域R₁₁₀₁、図２２の領域R₁₂₀₁、図２３の領域R₁₃₀₁などが、フィルタ部２２におけるフィルタリングを行うフィルタの通過帯域として決定される。 Then, in the filter information supply unit 32 of the transmission apparatus 1, for example, the region R _{1001 in} FIG. 20, the region R ₁₁₀₁ in FIG. 21, the region R ₁₂₀₁ in FIG. 22, the region R _{1301 in} FIG. Is determined as the pass band of the filter that performs filtering in the filter unit 22.

フィルタの通過帯域としての、例えば、図２０の領域R₁₀₀₁や、図２１の領域R₁₁₀₁、図２２の領域R₁₂₀₁、図２３の領域R₁₃₀₁などは、上述したように、T方向に、２π／(4t₀)の幅を有するため、このような通過帯域のフィルタによるフィルタリングでは、注目ブロックR₄₃₀₀のフレーム（注目フレーム）の前後４フレーム程度の画像データ、即ち、例えば、時刻t-4t₀，t-3t₀，t-2t₀，t-t₀，t，t+t₀，t+2t₀，t+3t₀，t+4t₀の９フレームの範囲の画像データや、時刻t-3t₀，t-2t₀，t-t₀，t，t+t₀，t+2t₀，t+3t₀の７フレームの範囲の画像データなどを用いた演算（送信装置１のフィルタ部２２にて行われる演算）によって、フィルタリング結果としての注目ブロックR₄₃₀₀の画像データが求められる。 As the pass band of the filter, for example, the region R _{1001 in} FIG. 20, the region R ₁₁₀₁ in FIG. 21, the region R ₁₂₀₁ in FIG. 22, the region R _{1301 in} FIG. / (4t ₀ ), therefore, filtering by such a passband filter, image data of about 4 frames before and after the frame of the block of interest R ₄₃₀₀ (frame of interest), that is, for example, time t-4t ₀ , T-3t ₀ , t-2t ₀ , tt ₀ , t, t + t ₀ , t + 2t ₀ , t + 3t ₀ , t + 4t ₀ range of image data or time t-3t ₀ , T−2t ₀ , tt ₀ , t, t + t ₀ , t + 2t ₀ , t + 3t _{0 and} other operations using image data in a range of 7 frames (performed by the filter unit 22 of the transmission apparatus 1) By calculation), the image data of the _target block R ₄₃₀₀ as the filtering result is obtained.

即ち、注目ブロックR₄₃₀₀のある位置(x,y)の画素の、フィルタリング結果としての画素値は、例えば、時刻t+t₀の画像データにおける位置(x,y)から動きベクトル(H₁,0)だけずれた位置付近の画素値、時刻t+2t₀の画像データにおける位置(x,y)から動きベクトル(H₁,0)の２倍のベクトル(2H₁,0)だけずれた位置付近の画素値、時刻t+3t₀の画像データにおける位置(x,y)から動きベクトル(H₁,0)の３倍のベクトル(3H₁,0)だけずれた位置付近の画素値、時刻t+4t₀の画像データにおける位置(x,y)から動きベクトル(H₁,0)の４倍のベクトル(4H₁,0)だけずれた位置付近の画素値、時刻t-t₀の画像データにおける位置(x,y)から動きベクトル(H₁,0)の−１倍のベクトル(-H₁,0)だけずれた位置付近の画素値、時刻t-2t₀の画像データにおける位置(x,y)から動きベクトル(H₁,0)の−２倍のベクトル(-2H₁,0)だけずれた位置付近の画素値、時刻t-3t₀の画像データにおける位置(x,y)から動きベクトル(H₁,0)の−３倍のベクトル(-3H₁,0)だけずれた位置付近の画素値、時刻t-4t₀の画像データにおける位置(x,y)から動きベクトル(H₁,0)の−４倍のベクトル(-4H₁,0)だけずれた位置付近の画素値を用いて求められる。 That is, the block of interest position with R ₄₃₀₀ of pixels (x, y), the pixel value as a filtering result, for example, the position in the image data at time _{t + t 0 (x, y} ) a motion vector (H ₁ from Pixel value near the position shifted by 0), position shifted by the vector (2H ₁ , 0) twice the motion vector (H ₁ , 0) from the position (x, y) in the image data at time t + 2t ₀ Near pixel value, pixel value near the position shifted by the vector (3H ₁ , 0) three times the motion vector (H ₁ , 0) from the position (x, y) in the image data at time t + 3t ₀ A pixel value in the vicinity of a position shifted by a vector (4H ₁ , 0) four times the motion vector (H ₁ , 0) from the position (x, y) in the image data at t + 4t _{0, in} the image data at time tt ₀ position (x, y) -1 times the vector (-H _1, 0) shifted by the pixel value of the vicinity of the position of the motion vector from the (H _1, 0), the position in the image data at time t-2t ₀ (x, -2 times the motion vector (H _1, 0) from y) Vector (-2H _1, 0) only the pixel values of the shifted around position, the position in the image data at time _{t-3t 0 (x, y} ) a motion vector (H _1, 0) from -3 fold vector (-3H _1, 0) -4 times the vector of pixel values around a position shifted by a position in the image data at time _{t-4t 0 (x, y} ) from the motion vector (H _1, 0) (-4H _1, 0) It is obtained by using pixel values in the vicinity of the position shifted by a certain amount.

ここで、動きベクトル(H₁,0)は、図４４に示したように、注目ブロックR₄₃₀₀から、時刻t+t₀の画像データにおける、注目ブロックR₄₃₀₀に対応する（注目ブロックR₄₃₀₀と同一の画像データが存在する）領域R₄₃₀₁の位置へのベクトルであるから、注目ブロックR₄₃₀₀から動きベクトル(H₁,0)だけずれた時刻t+t₀の画像データ上の位置は、注目ブロックR₄₃₀₀に対応する領域R₄₃₀₁の位置に一致する。 Here, the motion vector (H _1, 0), as shown in FIG. 44, from the target block R _4300, the image data at time t + t _0, corresponding to the target block R ₄₃₀₀ and (attention block R ₄₃₀₀ Since this is a vector to the position of the region R ₄₃₀₁ (where the same image data exists), the position on the image data at time t + t ₀ shifted by the motion vector (H ₁ , 0) from the block of interest R ₄₃₀₀ It matches the position of the region R ₄₃₀₁ corresponding to the block R ₄₃₀₀ .

しかしながら、ベクトル(2H₁,0)は、図４５に示すように、動きベクトル(H₁,0)を２倍したベクトルであるから、注目ブロックR₄₃₀₀からベクトル(2H₁,0)だけずれた時刻t+2t₀の画像データ上の位置は、注目ブロックR₄₃₀₀に対応する領域R₄₃₀₂の位置に一致するとは限らない。 However, since the vector (2H ₁ , 0) is a vector obtained by doubling the motion vector (H ₁ , 0) as shown in FIG. 45, the vector (2H ₁ , 0) is shifted from the _target block R ₄₃₀₀ by the vector (2H ₁ , 0). The position on the image data at time t + 2t ₀ does not necessarily match the position of the region R ₄₃₀₂ corresponding to the block of interest R ₄₃₀₀ .

また、ベクトル(3H₁,0)は、図４６に示すように、動きベクトル(H₁,0)を３倍したベクトルであるから、注目ブロックR₄₃₀₀からベクトル(3H₁,0)だけずれた時刻t+3t₀の画像データ上の位置も、注目ブロックR₄₃₀₀に対応する領域R₄₃₀₃の位置に一致するとは限らない。 Further, as shown in FIG. 46, the vector (3H ₁ , 0) is a vector obtained by multiplying the motion vector (H ₁ , 0) by three, so that it is shifted from the _target block R ₄₃₀₀ by the vector (3H ₁ , 0). The position on the image data at time t + 3t ₀ does not always match the position of the region R ₄₃₀₃ corresponding to the block of interest R ₄₃₀₀ .

さらに、ベクトル(4H₁,0)は、図４７に示すように、動きベクトル(H₁,0)を４倍したベクトルであるから、注目ブロックR₄₃₀₀からベクトル(4H₁,0)だけずれた時刻t+4t₀の画像データ上の位置も、注目ブロックR₄₃₀₀に対応する領域R₄₃₀₄の位置に一致するとは限らない。 Furthermore, since the vector (4H ₁ , 0) is a vector obtained by multiplying the motion vector (H ₁ , 0) by 4 as shown in FIG. 47, the vector (4H ₁ , 0) is shifted from the _target block R ₄₃₀₀ by the vector (4H ₁ , 0). The position on the image data at time t + 4t ₀ does not necessarily match the position of the region R ₄₃₀₄ corresponding to the target block R ₄₃₀₀ .

また、ベクトル(-H₁,0)は、図４８に示すように、動きベクトル(H₁,0)を−１倍したベクトルであるから、注目ブロックR₄₃₀₀からベクトル(-H₁,0)だけずれた時刻t-t₀の画像データ上の位置も、注目ブロックR₄₃₀₀に対応する領域R₄₃₁₁の位置に一致するとは限らない。 Further, as shown in FIG. 48, the vector (−H ₁ , 0) is a vector obtained by multiplying the motion vector (H ₁ , 0) by −1. Therefore, the vector (−H ₁ , 0) from the _target block R ₄₃₀₀ The position on the image data at time tt _{0 that} is shifted by a certain amount does not necessarily match the position of the region R ₄₃₁₁ corresponding to the block of interest R ₄₃₀₀ .

さらに、ベクトル(-2H₁,0)は、図４９に示すように、動きベクトル(H₁,0)を−２倍したベクトルであるから、注目ブロックR₄₃₀₀からベクトル(-2H₁,0)だけずれた時刻t-2t₀の画像データ上の位置も、注目ブロックR₄₃₀₀に対応する領域R₄₃₁₂の位置に一致するとは限らない。 Furthermore, since the vector (-2H ₁ , 0) is a vector obtained by multiplying the motion vector (H ₁ , 0) by -2 as shown in FIG. 49, the vector (-2H ₁ , 0) from the target block R ₄₃₀₀ The position on the image data at time t−2t _{0 that} is shifted by a certain amount does not always match the position of the region R ₄₃₁₂ corresponding to the block of interest R ₄₃₀₀ .

また、ベクトル(-3H₁,0)は、図５０に示すように、動きベクトル(H₁,0)を−３倍したベクトルであるから、注目ブロックR₄₃₀₀からベクトル(-3H₁,0)だけずれた時刻t-3t₀の画像データ上の位置も、注目ブロックR₄₃₀₀に対応する領域R₄₃₁₃の位置に一致するとは限らない。 Further, since the vector (−3H ₁ , 0) is a vector obtained by multiplying the motion vector (H ₁ , 0) by −3 as shown in FIG. 50, the vector (−3H ₁ , 0) from the target block R ₄₃₀₀ The position on the image data at time t-3t _{0 that} is shifted by a certain amount does not necessarily match the position of the region R ₄₃₁₃ corresponding to the block of interest R ₄₃₀₀ .

さらに、ベクトル(-4H₁,0)は、図５１に示すように、動きベクトル(H₁,0)を−４倍したベクトルであるから、注目ブロックR₄₃₀₀からベクトル(-4H₁,0)だけずれた時刻t-4t₀の画像データ上の位置も、注目ブロックR₄₃₀₀に対応する領域R₄₃₁₄の位置に一致するとは限らない。 Furthermore, since the vector (−4H ₁ , 0) is a vector obtained by multiplying the motion vector (H ₁ , 0) by −4 as shown in FIG. 51, the vector (−4H ₁ , 0) from the target block R ₄₃₀₀ The position on the image data at time t-4t _{0 that} is shifted by a certain amount does not necessarily match the position of the region R ₄₃₁₄ corresponding to the block of interest R ₄₃₀₀ .

このため、注目ブロックR₄₃₀₀に注目したフィルタリングにあたり、時刻t-4t₀，t-3t₀，t-2t₀，t-t₀，t+2t₀，t+3t₀，t+4t₀それぞれの画像データについては、注目ブロックR₄₃₀₀に対応する領域R₄₃₁₄，R₄₃₁₃，R₄₃₁₂，R₄₃₁₁，R₄₃₀₂，R₄₃₀₃，R₄₃₀₄からずれた位置の領域の画素値が用いられ、その結果、真の主成分方向に延びる周波数ドメイン上の領域（例えば、図２０の領域R₁₀₀₁や、図２１の領域R₁₁₀₁、図２２の領域R₁₂₀₁、図２３の領域R₁₃₀₁）内の周波数成分のみのデータを、正確に得ることが困難な場合が生じ得る。 For this reason, in filtering focused on the _target block R ₄₃₀₀ , the image data at times t-4t ₀ , t-3t ₀ , t-2t ₀ , tt ₀ , t + 2t ₀ , t + 3t ₀ , t + 4t ₀ , The pixel values of the regions shifted from the regions R ₄₃₁₄ , R ₄₃₁₃ , R ₄₃₁₂ , R ₄₃₁₁ , R ₄₃₀₂ , R ₄₃₀₃ , R ₄₃₀₄ corresponding to the target block R ₄₃₀₀ are used, and as a result, the true main Data of only frequency components in a region on the frequency domain extending in the component direction (for example, region R _{1001 in} FIG. 20, region R ₁₁₀₁ in FIG. 21, region R ₁₂₀₁ in FIG. 22, region R _{1301 in} FIG. 23), There may be cases where it is difficult to obtain accurately.

これは、注目ブロックから次のフレームへの動きベクトルを用いて主成分方向を求めることが、注目ブロックから次のフレームへの動きベクトルの単純な倍数によって、注目ブロックからその他のフレームへの動きベクトルを近似することに相当し、従って、被写体が一定速度で移動していないにもかかわらず、注目ブロックから次のフレームへの動きベクトルを用いて主成分方向を求めると、不正確な主成分方向が得られることに起因する。 This is because the motion vector from the target block to the next frame can be determined by using the motion vector from the target block to the next frame. Therefore, when the principal component direction is calculated using the motion vector from the block of interest to the next frame even though the subject is not moving at a constant speed, an incorrect principal component direction is obtained. This is due to the fact that

そこで、送信装置１（図２５）の主成分方向取得部３１では、上述したように、注目ブロックと次のフレームとの相関を表す相関情報だけではなく、注目ブロックとその前後の複数のフレームそれぞれとの相関を表す複数の相関情報を用いることによって、注目ブロックから複数のフレームそれぞれへの、いわば平均的な動きを表すベクトル（以下、適宜、平均動きベクトルという）を求め、その平均動きベクトルから、注目ブロックについて、精度の高い主成分方向を求めることができる。 Therefore, in the principal component direction acquisition unit 31 of the transmission device 1 (FIG. 25), as described above, not only the correlation information indicating the correlation between the block of interest and the next frame, but also the block of interest and a plurality of frames before and after the block. By using a plurality of pieces of correlation information representing the correlation between the target block and each of the plurality of frames, a so-called average motion vector (hereinafter referred to as an average motion vector as appropriate) is obtained. The principal component direction with high accuracy can be obtained for the target block.

図５２は、図２５の主成分方向取得部３１において、平均動きベクトルから主成分方向を求める場合の、その主成分方向取得部３１の構成例を示している。 FIG. 52 shows a configuration example of the principal component direction acquisition unit 31 in the case where the principal component direction acquisition unit 31 in FIG. 25 obtains the principal component direction from the average motion vector.

バッファ部１０１には、バッファ部２１（図２５）から読み出された1/t₀の高フレームレートの動画データが供給され、バッファ部１０１は、その動画データを一時記憶する。 The buffer unit 101 is supplied with the 1 / t ₀ high frame rate moving image data read from the buffer unit 21 (FIG. 25), and the buffer unit 101 temporarily stores the moving image data.

ブロック抽出部１０２は、バッファ部１０１に記憶された動画データを、例えば、上述した１６×１６画素のブロックに分割し、順次、注目ブロックとして選択して、相関演算部１０３に供給する。 The block extraction unit 102 divides the moving image data stored in the buffer unit 101 into, for example, the above-described 16 × 16 pixel blocks, sequentially selects the target block, and supplies the selected block to the correlation calculation unit 103.

相関演算部１０３は、ブロック抽出部１０２から供給される注目ブロックのフレーム（注目フレーム）に対して時間的に前後の複数のフレーム（のデータ）を、バッファ部１０１から読み出し、注目ブロックと、複数のフレームそれぞれとの相関を表す相関情報を演算して、スケーリング合成部１０４に供給する。 The correlation calculation unit 103 reads, from the buffer unit 101, a plurality of frames (data) before and after the frame of the block of interest (the frame of interest) supplied from the block extraction unit 102. The correlation information representing the correlation with each of the frames is calculated and supplied to the scaling synthesis unit 104.

ここで、相関情報としては、例えば、図３２のステップＳ２２で説明した場合と同様に、注目ブロックを、空間方向x,yに、それぞれ、u,vだけずらした位置における、注目ブロックの各画素と、その画素と同一位置にある他のフレームの画素との画素値の差分絶対値の総和を採用することとする。この場合、上述したように、相関情報の「値」が最小になる空間方向の位置u,vが、相関情報が表す相関が最も高くなる、他のフレーム上の空間方向の位置となる。 Here, as the correlation information, for example, as in the case described in step S22 of FIG. 32, each pixel of the target block at a position where the target block is shifted by u and v in the spatial directions x and y, respectively. And the sum of absolute difference values of pixel values from pixels in other frames at the same position as the pixel. In this case, as described above, the position u, v in the spatial direction where the “value” of the correlation information is the minimum is the position in the spatial direction on the other frame where the correlation represented by the correlation information is the highest.

なお、相関演算部１０３において、注目ブロックとの相関情報を演算する対象のフレームの範囲は、フィルタ部２２（図２５）でのフィルタリングに用いられるフレームの範囲に対応した範囲とすることができる。 Note that in the correlation calculation unit 103, the range of the frame for which the correlation information with the target block is calculated can be a range corresponding to the frame range used for filtering in the filter unit 22 (FIG. 25).

即ち、図４３で説明したように、時刻tの画像データを基準とすると（時刻tを注目フレームとすると）、フィルタ部２２でのフィルタリングでは、時刻t-4t₀，t-3t₀，t-2t₀，t-t₀，t，t+t₀，t+2t₀，t+3t₀，t+4t₀の９フレームの画像データや、時刻t-3t₀，t-2t₀，t-t₀，t，t+t₀，t+2t₀，t+3t₀の７フレームの画像データなどが用いられる。相関演算部１０３において、相関情報を演算するフレームは、この９フレームや７フレームの画像データとほぼ同様の範囲の画像データ、即ち、例えば、時刻t-4t₀，t-3t₀，t-2t₀，t-t₀，t，t+t₀，t+2t₀，t+3t₀，t+4t₀それぞれの画像データとすることができる。 That is, as described with reference to FIG. 43, when the image data at time t is used as a reference (when time t is the frame of interest), the filtering in the filter unit 22 performs time t-4t ₀ , t-3t ₀ , t− 9 frames of image data of 2t ₀ , tt ₀ , t, t + t ₀ , t + 2t ₀ , t + 3t ₀ , t + 4t ₀ , time t-3t ₀ , t-2t ₀ , tt ₀ , t , T + t ₀ , t + 2t ₀ , t + 3t ₀ , and the like are used. In the correlation calculation unit 103, the frame for calculating the correlation information is image data in a range almost similar to the image data of 9 frames or 7 frames, that is, for example, time t-4t ₀ , t-3t ₀ , t-2t. _{_0,} tt _0, t, can be _{t + t 0, t + 2t} 0, t + 3t 0, t + 4t 0 each image data.

なお、時刻tの画像データは、注目ブロックのフレーム（注目フレーム）であるから、その時刻tについては、相関情報を演算する必要はない。即ち、相関演算部１０３において、注目ブロックとの相関情報を演算する対象は、時刻t-4t₀，t-3t₀，t-2t₀，t-t₀，t+t₀，t+2t₀，t+3t₀，t+4t₀それぞれの画像データである。 Note that since the image data at time t is a frame of the block of interest (frame of interest), there is no need to calculate correlation information for that time t. That is, the correlation calculation unit 103 calculates the correlation information with the block of interest at times t-4t ₀ , t-3t ₀ , t-2t ₀ , tt ₀ , t + t ₀ , t + 2t ₀ , t The image data is + 3t ₀ and t + 4t ₀ respectively.

ここで、注目ブロックとの相関情報を演算する対象のフレームを、以下、適宜、相関演算対象フレームという。 Here, the frame for which the correlation information with the block of interest is calculated is hereinafter referred to as a correlation calculation target frame as appropriate.

スケーリング合成部１０４は、相関演算部１０３から供給される、複数の相関演算対象フレームそれぞれについて求められた相関情報を、後述するように、空間方向x,yにスケーリングし、さらに、そのスケーリング後の相関情報を合成して、合成相関情報を求める。そして、スケーリング合成部１０４は、その合成相関情報を、最小値検出部１０５に供給する。 The scaling synthesis unit 104 scales the correlation information obtained from each of the plurality of correlation calculation target frames supplied from the correlation calculation unit 103 in the spatial directions x and y as described later, and further, The correlation information is synthesized to obtain synthesized correlation information. Then, the scaling combining unit 104 supplies the combined correlation information to the minimum value detecting unit 105.

最小値検出部１０５は、スケーリング合成部１０４からの合成相関情報が表す相関が最大となる空間方向の位置である最大相関位置を検出し、その最大相関位置へのベクトルを、注目ブロックの平均動きベクトルとして求める。そして、最小値検出部１０５は、その平均動きベクトルに、バッファ部２１（図２５）から読み出された動画データのフレーム周期t₀を、時間方向tのコンポーネントとして加えた３次元の動きベクトルの方向と直交する方向を、主成分方向として検出して出力する。 The minimum value detecting unit 105 detects a maximum correlation position that is a position in the spatial direction where the correlation represented by the combined correlation information from the scaling combining unit 104 is maximum, and calculates a vector to the maximum correlation position as an average motion of the target block. Ask as a vector. Then, the minimum value detection unit 105 adds the frame period t ₀ of the moving image data read from the buffer unit 21 (FIG. 25) to the average motion vector as a component in the time direction t. A direction orthogonal to the direction is detected and output as the principal component direction.

次に、図５３は、図４３に示した注目ブロックR₄₃₀₀と、時刻t-4t₀，t-3t₀，t-2t₀，t-t₀，t，t+t₀，t+2t₀，t+3t₀，t+4t₀それぞれの画像データ（フレーム）との相関情報を示している。 Next, FIG. 53 shows the block of interest R ₄₃₀₀ shown in FIG. 43 and the times t-4t ₀ , t-3t ₀ , t-2t ₀ , tt ₀ , t, t + t ₀ , t + 2t ₀ , t. Correlation information with each image data (frame) of + 3t ₀ and t + 4t ₀ is shown.

なお、図５３に示した相関情報は、図５２の相関演算部１０３で求められる。但し、上述したように、注目ブロックR₄₃₀₀と、注目フレームである時刻tの画像データとの相関情報は、演算する必要はない。 Note that the correlation information shown in FIG. 53 is obtained by the correlation calculation unit 103 in FIG. However, as described above, the correlation information between the block of interest R ₄₃₀₀ and the image data at time t, which is the frame of interest, does not need to be calculated.

また、図５３において、相関情報としては、上述したように、画素値の差分絶対値の総和が採用されており、従って、相関情報は、下に凸の関数となり、相関情報の「値」が最小になる空間方向の位置が、相関情報が表す相関が最も高くなる空間方向の位置である。そして、注目ブロックから、その位置へのベクトルが、動きベクトルを表す。 In FIG. 53, as described above, the sum of the absolute differences of the pixel values is used as the correlation information. Therefore, the correlation information is a downward convex function, and the “value” of the correlation information is The position in the spatial direction that minimizes is the position in the spatial direction where the correlation represented by the correlation information is the highest. A vector from the target block to that position represents a motion vector.

図４３で説明したように、注目ブロックR₄₃₀₀と同一の画像データは、時刻t-4t₀では、注目ブロックR₄₃₀₀から空間方向xにK₄だけずれた位置の領域R₄₃₁₄に存在し、時刻t-3t₀では、注目ブロックR₄₃₀₀から空間方向xにK₃だけずれた位置の領域R₄₃₁₃に存在する。同様に、注目ブロックR₄₃₀₀と同一の画像データは、時刻t-2t₀では、注目ブロックR₄₃₀₀から空間方向xにK₂だけずれた位置の領域R₄₃₁₂に、時刻t-t₀では、注目ブロックR₄₃₀₀から空間方向xにK₁だけずれた位置の領域R₄₃₁₁に、時刻t+t₀では、注目ブロックR₄₃₀₀から空間方向xにH₁だけずれた位置の領域R₄₃₀₁に、時刻t+2t₀では、注目ブロックR₄₃₀₀から空間方向xにH₂だけずれた位置のR₄₃₀₂に、時刻t+3t₀では、注目ブロックR₄₃₀₀から空間方向xにH₃だけずれた位置の領域R₄₃₀₃に、時刻t+4t₀では、注目ブロックR₄₃₀₀から空間方向xにH₄だけずれた位置の領域R₄₃₀₄に、それぞれ存在する。 As described in FIG. 43, the same image data and the block of interest R ₄₃₀₀ is at time t-4t _0, present in the block of interest R ₄₃₀₀ position of the region R ₄₃₁₄ shifted by K ₄ in the spatial direction x from the time At t−3t ₀ , it exists in a region R ₄₃₁₃ at a position shifted from the target block R ₄₃₀₀ by K _{3 in the} spatial direction x. Similarly, the same image data as the _target block R ₄₃₀₀ is _displayed at the time t-2t ₀ in the region R ₄₃₁₂ at a position shifted by K ₂ from the _target block R _{4300 in the} spatial direction x, and at the time tt ₀ , the _target block R _At time t + t ₀ at a region R ₄₃₁₁ at a position shifted by K _{1 in the} spatial direction x from ₄₃₀₀ , at time t + 2t at a region R ₄₃₀₁ at a position shifted by H _{1 in the} spatial direction x from the _target block R ₄₃₀₀ _{At 0} , it moves to R ₄₃₀₂ at a position shifted by H _{2 in the} spatial direction x from the _target block R ₄₃₀₀ , and at time t + 3t ₀ , it _enters a region R ₄₃₀₃ at a position shifted by H ₃ from the _target block R _{4300 in the} spatial direction x. At time t + 4t ₀ , each exists in a region R ₄₃₀₄ at a position displaced from the target block R ₄₃₀₀ by H _{4 in the} spatial direction x.

従って、時刻t-4t₀における相関情報は、位置x=K₄において最小となり、時刻t-3t₀における相関情報は、位置x=K₃において最小となる。同様に、時刻t-2t₀における相関情報は、位置x=K₂において、時刻t-t₀における相関情報は、x=K₁において、時刻t+t₀における相関情報は、位置x=H₁において、時刻t+2t₀における相関情報は、位置x=H₂において、時刻t+3t₀における相関情報は、位置x=H₃において、時刻t+4t₀における相関情報は、位置x=H₄において、それぞれ最小となる。 Accordingly, the correlation information at time t-4t ₀ is minimum at position x = K ₄ , and the correlation information at time t-3t ₀ is minimum at position x = K ₃ . Similarly, the correlation information at time t-2t ₀ is at position x = K ₂ , the correlation information at time tt ₀ is at x = K ₁ , and the correlation information at time t + t ₀ is at position x = H ₁ . The correlation information at time t + 2t ₀ is at position x = H ₂ , the correlation information at time t + 3t ₀ is at position x = H ₃ , and the correlation information at time t + 4t ₀ is at position x = H ₄ In each, it becomes the minimum.

図５２の主成分方向取得部３１では、注目ブロックR₄₃₀₀の動きが、時刻t-4t₀乃至t+4t₀の時間において一定であると近似し、その一定の動きを表すベクトルが、平均動きベクトルとして求められる。 In the principal component direction acquisition unit 31 in FIG. 52, the motion of the _target block R ₄₃₀₀ is approximated to be constant at the time from time t-4t _{0 to} t + 4t ₀ , and the vector representing the constant motion is the average motion. Required as a vector.

そのため、主成分方向取得部３１のスケーリング合成部１０４は、時刻t-4t₀，t-3t₀，t-2t₀，t-t₀，t+t₀，t+2t₀，t+3t₀，t+4t₀それぞれにおける相関情報を、空間方向x,yにスケーリングし、さらに、そのスケーリング後の相関情報を合成して、合成相関情報を求める。 Therefore, the scaling synthesis unit 104 of the principal component direction acquisition unit 31 performs the times t-4t ₀ , t-3t ₀ , t-2t ₀ , tt ₀ , t + t ₀ , t + 2t ₀ , t + 3t ₀ , t Correlation information at each of + 4t ₀ is scaled in the spatial directions x and y, and the correlation information after the scaling is combined to obtain combined correlation information.

即ち、注目ブロックR₄₃₀₀の動きが、時刻t-4t₀乃至t+4t₀の時間において一定であると近似した場合、例えば、時刻t+t₀における相関情報を基準とすると、時刻t-4t₀における相関情報は、時刻t+t₀における相関情報に対し、位置（空間方向x,y）に関して−４倍のずれがある。時刻t-3t₀における相関情報は、時刻t+t₀における相関情報に対し、位置に関して−３倍のずれがある。以下、同様に、時刻t+t₀における相関情報に対し、時刻t-2t₀における相関情報は、位置に関して−２倍のずれが、時刻t-t₀における相関情報は、位置に関して−１倍のずれが、時刻t+2t₀における相関情報は、位置に関して２倍のずれが、時刻t+3t₀における相関情報は、位置に関して３倍のずれが、時刻t+4t₀における相関情報は、位置に関して４倍のずれが、それぞれある。 That is, when it is approximated that the motion of the _target block R ₄₃₀₀ is constant at the time from time t-4t _{0 to} time t + 4t ₀ , for example, when the correlation information at time t + t ₀ is used as a reference, time t-4t The correlation information at ₀ has a difference of −4 times with respect to the position (spatial direction x, y) with respect to the correlation information at time t + t ₀ . The correlation information at time t-3t ₀ is shifted by -3 times with respect to the correlation information at time t + t ₀ . Similarly, the correlation information at time t-2t ₀ is shifted by -2 times with respect to the position, and the correlation information at time tt ₀ is shifted by -1 times with respect to the correlation information at time t + t ₀ . However, the correlation information at time t + 2t ₀ is twice as much as the position, the correlation information at time t + 3t ₀ is three times as much as the position, and the correlation information at time t + 4t ₀ is about the position. There is a 4x shift.

そこで、スケーリング合成部１０４は、時刻t-4t₀における相関情報の位置のスケールを−１／４倍にする（反転縮小する）スケーリングを行う。同様に、スケーリング合成部１０４は、時刻t-3t₀における相関情報の位置のスケールを−１／３倍にするスケーリングを、時刻t-2t₀における相関情報の位置のスケールを−１／２倍にするスケーリングを、時刻t-t₀における相関情報の位置のスケールを−１倍にする（反転する）スケーリングを、時刻t+2t₀における相関情報の位置のスケールを１／２倍にする（縮小する）スケーリングを、時刻t+3t₀における相関情報の位置のスケールを１／３倍にするスケーリングを、時刻t+4t₀における相関情報の位置のスケールを１／４倍にするスケーリングを、それぞれ行う。 Therefore, the scaling synthesis unit 104 performs scaling by multiplying the scale of the position of the correlation information at time t-4t ₀ by −1/4 (reversing and reducing). Similarly, the scaling composition unit 104 performs scaling to increase the scale of the position of the correlation information at time t-3t ₀ by −1/3, and scales the position of the position of the correlation information at time t−2t ₀ by −1/2. The scaling of the correlation information position at time tt ₀ is multiplied by -1 (reversed), and the scaling of the correlation information position at time t + 2t ₀ is halved (reduced). ) Scaling is performed so that the scale of the position of the correlation information at time t + 3t ₀ is ３ times, and the scaling is performed to increase the scale of the position of the correlation information at time t + 4t ₀ to ¼. .

なお、時刻t+t₀における相関情報については、スケーリング合成部１０４は、何らスケーリングを行わないが、スケーリングを行わないことは、位置のスケールを１倍にするスケーリングを行っていると見ることもできる。 For the correlation information at time t + t ₀ , the scaling composition unit 104 does not perform any scaling, but it may be seen that the fact that the scaling is not performed means that the position is scaled to 1 time. it can.

ここで、図５４は、時刻t+t₀における相関情報を基準として、時刻t-4t₀，t-3t₀，t-2t₀，t-t₀，t+2t₀，t+3t₀，t+4t₀それぞれにおける相関情報をスケーリングした場合の、そのスケーリング後の相関情報を示している。 Here, FIG. 54 shows time t-4t ₀ , t-3t ₀ , t-2t ₀ , tt ₀ , t + 2t ₀ , t + 3t ₀ , t + based on the correlation information at time t + t ₀ . The correlation information after scaling when the correlation information at 4t ₀ is scaled is shown.

スケーリングにより、時刻t-4t₀における相関情報は、位置x=-K₄/4において最小となり、時刻t-3t₀における相関情報は、位置x=-K₃/3において最小となる。同様に、時刻t-2t₀における相関情報は、位置x=-K₂/2において、時刻t-t₀における相関情報は、x=-K₁において、時刻t+2t₀における相関情報は、位置x=H₂/2において、時刻t+3t₀における相関情報は、位置x=H₃/3において、時刻t+4t₀における相関情報は、位置x=H₄/4において、それぞれ最小となる。 Scaling, correlation information at time t-4t ₀ is becomes minimum at the position x = -K _4/4, the correlation information at time t-3t ₀ is at the minimum position x = -K _3/3. Similarly, the correlation information at time t-2t _0, at the position x = -K _2/2, correlation information at time tt _0, at x = -K _1, the correlation information at time t + 2t _0, the position x in = H _2/2, correlation information at time t + 3t _0, at the position x = H _3/3, correlation information at time t + 4t _0, at position x = H _4/4, respectively become minimum.

なお、スケーリングは、時刻t+t₀における相関情報を基準として行われるので、時刻t+t₀における相関情報は、スケーリングの前でも後でも、位置x=H₁において最小となる。 Since scaling is performed with reference to correlation information at time t + t _0, the correlation information at time t + t ₀ is minimum at position x = H ₁ before and after scaling.

図５２の主成分方向取得部３１では、スケーリング合成部１０４が、時刻t-4t₀，t-3t₀，t-2t₀，t-t₀，t+t₀，t+2t₀，t+3t₀，t+4t₀それぞれにおける、スケーリング後の相関情報を合成して、合成相関情報を求める。 In the principal component direction acquisition unit 31 in FIG. 52, the scaling synthesis unit 104 performs the time t-4t ₀ , t-3t ₀ , t-2t ₀ , tt ₀ , t + t ₀ , t + 2t ₀ , t + 3t _0. , T + 4t ₀ , the correlation information after scaling is synthesized to obtain synthesized correlation information.

図５５は、図５４の相関情報を合成することにより得られる合成相関情報を示している。 FIG. 55 shows combined correlation information obtained by combining the correlation information of FIG.

図５５の合成相関情報は、図５４に示した時刻t-4t₀，t-3t₀，t-2t₀，t-t₀，t+t₀，t+2t₀，t+3t₀，t+4t₀それぞれにおける、スケーリング後の相関情報を加算する合成を行うことにより求めることができる。 The combined correlation information of FIG. 55 is the time t-4t ₀ , t-3t ₀ , t-2t ₀ , tt ₀ , t + t ₀ , t + 2t ₀ , t + 3t ₀ , t + 4t shown in FIG. It can be obtained by performing a synthesis of adding the correlation information after scaling in each of ₀ .

図５２の主成分方向取得部３１の最小値検出部１０５では、図５５の合成相関情報が最小になる位置x=L₁（(x,y)=(L₁,0)）が、平均動きベクトル(L₁,0)として求められる。 In the minimum value detection unit 105 of the principal component direction acquisition unit 31 in FIG. 52, the position x = L ₁ ((x, y) = (L ₁ , 0)) at which the composite correlation information in FIG. It is obtained as a vector (L ₁ , 0).

なお、合成相関情報は、スケーリング後の相関情報を単純に加算する他、重み付け加算することにより求めることもできる。即ち、合成相関情報は、相関情報に対して、注目ブロックの時刻tから近い時刻の相関情報ほど大きな重みを付して加算することにより求めることができる。この場合、注目ブロックの時刻tから近い時刻の画像を重要視した平均動きベクトルが求められることになる。 Note that the combined correlation information can be obtained by adding the weighted addition in addition to simply adding the correlation information after scaling. In other words, the combined correlation information can be obtained by adding to the correlation information the correlation information closer to the time t of the block of interest with a greater weight. In this case, an average motion vector in which an image at a time close to the time t of the block of interest is regarded as important is obtained.

最小値検出部１０５では、以上のようにして求められる平均動きベクトル(L₁,0)から、注目ブロックの主成分方向が求められる。従って、注目ブロックについて、比較的精度の高い主成分方向を求めることができる。 The minimum value detection unit 105 obtains the principal component direction of the block of interest from the average motion vector (L ₁ , 0) obtained as described above. Therefore, it is possible to obtain the principal component direction with relatively high accuracy for the block of interest.

即ち、上述のような平均動きベクトル(L₁,0)から求められる主成分方向（つまり、(L₁,0,t₀)に垂直な平面）によれば、図２５のフィルタ部２２では、例えば、注目ブロックR₄₃₀₀のある位置(x,y)の画素の、フィルタリング結果としての画素値が、時刻t+t₀の画像データにおける位置(x,y)から動きベクトル(L₁,0)だけずれた位置付近の画素値、時刻t+2t₀の画像データにおける位置(x,y)から動きベクトル(L₁,0)の２倍のベクトル(2L₁,0)だけずれた位置付近の画素値、時刻t+3t₀の画像データにおける位置(x,y)から動きベクトル(L₁,0)の３倍のベクトル(3L₁,0)だけずれた位置付近の画素値、時刻t+4t₀の画像データにおける位置(x,y)から動きベクトル(L₁,0)の４倍のベクトル(4L₁,0)だけずれた位置付近の画素値、時刻t-t₀の画像データにおける位置(x,y)から動きベクトル(L₁,0)の−１倍のベクトル(-L₁,0)だけずれた位置付近の画素値、時刻t-2t₀の画像データにおける位置(x,y)から動きベクトル(L₁,0)の−２倍のベクトル(-2L₁,0)だけずれた位置付近の画素値、時刻t-3t₀の画像データにおける位置(x,y)から動きベクトル(L₁,0)の−３倍のベクトル(-3L₁,0)だけずれた位置付近の画素値、時刻t-4t₀の画像データにおける位置(x,y)から動きベクトル(L₁,0)の−４倍のベクトル(-4L₁,0)だけずれた位置付近の画素値を、それぞれ用いて求められる。 That is, according to the principal component direction (that is, a plane perpendicular to (L ₁ , 0, t ₀ )) obtained from the average motion vector (L ₁ , 0) as described above, the filter unit 22 in FIG. For example, the pixel value as a filtering result of a pixel at a certain position (x, y) of the _target block R ₄₃₀₀ is a motion vector (L ₁ , 0) from the position (x, y) in the image data at time t + t ₀ The pixel value near the position shifted by a certain amount, the position near the position shifted by a vector (2L ₁ , 0) twice the motion vector (L ₁ , 0) from the position (x, y) in the image data at time t + 2t ₀ Pixel value, pixel value near time t + 3t ₀ position in image data at position (x, y), pixel value near position shifted by vector (3L ₁ , 0) three times motion vector (L ₁ , 0), time t + A pixel value in the vicinity of a position shifted by a vector (4L ₁ , 0) that is four times the motion vector (L ₁ , 0) from the position (x, y) in the image data at 4t ₀ , and a position in the image data at time tt ₀ ( x, y) Torr (L _1, 0) -1 times the vector (-L _1, 0) pixel values around a position shifted by a position in the image data at time _{t-2t 0 (x, y} ) from the motion vector (L ₁ , 0), the pixel value in the vicinity of the position shifted by -2 times the vector (−2L ₁ , 0), the position of the motion vector (L ₁ , 0) from the position (x, y) in the image data at time t-3t ₀ -3 times the pixel value in the vicinity of the position shifted by the vector (-3L ₁ , 0), from the position (x, y) in the image data at time t-4t ₀ to -4 times the motion vector (L ₁ , 0) Pixel values in the vicinity of the position shifted by the vector (−4L ₁ , 0) are respectively obtained.

ここで、図５６は、注目ブロックR₄₃₀₀の位置(x,y)から動きベクトル(L₁,0)だけずれた時刻t+t₀の画像データにおける位置を示している。同様に、図５７乃至図６３は、注目ブロックR₄₃₀₀の位置(x,y)から動きベクトル(L₁,0)の２倍のベクトル(2L₁,0)だけずれた時刻t+2t₀の画像データにおける位置を、注目ブロックR₄₃₀₀の位置(x,y)から動きベクトル(L₁,0)の３倍のベクトル(3L₁,0)だけずれた時刻t+3t₀の画像データにおける位置を、注目ブロックR₄₃₀₀の位置(x,y)から動きベクトル(L₁,0)の４倍のベクトル(4L₁,0)だけずれた時刻t+4t₀の画像データにおける位置を、注目ブロックR₄₃₀₀の位置(x,y)から動きベクトル(L₁,0)の−１倍のベクトル(-L₁,0)だけずれた時刻t-t₀の画像データにおける位置を、注目ブロックR₄₃₀₀の位置(x,y)から動きベクトル(L₁,0)の−２倍のベクトル(-2L₁,0)だけずれた時刻t-2t₀の画像データにおける位置を、注目ブロックR₄₃₀₀の位置(x,y)から動きベクトル(L₁,0)の−３倍のベクトル(-3L₁,0)だけずれた時刻t-3t₀の画像データにおける位置を、注目ブロックR₄₃₀₀の位置(x,y)から動きベクトル(L₁,0)の−４倍のベクトル(-4L₁,0)だけずれた時刻t-4t₀の画像データにおける位置を、それぞれ示している。 Here, FIG. 56 shows the position in the image data at time t + t ₀ that is shifted from the position (x, y) of the _target block R _{4300 by} the motion vector (L ₁ , 0). Similarly, FIG. 57 to FIG. 63 show that at time t + 2t ₀ shifted from the position (x, y) of the _target block R ₄₃₀₀ by a vector (2L ₁ , 0) that is twice the motion vector (L ₁ , 0). Position in the image data at time t + 3t ₀ where the position in the image data is shifted from the position (x, y) of the _target block R ₄₃₀₀ by a vector (3L ₁ , 0) that is three times the motion vector (L ₁ , 0) Is shifted from the position (x, y) of the _target block R ₄₃₀₀ by a vector (4L ₁ , 0) four times the motion vector (L ₁ , 0) to the position in the image data at time t + 4t ₀ The position in the image data at time tt _{0 that} is shifted from the position (x, y) of R ₄₃₀₀ by a vector (−L ₁ , 0) that is −1 times the motion vector (L ₁ , 0) is the position of the target block R ₄₃₀₀ . (x, y) position in the -2 times the vector (-2L _1, 0) image data at time t-2t ₀ shifted by the motion vector (L _1, 0) from the position of the target block R ₄₃₀₀ (x , -3 times the vector of the motion vector from y) (L _1, 0) Le (-3L _1, 0) position in the image data at time t-3t ₀ shifted by -4 times the vector of position (x, y) from the motion vector of the target block _{_{R 4300 (L 1, 0)}} ( The positions in the image data at time t-4t ₀ shifted by −4L ₁ , 0) are shown respectively.

図５６乃至図６３それぞれと、上述した図４４乃至図５１それぞれとを比較することにより、図５６乃至図６３における場合の方が、図４４乃至図５１における場合よりも、全体として、注目ブロックR₄₃₀₀に対応する領域R₄₃₀₁，R₄₃₀₂，R₄₃₀₃，R₄₃₀₄，R₄₃₁₁，R₄₃₁₂，R₄₃₁₃，R₄₃₁₄に近い位置の画素値を用いて、注目ブロックR₄₃₀₀のフィルタリングが行われることが分かる。 56 to 63 and FIG. 44 to 51 described above are compared with each other, and in the case of FIGS. 56 to 63, the attention block R as a whole is more than the case of FIGS. ₄₃₀₀ using the pixel values of the position close to the corresponding region _{_{_{R 4301, R 4302, R 4303}}} , R 4304, R 4311, R 4312, R 4313, R 4314 , a is understood that the filtering of the block of interest R ₄₃₀₀ is performed .

次に、図６４乃至図６７のフローチャートを参照して、図５２の主成分方向取得部３１が行う処理について説明する。 Next, processing performed by the principal component direction acquisition unit 31 in FIG. 52 will be described with reference to the flowcharts in FIGS.

なお、図６４乃至図６７のフローチャートにしたがった処理は、図３１のステップＳ１１の処理に対応する。 Note that the processing according to the flowcharts of FIGS. 64 to 67 corresponds to the processing of step S11 of FIG.

図５２の主成分方向取得部３１において、バッファ部１０１には、バッファ部２１（図２５）から読み出された動画データが供給され、バッファ部１０１は、その動画データを一時記憶する。 52, the moving image data read from the buffer unit 21 (FIG. 25) is supplied to the buffer unit 101, and the buffer unit 101 temporarily stores the moving image data.

そして、図６４のステップＳ１０１において、ブロック抽出部１０２は、バッファ部１０１に記憶された動画データを、図３２のステップＳ２１における場合と同様に、例えば１６×１６画素のブロックに分割し、相関演算部１０３に供給する。なお、以降の処理は、ブロック抽出部１０２で得られたブロックを、順次、注目ブロックとして行われる。 64, the block extraction unit 102 divides the moving image data stored in the buffer unit 101 into blocks of 16 × 16 pixels, for example, as in step S21 of FIG. To the unit 103. In the subsequent processing, the blocks obtained by the block extraction unit 102 are sequentially performed as the target block.

ここで、注目ブロックの位置（例えば、注目ブロックの左上の画素の位置）を、(x₀,y₀)と表す。また、注目ブロックのフレーム（注目フレーム）は、時刻tのフレームであるとする。 Here, the position of the block of interest (for example, the position of the upper left pixel of the block of interest) is represented as (x ₀ , y ₀ ). The frame of the block of interest (frame of interest) is assumed to be a frame at time t.

ステップＳ１０１の処理後は、図６５のステップＳ１０２に進み、相関演算部１０３は、注目フレームの次のフレームの画像データである時刻t+t₀の画像データを、バッファ部１０１から読み出し、注目ブロックと、時刻t+t₀の画像データとの、対応する画素どうしの差分絶対値の総和を表す関数E₁(u₁,v₁)を、注目ブロックを対応させる時刻t+t₀の画像データの位置(x₀+u₁,y₀+v₁)を変えながら求め（(u₁,v₁)を変えながら求め）、その関数E₁(u₁,v₁)を、時刻t+t₀における相関情報として、スケーリング合成部１０４に供給して、ステップＳ１０３に進む。 After the processing in step S101, the process proceeds to step S102 in FIG. 65, in which the correlation calculation unit 103 reads out the image data at time t + t ₀ that is the image data of the frame next to the frame of interest from the buffer unit 101, and If, with the image data at time t + t _0, the function E ₁ representing the sum of the absolute differences of each other corresponding pixel (u _{_1,} v _1), the image data at time t + t ₀ to correspond to the block of interest (X ₀ + u ₁ , y ₀ + v ₁ ) while changing the position (determined while changing (u ₁ , v ₁ )), and the function E ₁ (u ₁ , v ₁ ) is obtained at time t + t The correlation information at ₀ is supplied to the scaling synthesis unit 104, and the process proceeds to step S103.

ステップＳ１０３では、相関演算部１０３は、注目フレームの次の次のフレームの画像データである時刻t+2t₀の画像データを、バッファ部１０１から読み出し、注目ブロックと、時刻t+2t₀の画像データとの、対応する画素どうしの差分絶対値の総和を表す関数E₂(u₂,v₂)を、注目ブロックを対応させる時刻t+2t₀の画像データの位置(x₀+u₂,y₀+v₂)を変えながら求め、その関数E₂(u₂,v₂)を、時刻t+2t₀における相関情報として、スケーリング合成部１０４に供給して、ステップＳ１０４に進む。 In step S103, the correlation calculation unit 103 reads out the image data at time t + 2t ₀ that is image data of the next frame after the frame of interest from the buffer unit 101, and reads the block of interest and the image at time t + 2t ₀ . The function E ₂ (u ₂ , v ₂ ) representing the sum of absolute differences between corresponding pixels with the data is expressed as the position of the image data at the time t + 2t ₀ (x ₀ + u ₂ , y ₀ + v ₂ ) is changed and the function E ₂ (u ₂ , v ₂ ) is supplied as correlation information at time t + 2t _{0 to} the scaling synthesis unit 104, and the process proceeds to step S104.

ステップＳ１０４では、相関演算部１０３は、注目フレームの３フレームだけ時間的に先の画像データである時刻t+3t₀の画像データを、バッファ部１０１から読み出し、注目ブロックと、時刻t+3t₀の画像データとの、対応する画素どうしの差分絶対値の総和を表す関数E₃(u₃,v₃)を、注目ブロックを対応させる時刻t+3t₀の画像データの位置(x₀+u₃,y₀+v₃)を変えながら求め、その関数E₃(u₃,v₃)を、時刻t+3t₀における相関情報として、スケーリング合成部１０４に供給して、ステップＳ１０５に進む。 In step S104, the correlation calculation unit 103 reads out the image data at time t + 3t ₀ that is temporally previous image data by 3 frames of the frame of interest from the buffer unit 101, and the block of interest and time t + 3t _0. The function E ₃ (u ₃ , v ₃ ) representing the sum of absolute differences between corresponding pixels with the image data of the image data position of the image data at time t + 3t ₀ (x ₀ + u ₃ , y ₀ + v ₃ ), and the function E ₃ (u ₃ , v ₃ ) is supplied as correlation information at time t + 3t _{0 to} the scaling synthesis unit 104, and the process proceeds to step S105.

ステップＳ１０５では、注目フレームの４フレームだけ時間的に先の画像データである時刻t+4t₀の画像データを、バッファ部１０１から読み出し、注目ブロックと、時刻t+4t₀の画像データとの、対応する画素どうしの差分絶対値の総和を表す関数E₄(u₄,v₄)を、注目ブロックを対応させる時刻t+4t₀の画像データの位置(x₀+u₄,y₀+v₄)を変えながら求め、その関数E₄(u₄,v₄)を、時刻t+4t₀における相関情報として、スケーリング合成部１０４に供給して、図６６のステップＳ１０６に進む。 In step S105, image data at time t + 4t ₀ that is temporally preceding image data by four frames of the target frame is read from the buffer unit 101, and the target block and the image data at time t + 4t ₀ are The function E ₄ (u ₄ , v ₄ ) representing the sum of absolute differences between corresponding pixels is set to the position (x ₀ + u ₄ , y ₀ + v) of the image data at time t + 4t _{0 to which} the target block is associated. ₄ ), and the function E ₄ (u ₄ , v ₄ ) is supplied to the scaling synthesis unit 104 as correlation information at time t + 4t ₀ , and the process proceeds to step S106 in FIG.

ステップＳ１０６では、相関演算部１０３は、注目フレームの前のフレームの画像データである時刻t-t₀の画像データを、バッファ部１０１から読み出し、注目ブロックと、時刻t-t₀の画像データとの、対応する画素どうしの差分絶対値の総和を表す関数F₁(r₁,s₁)を、注目ブロックを対応させる時刻t-t₀の画像データの位置(x₀+r₁,y₀+s₁)を変えながら求め、その関数F₁(r₁,s₁)を、時刻t-t₀における相関情報として、スケーリング合成部１０４に供給して、ステップＳ１０７に進む。 In step S106, correlation calculation section 103, the image data at time tt ₀ is image data of a previous frame of the frame of interest, from the buffer unit 101, and the block of interest, the image data at time tt _0, the corresponding The function F ₁ (r ₁ , s ₁ ) that represents the sum of absolute differences between pixels is changed by changing the position (x ₀ + r ₁ , y ₀ + s ₁ ) of the image data at time tt ₀ to which the block of interest is associated. The function F ₁ (r ₁ , s ₁ ) is obtained as correlation information at time tt ₀ and supplied to the scaling synthesizer 104, and the process proceeds to step S107.

ステップＳ１０７では、相関演算部１０３は、注目フレームの前の前のフレームの画像データである時刻t-2t₀の画像データを、バッファ部１０１から読み出し、注目ブロックと、時刻t-2t₀の画像データとの、対応する画素どうしの差分絶対値の総和を表す関数F₂(r₂,s₂)を、注目ブロックを対応させる時刻t-2t₀の画像データの位置(x₀+r₂,y₀+s₂)を変えながら求め、その関数F₂(r₂,s₂)を、時刻t-2t₀における相関情報として、スケーリング合成部１０４に供給して、ステップＳ１０８に進む。 In step S107, the correlation calculation unit 103 reads out the image data at time t-2t ₀ that is image data of the previous frame before the frame of interest from the buffer unit 101, and reads the block of interest and the image at time t-2t ₀ . The function F ₂ (r ₂ , s ₂ ) representing the sum of absolute differences between corresponding pixels with the data is expressed as the position of the image data at the time t-2t ₀ (x ₀ + r ₂ , y ₀ + s ₂ ) is changed and the function F ₂ (r ₂ , s ₂ ) is supplied as correlation information at time t−2t _{0 to} the scaling synthesis unit 104, and the process proceeds to step S108.

ステップＳ１０８では、相関演算部１０３は、注目フレームの３フレームだけ時間的に前の画像データである時刻t-3t₀の画像データを、バッファ部１０１から読み出し、注目ブロックと、時刻t-3t₀の画像データとの、対応する画素どうしの差分絶対値の総和を表す関数F₃(r₃,s₃)を、注目ブロックを対応させる時刻t-3t₀の画像データの位置(x₀+r₃,y₀+s₃)を変えながら求め、その関数F₃(r₃,s₃)を、時刻t-3t₀における相関情報として、スケーリング合成部１０４に供給して、ステップＳ１０９に進む。 In step S108, the correlation calculating unit 103, the image data at time t-3t ₀ is image data before in time by three frames of the frame of interest, from the buffer unit 101, and the block of interest, the time t-3t ₀ The function F ₃ (r ₃ , s ₃ ) representing the sum of absolute differences between corresponding pixels with the image data of the image data position of the image data at time t-3t ₀ (x ₀ + r ₃ , y ₀ + s ₃ ), and the function F ₃ (r ₃ , s ₃ ) is supplied to the scaling combiner 104 as correlation information at time t-3t ₀ , and the process proceeds to step S109.

ステップＳ１０９では、相関演算部１０３は、注目フレームの４フレームだけ時間的に前の画像データである時刻t-4t₀の画像データを、バッファ部１０１から読み出し、注目ブロックと、時刻t-4t₀の画像データとの、対応する画素どうしの差分絶対値の総和を表す関数F₄(r₄,s₄)を、注目ブロックを対応させる時刻t-4t₀の画像データの位置(x₀+r₄,y₀+s₄)を変えながら求め、その関数F₄(r₄,s₄)を、時刻t-4t₀における相関情報として、スケーリング合成部１０４に供給して、図６７のステップＳ１１０に進む。 In step S109, the correlation calculation unit 103 reads out the image data at time t-4t ₀ that is temporally previous image data by four frames of the target frame from the buffer unit 101, reads the target block, and the time t-4t _0. The function F ₄ (r ₄ , s ₄ ) representing the sum of absolute differences between corresponding pixels with the image data of the image data at the time t-4t ₀ corresponding to the block of interest (x ₀ + r ₄ , y ₀ + s ₄ ), and the function F ₄ (r ₄ , s ₄ ) is supplied to the scaling synthesis unit 104 as correlation information at time t-4t ₀ , and step S110 in FIG. Proceed to

ステップＳ１１０では、スケーリング合成部１０４は、相関演算部１０３から供給された相関情報E₁(u₁,v₁)，E₂(u₂,v₂)，E₃(u₃,v₃)，E₄(u₄,v₄)，F₁(r₁,s₁)，F₂(r₂,s₂)，F₃(r₃,s₃)，F₄(r₄,s₄)を、相関情報E₁(u₁,v₁)の位置(u₁,v₁)のスケールを基準として、上述したようなスケーリングを行い、さらに、そのスケーリング後の相関情報を合成して、合成相関情報E(p,q)を求める。即ち、スケーリング合成部１０４は、位置(p,q)=(u₁,v₁)=(u₂/2,v₂/2)=(u₃/3,v₃/3)=(u₄/4,v₄/4)=(-r₁,-s₁)=(-r₂/2,-s₂/2)=(-r₃/3,-s₃/3)=(-r₄/4,-s₄/4)とすることによりスケーリングを行い、さらに、そのスケーリング後の相関情報E₁(p,q)，E₂(2p,2q)，E₃(3p,3q)，E₄(4p,4q)，F₁(-p,-q)，F₂(-2p,-2q)，F₃(-3p,-3q)，F₄(-4p,-4q)を加算することにより、合成相関情報E(p,q)(=E₁(p,q)+E₂(2p,2q)+E₃(3p,3q)+E₄(4p,4q)+F₁(-p,-q)+F₂(-2p,-2q)+F₃(-3p,-3q)+F₄(-4p,-4q))を求める。 In step S 110, the scaling composition unit 104 receives the correlation information E ₁ (u ₁ , v ₁ ), E ₂ (u ₂ , v ₂ ), E ₃ (u ₃ , v ₃ ), supplied from the correlation calculation unit 103. E ₄ (u ₄ , v ₄ ), F ₁ (r ₁ , s ₁ ), F ₂ (r ₂ , s ₂ ), F ₃ (r ₃ , s ₃ ), F ₄ (r ₄ , s ₄ ) Based on the scale of the position (u ₁ , v ₁ ) of the correlation information E ₁ (u ₁ , v ₁ ), the above-mentioned scaling is performed, and further, the correlation information after the scaling is synthesized, and the synthesized correlation Find information E (p, q). That is, the scaling combining unit 104, the position (p, q) = (u 1, v 1) = (u 2/2, v 2/2) = (u 3/3, v 3/3) = (u 4 _{/ 4, v 4/4)} = (- r 1, -s 1) = (- r 2/2, -s 2/2) = (- r 3/3, -s 3/3) = (- r _4/4 , -s _4/4 ), and further scaling correlation information E ₁ (p, q), E ₂ (2p, 2q), E ₃ (3p, 3q), Add E ₄ (4p, 4q), F ₁ (-p, -q), F ₂ (-2p, -2q), F ₃ (-3p, -3q), F ₄ (-4p, -4q) Therefore, the combined correlation information E (p, q) (= E ₁ (p, q) + E ₂ (2p, 2q) + E ₃ (3p, 3q) + E ₄ (4p, 4q) + F ₁ (- p, -q) + F ₂ (-2p, -2q) + F ₃ (-3p, -3q) + F ₄ (-4p, -4q)).

なお、合成相関情報E(p,q)は、スケーリング後の相関情報を単純に加算する他、上述したように、重み付け加算することにより求めることもできる。即ち、合成相関情報E(p,q)は、例えば、注目ブロックの時刻tから近い時刻の相関情報ほど大きな重みを付して加算する式８×E₁(p,q)+４×E₂(2p,2q)+２×E₃(3p,3q)+１×E₄(4p,4q)+８×F₁(-p,-q)+４×F₂(-2p,-2q)+２×F₃(-3p,-3q)+１×F₄(-4p,-4q)を演算することにより求めることができる。 Note that the combined correlation information E (p, q) can be obtained by simply adding the correlation information after scaling, or by weighted addition as described above. That is, the combined correlation information E (p, q) is, for example, an expression 8 × E ₁ (p, q) + 4 × E _{2 in} which correlation information at a time closer to the time t of the block of interest is added with a greater weight. (2p, 2q) + 2 × E ₃ (3p, 3q) + 1 × E ₄ (4p, 4q) + 8 × F ₁ (-p, -q) + 4 × F ₂ (-2p, -2q) + It can be obtained by calculating 2 × F ₃ (−3p, −3q) + 1 × F ₄ (−4p, −4q).

スケーリング合成部１０４は、ステップＳ１１０において、合成相関情報E(p,q)を求めた後、その合成相関情報E(p,q)を、最小値検出部１０５に供給して、ステップＳ１１１に進む。 The scaling combining unit 104 obtains the combined correlation information E (p, q) in step S110, supplies the combined correlation information E (p, q) to the minimum value detecting unit 105, and proceeds to step S111. .

ステップＳ１１１では、最小値検出部１０５は、スケーリング合成部１０４からの合成相関情報E(p,q)が表す相関が最大となる空間方向の位置（注目ブロックからの相対的な位置）(p,q)を、最大相関位置(p₀,q₀)として検出し、即ち、合成相関情報E(p,q)の「値」を最小にする位置(p,q)を、最大相関位置(p₀,q₀)として検出し、その最大相関位置(p₀,q₀)へのベクトルを、平均動きベクトル(p₀,q₀)として、ステップＳ１１２に進む。 In step S111, the minimum value detection unit 105 has a position (relative position from the target block) in the spatial direction where the correlation represented by the combined correlation information E (p, q) from the scaling combining unit 104 is maximum (p, q) is detected as the maximum correlation position (p ₀ , q ₀ ), that is, the position (p, q) that minimizes the `` value '' of the combined correlation information E (p, q) is the maximum correlation position (p ₀ , q ₀ ), and the vector to the maximum correlation position (p ₀ , q ₀ ) is set as the average motion vector (p ₀ , q ₀ ), and the process proceeds to step S112.

ステップＳ１１２では、最小値検出部１０５は、ステップＳ１１１で検出された平均動きベクトル(p₀,q₀)に、元の動画データのフレーム周期t₀を、時間方向tのコンポーネントとして加えた３次元の動きベクトル(p₀,q₀,t₀)の方向と直交する方向が、主成分方向であるとして、３次元の動きベクトル(p₀,q₀,t₀)を、フィルタ情報供給部３２（図２５）に供給して、処理を終了する。 In step S112, the minimum value detecting section 105, the average motion vector detected at step _{_{S111 (p 0, q 0)}} , 3 -dimensional frame period t ₀ of the original video data was added as a component in the time direction t The direction orthogonal to the direction of the motion vector (p ₀ , q ₀ , t ₀ ) is the principal component direction, and the three-dimensional motion vector (p ₀ , q ₀ , t ₀ ) (FIG. 25) is supplied and the process is terminated.

この場合、フィルタ情報供給部３２は、例えば、主成分方向取得部３１の最小値検出部１０５から供給される３次元の動きベクトル(p₀,q₀,t₀)を、そのままフィルタ情報として、フィルタ部２２（図２５）に供給する。 In this case, for example, the filter information supply unit 32 directly uses the three-dimensional motion vector (p ₀ , q ₀ , t ₀ ) supplied from the minimum value detection unit 105 of the principal component direction acquisition unit 31 as filter information. It supplies to the filter part 22 (FIG. 25).

フィルタ部２２では、図６８に示すフローチャートにしたがった処理が、図２６のステップＳ３の処理として行われる。 In the filter unit 22, the process according to the flowchart shown in FIG. 68 is performed as the process of step S3 in FIG.

即ち、ステップＳ１３１において、フィルタ部２２は、フィルタ情報供給部３２からフィルタ情報として供給される３次元の動きベクトル(p₀,q₀,t₀)を受信することにより取得し、ステップＳ１３２に進む。 That is, in step S131, the filter unit 22 receives the three-dimensional motion vector (p ₀ , q ₀ , t ₀ ) supplied as filter information from the filter information supply unit 32, and proceeds to step S132. .

ステップＳ１３２では、フィルタ部２２は、時刻tのフレームの注目ブロック内の位置(x,y)における画素の画素値C(t,x,y)を、例えば、式(1/16)×D(t-3t₀,x-3p₀,y-3q₀)＋(2/16)×D(t-2t₀,x-2p₀,y-2q₀)＋(3/16)×D(t-t₀,x-p₀,y-q₀)＋(4/16)×D(t,x,y)＋(3/16)×D(t+t₀,x+p₀,y+q₀)＋(2/16)×D(t+2t₀,x+2p₀,y+2q₀)＋(1/16)×D(t+3t₀,x+3p₀,y+3q₀)を演算することにより求めるフィルタリングを行う。なお、D(t,x,y)は、フィルタリングに用いる動画データの時刻tの位置(x,y)における画素値を表す。 In step S132, the filter unit 22 calculates the pixel value C (t, x, y) of the pixel at the position (x, y) in the target block of the frame at time t, for example, the equation (1/16) × D ( t-3t ₀ , x-3p ₀ , y-3q ₀ ) + (2/16) × D (t-2t ₀ , x-2p ₀ , y-2q ₀ ) + (3/16) × D (tt ₀ , xp ₀ , yq ₀ ) + (4/16) × D (t, x, y) + (3/16) × D (t + t ₀ , x + p ₀ , y + q ₀ ) + (2 / 16) × D (t + 2t ₀ , x + 2p ₀ , y + 2q ₀ ) + (1/16) × D (t + 3t ₀ , x + 3p ₀ , y + 3q ₀ ) Perform filtering. Note that D (t, x, y) represents a pixel value at a position (x, y) at time t of moving image data used for filtering.

ここで、式(1/16)×D(t-3t₀,x-3p₀,y-3q₀)＋(2/16)×D(t-2t₀,x-2p₀,y-2q₀)＋(3/16)×D(t-t₀,x-p₀,y-q₀)＋(4/16)×D(t,x,y)＋(3/16)×D(t+t₀,x+p₀,y+q₀)＋(2/16)×D(t+2t₀,x+2p₀,y+2q₀)＋(1/16)×D(t+3t₀,x+3p₀,y+3q₀)の演算は、いわゆるタップ係数を、1/16，2/16，3/16，4/16，3/16，2/16，1/16とし、時間方向に７タップを有する（７フレームの画像データを用いたフィルタリングを行う）FIR(Finite Impulse Response)フィルタによるフィルタリングを行うことに等価である。このようなタップ係数のFIRフィルタは、ローパスフィルタである。 Here, the equation (1/16) × D (t-3t ₀ , x-3p ₀ , y-3q ₀ ) + (2/16) × D (t-2t ₀ , x-2p ₀ , y-2q ₀ ) + (3/16) × D (tt ₀ , xp ₀ , yq ₀ ) + (4/16) × D (t, x, y) + (3/16) × D (t + t ₀ , x + p ₀ , y + q ₀ ) + (2/16) × D (t + 2t ₀ , x + 2p ₀ , y + 2q ₀ ) + (1/16) × D (t + 3t ₀ , x + 3p ₀ , y + 3q ₀ ), the so-called tap coefficient is 1/16, 2/16, 3/16, 4/16, 3/16, 2/16, 1/16, and 7 taps in the time direction It is equivalent to performing filtering by a FIR (Finite Impulse Response) filter having (filtering using image data of 7 frames). Such a tap coefficient FIR filter is a low-pass filter.

なお、ステップＳ１３２において、画素値C(t,x,y)の演算としてのフィルタリングは、時間方向のサンプル数を1/4に間引くダウンサンプリングを行いながら、即ち、４フレームごとに１フレームの割合で行われる。つまり、tは、例えば、４×t₀の倍数である。 In step S132, filtering as the calculation of the pixel value C (t, x, y) is performed while down-sampling is performed to thin out the number of samples in the time direction to 1/4, that is, a ratio of one frame every four frames. Done in That is, t is a multiple of 4 × t ₀ , for example.

ステップＳ１３２の処理後は、ステップＳ１３３に進み、フィルタ部２２は、フィルタリング結果、即ち、式(1/16)×D(t-3t₀,x-3p₀,y-3q₀)＋(2/16)×D(t-2t₀,x-2p₀,y-2q₀)＋(3/16)×D(t-t₀,x-p₀,y-q₀)＋(4/16)×D(t,x,y)＋(3/16)×D(t+t₀,x+p₀,y+q₀)＋(2/16)×D(t+2t₀,x+2p₀,y+2q₀)＋(1/16)×D(t+3t₀,x+3p₀,y+3q₀)を演算することにより求めた画素値C(t,x,y)を、時刻tの位置(x,y)における画素値として、エンコード部２４に出力する。 After the process of step S132, the process proceeds to step S133, and the filter unit 22 performs the filtering result, that is, the expression (1/16) × D (t−3t ₀ , x−3p ₀ , y−3q ₀ ) + (2 / 16) × D (t-2t ₀ , x-2p ₀ , y-2q ₀ ) + (3/16) × D (tt ₀ , xp ₀ , yq ₀ ) + (4/16) × D (t, x , y) + (3/16) × D (t + t ₀ , x + p ₀ , y + q ₀ ) + (2/16) × D (t + 2t ₀ , x + 2p ₀ , y + 2q ₀ ) + (1/16) × D (t + 3t ₀ , x + 3p ₀ , y + 3q ₀ ), the pixel value C (t, x, y) obtained by calculating , y) is output to the encoding unit 24 as a pixel value.

なお、図６８では、時間方向に７タップを有するFIRフィルタによるフィルタリングを行うようにしたが、その他、例えば、時間方向に９タップを有するFIRフィルタによるフィルタリングを行うようにしても良い。 In FIG. 68, the filtering is performed by the FIR filter having 7 taps in the time direction. However, for example, the filtering may be performed by the FIR filter having 9 taps in the time direction.

また、式(1/16)×D(t-3t₀,x-3p₀,y-3q₀)＋(2/16)×D(t-2t₀,x-2p₀,y-2q₀)＋(3/16)×D(t-t₀,x-p₀,y-q₀)＋(4/16)×D(t,x,y)＋(3/16)×D(t+t₀,x+p₀,y+q₀)＋(2/16)×D(t+2t₀,x+2p₀,y+2q₀)＋(1/16)×D(t+3t₀,x+3p₀,y+3q₀)の演算は、ローパスフィルタによるフィルタリングに等価であるが、注目ブロックのフレームに近いほど大きな重みを付した重み付け加算と見ることもできる。 Also, the expression (1/16) × D (t-3t ₀ , x-3p ₀ , y-3q ₀ ) + (2/16) × D (t-2t ₀ , x-2p ₀ , y-2q ₀ ) + (3/16) × D (tt ₀ , xp ₀ , yq ₀ ) + (4/16) × D (t, x, y) + (3/16) × D (t + t ₀ , x + p ₀ , y + q ₀ ) + (2/16) × D (t + 2t ₀ , x + 2p ₀ , y + 2q ₀ ) + (1/16) × D (t + 3t ₀ , x + 3p ₀ , The calculation of y + 3q ₀ ) is equivalent to filtering by a low-pass filter, but it can also be regarded as weighted addition with a larger weight as it is closer to the frame of the target block.

このように、注目ブロックのフレームに近いほど大きな重みを付した重み付け加算を行うことによって、フィルタリング結果としての画素値C(t,x,y)を求める場合には、合成相関情報E(p,q)を求めるための合成も、上述したように、注目ブロックの時刻tから近い時刻の相関情報ほど大きな重みを付して加算する式８×E₁(p,q)+４×E₂(2p,2q)+２×E₃(3p,3q)+１×E₄(4p,4q)+８×F₁(-p,-q)+４×F₂(-2p,-2q)+２×F₃(-3p,-3q)+１×F₄(-4p,-4q)の演算によって行うのが望ましい。 In this way, when the pixel value C (t, x, y) as a filtering result is obtained by performing weighted addition with a greater weight as it is closer to the frame of the target block, the combined correlation information E (p, In the synthesis for obtaining q), as described above, the correlation information at the time closer to the time t of the block of interest is added with a larger weight and added to the equation 8 × E ₁ (p, q) + 4 × E ₂ ( 2p, 2q) + 2 × E ₃ (3p, 3q) + 1 × E ₄ (4p, 4q) + 8 × F ₁ (-p, -q) + 4 × F ₂ (-2p, -2q) +2 It is desirable to perform the calculation by × F ₃ (-3p, -3q) + 1 × F ₄ (-4p, -4q)

以上のように、注目ブロックと次のフレームとの相関を表す相関情報だけではなく、注目ブロックとその前後の複数のフレームそれぞれとの相関を表す複数の相関情報を用いることによって、注目ブロックから複数のフレームそれぞれへの、いわば平均的な動きを表す平均動きベクトルを求めるようにしたので、その平均動きベクトルから、注目ブロックについて、精度の高い主成分方向を求めることができる。 As described above, not only the correlation information indicating the correlation between the block of interest and the next frame, but also the plurality of pieces of correlation information indicating the correlation between the block of interest and each of a plurality of frames before and after the block, Since an average motion vector representing an average motion for each of the frames is obtained, a highly accurate principal component direction can be obtained for the target block from the average motion vector.

さらに、平均動きベクトルを求めるときに用いる複数のフレームの範囲は、フィルタ部２２（図２５）でのフィルタリングに用いられるフレームの範囲に対応した範囲、即ち、フィルタリングに用いられるフレーム（図６８では、注目ブロックのフレームを中心とする７フレーム）とほぼ同様の範囲（図６４乃至図６７では、注目ブロックのフレームを中心とする９フレーム）であるので、フィルタ部２２（図２５）において、正確なフィルタリング結果（本来通過させるべき周波数成分だけのフィルタリング結果）を得ることが可能となる。 Further, the range of the plurality of frames used when obtaining the average motion vector is a range corresponding to the range of frames used for filtering in the filter unit 22 (FIG. 25), that is, a frame used for filtering (in FIG. 68, Since the range is almost the same as (7 frames centered on the frame of the target block) (9 frames centering on the frame of the target block in FIGS. 64 to 67), the filter unit 22 (FIG. 25) It is possible to obtain a filtering result (a filtering result of only frequency components that should be passed through).

なお、図６８では、注目ブロックのフレームを中心とする７フレームを用いてフィルタリングを行い、図６４乃至図６７では、注目ブロックのフレームを中心とする９フレームを用いて平均動きベクトルを求めるようにしたが、その他、例えば、フィルタリングも平均動きベクトルの算出も、注目ブロックのフレームを中心とする７フレームまたは９フレームを用いて行うことが可能である。また、フィルタリングは、注目ブロックのフレームを中心とする９フレームを用いて行い、平均動きベクトルの算出は、注目ブロックのフレームを中心とする７フレームを用いて行うことも可能である。つまり、フィルタリングに用いるフレームの範囲と、平均動きベクトルの算出に用いるフレームの範囲とは、完全に一致していても良いし、異なっていても、ほぼ一致していれば良い。 In FIG. 68, filtering is performed using 7 frames centered on the frame of the target block, and in FIGS. 64 to 67, the average motion vector is obtained using 9 frames centered on the frame of the target block. However, for example, filtering and calculation of the average motion vector can be performed using 7 frames or 9 frames centering on the frame of the target block. Filtering can also be performed using 9 frames centered on the frame of the target block, and the average motion vector can be calculated using 7 frames centered on the frame of the target block. That is, the frame range used for filtering and the frame range used for calculating the average motion vector may be completely coincident with each other or may be almost coincident with each other.

以上のように、フィルタ部２２（図２５）でのフィルタリングに用いられるフレームの範囲とほぼ同様の範囲の複数のフレームを用いて、平均動きベクトルを求めることにより、その平均動きベクトルから、複数のフレームの範囲において比較的正確な主成分方向を求めることができる。 As described above, by obtaining an average motion vector using a plurality of frames in a range substantially similar to the range of frames used for filtering in the filter unit 22 (FIG. 25), a plurality of average motion vectors are obtained from the average motion vector. A relatively accurate principal component direction can be obtained in the frame range.

次に、上述のように、平均動きベクトルによれば、基本的には、比較的正確な主成分方向を求めることができる。 Next, as described above, according to the average motion vector, a relatively accurate principal component direction can be basically obtained.

しかしながら、常時、複数のフレームすべてを用いて、平均動きベクトルを求めると、誤差の大きな平均動きベクトルが得られる場合があり、この場合、主成分方向の精度も劣化し、その結果、フィルタ２２でのフィルタリング結果も、適切でない動画（受信装置２において、人間が画質の劣化を認識しない動画に復元することができない画像）となる。 However, if the average motion vector is always obtained using all of a plurality of frames, an average motion vector having a large error may be obtained. In this case, the accuracy in the principal component direction is also deteriorated. Also, the filtering result is an inappropriate moving image (an image that cannot be restored to a moving image in which a human does not recognize deterioration in image quality in the receiving device 2).

即ち、例えば、動画データにおいて、静止している被写体を背景として、その背景の手前側に、動いている被写体が前景として存在する場合、注目ブロックが背景のみを含むときには、他のフレーム（注目フレーム以外のフレーム）では、注目ブロックに対応する背景部分が、前景に隠れて見えない状態となっていることがある。この場合、注目ブロックに対応する背景部分が前景に隠れて見えない状態となっているフレームにおける、注目ブロックとの相関情報の値は、注目ブロックに対応する背景部分の位置で最小になるとは限らず、そのような相関情報を用いて、合成相関情報を算出し、さらに平均動きベクトルを求めると、誤差の大きな平均動きベクトルが得られることがある。 That is, for example, in moving image data, when a stationary object is used as a background and a moving object exists as a foreground on the front side of the background, when the block of interest includes only the background, another frame (frame of interest) In other frames, the background portion corresponding to the block of interest may be hidden behind the foreground and invisible. In this case, the value of the correlation information with the block of interest in the frame in which the background portion corresponding to the block of interest is hidden behind the foreground is not always the minimum at the position of the background portion corresponding to the block of interest. First, when the composite correlation information is calculated using such correlation information and the average motion vector is obtained, an average motion vector with a large error may be obtained.

具体的には、送信装置１に入力された動画データが、例えば、図６９に示す被写体が投影された動画データであったとする。 Specifically, it is assumed that the moving image data input to the transmission device 1 is, for example, moving image data on which a subject shown in FIG. 69 is projected.

ここで、図６９における横軸は、空間方向の位置xを表している。そして、図６９上側は、背景である被写体P₆₉₀₁の波形を示しており、図６９下側は、前景である被写体P₆₉₀₂の波形を示している。 Here, the horizontal axis in FIG. 69 represents the position x in the spatial direction. The upper side of FIG. 69 shows the waveform of the subject P _{6901 as} the background, and the lower side of FIG. 69 shows the waveform of the subject P _{6902 as} the foreground.

図６９上側に示した被写体P₆₉₀₁は静止しており、図６９下側に示した被写体P₆₉₀₂は、g/t₀[m/s]の速さで、空間方向xに移動している。 The subject P ₆₉₀₁ shown on the upper side of FIG. 69 is stationary, and the subject P ₆₉₀₂ shown on the lower side of FIG. 69 is moving in the spatial direction x at a speed of g / t ₀ [m / s].

なお、上述の場合には、平均動きベクトルの算出にあたり、時刻t-4t₀乃至t+4t₀の９時刻（９フレーム）の画像データを用いたが、以下では、時刻t-3t₀乃至t+3t₀の７時刻（７フレーム）の画像データを用いることとする。時刻t-3t₀乃至t+3t₀の時間は、6t₀という短い時間なので、その時間における被写体P₆₉₀₂の速さは一定であるとみなす（近似する）ことができ、上述したg/t₀は、その一定の速さを表す。 In the above-described case, image data at 9 times (9 frames) from time t-4t _{0 to} t + 4t ₀ is used in calculating the average motion vector. However, hereinafter, time t-3t _{0 to} t Image data at 7 times (7 frames) of + 3t ₀ is used. Since the time from time t-3t _{0 to} t + 3t ₀ is as short as 6t ₀ , the speed of the subject P _{6902 at} that time can be regarded as being constant (approximate), and g / t ₀ described above Represents the constant speed.

図７０および図７１は、図６９に示した被写体P₆₉₀₁とP₆₉₀₂とが投影された、時刻t-3t₀乃至t+3t₀の７時刻（７フレーム）それぞれの画像データを示している。 70 and 71 show image data at seven times (7 frames) from time t-3t _{0 to} t + 3t ₀ on which the subjects P ₆₉₀₁ and P ₆₉₀₂ shown in FIG. 69 are projected.

なお、図７０および図７１には、上から順に、時刻t-3t₀乃至t+3t₀の７時刻それぞれの画像データを図示してある。図７０および図７１は、符号が付してある部分が異なることを除いて、同一の図である。 In FIG. 70 and FIG. 71, image data at seven times from time t-3t _{0 to} t + 3t ₀ are illustrated in order from the top. FIG. 70 and FIG. 71 are the same figures except that the parts to which the reference numerals are attached are different.

上述した図６４乃至図６７のフローチャートにしたがった処理（以下、適宜、第１の動き検出処理という）によれば、例えば、時刻tにおける画像データが、ブロックに分割される（図６４におけるステップＳ１０１）。図７０および図７１では、時刻tにおける画像データが、ブロックB₁乃至B₁₆の１６のブロックに分割されている。 According to the processing according to the flowcharts of FIGS. 64 to 67 described above (hereinafter, referred to as first motion detection processing as appropriate), for example, the image data at time t is divided into blocks (step S101 in FIG. 64). ). In FIGS. 70 and 71, the image data at time t is divided into 16 blocks of the block B ₁ to B _16.

例えば、いま、背景である被写体P₆₉₀₁のみが含まれるブロックB₃が注目ブロックであるとして、第１の動き検出処理を行うと、図７０に示すように、時刻t+t₀における画像データにおいて、注目ブロックB₃と対応する領域は、注目ブロックB₃と同一位置の領域B₁₀₁であるので、図６５のステップＳ１０２で求められる、時刻t+t₀における相関情報E₁(u₁,v₁)の値が最小になるのは、位置(u₁,v₁)=(0,0)のときである。 For example, if the first motion detection process is performed on the assumption that the block B ₃ including only the subject P ₆₉₀₁ that is the background is the block of interest, as shown in FIG. 70, the image data at time t + t ₀ , a region corresponding to the target block B ₃ is because it is the block of interest B ₃ and the region B ₁₀₁ in the same position is determined at step S102 of FIG. 65, the correlation information at time _{_{t + t 0 E 1 (u}} 1, v The value of ₁ ) is minimized when the position (u ₁ , v ₁ ) = (0,0).

同様に、時刻t+2t₀，t+3t₀，t-t₀，t-2t₀，t-3t₀における画像データにおいて、注目ブロックB₃と対応する領域は、それぞれ、注目ブロックB₃と同一位置の領域B₂₀₁，B₃₀₁，B_-101，B_-201，B_-301である。従って、時刻t+2t₀における相関情報E₂(u₂,v₂)、時刻t+3t₀における相関情報E₃(u₃,v₃)、時刻t-t₀における相関情報F₁(r₁,s₁)、時刻t-2t₀における相関情報F₂(r₂,s₂)、時刻t-3t₀における相関情報F₃(r₃,s₃)の値が最小になるのは、いずれも、位置(u₂,v₂)，(u₃,v₃)，(r₁,s₁)，(r₂,s₂)，(r₃,s₃)が、(0,0)のときである。 Similarly, in the image data at times t + 2t ₀ , t + 3t ₀ , tt ₀ , t-2t ₀ , t-3t ₀ , the region corresponding to the target block B ₃ is the same position as the target block B ₃ , respectively. region _{_{_{B 201, B 301, B -101}}} , B -201, a B _-301. Therefore, the correlation at time t + 2t ₀ Info _{_{_{E 2 (u 2, v 2}}} ), the correlation information E ₃ at time _{_{t + 3t 0 (u 3,}} v 3), the correlation at time tt ₀ information F ₁ (r _1, s ₁ ), correlation information F ₂ (r ₂ , s ₂ ) at time t-2t ₀ , and correlation information F ₃ (r ₃ , s ₃ ) at time t-3t ₀ are all minimized. , (U ₂ , v ₂ ), (u ₃ , v ₃ ), (r ₁ , s ₁ ), (r ₂ , s ₂ ), (r ₃ , s ₃ ) are (0,0) It is.

従って、第１の動き検出処理によれば、図６７のステップＳ１１１において求められる、スケーリング後の相関情報E₁(u₁,v₁)，E₂(u₂,v₂)，E₃(u₃,v₃)，F₁(r₁,s₁)，F₂(r₂,s₂)，F₃(r₃,s₃)の合成結果である合成相関情報E(p,q)を最小にする(p,q)、つまり平均動きベクトル(p₀,q₀)は、(0,0)であり、その結果、注目ブロックB₃については、３次元の動きベクトルとして、(0,0,t₀)が求められる。 Therefore, according to the first motion detection process, the correlation information E ₁ (u ₁ , v ₁ ), E ₂ (u ₂ , v ₂ ), E ₃ (u) after scaling obtained in step S111 of FIG. ₃ , v ₃ ), F ₁ (r ₁ , s ₁ ), F ₂ (r ₂ , s ₂ ), and F ₃ (r ₃ , s ₃ ), the combined correlation information E (p, q) The minimum (p, q), that is, the average motion vector (p ₀ , q ₀ ) is (0,0). As a result, for the block of interest B ₃ , (0, 0, t ₀ ) is obtained.

この場合、図６８では、注目ブロックB₃の各画素について、領域B_-301，B_-201，B_-101，B₃，B₁₀₁，B₂₀₁，B₃₀₁それぞれの対応する画素の画素値に対し、重み１／１６，２／１６，３／１６，４／１６，３／１６，２／１６，１／１６を付した重み付け加算を行うフィルタリングが行われる。領域B_-301，B_-201，B_-101，B₃，B₁₀₁，B₂₀₁，B₃₀₁は、いずれも、注目ブロックB₃に対応した領域であり、従って、領域B_-301，B_-201，B_-101，B₃，B₁₀₁，B₂₀₁，B₃₀₁を用いたフィルタリングにより、人間の視覚で認識することができる周波数成分のみのデータ、即ち、いまの場合、(0,0,t₀)に直交する方向（注目ブロックB₃のデータの主成分方向）に延びる、T方向の幅が2π/(4t₀)の領域R₁₀₀₁（図２０）内の周波数成分のみの適正なデータを得ることができる。 In this case, in FIG. 68, for each pixel of the block of interest B _3, area _{_{_{B -301, B -201, B -101}}} , to _{_{_{B 3, B 101, B 201}}} , B 301 pixel values of respective corresponding pixels Filtering for performing weighted addition with weights 1/16, 2/16, 3/16, 4/16, 3/16, 2/16, 1/16 is performed. Region _{_{_{B -301, B -201, B -101}}} , B 3, B 101, B 201, B 301 are both an area corresponding to the target block B _3, therefore, the region B _-301, B _-201 , B _-101 , B ₃ , B ₁₀₁ , B ₂₀₁ , B ₃₀₁ , data of only frequency components that can be recognized by human vision, that is, (0,0, t _{0 in this case)} ) To obtain appropriate data of only the frequency component in the region R ₁₀₀₁ (FIG. 20) extending in the direction orthogonal to (the principal component direction of the data of the target block B ₃ ) and having a width in the T direction of 2π / (4t ₀ ). be able to.

次に、例えば、前景である被写体P₆₉₀₂のみが含まれるブロックB₁₀が注目ブロックであるとして、第１の動き検出処理を行うと、図７０に示すように、時刻t+t₀における画像データにおいて、注目ブロックB₁₀と対応する領域は、注目ブロックB₁₀からx方向にgだけずれた位置の領域B₁₀₂であるので、図６５のステップＳ１０２で求められる、時刻t+t₀における相関情報E₁(u₁,v₁)の値が最小になるのは、位置(u₁,v₁)=(g,0)のときである。 Next, for example, if the first motion detection process is performed assuming that the block B ₁₀ including only the subject P ₆₉₀₂ which is the foreground is the _target block, as shown in FIG. 70, the image data at time t + t ₀ is displayed. in the region corresponding to the block of interest B ₁₀ is because it is the region B ₁₀₂ of position shifted by g in the x-direction from the target block B _10, obtained in the step S102 of FIG. 65, the correlation information at time t + t ₀ The value of E ₁ (u ₁ , v ₁ ) is minimized when the position (u ₁ , v ₁ ) = (g, 0).

同様に、時刻t+2t₀，t+3t₀，t-t₀，t-2t₀，t-3t₀における画像データにおいて、注目ブロックB₁₀と対応する領域は、それぞれ、注目ブロックB₁₀からx方向に2gだけずれた位置の領域B₂₀₂、注目ブロックB₁₀からx方向に3gだけずれた位置の領域B₃₀₂、注目ブロックB₁₀からx方向に-gだけずれた位置の領域B_-102、注目ブロックB₁₀からx方向に-2gだけずれた位置の領域B_-202、注目ブロックB₁₀からx方向に-3gだけずれた位置の領域B_-302である。 Similarly, in the image data at times t + 2t ₀ , t + 3t ₀ , tt ₀ , t-2t ₀ , t-3t ₀ , regions corresponding to the target block B ₁₀ are respectively in the x direction from the target block B _10. the position of the region B ₂₀₂ shifted by 2g, the block of interest B ₁₀ position of the region B ₃₀₂ shifted by 3g in the x direction from the block of interest B ₁₀ position of the region B _-102 shifted in the x direction by -g from attention area at a position shifted by -2g from the block B ₁₀ in the x direction B _-202, a region B _-302 position shifted by -3g in the x direction from the target block B _10.

従って、時刻t+2t₀における相関情報E₂(u₂,v₂)の値が最小になるのは、位置(u₂,v₂)が(2g,0)のときであり、時刻t+3t₀における相関情報E₃(u₃,v₃)の値が最小になるのは、位置(u₃,v₃)が(3g,0)のときであり、時刻t-t₀における相関情報F₁(r₁,s₁)の値が最小になるのは、位置(r₁,s₁)が(-g,0)のときであり、時刻t-2t₀における相関情報F₂(r₂,s₂)の値が最小になるのは、位置(r₂,s₂)が(-2g,0)のときであり、時刻t-3t₀における相関情報F₃(r₃,s₃)の値が最小になるのは、位置(r₃,s₃)が、(-3g,0)のときである。 Therefore, the _value of the correlation information E ₂ (u ₂ , v ₂ ) at time t + 2t ₀ is minimized when the position (u ₂ , v ₂ ) is (2g, 0) and the time t + The _value of the correlation information E ₃ (u ₃ , v ₃ ) at 3t ₀ is minimized when the position (u ₃ , v ₃ ) is (3g, 0), and the correlation information F _{1 at} time tt ₀ (r _{_1,} s ₁₎ the value of the is minimized, positions (r _{_1,} s ₁₎ is (-g, 0) is when the correlation at time t-2t ₀ information F ₂ (r _2, The value of s ₂ ) is minimized when the position (r ₂ , s ₂ ) is (-2g, 0), and the correlation information F ₃ (r ₃ , s ₃ ) at time t-3t ₀ The value is minimized when the position (r ₃ , s ₃ ) is ( ₋₃ g, 0).

以上から、第１の動き検出処理によれば、図６７のステップＳ１１１において求められる、スケーリング後の相関情報E₁(u₁,v₁)，E₂(u₂,v₂)，E₃(u₃,v₃)，F₁(r₁,s₁)，F₂(r₂,s₂)，F₃(r₃,s₃)の合成結果である合成相関情報E(p,q)を最小にする(p,q)、つまり平均動きベクトル(p₀,q₀)は、(g,0)であり、その結果、注目ブロックB₁₀については、３次元の動きベクトルとして、(g,0,t₀)が求められる。 From the above, according to the first motion detection process, the correlation information E ₁ (u ₁ , v ₁ ), E ₂ (u ₂ , v ₂ ), E ₃ (after scaling) obtained in step S111 of FIG. u ₃ , v ₃ ), F ₁ (r ₁ , s ₁ ), F ₂ (r ₂ , s ₂ ), F ₃ (r ₃ , s ₃ ), and the resultant correlation information E (p, q) (P, q), that is, the average motion vector (p ₀ , q ₀ ) is (g, 0). As a result, for the target block B ₁₀ , (g , 0, t ₀ ).

この場合、図６８では、注目ブロックB₁₀の各画素について、領域B_-302，B_-202，B_-102，B₁₀，B₁₀₂，B₂₀₂，B₃₀₂それぞれの対応する画素の画素値に対し、重み１／１６，２／１６，３／１６，４／１６，３／１６，２／１６，１／１６を付した重み付け加算を行うフィルタリングが行われる。領域B_-302，B_-202，B_-102，B₁₀，B₁₀₂，B₂₀₂，B₃₀₂は、いずれも、注目ブロックB₁₀に対応した領域であり、従って、領域B_-302，B_-202，B_-102，B₁₀，B₁₀₂，B₂₀₂，B₃₀₂を用いたフィルタリングにより、人間の視覚で認識することができる周波数成分のみのデータ、即ち、いまの場合、(g,0,t₀)に直交する方向（注目ブロックB₁₀のデータの主成分方向）に延びる、T方向の幅が2π/(4t₀)の領域内の周波数成分のみの適正なデータを得ることができる。 In this case, in FIG. 68, for each pixel of the block of interest B _10, region _{_{_{B -302, B -202, B -102}}} , relative to _{_{_{B 10, B 102, B 202}}} , B 302 pixel values of respective corresponding pixels Filtering for performing weighted addition with weights 1/16, 2/16, 3/16, 4/16, 3/16, 2/16, 1/16 is performed. Region _{_{_{B -302, B -202, B -102}}} , B 10, B 102, B 202, B 302 are both an area corresponding to the target block B _10, therefore, the region B _-302, B _-202 , B _-102 , B ₁₀ , B ₁₀₂ , B ₂₀₂ , B ₃₀₂ , data of only frequency components that can be recognized by human vision, that is, (g, 0, t _{0 in this case)} ) Proper data of only frequency components in a region extending in a direction orthogonal to (a principal component direction of data of the target block B ₁₀ ) and having a width in the T direction of 2π / (4t ₀ ) can be obtained.

以上のように、注目ブロック、および他のフレームの注目ブロックに対応する領域のすべてが、背景である被写体P₆₉₀₁のみを含む場合や、前景である被写体P₆₉₀₂のみを含む場合には、注目ブロックのデータの主成分方向を正確に求め、その主成分方向に基づくフィルタリングによって、人間の視覚で認識することができる周波数成分のみの適切なデータを得ることができる。 As described above, when all of the areas corresponding to the block of interest and the block of interest in other frames include only the subject P ₆₉₀₁ that is the background or only the subject P ₆₉₀₂ that is the foreground, the block of interest By accurately obtaining the principal component direction of the data and filtering based on the principal component direction, it is possible to obtain appropriate data of only frequency components that can be recognized by human vision.

これに対して、注目ブロック、または他のフレームの注目ブロックに対応する領域が、速度が異なる複数の被写体を含む場合、つまり、速度が異なる複数の被写体の境界部分を含む場合には、誤差の大きな平均動きベクトルが求められることがある。この場合、そのような平均動きベクトルからは、注目ブロックのデータの主成分方向を正確に求められず、その結果、主成分方向に基づくフィルタリングによって、人間の視覚で認識することができる周波数成分のみの適切なデータを得ることができないことがある。 On the other hand, if the area corresponding to the target block or the target block in another frame includes a plurality of subjects with different speeds, that is, includes a boundary portion of a plurality of subjects with different speeds, A large average motion vector may be required. In this case, from such an average motion vector, the principal component direction of the data of the block of interest cannot be accurately determined, and as a result, only frequency components that can be recognized by human vision by filtering based on the principal component direction. It may not be possible to obtain appropriate data.

即ち、例えば、図７１において、背景である被写体P₆₉₀₁のみを含むブロックB₇が注目ブロックであるとして、第１の動き検出処理を行うと、時刻t+t₀における画像データにおいて、注目ブロックB₇と対応する領域は、注目ブロックB₇と同一位置の領域B₁₁₁であるので、図６５のステップＳ１０２で求められる、時刻t+t₀における相関情報E₁(u₁,v₁)の値が最小になるのは、位置(u₁,v₁)=(0,0)のときである。 That is, for example, in FIG. 71, if the first motion detection process is performed assuming that the block B ₇ including only the subject P _{6901 as the} background is the _target block, the _target block B in the image data at time t + t ₀ is performed. _Since the area corresponding to ₇ is the area B ₁₁₁ at the same position as the target block B ₇ , the _value of the correlation information E ₁ (u ₁ , v ₁ ) at time t + t ₀ obtained in step S102 of FIG. Is minimized when the position (u ₁ , v ₁ ) = (0,0).

同様に、時刻t+2t₀，t+3t₀，t-t₀，t-2t₀における画像データにおいて、注目ブロックB₇と対応する領域は、それぞれ、注目ブロックB₇と同一位置の領域B₂₁₁，B₃₁₁，B_-111，B_-211である。従って、時刻t+2t₀における相関情報E₂(u₂,v₂)、時刻t+3t₀における相関情報E₃(u₃,v₃)、時刻t-t₀における相関情報F₁(r₁,s₁)、時刻t-2t₀における相関情報F₂(r₂,s₂)の値が最小になるのは、いずれも、位置(u₂,v₂)，(u₃,v₃)，(r₁,s₁)，(r₂,s₂)が、(0,0)のときである。 Similarly, in the image data at time _{t + 2t 0, t + 3t} 0, tt 0, t-2t 0, a region corresponding to the block of interest B _7, which may each focus block B ₇ in the same position of the region B _211, B _311, B _-111, is a B _-211. Therefore, the correlation at time t + 2t ₀ Info _{_{_{E 2 (u 2, v 2}}} ), the correlation information E ₃ at time _{_{t + 3t 0 (u 3,}} v 3), the correlation at time tt ₀ information F ₁ (r _1, s ₁ ) and the _value of the correlation information F ₂ (r ₂ , s ₂ ) at the time t-2t ₀ are minimized because the position (u ₂ , v ₂ ), (u ₃ , v ₃ ), This is when (r ₁ , s ₁ ) and (r ₂ , s ₂ ) are (0,0).

しかしながら、時刻t-3t₀における画像データにおいて、注目ブロックB₇と対応する領域は、本来は、注目ブロックB₇と同一位置の領域B_-311であるはずであるが、図７１では、動いている被写体P₆₉₀₂の後ろに隠れてしまっており、存在しない。このため、時刻t-3t₀における相関情報F₃(r₃,s₃)の値が最小になるのは、位置(r₃,s₃)が(0,0)のときであって欲しいが、相関情報F₃(r₃,s₃)の値が、位置(r₃,s₃)が(0,0)のときに最小になるとは限らない。 However, in the image data at time t-3t _0, a region corresponding to the block of interest B _7, which may Originally, it should be region B _-311 at the same position as the block of interest B _7, FIG. 71, moving The subject is hidden behind the subject P ₆₉₀₂ and does not exist. Therefore, the _value of the correlation information F ₃ (r ₃ , s ₃ ) at time t-3t ₀ is minimized when the position (r ₃ , s ₃ ) is (0,0). The value of the correlation information F ₃ (r ₃ , s ₃ ) is not always the minimum when the position (r ₃ , s ₃ ) is (0,0).

そして、第１の動き検出処理によれば、図６７のステップＳ１１１において、スケーリング後の相関情報E₁(u₁,v₁)，E₂(u₂,v₂)，E₃(u₃,v₃)，F₁(r₁,s₁)，F₂(r₂,s₂)，F₃(r₃,s₃)の合成結果である合成相関情報E(p,q)を最小にする(p,q)、つまり平均動きベクトル(p₀,q₀)が求められ、さらに、３次元の動きベクトルとして、(p₀,q₀,t₀)が求められる。 Then, according to the first motion detection process, in step S111 in FIG. 67, the correlation information E ₁ (u ₁ , v ₁ ), E ₂ (u ₂ , v ₂ ), E ₃ (u ₃ , v ₃ ), F ₁ (r ₁ , s ₁ ), F ₂ (r ₂ , s ₂ ), F ₃ (r ₃ , s ₃ ) (P, q), that is, an average motion vector (p ₀ , q ₀ ) is obtained, and (p ₀ , q ₀ , t ₀ ) is obtained as a three-dimensional motion vector.

この３次元の動きベクトル(p₀,q₀,t₀)は、位置(r₃,s₃)が(0,0)のときに値が最小になるとは限らない相関情報F₃(r₃,s₃)を用いて求められた合成相関情報E(p,q)を最小にする(p,q)である平均動きベクトル(p₀,q₀)から得られるため、その３次元の動きベクトル(p₀,q₀,t₀)に直交する方向が、注目ブロックB₇のデータの主成分方向を正確に表しているとは限らない。 The three-dimensional motion vector (p ₀ , q ₀ , t ₀ ) is not always the minimum value when the position (r ₃ , s ₃ ) is (0,0). The correlation information F ₃ (r ₃ , s ₃ ) is obtained from the average motion vector (p ₀ , q ₀ ) that is (p, q) that minimizes the composite correlation information E (p, q) obtained using The direction orthogonal to the vector (p ₀ , q ₀ , t ₀ ) does not always accurately represent the principal component direction of the data of the block of interest B ₇ .

そして、そのような３次元の動きベクトル(p₀,q₀,t₀)に直交する方向に基づくフィルタリングでは、人間の視覚で認識することができる周波数成分のみの適切なデータを得ることが困難となる。 In such filtering based on a direction orthogonal to the three-dimensional motion vector (p ₀ , q ₀ , t ₀ ), it is difficult to obtain appropriate data of only frequency components that can be recognized by human vision. It becomes.

次に、例えば、図７１において、背景である被写体P₆₉₀₁のみを含むブロックB₁₄が注目ブロックであるとして、第１の動き検出処理を行うと、時刻t+t₀における画像データにおいて、注目ブロックB₁₄と対応する領域は、注目ブロックB₁₄と同一位置の領域B₁₁₂であるので、図６５のステップＳ１０２で求められる、時刻t+t₀における相関情報E₁(u₁,v₁)の値が最小になるのは、位置(u₁,v₁)=(0,0)のときである。 Next, for example, in FIG. 71, if the first motion detection process is performed assuming that the block B ₁₄ including only the subject P _{6901 as the} background is the _target block, the _target block is included in the image data at time t + t ₀ . a region corresponding to the B ₁₄ is because it is the region B ₁₁₂ of the block of interest B ₁₄ at the same position is determined at step S102 of FIG. 65, the correlation information at time t + t ₀ E ₁ of (u _1, v ₁₎ The value is minimized when the position (u ₁ , v ₁ ) = (0,0).

同様に、時刻t-t₀，t-2t₀，t-3t₀における画像データにおいて、注目ブロックB₁₄と対応する領域は、それぞれ、注目ブロックB₁₄と同一位置の領域B_-112，B_-212，B_-312である。従って、時刻t-t₀における相関情報F₁(r₁,s₁)、時刻t-2t₀における相関情報F₂(r₂,s₂)、時刻t-3t₀における相関情報F₃(r₃,s₃)の値が最小になるのは、いずれも、位置(r₁,s₁)，(r₂,s₂)，(r₃,s₃)が、(0,0)のときである。 Similarly, in the image data at time _{_{tt 0, t-2t 0,}} t-3t 0, a region corresponding to the block of interest B _14, respectively, the block of interest B ₁₄ at the same position in the region B _-112, B _-212, B- ₃₁₂ . Therefore, correlation information F ₁ (r ₁ , s ₁ ) at time tt _0, correlation information F ₂ (r ₂ , s ₂ ) at time t-2t ₀ , and correlation information F ₃ (r ₃ , r at time t-3t ₀ The value of s ₃ ) is minimized when the positions (r ₁ , s ₁ ), (r ₂ , s ₂ ), and (r ₃ , s ₃ ) are (0, 0). .

しかしながら、時刻t+2t₀における画像データにおいて、注目ブロックB₁₄と対応する領域は、本来は、注目ブロックB₁₄と同一位置の領域B₂₁₂であるはずであるが、動いている被写体P₆₉₀₂の後ろに隠れてしまっており、存在しない。このため、時刻t+2t₀における相関情報E₂(u₂,v₂)の値が最小になるのは、位置(u₂,v₂)が(0,0)のときであって欲しいが、相関情報E₂(u₂,v₂)の値が、位置(u₂,v₂)が(0,0)のときに最小になるとは限らない。 However, in the image data at time t + 2t _0, a region corresponding to the block of interest B ₁₄ is originally, but it should be region B ₂₁₂ of the block of interest B ₁₄ at the same position, moving the subject P ₆₉₀₂ are It is hidden behind and does not exist. Therefore, the _value of the correlation information E ₂ (u ₂ , v ₂ ) at time t + 2t ₀ is minimized when the position (u ₂ , v ₂ ) is (0,0). The value of the correlation information E ₂ (u ₂ , v ₂ ) is not necessarily the minimum when the position (u ₂ , v ₂ ) is (0,0).

同様に、時刻t+3t₀における画像データにおいて、注目ブロックB₁₄と対応する領域は、本来は、注目ブロックB₁₄と同一位置の領域B₃₁₂であるはずであるが、動いている被写体P₆₉₀₂の後ろに隠れてしまっており、存在しない。このため、時刻t+3t₀における相関情報E₃(u₃,v₃)の値が最小になるのは、位置(u₃,v₃)が(0,0)のときであって欲しいが、相関情報E₃(u₃,v₃)の値が、位置(u₃,v₃)が(0,0)のときに最小になるとは限らない。 Similarly, in the image data at time t + 3t _0, a region corresponding to the block of interest B ₁₄ are originally of interest but block B ₁₄ and it should be region B ₃₁₂ in the same position, a moving subject P ₆₉₀₂ It is hidden behind and does not exist. Therefore, the _value of the correlation information E ₃ (u ₃ , v ₃ ) at time t + 3t ₀ is minimized when the position (u ₃ , v ₃ ) is (0,0). The value of the correlation information E ₃ (u ₃ , v ₃ ) is not necessarily the minimum when the position (u ₃ , v ₃ ) is (0,0).

この３次元の動きベクトル(p₀,q₀,t₀)は、位置(u₂,v₂)が(0,0)のときに値が最小になるとは限らない相関情報F₂(u₂,v₂)と、位置(u₃,v₃)が(0,0)のときに値が最小になるとは限らない相関情報F₃(u₃,v₃)を用いて求められた合成相関情報E(p,q)を最小にする(p,q)である平均動きベクトル(p₀,q₀)から得られるため、その３次元の動きベクトル(p₀,q₀,t₀)に直交する方向が、注目ブロックB₁₄のデータの主成分方向を正確に表しているとは限らない。 This three-dimensional motion vector (p ₀ , q ₀ , t ₀ ) is not always the minimum when the position (u ₂ , v ₂ ) is (0,0), and the correlation information F ₂ (u ₂ , v ₂ ) and the correlation information F ₃ (u ₃ , v ₃ ), the value of which is not necessarily the minimum when the position (u ₃ , v ₃ ) is (0,0) Since it is obtained from the average motion vector (p ₀ , q ₀ ) that is (p, q) that minimizes the information E (p, q), the three-dimensional motion vector (p ₀ , q ₀ , t ₀ ) the direction perpendicular, not necessarily the main ingredient direction of the data block of interest B ₁₄ accurately represents.

次に、例えば、図７１において、背景である被写体P₆₉₀₁と前景である被写体P₆₉₀₂とを含むブロックB₉が注目ブロックであるとして、第１の動き検出処理を行うと、注目ブロックB₉は、静止している被写体P₆₉₀₁と、動いている被写体P₆₉₀₂との両方を含むため、即ち、静止している被写体P₆₉₀₁のある位置において、動いている被写体P₆₉₀₂が、静止している被写体P₆₉₀₁を隠した状態となっているため、そのような注目ブロックB₉に対応する領域は、時刻t+t₀，t+2t₀，t+3t₀，t-t₀，t-2t₀，t-3t₀における画像データのいずれにも存在しない。 Next, for example, in FIG. 71, if the block B ₉ including the subject P _{6901 that} is the background and the subject P ₆₉₀₂ that is the foreground is the _target block, and the first motion detection process is performed, the target block B ₉ is Since both the stationary subject P ₆₉₀₁ and the moving subject P ₆₉₀₂ are included, that is, the moving subject P ₆₉₀₂ is stationary at the position of the stationary subject P ₆₉₀₁ . Since P ₆₉₀₁ is hidden, the areas corresponding to such a target block B ₉ are time t + t ₀ , t + 2t ₀ , t + 3t ₀ , tt ₀ , t−2t ₀ , t It does not exist in any of the image data at -3t ₀ .

このため、時刻t+t₀，t+2t₀，t+3t₀，t-t₀，t-2t₀，t-3t₀における相関情報E₁(u₁,v₁)，E₂(u₂,v₂)，E₃(u₃,v₃)，F₁(r₁,s₁)，F₂(r₂,s₂)，F₃(r₃,s₃)の値は、時刻t+t₀，t+2t₀，t+3t₀，t-t₀，t-2t₀，t-3t₀における画像データにおける、注目ブロックB₉と最も類似する位置(u₁,v₁)，(u₂,v₂)，(u₃,v₃)，(r₁,s₁)，(r₂,s₂)，(r₃,s₃)で、それぞれ最小となる For this reason, the correlation information E ₁ (u ₁ , v ₁ ), E ₂ (u ₂ , t at time t + t ₀ , t + 2t ₀ , t + 3t ₀ , tt ₀ , t-2t ₀ , t-3t ₀ v ₂ ), E ₃ (u ₃ , v ₃ ), F ₁ (r ₁ , s ₁ ), F ₂ (r ₂ , s ₂ ), F ₃ (r ₃ , s ₃ ) Positions (u ₁ , v ₁ ), (u ₂ ) most similar to the block of interest B ₉ in the image data at t ₀ , t + 2t ₀ , t + 3t ₀ , tt ₀ , t-2t ₀ , t-3t ₀ , v ₂ ), (u ₃ , v ₃ ), (r ₁ , s ₁ ), (r ₂ , s ₂ ), (r ₃ , s ₃ )

この３次元の動きベクトル(p₀,q₀,t₀)は、注目ブロックB₉と最も類似する位置において最小になる相関情報E₁(u₁,v₁)，E₂(u₂,v₂)，E₃(u₃,v₃)，F₁(r₁,s₁)，F₂(r₂,s₂)，F₃(r₃,s₃)を用いて求められた合成相関情報E(p,q)を最小にする(p,q)である平均動きベクトル(p₀,q₀)から得られるため、その３次元の動きベクトル(p₀,q₀,t₀)に直交する方向が、注目ブロックB₉のデータの主成分方向を正確に表しているとは限らない。 This three-dimensional motion vector (p ₀ , q ₀ , t ₀ ) has the minimum correlation information E ₁ (u ₁ , v ₁ ), E ₂ (u ₂ , v) at the position most similar to the target block B _9. ₂ ), E ₃ (u ₃ , v ₃ ), F ₁ (r ₁ , s ₁ ), F ₂ (r ₂ , s ₂ ), F ₃ (r ₃ , s ₃ ) Since it is obtained from the average motion vector (p ₀ , q ₀ ) that is (p, q) that minimizes the information E (p, q), the three-dimensional motion vector (p ₀ , q ₀ , t ₀ ) the direction perpendicular, not necessarily the main ingredient direction of the data block of interest B ₉ accurately represents.

以上のように、注目ブロック、または他のフレームの注目ブロックに対応する領域が、速度が異なる複数の被写体を含む場合、つまり、速度が異なる複数の被写体の境界部分を含む場合には、第１の動き検出処理では、平均動きベクトル、ひいては、注目ブロックのデータの主成分方向を正確に求めることができず、そのような主成分方向に基づくフィルタリングが行われることによって、人間の視覚で認識することができる周波数成分のみの適切なデータを得ることが困難となることがあった。 As described above, when the region corresponding to the block of interest or the block of interest in another frame includes a plurality of subjects having different speeds, that is, if the region includes boundary portions of a plurality of subjects having different speeds, the first In this motion detection processing, the average motion vector, and thus the principal component direction of the data of the block of interest, cannot be obtained accurately, and the filtering based on such principal component direction is performed, so that it is recognized by human vision. It may be difficult to obtain appropriate data of only frequency components that can be obtained.

そこで、図７２は、注目ブロック、または他のフレームの注目ブロックに対応する領域が、速度が異なる複数の被写体を含む場合であっても、平均動きベクトル、ひいては、注目ブロックのデータの主成分方向を正確に求める処理（以下、適宜、第２の動き検出処理という）を行う図２５の主成分方向取得部３１の構成例を示している。 Therefore, FIG. 72 shows the average motion vector, and thus the principal component direction of the data of the target block, even when the region corresponding to the target block or the target block of another frame includes a plurality of subjects having different velocities. 25 shows an example of the configuration of the principal component direction acquisition unit 31 in FIG. 25 that performs a process for accurately obtaining (hereinafter referred to as a second motion detection process as appropriate).

なお、図中、図５２における場合と対応する部分については、同一の符号を付してあり、以下では、その説明は、適宜省略する。即ち、図７２の主成分方向取得部３１は、スケーリング合成部１０４と最小値検出部１０５に代えて、それぞれスケーリング合成部１２４と最小値検出部１２５が設けられている他は、図５２における場合と同様に構成されている。 In the figure, portions corresponding to those in FIG. 52 are denoted by the same reference numerals, and description thereof will be omitted below as appropriate. That is, the principal component direction acquisition unit 31 in FIG. 72 is different from the scaling synthesis unit 104 and the minimum value detection unit 105 in that the scaling synthesis unit 124 and the minimum value detection unit 125 are provided in the case of FIG. It is configured in the same way.

スケーリング合成部１２４は、図５２のスケーリング合成部１０４と同様に、相関演算部１０３から供給される、複数の相関演算対象フレームそれぞれについて求められた相関情報を、空間方向x,yにスケーリングし、さらに、そのスケーリング後の相関情報を合成して、合成相関情報を求める。但し、スケーリング合成部１２４では、スケーリング後の相関情報を合成する数（合成する相関情報の数）を変えて、複数の合成相関情報が求められる。そして、スケーリング合成部１２４は、複数の合成相関情報を、最小値検出部１２５に供給する。 Similar to the scaling synthesis unit 104 in FIG. 52, the scaling synthesis unit 124 scales the correlation information obtained from each of the correlation calculation target frames supplied from the correlation calculation unit 103 in the spatial directions x and y, Further, the correlation information after scaling is synthesized to obtain synthesized correlation information. However, the scaling combining unit 124 obtains a plurality of pieces of combined correlation information by changing the number of pieces of correlation information after scaling (the number of pieces of correlation information to be combined). Then, the scaling synthesis unit 124 supplies the plurality of synthesis correlation information to the minimum value detection unit 125.

最小値検出部１２５は、スケーリング合成部１２４からの複数の合成相関情報のうちの、相関が最大の合成相関情報である最大合成相関情報を求める。即ち、最小値検出部１２５は、スケーリング合成部１２４からの複数の合成相関情報それぞれの「値」の最小値を求める。さらに、最小値検出部１２５は、複数の合成相関情報から、最小値が最小の合成相関情報を、最大合成相関情報として求める。 The minimum value detecting unit 125 obtains maximum combined correlation information that is the combined correlation information having the maximum correlation among the plurality of combined correlation information from the scaling combining unit 124. That is, the minimum value detection unit 125 obtains the minimum value of “value” of each of the plurality of combined correlation information from the scaling combining unit 124. Further, the minimum value detection unit 125 obtains the composite correlation information having the minimum minimum value as the maximum composite correlation information from the plurality of composite correlation information.

さらに、最小値検出部１２５は、最大合成相関情報が表す相関が最大となる空間方向の位置を、即ち、最大合成相関情報の「値」を最小にする空間方向の位置を、最大相関位置として検出し、その最大相関位置へのベクトルを、平均動きベクトルとして求める。そして、最小値検出部１２５は、平均動きベクトルに、バッファ部２１（図２５）に記憶されている動画データのフレーム周期t₀を、時間方向tのコンポーネントとして加えた３次元の動きベクトルを求め、その３次元の動きベクトルの方向と直交する方向を、注目ブロックの主成分方向として検出して出力する。 Further, the minimum value detection unit 125 sets the position in the spatial direction where the correlation represented by the maximum combined correlation information is maximum, that is, the position in the spatial direction that minimizes the “value” of the maximum combined correlation information as the maximum correlation position. The vector to the maximum correlation position is detected as an average motion vector. Then, the minimum value detection unit 125 obtains a three-dimensional motion vector obtained by adding the frame period t ₀ of the moving image data stored in the buffer unit 21 (FIG. 25) to the average motion vector as a component in the time direction t. The direction orthogonal to the direction of the three-dimensional motion vector is detected and output as the principal component direction of the block of interest.

また、最小値検出部１２５は、合成相関情報を求めるのに用いられた複数のフレームのうちの、最大合成相関情報を求めるのに合成した相関情報の演算に用いられたフレームの範囲を表す後述するカーネルサイズを、注目ブロックの動きベクトル（平均動きベクトル）の有効範囲、つまり、注目ブロックの主成分方向の有効範囲として、さらに求め、注目ブロックの主成分方向とともに出力する。 Further, the minimum value detection unit 125 represents a range of frames used for calculating the correlation information synthesized to obtain the maximum synthesized correlation information among a plurality of frames used to obtain the synthesized correlation information. The kernel size to be calculated is further obtained as the effective range of the motion vector (average motion vector) of the target block, that is, the effective range of the main component direction of the target block, and is output together with the main component direction of the target block.

次に、図７３乃至図７７のフローチャートを参照して、主成分方向取得部３１が図７２に示したように構成される場合の図２５のフィルタ生成部２３の処理について説明する。 Next, the processing of the filter generation unit 23 in FIG. 25 when the principal component direction acquisition unit 31 is configured as shown in FIG. 72 will be described with reference to the flowcharts in FIGS. 73 to 77.

なお、図７３乃至図７７のフローチャートにしたがったステップＳ１５１乃至Ｓ１７６の処理のうちの、ステップＳ１５１乃至Ｓ１７２の処理は、第２の動き検出処理であり、図３１のステップＳ１１の処理に対応する。また、ステップＳ１７３乃至Ｓ１７５の処理は、図３１のステップＳ１２の処理に対応し、ステップＳ１７６の処理は、図３１のステップＳ１３の処理に対応する。 Of the processes of steps S151 to S176 according to the flowcharts of FIGS. 73 to 77, the processes of steps S151 to S172 are second motion detection processes and correspond to the process of step S11 of FIG. Further, the processing in steps S173 to S175 corresponds to the processing in step S12 in FIG. 31, and the processing in step S176 corresponds to the processing in step S13 in FIG.

第２の動き検出処理では、速度が異なる複数の被写体が投影された動画の、いわば性質を利用して、注目ブロックとの相関情報を演算する複数のフレームのうちの、注目ブロックと対応する領域を有するフレームについて求められた相関情報だけを合成して合成相関情報を得て、その合成相関情報を用いて、注目ブロックの正確な主成分方向を得ることができる平均動きベクトルを求める。 In the second motion detection process, an area corresponding to the target block among a plurality of frames for calculating correlation information with the target block using the so-called property of a moving image on which a plurality of subjects with different speeds are projected. Only the correlation information obtained for a frame having a frame is combined to obtain combined correlation information, and an average motion vector capable of obtaining an accurate principal component direction of the block of interest is obtained using the combined correlation information.

即ち、例えば、図６９乃至図７１に示したように、静止している被写体P₆₉₀₁の手前側に、動いている被写体P₆₉₀₂が存在する時刻t-3t₀乃至t+3t₀の７フレームの間の時間は、6t₀という短い時間であるので、動いている被写体P₆₉₀₂の速度は、一定であると近似することができる。 That is, for example, as shown in FIG. 69 to FIG. 71, seven frames from time t-3t _{0 to} t + 3t ₀ where the moving subject P ₆₉₀₂ exists on the front side of the stationary subject P ₆₉₀₁ are shown. Since the time between them is as short as 6t ₀ , it can be approximated that the speed of the moving subject P ₆₉₀₂ is constant.

そして、時刻tの画像データのあるブロックを注目ブロックとした場合に、その注目ブロックに対して、動いている被写体P₆₉₀₂が、遠ざかっていくケースでは、時刻tよりも時間的に後の時刻（未来の時刻）の画像データにおいて、注目ブロックと対応する領域が存在する。即ち、例えば、図７１において、ブロックB₇が注目ブロックである場合には、その注目ブロックB₇に対して、被写体P₆₉₀₂が遠ざかっていくが、この場合、時刻tよりも時間的に後の時刻t+t₀，t+2t₀，t+3t₀それぞれにおいて、注目ブロックB₇に対応する領域B₁₁₁，B₂₁₁，B₃₁₁が存在する。 Then, when a block having image data at time t is set as a target block, in the case where the moving subject P ₆₉₀₂ moves away from the _target block, a time later than time t ( In the image data at a future time), there is an area corresponding to the block of interest. That is, for example, in FIG. 71, when the block B ₇ is the _target block, the subject P ₆₉₀₂ moves away from the _target block B ₇ , but in this case, the time later than the time t At times t + t ₀ , t + 2t ₀ , and t + 3t ₀ , areas B ₁₁₁ , B ₂₁₁ , and B ₃₁₁ corresponding to the block of interest B ₇ exist.

また、注目ブロックに対して、動いている被写体P₆₉₀₂が、近づいてくるケースでは、時刻tよりも時間的に前の時刻（過去の時刻）の画像データにおいて、注目ブロックと対応する領域が存在する。即ち、例えば、図７１において、ブロックB₁₄が注目ブロックである場合には、その注目ブロックB₁₄に対して、被写体P₆₉₀₂が近づいてくるが、この場合、時刻tよりも時間的に前の時刻t-t₀，t-2t₀，t-3t₀それぞれにおいて、注目ブロックB₁₄に対応する領域B_-112，B_-212，B_-312が存在する。 In addition, in the case where the moving subject P ₆₉₀₂ approaches the block of interest, there is an area corresponding to the block of interest in the image data at a time prior to the time t (past time). To do. That is, for example, in FIG. 71, when the block B ₁₄ is the _target block, the subject P ₆₉₀₂ approaches the _target block B ₁₄ , but in this case, the time before the time t At times tt ₀ , t-2t ₀ , and t-3t ₀ , there are regions B ₋₁₁₂ , B ₋₂₁₂ , and B ₋₃₁₂ corresponding to the block of interest B ₁₄ .

なお、注目ブロックに、静止している被写体P₆₉₀₁と、動いている被写体P₆₉₀₂とが存在するケースでは、時刻tの前後の時刻の画像データにおいて、注目ブロックと対応する領域は存在しない。即ち、例えば、図７１において、ブロックB₉が注目ブロックである場合には、注目ブロックB₉には、静止している被写体P₆₉₀₁と、動いている被写体P₆₉₀₂とが存在するが、この場合、時刻tよりも前の時刻t-t₀，t-2t₀，t-3t₀と、後の時刻t+t₀，t+2t₀，t+3t₀のうちのいずれにも、注目ブロックB₉と対応する領域は存在しない。 Note that in the case where the subject block includes a stationary subject P ₆₉₀₁ and a moving subject P ₆₉₀₂ , there is no region corresponding to the subject block in the image data at times before and after time t. That is, for example, in FIG. 71, when the block B ₉ is the target block, the target block B ₉ includes a stationary subject P ₆₉₀₁ and a moving subject P _{6902. In} this case, , The block of interest B _{9 at} any of the times tt ₀ , t-2t ₀ , t-3t ₀ before the time t and the times t + t ₀ , t + 2t ₀ , t + 3t ₀ after the time t There is no corresponding area.

第２の動き検出処理では、以上のような、速度が異なる複数の被写体が投影された動画の性質を利用した処理が行われる。 In the second motion detection process, a process using the nature of a moving image in which a plurality of subjects with different speeds is projected as described above is performed.

即ち、図７２の主成分方向取得部３１において、バッファ部１０１には、バッファ部２１（図２５）から読み出された動画データが供給され、バッファ部１０１は、その動画データを一時記憶する。 That is, in the principal component direction acquisition unit 31 in FIG. 72, the moving image data read from the buffer unit 21 (FIG. 25) is supplied to the buffer unit 101, and the buffer unit 101 temporarily stores the moving image data.

そして、図７３のステップＳ１５１において、ブロック抽出部１０２は、バッファ部１０１に記憶された動画データを、図６４のステップＳ１０１における場合と同様に、例えば１６×１６画素のブロックに分割し、相関演算部１０３に供給する。なお、以降の処理は、ブロック抽出部１０２で得られたブロックを、順次、注目ブロックとして行われる。 In step S151 in FIG. 73, the block extraction unit 102 divides the moving image data stored in the buffer unit 101 into, for example, blocks of 16 × 16 pixels, as in step S101 in FIG. To the unit 103. In the subsequent processing, the blocks obtained by the block extraction unit 102 are sequentially performed as the target block.

ここで、図６４乃至図６７における場合と同様に、注目ブロックの位置（例えば、注目ブロックの左上の画素の位置）を、(x₀,y₀)と表す。また、注目ブロックのフレーム（注目フレーム）は、時刻tのフレームであるとする。 Here, as in FIGS. 64 to 67, the position of the block of interest (for example, the position of the upper left pixel of the block of interest) is represented as (x ₀ , y ₀ ). The frame of the block of interest (frame of interest) is assumed to be a frame at time t.

ステップＳ１５１の処理後は、ステップＳ１５２に進み、相関演算部１０３は、注目フレームの次のフレームの画像データである時刻t+t₀の画像データを、バッファ部１０１から読み出し、注目ブロックと、時刻t+t₀の画像データとの、対応する画素どうしの差分絶対値の総和を表す関数E₁(u₁,v₁)を、注目ブロックを対応させる時刻t+t₀の画像データの位置(x₀+u₁,y₀+v₁)を変えながら求め、その関数E₁(u₁,v₁)を、時刻t+t₀における相関情報として、スケーリング合成部１２４に供給して、ステップＳ１５３に進む。 After the processing in step S151, the process proceeds to step S152, in which the correlation calculation unit 103 reads the image data at time t + t ₀ that is the image data of the frame next to the frame of interest from the buffer unit 101, The function E ₁ (u ₁ , v ₁ ) representing the sum of absolute differences between corresponding pixels with the image data at t + t ₀ is expressed as the position of the image data at time t + t ₀ ( x ₀ + u ₁ , y ₀ + v ₁ ), and the function E ₁ (u ₁ , v ₁ ) is supplied to the scaling synthesis unit 124 as correlation information at time t + t ₀ The process proceeds to S153.

ステップＳ１５３では、相関演算部１０３は、注目フレームの次の次のフレームの画像データである時刻t+2t₀の画像データを、バッファ部１０１から読み出し、注目ブロックと、時刻t+2t₀の画像データとの、対応する画素どうしの差分絶対値の総和を表す関数E₂(u₂,v₂)を、注目ブロックを対応させる時刻t+2t₀の画像データの位置(x₀+u₂,y₀+v₂)を変えながら求め、その関数E₂(u₂,v₂)を、時刻t+2t₀における相関情報として、スケーリング合成部１２４に供給して、ステップＳ１５４に進む。 In step S153, the correlation calculation unit 103 reads the image data at time t + 2t ₀ that is the image data of the next frame after the frame of interest from the buffer unit 101, and reads the block of interest and the image at time t + 2t ₀ . The function E ₂ (u ₂ , v ₂ ) representing the sum of absolute differences between corresponding pixels with the data is expressed as the position of the image data at the time t + 2t ₀ (x ₀ + u ₂ , y ₀ + v ₂ ) is changed and the function E ₂ (u ₂ , v ₂ ) is supplied as correlation information at time t + 2t _{0 to} the scaling synthesis unit 124, and the process proceeds to step S154.

ステップＳ１５４では、相関演算部１０３は、注目フレームの３フレームだけ時間的に先の画像データである時刻t+3t₀の画像データを、バッファ部１０１から読み出し、注目ブロックと、時刻t+3t₀の画像データとの、対応する画素どうしの差分絶対値の総和を表す関数E₃(u₃,v₃)を、注目ブロックを対応させる時刻t+3t₀の画像データの位置(x₀+u₃,y₀+v₃)を変えながら求め、その関数E₃(u₃,v₃)を、時刻t+3t₀における相関情報として、スケーリング合成部１２４に供給して、図７４のステップＳ１５５に進む。 In step S154, the correlation calculation unit 103 reads out the image data at time t + 3t ₀ that is temporally preceding image data by three frames of the target frame from the buffer unit 101, and sets the target block and time t + 3t _0. A function E ₃ (u ₃ , v ₃ ) representing the sum of absolute differences between corresponding pixels with the image data of the image data position of the image data at time t + 3t ₀ (x ₀ + u ₃ , y ₀ + v ₃ ), and the function E ₃ (u ₃ , v ₃ ) is supplied to the scaling combiner 124 as correlation information at time t + 3t ₀ , and step S155 in FIG. Proceed to

ステップＳ１５５では、相関演算部１０３は、注目フレームの前のフレームの画像データである時刻t-t₀の画像データを、バッファ部１０１から読み出し、注目ブロックと、時刻t-t₀の画像データとの、対応する画素どうしの差分絶対値の総和を表す関数F₁(r₁,s₁)を、注目ブロックを対応させる時刻t-t₀の画像データの位置(x₀+r₁,y₀+s₁)を変えながら求め、その関数F₁(r₁,s₁)を、時刻t-t₀における相関情報として、スケーリング合成部１２４に供給して、ステップＳ１５６に進む。 In step S155, the correlation calculation unit 103 reads the image data at time tt ₀ that is the image data of the frame before the frame of interest from the buffer unit 101, and corresponds the block of interest to the image data at time tt _0. Change the position (x ₀ + r ₁ , y ₀ + s ₁ ) of the image data at the time tt ₀ to associate the block of interest with the function F ₁ (r ₁ , s ₁ ) that represents the sum of absolute differences between pixels. The function F ₁ (r ₁ , s ₁ ) is obtained as correlation information at time tt ₀ and supplied to the scaling synthesizer 124, and the process proceeds to step S156.

ステップＳ１５６では、相関演算部１０３は、注目フレームの前の前のフレームの画像データである時刻t-2t₀の画像データを、バッファ部１０１から読み出し、注目ブロックと、時刻t-2t₀の画像データとの、対応する画素どうしの差分絶対値の総和を表す関数F₂(r₂,s₂)を、注目ブロックを対応させる時刻t-2t₀の画像データの位置(x₀+r₂,y₀+s₂)を変えながら求め、その関数F₂(r₂,s₂)を、時刻t-2t₀における相関情報として、スケーリング合成部１２４に供給して、ステップＳ１５７に進む。 In step S156, the correlation calculation unit 103 reads the image data at the time t-2t ₀ that is the image data of the previous frame before the target frame from the buffer unit 101, and reads the target block and the image at the time t-2t ₀ . The function F ₂ (r ₂ , s ₂ ) representing the sum of absolute differences between corresponding pixels with the data is expressed as the position of the image data at the time t-2t ₀ (x ₀ + r ₂ , y ₀ + s ₂ ) is changed, and the function F ₂ (r ₂ , s ₂ ) is supplied as the correlation information at time t−2t _{0 to} the scaling combiner 124, and the process proceeds to step S157.

ステップＳ１５７では、相関演算部１０３は、注目フレームの３フレームだけ時間的に前の画像データである時刻t-3t₀の画像データを、バッファ部１０１から読み出し、注目ブロックと、時刻t-3t₀の画像データとの、対応する画素どうしの差分絶対値の総和を表す関数F₃(r₃,s₃)を、注目ブロックを対応させる時刻t-3t₀の画像データの位置(x₀+r₃,y₀+s₃)を変えながら求め、その関数F₃(r₃,s₃)を、時刻t-3t₀における相関情報として、スケーリング合成部１２４に供給して、ステップＳ１５８に進む。 In step S157, the correlation calculating unit 103, the image data at time t-3t ₀ is image data before in time by three frames of the frame of interest, from the buffer unit 101, and the block of interest, the time t-3t ₀ of the image data, the function _{_{_{F 3 (r 3, s 3}}} ) which represents the sum of the absolute differences of each other corresponding pixel, the position of the image data at time t-3t ₀ to correspond to the block of interest (x ₀ + r ₃ , y ₀ + s ₃ ), and the function F ₃ (r ₃ , s ₃ ) is supplied to the scaling combiner 124 as correlation information at time t-3t ₀ , and the process proceeds to step S158.

ステップＳ１５８では、スケーリング合成部１２４は、相関演算部１０３から供給された６つの相関情報E₁(u₁,v₁)，E₂(u₂,v₂)，E₃(u₃,v₃)，F₁(r₁,s₁)，F₂(r₂,s₂)，F₃(r₃,s₃)のうちのすべてについて、相関情報E₁(u₁,v₁)の位置(u₁,v₁)のスケールを基準としてスケーリングを行い、さらに、そのスケーリング後の相関情報を合成して、合成相関情報E(p,q)を求める。即ち、スケーリング合成部１２４は、位置(p,q)=(u₁,v₁)=(u₂/2,v₂/2)=(u₃/3,v₃/3)=(-r₁,-s₁)=(-r₂/2,-s₂/2)=(-r₃/3,-s₃/3)とすることによりスケーリングを行い、さらに、そのスケーリング後の相関情報E₁(p,q)，E₂(2p,2q)，E₃(3p,3q)，F₁(-p,-q)，F₂(-2p,-2q)，F₃(-3p,-3q)を加算することにより、合成相関情報E(p,q)(=E₁(p,q)+E₂(2p,2q)+E₃(3p,3q)+F₁(-p,-q)+F₂(-2p,-2q)+F₃(-3p,-3q))を求める。 In step S 158, the scaling synthesis unit 124 includes six pieces of correlation information E ₁ (u ₁ , v ₁ ), E ₂ (u ₂ , v ₂ ), E ₃ (u ₃ , v ₃ ) supplied from the correlation calculation unit 103. ), F ₁ (r ₁ , s ₁ ), F ₂ (r ₂ , s ₂ ), F ₃ (r ₃ , s ₃ ), the position of the correlation information E ₁ (u ₁ , v ₁ ) Scaling is performed with the scale of (u ₁ , v ₁ ) as a reference, and further, the correlation information after the scaling is synthesized to obtain synthesized correlation information E (p, q). That is, the scaling synthesizer 124, the position (p, q) = (u 1, v 1) = (u 2/2, v 2/2) = (u 3/3, v 3/3) = (- r _{_{1, -s 1) = (-}} r 2/2, -s 2/2) = (- r 3/3, performs scaling with -s _3/3), further, the correlation information after the scaling E ₁ (p, q), E ₂ (2p, 2q), E ₃ (3p, 3q), F ₁ (-p, -q), F ₂ (-2p, -2q), F ₃ (-3p, -3q), the combined correlation information E (p, q) (= E ₁ (p, q) + E ₂ (2p, 2q) + E ₃ (3p, 3q) + F ₁ (-p, -q) + F ₂ (-2p, -2q) + F ₃ (-3p, -3q)).

なお、合成相関情報E(p,q)は、スケーリング後の相関情報を単純に加算する他、上述したように、重み付け加算することにより求めることもできる。 Note that the combined correlation information E (p, q) can be obtained by simply adding the correlation information after scaling, or by weighted addition as described above.

以上のように、スケーリング合成部１２４は、ステップＳ１５８において、時刻t+t₀における相関情報E₁(u₁,v₁)、時刻t+2t₀における相関情報E₂(u₂,v₂)、時刻t+3t₀における相関情報E₃(u₃,v₃)、時刻t-t₀における相関情報F₁(r₁,s₁)、時刻t-2t₀における相関情報F₂(r₂,s₂)、時刻t-3t₀における相関情報F₃(r₃,s₃)のすべてを合成した合成相関情報E(p,q)、即ち、注目フレームの時刻tに対して時間的に後の３フレームと、時間的に前の３フレームのそれぞれについての、合計で６つの相関情報を合成した合成相関情報E(p,q)を求めた後、その合成相関情報E(p,q)を求めるにあたって合成した相関情報の数である６によって、合成相関情報E(p,q)を除算することにより正規化し、さらに、正規化後の合成相関情報E(p,q)に、所定のオフセット値L₃₃を加算することにより、最終的な合成相関情報E(p,q)(={E₁(p,q)+E₂(2p,2q)+E₃(3p,3q)+F₁(-p,-q)+F₂(-2p,-2q)+F₃(-3p,-3q)}/6+L₃₃)を得る。そして、スケーリング合成部１２４は、その最終的な合成相関情報E(p,q)を、最小値検出部１２５に供給して、ステップＳ１５８からＳ１５９に進む。 As described above, the scaling synthesizer 124, in step S158, the correlation information E ₁ at time _{_{t + t 0 (u 1,}} v 1), the correlation information E ₂ at time _{_{t + 2t 0 (u 2,}} v 2) , Correlation information E ₃ (u ₃ , v ₃ ) at time t + 3t _0, correlation information F ₁ (r ₁ , s ₁ ) at time tt ₀ , correlation information F ₂ (r ₂ , s at time t-2t ₀ ₂ ), combined correlation information E (p, q) that combines all the correlation information F ₃ (r ₃ , s ₃ ) at time t-3t ₀ , that is, after time t of the frame of interest After obtaining the combined correlation information E (p, q), which is a total of 6 correlation information for each of the 3 frames and the previous 3 frames in time, the combined correlation information E (p, q) is obtained. The obtained correlation information is normalized by dividing the combined correlation information E (p, q) by six, which is the number of correlation information combined. It is added to the value L ₃₃ By the final synthesized correlation information E (p, q) (= {E 1 (p, q) + E 2 (2p, 2q) + E 3 (3p, 3q) + F 1 (-p, -q ) + F ₂ (−2p, −2q) + F ₃ (−3p, −3q)} / 6 + L ₃₃ ). Then, the scaling combining unit 124 supplies the final combined correlation information E (p, q) to the minimum value detecting unit 125, and proceeds from step S158 to S159.

ステップＳ１５９では、最小値検出部１２５は、ステップＳ１５８でスケーリング合成部１２４から供給された、注目フレームの時刻tに対して時間的に後の３フレームと、時間的に前の３フレームのそれぞれについての相関情報を合成した合成相関情報E(p,q)が表す相関が最大となる空間方向の位置(p,q)である(p₃₃,q₃₃)を検出し、即ち、合成相関情報E(p,q)の「値」を最小にする位置(p,q)である(p₃₃,q₃₃)を検出し、位置(p,q)が最大相関位置(p₃₃,q₃₃)である場合の合成相関情報E(p₃₃,q₃₃)の「値」、即ち、最小値E₃₃とともに記憶して、図７５のステップＳ１６０に進む。 In step S159, the minimum value detecting unit 125 supplies the three frames that are temporally subsequent to the time t of the frame of interest and the three frames that are temporally previous, supplied from the scaling composition unit 124 in step S158. (P ₃₃ , q ₃₃ ), which is the position (p, q) in the spatial direction where the correlation represented by the combined correlation information E (p, q), which is obtained by synthesizing the correlation information, is maximized, that is, the combined correlation information E Detect (p ₃₃ , q ₃₃ ), which is the position (p, q) that minimizes the `` value '' of (p, q), and position (p, q) is the maximum correlation position (p ₃₃ , q ₃₃ ) Stored together with the “value” of the composite correlation information E (p ₃₃ , q ₃₃ ) in some cases, that is, the minimum value E ₃₃ , proceeds to step S160 in FIG.

ステップＳ１６０では、スケーリング合成部１２４は、相関演算部１０３から供給された６つの相関情報E₁(u₁,v₁)，E₂(u₂,v₂)，E₃(u₃,v₃)，F₁(r₁,s₁)，F₂(r₂,s₂)，F₃(r₃,s₃)のうちの、５つの相関情報E₁(u₁,v₁)，E₂(u₂,v₂)，E₃(u₃,v₃)，F₁(r₁,s₁)，F₂(r₂,s₂)について、相関情報E₁(u₁,v₁)の位置(u₁,v₁)のスケールを基準としてスケーリングを行い、さらに、そのスケーリング後の相関情報を合成して、合成相関情報E(p,q)を求める。即ち、スケーリング合成部１２４は、位置(p,q)=(u₁,v₁)=(u₂/2,v₂/2)=(u₃/3,v₃/3)=(-r₁,-s₁)=(-r₂/2,-s₂/2)とすることによりスケーリングを行い、さらに、そのスケーリング後の相関情報E₁(p,q)，E₂(2p,2q)，E₃(3p,3q)，F₁(-p,-q)，F₂(-2p,-2q)を加算することにより、合成相関情報E(p,q)(=E₁(p,q)+E₂(2p,2q)+E₃(3p,3q)+F₁(-p,-q)+F₂(-2p,-2q))を求める。 In step S 160, the scaling synthesis unit 124 includes six pieces of correlation information E ₁ (u ₁ , v ₁ ), E ₂ (u ₂ , v ₂ ), E ₃ (u ₃ , v ₃ ) supplied from the correlation calculation unit 103. ), F ₁ (r ₁ , s ₁ ), F ₂ (r ₂ , s ₂ ), F ₃ (r ₃ , s ₃ ), five correlation information E ₁ (u ₁ , v ₁ ), E ₂ (u ₂ , v ₂ ), E ₃ (u ₃ , v ₃ ), F ₁ (r ₁ , s ₁ ), F ₂ (r ₂ , s ₂ ), correlation information E ₁ (u ₁ , v ₁ ) Is scaled with reference to the scale of the position (u ₁ , v ₁ ), and the correlation information after the scaling is synthesized to obtain synthesized correlation information E (p, q). That is, the scaling synthesizer 124, the position (p, q) = (u 1, v 1) = (u 2/2, v 2/2) = (u 3/3, v 3/3) = (- r _{_{1, -s 1) = (-}} r 2/2, performs scaling with -s _2/2), further, the correlation information E ₁ after the scaling _{(p, q), E 2} (2p, 2q ), E ₃ (3p, 3q), F ₁ (-p, -q), and F ₂ (-2p, -2q) are added to obtain the combined correlation information E (p, q) (= E ₁ (p , q) + E ₂ (2p, 2q) + E ₃ (3p, 3q) + F ₁ (−p, −q) + F ₂ (−2p, −2q)).

以上のように、スケーリング合成部１２４は、ステップＳ１６０において、時刻t+t₀における相関情報E₁(u₁,v₁)、時刻t+2t₀における相関情報E₂(u₂,v₂)、時刻t+3t₀における相関情報E₃(u₃,v₃)、時刻t-t₀における相関情報F₁(r₁,s₁)、時刻t-2t₀における相関情報F₂(r₂,s₂)の５つを合成した合成相関情報E(p,q)、即ち、注目フレームの時刻tに対して時間的に後の３フレームと、時間的に前の２フレームのそれぞれについての、合計で５つの相関情報を合成した合成相関情報E(p,q)を求めた後、その合成相関情報E(p,q)を求めるにあたって合成した相関情報の数である５によって、合成相関情報E(p,q)を除算することにより正規化し、さらに、正規化後の合成相関情報E(p,q)に、所定のオフセット値L₂₃を加算することにより、最終的な合成相関情報E(p,q)(={E₁(p,q)+E₂(2p,2q)+E₃(3p,3q)+F₁(-p,-q)+F₂(-2p,-2q)}/5+L₂₃)を得る。そして、スケーリング合成部１２４は、その最終的な合成相関情報E(p,q)を、最小値検出部１２５に供給して、ステップＳ１６０からＳ１６１に進む。 As described above, the scaling synthesizer 124, in step S160, the correlation information E ₁ at time _{_{t + t 0 (u 1,}} v 1), the correlation information E ₂ at time _{_{t + 2t 0 (u 2,}} v 2) , Correlation information E ₃ (u ₃ , v ₃ ) at time t + 3t _0, correlation information F ₁ (r ₁ , s ₁ ) at time tt ₀ , correlation information F ₂ (r ₂ , s at time t-2t ₀ ₂ ), the combined correlation information E (p, q), that is, the sum of the three frames after the time t and the two frames before the time t of the frame of interest. The composite correlation information E (p, q) obtained by synthesizing the five correlation information in the above is obtained, and then the composite correlation information E is calculated by 5 which is the number of correlation information synthesized in obtaining the composite correlation information E (p, q). Normalization is performed by dividing (p, q) .Further, by adding a predetermined offset value L ₂₃ to the normalized combined correlation information E (p, q), the final combined correlation information E ( p, q) (= {E ₁ (p, q) + E ₂ (2p, 2q) + E ₃ (3p, 3q) + F ₁ (-p, -q) + F ₂ (-2p, -2q)} / 5 + L ₂₃ ) Then, the scaling combining unit 124 supplies the final combined correlation information E (p, q) to the minimum value detecting unit 125, and proceeds from step S160 to S161.

ステップＳ１６１では、最小値検出部１２５は、ステップＳ１６０でスケーリング合成部１２４から供給された、注目フレームの時刻tに対して時間的に後の３フレームと、時間的に前の２フレームのそれぞれについての相関情報を合成した合成相関情報E(p,q)が表す相関が最大となる空間方向の位置(p,q)である(p₂₃,q₂₃)を検出し、即ち、合成相関情報E(p,q)の「値」を最小にする位置(p,q)である(p₂₃,q₂₃)を検出し、位置(p,q)が最大相関位置(p₂₃,q₂₃)である場合の合成相関情報E(p₂₃,q₂₃)の「値」、即ち、最小値E₂₃とともに記憶して、ステップＳ１６２に進む。 In step S161, the minimum value detector 125 supplies each of the three frames that are temporally subsequent to the time t of the frame of interest and the two frames that are temporally previous, supplied from the scaling composition unit 124 in step S160. (P ₂₃ , q ₂₃ ), which is the position (p, q) in the spatial direction at which the correlation represented by the combined correlation information E (p, q), which is obtained by combining the correlation information, is maximized, that is, the combined correlation information E (p ₂₃ , q ₂₃ ), which is the position (p, q) that minimizes the (value) of (p, q), is detected, and the position (p, q) is the maximum correlation position (p ₂₃ , q ₂₃ ) Stored together with the “value” of the composite correlation information E (p ₂₃ , q ₂₃ ) in some cases, that is, the minimum value E ₂₃ , proceeds to step S162.

ステップＳ１６２では、スケーリング合成部１２４は、相関演算部１０３から供給された６つの相関情報E₁(u₁,v₁)，E₂(u₂,v₂)，E₃(u₃,v₃)，F₁(r₁,s₁)，F₂(r₂,s₂)，F₃(r₃,s₃)のうちの、４つの相関情報E₁(u₁,v₁)，E₂(u₂,v₂)，E₃(u₃,v₃)，F₁(r₁,s₁)について、相関情報E₁(u₁,v₁)の位置(u₁,v₁)のスケールを基準としてスケーリングを行い、さらに、そのスケーリング後の相関情報を合成して、合成相関情報E(p,q)を求める。即ち、スケーリング合成部１２４は、位置(p,q)=(u₁,v₁)=(u₂/2,v₂/2)=(u₃/3,v₃/3)=(-r₁,-s₁)とすることによりスケーリングを行い、さらに、そのスケーリング後の相関情報E₁(p,q)，E₂(2p,2q)，E₃(3p,3q)，F₁(-p,-q)を加算することにより、合成相関情報E(p,q)(=E₁(p,q)+E₂(2p,2q)+E₃(3p,3q)+F₁(-p,-q))を求める。 In step S 162, the scaling combining unit 124 includes six pieces of correlation information E ₁ (u ₁ , v ₁ ), E ₂ (u ₂ , v ₂ ), E ₃ (u ₃ , v ₃ ) supplied from the correlation calculation unit 103. ), F ₁ (r ₁ , s ₁ ), F ₂ (r ₂ , s ₂ ), F ₃ (r ₃ , s ₃ ), four correlation information E ₁ (u ₁ , v ₁ ), E _{_{_{2 (u 2, v 2)}}} , E 3 (u 3, v 3), the _{_{_{F 1 (r 1, s 1}}} ), the position of the correlation information _{_{_{E 1 (u 1, v 1}}} ) (u 1, v 1) Is scaled with reference to the scale, and the correlation information after the scaling is synthesized to obtain synthesized correlation information E (p, q). That is, the scaling synthesizer 124, the position (p, q) = (u 1, v 1) = (u 2/2, v 2/2) = (u 3/3, v 3/3) = (- r ₁ , -s ₁ ), and scaling information E ₁ (p, q), E ₂ (2p, 2q), E ₃ (3p, 3q), F ₁ (- By adding (p, -q), the combined correlation information E (p, q) (= E ₁ (p, q) + E ₂ (2p, 2q) + E ₃ (3p, 3q) + F ₁ (- p, -q)).

以上のように、スケーリング合成部１２４は、ステップＳ１６２において、時刻t+t₀における相関情報E₁(u₁,v₁)、時刻t+2t₀における相関情報E₂(u₂,v₂)、時刻t+3t₀における相関情報E₃(u₃,v₃)、時刻t-t₀における相関情報F₁(r₁,s₁)の４つを合成した合成相関情報E(p,q)、即ち、注目フレームの時刻tに対して時間的に後の３フレームと、時間的に前の１フレームのそれぞれについての、合計で４つの相関情報を合成した合成相関情報E(p,q)を求めた後、その合成相関情報E(p,q)を求めるにあたって合成した相関情報の数である４によって、合成相関情報E(p,q)を除算することにより正規化し、さらに、正規化後の合成相関情報E(p,q)に、所定のオフセット値L₁₃を加算することにより、最終的な合成相関情報E(p,q)(={E₁(p,q)+E₂(2p,2q)+E₃(3p,3q)+F₁(-p,-q)}/4+L₁₃)を得る。そして、スケーリング合成部１２４は、その最終的な合成相関情報E(p,q)を、最小値検出部１２５に供給して、ステップＳ１６２からＳ１６３に進む。 As described above, the scaling synthesizer 124, in step S162, the correlation information E ₁ at time _{_{t + t 0 (u 1,}} v 1), the correlation information E ₂ at time _{_{t + 2t 0 (u 2,}} v 2) , Combined correlation information E (p, q) obtained by combining four pieces of correlation information E ₃ (u ₃ , v ₃ ) at time t + 3t _{0 and} correlation information F ₁ (r ₁ , s ₁ ) at time tt ₀ , That is, the combined correlation information E (p, q) obtained by combining a total of four pieces of correlation information for each of the three frames that are temporally subsequent to the time t of the frame of interest and the one frame that is temporally previous. After obtaining, the composite correlation information E (p, q) is normalized by dividing the composite correlation information E (p, q) by 4 which is the number of correlation information synthesized in obtaining the composite correlation information E (p, q). By adding a predetermined offset value L ₁₃ to the composite correlation information E (p, q) of the final composite correlation information E (p, q) (= {E ₁ (p, q) + E ₂ ( 2p, 2q) + E ₃ (3p, 3q) + F ₁ (-p, -q)} / 4 + L ₁₃ ) Then, the scaling combining unit 124 supplies the final combined correlation information E (p, q) to the minimum value detecting unit 125, and proceeds from step S162 to S163.

ステップＳ１６３では、最小値検出部１２５は、ステップＳ１６２でスケーリング合成部１２４から供給された、注目フレームの時刻tに対して時間的に後の３フレームと、時間的に前の１フレームのそれぞれについての相関情報を合成した合成相関情報E(p,q)が表す相関が最大となる空間方向の位置(p,q)である(p₁₃,q₁₃)を検出し、即ち、合成相関情報E(p,q)の「値」を最小にする位置(p,q)である(p₁₃,q₁₃)を検出し、位置(p,q)が最大相関位置(p₁₃,q₁₃)である場合の合成相関情報E(p₁₃,q₁₃)の「値」、即ち、最小値E₁₃とともに記憶して、ステップＳ１６４に進む。 In step S163, the minimum value detecting unit 125 supplies the three frames that are temporally subsequent to the time t of the frame of interest and the one frame that is temporally previous supplied from the scaling composition unit 124 in step S162. (P ₁₃ , q ₁₃ ), which is the position (p, q) in the spatial direction at which the correlation represented by the combined correlation information E (p, q), which is obtained by combining the correlation information, is maximized, that is, the combined correlation information E (p ₁₃ , q ₁₃ ), which is the position (p, q) that minimizes the (value) of (p, q), is detected, and the position (p, q) is the maximum correlation position (p ₁₃ , q ₁₃ ) The value is stored together with the “value” of the composite correlation information E (p ₁₃ , q ₁₃ ) in some cases, that is, the minimum value E ₁₃ , and the process proceeds to step S164.

ステップＳ１６４では、スケーリング合成部１２４は、相関演算部１０３から供給された６つの相関情報E₁(u₁,v₁)，E₂(u₂,v₂)，E₃(u₃,v₃)，F₁(r₁,s₁)，F₂(r₂,s₂)，F₃(r₃,s₃)のうちの、３つの相関情報E₁(u₁,v₁)，E₂(u₂,v₂)，E₃(u₃,v₃)について、相関情報E₁(u₁,v₁)の位置(u₁,v₁)のスケールを基準としてスケーリングを行い、さらに、そのスケーリング後の相関情報を合成して、合成相関情報E(p,q)を求める。即ち、スケーリング合成部１２４は、位置(p,q)=(u₁,v₁)=(u₂/2,v₂/2)=(u₃/3,v₃/3)とすることによりスケーリングを行い、さらに、そのスケーリング後の相関情報E₁(p,q)，E₂(2p,2q)，E₃(3p,3q)を加算することにより、合成相関情報E(p,q)(=E₁(p,q)+E₂(2p,2q)+E₃(3p,3q))を求める。 In step S164, the scaling synthesis unit 124 receives the six pieces of correlation information E ₁ (u ₁ , v ₁ ), E ₂ (u ₂ , v ₂ ), E ₃ (u ₃ , v ₃ ) supplied from the correlation calculation unit 103. ), F ₁ (r ₁ , s ₁ ), F ₂ (r ₂ , s ₂ ), F ₃ (r ₃ , s ₃ ), three correlation information E ₁ (u ₁ , v ₁ ), E ₂ (u ₂ , v ₂ ) and E ₃ (u ₃ , v ₃ ) are scaled based on the scale of the position (u ₁ , v ₁ ) of the correlation information E ₁ (u ₁ , v ₁ ), and Then, the scaled correlation information is synthesized to obtain synthesized correlation information E (p, q). That is, the scaling synthesizer 124, the position (p, q) = (u 1, v 1) = (u 2/2, v 2/2) = With _{_{(u 3/3, v 3}} /3) By performing scaling and further adding the correlation information E ₁ (p, q), E ₂ (2p, 2q), E ₃ (3p, 3q) after the scaling, the combined correlation information E (p, q) (= E ₁ (p, q) + E ₂ (2p, 2q) + E ₃ (3p, 3q)) is obtained.

以上のように、スケーリング合成部１２４は、ステップＳ１６４において、時刻t+t₀における相関情報E₁(u₁,v₁)、時刻t+2t₀における相関情報E₂(u₂,v₂)、時刻t+3t₀における相関情報E₃(u₃,v₃)の３つを合成した合成相関情報E(p,q)、即ち、注目フレームの時刻tに対して時間的に後の３フレームのそれぞれについての、合計で３つの相関情報を合成した合成相関情報E(p,q)を求めた後、その合成相関情報E(p,q)を求めるにあたって合成した相関情報の数である３によって、合成相関情報E(p,q)を除算することにより正規化し、さらに、正規化後の合成相関情報E(p,q)に、所定のオフセット値L₀₃を加算することにより、最終的な合成相関情報E(p,q)(={E₁(p,q)+E₂(2p,2q)+E₃(3p,3q)}/3+L₀₃)を得る。そして、スケーリング合成部１２４は、その最終的な合成相関情報E(p,q)を、最小値検出部１２５に供給して、ステップＳ１６４からＳ１６５に進む。 As described above, the scaling synthesizer 124, in step S164, the correlation information E ₁ at time _{_{t + t 0 (u 1,}} v 1), the correlation information E ₂ at time _{_{t + 2t 0 (u 2,}} v 2) , Combined correlation information E (p, q) obtained by combining three pieces of correlation information E ₃ (u ₃ , v ₃ ) at time t + 3t ₀ , that is, 3 later in time with respect to time t of the frame of interest This is the number of pieces of correlation information synthesized for obtaining the combined correlation information E (p, q) after obtaining the combined correlation information E (p, q) obtained by combining a total of three pieces of correlation information for each frame. 3 is normalized by dividing the composite correlation information E (p, q) by 3 and, further, by adding a predetermined offset value L ₀₃ to the composite correlation information E (p, q) after normalization, Synthetic correlation information E (p, q) (= {E ₁ (p, q) + E ₂ (2p, 2q) + E ₃ (3p, 3q)} / 3 + L ₀₃ ) is obtained. Then, the scaling combining unit 124 supplies the final combined correlation information E (p, q) to the minimum value detecting unit 125, and proceeds from step S164 to S165.

ステップＳ１６５では、最小値検出部１２５は、ステップＳ１６４でスケーリング合成部１２４から供給された、注目フレームの時刻tに対して時間的に後の３フレームのそれぞれについての相関情報を合成した合成相関情報E(p,q)が表す相関が最大となる空間方向の位置(p,q)である(p₀₃,q₀₃)を検出し、即ち、合成相関情報E(p,q)の「値」を最小にする位置(p,q)である(p₀₃,q₀₃)を検出し、位置(p,q)が最大相関位置(p₀₃,q₀₃)である場合の合成相関情報E(p₀₃,q₀₃)の「値」、即ち、最小値E₀₃とともに記憶して、図７６のステップＳ１６６に進む。 In step S165, the minimum value detection unit 125 synthesizes the correlation information supplied from the scaling synthesis unit 124 in step S164 and synthesizes the correlation information for each of the three frames after the time t of the frame of interest. (P ₀₃ , q ₀₃ ), which is the position (p, q) in the spatial direction where the correlation represented by E (p, q) is maximum, is detected, that is, the “value” of the combined correlation information E (p, q) (P ₀₃ , q ₀₃ ), which is a position (p, q) that minimizes the position of the signal, is detected, and the combined correlation information E (p (p, q) is the maximum correlation position (p ₀₃ , q ₀₃ ) ₀₃ , q ₀₃ ), ie, the minimum value E ₀₃ , and the process proceeds to step S166 in FIG.

ステップＳ１６６では、スケーリング合成部１２４は、相関演算部１０３から供給された６つの相関情報E₁(u₁,v₁)，E₂(u₂,v₂)，E₃(u₃,v₃)，F₁(r₁,s₁)，F₂(r₂,s₂)，F₃(r₃,s₃)のうちの、５つの相関情報E₁(u₁,v₁)，E₂(u₂,v₂)，F₁(r₁,s₁)，F₂(r₂,s₂)，F₃(r₃,s₃)について、相関情報E₁(u₁,v₁)の位置(u₁,v₁)のスケールを基準としてスケーリングを行い、さらに、そのスケーリング後の相関情報を合成して、合成相関情報E(p,q)を求める。即ち、スケーリング合成部１２４は、位置(p,q)=(u₁,v₁)=(u₂/2,v₂/2)=(-r₁,-s₁)=(-r₂/2,-s₂/2)=(-r₃/3,-s₃/3)とすることによりスケーリングを行い、さらに、そのスケーリング後の相関情報E₁(p,q)，E₂(2p,2q)，F₁(-p,-q)，F₂(-2p,-2q)，F₃(-3p,-3q)を加算することにより、合成相関情報E(p,q)(=E₁(p,q)+E₂(2p,2q)+F₁(-p,-q)+F₂(-2p,-2q)+F₃(-3p,-3q))を求める。 In step S 166, the scaling synthesis unit 124 includes the six pieces of correlation information E ₁ (u ₁ , v ₁ ), E ₂ (u ₂ , v ₂ ), E ₃ (u ₃ , v ₃ ) supplied from the correlation calculation unit 103. ), F ₁ (r ₁ , s ₁ ), F ₂ (r ₂ , s ₂ ), F ₃ (r ₃ , s ₃ ), five correlation information E ₁ (u ₁ , v ₁ ), E ₂ (u ₂ , v ₂ ), F ₁ (r ₁ , s ₁ ), F ₂ (r ₂ , s ₂ ), F ₃ (r ₃ , s ₃ ), correlation information E ₁ (u ₁ , v ₁ ) Is scaled with reference to the scale of the position (u ₁ , v ₁ ), and the correlation information after the scaling is synthesized to obtain synthesized correlation information E (p, q). That is, the scaling synthesizer 124, the position (p, q) = (u 1, v 1) = (u 2/2, v 2/2) = (- r 1, -s 1) = (- r 2 / _{2, -s 2/2) =} (- r 3/3, performs scaling with -s _3/3), further, the correlation information E ₁ after the scaling _{(p, q), E 2} (2p , 2q), F ₁ (−p, −q), F ₂ (−2p, −2q), and F ₃ (−3p, −3q), the combined correlation information E (p, q) (= E ₁ (p, q) + E ₂ (2p, 2q) + F ₁ (−p, −q) + F ₂ (−2p, −2q) + F ₃ (−3p, −3q)) is obtained.

以上のように、スケーリング合成部１２４は、ステップＳ１６６において、時刻t+t₀における相関情報E₁(u₁,v₁)、時刻t+2t₀における相関情報E₂(u₂,v₂)、時刻t-t₀における相関情報F₁(r₁,s₁)、時刻t-2t₀における相関情報F₂(r₂,s₂)、時刻t-3t₀における相関情報F₃(r₃,s₃)を合成した合成相関情報E(p,q)、即ち、注目フレームの時刻tに対して時間的に後の２フレームと、時間的に前の３フレームのそれぞれについての、合計で５つの相関情報を合成した合成相関情報E(p,q)を求めた後、その合成相関情報E(p,q)を求めるにあたって合成した相関情報の数である５によって、合成相関情報E(p,q)を除算することにより正規化し、さらに、正規化後の合成相関情報E(p,q)に、所定のオフセット値L₃₂を加算することにより、最終的な合成相関情報E(p,q)(={E₁(p,q)+E₂(2p,2q)+F₁(-p,-q)+F₂(-2p,-2q)+F₃(-3p,-3q)}/5+L₃₂)を得る。そして、スケーリング合成部１２４は、その最終的な合成相関情報E(p,q)を、最小値検出部１２５に供給して、ステップＳ１６６からＳ１６７に進む。 As described above, the scaling synthesizer 124, in step S166, the correlation information E ₁ at time _{_{t + t 0 (u 1,}} v 1), the correlation information E ₂ at time _{_{t + 2t 0 (u 2,}} v 2) , Correlation information F ₁ (r ₁ , s ₁ ) at time tt _0, correlation information F ₂ (r ₂ , s ₂ ) at time t-2t ₀ , correlation information F ₃ (r ₃ , s at time t-3t ₀ ₃ ) combined correlation information E (p, q), that is, a total of five for each of the two frames later in time with respect to the time t of the frame of interest and the three frames earlier in time. The composite correlation information E (p, q) obtained by synthesizing the correlation information is obtained, and then the composite correlation information E (p, q) is obtained by 5 which is the number of correlation information synthesized in obtaining the composite correlation information E (p, q). q) is divided by normalization, and the final combined correlation information E (p, q is added by adding a predetermined offset value L ₃₂ to the normalized combined correlation information E (p, q). ) (= {E ₁ (p, q ) + E ₂ (2p, 2q) + F ₁ (−p, −q) + F ₂ (−2p, −2q) + F ₃ (−3p, −3q)} / 5 + L ₃₂ ). Then, the scaling combining unit 124 supplies the final combined correlation information E (p, q) to the minimum value detecting unit 125, and proceeds from step S166 to S167.

ステップＳ１６７では、最小値検出部１２５は、ステップＳ１６６でスケーリング合成部１２４から供給された、注目フレームの時刻tに対して時間的に後の２フレームと、時間的に前の３フレームのそれぞれについての相関情報を合成した合成相関情報E(p,q)が表す相関が最大となる空間方向の位置(p,q)である(p₃₂,q₃₂)を検出し、即ち、合成相関情報E(p,q)の「値」を最小にする位置(p,q)である(p₃₂,q₃₂)を検出し、位置(p,q)が最大相関位置(p₃₂,q₃₂)である場合の合成相関情報E(p₃₂,q₃₂)の「値」、即ち、最小値E₃₂とともに記憶して、ステップＳ１６８に進む。 In step S167, the minimum value detection unit 125 supplies each of the two frames that are temporally subsequent to the time t of the frame of interest and the three frames that are temporally previous supplied from the scaling composition unit 124 in step S166. (P ₃₂ , q ₃₂ ), which is the position (p, q) in the spatial direction where the correlation represented by the combined correlation information E (p, q) that is obtained by combining the correlation information of Detect (p ₃₂ , q ₃₂ ), which is the position (p, q) that minimizes the `` value '' of (p, q), and position (p, q) is the maximum correlation position (p ₃₂ , q ₃₂ ) Stored together with the “value” of the composite correlation information E (p ₃₂ , q ₃₂ ) in some cases, that is, the minimum value E ₃₂ , proceeds to step S168.

ステップＳ１６８では、スケーリング合成部１２４は、相関演算部１０３から供給された６つの相関情報E₁(u₁,v₁)，E₂(u₂,v₂)，E₃(u₃,v₃)，F₁(r₁,s₁)，F₂(r₂,s₂)，F₃(r₃,s₃)のうちの、４つの相関情報E₁(u₁,v₁)，F₁(r₁,s₁)，F₂(r₂,s₂)，F₃(r₃,s₃)について、相関情報E₁(u₁,v₁)の位置(u₁,v₁)のスケールを基準としてスケーリングを行い、さらに、そのスケーリング後の相関情報を合成して、合成相関情報E(p,q)を求める。即ち、スケーリング合成部１２４は、位置(p,q)=(u₁,v₁)=(-r₁,-s₁)=(-r₂/2,-s₂/2)=(-r₃/3,-s₃/3)とすることによりスケーリングを行い、さらに、そのスケーリング後の相関情報E₁(p,q)，F₁(-p,-q)，F₂(-2p,-2q)，F₃(-3p,-3q)を加算することにより、合成相関情報E(p,q)(=E₁(p,q)+F₁(-p,-q)+F₂(-2p,-2q)+F₃(-3p,-3q))を求める。 In step S168, the scaling synthesis unit 124 receives the six pieces of correlation information E ₁ (u ₁ , v ₁ ), E ₂ (u ₂ , v ₂ ), E ₃ (u ₃ , v ₃ ) supplied from the correlation calculation unit 103. ), F ₁ (r ₁ , s ₁ ), F ₂ (r ₂ , s ₂ ), F ₃ (r ₃ , s ₃ ), four correlation information E ₁ (u ₁ , v ₁ ), F _{_{_{1 (r 1, s 1)}}} , F 2 (r 2, s 2), F 3 (r 3, s 3) for the correlation information E ₁ (u _1, v ₁₎ the position of the (u _1, v ₁₎ Is scaled with reference to the scale, and the correlation information after the scaling is synthesized to obtain synthesized correlation information E (p, q). That is, the scaling synthesizer 124, the position (p, q) = (u 1, v 1) = (- r 1, -s 1) = (- r 2/2, -s 2/2) = (- r _3/3 performs scaling with -s _3/3), further, its the scaled correlation information _{E 1 (p, q),} F 1 (-p, -q), F 2 (-2p, -2q) and F ₃ (-3p, -3q) are added to obtain the combined correlation information E (p, q) (= E ₁ (p, q) + F ₁ (-p, -q) + F ₂ (-2p, -2q) + F ₃ (-3p, -3q)) is obtained.

以上のように、スケーリング合成部１２４は、ステップＳ１６８において、時刻t+t₀における相関情報E₁(u₁,v₁)、時刻t-t₀における相関情報F₁(r₁,s₁)、時刻t-2t₀における相関情報F₂(r₂,s₂)、時刻t-3t₀における相関情報F₃(r₃,s₃)を合成した合成相関情報E(p,q)、即ち、注目フレームの時刻tに対して時間的に後の１フレームと、時間的に前の３フレームのそれぞれについての、合計で４つの相関情報を合成した合成相関情報E(p,q)を求めた後、その合成相関情報E(p,q)を求めるにあたって合成した相関情報の数である４によって、合成相関情報E(p,q)を除算することにより正規化し、さらに、正規化後の合成相関情報E(p,q)に、所定のオフセット値L₃₁を加算することにより、最終的な合成相関情報E(p,q)(={E₁(p,q)+F₁(-p,-q)+F₂(-2p,-2q)+F₃(-3p,-3q)}/4+L₃₁)を得る。そして、スケーリング合成部１２４は、その最終的な合成相関情報E(p,q)を、最小値検出部１２５に供給して、ステップＳ１６８からＳ１６９に進む。 As described above, the scaling synthesizer 124, in step S168, the correlation at time t + t ₀ information _{_{_{E 1 (u 1, v 1}}} ), the correlation information at time _{_{_{tt 0 F 1 (r 1,}}} s 1), the time Synthetic correlation information E (p, q) obtained by synthesizing correlation information F ₂ (r ₂ , s ₂ ) at t-2t _{0 and} correlation information F ₃ (r ₃ , s ₃ ) at time t-3t ₀ , that is, attention After obtaining combined correlation information E (p, q), which is a total of four pieces of correlation information for one frame later in time with respect to time t of the frame and three frames earlier in time. Then, the composite correlation information E (p, q) is normalized by dividing the composite correlation information E (p, q) by 4 which is the number of correlation information synthesized in obtaining the composite correlation information E (p, q). By adding a predetermined offset value L ₃₁ to the information E (p, q), the final combined correlation information E (p, q) (= {E ₁ (p, q) + F ₁ (−p, -q) + F ₂ (-2p, -2q) + F ₃ (-3p, -3q)} / 4 + L ₃₁ ) obtain. Then, the scaling combining unit 124 supplies the final combined correlation information E (p, q) to the minimum value detecting unit 125, and proceeds from step S168 to S169.

ステップＳ１６９では、最小値検出部１２５は、ステップＳ１６８でスケーリング合成部１２４から供給された、注目フレームの時刻tに対して時間的に後の１フレームと、時間的に前の３フレームのそれぞれについての相関情報を合成した合成相関情報E(p,q)が表す相関が最大となる空間方向の位置(p,q)である(p₃₁,q₃₁)を検出し、即ち、合成相関情報E(p,q)の「値」を最小にする位置(p,q)である(p₃₁,q₃₁)を検出し、位置(p,q)が最大相関位置(p₃₁,q₃₁)である場合の合成相関情報E(p₃₁,q₃₁)の「値」、即ち、最小値E₃₁とともに記憶して、ステップＳ１７０に進む。 In step S169, the minimum value detecting unit 125 supplies each of one frame that is temporally subsequent to the time t of the frame of interest and the three frames that are temporally previous, supplied from the scaling composition unit 124 in step S168. (P ₃₁ , q ₃₁ ), which is the position (p, q) in the spatial direction at which the correlation represented by the combined correlation information E (p, q), which is obtained by combining the correlation information, is maximized, that is, the combined correlation information E (p ₃₁ , q ₃₁ ), which is the position (p, q) that minimizes the (value) of (p, q), is detected, and the position (p, q) is the maximum correlation position (p ₃₁ , q ₃₁ ) Stored together with the “value” of the composite correlation information E (p ₃₁ , q ₃₁ ) in some cases, that is, the minimum value E ₃₁ , proceeds to step S170.

ステップＳ１７０では、スケーリング合成部１２４は、相関演算部１０３から供給された６つの相関情報E₁(u₁,v₁)，E₂(u₂,v₂)，E₃(u₃,v₃)，F₁(r₁,s₁)，F₂(r₂,s₂)，F₃(r₃,s₃)のうちの、３つの相関情報F₁(r₁,s₁)，F₂(r₂,s₂)，F₃(r₃,s₃)について、相関情報E₁(u₁,v₁)の位置(u₁,v₁)のスケールを基準としてスケーリングを行い、さらに、そのスケーリング後の相関情報を合成して、合成相関情報E(p,q)を求める。即ち、スケーリング合成部１２４は、位置(p,q)=(-r₁,-s₁)=(-r₂/2,-s₂/2)=(-r₃/3,-s₃/3)とすることによりスケーリングを行い、さらに、そのスケーリング後の相関情報F₁(-p,-q)，F₂(-2p,-2q)，F₃(-3p,-3q)を加算することにより、合成相関情報E(p,q)(=F₁(-p,-q)+F₂(-2p,-2q)+F₃(-3p,-3q))を求める。 In step S 170, the scaling composition unit 124 includes the six pieces of correlation information E ₁ (u ₁ , v ₁ ), E ₂ (u ₂ , v ₂ ), E ₃ (u ₃ , v ₃ ) supplied from the correlation calculation unit 103. ), F ₁ (r ₁ , s ₁ ), F ₂ (r ₂ , s ₂ ), F ₃ (r ₃ , s ₃ ), three correlation information F ₁ (r ₁ , s ₁ ), F ₂ (r ₂ , s ₂ ) and F ₃ (r ₃ , s ₃ ) are scaled with reference to the scale of the position (u ₁ , v ₁ ) of the correlation information E ₁ (u ₁ , v ₁ ), Then, the scaled correlation information is synthesized to obtain synthesized correlation information E (p, q). That is, the scaling synthesizer 124, the position (p, q) = (- r 1, -s 1) = (- r 2/2, -s 2/2) = (- r 3/3, -s 3 / Scaling is performed by 3), and the correlation information F ₁ (-p, -q), F ₂ (-2p, -2q), and F ₃ (-3p, -3q) after the scaling is added. Thus, the composite correlation information E (p, q) (= F ₁ (−p, −q) + F ₂ (−2p, −2q) + F ₃ (−3p, −3q)) is obtained.

以上のように、スケーリング合成部１２４は、ステップＳ１７０において、時刻t-t₀における相関情報F₁(r₁,s₁)、時刻t-2t₀における相関情報F₂(r₂,s₂)、時刻t-3t₀における相関情報F₃(r₃,s₃)を合成した合成相関情報E(p,q)、即ち、注目フレームの時刻tに対して時間的に前の３フレームのそれぞれについての、合計で３つの相関情報を合成した合成相関情報E(p,q)を求めた後、その合成相関情報E(p,q)を求めるにあたって合成した相関情報の数である３によって、合成相関情報E(p,q)を除算することにより正規化し、さらに、正規化後の合成相関情報E(p,q)に、所定のオフセット値L₃₀を加算することにより、最終的な合成相関情報E(p,q)(={F₁(-p,-q)+F₂(-2p,-2q)+F₃(-3p,-3q)}/3+L₃₀)を得る。そして、スケーリング合成部１２４は、その最終的な合成相関情報E(p,q)を、最小値検出部１２５に供給して、ステップＳ１７０からＳ１７１に進む。 As described above, in step S170, the scaling composition unit 124 calculates the correlation information F ₁ (r ₁ , s ₁ ) at time tt ₀ , the correlation information F ₂ (r ₂ , s ₂ ) at time t-2t ₀ , and the time The combined correlation information E (p, q) obtained by synthesizing the correlation information F ₃ (r ₃ , s ₃ ) at t-3t ₀ , that is, for each of the three frames temporally prior to the time t of the frame of interest After obtaining the combined correlation information E (p, q) obtained by combining the three pieces of correlation information in total, the number of pieces of correlation information combined in determining the combined correlation information E (p, q) is 3 to determine the combined correlation information. normalized by dividing the information E (p, q), further synthesized correlation information E after the normalization (p, q) in, by adding a predetermined offset value L _30, final synthesized correlation information E (p, q) (= {F ₁ (−p, −q) + F ₂ (−2p, −2q) + F ₃ (−3p, −3q)} / 3 + L ₃₀ ) is obtained. Then, the scaling combining unit 124 supplies the final combined correlation information E (p, q) to the minimum value detecting unit 125, and proceeds from step S170 to S171.

ステップＳ１７１では、最小値検出部１２５は、ステップＳ１７０でスケーリング合成部１２４から供給された、注目フレームの時刻tに対して時間的に前の３フレームのそれぞれについての相関情報を合成した合成相関情報E(p,q)が表す相関が最大となる空間方向の位置(p,q)である(p₃₀,q₃₀)を検出し、即ち、合成相関情報E(p,q)の「値」を最小にする位置(p,q)である(p₃₀,q₃₀)を検出し、位置(p,q)が最大相関位置(p₃₀,q₃₀)である場合の合成相関情報E(p₃₀,q₃₀)の「値」、即ち、最小値E₃₀とともに記憶して、図７７のステップＳ１７２に進む。 In step S171, the minimum value detection unit 125 synthesizes the correlation information supplied from the scaling synthesis unit 124 in step S170 and synthesizes the correlation information for each of the previous three frames with respect to time t of the frame of interest. (P ₃₀ , q ₃₀ ) is detected as the position (p, q) in the spatial direction where the correlation represented by E (p, q) is maximum, that is, the `` value '' of the combined correlation information E (p, q) (P ₃₀ , q ₃₀ ), which is the position (p, q) that minimizes, and the position (p, q) is the maximum correlation position (p ₃₀ , q ₃₀ ). ₃₀ , q ₃₀ ), ie, the minimum value E ₃₀ , and the process proceeds to step S 172 in FIG. 77.

ステップＳ１７２では、最小値検出部１２５は、スケーリング合成部１２４からの複数の合成相関情報のうちの、相関が最大の合成相関情報である最大合成相関情報を求める。即ち、最小値検出部１２５は、注目フレームの時刻tに対して時間的に後の３フレームと時間的に前の３フレームの６つの相関情報を合成した合成相関情報E(p,q)の最小値E₃₃（ステップＳ１５９）、注目フレームの時刻tに対して時間的に後の３フレームと時間的に前の２フレームの５つの相関情報を合成した合成相関情報E(p,q)の最小値E₂₃（ステップＳ１６１）、注目フレームの時刻tに対して時間的に後の３フレームと時間的に前の１フレームの４つの相関情報を合成した合成相関情報E(p,q)の最小値E₁₃（ステップＳ１６３）、注目フレームの時刻tに対して時間的に後の３フレームの３つの相関情報を合成した合成相関情報E(p,q)の最小値E₀₃（ステップＳ１６５）、注目フレームの時刻tに対して時間的に後の２フレームと時間的に前の３フレームの５つの相関情報を合成した合成相関情報E(p,q)の最小値E₃₂（ステップＳ１６７）、注目フレームの時刻tに対して時間的に後の１フレームと時間的に前の３フレームの４つの相関情報を合成した合成相関情報E(p,q)の最小値E₃₁（ステップＳ１６９）、注目フレームの時刻tに対して時間的に前の３フレームの３つの相関情報を合成した合成相関情報E(p,q)の最小値E₃₀（ステップＳ１７１）のうちの最小のものを与える合成相関情報を、最大合成相関情報として求める。 In step S 172, the minimum value detection unit 125 obtains the maximum combined correlation information that is the combined correlation information with the maximum correlation among the plurality of combined correlation information from the scaling combining unit 124. In other words, the minimum value detection unit 125 synthesizes the combined correlation information E (p, q) obtained by synthesizing the six correlation information of the three frames later in time and the previous three frames with respect to the time t of the frame of interest. The minimum value E ₃₃ (step S159), the combined correlation information E (p, q) obtained by synthesizing the five correlation information of the temporally subsequent three frames and the temporally previous two frames with respect to the time t of the frame of interest. The minimum value E ₂₃ (step S161), the combined correlation information E (p, q) obtained by synthesizing the four correlation information of the temporally subsequent three frames and the temporally previous one frame with respect to the time t of the frame of interest. Minimum value E ₁₃ (step S163), minimum value E _{03 of} combined correlation information E (p, q) obtained by synthesizing three pieces of correlation information of three frames later in time with respect to time t of the frame of interest (step S165) , 2 frames later in time and 3 frames earlier in time with respect to time t of the frame of interest The minimum value E ₃₂ (step S167) of the combined correlation information E (p, q) obtained by synthesizing the five pieces of correlation information of the first frame, one frame later in time with respect to the time t of the frame of interest, and the previous three in time The minimum value E _{31 of} the combined correlation information E (p, q) obtained by combining the four pieces of correlation information of the frame (step S169), and the three correlation information of the previous three frames with respect to the time t of the target frame are combined. The combined correlation information that gives the minimum value of the minimum values E ₃₀ (step S171) of the combined correlation information E (p, q) is obtained as the maximum combined correlation information.

ここで、最大合成相関情報となった合成相関情報を求めるのに、注目フレームの時刻tに対して時間的に後のkフレームと時間的に前のhフレームの相関情報の、合計でk+hの相関情報を合成したとすると（ここでは、k,h=0,1,2,3）、最小値E₃₃，E₂₃，E₁₃，E₀₃，E₃₂，E₃₁，E₃₀のうちの最小のものは、E_hkで表すことができる。 Here, in order to obtain the combined correlation information that has become the maximum combined correlation information, the sum of the correlation information of the k frame temporally subsequent to the time t of the target frame and the temporally previous h frame is k + If the correlation information of h is synthesized (here, k, h = 0,1,2,3), among the minimum values E ₃₃ , E ₂₃ , E ₁₃ , E ₀₃ , E ₃₂ , E ₃₁ , E ₃₀ Can be expressed as E _hk .

ステップＳ１７２では、さらに、最小値検出部１２５は、最大合成相関情報の「値」を最小にする空間方向の位置、即ち、最大合成相関情報の最小値E_hkを与える空間方向の位置(p_hk,q_hk)を、最小値E₃₃，E₂₃，E₁₃，E₀₃，E₃₂，E₃₁，E₃₀それぞれとともに記憶した位置(p₃₃,q₃₃)，(p₂₃,q₂₃)，(p₁₃,q₁₃)，(p₀₃,q₀₃)，(p₃₂,q₃₂)，(p₃₁,q₃₁)，(p₃₀,q₃₀)の中から選択することにより検出し、最大相関位置(p₀,q₀)とする。 In step S172, the minimum value detecting unit 125 further minimizes the “value” of the maximum combined correlation information, that is, the position in the spatial direction that gives the minimum value E _hk of the maximum combined correlation information (p _hk , q _hk ), together with the minimum values E ₃₃ , E ₂₃ , E ₁₃ , E ₀₃ , E ₃₂ , E ₃₁ , E ₃₀ , the stored positions (p ₃₃ , q ₃₃ ), (p ₂₃ , q ₂₃ ), ( p ₁₃ , q ₁₃ ), (p ₀₃ , q ₀₃ ), (p ₃₂ , q ₃₂ ), (p ₃₁ , q ₃₁ ), (p ₃₀ , q ₃₀ ) Let it be the position (p ₀ , q ₀ ).

そして、最小値検出部１２５は、最大相関位置(p₀,q₀)を、平均動きベクトル(p₀,q₀)として、その平均動きベクトル(p₀,q₀)に、元の動画データのフレーム周期t₀を、時間方向tのコンポーネントとして加えた３次元の動きベクトル(p₀,q₀,t₀)を求め、その３次元の動きベクトル(p₀,q₀,t₀)の方向と直交する方向が、主成分方向であるとして、３次元の動きベクトル(p₀,q₀,t₀)を、フィルタ情報供給部３２（図２５）に供給するとともに、最大合成相関情報の最小値E_hkを、フィルタ情報供給部３２に供給する。さらに、最小値検出部１２５は、最大合成相関情報の最小値E_hkのサフィックスである(h,k)を、最大合成相関情報を求めるのに合成した相関情報の演算に用いられたフレームの範囲を表すカーネルサイズとして、フィルタ情報供給部３２に供給して、ステップＳ１７２からＳ１７３に進む。 Then, the minimum value detection unit 125 sets the maximum correlation position (p ₀ , q ₀ ) as the average motion vector (p ₀ , q ₀ ), and uses the original motion picture data as the average motion vector (p ₀ , q ₀ ). The three-dimensional motion vector (p ₀ , q ₀ , t ₀ ) obtained by adding the frame period t ₀ of the current as a component in the time direction t is obtained, and the three-dimensional motion vector (p ₀ , q ₀ , t ₀ ) Assuming that the direction orthogonal to the direction is the principal component direction, a three-dimensional motion vector (p ₀ , q ₀ , t ₀ ) is supplied to the filter information supply unit 32 (FIG. 25), and the maximum combined correlation information The minimum value E _hk is supplied to the filter information supply unit 32. Further, the minimum value detection unit 125 uses (h, k), which is the suffix of the minimum value E _hk of the maximum combined correlation information, and the range of frames used for calculating the correlation information combined to obtain the maximum combined correlation information. Is supplied to the filter information supply unit 32, and the process proceeds from step S172 to S173.

ここで、上述したオフセット値L₃₃（ステップＳ１５８），L₂₃（ステップＳ１６０），L₁₃（ステップＳ１６２），L₀₃（ステップＳ１６４），L₃₂（ステップＳ１６６），L₃₁（ステップＳ１６８），L₃₀（ステップＳ１７０）は、式L₃₃≦L₃₂≦L₃₁≦L₃₀、かつ、式L₃₃≦L₂₃≦L₁₃≦L₀₃を満たす小さな定数である。 Here, the offset values L ₃₃ (step S158), L ₂₃ (step S160), L ₁₃ (step S162), L ₀₃ (step S164), L ₃₂ (step S166), L ₃₁ (step S168), L described above. ₃₀ (step S170), the formula _{_{_{L 33 ≦ L 32 ≦ L 31}}} ≦ L 30, and a small constant satisfying the formula _{_{_{L 33 ≦ L 23 ≦ L 13}}} ≦ L 03.

ステップＳ１７３では、フィルタ情報供給部３２（図２５）は、図７２の主成分方向取得部３１（の最小値検出部１２５）からの最大合成相関情報の最小値E_hkが、所定の閾値ε以下（未満）であるかどうかを判定する。ここで、ステップＳ１７３で用いられる閾値εは、オフセット値L₃₃，L₂₃，L₁₃，L₀₃，L₃₂，L₃₁，L₃₀よりも大きな値で、例えば、シミュレーションなどによって決定することができる。また、閾値εは、ユーザの操作に応じて設定することも可能である。 In step S173, the filter information supply unit 32 (FIG. 25) determines that the minimum value E _hk of the maximum combined correlation information from the principal component direction acquisition unit 31 (its minimum value detection unit 125) in FIG. It is determined whether it is (less than). Here, the threshold value ε used in step S173 is larger than the offset values L ₃₃ , L ₂₃ , L ₁₃ , L ₀₃ , L ₃₂ , L ₃₁ , and L _{30 and} can be determined by, for example, simulation. . Further, the threshold value ε can be set according to a user operation.

ステップＳ１７３において、最大合成相関情報の最小値E_hkが、所定の閾値ε以下であると判定された場合、即ち、最大合成相関情報の最小値E_hkがある程度小さく、従って、注目ブロックと対応する領域が、時刻tの前または後のフレームに存在する場合、ステップＳ１７４に進み、フィルタ情報供給部３２は、周波数ドメインにおいて、原点(0,0)から、主成分方向取得部３１からの主成分方向に延びる領域であって、T方向に２π／（４×t₀）の幅を有し、X,Y方向が、−（π／r₀）乃至＋（π／r₀）で、T方向が、−（π／t₀）乃至＋（π／t₀）の範囲の領域、即ち、図２０の領域R₁₀₀₁や、図２１の領域R₁₁₀₁、図２２の領域R₁₂₀₁、図２３の領域R₁₃₀₁を、フィルタの通過帯域として決定するとともに、図７２の主成分方向取得部３１（の最小値検出部１２５）からの(h,k)を、そのままカーネルサイズとして決定し、ステップＳ１７６に進む。 If it is determined in step S173 that the minimum value E _hk of the maximum combined correlation information is equal to or smaller than the predetermined threshold ε, that is, the minimum value E _hk of the maximum combined correlation information is small to some extent, and therefore corresponds to the block of interest. When the region exists in the frame before or after time t, the process proceeds to step S174, and the filter information supply unit 32 performs the principal component from the principal component direction acquisition unit 31 from the origin (0, 0) in the frequency domain. A region extending in the direction, having a width of 2π / (4 × t ₀ ) in the T direction, and the X and Y directions being − (π / r ₀ ) to + (π / r ₀ ), and the T direction Are in the range of − (π / t ₀ ) to + (π / t ₀ ), that is, the region R _{1001 in} FIG. 20, the region R ₁₁₀₁ in FIG. 21, the region R ₁₂₀₁ in FIG. 22, and the region in FIG. R ₁₃₀₁ is determined as the pass band of the filter, and the minimum value detection of the principal component direction acquisition unit 31 in FIG. (H, k) from the unit 125) is directly determined as the kernel size, and the process proceeds to step S176.

一方、ステップＳ１７３において、最大合成相関情報の最小値E_hkが、所定の閾値ε以下でないと判定された場合、即ち、最大合成相関情報の最小値E_hkが大きく、従って、注目ブロックと対応する領域が、時刻tの前および後のフレームのいずれにも存在しない場合、ステップＳ１７５に進み、フィルタ情報供給部３２は、ステップＳ１７４における場合と同様に、周波数ドメインにおいて、原点(0,0)から、主成分方向取得部３１からの主成分方向に延びる領域であって、T方向に２π／（４×t₀）の幅を有し、X,Y方向が、−（π／r₀）乃至＋（π／r₀）で、T方向が、−（π／t₀）乃至＋（π／t₀）の範囲の領域を、フィルタの通過帯域として決定する。さらに、ステップＳ１７５では、フィルタ情報供給部３２は、図７２の主成分方向取得部３１（の最小値検出部１２５）からの(h,k)に代えて、(h,k)=(0,0)を、カーネルサイズとして決定し、ステップＳ１７６に進む。 On the other hand, if it is determined in step S173 that the minimum value E _hk of the maximum combined correlation information is not less than or equal to the predetermined threshold ε, that is, the minimum value E _hk of the maximum combined correlation information is large, and therefore corresponds to the target block. When the region does not exist in any of the frames before and after time t, the process proceeds to step S175, and the filter information supply unit 32 starts from the origin (0, 0) in the frequency domain as in step S174. , A region extending in the principal component direction from the principal component direction acquisition unit 31 and having a width of 2π / (4 × t ₀ ) in the T direction, and the X and Y directions being − (π / r ₀ ) to A region where + (π / r ₀ ) and the T direction is in the range of − (π / t ₀ ) to + (π / t ₀ ) is determined as the passband of the filter. Further, in step S175, the filter information supply unit 32 replaces (h, k) from the principal component direction acquisition unit 31 (the minimum value detection unit 125 thereof) in FIG. 72 with (h, k) = (0, 0) is determined as the kernel size, and the process proceeds to step S176.

ステップＳ１７６では、フィルタ情報供給部３２は、ステップＳ１７４またはＳ１７５で決定したフィルタの通過帯域を表すフィルタ情報として、例えば、３次元の動きベクトル(p₀,q₀,t₀)を、フィルタ部２２（図２５）に供給するとともに、カーネルサイズ(h,k)を、フィルタ部２２に供給し、処理を終了する。 In step S176, the filter information supply unit 32 uses, for example, a three-dimensional motion vector (p ₀ , q ₀ , t ₀ ) as filter information indicating the pass band of the filter determined in step S174 or S175, and the filter unit 22. (FIG. 25) and the kernel size (h, k) are supplied to the filter unit 22 and the process is terminated.

図７３乃至図７７に示したフローチャートにしたがった処理によれば、注目ブロックについて、次のようなフィルタ情報としての３次元の動きベクトル(p₀,q₀,t₀)と、カーネルサイズ(h,k)を求めることができる。 According to the processing according to the flowcharts shown in FIG. 73 to FIG. 77, the three-dimensional motion vector (p ₀ , q ₀ , t ₀ ) as the following filter information and the kernel size (h , k).

例えば、図７０に示した動画データにおいて、ブロックB₃が注目ブロックである場合には、上述したように、時刻t+t₀，t+2t₀，t+3t₀，t-t₀，t-2t₀，t-3t₀における画像データそれぞれにおいて、注目ブロックB₃と対応する領域B₁₀₁，B₂₀₁，B₃₀₁，B_-101，B_-201，B_-301が存在し、時刻t+t₀における相関情報E₁(u₁,v₁)、時刻t+2t₀における相関情報E₂(u₂,v₂)、時刻t+3t₀における相関情報E₃(u₃,v₃)、時刻t-t₀における相関情報F₁(r₁,s₁)、時刻t-2t₀における相関情報F₂(r₂,s₂)、時刻t-3t₀における相関情報F₃(r₃,s₃)の値は、いずれも、位置(u₂,v₂)，(u₃,v₃)，(r₁,s₁)，(r₂,s₂)，(r₃,s₃)が(0,0)のときに最小になり、ほぼ０となる。 For example, in the moving image data shown in FIG. 70, when the block B ₃ is the target block, as described above, the times t + t ₀ , t + 2t ₀ , t + 3t ₀ , tt ₀ , t−2t at _0, respectively image data in t-3t _0, block of interest B ₃ with the corresponding region _{_{_{B 101, B 201, B 301}}} , B -101, B -201, B -301 is present, the time t + t ₀ Correlation information E ₁ (u ₁ , v ₁ ), correlation information E ₂ (u ₂ , v ₂ ) at time t + 2t ₀ , correlation information E ₃ (u ₃ , v ₃ ) at time t + 3t ₀ , time tt correlation information in ₀ F ₁ of (r _{_1,} s _1), the correlation information F ₂ at time _{_{t-2t 0 (r 2,}} s 2), the correlation information F ₃ at time _{_{t-3t 0 (r 3,}} s 3) The values for all are (u ₂ , v ₂ ), (u ₃ , v ₃ ), (r ₁ , s ₁ ), (r ₂ , s ₂ ), (r ₃ , s ₃ ) are (0, 0) is the smallest and almost zero.

この場合、図７４のステップＳ１５９で求められる、合成相関関数E(p,q)の最小値E₃₃を与える位置(p₃₃,q₃₃)は(0,0)となり、最小値E₃₃は、ほぼオフセット値L₃₃に等しくなる。同様に、図７５のステップＳ１６１で求められる、合成相関関数E(p,q)の最小値E₂₃を与える位置(p₂₃,q₂₃)は(0,0)となり、最小値E₂₃は、ほぼオフセット値L₂₃に等しくなる。また、図７５のステップＳ１６３で求められる、合成相関関数E(p,q)の最小値E₁₃を与える位置(p₁₃,q₁₃)は(0,0)となり、最小値E₁₃は、ほぼオフセット値L₁₃に等しくなる。さらに、図７５のステップＳ１６５で求められる、合成相関関数E(p,q)の最小値E₀₃を与える位置(p₀₃,q₀₃)は(0,0)となり、最小値E₀₃は、ほぼオフセット値L₀₃に等しくなる。また、図７６のステップＳ１６７で求められる、合成相関関数E(p,q)の最小値E₃₂を与える位置(p₃₂,q₃₂)は(0,0)となり、最小値E₃₂は、ほぼオフセット値L₃₂に等しくなる。さらに、図７６のステップＳ１６９で求められる、合成相関関数E(p,q)の最小値E₃₁を与える位置(p₃₁,q₃₁)は(0,0)となり、最小値E₃₁は、ほぼオフセット値L₃₁に等しくなる。また、図７６のステップＳ１７１で求められる、合成相関関数E(p,q)の最小値E₃₀を与える位置(p₃₀,q₃₀)は(0,0)となり、最小値E₃₀は、ほぼオフセット値L₃₀に等しくなる。 In this case, the position (p ₃₃ , q ₃₃ ) that gives the minimum value E ₃₃ of the combined correlation function E (p, q) obtained in step S159 of FIG. 74 is (0,0), and the minimum value E ₃₃ is approximately equal to the offset value L _33. Similarly, the position (p ₂₃ , q ₂₃ ) that gives the minimum value E ₂₃ of the combined correlation function E (p, q) obtained in step S161 in FIG. 75 is (0,0), and the minimum value E ₂₃ is approximately equal to the offset value L _23. In addition, the position (p ₁₃ , q ₁₃ ) that gives the minimum value E ₁₃ of the combined correlation function E (p, q) obtained in step S163 in FIG. 75 is (0,0), and the minimum value E ₁₃ is almost equal to equal to the offset value L _13. Further, the position (p ₀₃ , q ₀₃ ) that gives the minimum value E ₀₃ of the combined correlation function E (p, q), which is obtained in step S165 of FIG. 75, is (0, 0), and the minimum value E ₀₃ is almost equal to equal to the offset value L _03. In addition, the position (p ₃₂ , q ₃₂ ) that gives the minimum value E ₃₂ of the combined correlation function E (p, q) obtained in step S167 of FIG. 76 is (0,0), and the minimum value E ₃₂ is almost equal to equal to the offset value L _32. Further, the position (p ₃₁ , q ₃₁ ) that gives the minimum value E ₃₁ of the combined correlation function E (p, q) obtained in step S169 in FIG. 76 is (0,0), and the minimum value E ₃₁ is almost equal to equal to the offset value L _31. In addition, the position (p ₃₀ , q ₃₀ ) that gives the minimum value E ₃₀ of the combined correlation function E (p, q) obtained in step S171 in FIG. 76 is (0, 0), and the minimum value E ₃₀ is almost equal to equal to the offset value L _30.

オフセット値L₃₃，L₂₃，L₁₃，L₀₃，L₃₂，L₃₁，L₃₀は、上述したように、式L₃₃≦L₃₂≦L₃₁≦L₃₀、かつ、式L₃₃≦L₂₃≦L₁₃≦L₀₃を満たすので、注目ブロックB₃については、図７７のステップＳ１７２において、基本的に、ほぼオフセット値L₃₃となる最小値E₃₃が、最小値E₃₃，E₂₃，E₁₃，E₀₃，E₃₂，E₃₁，E₃₀の中で最小のものとして検出され、平均動きベクトル(p₀,q₀)として(0,0)が求められる。さらに、ステップＳ１７２では、３次元の動きベクトル(p₀,q₀,t₀)として(0,0,t₀)が求められ、カーネルサイズ(h,k)として(3,3)が求められる。 As described above, the offset values L ₃₃ , L ₂₃ , L ₁₃ , L ₀₃ , L ₃₂ , L ₃₁ , and L ₃₀ are the expressions L ₃₃ ≦ L ₃₂ ≦ L ₃₁ ≦ L ₃₀ and the expressions L ₃₃ ≦ L _23. Since ≦ L ₁₃ ≦ L ₀₃ is satisfied, for the target block B ₃ , the minimum value E _{33 that} is substantially the offset value L ₃₃ is basically the minimum value E ₃₃ , E ₂₃ , E in step S172 of FIG. ₁₃ , E ₀₃ , E ₃₂ , E ₃₁ , E ₃₀ are detected as the smallest one, and ( ₀ , 0) is obtained as the average motion vector (p ₀ , q ₀ ). Further, in step S172, (0,0, t ₀ ) is obtained as the three-dimensional motion vector (p ₀ , q ₀ , t ₀ ), and (3,3) is obtained as the kernel size (h, k). .

さらに、オフセット値L₃₃は、上述したように、ステップＳ１７３における閾値εより小さい（以下である）ので、ほぼオフセット値L₃₃となる最小値E₃₃が、最小値E₃₃，E₂₃，E₁₃，E₀₃，E₃₂，E₃₁，E₃₀の中で最小のものとして検出される注目ブロックB₃については、図７７のステップＳ１７４において、フィルタ情報となる３次元の動きベクトル(p₀,q₀,t₀)として(0,0,t₀)が得られるとともに、カーネルサイズ(h,k)として(3,3)が得られる。 Further, the offset value L _33, as described above, since the smaller the threshold ε in step S173 (or less is), the minimum value E ₃₃ of substantially the offset value L ₃₃ is the minimum value E _33, E _23, E ₁₃ , E ₀₃ , E ₃₂ , E ₃₁ , E ₃₀ regarding the target block B ₃ detected as the smallest one in step S 174 of FIG. 77, the three-dimensional motion vector (p ₀ , q _{_0,} t ₀₎ as (0, 0, with t ₀₎ is obtained, the kernel size (h, k) as the (3,3) is obtained.

これは、時刻t-ht₀からt+kt₀、つまり、時刻t-3t₀からt+3t₀の間は、注目ブロックB₃は、(0,0,t₀)という速度（時間t₀の間に、空間方向に(0,0)だけ移動する速度）で移動しているということを示している。 This time t-ht ₀ from t + kt _0, that is, between time t-3t ₀ of t + 3t _0, the block of interest B ₃ is (0,0, t ₀₎ of the speed (time t ₀ It is shown that it is moving at a speed (moving by (0,0)) in the spatial direction.

従って、カーネルサイズ(h,k)は、３次元の動きベクトル(p₀,q₀,t₀)、ひいては、その３次元の動きベクトル(p₀,q₀,t₀)から求められる注目ブロックの主成分方向の有効範囲を表しているということができる。 Accordingly, the kernel size (h, k) is determined from the three-dimensional motion vector (p ₀ , q ₀ , t ₀ ), and thus the target block obtained from the three-dimensional motion vector (p ₀ , q ₀ , t ₀ ). It can be said that it represents the effective range of the principal component direction.

次に、例えば、図７０に示した動画データにおいて、ブロックB₁₀が注目ブロックである場合には、上述したように、時刻t+t₀，t+2t₀，t+3t₀，t-t₀，t-2t₀，t-3t₀における画像データそれぞれにおいて、注目ブロックB₃と対応する領域B₁₀₂，B₂₀₂，B₃₀₂，B_-102，B_-202，B_-302が存在し、時刻t+t₀における相関情報E₁(u₁,v₁)の値は、位置(u₁,v₁)が(g,0)のときに、時刻t+2t₀における相関情報E₂(u₂,v₂)の値は、位置(u₂,v₂)が(2g,0)のときに、時刻t+3t₀における相関情報E₃(u₃,v₃)の値は、位置(u₃,v₃)が(3g,0)のときに、時刻t-t₀における相関情報F₁(r₁,s₁)の値は、位置(r₁,s₁)が(-g,0)のときに、時刻t-2t₀における相関情報F₂(r₂,s₂)の値は、位置(r₂,s₂)が(-2g,0)のときに、時刻t-3t₀における相関情報F₃(r₃,s₃)の値は、位置(r₃,s₃)が、(-3g,0)のときに、それぞれ最小になり、ほぼ０となる。 Then, for example, in the moving image data shown in FIG. 70, when the block B ₁₀ is the target block, as described above, the time _{t + t 0, t + 2t} 0, t + 3t 0, tt 0, in each image data in the _{t-2t 0, t-3t} 0, region B ₁₀₂ corresponding to a target block _{_{_{B 3, B 202, B 302}}} , B -102, B -202, is B _-302 exist, time t + The _value of the correlation information E ₁ (u ₁ , v ₁ ) at t ₀ is the correlation information E ₂ (u ₂ , u at time t + 2t ₀ when the position (u ₁ , v ₁ ) is (g, 0). v is the value of _2), the value of the position (u _2, v ₂₎ is (2 g, 0 when), correlation information at time _{_{t + 3t 0 E 3 (u}} 3, v 3) , the position (u ₃ , v ₃ ) is (3g, 0), the _value of correlation information F ₁ (r ₁ , s ₁ ) at time tt ₀ is when position (r ₁ , s ₁ ) is (-g, 0) Further, the _value of the correlation information F ₂ (r ₂ , s ₂ ) at time t-2t ₀ is the correlation information at time t-3t ₀ when the position (r ₂ , s ₂ ) is (-2g, 0). the value of _{_{_{F 3 (r 3, s 3}}} ) , the position _{_{(r 3, s 3),}} - when the (3 g, 0) , Each becomes a minimum, it is substantially zero.

この場合、図７４のステップＳ１５９で求められる、合成相関関数E(p,q)の最小値E₃₃を与える位置(p₃₃,q₃₃)は(g,0)となり、最小値E₃₃は、ほぼオフセット値L₃₃に等しくなる。同様に、図７５のステップＳ１６１で求められる、合成相関関数E(p,q)の最小値E₂₃を与える位置(p₂₃,q₂₃)は(g,0)となり、最小値E₂₃は、ほぼオフセット値L₂₃に等しくなる。また、図７５のステップＳ１６３で求められる、合成相関関数E(p,q)の最小値E₁₃を与える位置(p₁₃,q₁₃)は(g,0)となり、最小値E₁₃は、ほぼオフセット値L₁₃に等しくなる。さらに、図７５のステップＳ１６５で求められる、合成相関関数E(p,q)の最小値E₀₃を与える位置(p₀₃,q₀₃)は(g,0)となり、最小値E₀₃は、ほぼオフセット値L₀₃に等しくなる。また、図７６のステップＳ１６７で求められる、合成相関関数E(p,q)の最小値E₃₂を与える位置(p₃₂,q₃₂)は(g,0)となり、最小値E₃₂は、ほぼオフセット値L₃₂に等しくなる。さらに、図７６のステップＳ１６９で求められる、合成相関関数E(p,q)の最小値E₃₁を与える位置(p₃₁,q₃₁)は(g,0)となり、最小値E₃₁は、ほぼオフセット値L₃₁に等しくなる。また、図７６のステップＳ１７１で求められる、合成相関関数E(p,q)の最小値E₃₀を与える位置(p₃₀,q₃₀)は(g,0)となり、最小値E₃₀は、ほぼオフセット値L₃₀に等しくなる。 In this case, the position (p ₃₃ , q ₃₃ ) giving the minimum value E ₃₃ of the combined correlation function E (p, q) obtained in step S159 of FIG. 74 is (g, 0), and the minimum value E ₃₃ is approximately equal to the offset value L _33. Similarly, the position (p ₂₃ , q ₂₃ ) that gives the minimum value E ₂₃ of the combined correlation function E (p, q) obtained in step S161 in FIG. 75 is (g, 0), and the minimum value E ₂₃ is approximately equal to the offset value L _23. In addition, the position (p ₁₃ , q ₁₃ ) that gives the minimum value E ₁₃ of the combined correlation function E (p, q) obtained in step S163 in FIG. 75 is (g, 0), and the minimum value E ₁₃ is almost equal to equal to the offset value L _13. Furthermore, the position (p ₀₃ , q ₀₃ ) that gives the minimum value E ₀₃ of the combined correlation function E (p, q), which is obtained in step S165 of FIG. 75, is (g, 0), and the minimum value E ₀₃ is almost equal to the offset value L _03. In addition, the position (p ₃₂ , q ₃₂ ) that gives the minimum value E ₃₂ of the combined correlation function E (p, q) obtained in step S167 of FIG. 76 is (g, 0), and the minimum value E ₃₂ is almost equal to equal to the offset value L _32. Further, the position (p ₃₁ , q ₃₁ ) that gives the minimum value E ₃₁ of the combined correlation function E (p, q) obtained in step S169 in FIG. 76 is (g, 0), and the minimum value E ₃₁ is almost equal to equal to the offset value L _31. In addition, the position (p ₃₀ , q ₃₀ ) that gives the minimum value E ₃₀ of the combined correlation function E (p, q) obtained in step S171 in FIG. 76 is (g, 0), and the minimum value E ₃₀ is almost equal to equal to the offset value L _30.

オフセット値L₃₃，L₂₃，L₁₃，L₀₃，L₃₂，L₃₁，L₃₀は、上述したように、式L₃₃≦L₃₂≦L₃₁≦L₃₀、かつ、式L₃₃≦L₂₃≦L₁₃≦L₀₃を満たすので、注目ブロックB₁₀については、図７７のステップＳ１７２において、基本的に、ほぼオフセット値L₃₃となる最小値E₃₃が、最小値E₃₃，E₂₃，E₁₃，E₀₃，E₃₂，E₃₁，E₃₀の中で最小のものとして検出され、平均動きベクトル(p₀,q₀)として(g,0)が求められる。さらに、ステップＳ１７２では、３次元の動きベクトル(p₀,q₀,t₀)として(g,0,t₀)が求められ、カーネルサイズ(h,k)として(3,3)が求められる。 As described above, the offset values L ₃₃ , L ₂₃ , L ₁₃ , L ₀₃ , L ₃₂ , L ₃₁ , and L ₃₀ are the expressions L ₃₃ ≦ L ₃₂ ≦ L ₃₁ ≦ L ₃₀ and the expressions L ₃₃ ≦ L _23. since satisfy ≦ L ₁₃ ≦ L _03, for the block of interest B _10, in step S172 of FIG. 77, basically, the minimum value E _33, which is substantially offset value L ₃₃ is the minimum value E _33, E _23, E ₁₃ , E ₀₃ , E ₃₂ , E ₃₁ , E ₃₀ are detected as the smallest one, and (g, 0) is obtained as the average motion vector (p ₀ , q ₀ ). In step S172, (g, 0, t ₀ ) is obtained as a three-dimensional motion vector (p ₀ , q ₀ , t ₀ ), and (3, 3) is obtained as a kernel size (h, k). .

さらに、オフセット値L₃₃は、上述したように、ステップＳ１７３における閾値εより小さいので、ほぼオフセット値L₃₃となる最小値E₃₃が、最小値E₃₃，E₂₃，E₁₃，E₀₃，E₃₂，E₃₁，E₃₀の中で最小のものとして検出される注目ブロックB₁₀については、図７７のステップＳ１７４において、フィルタ情報となる３次元の動きベクトル(p₀,q₀,t₀)として(g,0,t₀)が得られるとともに、カーネルサイズ(h,k)として(3,3)が得られる。 Further, as described above, since the offset value L ₃₃ is smaller than the threshold value ε in step S173, the minimum value E _{33 that} is substantially the offset value L ₃₃ becomes the minimum value E ₃₃ , E ₂₃ , E ₁₃ , E ₀₃ , E _With respect to the target block B ₁₀ detected as the smallest of ₃₂ , E ₃₁ , and E ₃₀ , in step S174 in FIG. 77, the three-dimensional motion vector (p ₀ , q ₀ , t ₀ ) serving as filter information is obtained. (G, 0, t ₀ ) is obtained, and (3, 3) is obtained as the kernel size (h, k).

これは、時刻t-ht₀からt+kt₀、つまり、時刻t-3t₀からt+3t₀の間は、注目ブロックB₁₀は、(g,0,t₀)という速度（時間t₀の間に、空間方向に(g,0)だけ移動する速度）で移動しているということを示している。 This time t-ht ₀ from t + kt _0, that is, between time t-3t ₀ of t + 3t _0, the block of interest B ₁₀ is (g, 0, t ₀₎ of the speed (time t ₀ It is shown that it is moving at a speed (moving by (g, 0)) in the space direction.

次に、例えば、図７１に示した動画データにおいて、ブロックB₇が注目ブロックである場合には、上述したように、時刻t+t₀，t+2t₀，t+3t₀，t-t₀，t-2t₀における画像データそれぞれにおいて、注目ブロックB₇と対応する領域B₁₁₁，B₂₁₁，B₃₁₁，B_-111，B_-211が存在し、時刻t+t₀における相関情報E₁(u₁,v₁)の値は、位置(u₁,v₁)が(0,0)のときに、時刻t+2t₀における相関情報E₂(u₂,v₂)の値は、位置(u₂,v₂)が(0,0)のときに、時刻t+3t₀における相関情報E₃(u₃,v₃)の値は、位置(u₃,v₃)が(0,0)のときに、時刻t-t₀における相関情報F₁(r₁,s₁)の値は、位置(r₁,s₁)が(0,0)のときに、時刻t-2t₀における相関情報F₂(r₂,s₂)の値は、位置(r₂,s₂)が(0,0)のときに、それぞれ最小になり、ほぼ０となる。 Next, for example, in the moving image data shown in FIG. 71, when the block B ₇ is the target block, as described above, the times t + t ₀ , t + 2t ₀ , t + 3t ₀ , tt ₀ , in each image data in the t-2t _0, region B _111, B ₂₁₁ corresponding to a target block _{_{_{B 7, B 311, B -111}}} , and B _-211 is present, the correlation information at time t + t ₀ E ₁ (u ₁ , v ₁ ) is the position (u ₁ , v ₁ ) is (0,0) and the correlation information E ₂ (u ₂ , v ₂ ) at time t + 2t ₀ is the position ( When u ₂ , v ₂ ) is (0,0), the _value of the correlation information E ₃ (u ₃ , v ₃ ) at time t + 3t ₀ is the position (u ₃ , v ₃ ) at (0,0 ), The _value of the correlation information F ₁ (r ₁ , s ₁ ) at time tt ₀ is the correlation information at time t-2t ₀ when the position (r ₁ , s ₁ ) is (0,0). The value of F ₂ (r ₂ , s ₂ ) is minimized when the position (r ₂ , s ₂ ) is (0, 0), and is almost zero.

しかしながら、時刻t-3t₀における画像データにおいて、注目ブロックB₇と対応する領域は、本来は、注目ブロックB₇と同一位置の領域B_-311であるはずであるが、動いている被写体P₆₉₀₂の後ろに隠れてしまっており、存在しない。このため、時刻t-3t₀における相関情報F₃(r₃,s₃)の値が最小になるのは、位置(r₃,s₃)が(0,0)のときであって欲しいが、相関情報F₃(r₃,s₃)の値は、位置(r₃,s₃)が(0,0)のときに最小になるとは限らない。即ち、相関情報F₃(r₃,s₃)の値は、(0,0)とは限らないある位置(r₃,s₃)において最小になり、さらに、その値は、ある程度大きな値となる。 However, in the image data at time t-3t _0, a region corresponding to the block of interest B _7, which may originally, but it should be region B _-311 at the same position as the block of interest B _7, a moving subject P ₆₉₀₂ It is hidden behind and does not exist. Therefore, the _value of the correlation information F ₃ (r ₃ , s ₃ ) at time t-3t ₀ is minimized when the position (r ₃ , s ₃ ) is (0,0). The value of the correlation information F ₃ (r ₃ , s ₃ ) is not always the minimum when the position (r ₃ , s ₃ ) is (0,0). That is, the value of the correlation information F ₃ (r ₃ , s ₃ ) is minimum at a certain position (r ₃ , s ₃ ) that is not necessarily (0, 0). Become.

この場合、図７４のステップＳ１５８において、時刻t-3t₀における相関情報F₃(r₃,s₃)をも用いて求められる合成相関関数E(p,q)については、その最小値E₃₃を与える位置(p₃₃,q₃₃)は、(0,0)とは限らないある位置となり、最小値E₃₃は、オフセット値L₃₃よりかなり大きな値になる。同様に、図７６のステップＳ１６６において、時刻t-3t₀における相関情報F₃(r₃,s₃)をも用いて求められる合成相関関数E(p,q)についても、その最小値E₃₂を与える位置(p₃₂,q₃₂)は、(0,0)とは限らないある位置となり、最小値E₃₂は、オフセット値L₃₂よりかなり大きな値になる。さらに、図７６のステップＳ１６８において、時刻t-3t₀における相関情報F₃(r₃,s₃)をも用いて求められる合成相関関数E(p,q)についても、その最小値E₃₁を与える位置(p₃₁,q₃₁)は、(0,0)とは限らないある位置となり、最小値E₃₁は、オフセット値L₃₁よりかなり大きな値になる。また、図７６のステップＳ１７０において、時刻t-3t₀における相関情報F₃(r₃,s₃)をも用いて求められる合成相関関数E(p,q)についても、その最小値E₃₀を与える位置(p₃₀,q₃₀)は、(0,0)とは限らないある位置となり、最小値E₃₀は、オフセット値L₃₀よりかなり大きな値になる。 In this case, the minimum value E _{33 of the} combined correlation function E (p, q) obtained using the correlation information F ₃ (r ₃ , s ₃ ) at time t-3t ₀ in step S158 in FIG. The position (p ₃₃ , q ₃₃ ) that gives is a position that is not necessarily (0, 0), and the minimum value E ₃₃ is considerably larger than the offset value L ₃₃ . Similarly, the minimum value E _{32 of the} combined correlation function E (p, q) obtained using the correlation information F ₃ (r ₃ , s ₃ ) at time t-3t ₀ in step S166 of FIG. The position (p ₃₂ , q ₃₂ ) that gives is a position that is not necessarily (0, 0), and the minimum value E ₃₂ is considerably larger than the offset value L ₃₂ . Further, in step S168 of FIG. 76, the minimum value E ₃₁ is also set for the combined correlation function E (p, q) obtained using the correlation information F ₃ (r ₃ , s ₃ ) at time t-3t ₀ as well. The given position (p ₃₁ , q ₃₁ ) is a position that is not necessarily (0, 0), and the minimum value E ₃₁ is considerably larger than the offset value L ₃₁ . Further, in step S170 of FIG. 76, the time t-3t ₀ correlation information F ₃ in (r _{_3,} s ₃₎ is also a synthetic correlation function E obtained (p, q) for even the minimum value E ₃₀ The given position (p ₃₀ , q ₃₀ ) is a certain position that is not necessarily (0, 0), and the minimum value E ₃₀ is considerably larger than the offset value L ₃₀ .

一方、図７５のステップＳ１６０において、時刻t-3t₀における相関情報F₃(r₃,s₃)を用いずに求められる合成相関関数E(p,q)については、その最小値E₂₃を与える位置(p₂₃,q₂₃)は(0,0)となり、最小値E₂₃は、ほぼオフセット値L₂₃に等しくなる。同様に、図７５のステップＳ１６２において、時刻t-3t₀における相関情報F₃(r₃,s₃)を用いずに求められる合成相関関数E(p,q)についても、その最小値E₁₃を与える位置(p₁₃,q₁₃)は(0,0)となり、最小値E₁₃は、ほぼオフセット値L₁₃に等しくなる。さらに、図７５のステップＳ１６４において、時刻t-3t₀における相関情報F₃(r₃,s₃)を用いずに求められる合成相関関数E(p,q)についても、その最小値E₀₃を与える位置(p₀₃,q₀₃)は(0,0)となり、最小値E₀₃は、ほぼオフセット値L₀₃に等しくなる。 On the other hand, in step S160 of FIG. 75, for a time t-3t correlation in ₀ information _{_{_{F 3 (r 3, s 3}}} ) determined without using the synthesized correlation function E (p, q), the minimum value E ₂₃ The given position (p ₂₃ , q ₂₃ ) is (0, 0), and the minimum value E ₂₃ is substantially equal to the offset value L ₂₃ . Similarly, the minimum value E _{13 of the} combined correlation function E (p, q) obtained without using the correlation information F ₃ (r ₃ , s ₃ ) at time t-3t ₀ in step S162 in FIG. The position (p ₁₃ , q ₁₃ ) that gives is (0, 0), and the minimum value E ₁₃ is substantially equal to the offset value L ₁₃ . Further, in step S164 of FIG. 75, the minimum value _E03 is also set for the combined correlation function E (p, q) obtained without using the correlation information F ₃ (r ₃ , s ₃ ) at time t-3t ₀ . The given position (p ₀₃ , q ₀₃ ) is (0, 0), and the minimum value E ₀₃ is substantially equal to the offset value L ₀₃ .

オフセット値L₂₃，L₁₃，L₀₃は、上述したことから、式L₂₃≦L₁₃≦L₀₃を満たすので、注目ブロックB₇については、図７７のステップＳ１７２において、基本的に、ほぼオフセット値L₂₃となる最小値E₂₃が、最小値E₃₃，E₂₃，E₁₃，E₀₃，E₃₂，E₃₁，E₃₀の中で最小のものとして検出され、平均動きベクトル(p₀,q₀)として(0,0)が求められる。さらに、ステップＳ１７２では、３次元の動きベクトル(p₀,q₀,t₀)として(0,0,t₀)が求められ、カーネルサイズ(h,k)として(2,3)が求められる。 Since the offset values L ₂₃ , L ₁₃ , and L ₀₃ satisfy the formula L ₂₃ ≦ L ₁₃ ≦ L ₀₃ from the above description, the target block B ₇ is basically substantially offset in step S172 of FIG. The minimum value E ₂₃ having the value L ₂₃ is detected as the minimum of the minimum values E ₃₃ , E ₂₃ , E ₁₃ , E ₀₃ , E ₃₂ , E ₃₁ , E ₃₀ and the average motion vector (p ₀ , (0,0) is obtained as q ₀ ). Further, in step S172, (0,0, t ₀ ) is obtained as the three-dimensional motion vector (p ₀ , q ₀ , t ₀ ), and (2,3) is obtained as the kernel size (h, k). .

さらに、オフセット値L₂₃は、上述したように、ステップＳ１７３における閾値εより小さいので、ほぼオフセット値L₂₃となる最小値E₂₃が、最小値E₃₃，E₂₃，E₁₃，E₀₃，E₃₂，E₃₁，E₃₀の中で最小のものとして検出される注目ブロックB₇については、図７７のステップＳ１７４において、フィルタ情報となる３次元の動きベクトル(p₀,q₀,t₀)として(0,0,t₀)が得られるとともに、カーネルサイズ(h,k)として(2,3)が得られる。 Further, the offset value L _23, as described above, is smaller than the threshold value ε in step S173, the minimum value E _23, which is substantially offset value L ₂₃ is the minimum value _{_{_{E 33, E 23, E 13}}} , E 03, E _With respect to the target block B ₇ detected as the smallest of ₃₂ , E ₃₁ , and E _30, the three-dimensional motion vector (p ₀ , q ₀ , t ₀ ) that becomes the filter information in step S174 of FIG. (0,0, t ₀ ) is obtained, and (2,3) is obtained as the kernel size (h, k).

これは、時刻t-ht₀からt+kt₀、つまり、時刻t-2t₀からt+3t₀の間は、注目ブロックB₇は、(0,0,t₀)という速度で移動しているということを示している。 This is because the target block B ₇ moves at a speed of (0,0, t ₀ ) from time t-ht ₀ to t + kt ₀ , that is, from time t-2t ₀ to t + 3t _0. It shows that there is.

次に、例えば、図７１に示した動画データにおいて、ブロックB₁₄が注目ブロックである場合には、上述したように、時刻t+t₀，t-t₀，t-2t₀，t-3t₀における画像データそれぞれにおいて、注目ブロックB₁₄と対応する領域B₁₁₂，B_-112，B_-212，B_-312が存在し、時刻t+t₀における相関情報E₁(u₁,v₁)の値は、位置(u₁,v₁)が(0,0)のときに、時刻t-t₀における相関情報F₁(r₁,s₁)の値は、位置(r₁,s₁)が(0,0)のときに、時刻t-2t₀における相関情報F₂(r₂,s₂)の値は、位置(r₂,s₂)が(0,0)のときに、時刻t-3t₀における相関情報F₃(r₃,s₃)の値は、位置(r₃,s₃)が(0,0)のときに、それぞれ最小になり、ほぼ０となる。 Then, for example, in the moving image data shown in FIG. 71, when the block B ₁₄ is the target block, as described above, at time _{_{t + t 0, tt 0,}} t-2t 0, t-3t 0 In each image data, there are regions B ₁₁₂ , B ₋₁₁₂ , B ₋₂₁₂ , and B ₋₃₁₂ corresponding to the block of interest B _14, and the _value of the correlation information E ₁ (u ₁ , v ₁ ) at time t + t ₀ When the position (u ₁ , v ₁ ) is (0,0), the _value of the correlation information F ₁ (r ₁ , s ₁ ) at time tt ₀ is the position (r ₁ , s ₁ ) is (0 , 0), the _value of the correlation information F ₂ (r ₂ , s ₂ ) at time t-2t ₀ is the time t-3t when the position (r ₂ , s ₂ ) is (0,0). _{The value} of the correlation information F ₃ (r ₃ , s ₃ ) at 0 is minimum and almost 0 when the position (r ₃ , s ₃ ) is (0,0).

しかしながら、時刻t+2t₀における画像データにおいて、注目ブロックB₁₄と対応する領域は、本来は、注目ブロックB₁₄と同一位置の領域B₂₁₂であるはずであるが、動いている被写体P₆₉₀₂の後ろに隠れてしまっており、存在しない。このため、時刻t+2t₀における相関情報E₂(u₂,v₂)の値が最小になるのは、位置(u₂,v₂)が(0,0)のときであって欲しいが、相関情報E₂(u₂,v₂)の値は、位置(u₂,v₂)が(0,0)のときに最小になるとは限らない。即ち、相関情報E₂(u₂,v₂)の値は、(0,0)とは限らないある位置(u₂,v₂)において最小になり、さらに、その値は、ある程度大きな値となる。 However, in the image data at time t + 2t _0, a region corresponding to the block of interest B ₁₄ is originally, but it should be region B ₂₁₂ of the block of interest B ₁₄ at the same position, moving the subject P ₆₉₀₂ are It is hidden behind and does not exist. For this reason, the _value of the correlation information E ₂ (u ₂ , v ₂ ) at time t + 2t ₀ is minimized when the position (u ₂ , v ₂ ) is (0,0). The value of the correlation information E ₂ (u ₂ , v ₂ ) is not always the minimum when the position (u ₂ , v ₂ ) is (0,0). That is, the value of the correlation information E ₂ (u ₂ , v ₂ ) is minimum at a certain position (u ₂ , v ₂ ) that is not necessarily (0,0). Become.

さらに、時刻t+3t₀における画像データにおいて、注目ブロックB₁₄と対応する領域は、本来は、注目ブロックB₁₄と同一位置の領域B₃₁₂であるはずであるが、動いている被写体P₆₉₀₂の後ろに隠れてしまっており、存在しない。このため、時刻t+3t₀における相関情報E₃(u₃,v₃)の値が最小になるのは、位置(u₃,v₃)が(0,0)のときであって欲しいが、相関情報E₃(u₃,v₃)の値は、位置(u₃,v₃)が(0,0)のときに最小になるとは限らない。即ち、相関情報E₃(u₃,v₃)の値は、(0,0)とは限らないある位置(u₃,v₃)において最小になり、さらに、その値は、ある程度大きな値となる。 Further, in the image data at time t + 3t _0, a region corresponding to the block of interest B ₁₄ is originally but should a region B ₃₁₂ of the block of interest B ₁₄ at the same position, moving the subject P ₆₉₀₂ are It is hidden behind and does not exist. Therefore, the _value of the correlation information E ₃ (u ₃ , v ₃ ) at time t + 3t ₀ is minimized when the position (u ₃ , v ₃ ) is (0,0). The value of the correlation information E ₃ (u ₃ , v ₃ ) is not always the minimum when the position (u ₃ , v ₃ ) is (0,0). That is, the value of the correlation information E ₃ (u ₃ , v ₃ ) is minimum at a certain position (u ₃ , v ₃ ) that is not necessarily (0,0). Become.

この場合、図７４のステップＳ１５８において、時刻t+2t₀における相関情報E₂(u₂,v₂)および時刻t+3t₀における相関情報E₃(u₃,v₃)をも用いて求められる合成相関関数E(p,q)については、その最小値E₃₃を与える位置(p₃₃,q₃₃)は、(0,0)とは限らないある位置となり、最小値E₃₃は、オフセット値L₃₃よりかなり大きな値になる。同様に、図７５のステップＳ１６０において、時刻t+2t₀における相関情報E₂(u₂,v₂)および時刻t+3t₀における相関情報E₃(u₃,v₃)をも用いて求められる合成相関関数E(p,q)についても、その最小値E₂₃を与える位置(p₂₃,q₂₃)は(0,0)とは限らないある位置となり、最小値E₂₃は、オフセット値L₂₃よりかなり大きな値となる。また、図７５のステップＳ１６２において、時刻t+2t₀における相関情報E₂(u₂,v₂)および時刻t+3t₀における相関情報E₃(u₃,v₃)をも用いて求められる合成相関関数E(p,q)についても、その最小値E₁₃を与える位置(p₁₃,q₁₃)は、(0,0)とは限らないある位置となり、最小値E₁₃は、オフセット値L₁₃よりかなり大きな値となる。さらに、図７５のステップＳ１６４において、時刻t+2t₀における相関情報E₂(u₂,v₂)および時刻t+3t₀における相関情報E₃(u₃,v₃)をも用いて求められる合成相関関数E(p,q)についても、その最小値E₀₃を与える位置(p₀₃,q₀₃)は、(0,0)とは限らないある位置となり、最小値E₀₃は、オフセット値L₀₃よりかなり大きな値となる。 In this case, in step S158 of FIG. 74, the correlation information E ₂ (u ₂ , v ₂ ) at time t + 2t ₀ and the correlation information E ₃ (u ₃ , v ₃ ) at time t + 3t ₀ are also obtained. For the resultant correlation function E (p, q), the position (p ₃₃ , q ₃₃ ) giving the minimum value E ₃₃ is not necessarily (0,0), and the minimum value E ₃₃ is offset. It becomes considerably to a value greater than the value L _33. Similarly, in step S160 in FIG. 75, the correlation information E ₂ (u ₂ , v ₂ ) at time t + 2t ₀ and the correlation information E ₃ (u ₃ , v ₃ ) at time t + 3t ₀ are also used. The position (p ₂₃ , q ₂₃ ) that gives the minimum value E ₂₃ is also a position that is not necessarily (0,0), and the minimum value E ₂₃ is an offset value. It becomes a much larger value than L _23. 75, the correlation information E ₂ (u ₂ , v ₂ ) at time t + 2t ₀ and the correlation information E ₃ (u ₃ , v ₃ ) at time t + 3t ₀ are also obtained. Also for the composite correlation function E (p, q), the position (p ₁₃ , q ₁₃ ) giving the minimum value E ₁₃ is not always (0,0), and the minimum value E ₁₃ is an offset value. It becomes a much larger value than L _13. Further, in step S164 of FIG. 75, the correlation information E ₂ (u ₂ , v ₂ ) at time t + 2t ₀ and the correlation information E ₃ (u ₃ , v ₃ ) at time t + 3t ₀ are also obtained. Also for the composite correlation function E (p, q), the position (p ₀₃ , q ₀₃ ) giving the minimum value E ₀₃ is not always (0,0), and the minimum value E ₀₃ is an offset value. It is considerably larger than L ₀₃ .

また、図７６のステップＳ１６６において、時刻t+3t₀における相関情報E₃(u₃,v₃)は用いられないが、時刻t+2t₀における相関情報E₂(u₂,v₂)を用いて求められる合成相関関数E(p,q)についても、その最小値E₃₂を与える位置(p₃₂,q₃₂)は、(0,0)とは限らないある位置となり、最小値E₃₂は、オフセット値L₃₂よりかなり大きな値になる。 In step S166 of FIG. 76, the correlation information E ₃ (u ₃ , v ₃ ) at time t + 3t ₀ is not used, but the correlation information E ₂ (u ₂ , v ₂ ) at time t + 2t ₀ is used. The position (p ₃₂ , q ₃₂ ) that gives the minimum value E ₃₂ of the composite correlation function E (p, q) obtained by using the position is not always (0,0), and the minimum value E ₃₂ becomes considerably larger than the offset value L _32.

一方、図７６のステップＳ１６８において、時刻t+2t₀における相関情報E₂(u₂,v₂)および時刻t+3t₀における相関情報E₃(u₃,v₃)のいずれも用いられずに求められる合成相関関数E(p,q)については、その最小値E₃₁を与える位置(p₃₁,q₃₁)は(0,0)となり、最小値E₃₁は、ほぼオフセット値L₃₁に等しくなる。同様に、図７６のステップＳ１７０において、時刻t+2t₀における相関情報E₂(u₂,v₂)および時刻t+3t₀における相関情報E₃(u₃,v₃)のいずれも用いられずに求められる合成相関関数E(p,q)についても、その最小値E₃₀を与える位置(p₃₀,q₃₀)は(0,0)となり、最小値E₃₀は、ほぼオフセット値L₃₀に等しくなる。 On the other hand, in step S168 in FIG. 76, neither correlation information E ₂ (u ₂ , v ₂ ) at time t + 2t ₀ nor correlation information E ₃ (u ₃ , v ₃ ) at time t + 3t ₀ is used. The position (p ₃₁ , q ₃₁ ) giving the minimum value E ₃₁ is (0,0), and the minimum value E ₃₁ is almost equal to the offset value L ₃₁ . Will be equal. Similarly, in step S170 of FIG. 76, both correlation information E ₂ (u ₂ , v ₂ ) at time t + 2t ₀ and correlation information E ₃ (u ₃ , v ₃ ) at time t + 3t ₀ are used. The position (p ₃₀ , q ₃₀ ) that gives the minimum value E ₃₀ is (0, 0), and the minimum value E ₃₀ is almost equal to the offset value L _30. Is equal to

オフセット値L₃₁，L₃₀は、上述したことから、式L₃₁≦L₃₀を満たすので、注目ブロックB₁₄については、図７７のステップＳ１７２において、基本的に、ほぼオフセット値L₃₁となる最小値E₃₁が、最小値E₃₃，E₂₃，E₁₃，E₀₃，E₃₂，E₃₁，E₃₀の中で最小のものとして検出され、平均動きベクトル(p₀,q₀)として(0,0)が求められる。さらに、ステップＳ１７２では、３次元の動きベクトル(p₀,q₀,t₀)として(0,0,t₀)が求められ、カーネルサイズ(h,k)として(3,1)が求められる。 Since the offset values L ₃₁ and L ₃₀ satisfy the expression L ₃₁ ≦ L ₃₀ from the above description, the block B ₁₄ of interest is basically the minimum that substantially becomes the offset value L ₃₁ in step S172 of FIG. The value E ₃₁ is detected as the smallest of the minimum values E ₃₃ , E ₂₃ , E ₁₃ , E ₀₃ , E ₃₂ , E ₃₁ , E ₃₀ , and ( ₀ as the average motion vector (p ₀ , q ₀ )) , 0) is required. Further, in step S172, (0,0, t ₀ ) is obtained as the three-dimensional motion vector (p ₀ , q ₀ , t ₀ ), and (3,1) is obtained as the kernel size (h, k). .

さらに、オフセット値L₃₁は、上述したように、ステップＳ１７３における閾値εより小さいので、ほぼオフセット値L₃₁となる最小値E₃₁が、最小値E₃₃，E₂₃，E₁₃，E₀₃，E₃₂，E₃₁，E₃₀の中で最小のものとして検出される注目ブロックB₁₄については、図７７のステップＳ１７４において、フィルタ情報となる３次元の動きベクトル(p₀,q₀,t₀)として(0,0,t₀)が得られるとともに、カーネルサイズ(h,k)として(3,1)が得られる。 Further, the offset value L _31, as described above, since less than the threshold value ε in step S173, the minimum value E _31, which is substantially offset value L ₃₁ is the minimum value _{_{_{E 33, E 23, E 13}}} , E 03, E For the target block B ₁₄ detected as the smallest of ₃₂ , E ₃₁ , and E ₃₀ , in step S174 of FIG. 77, the three-dimensional motion vector (p ₀ , q ₀ , t ₀ ) serving as filter information is obtained. (0,0, t ₀ ) is obtained, and (3,1) is obtained as the kernel size (h, k).

これは、時刻t-ht₀からt+kt₀、つまり、時刻t-3t₀からt+t₀の間は、注目ブロックB₁₄は、(0,0,t₀)という速度で移動しているということを示している。 This is because the target block B ₁₄ moves at a speed of (0,0, t ₀ ) from time t-ht ₀ to t + kt ₀ , that is, from time t-3t ₀ to t + t _0. It shows that there is.

次に、例えば、図７１に示した動画データにおいて、ブロックB₉が注目ブロックである場合には、上述したように、注目ブロックB₉では、静止している被写体P₆₉₀₁のある位置において、動いている被写体P₆₉₀₂が、静止している被写体P₆₉₀₁を隠した状態となっているため、時刻t+t₀，t+2t₀，t+3t₀，t-t₀，t-2t₀，t-3t₀における画像データのいずれにも、注目ブロックB₉と対応する領域は存在しない。 Next, for example, in the moving image data shown in FIG. 71, when the block B ₉ is the target block, as described above, the target block B ₉ moves at a position where the subject P ₆₉₀₁ is stationary. Since the stationary subject P ₆₉₀₂ hides the stationary subject P ₆₉₀₁ , the time t + t ₀ , t + 2t ₀ , t + 3t ₀ , tt ₀ , t−2t ₀ , t− There is no region corresponding to the target block B _{9 in} any of the image data at 3t ₀ .

このため、時刻t+t₀，t+2t₀，t+3t₀，t-t₀，t-2t₀，t-3t₀における相関情報E₁(u₁,v₁)，E₂(u₂,v₂)，E₃(u₃,v₃)，F₁(r₁,s₁)，F₂(r₂,s₂)，F₃(r₃,s₃)の値は、時刻t+t₀，t+2t₀，t+3t₀，t-t₀，t-2t₀，t-3t₀における画像データの、注目ブロックB₉と最も類似する位置(u₁,v₁)，(u₂,v₂)，(u₃,v₃)，(r₁,s₁)，(r₂,s₂)，(r₃,s₃)で、それぞれ最小となるが、その値は、ある程度大きな値となる。 Therefore, the correlation information E ₁ (u ₁ , v ₁ ), E ₂ (u ₂ , t at time t + t ₀ , t + 2t ₀ , t + 3t ₀ , tt ₀ , t-2t ₀ , t-3t ₀ v ₂ ), E ₃ (u ₃ , v ₃ ), F ₁ (r ₁ , s ₁ ), F ₂ (r ₂ , s ₂ ), F ₃ (r ₃ , s ₃ ) The position (u ₁ , v ₁ ), (u ₂ ) of the image data at t ₀ , t + 2t ₀ , t + 3t ₀ , tt ₀ , t-2t ₀ , t-3t _{0 that} is most similar to the target block B ₉ , v ₂ ), (u ₃ , v ₃ ), (r ₁ , s ₁ ), (r ₂ , s ₂ ), (r ₃ , s ₃ ) are the smallest, but the value is somewhat large Value.

この場合、図７４のステップＳ１５８において求められる合成相関関数E(p,q)については、その最小値E₃₃を与える位置(p₃₃,q₃₃)は、ある位置となり、最小値E₃₃は、オフセット値L₃₃よりかなり大きな値になる。同様に、図７５のステップＳ１６０において求められる合成相関関数E(p,q)についても、その最小値E₂₃を与える位置(p₂₃,q₂₃)はある位置となり、最小値E₂₃は、オフセット値L₂₃よりかなり大きな値となる。また、図７５のステップＳ１６２において求められる合成相関関数E(p,q)についても、その最小値E₁₃を与える位置(p₁₃,q₁₃)はある位置となり、最小値E₁₃は、オフセット値L₁₃よりかなり大きな値となる。さらに、図７５のステップＳ１６４において求められる合成相関関数E(p,q)についても、その最小値E₀₃を与える位置(p₀₃,q₀₃)はある位置となり、最小値E₀₃は、オフセット値L₀₃よりかなり大きな値となる。また、図７６のステップＳ１６６において求められる合成相関関数E(p,q)についても、その最小値E₃₂を与える位置(p₃₂,q₃₂)はある位置となり、最小値E₃₂は、オフセット値L₃₂よりかなり大きな値になる。さらに、図７６のステップＳ１６８において求められる合成相関関数E(p,q)についても、その最小値E₃₁を与える位置(p₃₁,q₃₁)はある位置となり、最小値E₃₁は、オフセット値L₃₁よりかなり大きな値になる。また、図７６のステップＳ１７０において求められる合成相関関数E(p,q)についても、その最小値E₃₀を与える位置(p₃₀,q₃₀)はある位置となり、最小値E₃₀は、オフセット値L₃₀よりかなり大きな値になる。 In this case, for the combined correlation function E (p, q) obtained in step S158 of FIG. 74, the position (p ₃₃ , q ₃₃ ) giving the minimum value E ₃₃ is a certain position, and the minimum value E ₃₃ is It becomes considerably larger than the offset value L _33. Similarly, for the combined correlation function E (p, q) obtained in step S160 of FIG. 75, the position (p ₂₃ , q ₂₃ ) giving the minimum value E ₂₃ is a certain position, and the minimum value E ₂₃ is an offset. It becomes a much larger value than the value L _23. Also, with respect to the composite correlation function E (p, q) obtained in step S162 in FIG. 75, the position (p ₁₃ , q ₁₃ ) giving the minimum value E ₁₃ is a certain position, and the minimum value E ₁₃ is the offset value. It becomes a much larger value than L _13. Further, with respect to the composite correlation function E (p, q) obtained in step S164 of FIG. 75, the position (p ₀₃ , q ₀₃ ) giving the minimum value E ₀₃ is a certain position, and the minimum value E ₀₃ is an offset value. It is considerably larger than L ₀₃ . Also, for the combined correlation function E (p, q) obtained in step S166 of FIG. 76, the position (p ₃₂ , q ₃₂ ) giving the minimum value E ₃₂ is a certain position, and the minimum value E ₃₂ is the offset value. It becomes considerably to a value greater than L _32. Further, with respect to the composite correlation function E (p, q) obtained in step S168 in FIG. 76, the position (p ₃₁ , q ₃₁ ) that gives the minimum value E ₃₁ is a certain position, and the minimum value E ₃₁ Much larger than L ₃₁ . Also, with respect to the composite correlation function E (p, q) obtained in step S170 of FIG. 76, the position (p ₃₀ , q ₃₀ ) that gives the minimum value E ₃₀ is a certain position, and the minimum value E ₃₀ It becomes considerably to a value greater than L _30.

従って、最小値E₃₃，E₂₃，E₁₃，E₀₃，E₃₂，E₃₁，E₃₀は、いずれもかなり大きな値となり、図７７のステップＳ１７２において、その中のいずれが最小のものとして選択されたとしても、ステップＳ１７３における閾値εより大きくなり（以上となり）、注目ブロックB₉については、図７７のステップＳ１７５において、フィルタ情報となる３次元の動きベクトル(p₀,q₀,t₀)として(p₀',q₀',t₀)が得られるとともに、カーネルサイズ(h,k)として(0,0)が得られる。 Accordingly, the minimum values E ₃₃ , E ₂₃ , E ₁₃ , E ₀₃ , E ₃₂ , E ₃₁ , and E ₃₀ are all large values, and any of them is selected as the minimum in step S172 of FIG. even if they are, larger than the threshold ε in step S173 (it becomes higher), for the block of interest B _9, in step S175 of FIG. 77, three-dimensional motion vectors, which are filter information _{_{(p 0, q 0, t}} 0 ) Is obtained as (p ₀ ′, q ₀ ′, t ₀ ), and (0,0) is obtained as the kernel size (h, k).

これは、時刻t-ht₀からt+kt₀、つまり、時刻tのタイミングだけにおいて、注目ブロックB₉は、(p₀',q₀',t₀)という速度で移動しているということを示している。換言すれば、カーネルサイズ(h,k)が(0,0)ということは、注目ブロックB₉に対応する領域が、他のどの時刻における画像データ内にも存在しないということである。 This means that the target block B ₉ is moving at a speed of (p ₀ ′, q ₀ ′, t ₀ ) only from time t-ht ₀ to t + kt ₀ , that is, at the timing of time t. Is shown. In other words, the kernel size (h, k) being (0, 0) means that the area corresponding to the target block B ₉ does not exist in the image data at any other time.

なお、(p₀',q₀')は、最小値E₃₃，E₂₃，E₁₃，E₀₃，E₃₂，E₃₁，E₃₀のうちの最小のものを与える位置(p,q)を表す。また、カーネルサイズ(h,k)が(0,0)のときの(p,q)はダミーデータであり、使用されることはない。 Note that (p ₀ ', q ₀ ') is a position (p, q) that gives the smallest one of the minimum values E ₃₃ , E ₂₃ , E ₁₃ , E ₀₃ , E ₃₂ , E ₃₁ , E _30. To express. Further, (p, q) when the kernel size (h, k) is (0,0) is dummy data and is not used.

以上のように、図７３乃至図７７に示したフローチャートにしたがった処理によれば、注目ブロックについて、フィルタ情報としての３次元の動きベクトル(p₀,q₀,t₀)と、カーネルサイズ(h,k)が求められ、これにより、注目ブロックについて、「時刻t-ht₀からt+kt₀の間は、注目ブロックは(p₀,q₀,t₀)という速度で移動している」という結果を得ることが出来る。 As described above, according to the processing according to the flowcharts shown in FIGS. 73 to 77, the three-dimensional motion vector (p ₀ , q ₀ , t ₀ ) as the filter information and the kernel size ( h, k) is obtained, and as a result, for the block of interest, the block of interest is moving at a speed of (p ₀ , q ₀ , t ₀ ) from time t-ht ₀ to t + kt ₀ Can be obtained.

なお、オフセット値L₃₃，L₂₃，L₁₃，L₀₃，L₃₂，L₃₁，L₃₀が、式L₃₃≦L₃₂≦L₃₁≦L₃₀、かつ、式L₃₃≦L₂₃≦L₁₃≦L₀₃を満たすことにより、基本的には、図７７のステップＳ１７２において、最小値E₃₃，E₂₃，E₁₃，E₀₃，E₃₂，E₃₁，E₃₀のうちの、hやkの大きい最小値E_hkが選択されやすくなる。これにより、動画データにノイズが含まれ、注目ブロックと、他のフレームの注目ブロックに対応する領域との間に、多少の画素値の違いがあっても、注目ブロックが(p₀,q₀,t₀)という速度で移動しているという範囲（時間）を、過小評価してしまうことを防止することができる。 The offset values L ₃₃ , L ₂₃ , L ₁₃ , L ₀₃ , L ₃₂ , L ₃₁ , and L ₃₀ are the expressions L ₃₃ ≦ L ₃₂ ≦ L ₃₁ ≦ L ₃₀ and the expressions L ₃₃ ≦ L ₂₃ ≦ L _13. By satisfying ≦ L ₀₃ , basically, in step S 172 of FIG. 77, h or k of the minimum values E ₃₃ , E ₂₃ , E ₁₃ , E ₀₃ , E ₃₂ , E ₃₁ , E ₃₀ A large minimum value E _hk is easily selected. As a result, even if there is a slight difference in pixel values between the block of interest and the region corresponding to the block of interest in another frame, the block of interest will be (p ₀ , q ₀ , t ₀ ) can be prevented from underestimating the range (time) of moving at a speed of (t ₀ ).

次に、図２５の主成分方向取得部３１が図７２に示したように構成されるフィルタ生成部２３が、図７３乃至図７７にしたがった処理を行うことにより、フィルタ生成部２３（のフィルタ情報供給部３２）からフィルタ部２２に対して、フィルタ情報としての３次元の動きベクトル(p₀,q₀,t₀)と、カーネルサイズ(h,k)が供給される場合、フィルタ部２２では、図７８に示すフローチャートにしたがった処理が、図２６のステップＳ３の処理として行われる。 Next, the filter generation unit 23 in which the principal component direction acquisition unit 31 of FIG. 25 is configured as shown in FIG. 72 performs the processing according to FIGS. When the three-dimensional motion vector (p ₀ , q ₀ , t ₀ ) and the kernel size (h, k) as filter information are supplied from the information supply unit 32) to the filter unit 22, the filter unit 22 Then, the process according to the flowchart shown in FIG. 78 is performed as the process of step S3 of FIG.

即ち、ステップＳ１９１において、フィルタ部２２は、フィルタ情報供給部３２からフィルタ情報として供給される３次元の動きベクトル(p₀,q₀,t₀)と、カーネルサイズ(h,k)を受信することにより取得し、ステップＳ１９２に進む。 That is, in step S191, the filter unit 22 receives the three-dimensional motion vector (p ₀ , q ₀ , t ₀ ) and the kernel size (h, k) supplied as filter information from the filter information supply unit 32. The process proceeds to step S192.

ステップＳ１９２では、フィルタ部２２は、時刻tのフレームの注目ブロック内の位置(x,y)における画素の画素値C(t,x,y)を、例えば、時刻t-3t₀乃至t+3t₀における画素の画素値D(t-3t₀,x-3p₀,y-3q₀)，D(t-2t₀,x-2p₀,y-2q₀)，D(t-t₀,x-p₀,y-q₀)，D(t,x,y)，D(t+t₀,x+p₀,y+q₀)，D(t+2t₀,x+2p₀,y+2q₀)，D(t+3t₀,x+3p₀,y+3q₀)のうちの、カーネルサイズ(h,k)に応じた時刻（フレーム）の範囲の画素値を用いた演算により求めるフィルタリングを行う。なお、D(t,x,y)は、フィルタリングに用いる動画データの時刻tの位置(x,y)における画素値を表す。 In step S192, the filter unit 22 calculates the pixel value C (t, x, y) of the pixel at the position (x, y) in the target block of the frame at time t, for example, from time t-3t _{0 to} t + 3t. pixel values of the pixels in the _{_{0 D (t-3t 0,}} x-3p 0, y-3q 0), D (t-2t 0, x-2p 0, y-2q 0), D (tt 0, xp 0, yq ₀ ), D (t, x, y), D (t + t ₀ , x + p ₀ , y + q ₀ ), D (t + 2t ₀ , x + 2p ₀ , y + 2q ₀ ), D Of (t + 3t ₀ , x + 3p ₀ , y + 3q ₀ ), filtering is performed by calculation using pixel values in a time (frame) range corresponding to the kernel size (h, k). Note that D (t, x, y) represents a pixel value at a position (x, y) at time t of moving image data used for filtering.

即ち、ステップＳ１９２では、フィルタ部２２は、フィルタリング、つまり、注目ブロック内の位置(x,y)における画素の画素値C(t,x,y)の演算を、カーネルサイズ(h,k)によって表される、動きベクトル(p₀,q₀,t₀)の有効範囲内のフレームの画素値のみを用いて行う。 That is, in step S192, the filter unit 22 performs the filtering, that is, the calculation of the pixel value C (t, x, y) of the pixel at the position (x, y) in the target block according to the kernel size (h, k). represented by, carried out using only the pixel values of frames within the effective range of the motion vector _{_{(p 0, q 0, t}} 0).

具体的には、ステップＳ１９２では、フィルタ部２２は、カーネルサイズ(h,k)が(3,3)である場合、時刻t-3t₀乃至t+3t₀における画素の画素値D(t-3t₀,x-3p₀,y-3q₀)，D(t-2t₀,x-2p₀,y-2q₀)，D(t-t₀,x-p₀,y-q₀)，D(t,x,y)，D(t+t₀,x+p₀,y+q₀)，D(t+2t₀,x+2p₀,y+2q₀)，D(t+3t₀,x+3p₀,y+3q₀)を用い、例えば、式（１）にしたがって、注目ブロックの画素値C(t,x,y)の演算を行う。この場合、図６８のステップＳ１３２における場合と同様にして、注目ブロックの主成分方向に延びる、T方向の幅が2π/(4t₀)の領域を通過帯域とするフィルタによるフィルタリングが行われる。 Specifically, in step S192, when the kernel size (h, k) is (3, 3), the filter unit 22 determines the pixel value D (t−t) of the pixels at times t−3t _{0 to} t + 3t ₀ . 3t ₀ , x-3p ₀ , y-3q ₀ ), D (t-2t ₀ , x-2p ₀ , y-2q ₀ ), D (tt ₀ , xp ₀ , yq ₀ ), D (t, x, y), D (t + t ₀ , x + p ₀ , y + q ₀ ), D (t + 2t ₀ , x + 2p ₀ , y + 2q ₀ ), D (t + 3t ₀ , x + 3p ₀ , y + 3q ₀ ), for example, the pixel value C (t, x, y) of the block of interest is calculated according to equation (1). In this case, in the same manner as in step S132 of FIG. 68, filtering is performed using a filter that extends in the principal component direction of the block of interest and has a region with a width in the T direction of 2π / (4t ₀ ).

・・・（１）

... (1)

また、フィルタ部２２は、カーネルサイズ(h,k)が(2,3)である場合、時刻t-3t₀における画素の画素値D(t-3t₀,x-3p₀,y-3q₀)は用いずに、時刻t-2t₀乃至t+3t₀における画素の画素値D(t-2t₀,x-2p₀,y-2q₀)，D(t-t₀,x-p₀,y-q₀)，D(t,x,y)，D(t+t₀,x+p₀,y+q₀)，D(t+2t₀,x+2p₀,y+2q₀)，D(t+3t₀,x+3p₀,y+3q₀)を用い、例えば、式（２）にしたがって、注目ブロックの画素値C(t,x,y)の演算を行う。 In addition, when the kernel size (h, k) is (2,3), the filter unit 22 has a pixel value D (t-3t ₀ , x-3p ₀ , y-3q ₀ ) at time t-3t ₀ . ) Without using the pixel values D (t-2t ₀ , x-2p ₀ , y-2q ₀ ), D (tt ₀ , xp ₀ , yq ₀ ) at times t-2t _{0 to} t + 3t ₀ , D (t, x, y), D (t + t ₀ , x + p ₀ , y + q ₀ ), D (t + 2t ₀ , x + 2p ₀ , y + 2q ₀ ), D (t + 3t ₀ , x + 3p ₀ , y + 3q ₀ ), for example, the pixel value C (t, x, y) of the block of interest is calculated according to equation (2).

・・・（２）

... (2)

さらに、フィルタ部２２は、カーネルサイズ(h,k)が(1,3)である場合、時刻t-3t₀における画素の画素値D(t-3t₀,x-3p₀,y-3q₀)、および時刻t-2t₀における画素の画素値D(t-2t₀,x-2p₀,y-2q₀)は用いずに、時刻t-t₀乃至t+3t₀における画素の画素値D(t-t₀,x-p₀,y-q₀)，D(t,x,y)，D(t+t₀,x+p₀,y+q₀)，D(t+2t₀,x+2p₀,y+2q₀)，D(t+3t₀,x+3p₀,y+3q₀)を用い、例えば、式（３）にしたがって、注目ブロックの画素値C(t,x,y)の演算を行う。 Further, when the kernel size (h, k) is (1,3), the filter unit 22 outputs the pixel value D (t-3t ₀ , x-3p ₀ , y-3q ₀ ) of the pixel at time t-3t ₀ . ), and time t-2t pixel value of the pixel at _{_{0 D (t-2t 0,}} x-2p 0, y-2q 0) is not used, the time tt ₀ to t + 3t pixel value of the pixel at ₀ D ( tt ₀ , xp ₀ , yq ₀ ), D (t, x, y), D (t + t ₀ , x + p ₀ , y + q ₀ ), D (t + 2t ₀ , x + 2p ₀ , y + 2q ₀ ), D (t + 3t ₀ , x + 3p ₀ , y + 3q ₀ ), for example, according to equation (3), the pixel value C (t, x, y) of the block of interest is calculated Do.

・・・（３）

... (3)

また、フィルタ部２２は、カーネルサイズ(h,k)が(0,3)である場合、時刻t-3t₀における画素の画素値D(t-3t₀,x-3p₀,y-3q₀)、時刻t-2t₀における画素の画素値D(t-2t₀,x-2p₀,y-2q₀)、および時刻t-t₀における画素の画素値D(t-t₀,x-p₀,y-q₀)は用いずに、時刻t乃至t+3t₀における画素の画素値D(t,x,y)，D(t+t₀,x+p₀,y+q₀)，D(t+2t₀,x+2p₀,y+2q₀)，D(t+3t₀,x+3p₀,y+3q₀)を用い、例えば、式（４）にしたがって、注目ブロックの画素値C(t,x,y)の演算を行う。 In addition, when the kernel size (h, k) is (0,3), the filter unit 22 has a pixel value D (t-3t ₀ , x-3p ₀ , y-3q ₀ ) at time t-3t ₀ . ), Pixel value D (t-2t ₀ , x-2p ₀ , y-2q ₀ ) of the pixel at time t-2t ₀ , and pixel value D (tt ₀ , xp ₀ , yq ₀ ) of the pixel at time tt ₀ Is not used, and the pixel values D (t, x, y), D (t + t ₀ , x + p ₀ , y + q ₀ ), D (t + 2t ₀ ) at times t to t + 3t ₀ are used. , x + 2p ₀ , y + 2q ₀ ), D (t + 3t ₀ , x + 3p ₀ , y + 3q ₀ ), for example, according to equation (4), the pixel value C (t, Calculate x, y).

・・・（４）

... (4)

同様に、フィルタ部２２は、カーネルサイズ(h,k)が(3,2)である場合、時刻t+3t₀における画素の画素値D(t+3t₀,x+3p₀,y+3q₀)は用いずに、時刻t-3t₀乃至t+2t₀における画素の画素値D(t-3t₀,x-3p₀,y-3q₀)，D(t-2t₀,x-2p₀,y-2q₀)，D(t-t₀,x-p₀,y-q₀)，D(t,x,y)，D(t+t₀,x+p₀,y+q₀)，D(t+2t₀,x+2p₀,y+2q₀)を用い、例えば、式（５）にしたがって、注目ブロックの画素値C(t,x,y)の演算を行う。 Similarly, when the kernel size (h, k) is (3, 2), the filter unit 22 uses the pixel value D (t + 3t ₀ , x + 3p ₀ , y + 3q) at the time t + 3t ₀ . ₀ ) is not used, and pixel values D (t-3t ₀ , x-3p ₀ , y-3q ₀ ), D (t-2t ₀ , x-2p ₀ ) at times t-3t _{0 to} t + 2t ₀ ₀ , y-2q ₀ ), D (tt ₀ , xp ₀ , yq ₀ ), D (t, x, y), D (t + t ₀ , x + p ₀ , y + q ₀ ), D (t + 2t ₀ , x + 2p ₀ , y + 2q ₀ ), for example, the pixel value C (t, x, y) of the block of interest is calculated according to equation (5).

・・・（５）

... (5)

さらに、フィルタ部２２は、カーネルサイズ(h,k)が(3,1)である場合、時刻t+3t₀における画素の画素値D(t+3t₀,x+3p₀,y+3q₀)、および時刻t+2t₀における画素の画素値D(t+2t₀,x+2p₀,y+2q₀)は用いずに、時刻t-3t₀乃至t+t₀における画素の画素値D(t-3t₀,x-3p₀,y-3q₀)，D(t-2t₀,x-2p₀,y-2q₀)，D(t-t₀,x-p₀,y-q₀)，D(t,x,y)，D(t+t₀,x+p₀,y+q₀)を用い、例えば、式（６）にしたがって、注目ブロックの画素値C(t,x,y)の演算を行う。 Further, when the kernel size (h, k) is (3, 1), the filter unit 22 uses the pixel value D (t + 3t ₀ , x + 3p ₀ , y + 3q ₀ ) at time t + 3t ₀ . ), And the pixel value D (t + 2t ₀ , x + 2p ₀ , y + 2q ₀ ) of the pixel at time t + 2t ₀ , and the pixel value of the pixel at time t-3t _{0 to} t + t ₀ D (t-3t ₀ , x-3p ₀ , y-3q ₀ ), D (t-2t ₀ , x-2p ₀ , y-2q ₀ ), D (tt ₀ , xp ₀ , yq ₀ ), D ( t, x, y), D (t + t ₀ , x + p ₀ , y + q ₀ ), for example, according to equation (6), the pixel value C (t, x, y) of the block of interest Perform the operation.

・・・（６）

... (6)

また、フィルタ部２２は、カーネルサイズ(h,k)が(3,0)である場合、時刻t+3t₀における画素の画素値D(t+3t₀,x+3p₀,y+3q₀)、時刻t+2t₀における画素の画素値D(t+2t₀,x+2p₀,y+2q₀)、および時刻t+t₀における画素の画素値D(t+t₀,x+p₀,y+q₀)は用いずに、時刻t-3t₀乃至tにおける画素の画素値D(t-3t₀,x-3p₀,y-3q₀)，D(t-2t₀,x-2p₀,y-2q₀)，D(t-t₀,x-p₀,y-q₀)，D(t,x,y)を用い、例えば、式（７）にしたがって、注目ブロックの画素値C(t,x,y)の演算を行う。 In addition, when the kernel size (h, k) is (3,0), the filter unit 22 has a pixel value D (t + 3t ₀ , x + 3p ₀ , y + 3q ₀ ) at time t + 3t ₀ . ), The pixel value D (t + 2t ₀ , x + 2p ₀ , y + 2q ₀ ) of the pixel at time t + 2t ₀ , and the pixel value D (t + t ₀ , x + of the pixel at time t + t ₀ p _{_0,} y + q ₀₎ without the pixel value D of the pixel at time t-3t ₀ to _{t (t-3t 0, x} -3p 0, y-3q 0), D (t-2t 0, x-2p ₀ , y-2q ₀ ), D (tt ₀ , xp ₀ , yq ₀ ), D (t, x, y), for example, according to the equation (7), the pixel value C ( t, x, y) is calculated.

・・・（７）

... (7)

一方、カーネルサイズ(h,k)が(0,0)である場合は、注目ブロックと同一の（画像の）領域（注目ブロックに対応する領域）が、他の時刻t-3t₀，t-2t₀，t-t₀，t+t₀，t+2t₀，t+3t₀の画像に存在しない場合であり、この場合、時間方向のフィルタリングは、画質の劣化をもたらすので、注目ブロックのフレーム（注目フレーム）、即ち、時刻tにおける画素の画素値D(t,x,y)のみを用い、例えば、式（８）にしたがって、注目ブロックの画素値C(t,x,y)の演算を行う。なお、式（８）は、フィルタリングを行わないことと等価である。 On the other hand, when the kernel size (h, k) is (0,0), the same area (image area) as the target block (area corresponding to the target block) is set to other times t-3t ₀ , t- 2t ₀ , tt ₀ , t + t ₀ , t + 2t ₀ , and t + 3t ₀ are not present in this case. In this case, temporal filtering results in degradation of image quality. (Frame of interest), that is, using only the pixel value D (t, x, y) of the pixel at time t, for example, the pixel value C (t, x, y) of the block of interest is calculated according to equation (8). Do. Equation (8) is equivalent to not performing filtering.

・・・（８）

... (8)

ここで、注目ブロックに対応する領域が、他の時刻t-3t₀，t-2t₀，t-t₀，t+t₀，t+2t₀，t+3t₀の画像に存在しない場合としては、例えば、注目ブロック内に投影されている被写体の移動速度が、人間が認識することができないほど高速である場合や、注目ブロック内に投影されている被写体が、時刻tのフレーム（注目フレーム）にだけ突然現れた場合などがある。 Here, as a case where the region corresponding to the block of interest does not exist in the images at other times t-3t ₀ , t-2t ₀ , tt ₀ , t + t ₀ , t + 2t ₀ , t + 3t ₀ , For example, when the moving speed of the subject projected in the block of interest is so high that humans cannot recognize it, or the subject projected in the block of interest is in the frame at time t (frame of interest). There are cases where it only appears suddenly.

また、相関情報や合成相関情報の最小値を求めることは、いわゆるブロックマッチングにより動きベクトルを検出することに相当するが、注目ブロック内に投影されている被写体が、ブロックマッチングにおける探索範囲を超えて移動してしまっている場合も、注目ブロックに対応する領域が、他の時刻t-3t₀，t-2t₀，t-t₀，t+t₀，t+2t₀，t+3t₀の画像に存在しない場合に該当する。 Finding the minimum value of correlation information or composite correlation information is equivalent to detecting a motion vector by so-called block matching, but the subject projected in the target block exceeds the search range in block matching. Even if it has moved, the area corresponding to the block of interest is displayed in the images at other times t-3t ₀ , t-2t ₀ , tt ₀ , t + t ₀ , t + 2t ₀ , t + 3t ₀ . Applicable when it does not exist.

なお、ステップＳ１９２において、画素値C(t,x,y)の演算としてのフィルタリングは、時間方向のサンプル数を1/4に間引くダウンサンプリングを行いながら、即ち、４フレームごとに１フレームの割合で行われる。 In step S192, the filtering as the calculation of the pixel value C (t, x, y) is performed while down-sampling that thins out the number of samples in the time direction to 1/4, that is, the ratio of one frame every four frames. Done in

ステップＳ１９２の処理後は、ステップＳ１９３に進み、フィルタ部２２は、フィルタリング結果、即ち、ステップＳ１９２におけるフィルタリング結果（式（１）乃至（８）のうちのいずれかの演算結果）である画素値C(t,x,y)を、時刻tの位置(x,y)における画素値として、エンコード部２４に出力する。 After the process of step S192, the process proceeds to step S193, and the filter unit 22 outputs the pixel value C that is the filtering result, that is, the filtering result in step S192 (the calculation result of any one of the expressions (1) to (8)). (t, x, y) is output to the encoding unit 24 as a pixel value at the position (x, y) at time t.

以上のように、第２の動き検出処理（図７３乃至図７７のステップＳ１５１乃至Ｓ１７６の処理のうちの、ステップＳ１５１乃至Ｓ１７２の処理）では、第１の動き検出処理と同様に、フィルタの通過帯域のT方向の幅である2π/(4t₀)に相当する複数のフレームそれぞれとの相関を表す複数の相関情報を用いることによって、注目ブロックから複数のフレームへの、いわば平均的な動きを表す平均動きベクトルを求めるが、その際に、注目ブロックに対応する領域が存在しないフレームについての相関情報を用いないようにしたので、注目ブロックの平均的な動きを正確に表す平均動きベクトルを求めることができ、さらに、注目ブロックについて、正確な主成分方向を求めることができる。 As described above, in the second motion detection process (the processes in steps S151 to S172 of the processes in steps S151 to S176 in FIGS. 73 to 77), the filter passes as in the first motion detection process. By using multiple pieces of correlation information representing the correlation with each of multiple frames corresponding to the width in the T direction of the band, 2π / (4t ₀ ), the average movement from the target block to multiple frames can be said. The average motion vector that represents the average motion of the target block is calculated because the correlation information is not used for the frame in which the region corresponding to the target block does not exist. Furthermore, an accurate principal component direction can be obtained for the block of interest.

また、第２の動き検出処理では、カーネルサイズ(h,k)を出力し、フィルタ部２２でのフィルタリングにおいては、そのカーネルサイズ(h,k)に対応した範囲のフレームの画素だけを用いるようにしたので、即ち、注目ブロックに対応する領域が存在しないフレームの画素を用いないようにしたので、人間の視覚で認識することができる周波数成分のみの適切なデータを得ることができる。 In the second motion detection process, the kernel size (h, k) is output, and only the pixels of the frame in the range corresponding to the kernel size (h, k) are used for filtering in the filter unit 22. In other words, since the pixel of the frame in which the region corresponding to the block of interest does not exist is not used, appropriate data of only frequency components that can be recognized by human vision can be obtained.

次に、例えば、図２５に示した送信装置１では、上述したように、フレームレート1/t₀の動画データを対象に、人間の視覚特性を考慮した必要な情報のみを残すフィルタリングを行うとともに、ダウンサンプリングを行うことで、データ量を削減することができる。 Next, for example, in the transmission device 1 shown in FIG. 25, as described above, filtering is performed on moving image data with a frame rate of 1 / t ₀ so as to leave only necessary information in consideration of human visual characteristics. By performing downsampling, the amount of data can be reduced.

さらに、図３３に示した受信装置２では、送信装置１が出力するフレームレート1/(4t₀)の動画データを対象に、フィルタリングとアップサンプリングを行うことで、理想的には、人間が画質の劣化を感じないフレームレート1/t₀の動画データを得ることができる。 Furthermore, in the receiving apparatus 2 shown in FIG. 33, by performing filtering and upsampling on the moving image data of the frame rate 1 / (4t ₀ ) output from the transmitting apparatus 1, ideally, human beings have image quality. It is possible to obtain moving image data with a frame rate of 1 / t ₀ that does not feel deterioration of the image.

ところで、受信装置２では、フレームレート1/(4t₀)の動画データをアップサンプリングすることによって、その動画データの２つのフレームの間に、３フレームが補間され、これにより、フレームレート1/t₀の動画データが得られる。このアップサンプリングにより補間されるフレーム（以下、適宜、補間フレームという）の画像データは、フレームレート1/(4t₀)の動画データのみから求められるため、実際には、多少の画質の劣化が生じることがある。 By the way, in the receiving device 2, by up-sampling the moving image data at the frame rate 1 / (4t ₀ ), 3 frames are interpolated between the two frames of the moving image data, whereby the frame rate 1 / t. ₀ video data is obtained. Since image data of a frame to be interpolated by this upsampling (hereinafter referred to as an interpolated frame as appropriate) is obtained only from moving image data at a frame rate of 1 / (4t ₀ ), in reality, some image quality degradation occurs. Sometimes.

そこで、送信装置１では、フレームレート1/(4t₀)の動画データの他に、その動画データの２つのフレームの間に補間される補間フレームの画質の劣化を補う、いわば補足データを得て出力するようにすることができる。 Therefore, in the transmission apparatus 1, in addition to the moving image data at the frame rate 1 / (4t ₀ ), the supplemental data that compensates for the deterioration of the image quality of the interpolation frame interpolated between the two frames of the moving image data is obtained. Can be output.

図７９は、フレームレート1/t₀の動画データから、フレームレート1/(4t₀)の動画データと、補足データとを得て出力する図２４の送信装置１の構成例を示している。 Figure 79 is the video data of the frame rate 1 / t _0, and shows the video data of the frame rate 1 / (4t _0), a configuration example of a transmitting apparatus 1 of FIG. 24 which outputs to obtain the supplemental data.

なお、以下においては、フレーム周期t₀を、例えば、1/240秒とする。この場合、送信装置１に供給（入力）されるフレームレート1/t₀の動画データは、フレームレートが240fpsの動画データである。また、フレームレートが240fpsの動画データなどを、以下、適宜、240fps動画データなどと記載する。 In the following, the frame period t ₀ is, for example, 1/240 seconds. In this case, the moving image data with the frame rate 1 / t ₀ supplied (input) to the transmission device 1 is moving image data with a frame rate of 240 fps. In addition, moving image data with a frame rate of 240 fps is hereinafter referred to as 240 fps moving image data as appropriate.

図７９の送信装置１において、入力端子２１１には、240fps動画データが入力される。入力端子２１１は、帯域制限フィルタ部２１２に接続されており、入力端子２１１に入力された240fps動画データは、帯域制限フィルタ部２１２に供給される。 In the transmission device 1 of FIG. 79, 240 fps moving image data is input to the input terminal 211. The input terminal 211 is connected to the band limiting filter unit 212, and the 240 fps moving image data input to the input terminal 211 is supplied to the band limiting filter unit 212.

帯域制限フィルタ部２１２は、例えば、図２５に示したバッファ部２１、フィルタ部２２、およびフィルタ生成部２３で構成されている。帯域制限フィルタ部２１２では、入力端子２１１からの240fps動画データを対象に、上述したような人間の視覚特性を考慮した必要な情報のみを残すフィルタリングが行われる。但し、帯域制限フィルタ部２１２を構成するフィルタ部２２（図２５）では、1/4のダウンサンプリングは行われない。従って、帯域制限フィルタ部２１２から出力される動画データは、240fps動画データのフレームレートが1/4になったフレームレート1/(4t₀)の動画データ（60fps動画データ）ではなく、240fps動画データである。 The band limiting filter unit 212 includes, for example, the buffer unit 21, the filter unit 22, and the filter generation unit 23 illustrated in FIG. In the band limiting filter unit 212, filtering is performed on the 240 fps moving image data from the input terminal 211 so as to leave only necessary information in consideration of human visual characteristics as described above. However, 1/4 downsampling is not performed in the filter unit 22 (FIG. 25) constituting the band limiting filter unit 212. Therefore, the moving image data output from the band limiting filter unit 212 is not the moving image data (60 fps moving image data) of the frame rate 1 / (4t ₀ ) in which the frame rate of the 240 fps moving image data is 1/4, but the 240 fps moving image data. It is.

なお、帯域制限フィルタ部２１２を構成するフィルタ生成部２３（図２５）の主成分方向取得部３１は、例えば、図７２に示したように構成することができる。この場合、フィルタ生成部２３では、図７３乃至図７７で説明した処理が行われ、帯域制限フィルタ部２１２を構成するフィルタ部２２（図２５）に対して、フィルタ情報としての３次元の動きベクトル(p₀,q₀,t₀)と、カーネルサイズ(h,k)とが出力される（図７７のステップＳ１７６）。そして、帯域制限フィルタ部２１２を構成するフィルタ部２２では、図７８で説明したように、カーネルサイズ(h,k)に応じたフィルタリングが行われる。但し、帯域制限フィルタ部２１２を構成するフィルタ部２２では、上述したように、ダウンサンプリングは行われない。即ち、240fps動画データの４フレームごとに１フレームのフィルタリングが行われるのではなく、240fps動画データの各フレーム（の各画素）についてフィルタリングが行われる。 Note that the principal component direction acquisition unit 31 of the filter generation unit 23 (FIG. 25) constituting the band limiting filter unit 212 can be configured as shown in FIG. 72, for example. In this case, the filter generation unit 23 performs the processing described with reference to FIGS. 73 to 77, and provides the filter unit 22 (FIG. 25) constituting the band-limiting filter unit 212 with a three-dimensional motion vector as filter information. (p ₀ , q ₀ , t ₀ ) and the kernel size (h, k) are output (step S176 in FIG. 77). Then, as described with reference to FIG. 78, filtering according to the kernel size (h, k) is performed in the filter unit 22 constituting the band limiting filter unit 212. However, as described above, downsampling is not performed in the filter unit 22 constituting the band limiting filter unit 212. That is, one frame is not filtered for every four frames of 240 fps moving image data, but filtering is performed for each frame (each pixel) of 240 fps moving image data.

ここで、帯域制限フィルタ部２１２での処理は、入力端子２１１からの240fps動画データから、人間には認識することができない周波数成分を除去するものであり、いわば240fps動画データを圧縮するための前段階の処理（プリプロセス）と考えることも出来る。帯域制限フィルタ部２１２は、図７９の送信装置１において必須のブロックというわけではなく、従って、図７９の送信装置１は、帯域制限フィルタ部２１２を設けずに構成することができる。この場合、入力端子２１１からの240fps動画データは、帯域制限フィルタ部２１２の後段に設けられている分離回路２１３に対して、直接入力される。 Here, the processing in the band limiting filter unit 212 is to remove frequency components that cannot be recognized by humans from the 240 fps moving image data from the input terminal 211, so to speak, before the 240 fps moving image data is compressed. It can also be thought of as stage processing (preprocessing). The band limiting filter unit 212 is not an indispensable block in the transmission device 1 of FIG. 79, and therefore the transmission device 1 of FIG. 79 can be configured without providing the band limiting filter unit 212. In this case, the 240 fps moving image data from the input terminal 211 is directly input to the separation circuit 213 provided at the subsequent stage of the band limiting filter unit 212.

図７９では、帯域制限フィルタ部２１２が出力する240fps動画データは、その後段の分離回路２１３に供給される。 In FIG. 79, the 240 fps moving image data output from the band limiting filter unit 212 is supplied to the subsequent separation circuit 213.

分離回路２１３は、帯域制限フィルタ部２１２からの240fps動画データ（第１の動画データ）を、その240fps動画データのフレームレートよりも低いフレームレートの、例えば、60fps動画データ（第２の動画データ）と、240fps動画データから60fps動画データを除いた残りの動画データ（第３の動画データ）とに分離する。即ち、分離回路２１３は、帯域制限フィルタ部２１２からの240fps動画データの４フレームごとに１フレームを選択することで、その240fps動画データから、60fps動画データを分離する。つまり、分離回路２１３は、図２５のフィルタ部２２における1/4のダウンサンプリングの結果得られるのと同様の60fps動画データを得る。さらに、分離回路２１３は、帯域制限フィルタ部２１２からの240fps動画データから、60fps動画データを分離した残りの動画データを得る。 The separation circuit 213 converts the 240 fps moving image data (first moving image data) from the band limiting filter unit 212 to a frame rate lower than the frame rate of the 240 fps moving image data, for example, 60 fps moving image data (second moving image data). And the remaining video data (third video data) obtained by removing the 60 fps video data from the 240 fps video data. That is, the separation circuit 213 selects one frame for every four frames of 240 fps moving image data from the band limiting filter unit 212, thereby separating 60 fps moving image data from the 240 fps moving image data. That is, the separation circuit 213 obtains 60 fps moving image data similar to that obtained as a result of 1/4 downsampling in the filter unit 22 of FIG. Further, the separation circuit 213 obtains the remaining moving image data obtained by separating the 60 fps moving image data from the 240 fps moving image data from the band limiting filter unit 212.

ここで、240fps動画データから、60fps動画データを分離した残りの動画データを、以下、適宜、240-60fps動画データという。 Here, the remaining moving image data obtained by separating 60 fps moving image data from 240 fps moving image data is hereinafter referred to as 240-60 fps moving image data as appropriate.

分離回路２１３において得られた60fps動画データは、圧縮回路２１４に供給される。また、分離回路２１３において得られた240-60fps動画データは、圧縮の対象の画像としてのターゲット(target)画像として、差分情報抽出部２１７に供給される。 The 60 fps moving image data obtained in the separation circuit 213 is supplied to the compression circuit 214. The 240-60 fps moving image data obtained in the separation circuit 213 is supplied to the difference information extraction unit 217 as a target image as an image to be compressed.

圧縮回路２１４は、分離回路２１３から供給される60fps動画データを、例えば、MPEGなどの既知のエンコード方法、その他の任意のエンコード方法によりエンコード（圧縮）し、その結果得られるビットストリームを出力する。このビットストリームは、解凍回路２１６と出力端子２１５に供給され、これにより、出力端子２１５からは、60fps動画データのエンコード（圧縮）結果としてのビットストリームが出力される。ここで、圧縮回路２１４は、図２５のエンコード部２４に相当し、従って、圧縮回路２１４が出力するビットストリームは、エンコードデータに相当する。 The compression circuit 214 encodes (compresses) the 60 fps moving image data supplied from the separation circuit 213 by, for example, a known encoding method such as MPEG, or any other encoding method, and outputs a bit stream obtained as a result. This bit stream is supplied to the decompression circuit 216 and the output terminal 215, whereby a bit stream as a result of encoding (compression) of 60 fps moving image data is output from the output terminal 215. Here, the compression circuit 214 corresponds to the encoding unit 24 in FIG. 25, and therefore the bit stream output from the compression circuit 214 corresponds to encoded data.

解凍回路２１６は、圧縮回路２１４からのビットストリームをデコード（解凍）（伸張）し、即ち、ローカルデコードし、そのローカルデコードの結果得られる60fps動画データを、ターゲット画像の圧縮にあたって参照するリファレンス(reference)画像として、差分情報抽出部２１７に供給する。 The decompression circuit 216 decodes (decompresses) (decompresses) the bit stream from the compression circuit 214, that is, performs local decoding, and references the 60 fps moving image data obtained as a result of the local decoding in compressing the target image. ) The image is supplied to the difference information extraction unit 217 as an image.

なお、圧縮回路２１４が、例えば、MPEGエンコードなどの、データのエンコードにあたってローカルデコードを行うエンコード方法によるエンコードを行う場合には、圧縮回路２１４において行われるローカルデコードの結果得られる60fps動画データを、リファレンス画像として、圧縮回路２１４から差分情報抽出部２１７に供給するようにすることができる。この場合、解凍回路２１６は、設ける必要がない。 When the compression circuit 214 performs encoding by an encoding method that performs local decoding in encoding data, such as MPEG encoding, for example, the 60 fps moving image data obtained as a result of local decoding performed in the compression circuit 214 is used as a reference. An image can be supplied from the compression circuit 214 to the difference information extraction unit 217. In this case, the decompression circuit 216 need not be provided.

差分情報抽出部２１７は、分離回路２１３からのターゲット画像である240-60fps動画データを、解凍回路２１６からのリファレンス画像である60fps動画データを用いて圧縮し、その圧縮によって得られるデータ（以下、適宜、差分圧縮データという）を、出力端子２１８に供給する。即ち、差分情報抽出部２１７は、リファレンス画像から、ターゲット画像の推測値を求め、その推測値を用いて、ターゲット画像を圧縮する。具体的には、差分情報抽出部２１７は、ターゲット画像と、その推測値との差分をとることで、ターゲット画像を圧縮する。そして、ターゲット画像とその推測値との差分をとることで、ターゲット画像を圧縮することにより得られるデータである差分圧縮データは、差分情報抽出部２１７から出力端子２１８に供給される。これにより、出力端子２１８からは、240-60fps動画データの圧縮結果としての差分圧縮データが出力される。 The difference information extraction unit 217 compresses the 240-60 fps moving image data, which is the target image from the separation circuit 213, using the 60 fps moving image data, which is the reference image from the decompression circuit 216, and data obtained by the compression (hereinafter, referred to as the following) (Referred to as differentially compressed data as appropriate) is supplied to the output terminal 218. That is, the difference information extraction unit 217 obtains an estimated value of the target image from the reference image, and compresses the target image using the estimated value. Specifically, the difference information extraction unit 217 compresses the target image by taking the difference between the target image and the estimated value. Then, by taking the difference between the target image and its estimated value, the difference compressed data that is data obtained by compressing the target image is supplied from the difference information extraction unit 217 to the output terminal 218. Thereby, the differential compression data as a compression result of 240-60fps moving image data is output from the output terminal 218.

ここで、以上のように、図７９の送信装置１では、60fps動画データのエンコード（圧縮）結果としてのビットストリームと、240-60fps動画データの圧縮結果としての差分圧縮データとが、出力端子２１５と２１８とから、独立に出力される。ビットストリームと差分圧縮データとは、そのまま別々に、あるいは、多重化され、図２４の記録媒体１１に記録され、あるいは、伝送媒体１２を介して伝送される。 Here, as described above, in the transmission device 1 of FIG. 79, the bit stream as the result of encoding (compression) of 60 fps moving picture data and the differential compressed data as the compression result of 240-60 fps moving picture data are output terminal 215. And 218 are output independently. The bit stream and the differentially compressed data are separately separately or multiplexed and recorded on the recording medium 11 of FIG. 24 or transmitted via the transmission medium 12.

なお、出力端子２１８から出力される、240-60fps動画データの圧縮結果としての差分圧縮データが、上述した補間フレームの画質の劣化を補う補足データである。 The differentially compressed data output from the output terminal 218 as the compression result of the 240-60 fps moving image data is supplementary data that compensates for the above-described degradation of the image quality of the interpolation frame.

次に、図８０は、分離回路２１３に入力される240fps動画データと、分離回路２１３から出力される60fps動画データ、および240-60fps動画データを示している。なお、図８０において、横軸は時間を表している。 Next, FIG. 80 shows 240 fps moving image data input to the separation circuit 213, 60 fps moving image data and 240-60 fps moving image data output from the separation circuit 213. In FIG. 80, the horizontal axis represents time.

図８０の一番上のフレームf₁，f₂，・・・，f₁₃は、図７９の分離回路２１３に入力される240fps動画データを示している。240fps動画データにおいて、あるフレームf_iと次のフレームf_i+1との時間間隔は、1/240秒である(iは整数）。 The top frames f ₁ , f ₂ ,..., F ₁₃ in FIG. 80 show 240 fps moving image data input to the separation circuit 213 in FIG. In 240fps video data, the time interval between the frames f _{i + 1} following the given frame f _i of a 1/240 seconds (i is an integer).

なお、フレームf₁の前と、フレームf₁₃の後にも、フレームは続くが、図８０では、図示を省略してある。 Although the frame continues before the frame f ₁ and after the frame f ₁₃ , the illustration is omitted in FIG.

図７９の分離回路２１３は、図８０の一番上に示した240fps動画データを、図８０の上から２番目に示す60fps動画データと、図８０の一番下に示す240-60fps動画データとに分離する。 The separation circuit 213 of FIG. 79 converts the 240 fps moving image data shown at the top of FIG. 80 into the 60 fps moving image data shown second from the top of FIG. 80 and the 240-60 fps moving image data shown at the bottom of FIG. To separate.

即ち、分離回路２１３は、240fps動画データの４フレームf_4i+1，f_4i+2，f_4i+3，f_4i+4ごとの、例えば、１フレームf_4i+1を選択することにより、図８０の上から２番目に示すようなフレーム・・・，f₁，f₅，f₉，f₁₃，・・・からなる60fps動画データを得る。 That is, the separation circuit 213 selects, for example, one frame f _{4i + 1} for every four frames f _{4i + 1} , f _{4i + 2} , f _{4i + 3} , and f _{4i + 4} of the 240 fps moving image data. 60 fps moving image data consisting of frames..., F ₁ , f ₅ , f ₉ , f ₁₃ ,.

また、分離回路２１３は、240fps動画データから、60fps動画データを除いた残りの、図８０の一番下に示すようなフレーム・・・，f₂，f₃，f₄，f₆，f₇，f₈，f₁₀，f₁₁，f₁₂，・・・からなる240-60fps動画データを得る。 Further, the separation circuit 213 removes the 60 fps moving image data from the 240 fps moving image data, and the remaining frames shown in the bottom of FIG. 80..., F ₂ , f ₃ , f ₄ , f ₆ , f ₇ , F ₈ , f ₁₀ , f ₁₁ , f ₁₂ ,...

60fps動画データにおいては、あるフレームと次のフレームとの時間間隔は、1/60秒である。また、240-60fps動画データは、１秒あたりのフレーム数が、240-60=180フレームの動画データである。 In 60 fps video data, the time interval between one frame and the next is 1/60 second. Further, 240-60fps moving image data is moving image data with 240-60 = 180 frames per second.

図７９の差分情報抽出部２１７には、図８０の上から２番目に示す60fps動画データがリファレンス画像として供給されるとともに、図８０の一番下に示す240-60fps動画データがターゲット画像として供給される。差分情報抽出部２１７では、ターゲット画像である240-60fps動画データのフレーム・・・，f₂，f₃，f₄，f₆，f₇，f₈，f₁₀，f₁₁，f₁₂，・・・（の画像データ）が、リファレンス画像である60fps動画データのフレーム・・・，f₁，f₅，f₉，f₁₃，・・・（の画像データ）を用いて圧縮される。 The second 60 fps moving image data from the top of FIG. 80 is supplied as a reference image to the difference information extraction unit 217 of FIG. 79, and the 240-60 fps moving image data shown at the bottom of FIG. 80 is supplied as a target image. Is done. In difference information extraction unit 217, a frame ... of 240-60fps video data which is the target _{_{image, f 2, f 3, f}} 4, f 6, f 7, f 8, f 10, f 11, f 12, · .. (Image data) is compressed using frames of 60 fps moving image data as reference images,..., F ₁ , f ₅ , f ₉ , f ₁₃ ,.

なお、図７９の圧縮回路２１４において、分離回路２１３からの60fps動画データに対して、完全に可逆のエンコードが行われる場合には、分離回路２１３で得られる60fps動画データと、解凍回路２１６から差分情報抽出部２１７に供給されるリファレンス画像としての60fps動画データとは一致する。しかしながら、圧縮回路２１４において、分離回路２１３からの60fps動画データに対して行われるエンコードが完全に可逆でない場合には、分離回路２１３で得られる60fps動画データと、解凍回路２１６から差分情報抽出部２１７に供給されるリファレンス画像としての60fps動画データとには、厳密には違いがある。但し、ここでは、その違いを問題とする必要はないので、以下では、分離回路２１３で得られる60fps動画データと、解凍回路２１６から差分情報抽出部２１７に供給される60fps動画データとは、一致するものとして説明を行う。即ち、厳密には、・・・，f₁，f₅，f₉，f₁₃，・・・（の画像データ）に対してエンコード（図７９の圧縮回路２１４によるエンコード）とデコード（解凍回路２１６によるデコード）を行ったものが、リファレンス画像である。しかし、ここでは、単に、リファレンス画像は図８０の上から２番目に示す60fps動画データのフレーム・・・，f₁，f₅，f₉，f₁₃，・・・（の画像データ）であるとして説明を行う。 In the compression circuit 214 of FIG. 79, when the 60 fps moving image data from the separation circuit 213 is completely reversibly encoded, the difference between the 60 fps moving image data obtained by the separation circuit 213 and the decompression circuit 216 is obtained. The 60 fps moving image data as the reference image supplied to the information extraction unit 217 matches. However, in the compression circuit 214, when the encoding performed on the 60 fps moving image data from the separation circuit 213 is not completely reversible, the 60 fps moving image data obtained by the separation circuit 213 and the difference information extraction unit 217 from the decompression circuit 216 are obtained. Strictly speaking, there is a difference from the 60 fps moving image data serving as a reference image supplied to. However, since it is not necessary to consider the difference here, in the following, the 60 fps moving image data obtained by the separation circuit 213 and the 60 fps moving image data supplied from the decompression circuit 216 to the difference information extraction unit 217 match. The explanation will be given on the assumption that That is, strictly speaking,..., F ₁ , f ₅ , f ₉ , f ₁₃ ,... (Image data) are encoded (encoded by the compression circuit 214 in FIG. 79) and decoded (decompression circuit 216). The reference image has been subjected to (decoding by). However, here, the reference image is simply the frame of the 60 fps moving image data shown in the second part from the top of FIG. 80..., F ₁ , f ₅ , f ₉ , f ₁₃ ,. Will be described.

次に、図８１は、図７９の差分情報抽出部２１７の構成例を示している。 Next, FIG. 81 shows a configuration example of the difference information extraction unit 217 of FIG.

差分情報抽出部２１７は、大きく分けて、ターゲット記憶部２２１、リファレンス記憶部２２２、およびデータ処理部２２３から構成されている。 The difference information extraction unit 217 is roughly composed of a target storage unit 221, a reference storage unit 222, and a data processing unit 223.

ターゲット記憶部２２１には、分離回路２１３（図７９）からターゲット画像としての240-60fps動画データが供給される。ターゲット記憶部２２１は、分離回路２１３からのターゲット画像としての240-60fps動画データを一時記憶する。 The target storage unit 221 is supplied with 240-60 fps moving image data as a target image from the separation circuit 213 (FIG. 79). The target storage unit 221 temporarily stores 240-60 fps moving image data as a target image from the separation circuit 213.

ターゲット記憶部２２１に記憶されたターゲット画像の各フレーム（の画像データ）は、ブロック分割される。そして、そのブロック分割によって得られる各ブロックが、順次、注目ブロックとされ、その注目ブロック（の画像データ）は、ターゲット記憶部２２１から読み出され、入力端子２３３からデータ処理部２２３に入力される。なお、ブロックのサイズは、例えば、上述した場合と同様に、１６×１６画素などを採用することができる。また、ブロックを構成する画素数は、複数画素であってもよいし、１画素であってもよい。 Each frame (image data) of the target image stored in the target storage unit 221 is divided into blocks. Each block obtained by the block division is sequentially set as a target block, and the target block (image data thereof) is read from the target storage unit 221 and input to the data processing unit 223 from the input terminal 233. . Note that, for example, 16 × 16 pixels can be adopted as the block size, as in the case described above. In addition, the number of pixels constituting the block may be a plurality of pixels or one pixel.

リファレンス記憶部２２２には、解凍回路２１６（図７９）からリファレンス画像としての60fps動画データが供給される。リファレンス記憶部２２２は、解凍回路２１６からのリファレンス画像としての60fps動画データを一時記憶する。 The reference storage unit 222 is supplied with 60 fps moving image data as a reference image from the decompression circuit 216 (FIG. 79). The reference storage unit 222 temporarily stores 60 fps moving image data as a reference image from the decompression circuit 216.

リファレンス記憶部２２２に記憶されたリファレンス画像のフレームのうちの、注目ブロックのフレームの直前のフレーム（の画像データ）は、パストリファレンス(past reference)画像として読み出され、入力端子２３１からデータ処理部２２３に入力される。また、リファレンス記憶部２２２に記憶されたリファレンス画像のフレームのうちの、注目ブロックのフレームの直後のフレーム（の画像データ）は、フューチャリファレンス(future reference)画像として読み出され、入力端子２３２からデータ処理部２２３に入力される。 Of the frames of the reference image stored in the reference storage unit 222, the frame immediately before the frame of the target block is read as a past reference image, and the data processing unit is input from the input terminal 231. 223 is input. In addition, a frame (image data) immediately after the frame of the target block among the frames of the reference image stored in the reference storage unit 222 is read as a future reference image, and data is input from the input terminal 232. Input to the processing unit 223.

ここで、ターゲット記憶部２２１には、図８０の一番下に示したターゲット画像である240-60fps動画データのフレームf₂，f₃，f₄，f₆，f₇，f₈，f₁₀，f₁₁，f₁₂等が一時記憶される。また、リファレンス記憶部２２２には、図８０の上から２番目に示したリファレンス画像である60fps動画データのフレームf₁，f₅，f₉，f₁₃等が一時記憶される。 Here, the target storage unit 221 stores frames f ₂ , f ₃ , f ₄ , f ₆ , f ₇ , f ₈ , f _{10 of} 240-60 fps moving image data, which is the target image shown at the bottom of FIG. , F ₁₁ , f ₁₂ etc. are temporarily stored. Further, the reference storage unit 222 temporarily stores frames f ₁ , f ₅ , f ₉ , f _{13 and the} like of 60 fps moving image data, which is the second reference image from the top in FIG.

そして、例えば、ターゲット記憶部２２１に記憶されたターゲット画像のフレームf₂，f₃，f₄のうちのいずれかのフレームのブロックが注目ブロックである場合には、リファレンス画像のフレームf₁，f₅，f₉，f₁₃のうちの、ターゲット画像のフレームf₂，f₃，f₄の直前と直後のフレームf₁とf₅が、それぞれパストリファレンス画像とフューチャリファレンス画像として、リファレンス記憶部２２２から読み出される。 For example, when the block of one of the frames f ₂ , f ₃ , and f ₄ of the target image stored in the target storage unit 221 is the target block, the frames f ₁ and f of the reference image _5, of f _9, f _13, the frame f ₁ and f ₅ immediately before and after the frame f _2, f _3, f ₄ of the target image, as a past reference image and Futuresse reference image respectively, the reference storage unit 222 Read from.

また、例えば、ターゲット記憶部２２１に記憶されたターゲット画像のフレームf₆，f₇，f₈のうちのいずれかのフレームのブロックが注目ブロックである場合には、リファレンス画像のフレームf₁，f₅，f₉，f₁₃のうちの、ターゲット画像のフレームf₆，f₇，f₈の直前と直後のフレームf₅とf₉が、それぞれパストリファレンス画像とフューチャリファレンス画像として、リファレンス記憶部２２２から読み出される。 Further, for example, when the block of one of the frames f ₆ , f ₇ , and f ₈ of the target image stored in the target storage unit 221 is the target block, the frames f ₁ and f of the reference image Among the frames f ₅ , f ₉ , and f ₁₃ , the frames f ₅ and f ₉ immediately before and after the frames f ₆ , f ₇ , and f ₈ of the target image are used as a past reference image and a feature reference image, respectively, as a reference storage unit 222. Read from.

さらに、例えば、ターゲット記憶部２２１に記憶されたターゲット画像のフレームf₁₀，f₁₁，f₁₂のうちのいずれかのフレームのブロックが注目ブロックである場合には、リファレンス画像のフレームf₁，f₅，f₉，f₁₃のうちの、ターゲット画像のフレームf₁₀，f₁₁，f₁₂の直前と直後のフレームf₉とf₁₃が、それぞれパストリファレンス画像とフューチャリファレンス画像として、リファレンス記憶部２２２から読み出される。 Further, for example, when the block of one of the frames f ₁₀ , f ₁₁ , and f ₁₂ of the target image stored in the target storage unit 221 is the target block, the frames f ₁ and f of the reference image _5, of f _9, f _13, frame f ₉ and f ₁₃ of immediately before and after the frame f _10, f _11, f ₁₂ of the target image, as a past reference image and Futuresse reference image respectively, the reference storage unit 222 Read from.

データ処理部２２３は、入力端子２３１乃至２３３、６つの差分データ計算部２３４乃至２３９、選択回路２４０、および出力端子２４１で構成され、入力端子２３３から入力される注目ブロックを、入力端子２３１から入力されるパストリファレンス画像、または入力端子２３２から入力されるフューチャリファレンス画像を用いて圧縮し、その結果得られる差分圧縮データを、出力端子２４１から、図７９の出力端子２１８に出力する。 The data processing unit 223 includes input terminals 231 to 233, six difference data calculation units 234 to 239, a selection circuit 240, and an output terminal 241. The target block input from the input terminal 233 is input from the input terminal 231. The compressed past reference image or the feature reference image input from the input terminal 232 is compressed, and the differential compressed data obtained as a result is output from the output terminal 241 to the output terminal 218 of FIG.

即ち、データ処理部２２３は、ターゲット画像を、ブロック単位で圧縮する。 That is, the data processing unit 223 compresses the target image in units of blocks.

具体的には、データ処理部２２３において、入力端子２３１から入力されたパストリファレンス画像は、６つの差分データ計算部２３４乃至２３９のうちの、４つの差分データ計算部２３４乃至２３７に供給される。また、入力端子２３２から入力されたフューチャリファレンス画像は、６つの差分データ計算部２３４乃至２３９のうちの、４つの差分データ計算部２３４乃至２３６、および２３８に供給される。さらに、入力端子２３３から入力された注目ブロックは、６つの差分データ計算部２３４乃至２３９のすべてに供給される。 Specifically, in the data processing unit 223, the past reference image input from the input terminal 231 is supplied to four difference data calculation units 234 to 237 among the six difference data calculation units 234 to 239. The feature reference image input from the input terminal 232 is supplied to four difference data calculation units 234 to 236 and 238 among the six difference data calculation units 234 to 239. Further, the target block input from the input terminal 233 is supplied to all of the six difference data calculation units 234 to 239.

差分データ計算部２３４は、入力端子２３１からのパストリファレンス画像と、入力端子２３２からのフューチャリファレンス画像とを用い、入力端子２３３からの注目ブロックを圧縮し、その圧縮結果としての、後述する第１の差分データ（圧縮データ）を、選択回路２４０に供給する。 The difference data calculation unit 234 compresses the block of interest from the input terminal 233 using the past reference image from the input terminal 231 and the feature reference image from the input terminal 232, and the first result to be described later is the compression result. Difference data (compressed data) is supplied to the selection circuit 240.

差分データ計算部２３５も、入力端子２３１からのパストリファレンス画像と、入力端子２３２からのフューチャリファレンス画像とを用い、入力端子２３３からの注目ブロックを圧縮し、その圧縮結果としての、後述する第２の差分データ（他の圧縮データ）を、選択回路２４０に供給する。 The difference data calculation unit 235 also compresses the block of interest from the input terminal 233 using the past reference image from the input terminal 231 and the feature reference image from the input terminal 232, and a second result to be described later is obtained as a result of the compression. Difference data (other compressed data) is supplied to the selection circuit 240.

差分データ計算部２３６も、入力端子２３１からのパストリファレンス画像と、入力端子２３２からのフューチャリファレンス画像とを用い、入力端子２３３からの注目ブロックを圧縮し、その圧縮結果としての、後述する第３の差分データを、選択回路２４０に供給する。 The difference data calculation unit 236 also compresses the block of interest from the input terminal 233 using the past reference image from the input terminal 231 and the feature reference image from the input terminal 232, and a third result to be described later is obtained as a result of the compression. Are supplied to the selection circuit 240.

差分データ計算部２３７は、入力端子２３１からのパストリファレンス画像を用い、入力端子２３３からの注目ブロックを圧縮し、その圧縮結果としての、後述する第４の差分データを、選択回路２４０に供給する。 The difference data calculation unit 237 compresses the block of interest from the input terminal 233 using the past reference image from the input terminal 231, and supplies fourth difference data, which will be described later, to the selection circuit 240 as the compression result. .

差分データ計算部２３８は、入力端子２３２からのフューチャリファレンス画像を用い、入力端子２３３からの注目ブロックを圧縮し、その圧縮結果としての、後述する第５の差分データを、選択回路２４０に供給する。 The difference data calculation unit 238 compresses the target block from the input terminal 233 using the feature reference image from the input terminal 232, and supplies fifth difference data, which will be described later, to the selection circuit 240 as a compression result. .

差分データ計算部２３９は、入力端子２３３からの注目ブロックを圧縮し、その圧縮結果としての、後述する第６の差分データを、選択回路２４０に供給する。 The difference data calculation unit 239 compresses the target block from the input terminal 233, and supplies sixth difference data, which will be described later, as the compression result to the selection circuit 240.

選択回路２４０は、６つの差分データ計算部２３４乃至２３９からそれぞれ供給される第１乃至第６の差分データの中から、注目ブロックの圧縮結果として最適なものを選択する。ここで、第１乃至第６の差分データのうちの、選択回路２４０において選択されたものを、以下、適宜、選択差分データという。 The selection circuit 240 selects the optimum compression result of the block of interest from the first to sixth difference data supplied from the six difference data calculation units 234 to 239, respectively. Here, of the first to sixth difference data, the one selected by the selection circuit 240 is hereinafter referred to as selection difference data as appropriate.

選択回路２４０は、選択差分データに、その選択差分データが、第１乃至第６の差分データのうちのいずれであるかを識別するための識別情報としての、後述するケースID(Identification)を付加し、注目ブロックの圧縮結果である差分圧縮データとして、出力端子２４１から出力する。 The selection circuit 240 adds a case ID (Identification), which will be described later, as identification information for identifying which of the first to sixth difference data the selection difference data is to the selection difference data. Then, it is output from the output terminal 241 as differential compressed data that is the compression result of the block of interest.

次に、差分データ計算部２３４乃至２３９それぞれにおける注目ブロックの圧縮および復元の方法と、注目ブロックの圧縮により得られる第１乃至第６の差分データについて説明する。 Next, a method of compressing and restoring the target block in each of the difference data calculation units 234 to 239 and first to sixth difference data obtained by compression of the target block will be described.

図８２は、ターゲット画像V_T、パストリファレンス画像V_P、およびフューチャリファレンス画像V_F（のフレーム）を示している。 FIG. 82 shows a target image V _T , a past reference image V _P , and a future reference image V _F (frames thereof).

いま、sが、１，２，３のうちのいずれかの値をとる変数であるとして、ターゲット画像V_Tとフューチャリファレンス画像V_Fとの時間間隔を、s/240秒と表すと、ターゲット画像V_Tとパストリファレンス画像V_Pとの時間間隔は、(4-s)/240秒と表すことができる。 Now, s is as a variable which takes one of three values 1, 2, 3, the time interval between the target image V _T and Futuresse reference image V _F, when expressed as s / 240 seconds, the target image time interval between V _T and Pasto reference image V _P can be expressed as (4-s) / 240 seconds.

ここで、ターゲット画像V_Tが、例えば、図８０の一番下に示した240-60fps動画データのフレームf₂であるとすると、パストリファレンス画像V_Pとフューチャリファレンス画像V_Fは、それぞれ、図８０の上から２番目に示した60fps動画データのフレームf₁とf₅であり、この場合、変数sは３である。 Here, if the target image V _T is, for example, the frame f ₂ of the 240-60 fps moving image data shown at the bottom of FIG. 80, the past reference image V _P and the future reference image V _F are respectively shown in FIG. The frames f ₁ and f ₅ of the 60 fps moving image data shown second from the top of 80 are shown. In this case, the variable s is 3.

また、ターゲット画像V_Tが、例えば、図８０の一番下に示した240-60fps動画データのフレームf₃であるとすると、パストリファレンス画像V_Pとフューチャリファレンス画像V_Fは、それぞれ、図８０の上から２番目に示した60fps動画データのフレームf₁とf₅であり、この場合、変数sは２である。 Further, if the target image V _T is, for example, the frame f ₃ of the 240-60 fps moving image data shown at the bottom of FIG. 80, the past reference image V _P and the feature reference image V _F are respectively shown in FIG. The frames f ₁ and f ₅ of the 60 fps moving image data shown second from the top, in this case, the variable s is 2.

さらに、ターゲット画像V_Tが、例えば、図８０の一番下に示した240-60fps動画データのフレームf₄であるとすると、パストリファレンス画像V_Pとフューチャリファレンス画像V_Fは、それぞれ、図８０の上から２番目に示した60fps動画データのフレームf₁とf₅であり、この場合、変数sは１である。他の場合も同様に、変数sは、１，２，３のうちのいずれかである。 Further, assuming that the target image V _T is, for example, the frame f ₄ of the 240-60 fps moving image data shown at the bottom of FIG. 80, the past reference image V _P and the feature reference image V _F are respectively shown in FIG. The frames f ₁ and f ₅ of the 60 fps moving image data shown second from the top are shown. In this case, the variable s is 1. Similarly, in other cases, the variable s is one of 1, 2, and 3.

ターゲット画像V_Tに対して、上述したパストリファレンス画像V_Pとフューチャリファレンス画像V_Fとが存在する場合、ターゲット画像V_Tの注目ブロックは、そのパストリファレンス画像V_Pとフューチャリファレンス画像V_Fを、いわばヒントとして圧縮をすることができ、その圧縮結果の復元も同様に、パストリファレンス画像V_Pとフューチャリファレンス画像V_Fをヒントとして行うことができる。 The target image V _T, if the Pasto reference image V _P described above and Futuresse reference image V _F is present, the block of interest of the target image V _T is the Past reference image V _P and Futuresse reference image V _F, so to speak can be compressed as a hint, can be carried out similarly restoration of the compressed result, the Past reference image V _P and Futuresse reference image V _F as a hint.

即ち、パストリファレンス画像V_Pにおいて、ターゲット画像V_Tの注目ブロックの位置（例えば、注目ブロックの左上の位置）から、ベクトルmv₈₂₀₁だけずれた位置の、注目ブロックと同一のサイズの領域R₈₂₀₁が、注目ブロック（の画像データ）と類似している場合、注目ブロックの各画素の画素値（画像データ）と、領域R₈₂₀₁の対応する画素の画素値との差分をとることにより、各画素の画素値の差分は、ほとんど０となることから、注目ブロックの画像データとして、そのほとんど０である差分を採用することにより、注目ブロックの画像データのデータ量を削減、即ち、注目ブロックを圧縮することができる。 That is, in the PAST reference image V _P, the position of the target block of the target image V _T (e.g., the position of the upper left of the target block) from the position shifted by vector mv _8201, a region R ₈₂₀₁ of the target block and the same size If it is similar to the target block (image data thereof), the difference between the pixel value (image data) of each pixel of the target block and the pixel value of the corresponding pixel in the region R ₈₂₀₁ is obtained. Since the difference between the pixel values is almost 0, by adopting the difference that is almost 0 as the image data of the block of interest, the data amount of the image data of the block of interest is reduced, that is, the block of interest is compressed. be able to.

同様に、フューチャリファレンス画像V_Fにおいて、ターゲット画像V_Tの注目ブロックの位置から、ベクトルmv₈₂₀₂だけずれた位置の、注目ブロックと同一のサイズの領域R₈₂₀₂が、注目ブロックと類似している場合には、注目ブロックの各画素の画素値と、領域R₈₂₀₂の対応する画素の画素値との差分をとることにより、各画素の画素値の差分は、ほとんど０となることから、注目ブロックの画像データとして、そのほとんど０である差分を採用することにより、注目ブロックの画像データのデータ量を削減、即ち、注目ブロックを圧縮することができる。 Similarly, in Futuresse reference image V _F, when the position of the target block of the target image V _T, the position shifted by vector mv _8202, region R ₈₂₀₂ of the target block and the same size, which are similar to the block of interest Since the difference between the pixel value of each pixel of the target block and the pixel value of the corresponding pixel in the region R ₈₂₀₂ is almost zero, the difference between the pixel values of each pixel is almost zero. By adopting a difference that is almost zero as the image data, it is possible to reduce the amount of image data of the block of interest, that is, to compress the block of interest.

差分データ計算部２３４乃至２３９（図８１）は、基本的には、上述したように、注目ブロックと、パストリファレンス画像V_Pまたはフューチャリファレンス画像V_Fとの差分をとる方法により、注目ブロックを圧縮する。 Differential data calculation unit 234 through 239 (FIG. 81) are basically as described above, the block of interest, the method of taking the difference between a past reference image V _P or Futuresse reference image V _F, compressing the block of interest To do.

ここで、以下においては、240fps動画データとして、例えば、図８３に示す被写体が投影された動画データを考える。 Here, in the following, as the 240 fps moving image data, for example, moving image data on which a subject shown in FIG. 83 is projected is considered.

なお、図８３における横軸は、空間方向の位置x,yを表している。即ち、図８３においては、図が煩雑になるのを避けるため、x,yの２つの空間方向を、１軸で表している。後述する図８４乃至図９３においても同様である。 Note that the horizontal axis in FIG. 83 represents the position x, y in the spatial direction. That is, in FIG. 83, the two spatial directions x and y are represented by one axis in order to avoid making the diagram complicated. The same applies to FIGS. 84 to 93 described later.

図８３において、その上側は、背景である被写体P₈₃₀₁の波形を示しており、下側は、前景である被写体P₈₃₀₂の波形を示している。 In FIG. 83, the upper side shows the waveform of the subject P _{8301 as} the background, and the lower side shows the waveform of the subject P _{8302 as} the foreground.

図８３上側に示した被写体P₈₃₀₁は静止しており、図８３下側に示した被写体P₈₃₀₂は、ある空間方向に移動している。 The subject P ₈₃₀₁ shown at the upper side of FIG. 83 is stationary, and the subject P ₈₃₀₂ shown at the lower side of FIG. 83 is moving in a certain spatial direction.

図８４は、図８３に示した被写体P₈₃₀₁とP₈₃₀₂とが投影されたターゲット画像V_T、パストリファレンス画像V_P、およびフューチャリファレンス画像V_F（のフレーム）を示している。 FIG. 84 shows the target image V _T , past reference image V _P , and feature reference image V _F (frames) onto which the subjects P ₈₃₀₁ and P ₈₃₀₂ shown in FIG. 83 are projected.

図８４においては、例えば、被写体P₈₃₀₂が、ゆっくりと一定速度で移動しており、被写体P₈₃₀₂の動きベクトル、即ち、240fps動画データにおけるフレーム周期である1/240秒あたりの、被写体P₈₃₀₂の空間方向x,yの移動量を表すベクトルが(U,V)であるとする。 In FIG. 84, for example, the subject P ₈₃₀₂ is has moved slowly at a constant speed, the motion vector of the subject P _8302, i.e., per 1/240 second is a frame period in 240fps video data, the subject P ₈₃₀₂ It is assumed that the vector representing the movement amount in the spatial direction x, y is (U, V).

上述したように、パストリファレンス画像V_Pとターゲット画像V_Tとの時間間隔は、(4-s)/240秒であり、ターゲット画像V_Tとフューチャリファレンス画像V_Fとの時間間隔は、s/240秒である。 As described above, the time interval between a past reference image V _P and the target image V _T, (4-s) was / 240 seconds, the time interval between the target image V _T and Futuresse reference image V _F, s / 240 seconds.

そして、被写体P₈₃₀₂が、ゆっくりと一定速度で移動している場合には、ターゲット画像V_T、パストリファレンス画像V_P、およびフューチャリファレンス画像V_Fのすべてに、被写体P₈₃₀₂が存在し、さらに、パストリファレンス画像V_P上の被写体P₈₃₀₂は、ターゲット画像V_T上の被写体P₈₃₀₂から、ベクトル(-(4-s)U,-(4-s)V)だけずれた位置にあり、フューチャリファレンス画像V_F上の被写体P₈₃₀₂は、ターゲット画像V_T上の被写体P₈₃₀₂から、ベクトル(sU,sV)だけずれた位置にある。 When the subject P ₈₃₀₂ is slowly moving at a constant speed, the subject P ₈₃₀₂ exists in all of the target image V _T , the past reference image V _P , and the future reference image V _F , and The subject P ₈₃₀₂ on the past reference image V _P is at a position shifted from the subject P ₈₃₀₂ on the target image V _{T by} a vector (-(4-s) U,-(4-s) V). subject P ₈₃₀₂ on the image V _F from the subject P ₈₃₀₂ on a target image V _T, is the position shifted by vector (sU, sV).

従って、ターゲット画像V_Tにおいて、例えば、被写体P₈₃₀₂のみを含むあるブロックB₈₄₀₁を、注目ブロックとして注目すると、パストリファレンス画像V_Pにおいて、注目ブロックB₈₄₀₁からベクトル(-(4-s)U,-(4-s)V)だけずれた位置の、注目ブロックB₈₄₀₁と同一サイズの領域R₈₄₀₁の画像データは、（理想的には）注目ブロックB₈₄₀₁と同一になっている。また、フューチャリファレンス画像V_Fにおいても、注目ブロックB₈₄₀₁からベクトル(sU,sV)だけずれた位置の、注目ブロックB₈₄₀₁と同一サイズの領域R₈₄₀₂の画像データは、（理想的には）注目ブロックB₈₄₀₁と同一になっている。 Accordingly, in the target image V _T, for example, a certain block B ₈₄₀₁ including only the subject P _8302, when attention as the subject block, in Past reference image V _P, from the target block B ₈₄₀₁ vector (- (4-s) U , The image data of the region R ₈₄₀₁ having the same size as the _target block B _{8401 at} a position shifted _by- (4-s) V) is (ideally) the same as the _target block B ₈₄₀₁ . Also in Futuresse reference image V _F, a vector from the target block B ₈₄₀₁ (sU, sV) of position shifted by the image data of the target block B ₈₄₀₁ same size as the region R ₈₄₀₂ is (ideally) of interest It is the same as block B ₈₄₀₁ .

即ち、パストリファレンス画像V_Pの領域R₈₄₀₁、注目ブロックB₈₄₀₁、およびフューチャリファレンス画像V_Fの領域R₈₄₀₂の画像データは、（理想的には）同一になっている。 That is, the image data of Pasto reference image V _P of region R _8401, the block of interest B _8401, and the region R ₈₄₀₂ of Futuresse reference image V _F is made the same (ideally).

差分データ計算部２３４（図８１）では、以上のような図８４に示した場合に適した第１の差分データが求められる。 The difference data calculation unit 234 (FIG. 81) obtains first difference data suitable for the case shown in FIG. 84 as described above.

即ち、差分データ計算部２３４（図８１）では、例えば、パストリファレンス画像V_Pの領域R₈₄₀₁と、フューチャリファレンス画像V_Fの領域R₈₄₀₂との画像データの平均値（領域R₈₄₀₁内の各画素の画素値と、領域R₈₄₀₂内の対応する画素の画素値との平均値）が演算され、その平均値を、注目ブロックB₈₄₀₁の推測値として、注目ブロックB₈₄₀₁とその推測値との差分（注目ブロックB₈₄₀₁の各画素の画素値と、その画素の画素値の推測値との差分）が演算される。この差分は、ほとんど０となるので、注目ブロックB₈₄₀₁のデータ量を削減することができる。差分データ計算部２３４は、以上のような注目ブロックB₈₄₀₁とその推測値との差分を、注目ブロックB₈₄₀₁の圧縮結果である第１の差分データとして出力する。 That is, the difference data calculating unit 234 (FIG. 81), for example, each pixel in the region R ₈₄₀₁ Past reference image V _P, the average value of the image data of the area R ₈₄₀₂ of Futuresse reference image V _F (region R ₈₄₀₁ and pixel value, the average value between the pixel value of the corresponding pixel in the region R ₈₄₀₂₎ is calculated, the difference of the average value, as estimated value of the block of interest B _8401, the block of interest B ₈₄₀₁ and its estimated value (The difference between the pixel value of each pixel of the _target block B ₈₄₀₁ and the estimated value of the pixel value of the pixel) is calculated. Since this difference is almost 0, the data amount of the target block B ₈₄₀₁ can be reduced. The difference data calculation unit 234 outputs the difference between the target block B ₈₄₀₁ and its estimated value as the first difference data that is the compression result of the target block B ₈₄₀₁ .

ところで、第１の差分データを、元の注目ブロックB₈₄₀₁に復元するには、注目ブロックB₈₄₀₁からパストリファレンス画像V_Pへの動きベクトルとなるベクトル(-(4-s)U,-(4-s)V)と、注目ブロックB₈₄₀₁からフューチャリファレンス画像V_Fへの動きベクトルとなるベクトル(sU,sV)との２つの動きベクトルが必要となる。 Incidentally, the first difference data, to restore to the original block of interest B ₈₄₀₁ is a motion vector from the target block B ₈₄₀₁ to Pasto reference image V _P vector (- (4-s) U , - (4 and -s) V), the motion vector from the target block B ₈₄₀₁ to Futuresse reference image V _F vector (sU, two motion vectors with sV) is required.

しかしながら、第１の差分データに、２つの動きベクトル(-(4-s)U,-(4-s)V)と(sU,sV)とを付加した場合には、その２つの動きベクトル(-(4-s)U,-(4-s)V)と(sU,sV)の分だけ、データ量が増加することになる。 However, when two motion vectors (-(4-s) U,-(4-s) V) and (sU, sV) are added to the first difference data, the two motion vectors ( -(4-s) U,-(4-s) V) and (sU, sV) increase the amount of data.

そこで、差分データ計算部２３４では、次のようにして、注目ブロックB₈₄₀₁の推測値を求めるのに用いる領域R₈₄₀₁およびR₈₄₀₂（への動きベクトル）を求めることにより、動きベクトルなしで復元することができる第１の差分データを求める。 Therefore, the difference data calculation unit 234 performs restoration without a motion vector by _obtaining regions R ₈₄₀₁ and R ₈₄₀₂ (to which motion vectors are used) used to obtain the estimated value of the block of interest B ₈₄₀₁ as follows. First differential data that can be obtained is obtained.

即ち、いまの場合、被写体P₈₃₀₂が一定速度（速度＝０の場合も含む）で移動していることを前提とするので、注目ブロックB₈₄₀₁から動きベクトル(-(4-s)U,-(4-s)V)だけずれたパストリファレンス画像V_P上の領域R₈₄₀₁、注目ブロックB₈₄₀₁から動きベクトル(sU,sV)だけずれたフューチャリファレンス画像V_F上の領域R₈₄₀₂、および注目ブロックB₈₄₀₁（の対応する画素）は、図８４に示すように、時空間において一直線上に並ぶ。 That is, in this case, since it is assumed that the subject P ₈₃₀₂ is moving at a constant speed (including the case of speed = 0), the motion vector from the target block _{B 8401 (- (4-s} ) U, - Region R ₈₄₀₁ on past reference image V _P shifted by (4-s) V), region R ₈₄₀₂ on feature reference image V _F shifted by motion vector (sU, sV) from target block B ₈₄₀₁ , and target block As shown in FIG. 84, B ₈₄₀₁ (corresponding pixels) are arranged in a straight line in space-time.

従って、注目ブロックB₈₄₀₁から動きベクトル(-(4-s)U,-(4-s)V)だけずれたパストリファレンス画像V_P上の領域R₈₄₀₁と、注目ブロックB₈₄₀₁から動きベクトル(sU,sV)だけずれたフューチャリファレンス画像V_F上の領域R₈₄₀₂は、いずれも、時空間において、注目ブロックB₈₄₀₁を通る直線上に存在する。 Therefore, the motion vector from the target block _{B 8401 (- (4-s} ) U, - (4-s) V) a region R ₈₄₀₁ on which shifted by PAST reference image V _P, the motion vector from the target block B ₈₄₀₁ (sU , sV), the region R ₈₄₀₂ on the feature reference image V _{F is located} on a straight line passing through the block of interest B ₈₄₀₁ in time and space.

そして、パストリファレンス画像V_P上の領域R₈₄₀₁と、フューチャリファレンス画像V_F上の領域R₈₄₀₂の画像データは、上述したように、（理想的には）同一であるから、その領域R₈₄₀₁とR₈₄₀₂との（画像データ）の相関は、時空間において、注目ブロックB₈₄₀₁を通る直線上に存在するパストリファレンス画像V_P上の領域と、フューチャリファレンス画像V_F上の領域との相関の中で、最も高くなる。 Then, a region R ₈₄₀₁ on Pasto reference image V _P, diffuser image data of a region R ₈₄₀₂ on tea reference image V _F, as described above, (ideally) Since identical, and the area R ₈₄₀₁ correlation (image data) of the R _8402, in time and space, and the region on the Pasto reference image V _P that exists on a straight line passing through the block of interest B _8401, in a correlation with Futuresse reference image V _F on the region And get the highest.

従って、パストリファレンス画像V_Pとフューチャリファレンス画像V_Fにおいて、注目ブロックB₈₄₀₁を通る時空間の直線上にある領域から、相関が最も高い領域の位置関係を検出すれば、その位置関係にあるパストリファレンス画像V_Pの領域と、フューチャリファレンス画像V_Fの領域とが、それぞれ領域R₈₄₀₁とR₈₄₀₂となる。 Accordingly, in the past reference image V _P and the future reference image V _F , if the positional relationship of the region having the highest correlation is detected from the region on the space-time straight line passing through the _target block B ₈₄₀₁ , the past relationship image having the highest correlation is detected. a region of the reference image V _P, and the area of Futuresse reference image V _F, the respective regions R ₈₄₀₁ and R _8402.

即ち、パストリファレンス画像V_Pとフューチャリファレンス画像V_Fにおいて、相関が最も高い領域の位置関係を検出することにより、注目ブロックB₈₄₀₁から領域R₈₄₀₁への動きベクトル(-(4-s)U,-(4-s)V)と、注目ブロックB₈₄₀₁から領域R₈₄₀₂への動きベクトル(sU,sV)がなくても、領域R₈₄₀₁とR₈₄₀₂を求めることができる。 That is, in the PAST reference image V _P and Futuresse reference image V _F, by detecting the positional relationship of the correlation is the highest region, the motion vector from the target block B ₈₄₀₁ to the area _{R 8401 (- (4-s} ) U, -(4-s) V) and the regions R ₈₄₀₁ and R ₈₄₀₂ can be obtained without the motion vector (sU, sV) from the target block B ₈₄₀₁ to the region R ₈₄₀₂ .

差分データ計算部２３４（図８１）は、以上のような原理を利用して、動きベクトルなしで復元することができる第１の差分データを求める。 The difference data calculation unit 234 (FIG. 81) obtains first difference data that can be restored without a motion vector by using the principle as described above.

具体的には、差分データ計算部２３４は、注目ブロックを通る時空間の直線上にあるパストリファレンス画像V_Pの領域Ipとフューチャリファレンス画像V_Fの領域Ifとの相関を表す相関情報e₁(U',V')を、例えば、式（９）にしたがって求める。 Specifically, the difference data calculation unit 234 correlates the correlation information e ₁ (representing the correlation between the region Ip of the past reference image V _P and the region If of the future reference image V _F on a space-time straight line passing through the block of interest. U ′, V ′) is obtained, for example, according to equation (9).

・・・（９）

... (9)

ここで、式（９）において、(x,y)は、ターゲット画像V_Tにおける注目ブロックの画素の位置を表す。また、Ip(x-(4-s)U'，y-(4-s)V')は、パストリファレンス画像V_Pにおける位置(x-(4-s)U'，y-(4-s)V')にある画素の画素値を表す。さらに、If(x+sU'，y+sV')は、フューチャリファレンス画像V_Fにおける位置(x+sU'，y+sV')にある画素の画素値を表す。また、式（９）におけるΣは、注目ブロックを構成する画素すべてについてのサメーションを表す。 Here, in equation (9), (x, y ) represents the position of the pixel of the target block in the target image V _T. Ip (x- (4-s) U ′, y- (4-s) V ′) is a position (x- (4-s) U ′, y- (4-s) in the past reference image V _P. ) Represents the pixel value of the pixel in V ′). Furthermore, If (x + sU ', y + sV') represents the position (x + sU ', y + sV') pixel value of the pixel in the Futuresse reference image V _F. Also, Σ in equation (9) represents summation for all the pixels constituting the block of interest.

なお、式（９）の相関情報e₁(U',V')は、パストリファレンス画像V_Pの領域Ipと、フューチャリファレンス画像V_Fの領域Ifとの画素値の差分絶対値の総和であり、従って、パストリファレンス画像V_Pの領域Ipとフューチャリファレンス画像V_Fの領域Ifとの相関が高いほど、相関情報e₁(U',V')の「値」は、小さくなる。 Incidentally, the correlation information e ₁ of the formula (9) (U ', V ') is an sum of absolute differences of the pixel values of the region Ip Past reference image V _P, a region If the Futuresse reference image V _F , therefore, the higher the correlation between the region If region Ip and Futuresse reference image V _F of Pasto reference image V _P, the "value" of the correlation information _{e 1 (U ', V'} ) is reduced.

差分データ計算部２３４は、式（９）の相関情報e₁(U',V')を、考えられる全ての(U',V')について計算する。ここで、考えられる全ての(U',V')とは、例えば、位置(x-(4-s)U'，y-(4-s)V')がパストリファレンス画像V_P上にあり、かつ、位置(x+sU'，y+sV')がフューチャリファレンス画像V_F上にある場合（範囲）の、(U',V')である。なお、式（９）の相関情報e₁(U',V')の計算は、その他、あらかじめ定められた範囲の(U',V')についてだけ行うようにしてもよい。 The difference data calculation unit 234 calculates the correlation information e ₁ (U ′, V ′) in Expression (9) for all possible (U ′, V ′). Here, all the possible (U ′, V ′) are, for example, positions (x- (4-s) U ′, y- (4-s) V ′) on the past reference image V _P. and a position (x + sU ', y + sV') If is on Futuresse reference image V _F of (range), (U ', V' ). In addition, the calculation of the correlation information e ₁ (U ′, V ′) in Expression (9) may be performed only for (U ′, V ′) within a predetermined range.

差分データ計算部２３４は、各値の(U',V')について計算した相関情報e₁(U',V')から最小値を検出し、その相関情報e₁(U',V')の最小値を与える(U',V')を、パストリファレンス画像V_Pとフューチャリファレンス画像V_Fにおいて、相関が最も高い領域の位置関係を表すベクトル(U,V)として検出する。 The difference data calculation unit 234 detects the minimum value from the correlation information e ₁ (U ′, V ′) calculated for each value (U ′, V ′), and the correlation information e ₁ (U ′, V ′). gives the minimum value of (U ', V'), and in Past reference image V _P and Futuresse reference image V _F, is detected as a vector (U, V) representing the positional relationship of the correlation is the highest region.

ここで、以上のような、パストリファレンス画像V_Pとフューチャリファレンス画像V_Fにおいて、相関が最も高い領域の位置関係を表すベクトル(U,V)を、以下、適宜、位置関係ベクトル(U,V)という。 Here, in the past reference image V _P and the feature reference image V _F as described above, the vector (U, V) representing the positional relationship of the region having the highest correlation is appropriately referred to as the positional relationship vector (U, V). ).

差分データ計算部２３４は、位置関係ベクトル(U,V)の検出後、その位置関係ベクトル(U,V)が表す位置関係にあるパストリファレンス画像V_Pとフューチャリファレンス画像V_Fとの画像データから、注目ブロックの推測値を求める。 Differential data calculation unit 234, the positional relationship vector (U, V) after the detection of, from the image data with its positional relation vector (U, V) Past reference image V _P and Futuresse reference image V _F in the positional relationship indicated by the Then, the estimated value of the target block is obtained.

即ち、差分データ計算部２３４は、注目ブロックからベクトル(-(4-s)U,-(4-s)V)だけずれたパストリファレンス画像V_P上の、注目ブロックと同一サイズの領域と、注目ブロックからベクトル(sU,sV)だけずれたフューチャリファレンス画像V_F上の、注目ブロックと同一サイズの領域を求め、そのパストリファレンス画像V_Pの領域と、フューチャリファレンス画像V_Fの領域との画像データの平均値を、注目ブロックの推測値として求める。 That is, the difference data calculation unit 234 has an area of the same size as the target block on the past reference image V _P shifted from the target block by a vector (− (4-s) U, − (4-s) V), from the target block vector (sU, sV) shifted on Futuresse reference image V _F was obtains a region of the block of interest and the same size, the image of the region of the Pasto reference image V _P, and Futuresse reference image V _F of the area The average value of the data is obtained as an estimated value of the block of interest.

そして、差分データ計算部２３４は、注目ブロックとその推測値との差分を演算し、注目ブロックの圧縮結果である第１の差分データとして出力する。 Then, the difference data calculation unit 234 calculates the difference between the block of interest and its estimated value, and outputs it as first difference data that is the compression result of the block of interest.

このような第１の差分データは、注目ブロックからパストリファレンス画像V_Pへの動きベクトル(-(4-s)U,-(4-s)V)と、注目ブロックからフューチャリファレンス画像V_Fへの動きベクトル(sU,sV)がなくても（第１の差分データに動きベクトルが付加されていなくても）、元の注目ブロックに復元することができる。 Such first difference data, the motion vector from the target block to Pasto reference image _{V P (- (4-s} ) U, - (4-s) V) and, from the target block to Futuresse reference image V _F Even if there is no motion vector (sU, sV) (even if no motion vector is added to the first difference data), the original block of interest can be restored.

即ち、例えば、図８５に示すように、ターゲット画像V_T上のあるブロックB₈₅₀₁を注目ブロックとして、その注目ブロックB₈₅₀₁を復元する場合には、その注目ブロックB₈₅₀₁につき、式（９）の相関情報e₁(U',V')を、考えられる全ての(U',V')について計算する。 That is, for example, as shown in FIG. 85, the block B ₈₅₀₁ with a top target image V _T as the target block, when restoring the block of interest B ₈₅₀₁ is attached to the block of interest B _8501, formula (9) Correlation information e ₁ (U ′, V ′) is calculated for all possible (U ′, V ′).

具体的には、例えば、図８５に示すように、(U',V')=(U₁',V₁')として、相関情報e₁(U₁',V₁')を計算する。さらに、例えば、図８６に示すように、(U',V')=(U₂',V₂')として、相関情報e₁(U₂',V₂')を計算する。 Specifically, for example, as shown in FIG. 85, correlation information e ₁ (U ₁ ′, V ₁ ′) is calculated as (U ′, V ′) = (U ₁ ′, V ₁ ′). Further, for example, as shown in FIG. 86, correlation information e ₁ (U ₂ ′, V ₂ ′) is calculated as (U ′, V ′) = (U ₂ ′, V ₂ ′).

ここで、図８５では、(U',V')=(U₁',V₁')とすることにより、相関情報e₁(U₁',V₁')の計算対象となるパストリファレンス画像V_Pの領域Ipが、動いている被写体P₈₃₀₂のある部分の領域R₈₅₀₁になっている。さらに、図８５では、相関情報e₁(U₁',V₁')の計算対象となるフューチャリファレンス画像V_Fの領域Ifが、やはり、動いている被写体P₈₃₀₂の同一部分の領域R₈₅₀₂になっている。領域R₈₅₀₁とR₈₅₀₂との相関は高いから、相関情報e₁(U₁',V₁')の値は、小さくなる。なお、図８５において、パストリファレンス画像V_Pの領域R₈₅₀₁は、注目ブロックB₈₅₀₁から、ベクトル(-(4-s)U₁',-(4-s)V₁')だけずれた位置にあり、その領域R₈₅₀₁内の画素の画素値Ip(x-(4-s)U₁'，y-(4-s)V₁')が、式（９）の相関情報e₁(U₁',V₁')の計算に用いられる。また、フューチャリファレンス画像V_Fの領域R₈₅₀₂は、注目ブロックB₈₅₀₁から、ベクトル(sU₁',sV₁')だけずれた位置にあり、その領域R₈₅₀₂内の画素値If(x+sU₁'，y+sV₁')が、式（９）の相関情報e₁(U₁',V₁')の計算に用いられる。 Here, in FIG. 85, by setting (U ′, V ′) = (U ₁ ′, V ₁ ′), the past reference image that is a target of calculation of the correlation information e ₁ (U ₁ ′, V ₁ ′). A region Ip of V _P is a region R ₈₅₀₁ in a part of the moving subject P ₈₃₀₂ . Further, in FIG. 85, the correlation information _{_{e 1 (U 1 ', V}} 1') area If the Futuresse reference image V _F as a calculation target of, again, the region R ₈₅₀₂ of the same portion of the subject P ₈₃₀₂ in motion It has become. Since the correlation between the regions R ₈₅₀₁ and R ₈₅₀₂ is high, the value of the correlation information e ₁ (U ₁ ′, V ₁ ′) is small. In FIG. 85, the region R ₈₅₀₁ of the past reference image V _P is shifted from the _target block B ₈₅₀₁ by a vector (− (4-s) U ₁ ′, − (4-s) V ₁ ′). _Yes , the pixel value Ip (x- (4-s) U ₁ ′, y- (4-s) V ₁ ′) of the pixel in the region R ₈₅₀₁ is the correlation information e ₁ (U ₁ ) in Expression (9). ', V ₁ '). Further, the region R ₈₅₀₂ of the future reference image V _F is located at a position shifted from the target block B ₈₅₀₁ by the vector (sU ₁ ′, sV ₁ ′), and the pixel value If (x + sU ₁ in the region R ₈₅₀₂ ', Y + sV ₁ ') is used to calculate the correlation information e ₁ (U ₁ ', V ₁ ') in equation (9).

一方、図８６では、(U',V')=(U₂',V₂')とすることにより、相関情報e₁(U₂',V₂')の計算対象となるパストリファレンス画像V_Pの領域Ipが、静止している被写体P₈₃₀₁のある部分の領域R₈₆₀₁になっている。さらに、図８６では、相関情報e₁(U₂',V₂')の計算対象となるフューチャリファレンス画像V_Fの領域Ifが、静止している被写体P₈₃₀₁の他の部分の領域R₈₆₀₂になっている。領域R₈₆₀₁とR₈₆₀₂との相関はそれほど高くなく、相関情報e₁(U₂',V₂')の値は、それほど小さくならない。なお、図８６において、パストリファレンス画像V_Pの領域R₈₆₀₁は、注目ブロックB₈₅₀₁から、ベクトル(-(4-s)U₂',-(4-s)V₂')だけずれた位置にあり、その領域R₈₆₀₁内の画素の画素値Ip(x-(4-s)U₂'，y-(4-s)V₂')が、式（９）の相関情報e₁(U₂',V₂')の計算に用いられる。また、フューチャリファレンス画像V_Fの領域R₈₆₀₂は、注目ブロックB₈₅₀₁から、ベクトル(sU₂',sV₂')だけずれた位置にあり、その領域R₈₆₀₂内の画素値If(x+sU₂'，y+sV₂')が、式（９）の相関情報e₁(U₂',V₂')の計算に用いられる。 On the other hand, in FIG. 86, by setting (U ′, V ′) = (U ₂ ′, V ₂ ′), the past reference image V that is the calculation target of the correlation information e ₁ (U ₂ ′, V ₂ ′). _P regions Ip has been in the region R ₈₆₀₁ of the portion of the subject P ₈₃₀₁ which is stationary. Further, in FIG. 86, the correlation information _{_{e 1 (U 2 ', V}} 2') regions If the Futuresse reference image V _F which is an object of calculation of the area R ₈₆₀₂ of the other part of the subject P ₈₃₀₁ at rest It has become. The correlation between the regions R ₈₆₀₁ and R ₈₆₀₂ is not so high, and the value of the correlation information e ₁ (U ₂ ′, V ₂ ′) is not so small. In FIG. 86, the region R ₈₆₀₁ of the past reference image V _P is shifted from the _target block B ₈₅₀₁ by a vector (− (4-s) U ₂ ′, − (4-s) V ₂ ′). _Yes , the pixel value Ip (x- (4-s) U ₂ ′, y- (4-s) V ₂ ′) of the pixel in the region R ₈₆₀₁ is the correlation information e ₁ (U ₂ ) in Expression (9). ', V ₂ '). Further, the region R ₈₆₀₂ of the future reference image V _F is located at a position shifted from the _target block B ₈₅₀₁ by the vector (sU ₂ ′, sV ₂ ′), and the pixel value If (x + sU ₂ in the region R ₈₆₀₂ ', Y + sV ₂ ') is used to calculate the correlation information e ₁ (U ₂ ', V ₂ ') in equation (9).

いま、注目ブロックB₈₅₀₁について計算された相関情報e₁(U',V')を最小にする(U',V')が、図８５に示した(U₁',V₁')であったとする。この場合、注目ブロックB₈₅₀₁が、図８４に示したブロックB₈₄₀₁であったとすると、図８５の領域R₈₅₀₁とR₈₅₀₂は、それぞれ、図８４の領域R₈₄₀₁とR₈₄₀₂に一致する。 Now, (U ′, V ′) that minimizes the correlation information e ₁ (U ′, V ′) calculated for the target block B ₈₅₀₁ is (U ₁ ′, V ₁ ′) shown in FIG. Suppose. In this case, if the target block B ₈₅₀₁ is the block B ₈₄₀₁ shown in FIG. 84, the regions R ₈₅₀₁ and R _{8502 in} FIG. 85 match the regions R ₈₄₀₁ and R ₈₄₀₂ in FIG. 84, respectively.

従って、第１の差分データを求めるときに用いられた推測値を、図８４の領域R₈₄₀₁とR₈₄₀₂とそれぞれ同一の図８５の領域R₈₅₀₁とR₈₅₀₂から得ることができ、その推測値を、第１の差分データに加算することにより、注目ブロックB₈₅₀₁(B₈₄₀₁)を復元することができる。 Therefore, the estimated value used when obtaining the first difference data can be obtained from the regions R ₈₅₀₁ and R _{8502 in} FIG. 85 which are the same as the regions R ₈₄₀₁ and R ₈₄₀₂ in FIG. 84, respectively. The block of interest B ₈₅₀₁ (B ₈₄₀₁ ) can be restored by adding to the first difference data.

ここで、以上のような、第１の差分データを元のブロックに復元する方法は、画像圧縮の分野において、従来にはない、まったく新しい方法であり、例えば、60fps動画データなどの低フレームレートの動画データから、240fps動画データなどの高フレームレートの動画データを復元する場合に使用することができる。この復元の方法を要約すると、次のようになる。 Here, the method of restoring the first difference data to the original block as described above is a completely new method that has not been conventionally used in the field of image compression. For example, a low frame rate such as 60 fps moving image data is used. It can be used to restore high frame rate video data such as 240 fps video data from the video data. The following summarizes the restoration method.

即ち、復元したい高フレームレートの動画データの中の１枚（フレーム）の画像に注目する。この注目した「復元したい画像」に対して時間的に近くにある複数（例えば２枚）の「低フレームレートの動画データを構成する画像」を取り出し、その取り出された複数の画像同士において相関が高い位置関係にある部分を求め、その部分から注目している「復元したい画像」を復元する。 That is, attention is focused on one (frame) image in the high frame rate moving image data to be restored. A plurality of (for example, two) “images constituting moving image data with a low frame rate” that are close in time to the noticed “image to be restored” are extracted, and the correlation between the extracted images is correlated. A portion having a high positional relationship is obtained, and the “image to be restored” focused on from that portion is restored.

なお、第１の差分データを復元する方法は、例えば、MPEG2などに用いられているBピクチャのデコードとは異なる。 Note that the method of restoring the first difference data is different from, for example, decoding of a B picture used for MPEG2 or the like.

即ち、IピクチャとPピクチャのみから構成される動画データを、「低フレームレートの動画データ」と考え、さらにBピクチャを加えた動画データを、「高フレームレートの動画データ」と考えると、第１の差分データを復元する方法は、Bピクチャのデコード方法と、一見、同じように思える。 That is, moving image data composed only of I and P pictures is considered as “low frame rate moving image data”, and moving image data added with a B picture is considered as “high frame rate moving image data”. The method of restoring the difference data of 1 seems to be the same as the method of decoding a B picture.

しかしながら、MPEG２におけるBピクチャは、そのBピクチャ内の各ブロック（マクロブロック）について、そのブロックと類似しているテクスチャが、IピクチャまたはPピクチャのどの部分に存在しているかを、「動きベクトル」という情報で明示的に保持している。 However, a B picture in MPEG2 indicates, for each block (macroblock) in the B picture, in which part of the I picture or P picture a texture similar to that block exists, a “motion vector” This information is explicitly held.

一方、第１の差分データは、そのような「動きベクトル」の情報は保持していない。さらに、第１の差分データの復元は、低フレームレートの動画データの複数フレーム、即ち、例えば、パストリファレンス画像V_Pとフューチャリファレンス画像V_Fにおいて、相関の高い領域の位置関係を探索（検出）し、その位置関係にある領域を用いて行われる。 On the other hand, the first difference data does not hold such “motion vector” information. Furthermore, restoration of the first difference data, a plurality of frames of moving image data of a low frame rate, i.e., for example, in the Past reference image V _P and Futuresse reference image V _F, searches the positional relationship between the high correlation region (detection) However, it is performed using the region in the positional relationship.

このように、第１の差分データを復元する方法は、「動きベクトル」の情報を保持していない点、および低フレームレートの動画データの複数フレームにおいて相関の高い領域の位置関係を検出し、その位置関係にある領域を用いて行われる点において、Bピクチャのデコード方法とは大きく異なる。 As described above, the method of restoring the first difference data detects the positional relationship between the areas that do not hold the “motion vector” information and the areas having high correlation in the plurality of frames of the low-frame-rate moving image data, The method is greatly different from the B picture decoding method in that the processing is performed using regions in the positional relationship.

次に、図８１の差分データ計算部２３５で求められる第２の差分データについて説明する。 Next, the second difference data obtained by the difference data calculation unit 235 in FIG. 81 will be described.

差分データ計算部２３５では、注目ブロックについて、パストリファレンス画像V_Pとフューチャリファレンス画像V_Fにおいて注目ブロックとの相関が高い位置関係を表す１つの動きベクトルが検出され、注目ブロックとの位置関係が、その１つの動きベクトルから求められる位置関係にある、パストリファレンス画像V_Pとフューチャリファレンス画像V_Fの領域の画像データから、注目ブロックの推測値が求められる。そして、差分データ計算部２３５では、注目ブロックとその推測値との差分が、第２の差分データとして出力される。 In the difference data calculating section 235, for the target block, is detected one motion vector representing the correlation is high positional relationship between the target blocks in the PAST reference image V _P and Futuresse reference image V _F, the positional relationship between the target block, a positional relationship obtained from one motion vector that, from the image data of the region of Pasto reference image V _P and Futuresse reference image V _F, estimate of the block of interest is determined. Then, the difference data calculation unit 235 outputs the difference between the block of interest and its estimated value as second difference data.

即ち、図８７は、図８３に示した被写体P₈₃₀₁とP₈₃₀₂とが投影されたターゲット画像V_T、パストリファレンス画像V_P、およびフューチャリファレンス画像V_F（のフレーム）を示している。 That is, FIG. 87 shows the target image V _T , past reference image V _P , and feature reference image V _F (frames) onto which the subjects P ₈₃₀₁ and P ₈₃₀₂ shown in FIG. 83 are projected.

図８７においては、例えば、被写体P₈₃₀₂が、図８４における場合と同様に、一定速度で移動している。但し、図８７では、被写体P₈₃₀₂は、図８４における場合よりもやや速い速度で移動している。なお、図８７でも、被写体P₈₃₀₂の動きベクトル、即ち、240fps動画データにおけるフレーム周期である1/240秒あたりの、被写体P₈₃₀₂の空間方向x,yの移動量を表すベクトルが(U,V)であるとする。 In FIG. 87, for example, the subject P ₈₃₀₂ is moving at a constant speed as in the case of FIG. However, in FIG. 87, the subject P ₈₃₀₂ is moving at a slightly higher speed than in the case of FIG. Also in FIG. 87, the motion vector of the subject P _8302, i.e., per 1/240 second is a frame period in 240fps video data, the spatial direction x of the object P _8302, a vector representing the movement amount of y (U, V ).

被写体P₈₃₀₂が、図８４における場合よりもやや速い一定速度で移動している場合も、ターゲット画像V_T、パストリファレンス画像V_P、およびフューチャリファレンス画像V_Fのすべてに、被写体P₈₃₀₂が存在し、さらに、パストリファレンス画像V_P上の被写体P₈₃₀₂は、ターゲット画像V_T上の被写体P₈₃₀₂から、ベクトル(-(4-s)U,-(4-s)V)だけずれた位置にあり、フューチャリファレンス画像V_F上の被写体P₈₃₀₂は、ターゲット画像V_T上の被写体P₈₃₀₂から、ベクトル(sU,sV)だけずれた位置にある。 Subject P ₈₃₀₂ is also moving at a slightly higher constant rate than in FIG. 84, the target image V _T, Past reference image V _P, and all Futuresse reference image V _F, there is a subject P ₈₃₀₂ Furthermore, the subject P ₈₃₀₂ on the past reference image V _P is at a position shifted from the subject P ₈₃₀₂ on the target image V _{T by} a vector (− (4-s) U, − (4-s) V). subject P ₈₃₀₂ on Futuresse reference image V _F from the subject P ₈₃₀₂ on a target image V _T, is the position shifted by vector (sU, sV).

従って、ターゲット画像V_Tにおいて、例えば、被写体P₈₃₀₂のみを含むあるブロックB₈₇₀₁を、注目ブロックとして注目すると、パストリファレンス画像V_Pにおいて、注目ブロックB₈₇₀₁からベクトル(-(4-s)U,-(4-s)V)だけずれた位置の、注目ブロックB₈₇₀₁と同一サイズの領域R₈₇₀₁の画像データは、（理想的には）注目ブロックB₈₇₀₁と同一になっている。また、フューチャリファレンス画像V_Fにおいても、注目ブロックB₈₇₀₁からベクトル(sU,sV)だけずれた位置の、注目ブロックB₈₇₀₁と同一サイズの領域R₈₇₀₂の画像データは、（理想的には）注目ブロックB₈₇₀₁と同一になっている。 Accordingly, in the target image V _T, for example, a certain block B ₈₇₀₁ including only the subject P _8302, when attention as the subject block, in Past reference image V _P, from the target block B ₈₇₀₁ vector (- (4-s) U , The image data of the region R ₈₇₀₁ having the same size as the _target block B _{8701 at} a position shifted _by- (4-s) V) is (ideally) the same as the _target block B ₈₇₀₁ . Also in Futuresse reference image V _F, a vector from the target block B ₈₇₀₁ (sU, sV) of position shifted by the image data of the target block B ₈₇₀₁ same size as the region R ₈₇₀₂ is (ideally) of interest It is the same as block B ₈₇₀₁ .

即ち、パストリファレンス画像V_Pの領域R₈₇₀₁、注目ブロックB₈₇₀₁、およびフューチャリファレンス画像V_Fの領域R₈₇₀₂の画像データは、（理想的には）同一になっている。 That is, the image data of Pasto reference image V _P of region R _8701, the block of interest B _8701, and the region R ₈₇₀₂ of Futuresse reference image V _F is made the same (ideally).

差分データ計算部２３５（図８１）では、以上のような図８７に示した場合に適した第２の差分データが求められる。 The difference data calculation unit 235 (FIG. 81) obtains second difference data suitable for the case shown in FIG. 87 as described above.

即ち、差分データ計算部２３５（図８１）でも、例えば、差分データ計算部２３４における場合と同様に、パストリファレンス画像V_Pの領域R₈₇₀₁と、フューチャリファレンス画像V_Fの領域R₈₇₀₂との画像データの平均値が演算され、その平均値を、注目ブロックB₈₇₀₁の推測値として、注目ブロックB₈₇₀₁とその推測値との差分が演算される。この差分は、ほとんど０となるので、注目ブロックB₈₇₀₁のデータ量を削減することができる。差分データ計算部２３５は、以上のような注目ブロックB₈₇₀₁とその推測値との差分を、注目ブロックB₈₇₀₁の圧縮結果である第２の差分データとして出力する。 That is, even in the difference data calculation unit 235 (FIG. 81), for example, as in the case of the difference data calculation unit 234, image data of the region R ₈₇₀₁ of the past reference image V _P and the region R ₈₇₀₂ of the future reference image V _F the average value of is calculated, the average value, as estimated value of the block of interest B _8701, the difference between the target block B ₈₇₀₁ and its estimated value is calculated. Since this difference is almost zero, the data amount of the target block B ₈₇₀₁ can be reduced. The difference data calculation unit 235 outputs the difference between the attention block B ₈₇₀₁ and the estimated value as described above as second difference data that is a compression result of the attention block B ₈₇₀₁ .

ところで、第２の差分データを、元の注目ブロックB₈₇₀₁に復元するには、上述した第１の差分データを復元する場合と同様に、注目ブロックB₈₇₀₁からパストリファレンス画像V_Pへの動きベクトルとなるベクトル(-(4-s)U,-(4-s)V)と、注目ブロックB₈₇₀₁からフューチャリファレンス画像V_Fへの動きベクトルとなるベクトル(sU,sV)との２つの動きベクトルが必要となる。 Meanwhile, the second difference data, to restore to the original block of interest B _8701, as in the case of restoring the first differential data described above, the motion vector from the target block B ₈₇₀₁ to Pasto reference image V _P Two motion vectors, a vector (-(4-s) U,-(4-s) V) and a vector (sU, sV) that is a motion vector from the target block B ₈₇₀₁ to the future reference image V _F Is required.

しかしながら、第２の差分データに、２つの動きベクトル(-(4-s)U,-(4-s)V)と(sU,sV)とを付加した場合には、その２つの動きベクトル(-(4-s)U,-(4-s)V)と(sU,sV)の分だけ、データ量が増加することになる。 However, when two motion vectors (-(4-s) U,-(4-s) V) and (sU, sV) are added to the second difference data, the two motion vectors ( -(4-s) U,-(4-s) V) and (sU, sV) increase the amount of data.

そこで、差分データ計算部２３５では、次のようにして、注目ブロックB₈₇₀₁の推測値を求めるのに用いる領域R₈₇₀₁およびR₈₇₀₂（への動きベクトル）を求めることにより、１つの動きベクトルだけで復元することができる第２の差分データを求める。 Therefore, the difference data calculation unit 235 obtains the regions R ₈₇₀₁ and R ₈₇₀₂ (motion vectors to) used for _obtaining the estimated value of the block of interest B ₈₇₀₁ as follows, by using only one motion vector. Second differential data that can be restored is obtained.

即ち、上述したように、いまの場合、被写体P₈₃₀₂が一定速度で移動していることを前提とするので、注目ブロックB₈₇₀₁から動きベクトル(-(4-s)U,-(4-s)V)だけずれたパストリファレンス画像V_P上の領域R₈₇₀₁、注目ブロックB₈₇₀₁から動きベクトル(sU,sV)だけずれたフューチャリファレンス画像V_F上の領域R₈₇₀₂、および注目ブロックB₈₇₀₁（の対応する画素）は、図８７に示すように、時空間において一直線上に並ぶ。 That is, as described above, in this case, since it is assumed that the subject P ₈₃₀₂ is moving at a constant speed, the motion from the target block B ₈₇₀₁ vector (- (4-s) U , - (4-s ) V) shifted by PAST reference image V region of the _P R _8701, the block of interest B ₈₇₀₁ from the motion vector (sU, sV) shifted by Futuresse reference image V region on _F R _8702, and the block of interest B ₈₇₀₁ (the As shown in FIG. 87, the corresponding pixels are aligned in a straight line in the space-time.

従って、注目ブロックB₈₇₀₁から動きベクトル(-(4-s)U,-(4-s)V)だけずれたパストリファレンス画像V_P上の領域R₈₇₀₁と、注目ブロックB₈₇₀₁から動きベクトル(sU,sV)だけずれたフューチャリファレンス画像V_F上の領域R₈₇₀₂は、いずれも、時空間において、注目ブロックB₈₇₀₁を通る直線上に存在する。 Therefore, the motion vector from the target block _{B 8701 (- (4-s} ) U, - (4-s) V) a region R ₈₇₀₁ on which shifted by PAST reference image V _P, the motion vector from the target block B ₈₇₀₁ (sU , sV), the region R ₈₇₀₂ on the feature reference image V _{F is located} on a straight line passing through the block of interest B ₈₇₀₁ in time and space.

また、注目ブロックB₈₇₀₁と、パストリファレンス画像V_P上の領域R₈₇₀₁との画像データは、上述したように、（理想的には）同一であるから、注目ブロックB₈₇₀₁と領域R₈₇₀₁との相関は、注目ブロックB₈₇₀₁とパストリファレンス画像V_P上の領域との相関の中で最も高くなる。さらに、注目ブロックB₈₇₀₁と、フューチャリファレンス画像V_F上の領域R₈₇₀₂との画像データも、上述したように、（理想的には）同一であるから、注目ブロックB₈₇₀₁と領域R₈₇₀₂との相関は、注目ブロックB₈₇₀₁とフューチャリファレンス画像V_F上の領域との相関の中で最も高くなる。 Further, the block of interest B _8701, the image data of the area R ₈₇₀₁ on Pasto reference image V _P, as described above, (ideally) from the same, the block of interest B ₈₇₀₁ and the region R ₈₇₀₁ correlation becomes highest in the correlation between the target block B ₈₇₀₁ and Pasto reference image V _P on the region. Further, the block of interest B _8701, also image data of a region R ₈₇₀₂ on Futuresse reference image V _F, as described above, (ideally) from the same, the block of interest B ₈₇₀₁ and the region R ₈₇₀₂ correlation becomes highest in the correlation between the target block B ₈₇₀₁ and Futuresse reference image V _F on the region.

従って、パストリファレンス画像V_Pとフューチャリファレンス画像V_Fそれぞれにおいて、注目ブロックB₈₇₀₁との相関が最も高い領域を検出すれば、そのパストリファレンス画像V_Pの領域と、フューチャリファレンス画像V_Fの領域とが、それぞれ領域R₈₇₀₁とR₈₇₀₂となる。 Therefore, if a region having the highest correlation with the _target block B ₈₇₀₁ is detected in each of the past reference image V _P and the feature reference image V _F , the past reference image V _P region, the future reference image V _F region, _Are regions R ₈₇₀₁ and R ₈₇₀₂ , respectively.

ところで、パストリファレンス画像V_P上の領域R₈₇₀₁は、注目ブロックB₈₇₀₁から動きベクトル(-(4-s)U,-(4-s)V)だけずれた位置にあり、フューチャリファレンス画像V_F上の領域R₈₇₀₂は、注目ブロックB₈₇₀₁から動きベクトル(sU,sV)だけずれた位置にある。 By the way, the region R ₈₇₀₁ on the past reference image V _P is located at a position shifted from the _target block B ₈₇₀₁ by a motion vector (− (4-s) U, − (4-s) V), and the future reference image V _F The upper region R ₈₇₀₂ is located at a position shifted from the target block B ₈₇₀₁ by a motion vector (sU, sV).

従って、パストリファレンス画像V_P上の領域R₈₇₀₁の位置も、フューチャリファレンス画像V_F上の領域R₈₇₀₂の位置も、１つの動きベクトル(U,V)から求めることができる。 Therefore, the position of the region R ₈₇₀₁ on Pasto reference image V _P also, the position of the region R ₈₇₀₂ on Futuresse reference image V _F, can be obtained from one motion vector (U, V).

即ち、注目ブロックB₈₇₀₁から領域R₈₇₀₁への動きベクトル(-(4-s)U,-(4-s)V)と、注目ブロックB₈₇₀₁から領域R₈₇₀₂への動きベクトル(sU,sV)との、合計で２つの動きベクトルがなくても、１つの動きベクトル(U,V)から、領域R₈₇₀₁とR₈₇₀₂を求めることができる。 That is, the motion vector from the target block B ₈₇₀₁ to the area _{R 8701 (- (4-s} ) U, - (4-s) V) and the motion vector from the target block B ₈₇₀₁ to the area R ₈₇₀₂ (sU, sV) Thus, even if there are no two motion vectors in total, the regions R ₈₇₀₁ and R ₈₇₀₂ can be obtained from one motion vector (U, V).

差分データ計算部２３５（図８１）は、以上のような原理を利用して、１つの動きベクトル(U,V)から復元することができる第２の差分データを求める。 The difference data calculation unit 235 (FIG. 81) obtains second difference data that can be restored from one motion vector (U, V) using the principle as described above.

具体的には、差分データ計算部２３５は、注目ブロックを通る時空間の直線上にあるパストリファレンス画像V_Pの領域Ip、またはフューチャリファレンス画像V_Fの領域Ifそれぞれと、注目ブロックとの相関を表す相関情報e₂(U',V')を、例えば、式（１０）にしたがって求める。 Specifically, the difference data calculation unit 235 calculates the correlation between the region Ip of the past reference image V _P or the region If of the future reference image V _F on the straight line in the space-time passing through the block of interest and the block of interest. The correlation information e ₂ (U ′, V ′) to be expressed is obtained according to, for example, the equation (10).

・・・（１０）

... (10)

ここで、式（１０）において、(x,y)は、ターゲット画像V_Tにおける注目ブロックの画素の位置を表す。また、Ip(x-(4-s)U'，y-(4-s)V')は、パストリファレンス画像V_Pにおける位置(x-(4-s)U'，y-(4-s)V')にある画素の画素値を表す。さらに、If(x+sU'，y+sV')は、フューチャリファレンス画像V_Fにおける位置(x+sU'，y+sV')にある画素の画素値を表す。また、Ic(x,y)は、ターゲット画像V_Tにおける位置(x,y)における画素の画素値（注目ブロックの画素の画素値）を表す。また、式（１０）におけるΣは、注目ブロックを構成する画素すべてについてのサメーションを表す。 Here, in the formula (10), (x, y ) represents the position of the pixel of the target block in the target image V _T. Ip (x- (4-s) U ′, y- (4-s) V ′) is a position (x- (4-s) U ′, y- (4-s) in the past reference image V _P. ) Represents the pixel value of the pixel in V ′). Furthermore, If (x + sU ', y + sV') represents the position (x + sU ', y + sV') pixel value of the pixel in the Futuresse reference image V _F. Further, Ic (x, y) represents the position in the target image V _T (x, y) pixel value of the pixel in the (pixel value of the pixel of the target block). Further, Σ in Expression (10) represents summation for all the pixels constituting the block of interest.

なお、式（１０）の相関情報e₂(U',V')は、パストリファレンス画像V_Pの領域Ipと注目ブロックとの画素値の差分絶対値と、フューチャリファレンス画像V_Fの領域Ifと注目ブロックとの画素値の差分絶対値との加算値の総和であり、従って、パストリファレンス画像V_Pの領域Ipと注目ブロックとの相関や、フューチャリファレンス画像V_Fの領域Ifと注目ブロックとの相関が高いほど、相関情報e₂(U',V')の「値」は、小さくなる。 Incidentally, the correlation information e ₂ of the formula (10) (U ', V ') is the absolute difference of pixel values between the target block and the area Ip Past reference image V _P, a region If the Futuresse reference image V _F block of interest is the sum of the sum of the difference absolute values of the pixel values and, hence, correlation and the block of interest and the region Ip Past reference image V _P, the area If the block of interest Futuresse reference image V _F The higher the correlation, the smaller the “value” of the correlation information e ₂ (U ′, V ′).

差分データ計算部２３５は、式（１０）の相関情報e₂(U',V')を、考えられる全ての(U',V')について計算する。ここで、考えられる全ての(U',V')とは、例えば、位置(x-(4-s)U'，y-(4-s)V')がパストリファレンス画像V_P上にあり、かつ、位置(x+sU'，y+sV')がフューチャリファレンス画像V_F上にある場合の、(U',V')である。なお、式（１０）の相関情報e₂(U',V')の計算は、その他、あらかじめ定められた範囲の(U',V')についてだけ行うようにしてもよい。 The difference data calculation unit 235 calculates the correlation information e ₂ (U ′, V ′) in Expression (10) for all possible (U ′, V ′). Here, all the possible (U ′, V ′) are, for example, positions (x- (4-s) U ′, y- (4-s) V ′) on the past reference image V _P. and a position (x + sU ', y + sV') when is on Futuresse reference image _{V F, (U ', V} '). In addition, the calculation of the correlation information e ₂ (U ′, V ′) in Expression (10) may be performed only for (U ′, V ′) within a predetermined range.

即ち、差分データ計算部２３５は、例えば、図８８に示すように、(U',V')=(U₁',V₁')として、式（１０）の相関情報e₂(U₁',V₁')を計算する。さらに、差分データ計算部２３５は、例えば、図８９に示すように、(U',V')=(U₂',V₂')として、式（１０）の相関情報e₂(U₂',V₂')を計算する。 That is, for example, as shown in FIG. 88, the difference data calculation unit 235 sets (U ′, V ′) = (U ₁ ′, V ₁ ′) to correlate information e ₂ (U ₁ ′) in Expression (10). , V ₁ '). Further, for example, as shown in FIG. 89, the difference data calculation unit 235 sets (U ′, V ′) = (U ₂ ′, V ₂ ′) as the correlation information e ₂ (U ₂ ′) in Expression (10). , V ₂ ').

ここで、図８８では、(U',V')=(U₁',V₁')とすることにより、相関情報e₂(U₁',V₁')の計算対象となるパストリファレンス画像V_Pの領域Ipが、動いている被写体P₈₃₀₂のうちの注目ブロックB₈₇₀₁における部分と同一の領域R₈₈₀₁になっている。さらに、図８８では、相関情報e₂(U₁',V₁')の計算対象となるフューチャリファレンス画像V_Fの領域Ifが、やはり、動いている被写体P₈₃₀₂のうちの注目ブロックにおける部分と同一の領域R₈₈₀₂になっている。従って、注目ブロックB₈₇₀₁と、領域R₈₈₀₁またはR₈₈₀₂それぞれとの相関は、いずれも高いから、相関情報e₂(U₁',V₁')の値は、小さくなる。なお、図８８において、パストリファレンス画像V_Pの領域R₈₈₀₁は、注目ブロックB₈₇₀₁から、ベクトル(-(4-s)U₁',-(4-s)V₁')だけずれた位置にあり、その領域R₈₈₀₁内の画素の画素値Ip(x-(4-s)U₁'，y-(4-s)V₁')が、式（１０）の相関情報e₂(U₁',V₁')の計算に用いられる。また、フューチャリファレンス画像V_Fの領域R₈₈₀₂は、注目ブロックB₈₇₀₁から、ベクトル(sU₁',sV₁')だけずれた位置にあり、その領域R₈₈₀₂内の画素値If(x+sU₁'，y+sV₁')が、式（１０）の相関情報e₂(U₁',V₁')の計算に用いられる。 Here, in FIG. 88, by setting (U ′, V ′) = (U ₁ ′, V ₁ ′), the past reference image that is a target of calculation of the correlation information e ₂ (U ₁ ′, V ₁ ′). area Ip of V _P has become the same region R ₈₈₀₁ and the partial block of interest B ₈₇₀₁ of the subject P ₈₃₀₂ is moving. Further, in FIG. 88, the correlation information _{_{e 2 (U 1 ', V}} 1') area If the Futuresse reference image V _F as a calculation target of, again, a portion in the target block among the object P ₈₃₀₂ in motion It is the same region R ₈₈₀₂ . Accordingly, since the correlation between the block of interest B ₈₇₀₁ and each of the regions R _{8801 and} R ₈₈₀₂ is high, the value of the correlation information e ₂ (U ₁ ′, V ₁ ′) is small. Incidentally, in FIG. 88, the region R ₈₈₀₁ Past reference image V _P from the block of interest B _8701, vector (- (4-s) U 1 ', - (4-s) V 1') to a position shifted by _Yes , the pixel value Ip (x- (4-s) U ₁ ′, y- (4-s) V ₁ ′) of the pixel in the region R ₈₈₀₁ is the correlation information e ₂ (U ₁ ) in Expression (10). ', V ₁ '). Further, the region R ₈₈₀₂ of the future reference image V _F is located at a position shifted from the target block B ₈₇₀₁ by the vector (sU ₁ ′, sV ₁ ′), and the pixel value If (x + sU ₁ in the region R ₈₈₀₂ ', Y + sV ₁ ') is used to calculate the correlation information e ₂ (U ₁ ', V ₁ ') in equation (10).

一方、図８９では、(U',V')=(U₂',V₂')とすることにより、相関情報e₂(U₂',V₂')の計算対象となるパストリファレンス画像V_Pの領域Ipが、静止している被写体P₈₃₀₁のある部分の領域R₈₉₀₁になっている。さらに、図８９では、相関情報e₂(U₂',V₂')の計算対象となるフューチャリファレンス画像V_Fの領域Ifが、静止している被写体P₈₃₀₁の同一部分の領域R₈₉₀₂になっている。注目ブロックB₈₇₀₁は、動いている被写体P₈₃₀₂のある部分であり、従って、注目ブロックB₈₇₀₁と、領域R₈₉₀₁またはR₈₉₀₂それぞれとの相関は、いずれも、それほど高くなく、相関情報e₂(U₂',V₂')の値は、それほど小さくならない。なお、図８９において、パストリファレンス画像V_Pの領域R₈₉₀₁は、注目ブロックB₈₇₀₁から、ベクトル(-(4-s)U₂',-(4-s)V₂')だけずれた位置にあり、その領域R₈₉₀₁内の画素の画素値Ip(x-(4-s)U₂'，y-(4-s)V₂')が、式（１０）の相関情報e₂(U₂',V₂')の計算に用いられる。また、フューチャリファレンス画像V_Fの領域R₈₉₀₂は、注目ブロックB₈₇₀₁から、ベクトル(sU₂',sV₂')だけずれた位置にあり、その領域R₈₉₀₂内の画素値If(x+sU₂'，y+sV₂')が、式（１０）の相関情報e₂(U₂',V₂')の計算に用いられる。但し、図８９では、(U₂',V₂')=(0,0)になっている。 On the other hand, in FIG. 89, by setting (U ′, V ′) = (U ₂ ′, V ₂ ′), the past reference image V that is the calculation target of the correlation information e ₂ (U ₂ ′, V ₂ ′). _P regions Ip has become a part of the region R ₈₉₀₁ with a subject P ₈₃₀₁ which is stationary. Further, in FIG. 89, the correlation information _{_{e 2 (U 2 ', V}} 2') regions If the calculated subject to Futuresse reference image V _F of, become a region R ₈₉₀₂ of the same portion of the subject P ₈₃₀₁ at rest ing. The attention block B ₈₇₀₁ is a part of the moving subject P _8302. Therefore, the correlation between the attention block B ₈₇₀₁ and each of the regions R _{8901 and} R ₈₉₀₂ is not so high, and the correlation information e ₂ ( The value of U ₂ ', V ₂ ') is not so small. In FIG. 89, the region R ₈₉₀₁ of the past reference image V _P is shifted from the _target block B ₈₇₀₁ by a vector (− (4-s) U ₂ ′, − (4-s) V ₂ ′). _Yes , the pixel value Ip (x- (4-s) U ₂ ′, y- (4-s) V ₂ ′) of the pixel in the region R ₈₉₀₁ is the correlation information e ₂ (U ₂ ) in Expression (10). ', V ₂ '). Further, the region R ₈₉₀₂ of the future reference image V _F is located at a position shifted from the target block B ₈₇₀₁ by the vector (sU ₂ ′, sV ₂ ′), and the pixel value If (x + sU ₂ in the region R ₈₉₀₂ ', Y + sV ₂ ') is used to calculate the correlation information e ₂ (U ₂ ', V ₂ ') in equation (10). However, in FIG. 89, (U ₂ ′, V ₂ ′) = (0, 0).

いま、注目ブロックB₈₇₀₁について計算された相関情報e₂(U',V')を最小にする(U',V')が、図８８に示した(U₁',V₁')であったとすると、図８８の領域R₈₈₀₁とR₈₈₀₂は、それぞれ、図８７の領域R₈₇₀₁とR₈₇₀₂に一致する。 Now, (U ′, V ′) that minimizes the correlation information e ₂ (U ′, V ′) calculated for the _target block B ₈₇₀₁ is (U ₁ ′, V ₁ ′) shown in FIG. _Assuming that the regions R ₈₈₀₁ and R ₈₈₀₂ in FIG. 88 correspond to the regions R ₈₇₀₁ and R ₈₇₀₂ in FIG. 87, respectively.

そして、領域R₈₇₀₁とR₈₇₀₂の位置は、上述したように、注目ブロックB₈₇₀₁から領域R₈₇₀₁への動きベクトル(-(4-s)U,-(4-s)V)と、注目ブロックB₈₇₀₁から領域R₈₇₀₂への動きベクトル(sU,sV)との、合計で２つの動きベクトルがなくても、注目ブロックB₈₇₀₁について計算された相関情報e₂(U',V')を最小にする(U',V')である１つの動きベクトル(U,V)から求めることができる。 Then, as described above, the positions of the regions R ₈₇₀₁ and R ₈₇₀₂ are the motion vector (-(4-s) U,-(4-s) V) from the _target block B ₈₇₀₁ to the region R _8701, and the _target block. The correlation information e ₂ (U ′, V ′) calculated for the target block B ₈₇₀₁ is minimized even if there are no two motion vectors in total with the motion vector (sU, sV) from the B ₈₇₀₁ to the region R ₈₇₀₂ It can be obtained from one motion vector (U, V) which is (U ', V').

いま、この動きベクトル(U,V)を、相関最大ベクトル(U,V)ということとすると、差分データ計算部２３５は、式（１０）の相関情報e₂(U',V')を最小にする(U',V')である相関最大ベクトル(U,V)を検出した後、注目ブロックとの位置関係が、相関最大ベクトル(U,V)から求められる位置関係にあるパストリファレンス画像V_Pとフューチャリファレンス画像V_Fとの画像データから、注目ブロックの推測値を求める。 Now, assuming that the motion vector (U, V) is a correlation maximum vector (U, V), the difference data calculation unit 235 minimizes the correlation information e ₂ (U ′, V ′) in Expression (10). After detecting the maximum correlation vector (U, V) that is (U ', V'), the past reference image whose positional relationship with the block of interest is the positional relationship obtained from the maximum correlation vector (U, V) from the image data of V _P and Futuresse reference image V _F, we obtain the estimated value of the block of interest.

即ち、差分データ計算部２３５は、注目ブロックからベクトル(-(4-s)U,-(4-s)V)だけずれたパストリファレンス画像V_P上の、注目ブロックと同一サイズの領域と、注目ブロックからベクトル(sU,sV)だけずれたフューチャリファレンス画像V_F上の、注目ブロックと同一サイズの領域を求め、そのパストリファレンス画像V_Pの領域と、フューチャリファレンス画像V_Fの領域との画像データの平均値を、注目ブロックの推測値として求める。 That is, the difference data calculation unit 235 has an area of the same size as the target block on the past reference image V _P shifted from the target block by a vector (− (4-s) U, − (4-s) V), from the target block vector (sU, sV) shifted on Futuresse reference image V _F was obtains a region of the block of interest and the same size, the image of the region of the Pasto reference image V _P, and Futuresse reference image V _F of the area The average value of the data is obtained as an estimated value of the block of interest.

そして、差分データ計算部２３５は、注目ブロックとその推測値との差分を演算し、その演算結果を第２の差分データとして、その第２の差分データに、相関最大ベクトル(U,V)を付加して出力する。 Then, the difference data calculation unit 235 calculates a difference between the block of interest and its estimated value, sets the calculated result as second difference data, and sets the correlation maximum vector (U, V) in the second difference data. Append and output.

このような第２の差分データは、注目ブロックからパストリファレンス画像V_Pへの動きベクトル(-(4-s)U,-(4-s)V)と、注目ブロックからフューチャリファレンス画像V_Fへの動きベクトル(sU,sV)の２つの動きベクトルがなくても（第２の差分データに２つの動きベクトルが付加されていなくても）、それに付加されている１つの動きベクトルである相関最大ベクトル(U,V)を用いて、元の注目ブロックに復元することができる。 Such second difference data, the motion vector from the target block to Pasto reference image _{V P (- (4-s} ) U, - (4-s) V) and, from the target block to Futuresse reference image V _F Even if there is no two motion vectors (sU, sV) of (no two motion vectors are added to the second difference data), the maximum correlation that is one motion vector added to it The vector (U, V) can be used to restore the original block of interest.

即ち、第２の差分データを求める場合と同様に、まず、注目ブロックとの位置関係が、第２の差分データに付加されている相関最大ベクトル(U,V)から求められる位置関係にあるパストリファレンス画像V_Pとフューチャリファレンス画像V_Fとの画像データから、注目ブロックの推測値を求める。具体的には、注目ブロックからベクトル(-(4-s)U,-(4-s)V)だけずれたパストリファレンス画像V_P上の、注目ブロックと同一サイズの領域と、注目ブロックからベクトル(sU,sV)だけずれたフューチャリファレンス画像V_F上の、注目ブロックと同一サイズの領域を求め、そのパストリファレンス画像V_Pの領域と、フューチャリファレンス画像V_Fの領域との画像データの平均値を、注目ブロックの推測値として求める。 That is, as in the case of obtaining the second difference data, first, the path relationship between the target block and the target block is the position relationship obtained from the maximum correlation vector (U, V) added to the second difference data. from the image data of the reference image V _P and Futuresse reference image V _F, obtains the estimated value of the block of interest. Specifically, an area of the same size as the target block on the past reference image V _P shifted from the target block by a vector (-(4-s) U,-(4-s) V), and the vector from the target block An area of the same size as the target block on the future reference image V _F shifted by (sU, sV) is obtained, and the average value of the image data of the area of the past reference image V _{P and} the area of the future reference image V _F As the estimated value of the block of interest.

そして、第２の差分データと、注目ブロックの推測値とを加算することにより、注目ブロックを復元することができる。 Then, the block of interest can be restored by adding the second difference data and the estimated value of the block of interest.

以上のように、第２の差分データは、１つの相関最大ベクトル(U,V)があれば復元することができるので、注目ブロックからパストリファレンス画像V_Pへの動きベクトル(-(4-s)U,-(4-s)V)と、注目ブロックからフューチャリファレンス画像V_Fへの動きベクトル(sU,sV)との２つの動きベクトルを付加する場合に比較して、データ量の増加を抑制することができる。 As described above, the second difference data, it is possible to one correlation maximum vector (U, V) is restored if the motion vector from the target block to Pasto reference image V _P (- (4-s ) U, - (a 4-s) V), motion vector from the target block to Futuresse reference image V _F (sU, as compared with the case of adding the two motion vectors with sV), an increase in the data amount Can be suppressed.

なお、第１と第２の差分データについては、そのうちのいずれか一方だけを求め、ブロックの圧縮結果としてもよいが、第１と第２の差分データの両方を求め、そのうちの適切な方を、ブロックの圧縮結果とすることにより、次のような利点がある。 For the first and second difference data, only one of them may be obtained and the compression result of the block may be obtained, but both the first and second difference data are obtained and the appropriate one of them is determined. The block compression result has the following advantages.

即ち、第１と第２の差分データについては、第２の差分データに相関最大ベクトル(U,V)が付加されることから、その分のデータ量の観点からすれば、動きベクトルが付加されない第１の差分データの方が優れている。 That is, with respect to the first and second difference data, since the correlation maximum vector (U, V) is added to the second difference data, no motion vector is added from the viewpoint of the corresponding data amount. The first difference data is superior.

しかしながら、第１の差分データは、そのデータ量が大になる場合があり、この場合、相関最大ベクトル(U,V)のデータ量を含めても、第２の差分データのデータ量の方が、第１の差分データのデータ量よりも少なくなることがある。 However, the first differential data may have a large data amount. In this case, even if the data amount of the correlation maximum vector (U, V) is included, the data amount of the second differential data is better. The data amount of the first difference data may be smaller.

即ち、例えば、図８８に示したパストリファレンス画像V_P上の領域R₈₈₀₁と、フューチャリファレンス画像V_F上の領域R₈₈₀₂とは、動いている被写体P₈₃₀₂の部分の画像であるが、この領域R₈₈₀₁とR₈₈₀₂とが、動いている被写体P₈₃₀₂の同一部分の画像であった場合には、注目ブロックB₈₇₀₁について計算される式（９）の相関情報e₁(U₁',V₁')が表す領域R₈₈₀₁とR₈₈₀₂との相関は高くなる。 That is, for example, the region R ₈₈₀₁ on Pasto reference image V _P shown in FIG. 88, the region R ₈₈₀₂ on Futuresse reference image V _F, but is an image of a part of the subject P ₈₃₀₂ is moving, the area When R ₈₈₀₁ and R ₈₈₀₂ are images of the same part of the moving subject P ₈₃₀₂ , the correlation information e ₁ (U ₁ ′, V ₁ ) calculated for the block of interest B ₈₇₀₁ is calculated. The correlation between the regions R ₈₈₀₁ and R ₈₈₀₂ represented by ') is high.

一方、例えば、図８９に示したパストリファレンス画像V_Pの領域R₈₉₀₁と、フューチャリファレンス画像V_Fの領域R₈₉₀₂とは、静止している被写体P₈₃₀₁の部分の画像であるが、この領域R₈₉₀₁とR₈₉₀₂とが、静止している被写体P₈₃₀₁の同一部分の画像であった場合も、式（９）の相関情報e₁(U₂',V₂')が表す領域R₈₉₀₁とR₈₉₀₂との相関は高くなる。 On the other hand, for example, the past-reference image V _P of region R ₈₉₀₁ shown in FIG. 89, the region R ₈₉₀₂ of Futuresse reference image V _F, but is an image of a part of the subject P ₈₃₀₁ which is stationary, the area R _{Even when 8901} and R ₈₉₀₂ are images of the same portion of the stationary subject P ₈₃₀₁ , the regions R ₈₉₀₁ and R represented by the correlation information e ₁ (U ₂ ′, V ₂ ′) in Expression (9) The correlation with ₈₉₀₂ is high.

つまり、図８８における領域R₈₈₀₁とR₈₈₀₂との相関を表す式（９）の相関情報e₁(U₁',V₁')の値と、図８９における領域R₈₉₀₁とR₈₉₀₂との相関を表す式（９）の相関情報e₁(U₂',V₂')の値とは、理想的には、０となり、同一になる。 That is, the value of the correlation information e ₁ (U ₁ ′, V ₁ ′) in Expression (9) representing the correlation between the regions R ₈₈₀₁ and R ₈₈₀₂ in FIG. 88 and the correlation between the regions R ₈₉₀₁ and R ₈₉₀₂ in FIG. The value of the correlation information e ₁ (U ₂ ′, V ₂ ′) in the equation (9) that represents is ideally 0 and is the same.

しかしながら、実際には、ノイズ等の影響によって、図８８における領域R₈₈₀₁とR₈₈₀₂との相関情報e₁(U₁',V₁')の値の方が小さくなることや、逆に、図８９における領域R₈₉₀₁とR₈₉₀₂との相関情報e₁(U₂',V₂')の値の方が小さくなることがある。 However, actually, the value of the correlation information e ₁ (U ₁ ′, V ₁ ′) between the regions R ₈₈₀₁ and R ₈₈₀₂ in FIG. 88 becomes smaller due to the influence of noise or the like. The value of the correlation information e ₁ (U ₂ ′, V ₂ ′) between the regions R ₈₉₀₁ and R ₈₉₀₂ in 89 may be smaller.

そして、図８９における領域R₈₉₀₁とR₈₉₀₂との相関情報e₁(U₂',V₂')の値の方が小さくなった場合には、その領域R₈₉₀₁とR₈₉₀₂から、注目ブロックB₈₇₀₁の推測値が求められる。 Then, when the value of the correlation information e ₁ (U ₂ ′, V ₂ ′) between the regions R ₈₉₀₁ and R ₈₉₀₂ in FIG. 89 becomes smaller, the attention block B from the regions R ₈₉₀₁ and R ₈₉₀₂ An estimated value of ₈₇₀₁ is obtained.

即ち、静止している被写体P₈₃₀₁の部分の領域R₈₉₀₁とR₈₉₀₂から、動いている被写体P₈₃₀₂の部分の注目ブロックB₈₇₀₁の推測値が求められる。 That is, the estimated value of the _target block B ₈₇₀₁ of the moving subject P ₈₃₀₂ is obtained from the regions R ₈₉₀₁ and R ₈₉₀₂ of the portion of the stationary subject P ₈₃₀₁ .

この、静止している被写体P₈₃₀₁の部分の領域R₈₉₀₁とR₈₉₀₂から求められる注目ブロックB₈₇₀₁の推測値は、注目ブロックB₈₇₀₁（の画像データ）と大きく異なったものとなり、その結果、注目ブロックB₈₇₀₁と、その推定値との差分をとっても、その差分、つまり、第１の差分データは、０または０に近い値にはならず、大きな値となって、データ量はほとんど削減されない。 The estimated value of the _target block B ₈₇₀₁ obtained from the regions R ₈₉₀₁ and R ₈₉₀₂ of the portion of the stationary subject P ₈₃₀₁ is significantly different from the _target block B ₈₇₀₁ (image data thereof). Even if the difference between the block B ₈₇₀₁ and the estimated value is taken, the difference, that is, the first difference data does not become 0 or a value close to 0 but becomes a large value, and the data amount is hardly reduced.

この場合、相関最大ベクトル(U,V)のデータ量を含めても、第２の差分データのデータ量の方が、第１の差分データのデータ量よりも少なくなる。 In this case, even if the data amount of the correlation maximum vector (U, V) is included, the data amount of the second difference data is smaller than the data amount of the first difference data.

さらに、ノイズ等の影響によって、注目ブロックB₈₇₀₁の第１の差分データへの圧縮時には、図８８における領域R₈₈₀₁とR₈₈₀₂との相関情報e₁(U₁',V₁')の値と、図８９における領域R₈₉₀₁とR₈₉₀₂との相関情報e₁(U₂',V₂')の値とのうちの一方の値の方が小さくなったのに、その第１の差分データの復元時には、他方の値の方が小さくなることがある。 Further, when the _target block B ₈₇₀₁ is compressed into the first difference data due to the influence of noise or the like, the value of the correlation information e ₁ (U ₁ ′, V ₁ ′) between the regions R ₈₈₀₁ and R ₈₈₀₂ in FIG. 89, the value of one of the correlation information e ₁ (U ₂ ′, V ₂ ′) between the regions R ₈₉₀₁ and R ₈₉₀₂ is smaller, but the first difference data At the time of restoration, the other value may be smaller.

この場合、注目ブロックB₈₇₀₁の第１の差分データへの圧縮時に用いられた注目ブロックB₈₇₀₁の推測値と異なる推測値を用いて、第１の差分データが復元されることになるため、元の注目ブロックB₈₇₀₁を復元することができないことになる。 In this case, the first difference data is restored using an estimated value different from the estimated value of the _target block B ₈₇₀₁ used when the _target block B ₈₇₀₁ is compressed into the first difference data. The attention block B ₈₇₀₁ cannot be restored.

そこで、注目ブロックB₈₇₀₁について、第１と第２の差分データのうちの、相関最大ベクトル(U,V)が付加される第２の差分データだけを採用することにより、上述したような、データ量がほとんど削減されないことや、元の注目ブロックB₈₇₀₁を復元することができなくなることを防止することができる。 Therefore, for the _target block B ₈₇₀₁ , by adopting only the second difference data to which the correlation maximum vector (U, V) is added, of the first and second difference data, the data as described above is obtained. It can be prevented that the amount is hardly reduced and the original attention block B ₈₇₀₁ cannot be restored.

しかしながら、第２の差分データだけを採用する場合には、相関最大ベクトル(U,V)の分だけのデータ量が、必ず増えることになる。 However, when only the second difference data is employed, the data amount corresponding to the maximum correlation vector (U, V) is necessarily increased.

そこで、注目ブロックB₈₇₀₁について、第１と第２の差分データの両方を求め、例えば、そのうちのデータ量の少ない方を、注目ブロックB₈₇₀₁の最終的な圧縮結果として選択することにより、データ量を低減するとともに、注目ブロックB₈₇₀₁を復元することができなくなることを防止することができる。 Therefore, for the target block B _8701, the first and obtains both the second difference data, for example, by selecting the lesser of the data amount of which, as the final compression result of the block of interest B _8701, the amount of data In addition, it is possible to prevent the target block B ₈₇₀₁ from being unable to be restored.

また、例えば、第１の差分データを求めるときに計算される式（９）の相関情報e₁(U',V')の最小値と、２番目に小さい値との差が、ほとんどない場合（例えば、ある閾値以下である場合）には、第１の差分データのデータ量が少ないときでも、第２の差分データを、注目ブロックB₈₇₀₁の最終的な圧縮結果として選択することにより、元の注目ブロックB₈₇₀₁を復元することができなくなることを、より強固に防止することができる。 Further, for example, when there is almost no difference between the minimum value of the correlation information e ₁ (U ′, V ′) of Equation (9) calculated when obtaining the first difference data and the second smallest value (For example, when it is below a certain threshold), even when the amount of the first difference data is small, the second difference data is selected as the final compression result of the block of interest B _8701. It is possible to more firmly prevent the attention block B ₈₇₀₁ from being unable to be restored.

次に、図８１の差分データ計算部２３６で求められる第３の差分データについて説明する。 Next, the 3rd difference data calculated | required by the difference data calculation part 236 of FIG. 81 is demonstrated.

差分データ計算部２３６では、注目ブロックについて、パストリファレンス画像V_Pにおいて注目ブロックとの相関が高い位置関係を表す動きベクトルと、フューチャリファレンス画像V_Fにおいて注目ブロックとの相関が高い位置関係を表す動きベクトルとの、合計で２つの動きベクトルが求められる。 In the difference data calculating section 236, for the target block, a motion representing the correlation is high positional relationship between a motion vector representing the correlation is high positional relationship between the target blocks in the PAST reference image V _P, the target block in Futuresse reference image V _F In total, two motion vectors with the vector are determined.

ここで、以下、適宜、パストリファレンス画像V_Pにおいて注目ブロックとの相関が高い位置関係を表す動きベクトルを、前動きベクトルといい、また、フューチャリファレンス画像V_Fにおいて注目ブロックとの相関が高い位置関係を表す動きベクトルを、後動きベクトルという。 Here, hereinafter, the motion vector representing the correlation is high positional relationship between the target blocks in the PAST reference image V _P, before called motion vector, also highly correlated positions of the target block in Futuresse reference image V _F A motion vector representing the relationship is referred to as a post-motion vector.

差分データ計算部２３６では、注目ブロックとの位置関係が前動きベクトルで表される位置関係にあるパストリファレンス画像V_Pの領域の画像データと、注目ブロックとの位置関係が後動きベクトルで表される位置関係にあるフューチャリファレンス画像V_Fの領域の画像データとから、注目ブロックの推測値が求められる。そして、差分データ計算部２３６では、注目ブロックとその推測値との差分が、第３の差分データとして出力される。 In the difference data calculating section 236, the image data of the region of Pasto reference image V _P in the positional relationship in which the positional relationship between the block of interest is expressed in the previous motion vector is expressed positional relationship between the target block in backward motion vector from the image data of the region of Futuresse reference image V _F in a positional relationship that, estimate of the block of interest is determined. Then, the difference data calculation unit 236 outputs the difference between the block of interest and its estimated value as third difference data.

即ち、図９０は、図８３に示した被写体P₈₃₀₁とP₈₃₀₂とが投影されたターゲット画像V_T、パストリファレンス画像V_P、およびフューチャリファレンス画像V_F（のフレーム）を示している。 90 shows the target image V _T , past reference image V _P , and feature reference image V _F (frames) onto which the subjects P ₈₃₀₁ and P ₈₃₀₂ shown in FIG. 83 are projected.

図９０においては、例えば、被写体P₈₃₀₂が、一定でない速度で移動している。即ち、図９０では、被写体P₈₃₀₂は、パストリファレンス画像V_Pからターゲット画像V_Tの時刻にかけては、遅い速度で移動し、ターゲット画像V_Tからフューチャリファレンス画像V_Fの時刻にかけては、速い速度で移動している。 In FIG. 90, for example, the subject P ₈₃₀₂ is moving at a non-constant speed. That is, in FIG. 90, the object P ₈₃₀₂ is subjected the time of the target image V _T from Pasto reference image V _P, moves at a slower rate, from the target image V _T to a time of Futuresse reference image V _F, at a faster rate Has moved.

いま、ターゲット画像V_Tにおいて、例えば、被写体P₈₃₀₂のみを含むあるブロックB₉₀₀₁を、注目ブロックとして注目すると、図９０のパストリファレンス画像V_Pにおいて、注目ブロックB₉₀₀₁からベクトル(-U₁,-V₁)だけずれた位置の、注目ブロックB₉₀₀₁と同一サイズの領域R₉₀₀₁の画像データは、（理想的には）注目ブロックB₉₀₀₁と同一になっている。また、フューチャリファレンス画像V_Fにおいて、注目ブロックB₉₀₀₁からベクトル(U₂,V₂)だけずれた位置の、注目ブロックB₉₀₀₁と同一サイズの領域R₉₀₀₂の画像データは、（理想的には）注目ブロックB₉₀₀₁と同一になっている。従って、ベクトル(-U₁,-V₁)は前動きベクトルであり、ベクトル(U₂,V₂)は後動きベクトルである。 Now, in the target image V _T, for example, a certain block B ₉₀₀₁ including only the subject P _8302, when attention as the subject block, in Past reference image V _P of FIG. 90, the vector (-U ₁ from the target block B _9001, - image data V ₁₎ of the shifted position, the block of interest B ₉₀₀₁ and the region R ₉₀₀₁ of the same size is made equal to the (ideally) the block of interest B _9001. In the future reference image V _F , the image data of the region R ₉₀₀₂ having the same size as the _target block B _{9001 at} the position shifted from the _target block B ₉₀₀₁ by the vector (U ₂ , V ₂ ) is (ideally) It is the same as the attention block B ₉₀₀₁ . Therefore, the vector (−U ₁ , −V ₁ ) is a front motion vector, and the vector (U ₂ , V ₂ ) is a back motion vector.

差分データ計算部２３６（図８１）では、以上のような図９０に示した場合に適した第３の差分データが求められる。 The difference data calculation unit 236 (FIG. 81) obtains third difference data suitable for the case shown in FIG. 90 as described above.

即ち、差分データ計算部２３６（図８１）では、例えば、パストリファレンス画像V_Pの領域R₉₀₀₁と、フューチャリファレンス画像V_Fの領域R₉₀₀₂との画像データの平均値が演算され、その平均値を、注目ブロックB₉₀₀₁の推測値として、注目ブロックB₉₀₀₁とその推測値との差分が演算される。この差分は、ほとんど０となるので、注目ブロックB₉₀₀₁のデータ量を削減することができる。差分データ計算部２３６は、以上のような注目ブロックB₉₀₀₁とその推測値との差分を、注目ブロックB₉₀₀₁の圧縮結果である第３の差分データとして出力する。 That is, the difference data calculating unit 236 (FIG. 81), for example, the region R ₉₀₀₁ Past reference image V _P, the average value of the image data of the area R ₉₀₀₂ of Futuresse reference image V _F is calculated, the average value as an estimate of the block of interest B _9001, the difference between the target block B ₉₀₀₁ and its estimated value is calculated. Since this difference is almost 0, the data amount of the target block B ₉₀₀₁ can be reduced. The difference data calculation unit 236 outputs the difference between the block of interest B ₉₀₀₁ as described above and its estimated value as third difference data that is the compression result of the block of interest B ₉₀₀₁ .

ここで、差分データ計算部２３６は、パストリファレンス画像V_Pにおいて、注目ブロックB₉₀₀₁と同一の（最も類似する）画像データの領域R₉₀₀₁の位置を表す前動きベクトル(-U₁,-V₁)と、フューチャリファレンス画像V_Fにおいて、注目ブロックB₉₀₀₁と同一の（最も類似する）画像データの領域R₉₀₀₂の位置を表す後動きベクトル(U₂,V₂)を、次のようにして求める。 Here, the difference data calculation unit 236 uses the previous motion vector (−U ₁ , −V ₁ ) representing the position of the region R ₉₀₀₁ of the same (most similar) image data as the target block B ₉₀₀₁ in the past reference image V _P. ) And the back motion vector (U ₂ , V ₂ ) representing the position of the same (most similar) image data area R _{9002 as} the target block B ₉₀₀₁ in the future reference image V _F as follows. .

即ち、差分データ計算部２３６は、注目ブロックと、パストリファレンス画像V_Pの領域Ipとの相関を表す相関情報e₃(U',V')を、例えば、式（１１）にしたがって求める。 That is, the difference data calculation unit 236 obtains the correlation information e ₃ (U ′, V ′) representing the correlation between the block of interest and the region Ip of the past reference image V _P according to, for example, Expression (11).

・・・（１１）

(11)

ここで、式（１１）において、(x,y)は、ターゲット画像V_Tにおける注目ブロックの画素の位置を表し、Ic(x,y)は、ターゲット画像V_Tにおける位置(x,y)における画素の画素値（注目ブロックの画素の画素値）を表す。また、Ip(x+U'，y+V')は、パストリファレンス画像V_Pにおける位置(x+U'，y+V')にある画素の画素値を表す。また、式（１１）におけるΣは、注目ブロックを構成する画素すべてについてのサメーションを表す。 Here, in Expression (11), (x, y) represents the pixel position of the target block in the target image V _T , and Ic (x, y) represents the position (x, y) in the target image V _T. This represents the pixel value of the pixel (pixel value of the pixel of the target block). Also, Ip (x + U ', y + V') denotes the pixel value of the pixel at the position in Pasto reference image _{V P (x + U ',} y + V'). Further, Σ in equation (11) represents summation for all the pixels constituting the block of interest.

なお、式（１１）の相関情報e₃(U',V')は、注目ブロックとパストリファレンス画像V_Pの領域Ipとの画素値の差分絶対値の総和であり、従って、注目ブロックとパストリファレンス画像V_Pの領域Ipとの相関が高いほど、相関情報e₃(U',V')の「値」は、小さくなる。 Note that the correlation information e ₃ (U ′, V ′) in Expression (11) is the sum of absolute differences of pixel values between the block of interest and the region Ip of the past reference image V _P , and accordingly, the block of interest and the past the higher the correlation between the area Ip of the reference image V _P, the "value" of the correlation information _{e 3 (U ', V'} ) is reduced.

さらに、差分データ計算部２３６は、注目ブロックと、フューチャリファレンス画像V_Fの領域Ifとの相関を表す相関情報e₄(U',V')を、例えば、式（１２）にしたがって求める。 Further, the difference data calculation unit 236, a block of interest, the correlation information e ₄ representing the correlation between the region If the Futuresse reference image V _F a (U ', V'), for example, determined according to the equation (12).

・・・（１２）

(12)

ここで、式（１２）において、(x,y)は、ターゲット画像V_Tにおける注目ブロックの画素の位置を表し、Ic(x,y)は、ターゲット画像V_Tにおける位置(x,y)における画素の画素値（注目ブロックの画素の画素値）を表す。また、If(x+U'，y+V')は、フューチャリファレンス画像V_Fにおける位置(x+U'，y+V')にある画素の画素値を表す。また、式（１２）におけるΣは、注目ブロックを構成する画素すべてについてのサメーションを表す。 Here, in Expression (12), (x, y) represents the position of the pixel of the target block in the target image V _T , and Ic (x, y) represents the position (x, y) in the target image V _T. This represents the pixel value of the pixel (pixel value of the pixel of the target block). Also, If (x + U ', y + V') denotes the pixel value of the pixel at the position (x + U ', y + V') in Futuresse reference image V _F. Further, Σ in Expression (12) represents summation for all the pixels constituting the block of interest.

なお、式（１２）の相関情報e₄(U',V')は、注目ブロックとフューチャリファレンス画像V_Fの領域Ifとの画素値の差分絶対値の総和であり、従って、注目ブロックとフューチャリファレンス画像V_Fの領域Ifとの相関が高いほど、相関情報e₄(U',V')の「値」は、小さくなる。 Incidentally, formula (12) correlation information _{e 4 (U ', V'} ) of a total sum of the difference absolute value of pixel values of a region If the block of interest and Futuresse reference image V _F, therefore, the block of interest and Fuyucha the higher the correlation between the region If the reference image V _F, the "value" of the correlation information _{e 4 (U ', V'} ) is reduced.

差分データ計算部２３６は、式（１１）の相関情報e₃(U',V')と、式（１２）の相関情報e₄(U',V')とを、考えられる全ての(U',V')について計算する。ここで、考えられる全ての(U',V')とは、式（１１）の相関情報e₃(U',V')については、位置(x+U'，y+V')がパストリファレンス画像V_P上にある場合の(U',V')であり、式（１２）の相関情報e₄(U',V')については、位置(x+U'，y+V')がフューチャリファレンス画像V_F上にある場合の(U',V')である。なお、式（１１）の相関情報e₃(U',V')や、式（１２）の相関情報e₄(U',V')の計算は、その他、あらかじめ定められた範囲の(U',V')についてだけ行うようにしてもよい。 Differential data calculation unit 236, correlation information e ₃ of the formula (11) (U ', V') and the formula (12) correlation information e ₄ (U ', V') of a, all possible (U ', V'). Here, all the possible (U ′, V ′) are the positions (x + U ′, y + V ′) of the correlation information e ₃ (U ′, V ′) in Expression (11). (U ′, V ′) in the case of being on the reference image V _P , and the position (x + U ′, y + V ′) for the correlation information e ₄ (U ′, V ′) in Expression (12). there is a case that is on Futuresse reference image _{V F (U ', V'} ). The calculation equation correlation information e ₃ of (11) (U ', V') and the correlation information e ₄ (U ', V') of the formula (12), other, of a predetermined range (U It may be performed only for ', V').

差分データ計算部２３６は、各値の(U',V')について計算した相関情報e₃(U',V')から最小値を検出し、その相関情報e₃(U',V')の最小値を与える(U',V')を、前動きベクトル(-U₁,-V₁)として検出する。さらに、差分データ計算部２３６は、各値の(U',V')について計算した相関情報e₄(U',V')から最小値を検出し、その相関情報e₄(U',V')の最小値を与える(U',V')を、後動きベクトル(U₂,V₂)として検出する。 The difference data calculation unit 236 detects the minimum value from the correlation information e ₃ (U ′, V ′) calculated for (U ′, V ′) of each value, and the correlation information e ₃ (U ′, V ′). (U ′, V ′) that gives the minimum value is detected as the previous motion vector (−U ₁ , −V ₁ ). Further, the difference data calculating section 236, for each value (U ', V') correlation information calculated for e ₄ (U ', V') detects a minimum value from the correlation information e ₄ (U ', V (U ', V') that gives the minimum value of ') is detected as a rear motion vector (U ₂ , V ₂ ).

差分データ計算部２３６では、注目ブロックとの位置関係が前動きベクトル(-U₁,-V₁)で表される位置関係にあるパストリファレンス画像V_Pの領域の画像データと、注目ブロックとの位置関係が後動きベクトル(U₂,V₂)で表される位置関係にあるフューチャリファレンス画像V_Fの領域の画像データとから、注目ブロックの推測値が求められる。そして、差分データ計算部２３６では、注目ブロックとその推測値との差分を、第３の差分データとして、その第３の差分データに、前動きベクトル(-U₁,-V₁)と、後動きベクトル(U₂,V₂)とが付加されて出力される。 In the difference data calculation unit 236, the image data of the region of the past reference image V _{P having} the positional relationship represented by the previous motion vector (−U ₁ , −V ₁ ) and the positional relationship between the target block and the target block from the image data of the region of Futuresse reference image V _F in a positional relationship shown by the positional relationship backward motion vector (U _{_2,} V _2), an estimate of the block of interest is determined. Then, in the difference data calculation unit 236, the difference between the block of interest and its estimated value is used as third difference data, the third difference data is added to the previous motion vector (−U ₁ , −V ₁ ), and the following A motion vector (U ₂ , V ₂ ) is added and output.

このような第３の差分データは、それに付加されている前動きベクトル(-U₁,-V₁)と、後動きベクトル(U₂,V₂)とを用いて、元の注目ブロックに復元することができる。 Such third difference data is restored to the original block of interest using the front motion vector (−U ₁ , −V ₁ ) and the back motion vector (U ₂ , V ₂ ) added thereto. can do.

即ち、第３の差分データを求める場合と同様に、まず、注目ブロックとの位置関係が、第３の差分データに付加されている前動きベクトル(-U₁,-V₁)と後動きベクトル(U₂,V₂)で表される位置関係にあるパストリファレンス画像V_Pとフューチャリファレンス画像V_Fとの画像データから、注目ブロックの推測値を求める。即ち、注目ブロックから前動きベクトル(-U₁,-V₁)だけずれたパストリファレンス画像V_P上の、注目ブロックと同一サイズの領域と、注目ブロックから後動きベクトル(U₂,V₂)だけずれたフューチャリファレンス画像V_F上の、注目ブロックと同一サイズの領域を求め、そのパストリファレンス画像V_Pの領域と、フューチャリファレンス画像V_Fの領域との画像データの平均値を、注目ブロックの推測値として求める。 That is, as in the case of obtaining the third difference data, first, the positional relationship with the block of interest is the previous motion vector (−U ₁ , −V ₁ ) added to the third difference data and the subsequent motion vector. from the image data of the Past reference image V _P and Futuresse reference image V _F at the position relationship represented by (U _{_2,} V _2), we obtain the estimated value of the block of interest. That is, before the motion vector from the target block (-U _{_1,} -V ₁₎ shifted on Pasto reference image V _P has a region of the block of interest and the same size, backward motion vector from the target block (U _{_2,} V ₂₎ An area of the same size as the target block on the future reference image V _{F that} is shifted by a certain amount is obtained, and the average value of the image data of the past reference image V _P area and the future reference image V _F area Obtained as an estimated value.

そして、第３の差分データと、注目ブロックの推測値とを加算することにより、注目ブロックを復元することができる。 Then, the block of interest can be restored by adding the third difference data and the estimated value of the block of interest.

次に、図８１の差分データ計算部２３７で求められる第４の差分データについて説明する。 Next, the fourth difference data obtained by the difference data calculation unit 237 in FIG. 81 will be described.

差分データ計算部２３７では、注目ブロックについて、パストリファレンス画像V_Pにおいて注目ブロックとの相関が高い位置関係を表す動きベクトル、つまり、前動きベクトル（-U₁,-V₁)だけが求められる（フューチャリファレンス画像V_Fにおいて注目ブロックとの相関が高い位置関係を表す動きベクトル、つまり、後動きベクトル(U₂,V₂)は求められない）。 In the difference data calculation unit 237, only the motion vector representing the positional relationship having a high correlation with the target block in the past reference image V _P , that is, the previous motion vector (−U ₁ , −V ₁ ) is obtained for the target block ( motion vector correlation represents a high positional relationship of the target block and the Futuresse reference image V _F, that is, backward motion vector (U _2, V ₂₎ is not required).

差分データ計算部２３７では、注目ブロックとの位置関係が前動きベクトル（-U₁,-V₁)で表される位置関係にあるパストリファレンス画像V_Pの領域の画像データから、注目ブロックの推測値が求められる。そして、差分データ計算部２３７では、注目ブロックとその推測値との差分が、第４の差分データとして出力される。 The difference data calculation unit 237 estimates the target block from the image data of the region of the past reference image V _P in which the positional relationship with the target block is the positional relationship represented by the previous motion vector (−U ₁ , −V ₁ ). A value is determined. Then, the difference data calculation unit 237 outputs the difference between the block of interest and its estimated value as fourth difference data.

即ち、図９１は、図８３に示した被写体P₈₃₀₁とP₈₃₀₂とが投影されたターゲット画像V_T、パストリファレンス画像V_P、およびフューチャリファレンス画像V_F（のフレーム）を示している。 That is, FIG. 91 shows the target image V _T , past reference image V _P , and feature reference image V _F (frames) onto which the subjects P ₈₃₀₁ and P ₈₃₀₂ shown in FIG. 83 are projected.

図９１においては、例えば、移動している被写体P₈₃₀₂が、突然消滅している。即ち、図９１では、移動している被写体P₈₃₀₂が、パストリファレンス画像V_Pからターゲット画像V_Tの時刻にかけては存在しているが、ターゲット画像V_Tからフューチャリファレンス画像V_Fの時刻の間に消滅している。 In FIG. 91, for example, the moving subject P ₈₃₀₂ suddenly disappears. That is, in FIG. 91, the object P ₈₃₀₂ that is moving, although there are from Pasto reference image V _P to a time of the target image V _T, between the target image V _T of time of Futuresse reference image V _F It has disappeared.

いま、ターゲット画像V_Tにおいて、例えば、被写体P₈₃₀₂のみを含むあるブロックB₉₁₀₁を、注目ブロックとして注目すると、図９１のパストリファレンス画像V_Pにおいて、注目ブロックB₉₁₀₁からベクトル(-U₁,-V₁)だけずれた位置の、注目ブロックB₉₁₀₁と同一サイズの領域R₉₁₀₁の画像データは、（理想的には）注目ブロックB₉₁₀₁と同一になっている。また、フューチャリファレンス画像V_Fにおいては、注目ブロックB₉₁₀₁と同一の（類似する）領域は、存在しない。 Now, in the target image V _T, for example, a certain block B ₉₁₀₁ including only the subject P _8302, when attention as the subject block, in Past reference image V _P of FIG. 91, the vector (-U ₁ from the target block B _9101, - image data V ₁₎ of the shifted position, the block of interest B ₉₁₀₁ and the same size of the area R ₉₁₀₁ is made identical to the (ideally) the block of interest B _9101. In the Futuresse reference image V _F, the block of interest B same (similar) and ₉₁₀₁ regions are not present.

差分データ計算部２３７（図８１）では、以上のような図９１に示した場合に適した第４の差分データが求められる。 The difference data calculation unit 237 (FIG. 81) obtains fourth difference data suitable for the case shown in FIG. 91 as described above.

即ち、差分データ計算部２３７（図８１）では、例えば、パストリファレンス画像V_Pの領域R₉₁₀₁の画像データを、そのまま、注目ブロックB₉₁₀₁の推測値として、注目ブロックB₉₁₀₁とその推測値との差分が演算される。この差分は、ほとんど０となるので、注目ブロックB₉₁₀₁のデータ量を削減することができる。差分データ計算部２３７は、以上のような注目ブロックB₉₁₀₁とその推測値との差分を、注目ブロックB₉₁₀₁の圧縮結果である第４の差分データとして出力する。 That is, the difference data calculating unit 237 (FIG. 81), for example, the image data of the area R ₉₁₀₁ Past reference image V _P, it is, as inferred values of the target block B _9101, of the block of interest B ₉₁₀₁ and its estimated value The difference is calculated. Since this difference is almost zero, the data amount of the target block B ₉₁₀₁ can be reduced. The difference data calculation unit 237 outputs the difference between the attention block B ₉₁₀₁ as described above and the estimated value thereof as fourth difference data that is a compression result of the attention block B ₉₁₀₁ .

ここで、差分データ計算部２３７は、パストリファレンス画像V_Pにおいて、注目ブロックB₉₁₀₁と同一の（最も類似する）画像データの領域R₉₁₀₁の位置を表す前動きベクトル(-U₁,-V₁)を、次のようにして求める。 Here, the difference data calculating section 237 in a past reference image V _P, the block of interest B identical (most similar) and ₉₁₀₁ before representing the location of the region R ₉₁₀₁ of the image data a motion vector (-U _1, -V ₁ ) Is obtained as follows.

即ち、差分データ計算部２３７は、注目ブロックと、パストリファレンス画像V_Pの領域Ipとの相関を表す相関情報e₃(U',V')を、例えば、上述した式（１１）にしたがって求める。 That is, the difference data calculation unit 237 obtains the correlation information e ₃ (U ′, V ′) representing the correlation between the block of interest and the region Ip of the past reference image V _{P according} to, for example, the above equation (11). .

差分データ計算部２３７は、式（１１）の相関情報e₃(U',V')を、考えられる全ての(U',V')について計算する。さらに、差分データ計算部２３７は、各値の(U',V')について計算した相関情報e₃(U',V')から最小値を検出し、その相関情報e₃(U',V')の最小値を与える(U',V')を、前動きベクトル(-U₁,-V₁)として検出する。 The difference data calculation unit 237 calculates the correlation information e ₃ (U ′, V ′) in Expression (11) for all possible (U ′, V ′). Further, the difference data calculating section 237, for each value (U ', V') correlation information e ₃ (U ', V') calculated for detecting a minimum value from the correlation information e ₃ (U ', V (U ', V') that gives the minimum value of ') is detected as the previous motion vector (-U ₁ , -V ₁ ).

差分データ計算部２３７では、注目ブロックとの位置関係が前動きベクトル(-U₁,-V₁)で表される位置関係にあるパストリファレンス画像V_Pの領域の画像データから、注目ブロックの推測値が求められる。そして、差分データ計算部２３７では、注目ブロックとその推測値との差分を、第４の差分データとして、その第４の差分データに、前動きベクトル(-U₁,-V₁)が付加されて出力される。 The difference data calculation unit 237 estimates the target block from the image data of the past reference image V _P region in which the positional relationship with the target block is the positional relationship represented by the previous motion vector (−U ₁ , −V ₁ ). A value is determined. Then, the difference data calculation unit 237 adds the difference between the block of interest and its estimated value as the fourth difference data, and adds the previous motion vector (−U ₁ , −V ₁ ) to the fourth difference data. Is output.

このような第４の差分データは、それに付加されている前動きベクトル(-U₁,-V₁)を用いて、元の注目ブロックに復元することができる。 Such fourth difference data can be restored to the original block of interest using the previous motion vector (−U ₁ , −V ₁ ) added thereto.

即ち、第４の差分データを求める場合と同様に、まず、注目ブロックとの位置関係が、第４の差分データに付加されている前動きベクトル(-U₁,-V₁)で表される位置関係にあるパストリファレンス画像V_Pの画像データから、注目ブロックの推測値を求める。即ち、注目ブロックから前動きベクトル(-U₁,-V₁)だけずれたパストリファレンス画像V_P上の、注目ブロックと同一サイズの領域を求め、そのパストリファレンス画像V_Pの領域の画像データを、そのまま、注目ブロックの推測値として求める。 That is, as in the case of obtaining the fourth difference data, first, the positional relationship with the block of interest is represented by the previous motion vector (−U ₁ , −V ₁ ) added to the fourth difference data. from the image data of Pasto reference image V _P in the positional relationship, obtaining the estimated value of the block of interest. That is, an area of the same size as the target block on the past reference image V _P shifted from the target block by the previous motion vector (−U ₁ , −V ₁ ) is obtained, and image data of the area of the past reference image V _P is obtained. As it is, it is obtained as an estimated value of the target block.

そして、第４の差分データと、注目ブロックの推測値とを加算することにより、注目ブロックを復元することができる。 The block of interest can be restored by adding the fourth difference data and the estimated value of the block of interest.

次に、図８１の差分データ計算部２３８で求められる第５の差分データについて説明する。 Next, the fifth difference data obtained by the difference data calculation unit 238 in FIG. 81 will be described.

差分データ計算部２３８では、注目ブロックについて、フィーチャリファレンス画像V_Fにおいて注目ブロックとの相関が高い位置関係を表す動きベクトル、つまり、後動きベクトル（U₂,V₂)だけが求められる（パストリファレンス画像V_Pにおいて注目ブロックとの相関が高い位置関係を表す動きベクトル、つまり、前動きベクトル(-U₁,-V₁)は求められない）。 In the difference data calculation unit 238, only the motion vector representing the positional relationship having a high correlation with the target block in the feature reference image V _F , that is, the rear motion vector (U ₂ , V ₂ ) is obtained for the target block (pasto reference). motion vector correlation represents a high positional relationship between the target block in the image V _P, that is, before the motion vector (-U _1, -V ₁₎ is not required).

差分データ計算部２３８では、注目ブロックとの位置関係が後動きベクトル（U₂,V₂)で表される位置関係にあるフィーチャリファレンス画像V_Fの領域の画像データから、注目ブロックの推測値が求められる。そして、差分データ計算部２３８では、注目ブロックとその推測値との差分が、第５の差分データとして出力される。 In the difference data calculation unit 238, from the image data of the area of the feature reference image V _F in a positional relationship in which the positional relationship between the block of interest is represented by backward motion vector (U _{_2,} V _2), an estimate of the block of interest is Desired. Then, the difference data calculation unit 238 outputs the difference between the block of interest and its estimated value as fifth difference data.

即ち、図９２は、図８３に示した被写体P₈₃₀₁とP₈₃₀₂とが投影されたターゲット画像V_T、パストリファレンス画像V_P、およびフューチャリファレンス画像V_F（のフレーム）を示している。 That is, FIG. 92 shows the target image V _T , past reference image V _P , and feature reference image V _F (frames) onto which the subjects P ₈₃₀₁ and P ₈₃₀₂ shown in FIG. 83 are projected.

図９２においては、例えば、移動している被写体P₈₃₀₂が、突然出現している。即ち、図９２では、移動している被写体P₈₃₀₂が、パストリファレンス画像V_Pの時刻には存在しないが、その後、ターゲット画像V_Tの時刻までの間に出現し、フューチャリファレンス画像V_Fの時刻でも存在している。 In FIG. 92, for example, the moving subject P ₈₃₀₂ suddenly appears. That is, in FIG. 92, the moving subject P ₈₃₀₂ does not exist at the time of the past reference image V _P , but appears after that until the time of the target image V _T , and the time of the future reference image V _F But it exists.

いま、ターゲット画像V_Tにおいて、例えば、被写体P₈₃₀₂のみを含むあるブロックB₉₂₀₁を、注目ブロックとして注目すると、図９２のフィーチャリファレンス画像V_Fにおいて、注目ブロックB₉₂₀₁からベクトル(U₂,V₂)だけずれた位置の、注目ブロックB₉₂₀₁と同一サイズの領域R₉₂₀₂の画像データは、（理想的には）注目ブロックB₉₂₀₁と同一になっている。また、パストリファレンス画像V_Pにおいては、注目ブロックB₉₂₀₁と同一の（類似する）領域は、存在しない。 Now, in the target image V _T, for example, a certain block B ₉₂₀₁ including only the subject P _8302, when attention as the subject block, in the feature reference image V _F of Figure 92, the vector from the target block B ₉₂₀₁ (U _{_2,} V ₂ The image data of the region R ₉₂₀₂ of the same size as the _target block B _{9201 at} a position shifted by () is (ideally) the same as the _target block B ₉₂₀₁ . Further, in the past reference image V _P , there is no same (similar) area as the target block B ₉₂₀₁ .

差分データ計算部２３８（図８１）では、以上のような図９２に示した場合に適した第５の差分データが求められる。 The difference data calculation unit 238 (FIG. 81) obtains fifth difference data suitable for the case shown in FIG. 92 as described above.

即ち、差分データ計算部２３８（図８１）では、例えば、フィーチャリファレンス画像V_Fの領域R₉₂₀₂の画像データを、そのまま、注目ブロックB₉₂₀₁の推測値として、注目ブロックB₉₂₀₁とその推測値との差分が演算される。この差分は、ほとんど０となるので、注目ブロックB₉₂₀₁のデータ量を削減することができる。差分データ計算部２３８は、以上のような注目ブロックB₉₂₀₁とその推測値との差分を、注目ブロックB₉₂₀₁の圧縮結果である第５の差分データとして出力する。 That is, the difference data calculating unit 238 (FIG. 81), for example, the image data of the area R ₉₂₀₂ of the feature reference image V _F, it is, as inferred values of the target block B _9201, of the block of interest B ₉₂₀₁ and its estimated value The difference is calculated. Since this difference is almost 0, the data amount of the target block B ₉₂₀₁ can be reduced. The difference data calculation unit 238 outputs the difference between the attention block B ₉₂₀₁ and the estimated value as described above as fifth difference data that is a compression result of the attention block B ₉₂₀₁ .

ここで、差分データ計算部２３８は、フィーチャリファレンス画像V_Fにおいて、注目ブロックB₉₂₀₁と同一の（最も類似する）画像データの領域R₉₂₀₂の位置を表す後動きベクトル(U₂,V₂)を、次のようにして求める。 Here, the difference data calculation unit 238, the feature reference image V _F, identical to the block of interest B ₉₂₀₁ (the most similar) motion vector after indicating the position of region R ₉₂₀₂ of the image data (U _2, V ₂₎ Find it as follows.

即ち、差分データ計算部２３８は、注目ブロックと、フィーチャリファレンス画像V_Fの領域Ifとの相関を表す相関情報e₄(U',V')を、例えば、上述した式（１２）にしたがって求める。 That is, the difference data calculation unit 238, a block of interest, the correlation information e ₄ representing the correlation between the region If the feature reference image _{V F (U ', V'} ), for example, determined according to the above Expression (12) .

差分データ計算部２３８は、式（１２）の相関情報e₄(U',V')を、考えられる全ての(U',V')について計算する。さらに、差分データ計算部２３８は、各値の(U',V')について計算した相関情報e₄(U',V')から最小値を検出し、その相関情報e₄(U',V')の最小値を与える(U',V')を、後動きベクトル(U₂,V₂)として検出する。 The difference data calculation unit 238 calculates the correlation information e ₄ (U ′, V ′) in Expression (12) for all possible (U ′, V ′). Further, the difference data calculating unit 238 of each value (U ', V') correlation information calculated for e ₄ (U ', V') detects a minimum value from the correlation information e ₄ (U ', V (U ', V') that gives the minimum value of ') is detected as a rear motion vector (U ₂ , V ₂ ).

差分データ計算部２３８では、注目ブロックとの位置関係が後動きベクトル(U₂,V₂)で表される位置関係にあるフィーチャリファレンス画像V_Fの領域の画像データから、注目ブロックの推測値が求められる。そして、差分データ計算部２３８では、注目ブロックとその推測値との差分を、第５の差分データとして、その第５の差分データに、後動きベクトル(U₂,V₂)が付加されて出力される。 In the difference data calculation unit 238, from the image data of the area of the feature reference image V _F in a positional relationship in which the positional relationship between the block of interest is represented by backward motion vector (U _{_2,} V _2), an estimate of the block of interest is Desired. Then, the difference data calculation unit 238 outputs the difference between the block of interest and its estimated value as fifth difference data, and adds the rear motion vector (U ₂ , V ₂ ) to the fifth difference data. Is done.

このような第５の差分データは、それに付加されている後動きベクトル(U₂,V₂)を用いて、元の注目ブロックに復元することができる。 Such fifth difference data can be restored to the original block of interest using the post-motion vector (U ₂ , V ₂ ) added thereto.

即ち、第５の差分データを求める場合と同様に、まず、注目ブロックとの位置関係が、第５の差分データに付加されている後動きベクトル(U₂,V₂)で表される位置関係にあるフィーチャリファレンス画像V_Fの画像データから、注目ブロックの推測値を求める。即ち、注目ブロックから後動きベクトル(U₂,V₂)だけずれたフィーチャリファレンス画像V_F上の、注目ブロックと同一サイズの領域を求め、そのフィーチャリファレンス画像V_Fの領域の画像データを、そのまま、注目ブロックの推測値として求める。 That is, as in the case of obtaining the fifth difference data, first, the positional relationship with the target block is the positional relationship represented by the rear motion vector (U ₂ , V ₂ ) added to the fifth differential data. from the image data of the feature reference image V _F in, obtaining the estimated value of the block of interest. That is, on the feature reference image V _F offset by backward motion vector (U _{_2,} V ₂₎ from the block of interest, determine the area of the block of interest and the same size, the image data of the area of the feature reference image V _F, as As an estimated value of the target block.

そして、第５の差分データと、注目ブロックの推測値とを加算することにより、注目ブロックを復元することができる。 Then, the block of interest can be restored by adding the fifth difference data and the estimated value of the block of interest.

次に、図８１の差分データ計算部２３９で求められる第６の差分データについて説明する。 Next, the sixth difference data obtained by the difference data calculation unit 239 in FIG. 81 will be described.

差分データ計算部２３９では、注目ブロックの推定値が０であるとして、注目ブロックとその推測値との差分が、第６の差分データとして出力される。 In the difference data calculation unit 239, assuming that the estimated value of the target block is 0, the difference between the target block and the estimated value is output as sixth difference data.

即ち、図９３は、図８３に示した被写体P₈₃₀₁とP₈₃₀₂とが投影されたターゲット画像V_T、パストリファレンス画像V_P、およびフューチャリファレンス画像V_F（のフレーム）を示している。 That is, FIG. 93 shows the target image V _T , past reference image V _P , and feature reference image V _F (frames) onto which the subjects P ₈₃₀₁ and P ₈₃₀₂ shown in FIG. 83 are projected.

図９３においては、例えば、移動している被写体P₈₃₀₂が、突然出現し、その後、突然消滅している。即ち、図９３では、移動している被写体P₈₃₀₂が、パストリファレンス画像V_Pの時刻には存在しないが、その後、ターゲット画像V_Tの時刻までの間に出現し、さらに、フューチャリファレンス画像V_Fの時刻までの間に消滅している。 In FIG. 93, for example, the moving subject P ₈₃₀₂ suddenly appears and then disappears suddenly. That is, in FIG. 93, the moving subject P ₈₃₀₂ does not exist at the time of the past reference image V _P , but appears thereafter until the time of the target image V _T , and further, the feature reference image V _F It disappears by the time of.

いま、ターゲット画像V_Tにおいて、例えば、被写体P₈₃₀₂のみを含むあるブロックB₉₃₀₁を、注目ブロックとして注目すると、図９３のパストリファレンス画像V_Pおよびフィーチャリファレンス画像V_Fのいずれにも、注目ブロックB₉₃₀₁と同一の（類似する）領域は、存在しない。 Now, in the target image V _T , for example, when a block B ₉₃₀₁ including only the subject P ₈₃₀₂ is focused as a focused block, the focused block B is included in both the past reference image V _P and the feature reference image V _F in FIG. There is no region similar to (similar to) ₉₃₀₁ .

差分データ計算部２３９（図８１）では、以上のような図９３に示した場合に適した第６の差分データが求められる。 The difference data calculation unit 239 (FIG. 81) obtains sixth difference data suitable for the case shown in FIG. 93 as described above.

即ち、パストリファレンス画像V_Pおよびフィーチャリファレンス画像V_Fのいずれにも、注目ブロックB₉₃₀₁と同一の（類似する）領域が存在しない場合には、パストリファレンス画像V_Pやフィーチャリファレンス画像V_Fから、注目ブロックB₉₃₀₁の推測値を求めても、その推測値は、注目ブロックB₉₃₀₁の画像データとはかけ離れた値になる。従って、注目ブロックB₉₃₀₁と、そのような推測値との差分をとっても、その差分は、ほとんど０となることはなく、大きな値になるため、注目ブロックB₉₃₀₁のデータ量を削減することはできない。 In other words, in any of Pasto reference image V _P and feature reference image V _F, when the target block B the same (similar) and ₉₃₀₁ there is no region of Pasto reference image V _P and feature reference image V _F, also be determined estimate of the block of interest B _9301, the estimate has a value far from the image data of the target block B _9301. Therefore, even if the difference between the _target block B ₉₃₀₁ and such an estimated value is taken, the difference is almost zero and becomes a large value, and therefore the data amount of the target block B ₉₃₀₁ cannot be reduced. .

そこで、差分データ計算部２３９は、注目ブロックB₉₃₀₁の推定値を求めずに、注目ブロックB₉₃₀₁を、第６の差分データとして出力する。ここで、第６の差分データは、上述したように、注目ブロックの推定値が０であるとして、注目ブロックとその推測値との差分をとった結果であるとみることもできる。 Therefore, the difference data calculation unit 239 outputs the target block B ₉₃₀₁ as sixth difference data without _obtaining the estimated value of the target block B ₉₃₀₁ . Here, as described above, the sixth difference data can be regarded as a result of taking the difference between the target block and the estimated value, assuming that the estimated value of the target block is 0.

なお、以上の第１乃至第６の差分データのうちの、第３乃至第６の差分データは、MPEGにおけるBピクチャと同様の概念のものである。 Of the above first to sixth difference data, the third to sixth difference data have the same concept as the B picture in MPEG.

また、上述の場合には、相関を表す相関情報として、計算量の削減のために、差分絶対値の総和を採用したが、相関情報としては、相関係数などを採用することもできる。 In the above case, the sum of absolute difference values is used as the correlation information representing the correlation in order to reduce the amount of calculation. However, a correlation coefficient or the like can also be used as the correlation information.

次に、図９４は、第１の差分データを求める図８１の差分データ計算部２３４の構成例を示している。 Next, FIG. 94 shows a configuration example of the difference data calculation unit 234 in FIG. 81 for obtaining the first difference data.

図９４において、入力端子２５１には、図８１の入力端子２３３からの注目ブロック（の画像データ）が入力される。また、入力端子２５２には、図８１の入力端子２３１からのパストリファレンス画像V_Pが入力され、入力端子２５３には、図８１の入力端子２３２からのフューチャリファレンス画像V_Fが入力される。 In FIG. 94, the target block (image data) from the input terminal 233 of FIG. 81 is input to the input terminal 251. In addition, the past reference image V _P from the input terminal 231 in FIG. 81 is input to the input terminal 252, and the future reference image V _F from the input terminal 232 in FIG. 81 is input to the input terminal 253.

入力端子２５１から入力された注目ブロックは、減算部２５６に供給される。 The block of interest input from the input terminal 251 is supplied to the subtraction unit 256.

入力端子２５２から入力されたパストリファレンス画像V_Pは、相関最大位置検出部２５４および平均値計算部２５５に供給される。入力端子２５３から入力されたフューチャリファレンス画像V_Fも、相関最大位置検出部２５４および平均値計算部２５５に供給される。 The past reference image V _P input from the input terminal 252 is supplied to the maximum correlation position detection unit 254 and the average value calculation unit 255. Futuresse reference image V _F which is input from the input terminal 253 is also supplied to the maximum correlation position detection section 254 and the average value calculation section 255.

相関最大位置検出部２５４は、入力端子２５１からの注目ブロックについて、入力端子２５２からのパストリファレンス画像V_Pと、入力端子２５３からのフューチャリファレンス画像V_Fとにおいて、相関が高い位置関係を検出する。即ち、相関最大位置検出部２５４は、注目ブロックについて、式（９）の相関情報e₁(U',V')を計算し、その相関情報e₁(U',V')が表す相関を最大にする(U',V')、つまり、相関情報e₁(U',V')の値を最小にする(U',V')を検出する。そして、相関最大位置検出部２５４は、相関情報e₁(U',V')の値を最小にする(U',V')を、パストリファレンス画像V_Pとフューチャリファレンス画像V_Fにおいて、相関が最も高い領域の位置関係を表す位置関係ベクトル(U,V)として、平均値計算部２５５に供給する。 Maximum correlation position detection section 254, for the target block from the input terminal 251, and Past reference image V _P from the input terminal 252, in the Futuresse reference image V _F from the input terminal 253, detects the correlation is high positional relationship . That is, the maximum correlation position detection unit 254 calculates the correlation information e ₁ (U ′, V ′) of Expression (9) for the block of interest, and calculates the correlation represented by the correlation information e ₁ (U ′, V ′). It is detected to maximize (U ′, V ′), that is, minimize (U ′, V ′) the value of correlation information e ₁ (U ′, V ′). Then, the maximum correlation position detection section 254, correlation information _{e 1 (U ', V'} ) to minimize the value of the (U ', V'), in the PAST reference image V _P and Futuresse reference image V _F, correlation Is supplied to the average value calculation unit 255 as a positional relationship vector (U, V) representing the positional relationship of the highest region.

平均値計算部２５５は、相関最大位置検出部２５４からの位置関係ベクトル(U,V)が表す位置関係にある、パストリファレンス画像V_Pとフューチャリファレンス画像V_Fの領域の画像データから、注目ブロックの推測値を求め、減算部２５６に供給する。 The average value calculation unit 255 calculates the target block from the image data of the past reference image V _P and the feature reference image V _{F in} the positional relationship represented by the positional relationship vector (U, V) from the correlation maximum position detection unit 254. Is obtained and supplied to the subtracting unit 256.

即ち、平均値計算部２５５は、パストリファレンス画像V_Pにおいて、注目ブロックからベクトル(-(4-s)U,-(4-s)V)だけずれた位置の領域と、フューチャリファレンス画像V_Fにおいて、注目ブロックからベクトル(sU,sV)だけずれた位置の領域との画像データの、例えば、重み付け平均値を、式（１３）にしたがって計算し、その重み付け平均値を、注目ブロックの推測値Pre(x,y)として求め、減算部２５６に供給する。 That is, the average value calculation unit 255 determines the region at a position shifted from the target block by the vector (− (4-s) U, − (4-s) V) in the past reference image V _P and the feature reference image V _F. , For example, the weighted average value of the image data of the region at a position shifted from the target block by the vector (sU, sV) is calculated according to the equation (13), and the weighted average value is calculated as the estimated value of the target block. Obtained as Pre (x, y) and supplied to the subtracting unit 256.

・・・（１３）

(13)

ここで、式（１３）において、(x,y)は、ターゲット画像V_Tにおける注目ブロックの各画素の位置を表す。また、Ip(x-(4-s)U,y-(4-s)V)は、パストリファレンス画像V_Pにおける位置(x-(4-s)U,y-(4-s)V)の画素の画素値を表す。さらに、If(x+sU,y+sV)は、フューチャリファレンス画像V_Fにおける位置(x+sU,y+sV)の画素の画素値を表す。また、Wは、重みで、例えば、1/2などを採用することができる。 Here, in Expression (13), (x, y) represents the position of each pixel of the target block in the target image V _T. Also, Ip (x- (4-s ) U, y- (4-s) V) is located in the Past reference image _{V P (x- (4-s} ) U, y- (4-s) V) Represents the pixel value of the pixel. Furthermore, If (x + sU, y + sV) represents the pixel value of the pixel position (x + sU, y + sV ) in Futuresse reference image V _F. W can be a weight, for example, 1/2.

なお、重みWは、例えば、ユーザの操作入力に応じて設定することができる。また、重みWは、その他、例えば、図８２に示した、ターゲット画像V_Tとパストリファレンス画像V_Pとの時間差である(4-s)/240秒と、ターゲット画像V_Tとフューチャリファレンス画像V_Fとの時間差であるs/240秒との割合を考慮して決めても良い。 The weight W can be set, for example, according to a user operation input. Further, the weight W is, for example, (4-s) / 240 seconds, which is a time difference between the target image V _T and the past reference image V _P shown in FIG. 82, and the target image V _T and the feature reference image V. It may be determined in consideration of the ratio with s / 240 seconds, which is the time difference from _F.

即ち、ターゲット画像V_Tとパストリファレンス画像V_Pとの時間差である(4-s)/240秒が、ターゲット画像V_Tとフューチャリファレンス画像V_Pとの時間差であるs/240秒よりも短いときは、注目ブロックの推測値は、パストリファレンス画像V_Pに重きをおいて求めた方が、より精度が高い値を求めることができる。一方、ターゲット画像V_Tとフューチャリファレンス画像V_Pとの時間差であるs/240秒が、ターゲット画像V_Tとパストリファレンス画像V_Pとの時間差である(4-s)/240秒よりも短いときは、注目ブロックの推測値は、フューチャリファレンス画像V_Fに重きをおいて求めた方が、より精度が高い値を求めることができる。そこで、重みWは、上述の時間差を表す変数sに依存する値、即ち、例えば、s/4などとすることができる。 That is, when the time difference (4-s) / 240 seconds between the target image V _T and the past reference image V _P is shorter than s / 240 seconds, which is the time difference between the target image V _T and the feature reference image V _P. the estimated value of the block of interest, who found by emphasis on Past reference image V _P is, it is possible to obtain a higher accuracy values. On the other hand, when the time difference between the target image V _T and the future reference image V _P is shorter than (4-s) / 240 seconds, which is the time difference between the target image V _T and the past reference image V _P the estimated value of the block of interest, who found by emphasis on Futuresse reference image V _F is, it is possible to obtain a higher accuracy values. Therefore, the weight W can be a value that depends on the variable s representing the above-described time difference, that is, for example, s / 4.

減算部２５６は、平均値計算部２５５から供給される注目ブロックの推測値Pre(x,y)を用いて、入力端子２５１から供給される注目ブロック（の画像データ）Ic(x,y)を圧縮する。即ち、減算部２５６は、平均値計算部２５５から供給される注目ブロックの推測値Pre(x,y)と、入力端子２５１から供給される注目ブロックIc(x,y)との、対応する画素の画素値どうしの差分を計算し、その結果得られる差分値Sub(x,y)=Ic(x,y)-Pre(x,y)を、変換部２５７に供給する。 The subtracting unit 256 uses the estimated value Pre (x, y) of the target block supplied from the average value calculating unit 255 to calculate the target block (image data) Ic (x, y) supplied from the input terminal 251. Compress. That is, the subtracting unit 256 corresponds to the pixel corresponding to the estimated value Pre (x, y) of the target block supplied from the average value calculating unit 255 and the target block Ic (x, y) supplied from the input terminal 251. The difference value Sub (x, y) = Ic (x, y) −Pre (x, y) obtained as a result is supplied to the conversion unit 257.

変換部２５７は、減算部２５６からの注目ブロックの差分値Sub(x,y)を、周波数空間上のデータに変換する。即ち、変換部２５７は、注目ブロックの差分値Sub(x,y)を、例えば、DCT(Discrete Cosine Transform)変換し、その結果得られるDCT係数を、量子化部２５８に供給する。 The conversion unit 257 converts the difference value Sub (x, y) of the target block from the subtraction unit 256 into data on the frequency space. That is, the transform unit 257 performs, for example, DCT (Discrete Cosine Transform) transform on the difference value Sub (x, y) of the block of interest, and supplies the DCT coefficient obtained as a result to the quantizer 258.

量子化部２５８は、変換部２５７からの注目ブロックのDCT係数を量子化し、その結果得られる量子化データを、可変長符号化部２５９に供給する。 The quantization unit 258 quantizes the DCT coefficient of the block of interest from the transform unit 257 and supplies the quantized data obtained as a result to the variable length coding unit 259.

可変長符号化部２５９は、量子化部２５８からの注目ブロックの量子化データを可変長符号化し、その結果得られる可変長符号を、第１の差分データとして、出力端子２６０から、選択回路２４０（図８１）に出力する。なお、注目ブロック（の画像データ）Ic(x,y)とその推測値Pre(x,y)とが完全に一致する場合、可変長符号化部２５９において、注目ブロックの量子化データを可変長符号化することにより得られる可変長符号は、NULLを表すものとなる。 The variable length coding unit 259 performs variable length coding on the quantized data of the block of interest from the quantization unit 258, and uses the resulting variable length code as the first difference data from the output terminal 260 to the selection circuit 240. (FIG. 81). When the block of interest (image data) Ic (x, y) and its estimated value Pre (x, y) completely match, the variable length coding unit 259 converts the quantized data of the block of interest into variable length The variable length code obtained by encoding represents NULL.

次に、図９５のフローチャートを参照して、図９４の差分データ計算部２３４の動作について説明する。 Next, the operation of the difference data calculation unit 234 in FIG. 94 will be described with reference to the flowchart in FIG.

まず最初に、ステップＳ２１１において、注目ブロック、パストリファレンス画像V_P、およびフューチャリファレンス画像V_Fが入力される。即ち、注目ブロックが、入力端子２５１から減算部２５６に入力され、パストリファレンス画像V_Pが、入力端子２５２から、相関最大位置検出部２５４および平均値計算部２５５に入力される。さらに、フューチャリファレンス画像V_Fが、入力端子２５３から、相関最大位置検出部２５４および平均値計算部２５５に入力される。 First, in step S211, the block of interest, past reference image V _P , and future reference image V _F are input. That is, the block of interest is input from the input terminal 251 to the subtraction unit 256, and the past reference image V _P is input from the input terminal 252 to the maximum correlation position detection unit 254 and the average value calculation unit 255. Furthermore, Futuresse reference image V _F is, from the input terminal 253 is input to the maximum correlation position detection section 254 and the average value calculation section 255.

そして、ステップＳ２１１からＳ２１２に進み、相関最大位置検出部２５４は、入力端子２５１からの注目ブロックについて、入力端子２５２からのパストリファレンス画像V_Pと、入力端子２５３からのフューチャリファレンス画像V_Fとにおいて、相関が高い位置関係を検出する。即ち、相関最大位置検出部２５４は、注目ブロックにつき、考えられるすべての(U',V')の値について、式（９）の相関情報e₁(U',V')を計算し、その相関情報e₁(U',V')が表す相関を最大にする(U',V')、つまり、相関情報e₁(U',V')の値を最小にする(U',V')を検出する。そして、相関最大位置検出部２５４は、相関情報e₁(U',V')の値を最小にする(U',V')を、パストリファレンス画像V_Pとフューチャリファレンス画像V_Fにおいて、相関が最も高い領域の位置関係を表す位置関係ベクトル(U,V)として、平均値計算部２５５に供給し、ステップＳ２１２からＳ２１３に進む。 Then, the process proceeds from step S211 to step S212, and the maximum correlation position detection unit 254 uses the past reference image V _P from the input terminal 252 and the future reference image V _F from the input terminal 253 for the block of interest from the input terminal 251. , Detecting a positional relationship with high correlation. That is, the maximum correlation position detection unit 254 calculates the correlation information e ₁ (U ′, V ′) in Expression (9) for all possible (U ′, V ′) values for the block of interest, and The correlation represented by the correlation information e ₁ (U ′, V ′) is maximized (U ′, V ′), that is, the value of the correlation information e ₁ (U ′, V ′) is minimized (U ′, V ′). ') Is detected. Then, the maximum correlation position detection section 254, correlation information _{e 1 (U ', V'} ) to minimize the value of the (U ', V'), in the PAST reference image V _P and Futuresse reference image V _F, correlation Is supplied to the average value calculation unit 255 as a positional relationship vector (U, V) representing the positional relationship of the highest region, and the process proceeds from step S212 to S213.

ステップＳ２１３では、平均値計算部２５５は、式（１３）にしたがい、相関最大位置検出部２５４からの位置関係ベクトル(U,V)が表す位置関係にある、パストリファレンス画像V_Pとフューチャリファレンス画像V_Fの領域の画像データから、注目ブロックの推測値Pre(x,y)を求め、減算部２５６に供給して、ステップＳ２１４に進む。 In step S213, the average value calculating unit 255, follows the equation (13), positional relation vector (U, V) from the maximum correlation position detection section 254 are in the positional relationship indicated by, Past reference image V _P and Futuresse reference image from the image data of the region of the V _F, we obtain the estimated value Pre block of interest (x, y), a is supplied to the subtraction unit 256, processing proceeds to step S214.

ステップＳ２１４では、減算部２５６は、平均値計算部２５５から供給される注目ブロックの推測値Pre(x,y)と、入力端子２５１から供給される注目ブロックIc(x,y)との、対応する画素の画素値どうしの差分を計算し、その結果得られる差分値Sub(x,y)を、変換部２５７に供給して、ステップＳ２１５に進む。 In step S214, the subtracting unit 256 corresponds the estimated value Pre (x, y) of the target block supplied from the average value calculating unit 255 and the target block Ic (x, y) supplied from the input terminal 251. The difference between the pixel values of the pixels to be calculated is calculated, and the difference value Sub (x, y) obtained as a result is supplied to the conversion unit 257, and the process proceeds to step S215.

ステップＳ２１５では、変換部２５７は、減算部２５６からの注目ブロックの差分値Sub(x,y)をDCT変換し、その結果得られるDCT係数を、量子化部２５８に供給して、ステップＳ２１６に進む。ステップＳ２１６では、量子化部２５８は、変換部２５７からの注目ブロックのDCT係数を量子化し、その結果得られる量子化データを、可変長符号化部２５９に供給して、ステップＳ２１７に進む。ステップＳ２１７では、可変長符号化部２５９は、量子化部２５８からの注目ブロックの量子化データを可変長符号化して、ステップＳ２１８に進み、可変長符号化の結果得られる可変長符号を、第１の差分データとして、出力端子２６０から、選択回路２４０（図８１）に出力する。従って、第１の差分データには、位置関係ベクトル(U,V)は含まない。 In step S215, the converting unit 257 DCT-transforms the difference value Sub (x, y) of the target block from the subtracting unit 256, and supplies the resulting DCT coefficient to the quantizing unit 258, and then proceeds to step S216. move on. In step S216, the quantization unit 258 quantizes the DCT coefficient of the block of interest from the transform unit 257, supplies the quantized data obtained as a result to the variable length coding unit 259, and proceeds to step S217. In step S217, the variable length coding unit 259 performs variable length coding on the quantized data of the block of interest from the quantization unit 258, proceeds to step S218, and obtains the variable length code obtained as a result of the variable length coding. 1 is output from the output terminal 260 to the selection circuit 240 (FIG. 81). Accordingly, the first difference data does not include the positional relationship vector (U, V).

以上のようにして、差分データ計算部２３４では、ターゲット画像V_Tがブロック単位で、第１の差分データに圧縮される。 As described above, the difference data calculation unit 234 compresses the target image V _T to the first difference data in units of blocks.

次に、図９６は、第２の差分データを求める図８１の差分データ計算部２３５の構成例を示している。 Next, FIG. 96 shows a configuration example of the difference data calculation unit 235 of FIG. 81 for obtaining the second difference data.

図９６において、入力端子２７１には、図８１の入力端子２３３からの注目ブロック（の画像データ）が入力される。また、入力端子２７２には、図８１の入力端子２３１からのパストリファレンス画像V_Pが入力され、入力端子２７３には、図８１の入力端子２３２からのフューチャリファレンス画像V_Fが入力される。 96, the target block (image data) from the input terminal 233 in FIG. 81 is input to the input terminal 271. In addition, the past reference image V _P from the input terminal 231 in FIG. 81 is input to the input terminal 272, and the future reference image V _F from the input terminal 232 in FIG. 81 is input to the input terminal 273.

入力端子２７１から入力された注目ブロックは、相関最大位置検出部２７４および減算部２７６に供給される。 The block of interest input from the input terminal 271 is supplied to the maximum correlation position detection unit 274 and the subtraction unit 276.

入力端子２７２から入力されたパストリファレンス画像V_Pは、相関最大位置検出部２７４および平均値計算部２７５に供給される。入力端子２７３から入力されたフューチャリファレンス画像V_Fも、相関最大位置検出部２７４および平均値計算部２７５に供給される。 The past reference image V _P input from the input terminal 272 is supplied to the correlation maximum position detection unit 274 and the average value calculation unit 275. The feature reference image V _F input from the input terminal 273 is also supplied to the correlation maximum position detection unit 274 and the average value calculation unit 275.

相関最大位置検出部２７４は、入力端子２７１からの注目ブロックについて、入力端子２７２からのパストリファレンス画像V_Pと、入力端子２７３からのフューチャリファレンス画像V_Fとにおいて、入力端子２７１からの注目ブロックとの相関が高い位置関係を検出する。即ち、相関最大位置検出部２７４は、注目ブロックについて、式（１０）の相関情報e₂(U',V')を計算し、その相関情報e₂(U',V')が表す相関を最大にする(U',V')、つまり、相関情報e₂(U',V')の値を最小にする(U',V')を検出する。そして、相関最大位置検出部２７４は、相関情報e₂(U',V')の値を最小にする(U',V')を、パストリファレンス画像V_Pとフューチャリファレンス画像V_Fにおいて、注目ブロックとの相関が最も高い領域の位置関係を表す相関最大ベクトル(U,V)として、平均値計算部２７５および可変長符号化部２７９に供給する。 The correlation maximum position detection unit 274 determines the block of interest from the input terminal 271 in the past reference image V _P from the input terminal 272 and the feature reference image V _F from the input terminal 273 for the block of interest from the input terminal 271. A positional relationship having a high correlation is detected. That is, the maximum correlation position detection unit 274 calculates the correlation information e ₂ (U ′, V ′) of Expression (10) for the block of interest, and calculates the correlation represented by the correlation information e ₂ (U ′, V ′). It is detected to maximize (U ′, V ′), that is, minimize (U ′, V ′) the value of the correlation information e ₂ (U ′, V ′). Then, the maximum correlation position detection section 274, correlation information _{e 2 (U ', V'} ) to minimize the value of the (U ', V'), in the PAST reference image V _P and Futuresse reference image V _F, attention This is supplied to the average value calculation unit 275 and the variable length coding unit 279 as the maximum correlation vector (U, V) representing the positional relationship of the region having the highest correlation with the block.

平均値計算部２７５は、注目ブロックとの位置関係が、相関最大位置検出部２７４からの相関最大ベクトル(U,V)から求められる位置関係にある、パストリファレンス画像V_Pとフューチャリファレンス画像V_Fの領域の画像データから、注目ブロックの推測値を求め、減算部２７６に供給する。 The average value calculation unit 275 has a positional relationship with the block of interest in a positional relationship obtained from the correlation maximum vector (U, V) from the correlation maximum position detection unit 274, and the past reference image V _P and the feature reference image V _F The estimated value of the block of interest is obtained from the image data of the area, and supplied to the subtracting unit 276.

即ち、平均値計算部２７５は、パストリファレンス画像V_Pにおいて、注目ブロックからベクトル(-(4-s)U,-(4-s)V)だけずれた位置の領域と、フューチャリファレンス画像V_Fにおいて、注目ブロックからベクトル(sU,sV)だけずれた位置の領域との画像データの、例えば、重み付け平均値を、上述の式（１３）にしたがって計算し、その重み付け平均値を、注目ブロックの推測値Pre(x,y)として求め、減算部２７６に供給する。 That is, the average value calculating unit 275, in Pasto reference image V _P, from the target block vector (- (4-s) U , - (4-s) V) shifted the position of the region, Futuresse reference image V _F , For example, the weighted average value of the image data with the region at a position shifted from the target block by the vector (sU, sV) is calculated according to the above-described equation (13), and the weighted average value is calculated for the target block. The estimated value Pre (x, y) is obtained and supplied to the subtracting unit 276.

減算部２７６は、平均値計算部２７５から供給される注目ブロックの推測値Pre(x,y)を用いて、入力端子２７１から供給される注目ブロック（の画像データ）Ic(x,y)を圧縮する。即ち、減算部２７６は、平均値計算部２７５から供給される注目ブロックの推測値Pre(x,y)と、入力端子２７１から供給される注目ブロックIc(x,y)との、対応する画素の画素値どうしの差分を計算し、その結果得られる差分値Sub(x,y)=Ic(x,y)-Pre(x,y)を、変換部２７７に供給する。 The subtraction unit 276 uses the estimated value Pre (x, y) of the target block supplied from the average value calculation unit 275 to calculate the target block (image data) Ic (x, y) supplied from the input terminal 271. Compress. That is, the subtraction unit 276 corresponds to the pixel corresponding to the estimated value Pre (x, y) of the target block supplied from the average value calculation unit 275 and the target block Ic (x, y) supplied from the input terminal 271. The difference value Sub (x, y) = Ic (x, y) −Pre (x, y) obtained as a result is supplied to the conversion unit 277.

変換部２７７は、減算部２７６からの注目ブロックの差分値Sub(x,y)を、周波数空間上のデータに変換する。即ち、変換部２７７は、注目ブロックの差分値Sub(x,y)を、例えば、DCT変換し、その結果得られるDCT係数を、量子化部２７８に供給する。 The conversion unit 277 converts the difference value Sub (x, y) of the target block from the subtraction unit 276 into data on the frequency space. That is, the transform unit 277 performs, for example, DCT transform on the difference value Sub (x, y) of the block of interest, and supplies the resulting DCT coefficient to the quantizer 278.

量子化部２７８は、変換部２７７からの注目ブロックのDCT係数を量子化し、その結果得られる量子化データを、可変長符号化部２７９に供給する。 The quantization unit 278 quantizes the DCT coefficient of the block of interest from the conversion unit 277 and supplies the quantized data obtained as a result to the variable length coding unit 279.

可変長符号化部２７９は、量子化部２７８からの注目ブロックの量子化データを可変長符号化するとともに、相関最大位置検出部２７４からの相関最大ベクトル(U,V)を可変長符号化する。さらに、可変長符号化部２７９は、注目ブロックの量子化データの可変長符号に、相関最大ベクトル(U,V)の可変長符号を付加し、第２の差分データとして、出力端子２８０から、選択回路２４０（図８１）に出力する。なお、注目ブロック（の画像データ）Ic(x,y)とその推測値Pre(x,y)とが完全に一致する場合、可変長符号化部２７９において、注目ブロックの量子化データを可変長符号化することにより得られる可変長符号は、NULLを表すものとなり、第２の差分データは、相関最大ベクトル(U,V)の可変長符号だけとなる。 The variable length coding unit 279 performs variable length coding on the quantized data of the block of interest from the quantization unit 278 and variable length coding the correlation maximum vector (U, V) from the correlation maximum position detection unit 274. . Further, the variable length coding unit 279 adds the variable length code of the correlation maximum vector (U, V) to the variable length code of the quantized data of the block of interest, and outputs the second difference data from the output terminal 280. It outputs to the selection circuit 240 (FIG. 81). When the block of interest (image data) Ic (x, y) and its estimated value Pre (x, y) completely match, the variable length coding unit 279 converts the quantized data of the block of interest into a variable length. The variable length code obtained by encoding represents NULL, and the second difference data is only the variable length code of the maximum correlation vector (U, V).

次に、図９７のフローチャートを参照して、図９６の差分データ計算部２３５の動作について説明する。 Next, the operation of the difference data calculation unit 235 in FIG. 96 will be described with reference to the flowchart in FIG.

まず最初に、ステップＳ２２１において、注目ブロック、パストリファレンス画像V_P、およびフューチャリファレンス画像V_Fが入力される。即ち、注目ブロックが、入力端子２７１から相関最大位置検出部２７４および減算部２７６に入力され、パストリファレンス画像V_Pが、入力端子２７２から、相関最大位置検出部２７４および平均値計算部２７５に入力される。さらに、フューチャリファレンス画像V_Fが、入力端子２７３から、相関最大位置検出部２７４および平均値計算部２７５に入力される。 First, in step S221, the block of interest, past reference image V _P , and future reference image V _F are input. That is, the block of interest is input from the input terminal 271 to the maximum correlation position detection unit 274 and subtraction unit 276, and the past reference image V _P is input from the input terminal 272 to the maximum correlation position detection unit 274 and average value calculation unit 275. Is done. Furthermore, Futuresse reference image V _F is, from the input terminal 273 is input to the maximum correlation position detection section 274 and the average value calculation section 275.

そして、ステップＳ２２１からＳ２２２に進み、相関最大位置検出部２７４は、入力端子２７１からの注目ブロックについて、入力端子２７２からのパストリファレンス画像V_Pと、入力端子２７３からのフューチャリファレンス画像V_Fとにおいて、注目ブロックとの相関が高い位置関係を検出する。即ち、相関最大位置検出部２７４は、注目ブロックにつき、考えられるすべての(U',V')の値について、式（１０）の相関情報e₂(U',V')を計算し、その相関情報e₂(U',V')が表す相関を最大にする(U',V')、つまり、相関情報e₂(U',V')の値を最小にする(U',V')を検出する。そして、相関最大位置検出部２７４は、相関情報e₂(U',V')の値を最小にする(U',V')を、パストリファレンス画像V_Pとフューチャリファレンス画像V_Fにおいて、注目ブロックとの相関が最も高い領域の位置関係を表す相関最大ベクトル(U,V)として、平均値計算部２７５および可変長符号化部２７９に供給し、ステップＳ２２２からＳ２２３に進む。 Then, the process proceeds from step S221 to step S222, and the maximum correlation position detection unit 274 uses the past reference image V _P from the input terminal 272 and the feature reference image V _F from the input terminal 273 for the block of interest from the input terminal 271. Then, a positional relationship having a high correlation with the block of interest is detected. That is, the maximum correlation position detection unit 274 calculates the correlation information e ₂ (U ′, V ′) in Expression (10) for all possible (U ′, V ′) values for the target block, The correlation represented by the correlation information e ₂ (U ′, V ′) is maximized (U ′, V ′), that is, the value of the correlation information e ₂ (U ′, V ′) is minimized (U ′, V ′). ') Is detected. Then, the maximum correlation position detection section 274, correlation information _{e 2 (U ', V'} ) to minimize the value of the (U ', V'), in the PAST reference image V _P and Futuresse reference image V _F, attention The maximum correlation vector (U, V) representing the positional relationship of the region having the highest correlation with the block is supplied to the average value calculation unit 275 and the variable length coding unit 279, and the process proceeds from step S222 to S223.

ステップＳ２２３では、平均値計算部２７５は、式（１３）にしたがい、注目ブロックとの位置関係が、相関最大位置検出部２７４からの相関最大ベクトル(U,V)から求められる位置関係にある、パストリファレンス画像V_Pとフューチャリファレンス画像V_Fの領域の画像データから、注目ブロックの推測値Pre(x,y)を求め、減算部２７６に供給して、ステップＳ２２４に進む。 In step S223, the average value calculation unit 275 has a positional relationship with the block of interest obtained from the maximum correlation vector (U, V) from the maximum correlation position detection unit 274 according to equation (13). The estimated value Pre (x, y) of the block of interest is obtained from the image data of the past reference image V _P and the feature reference image V _F , supplied to the subtraction unit 276, and the process proceeds to step S224.

ステップＳ２２４では、減算部２７６は、平均値計算部２７５から供給される注目ブロックの推測値Pre(x,y)と、入力端子２７１から供給される注目ブロックIc(x,y)との、対応する画素の画素値どうしの差分を計算し、その結果得られる差分値Sub(x,y)を、変換部２７７に供給して、ステップＳ２２５に進む。 In step S224, the subtraction unit 276 associates the estimated value Pre (x, y) of the target block supplied from the average value calculation unit 275 with the target block Ic (x, y) supplied from the input terminal 271. The difference between the pixel values of the pixels to be calculated is calculated, and the difference value Sub (x, y) obtained as a result is supplied to the conversion unit 277, and the process proceeds to step S225.

ステップＳ２２５では、変換部２７７は、減算部２７６からの注目ブロックの差分値Sub(x,y)をDCT変換し、その結果得られるDCT係数を、量子化部２７８に供給して、ステップＳ２２６に進む。ステップＳ２２６では、量子化部２７８は、変換部２７７からの注目ブロックのDCT係数を量子化し、その結果得られる量子化データを、可変長符号化部２７９に供給して、ステップＳ２２７に進む。ステップＳ２２７では、可変長符号化部２７９は、量子化部２７８からの注目ブロックの量子化データを可変長符号化するとともに、相関最大位置検出部２７４からの相関最大ベクトル(U,V)を可変長符号化する。さらに、ステップＳ２２７において、可変長符号化部２７９は、注目ブロックの量子化データの可変長符号に、相関最大ベクトル(U,V)の可変長符号を付加することにより、第２の差分データとして、ステップＳ２２８に進み、その第２の差分データを、出力端子２８０から、選択回路２４０（図８１）に出力する。 In step S225, the converting unit 277 performs DCT conversion on the difference value Sub (x, y) of the target block from the subtracting unit 276, and supplies the resulting DCT coefficient to the quantizing unit 278, and then proceeds to step S226. move on. In step S226, the quantization unit 278 quantizes the DCT coefficient of the target block from the transform unit 277, supplies the quantized data obtained as a result to the variable length coding unit 279, and proceeds to step S227. In step S227, the variable length coding unit 279 performs variable length coding on the quantized data of the block of interest from the quantization unit 278 and also varies the correlation maximum vector (U, V) from the correlation maximum position detection unit 274. Encode long. Further, in step S227, the variable length coding unit 279 adds the variable length code of the correlation maximum vector (U, V) to the variable length code of the quantized data of the target block, thereby obtaining the second difference data. In step S228, the second difference data is output from the output terminal 280 to the selection circuit 240 (FIG. 81).

以上のようにして、差分データ計算部２３５では、ターゲット画像V_Tがブロック単位で、第２の差分データに圧縮される。 As described above, the difference data calculation unit 235 compresses the target image V _T into second difference data in units of blocks.

次に、図９８は、第３の差分データを求める図８１の差分データ計算部２３６の構成例を示している。 Next, FIG. 98 shows a configuration example of the difference data calculation unit 236 of FIG. 81 for obtaining the third difference data.

図９８において、入力端子２９１には、図８１の入力端子２３３からの注目ブロック（の画像データ）が入力される。また、入力端子２９２には、図８１の入力端子２３１からのパストリファレンス画像V_Pが入力され、入力端子２９３には、図８１の入力端子２３２からのフューチャリファレンス画像V_Fが入力される。 98, the target block (image data) from the input terminal 233 in FIG. 81 is input to the input terminal 291. In addition, the past reference image V _P from the input terminal 231 in FIG. 81 is input to the input terminal 292, and the future reference image V _F from the input terminal 232 in FIG. 81 is input to the input terminal 293.

入力端子２９１から入力された注目ブロックは、相関最大位置検出部２９４，２９５、および減算部２９７に供給される。 The block of interest input from the input terminal 291 is supplied to the correlation maximum position detection units 294 and 295 and the subtraction unit 297.

入力端子２９２から入力されたパストリファレンス画像V_Pは、相関最大位置検出部２９４および平均値計算部２９６に供給される。入力端子２９３から入力されたフューチャリファレンス画像V_Fは、相関最大位置検出部２９５および平均値計算部２９６に供給される。 The past reference image V _P input from the input terminal 292 is supplied to the correlation maximum position detection unit 294 and the average value calculation unit 296. Futuresse reference image V _F which is input from the input terminal 293 is supplied to a maximum correlation position detection section 295 and the average value calculation section 296.

相関最大位置検出部２９４は、入力端子２９２からのパストリファレンス画像V_Pにおいて、入力端子２９１からの注目ブロックとの相関が高い位置関係を検出する相関最大位置検出部２９５は、入力端子２９３からのフューチャリファレンス画像V_Fにおいて、入力端子２９１からの注目ブロックとの相関が高い位置関係を検出する。 Maximum correlation position detection section 294, in Pasto reference image V _P from the input terminal 292, maximum correlation position detection section 295 for detecting a correlation is high positional relationship between the target block from the input terminal 291, from the input terminal 293 in Futuresse reference image V _F, it detects the correlation is high positional relationship between the target block from the input terminal 291.

即ち、相関最大位置検出部２９４は、注目ブロックについて、式（１１）の相関情報e₃(U',V')を計算し、その相関情報e₃(U',V')が表す相関を最大にする(U',V')、つまり、相関情報e₃(U',V')の値を最小にする(U',V')を検出する。そして、相関最大位置検出部２９４は、相関情報e₃(U',V')の値を最小にする(U',V')を、パストリファレンス画像V_Pにおいて、注目ブロックとの相関が最も高い領域の位置関係を表す前動きベクトル(-U₁,-V₁)として、平均値計算部２９６および可変長符号化部３００に供給する。 That is, the maximum correlation position detection unit 294 calculates the correlation information e ₃ (U ′, V ′) of Expression (11) for the block of interest, and calculates the correlation represented by the correlation information e ₃ (U ′, V ′). The maximum (U ′, V ′), that is, the minimum (U ′, V ′) of the correlation information e ₃ (U ′, V ′) is detected. Then, the maximum correlation position detection section 294, correlation information _{e 3 (U ', V'} ) value to smallest (U ', V'), in the PAST reference image V _P, the correlation between the target block is the most This is supplied to the average value calculation unit 296 and the variable length coding unit 300 as a previous motion vector (−U ₁ , −V ₁ ) representing the positional relationship of a high region.

また、相関最大位置検出部２９５は、注目ブロックについて、式（１２）の相関情報e₄(U',V')を計算し、その相関情報e₄(U',V')が表す相関を最大にする(U',V')、つまり、相関情報e₄(U',V')の値を最小にする(U',V')を検出する。そして、相関最大位置検出部２９５は、相関情報e₄(U',V')の値を最小にする(U',V')を、フューチャリファレンス画像V_Fにおいて、注目ブロックとの相関が最も高い領域の位置関係を表す後動きベクトル(U₂,V₂)として、平均値計算部２９６および可変長符号化部３００に供給する。 In addition, the maximum correlation position detection unit 295 calculates the correlation information e ₄ (U ′, V ′) of Expression (12) for the block of interest, and calculates the correlation represented by the correlation information e ₄ (U ′, V ′). It is detected to maximize (U ′, V ′), that is, to minimize the value of correlation information e ₄ (U ′, V ′). Then, the maximum correlation position detection section 295, correlation information _{e 4 (U ', V'} ) to minimize the value of the (U ', V'), in Futuresse reference image V _F, the correlation between the target block is the most This is supplied to the average value calculation unit 296 and the variable length coding unit 300 as a post motion vector (U ₂ , V ₂ ) representing the positional relationship of the high region.

平均値計算部２９６は、注目ブロックとの位置関係が、相関最大位置検出部２９４からの前動きベクトル(-U₁,-V₁)が表す位置関係にある、パストリファレンス画像V_Pの領域の画像データと、注目ブロックとの位置関係が、相関最大位置検出部２９５からの後動きベクトル(U₂,V₂)が表す位置関係にある、フューチャリファレンス画像V_Fの領域の画像データとから、注目ブロックの推測値を求め、減算部２９７に供給する。 The average value calculation unit 296 has a positional relationship with the block of interest in the region of the past reference image V _P in which the positional relationship represented by the previous motion vector (−U ₁ , −V ₁ ) from the maximum correlation position detection unit 294 is present. from the image data, the positional relationship between the target block, a positional relationship indicated by the motion vector after the maximum correlation position detection section _{_{295 (U 2, V 2)}} , and the image data of the region of Futuresse reference image V _F, An estimated value of the block of interest is obtained and supplied to the subtraction unit 297.

即ち、平均値計算部２９６は、パストリファレンス画像V_Pにおいて、注目ブロックから前動きベクトル(-U₁,-V₁)だけずれた位置の領域と、フューチャリファレンス画像V_Fにおいて、後ベクトル(U₂,V₂)だけずれた位置の領域との画像データの、例えば、重み付け平均値を、式（１４）にしたがって計算し、その重み付け平均値を、注目ブロックの推測値Pre(x,y)として求め、減算部２９７に供給する。 That is, the average value calculating unit 296, in Pasto reference image V _P, from the target block prior to the motion vector (-U _{_1,} -V ₁₎ and the area of the shifted positions, the Futuresse reference image V _F, the rear vector (U ₂ , V ₂ ), for example, the weighted average value of the image data with the region shifted by the position is calculated according to the equation (14), and the weighted average value is calculated as the estimated value Pre (x, y) of the target block. And supplied to the subtracting unit 297.

・・・（１４）

(14)

ここで、式（１４）において、(x,y)は、ターゲット画像V_Tにおける注目ブロックの各画素の位置を表す。また、Ip(x-U₁,y-V₁)は、パストリファレンス画像V_Pにおける位置(x-U₁,y-V₁)の画素の画素値を表す。さらに、If(x+U₂,y+V₂)は、フューチャリファレンス画像V_Fにおける位置(x+U₂,y+V₂)の画素の画素値を表す。また、Wは、重みで、例えば、1/2などを採用することができる。 Here, in the formula (14), (x, y ) represents the position of each pixel of the block of interest in the target image V _T. _{_{Also, Ip (xU 1, yV 1}} ) represents the pixel value of the pixel position (xU _{_1,} yV ₁₎ in the PAST reference image V _P. _{Furthermore, If (x + U 2,} y + V 2) represents the pixel value of the pixel position in Futuresse reference image _{_{V F (x + U 2,}} y + V 2). W can be a weight, for example, 1/2.

減算部２９７は、平均値計算部２９６から供給される注目ブロックの推測値Pre(x,y)を用いて、入力端子２９１から供給される注目ブロック（の画像データ）Ic(x,y)を圧縮する。即ち、減算部２９７は、平均値計算部２９６から供給される注目ブロックの推測値Pre(x,y)と、入力端子２９１から供給される注目ブロックIc(x,y)との、対応する画素の画素値どうしの差分を計算し、その結果得られる差分値Sub(x,y)=Ic(x,y)-Pre(x,y)を、変換部２９８に供給する。 The subtraction unit 297 uses the estimated value Pre (x, y) of the target block supplied from the average value calculation unit 296 to calculate the target block (image data) Ic (x, y) supplied from the input terminal 291. Compress. That is, the subtraction unit 297 corresponds to the pixel corresponding to the estimated value Pre (x, y) of the target block supplied from the average value calculation unit 296 and the target block Ic (x, y) supplied from the input terminal 291. The difference value Sub (x, y) = Ic (x, y) −Pre (x, y) obtained as a result is supplied to the conversion unit 298.

変換部２９８は、減算部２９７からの注目ブロックの差分値Sub(x,y)を、周波数空間上のデータに変換する。即ち、変換部２９８は、注目ブロックの差分値Sub(x,y)を、例えば、DCT変換し、その結果得られるDCT係数を、量子化部２９９に供給する。 The conversion unit 298 converts the difference value Sub (x, y) of the target block from the subtraction unit 297 into data on the frequency space. That is, the transform unit 298 performs, for example, DCT transform on the difference value Sub (x, y) of the block of interest, and supplies the resulting DCT coefficient to the quantization unit 299.

量子化部２９９は、変換部２９８からの注目ブロックのDCT係数を量子化し、その結果得られる量子化データを、可変長符号化部３００に供給する。 The quantization unit 299 quantizes the DCT coefficient of the block of interest from the transform unit 298 and supplies the quantized data obtained as a result to the variable length coding unit 300.

可変長符号化部３００は、量子化部２９９からの注目ブロックの量子化データを可変長符号化するとともに、相関最大位置検出部２９４からの前動きベクトル(-U₁,-V₁)と、相関最大位置検出部２９５からの後動きベクトル(U₂,V₂)とを可変長符号化する。さらに、可変長符号化部３００は、注目ブロックの量子化データの可変長符号に、前動きベクトル(-U₁,-V₁)と後動きベクトル(U₂,V₂)との可変長符号を付加し、第３の差分データとして、出力端子３０１から、選択回路２４０（図８１）に出力する。なお、注目ブロック（の画像データ）Ic(x,y)とその推測値Pre(x,y)とが完全に一致する場合、可変長符号化部３００において、注目ブロックの量子化データを可変長符号化することにより得られる可変長符号は、NULLを表すものとなり、第３の差分データは、前動きベクトル(-U₁,-V₁)と後動きベクトル(U₂,V₂)との可変長符号だけとなる。 The variable length coding unit 300 performs variable length coding on the quantized data of the block of interest from the quantization unit 299, and the previous motion vector (−U ₁ , −V ₁ ) from the correlation maximum position detection unit 294, The rear motion vector (U ₂ , V ₂ ) from the maximum correlation position detector 295 is variable-length encoded. Furthermore, the variable length coding unit 300 adds variable length codes of the previous motion vector (−U ₁ , −V ₁ ) and the subsequent motion vector (U ₂ , V ₂ ) to the variable length code of the quantized data of the block of interest. Is output from the output terminal 301 to the selection circuit 240 (FIG. 81) as third difference data. When the block of interest (image data) Ic (x, y) and its estimated value Pre (x, y) completely match, the variable-length coding unit 300 converts the quantized data of the block of interest into a variable length The variable length code obtained by encoding represents NULL, and the third difference data is the difference between the previous motion vector (−U ₁ , −V ₁ ) and the subsequent motion vector (U ₂ , V ₂ ). Only variable length codes.

次に、図９９のフローチャートを参照して、図９８の差分データ計算部２３６の動作について説明する。 Next, the operation of the difference data calculation unit 236 in FIG. 98 will be described with reference to the flowchart in FIG.

まず最初に、ステップＳ２３１において、注目ブロック、パストリファレンス画像V_P、およびフューチャリファレンス画像V_Fが入力される。即ち、注目ブロックが、入力端子２９１から相関最大位置検出部２９４，２９５、および減算部２９７に入力され、パストリファレンス画像V_Pが、入力端子２９２から、相関最大位置検出部２９４および平均値計算部２９６に入力される。さらに、フューチャリファレンス画像V_Fが、入力端子２９３から、相関最大位置検出部２９５および平均値計算部２９６に入力される。 First, in step S231, the target block, past reference image V _P , and future reference image V _F are input. That is, the block of interest is input from the input terminal 291 to the correlation maximum position detection units 294 and 295 and the subtraction unit 297, and the past reference image V _P is input from the input terminal 292 to the correlation maximum position detection unit 294 and the average value calculation unit. 296. Further, the future reference image V _F is input from the input terminal 293 to the maximum correlation position detection unit 295 and the average value calculation unit 296.

そして、ステップＳ２３１からＳ２３２に進み、相関最大位置検出部２９４は、入力端子２９１からの注目ブロックについて、入力端子２９２からのパストリファレンス画像V_Pにおいて、注目ブロックとの相関が高い位置関係を検出する。即ち、相関最大位置検出部２９４は、注目ブロックにつき、考えられるすべての(U',V')の値について、式（１１）の相関情報e₃(U',V')を計算し、その相関情報e₃(U',V')が表す相関を最大にする(U',V')、つまり、相関情報e₃(U',V')の値を最小にする(U',V')を検出する。そして、相関最大位置検出部２９４は、相関情報e₃(U',V')の値を最小にする(U',V')を、パストリファレンス画像V_Pにおいて、注目ブロックとの相関が最も高い領域の位置関係を表す前動きベクトル(-U₁,-V₁)として、平均値計算部２９６および可変長符号化部３００に供給し、ステップＳ２３２からＳ２３３に進む。 Then, the process proceeds from step S231 to S232, and the maximum correlation position detection unit 294 detects a positional relationship having a high correlation with the target block in the past reference image V _P from the input terminal 292 for the target block from the input terminal 291. . That is, the maximum correlation position detection unit 294 calculates the correlation information e ₃ (U ′, V ′) in Expression (11) for all possible (U ′, V ′) values for the block of interest, The correlation represented by the correlation information e ₃ (U ', V') is maximized (U ', V'), that is, the correlation information e ₃ (U ', V') is minimized (U ', V') ') Is detected. Then, the maximum correlation position detection section 294, correlation information _{e 3 (U ', V'} ) value to smallest (U ', V'), in the PAST reference image V _P, the correlation between the target block is the most The previous motion vector (−U ₁ , −V ₁ ) representing the positional relationship of the high region is supplied to the average value calculation unit 296 and the variable length coding unit 300, and the process proceeds from step S232 to S233.

ステップＳ２３３では、相関最大位置検出部２９５は、入力端子２９１からの注目ブロックについて、入力端子２９３からのフューチャリファレンス画像V_Fにおいて、注目ブロックとの相関が高い位置関係を検出する。即ち、相関最大位置検出部２９５は、注目ブロックにつき、考えられるすべての(U',V')の値について、式（１２）の相関情報e₄(U',V')を計算し、その相関情報e₄(U',V')が表す相関を最大にする(U',V')、つまり、相関情報e₄(U',V')の値を最小にする(U',V')を検出する。そして、相関最大位置検出部２９４は、相関情報e₄(U',V')の値を最小にする(U',V')を、フューチャリファレンス画像V_Fにおいて、注目ブロックとの相関が最も高い領域の位置関係を表す後動きベクトル(U₂,V₂)として、平均値計算部２９６および可変長符号化部３００に供給し、ステップＳ２３３からＳ２３４に進む。 In step S233, the maximum correlation position detection section 295, for the target block from the input terminal 291, in Futuresse reference image V _F from the input terminal 293, the correlation between the target block is detected with high positional relationship. That is, the maximum correlation position detection unit 295 calculates the correlation information e ₄ (U ′, V ′) in Expression (12) for all possible values of (U ′, V ′) for the block of interest. The correlation represented by the correlation information e ₄ (U ′, V ′) is maximized (U ′, V ′), that is, the value of the correlation information e ₄ (U ′, V ′) is minimized (U ′, V ′). ') Is detected. Then, the maximum correlation position detection section 294, correlation information _{e 4 (U ', V'} ) to minimize the value of the (U ', V'), in Futuresse reference image V _F, the correlation between the target block is the most The post motion vector (U ₂ , V ₂ ) representing the positional relationship of the high region is supplied to the average value calculation unit 296 and the variable length coding unit 300, and the process proceeds from step S233 to S234.

ステップＳ２３４では、平均値計算部２９６は、式（１４）にしたがい、注目ブロックとの位置関係が、相関最大位置検出部２９４からの前動きベクトル(-U₁,-V₁)が表す位置関係にある、パストリファレンス画像V_Pの領域の画像データと、注目ブロックとの位置関係が、相関最大位置検出部２９５からの後動きベクトル(U₂,V₂)が表す位置関係にある、フューチャリファレンス画像V_Fの領域の画像データとから、注目ブロックの推測値Pre(x,y)を求め、減算部２９７に供給して、ステップＳ２３５に進む。 In step S234, the average value calculation unit 296 represents the positional relationship represented by the previous motion vector (−U ₁ , −V ₁ ) from the maximum correlation position detection unit 294 in accordance with the equation (14). in some, the image data of a region of Pasto reference image V _P, positional relationship between the target block is in a positional relationship indicated by the motion vector (U _{_2,} V ₂₎ after the maximum correlation position detection section 295, Futuresse reference from the image data region of the image V _F, we obtain the estimated value Pre block of interest (x, y), a is supplied to the subtraction unit 297, processing proceeds to step S235.

ステップＳ２３５では、減算部２９７は、平均値計算部２９６から供給される注目ブロックの推測値Pre(x,y)と、入力端子２９１から供給される注目ブロックIc(x,y)との、対応する画素の画素値どうしの差分を計算し、その結果得られる差分値Sub(x,y)を、変換部２９８に供給して、ステップＳ２３６に進む。 In step S 235, the subtraction unit 297 corresponds to the estimated value Pre (x, y) of the target block supplied from the average value calculation unit 296 and the target block Ic (x, y) supplied from the input terminal 291. The difference between the pixel values of the pixels to be calculated is calculated, the difference value Sub (x, y) obtained as a result is supplied to the conversion unit 298, and the process proceeds to step S236.

ステップＳ２３６では、変換部２９８は、減算部２９７からの注目ブロックの差分値Sub(x,y)をDCT変換し、その結果得られるDCT係数を、量子化部２９９に供給して、ステップＳ２３７に進む。ステップＳ２３７では、量子化部２９９は、変換部２９８からの注目ブロックのDCT係数を量子化し、その結果得られる量子化データを、可変長符号化部３００に供給して、ステップＳ２３８に進む。ステップＳ２３８では、可変長符号化部３００は、量子化部２９９からの注目ブロックの量子化データを可変長符号化するとともに、相関最大位置検出部２９４からの前動きベクトル(-U₁,-V₁)と、相関最大位置検出部２９５からの後動きベクトル(U₂,V₂)とを可変長符号化する。さらに、ステップＳ２３８において、可変長符号化部３００は、注目ブロックの量子化データの可変長符号に、前動きベクトル(-U₁,-V₁)と後動きベクトル(U₂,V₂)との可変長符号を付加することにより、第３の差分データとして、ステップＳ２３９に進み、その第３の差分データを、出力端子３０１から、選択回路２４０（図８１）に出力する。 In step S236, the conversion unit 298 DCT-transforms the difference value Sub (x, y) of the target block from the subtraction unit 297, supplies the DCT coefficient obtained as a result to the quantization unit 299, and proceeds to step S237. move on. In step S237, the quantization unit 299 quantizes the DCT coefficient of the block of interest from the conversion unit 298, supplies the quantized data obtained as a result to the variable length coding unit 300, and proceeds to step S238. In step S238, the variable length coding unit 300 performs variable length coding on the quantized data of the block of interest from the quantization unit 299, and the previous motion vector (−U ₁ , −V from the maximum correlation position detection unit 294). ₁ ) and the rear motion vector (U ₂ , V ₂ ) from the maximum correlation position detector 295 are variable-length encoded. Further, in step S238, the variable length coding unit 300 adds the front motion vector (−U ₁ , −V ₁ ) and the back motion vector (U ₂ , V ₂ ) to the variable length code of the quantized data of the block of interest. As the third difference data, the process proceeds to step S239, and the third difference data is output from the output terminal 301 to the selection circuit 240 (FIG. 81).

以上のようにして、差分データ計算部２３６では、ターゲット画像V_Tがブロック単位で、第３の差分データに圧縮される。 As described above, the difference data calculating section 236, a target image V _T is in units of blocks, is compressed to a third difference data.

次に、図１００は、第４の差分データを求める図８１の差分データ計算部２３７の構成例を示している。 Next, FIG. 100 shows a configuration example of the difference data calculation unit 237 of FIG. 81 for obtaining the fourth difference data.

図１００において、入力端子３１１には、図８１の入力端子２３３からの注目ブロック（の画像データ）が入力され、入力端子３１２には、図８１の入力端子２３１からのパストリファレンス画像V_Pが入力される。 In FIG. 100, the target block (image data) from the input terminal 233 in FIG. 81 is input to the input terminal 311, and the past reference image V _P from the input terminal 231 in FIG. 81 is input to the input terminal 312. Is done.

入力端子３１１から入力された注目ブロックは、相関最大位置検出部３１３および減算部３１５に供給される。 The block of interest input from the input terminal 311 is supplied to the maximum correlation position detection unit 313 and the subtraction unit 315.

入力端子３１２から入力されたパストリファレンス画像V_Pは、相関最大位置検出部３１３および切り出し部３１４に供給される。 The past reference image V _P input from the input terminal 312 is supplied to the correlation maximum position detection unit 313 and the clipping unit 314.

相関最大位置検出部３１３は、入力端子３１２からのパストリファレンス画像V_Pにおいて、入力端子３１１からの注目ブロックとの相関が高い位置関係を検出する。 Maximum correlation position detection section 313, in Pasto reference image V _P from the input terminal 312, detects the correlation is high positional relationship between the target block from the input terminal 311.

即ち、相関最大位置検出部３１３は、注目ブロックについて、式（１１）の相関情報e₃(U',V')を計算し、その相関情報e₃(U',V')が表す相関を最大にする(U',V')、つまり、相関情報e₃(U',V')の値を最小にする(U',V')を検出する。そして、相関最大位置検出部３１３は、相関情報e₃(U',V')の値を最小にする(U',V')を、パストリファレンス画像V_Pにおいて、注目ブロックとの相関が最も高い領域の位置関係を表す前動きベクトル(-U₁,-V₁)として、切り出し部３１４および可変長符号化部３１８に供給する。 That is, the maximum correlation position detection unit 313 calculates the correlation information e ₃ (U ′, V ′) of Expression (11) for the block of interest, and calculates the correlation represented by the correlation information e ₃ (U ′, V ′). The maximum (U ′, V ′), that is, the minimum (U ′, V ′) of the correlation information e ₃ (U ′, V ′) is detected. The maximum correlation position detection section 313, correlation information _{e 3 (U ', V'} ) value to smallest (U ', V'), in the PAST reference image V _P, the correlation between the target block is the most This is supplied to the cutout unit 314 and the variable length coding unit 318 as the previous motion vector (−U ₁ , −V ₁ ) representing the positional relationship of the high region.

切り出し部３１４は、注目ブロックとの位置関係が、相関最大位置検出部３１３からの前動きベクトル(-U₁,-V₁)が表す位置関係にある、パストリファレンス画像V_Pの領域の画像データを切り出し、注目ブロックの推測値として、減算部３１５に供給する。 The cutout unit 314 has image data of a region of the past reference image V _P in which the positional relationship with the target block is the positional relationship represented by the previous motion vector (−U ₁ , −V ₁ ) from the maximum correlation position detection unit 313. And is supplied to the subtraction unit 315 as the estimated value of the block of interest.

即ち、切り出し部３１４は、パストリファレンス画像V_Pにおいて、注目ブロックから前動きベクトル(-U₁,-V₁)だけずれた位置の領域の画像データを、注目ブロックの推測値Pre(x,y)として切り出し（求め）、減算部３１５に供給する。 In other words, the cutout unit 314 uses the estimated data Pre (x, y) of the block of interest in the region of the past reference image V _P that is shifted from the block of interest by the previous motion vector (−U ₁ , −V ₁ ). ) Is cut out (obtained) and supplied to the subtractor 315.

減算部３１５は、切り出し部３１４から供給される注目ブロックの推測値Pre(x,y)を用いて、入力端子３１１から供給される注目ブロック（の画像データ）Ic(x,y)を圧縮する。即ち、減算部３１５は、切り出し部３１４から供給される注目ブロックの推測値Pre(x,y)と、入力端子３１１から供給される注目ブロックIc(x,y)との、対応する画素の画素値どうしの差分を計算し、その結果得られる差分値Sub(x,y)=Ic(x,y)-Pre(x,y)を、変換部３１６に供給する。 The subtraction unit 315 compresses the target block (image data) Ic (x, y) supplied from the input terminal 311 using the estimated value Pre (x, y) of the target block supplied from the cutout unit 314. . That is, the subtraction unit 315 corresponds to the pixel of the corresponding pixel between the estimated value Pre (x, y) of the target block supplied from the clipping unit 314 and the target block Ic (x, y) supplied from the input terminal 311. The difference between the values is calculated, and the difference value Sub (x, y) = Ic (x, y) −Pre (x, y) obtained as a result is supplied to the conversion unit 316.

変換部３１６は、減算部３１５からの注目ブロックの差分値Sub(x,y)を、周波数空間上のデータに変換する。即ち、変換部３１６は、注目ブロックの差分値Sub(x,y)を、例えば、DCT変換し、その結果得られるDCT係数を、量子化部３１７に供給する。 The conversion unit 316 converts the difference value Sub (x, y) of the target block from the subtraction unit 315 into data on the frequency space. That is, the transform unit 316 performs, for example, DCT transform on the difference value Sub (x, y) of the block of interest, and supplies the resulting DCT coefficient to the quantization unit 317.

量子化部３１７は、変換部３１６からの注目ブロックのDCT係数を量子化し、その結果得られる量子化データを、可変長符号化部３１８に供給する。 The quantization unit 317 quantizes the DCT coefficient of the block of interest from the conversion unit 316 and supplies the quantized data obtained as a result to the variable length encoding unit 318.

可変長符号化部３１８は、量子化部３１７からの注目ブロックの量子化データを可変長符号化するとともに、相関最大位置検出部３１３からの前動きベクトル(-U₁,-V₁)を可変長符号化する。さらに、可変長符号化部３１８は、注目ブロックの量子化データの可変長符号に、前動きベクトル(-U₁,-V₁)の可変長符号を付加し、第４の差分データとして、出力端子３１９から、選択回路２４０（図８１）に出力する。なお、注目ブロック（の画像データ）Ic(x,y)とその推測値Pre(x,y)とが完全に一致する場合、可変長符号化部３１８において、注目ブロックの量子化データを可変長符号化することにより得られる可変長符号は、NULLを表すものとなり、第４の差分データは、前動きベクトル(-U₁,-V₁)の可変長符号だけとなる。 The variable length encoding unit 318 performs variable length encoding on the quantized data of the block of interest from the quantization unit 317 and also changes the previous motion vector (−U ₁ , −V ₁ ) from the correlation maximum position detection unit 313. Encode long. Further, the variable length coding unit 318 adds the variable length code of the previous motion vector (−U ₁ , −V ₁ ) to the variable length code of the quantized data of the block of interest, and outputs it as fourth difference data. The signal is output from the terminal 319 to the selection circuit 240 (FIG. 81). When the block of interest (image data) Ic (x, y) and its estimated value Pre (x, y) completely match, the variable length coding unit 318 converts the quantized data of the block of interest into a variable length. The variable length code obtained by encoding represents NULL, and the fourth difference data is only the variable length code of the previous motion vector (−U ₁ , −V ₁ ).

次に、図１０１のフローチャートを参照して、図１００の差分データ計算部２３７の動作について説明する。 Next, the operation of the difference data calculation unit 237 in FIG. 100 will be described with reference to the flowchart in FIG.

まず最初に、ステップＳ２４１において、注目ブロックおよびパストリファレンス画像V_Pが入力される。即ち、注目ブロックが、入力端子３１１から相関最大位置検出部３１３および減算部３１５に入力され、パストリファレンス画像V_Pが、入力端子３１２から、相関最大位置検出部３１３および切り出し部３１４に入力される。 First, in step S241, the target block and past reference image V _P are input. That is, the block of interest is input from the input terminal 311 to the maximum correlation position detection unit 313 and the subtraction unit 315, and the past reference image V _P is input from the input terminal 312 to the maximum correlation position detection unit 313 and the clipping unit 314. .

そして、ステップＳ２４１からＳ２４２に進み、相関最大位置検出部３１３は、入力端子３１１からの注目ブロックについて、入力端子３１２からのパストリファレンス画像V_Pにおいて、注目ブロックとの相関が高い位置関係を検出する。即ち、相関最大位置検出部３１３は、注目ブロックにつき、考えられるすべての(U',V')の値について、式（１１）の相関情報e₃(U',V')を計算し、その相関情報e₃(U',V')が表す相関を最大にする(U',V')、つまり、相関情報e₃(U',V')の値を最小にする(U',V')を検出する。そして、相関最大位置検出部３１３は、相関情報e₃(U',V')の値を最小にする(U',V')を、パストリファレンス画像V_Pにおいて、注目ブロックとの相関が最も高い領域の位置関係を表す前動きベクトル(-U₁,-V₁)として、切り出し部３１４および可変長符号化部３１８に供給し、ステップＳ２４２からＳ２４３に進む。 Then, the process proceeds from step S241 to S242, and the maximum correlation position detection unit 313 detects the positional relationship of the block of interest from the input terminal 311 having a high correlation with the block of interest in the past reference image V _P from the input terminal 312. . That is, the maximum correlation position detection unit 313 calculates the correlation information e ₃ (U ′, V ′) in Expression (11) for all possible values of (U ′, V ′) for the block of interest. The correlation represented by the correlation information e ₃ (U ', V') is maximized (U ', V'), that is, the correlation information e ₃ (U ', V') is minimized (U ', V') ') Is detected. The maximum correlation position detection section 313, correlation information _{e 3 (U ', V'} ) value to smallest (U ', V'), in the PAST reference image V _P, the correlation between the target block is the most The previous motion vector (−U ₁ , −V ₁ ) representing the positional relationship of the high region is supplied to the cutout unit 314 and the variable length coding unit 318, and the process proceeds from step S242 to S243.

ステップＳ２４３では、切り出し部３１４は、注目ブロックとの位置関係が、相関最大位置検出部３１３からの前動きベクトル(-U₁,-V₁)が表す位置関係にある、パストリファレンス画像V_Pの領域の画像データを、注目ブロックの推測値Pre(x,y)として切り出し、減算部３１５に供給して、ステップＳ２４４に進む。 In step S243, the cutout unit 314 has the positional relationship with the target block of the past reference image V _{P in} the positional relationship represented by the previous motion vector (−U ₁ , −V ₁ ) from the correlation maximum position detection unit 313. The image data of the area is cut out as the estimated value Pre (x, y) of the block of interest, supplied to the subtraction unit 315, and the process proceeds to step S244.

ステップＳ２４４では、減算部３１５は、切り出し部３１４から供給される注目ブロックの推測値Pre(x,y)と、入力端子３１１から供給される注目ブロックIc(x,y)との、対応する画素の画素値どうしの差分を計算し、その結果得られる差分値Sub(x,y)を、変換部３１６に供給して、ステップＳ２４５に進む。 In step S244, the subtracting unit 315 corresponds to the pixel corresponding to the estimated value Pre (x, y) of the target block supplied from the clipping unit 314 and the target block Ic (x, y) supplied from the input terminal 311. The difference value Sub (x, y) obtained as a result is supplied to the conversion unit 316, and the process proceeds to step S245.

ステップＳ２４５では、変換部３１６は、減算部３１５からの注目ブロックの差分値Sub(x,y)をDCT変換し、その結果得られるDCT係数を、量子化部３１７に供給して、ステップＳ２４６に進む。ステップＳ２４６では、量子化部３１７は、変換部３１６からの注目ブロックのDCT係数を量子化し、その結果得られる量子化データを、可変長符号化部３１８に供給して、ステップＳ２４７に進む。ステップＳ２４７では、可変長符号化部３１８は、量子化部３１７からの注目ブロックの量子化データを可変長符号化するとともに、相関最大位置検出部３１３からの前動きベクトル(-U₁,-V₁)を可変長符号化する。さらに、ステップＳ２４７において、可変長符号化部３１８は、注目ブロックの量子化データの可変長符号に、前動きベクトル(-U₁,-V₁)の可変長符号を付加することにより、第４の差分データとして、ステップＳ２４８に進み、その第４の差分データを、出力端子３１９から、選択回路２４０（図８１）に出力する。 In step S245, the conversion unit 316 DCT-transforms the difference value Sub (x, y) of the target block from the subtraction unit 315, supplies the DCT coefficient obtained as a result to the quantization unit 317, and proceeds to step S246. move on. In step S246, the quantization unit 317 quantizes the DCT coefficient of the block of interest from the conversion unit 316, supplies the quantized data obtained as a result to the variable length coding unit 318, and proceeds to step S247. In step S247, the variable length coding unit 318 performs variable length coding on the quantized data of the block of interest from the quantization unit 317, and the previous motion vector (−U ₁ , −V from the correlation maximum position detection unit 313). ₁ ) Variable length coding. Further, in step S247, the variable length coding unit 318 adds the variable length code of the previous motion vector (−U ₁ , −V ₁ ) to the variable length code of the quantized data of the block of interest, thereby In step S248, the fourth difference data is output from the output terminal 319 to the selection circuit 240 (FIG. 81).

以上のようにして、差分データ計算部２３７では、ターゲット画像V_Tがブロック単位で、第４の差分データに圧縮される。 As described above, the difference data calculation unit 237 compresses the target image V _T into the fourth difference data in units of blocks.

次に、図１０２は、第５の差分データを求める図８１の差分データ計算部２３８の構成例を示している。 Next, FIG. 102 shows a configuration example of the difference data calculation unit 238 in FIG. 81 for obtaining fifth difference data.

図１０２において、入力端子３３１には、図８１の入力端子２３３からの注目ブロック（の画像データ）が入力され、入力端子３３２には、図８１の入力端子２３２からのフューチャリファレンス画像V_Fが入力される。 102, the target block (image data) from the input terminal 233 in FIG. 81 is input to the input terminal 331, and the future reference image V _F from the input terminal 232 in FIG. 81 is input to the input terminal 332. Is done.

入力端子３３１から入力された注目ブロックは、相関最大位置検出部３３３および減算部３３５に供給される。 The block of interest input from the input terminal 331 is supplied to the correlation maximum position detection unit 333 and the subtraction unit 335.

入力端子３３２から入力されたフューチャリファレンス画像V_Fは、相関最大位置検出部３３３および切り出し部３３４に供給される。 The feature reference image V _F input from the input terminal 332 is supplied to the maximum correlation position detection unit 333 and the cutout unit 334.

相関最大位置検出部３３３は、入力端子３３２からのフューチャリファレンス画像V_Fにおいて、入力端子３３１からの注目ブロックとの相関が高い位置関係を検出する。 Maximum correlation position detection section 333, in Futuresse reference image V _F from the input terminal 332, detects the correlation is high positional relationship between the target block from the input terminal 331.

即ち、相関最大位置検出部３３３は、注目ブロックについて、式（１２）の相関情報e₄(U',V')を計算し、その相関情報e₄(U',V')が表す相関を最大にする(U',V')、つまり、相関情報e₄(U',V')の値を最小にする(U',V')を検出する。そして、相関最大位置検出部３３３は、相関情報e₄(U',V')の値を最小にする(U',V')を、フューチャリファレンス画像V_Fにおいて、注目ブロックとの相関が最も高い領域の位置関係を表す後動きベクトル(U₂,V₂)として、切り出し部３３４および可変長符号化部３３８に供給する。 That is, the maximum correlation position detection unit 333 calculates the correlation information e ₄ (U ′, V ′) in Expression (12) for the block of interest, and calculates the correlation represented by the correlation information e ₄ (U ′, V ′). It is detected to maximize (U ′, V ′), that is, to minimize the value of correlation information e ₄ (U ′, V ′). Then, the maximum correlation position detection section 333, correlation information _{e 4 (U ', V'} ) to minimize the value of the (U ', V'), in Futuresse reference image V _F, the correlation between the target block is the most This is supplied to the cutout unit 334 and the variable length coding unit 338 as a post motion vector (U ₂ , V ₂ ) representing the positional relationship of the high region.

切り出し部３３４は、注目ブロックとの位置関係が、相関最大位置検出部３３３からの後動きベクトル(U₂,V₂)が表す位置関係にある、フューチャリファレンス画像V_Fの領域の画像データを切り出し、注目ブロックの推測値として、減算部３３５に供給する。 Clipping unit 334, the positional relationship between the block of interest, cut out image data of the motion vector (U _{_2,} V ₂₎ in a positional relationship represented by the area of Futuresse reference image V _F after the maximum correlation position detection section 333 The estimated value of the target block is supplied to the subtracting unit 335.

即ち、切り出し部３３４は、フューチャリファレンス画像V_Fにおいて、注目ブロックから後動きベクトル(U₂,V₂)だけずれた位置の領域の画像データを、注目ブロックの推測値Pre(x,y)として切り出し（求め）、減算部３３５に供給する。 That is, the cutout portion 334, in Futuresse reference image V _F, the image data of the area of the position shifted by backward motion vector (U _{_2,} V ₂₎ from the target block, as estimated value Pre block of interest (x, y) Cutting out (obtaining) and supplying to the subtracting unit 335.

減算部３３５は、切り出し部３３４から供給される注目ブロックの推測値Pre(x,y)を用いて、入力端子３３１から供給される注目ブロック（の画像データ）Ic(x,y)を圧縮する。即ち、減算部３３５は、切り出し部３３４から供給される注目ブロックの推測値Pre(x,y)と、入力端子３３１から供給される注目ブロックIc(x,y)との、対応する画素の画素値どうしの差分を計算し、その結果得られる差分値Sub(x,y)=Ic(x,y)-Pre(x,y)を、変換部３３６に供給する。 The subtraction unit 335 compresses the target block (image data) Ic (x, y) supplied from the input terminal 331 using the estimated value Pre (x, y) of the target block supplied from the clipping unit 334. . That is, the subtraction unit 335 corresponds to the pixel of the corresponding pixel between the estimated value Pre (x, y) of the target block supplied from the cutout unit 334 and the target block Ic (x, y) supplied from the input terminal 331. The difference between the values is calculated, and the difference value Sub (x, y) = Ic (x, y) −Pre (x, y) obtained as a result is supplied to the conversion unit 336.

変換部３３６は、減算部３３５からの注目ブロックの差分値Sub(x,y)を、周波数空間上のデータに変換する。即ち、変換部３３６は、注目ブロックの差分値Sub(x,y)を、例えば、DCT変換し、その結果得られるDCT係数を、量子化部３３７に供給する。 The conversion unit 336 converts the difference value Sub (x, y) of the target block from the subtraction unit 335 into data on the frequency space. That is, the transform unit 336 performs, for example, DCT transform on the difference value Sub (x, y) of the block of interest, and supplies the resulting DCT coefficient to the quantization unit 337.

量子化部３３７は、変換部３３６からの注目ブロックのDCT係数を量子化し、その結果得られる量子化データを、可変長符号化部３３８に供給する。 The quantization unit 337 quantizes the DCT coefficient of the block of interest from the transform unit 336 and supplies the quantized data obtained as a result to the variable length coding unit 338.

可変長符号化部３３８は、量子化部３３７からの注目ブロックの量子化データを可変長符号化するとともに、相関最大位置検出部３３３からの後動きベクトル(U₂,V₂)を可変長符号化する。さらに、可変長符号化部３３８は、注目ブロックの量子化データの可変長符号に、後動きベクトル(U₂,V₂)の可変長符号を付加し、第５の差分データとして、出力端子３３９から、選択回路２４０（図８１）に出力する。なお、注目ブロック（の画像データ）Ic(x,y)とその推測値Pre(x,y)とが完全に一致する場合、可変長符号化部３３８において、注目ブロックの量子化データを可変長符号化することにより得られる可変長符号は、NULLを表すものとなり、第５の差分データは、後動きベクトル(U₂,V₂)の可変長符号だけとなる。 The variable length coding unit 338 performs variable length coding on the quantized data of the block of interest from the quantization unit 337 and also uses the variable length code for the back motion vector (U ₂ , V ₂ ) from the maximum correlation position detection unit 333. Turn into. Further, the variable length coding unit 338 adds the variable length code of the back motion vector (U ₂ , V ₂ ) to the variable length code of the quantized data of the block of interest, and outputs it as fifth difference data to the output terminal 339. To the selection circuit 240 (FIG. 81). When the block of interest (image data) Ic (x, y) and its estimated value Pre (x, y) completely match, the variable length coding unit 338 converts the quantized data of the block of interest into a variable length. The variable length code obtained by encoding represents NULL, and the fifth difference data is only the variable length code of the back motion vector (U ₂ , V ₂ ).

次に、図１０３のフローチャートを参照して、図１０２の差分データ計算部２３８の動作について説明する。 Next, the operation of the difference data calculation unit 238 in FIG. 102 will be described with reference to the flowchart in FIG.

まず最初に、ステップＳ２５１において、注目ブロックおよびフューチャリファレンス画像V_Fが入力される。即ち、注目ブロックが、入力端子３３１から相関最大位置検出部３３３および減算部３３５に入力され、フューチャリファレンス画像V_Fが、入力端子３３２から、相関最大位置検出部３３３および切り出し部３３４に入力される。 First, in step S251, the block of interest and Futuresse reference image V _F is input. That is, the target block is input from the input terminal 331 to the maximum correlation position detection unit 333 and the subtraction unit 335, and the future reference image V _F is input from the input terminal 332 to the maximum correlation position detection unit 333 and the clipping unit 334. .

そして、ステップＳ２５１からＳ２５２に進み、相関最大位置検出部３３３は、入力端子３３１からの注目ブロックについて、入力端子３３２からのフューチャリファレンス画像V_Fにおいて、注目ブロックとの相関が高い位置関係を検出する。即ち、相関最大位置検出部３３３は、注目ブロックにつき、考えられるすべての(U',V')の値について、式（１２）の相関情報e₄(U',V')を計算し、その相関情報e₄(U',V')が表す相関を最大にする(U',V')、つまり、相関情報e₄(U',V')の値を最小にする(U',V')を検出する。そして、相関最大位置検出部３３３は、相関情報e₄(U',V')の値を最小にする(U',V')を、フューチャリファレンス画像V_Fにおいて、注目ブロックとの相関が最も高い領域の位置関係を表す後動きベクトル(U₂,V₂)として、切り出し部３３４および可変長符号化部３３８に供給し、ステップＳ２５２からＳ２５３に進む。 Then, the process proceeds from step S251 to S252, the maximum correlation position detection section 333, for the target block from the input terminal 331, in Futuresse reference image V _F from the input terminal 332, the correlation between the target block is detected with high positional relationship . That is, the maximum correlation position detection unit 333 calculates the correlation information e ₄ (U ′, V ′) in Expression (12) for all possible values of (U ′, V ′) for the block of interest. The correlation represented by the correlation information e ₄ (U ′, V ′) is maximized (U ′, V ′), that is, the value of the correlation information e ₄ (U ′, V ′) is minimized (U ′, V ′). ') Is detected. Then, the maximum correlation position detection section 333, correlation information _{e 4 (U ', V'} ) to minimize the value of the (U ', V'), in Futuresse reference image V _F, the correlation between the target block is the most The post motion vector (U ₂ , V ₂ ) representing the positional relationship of the high region is supplied to the cutout unit 334 and the variable length coding unit 338, and the process proceeds from step S252 to S253.

ステップＳ２５３では、切り出し部３３４は、注目ブロックとの位置関係が、相関最大位置検出部３３３からの後動きベクトル(U₂,V₂)が表す位置関係にある、フューチャリファレンス画像V_Fの領域の画像データを、注目ブロックの推測値Pre(x,y)として切り出し、減算部３３５に供給して、ステップＳ２５４に進む。 In step S253, the cutout unit 334 has the positional relationship with the block of interest in the region of the future reference image V _F in which the positional relationship represented by the back motion vector (U ₂ , V ₂ ) from the maximum correlation position detection unit 333 is present. The image data is cut out as the estimated value Pre (x, y) of the block of interest, supplied to the subtraction unit 335, and the process proceeds to step S254.

ステップＳ２５４では、減算部３３５は、切り出し部３３４から供給される注目ブロックの推測値Pre(x,y)と、入力端子３３１から供給される注目ブロックIc(x,y)との、対応する画素の画素値どうしの差分を計算し、その結果得られる差分値Sub(x,y)を、変換部３３６に供給して、ステップＳ２５５に進む。 In step S254, the subtraction unit 335 corresponds to the pixel of interest corresponding to the estimated value Pre (x, y) of the target block supplied from the cutout unit 334 and the target block Ic (x, y) supplied from the input terminal 331. The difference value Sub (x, y) obtained as a result is supplied to the conversion unit 336, and the process proceeds to step S255.

ステップＳ２５５では、変換部３３６は、減算部３３５からの注目ブロックの差分値Sub(x,y)をDCT変換し、その結果得られるDCT係数を、量子化部３３７に供給して、ステップＳ２５６に進む。ステップＳ２５６では、量子化部３３７は、変換部３３６からの注目ブロックのDCT係数を量子化し、その結果得られる量子化データを、可変長符号化部３３８に供給して、ステップＳ２５７に進む。ステップＳ２５７では、可変長符号化部３３８は、量子化部３３７からの注目ブロックの量子化データを可変長符号化するとともに、相関最大位置検出部３３３からの後動きベクトル(U₂,V₂)を可変長符号化する。さらに、ステップＳ２５７において、可変長符号化部３３８は、注目ブロックの量子化データの可変長符号に、後動きベクトル(U₂,V₂)の可変長符号を付加することにより、第５の差分データとして、ステップＳ２５８に進み、その第５の差分データを、出力端子３３９から、選択回路２４０（図８１）に出力する。 In step S255, the converting unit 336 performs DCT conversion on the difference value Sub (x, y) of the target block from the subtracting unit 335, and supplies the resulting DCT coefficient to the quantizing unit 337, and the process proceeds to step S256. move on. In step S256, the quantization unit 337 quantizes the DCT coefficient of the block of interest from the conversion unit 336, supplies the quantized data obtained as a result to the variable length coding unit 338, and proceeds to step S257. In step S257, the variable length coding unit 338 performs variable length coding on the quantized data of the block of interest from the quantization unit 337, and the back motion vector (U ₂ , V ₂ ) from the maximum correlation position detection unit 333. Is encoded with variable length. Further, in step S257, the variable length coding unit 338 adds the variable length code of the back motion vector (U ₂ , V ₂ ) to the variable length code of the quantized data of the block of interest, thereby obtaining the fifth difference. The process proceeds to step S258 as data, and the fifth difference data is output from the output terminal 339 to the selection circuit 240 (FIG. 81).

以上のようにして、差分データ計算部２３８では、ターゲット画像V_Tがブロック単位で、第５の差分データに圧縮される。 As described above, the difference data calculation unit 238 compresses the target image V _T into the fifth difference data in units of blocks.

次に、図１０４は、第６の差分データを求める図８１の差分データ計算部２３９の構成例を示している。 Next, FIG. 104 shows a configuration example of the difference data calculation unit 239 in FIG. 81 for obtaining sixth difference data.

図１０４において、入力端子３５１には、図８１の入力端子２３３からの注目ブロック（の画像データ）が入力される。入力端子３５１から入力された注目ブロックは、変換部３５２に供給される。 104, the target block (image data thereof) from the input terminal 233 in FIG. 81 is input to the input terminal 351. The block of interest input from the input terminal 351 is supplied to the conversion unit 352.

変換部３５２は、入力端子３５１から入力された注目ブロック、即ち、注目ブロック（の画像データ）Ic(x,y)と、その推測値Pre(x,y)としての０との差分値Sub(x,y)=Ic(x,y)-Pre(x,y)=Ic(x,y)を、周波数空間上のデータに変換する。即ち、変換部３５２は、注目ブロックの画像データIc(x,y)に等しい差分値Sub(x,y)を、例えば、DCT変換し、その結果得られるDCT係数を、量子化部３５３に供給する。 The converting unit 352 receives the difference value Sub () between the target block input from the input terminal 351, that is, the target block (image data) Ic (x, y) and 0 as the estimated value Pre (x, y). x, y) = Ic (x, y) -Pre (x, y) = Ic (x, y) is converted into data in the frequency space. That is, the transform unit 352 performs, for example, DCT transform on the difference value Sub (x, y) equal to the image data Ic (x, y) of the block of interest, and supplies the resulting DCT coefficient to the quantization unit 353. To do.

量子化部３５３は、変換部３５２からの注目ブロックのDCT係数を量子化し、その結果得られる量子化データを、可変長符号化部３５４に供給する。 The quantization unit 353 quantizes the DCT coefficient of the block of interest from the transform unit 352 and supplies the quantized data obtained as a result to the variable length coding unit 354.

可変長符号化部３５４は、量子化部３５３からの注目ブロックの量子化データを可変長符号化し、第６の差分データとして、出力端子３５５から、選択回路２４０（図８１）に出力する。 The variable length coding unit 354 performs variable length coding on the quantized data of the block of interest from the quantization unit 353, and outputs the result as sixth difference data from the output terminal 355 to the selection circuit 240 (FIG. 81).

次に、図１０５のフローチャートを参照して、図１０４の差分データ計算部２３９の動作について説明する。 Next, the operation of the difference data calculation unit 239 in FIG. 104 will be described with reference to the flowchart in FIG.

まず最初に、ステップＳ２６１において、注目ブロックが入力される。即ち、注目ブロックが、入力端子３５１から変換部３５２に入力される。 First, in step S261, a target block is input. That is, the block of interest is input from the input terminal 351 to the conversion unit 352.

そして、ステップＳ２６１からＳ２６２に進み、変換部３５２は、入力端子３５１から入力された注目ブロック、即ち、注目ブロック（の画像データ）Ic(x,y)と、その推測値Pre(x,y)としての０との差分値Sub(x,y)=Ic(x,y)-Pre(x,y)=Ic(x,y)をDCT変換し、その結果得られるDCT係数を、量子化部３５３に供給して、ステップＳ２６３に進む。ステップＳ２６３では、量子化部３５３は、変換部３５２からの注目ブロックのDCT係数を量子化し、その結果得られる量子化データを、可変長符号化部３５４に供給して、ステップＳ２６４に進む。ステップＳ２６４では、可変長符号化部３５４は、量子化部３５３からの注目ブロックの量子化データを可変長符号化し、その結果得られる可変長符号を、第６の差分データとして、ステップＳ２６５に進み、その第６の差分データを、出力端子３５５から、選択回路２４０（図８１）に出力する。 Then, the process proceeds from step S261 to S262, and the conversion unit 352 receives the target block input from the input terminal 351, that is, the target block (image data) Ic (x, y) and its estimated value Pre (x, y). The difference value from 0 as Sub (x, y) = Ic (x, y) -Pre (x, y) = Ic (x, y) is DCT transformed, and the resulting DCT coefficient is quantized Then, the process proceeds to step S263. In step S263, the quantization unit 353 quantizes the DCT coefficient of the target block from the transform unit 352, supplies the quantized data obtained as a result to the variable length coding unit 354, and proceeds to step S264. In step S264, the variable length coding unit 354 performs variable length coding on the quantized data of the block of interest from the quantization unit 353, and proceeds to step S265 using the resulting variable length code as sixth difference data. The sixth difference data is output from the output terminal 355 to the selection circuit 240 (FIG. 81).

以上のようにして、差分データ計算部２３９では、ターゲット画像V_Tがブロック単位で、第６の差分データに圧縮される。 As described above, the difference data calculation unit 239 compresses the target image V _T into the sixth difference data in units of blocks.

次に、図１０６は、図８１の差分データ計算部２３４乃至２３９が選択回路２４０に対して出力する第１乃至第６の差分データそれぞれのデータ構造を、模式的に示している。 Next, FIG. 106 schematically illustrates a data structure of each of the first to sixth difference data output from the difference data calculation units 234 to 239 of FIG. 81 to the selection circuit 240.

差分データ計算部２３４が選択回路２４０に対して出力する第１の差分データは、図１０６上から１番目に示すように、注目ブロックとその推測値との差分値をDCT処理し、さらに量子化して得られる量子化データの可変長符号からなる。即ち、第１の差分データには、位置関係ベクトル(U,V)は含まれない。 As shown first in FIG. 106, the first difference data output from the difference data calculation unit 234 to the selection circuit 240 is obtained by subjecting the difference value between the target block and its estimated value to DCT processing and further quantizing the difference data. It consists of variable length codes of quantized data obtained in this way. That is, the first difference data does not include the positional relationship vector (U, V).

差分データ計算部２３５が選択回路２４０に対して出力する第２の差分データは、図１０６上から２番目に示すように、注目ブロックとその推測値との差分値をDCT処理し、さらに量子化して得られる量子化データの可変長符号と、相関最大ベクトル(U,V)の可変長符号とからなる。 The second difference data output from the difference data calculation unit 235 to the selection circuit 240 is obtained by performing DCT processing on the difference value between the target block and its estimated value as shown in the second from the top in FIG. The variable length code of the quantized data obtained in this way and the variable length code of the maximum correlation vector (U, V).

差分データ計算部２３６が選択回路２４０に対して出力する第３の差分データは、図１０６上から３番目に示すように、注目ブロックとその推測値との差分値をDCT処理し、さらに量子化して得られる量子化データの可変長符号と、前動きベクトル(-U₁,-V₁)および後動きベクトル(U₂,V₂)の可変長符号とからなる。 The third difference data output from the difference data calculation unit 236 to the selection circuit 240 is obtained by performing DCT processing on the difference value between the target block and its estimated value as shown in the third part from the top of FIG. The variable length code of the quantized data obtained in this way, and the variable length code of the previous motion vector (−U ₁ , −V ₁ ) and the subsequent motion vector (U ₂ , V ₂ ).

差分データ計算部２３７が選択回路２４０に対して出力する第４の差分データは、図１０６上から４番目に示すように、注目ブロックとその推測値との差分値をDCT処理し、さらに量子化して得られる量子化データの可変長符号と、前動きベクトル(-U₁,-V₁)の可変長符号とからなる。 The fourth difference data output from the difference data calculation unit 237 to the selection circuit 240 is obtained by performing DCT processing on the difference value between the block of interest and its estimated value, as shown in the fourth from the top in FIG. The variable length code of the quantized data obtained in this way and the variable length code of the previous motion vector (−U ₁ , −V ₁ ).

差分データ計算部２３８が選択回路２４０に対して出力する第５の差分データは、図１０６上から５番目に示すように、注目ブロックとその推測値との差分値をDCT処理し、さらに量子化して得られる量子化データの可変長符号と、後動きベクトル(U₂,V₂)の可変長符号とからなる。 The fifth difference data output from the difference data calculation unit 238 to the selection circuit 240 is subjected to DCT processing on the difference value between the block of interest and its estimated value, as shown in the fifth figure from the top in FIG. The variable length code of the quantized data obtained in this way and the variable length code of the back motion vector (U ₂ , V ₂ ).

差分データ計算部２３９が選択回路２４０に対して出力する第６の差分データは、図１０６上から６番目に示すように、注目ブロックとその推測値としての０との差分値、即ち、注目ブロックをDCT処理し、さらに量子化して得られる量子化データの可変長符号からなる。 The sixth difference data output from the difference data calculation unit 239 to the selection circuit 240 is the difference value between the target block and 0 as its estimated value, that is, the target block, as shown in the sixth from the top in FIG. It consists of variable length codes of quantized data obtained by DCT processing and further quantization.

なお、注目ブロックとその推測値との差分値をDCT処理し、さらに量子化して得られる量子化データの可変長符号は、上述したように、注目ブロックとその推測値とが一致していれば、NULLとなる。 Note that the variable length code of the quantized data obtained by performing DCT processing on the difference value between the target block and its estimated value, and further quantizing it, as described above, if the target block and its estimated value match. , NULL.

次に、図１０７は、図８１の選択回路２４０が出力端子２４１から差分圧縮データとして出力する第１乃至第６の差分データのデータ構造を、模式的に示している。 Next, FIG. 107 schematically shows the data structure of first to sixth difference data output from the output terminal 241 as the differentially compressed data by the selection circuit 240 of FIG.

選択回路２４０は、注目ブロックについて、差分データ計算部２３４乃至２３９が出力する第１乃至第６の差分データのうちの１つを、選択差分データとして選択し、その選択差分データが、第１乃至第６の差分データのうちのいずれであるかを表すケースIDを付加して、出力端子２４１から、差分圧縮データとして出力する。 The selection circuit 240 selects, as selection difference data, one of the first to sixth difference data output from the difference data calculation units 234 to 239 for the target block, and the selection difference data is the first to sixth difference data. A case ID indicating which of the sixth difference data is added, and output as differential compressed data from the output terminal 241.

図１０７では、第１乃至第６の差分データに対し、ケースIDとして、1,2,3,4,5,6が、それぞれ付加されるようになっている。 In FIG. 107, 1, 2, 3, 4, 5, and 6 are added as case IDs to the first to sixth difference data, respectively.

次に、図１０８のフローチャートを参照して、図７９の送信装置１の処理について説明する。 Next, processing of the transmission apparatus 1 in FIG. 79 will be described with reference to the flowchart in FIG.

図７９の入力端子２１１から帯域制限フィルタ部２１２には、240fps動画データが供給される。 The fps moving image data is supplied from the input terminal 211 in FIG. 79 to the band limiting filter unit 212.

帯域制限フィルタ部２１２は、ステップＳ２８１において、入力端子２１１からの240fps動画データを対象に、上述したような人間の視覚特性を考慮した必要な情報のみを残すフィルタリングを行い、その結果得られる240fps動画データを、分離回路２１３に供給して、ステップＳ２８２に進む。 In step S281, the band limiting filter unit 212 performs filtering on the 240 fps moving image data from the input terminal 211 to leave only necessary information in consideration of human visual characteristics as described above, and the 240 fps moving image obtained as a result The data is supplied to the separation circuit 213, and the process proceeds to step S282.

ステップＳ２８２では、分離回路２１３は、帯域制限フィルタ部２１２からの240fps動画データを、60fps動画データと、240fps動画データから60fps動画データを除いた残りの240-60fps動画データとに分離し、60fps動画データを、圧縮回路２１４に供給するとともに、240-60fps動画データを、ターゲット画像として、差分情報抽出部２１７に供給する。 In step S282, the separation circuit 213 separates the 240 fps moving image data from the band limiting filter unit 212 into 60 fps moving image data and the remaining 240-60 fps moving image data obtained by removing 60 fps moving image data from the 240 fps moving image data. Data is supplied to the compression circuit 214 and 240-60 fps moving image data is supplied to the difference information extraction unit 217 as a target image.

そして、ステップＳ２８３に進み、圧縮回路２１４は、分離回路２１３から供給される60fps動画データをエンコード（圧縮）し、その結果得られるビットストリームを出力する。このビットストリームは、解凍回路２１６と出力端子２１５に供給される。 In step S283, the compression circuit 214 encodes (compresses) the 60 fps moving image data supplied from the separation circuit 213, and outputs the resultant bit stream. This bit stream is supplied to the decompression circuit 216 and the output terminal 215.

その後、ステップＳ２８４に進み、解凍回路２１６は、圧縮回路２１４からのビットストリームをローカルデコードし、そのローカルデコードの結果得られる60fps動画データを、リファレンス画像として、差分情報抽出部２１７に供給して、ステップＳ２８５に進む。 Thereafter, the process proceeds to step S284, and the decompression circuit 216 locally decodes the bit stream from the compression circuit 214, and supplies 60 fps moving image data obtained as a result of the local decoding as a reference image to the difference information extraction unit 217, The process proceeds to step S285.

ステップＳ２８５では、差分情報抽出部２１７は、分離回路２１３からのターゲット画像である240-60fps動画データを、解凍回路２１６からのリファレンス画像である60fps動画データを用いて圧縮し、即ち、リファレンス画像（60fps動画データ）に対するターゲット画像（240-60fps動画データ）の差分に関する差分圧縮データを求めることにより、ターゲット画像を圧縮し、その差分圧縮データを出力端子２１８に供給して、ステップＳ２８６に進む。 In step S285, the difference information extraction unit 217 compresses the 240-60 fps moving image data that is the target image from the separation circuit 213 using the 60 fps moving image data that is the reference image from the decompression circuit 216, that is, the reference image ( The difference compressed data relating to the difference between the target image (240-60 fps moving image data) with respect to the 60 fps moving image data) is obtained, the target image is compressed, the difference compressed data is supplied to the output terminal 218, and the process proceeds to step S286.

ステップＳ２８６では、60fps動画データ（リファレンス画像）のエンコード結果としてのビットストリームが、出力端子２１５から出力されるとともに、240-60fps動画データの圧縮結果としての差分圧縮データが、出力端子２１８から出力される。 In step S286, a bit stream as an encoding result of 60 fps moving image data (reference image) is output from the output terminal 215, and differential compressed data as a compression result of 240-60 fps moving image data is output from the output terminal 218. The

以上のように、図７９の送信装置１においては、60fps動画データのエンコード結果としてのビットストリームと、240-60fps動画データの圧縮結果としての差分圧縮データとが、別々に独立に出力されるので、受信装置２（図２４）では、例えば、60fps動画データのエンコード結果としてのビットストリームだけを受信することで、60fps動画データを復元することができる。また、受信装置２では、例えば、60fps動画データのエンコード結果としてのビットストリームと、240-60fps動画データの圧縮結果としての差分圧縮データとを受信することで、60fps動画データを復元することもできるし、240fps動画データも復元することができる。即ち、いわゆる、テンポラルスケーラビリティを持たせることが出来る。 As described above, in the transmission device 1 of FIG. 79, the bit stream as the encoding result of the 60 fps moving image data and the differential compressed data as the compression result of the 240-60 fps moving image data are separately output independently. In the receiving apparatus 2 (FIG. 24), for example, by receiving only a bit stream as a result of encoding 60 fps moving picture data, the 60 fps moving picture data can be restored. The receiving device 2 can also restore 60 fps moving image data by receiving, for example, a bit stream as a result of encoding 60 fps moving image data and differentially compressed data as a compression result of 240-60 fps moving image data. And 240fps video data can be restored. In other words, so-called temporal scalability can be provided.

次に、図１０９のフローチャートを参照して、差分情報抽出部２１７が、図１０８のステップＳ２８５で行うターゲット画像（240-60fps動画データ）の圧縮について説明する。 Next, the compression of the target image (240-60 fps moving image data) performed by the difference information extraction unit 217 in step S285 of FIG. 108 will be described with reference to the flowchart of FIG.

差分情報抽出部２１７（図８１）では、ステップＳ２９１において、差分データ計算部２３４乃至２３９が、注目ブロックについて、図１０６に示した第１乃至第６の差分データを、図９４乃至図１０５で説明したようにして求め、選択回路２４０に供給して、ステップＳ２９２に進む。 In the difference information extraction unit 217 (FIG. 81), in step S291, the difference data calculation units 234 to 239 explain the first to sixth difference data shown in FIG. Thus, it is obtained and supplied to the selection circuit 240, and the process proceeds to step S292.

ステップＳ２９２では、選択回路２４０は、差分データ計算部２３４乃至２３９それぞれからの注目ブロックについての第１乃至第６の差分データのうちの、データ量が最小のものを、選択差分データとして選択し、ステップＳ２９３に進む。 In step S292, the selection circuit 240 selects, as selection difference data, the one with the smallest data amount from among the first to sixth difference data for the target block from the difference data calculation units 234 to 239, Proceed to step S293.

ここで、注目ブロックについての第１乃至第６の差分データのうちの第１の差分データのデータ量が最小である場合には、選択回路２４０において、第１の差分データを求めるときに計算される式（９）の相関情報e₁(U',V')の最小値と、２番目に小さい値との差が、ある閾値以下であるかどうかを判定し、式（９）の相関情報e₁(U',V')の最小値と、２番目に小さい値との差が、ある閾値以下であるときには、第１の差分データに代えて、第２乃至第６の差分データのうちのデータ量が最小のもの（第１乃至第６の差分データのうちのデータ量が２番目に小さいもの）を、選択差分データとして選択することができる。この場合、上述したように、第１の差分データを、元の注目ブロックに復元することができなくなることを、より強固に防止することができる。 Here, when the data amount of the first difference data among the first to sixth difference data for the block of interest is the minimum, the selection circuit 240 calculates the first difference data. It is determined whether the difference between the minimum value of the correlation information e ₁ (U ′, V ′) in equation (9) and the second smallest value is equal to or less than a certain threshold, and the correlation information in equation (9) When the difference between the minimum value of e ₁ (U ′, V ′) and the second smallest value is equal to or smaller than a certain threshold value, instead of the first difference data, the second to sixth difference data Can be selected as the selected difference data (the data amount of the first to sixth difference data having the second smallest data amount). In this case, as described above, it is possible to more firmly prevent the first difference data from being restored to the original block of interest.

ステップＳ２９３では、選択回路２４０は、選択差分データに対して、対応するケースIDを付加し、これにより、図１０７に示したいずれかのデータ構造の差分圧縮データを、注目ブロックの圧縮結果として、出力端子２４１から出力する。 In step S293, the selection circuit 240 adds a corresponding case ID to the selected difference data, and thereby converts the difference compressed data having one of the data structures shown in FIG. 107 as the compression result of the block of interest. Output from the output terminal 241.

即ち、これにより、出力端子２４１からは、ブロックごとに、第１乃至第６の差分データのうちの最適なものが出力される。 That is, as a result, the optimum one of the first to sixth difference data is output from the output terminal 241 for each block.

以上のように、図７９の送信装置１では、入力端子２１１から入力した240fps動画データを圧縮し、２つの圧縮結果（第１と第２の圧縮結果）を出力する。第１の圧縮結果は、圧縮回路２１４により、240fps動画データから分離した60fps動画データを圧縮して得られるビットストリームである。このビットストリームは、出力端子２１５から出力される。第２の圧縮結果は、差分情報抽出部２１７により、240fps動画データから分離した240-60fps動画データを、60fps動画データとの差分をとることにより圧縮した差分圧縮データである。差分圧縮データは、ブロック単位で可変長符号化されており、出力端子２１８から出力される。 As described above, the transmission apparatus 1 in FIG. 79 compresses 240 fps moving image data input from the input terminal 211 and outputs two compression results (first and second compression results). The first compression result is a bit stream obtained by compressing 60 fps moving image data separated from 240 fps moving image data by the compression circuit 214. This bit stream is output from the output terminal 215. The second compression result is differentially compressed data obtained by compressing the 240-60 fps moving image data separated from the 240 fps moving image data by taking the difference from the 60 fps moving image data by the difference information extraction unit 217. The differentially compressed data is variable-length encoded in units of blocks and is output from the output terminal 218.

また、図７９の送信装置１では、帯域制限フィルタ部２１２において、入力端子２１１からの240fps動画データに対して、人間の視覚特性を考慮した通過帯域のフィルタ、即ち、１／６０秒程度のローパスフィルタ（ただし、被写体の動きを考慮して適応的にフィルタ係数（タップ係数）を変更している）をかけている。人間の視覚特性により、１／６０秒程度の範囲内で不規則に移動する物体には人間の目は追従することが出来ずに、１／６０秒の間積分された画素値を認識することから、このようなローパスフィルタ（帯域制限フィルタ）をかけても人間には画質劣化したと感じない。このローパスフィルタによるフィルタリングの結果得られる動画データは、およそ１／６０秒（つまり４／２４０秒）程度の時間内では、被写体が一定速度で動いているような動画データとなる。なぜなら、１／６０秒という間隔よりも短い時間に高速で速度が変化するような不規則な移動はローパスフィルタにより平均化されるからである。このような動画データは、まさに、式（９）の相関情報e₁(U',V')や式（１０）の相関情報e₂(U',V')を最小とする(U',V')により、ターゲット画像内の注目ブロックの推測値を求めることに適している。なぜなら、式（９）の相関情報e₁(U',V')や式（１０）の相関情報e₂(U',V')を最小にする(U',V')を求める場合には、図８４あるいは図８７で説明したが、被写体が一定速度で移動していることを前提としているからである。 Further, in the transmission device 1 of FIG. 79, the band limiting filter unit 212 performs a passband filter considering human visual characteristics with respect to 240 fps moving image data from the input terminal 211, that is, a low pass of about 1/60 seconds. A filter (however, the filter coefficient (tap coefficient) is adaptively changed in consideration of the movement of the subject) is applied. Due to human visual characteristics, the human eye cannot follow an object that moves irregularly within a range of about 1/60 seconds, and recognizes the pixel value integrated for 1/60 seconds. Therefore, even if such a low-pass filter (band limiting filter) is applied, humans do not feel that the image quality has deteriorated. The moving image data obtained as a result of filtering by the low-pass filter is moving image data in which the subject is moving at a constant speed within a time of about 1/60 seconds (that is, 4/240 seconds). This is because an irregular movement whose speed changes at a high speed in a time shorter than an interval of 1/60 seconds is averaged by the low-pass filter. Such moving image data minimizes the correlation information e ₁ (U ′, V ′) of the equation (9) and the correlation information e ₂ (U ′, V ′) of the equation (10) (U ′, V ′) is suitable for obtaining an estimated value of the target block in the target image. This is because the formula (9) correlation information e ₁ of (U ', V') or Formula (10) correlation information e ₂ of (U ', V') to minimize (U ', V') in the case of obtaining the This is because, as described in FIG. 84 or FIG. 87, it is assumed that the subject is moving at a constant speed.

次に、図１１０は、送信装置１が図７９に示したように構成される場合の、図２４の受信装置２の構成例を示している。 Next, FIG. 110 illustrates a configuration example of the reception device 2 in FIG. 24 when the transmission device 1 is configured as illustrated in FIG. 79.

受信装置２には、図７９の送信装置１が出力する、60fps動画データのエンコード結果としてのビットストリームと、240-60fps動画データの圧縮結果としての差分圧縮データとが供給され、ビットストリームは入力端子３６１から、差分圧縮データは入力端子３６２から、それぞれ、受信装置２に入力される。そして、入力端子３６１から解凍回路３６３に対して、ビットストリームが供給されるとともに、入力端子３６２から差分情報復元部３６４に対して、差分圧縮データが供給される。 The receiving device 2 is supplied with the bit stream as the encoding result of the 60 fps moving image data and the differential compression data as the compression result of the 240-60 fps moving image data output from the transmitting device 1 of FIG. The differential compressed data is input from the terminal 361 to the receiving device 2 from the input terminal 362, respectively. Then, the bit stream is supplied from the input terminal 361 to the decompression circuit 363, and the differential compressed data is supplied from the input terminal 362 to the differential information restoring unit 364.

なお、60fps動画データのエンコード結果としてのビットストリームと、240-60fps動画データの圧縮結果としての差分圧縮データとが多重化データに多重化されている場合には、その多重化データから、ビットストリームと差分圧縮データとが分離され、入力端子３６１と３６２にそれぞれ入力される。 In addition, when the bit stream as an encoding result of 60fps moving image data and the differential compression data as the compression result of 240-60fps moving image data are multiplexed in the multiplexed data, the bit stream is converted from the multiplexed data. And the differentially compressed data are separated and input to the input terminals 361 and 362, respectively.

解凍回路３６３は、図７９の解凍回路２１６と同様に、入力端子３６１からのビットストリームをデコード（解凍）し、60fps動画データを復元する。この60fps動画データは、リファレンス画像として、差分情報復元部３６４に供給されるとともに、合成部３６５に供給される。 Similar to the decompression circuit 216 in FIG. 79, the decompression circuit 363 decodes (decompresses) the bit stream from the input terminal 361 and restores 60 fps moving image data. The 60 fps moving image data is supplied as a reference image to the difference information restoring unit 364 and also supplied to the synthesizing unit 365.

差分情報復元部３６４は、解凍回路３６３からのリファレンス画像としての60fps動画データを用いて、入力端子３６２からの差分圧縮データを、ブロック単位で、240-60fps動画データに復元する。この240-60fps動画データは、合成部３６５に供給される。 The difference information restoring unit 364 restores the differentially compressed data from the input terminal 362 to 240-60 fps moving image data in units of blocks using the 60 fps moving image data as the reference image from the decompression circuit 363. The 240-60 fps moving image data is supplied to the synthesis unit 365.

合成部３６５は、解凍回路３６３からの60fps動画データと、差分情報復元部３６４からの240-60fps動画データとを合成し、これにより、240fps動画データを復元して、出力端子３６６から表示装置３（図２４）に出力する。 The synthesizing unit 365 synthesizes the 60 fps moving image data from the decompression circuit 363 and the 240-60 fps moving image data from the difference information restoring unit 364, thereby restoring the 240 fps moving image data and from the output terminal 366 to the display device 3. (FIG. 24).

次に、図１１１のフローチャートを参照して、図１１０の受信装置２の処理について説明する。 Next, processing of the receiving device 2 in FIG. 110 will be described with reference to the flowchart in FIG.

解凍回路３６３には、入力端子３６１から、60fps動画データのエンコード結果としてのビットストリームが供給される。また、差分情報復元部３６４には、入力端子３６２から、240-60fps動画データの圧縮結果としての差分圧縮データが供給される。 A bit stream as an encoding result of 60 fps moving image data is supplied from the input terminal 361 to the decompression circuit 363. Also, the differential information restoring unit 364 is supplied with differential compressed data as a compression result of 240-60 fps moving image data from the input terminal 362.

そして、ステップＳ３０１では、解凍回路３６３は、入力端子３６１からのビットストリームをデコードし、その結果得られる60fps動画データを、リファレンス画像として、差分情報復元部３６４に供給するとともに、合成部３６５に供給して、ステップＳ３０２に進む。 In step S301, the decompression circuit 363 decodes the bit stream from the input terminal 361, and supplies the 60 fps moving image data obtained as a result to the difference information restoration unit 364 as a reference image and also to the synthesis unit 365. Then, the process proceeds to step S302.

ステップＳ３０２では、差分情報復元部３６４は、解凍回路３６３からのリファレンス画像としての60fps動画データを用いて、入力端子３６２からの差分圧縮データを、ブロック単位で、240-60fps動画データに復元し、合成部３６５に供給する。 In step S302, the difference information restoration unit 364 uses the 60fps moving image data as the reference image from the decompression circuit 363 to restore the differential compressed data from the input terminal 362 to 240-60fps moving image data in units of blocks. This is supplied to the synthesis unit 365.

そして、ステップＳ３０２からＳ３０３に進み、合成部３６５は、解凍回路３６３からの60fps動画データと、差分情報復元部３６４からの240-60fps動画データとを合成し、これにより、240fps動画データを復元して、出力端子３６６から表示装置３（図２４）に出力する。 Then, the process proceeds from step S302 to S303, and the combining unit 365 combines the 60 fps moving image data from the decompression circuit 363 and the 240-60 fps moving image data from the difference information restoring unit 364, thereby reconstructing the 240 fps moving image data. Then, the data is output from the output terminal 366 to the display device 3 (FIG. 24).

即ち、解凍回路３６３は、入力端子３６１からのビットストリームから、例えば、図８０の上から２番目に示した60fps動画データのフレーム・・・，f₁，f₅，f₉，f₁₃，・・・（の画像データ）を復元し、差分情報復元部および合成部３６５に供給する。 That is, the decompression circuit 363 uses, for example, the frame of 60 fps moving image data shown second from the top in FIG. 80 from the bit stream from the input terminal 361, f ₁ , f ₅ , f ₉ , f ₁₃ ,. (Image data) is restored and supplied to the difference information restoration unit and synthesis unit 365.

また、差分情報復元部３６４では、解凍回路３６３からの60fps動画データをリファレンス画像として用い、入力端子３６２からの差分圧縮データを、例えば、図８０の一番下に示した240-60fps動画データのフレーム・・・，f₂，f₃，f₄，f₆，f₇，f₈，f₁₀，f₁₁，f₁₂，・・・（の画像データ）に復元し、合成部３６５に供給する。 Further, the difference information restoration unit 364 uses the 60 fps moving image data from the decompression circuit 363 as a reference image, and converts the difference compressed data from the input terminal 362 into, for example, the 240-60 fps moving image data shown at the bottom of FIG. ..., F ₂ , f ₃ , f ₄ , f ₆ , f ₇ , f ₈ , f ₁₀ , f ₁₁ , f ₁₂ ,. .

合成部３６５は、解凍回路３６３からの60fps動画データの１フレームを選択し、次に、差分情報復元部３６４からの240-60fps動画データの３フレームを選択することを繰り返し、選択したフレームを、1/240秒ごとに、出力端子３６６から出力する。即ち、これにより、60fps動画データと240-60fps動画データとを合成した、図８０の一番上に示したような240fps動画データが、出力端子３６６から出力される。 The synthesizing unit 365 repeatedly selects one frame of 60 fps moving image data from the decompression circuit 363, and then repeatedly selects three frames of 240-60 fps moving image data from the difference information restoring unit 364. Output from the output terminal 366 every 1/240 seconds. That is, by this, the 240 fps moving image data as shown at the top of FIG. 80, which is a combination of the 60 fps moving image data and the 240-60 fps moving image data, is output from the output terminal 366.

次に、図１１２のフローチャートを参照して、図１１０の差分情報復元部３６４が図１１１のステップＳ３０２で行う、240-60fps動画データのデコードについて説明する。 Next, with reference to the flowchart of FIG. 112, the decoding of 240-60 fps moving image data performed by the difference information restoring unit 364 of FIG. 110 in step S302 of FIG. 111 will be described.

ステップＳ３１１において、差分情報復元部３６４は、解凍回路３６３（図１１０）からの60fps動画データをリファレンス画像として用い、入力端子３６２（図１１０）からの差分圧縮データを、240-60fps動画データに復元する、後述する処理を行うことで、240-60fps動画データの各フレーム、即ち、ターゲット画像の画素値を、ブロック単位で求め、ステップＳ３１２に進む。 In step S311, the difference information restoration unit 364 uses the 60 fps moving image data from the decompression circuit 363 (FIG. 110) as a reference image, and restores the difference compressed data from the input terminal 362 (FIG. 110) to 240-60 fps moving image data. By performing the processing described later, each frame of the 240-60 fps moving image data, that is, the pixel value of the target image is obtained for each block, and the process proceeds to step S312.

ステップＳ３１２では、差分情報復元部３６４は、ステップＳ３１１で画素値が求められたブロックを集めることで、240-60fps動画データの各フレームの画像データを得て、ステップＳ３１３に進む。ステップＳ３１３では、差分情報復元部３６４は、ステップＳ３１２で得た240-60fps動画データを、合成部３６５（図１１０）に出力する。 In step S312, the difference information restoration unit 364 obtains image data of each frame of the 240-60 fps moving image data by collecting the blocks whose pixel values are obtained in step S311 and proceeds to step S313. In step S313, the difference information restoration unit 364 outputs the 240-60 fps moving image data obtained in step S312 to the synthesis unit 365 (FIG. 110).

次に、図１１３は、図１１０の差分情報復元部３６４の構成例を示している。 Next, FIG. 113 shows a configuration example of the difference information restoration unit 364 of FIG.

差分データ記憶部３７１には、入力端子３６２（図１１０）から差分圧縮データが供給される。差分データ記憶部３７１は、入力端子３６２からの差分圧縮データを一時記憶する。 The differential data storage unit 371 is supplied with differential compressed data from the input terminal 362 (FIG. 110). The difference data storage unit 371 temporarily stores the difference compressed data from the input terminal 362.

リファレンス記憶部３７２には、解凍回路３６３からリファレンス画像としての60fps動画データが供給される。リファレンス記憶部３７２は、解凍回路３６３からのリファレンス画像を一時記憶する。 The reference storage unit 372 is supplied with 60 fps moving image data as a reference image from the decompression circuit 363. The reference storage unit 372 temporarily stores the reference image from the decompression circuit 363.

ケースID判定部３７３は、これから復元しようとするターゲット画像のブロックを、注目ブロックとして、その注目ブロックの差分圧縮データを、差分データ記憶部３７１から読み出し、その差分圧縮データに付加されているケースID（図１０７）を判定する。そして、ケースID判定部３７３は、注目ブロックについてのケースIDの判定結果を、相関最大位置検出部３７７に供給する。 The case ID determination unit 373 reads a target image block to be restored from now on as a target block, reads out the differential compressed data of the target block from the differential data storage unit 371, and adds the case ID added to the differential compressed data. (FIG. 107) is determined. Then, the case ID determination unit 373 supplies the case ID determination result for the block of interest to the maximum correlation position detection unit 377.

可変長復号部３７４は、注目ブロックの差分圧縮データを、差分データ記憶部３７１から読み出し、その差分圧縮データを可変長復号することで、量子化データを得て、逆量子化部３７５に供給する。さらに、可変長復号部３７４は、差分圧縮データを可変長復号することで、必要に応じて、相関最大ベクトル(U,V)、前動きベクトル(-U₁,-V₁)、または後動きベクトル(U₂,V₂)を得て、相関最大位置検出部３７７に供給する。 The variable length decoding unit 374 reads the difference compressed data of the block of interest from the difference data storage unit 371, obtains quantized data by variable length decoding the difference compressed data, and supplies the quantized data to the inverse quantization unit 375. . Furthermore, the variable length decoding unit 374 performs variable length decoding on the differentially compressed data, so that the maximum correlation vector (U, V), the previous motion vector (−U ₁ , −V ₁ ), or the rear motion is used as necessary. A vector (U ₂ , V ₂ ) is obtained and supplied to the maximum correlation position detector 377.

即ち、差分圧縮データが第１または第６の差分データである場合、つまり、注目ブロックのケースIDが１または６であった場合、図１０６で説明したように、第１または第６の差分データには、量子化データの可変長符号しか含まれていないので、可変長復号部３７４は、その可変長符号を、量子化データに復号し、逆量子化部３７５に供給する。 That is, when the differentially compressed data is the first or sixth differential data, that is, when the case ID of the target block is 1 or 6, as described with reference to FIG. 106, the first or sixth differential data Since only the variable length code of the quantized data is included, the variable length decoding unit 374 decodes the variable length code into quantized data and supplies it to the inverse quantization unit 375.

また、差分圧縮データが第２の差分データである場合、つまり、注目ブロックのケースIDが２であった場合、図１０６で説明したように、第２の差分データには、量子化データの可変長符号と、相関最大ベクトル(U,V)の可変長符号とが含まれるので、可変長復号部３７４は、その可変長符号を、量子化データと相関最大ベクトル(U,V)に復号する。そして、可変長復号部３７４は、量子化データを、逆量子化部３７５に供給し、相関最大ベクトル(U,V)を、相関最大位置検出部３７７に供給する。 When the differentially compressed data is the second differential data, that is, when the case ID of the target block is 2, as described with reference to FIG. 106, the second differential data includes a variable of quantized data. Since the long code and the variable length code of the maximum correlation vector (U, V) are included, the variable length decoding unit 374 decodes the variable length code into the quantized data and the maximum correlation vector (U, V). . Then, the variable length decoding unit 374 supplies the quantized data to the inverse quantization unit 375 and supplies the maximum correlation vector (U, V) to the maximum correlation position detection unit 377.

さらに、差分圧縮データが第３の差分データである場合、つまり、注目ブロックのケースIDが３であった場合、図１０６で説明したように、第３の差分データには、量子化データの可変長符号と、前動きベクトル(-U₁,-V₁)および後動きベクトル(U₂,V₂)の可変長符号とが含まれるので、可変長復号部３７４は、その可変長符号を、量子化データと前動きベクトル(-U₁,-V₁)および後動きベクトル(U₂,V₂)とに復号する。そして、可変長復号部３７４は、量子化データを、逆量子化部３７５に供給し、前動きベクトル(-U₁,-V₁)および後動きベクトル(U₂,V₂)を、相関最大位置検出部３７７に供給する。 Furthermore, when the differentially compressed data is the third differential data, that is, when the case ID of the target block is 3, as described with reference to FIG. 106, the third differential data includes a variable of quantized data. Since the long code and the variable length codes of the front motion vector (−U ₁ , −V ₁ ) and the back motion vector (U ₂ , V ₂ ) are included, the variable length decoding unit 374 converts the variable length code into The quantized data is decoded into the front motion vector (−U ₁ , −V ₁ ) and the back motion vector (U ₂ , V ₂ ). Then, the variable length decoding unit 374 supplies the quantized data to the inverse quantization unit 375, and converts the front motion vector (−U ₁ , −V ₁ ) and the back motion vector (U ₂ , V ₂ ) to the maximum correlation. This is supplied to the position detection unit 377.

また、差分圧縮データが第４の差分データである場合、つまり、注目ブロックのケースIDが４であった場合、図１０６で説明したように、第４の差分データには、量子化データの可変長符号と、前動きベクトル(-U₁,-V₁)の可変長符号とが含まれるので、可変長復号部３７４は、その可変長符号を、量子化データと前動きベクトル(-U₁,-V₁)とに復号する。そして、可変長復号部３７４は、量子化データを、逆量子化部３７５に供給し、前動きベクトル(-U₁,-V₁)を、相関最大位置検出部３７７に供給する。 Further, when the differentially compressed data is the fourth differential data, that is, when the case ID of the target block is 4, as described with reference to FIG. 106, the fourth differential data includes the variable of the quantized data. Since the long code and the variable length code of the previous motion vector (−U ₁ , −V ₁ ) are included, the variable length decoding unit 374 converts the variable length code into the quantized data and the previous motion vector (−U _1). , -V ₁ ). Then, the variable length decoding unit 374 supplies the quantized data to the inverse quantization unit 375 and supplies the previous motion vector (−U ₁ , −V ₁ ) to the correlation maximum position detection unit 377.

さらに、差分圧縮データが第５の差分データである場合、つまり、注目ブロックのケースIDが５であった場合、図１０６で説明したように、第５の差分データには、量子化データの可変長符号と、後動きベクトル(U₂,V₂)の可変長符号とが含まれるので、可変長復号部３７４は、その可変長符号を、量子化データと後動きベクトル(U₂,V₂)とに復号する。そして、可変長復号部３７４は、量子化データを、逆量子化部３７５に供給し、後動きベクトル(U₂,V₂)を、相関最大位置検出部３７７に供給する。 Further, when the differentially compressed data is the fifth differential data, that is, when the case ID of the target block is 5, as described with reference to FIG. 106, the fifth differential data includes a variable of quantized data. Since the long code and the variable length code of the back motion vector (U ₂ , V ₂ ) are included, the variable length decoding unit 374 converts the variable length code into the quantized data and the back motion vector (U ₂ , V _2). ) And decrypt. Then, the variable length decoding unit 374 supplies the quantized data to the inverse quantization unit 375 and supplies the back motion vector (U ₂ , V ₂ ) to the correlation maximum position detection unit 377.

逆量子化部３７５は、可変長復号部３７４からの量子化データを逆量子化し、その結果得られる、周波数空間上のデータとしての、例えば、DCT係数を、変換部３７６に供給する。 The inverse quantization unit 375 inversely quantizes the quantized data from the variable length decoding unit 374, and supplies, for example, DCT coefficients as data on the frequency space obtained as a result to the conversion unit 376.

変換部３７６は、逆量子化部３７５からのDCT係数を逆DCT変換し、その結果られる、注目ブロックの画像データ（画素値）Ic(x,y)とその推測値Pre(x,y)との差分値Sub(x,y)を、加算部３７９に供給する。 The transform unit 376 performs inverse DCT transform on the DCT coefficient from the inverse quantization unit 375, and the resulting image data (pixel value) Ic (x, y) of the target block and its estimated value Pre (x, y) The difference value Sub (x, y) is supplied to the adding unit 379.

最大相関位置検出部３７７は、リファレンス記憶部３７２に記憶されたリファレンス画像のうちの、注目ブロックのフレームであるターゲット画像V_Tの直前と直後のフレームを、それぞれ、パストリファレンス画像V_Pとフューチャリファレンス画像V_Fとし、ケースID判定部３７３における注目ブロックのケースIDの判定結果、または可変長復号部３７４から供給される相関最大ベクトル(U,V)、前動きベクトル(-U₁,-V₁)、後動きベクトル(U₂,V₂)に基づき、パストリファレンス画像V_Pとフューチャリファレンス画像V_Fにおいて相関が高い位置関係や、パストリファレンス画像V_Pまたはフューチャリファレンス画像V_Fにおいて、注目ブロックとの相関が高い位置関係を検出して、推測部３７８に供給する。 The maximum correlation position detection unit 377 uses the past reference image V _P and the feature reference for the frames immediately before and after the target image V _T that are the frames of the target block among the reference images stored in the reference storage unit 372, respectively. As the image V _F , the case ID determination result of the block of interest in the case ID determination unit 373 or the maximum correlation vector (U, V) and the previous motion vector (−U ₁ , −V ₁ ) supplied from the variable length decoding unit 374 ), based on the backward motion vector (U _2, V _2), the positional relationship and a high correlation in the PAST reference image V _P and Futuresse reference image V _F, the Past reference image V _P or Futuresse reference image V _F, and the block of interest Are detected and supplied to the estimation unit 378.

推測部３７８は、相関最大位置検出部３７７からの位置関係に基づき、リファレンス記憶部３７２に記憶されたリファレンス画像のうちのパストリファレンス画像V_Pまたはフューチャリファレンス画像V_Fから、注目ブロックの推測値Pre(x,y)を求め、加算部３７９に供給する。 Estimating unit 378, based on the positional relationship between the maximum correlation position detection section 377, from the Past reference image V _P or Futuresse reference image V _F of the reference image stored in the reference storage unit 372, estimates Pre block of interest (x, y) is obtained and supplied to the adder 379.

加算部３７９は、変換部３７６からの注目ブロックの差分値Sub(x,y)と、推測部３７８からの注目ブロックの推測値Pre(x,y)とを加算し、これにより、注目ブロック（の画像データ）Ic(x,y)=Sub(x,y)+Pre(x,y)を復元して、合成部３６５（図１１０）に出力する。 The adding unit 379 adds the difference value Sub (x, y) of the target block from the converting unit 376 and the estimated value Pre (x, y) of the target block from the estimating unit 378, and thereby the target block ( Image data) Ic (x, y) = Sub (x, y) + Pre (x, y) is restored and output to the synthesis unit 365 (FIG. 110).

次に、図１１４乃至図１２１のフローチャートを参照して、図１１３の差分情報復元部３６４が、図１１２のステップＳ３１１で行うブロックの画素値（画像データ）を求める処理について説明する。 Next, processing for obtaining the pixel value (image data) of the block performed in step S311 of FIG. 112 by the difference information restoring unit 364 of FIG. 112 will be described with reference to the flowcharts of FIGS.

まず、ステップＳ３２１において、復元しようとする240-60fps動画データのあるフレームをターゲット画像V_Tとし、さらに、ターゲット画像V_Tをブロック分割したあるブロックを注目ブロックとして、差分データ記憶部３７１に記憶された差分圧縮データのうちの、注目ブロックの差分圧縮データが、差分データ記憶部３７１から読み出され、ケースID判定部３７３および可変長復号部３７４に入力されて、ステップＳ３２２に進む。 First, in step S321, a frame having 240-60fps moving image data to be restored is set as a target image V _T, and a block obtained by dividing the target image V _T into blocks is stored in the difference data storage unit 371 as a target block. Of the difference compressed data, the difference compressed data of the target block is read from the difference data storage unit 371 and input to the case ID determination unit 373 and the variable length decoding unit 374, and the process proceeds to step S322.

ステップＳ３２２では、リファレンス記憶部３７２に記憶されているリファレンス画像のうちの、ターゲット画像V_Tの直前と直後のフレームが、それぞれ、パストリファレンス画像V_Pとフューチャリファレンス画像V_Fとされ、相関最大位置検出部３７７および推測部３７８に入力されて、ステップＳ３２３に進む。 In step S322, among the reference images stored in the reference storage unit 372, before and immediately after the frame of the target image V _T are respectively set to Past reference image V _P and Futuresse reference image V _F, the correlation maximum position The data is input to the detection unit 377 and the estimation unit 378, and the process proceeds to step S323.

ここで、例えば、図８０の一番下に示したターゲット画像のフレームf₂，f₃，f₄のうちのいずれかのブロックが注目ブロックである場合には、リファレンス画像のフレームf₁，f₅，f₉，f₁₃のうちの、ターゲット画像のフレームf₂，f₃，f₄の直前と直後のフレームf₁とf₅が、それぞれパストリファレンス画像とフューチャリファレンス画像とされる。 Here, for example, when one of the blocks f ₂ , f ₃ , and f ₄ of the target image shown at the bottom of FIG. 80 is the target block, the frames f ₁ and f of the reference image Of frames ₅ , f ₉ , and f ₁₃ , the frames f ₁ and f ₅ immediately before and immediately after the frames f ₂ , f ₃ , and f ₄ of the target image are used as a past reference image and a feature reference image, respectively.

また、例えば、ターゲット画像のフレームf₆，f₇，f₈のうちのいずれかのブロックが注目ブロックである場合には、リファレンス画像のフレームf₁，f₅，f₉，f₁₃のうちの、ターゲット画像のフレームf₆，f₇，f₈の直前と直後のフレームf₅とf₉が、それぞれパストリファレンス画像とフューチャリファレンス画像とされる。 Also, for example, when any of the blocks f ₆ , f ₇ , and f ₈ of the target image is the target block, the frames of the reference image frames f ₁ , f ₅ , f ₉ , and f ₁₃ The frames f ₅ and f ₉ immediately before and after the frames f ₆ , f ₇ , and f ₈ of the target image are set as a past reference image and a feature reference image, respectively.

さらに、例えば、ターゲット画像のフレームf₁₀，f₁₁，f₁₂のうちのいずれかのブロックが注目ブロックである場合には、リファレンス画像のフレームf₁，f₅，f₉，f₁₃のうちの、ターゲット画像のフレームf₁₀，f₁₁，f₁₂の直前と直後のフレームf₉とf₁₃が、それぞれパストリファレンス画像とフューチャリファレンス画像とされる。 Further, for example, when any of the blocks f ₁₀ , f ₁₁ , and f ₁₂ of the target image is the target block, the frames of the reference image frames f ₁ , f ₅ , f ₉ , and f ₁₃ The frames f ₉ and f ₁₃ immediately before and after the frames f ₁₀ , f ₁₁ , and f ₁₂ of the target image are set as a past reference image and a feature reference image, respectively.

ステップＳ３２３では、ケースID判定部３７３は、注目ブロックの差分圧縮データに付加されているケースIDを解析し、そのケースIDの値を求めて、ステップＳ３２４に進む。 In step S323, the case ID determination unit 373 analyzes the case ID added to the differential compression data of the block of interest, obtains the value of the case ID, and proceeds to step S324.

ステップＳ３２４では、可変長復号部３７４は、注目ブロックの差分圧縮データを解析し、その差分圧縮データにおいて可変長符号となっている量子化データを復号（可変長復号）する。可変長復号部３７４で可変長復号が行われることにより得られた量子化データは、可変長復号部３７４から逆量子化部３７５に供給される。なお、量子化データは、ない場合（NULLの場合）もある。 In step S324, the variable length decoding unit 374 analyzes the differential compression data of the block of interest, and decodes the quantized data that is a variable length code in the differential compression data (variable length decoding). The quantized data obtained by performing variable length decoding in the variable length decoding unit 374 is supplied from the variable length decoding unit 374 to the inverse quantization unit 375. Note that there may be no quantized data (in the case of NULL).

ステップＳ３２４の処理後は、ステップＳ３２５に進み、逆量子化部３７５は、可変長復号部３７４からの量子化データを逆量子化し、その結果得られる、周波数空間上のデータとしての、例えば、DCT係数を、変換部３７６に供給して、ステップＳ３２５に進む。 After the process of step S324, the process proceeds to step S325, and the inverse quantization unit 375 inversely quantizes the quantized data from the variable length decoding unit 374, and the resulting data on the frequency space, for example, DCT The coefficient is supplied to the conversion unit 376, and the process proceeds to step S325.

ステップＳ３２５では、変換部３７６は、逆量子化部３７５からのDCT係数を、２次元空間上のデータに変換する逆DCT変換を行い、その結果られる、注目ブロックの画像データ（画素値）とその推測値との差分値Sub(x,y)を、加算部３７９に供給する。なお、ステップＳ３２４での可変長復号において、量子化データがなかった場合（NULLの場合）、変換部３７６は、注目ブロックの差分値Sub(x,y)として０を、加算部３７９に供給する。 In step S325, the transform unit 376 performs inverse DCT transform that transforms the DCT coefficient from the inverse quantization unit 375 into data in a two-dimensional space, and the image data (pixel value) of the block of interest and the result thereof. The difference value Sub (x, y) from the estimated value is supplied to the adding unit 379. In the variable length decoding in step S324, when there is no quantized data (in the case of NULL), the converting unit 376 supplies 0 as the difference value Sub (x, y) of the target block to the adding unit 379. .

ステップＳ３２６の処理後は、図１１５のステップＳ３２７に進み、ケースID判定部３７３は、注目ブロックの差分圧縮データに付加されているケースIDが１であるかどうか、即ち、注目ブロックの差分圧縮データが第１の差分データであるかどうかを判定する。 After the processing in step S326, the process proceeds to step S327 in FIG. 115, and the case ID determination unit 373 determines whether or not the case ID added to the differential compression data of the block of interest is 1, that is, the differential compression data of the block of interest. Is the first difference data.

ステップＳ３２７に進み、注目ブロックの差分圧縮データに付加されているケースIDが１であると判定された場合、即ち、注目ブロックの差分圧縮データが第１の差分データである場合、ケースID判定部３７３は、その判定結果を、相関最大位置検出部３７７に供給して、図１１６のステップＳ３５１に進み、以下、後述するように、注目ブロックの差分圧縮データである第１の差分データが復元される。 In step S327, when it is determined that the case ID added to the differential compression data of the target block is 1, that is, when the differential compression data of the target block is the first differential data, the case ID determination unit 373 supplies the determination result to the maximum correlation position detection unit 377, and proceeds to step S351 in FIG. 116. As will be described later, first differential data that is differentially compressed data of the target block is restored. The

また、ステップＳ３２７において、注目ブロックの差分圧縮データに付加されているケースIDが１でないと判定された場合、ステップＳ３２８に進み、ケースID判定部３７３は、注目ブロックの差分圧縮データに付加されているケースIDが２であるかどうか、即ち、注目ブロックの差分圧縮データが第２の差分データであるかどうかを判定する。 If it is determined in step S327 that the case ID added to the differential compression data of the target block is not 1, the process proceeds to step S328, and the case ID determination unit 373 adds the case ID to the differential compression data of the target block. It is determined whether or not the case ID is 2, that is, whether or not the differential compressed data of the block of interest is the second differential data.

ステップＳ３２８において、注目ブロックの差分圧縮データに付加されているケースIDが２であると判定された場合、即ち、注目ブロックの差分圧縮データが第２の差分データである場合、ケースID判定部３７３は、その判定結果を、相関最大位置検出部３７７に供給して、図１１７のステップＳ３６１に進み、以下、後述するように、注目ブロックの差分圧縮データである第２の差分データが復元される。 If it is determined in step S328 that the case ID added to the differential compressed data of the block of interest is 2, that is, if the differential compressed data of the block of interest is the second differential data, the case ID determination unit 373 Supplies the determination result to the maximum correlation position detection unit 377, and proceeds to step S361 in FIG. 117. As will be described later, second differential data that is differentially compressed data of the target block is restored. .

また、ステップＳ３２８において、注目ブロックの差分圧縮データに付加されているケースIDが２でないと判定された場合、ステップＳ３２９に進み、ケースID判定部３７３は、注目ブロックの差分圧縮データに付加されているケースIDが３であるかどうか、即ち、注目ブロックの差分圧縮データが第３の差分データであるかどうかを判定する。 If it is determined in step S328 that the case ID added to the differential compression data of the block of interest is not 2, the process proceeds to step S329, and the case ID determination unit 373 adds the case ID to the differential compression data of the block of interest. It is determined whether or not the case ID is 3, that is, whether or not the differential compressed data of the block of interest is the third differential data.

ステップＳ３２９において、注目ブロックの差分圧縮データに付加されているケースIDが３であると判定された場合、即ち、注目ブロックの差分圧縮データが第３の差分データである場合、ケースID判定部３７３は、その判定結果を、相関最大位置検出部３７７に供給して、図１１８のステップＳ３８１に進み、以下、後述するように、注目ブロックの差分圧縮データである第３の差分データが復元される。 If it is determined in step S329 that the case ID added to the differential compressed data of the block of interest is 3, that is, if the differential compressed data of the block of interest is the third differential data, the case ID determination unit 373 Supplies the determination result to the correlation maximum position detection unit 377 and proceeds to step S381 in FIG. 118, and the third difference data, which is the difference compressed data of the block of interest, is restored as will be described later. .

また、ステップＳ３２９において、注目ブロックの差分圧縮データに付加されているケースIDが３でないと判定された場合、ステップＳ３３０に進み、ケースID判定部３７３は、注目ブロックの差分圧縮データに付加されているケースIDが４であるかどうか、即ち、注目ブロックの差分圧縮データが第４の差分データであるかどうかを判定する。 If it is determined in step S329 that the case ID added to the differential compression data of the target block is not 3, the process proceeds to step S330, and the case ID determination unit 373 adds the case ID to the differential compression data of the target block. It is determined whether or not the case ID is 4, that is, whether or not the differential compressed data of the block of interest is the fourth differential data.

ステップＳ３３０において、注目ブロックの差分圧縮データに付加されているケースIDが４であると判定された場合、即ち、注目ブロックの差分圧縮データが第４の差分データである場合、ケースID判定部３７３は、その判定結果を、相関最大位置検出部３７７に供給して、図１１９のステップＳ３９１に進み、以下、後述するように、注目ブロックの差分圧縮データである第４の差分データが復元される。 If it is determined in step S330 that the case ID added to the differential compressed data of the block of interest is 4, that is, if the differential compressed data of the block of interest is the fourth differential data, the case ID determination unit 373 Supplies the determination result to the correlation maximum position detection unit 377, and proceeds to step S391 in FIG. 119. Hereinafter, as will be described later, fourth difference data that is differentially compressed data of the block of interest is restored. .

また、ステップＳ３３０において、注目ブロックの差分圧縮データに付加されているケースIDが４でないと判定された場合、ステップＳ３３１に進み、ケースID判定部３７３は、注目ブロックの差分圧縮データに付加されているケースIDが５であるかどうか、即ち、注目ブロックの差分圧縮データが第５の差分データであるかどうかを判定する。 If it is determined in step S330 that the case ID added to the differential compression data of the target block is not 4, the process proceeds to step S331, and the case ID determination unit 373 adds the case ID to the differential compression data of the target block. It is determined whether or not the case ID is 5, that is, whether or not the differential compressed data of the block of interest is the fifth differential data.

ステップＳ３３１において、注目ブロックの差分圧縮データに付加されているケースIDが５であると判定された場合、即ち、注目ブロックの差分圧縮データが第５の差分データである場合、ケースID判定部３７３は、その判定結果を、相関最大位置検出部３７７に供給して、図１２０のステップＳ４０１に進み、以下、後述するように、注目ブロックの差分圧縮データである第５の差分データが復元される。 If it is determined in step S331 that the case ID added to the differential compressed data of the block of interest is 5, that is, if the differential compressed data of the block of interest is the fifth differential data, the case ID determination unit 373 Supplies the determination result to the maximum correlation position detection unit 377, and proceeds to step S401 in FIG. 120. The fifth differential data, which is differential compressed data of the block of interest, is restored, as will be described later. .

また、ステップＳ３３１において、注目ブロックの差分圧縮データに付加されているケースIDが５でないと判定された場合、即ち、注目ブロックの差分圧縮データに付加されているケースIDが６であり、注目ブロックの差分圧縮データが第６の差分データである場合、ケースID判定部３７３は、その判定結果を、相関最大位置検出部３７７に供給して、図１２１のステップＳ４１１に進み、以下、後述するように、注目ブロックの差分圧縮データである第６の差分データが復元される。 If it is determined in step S331 that the case ID added to the differential compression data of the block of interest is not 5, that is, the case ID added to the differential compression data of the block of interest is 6, and the block of interest If the difference compressed data is the sixth difference data, the case ID determination unit 373 supplies the determination result to the correlation maximum position detection unit 377 and proceeds to step S411 in FIG. 121, and will be described later. In addition, the sixth differential data that is the differentially compressed data of the block of interest is restored.

次に、図１１６のフローチャートを参照して、注目ブロックの差分圧縮データが第１の差分データである場合の、その第１の差分データの復元について説明する。 Next, with reference to the flowchart of FIG. 116, restoration of the first difference data when the difference compressed data of the target block is the first difference data will be described.

相関最大位置検出部３７７は、ケースID判定部３７３から、ケースIDが１である旨の判定結果を受信した場合、ステップＳ３５１において、注目ブロックについて、リファレンス記憶部３７２から読み出されたパストリファレンス画像V_Pとフューチャリファレンス画像V_Fとにおいて、相関が高い位置関係を検出する。即ち、相関最大位置検出部３７７は、注目ブロックにつき、図９４の相関最大位置検出部２５４における場合と同様に、考えられる(U',V')の各値について、式（９）の相関情報e₁(U',V')を計算し、その相関情報e₁(U',V')が表す相関を最大にする(U',V')、つまり、相関情報e₁(U',V')の値を最小にする(U',V')を検出する。そして、相関最大位置検出部３７７は、相関情報e₁(U',V')の値を最小にする(U',V')を、パストリファレンス画像V_Pとフューチャリファレンス画像V_Fにおいて、相関が最も高い領域の位置関係を表す位置関係ベクトル(U,V)として、推測部３７８に供給し、ステップＳ３５１からＳ３５２に進む。 When the correlation maximum position detection unit 377 receives the determination result indicating that the case ID is 1 from the case ID determination unit 373, the past reference image read from the reference storage unit 372 for the target block in step S351. in the V _P and Futuresse reference image V _F, it detects the correlation is high positional relationship. That is, the maximum correlation position detection unit 377 performs the correlation information of Equation (9) for each possible value of (U ′, V ′) for the target block, as in the correlation maximum position detection unit 254 of FIG. e ₁ (U ′, V ′) is calculated and the correlation represented by the correlation information e ₁ (U ′, V ′) is maximized (U ′, V ′), that is, the correlation information e ₁ (U ′, V ′) (U ′, V ′) that minimizes the value of V ′) is detected. Then, the maximum correlation position detection section 377, correlation information _{e 1 (U ', V'} ) to minimize the value of the (U ', V'), in the PAST reference image V _P and Futuresse reference image V _F, correlation Is supplied to the estimation unit 378 as a positional relationship vector (U, V) representing the positional relationship of the highest region, and the process proceeds from step S351 to S352.

ステップＳ３５２では、推測部３７８は、相関最大位置検出部３７７からの位置関係ベクトル(U,V)が表す位置関係にある、パストリファレンス画像V_Pとフューチャリファレンス画像V_Fの領域の画像データから、注目ブロックの推測値を求め、加算部３７９に供給する。 In step S352, the estimation unit 378 calculates from the image data of the past reference image V _P and the feature reference image V _{F in} the positional relationship represented by the positional relationship vector (U, V) from the correlation maximum position detection unit 377. An estimated value of the block of interest is obtained and supplied to the adding unit 379.

即ち、推測部３７８は、パストリファレンス画像V_Pにおいて、注目ブロックからベクトル(-(4-s)U,-(4-s)V)だけずれた位置の、注目ブロックと同一サイズの領域と、フューチャリファレンス画像V_Fにおいて、注目ブロックからベクトル(sU,sV)だけずれた位置の、注目ブロックと同一サイズの領域との画像データの、例えば、重み付け平均値を、式（１３）にしたがって計算し、その重み付け平均値を、注目ブロックの推測値Pre(x,y)として求め、加算部３７９に供給して、ステップＳ３５２からＳ３５３に進む。 That is, the estimating unit 378, the Pasto reference image V _P, from the target block vector and the shifted position, the region of interest blocks the same size, (- (4-s) V - (4-s) U,) in Futuresse reference image V _F, a vector from the target block (sU, sV) shifted by a position of the image data of the block of interest and the region of the same size, for example, a weighted average value, calculated according to equation (13) The weighted average value is obtained as the estimated value Pre (x, y) of the block of interest, supplied to the adding unit 379, and the process proceeds from step S352 to S353.

ステップＳ３５３では、加算部３７９が、変換部３７６からの注目ブロックの差分値Sub(x,y)と、推測部３７８からの注目ブロックの推測値Pre(x,y)とを加算し、これにより、注目ブロック（の画像データ）Ic(x,y)を復元して、ステップＳ３５４に進み、その注目ブロックIc(x,y)を、合成部３６５（図１１０）に出力する。 In step S353, the addition unit 379 adds the difference value Sub (x, y) of the target block from the conversion unit 376 and the estimated value Pre (x, y) of the target block from the estimation unit 378, thereby The target block (image data) Ic (x, y) is restored, and the process proceeds to step S354, where the target block Ic (x, y) is output to the synthesizing unit 365 (FIG. 110).

次に、図１１７のフローチャートを参照して、注目ブロックの差分圧縮データが第２の差分データである場合の、その第２の差分データの復元について説明する。 Next, with reference to the flowchart of FIG. 117, the restoration of the second differential data when the differential compressed data of the block of interest is the second differential data will be described.

注目ブロックの差分圧縮データが、第２の差分データである場合には、その第２の差分データには、上述したように、相関最大ベクトル(U,V)が含まれる。そこで、可変長復号部３７４は、ステップＳ３６１において、注目ブロックの差分圧縮データである第２の差分データを解析し、その第２の差分データにおいて可変長符号となっている相関最大ベクトル(U,V)を可変長復号する。可変長復号部３７４で可変長復号が行われることにより得られた相関最大ベクトル(U,V)は、可変長復号部３７４から相関最大位置検出部３７７に供給される。 When the differential compressed data of the block of interest is the second differential data, the second differential data includes the correlation maximum vector (U, V) as described above. Therefore, in step S361, the variable length decoding unit 374 analyzes the second difference data that is the difference compressed data of the block of interest, and the correlation maximum vector (U, U) that is a variable length code in the second difference data. V) is variable length decoded. The maximum correlation vector (U, V) obtained by performing variable length decoding in the variable length decoding unit 374 is supplied from the variable length decoding unit 374 to the maximum correlation position detection unit 377.

さらに、ステップＳ３６１では、相関最大位置検出部３７７は、ケースID判定部３７３から、ケースIDが２である旨の判定結果を受信し、この場合、注目ブロックについて、リファレンス記憶部３７２から読み出されたパストリファレンス画像V_Pとフューチャリファレンス画像V_Fとにおいて、注目ブロックとの相関が高い位置関係を検出する。 Further, in step S361, the maximum correlation position detection unit 377 receives a determination result indicating that the case ID is 2 from the case ID determination unit 373, and in this case, the target block is read from the reference storage unit 372. in the Past reference image V _P and Futuresse reference image V _F was a correlation between the target block is detected with high positional relationship.

即ち、相関最大位置検出部３７７は、可変長復号部３７４からの相関最大ベクトル(U,V)から、パストリファレンス画像V_Pにおいて、注目ブロックとの相関が高い位置が、注目ブロックからベクトル(-(4-s)U,-(4-s)V)だけずれた位置であることを検出するとともに、フューチャリファレンス画像V_Fにおいて、注目ブロックとの相関が高い位置が、注目ブロックからベクトル(sU,sV)だけずれた位置であることを検出し、その位置関係を表すベクトル(-(4-s)U,-(4-s)V)および(sU,sV)を、推測部３７８に供給して、ステップＳ３６１からＳ３６２に進む。 That is, the maximum correlation position detection unit 377 determines that a position having a high correlation with the target block in the past reference image V _P from the maximum correlation vector (U, V) from the variable length decoding unit 374 is a vector (− (4-s) U, - (4-s) and detects that the position shifted by V), in Futuresse reference image V _F, a high correlation position of the target block, from the target block vector (sU , sV) is detected, and vectors (-(4-s) U,-(4-s) V) and (sU, sV) representing the positional relationship are supplied to the estimation unit 378. Then, the process proceeds from step S361 to S362.

ステップＳ３６２では、推測部３７８は、注目ブロックからの位置が、相関最大位置検出部３７７からのベクトル(-(4-s)U,-(4-s)V)と(sU,sV)が表す位置にある、パストリファレンス画像V_Pとフューチャリファレンス画像V_Fの領域の画像データから、注目ブロックの推測値を求め、加算部３７９に供給する。 In step S362, the estimation unit 378 has the positions from the target block represented by vectors (− (4-s) U, − (4-s) V) and (sU, sV) from the correlation maximum position detection unit 377. An estimated value of the block of interest is obtained from the image data of the past reference image V _P and the feature reference image V _{F at} the position, and is supplied to the adding unit 379.

即ち、推測部３７８は、パストリファレンス画像V_Pにおいて、注目ブロックからベクトル(-(4-s)U,-(4-s)V)だけずれた位置の、注目ブロックと同一サイズの領域と、フューチャリファレンス画像V_Fにおいて、注目ブロックからベクトル(sU,sV)だけずれた位置の、注目ブロックと同一サイズの領域との画像データの、例えば、重み付け平均値を、式（１３）にしたがって計算し、その重み付け平均値を、注目ブロックの推測値Pre(x,y)として求め、加算部３７９に供給して、ステップＳ３６２からＳ３６３に進む。 That is, the estimating unit 378, the Pasto reference image V _P, from the target block vector and the shifted position, the region of interest blocks the same size, (- (4-s) V - (4-s) U,) in Futuresse reference image V _F, a vector from the target block (sU, sV) shifted by a position of the image data of the block of interest and the region of the same size, for example, a weighted average value, calculated according to equation (13) The weighted average value is obtained as the estimated value Pre (x, y) of the target block, supplied to the adding unit 379, and the process proceeds from step S362 to S363.

ステップＳ３６３では、加算部３７９が、変換部３７６からの注目ブロックの差分値Sub(x,y)と、推測部３７８からの注目ブロックの推測値Pre(x,y)とを加算し、これにより、注目ブロックIc(x,y)を復元して、ステップＳ３６４に進み、その注目ブロックIc(x,y)を、合成部３６５（図１１０）に出力する。 In step S363, the addition unit 379 adds the difference value Sub (x, y) of the target block from the conversion unit 376 and the estimated value Pre (x, y) of the target block from the estimation unit 378, thereby The target block Ic (x, y) is restored, and the process proceeds to step S364, where the target block Ic (x, y) is output to the synthesis unit 365 (FIG. 110).

次に、図１１８のフローチャートを参照して、注目ブロックの差分圧縮データが第３の差分データである場合の、その第３の差分データの復元について説明する。 Next, with reference to the flowchart of FIG. 118, the restoration of the third difference data when the difference compressed data of the block of interest is the third difference data will be described.

注目ブロックの差分圧縮データが、第３の差分データである場合には、その第３の差分データには、上述したように、前動きベクトル(-U₁,-V₁)、および後動きベクトル(U₂,V₂)が含まれる。そこで、可変長復号部３７４は、ステップＳ３８１において、注目ブロックの差分圧縮データである第３の差分データを解析し、その第３の差分データにおいて可変長符号となっている前動きベクトル(-U₁,-V₁)、および後動きベクトル(U₂,V₂)を可変長復号する。可変長復号部３７４で可変長復号が行われることにより得られた前動きベクトル(-U₁,-V₁)、および後動きベクトル(U₂,V₂)は、可変長復号部３７４から相関最大位置検出部３７７に供給される。 When the differential compressed data of the block of interest is the third differential data, the third differential data includes the previous motion vector (−U ₁ , −V ₁ ) and the subsequent motion vector as described above. (U ₂ , V ₂ ) is included. Therefore, in step S381, the variable length decoding unit 374 analyzes the third difference data that is the difference compressed data of the block of interest, and the previous motion vector (-U) that is a variable length code in the third difference data. ₁ , −V ₁ ) and the back motion vector (U ₂ , V ₂ ) are variable length decoded. The front motion vector (−U ₁ , −V ₁ ) and the back motion vector (U ₂ , V ₂ ) obtained by performing variable length decoding in the variable length decoding unit 374 are correlated from the variable length decoding unit 374. The maximum position detection unit 377 is supplied.

さらに、ステップＳ３８１では、相関最大位置検出部３７７は、ケースID判定部３７３から、ケースIDが３である旨の判定結果を受信し、この場合、注目ブロックについて、リファレンス記憶部３７２から読み出されたパストリファレンス画像V_Pとフューチャリファレンス画像V_Fとのそれぞれにおいて、注目ブロックとの相関が高い位置関係を検出する。 Furthermore, in step S381, the correlation maximum position detection unit 377 receives a determination result indicating that the case ID is 3 from the case ID determination unit 373, and in this case, the block of interest is read from the reference storage unit 372. in each of the Past reference image V _P and Futuresse reference image V _F was a correlation between the target block is detected with high positional relationship.

即ち、相関最大位置検出部３７７は、パストリファレンス画像V_Pにおいて、注目ブロックとの相関が高い位置が、注目ブロックから前動きベクトル(-U₁,-V₁)だけずれた位置であることを、可変長復号部３７４からの前動きベクトル(-U₁,-V₁)から検出するとともに、フューチャリファレンス画像V_Fにおいて、注目ブロックとの相関が高い位置が、注目ブロックから後動きベクトル(U₂,V₂)だけずれた位置であることを、可変長復号部３７４からの後動きベクトル(U₂,V₂)から検出し、その位置関係、つまり、可変長復号部３７４からの前動きベクトル(-U₁,-V₁)、および後動きベクトル(U₂,V₂)を、推測部３７８に供給して、ステップＳ３８１からＳ３８２に進む。 That is, the maximum correlation position detection unit 377 indicates that the position having a high correlation with the target block in the past reference image V _P is a position shifted from the target block by the previous motion vector (−U ₁ , −V ₁ ). , before the motion vector (-U _{_1,} -V ₁₎ from the variable length decoding unit 374 and detects from the Futuresse reference image V _F, a high correlation position of the target block, backward motion vector from the target block (U ₂ , V ₂ ) is detected from the back motion vector (U ₂ , V ₂ ) from the variable length decoding unit 374 and the positional relationship, that is, the front motion from the variable length decoding unit 374 is detected. The vector (−U ₁ , −V ₁ ) and the rear motion vector (U ₂ , V ₂ ) are supplied to the estimation unit 378, and the process proceeds from step S381 to S382.

ステップＳ３８２では、推測部３７８は、注目ブロックからの位置が、相関最大位置検出部３７７からの前動きベクトル(-U₁,-V₁)と後動きベクトル(U₂,V₂)が表す位置にある、パストリファレンス画像V_Pとフューチャリファレンス画像V_Fの領域の画像データから、注目ブロックの推測値を求め、加算部３７９に供給する。 In step S382, the estimation unit 378 indicates that the position from the target block is represented by the front motion vector (−U ₁ , −V ₁ ) and the back motion vector (U ₂ , V ₂ ) from the maximum correlation position detection unit 377. in some, from the image data of the region of Pasto reference image V _P and Futuresse reference image V _F, we obtain the estimated value of the block of interest, and supplies to the adder 379.

即ち、推測部３７８は、パストリファレンス画像V_Pにおいて、注目ブロックから前動きベクトル(-U₁,-V₁)だけずれた位置の、注目ブロックと同一サイズの領域と、フューチャリファレンス画像V_Fにおいて、注目ブロックから後動きベクトル(U₂,V₂)だけずれた位置の、注目ブロックと同一サイズの領域との画像データの、例えば、重み付け平均値を、式（１４）にしたがって計算し、その重み付け平均値を、注目ブロックの推測値Pre(x,y)として求め、加算部３７９に供給して、ステップＳ３８２からＳ３８３に進む。 That is, the estimating unit 378, the Pasto reference image V _P, from the target block prior to the motion vector (-U _{_1,} -V ₁₎ of a position shifted by a region of the block of interest and the same size, in Futuresse reference image V _F For example, the weighted average value of the image data of the region of the same size as the target block at the position shifted by the backward motion vector (U ₂ , V ₂ ) from the target block is calculated according to the equation (14), The weighted average value is obtained as the estimated value Pre (x, y) of the block of interest, supplied to the adding unit 379, and the process proceeds from step S382 to S383.

ステップＳ３８３では、加算部３７９が、変換部３７６からの注目ブロックの差分値Sub(x,y)と、推測部３７８からの注目ブロックの推測値Pre(x,y)とを加算し、これにより、注目ブロックIc(x,y)を復元して、ステップＳ３８４に進み、その注目ブロックIc(x,y)を、合成部３６５（図１１０）に出力する。 In step S383, the addition unit 379 adds the difference value Sub (x, y) of the target block from the conversion unit 376 and the estimated value Pre (x, y) of the target block from the estimation unit 378, thereby The target block Ic (x, y) is restored, and the process proceeds to step S384, where the target block Ic (x, y) is output to the synthesis unit 365 (FIG. 110).

次に、図１１９のフローチャートを参照して、注目ブロックの差分圧縮データが第４の差分データである場合の、その第４の差分データの復元について説明する。 Next, with reference to the flowchart of FIG. 119, description will be given of restoration of the fourth difference data when the difference compressed data of the target block is the fourth difference data.

注目ブロックの差分圧縮データが、第４の差分データである場合には、その第４の差分データには、上述したように、前動きベクトル(-U₁,-V₁)が含まれる。そこで、可変長復号部３７４は、ステップＳ３９１において、注目ブロックの差分圧縮データである第４の差分データを解析し、その第４の差分データにおいて可変長符号となっている前動きベクトル(-U₁,-V₁)を可変長復号する。可変長復号部３７４で可変長復号が行われることにより得られた前動きベクトル(-U₁,-V₁)は、可変長復号部３７４から相関最大位置検出部３７７に供給される。 When the differential compressed data of the block of interest is the fourth differential data, the fourth differential data includes the previous motion vector (−U ₁ , −V ₁ ) as described above. Therefore, in step S391, the variable length decoding unit 374 analyzes the fourth difference data that is the difference compressed data of the block of interest, and the previous motion vector (−U that is a variable length code in the fourth difference data). ₁ , -V ₁ ) is variable length decoded. The previous motion vector (−U ₁ , −V ₁ ) obtained by performing variable length decoding in the variable length decoding unit 374 is supplied from the variable length decoding unit 374 to the maximum correlation position detection unit 377.

さらに、ステップＳ３９１では、相関最大位置検出部３７７は、ケースID判定部３７３から、ケースIDが４である旨の判定結果を受信し、この場合、注目ブロックについて、リファレンス記憶部３７２から読み出されたパストリファレンス画像V_Pにおいて、注目ブロックとの相関が高い位置関係を検出する。 Further, in step S391, the maximum correlation position detection unit 377 receives the determination result that the case ID is 4 from the case ID determination unit 373, and in this case, the attention block is read from the reference storage unit 372. in the Past reference image V _P, the correlation between the target block is detected with high positional relationship.

即ち、相関最大位置検出部３７７は、パストリファレンス画像V_Pにおいて、注目ブロックとの相関が高い位置が、注目ブロックから前動きベクトル(-U₁,-V₁)だけずれた位置であることを、可変長復号部３７４からの前動きベクトル(-U₁,-V₁)から検出し、その位置関係、つまり、可変長復号部３７４からの前動きベクトル(-U₁,-V₁)を、推測部３７８に供給して、ステップＳ３９１からＳ３９２に進む。 That is, the maximum correlation position detection unit 377 indicates that the position having a high correlation with the target block in the past reference image V _P is a position shifted from the target block by the previous motion vector (−U ₁ , −V ₁ ). , Detected from the previous motion vector (−U ₁ , −V ₁ ) from the variable length decoding unit 374, and the positional relationship, that is, the previous motion vector (−U ₁ , −V ₁ ) from the variable length decoding unit 374 is detected. , The process proceeds to step S392 from step S391.

ステップＳ３９２では、推測部３７８は、注目ブロックからの位置が、相関最大位置検出部３７７からの前動きベクトル(-U₁,-V₁)が表す位置にある、パストリファレンス画像V_Pの領域の画像データから、注目ブロックの推測値を求め、加算部３７９に供給する。 In step S392, the estimation unit 378 determines the region of the past reference image V _P in which the position from the target block is at the position represented by the previous motion vector (−U ₁ , −V ₁ ) from the correlation maximum position detection unit 377. An estimated value of the target block is obtained from the image data and supplied to the adding unit 379.

即ち、推測部３７８は、パストリファレンス画像V_Pにおいて、注目ブロックから前動きベクトル(-U₁,-V₁)だけずれた位置の、注目ブロックと同一サイズの領域の画像データを切り出し、注目ブロックの推測値Pre(x,y)として、加算部３７９に供給して、ステップＳ３９２からＳ３９３に進む。 That is, the estimating unit 378 cuts out the Past reference image V _P, from the target block prior to the motion vector (-U _{_1,} -V ₁₎ of a position shifted by the image data of a region of the target block and the same size, the block of interest The estimated value Pre (x, y) is supplied to the adding unit 379, and the process proceeds from step S392 to S393.

ステップＳ３９３では、加算部３７９が、変換部３７６からの注目ブロックの差分値Sub(x,y)と、推測部３７８からの注目ブロックの推測値Pre(x,y)とを加算し、これにより、注目ブロックIc(x,y)を復元して、ステップＳ３９４に進み、その注目ブロックIc(x,y)を、合成部３６５（図１１０）に出力する。 In step S393, the addition unit 379 adds the difference value Sub (x, y) of the target block from the conversion unit 376 and the estimated value Pre (x, y) of the target block from the estimation unit 378, thereby The target block Ic (x, y) is restored, and the process proceeds to step S394. The target block Ic (x, y) is output to the synthesis unit 365 (FIG. 110).

次に、図１２０のフローチャートを参照して、注目ブロックの差分圧縮データが第５の差分データである場合の、その第５の差分データの復元について説明する。 Next, with reference to the flowchart of FIG. 120, restoration of the fifth difference data when the difference compressed data of the target block is the fifth difference data will be described.

注目ブロックの差分圧縮データが、第５の差分データである場合には、その第５の差分データには、上述したように、後動きベクトル(U₂,V₂)が含まれる。そこで、可変長復号部３７４は、ステップＳ４０１において、注目ブロックの差分圧縮データである第５の差分データを解析し、その第５の差分データにおいて可変長符号となっている後動きベクトル(U₂,V₂)を可変長復号する。可変長復号部３７４で可変長復号が行われることにより得られた後動きベクトル(U₂,V₂)は、可変長復号部３７４から相関最大位置検出部３７７に供給される。 When the differential compressed data of the block of interest is the fifth differential data, the fifth differential data includes the back motion vector (U ₂ , V ₂ ) as described above. Therefore, in step S401, the variable length decoding unit 374 analyzes the fifth difference data that is the difference compressed data of the block of interest, and the post motion vector (U ₂₎ that is a variable length code in the fifth difference data. , V ₂ ) is variable length decoded. The post motion vector (U ₂ , V ₂ ) obtained by performing variable length decoding in the variable length decoding unit 374 is supplied from the variable length decoding unit 374 to the maximum correlation position detection unit 377.

さらに、ステップＳ４０１では、相関最大位置検出部３７７は、ケースID判定部３７３から、ケースIDが５である旨の判定結果を受信し、この場合、注目ブロックについて、リファレンス記憶部３７２から読み出されたフューチャリファレンス画像V_Fにおいて、注目ブロックとの相関が高い位置関係を検出する。 Further, in step S401, the maximum correlation position detection unit 377 receives the determination result that the case ID is 5 from the case ID determination unit 373, and in this case, the attention block is read from the reference storage unit 372. in the Futuresse reference image V _F, the correlation between the target block is detected with high positional relationship.

即ち、相関最大位置検出部３７７は、フューチャリファレンス画像V_Fにおいて、注目ブロックとの相関が高い位置が、注目ブロックから後動きベクトル(U₂,V₂)だけずれた位置であることを、可変長復号部３７４からの後動きベクトル(U₂,V₂)から検出し、その位置関係、つまり、可変長復号部３７４からの後動きベクトル(U₂,V₂)を、推測部３７８に供給して、ステップＳ４０１からＳ４０２に進む。 In other words, the maximum correlation position detection section 377, in Futuresse reference image V _F, that high correlation position of the block of interest, a position shifted by backward motion vector (U _{_2,} V ₂₎ from the target block, the variable Detected from the back motion vector (U ₂ , V ₂ ) from the long decoding unit 374, and supplies the positional relationship, that is, the back motion vector (U ₂ , V ₂ ) from the variable length decoding unit 374 to the estimation unit 378. Then, the process proceeds from step S401 to S402.

ステップＳ４０２では、推測部３７８は、注目ブロックからの位置が、相関最大位置検出部３７７からの後動きベクトル(U₂,V₂)が表す位置にある、フューチャリファレンス画像V_Fの領域の画像データから、注目ブロックの推測値を求め、加算部３７９に供給する。 In step S402, the estimation unit 378 has the image data of the area of the feature reference image V _F in which the position from the target block is at the position represented by the back motion vector (U ₂ , V ₂ ) from the maximum correlation position detection unit 377. Then, an estimated value of the target block is obtained and supplied to the adding unit 379.

即ち、推測部３７８は、フューチャリファレンス画像V_Fにおいて、注目ブロックから後動きベクトル(U₂,V₂)だけずれた位置の、注目ブロックと同一サイズの領域の画像データを切り出し、注目ブロックの推測値Pre(x,y)として、加算部３７９に供給して、ステップＳ４０２からＳ４０３に進む。 That is, the estimating unit 378, the Futuresse reference image V _F, the position shifted by backward motion vector (U _{_2,} V ₂₎ from the target block, the image data of the region of interest blocks the same size cut out, guess of the block of interest The value Pre (x, y) is supplied to the adding unit 379, and the process proceeds from step S402 to S403.

ステップＳ４０３では、加算部３７９が、変換部３７６からの注目ブロックの差分値Sub(x,y)と、推測部３７８からの注目ブロックの推測値Pre(x,y)とを加算し、これにより、注目ブロックIc(x,y)を復元して、ステップＳ４０４に進み、その注目ブロックIc(x,y)を、合成部３６５（図１１０）に出力する。 In step S403, the addition unit 379 adds the difference value Sub (x, y) of the target block from the conversion unit 376 and the estimated value Pre (x, y) of the target block from the estimation unit 378, thereby The target block Ic (x, y) is restored, and the process proceeds to step S404, where the target block Ic (x, y) is output to the synthesis unit 365 (FIG. 110).

次に、図１２１のフローチャートを参照して、注目ブロックの差分圧縮データが第６の差分データである場合の、その第６の差分データの復元について説明する。 Next, with reference to the flowchart in FIG. 121, restoration of the sixth difference data when the difference compressed data of the block of interest is the sixth difference data will be described.

注目ブロックの差分圧縮データが、第６の差分データである場合には、その第６の差分データは、注目ブロックの推定値Pre(x,y)を０として、注目ブロックIc(x,y)とその推定値Pre(x,y)との差分をとったものとなっている。 When the difference compressed data of the block of interest is the sixth difference data, the sixth difference data sets the estimated value Pre (x, y) of the block of interest to 0, and the block of interest Ic (x, y) And the estimated value Pre (x, y).

そこで、相関最大位置検出部３７７は、ケースID判定部３７３から、ケースIDが６である旨の判定結果が供給された場合、その判定結果を、推定部３７８に供給し、推定部３７８は、相関最大位置検出部３７７から、ケースIDが６である旨の判定結果が供給された場合には、注目ブロックの推測値Pre(x,y)として０を、加算部３７９に供給する。 Therefore, when the determination result indicating that the case ID is 6 is supplied from the case ID determination unit 373, the maximum correlation position detection unit 377 supplies the determination result to the estimation unit 378, and the estimation unit 378 When the determination result that the case ID is 6 is supplied from the maximum correlation position detection unit 377, 0 is supplied to the addition unit 379 as the estimated value Pre (x, y) of the block of interest.

この場合、加算部３７９は、ステップＳ４１１において、変換部３７６からの注目ブロックの差分値Sub(x,y)（但し、ここでは、注目ブロックの画像データIc(x,y)に等しい）と、推測部３７８からの注目ブロックの推測値Pre(x,y)（但し、ここでは、０）とを加算し、これにより、注目ブロックIc(x,y)を復元して、合成部３６５（図１１０）に出力する。 In this case, in step S411, the adding unit 379 calculates the difference value Sub (x, y) of the target block from the conversion unit 376 (here, equal to the image data Ic (x, y) of the target block), The estimated value Pre (x, y) (here, 0) of the target block from the estimation unit 378 is added, thereby restoring the target block Ic (x, y), and the synthesis unit 365 (FIG. 110).

なお、表示装置３（図２４）が、例えば、60fpsのフレームレートでの表示しか行うことができない場合には、図１１０の受信装置では、入力端子３６１からのビットストリームのみを、解凍回路３６３において復元し、その結果得られる60fps動画データを、そのまま、合成部３６５を介して、表示装置３に出力すればよい。 For example, when the display device 3 (FIG. 24) can only display at a frame rate of 60 fps, the receiving device of FIG. 110 receives only the bit stream from the input terminal 361 in the decompression circuit 363. The restored 60fps moving image data may be output as it is to the display device 3 via the synthesizing unit 365.

また、表示装置３（図２４）が、例えば、120fpsのフレームレートでの表示しか行うことができない場合には、図１１０の受信装置では、以下のような処理を行えばよい。 In addition, when the display device 3 (FIG. 24) can only display at a frame rate of 120 fps, for example, the receiving device of FIG. 110 may perform the following processing.

即ち、解凍回路３６３では、入力端子３６１からのビットストリームを、60fps動画データに復元し、合成部３６５に供給する。また、差分情報復元部３６４では、入力端子３６２からの差分圧縮データのうちの、図８２で説明した変数sが２のフレームのブロックの差分圧縮データを復元し、合成部３６５に供給する。 That is, the decompression circuit 363 restores the bit stream from the input terminal 361 to 60 fps moving image data and supplies it to the synthesis unit 365. Also, the differential information restoring unit 364 restores the differential compressed data of the block of the frame with the variable s described in FIG. 82 out of the differential compressed data from the input terminal 362 and supplies the restored differential data to the synthesizing unit 365.

この場合、解凍回路３６３から合成部３６５に対しては、例えば、図８０の上から２番目に示した60fps動画データのフレーム・・・，f₁，f₅，f₉，f₁₃，・・・が供給される。また、差分情報復元部３６４から合成部３６５に対しては、例えば、図８０の一番下に示した240-60fps動画データのフレーム・・・，f₂，f₃，f₄，f₆，f₇，f₈，f₁₀，f₁₁，f₁₂，・・・のうちの、フレーム・・・，f₃，f₇，f₁₁，・・・が供給される。 In this case, for example, the frame of 60 fps moving image data shown second from the top in FIG. 80..., F ₁ , f ₅ , f ₉ , f ₁₃ ,.・ Is supplied. Further, for example, the frame of 240-60 fps moving image data shown at the bottom of FIG. 80..., F ₂ , f ₃ , f ₄ , f ₆ , Of f ₇ , f ₈ , f ₁₀ , f ₁₁ , f ₁₂ ,..., frames..., f ₃ , f ₇ , f ₁₁ ,.

従って、合成部３６５では、解凍回路３６３からの60fps動画データのフレーム・・・，f₁，f₅，f₉，f₁₃，・・・と、差分情報復元部３６４からのフレーム・・・，f₃，f₇，f₁₁，・・・を、フレーム・・・，f₁，f₃，f₅，f₇，f₉，f₁₁，f₁₃，・・・の順で選択して出力することにより、フレームレートが120fpsの動画データを得ることができる。このフレームレートが120fpsの動画データを、表示装置３に出力すればよい。 Therefore, in the synthesis unit 365, frames of 60 fps moving image data from the decompression circuit 363,..., F ₁ , f ₅ , f ₉ , f ₁₃ ,. Select f ₃ , f ₇ , f ₁₁ ,... in the order of frame..., f ₁ , f ₃ , f ₅ , f ₇ , f ₉ , f ₁₁ , f ₁₃ ,. By doing so, it is possible to obtain moving image data having a frame rate of 120 fps. The moving image data having a frame rate of 120 fps may be output to the display device 3.

このようにして、図１１０の受信装置２では、表示装置３（図２４）の表示能力（フレームレート）に応じて適切なフレームレートの動画データを得ることができる。 In this manner, the receiving device 2 in FIG. 110 can obtain moving image data having an appropriate frame rate according to the display capability (frame rate) of the display device 3 (FIG. 24).

以上説明したように、図７９の送信装置１では、例えば、240fps動画データなどの高フレームレートの動画データを高圧縮し、また、図１１０の受信装置２では、そのように高圧縮された高フレームレートの動画データを復元することができる。 As described above, the transmission apparatus 1 in FIG. 79 highly compresses high frame rate moving picture data such as 240 fps moving picture data, for example, and the receiving apparatus 2 in FIG. Frame rate video data can be restored.

特に、第１の差分データに関しては、高フレームレート（例えば、240fps）の動画を復元する際に、復元したい高フレームレートの動画の中の１枚の画像に注目する。この注目した「復元したい画像」（ターゲット画像）に対して時間的に近くにある複数（例えば２枚）の「低フレームレートを構成する画像（例えば、前述のパストリファレンス画像とフューチャリファレンス画像）」を取り出し、その取り出された複数の画像同士において相関が高い位置関係にある部分（例えば、式（９）の相関情報e₁(U',V')を最小とする(U',V')）を求め、その部分から注目している「復元したい画像」を復元するようにしている。これにより、圧縮側（送信装置１）から解凍側（受信装置２）へ、動きベクトルを送信しなくてよいという利点がある。これは、データ量削減に効果がある。 In particular, regarding the first difference data, when restoring a high frame rate (for example, 240 fps) moving image, attention is paid to one image in the high frame rate moving image to be restored. A plurality of (for example, two) “images constituting a low frame rate (for example, the above-described past reference image and feature reference image)” that are close in time to this noticed “image to be restored” (target image) And a portion (for example, correlation information e ₁ (U ′, V ′) in Expression (9)) that has a high correlation in the extracted images is minimized (U ′, V ′). ) To restore the “image you want to restore”. Thereby, there is an advantage that it is not necessary to transmit a motion vector from the compression side (transmission device 1) to the decompression side (reception device 2). This is effective in reducing the amount of data.

さらに、動きベクトルを含まない第１の差分データについては、パストリファレンス画像とフューチャリファレンス画像において、相関が高い位置関係が複数あるときに復元に失敗することがあり得る。そこで、そのような場合に対処すべく、送信装置１では、第１の差分データの他に、相関最大ベクトル(U,V)を含む第２の差分データ、さらには、前動きベクトル(-U₁,-V₁)および後動きベクトル(U₂,V₂)の両方を含む第３の差分データ、前動きベクトル(-U₁,-V₁)を含む第４の差分データ、後動きベクトル(U₂,V₂)を含む第５の差分データ、リファレンス画像を参照せずに復元することができる第６の差分データを採用し、必要に応じて、第１乃至第６の差分データのうちのいずれかを選択する。従って、動画データの復元に失敗することを防止することができる。 Further, the first difference data that does not include a motion vector may fail to be restored when there are a plurality of positional relationships with high correlation in the past reference image and the future reference image. Therefore, in order to deal with such a case, in the transmission apparatus 1, in addition to the first difference data, the second difference data including the correlation maximum vector (U, V), and the previous motion vector (−U ₁ , −V ₁ ) and the back motion vector (U ₂ , V ₂ ), the third difference data, the front motion vector (−U ₁ , −V ₁ ), the fourth difference data, and the back motion vector The fifth difference data including (U ₂ , V ₂ ) and the sixth difference data that can be restored without referring to the reference image are adopted, and the first to sixth difference data are included as necessary. Select one of them. Therefore, it is possible to prevent a failure in restoring the moving image data.

さらに、図７９の送信装置１では、低フレームレートの動画に関するデータ（前述のビットストリーム）と、その低フレームレートの動画から高フレームレートの動画を復元する際に必要となるデータ（前述の差分圧縮データ）とを別々に分けていることにより、「低フレームレートの動画に関するデータ」だけを、図１１０の受信装置２が受信することで、低フレームレートの動画を復元することが出来る。また、「低フレームレートの動画に関するデータ」と「その低フレームレートの動画から高フレームレートの動画を復元する際に必要となるデータ」の両方を受信装置２が受信することで、高フレームレートの動画を復元することが出来る。即ち、いわゆる、テンポラルスケーラビリティを持たせることが出来る。 Further, in the transmission apparatus 1 of FIG. 79, data relating to a low frame rate moving image (the above-described bit stream) and data required for restoring a high frame rate moving image from the low frame rate moving image (the above difference) 110 is received separately by the receiving apparatus 2 in FIG. 110, so that the low frame rate moving image can be restored. In addition, the receiving apparatus 2 receives both “data relating to a low frame rate moving image” and “data necessary for restoring a high frame rate moving image from the low frame rate moving image”, whereby a high frame rate is obtained. Can be restored. In other words, so-called temporal scalability can be provided.

次に、上述した一連の処理は、専用のハードウェアにより行うこともできるし、ソフトウェアにより行うこともできる。一連の処理をソフトウェアによって行う場合には、そのソフトウェアを構成するプログラムが、汎用のコンピュータ等にインストールされる。 Next, the series of processes described above can be performed by dedicated hardware or by software. When a series of processing is performed by software, a program constituting the software is installed in a general-purpose computer or the like.

そこで、図１２２は、上述した一連の処理を実行するプログラムがインストールされるコンピュータの一実施の形態の構成例を示している。 Therefore, FIG. 122 shows a configuration example of an embodiment of a computer in which a program for executing the series of processes described above is installed.

コンピュータのメイン・コントローラであるCPU（Central Processing Unit）４０１は、バス４０８を介して、各部に接続されており、オペレーティング・システム（OS）の制御下で、アプリケーションプログラムを実行することにより、上述した一連の処理を行う。CPU４０１が実行するアプリケーションプログラムには、上述した一連の処理を行うためのものが含まれており、CPU４０１は、例えば、VTR(Video Tape Recorder)４１０から、VTRインターフェース４０９、バス４０８、および外部機器インターフェース４０７を介して、HDD(Hard disk Drive)４１４へダウンロードされた動画データを処理する。 A CPU (Central Processing Unit) 401, which is the main controller of the computer, is connected to each unit via a bus 408, and executes the application program under the control of the operating system (OS) to thereby execute the above-described operation. Perform a series of processing. The application program executed by the CPU 401 includes an application program for performing the above-described series of processing. The moving image data downloaded to the HDD (Hard disk Drive) 414 via 407 is processed.

メモリ４０２は、CPU４０１において実行されるプログラム・コードを格納したり、実行中の作業データを一時保管するために使用される記憶装置である。なお、メモリ４０２には、ROM(Read Only Memory)などの不揮発性メモリ及びDRAM(Dynamic Random Access Memory)などの揮発性メモリの双方が含まれる。 The memory 402 is a storage device used for storing program codes executed by the CPU 401 and temporarily storing work data being executed. The memory 402 includes both a nonvolatile memory such as a ROM (Read Only Memory) and a volatile memory such as a DRAM (Dynamic Random Access Memory).

ディスプレイコントローラ４０３は、CPU４０１が発行する描画命令を実際に処理するための専用コントローラである。ディスプレイ・コントローラ４０３において処理された描画データは、例えばフレーム・バッファ（図示しない）に一旦書き込まれた後、ディスプレイ４１１によって画面出力される。例えば、HDD４１４から再生された動画は、上述したように、ディスプレイ４１１で画面表示される。 The display controller 403 is a dedicated controller for actually processing a drawing command issued by the CPU 401. The drawing data processed in the display controller 403 is temporarily written in a frame buffer (not shown), for example, and then output on the screen by the display 411. For example, the moving image reproduced from the HDD 414 is displayed on the screen of the display 411 as described above.

入力機器インターフェース４０４は、キーボード４１２やマウス４１３などのユーザ入力機器をコンピュータに接続するための装置である。ユーザは、キーボード４１２やマウス４１３を介して、動画を再生するためのコマンドなどを入力することができる。 The input device interface 404 is a device for connecting user input devices such as a keyboard 412 and a mouse 413 to a computer. A user can input a command or the like for playing back a moving image via the keyboard 412 or the mouse 413.

ネットワークインターフェース４０５は、Ethernet（登録商標）などの所定の通信プロトコルに従って、コンピュータをLAN（Local Area Network）などの局所的ネットワーク、さらにはインターネットのような広域ネットワークに接続することができる。なお、ネットワーク上では、複数のホスト端末やサーバ（図示しない）がトランスペアレントな状態で接続され、分散コンピューティング環境が構築されている。ネットワーク上では、上述した一連の処理を実行するためのアプリケーションプログラムを含むソフトウェア・プログラムや、上述のビットストリーム（エンコードデータ）、差分圧縮データを含むデータ・コンテンツなどの配信サービスを行うことができる。また、コンピュータでは、例えば、他人が撮影した動画等が保存されているサーバから、動画データや、上述のビットストリーム、差分圧縮データを、ネットワークを経由してネットワークインターフェース４０５で受信し、HDD４１４へダウンロードすることができる。 The network interface 405 can connect the computer to a local network such as a LAN (Local Area Network) and further to a wide area network such as the Internet according to a predetermined communication protocol such as Ethernet (registered trademark). On the network, a plurality of host terminals and servers (not shown) are connected in a transparent state, and a distributed computing environment is constructed. On the network, distribution services such as a software program including an application program for executing the above-described series of processing, a data content including the above-described bit stream (encoded data), and differentially compressed data can be provided. In addition, the computer receives, for example, moving image data, the above-described bit stream, and differentially compressed data from the server storing moving images taken by others by the network interface 405 via the network and downloads them to the HDD 414. can do.

外部機器インターフェース４０７は、HDD４１４やメディア・ドライブ４１５などの外部装置をコンピュータに接続するための装置である。 The external device interface 407 is a device for connecting external devices such as the HDD 414 and the media drive 415 to the computer.

HDD４１４は、記憶媒体としての磁気ディスクを固定的に搭載したランダムアクセス可能な外部記憶装置であり、記憶容量やデータ転送速度などの点で他の外部記憶装置よりも優れている。HDD４１４には、上述した一連の処理を実行するアプリケーションプログラム等がインストールされる。ここで、「インストール」とは、ソフトウェア・プログラムを実行可能な状態でHDD４１４上に置くことを意味する。HDD４１４には、CPU４０１が実行すべきオペレーティング・システムのプログラム・コードや、アプリケーション・プログラム、デバイス・ドライバなどが不揮発的に格納される。なお、アプリケーションプログラムは、可搬形メディア４１６から、あるいは、ネットワークインターフェース４０５を介してダウンロードして、HDD４１４にインストールすることができる。また、アプリケーションプログラムは、あらかじめHDD４１４にインストールしておくことができる。 The HDD 414 is a random-accessible external storage device in which a magnetic disk as a storage medium is fixedly mounted, and is superior to other external storage devices in terms of storage capacity and data transfer speed. The HDD 414 is installed with an application program that executes the above-described series of processing. Here, “install” means placing the software program on the HDD 414 in an executable state. The HDD 414 stores the operating system program code to be executed by the CPU 401, application programs, device drivers, and the like in a nonvolatile manner. The application program can be downloaded from the portable medium 416 or via the network interface 405 and installed in the HDD 414. The application program can be installed in the HDD 414 in advance.

メディア・ドライブ４１５は、CD（Compact Disc）やMO（Magneto-Optical disc）、DVD（Digital Versatile Disc）などの可搬型メディア４１６を装填して、そのデータ記録面にアクセスするための装置である。 The media drive 415 is a device for loading portable media 416 such as CD (Compact Disc), MO (Magneto-Optical disc), DVD (Digital Versatile Disc), etc., and accessing the data recording surface.

可搬型メディア４１６は、主として、ソフトウェア・プログラムやデータ・ファイルなどをコンピュータ可読形式のデータとしてバックアップすることや、これらをシステム間で移動（すなわち販売・流通・配布を含む）する目的で使用される。上述した一連の処理を実行するアプリケーションプログラムや、そのアプリケーションプログラムを実行することにより得られるデータは、可搬型メディア４１６を利用して複数の機器間で物理的に流通・配布することができる。また、データは、ネットワークインターフェース４０５を介して配信することもできる。 The portable media 416 is used mainly for the purpose of backing up software programs, data files, and the like as data in a computer-readable format, and for moving them between systems (that is, including sales, distribution, and distribution). . An application program that executes the above-described series of processing and data obtained by executing the application program can be physically distributed and distributed among a plurality of devices by using the portable medium 416. Data can also be distributed via the network interface 405.

VTRインターフェース４０９は、VTR４１０から再生される動画をコンピュータ内に取り込むための装置である。 The VTR interface 409 is a device for taking a moving image reproduced from the VTR 410 into the computer.

ここで、図１２２に示すコンピュータとしては、米IBM社のパーソナル・コンピュータ"PC／AT（Personal Computer/Advanced Technology）"の互換機又は後継機を採用することができる。勿論、他のアーキテクチャを備えたコンピュータを採用しても良い。 Here, as the computer shown in FIG. 122, a compatible computer or a successor of IBM's personal computer “PC / AT (Personal Computer / Advanced Technology)” can be employed. Of course, you may employ | adopt the computer provided with the other architecture.

なお、図７９の送信装置１では、240fps動画データを処理するようにしたが、送信装置１で処理する動画データのフレームレートは、240fpsに限定されるものではない。さらに、分離回路２１３における動画データの分離の仕方も、動画データを、60fps動画データと、残りの240-60fps動画データとに分離するものに限定されるものではない。即ち、分離回路２１３では、例えば、240fps動画データを、30fps、あるいは120fpsの動画データと、残りの動画データとに分離することなどが可能である。 Note that although the transmission device 1 in FIG. 79 processes 240 fps moving image data, the frame rate of the moving image data processed by the transmission device 1 is not limited to 240 fps. Further, the method of separating the moving image data in the separation circuit 213 is not limited to the method of separating the moving image data into 60 fps moving image data and the remaining 240-60 fps moving image data. That is, the separation circuit 213 can separate, for example, 240 fps moving image data into 30 fps or 120 fps moving image data and the remaining moving image data.

また、上述の説明における「フレーム」は、「フィールド」と読み替えることができる。 Further, “frame” in the above description can be read as “field”.

さらに、本実施の形態では、注目ブロックの推測値を、パストリファレンス画像またはフューチャリファレンス画像を用いて求めるようにしたが、注目ブロックの推測値は、さらに、パストリファレンス画像の時間的に前にあるフレームや、フューチャリファレンス画像の時間的に後にあるフレームを用いて求めるようにすることも可能である。 Furthermore, in the present embodiment, the estimated value of the target block is obtained using the past reference image or the future reference image. However, the estimated value of the target block is further in time before the past reference image. It is also possible to use a frame or a frame that is temporally after the future reference image.

また、図７９の送信装置１では、１つのブロックごとに、第１乃至第６の差分データを求め、その中から、ブロックの圧縮結果となるものを選択するようにしたが、送信装置１では、第１の差分データだけ、または第２の差分データだけを求め、その第１または第２の差分データを、そのまま、ブロックの圧縮結果（差分圧縮データ）としてもよい。さらに、送信装置１では、第１の差分データと、第２乃至第６のうちの１以上の差分データとを求め、その中から、ブロックの圧縮結果となるものを選択してもよい。また、送信装置１では、第２の差分データと、第１の差分データおよび第３乃至第６のうちの１以上の差分データとを求め、その中から、ブロックの圧縮結果となるものを選択してもよい。 79 obtains the first to sixth difference data for each block, and selects the data that is the block compression result from the first to sixth difference data. Alternatively, only the first difference data or only the second difference data may be obtained, and the first or second difference data may be used as the block compression result (difference compressed data) as it is. Further, the transmission device 1 may obtain the first difference data and one or more of the second to sixth difference data, and may select a block compression result from the first difference data. Further, the transmission apparatus 1 obtains the second difference data, the first difference data, and one or more of the third to sixth difference data, and selects the one that becomes the block compression result from the second difference data May be.

２次元平面(x,y)に時間方向(t)を加えた３次元空間における動画データを示す図である。It is a figure which shows the moving image data in the three-dimensional space which added the time direction (t) to the two-dimensional plane (x, y). 人間の視覚によって認識することができる動画の周波数ドメイン上の範囲を示す図である。It is a figure which shows the range on the frequency domain of the moving image which can be recognized by human vision. 静止している動画の部分のデータが分布する周波数ドメイン上の領域を示す図である。It is a figure which shows the area | region on the frequency domain where the data of the part of the moving image which are still are distributed. 被写体が速度（r₀／t₀）／２程度で動いている部分のデータが分布する周波数ドメイン上の領域を示す図である。Subject is a diagram showing a region on the frequency domain data is distributed in the moving parts at a rate _{_{(r 0 / t 0) /}} 2 approximately. 速度（r₀／t₀）／２で動いている被写体を示す波形図である。It is a waveform diagram showing a moving subject at a rate _{_{(r 0 / t 0) /}} 2. 速度（r₀／t₀）／２で動いている被写体を示す図である。It is a diagram showing a moving subject at a rate _{_{(r 0 / t 0) /}} 2. 速度（r₀／t₀）／２程度で動いている部分のデータの周波数ドメイン上の部分を説明するための図である。It is a diagram for explaining a portion of a frequency domain data of the moving parts at a rate _{_{(r 0 / t 0) /}} 2 approximately. 被写体が速度r₀／t₀程度で動いている部分のデータが分布する周波数ドメイン上の領域を示す図である。Subject is a diagram showing a region on the frequency domain data is distributed in the moving parts in the order rate r _₀ / t _0. 被写体が速度２r₀／t₀程度で動いている部分のデータが分布する周波数ドメイン上の領域を示す図である。Subject is a diagram showing a region on the frequency domain data is distributed in the moving parts in the order rate 2r _₀ / t _0. 動画の静止している部分について人間が認識することができる周波数ドメイン上の領域を説明する図である。It is a figure explaining the area | region on the frequency domain which a human can recognize about the still part of a moving image. 被写体が速度(r₀／t₀)／２程度で動いている部分について人間が認識することができる周波数ドメイン上の領域を説明する図である。It is a diagram for explaining a region of the frequency domain that can recognize human the portion where the subject is moving at a speed _{_{(r 0 / t 0) /}} 2 approximately. 被写体が速度r₀／t₀程度で動いている部分について人間が認識することができる周波数ドメイン上の領域を説明する図である。Subject is a diagram for explaining a region of the frequency domain that a human can recognize the portion in motion at about the rate r _₀ / t _0. 被写体が速度２r₀／t₀程度で動いている部分について人間が認識することができる周波数ドメイン上の領域を説明する図である。Subject is a diagram for explaining a region of the frequency domain that a human can recognize the portion in motion at about the rate 2r _₀ / t _0. 被写体が速度(r₀／t₀)／２程度で動いている部分について人間が認識することができる周波数ドメイン上の領域を説明する図である。It is a diagram for explaining a region of the frequency domain that can recognize human the portion where the subject is moving at a speed _{_{(r 0 / t 0) /}} 2 approximately. 被写体が速度(r₀／t₀)／２程度で動いている部分について人間が認識することができる周波数ドメイン上の領域を説明する図である。It is a diagram for explaining a region of the frequency domain that can recognize human the portion where the subject is moving at a speed _{_{(r 0 / t 0) /}} 2 approximately. 被写体が速度r₀／t₀程度で動いている部分について人間が認識することができる周波数ドメイン上の領域を説明する図である。Subject is a diagram for explaining a region of the frequency domain that a human can recognize the portion in motion at about the rate r _₀ / t _0. 被写体が速度r₀／t₀程度で動いている部分について人間が認識することができる周波数ドメイン上の領域を説明する図である。Subject is a diagram for explaining a region of the frequency domain that a human can recognize the portion in motion at about the rate r _₀ / t _0. 被写体が速度２r₀／t₀程度で動いている部分について人間が認識することができる周波数ドメイン上の領域を説明する図である。Subject is a diagram for explaining a region of the frequency domain that a human can recognize the portion in motion at about the rate 2r _₀ / t _0. 被写体が速度２r₀／t₀程度で動いている部分について人間が認識することができる周波数ドメイン上の領域を説明する図である。Subject is a diagram for explaining a region of the frequency domain that a human can recognize the portion in motion at about the rate 2r _₀ / t _0. 動画の静止している部分について人間が認識することができる周波数ドメイン上の領域を説明する図である。It is a figure explaining the area | region on the frequency domain which a human can recognize about the still part of a moving image. 被写体が速度(r₀／t₀)／２程度で動いている部分について人間が認識することができる周波数ドメイン上の領域を説明する図である。It is a diagram for explaining a region of the frequency domain that can recognize human the portion where the subject is moving at a speed _{_{(r 0 / t 0) /}} 2 approximately. 被写体が速度r₀／t₀程度で動いている部分について人間が認識することができる周波数ドメイン上の領域を説明する図である。Subject is a diagram for explaining a region of the frequency domain that a human can recognize the portion in motion at about the rate r _₀ / t _0. 被写体が速度２r₀／t₀程度で動いている部分について人間が認識することができる周波数ドメイン上の領域を説明する図である。Subject is a diagram for explaining a region of the frequency domain that a human can recognize the portion in motion at about the rate 2r _₀ / t _0. 本発明が適用される画像処理システムの一実施の形態の構成例を示すブロック図である。It is a block diagram which shows the structural example of one Embodiment of the image processing system to which this invention is applied. 送信装置１の第１の構成例を示すブロック図である。3 is a block diagram illustrating a first configuration example of a transmission apparatus 1. FIG. 送信装置１の処理を説明するフローチャートである。4 is a flowchart for explaining processing of the transmission device 1. 送信装置１が出力する動画データを説明する図である。It is a figure explaining the moving image data which the transmitter 1 outputs. 送信装置１が出力する動画データを説明する図である。It is a figure explaining the moving image data which the transmitter 1 outputs. 送信装置１が出力する動画データを説明する図である。It is a figure explaining the moving image data which the transmitter 1 outputs. 送信装置１が出力する動画データを説明する図である。It is a figure explaining the moving image data which the transmitter 1 outputs. フィルタ生成部２３の処理を説明するフローチャートである。5 is a flowchart for explaining processing of a filter generation unit 23. 主成分方向を取得する方法を説明するフローチャートである。It is a flowchart explaining the method of acquiring a principal component direction. 受信装置２の第１の構成例を示すブロック図である。3 is a block diagram illustrating a first configuration example of a receiving device 2. FIG. 受信装置２の処理を説明するフローチャートである。5 is a flowchart for explaining processing of the reception device 2. 受信装置２でのアップサンプリング後の動画データを説明する図である。FIG. 6 is a diagram for explaining moving image data after upsampling in the receiving apparatus 2; 受信装置２でのアップサンプリング後の動画データを説明する図である。FIG. 6 is a diagram for explaining moving image data after upsampling in the receiving apparatus 2; 受信装置２でのアップサンプリング後の動画データを説明する図である。FIG. 6 is a diagram for explaining moving image data after upsampling in the receiving apparatus 2; 受信装置２でのアップサンプリング後の動画データを説明する図である。FIG. 6 is a diagram for explaining moving image data after upsampling in the receiving apparatus 2; 電子シャッタによる積分を説明する図である。It is a figure explaining the integration by an electronic shutter. フィルタの通過帯域を説明する図である。It is a figure explaining the pass band of a filter. フィルタの通過帯域を説明する図である。It is a figure explaining the pass band of a filter. フィルタの通過帯域を説明する図である。It is a figure explaining the pass band of a filter. 送信装置１に供給される高フレームレートの動画データを示す波形図である。FIG. 4 is a waveform diagram showing high frame rate moving image data supplied to the transmission apparatus 1. 送信装置１に供給される高フレームレートの動画データを示す波形図である。FIG. 4 is a waveform diagram showing high frame rate moving image data supplied to the transmission apparatus 1. 送信装置１に供給される高フレームレートの動画データを示す波形図である。FIG. 4 is a waveform diagram showing high frame rate moving image data supplied to the transmission apparatus 1. 送信装置１に供給される高フレームレートの動画データを示す波形図である。FIG. 4 is a waveform diagram showing high frame rate moving image data supplied to the transmission apparatus 1. 送信装置１に供給される高フレームレートの動画データを示す波形図である。FIG. 4 is a waveform diagram showing high frame rate moving image data supplied to the transmission apparatus 1. 送信装置１に供給される高フレームレートの動画データを示す波形図である。FIG. 4 is a waveform diagram showing high frame rate moving image data supplied to the transmission apparatus 1. 送信装置１に供給される高フレームレートの動画データを示す波形図である。FIG. 4 is a waveform diagram showing high frame rate moving image data supplied to the transmission apparatus 1. 送信装置１に供給される高フレームレートの動画データを示す波形図である。FIG. 4 is a waveform diagram showing high frame rate moving image data supplied to the transmission apparatus 1. 送信装置１に供給される高フレームレートの動画データを示す波形図である。FIG. 4 is a waveform diagram showing high frame rate moving image data supplied to the transmission apparatus 1. 主成分方向取得部３１の構成例を示すブロック図である。3 is a block diagram illustrating a configuration example of a principal component direction acquisition unit 31. FIG. 相関情報を示す図である。It is a figure which shows correlation information. スケーリング後の相関情報を示す図である。It is a figure which shows the correlation information after scaling. 合成相関情報を示す図である。It is a figure which shows synthetic | combination correlation information. 動きベクトルを示す図である。It is a figure which shows a motion vector. 動きベクトルを示す図である。It is a figure which shows a motion vector. 動きベクトルを示す図である。It is a figure which shows a motion vector. 動きベクトルを示す図である。It is a figure which shows a motion vector. 動きベクトルを示す図である。It is a figure which shows a motion vector. 動きベクトルを示す図である。It is a figure which shows a motion vector. 動きベクトルを示す図である。It is a figure which shows a motion vector. 動きベクトルを示す図である。It is a figure which shows a motion vector. 主成分方向取得部３１の処理を説明するフローチャートである。5 is a flowchart illustrating processing of a principal component direction acquisition unit 31. 主成分方向取得部３１の処理を説明するフローチャートである。5 is a flowchart illustrating processing of a principal component direction acquisition unit 31. 主成分方向取得部３１の処理を説明するフローチャートである。5 is a flowchart illustrating processing of a principal component direction acquisition unit 31. 主成分方向取得部３１の処理を説明するフローチャートである。5 is a flowchart illustrating processing of a principal component direction acquisition unit 31. フィルタ部２２の処理を説明するフローチャートである。4 is a flowchart for explaining processing of a filter unit 22; 静止している被写体と動いている被写体を示す図である。It is a figure which shows the to-be-photographed subject and the to-be-moved subject. 静止している被写体と動いている被写体とが投影された動画データを示す波形図である。FIG. 6 is a waveform diagram showing moving image data in which a stationary subject and a moving subject are projected. 静止している被写体と動いている被写体とが投影された動画データを示す波形図である。FIG. 6 is a waveform diagram showing moving image data in which a stationary subject and a moving subject are projected. 主成分方向取得部３１の他の構成例を示すブロック図である。10 is a block diagram illustrating another configuration example of the principal component direction acquisition unit 31. FIG. フィルタ生成部２３の処理を説明するフローチャートである。5 is a flowchart for explaining processing of a filter generation unit 23. フィルタ生成部２３の処理を説明するフローチャートである。5 is a flowchart for explaining processing of a filter generation unit 23. フィルタ生成部２３の処理を説明するフローチャートである。5 is a flowchart for explaining processing of a filter generation unit 23. フィルタ生成部２３の処理を説明するフローチャートである。5 is a flowchart for explaining processing of a filter generation unit 23. フィルタ生成部２３の処理を説明するフローチャートである。5 is a flowchart for explaining processing of a filter generation unit 23. フィルタ部２２の処理を説明するフローチャートである。4 is a flowchart for explaining processing of a filter unit 22; 送信装置１の第２の構成例を示すブロック図である。6 is a block diagram illustrating a second configuration example of the transmission device 1. FIG. 240fps動画データ、60fps動画データ、および240-60fps動画データを示す図である。It is a figure which shows 240fps moving image data, 60fps moving image data, and 240-60fps moving image data. 差分情報抽出部２１７の構成例を示すブロック図である。5 is a block diagram illustrating a configuration example of a difference information extraction unit 217. FIG. ターゲット画像V_T、パストリファレンス画像V_P、およびフューチャリファレンス画像V_Fを示す図である。FIG. 6 is a diagram showing a target image V _T , a past reference image V _P , and a future reference image V _F. 被写体を示す波形図である。It is a wave form diagram which shows a to-be-photographed object. ターゲット画像V_T、パストリファレンス画像V_P、およびフューチャリファレンス画像V_Fを示す波形図である。FIG. 6 is a waveform diagram showing a target image V _T , a past reference image V _P , and a future reference image V _F. パストリファレンス画像V_P、およびフューチャリファレンス画像V_Fを示す波形図である。FIG. 6 is a waveform diagram showing a past reference image V _P and a feature reference image V _F. パストリファレンス画像V_P、およびフューチャリファレンス画像V_Fを示す波形図である。FIG. 6 is a waveform diagram showing a past reference image V _P and a feature reference image V _F. ターゲット画像V_T、パストリファレンス画像V_P、およびフューチャリファレンス画像V_Fを示す波形図である。FIG. 6 is a waveform diagram showing a target image V _T , a past reference image V _P , and a future reference image V _F. ターゲット画像V_T、パストリファレンス画像V_P、およびフューチャリファレンス画像V_Fを示す波形図である。FIG. 6 is a waveform diagram showing a target image V _T , a past reference image V _P , and a future reference image V _F. ターゲット画像V_T、パストリファレンス画像V_P、およびフューチャリファレンス画像V_Fを示す波形図である。FIG. 6 is a waveform diagram showing a target image V _T , a past reference image V _P , and a future reference image V _F. ターゲット画像V_T、パストリファレンス画像V_P、およびフューチャリファレンス画像V_Fを示す波形図である。FIG. 6 is a waveform diagram showing a target image V _T , a past reference image V _P , and a future reference image V _F. ターゲット画像V_T、パストリファレンス画像V_P、およびフューチャリファレンス画像V_Fを示す波形図である。FIG. 6 is a waveform diagram showing a target image V _T , a past reference image V _P , and a future reference image V _F. ターゲット画像V_T、パストリファレンス画像V_P、およびフューチャリファレンス画像V_Fを示す波形図である。FIG. 6 is a waveform diagram showing a target image V _T , a past reference image V _P , and a future reference image V _F. ターゲット画像V_T、パストリファレンス画像V_P、およびフューチャリファレンス画像V_Fを示す波形図である。FIG. 6 is a waveform diagram showing a target image V _T , a past reference image V _P , and a future reference image V _F. 差分データ計算部２３４の構成例を示すブロック図である。6 is a block diagram illustrating a configuration example of a difference data calculation unit 234. FIG. 差分データ計算部２３４の処理を説明するフローチャートである。5 is a flowchart for explaining processing of a difference data calculation unit 234. 差分データ計算部２３５の構成例を示すブロック図である。4 is a block diagram illustrating a configuration example of a difference data calculation unit 235. FIG. 差分データ計算部２３５の処理を説明するフローチャートである。5 is a flowchart for explaining processing of a difference data calculation unit 235. 差分データ計算部２３６の構成例を示すブロック図である。5 is a block diagram illustrating a configuration example of a difference data calculation unit 236. FIG. 差分データ計算部２３６の処理を説明するフローチャートである。5 is a flowchart for explaining processing of a difference data calculation unit 236. 差分データ計算部２３７の構成例を示すブロック図である。6 is a block diagram illustrating a configuration example of a difference data calculation unit 237. FIG. 差分データ計算部２３７の処理を説明するフローチャートである。It is a flowchart explaining the process of the difference data calculation part 237. 差分データ計算部２３８の構成例を示すブロック図である。6 is a block diagram illustrating a configuration example of a difference data calculation unit 238. FIG. 差分データ計算部２３８の処理を説明するフローチャートである。It is a flowchart explaining the process of the difference data calculation part 238. 差分データ計算部２３９の構成例を示すブロック図である。It is a block diagram which shows the structural example of the difference data calculation part 239. 差分データ計算部２３９の処理を説明するフローチャートである。It is a flowchart explaining the process of the difference data calculation part 239. FIG. 第１乃至第６の差分データのデータ構造を示す図である。It is a figure which shows the data structure of the 1st thru | or 6th difference data. ケースIDが付加された第１乃至第６の差分データのデータ構造を示す図である。It is a figure which shows the data structure of the 1st thru | or 6th difference data to which case ID was added. 送信装置１の処理を説明するフローチャートである。4 is a flowchart for explaining processing of the transmission device 1. 差分情報抽出部２１７が行うターゲット画像の圧縮を説明するフローチャートである。It is a flowchart explaining compression of the target image which the difference information extraction part 217 performs. 受信装置２の第２の構成例を示すブロック図である。6 is a block diagram illustrating a second configuration example of the reception device 2. FIG. 受信装置２の処理を説明するフローチャートである。5 is a flowchart for explaining processing of the reception device 2. 差分情報復元部３６４が行う240-60fps動画データのデコードを説明するフローチャートである。It is a flowchart explaining the decoding of 240-60fps moving image data which the difference information decompression | restoration part 364 performs. 差分情報復元部３６４の構成例を示すブロック図である。5 is a block diagram illustrating a configuration example of a difference information restoration unit 364. FIG. 差分情報復元部３６４が行うブロックの画素値を求める処理を説明するフローチャートである。It is a flowchart explaining the process which calculates | requires the pixel value of the block which the difference information decompression | restoration part 364 performs. 差分情報復元部３６４が行うブロックの画素値を求める処理を説明するフローチャートである。It is a flowchart explaining the process which calculates | requires the pixel value of the block which the difference information decompression | restoration part 364 performs. 差分情報復元部３６４が行うブロックの画素値を求める処理を説明するフローチャートである。It is a flowchart explaining the process which calculates | requires the pixel value of the block which the difference information decompression | restoration part 364 performs. 差分情報復元部３６４が行うブロックの画素値を求める処理を説明するフローチャートである。It is a flowchart explaining the process which calculates | requires the pixel value of the block which the difference information decompression | restoration part 364 performs. 差分情報復元部３６４が行うブロックの画素値を求める処理を説明するフローチャートである。It is a flowchart explaining the process which calculates | requires the pixel value of the block which the difference information decompression | restoration part 364 performs. 差分情報復元部３６４が行うブロックの画素値を求める処理を説明するフローチャートである。It is a flowchart explaining the process which calculates | requires the pixel value of the block which the difference information decompression | restoration part 364 performs. 差分情報復元部３６４が行うブロックの画素値を求める処理を説明するフローチャートである。It is a flowchart explaining the process which calculates | requires the pixel value of the block which the difference information decompression | restoration part 364 performs. 差分情報復元部３６４が行うブロックの画素値を求める処理を説明するフローチャートである。It is a flowchart explaining the process which calculates | requires the pixel value of the block which the difference information decompression | restoration part 364 performs. 本発明を適用したコンピュータの一実施の形態の構成例を示すブロック図である。It is a block diagram which shows the structural example of one Embodiment of the computer to which this invention is applied.

Explanation of symbols

１送信装置，２受信装置，３表示装置，１１記録媒体，１２伝送媒体，２１バッファ部，２２フィルタ部，２３フィルタ生成部，２４エンコード部，３１主成分方向取得部，３２フィルタ情報供給部，５０デコード部，５１バッファ部，５２フィルタ部，５３フィルタ生成部，６１主成分方向取得部，６２フィルタ情報供給部，１０１バッファ部，１０２ブロック抽出部，１０３相関演算部，１０４スケーリング合成部，１０５最小値検出部，１２４合成部，１２５最小値検出部，２１１入力端子，２１２帯域制限フィルタ部，２１３分離回路，２１４圧縮回路，２１５出力端子，２１６解凍回路，２１７差分情報抽出部，２１８出力端子，２２１リファレンス記憶部，２２２ターゲット記憶部，２２３データ処理部，２３１乃至２３３入力端子，２３４乃至２３９差分データ計算部，２４０選択回路，２４１出力端子，２５１乃至２５３入力端子，２５４相関最大位置検出部，２５５平均値計算部，２５６減算部，２５７変換部，２５８量子化部，２５９可変長符号化部，２６０出力端子，２７１乃至２７３入力端子，２７４相関最大位置検出部，２７５平均値計算部，２７６減算部，２７７変換部，２７８量子化部，２７９可変長符号化部，２８０出力端子，２９１乃至２９３入力端子，２９４，２９５相関最大位置検出部，２９６平均値計算部，２９７減算部，２９８変換部，２９９量子化部，３００可変長符号化部，３０１出力端子，３１１，３１２入力端子，３１３相関最大位置検出部，３１４切り出し部，３１５減算部，３１６変換部，３１７量子化部，３１８可変長符号化部，３１９出力端子，３３１，３３２入力端子，３３３相関最大位置検出部，３３４切り出し部，３３５減算部，３３６変換部，３３７量子化部，３３８可変長符号化部，３３９出力端子，３５１入力端子，３５２変換部，３５３量子化部，３５４可変長符号化部，３５５出力端子，３６１，３６２入力端子，３６３解凍回路，３６４差分情報復元部，３６５合成部，３６６出力端子，３７１差分データ記憶部，３７２リファレンス記憶部，３７３ケースID判定部，３７４可変長復号部，３７５逆量子化部，３７６変換部，３７７相関最大位置検出部，３７８推測部，３７９加算部，４０１ CPU，４０２メモリ，４０３ディスプレイコントローラ，４０４入力機器インターフェース，４０５ネットワークインターフェース，４０７外部機器インターフェース，４０９ VTRインターフェース，４１０ VTR，４１１ディスプレイ，４１２キーボード，４１３マウス，４１４ HDD，４１５メディアドライブ，４１６可搬型メディア DESCRIPTION OF SYMBOLS 1 Transmission apparatus, 2 Reception apparatus, 3 Display apparatus, 11 Recording medium, 12 Transmission medium, 21 Buffer part, 22 Filter part, 23 Filter production | generation part, 24 Encoding part, 31 Main component direction acquisition part, 32 Filter information supply part, 50 Decoding unit, 51 Buffer unit, 52 Filter unit, 53 Filter generation unit, 61 Principal component direction acquisition unit, 62 Filter information supply unit, 101 Buffer unit, 102 Block extraction unit, 103 Correlation calculation unit, 104 Scaling synthesis unit, 105 Minimum value detection unit, 124 synthesis unit, 125 minimum value detection unit, 211 input terminal, 212 band limiting filter unit, 213 separation circuit, 214 compression circuit, 215 output terminal, 216 decompression circuit, 217 difference information extraction unit, 218 output terminal , 22 Reference storage unit, 222 Target storage unit, 223 Data processing unit, 231 to 233 input terminal, 234 to 239 Difference data calculation unit, 240 selection circuit, 241 output terminal, 251 to 253 input terminal, 254 Correlation maximum position detection unit, 255 Average value calculation unit, 256 subtraction unit, 257 conversion unit, 258 quantization unit, 259 variable length coding unit, 260 output terminal, 271 to 273 input terminal, 274 maximum correlation position detection unit, 275 average value calculation unit, 276 subtraction Unit, 277 conversion unit, 278 quantization unit, 279 variable length coding unit, 280 output terminal, 291 to 293 input terminal, 294,295 correlation maximum position detection unit, 296 average value calculation unit, 297 subtraction unit, 298 conversion unit , 299 Quantizer, 00 variable length encoding unit, 301 output terminal, 311, 312 input terminal, 313 maximum correlation position detection unit, 314 clipping unit, 315 subtraction unit, 316 conversion unit, 317 quantization unit, 318 variable length encoding unit, 319 output Terminal, 331, 332 input terminal, 333 maximum correlation position detection unit, 334 extraction unit, 335 subtraction unit, 336 conversion unit, 337 quantization unit, 338 variable length coding unit, 339 output terminal, 351 input terminal, 352 conversion unit , 353 quantization unit, 354 variable length coding unit, 355 output terminal, 361, 362 input terminal, 363 decompression circuit, 364 differential information restoration unit, 365 synthesis unit, 366 output terminal, 371 differential data storage unit, 372 reference storage Part, 373 case ID judgment part, 374 variable length decoding unit, 375 inverse quantization unit, 376 conversion unit, 377 correlation maximum position detection unit, 378 estimation unit, 379 addition unit, 401 CPU, 402 memory, 403 display controller, 404 input device interface, 405 network interface, 407 External device interface, 409 VTR interface, 410 VTR, 411 display, 412 keyboard, 413 mouse, 414 HDD, 415 media drive, 416 portable media

Claims

In an image processing apparatus that processes video data,
The first moving image data includes the second moving image data having a frame rate lower than the frame rate of the first moving image data, and the remaining third moving image obtained by removing the second moving image data from the first moving image data. Separation means for separating data,
Detecting means for detecting a positional relationship having a high correlation in a plurality of frames of the second moving image data;
Estimating means for obtaining an estimated value of a block obtained by dividing the frame of the third moving image data from the plurality of frames of image data of the second moving image data in the positional relationship detected by the detecting means. When,
An image processing apparatus comprising: a compression unit that compresses the third moving image data in units of blocks using the estimated value of the block.

The output of the second moving image data and the compression result of the third moving image data by the compression unit are independently output as the compression result of the first moving image data. Image processing device.

Other detection means for detecting one motion vector representing a positional relationship having a high correlation with the block in a plurality of frames of the second moving image data for the block;
Other estimation means for obtaining another estimated value of the block from image data of a plurality of frames of the second moving image data, wherein the positional relationship with the block is a positional relationship obtained from the one motion vector;
Other compression means for compressing the third moving image data in units of blocks using another estimated value of the block;
Selecting either one of the compressed data obtained by compressing the third moving image data by the compressing unit or the compressed data obtained by compressing the third moving image data by the other compressing unit; The image processing apparatus according to claim 1, further comprising: selection means for adding identification information for identifying the compressed data and outputting the identification data.

The selection unit has a data amount of compressed data obtained by compression of the third moving image data by the compression unit or compressed data obtained by compression of the third moving image data by the other compression unit. The image processing apparatus according to claim 3, wherein a smaller amount of compressed data is selected.

The image processing apparatus according to claim 3, wherein the second moving image data and the compressed data to which the identification information is added are independently output as a compression result of the first moving image data.

The estimation unit obtains a weighted average value of image data of a plurality of frames of the second moving image data in the positional relationship detected by the detection unit as an estimation value of the block. Item 8. The image processing apparatus according to Item 1.

In the frequency domain defined by the frequency axis in the time direction and the frequency axis in the spatial direction, the region extends in the principal component direction that is the direction of the principal component of the first moving image data, and the frequency axis in the time direction Filter means for filtering the first moving image data, with a region having a specific width in the direction as a passband,
The image processing apparatus according to claim 1, wherein the separation unit separates the first moving image data after filtering by the filter unit into the second and third moving image data.

In an image processing method for processing video data,
The first moving image data includes the second moving image data having a frame rate lower than the frame rate of the first moving image data, and the remaining third moving image obtained by removing the second moving image data from the first moving image data. A separation step for separating data,
A detection step of detecting a positional relationship having a high correlation in a plurality of frames of the second moving image data;
Estimating step for obtaining an estimated value of a block obtained by dividing the frame of the third moving image data from the plurality of frames of image data of the second moving image data in the positional relationship detected in the detecting step. When,
A compression step of compressing the third moving image data in units of blocks using the estimated value of the block.

Another detection step of detecting one motion vector representing a positional relationship having a high correlation with the block in the plurality of frames of the second moving image data for the block;
Another estimation step for obtaining another estimated value of the block from image data of a plurality of frames of the second moving image data, wherein the positional relationship with the block is a positional relationship obtained from the one motion vector;
Another compression step of compressing the third moving image data in units of blocks using another estimated value of the block;
One of the compressed data obtained by compressing the third moving image data by the compression step or the compressed data obtained by compressing the third moving image data by the other compression step is selected. The image processing method according to claim 8, further comprising: a selection step of adding identification information for identifying the compressed data and outputting the identification data.

In a program that causes a computer to process video data,
The first moving image data includes the second moving image data having a frame rate lower than the frame rate of the first moving image data, and the remaining third moving image obtained by removing the second moving image data from the first moving image data. A separation step for separating data,
A detection step of detecting a positional relationship having a high correlation in a plurality of frames of the second moving image data;
Estimating step for obtaining an estimated value of a block obtained by dividing the frame of the third moving image data from the plurality of frames of image data of the second moving image data in the positional relationship detected in the detecting step. When,
A compression step of compressing the third moving image data in units of blocks using the estimated value of the block.

Another detection step of detecting one motion vector representing a positional relationship having a high correlation with the block in the plurality of frames of the second moving image data for the block;
Another estimation step for obtaining another estimated value of the block from image data of a plurality of frames of the second moving image data, wherein the positional relationship with the block is a positional relationship obtained from the one motion vector;
Another compression step of compressing the third moving image data in units of blocks using another estimated value of the block;
One of the compressed data obtained by compressing the third moving image data by the compression step or the compressed data obtained by compressing the third moving image data by the other compression step is selected. The program according to claim 10, further comprising a selection step of adding identification information for identifying the compressed data and outputting the identification data.

In a program recording medium in which a program for causing a computer to process video data is recorded,
The first moving image data includes the second moving image data having a frame rate lower than the frame rate of the first moving image data, and the remaining third moving image obtained by removing the second moving image data from the first moving image data. A separation step for separating data,
A detection step of detecting a positional relationship having a high correlation in a plurality of frames of the second moving image data;
Estimating step for obtaining an estimated value of a block obtained by dividing the frame of the third moving image data from the plurality of frames of image data of the second moving image data in the positional relationship detected in the detecting step. When,
And a compressing step of compressing the third moving image data in units of blocks using the estimated value of the block.

Another detection step of detecting one motion vector representing a positional relationship having a high correlation with the block in the plurality of frames of the second moving image data for the block;
Another estimation step for obtaining another estimated value of the block from image data of a plurality of frames of the second moving image data, wherein the positional relationship with the block is a positional relationship obtained from the one motion vector;
Another compression step of compressing the third moving image data in units of blocks using another estimated value of the block;
One of the compressed data obtained by compressing the third moving image data by the compression step or the compressed data obtained by compressing the third moving image data by the other compression step is selected. The program recording medium according to claim 12, further comprising: a selection step of adding and outputting identification information for identifying the compressed data.

In the data structure of data obtained by compressing video data,
The first moving image data includes the second moving image data having a frame rate lower than the frame rate of the first moving image data, and the remaining third moving image obtained by removing the second moving image data from the first moving image data. Separated into data,
Detecting a positional relationship having a high correlation in a plurality of frames of the second moving image data;
Obtaining an estimated value of a block obtained by dividing a frame of the third moving image data from a plurality of frames of image data of the second moving image data having a high correlation with the correlation,
Compressed data obtained by compressing the block using the estimated value of the block;
A data structure comprising: the second moving image data.

For another block of the third moving image data, detect one motion vector representing a positional relationship having a high correlation with the other block in a plurality of frames of the second moving image data;
Obtaining an estimated value of the other block from image data of a plurality of frames of the second moving image data, wherein the positional relationship with the other block is a positional relationship obtained from the one motion vector,
Further including other compressed data obtained by compressing the other block using the estimated value of the other block;
The compressed data and the other compressed data include identification information for identifying each,
The data structure according to claim 14, wherein the other compressed data further includes the one motion vector.

In a data recording medium on which data obtained by compressing moving image data is recorded,
The first moving image data includes the second moving image data having a frame rate lower than the frame rate of the first moving image data, and the remaining third moving image obtained by removing the second moving image data from the first moving image data. Separated into data,
Detecting a positional relationship having a high correlation in a plurality of frames of the second moving image data;
Obtaining an estimated value of a block obtained by dividing a frame of the third moving image data from a plurality of frames of image data of the second moving image data having a high correlation with the correlation,
Compressed data obtained by compressing the block using the estimated value of the block;
A data recording medium on which data is recorded, comprising the second moving image data.

For another block of the third moving image data, detect one motion vector representing a positional relationship having a high correlation with the other block in a plurality of frames of the second moving image data;
Obtaining an estimated value of the other block from image data of a plurality of frames of the second moving image data, wherein the positional relationship with the other block is a positional relationship obtained from the one motion vector,
Further including other compressed data obtained by compressing the other block using the estimated value of the other block;
The compressed data and the other compressed data include identification information for identifying each,
The data recording medium according to claim 16, wherein the other compressed data further includes the one motion vector.

In an image processing apparatus that processes video data,
The first moving image data includes the second moving image data having a frame rate lower than the frame rate of the first moving image data, and the remaining third moving image obtained by removing the second moving image data from the first moving image data. Separated into data,
Detecting a positional relationship having a high correlation in a plurality of frames of the second moving image data;
Obtaining an estimated value of a block obtained by dividing a frame of the third moving image data from a plurality of frames of image data of the second moving image data having a high correlation with the correlation,
First restoration means for restoring compressed data obtained by compressing the block using the estimated value of the block to the third moving image data using the second moving image data;
A second restoring means for synthesizing the second and third moving image data and restoring the first moving image data;
The first restoration means includes
Detecting means for detecting a positional relationship having a high correlation in a plurality of frames of the second moving image data;
Estimating means for obtaining an estimated value of a block obtained by dividing the frame of the third moving image data from the plurality of frames of image data of the second moving image data in the positional relationship detected by the detecting means. When,
An image processing apparatus comprising: a block restoration unit that restores the third moving image data in units of blocks using the estimated value of the block and the compressed data.

The block is
For the block, one motion vector representing a positional relationship having a high correlation with the block in a plurality of frames of the second moving image data is detected;
Obtaining an estimated value of the block from image data of a plurality of frames of the second moving image data, wherein the positional relationship with the block is a positional relationship obtained from the one motion vector,
Other compressed data obtained by compressing the block using the estimated value of the block,
Or compressed into the compressed data,
The compressed data and the other compressed data include identification information for identifying each,
In the case where the other compressed data further includes the one motion vector,
When the identification information represents the compressed data,
The detecting means detects a positional relationship having a high correlation in a plurality of frames of the second moving image data;
The estimation means estimates a block obtained by dividing a frame of the third moving image data from the plurality of frames of image data of the second moving image data in the positional relationship detected by the detection means. Find the value
The block restoration means restores the block using the estimated value of the block and the compressed data,
When the identification information represents the other compressed data,
The detection means detects a positional relationship having a high correlation with the block in the plurality of frames of the second moving image data for the block from the one motion vector,
The estimation means determines an estimated value of the block from image data of a plurality of frames of the second moving image data, wherein the positional relationship with the block is a positional relationship determined from the one motion vector,
The image processing apparatus according to claim 18, wherein the block restoration unit restores the block using an estimated value of the block and the other compressed data.

The estimation unit obtains a weighted average value of image data of a plurality of frames of the second moving image data in the positional relationship detected by the detection unit as an estimation value of the block. Item 19. The image processing apparatus according to Item 18.

In an image processing method for processing video data,
The first moving image data includes the second moving image data having a frame rate lower than the frame rate of the first moving image data, and the remaining third moving image obtained by removing the second moving image data from the first moving image data. Separated into data,
Detecting a positional relationship having a high correlation in a plurality of frames of the second moving image data;
Obtaining an estimated value of a block obtained by dividing a frame of the third moving image data from a plurality of frames of image data of the second moving image data having a high correlation with the correlation,
A first restoration step of restoring compressed data obtained by compressing the block using the estimated value of the block into the third moving image data using the second moving image data;
Combining the second and third moving image data and restoring the first moving image data;
The first restoration step includes
A detection step of detecting a positional relationship having a high correlation in a plurality of frames of the second moving image data;
Estimating step for obtaining an estimated value of a block obtained by dividing the frame of the third moving image data from the plurality of frames of image data of the second moving image data in the positional relationship detected in the detecting step. When,
An image processing method comprising: a block restoration step of restoring the third moving image data in units of blocks using the estimated value of the block and the compressed data.

The block is
For the block, one motion vector representing a positional relationship having a high correlation with the block in a plurality of frames of the second moving image data is detected;
Obtaining an estimated value of the block from image data of a plurality of frames of the second moving image data, wherein the positional relationship with the block is a positional relationship obtained from the one motion vector,
Other compressed data obtained by compressing the block using the estimated value of the block,
Or compressed into the compressed data,
The compressed data and the other compressed data include identification information for identifying each,
In the case where the other compressed data further includes the one motion vector,
When the identification information represents the compressed data,
In the detecting step, a positional relationship having a high correlation is detected in a plurality of frames of the second moving image data,
In the estimating step, a block obtained by dividing a frame of the third moving image data into blocks from a plurality of frames of image data of the second moving image data in the positional relationship detected in the detecting step. Find the value
In the block restoration step, the block is restored using the estimated value of the block and the compressed data,
When the identification information represents the other compressed data,
In the detection step, a positional relationship having a high correlation with the block in the plurality of frames of the second moving image data is detected for the block from the one motion vector,
In the estimating step, an estimated value of the block is obtained from image data of a plurality of frames of the second moving image data, wherein the positional relationship with the block is a positional relationship obtained from the one motion vector,
The image processing method according to claim 21, wherein in the block restoration step, the block is restored using the estimated value of the block and the other compressed data.

In a program that causes a computer to process video data,
The first moving image data includes the second moving image data having a frame rate lower than the frame rate of the first moving image data, and the remaining third moving image obtained by removing the second moving image data from the first moving image data. Separated into data,
Detecting a positional relationship having a high correlation in a plurality of frames of the second moving image data;
Obtaining an estimated value of a block obtained by dividing a frame of the third moving image data from a plurality of frames of image data of the second moving image data having a high correlation with the correlation,
A first restoration step of restoring compressed data obtained by compressing the block using the estimated value of the block into the third moving image data using the second moving image data;
Combining the second and third moving image data and restoring the first moving image data;
The first restoration step includes
A detection step of detecting a positional relationship having a high correlation in a plurality of frames of the second moving image data;
Estimating step for obtaining an estimated value of a block obtained by dividing the frame of the third moving image data from the plurality of frames of image data of the second moving image data in the positional relationship detected in the detecting step. When,
A block restoring step of restoring the third moving image data in units of blocks using the estimated value of the block and the compressed data.

The block is
For the block, one motion vector representing a positional relationship having a high correlation with the block in a plurality of frames of the second moving image data is detected;
Obtaining an estimated value of the block from image data of a plurality of frames of the second moving image data, wherein the positional relationship with the block is a positional relationship obtained from the one motion vector,
Other compressed data obtained by compressing the block using the estimated value of the block,
Or compressed into the compressed data,
The compressed data and the other compressed data include identification information for identifying each,
In the case where the other compressed data further includes the one motion vector,
When the identification information represents the compressed data,
In the detecting step, a positional relationship having a high correlation is detected in a plurality of frames of the second moving image data,
In the estimating step, a block obtained by dividing a frame of the third moving image data into blocks from a plurality of frames of image data of the second moving image data in the positional relationship detected in the detecting step. Find the value
In the block restoration step, the block is restored using the estimated value of the block and the compressed data,
When the identification information represents the other compressed data,
In the detection step, a positional relationship having a high correlation with the block in the plurality of frames of the second moving image data is detected for the block from the one motion vector,
In the estimating step, an estimated value of the block is obtained from image data of a plurality of frames of the second moving image data, wherein the positional relationship with the block is a positional relationship obtained from the one motion vector,
The program according to claim 23, wherein in the block restoration step, the block is restored using the estimated value of the block and the other compressed data.

In a program recording medium in which a program for causing a computer to process video data is recorded,
The first moving image data includes the second moving image data having a frame rate lower than the frame rate of the first moving image data, and the remaining third moving image obtained by removing the second moving image data from the first moving image data. Separated into data,
Detecting a positional relationship having a high correlation in a plurality of frames of the second moving image data;
Obtaining an estimated value of a block obtained by dividing a frame of the third moving image data from a plurality of frames of image data of the second moving image data having a high correlation with the correlation,
A first restoration step of restoring compressed data obtained by compressing the block using the estimated value of the block into the third moving image data using the second moving image data;
Combining the second and third moving image data and restoring the first moving image data;
The first restoration step includes
A detection step of detecting a positional relationship having a high correlation in a plurality of frames of the second moving image data;
Estimating step for obtaining an estimated value of a block obtained by dividing the frame of the third moving image data from the plurality of frames of image data of the second moving image data in the positional relationship detected in the detecting step. When,
A program recording medium on which a program is recorded, comprising: a block restoring step of restoring the third moving image data in units of blocks using the estimated value of the block and the compressed data.

The block is
For the block, one motion vector representing a positional relationship having a high correlation with the block in a plurality of frames of the second moving image data is detected;
Obtaining an estimated value of the block from image data of a plurality of frames of the second moving image data, wherein the positional relationship with the block is a positional relationship obtained from the one motion vector,
Other compressed data obtained by compressing the block using the estimated value of the block,
Or compressed into the compressed data,
The compressed data and the other compressed data include identification information for identifying each,
In the case where the other compressed data further includes the one motion vector,
When the identification information represents the compressed data,
In the detecting step, a positional relationship having a high correlation is detected in a plurality of frames of the second moving image data,
In the estimating step, a block obtained by dividing a frame of the third moving image data into blocks from a plurality of frames of image data of the second moving image data in the positional relationship detected in the detecting step. Find the value
In the block restoration step, the block is restored using the estimated value of the block and the compressed data,
When the identification information represents the other compressed data,
In the detection step, a positional relationship having a high correlation with the block in the plurality of frames of the second moving image data is detected for the block from the one motion vector,
In the estimating step, an estimated value of the block is obtained from image data of a plurality of frames of the second moving image data, wherein the positional relationship with the block is a positional relationship obtained from the one motion vector,
26. The program recording medium according to claim 25, wherein in the block restoration step, the block is restored using the estimated value of the block and the other compressed data.

In an image processing apparatus that processes video data,
The first moving image data includes the second moving image data having a frame rate lower than the frame rate of the first moving image data, and the remaining third moving image obtained by removing the second moving image data from the first moving image data. Separation means for separating data,
Detecting means for detecting one motion vector representing a positional relationship having a high correlation with the block in a plurality of frames of the second moving image data for a block obtained by dividing the frame of the third moving image data into blocks;
Inference means for obtaining an estimated value of the block from image data of a plurality of frames of the second moving image data, wherein the positional relationship with the block is a positional relationship obtained from the one motion vector;
An image processing apparatus comprising: a compression unit that compresses the third moving image data in units of blocks using the estimated value of the block.

The said 2nd moving image data and the compression result of the said 3rd moving image data by the said compression means are output independently as a compression result of the said 1st moving image data. Image processing device.

The estimation means uses, as an estimated value of the block, a weighted average value of image data of a plurality of frames of the second moving image data in which the positional relationship with the block is a positional relationship obtained from the one motion vector. The image processing apparatus according to claim 27, wherein the image processing apparatus is obtained.

In the frequency domain defined by the frequency axis in the time direction and the frequency axis in the spatial direction, the region extends in the principal component direction that is the direction of the principal component of the first moving image data, and the frequency axis in the time direction Filter means for filtering the first moving image data, with a region having a specific width in the direction as a passband,
The image processing apparatus according to claim 27, wherein the separating unit separates the first moving image data after filtering by the filtering unit into the second and third moving image data.

In an image processing method for processing video data,
The first moving image data includes the second moving image data having a frame rate lower than the frame rate of the first moving image data, and the remaining third moving image obtained by removing the second moving image data from the first moving image data. A separation step for separating data,
A detection step of detecting one motion vector representing a positional relationship having a high correlation with the block in a plurality of frames of the second moving image data for a block obtained by dividing the frame of the third moving image data.
An estimation step of obtaining an estimated value of the block from image data of a plurality of frames of the second moving image data, wherein the positional relationship with the block is a positional relationship obtained from the one motion vector;
A compression step of compressing the third moving image data in units of blocks using the estimated value of the block.

In a program that causes a computer to process video data,
The first moving image data includes the second moving image data having a frame rate lower than the frame rate of the first moving image data, and the remaining third moving image obtained by removing the second moving image data from the first moving image data. A separation step for separating data,
A detection step of detecting one motion vector representing a positional relationship having a high correlation with the block in a plurality of frames of the second moving image data for a block obtained by dividing the frame of the third moving image data.
An estimation step of obtaining an estimated value of the block from image data of a plurality of frames of the second moving image data, wherein the positional relationship with the block is a positional relationship obtained from the one motion vector;
A compression step of compressing the third moving image data in units of blocks using the estimated value of the block.

In a program recording medium in which a program for causing a computer to process video data is recorded,
The first moving image data includes the second moving image data having a frame rate lower than the frame rate of the first moving image data, and the remaining third moving image obtained by removing the second moving image data from the first moving image data. A separation step for separating data,
A detection step of detecting one motion vector representing a positional relationship having a high correlation with the block in a plurality of frames of the second moving image data for a block obtained by dividing the frame of the third moving image data.
An estimation step of obtaining an estimated value of the block from image data of a plurality of frames of the second moving image data, wherein the positional relationship with the block is a positional relationship obtained from the one motion vector;
And a compressing step of compressing the third moving image data in units of blocks using the estimated value of the block.

In the data structure of data obtained by compressing video data,
The first moving image data includes the second moving image data having a frame rate lower than the frame rate of the first moving image data, and the remaining third moving image obtained by removing the second moving image data from the first moving image data. Separated into data,
For a block obtained by dividing the frame of the third moving image data into blocks, one motion vector representing a positional relationship having a high correlation with the block in a plurality of frames of the second moving image data is detected;
Obtaining an estimated value of the block from image data of a plurality of frames of the second moving image data, wherein the positional relationship with the block is a positional relationship obtained from the one motion vector,
Compressed data obtained by compressing the block using the estimated value of the block;
Including the second moving image data,
The compressed data includes the one motion vector.

In a data recording medium on which data obtained by compressing moving image data is recorded,
The first moving image data includes the second moving image data having a frame rate lower than the frame rate of the first moving image data, and the remaining third moving image obtained by removing the second moving image data from the first moving image data. Separated into data,
For a block obtained by dividing the frame of the third moving image data into blocks, one motion vector representing a positional relationship having a high correlation with the block in a plurality of frames of the second moving image data is detected;
Obtaining an estimated value of the block from image data of a plurality of frames of the second moving image data, wherein the positional relationship with the block is a positional relationship obtained from the one motion vector,
Compressed data obtained by compressing the block using the estimated value of the block;
Including the second moving image data,
The compressed data includes the one motion vector. A data recording medium on which data is recorded.

In an image processing apparatus that processes video data,
The first moving image data includes the second moving image data having a frame rate lower than the frame rate of the first moving image data, and the remaining third moving image obtained by removing the second moving image data from the first moving image data. Separated into data,
For a block obtained by dividing the frame of the third moving image data into blocks, one motion vector representing a positional relationship having a high correlation with the block in a plurality of frames of the second moving image data is detected;
Obtaining an estimated value of the block from image data of a plurality of frames of the second moving image data, wherein the positional relationship with the block is a positional relationship obtained from the one motion vector,
First restoration means for restoring compressed data obtained by compressing the block using the estimated value of the block to the third moving image data using the second moving image data;
A second restoring means for synthesizing the second and third moving image data and restoring the first moving image data;
The first restoration means includes
Detecting means for obtaining a positional relationship having a high correlation with the block in a plurality of frames of the second moving image data from the one motion vector;
An estimation means for obtaining an estimated value of the block from image data of a plurality of frames of the second moving image data, wherein the positional relationship with the block is a positional relationship obtained from the one motion vector;
An image processing apparatus comprising: a block restoration unit that restores the block using the estimated value of the block and the compressed data.

In an image processing method for processing video data,
The first moving image data includes the second moving image data having a frame rate lower than the frame rate of the first moving image data, and the remaining third moving image obtained by removing the second moving image data from the first moving image data. Separated into data,
For a block obtained by dividing the frame of the third moving image data into blocks, one motion vector representing a positional relationship having a high correlation with the block in a plurality of frames of the second moving image data is detected;
Obtaining an estimated value of the block from image data of a plurality of frames of the second moving image data, wherein the positional relationship with the block is a positional relationship obtained from the one motion vector,
A first restoration step of restoring compressed data obtained by compressing the block using the estimated value of the block into the third moving image data using the second moving image data;
Combining the second and third moving image data and restoring the first moving image data;
The first restoration step includes
From the one motion vector, a detection step for obtaining a positional relationship having a high correlation with the block in a plurality of frames of the second moving image data for the block;
An estimation step for obtaining an estimated value of the block from image data of a plurality of frames of the second moving image data, wherein the positional relationship with the block is a positional relationship obtained from the one motion vector;
An image processing method comprising: a block restoration step of restoring the block using the estimated value of the block and the compressed data.

In a program that causes a computer to process video data,
The first moving image data includes the second moving image data having a frame rate lower than the frame rate of the first moving image data, and the remaining third moving image obtained by removing the second moving image data from the first moving image data. Separated into data,
For a block obtained by dividing the frame of the third moving image data into blocks, one motion vector representing a positional relationship having a high correlation with the block in a plurality of frames of the second moving image data is detected;
Obtaining an estimated value of the block from image data of a plurality of frames of the second moving image data, wherein the positional relationship with the block is a positional relationship obtained from the one motion vector,
A first restoration step of restoring compressed data obtained by compressing the block using the estimated value of the block into the third moving image data using the second moving image data;
Combining the second and third moving image data and restoring the first moving image data;
The first restoration step includes
From the one motion vector, a detection step for obtaining a positional relationship having a high correlation with the block in a plurality of frames of the second moving image data for the block;
An estimation step for obtaining an estimated value of the block from image data of a plurality of frames of the second moving image data, wherein the positional relationship with the block is a positional relationship obtained from the one motion vector;
A block restoration step of restoring the block using the estimated value of the block and the compressed data.

In a program recording medium in which a program for causing a computer to process video data is recorded,
The first moving image data includes the second moving image data having a frame rate lower than the frame rate of the first moving image data, and the remaining third moving image obtained by removing the second moving image data from the first moving image data. Separated into data,
For a block obtained by dividing the frame of the third moving image data into blocks, one motion vector representing a positional relationship having a high correlation with the block in a plurality of frames of the second moving image data is detected;
Obtaining an estimated value of the block from image data of a plurality of frames of the second moving image data, wherein the positional relationship with the block is a positional relationship obtained from the one motion vector,
A first restoration step of restoring compressed data obtained by compressing the block using the estimated value of the block into the third moving image data using the second moving image data;
Combining the second and third moving image data and restoring the first moving image data;
The first restoration step includes
From the one motion vector, a detection step for obtaining a positional relationship having a high correlation with the block in a plurality of frames of the second moving image data for the block;
An estimation step for obtaining an estimated value of the block from image data of a plurality of frames of the second moving image data, wherein the positional relationship with the block is a positional relationship obtained from the one motion vector;
A program recording medium on which a program is recorded, comprising: a block restoring step for restoring the block using the estimated value of the block and the compressed data.