JP5742886B2

JP5742886B2 - Stereoscopic video encoding apparatus, stereoscopic video decoding apparatus, stereoscopic video encoding system, stereoscopic video encoding program, and stereoscopic video decoding program

Info

Publication number: JP5742886B2
Application number: JP2013128401A
Authority: JP
Inventors: 貴宏山崎
Original assignee: Oki Electric Industry Co Ltd
Current assignee: Oki Electric Industry Co Ltd
Priority date: 2013-06-19
Filing date: 2013-06-19
Publication date: 2015-07-01
Anticipated expiration: 2033-06-19
Also published as: JP2015005807A

Description

本発明は、立体映像符号化装置、立体映像復号装置及び立体映像符号化システム、並びに、立体映像符号化プログラム及び立体映像復号プログラムに関し、例えば、左目用画像、右目用画像（本明細書においては、画像データを単に画像と呼んでいることもある）をそれぞれの目に投影することで立体感が得られる動画像データを送受信する場合に適用し得るものである。 The present invention relates to a stereoscopic video encoding device, a stereoscopic video decoding device, a stereoscopic video encoding system, a stereoscopic video encoding program, and a stereoscopic video decoding program. For example, the left-eye image and the right-eye image (in this specification, The image data may be simply referred to as an image), and can be applied to transmission / reception of moving image data in which a stereoscopic effect can be obtained by projecting each eye.

人の視覚に立体的な動画像を認知させるための方法として、それぞれ異なる方向から撮影した右目用画像（動画像）及び左目用画像（動画像）を用意し、右目用画像を右目に、左目用画像を左目に投影するような表示機器を用いて、観察者に立体感を認識させる方法が広く知られている（例えば、特許文献１参照）。このような視差を持った動画像を伝送する方法として、右目用画像及び左目用画像を符号化かつ多重化したデータを作成して伝送する方法がある。 As a method for recognizing a three-dimensional moving image to human vision, a right eye image (moving image) and a left eye image (moving image) taken from different directions are prepared, and the right eye image is used as the right eye and the left eye. A method of causing an observer to recognize a stereoscopic effect using a display device that projects a commercial image to the left eye is widely known (see, for example, Patent Document 1). As a method of transmitting a moving image having such parallax, there is a method of generating and transmitting data obtained by encoding and multiplexing a right-eye image and a left-eye image.

右目用画像及び左目用画像を多重化かつ符号化する従来方法として、右目用画像及び左目用画像をそれぞれ、横方向（若しくは縦方向）に半分に圧縮した後、横方向（若しくは縦方向）に並置して１枚の画像に合成し、その合成画像を符号化する方法があった。この従来方法では、１枚の符号化データの中に同時刻の右目用画像及び左目用画像の情報が含まれるので、伝送中のエラーによりパケットロスが発生したとしても、左右視点間（右目用画像及び左目用画像間）の同期を確保することができる。 As a conventional method of multiplexing and encoding a right-eye image and a left-eye image, the right-eye image and the left-eye image are each compressed in half in the horizontal direction (or vertical direction) and then in the horizontal direction (or vertical direction). There is a method in which the images are combined and combined into one image, and the combined image is encoded. In this conventional method, information on the right-eye image and the left-eye image at the same time is included in one piece of encoded data, so even if a packet loss occurs due to an error during transmission, the left-right viewpoint (for the right eye) Synchronization between the image and the left-eye image) can be ensured.

特開２００６−１３５７８３号公報JP 2006-135783 A

しかしながら、上述した従来の方法では、合成画像内で、対応する点同士の距離が離れているため、パケットロスが発生した場合、エラーの伝播により、復号画像の対応する領域同士の差が大きくなることが起こり得る。ここで、対応する点又は領域とは、立体視した場合（右目用画像及び左目用画像をそれぞれ、それようの表示機器で見た場合）に同じ位置に知覚される点又は領域のことを指す。合成画像は、右目用画像及び左目用画像を合成しているため、合成画像中に対応する点や領域が存在することになる。 However, in the conventional method described above, the distance between corresponding points in the composite image is far away, so that when a packet loss occurs, the difference between corresponding regions in the decoded image increases due to error propagation. Can happen. Here, the corresponding point or region refers to a point or region that is perceived at the same position when viewed stereoscopically (when the image for the right eye and the image for the left eye are viewed with such a display device, respectively). . Since the composite image combines the right-eye image and the left-eye image, there are corresponding points and regions in the composite image.

図１３を用いて、エラーの伝播により、対応する領域同士に大きな差が生じ得ることを説明する。今、右目用画像の縮小画像と左目用画像の縮小画像とが並置合成された、図１３（Ａ）に示すｎフレーム目の合成画像において、パケットロスにより、領域Ｙが欠落したとする。なお、図１３（Ａ）及び（Ｂ）における領域ＸＬ及びＸＲは左目用画像と右目用画像の対応する領域とする。 With reference to FIG. 13, it will be described that a large difference may occur between corresponding regions due to error propagation. Assume that the region Y is lost due to packet loss in the composite image of the nth frame shown in FIG. 13A, in which the reduced image of the right-eye image and the reduced image of the left-eye image are combined in parallel. Note that regions XL and XR in FIGS. 13A and 13B are regions corresponding to the left-eye image and the right-eye image.

ｎフレーム目において、パケットロスによる領域Ｙの欠落により、この領域Ｙのデータは、補間処理や前フレームのデータ流用などが行われ、その結果、復号で得られる本来のデータとは異なったものとなる。すなわち、ノイズが混入したものとなる。ｎ＋１フレーム目において、復号の際、左目用画像の領域ＸＬは領域Ｙを参照しており、右目用画像の領域ＸＲは領域Ｙを参照していなかった場合、領域ＸＬの復号データにはノイズが混入することになるが、領域ＸＲの復号データにはノイズが混入しないことになる。この場合、対応する領域にも拘わらず、左右の視点間（右目用画像及び左目用画像間）で差が生じることになる。 In the nth frame, due to the lack of the area Y due to packet loss, the data in this area Y is subjected to interpolation processing, data diversion of the previous frame, etc. As a result, it differs from the original data obtained by decoding Become. That is, noise is mixed. In the (n + 1) th frame, when decoding, if the region XL of the left-eye image refers to the region Y and the region XR of the right-eye image does not refer to the region Y, the decoded data in the region XL has noise. Although mixed, noise is not mixed in the decoded data in the region XR. In this case, a difference occurs between the left and right viewpoints (between the right-eye image and the left-eye image) regardless of the corresponding region.

立体映像において、左右視点間で対応する領域間で差が大きい場合、立体感が感じられないものとなってしまう。さらに、視点間での画像の差が大きい場合、観察者の脳内で、視野闘争と呼ばれる現象が起こる。これは、右目に入る映像と左目に入る映像に大きな差がある場合、どちらかの視点の映像のみを脳が認識し、それが短時間で入れ替わって知覚されるという現象である。このような現象が起こると、ちらつきが感じられ、不快な立体映像となってしまう。 In a stereoscopic image, when there is a large difference between corresponding regions between the left and right viewpoints, the stereoscopic effect is not felt. Furthermore, when the image difference between the viewpoints is large, a phenomenon called visual field struggle occurs in the observer's brain. This is a phenomenon in which, when there is a large difference between the video entering the right eye and the video entering the left eye, the brain recognizes only the video from either viewpoint and perceives it by switching in a short time. When such a phenomenon occurs, flickering is felt, resulting in an unpleasant stereoscopic image.

そのため、右目用画像及び左目用画像を多重化かつ符号化した情報の一部が欠落しても、復号された右目用画像及び左目用画像が立体感を感じられ、かつ、視野闘争を引き起こさない立体映像符号化装置、立体映像復号装置及び立体映像符号化システム、並びに、立体映像符号化プログラム及び立体映像復号プログラムが望まれている。 Therefore, even if a part of information obtained by multiplexing and encoding the right-eye image and the left-eye image is lost, the decoded right-eye image and left-eye image can feel a three-dimensional effect and do not cause a visual field conflict. A stereoscopic video encoding apparatus, a stereoscopic video decoding apparatus, a stereoscopic video encoding system, a stereoscopic video encoding program, and a stereoscopic video decoding program are desired.

第１の本発明の立体映像符号化装置は、（１）縦横の画素数が同じ左目用画像及び右目用画像を多重して多重画像を生成する画像多重手段と、（２）生成された多重画像を符号化する符号化手段とを備え、（３）上記画像多重手段は、縦方向が同一の上記左目用画像及び上記右目用画像の横方向ラインの所定範囲の情報を、所定の画素数単位で横方向に交互に混在させて、縦方向が同じ上記多重画像の横方向ラインの情報を生成し、（４）上記横方向ラインの所定範囲が、全ての横方向ラインで同じであって、上記左目用画像及び上記右目用画像の視差量に応じて定まり、上記左目用画像及び上記右目用画像に対応する領域が存在する範囲であることを特徴とする。
第２の本発明の立体映像符号化装置は、（１）縦横の画素数が同じ左目用画像及び右目用画像を多重して多重画像を生成する画像多重手段と、（２）生成された多重画像を符号化する符号化手段とを備え、（３）上記画像多重手段は、縦方向が同一の上記左目用画像及び上記右目用画像の横方向ラインの所定範囲の情報を、所定の画素数単位で横方向に交互に混在させて、縦方向が同じ上記多重画像の横方向ラインの情報を生成し、（４）上記横方向ラインの所定範囲が、横方向ライン毎、若しくは、数ラインでなる横方向ラインの組毎に相違し、処理対象の横方向ライン若しくは横方向ライン組の、上記左目用画像及び上記右目用画像の視差量に応じて定まり、上記左目用画像及び上記右目用画像に対応する領域が存在する範囲であり、（５）上記左目用画像及び上記右目用画像の相関が高い領域の探索を行い、その探索により得られた複数の視差量のうち、最大の視差量若しくは最大頻度の視差量を、上記左目用画像及び上記右目用画像の視差量とする視差パラメータ算出手段を備えることを特徴とする。 The stereoscopic video encoding apparatus according to the first aspect of the present invention includes (1) image multiplexing means for generating a multiplexed image by multiplexing a left-eye image and a right-eye image having the same number of vertical and horizontal pixels, and (2) generated multiplexing And (3) the image multiplexing unit is configured to store information on a predetermined range of horizontal lines of the left-eye image and the right-eye image in the same vertical direction with a predetermined number of pixels. By alternately mixing in the horizontal direction in units, information on the horizontal lines of the multiple images having the same vertical direction is generated . (4) The predetermined range of the horizontal lines is the same for all the horizontal lines. It is determined according to the amount of parallax between the left-eye image and the right-eye image, and is a range in which there are areas corresponding to the left-eye image and the right-eye image .
The stereoscopic video encoding apparatus according to the second aspect of the present invention includes (1) image multiplexing means for generating a multiplexed image by multiplexing a left-eye image and a right-eye image having the same number of vertical and horizontal pixels, and (2) generated multiplexing And (3) the image multiplexing means stores information on a predetermined range of horizontal lines of the left-eye image and the right-eye image having the same vertical direction in a predetermined number of pixels. The horizontal line information of the multiple images having the same vertical direction is generated by alternately mixing in the horizontal direction in units. (4) The predetermined range of the horizontal line is set for each horizontal line or several lines. The left-eye image and the right-eye image, which are different for each set of horizontal lines, are determined according to the amount of parallax between the left-eye image and the right-eye image of the horizontal line or the horizontal line set to be processed. Is a range where there is an area corresponding to ( ) A search is performed for a region having a high correlation between the left-eye image and the right-eye image, and among the plurality of parallax amounts obtained by the search, the maximum parallax amount or the highest-frequency parallax amount is determined as the left-eye image and Disparity parameter calculation means for setting the amount of parallax for the right-eye image is provided.

第３の本発明の立体映像復号装置は、第１の本発明の立体映像符号化装置が形成した多重画像の符号化データが入力されると共に第１の本発明の立体映像符号化装置から視差量の情報が入力されるものであって、（１）入力された多重画像の符号化データを復号する復号手段と、（２）復号により得られた多重画像に対して、入力された視差量の情報を適用して、第１の本発明の立体映像符号化装置における画像多重手段の逆処理の多重分離を行うことにより、左目用画像及び右目用画像を分離抽出する画像多重分離手段とを備えることを特徴とする。
第４の本発明の立体映像復号装置は、第２の本発明の立体映像符号化装置が形成した多重画像の符号化データが入力されると共に第２の本発明の立体映像符号化装置から視差量の情報が入力されるものであって、（１）入力された多重画像の符号化データを復号する復号手段と、（２）復号により得られた多重画像に対して、入力された視差量の情報を適用して、第２の本発明の立体映像符号化装置における画像多重手段の逆処理の多重分離を行うことにより、左目用画像及び右目用画像を分離抽出する画像多重分離手段とを備えることを特徴とする。 The stereoscopic video decoding apparatus according to the third aspect of the present invention receives the encoded data of the multiplexed image formed by the stereoscopic video encoding apparatus according to the first aspect of the present invention and receives the parallax from the stereoscopic video encoding apparatus of the first aspect of the present invention. be one amount of information is input, (1) decoding means for decoding the encoded data of the input multiplexed image, (2) for multiple image obtained by decoding, parallax amount inputted The image demultiplexing means for separating and extracting the left-eye image and the right-eye image by performing demultiplexing of the inverse processing of the image multiplexing means in the stereoscopic video encoding device of the first aspect of the present invention by applying the above information It is characterized by providing.
The stereoscopic video decoding apparatus according to the fourth aspect of the present invention receives the encoded data of the multiplexed image formed by the stereoscopic video encoding apparatus according to the second aspect of the present invention and receives the parallax from the stereoscopic video encoding apparatus of the second aspect of the present invention. (1) decoding means for decoding the encoded data of the input multiple image, and (2) the amount of parallax input to the multiple image obtained by decoding. The image demultiplexing means for separating and extracting the left-eye image and the right-eye image by performing demultiplexing of the inverse processing of the image multiplexing means in the stereoscopic video encoding device of the second aspect of the present invention by applying the above information It is characterized by providing.

第５の本発明の立体映像符号化システムは、第１の本発明の立体映像符号化装置と、第３の本発明の立体映像復号装置とを有することを特徴とする。
第６の本発明の立体映像符号化システムは、第２の本発明の立体映像符号化装置と、第４の本発明の立体映像復号装置とを有することを特徴とする。 Fifth stereoscopic video encoding system of the present invention is characterized by having a three-dimensional video encoding apparatus of the first aspect of the present invention, a stereoscopic video decoding apparatus of the third invention.
The stereoscopic video encoding system of the sixth aspect of the present invention includes the stereoscopic video encoding apparatus of the second aspect of the present invention and the stereoscopic video decoding apparatus of the fourth aspect of the present invention.

第７の本発明の立体映像符号化プログラムは、コンピュータを、（１）縦横の画素数が同じ左目用画像及び右目用画像を多重して多重画像を生成するものであって、縦方向が同一の上記左目用画像及び上記右目用画像の横方向ラインの所定範囲の情報を、所定の画素数単位で横方向に交互に混在させて、縦方向が同じ上記多重画像の横方向ラインの情報を生成する画像多重手段と、（２）生成された多重画像を符号化する符号化手段として機能させる立体映像符号化プログラムであって、（３）上記横方向ラインの所定範囲が、全ての横方向ラインで同じであって、上記左目用画像及び上記右目用画像の視差量に応じて定まり、視差に拘わらず、上記左目用画像及び上記右目用画像に対応する領域が存在する範囲であることを特徴とする。
第８の本発明の立体映像符号化プログラムは、コンピュータを、（１）縦横の画素数が同じ左目用画像及び右目用画像を多重して多重画像を生成するものであって、縦方向が同一の上記左目用画像及び上記右目用画像の横方向ラインの所定範囲の情報を、所定の画素数単位で横方向に交互に混在させて、縦方向が同じ上記多重画像の横方向ラインの情報を生成する画像多重手段と、（２）生成された多重画像を符号化する符号化手段と、（３）上記左目用画像及び上記右目用画像の相関が高い領域の探索を行い、その探索により得られた複数の視差量のうち、最大の視差量若しくは最大頻度の視差量を、上記左目用画像及び上記右目用画像の視差量とする視差パラメータ算出手段として機能させる立体映像符号化プログラムであって、（４）上記横方向ラインの所定範囲が、横方向ライン毎、若しくは、数ラインでなる横方向ラインの組毎に相違し、処理対象の横方向ライン若しくは横方向ライン組の、上記左目用画像及び上記右目用画像の視差量に応じて定まり、視差に拘わらず、上記左目用画像及び上記右目用画像に対応する領域が存在する範囲であることを特徴とする。 According to a seventh aspect of the present invention, there is provided a three-dimensional video encoding program for generating a multiple image by multiplexing a left-eye image and a right-eye image having the same number of vertical and horizontal pixels, and having the same vertical direction. The information of the predetermined range of the horizontal line of the left eye image and the right eye image of the left eye image and the right eye image is alternately mixed in the horizontal direction in units of a predetermined number of pixels, and the horizontal line information of the multiple images having the same vertical direction is obtained. And (2) a stereoscopic video encoding program that functions as an encoding unit that encodes the generated multiplexed image , and (3) a predetermined range of the horizontal line includes all horizontal directions It is the same in the line, and is determined according to the amount of parallax between the left-eye image and the right-eye image, and the range corresponding to the left-eye image and the right-eye image exists regardless of the parallax. Features.
According to an eighth aspect of the present invention, there is provided a stereoscopic video encoding program for generating a multiple image by multiplexing a left-eye image and a right-eye image having the same number of vertical and horizontal pixels and having the same vertical direction. The information of the predetermined range of the horizontal line of the left eye image and the right eye image of the left eye image and the right eye image is alternately mixed in the horizontal direction in units of a predetermined number of pixels, and the horizontal line information of the multiple images having the same vertical direction is obtained. (2) encoding means for encoding the generated multiplexed image; and (3) searching for a region having a high correlation between the left-eye image and the right-eye image, and obtaining by the search. A stereoscopic video encoding program that functions as a parallax parameter calculation unit that uses a parallax amount of the left-eye image and the right-eye image as a parallax amount of the left-eye image and the right-eye image among a plurality of parallax amounts obtained. , (4) The predetermined range of horizontal lines is different for each horizontal line or for each set of horizontal lines consisting of several lines, and the left eye image and the right eye of the horizontal line or horizontal line set to be processed It is determined according to the amount of parallax of the image for use, and is a range in which areas corresponding to the image for the left eye and the image for the right eye exist regardless of the parallax.

第９の本発明の立体映像復号プログラムは、第１の本発明の立体映像符号化装置が形成した多重画像の符号化データが入力されると共に第１の本発明の立体映像符号化装置から視差量の情報が入力される装置に搭載されるコンピュータを、（１）入力された多重画像の符号化データを復号する復号手段と、（２）復号により得られた多重画像に対して、入力された視差量の情報を適用して、第１の本発明の立体映像符号化装置における画像多重手段の逆処理の多重分離を行うことにより、左目用画像及び右目用画像を分離抽出する画像多重分離手段として機能させることを特徴とする。
第１０の本発明の立体映像復号プログラムは、第２の本発明の立体映像符号化装置が形成した多重画像の符号化データが入力されると共に第２の本発明の立体映像符号化装置から視差量の情報が入力される装置に搭載されるコンピュータを、（１）入力された多重画像の符号化データを復号する復号手段と、（２）復号により得られた多重画像に対して、入力された視差量の情報を適用して、第２の本発明の立体映像符号化装置における画像多重手段の逆処理の多重分離を行うことにより、左目用画像及び右目用画像を分離抽出する画像多重分離手段として機能させることを特徴とする。 The stereoscopic video decoding program of the ninth aspect of the present invention receives the encoded data of the multiplexed image formed by the stereoscopic video encoding apparatus of the first aspect of the present invention, and receives the parallax from the stereoscopic video encoding apparatus of the first aspect of the present invention. A computer mounted on a device to which the quantity information is input is input to (1) decoding means for decoding encoded data of the input multiple image, and (2) input to the multiple image obtained by decoding. Image demultiplexing that separates and extracts a left-eye image and a right-eye image by performing demultiplexing by inverse processing of the image multiplexing means in the stereoscopic video encoding device of the first aspect of the present invention by applying the parallax information It is made to function as a means.
The stereoscopic video decoding program according to the tenth aspect of the present invention receives encoded data of a multiplexed image formed by the stereoscopic video encoding apparatus according to the second aspect of the present invention and receives parallax from the stereoscopic video encoding apparatus according to the second aspect of the present invention. A computer mounted on a device to which the quantity information is input is input to (1) decoding means for decoding encoded data of the input multiple image, and (2) input to the multiple image obtained by decoding. Image demultiplexing that separates and extracts a left-eye image and a right-eye image by performing demultiplexing of inverse processing of the image multiplexing means in the stereoscopic video encoding device of the second aspect of the present invention by applying the information on the amount of parallax It is made to function as a means.

本発明によれば、右目用画像及び左目用画像を多重化かつ符号化した情報の一部が欠落しても、復号された右目用画像及び左目用画像が立体感を感じられ、かつ、視野闘争を引き起こさない立体映像符号化装置、立体映像復号装置及び立体映像符号化システム、並びに、立体映像符号化プログラム及び立体映像復号プログラムを提供できる。 According to the present invention, even if a part of information obtained by multiplexing and encoding the right-eye image and the left-eye image is lost, the decoded right-eye image and left-eye image can feel a stereoscopic effect, and the field of view A stereoscopic video encoding device, a stereoscopic video decoding device, a stereoscopic video encoding system, a stereoscopic video encoding program, and a stereoscopic video decoding program that do not cause a conflict can be provided.

第１の実施形態の立体映像符号化システムを含む立体映像通信システムの構成を示すブロック図である。1 is a block diagram illustrating a configuration of a stereoscopic video communication system including a stereoscopic video encoding system according to a first embodiment. 図１の多重部の機能の説明図である。It is explanatory drawing of the function of the multiplexing part of FIG. 図１の立体映像送信装置の動作を示すフローチャートである。It is a flowchart which shows operation | movement of the stereo image transmission apparatus of FIG. 図１の立体映像受信装置の動作を示すフローチャートである。It is a flowchart which shows operation | movement of the three-dimensional video receiver of FIG. 第２〜第４の実施形態で視差パラメータを導入した理由と、視差パラメータを利用した多重部の機能との説明図である。It is explanatory drawing of the reason which introduced the parallax parameter in the 2nd-4th embodiment, and the function of the multiplexing part using a parallax parameter. 第２の実施形態の立体映像符号化システムを含む立体映像通信システムの構成を示すブロック図である。It is a block diagram which shows the structure of the stereoscopic video communication system containing the stereoscopic video encoding system of 2nd Embodiment. 図６の立体映像送信装置の動作を示すフローチャートである。It is a flowchart which shows operation | movement of the stereoscopic video transmission apparatus of FIG. 図６の立体映像受信装置の動作を示すフローチャートである。It is a flowchart which shows operation | movement of the three-dimensional video receiver of FIG. 第３の実施形態の立体映像符号化システムを含む立体映像通信システムの構成を示すブロック図である。It is a block diagram which shows the structure of the stereoscopic video communication system containing the stereoscopic video encoding system of 3rd Embodiment. 第４の実施形態の立体映像符号化システムを含む立体映像通信システムの構成を示すブロック図である。It is a block diagram which shows the structure of the stereoscopic video communication system containing the stereoscopic video encoding system of 4th Embodiment. 図１０の立体映像送信装置の動作を示すフローチャートである。It is a flowchart which shows operation | movement of the stereo image transmission apparatus of FIG. 図１０の立体映像受信装置の動作を示すフローチャートである。It is a flowchart which shows operation | movement of the three-dimensional video receiver of FIG. 右目用画像及び左目用画像を多重化かつ符号化する従来方法の課題の説明図である。It is explanatory drawing of the subject of the conventional method which multiplexes and encodes the image for right eyes, and the image for left eyes.

（Ａ）第１の実施形態
以下、本発明による立体映像符号化装置、立体映像復号装置及び立体映像符号化システム、並びに、立体映像符号化プログラム及び立体映像復号プログラムの第１の実施形態を、図面を参照しながら説明する。 (A) First Embodiment Hereinafter, a first embodiment of a stereoscopic video encoding device, a stereoscopic video decoding device, a stereoscopic video encoding system, a stereoscopic video encoding program, and a stereoscopic video decoding program according to the present invention will be described. This will be described with reference to the drawings.

第１の実施形態の立体映像符号化システムは、立体映像通信システムに組み込まれたものである。 The stereoscopic video encoding system of the first embodiment is incorporated in a stereoscopic video communication system.

（Ａ−１）第１の実施形態の構成
図１は、第１の実施形態の立体映像符号化システムを含む立体映像通信システムの構成を示すブロック図である。 (A-1) Configuration of First Embodiment FIG. 1 is a block diagram showing a configuration of a stereoscopic video communication system including the stereoscopic video encoding system of the first embodiment.

図１において、立体映像通信システム１は、立体映像送信装置１０及び立体映像受信装置２０を有する。 In FIG. 1, the stereoscopic video communication system 1 includes a stereoscopic video transmission device 10 and a stereoscopic video reception device 20.

立体映像送信装置１０は、画像多重部１１、符号化部１２及び送信部１３を有する。ここで、画像多重部１１及び符号化部１２は、第１の実施形態の立体映像符号化装置の構成要素である。少なくとも画像多重部１１及び符号化部１２の部分は、ハードウェア的に各種回路を接続して構築されても良く、また、ＣＰＵ、ＲＯＭ、ＲＡＭなどを有する汎用的な装置若しくはユニットが所定のプログラムを実行することで該当する機能を実現するように構築されても良い。 The stereoscopic video transmission device 10 includes an image multiplexing unit 11, an encoding unit 12, and a transmission unit 13. Here, the image multiplexing unit 11 and the encoding unit 12 are components of the stereoscopic video encoding apparatus of the first embodiment. At least the image multiplexing unit 11 and the encoding unit 12 may be constructed by connecting various circuits in hardware, and a general-purpose device or unit having a CPU, a ROM, a RAM, and the like is used as a predetermined program. It may be constructed to realize the corresponding function by executing.

画像多重部１１は、入力された左目用画像ＰＩＣＬ及び右目用画像ＰＩＣＲを多重（合成）し、多重画像を符号化部１２に出力するものである。ここで、一対の左目用画像ＰＩＣＬ及び右目用画像ＰＩＣＲは同時刻の視点（撮影を行った位置）が異なる立体視用の画像である。 The image multiplexing unit 11 multiplexes (combines) the input left-eye image PICL and right-eye image PICR, and outputs the multiplexed image to the encoding unit 12. Here, the pair of left-eye images PICL and right-eye images PICR are images for stereoscopic viewing that have different viewpoints at the same time (positions at which shooting was performed).

左目用画像ＰＩＣＬ及び右目用画像ＰＩＣＲは、撮像装置が得たばかりのものであっても良く、記録媒体等から読み出したものであっても良く、他の装置から送信されて受信したものであっても良い。すなわち、左目用画像ＰＩＣＬ及び右目用画像ＰＩＣＲの入力方法は、問われないものである。 The left-eye image PICL and the right-eye image PICR may be those just obtained by the imaging device, may be read from a recording medium or the like, and are transmitted and received from other devices. Also good. That is, the input method of the left-eye image PICL and the right-eye image PICR is not limited.

また、一対の左目用画像ＰＩＣＬ及び右目用画像ＰＩＣＲは、同じ種類の画像であれば、輝度信号（Ｙ）の画像であっても良く、いずれかの色差信号（Ｃｂ、Ｃｒ）の画像であっても良く、いずれかの色信号（Ｒ、Ｇ、Ｂ）の画像であっても良い。左目用画像ＰＩＣＬ及び右目用画像ＰＩＣＲの大きさは限定されるものではない。例えば、縦横１０８０ライン×１９２０画素、縦横５４０ライン×９６０画素、縦横４８０ライン×７２０画素、縦横２８８ライン×３６０画素、縦横２４０ライン×３６０画素、縦横１４４ライン×１８０画素、縦横１２０ライン×１８０画素、縦横７２ライン×９０画素などのいずれの大きさであっても良い。また、１画素当たりのビット数も限定されるものではない。 Further, the pair of left-eye image PICL and right-eye image PICR may be images of luminance signals (Y) as long as they are the same type of image, and may be images of one of the color difference signals (Cb, Cr). It may be an image of any color signal (R, G, B). The sizes of the left-eye image PICL and the right-eye image PICR are not limited. For example, vertical and horizontal 1080 lines × 1920 pixels, vertical and horizontal 540 lines × 960 pixels, vertical and horizontal 480 lines × 720 pixels, vertical and horizontal 288 lines × 360 pixels, vertical and horizontal 240 lines × 360 pixels, vertical and horizontal 144 lines × 180 pixels, vertical and horizontal 120 lines × 180 pixels Any size such as vertical and horizontal 72 lines × 90 pixels may be used. Also, the number of bits per pixel is not limited.

画像多重部１１は、図２（Ａ１）及び（Ａ２）に示すように、左目用画像ＰＩＣＬ及び右目用画像ＰＩＣＲをそれぞれ、縦方向の分割線に沿って、複数の縦縞画像部分ＳＬ−１〜ＳＬ−Ｎ、ＳＲ−１〜ＳＲ−Ｎ（Ｎは２以上の整数）に分割した後、図２（Ｂ）に示すように、左目用画像ＰＩＣＬの縦縞画像部分ＳＬ−１〜ＳＬ−Ｎと右目用画像ＰＩＣＲの縦縞画像部分ＳＲ−１〜ＳＲ−Ｎとを横方向に交互に配置した多重画像ＣＰＩＣを形成して出力する。 As shown in FIGS. 2A1 and 2A, the image multiplexing unit 11 converts the left-eye image PICL and the right-eye image PICR into a plurality of vertical stripe image portions SL-1 to SL, respectively, along a vertical dividing line. After dividing into SL-N, SR-1 to SR-N (N is an integer of 2 or more), as shown in FIG. 2B, the vertical stripe image portions SL-1 to SL-N of the left eye image PICL A multiple image CPIC in which the vertical stripe image portions SR-1 to SR-N of the right eye image PICR are alternately arranged in the horizontal direction is formed and output.

多重画像ＣＰＩＣは、左目用画像ＰＩＣＬの縦縞画像部分ＳＬ−１〜ＳＬ−Ｎと右目用画像ＰＩＣＲの縦縞画像部分ＳＲ−１〜ＳＲ−Ｎとを横方向に交互に配置している画像であるため、左右視点の対応する領域の画素同士の距離は短いものとなっている（左右視点の対応する領域の画素同士が近付いたものとなっている）。そのため、通信時のパケットロス等が生じた場合であっても、左右視点の対応する領域が共に欠落する可能性が高くなる。 The multiple image CPIC is an image in which the vertical stripe image portions SL-1 to SL-N of the left eye image PICL and the vertical stripe image portions SR-1 to SR-N of the right eye image PICR are alternately arranged in the horizontal direction. Therefore, the distance between the pixels in the corresponding region of the left and right viewpoints is short (the pixels in the corresponding region of the left and right viewpoints are close to each other). For this reason, even when packet loss or the like occurs during communication, there is a high possibility that the corresponding areas of the left and right viewpoints are lost.

左目用画像ＰＩＣＬ及び右目用画像ＰＩＣＲを、縦方向の分割線に沿って、複数の縦縞画像部分ＳＬ−１〜ＳＬ−Ｎ、ＳＲ−１〜ＳＲ−Ｎに分割するようにしたのは、以下の考え方による。人間の２つの目は横方向（左右方向）に離間しており、その左右の目で立体感を感じるため、立体視画像においては横方向が縦方向より重みが大きく、横縞画像部分を縦方向に交互に配置した場合であれば、通信時のパケットロス等が生じた場合に生じる、左右視点の対応する欠落した領域が縦方向に存在するが、上記のように、縦縞画像部分を横方向に交互に配置した場合であれば、通信時のパケットロス等が生じた場合に生じる、左右視点の対応する欠落した領域が横方向に存在し、後述する効果が有効なものとなる。 The left-eye image PICL and the right-eye image PICR are divided into a plurality of vertical stripe image portions SL-1 to SL-N and SR-1 to SR-N along the vertical dividing line as follows. According to the idea. The two human eyes are separated in the horizontal direction (left and right direction), and the left and right eyes feel a three-dimensional effect. Therefore, in a stereoscopic image, the horizontal direction has a higher weight than the vertical direction, and the horizontal stripe image portion is If there is an alternating arrangement, the missing areas corresponding to the left and right viewpoints exist in the vertical direction, which occurs when packet loss or the like occurs during communication. In the case where they are alternately arranged, there is a corresponding missing region in the left and right viewpoints that occurs when packet loss or the like during communication occurs, and the effects described later are effective.

ここで、各縦縞画像部分ＳＬ−１〜ＳＬ−Ｎ、ＳＲ−１〜ＳＲ−Ｎの幅は限定されるものではない。例えば、縦縞画像部分ＳＬ−１〜ＳＬ−Ｎ、ＳＲ−１〜ＳＲ−Ｎの幅が１画素であっても良く、また、複数画素であっても良い。複数画像の場合、欠落領域の補間処理や符号化などで定まっているブロックの大きさを考慮して具体的な画素数を定めるようにしても良い。例えば、動きベクトルの探索で用いるブロックの一辺の半分の画素数を縦縞画像部分の幅に選定しても良い。後述する動作の項では、縦縞画像部分ＳＬ−１〜ＳＬ−Ｎ、ＳＲ−１〜ＳＲ−Ｎの幅が１画素であるとして説明する。 Here, the widths of the vertical stripe image portions SL-1 to SL-N and SR-1 to SR-N are not limited. For example, the vertical stripe image portions SL-1 to SL-N and SR-1 to SR-N may have a width of one pixel or a plurality of pixels. In the case of a plurality of images, the specific number of pixels may be determined in consideration of the block size determined by interpolation processing or encoding of the missing region. For example, half the number of pixels on one side of the block used for motion vector search may be selected as the width of the vertical stripe image portion. In the section of the operation described later, the description will be made assuming that the vertical stripe image portions SL-1 to SL-N and SR-1 to SR-N have a width of one pixel.

符号化部１２は、画像多重部１１から出力された、左目用画像ＰＩＣＬ及び右目用画像ＰＩＣＲの多重画像ＣＰＩＣを所定の符号化方式に従って符号化するものである。ここで、符号化方式は限定されないが、例えば、Ｈ．２６４／ＭＰＥＧ−４ＡＶＣのような符号化方式を適用できる。 The encoding unit 12 encodes the multiplexed image CPIC of the left-eye image PICL and the right-eye image PICR output from the image multiplexing unit 11 according to a predetermined encoding method. Here, the encoding method is not limited. An encoding scheme such as H.264 / MPEG-4 AVC can be applied.

送信部１３は、符号化部１２から出力された符号化データから、送信データ（例えばパケット）を形成して立体映像受信装置２０に向けて送信するものである。 The transmission unit 13 forms transmission data (for example, a packet) from the encoded data output from the encoding unit 12 and transmits the transmission data to the stereoscopic video reception device 20.

ここで、立体映像送信装置１０及び立体映像受信装置２０間の伝送路は、専用回線であっても良く、また、多くの人が利用できる通信網であっても良い。立体映像送信装置１０及び立体映像受信装置２０間の通信方式（例えば、暗号化の有無、通信速度、伝送路符号化方式、有線・無線通信など）は限定されるものではない。また、送信部１３は、例えば、輝度信号用の符号化データ、色差信号用の２つの符号化データから送信データを形成するものであっても良い。また、送信データに、複数フレームに係る符号化データを挿入するようにしても良い。 Here, the transmission path between the stereoscopic video transmitting apparatus 10 and the stereoscopic video receiving apparatus 20 may be a dedicated line or a communication network that can be used by many people. The communication method (for example, presence / absence of encryption, communication speed, transmission path coding method, wired / wireless communication, etc.) between the stereoscopic video transmission device 10 and the stereoscopic video reception device 20 is not limited. Further, the transmission unit 13 may form transmission data from, for example, encoded data for luminance signals and two encoded data for color difference signals. Further, encoded data related to a plurality of frames may be inserted into transmission data.

立体映像受信装置２０は、受信部２１、復号部２２及び画像多重分離部２３を有する。ここで、復号部２２及び画像多重分離部２３は、第１の実施形態の立体映像復号装置の構成要素である。少なくとも復号部２２及び画像多重分離部２３の部分は、ハードウェア的に各種回路を接続して構築されても良く、また、ＣＰＵ、ＲＯＭ、ＲＡＭなどを有する汎用的な装置若しくはユニットが所定のプログラムを実行することで該当する機能を実現するように構築されても良い。 The stereoscopic video reception device 20 includes a reception unit 21, a decoding unit 22, and an image demultiplexing unit 23. Here, the decoding unit 22 and the image demultiplexing unit 23 are components of the stereoscopic video decoding device according to the first embodiment. At least the decoding unit 22 and the image demultiplexing unit 23 may be constructed by connecting various circuits in hardware, and a general-purpose device or unit having a CPU, a ROM, a RAM, or the like may be a predetermined program. It may be constructed to realize the corresponding function by executing.

受信部２１は、立体映像送信装置１０からの送信データを受信処理して符号化データを取り出して復号部２２に与えるものである。 The receiving unit 21 receives transmission data from the stereoscopic video transmission device 10, extracts encoded data, and provides the encoded data to the decoding unit 22.

復号部２２は、符号化データに対して復号を行って、図２（Ｂ）に示す多重画像ＣＰＩＣを得て画像多重分離部２３に与えるものである。 The decoding unit 22 performs decoding on the encoded data, obtains a multiplexed image CPIC shown in FIG.

画像多重分離部２３は、多重画像ＣＰＩＣを縦縞部分ＳＬ−１、ＳＲ−１、…、ＳＬ−Ｎ、ＳＲ−Ｎに分解しつつ、奇数番目の縦縞部分ＳＬ−１、…、ＳＬ−Ｎを横方向に並置して左目用画像ＰＩＣＬを再生すると共に、偶数番目の縦縞部分ＳＲ−１、…、ＳＲ−Ｎを横方向に並置して右目用画像ＰＩＣＲを再生するものである。 The image demultiplexing unit 23 decomposes the multiplexed image CPIC into vertical stripe portions SL-1, SR-1,..., SL-N, SR-N, and converts the odd-numbered vertical stripe portions SL-1,. The left eye image PICL is reproduced by juxtaposing in the horizontal direction, and the right eye image PICR is reproduced by juxtaposing even-numbered vertical stripe portions SR-1,..., SR-N in the horizontal direction.

再生された左目用画像ＰＩＣＬ及び右目用画像ＰＩＣＲは、例えば、表示機器３０に与えられて立体視映像の表示に供される。 The reproduced left-eye image PICL and right-eye image PICR are provided to the display device 30 for display of stereoscopic video, for example.

（Ａ−２）第１の実施形態の動作
次に、第１の実施形態の立体映像符号化システムを含む立体映像通信システム１の動作を説明する。 (A-2) Operation of the First Embodiment Next, the operation of the stereoscopic video communication system 1 including the stereoscopic video encoding system of the first embodiment will be described.

初めに、立体映像送信装置１０の動作を、図３のフローチャートを参照しながら説明する。図３は、同期している１フレーム分の左目用画像ＰＩＣＬ及び右目用画像ＰＩＣＲを多重して送信する動作を示しており、この図３に示す動作が立体視の動画像を構成しているフレーム数分だけ繰り返される。 First, the operation of the stereoscopic video transmission apparatus 10 will be described with reference to the flowchart of FIG. FIG. 3 shows the operation of multiplexing and transmitting the left-eye image PICL and the right-eye image PICR for one frame in synchronization, and the operation shown in FIG. 3 constitutes a stereoscopic moving image. Repeated for the number of frames.

まず、画像多重部１１は、撮影カメラや記録ファイルといった映像ソースから、同期している一対の左目用画像ＰＩＣＬ及び右目用画像ＰＩＣＲ（左右の視点画像）を取得する（ステップＳ１０１）。ここで、左目用画像ＰＩＣＬ及び右目用画像ＰＩＣＲの取得は、並行して行っても良く、順次行っても良い。 First, the image multiplexing unit 11 acquires a pair of synchronized left-eye image PICL and right-eye image PICR (left and right viewpoint images) from a video source such as a photographing camera or a recording file (step S101). Here, the acquisition of the left eye image PICL and the right eye image PICR may be performed in parallel or sequentially.

次に、画像多重部１１は、図２に示したように、左目用画像ＰＩＣＬ及び右目用画像ＰＩＣＲの縦方向の画素列（縦縞画像部分）ＳＬ−１〜ＳＬ−Ｎ、ＳＲ−１〜ＳＲ−Ｎが横方向に交互に並ぶような形式で左目用画像ＰＩＣＬ及び右目用画像ＰＩＣＲを多重する（ステップＳ１０２）。すなわち、左目用画像ＰＩＣＬから得られた縦方向画素列ＳＬ−１〜ＳＬ−Ｎが多重画像ＣＰＩＣの奇数列に並び、右目用画像ＰＩＣＲから得られた縦方向画素列ＳＲ−１〜ＳＲ−Ｎが多重画像ＣＰＩＣの偶数列に並ぶように多重処理を行う。上述したように、多重画像ＣＰＩＣは、左目用画像ＰＩＣＬ及び右目用画像ＰＩＣＲの対応する領域同士が近い位置に存在するものとなっている。 Next, as illustrated in FIG. 2, the image multiplexing unit 11 includes vertical pixel columns (vertical stripe image portions) SL-1 to SL-N and SR-1 to SR of the left-eye image PICL and the right-eye image PICR. The left-eye image PICL and the right-eye image PICR are multiplexed in a format in which −N is alternately arranged in the horizontal direction (step S102). That is, the vertical pixel rows SL-1 to SL-N obtained from the left-eye image PICL are arranged in odd-numbered rows of the multiple image CPIC, and the vertical pixel rows SR-1 to SR-N obtained from the right-eye image PICR. Are multiplexed so that the images are arranged in even-numbered columns of the multiple image CPIC. As described above, the multiple image CPIC is such that the corresponding areas of the left-eye image PICL and the right-eye image PICR are close to each other.

画像多重部１１は、得られた多重画像ＣＰＩＣを符号化部１２に与える（ステップＳ１０３）。このとき、符号化部１２は、多重画像ＣＰＩＣを所定の符号化方式に従って符号化し（ステップＳ１０４）、得られた符号化データを送信部１３に与える（ステップＳ１０５）。 The image multiplexing unit 11 gives the obtained multiplexed image CPIC to the encoding unit 12 (step S103). At this time, the encoding unit 12 encodes the multiple image CPIC according to a predetermined encoding method (step S104), and gives the obtained encoded data to the transmission unit 13 (step S105).

送信部１３は、与えられた符号化データから送信データを組み立て（ステップＳ１０６）、その送信データを立体映像受信装置２０へ向けて送信する（ステップＳ１０７）。送信部１３は、例えば、ステップＳ１０６では、与えられた符号化データを伝送路へ送信できるサイズへ分割し、ＲＴＰ（Ｒｅａｌ−ｔｉｍｅＴｒａｎｓｐｏｒｔＰｒｏｔｏｃｏｌ）に従ってパケット化し、送信データとなるパケットを形成する。このような場合であれば、１フレーム分の左目用画像ＰＩＣＬ及び右目用画像ＰＩＣＲから、複数のパケットが形成されて送信されることとなる。 The transmission unit 13 assembles transmission data from the given encoded data (step S106), and transmits the transmission data to the stereoscopic video reception device 20 (step S107). For example, in step S106, the transmission unit 13 divides the given encoded data into sizes that can be transmitted to the transmission path, packetizes them according to RTP (Real-time Transport Protocol), and forms packets that become transmission data. In such a case, a plurality of packets are formed and transmitted from the left-eye image PICL and the right-eye image PICR for one frame.

次に、立体映像受信装置２０の動作を、図４のフローチャートを参照しながら説明する。図４は、受信データに基づき、同期している１フレーム分の左目用画像ＰＩＣＬ及び右目用画像ＰＩＣＲを多重分離する動作を示しており、この図４に示す動作が立体視の動画像を構成しているフレーム数分だけ繰り返される。以下では、送信データがパケットであるとして説明する。 Next, the operation of the stereoscopic video receiving device 20 will be described with reference to the flowchart of FIG. FIG. 4 shows the operation of demultiplexing the left-eye image PICL and the right-eye image PICR for one frame in synchronization based on the received data, and the operation shown in FIG. 4 forms a stereoscopic moving image. It is repeated as many times as the number of frames. In the following description, it is assumed that the transmission data is a packet.

受信部２１は、送信側より送られたパケットを受信し（ステップＳ２０１）、
受信したパケットから、ヘッダ情報などを取り除いて符号化データを得て（ステップＳ２０２）、得られた符号化データを復号部２２に与える（ステップＳ２０３）。 The receiving unit 21 receives the packet sent from the transmitting side (step S201),
Header data and the like are removed from the received packet to obtain encoded data (step S202), and the obtained encoded data is provided to the decoding unit 22 (step S203).

復号部２２は、入力された符号化データを復号した後（ステップＳ２０４）、今回の復号情報を含めて１フレーム分の復号画像が得られたか否かを判別する（ステップＳ２０５）。１フレーム分の復号画像が得られなかった場合には、ステップＳ２０１へ戻り、１フレーム分の復号画像が得られた場合には、復号部２２は、得られた復号画像を画像多重分離部２３に与える（ステップＳ２０６）。ここで、送信側において、１フレーム分の符号化データを複数のパケットに振り分けて挿入している場合には、ステップＳ２０５からステップＳ２０１へ戻る処理が実行されることとなる。 After decoding the input encoded data (step S204), the decoding unit 22 determines whether a decoded image for one frame including the current decoding information is obtained (step S205). When the decoded image for one frame is not obtained, the process returns to step S201, and when the decoded image for one frame is obtained, the decoding unit 22 converts the obtained decoded image into the image demultiplexing unit 23. (Step S206). Here, when the encoded data for one frame is distributed and inserted into a plurality of packets on the transmission side, the process from step S205 to step S201 is executed.

画像多重分離部２３は、復号画像（復号された多重画像）から、左目用画像及び右目用画像を多重分離する（ステップＳ２０７）。この多重分離処理は、画像多重部１１で行った多重処理の逆の処理である（図２参照）。すなわち、復号された多重画像ＣＰＩＣの奇数列の縦方向画素列のみを集めて左目用画像ＰＩＣＬを得ると共に、多重画像ＣＰＩＣの偶数列の縦方向画素列のみを集めて右目用画像ＰＩＣＲを得る。 The image demultiplexing unit 23 demultiplexes the left-eye image and the right-eye image from the decoded image (decoded multiple image) (step S207). This demultiplexing process is the reverse of the multiplexing process performed by the image multiplexing unit 11 (see FIG. 2). That is, only the odd-numbered vertical pixel columns of the decoded multiple image CPIC are collected to obtain the left-eye image PICL, and only the even-numbered vertical pixel columns of the multiplexed image CPIC are collected to obtain the right-eye image PICR.

画像多重分離部２３は、得られた左目用画像及び右目用画像を表示機器３０に与え（ステップＳ２０８）、立体視表示させる。 The image demultiplexing unit 23 gives the obtained left-eye image and right-eye image to the display device 30 (step S208), and causes stereoscopic display.

（Ａ−３）第１の実施形態の効果
第１の実施形態においては、左目用画像及び右目用画像の縦１列を交互に並べることにより多重画像を形成したので、左目用画像及び右目用画像間の対応する領域同士が多重画像において近傍に位置する。 (A-3) Effects of the First Embodiment In the first embodiment, since the multiple images are formed by alternately arranging the vertical columns of the left-eye image and the right-eye image, the left-eye image and the right-eye image Corresponding regions between images are located in the vicinity of the multiple images.

そのため、この多重画像を符号化した符号化データ中でも近い位置にある。仮に、伝送途中に、パケットロスが発生したとしても、片方の視点映像のみ正しく復号され、もう片方が正しく復号できない、ということが起こりにくくなる。 Therefore, it is in a close position in the encoded data obtained by encoding this multiplexed image. Even if a packet loss occurs in the middle of transmission, it is less likely that only one viewpoint video is correctly decoded and the other cannot be decoded correctly.

また、ノイズの伝播に関しても、対応する領域が近傍に存在するため、片方の視点のある点が存在する領域（例えばマクロブロック）が、復号の際に、誤りのある復号画像を参照していたとしても、もう片方の視点の対応する点が同じ領域（マクロブロック）に存在していれば、同様に誤りを参照することになるため、対応点同士は同じようなノイズとなる。そのため、立体映像受信装置から出力された映像を立体表示した場合、対応する領域には、ノイズが乗っているものの、視点間の差が小さいため、視野闘争が起きて不快な立体映像となることを回避することができる。 Also, with respect to noise propagation, since a corresponding region exists in the vicinity, a region where a point with one viewpoint exists (for example, a macroblock) referred to a decoded image having an error at the time of decoding. However, if the corresponding point of the other viewpoint exists in the same region (macroblock), an error is referred to in the same manner, so that the corresponding points have the same noise. For this reason, when the video output from the stereoscopic video receiver is stereoscopically displayed, although the corresponding area is noisy, the difference between the viewpoints is small, resulting in an uncomfortable stereoscopic video due to a visual field conflict. Can be avoided.

（Ｂ）第２の実施形態
次に、本発明による立体映像符号化装置、立体映像復号装置及び立体映像符号化システム、並びに、立体映像符号化プログラム及び立体映像復号プログラムの第２の実施形態を、図面を参照しながら説明する。 (B) Second Embodiment Next, a stereoscopic video encoding device, a stereoscopic video decoding device, a stereoscopic video encoding system, and a stereoscopic video encoding program and a stereoscopic video decoding program according to a second embodiment of the present invention will be described. This will be described with reference to the drawings.

第２の実施形態の立体映像符号化システムは、立体映像通信システムに組み込まれたものである。 The stereoscopic video encoding system of the second embodiment is incorporated in a stereoscopic video communication system.

（Ｂ−１）第２の実施形態の構成
第１の実施形態の立体映像符号化方式では、符号化前の多重処理において、左目用画像及び右目用画像のそれぞれの所定幅（例えば１画素）の縦縞画像部分を横方向に交互に並べることにより、左目用画像及び右目用画像（左右視点）間の対応する領域同士が近傍となるようにしていた。 (B-1) Configuration of the Second Embodiment In the stereoscopic video encoding system of the first embodiment, the predetermined width (for example, one pixel) of each of the left-eye image and the right-eye image in the multiplex processing before encoding. By alternately arranging the vertical stripe image portions in the horizontal direction, corresponding regions between the left-eye image and the right-eye image (left and right viewpoints) are arranged in the vicinity.

しかしながら、写されている物体の撮影カメラからの距離によっては、第１の実施形態のような多重処理では、対応する領域同士が近傍に存在しないこともあり得る。例えば、撮影カメラから物体までの距離が近付くほど、左目用画像におけるその物品領域と右目用画像におけるその物品領域との横方向の距離が離れ、視差量が大きくなる。図５（Ａ１）及び（Ａ２）に示す左目用画像ＰＩＣＬ及び右目用画像ＰＩＣＲは、このような視差領域が大きい画像の対となっている。左目用画像ＰＩＣＬにおいて、二等辺三角形形状の物品は左端から距離ＬＬだけ離れた位置にあり、右目用画像ＰＩＣＲにおいて、二等辺三角形形状の物品は左端から距離ＬＲだけ離れた位置にあり、２つの距離ＬＬ及びＬＲの差ｎ＝ＬＬ−ＬＲはかなり大きい。以上のように、撮影カメラからの距離が近い物体が位置している、左目用画像ＰＩＣＬ及び右目用画像ＰＩＣＲにおける領域に関しては、第１の実施形態の多重処理を施しても、多重画像ＣＰＩＣにおいて、左目用画像ＰＩＣＬ及び右目用画像ＰＩＣＲにおける対応領域同士の距離が離れている。図５の例で言えば、差ｎの距離だけ離れている。 However, depending on the distance of the object being photographed from the camera, the corresponding regions may not exist in the vicinity in the multiplex processing as in the first embodiment. For example, as the distance from the photographing camera to the object gets closer, the lateral distance between the article region in the left-eye image and the article region in the right-eye image increases, and the amount of parallax increases. The left-eye image PICL and the right-eye image PICR shown in FIGS. 5A1 and 5A2 are a pair of images having such a large parallax region. In the left-eye image PICL, the isosceles triangle-shaped article is at a position away from the left end by a distance LL, and in the right-eye image PICR, the isosceles triangle-shaped article is at a position away from the left end by a distance LR. The difference n = LL−LR between the distances LL and LR is quite large. As described above, regarding the regions in the left-eye image PICL and the right-eye image PICR where an object that is close to the photographing camera is located, even if the multiple processing of the first embodiment is performed, in the multiple image CPIC The distances between the corresponding regions in the left-eye image PICL and the right-eye image PICR are increased. In the example of FIG. 5, it is separated by a distance n.

そのため、パケットロスによるノイズの影響が、各視点（左目用画像ＰＩＣＬ及び右目用画像ＰＩＣＲ）で異なるということが起こり得る。また、左右視点（左目用画像ＰＩＣＬ及び右目用画像ＰＩＣＲ）の縦縞画像部分を横方向に交互に並べても、撮影カメラからの距離が近い場合には、対応しない領域の縦縞画像部分が交互に並んで相関の低い領域となるため、符号化効率が悪くなってしまう。 Therefore, it is possible that the influence of noise due to packet loss is different at each viewpoint (left-eye image PICL and right-eye image PICR). Even if the vertical stripe image portions of the left and right viewpoints (left eye image PICL and right eye image PICR) are alternately arranged in the horizontal direction, if the distance from the photographing camera is short, the vertical stripe image portions of the non-corresponding region are alternately arranged. Therefore, the coding efficiency is deteriorated.

第２の実施形態〜後述する第４の実施形態の立体映像符号化システムは、以上の点に鑑み、撮影対象の物品の撮影カメラからの距離が近い場合でも、左目用画像ＰＩＣＬ及び右目用画像ＰＩＣＲにおける対応領域同士が多重画像ＣＰＩＣで隣接するように多重しようとしたものである。 In view of the above points, the stereoscopic video encoding system according to the second embodiment to the fourth embodiment described later takes the image PICL for the left eye and the image for the right eye even when the article to be photographed is close to the photographing camera. This is an attempt to multiplex so that corresponding areas in the PICR are adjacent to each other in the multiplex image CPIC.

図６は、第２の実施形態の立体映像符号化システムを含む立体映像通信システムの構成を示すブロック図であり、第１の実施形態に係る図１との同一、対応部分には同一、対応符号を付して示している。 FIG. 6 is a block diagram illustrating a configuration of a stereoscopic video communication system including the stereoscopic video encoding system according to the second embodiment. The configuration is the same as that of FIG. 1 according to the first embodiment, and the corresponding parts are the same. A reference numeral is attached.

図６において、立体映像通信システム１Ａは、立体映像送信装置１０Ａ及び立体映像受信装置２０Ａを有する。 In FIG. 6, the stereoscopic video communication system 1A includes a stereoscopic video transmission device 10A and a stereoscopic video reception device 20A.

立体映像送信装置１０Ａは、画像多重部１１Ａ、符号化部１２、送信部１３Ａ及び多重設定情報部１４を有する。符号化部１２は、第１の実施形態のものと同様である。ここで、画像多重部１１Ａ、符号化部１２及び多重設定情報部１４が、第２の実施形態の立体映像符号化装置の構成要素である。 The stereoscopic video transmission apparatus 10A includes an image multiplexing unit 11A, an encoding unit 12, a transmission unit 13A, and a multiplexing setting information unit 14. The encoding unit 12 is the same as that of the first embodiment. Here, the image multiplexing unit 11A, the encoding unit 12, and the multiplexing setting information unit 14 are components of the stereoscopic video encoding device of the second embodiment.

第２の実施形態で新たに設けられた多重設定情報部１４は、ユーザの指定などの形で指定された視差パラメータを保持するものである。視差パラメータは、例えば、左目用画像ＰＩＣＬ及び右目用画像ＰＩＣＲにおける対応領域同士の横方向の位置ずれ量ｎである（図５参照）。視差パラメータの設定は、フレーム毎に行うものであっても良く、動画像の同一シーン毎に設定するものであっても良く、動画像のフレーム全体に共通するように設定するものであっても良い。後述する動作説明では、視差パラメータの設定はフレーム毎に行うものとして説明する。 The multiple setting information unit 14 newly provided in the second embodiment holds a parallax parameter designated in the form of user designation or the like. The parallax parameter is, for example, a lateral displacement amount n between corresponding regions in the left-eye image PICL and the right-eye image PICR (see FIG. 5). The parallax parameter may be set for each frame, may be set for each same scene of the moving image, or may be set to be common to the entire frame of the moving image. good. In the description of the operation described later, it is assumed that the parallax parameter is set for each frame.

第２の実施形態に係る画像多重部１１Ａは、多重設定情報部１４に保持されている視差パラメータを参照し、視差パラメータに応じて、入力された左目用画像ＰＩＣＬに対して、横方向に、非交互多重領域ＡＲＬ１と交互多重領域ＡＲＬ２とに区分すると共に、入力された右目用画像ＰＩＣＲに対して、横方向に、交互多重領域ＡＲＲ２と非交互多重領域ＡＲＲ１とに区分し、さらに、図５（Ａ１）及び（Ａ２）に示すように、縦方向の分割線に沿って、複数の縦縞画像部分ＡＲＬ２−１〜ＡＲＬ２−Ｍ、ＡＲＲ２−１〜ＡＲＲ２−Ｍ（Ｍは２以上の整数）に分割し、その後、図５（Ｂ）に示すように、左目用画像ＰＩＣＬの非交互多重領域ＡＲＬ１、左目用画像ＰＩＣＬの縦縞画像部分ＡＲＬ２−１〜ＡＲＬ２−Ｍと右目用画像ＰＩＣＲの縦縞画像部分ＡＲＲ２−１〜ＡＲＲ２−Ｍとの横方向の交互配置領域、右目用画像ＰＩＣＲの非交互多重領域ＡＲＲ１とを左側から配置することにより、多重画像ＣＰＩＣＡを形成して出力する。 The image multiplexing unit 11A according to the second embodiment refers to the parallax parameter held in the multiplexing setting information unit 14, and in accordance with the parallax parameter, in the lateral direction with respect to the input left-eye image PICL. The non-alternating multi-region ARL1 and the alternating multi-region ARL2 are divided, and the input right-eye image PICR is divided into the alternating multi-region ARR2 and the non-alternating multi-region ARR1 in the horizontal direction, and further, FIG. As shown in (A1) and (A2), a plurality of vertical stripe image portions ARL2-1 to ARL2-M and ARR2-1 to ARR2-M (M is an integer of 2 or more) along the vertical dividing line. After that, as shown in FIG. 5B, the non-alternate multiple region ARL1 of the left eye image PICL, the vertical stripe image portions ARL2-1 to ARL2-M of the left eye image PICL, and the right eye image PICR. By arranging transverse interleaved region of the fringe image portion ARR2-1~ARR2-M, and a non-alternating multiple regions ARR1 of the right eye image PICR from the left, and outputs to form a multi-image CPICA.

なお、左目用画像ＰＩＣＬの非交互多重領域ＡＲＬ１は、左目用画像ＰＩＣＬでは写っているが、右目用画像ＰＩＣＲでは写っていない領域に相当し、右目用画像ＰＩＣＲの非交互多重領域ＡＲＲ１は、右目用画像ＰＩＣＲでは写っているが、左目用画像ＰＩＣＬでは写っていない領域に相当し、左目用画像ＰＩＣＬの交互多重領域ＡＲＬ２及び右目用画像ＰＩＣＲの交互多重領域ＡＲＲ２は、左目用画像ＰＩＣＬでも右目用画像ＰＩＣＲでも写っている領域に相当する。 Note that the non-alternate multiple region ARL1 of the left-eye image PICL corresponds to a region that is captured in the left-eye image PICL but not in the right-eye image PICR, and the non-alternate multiple region ARR1 of the right-eye image PICR is the right-eye image. It corresponds to a region that is captured in the image PICR for the image but is not captured in the image PICL for the left eye. This corresponds to a region that is also shown in the image PICR.

第２の実施形態に係る送信部１３Ａは、符号化部１２から出力された符号化データと多重設定情報部１４で保持されている視差パラメータとを多重した後、送信データ（例えばパケット）を形成して立体映像受信装置２０Ａに向けて送信するものである。上記では、視差パラメータの情報は全てのフレームの送信データに多重するように説明したが、設定された視差パラメータの有効期間によっては、他の方法によって、視差パラメータの情報を立体映像受信装置２０Ａに与えるようにしても良い。例えば、一連の動画像データの最初のフレームの符号化データのみに視差パラメータを多重して立体映像受信装置２０Ａに与えるようにしても良い。また例えば、立体映像送信装置１０Ａ及び立体映像受信装置２０Ａ間で、画像データを授受し合う動作の開始前に行うネゴシエーション期間で視差パラメータを立体映像送信装置１０Ａから立体映像受信装置２０Ａへ与えるようにしても良い。 13 A of transmission parts which concern on 2nd Embodiment form transmission data (for example, packet), after multiplexing the encoding data output from the encoding part 12, and the parallax parameter currently hold | maintained at the multiplexing setting information part 14 Then, it is transmitted toward the stereoscopic video receiving device 20A. In the above description, the parallax parameter information is described as multiplexed on the transmission data of all frames. However, depending on the set validity period of the parallax parameter, the parallax parameter information may be transferred to the stereoscopic video reception device 20A by another method. You may make it give. For example, the parallax parameter may be multiplexed only on the encoded data of the first frame of a series of moving image data and provided to the stereoscopic video reception device 20A. Further, for example, a parallax parameter is given from the stereoscopic video transmitting apparatus 10A to the stereoscopic video receiving apparatus 20A in a negotiation period performed before the start of the operation of exchanging image data between the stereoscopic video transmitting apparatus 10A and the stereoscopic video receiving apparatus 20A. May be.

立体映像受信装置２０Ａは、受信部２１Ａ、復号部２２、画像多重分離部２３Ａ及び多重分離設定情報部２４を有する。復号部２２は、第１の実施形態のものと同様である。ここで、復号部２２、画像多重分離部２３Ａ及び多重分離設定情報部２４が、第２の実施形態の立体映像復号装置の構成要素である。 The stereoscopic video reception device 20A includes a reception unit 21A, a decoding unit 22, an image demultiplexing unit 23A, and a demultiplexing setting information unit 24. The decoding unit 22 is the same as that of the first embodiment. Here, the decoding unit 22, the image demultiplexing unit 23A, and the demultiplexing setting information unit 24 are components of the stereoscopic video decoding device according to the second embodiment.

第２の実施形態に係る受信部２１Ａは、立体映像送信装置１０Ａが送信したデータを受信し、視差パラメータと符号化データとに多重分離し、視差パラメータを多重分離設定情報部２４に与え、符号化データを復号部２２に与えるものである。 The receiving unit 21A according to the second embodiment receives the data transmitted by the stereoscopic video transmitting apparatus 10A, demultiplexes the parallax parameter into encoded data, and gives the parallax parameter to the demultiplexing setting information unit 24. The converted data is given to the decoding unit 22.

第２の実施形態で新たに設けられた多重分離設定情報部２４は、受信部２１Ａから与えられた視差パラメータを保持するものである。 The demultiplexing setting information unit 24 newly provided in the second embodiment holds the parallax parameter given from the receiving unit 21A.

第２の実施形態に係る画像多重分離部２３Ａは、多重分離設定情報部２４に設定されている視差パラメータを参照し、図５を用いて説明した、上述した画像多重部１１Ａによる多重処理の逆処理（多重分離処理）を行い、左目用画像ＰＩＣＬ及び右目用画像ＰＩＣＲ（すなわち、左右視点画像）を再生するものである。 The image demultiplexing unit 23A according to the second embodiment refers to the parallax parameter set in the demultiplexing setting information unit 24 and reverses the multiplexing process by the image multiplexing unit 11A described above with reference to FIG. Processing (demultiplexing processing) is performed to reproduce the left-eye image PICL and the right-eye image PICR (that is, the left and right viewpoint images).

（Ｂ−２）第２の実施形態の動作
次に、第２の実施形態の立体映像符号化システムを含む立体映像通信システム１Ａの動作を説明する。 (B-2) Operation of the Second Embodiment Next, the operation of the stereoscopic video communication system 1A including the stereoscopic video encoding system of the second embodiment will be described.

初めに、立体映像送信装置１０Ａの動作を、図７のフローチャートを参照しながら説明する。図７において、第１の実施形態に係る図３との同一ステップには同一符号を付して示している。図７は、同期している１フレーム分の左目用画像ＰＩＣＬ及び右目用画像ＰＩＣＲを多重して送信する動作を示しており、この図７に示す動作が立体視の動画像を構成しているフレーム数分だけ繰り返される。 First, the operation of the stereoscopic video transmission apparatus 10A will be described with reference to the flowchart of FIG. In FIG. 7, the same steps as those in FIG. 3 according to the first embodiment are denoted by the same reference numerals. FIG. 7 shows an operation of multiplexing and transmitting the synchronized left-eye image PICL and right-eye image PICR for one frame, and the operation shown in FIG. 7 constitutes a stereoscopic moving image. Repeated for the number of frames.

まず、多重設定情報部１４に保持されている視差パラメータの情報が、ユーザからの指示など、外部からの指定によって更新される（ステップＳ３０１）。 First, the parallax parameter information held in the multiple setting information unit 14 is updated by external designation such as an instruction from the user (step S301).

そして、画像多重部１１は、撮影カメラや記録ファイルといった映像ソースから、同期している一対の左目用画像ＰＩＣＬ及び右目用画像ＰＩＣＲ（左右の視点画像）を取得し（ステップＳ１０１）、図５に示したように、視差パラメータに応じて、入力された左目用画像ＰＩＣＬ及び右目用画像ＰＩＣＲに対して、横方向に、非交互多重領域ＡＲＬ１及びＡＲＲ１と、交互多重領域ＡＲＬ２及びＡＲＲ２とに区分し、左目用画像ＰＩＣＬの非交互多重領域ＡＲＬ１、左目用画像ＰＩＣＬの縦縞画像部分ＡＲＬ２−１〜ＡＲＬ２−Ｍと右目用画像ＰＩＣＲの縦縞画像部分ＡＲＲ２−１〜ＡＲＲ２−Ｍとの横方向の交互配置領域、右目用画像ＰＩＣＲの非交互多重領域ＡＲＲ１とを左側から配置するように、左目用画像ＰＩＣＬ及び右目用画像ＰＩＣＲを多重する（ステップＳ３０２）。 Then, the image multiplexing unit 11 obtains a pair of synchronized left-eye image PICL and right-eye image PICR (left and right viewpoint images) from a video source such as a photographing camera or a recording file (step S101), and FIG. As shown, according to the parallax parameter, the input left-eye image PICL and right-eye image PICR are divided into non-alternating multiple regions ARL1 and ARR1 and alternating multiple regions ARL2 and ARR2 in the horizontal direction. , Non-alternate multiple regions ARL1 of the left eye image PICL, vertical stripe image portions ARL2-1 to ARL2-M of the left eye image PICL, and vertical stripe image portions ARR2-1 to ARR2-M of the right eye image PICR in the horizontal direction The left eye image PICL and the right eye image are arranged so that the region and the non-alternate multiple region ARR1 of the right eye image PICR are arranged from the left side. The ICR multiplexing (step S302).

画像多重部１１は、得られた多重画像ＣＰＩＣＡを符号化部１２に与え（ステップＳ１０３）、符号化部１２は、多重画像ＣＰＩＣＡを所定の符号化方式に従って符号化し（ステップＳ１０４）、得られた符号化データを送信部１３Ａに与える（ステップＳ１０５）。 The image multiplexing unit 11 gives the obtained multiple image CPICA to the encoding unit 12 (step S103), and the encoding unit 12 encodes the multiple image CPICA according to a predetermined encoding method (step S104), and obtained The encoded data is given to the transmission unit 13A (step S105).

送信部１３Ａは、多重設定情報部１４から視差パラメータを取得した後（ステップＳ３０３）、例えば、符号化部１２から出力された符号化データのヘッダ部分の末尾へ、視差パラメータの情報を追加することにより、符号化データ及び視差パラメータの多重化を行う（ステップＳ３０４）。符号化データのヘッダとしては、シーケンス全体の符号化情報（シーケンスヘッダ）、各フレームに関する符号化情報（フレームヘッダ）、動画像の符号化単位であるマクロブロックを何個か合わせた各スライスに関する符号化情報（スライスヘッダ）などがあり、いずれのヘッダの末尾に追加しても良いが、以下では、フレームヘッダの末尾へ視差パラメータを含めるとして説明する。なお、符号化データの先頭に固定長の符号で視差パラメータを追加する、というような他の形式で多重しても構わない。 After acquiring the disparity parameter from the multiplex setting information unit 14 (step S303), the transmission unit 13A adds disparity parameter information to the end of the header portion of the encoded data output from the encoding unit 12, for example. Thus, the encoded data and the parallax parameter are multiplexed (step S304). As the header of the encoded data, the encoding information (sequence header) of the entire sequence, the encoding information (frame header) related to each frame, and the code related to each slice obtained by combining several macroblocks that are encoding units of moving images In the following description, it is assumed that a disparity parameter is included at the end of the frame header, although it may be added to the end of any header. Note that the encoded data may be multiplexed in another format such as adding a parallax parameter with a fixed-length code at the beginning of the encoded data.

送信部１３Ａは、与えられた視差パラメータが追加された符号化データから送信データを組み立て（ステップＳ１０６）、その送信データを立体映像受信装置２０Ａへ向けて送信する（ステップＳ１０７）。 The transmission unit 13A assembles transmission data from the encoded data to which the given parallax parameter is added (step S106), and transmits the transmission data to the stereoscopic video reception device 20A (step S107).

次に、立体映像受信装置２０Ａの動作を、図８のフローチャートを参照しながら説明する。図８において、第１の実施形態に係る図４との同一ステップには同一符号を付して示している。図８は、受信データに基づき、同期している１フレーム分の左目用画像ＰＩＣＬ及び右目用画像ＰＩＣＲを多重分離する動作を示しており、この図８に示す動作が立体視の動画像を構成しているフレーム数分だけ繰り返される。以下では、送信データがパケットであるとして説明する。 Next, the operation of the stereoscopic video reception device 20A will be described with reference to the flowchart of FIG. In FIG. 8, the same steps as those in FIG. 4 according to the first embodiment are denoted by the same reference numerals. FIG. 8 shows the operation of demultiplexing the left-eye image PICL and the right-eye image PICR for one frame in synchronization based on the received data, and the operation shown in FIG. 8 forms a stereoscopic moving image. It is repeated as many times as the number of frames. In the following description, it is assumed that the transmission data is a packet.

受信部２１Ａは、送信側より送られたパケットを受信し（ステップＳ２０１）、受信したパケットから、視差パラメータと符号化データを多重分離し（ステップＳ４０１）、多重分離設定情報部２４に保持されている視差パラメータの情報を、受信した視差パラメータの値に更新すると共に（ステップＳ４０２）、得られた符号化データを復号部２２に与える（ステップＳ２０３）。 The receiving unit 21A receives the packet sent from the transmitting side (step S201), demultiplexes the parallax parameter and the encoded data from the received packet (step S401), and is held in the demultiplexing setting information unit 24. The information on the existing parallax parameter is updated to the value of the received parallax parameter (step S402), and the obtained encoded data is given to the decoding unit 22 (step S203).

復号部２２は、入力された符号化データを復号した後（ステップＳ２０４）、今回の復号情報を含めて１フレーム分の復号画像が得られたか否かを判別する（ステップＳ２０５）。１フレーム分の復号画像が得られなかった場合には、ステップＳ２０１へ戻り、１フレーム分の復号画像が得られた場合には、復号部２２は、得られた復号画像を画像多重分離部２３Ａに与える（ステップＳ２０６）。 After decoding the input encoded data (step S204), the decoding unit 22 determines whether a decoded image for one frame including the current decoding information is obtained (step S205). When the decoded image for one frame is not obtained, the process returns to step S201, and when the decoded image for one frame is obtained, the decoding unit 22 converts the obtained decoded image into the image demultiplexing unit 23A. (Step S206).

画像多重分離部２３Ａは、多重分離設定情報部２４に保持されている視差パラメータを参照し、復号画像（復号された多重画像）に対して、画像多重部１１Ａの多重処理に対応した多重分離処理を行って（図５参照）、左目用画像及び右目用画像を得る（ステップＳ４０３）。 The image demultiplexing unit 23A refers to the parallax parameters held in the demultiplexing setting information unit 24, and performs demultiplexing processing corresponding to the multiplexing processing of the image multiplexing unit 11A on the decoded image (decoded multiple image) (See FIG. 5) to obtain a left-eye image and a right-eye image (step S403).

画像多重分離部２３Ａは、得られた左目用画像及び右目用画像を表示機器３０に与え（ステップＳ２０８）、立体視表示させる。 The image demultiplexing unit 23A gives the obtained left-eye image and right-eye image to the display device 30 (step S208), and stereoscopically displays them.

（Ｂ−３）第２の実施形態の効果
第２の実施形態によれば、利用者が視差パラメータを指定することにより、指定された視差を持つ対応点が隣接するような、左目用画像及び右目用画像の多重処理を行うので、指定された視差を持つ領域が隣接するため、この領域に対しては、パケットロスによるノイズの視点間の差が生じない。そのため、視差パラメータとして、人が注目する領域の視差量を指定した場合、その注目領域に対する、視点間の差が生じないため、主観的な画質の低下を抑えることが可能となる。また、指定された視差を持つ領域の空間的な相関性が高くなるため、この部分の符号化効率が向上し、主観品質が向上することが期待できる。このように、任意の視差量を持つ領域に関して、左右差に視野闘争を抑制し、符号化効率を向上させる効果がある。 (B-3) Effects of the Second Embodiment According to the second embodiment, the left eye image and the corresponding points having the designated parallax are adjacent to each other when the user designates the parallax parameter. Since the right-eye image is multiplexed, an area having the designated parallax is adjacent to the area, so that no difference between noise viewpoints due to packet loss occurs in this area. For this reason, when the amount of parallax of a region that a person is interested in is specified as a parallax parameter, there is no difference between viewpoints with respect to the region of interest, so that it is possible to suppress a deterioration in subjective image quality. In addition, since the spatial correlation of the area having the designated parallax becomes high, it can be expected that the coding efficiency of this part is improved and the subjective quality is improved. As described above, there is an effect of suppressing the field-of-view struggle in the left-right difference and improving the coding efficiency with respect to the region having an arbitrary amount of parallax.

（Ｃ）第３の実施形態
次に、本発明による立体映像符号化装置、立体映像復号装置及び立体映像符号化システム、並びに、立体映像符号化プログラム及び立体映像復号プログラムの第３の実施形態を、図面を参照しながら説明する。 (C) Third Embodiment Next, a third embodiment of a stereoscopic video encoding device, a stereoscopic video decoding device, a stereoscopic video encoding system, a stereoscopic video encoding program, and a stereoscopic video decoding program according to the present invention will be described. This will be described with reference to the drawings.

第３の実施形態の立体映像符号化システムは、立体映像通信システムに組み込まれたものである。 The stereoscopic video encoding system of the third embodiment is incorporated in a stereoscopic video communication system.

上述した第２の実施形態では、視差パラメータを指定することにより、指定した視差をもつ領域の、パケットロスによるノイズの影響を低減するようにしていた。しかし、視差パラメータを指定する必要がある上、映像は時間と共に変化するため、適切な視差パラメータも変化する。この第３の実施形態では、入力画像から視差パラメータを推定する機能を追加している。 In the second embodiment described above, by specifying a parallax parameter, the influence of noise due to packet loss in an area having the designated parallax is reduced. However, since it is necessary to specify a parallax parameter and the video changes with time, an appropriate parallax parameter also changes. In the third embodiment, a function for estimating a parallax parameter from an input image is added.

図９は、第３の実施形態の立体映像符号化システムを含む立体映像通信システムの構成を示すブロック図であり、第２の実施形態に係る図１との同一、対応部分には同一、対応符号を付して示している。 FIG. 9 is a block diagram illustrating a configuration of a stereoscopic video communication system including the stereoscopic video encoding system according to the third embodiment. The configuration is the same as that of FIG. 1 according to the second embodiment, and the corresponding parts are the same. A reference numeral is attached.

図９において、立体映像通信システム１Ｂは、立体映像送信装置１０Ｂ及び立体映像受信装置２０Ａを有する。立体映像受信装置２０Ａは、第２の実施形態のものと同一であり、その機能説明は省略する。 In FIG. 9, the stereoscopic video communication system 1B includes a stereoscopic video transmission device 10B and a stereoscopic video reception device 20A. The stereoscopic video receiving device 20A is the same as that of the second embodiment, and the functional description thereof is omitted.

立体映像送信装置１０Ｂは、画像多重部１１Ａ、符号化部１２、送信部１３Ａ及び多重設定情報部１４に加え、視差パラメータ計算部１５を有する。画像多重部１１Ａ、符号化部１２、送信部１３Ａ及び多重設定情報部１４は、第２の実施形態のものと同一であり、その機能説明は省略する。 The stereoscopic video transmission device 10B includes a parallax parameter calculation unit 15 in addition to the image multiplexing unit 11A, the encoding unit 12, the transmission unit 13A, and the multiple setting information unit 14. The image multiplexing unit 11A, the encoding unit 12, the transmission unit 13A, and the multiplex setting information unit 14 are the same as those in the second embodiment, and description of their functions is omitted.

視差パラメータ計算部１５は、入力された左目用画像ＰＩＣＬ及び右目用画像ＰＩＣＲに基づいて、視差パラメータを計算して多重設定情報部１４に与えるものである。視差パラメータ計算部１５は、例えば、左目用画像ＰＩＣＬ及び右目用画像ＰＩＣＲの対応する点をブロックマッチング法などの方法で推定する。左目用画像ＰＩＣＬ及び右目用画像ＰＩＣＲで対応する点の画素位置を比較することで、各点の視差量を推定する。そして、得られた複数の視差量から、例えば、以下のようにして多重設定情報部１４に与える視差量（視差パラメータ）を決定する。第１に、最も大きい視差量を、多重設定情報部１４に与える視差量（視差パラメータ）に決定する。第２に、最大頻度の視差量を、多重設定情報部１４に与える視差量（視差パラメータ）に決定する。 The parallax parameter calculation unit 15, based on the input left-eye image PICL and the right-eye image PICR, is shall give the multiplexing setting information unit 14 calculates the parallax parameters. For example, the parallax parameter calculation unit 15 estimates corresponding points of the left-eye image PICL and the right-eye image PICR by a method such as a block matching method. The parallax amount of each point is estimated by comparing pixel positions of corresponding points in the left-eye image PICL and the right-eye image PICR. Then, from the plurality of obtained parallax amounts, for example, the parallax amount (parallax parameter) to be given to the multiple setting information unit 14 is determined as follows. First, the largest amount of parallax is determined as the amount of parallax (parallax parameter) to be given to the multiple setting information unit 14. Second, the parallax amount with the highest frequency is determined as the parallax amount (parallax parameter) to be given to the multiple setting information unit 14.

以上では、入力された左目用画像ＰＩＣＬ及び右目用画像ＰＩＣＲから、多重設定情報部１４に与える視差量（視差パラメータ）を決定する場合を説明したが、左目用画像ＰＩＣＬ及び右目用画像ＰＩＣＲに加えて、他の情報（以下、補助情報と呼ぶ）をも、視差量（視差パラメータ）の決定に利用するようにしても良い。補助情報として、奥行き情報を利用できる。例えば、撮像カメラの焦点距離や、ズーム時の倍率などを、奥行情報として取込んで用いることができる。左目用画像ＰＩＣＬ及び右目用画像ＰＩＣＲに加えて、奥行き情報を利用することにより、視差量の探索処理を迅速化できる。例えば、視差パラメータの第１の決定方法に係る最も視差量の大きい点というのは、奥行きが最も短い点ということであるので、左目用画像ＰＩＣＬ及び右目用画像ＰＩＣＲの一方の点に対応する他方の画像の点は、奥行き情報で定まる視差量、若しくは、その近傍の値だけずれて存在する。従って、探索範囲を、奥行き情報に応じて定まる狭い範囲に限定し、視差量を計算するようにしても良い。 In the above, the case where the parallax amount (parallax parameter) to be given to the multiple setting information unit 14 is determined from the input left-eye image PICL and right-eye image PICR has been described. In addition to the left-eye image PICL and the right-eye image PICR Other information (hereinafter referred to as auxiliary information) may also be used for determining the amount of parallax (parallax parameter). Depth information can be used as auxiliary information. For example, the focal length of the imaging camera, the magnification at the time of zooming, and the like can be captured and used as the depth information. By using depth information in addition to the left-eye image PICL and the right-eye image PICR, the parallax amount search process can be speeded up. For example, the point with the largest amount of parallax according to the first method for determining the parallax parameter is the point with the shortest depth, so the other corresponding to one point of the left-eye image PICL and the right-eye image PICR. These image points are shifted by the amount of parallax determined by the depth information or a value in the vicinity thereof. Accordingly, the search range may be limited to a narrow range determined according to the depth information, and the parallax amount may be calculated.

第３の実施形態によれば、フレーム毎に視差パラメータを入力画像から推定するようにしたので、時間の変化に伴い、奥行きの変化が起こる立体映像に対しても、視差パラメータを動的に設定することができる。 According to the third embodiment, since the parallax parameter is estimated from the input image for each frame, the parallax parameter is dynamically set even for a stereoscopic image in which a change in depth occurs with a change in time. can do.

ここで、設定する視差パラメータを左右視点画像内の最大視差とした場合には、最も拒離の近い領域に対して、ノイズの影響を低減し、符号化効率を向上させる効果がある。また、このような視差量の大きい領域は、手前に写る領域となるため、この領域の符号化効率の向上と、視野闘争の抑制により、主観画質の向上が期待できる。また、設定する視差パラメータを左右視点画像内の視差量で最も多いものとした場合には、左右視点画像間の全体的な相関を高めることとなり、符号化効率が向上する。 Here, when the parallax parameter to be set is the maximum parallax in the left and right viewpoint images, there is an effect of reducing the influence of noise and improving the coding efficiency for the region closest to rejection. Moreover, since such a region with a large amount of parallax is a region that appears in the foreground, an improvement in subjective image quality can be expected by improving the coding efficiency of this region and suppressing the binocular rivalry. Further, when the parallax parameter to be set is the largest in the amount of parallax in the left and right viewpoint images, the overall correlation between the left and right viewpoint images is increased, and the coding efficiency is improved.

（Ｄ）第４の実施形態
次に、本発明による立体映像符号化装置、立体映像復号装置及び立体映像符号化システム、並びに、立体映像符号化プログラム及び立体映像復号プログラムの第４の実施形態を、図面を参照しながら説明する。 (D) Fourth Embodiment Next, a fourth embodiment of a stereoscopic video encoding device, a stereoscopic video decoding device, a stereoscopic video encoding system, a stereoscopic video encoding program, and a stereoscopic video decoding program according to the present invention will be described. This will be described with reference to the drawings.

第４の実施形態の立体映像符号化システムは、立体映像通信システムに組み込まれたものである。 The stereoscopic video encoding system of the fourth embodiment is incorporated in a stereoscopic video communication system.

（Ｄ−１）第４の実施形態の構成
第１〜第３の実施形態では、画像全体に対して全て同一の多重処理を行っていた。しかし、左右視点画像内の視差量は領域によって異なる。例えば、画像の上部には、視差量が小さい領域、すなわち、背景となるような奥行きが深い領域が広く写っており、画像の下部には、視差量が大きい領域、すなわち、手前に存在する物体の領域が広く写っている、ということがある。 (D-1) Configuration of Fourth Embodiment In the first to third embodiments, the same multiple processing is performed on the entire image. However, the amount of parallax in the left and right viewpoint images varies depending on the region. For example, a region with a small amount of parallax, that is, a region with a deep depth as a background is widely shown in the upper part of the image, and a region with a large amount of parallax, that is, an object existing in the foreground at the lower part of the image. There are times when the area of is widely reflected.

第４の実施形態の立体映像符号化システムは、このような点に鑑み、横方向のライン毎（１ライン毎に限定されず、複数ライン毎であっても良い）に視差パラメータを設定して多重処理を行う。以下では、１ライン毎に視差パラメータを設定する場合を説明する。 In view of such a point, the stereoscopic video encoding system according to the fourth embodiment sets a parallax parameter for each line in the horizontal direction (not limited to one line but may be a plurality of lines). Perform multiple processing. Hereinafter, a case where the parallax parameter is set for each line will be described.

図１０は、第４の実施形態の立体映像符号化システムを含む立体映像通信システムの構成を示すブロック図であり、既述した図面との同一、対応部分には同一、対応符号を付して示している。 FIG. 10 is a block diagram illustrating a configuration of a stereoscopic video communication system including the stereoscopic video encoding system according to the fourth embodiment. The same reference numerals are assigned to the same or corresponding parts as those in the above-described drawings. Show.

図１０において、立体映像通信システム１Ｃは、立体映像送信装置１０Ｃ及び立体映像受信装置２０Ｃを有する。 In FIG. 10, the stereoscopic video communication system 1C includes a stereoscopic video transmission device 10C and a stereoscopic video reception device 20C.

立体映像送信装置１０Ｃは、画像多重部１１Ｃ、符号化部１２、送信部１３Ｃ、多重設定情報部１４Ｃ及び視差パラメータ計算部１５Ｃを有する。符号化部１２は、第１の実施形態のものと同様である。ここで、画像多重部１１Ｃ、符号化部１２、多重設定情報部１４Ｃ及び視差パラメータ計算部１５Ｃが、第４の実施形態の立体映像符号化装置の構成要素である。 The stereoscopic video transmission apparatus 10C includes an image multiplexing unit 11C, an encoding unit 12, a transmission unit 13C, a multiple setting information unit 14C, and a parallax parameter calculation unit 15C. The encoding unit 12 is the same as that of the first embodiment. Here, the image multiplexing unit 11C, the encoding unit 12, the multiplexing setting information unit 14C, and the parallax parameter calculation unit 15C are constituent elements of the stereoscopic video encoding apparatus according to the fourth embodiment.

第４の実施形態における視差パラメータ計算部１５Ｃは、入力された左目用画像ＰＩＣＬ及び右目用画像ＰＩＣＲに基づいて（さらに、第３の実施形態で言及したように、補助情報を利用しても良い）、ライン毎の視差パラメータを計算し、ライン毎の視差パラメータを多重設定情報部１４Ｃに与えるものである。ライン毎の視差パラメータの計算方法も、第３の実施形態で説明した画像全体の視差パラメータの計算方法とほぼ同様である。すなわち、左目用画像ＰＩＣＬ及び右目用画像ＰＩＣＲの対応する点をブロックマッチング法などの方法で推定し、左目用画像ＰＩＣＬ及び右目用画像ＰＩＣＲで対応する点の画素位置を比較することで、各点の視差量を推定する。そして、１ラインについて得られた複数の視差量から、例えば、最も大きい視差量を、多重設定情報部１４に与えるライン毎の視差量（視差パラメータ）に決定したり、最大頻度の視差量を、多重設定情報部１４に与えるライン毎の視差量（視差パラメータ）に決定したりする。 The parallax parameter calculation unit 15C in the fourth embodiment may use auxiliary information based on the input left-eye image PICL and right-eye image PICR (as described in the third embodiment). ), The parallax parameter for each line is calculated, and the parallax parameter for each line is given to the multiplex setting information unit 14C. The parallax parameter calculation method for each line is almost the same as the parallax parameter calculation method for the entire image described in the third embodiment. That is, the points corresponding to the left eye image PICL and the right eye image PICR are estimated by a method such as a block matching method, and the pixel positions of the corresponding points in the left eye image PICL and the right eye image PICR are compared. Is estimated. Then, from the plurality of parallax amounts obtained for one line, for example, the largest parallax amount is determined as the parallax amount (parallax parameter) for each line to be given to the multiplex setting information unit 14, The parallax amount (parallax parameter) for each line given to the multiplex setting information unit 14 is determined.

第４の実施形態における多重設定情報部１４Ｃは、視差パラメータ計算部１５Ｃから与えられたライン毎の視差パラメータを保持する。 The multiplex setting information unit 14C in the fourth embodiment holds the parallax parameter for each line given from the parallax parameter calculation unit 15C.

第４の実施形態における画像多重部１１Ｃは、多重設定情報部１４Ｃに保持されているライン毎の視差パラメータに基づいて、ライン毎に多重方法を切り替えながら、左目用画像ＰＩＣＬ及び右目用画像ＰＩＣＲをライン毎に多重し、多重画像ＣＰＩＣＣを符号化部１２に与えるものである。画像多重部１１Ｃは、あるラインが多重処理の対象ラインとなったときには、多重設定情報部１４に保持されているそのラインの視差パラメータを参照し、視差パラメータに応じて、入力された左目用画像ＰＩＣＬの処理対象ラインに対して、横方向に、非交互多重領域と交互多重領域とに区分すると共に、入力された右目用画像ＰＩＣＲの処理対象ラインに対して、横方向に、交互多重領域と非交互多重領域とに区分し、その後、左目用画像ＰＩＣＬの処理対象ラインの非交互多重領域、左目用画像ＰＩＣＬの処理対象ラインの交互多重領域の画素と右目用画像ＰＩＣＲの処理対象ラインの交互多重領域の画素とを交互に配置させた領域、右目用画像ＰＩＣＲの処理対象ラインの非交互多重領域とを左側から配置することにより、多重画像ＣＰＩＣＡにおける処理対象ラインを形成する。 The image multiplexing unit 11C according to the fourth embodiment displays the left-eye image PICL and the right-eye image PICR while switching the multiplexing method for each line based on the parallax parameter for each line held in the multiplexing setting information unit 14C. Multiplexing is performed for each line, and a multiplexed image CPICCC is given to the encoding unit 12. The image multiplexing unit 11C refers to the parallax parameter of the line held in the multiplexing setting information unit 14 when a certain line becomes the target line of the multiplexing process, and inputs the left-eye image according to the parallax parameter. The PICL processing target line is divided into a non-alternating multiple region and an alternating multiple region in the horizontal direction, and the alternating multiple region in the horizontal direction with respect to the processing target line of the input right eye image PICR. After dividing into non-alternating multiple regions, the non-alternating multiple regions of the processing target line of the left-eye image PICL, the alternating multi-region pixels of the processing target line of the left-eye image PICL, and the processing target line of the right-eye image PICR alternately A multiple image is obtained by arranging a region where pixels of a multiple region are alternately arranged and a non-alternate multiple region of the processing target line of the right-eye image PICR from the left side. Forming the processing object line in PICA.

第４の実施形態に係る送信部１３Ｃは、符号化部１２から出力された符号化データと多重設定情報部１４Ｃで保持されているライン毎の視差パラメータとを多重した後、送信データ（例えばパケット）を形成して立体映像受信装置２０Ｃに向けて送信するものである。 The transmission unit 13C according to the fourth embodiment multiplexes the encoded data output from the encoding unit 12 and the parallax parameters for each line held in the multiplexing setting information unit 14C, and then transmits transmission data (for example, a packet). ) And transmitted to the stereoscopic video receiving device 20C.

立体映像受信装置２０Ｃは、受信部２１Ｃ、復号部２２、画像多重分離部２３Ｃ及び多重分離設定情報部２４Ｃを有する。復号部２２は、第１の実施形態のものと同様である。ここで、復号部２２、画像多重分離部２３Ｃ及び多重分離設定情報部２４Ｃが、第４の実施形態の立体映像復号装置の構成要素である。 The stereoscopic video receiving device 20C includes a receiving unit 21C, a decoding unit 22, an image demultiplexing unit 23C, and a demultiplexing setting information unit 24C. The decoding unit 22 is the same as that of the first embodiment. Here, the decoding unit 22, the image demultiplexing unit 23C, and the demultiplexing setting information unit 24C are constituent elements of the stereoscopic video decoding apparatus according to the fourth embodiment.

第４の実施形態に係る受信部２１Ｃは、立体映像送信装置１０Ｃが送信したデータを受信し、ライン毎の視差パラメータと符号化データとに多重分離し、ライン毎の視差パラメータを多重分離設定情報部２４Ｃに与え、符号化データを復号部２２に与えるものである。 The receiving unit 21C according to the fourth embodiment receives the data transmitted by the stereoscopic video transmission device 10C, demultiplexes the parallax parameter for each line and the encoded data, and demultiplexes the parallax parameter for each line. This is given to the unit 24C, and the coded data is given to the decoding unit 22.

第４の実施形態における多重分離設定情報部２４Ｃは、受信部２１Ｃから与えられたライン毎の視差パラメータを保持するものである。 The demultiplexing setting information unit 24C according to the fourth embodiment holds a parallax parameter for each line given from the receiving unit 21C.

第４の実施形態に係る画像多重分離部２３Ｃは、多重分離設定情報部２４Ｃに設定されているライン毎の視差パラメータを参照し、上述した画像多重部１１Ｃによるライン毎の多重処理の逆処理（ライン毎の多重分離処理）を行い、左目用画像ＰＩＣＬ及び右目用画像ＰＩＣＲ（すなわち、左右視点画像）を再生するものである。 The image demultiplexing unit 23C according to the fourth embodiment refers to the parallax parameter for each line set in the demultiplexing setting information unit 24C, and reverses the multiplexing process for each line by the image multiplexing unit 11C described above ( The left-eye image PICL and the right-eye image PICR (that is, the left and right viewpoint images) are reproduced.

（Ｄ−２）第４の実施形態の動作
次に、第４の実施形態の立体映像符号化システムを含む立体映像通信システム１Ｃの動作を説明する。 (D-2) Operation of the Fourth Embodiment Next, the operation of the stereoscopic video communication system 1C including the stereoscopic video encoding system of the fourth embodiment will be described.

初めに、立体映像送信装置１０Ｃの動作を、図１１のフローチャートを参照しながら説明する。図１１において、第２の実施形態に係る図７との同一ステップには同一符号を付して示している。図１１は、同期している１フレーム分の左目用画像ＰＩＣＬ及び右目用画像ＰＩＣＲを多重して送信する動作を示しており、この図１１に示す動作が立体視の動画像を構成しているフレーム数分だけ繰り返される。 First, the operation of the stereoscopic video transmission apparatus 10C will be described with reference to the flowchart of FIG. In FIG. 11, the same steps as those in FIG. 7 according to the second embodiment are denoted by the same reference numerals. FIG. 11 shows an operation of multiplexing and transmitting the synchronized left-eye image PICL and right-eye image PICR for one frame, and the operation shown in FIG. 11 forms a stereoscopic moving image. Repeated for the number of frames.

まず、入力された左目用画像ＰＩＣＬ及び右目用画像ＰＩＣＲに基づいて（補助情報を利用する場合には補助情報にも基づく）、視差パラメータ計算部１５Ｃが、ライン毎の視差パラメータを計算し、多重設定情報部１４Ｃに保持されているライン毎の視差パラメータの情報が、計算によって得られたライン毎の視差パラメータに更新される（ステップＳ５０１）。 First, based on the input left-eye image PICL and right-eye image PICR (based on auxiliary information when auxiliary information is used), the parallax parameter calculation unit 15C calculates a parallax parameter for each line and performs multiplexing. The parallax parameter information for each line held in the setting information unit 14C is updated to the parallax parameters for each line obtained by calculation (step S501).

そして、画像多重部１１Ｃは、一対の左目用画像ＰＩＣＬ及び右目用画像ＰＩＣＲ（左右の視点画像）を取得し（ステップＳ１０１）、ライＮ毎の視差パラメータに応じて、入力された左目用画像ＰＩＣＬ及び右目用画像ＰＩＣＲに対して、上述したライン毎の多重処理を行い、左目用画像ＰＩＣＬ及び右目用画像ＰＩＣＲの多重画像ＣＰＩＣＣを形成する（ステップＳ５０２）。 Then, the image multiplexing unit 11C acquires a pair of left-eye images PICL and right-eye images PICR (left and right viewpoint images) (step S101), and the input left-eye image PICL according to the parallax parameter for each line N Then, the above-mentioned multiplexing processing for each line is performed on the image PICR for the right eye and the multiple image CPICCC of the image PICL for the left eye and the image PICR for the right eye is formed (step S502).

画像多重部１１Ｃは、得られた多重画像ＣＰＩＣＣを符号化部１２に与え（ステップＳ１０３）、符号化部１２は、多重画像ＣＰＩＣＣを所定の符号化方式に従って符号化し（ステップＳ１０４）、得られた符号化データを送信部１３Ｃに与える（ステップＳ１０５）。 The image multiplexing unit 11C gives the obtained multiple image CPICCC to the encoding unit 12 (step S103), and the encoding unit 12 encodes the multiple image CPICCC according to a predetermined encoding method (step S104) and obtains it. The encoded data is given to the transmission unit 13C (step S105).

送信部１３Ｃは、多重設定情報部１４Ｃからライン毎の視差パラメータを取得した後（ステップＳ５０３）、例えば、符号化部１２から出力された符号化データのヘッダ部分の末尾へ、ライン毎の視差パラメータの情報を追加することにより、符号化データ及びライン毎の視差パラメータの多重化を行う（ステップＳ５０４）。 After acquiring the disparity parameter for each line from the multiplex setting information unit 14C (step S503), the transmission unit 13C, for example, the disparity parameter for each line to the end of the header portion of the encoded data output from the encoding unit 12 Is added, the encoded data and the parallax parameter for each line are multiplexed (step S504).

送信部１３Ｃは、与えられたライン毎の視差パラメータが追加された符号化データから、送信データを組み立て（ステップＳ１０６）、その送信データを立体映像受信装置２０Ｃへ向けて送信する（ステップＳ１０７）。 The transmission unit 13C assembles transmission data from the encoded data to which the given parallax parameter for each line is added (step S106), and transmits the transmission data to the stereoscopic video reception device 20C (step S107).

次に、立体映像受信装置２０Ｃの動作を、図１２のフローチャートを参照しながら説明する。図１２において、第２の実施形態に係る図８との同一ステップには同一符号を付して示している。図１２は、受信データに基づき、同期している１フレーム分の左目用画像ＰＩＣＬ及び右目用画像ＰＩＣＲを多重分離する動作を示しており、この図１２に示す動作が立体視の動画像を構成しているフレーム数分だけ繰り返される。以下では、送信データがパケットであるとして説明する。 Next, the operation of the stereoscopic video reception device 20C will be described with reference to the flowchart of FIG. In FIG. 12, the same steps as those in FIG. 8 according to the second embodiment are denoted by the same reference numerals. FIG. 12 shows an operation of demultiplexing the left-eye image PICL and the right-eye image PICR for one frame in synchronization based on the received data, and the operation shown in FIG. 12 forms a stereoscopic moving image. It is repeated as many times as the number of frames. In the following description, it is assumed that the transmission data is a packet.

受信部２１Ｃは、送信側より送られたパケットを受信し（ステップＳ２０１）、受信したパケットから、ライン毎の視差パラメータと符号化データを多重分離し（ステップＳ６０１）、多重分離設定情報部２４Ｃに保持されているライン毎の視差パラメータの情報を、受信したライン毎の視差パラメータの値に更新すると共に（ステップＳ６０２）、得られた符号化データを復号部２２に与える（ステップＳ２０３）。 The receiving unit 21C receives the packet sent from the transmitting side (step S201), demultiplexes the parallax parameter and the encoded data for each line from the received packet (step S601), and sends the demultiplexing setting information unit 24C to the demultiplexing setting information unit 24C. The stored parallax parameter information for each line is updated to the received parallax parameter value for each line (step S602), and the obtained encoded data is provided to the decoding unit 22 (step S203).

復号部２２は、入力された符号化データを復号した後（ステップＳ２０４）、今回の復号情報を含めて１フレーム分の復号画像が得られたか否かを判別する（ステップＳ２０５）。１フレーム分の復号画像が得られなかった場合には、ステップＳ２０１へ戻り、１フレーム分の復号画像が得られた場合には、復号部２２は、得られた復号画像を画像多重分離部２３Ｃに与える（ステップＳ２０６）。 After decoding the input encoded data (step S204), the decoding unit 22 determines whether a decoded image for one frame including the current decoding information is obtained (step S205). When the decoded image for one frame is not obtained, the process returns to step S201, and when the decoded image for one frame is obtained, the decoding unit 22 converts the obtained decoded image into the image demultiplexing unit 23C. (Step S206).

画像多重分離部２３Ｃは、多重分離設定情報部２４Ｃに保持されているライン毎の視差パラメータを参照し、復号画像（復号された多重画像）に対して、画像多重部１１Ｃによるライン毎の多重処理に対応したライン毎の多重分離処理を行って、左目用画像及び右目用画像を得る（ステップＳ６０３）。 The image demultiplexing unit 23C refers to the parallax parameter for each line held in the demultiplexing setting information unit 24C, and performs the multiplexing process for each line by the image multiplexing unit 11C on the decoded image (decoded multiple image). The left-eye image and the right-eye image are obtained by performing the demultiplexing process for each line corresponding to (step S603).

画像多重分離部２３Ｃは、得られた左目用画像及び右目用画像を表示機器３０に与え（ステップＳ２０８）、立体視表示させる。 The image demultiplexing unit 23C gives the obtained left-eye image and right-eye image to the display device 30 (step S208), and causes stereoscopic display.

（Ｄ−３）第４の実施形態の効果
第４の実施形態によれば、ライン毎に視差パラメータを設定し、ライン毎の視差パラメータに応じてライン毎に多重処理を切り替えるようにしたので、様々な奥行きを持つ左右視点画像に対して、パケットロスによるノイズの影響による視野闘争の発生を抑えることができる。また、ライン毎に適切な視差パラメータを設定するため、ライン内での相関が高くなり、符号化効率の向上が期待できる。 (D-3) Effect of the fourth embodiment According to the fourth embodiment, the parallax parameter is set for each line, and the multiplexing process is switched for each line according to the parallax parameter for each line. It is possible to suppress the occurrence of a field-of-view struggle due to the influence of noise due to packet loss on left and right viewpoint images having various depths. Moreover, since an appropriate parallax parameter is set for each line, the correlation within the line is increased, and an improvement in coding efficiency can be expected.

（Ｅ）他の実施形態
上記各実施形態の説明においても、種々変形実施形態に言及したが、さらに、以下に例示するような変形実施形態を挙げることができる。 (E) Other Embodiments In the description of each of the above embodiments, various modified embodiments have been referred to, but further modified embodiments as exemplified below can be given.

上記各実施形態では、入力された左目用画像及び右目用画像をそのまま多重し、多重画像の横方向の長さが、左目用画像、右目用画像の横方向の長さの２倍になるものを示したが、入力された左目用画像及び右目用画像をそれぞれ、横方向に１／２に縮小した後、２つの縮小画像に対して、上記各実施形態で説明した多重処理を行って、左目用画像、右目用画像の横方向の長さと同じ多重画像を形成するようにしても良い。 In each of the above embodiments, the input left-eye image and right-eye image are multiplexed as they are, and the horizontal length of the multiplexed image is twice the horizontal length of the left-eye image and right-eye image. However, after the input left-eye image and right-eye image are each reduced by half in the horizontal direction, the multiple processing described in the above embodiments is performed on the two reduced images. Multiple images having the same horizontal length as the left-eye image and the right-eye image may be formed.

第４の実施形態では、１又は複数の画素ライン単位に、その単位毎の視差パラメータに応じた多重処理を行うものを示したが、マクロブロックの縦方向の幅（例えば、８画素×８画素のマクロブロックであれば、縦方向の８画素）を単位に視差パラメータを計算し、その幅を単位に多重処理を行うようにしても良い。 In the fourth embodiment, one or a plurality of pixel line units is subjected to multiple processing according to the parallax parameter for each unit. However, the vertical width of the macroblock (for example, 8 pixels × 8 pixels) In the case of a macroblock, the parallax parameter may be calculated in units of 8 pixels in the vertical direction), and multiple processing may be performed in units of the width.

上記各実施形態では、左右視点画像の多重画像の符号化データを、２つの装置間で送受信するものを示したが、左右視点画像の多重画像の符号化データを記録媒体に記録し、その記録媒体から再生した符号化データを復号する場合にも、本発明の技術思想を適用することができる。
In each of the above embodiments, the encoded data of the multiplexed image of the left and right viewpoint images is transmitted and received between the two devices . However, the encoded data of the multiplexed image of the left and right viewpoint images is recorded on the recording medium, and the recording is performed. The technical idea of the present invention can also be applied when decoding encoded data reproduced from a medium.

輝度信号の左右視点画像を多重化した後、符号化して輝度符号化データを得ると共に、色差信号の左右視点画像を多重化した後、符号化して色差符号化データを得る装置に、上記第３や第４の実施形態の技術思想を適用する場合であれば、例えば、輝度信号について得た視差パラメータやライン毎の視差パラメータを、色差信号の処理系でそのまま流用するようにしても良い。 After the left and right viewpoint images of the luminance signal are multiplexed, encoding is performed to obtain luminance encoded data, and the left and right viewpoint images of the color difference signal are multiplexed and then encoded to obtain the color difference encoded data. If the technical idea of the fourth embodiment is applied, for example, the parallax parameter obtained for the luminance signal and the parallax parameter for each line may be used as they are in the color difference signal processing system.

１、１Ａ、１Ｂ、１Ｃ…立体映像通信システム、
１０、１０Ａ、１０Ｂ、１０Ｃ…立体映像送信装置、
１１、１１Ａ、１１Ｃ…画像多重部、１２…符号化部、１３、１３Ａ、１３Ｃ…送信部、１４、１４Ｃ…多重設定情報部、１５、１５Ｃ…視差パラメータ計算部、
２０、２０Ａ、２０Ｃ…立体映像受信装置、
２１、２１Ａ、２１Ｃ…受信部、２２…復号部、２３、２３Ａ、２３Ｃ…画像多重分離部、２４、２４Ｃ…多重分離設定情報部。 1, 1A, 1B, 1C ... stereoscopic video communication system,
10, 10A, 10B, 10C ... stereoscopic video transmission device,
11, 11A, 11C ... image multiplexing unit, 12 ... encoding unit, 13, 13A, 13C ... transmission unit, 14, 14C ... multiple setting information unit, 15, 15C ... parallax parameter calculation unit,
20, 20A, 20C ... stereoscopic video receiver,
21, 21 </ b> A, 21 </ b> C... Receiving unit, 22... Decoding unit, 23, 23 </ b> A, 23 </ b> C image demultiplexing unit, 24, 24C.

Claims

Image multiplexing means for generating a multiple image by multiplexing a left-eye image and a right-eye image having the same number of vertical and horizontal pixels;
Encoding means for encoding the generated multiple images,
The image multiplexing means mixes the information of the predetermined range of the horizontal line of the left-eye image and the right-eye image having the same vertical direction alternately in the horizontal direction in a predetermined number of pixels, and the vertical direction is the same. Generate information on the horizontal lines of the multiple images ,
The predetermined range of the horizontal line is the same for all horizontal lines, and is determined according to the amount of parallax between the left-eye image and the right-eye image, and corresponds to the left-eye image and the right-eye image. A stereoscopic video encoding apparatus characterized by being in a range in which there exists .

The stereoscopic video encoding apparatus according to claim 1 , wherein information on a parallax amount of the left-eye image and the right-eye image is given from outside.

There row the search for high correlation region of the left eye image and the right-eye image, among the plurality of parallax amounts obtained by the search, the parallax amount of the maximum parallax amount or maximum frequency, the image and for the left eye The stereoscopic video encoding apparatus according to claim 1 , further comprising a parallax parameter calculating unit that calculates the parallax amount information of the right-eye image.

Image multiplexing means for generating a multiple image by multiplexing a left-eye image and a right-eye image having the same number of vertical and horizontal pixels;
Encoding means for encoding the generated multiple images,
The image multiplexing means mixes the information of the predetermined range of the horizontal line of the left-eye image and the right-eye image having the same vertical direction alternately in the horizontal direction in a predetermined number of pixels, and the vertical direction is the same. Generate information on the horizontal lines of the multiple images,
The predetermined range of the horizontal line is different for each horizontal line or for each set of horizontal lines consisting of several lines, and the left eye image and the right eye of the horizontal line or horizontal line set to be processed Is determined according to the amount of parallax of the image for use, and is a range where there are regions corresponding to the image for the left eye and the image for the right eye,
There row the search for high correlation region of the left eye image and the right-eye image, among the plurality of parallax amounts obtained by the search, the parallax amount of the maximum parallax amount or maximum frequency, the image and for the left eye A stereoscopic video encoding apparatus, comprising: a parallax parameter calculating unit that sets a parallax amount of the right-eye image .

A stereoscopic video decoding device to which encoded data of a multiplexed image formed by the stereoscopic video encoding device according to claim 1 is input and parallax information is input from the stereoscopic video encoding device according to claim 1 . ,
Decoding means for decoding the encoded data of the input multiple images;
The left eye is demultiplexed by performing reverse processing of the image multiplexing means in the stereoscopic video encoding device according to claim 1 by applying the input parallax information to the multiplexed image obtained by decoding. A stereoscopic video decoding apparatus comprising: an image demultiplexing unit that separates and extracts the image for use and the image for the right eye.

A stereoscopic video decoding device to which encoded data of a multiplex image formed by the stereoscopic video encoding device according to claim 4 is input and parallax amount information is input from the stereoscopic video encoding device according to claim 4. ,
Decoding means for decoding the encoded data of the input multiple images;
The left eye is demultiplexed by performing reverse processing of the image multiplexing means in the stereoscopic video encoding device according to claim 4 by applying the input parallax information to the multiplexed image obtained by decoding. And image demultiplexing means for separating and extracting the image for the right eye and the image for the right eye
A stereoscopic video decoding apparatus characterized by that.

A stereoscopic video encoding system comprising the stereoscopic video encoding device according to claim 1 and the stereoscopic video decoding device according to claim 5 .

A stereoscopic video encoding system comprising the stereoscopic video encoding device according to claim 4 and the stereoscopic video decoding device according to claim 6.

Computer
A left-eye image and a right-eye image having the same number of vertical and horizontal pixels are multiplexed to generate a multiplexed image, and information on a predetermined range of horizontal lines of the left-eye image and the right-eye image having the same vertical direction Are multiplexed alternately in the horizontal direction in units of a predetermined number of pixels, and image multiplexing means for generating information on the horizontal lines of the multiple images having the same vertical direction;
A stereoscopic video encoding program for functioning as an encoding means for encoding a generated multiple image ,
The predetermined range of the horizontal line is the same for all the horizontal lines, and is determined according to the amount of parallax of the left-eye image and the right-eye image. Regardless of the parallax, the left-eye image and the right-eye line A stereoscopic video encoding program characterized in that a region corresponding to an image is present .

Computer
A left-eye image and a right-eye image having the same number of vertical and horizontal pixels are multiplexed to generate a multiplexed image, and information on a predetermined range of horizontal lines of the left-eye image and the right-eye image having the same vertical direction Are multiplexed alternately in the horizontal direction in units of a predetermined number of pixels, and image multiplexing means for generating information on the horizontal lines of the multiple images having the same vertical direction;
Encoding means for encoding the generated multiple images;
The left eye image and the right eye image are searched for a region having a high correlation, and among the plurality of parallax amounts obtained by the search, the maximum parallax amount or the highest frequency parallax amount is determined as the left eye image and the above-described parallax amount. A stereoscopic video encoding program that functions as a parallax parameter calculation unit that sets a parallax amount of a right-eye image,
The predetermined range of the horizontal line is different for each horizontal line or for each set of horizontal lines consisting of several lines, and the left eye image and the right eye of the horizontal line or horizontal line set to be processed It is determined in accordance with the amount of parallax of the image for use, and is a range in which areas corresponding to the image for the left eye and the image for the right eye exist regardless of the parallax.
A stereoscopic video encoding program characterized by the above.

The encoded data of the multiplexed image formed by the stereoscopic video encoding device according to claim 1 is input and mounted on a device to which information on the amount of parallax is input from the stereoscopic video encoding device according to claim 1. Computer
Decoding means for decoding the encoded data of the input multiple images;
The left eye is demultiplexed by performing reverse processing of the image multiplexing means in the stereoscopic video encoding device according to claim 1 by applying the input parallax information to the multiplexed image obtained by decoding. A stereoscopic video decoding program that functions as an image demultiplexing unit that separates and extracts a for-image and a right-eye image.

The encoded data of the multiplexed image formed by the stereoscopic video encoding device according to claim 4 is input and mounted on a device to which information on the amount of parallax is input from the stereoscopic video encoding device according to claim 4. Computer
Decoding means for decoding the encoded data of the input multiple images;
The left eye is demultiplexed by performing reverse processing of the image multiplexing means in the stereoscopic video encoding device according to claim 4 by applying the input parallax information to the multiplexed image obtained by decoding. And image demultiplexing means for separating and extracting the image for the right eye and the image for the right eye
3D video decoding program characterized by being made to function.