JP2014212459A

JP2014212459A - Video output device, caption server device, video output method, and program

Info

Publication number: JP2014212459A
Application number: JP2013087975A
Authority: JP
Inventors: 福留　正男; Masao Fukutome; 正男福留; 彰宏中谷; Teruhiro Nakatani; 佐藤　至; Itaru Sato; 至佐藤; 大嶋雄三; Yuzo Oshima; 雄三大嶋
Original assignee: Astem Corp
Current assignee: Astem Corp
Priority date: 2013-04-19
Filing date: 2013-04-19
Publication date: 2014-11-13
Anticipated expiration: 2033-04-19
Also published as: JP6154181B2

Abstract

PROBLEM TO BE SOLVED: To provide a video output device capable of outputting video information and information of a character string indicating a caption while matching timing appropriately.SOLUTION: A video output device comprises: a video reception unit 101 that receives video information from a video server device 3; a caption reception unit 102 that receives one or more pieces of caption information as information of a character string indicating a caption and caption timing information indicating timing of outputting each piece of caption information with respect to output of the video information from a caption server device 4; and an output unit 113 that combines the one or more pieces of caption information received by the caption reception unit 102 with the video information received by the video reception unit 101 using the caption timing information associated with each piece of caption information, and outputs the result.

Description

本発明は、字幕と映像とを合成して出力する映像出力装置や、合成に用いられる字幕を提供する字幕サーバ装置等に関するものである。 The present invention relates to a video output device that synthesizes and outputs subtitles and video, a subtitle server device that provides subtitles used for synthesis, and the like.

従来技術として、入力映像信号に文字データを合成し、表示装置に対して出力する文字表示処理装置であって、前記表示装置の表示方式に基づき、文字サイズを選択する文字サイズ選択手段と、前記文字データを、前記選択された文字サイズの文字映像信号に変換する文字データ展開手段と、前記入力映像信号に前記文字映像信号を合成する映像合成手段とを備えた、文字表示処理装置が知られていた（例えば、特許文献１参照）。 As a prior art, a character display processing device that synthesizes character data with an input video signal and outputs it to a display device, the character size selection means for selecting a character size based on the display method of the display device, 2. Description of the Related Art A character display processing apparatus is known, comprising character data expansion means for converting character data into a character video signal of the selected character size, and video composition means for synthesizing the character video signal with the input video signal. (For example, refer to Patent Document 1).

特開２００１−６９４１８号公報（第１頁、第１図等）JP 2001-69418 A (first page, FIG. 1 etc.)

しかしながら、従来の映像出力装置においては、映像情報と、字幕を示す文字列の情報とを、タイミングを適切に合わせて出力することができないという課題があった。このため、例えば、映像と字幕とがずれてしまい、映像で表現される内容が、適切に把握できなくなってしまうという問題が生じていた。 However, the conventional video output device has a problem in that video information and character string information indicating captions cannot be output in appropriate timing. For this reason, for example, there has been a problem that the video and the subtitles are misaligned, and the content expressed by the video cannot be properly grasped.

本発明の映像出力装置は、映像情報を映像サーバ装置から受信する映像受信部と、字幕を示す文字列の情報である１以上の字幕情報と、映像情報の出力に対する当該各字幕情報が出力されるタイミングを示す字幕タイミング情報とを、字幕サーバ装置から受信する字幕受信部と、字幕受信部が受信した１以上の字幕情報を、各字幕情報と対応付けられた字幕タイミング情報を用いて、映像受信部が受信した映像情報と合成して出力する出力部とを備えた映像出力装置である。 The video output device of the present invention outputs a video receiving unit that receives video information from the video server device, one or more subtitle information that is character string information indicating subtitles, and each subtitle information corresponding to the output of the video information. Subtitle timing information indicating the timing of the subtitles received from the subtitle server device, and one or more subtitle information received by the subtitle receiver using the subtitle timing information associated with each subtitle information, The video output device includes an output unit that synthesizes and outputs the video information received by the reception unit.

かかる構成により、映像情報と、字幕を示す文字列の情報とを、タイミングを適切に合わせて出力することができる。 With this configuration, it is possible to output the video information and the character string information indicating the caption in appropriate timing.

また、本発明の映像出力装置は、前記映像出力装置において、出力部は、映像情報の出力に応じて、映像情報の出力箇所を示す情報を取得し、出力箇所を示す情報に対応する字幕タイミング情報と対応付けられた字幕情報を出力する映像出力装置である。 In the video output device of the present invention, in the video output device, the output unit acquires information indicating an output location of the video information according to the output of the video information, and subtitle timing corresponding to the information indicating the output location. It is a video output device that outputs subtitle information associated with information.

かかる構成により、映像情報の出力箇所に対応した字幕情報を出力することができる。 With this configuration, it is possible to output caption information corresponding to the output location of the video information.

また、本発明の映像出力装置は、前記映像出力装置において、映像受信部は、複数のフレーム画像を有する映像情報であって、各フレーム画像を出力するタイミングを示す映像タイミング情報が、各フレーム画像と対応付けられている映像情報を受信し、字幕タイミング情報は、１以上の映像タイミング情報を指定する情報であり、出力部は、一のフレーム画像を出力する際に、フレーム画像と対応付けられた映像タイミング情報を指定する字幕タイミング情報と対応付けられた字幕情報を出力する映像出力装置である。 In the video output device of the present invention, in the video output device, the video receiving unit is video information having a plurality of frame images, and the video timing information indicating the timing of outputting each frame image includes each frame image. The subtitle timing information is information specifying one or more video timing information, and the output unit is associated with the frame image when outputting one frame image. This is a video output device that outputs subtitle information associated with subtitle timing information that specifies the video timing information.

かかる構成により、フレーム画像に対応付けられた映像タイミング情報に対応する字幕情報を出力することができ、字幕情報と映像情報とをより確実にタイミングを合わせて出力することができる。例えば、映像情報が、途中から出力された場合においても、出力されるフレーム画像に合わせた字幕情報を出力することができる。 With this configuration, it is possible to output subtitle information corresponding to the video timing information associated with the frame image, and it is possible to output the subtitle information and the video information in a more reliable timing. For example, even when video information is output from the middle, it is possible to output subtitle information that matches the output frame image.

また、本発明の映像出力装置は、前記映像出力装置において、映像受信部が受信した映像情報が一時的に格納される映像格納部と、映像受信部が受信した字幕情報が一時的に格納される字幕格納部とを更に備え、映像受信部は、受信した映像情報を、映像格納部に蓄積し、字幕受信部は、受信した字幕情報を、字幕格納部に蓄積し、出力部は、映像格納部に格納された映像情報と、字幕格納部に格納された字幕情報とを読み出して出力する映像出力装置である。 In the video output device of the present invention, in the video output device, a video storage unit that temporarily stores video information received by the video receiving unit and subtitle information received by the video receiving unit are temporarily stored. A subtitle storage unit, the video reception unit stores the received video information in the video storage unit, the subtitle reception unit stores the received subtitle information in the subtitle storage unit, and the output unit It is a video output device that reads and outputs video information stored in a storage unit and caption information stored in a caption storage unit.

かかる構成により、次以降に出力される映像情報と字幕情報とを予め格納しておくことができるため、例えば、通信状況等によって、映像情報や字幕情報の送信の遅延等が起こっても、映像情報や字幕情報の出力に遅れが生じないようにすることができる。 With this configuration, video information and subtitle information to be output after the next can be stored in advance, so that, for example, even if there is a delay in transmission of video information or subtitle information due to communication conditions, etc. It is possible to prevent delay in the output of information and caption information.

また、本発明の映像出力装置は、前記映像出力装置において、出力部による映像情報及び字幕情報の少なくとも一方の出力状況を示す情報を取得する出力状況取得部と、出力状況取得部が取得した出力状況を示す情報に応じて、字幕サーバ装置に、字幕情報の送信を要求する情報である字幕要求情報を送信する要求送信部とを更に備え、字幕サーバ装置は、字幕要求情報に応じて字幕情報を送信する映像出力装置である。 In the video output device of the present invention, in the video output device, an output status acquisition unit that acquires information indicating an output status of at least one of video information and subtitle information by the output unit, and an output acquired by the output status acquisition unit The subtitle server apparatus further includes a request transmission unit that transmits subtitle request information that is information for requesting transmission of subtitle information to the subtitle server apparatus according to the information indicating the situation. Is a video output device for transmitting the video.

かかる構成により、映像情報や字幕情報の出力状況に応じて、字幕情報を取得することができ、不必要な量の字幕情報を取得しないようにして、字幕情報を取得する量を最適化することができる。 With this configuration, subtitle information can be acquired according to the output status of video information and subtitle information, and the amount of subtitle information to be acquired is optimized so as not to acquire an unnecessary amount of subtitle information. Can do.

また、本発明の映像出力装置は、前記映像出力装置において、出力状況取得部は、映像格納部における映像情報の格納状況、及び字幕格納部における字幕情報の格納状況の少なくとも一方を示す情報を、出力状況を示す情報として取得する映像出力装置である。 Further, in the video output device of the present invention, in the video output device, the output status acquisition unit includes information indicating at least one of a storage status of video information in the video storage unit and a storage status of caption information in the caption storage unit, This is a video output device that acquires the information indicating the output status.

かかる構成により、映像情報や字幕情報の格納状況に応じて、字幕情報を取得することができ、不必要な量の字幕情報を格納しないようにして、出力のために格納する字幕情報の量を最適化することができる。 With this configuration, subtitle information can be acquired according to the storage status of video information and subtitle information, and an amount of subtitle information stored for output can be reduced without storing unnecessary amount of subtitle information. Can be optimized.

また、本発明の映像出力装置は、前記映像出力装置において、映像格納部に格納される映像情報の格納量の最大値と、字幕格納部に格納される字幕情報の格納量の最大値とが互いに異なる映像出力装置である。 In the video output device of the present invention, in the video output device, the maximum value of the storage amount of the video information stored in the video storage unit and the maximum value of the storage amount of the caption information stored in the caption storage unit are Different video output devices.

かかる構成により、例えば、データ容量が小さいために送受信に要する時間の短い字幕情報の格納量を減らして、格納部の容量を節約することができる。 With this configuration, for example, the storage capacity of the storage unit can be saved by reducing the storage amount of caption information that requires a short time for transmission and reception because the data capacity is small.

また、本発明の映像出力装置は、前記映像出力装置において、映像格納部に格納される映像情報の格納量の最大値と、字幕格納部に格納される字幕情報の格納量の最大値とを、それぞれ、自装置の処理能力に応じた値に設定する格納量設定部を更に備えた映像出力装置である。 In the video output device of the present invention, in the video output device, the maximum value of the storage amount of the video information stored in the video storage unit and the maximum value of the storage amount of the caption information stored in the caption storage unit are obtained. These are video output devices each further comprising a storage amount setting unit for setting a value corresponding to the processing capability of the own device.

かかる構成により、処理能力に応じて、映像格納部と字幕格納部の格納量の最大値を最適化することができる。 With this configuration, it is possible to optimize the maximum storage amount of the video storage unit and the caption storage unit according to the processing capability.

また、本発明の映像出力装置は、前記映像出力装置において、映像受信部が受信した映像情報の種類に応じて、字幕タイミング情報と対応付けられた字幕情報及び字幕タイミング情報と対応付けられていない字幕情報のいずれか一方の送信を指定する情報を、字幕サーバ装置に送信する指定送信部を更に備え、字幕受信部は、送信を指定する情報に応じて字幕サーバ装置が送信する字幕情報であって、字幕タイミング情報と対応付けられていない字幕情報を更に受信し、出力部は、字幕受信部が受信した字幕情報が字幕タイミング情報と対応付けられていない場合は、字幕情報を映像受信部が受信した映像情報と合成して出力する映像出力装置である。 Also, the video output device of the present invention is not associated with subtitle information and subtitle timing information associated with subtitle timing information in accordance with the type of video information received by the video receiver in the video output device. The information processing apparatus further includes a designation transmission unit that transmits information specifying one of the subtitle information to the subtitle server apparatus, and the subtitle reception section is subtitle information that the subtitle server apparatus transmits according to the information that specifies transmission. If the subtitle information that is not associated with the subtitle timing information is further received, and the subtitle information received by the subtitle receiver is not associated with the subtitle timing information, the output unit receives the subtitle information. This is a video output device that synthesizes and outputs the received video information.

かかる構成により、映像情報の種類に応じて、字幕タイミング情報を用いた字幕情報の出力を行うか否かを制御できる。 With this configuration, it is possible to control whether or not to output caption information using caption timing information according to the type of video information.

また、本発明の映像出力装置は、前記映像出力装置において、映像受信部が受信した映像情報に対応付けられた、複数のフレーム画像を有する手話の映像を示す情報であって、各フレーム画像を出力するタイミングを示す手話タイミング情報が、各フレーム画像と対応付けられている手話映像情報を、手話映像サーバ装置から受信する手話映像受信部を更に備え、出力部は、更に、手話映像受信部が受信した手話映像情報の各フレーム画像を、フレーム画像に対応する手話タイミング情報を用いて、映像受信部が受信した映像情報と合成して出力する映像出力装置である。 Also, the video output device of the present invention is information indicating a sign language video having a plurality of frame images associated with the video information received by the video receiver in the video output device. The sign language timing information indicating the output timing further includes a sign language video receiving unit that receives sign language video information associated with each frame image from the sign language video server device, and the output unit further includes a sign language video receiving unit. This is a video output device that synthesizes and outputs each frame image of the received sign language video information with the video information received by the video receiver using sign language timing information corresponding to the frame image.

かかる構成により、映像情報と、字幕を示す文字列の情報と、手話映像情報とを、タイミングを適切に合わせて出力することができる。 With this configuration, it is possible to output video information, character string information indicating subtitles, and sign language video information in appropriate timing.

また、本発明の映像出力装置は、前記映像出力装置において、映像情報及び字幕情報の受信に関する制御に用いられる制御情報の送信を要求する情報である制御要求情報を、制御サーバ装置に送信する制御要求情報送信部と、制御サーバ装置から制御要求情報に応じて送信される制御情報を受信する制御情報受信部とを更に備え、制御情報を用いて、映像情報と字幕情報との受信を行う映像出力装置である。 The video output device of the present invention is a control for transmitting control request information, which is information for requesting transmission of control information used for control related to reception of video information and caption information, to the control server device in the video output device. A video that further includes a request information transmission unit and a control information reception unit that receives control information transmitted in response to control request information from the control server device, and receives video information and caption information using the control information Output device.

かかる構成により、映像出力装置が予め映像情報と字幕情報との受信を制御するための情報を有していなくても、制御情報を用いて、映像情報と字幕情報との受信を制御することが可能となる。 With this configuration, even if the video output apparatus does not have information for controlling reception of video information and caption information in advance, reception of video information and caption information can be controlled using the control information. It becomes possible.

また、本発明の映像出力装置は、前記映像出力装置において、映像出力装置に字幕情報を送信する字幕サーバ装置であって、１以上の字幕情報と、各字幕情報を出力するタイミングを示す字幕タイミング情報とが対応付けて格納されている字幕サーバ側格納部と、字幕サーバ側格納部に格納されている字幕情報を読み出して映像出力装置に送信する字幕サーバ側送信部とを備えた字幕サーバ装置である。 The video output device of the present invention is a subtitle server device that transmits subtitle information to the video output device in the video output device, wherein one or more subtitle information and subtitle timing indicating a timing of outputting each subtitle information A caption server device comprising: a caption server side storage unit that stores information in association with each other; and a caption server side transmission unit that reads the caption information stored in the caption server side storage unit and transmits the information to the video output device It is.

かかる構成により、映像情報と、タイミングを適切に合わせて出力することが可能な字幕を示す文字列の情報を提供することができる。 With this configuration, it is possible to provide video information and character string information indicating subtitles that can be output with appropriate timing.

本発明による映像出力装置等によれば、映像情報と、字幕を示す文字列の情報とを、タイミングを適切に合わせて出力することができる。 According to the video output apparatus and the like according to the present invention, video information and character string information indicating a caption can be output in appropriate timing.

本発明の実施の形態における映像システムのブロック図Block diagram of a video system in an embodiment of the present invention 同映像システムの映像出力装置の動作の一例について説明するフローチャートA flowchart for explaining an example of the operation of the video output device of the video system 同映像システムの字幕サーバ装置の動作の一例について説明するフローチャートThe flowchart explaining an example of operation | movement of the caption server apparatus of the video system 同映像システムの概念図Conceptual diagram of the video system 同映像システムの制御サーバ装置の制御情報管理表の一例を示す図The figure which shows an example of the control information management table | surface of the control server apparatus of the video system 同映像システムの映像サーバ装置の映像情報管理表の一例を示す図The figure which shows an example of the video information management table | surface of the video server apparatus of the video system 同映像システムの映像サーバ装置の映像情報の一例を示す模式図Schematic diagram showing an example of video information of a video server device of the video system 同映像システムの字幕サーバ装置の字幕群情報管理表の一例を示す図The figure which shows an example of the caption group information management table | surface of the caption server apparatus of the video system 同映像システムの字幕サーバ装置の字幕群情報の一例を示す図The figure which shows an example of the caption group information of the caption server apparatus of the video system 同映像システムの手話映像サーバ装置の手話映像管理表を示す図The figure which shows the sign language picture management table of the sign language picture server device of the picture system 同映像システムの手話映像サーバ装置の手話映像情報の一例を示す模式図Schematic diagram showing an example of sign language video information of the sign language video server device of the video system 同映像システムの映像出力装置の格納量管理表を示す図The figure which shows the storage amount management table of the video output device of the same video system 同映像システムの映像出力装置が受信した制御情報の一例を示す図The figure which shows an example of the control information which the video output device of the video system received 同映像システムの映像出力装置の最初の字幕組情報が蓄積された状態の字幕組管理表を示す図（図１４（ａ））、及びその後に複数の字幕組情報が追記された状態の字幕組管理表を示す図（図１４（ｂ））FIG. 14A shows a subtitle group management table in a state where the first subtitle group information of the video output device of the video system is accumulated, and a subtitle group in which a plurality of subtitle group information is added thereafter Figure showing the management table (Figure 14 (b)) 同映像システムの映像出力装置の映像情報の表示例を示す図The figure which shows the example of a display of the video information of the video output apparatus of the video system 同映像システムの映像出力装置の映像情報の表示例を示す図The figure which shows the example of a display of the video information of the video output apparatus of the video system 同映像システムの映像出力装置を実現するコンピュータの外観の一例を示す模式図Schematic diagram showing an example of the appearance of a computer that implements the video output device of the video system 同映像システムの映像出力装置を実現するコンピュータの内部構成の一例を示す図The figure which shows an example of the internal structure of the computer which implement | achieves the video output device of the video system

以下、映像出力装置等の実施形態について図面を参照して説明する。なお、実施の形態において同じ符号を付した構成要素は同様の動作を行うので、再度の説明を省略する場合がある。 Hereinafter, embodiments of a video output device and the like will be described with reference to the drawings. In addition, since the component which attached | subjected the same code | symbol in embodiment performs the same operation | movement, description may be abbreviate | omitted again.

（実施の形態）
図１は、本実施の形態における映像システム１０００のブロック図である。
映像システム１０００は、１または２以上の映像出力装置１、１または２以上の制御サーバ装置２、１または２以上の映像サーバ装置３、１または２以上の字幕サーバ装置４、及び１または２以上の手話映像サーバ装置５を備える。なお、ここでは説明の便宜上、映像出力装置１、制御サーバ装置２、映像サーバ装置３、字幕サーバ装置４、及び手話映像サーバ装置５をそれぞれ一つずつ備えている場合を例に挙げて説明する。映像出力装置１、制御サーバ装置２、映像サーバ装置３、字幕サーバ装置４、及び手話映像サーバ装置５は、有線または無線のネットワークや通信回線等を介してデータの送受信が可能となるよう接続されている。 (Embodiment)
FIG. 1 is a block diagram of a video system 1000 according to the present embodiment.
The video system 1000 includes one or more video output devices 1, one or two or more control server devices 2, one or two or more video server devices 3, one or two or more caption server devices 4, and one or two or more. The sign language video server device 5 is provided. Here, for convenience of explanation, a case will be described as an example in which each of the video output device 1, the control server device 2, the video server device 3, the caption server device 4, and the sign language video server device 5 is provided. . The video output device 1, the control server device 2, the video server device 3, the caption server device 4, and the sign language video server device 5 are connected so as to be able to transmit and receive data via a wired or wireless network, a communication line, or the like. ing.

映像出力装置１は、映像受信部１０１、字幕受信部１０２、映像格納部１０３、字幕格納部１０４、手話映像受信部１０５、格納量設定部１０６、出力状況取得部１０７、要求送信部１０８、指定送信部１０９、映像要求情報送信部１１０、制御要求情報送信部１１１、制御情報受信部１１２、出力部１１３を備える。 The video output device 1 includes a video reception unit 101, a caption reception unit 102, a video storage unit 103, a caption storage unit 104, a sign language video reception unit 105, a storage amount setting unit 106, an output status acquisition unit 107, a request transmission unit 108, a designation A transmission unit 109, a video request information transmission unit 110, a control request information transmission unit 111, a control information reception unit 112, and an output unit 113 are provided.

制御サーバ装置２は、制御情報格納部２０１、制御サーバ側受信部２０２、制御サーバ側送信部２０３を備える。 The control server device 2 includes a control information storage unit 201, a control server side reception unit 202, and a control server side transmission unit 203.

映像サーバ装置３は、映像サーバ側格納部３０１、映像サーバ側受信部３０２、映像サーバ側送信部３０３を備える。 The video server device 3 includes a video server side storage unit 301, a video server side reception unit 302, and a video server side transmission unit 303.

字幕サーバ装置４は、字幕サーバ側格納部４０１、字幕サーバ側受信部４０２、字幕サーバ側送信部４０３を備える。 The caption server device 4 includes a caption server side storage unit 401, a caption server side reception unit 402, and a caption server side transmission unit 403.

手話映像サーバ装置５は、手話映像サーバ側格納部５０１、手話映像サーバ側受信部５０２、手話映像サーバ側送信部５０３を備える。 The sign language video server device 5 includes a sign language video server storage unit 501, a sign language video server reception unit 502, and a sign language video server transmission unit 503.

映像出力装置１は、映像を出力するための装置である。映像出力装置１は、例えば、ユーザが映像の出力に利用する端末装置である。映像出力装置１は、例えば、コンピュータや、携帯電話、いわゆるスマートフォンやタブレット型端末等の多機能携帯端末等で実現される。 The video output device 1 is a device for outputting video. The video output device 1 is, for example, a terminal device used by a user for video output. The video output device 1 is realized by, for example, a computer, a mobile phone, a multi-function mobile terminal such as a so-called smartphone or tablet terminal, or the like.

映像受信部１０１は、１または２以上の映像サーバ装置３から１または２以上の映像情報を受信する。映像情報は、動画像の情報である。映像受信部１０１が受信する映像情報は、映像サーバ装置３からパケット等に分割されて配信されるストリーミング形式の映像情報でもよいし、プログレッシブダウンロード形式や、ダウンロード形式で提供される映像情報でも良い。映像受信部１０１が受信する映像情報は、ストリーミング形式や、プログレッシブダウンロード形式等の、ダウンロードしながら出力（再生）が可能な形式であることが、映像情報が出力開始されるまでのユーザの待ち時間を短縮できるうえで好ましい。映像情報のフォーマットや、ファイル形式、圧縮方式、アスペクト比、画素数、色深度等は問わない。映像情報は、更に、音声情報を有していても良い。また、ストリーミング時に利用されるプロトコル等は問わない。音声情報は、映像情報と同期していても良いし、同期していなくても良い。 The video receiver 101 receives one or more video information from one or more video server devices 3. The video information is moving image information. The video information received by the video receiving unit 101 may be video information in a streaming format that is distributed from the video server device 3 into packets or the like, or may be video information provided in a progressive download format or a download format. The video information received by the video receiving unit 101 must be in a format that can be output (played back) while being downloaded, such as a streaming format or a progressive download format. Is preferable in that it can be shortened. The format of video information, file format, compression method, aspect ratio, number of pixels, color depth, etc. are not limited. The video information may further include audio information. Moreover, the protocol used at the time of streaming is not ask | required. The audio information may be synchronized with the video information or may not be synchronized.

映像受信部１０１は、例えば、複数のフレーム画像を有する映像情報を受信する。ここでのフレーム画像を有する映像情報とは、結果的に出力の対象となる複数のフレーム画像を取得することが可能な映像情報と考えても良い。例えば、フレーム画像を有する映像情報は、複数のフィールド画像を有する映像情報であっても良い。 The video receiving unit 101 receives video information having a plurality of frame images, for example. The video information having a frame image here may be considered as video information capable of acquiring a plurality of frame images to be output as a result. For example, the video information having a frame image may be video information having a plurality of field images.

映像受信部１０１は、例えば、複数のフレーム画像を有する映像情報であって、各フレーム画像を出力するタイミングを示す映像タイミング情報が、各フレーム画像と対応付けられている映像情報を受信する。映像タイミング情報が、各フレーム画像と対応付けられているということは、例えば、各フレーム画像に映像タイミング情報が付与されていることである。映像タイミング情報が付与されているということは、例えば、映像タイミング情報が各フレーム画像に埋め込まれていることであっても良いし、各フレーム画像のヘッダ情報や、フッタ情報や、プロパティ等に、映像タイミング情報が埋め込まれていることであっても良い。あるいは、フレーム画像に対応付けられたフレームの識別情報と、映像タイミング情報とが対応付けて、図示しない記憶媒体等に格納されていることであっても良い。また、映像タイミング情報が、各フレーム画像と対応付けられているということは、連続した複数のフレーム画像群に対して、各フレーム画像に割り当てられる映像タイミング情報の範囲や、複数の映像タイミング情報が格納されていても良い。 For example, the video receiving unit 101 receives video information that is video information having a plurality of frame images, and video timing information indicating the timing of outputting each frame image is associated with each frame image. The fact that the video timing information is associated with each frame image means that, for example, video timing information is given to each frame image. The fact that the video timing information is given may be, for example, that the video timing information is embedded in each frame image, or the header information, footer information, properties, etc. of each frame image, The video timing information may be embedded. Alternatively, the frame identification information associated with the frame image and the video timing information may be associated with each other and stored in a storage medium (not shown). Also, the fact that the video timing information is associated with each frame image means that the range of video timing information allocated to each frame image and the plurality of video timing information are associated with a plurality of continuous frame image groups. It may be stored.

映像タイミング情報とは、各フレーム画像の出力されるタイミングを判断するため、あるいは決定するために用いられる情報である。映像タイミング情報は、例えば、各フレーム画像の出力順番を示す連番である。また、映像タイミング情報は、例えば、各フレーム画像に付与された、各フレーム画像の出力時刻を示す情報であり、具体的には、タイムコードである。タイムコードは、例えば、時、分、秒と、フレームとの組合せで表される。タイムコードは、タイムスタンプとも呼ばれる。タイムコードのフォーマット等は問わない。各フレーム画像に付与されるタイムコードは、例えば、ＳＭＰＴＥタイムコードである。なお、ここでの時刻とは、世界の標準時刻等の絶対的な時刻であっても良いし、映像情報の出力が開始された時刻を起点等とした相対的な時刻であっても良い。 The video timing information is information used to determine or determine the output timing of each frame image. The video timing information is, for example, a serial number indicating the output order of each frame image. The video timing information is, for example, information indicating the output time of each frame image given to each frame image, and is specifically a time code. The time code is represented by, for example, a combination of hour, minute, second, and frame. The time code is also called a time stamp. The format of the time code does not matter. The time code given to each frame image is, for example, an SMPTE time code. The time here may be an absolute time such as the world standard time, or may be a relative time starting from the time when the output of the video information is started.

映像情報は、映像情報を識別する情報と対応付けられていても良い。映像情報を識別する情報を、ここでは映像識別情報と称す。映像情報を識別する情報は、例えば、映像情報のタイトルや、映像情報に割り当てられたコード等である。 Video information may be associated with information for identifying video information. Here, the information for identifying the video information is referred to as video identification information. The information for identifying the video information is, for example, a title of the video information, a code assigned to the video information, or the like.

また、映像情報は、映像情報の種類を示す情報を有していても良い。映像情報の種類とは、例えば、映像情報が作成されたタイミングにより分類された種類であってもよい。作成されたタイミングにより分類された種類を示す情報は、例えば、映像情報の種類を示す情報とは、ライブ（生）の映像情報であるか否かを示す情報である。また、映像情報で表示される内容により分類された種類であっても良い。例えば、映像情報の種類を示す情報は、ストーリー性のある内容の映像であるか、ストーリー性のない内容の映像であるかを示す情報であってもよい。映像情報の種類を示す情報は、例えば文字列の情報である。 Further, the video information may include information indicating the type of video information. The type of video information may be, for example, a type classified according to the timing at which video information is created. The information indicating the type classified by the created timing is, for example, information indicating whether or not the information indicating the type of video information is live (raw) video information. Moreover, the kind classified according to the content displayed by video information may be sufficient. For example, the information indicating the type of the video information may be information indicating whether the video has a story content or a video without a story property. The information indicating the type of video information is, for example, character string information.

映像受信部１０１は、例えば、受信した映像情報を、映像格納部１０３に蓄積する。例えば、映像サーバ装置３から、映像情報が分割して送信される場合、映像受信部１０１は、受信した映像情報を順次映像格納部１０３に蓄積する。また、映像情報がストリーミング配信される場合等においては、受信したパケット等の情報を用いて構成したフレーム画像を、映像格納部１０３に順次蓄積しても良い。また、映像受信部１０１は、映像情報を蓄積する際に、映像情報の復号化等を行っても良い。 For example, the video receiving unit 101 accumulates the received video information in the video storage unit 103. For example, when video information is divided and transmitted from the video server device 3, the video receiving unit 101 sequentially stores the received video information in the video storage unit 103. In addition, when video information is distributed by streaming, frame images configured using information such as received packets may be sequentially stored in the video storage unit 103. Further, the video receiving unit 101 may perform decoding of the video information when storing the video information.

映像受信部１０１は、後述する制御情報を用いて映像情報を受信しても良い。制御情報を用いて受信するということは、例えば、制御情報を用いて特定された映像サーバ装置３から送信される映像情報を受信することも含む概念である。 The video receiving unit 101 may receive video information using control information described later. Receiving using control information is a concept including receiving video information transmitted from the video server device 3 specified using control information, for example.

映像受信部１０１は、通常、無線または有線の通信手段で実現されるが、放送を受信する手段で実現されても良い。映像受信部１０１は、通信デバイスを含むと考えても、含まないと考えても良い。 The video receiving unit 101 is usually realized by a wireless or wired communication means, but may be realized by a means for receiving a broadcast. The video receiving unit 101 may or may not include a communication device.

字幕受信部１０２は、１または２以上の字幕情報と、字幕タイミング情報とを字幕サーバ装置４から受信する。字幕情報は、字幕を示す文字列の情報である。文字列の情報は、テキスト情報であっても良いし、文字列を示す静止画像の情報であっても良い。ここでの文字列は、１文字以上の文字で構成される。字幕タイミング情報は、映像情報の出力に対する各字幕情報が出力されるタイミングを示す情報である。ここでの映像情報は、例えば、映像受信部１０１が受信した映像情報である。ここでの映像情報は、字幕情報が合成して出力される映像情報と考えても良い。一の字幕情報とこの字幕情報を出力するタイミングを示す字幕タイミング情報とは互いに対応付けられていると考えても良い。字幕受信部１０２は、例えば、一の字幕情報と、この字幕情報を映像情報と合成して出力するタイミングを示す字幕タイミング情報とを有する情報である１または２以上の字幕組情報を、字幕サーバ装置４から受信する。 The subtitle receiving unit 102 receives one or more subtitle information and subtitle timing information from the subtitle server device 4. The caption information is information on a character string indicating the caption. The character string information may be text information or may be still image information indicating the character string. The character string here is composed of one or more characters. The caption timing information is information indicating the timing at which each caption information is output with respect to the output of the video information. The video information here is, for example, video information received by the video receiving unit 101. The video information here may be considered as video information output by combining caption information. One subtitle information and subtitle timing information indicating the timing for outputting the subtitle information may be considered to be associated with each other. For example, the caption receiving unit 102 receives one or more pieces of caption group information, which is information having one caption information and caption timing information indicating the timing of synthesizing and outputting the caption information with the video information. Receive from device 4.

字幕受信部１０２は、例えば、映像受信部１０１が受信した映像情報に対応付けられた字幕情報を受信する。映像情報に対応付けられた字幕情報を受信するということは、例えば、映像受信部１０１が受信した映像情報と同じ映像識別情報が対応付けられている字幕情報や字幕組情報を受信することである。字幕受信部１０２は、一の映像情報に対応付けられた全ての字幕情報をまとめて受信しても良い。 For example, the caption receiving unit 102 receives caption information associated with the video information received by the video receiving unit 101. Receiving subtitle information associated with video information means, for example, receiving subtitle information or subtitle group information associated with the same video identification information as the video information received by the video receiving unit 101. . The caption receiving unit 102 may receive all the caption information associated with one piece of video information collectively.

映像情報の出力に対する字幕情報が出力されるタイミングとは、例えば、映像情報の一部（例えば、最初のフレーム画像）が出力されたタイミング（例えば時刻等）を基準とした字幕情報が出力されるタイミングや、映像情報の各フレーム画像が出力されるタイミングを示す情報（例えば、タイムコードやフレーム画像の出力順番等）を用いて指定されるタイミングである。字幕タイミング情報は、字幕情報の出力を開始するタイミングを指定する情報であっても良いし、字幕情報を出力する期間を指定する情報であっても良い。期間を指定する情報は、例えば、字幕情報の出力を開始するタイミングを指定する情報と、出力を終了するタイミングを指定する情報との組合せや、出力を開始するタイミングを指定する情報と、出力の継続期間とを指定する情報との組合せ等である。字幕情報の出力を開始するタイミングを示す字幕タイミング情報は、例えば、字幕情報に対応する映像情報の、所定のフレーム画像（通常は、最初のフレーム画像）を出力開始してからの経過時間や、所定のフレーム画像の出力を開始してから出力した総フレーム画像数や、次に出力するフレーム画像の出力順番を示す情報である。出力の継続期間を指定する情報とは、例えば、出力の継続時間や、出力されるフレーム数を指定する情報である。字幕情報の出力を開始するタイミングや、字幕情報の出力を行う期間は、例えば、時刻を示す情報を用いて指定される。例えば、期間は、開始時刻と終了時刻とをそれぞれ示す情報で指定される。ここでの時刻は、上述したような絶対的な時刻であっても相対的な時刻であってもよい。ここでの時刻を示す情報としては、例えば、上述したようなタイムコードが用いられる。 The timing at which the caption information is output in response to the output of the video information is, for example, the caption information based on the timing (for example, the time) at which a part of the video information (for example, the first frame image) is output. The timing is specified using information indicating timing and timing at which each frame image of the video information is output (for example, time code, output order of frame images, and the like). The subtitle timing information may be information specifying timing for starting output of subtitle information, or information specifying a period for outputting subtitle information. The information for specifying the period includes, for example, a combination of information for specifying the timing for starting output of subtitle information and information for specifying the timing for ending output, information for specifying the timing for starting output, For example, a combination with information specifying a duration. The subtitle timing information indicating the timing of starting the output of the subtitle information is, for example, the elapsed time from the start of outputting a predetermined frame image (usually the first frame image) of the video information corresponding to the subtitle information, This is information indicating the total number of frame images output since the start of output of a predetermined frame image and the output order of the next frame image to be output. The information specifying the output duration is information specifying the output duration and the number of output frames, for example. The timing for starting the output of the caption information and the period for outputting the caption information are specified using, for example, information indicating the time. For example, the period is specified by information indicating a start time and an end time. The time here may be an absolute time as described above or a relative time. As information indicating the time here, for example, the time code as described above is used.

また、字幕タイミング情報は、１以上の映像タイミング情報を指定する情報であってもよい。例えば、字幕タイミング情報は、映像情報の１または２以上のフレーム画像に対応付けられた１または２以上のタイムコードを指定する情報であっても良い。例えば、字幕タイミング情報は、一のタイムコードや、タイムコードの範囲を示す情報（例えば、起点となるタイムコードと終点となるタイムコードとの組合せ）であっても良い。 The subtitle timing information may be information specifying one or more video timing information. For example, the subtitle timing information may be information specifying one or more time codes associated with one or more frame images of the video information. For example, the subtitle timing information may be one time code or information indicating a time code range (for example, a combination of a start time code and an end time code).

字幕受信部１０２は、例えば、受信した字幕情報を、字幕タイミング情報と対応付けて字幕格納部１０４に蓄積する。字幕受信部１０２は、字幕情報と字幕タイミング情報との組、例えば、字幕組情報を字幕格納部１０４に蓄積するようにしても良い。 For example, the caption receiving unit 102 stores the received caption information in the caption storage unit 104 in association with the caption timing information. The subtitle receiving unit 102 may store a set of subtitle information and subtitle timing information, for example, subtitle group information, in the subtitle storage unit 104.

なお、字幕受信部１０２は、字幕サーバ装置４が送信する、字幕タイミング情報と対応付けられていない字幕情報を更に受信してもよい。この字幕情報は、例えば、後述するような指定送信部１０９が送信する字幕タイミング情報と対応付けられていない字幕情報の送信を指定する字幕指定情報に応じて、字幕サーバ装置４が送信する字幕情報である。 Note that the caption receiving unit 102 may further receive caption information that is transmitted from the caption server device 4 and is not associated with caption timing information. This subtitle information is, for example, subtitle information transmitted by the subtitle server device 4 in accordance with subtitle designation information that designates transmission of subtitle information that is not associated with subtitle timing information that is transmitted by the designation transmission unit 109 described later. It is.

字幕受信部１０２は、例えば、後述する制御情報を用いて字幕情報や字幕タイミング情報等を受信しても良い。制御情報を用いて受信するということは、例えば、制御情報を用いて特定された字幕サーバ装置４から送信される字幕情報や字幕タイミング情報等を受信することも含む概念である。 For example, the caption receiving unit 102 may receive caption information, caption timing information, and the like using control information described later. Receiving using control information is a concept including receiving subtitle information, subtitle timing information, and the like transmitted from subtitle server device 4 specified using control information, for example.

字幕受信部１０２は、通常、無線または有線の通信手段で実現されるが、放送を受信する手段で実現されても良い。字幕受信部１０２は、通信デバイスを含むと考えても、含まないと考えても良い。 The caption receiving unit 102 is generally realized by a wireless or wired communication unit, but may be realized by a unit that receives a broadcast. The caption receiving unit 102 may or may not include a communication device.

映像格納部１０３には、映像受信部１０１が受信した映像情報が一時的に格納される。例えば、映像格納部１０３には、例えば、映像情報の１または２以上のフレーム画像が格納される。映像格納部１０３は、映像格納部１０３に格納されたフレーム画像は、通常、蓄積順や、フレーム画像に対応付けられた映像タイミング情報が示す順番に読み出されて、読み出されたフレーム画像は、削除される。例えば、映像格納部１０３は、フレームメモリやＶＲＡＭ等と考えても良い。映像格納部１０３は、映像受信部１０１が受信した映像情報を一時的にバッファリングするためのいわゆるバッファやキャッシュと考えても良い。なお、映像受信部１０１が、プログレッシブダウンロード形式や、ダウンロード形式で提供される映像情報を受信した場合、この映像情報は、一旦、映像格納部１０３や図示しない記憶媒体等に蓄積された後、蓄積された映像情報から読み出された１または２以上のフレーム画像が、映像格納部１０３に蓄積されるようにしても良い。 Video information received by the video receiving unit 101 is temporarily stored in the video storage unit 103. For example, the video storage unit 103 stores, for example, one or more frame images of video information. The video storage unit 103 normally reads the frame images stored in the video storage unit 103 in the order of accumulation or the order indicated by the video timing information associated with the frame images. Deleted. For example, the video storage unit 103 may be considered as a frame memory, a VRAM, or the like. The video storage unit 103 may be considered as a so-called buffer or cache for temporarily buffering video information received by the video receiving unit 101. When the video receiving unit 101 receives the progressive download format or video information provided in the download format, the video information is temporarily stored in the video storage unit 103, a storage medium (not shown), or the like. One or more frame images read from the read video information may be accumulated in the video storage unit 103.

映像格納部１０３は、不揮発性の記録媒体が好適であるが、揮発性の記録媒体でも実現可能である。かかることは以下の格納部においても同様である。 The video storage unit 103 is preferably a non-volatile recording medium, but can also be realized by a volatile recording medium. The same applies to the following storage units.

字幕格納部１０４には、字幕受信部１０２が受信した１または２以上の字幕情報が一時的に格納される。字幕格納部１０４は、字幕受信部１０２が受信した字幕情報を一時的にバッファリングするためのいわゆるバッファと考えても良い。字幕格納部１０４には、更に、字幕情報に対応する字幕タイミング情報が、字幕情報と対応付けられて蓄積されても良い。字幕格納部１０４には、例えば、字幕受信部１０２が受信した１または２以上の字幕組情報が格納されても良い。 The caption storage unit 104 temporarily stores one or more pieces of caption information received by the caption receiving unit 102. The caption storage unit 104 may be considered as a so-called buffer for temporarily buffering the caption information received by the caption receiving unit 102. The subtitle storage unit 104 may further store subtitle timing information corresponding to the subtitle information in association with the subtitle information. In the caption storage unit 104, for example, one or more caption group information received by the caption receiving unit 102 may be stored.

なお、字幕格納部１０４の字幕情報の格納量の最大値と、映像格納部１０３の映像情報の格納量の最大値とは、異なっていることが好ましく、映像情報の格納量の最大値が、字幕情報の格納量の最大値よりも大きい値に設定することがより好ましい。通常、文字列で構成される字幕情報の方が、画像の情報で構成される映像情報よりも、同じ期間に出力されるデータ量が小さいため、字幕情報を字幕サーバ装置４から取得するために要する時間は、同じ時間分の映像情報を取得するために要する時間よりも短く、字幕情報の取得に要する時間は短くて済む。このため、映像情報とは異なり、大量の字幕情報を映像出力装置１に、予めバッファリングしておかなくても、通信状況の変化等による字幕情報の出力の遅れは発生しにくいため、字幕格納部１０４の格納量の最大値を映像情報よりも小さくしても、字幕情報の出力についての影響は少ない。また、字幕格納部１０４の格納量を小さくすることで、映像出力装置１の記憶媒体の使用量を削減することが可能となる。 It is preferable that the maximum value of the subtitle information storage amount of the subtitle storage unit 104 and the maximum value of the video information storage amount of the video storage unit 103 are different, and the maximum value of the video information storage amount is It is more preferable to set a value larger than the maximum value of the subtitle information storage amount. In general, subtitle information composed of character strings has a smaller amount of data output in the same period than video information composed of image information. The time required is shorter than the time required to acquire the same amount of video information, and the time required to acquire the caption information may be shorter. For this reason, unlike video information, even if a large amount of subtitle information is not buffered in the video output device 1 in advance, subtitle information output is unlikely to be delayed due to a change in communication status or the like. Even if the maximum value of the storage amount of the unit 104 is made smaller than the video information, there is little influence on the output of the caption information. Further, by reducing the storage amount of the caption storage unit 104, it is possible to reduce the usage amount of the storage medium of the video output device 1.

なお、字幕格納部１０４及び映像格納部１０３は、一の記憶媒体にそれぞれ割り当てられた格納領域であっても良い。 Note that the caption storage unit 104 and the video storage unit 103 may be storage areas respectively allocated to one storage medium.

手話映像受信部１０５は、手話映像サーバ装置５から手話映像情報を受信する。手話映像情報は手話の映像を示す情報である。手話映像受信部１０５は、映像受信部１０１が受信した映像情報に対応付けられた手話映像情報を受信する。一の映像情報に対応付けられた手話映像情報とは、例えば、一の映像情報の映像識別情報と対応付けられた手話映像情報である。手話映像受信部１０５は、複数のフレーム画像を有する情報であって、各フレーム画像を出力するタイミングを示す手話タイミング情報が、各フレーム画像と対応付けられている手話映像情報を受信する。手話映像情報は、映像に示されている内容が手話であることを除けば、上述した映像情報と同様の情報であるため、ここでは詳細な説明は省略する。また、ここでの手話タイミング情報や、フレーム画像と手話タイミング情報との対応付け等については、上述した映像情報や、映像情報と映像タイミング情報との対応付けと同様であるので、詳細な説明は省略する。 The sign language video receiving unit 105 receives sign language video information from the sign language video server device 5. The sign language video information is information indicating a sign language video. The sign language video receiving unit 105 receives sign language video information associated with the video information received by the video receiving unit 101. The sign language video information associated with one video information is, for example, sign language video information associated with video identification information of one video information. The sign language video receiving unit 105 receives sign language video information, which is information having a plurality of frame images and in which sign language timing information indicating the timing of outputting each frame image is associated with each frame image. The sign language video information is the same information as the video information described above except that the content shown in the video is a sign language, and thus detailed description thereof is omitted here. In addition, the sign language timing information here and the association between the frame image and the sign language timing information are the same as the above-described image information and the association between the image information and the image timing information, so a detailed description will be given. Omitted.

手話映像受信部１０５は、例えば、受信した手話映像を図示しない手話映像格納部に蓄積する。なお、この手話映像格納部は、例えば、上述した映像格納部１０３と同様の格納部である。 For example, the sign language video receiving unit 105 stores the received sign language video in a sign language video storage unit (not shown). The sign language video storage unit is a storage unit similar to the video storage unit 103 described above, for example.

手話映像受信部１０５は、例えば、映像受信部１０１と同様に、後述する制御情報を用いて手話映像情報等を受信しても良い。 For example, the sign language video receiving unit 105 may receive sign language video information or the like using control information to be described later, similarly to the video receiving unit 101.

手話映像受信部１０５は、通常、無線または有線の通信手段で実現されるが、放送を受信する手段で実現されても良い。手話映像受信部１０５は、通信デバイスを含むと考えても、含まないと考えても良い。 The sign language video receiving unit 105 is usually realized by a wireless or wired communication means, but may be realized by a means for receiving a broadcast. The sign language video reception unit 105 may or may not include a communication device.

格納量設定部１０６は、映像格納部１０３に格納される映像情報の格納量の最大値と、字幕格納部１０４に格納される字幕情報の格納量の最大値とを、それぞれ、自装置の処理能力に応じた値に設定する。ここでの映像情報の格納量は、例えば、映像格納部１０３に格納され得る映像情報の総出力時間やフレーム数、データ容量（例えば、ビット数等）で表される。例えば、映像情報の格納量の最大値を、３０秒に設定するということは、映像格納部１０３に、最大で３０秒間分の映像情報（例えば、３０秒間分のフレーム画像）が格納可能であることを意味する。 The storage amount setting unit 106 determines the maximum value of the storage amount of the video information stored in the video storage unit 103 and the maximum value of the storage amount of the caption information stored in the caption storage unit 104, respectively. Set the value according to the ability. The storage amount of the video information here is represented by, for example, the total output time, the number of frames, and the data capacity (for example, the number of bits) of the video information that can be stored in the video storage unit 103. For example, setting the maximum value of the storage amount of video information to 30 seconds means that video information for a maximum of 30 seconds (for example, frame images for 30 seconds) can be stored in the video storage unit 103. Means that.

また、ここでの字幕情報の格納量は、例えば、字幕格納部１０４に格納される字幕情報の出力開始時刻の属する期間や、格納される字幕情報の出力時間（出力期間）、字幕情報または字幕組情報の数、または、字幕情報のデータ容量や総文字数で表される。例えば、字幕情報の格納量の最大値を１０秒に設定するということは、字幕格納部１０４に、出力される時間の合計時間が、１０秒以内の範囲となる１または２以上の字幕情報が格納可能であることを意味する。 The amount of stored caption information here is, for example, the period to which the output start time of the caption information stored in the caption storage unit 104 belongs, the output time (output period) of the stored caption information, the caption information or the caption It is represented by the number of group information, the data capacity of subtitle information, or the total number of characters. For example, setting the maximum value of the storage amount of caption information to 10 seconds means that the caption storage unit 104 has one or more pieces of caption information whose total output time is within 10 seconds. It means that it can be stored.

格納量設定部１０６は、例えば、自装置の処理能力に応じて上記の格納量の最大値を示す値を取得し、取得した値で、図示しない記憶媒体等に最大値を設定する値として蓄積する。また、図示しない記憶媒体に格納されている上記の格納量の最大値を設定する値を、自装置の処理能力に応じて取得した値で更新しても良い。 For example, the storage amount setting unit 106 acquires a value indicating the maximum value of the storage amount according to the processing capability of the own device, and accumulates the acquired value as a value for setting the maximum value in a storage medium (not shown). To do. Further, the value for setting the maximum value of the storage amount stored in a storage medium (not shown) may be updated with a value acquired according to the processing capability of the own apparatus.

自装置の処理能力とは、映像出力装置１の処理能力を決定する１以上の要素、例えば、ＭＰＵ及び／またはＧＰＵの処理能力を示す値（例えば、動作クロック数や、コア数等）や、キャッシュやＲＡＭ等のメモリのサイズ、通信速度、表示解像度等の１以上の組合せで決定される。自装置の処理能力を決定する要素を示す情報は、例えばコンピュータシステムのプロファイルを取得するソフトウェア等として公知であるので、ここでは説明は省略する。 The processing capability of the own device is one or more elements that determine the processing capability of the video output device 1, for example, values indicating the processing capability of the MPU and / or GPU (for example, the number of operating clocks, the number of cores, etc.) It is determined by one or more combinations of the size of a memory such as a cache and RAM, communication speed, display resolution, and the like. Information indicating the elements that determine the processing capability of the own apparatus is well known as, for example, software for acquiring a profile of a computer system, and thus description thereof is omitted here.

格納量設定部１０６は、通常、自装置の処理能力が高い場合（例えば、処理が速い、容量が大きい等の場合）の映像情報の格納量の最大値が、処理能力の低い場合の最大値以上となるように、映像情報の格納量の最大値を決定する。かかることは、字幕情報の格納量の最大値についても同様である。 The storage amount setting unit 106 normally has a maximum storage amount of video information when the processing capability of the device itself is high (for example, when processing is fast or capacity is large), and the maximum value when the processing capability is low. As described above, the maximum value of the storage amount of the video information is determined. The same applies to the maximum value of the subtitle information storage amount.

格納量設定部１０６は、例えば、自装置の処理能力に関する値、例えば上記のような１または２以上の要素の値を取得する。そして、この値を用いて、格納量の最大値を決定する。 The storage amount setting unit 106 acquires, for example, a value related to the processing capability of the own device, for example, the value of one or more elements as described above. Then, using this value, the maximum storage amount is determined.

格納量設定部１０６は、例えば、映像情報の格納量の最大値を、ＭＰＵ及び／またはＧＰＵの動作クロック数の値が増加するに従って、線形的に増加、あるいは段階的に増加する値に設定する。例えば、格納量設定部１０６は、クロック数を引数とした非減少関数の値を、映像情報の格納量に設定する。 For example, the storage amount setting unit 106 sets the maximum value of the storage amount of video information to a value that increases linearly or stepwise as the value of the number of operation clocks of the MPU and / or GPU increases. . For example, the storage amount setting unit 106 sets the value of the non-decreasing function with the number of clocks as an argument as the storage amount of video information.

また、格納量設定部１０６は、例えば、映像情報の格納量の最大値を、メモリの容量の値が増加するに従って、線形的に増加、あるいは段階的に増加する値に設定する。例えば、格納量設定部１０６は、映像情報の格納量の最大値を、メモリの容量の値を引数とする非減少関数の値に設定する。 For example, the storage amount setting unit 106 sets the maximum value of the storage amount of video information to a value that increases linearly or increases stepwise as the memory capacity value increases. For example, the storage amount setting unit 106 sets the maximum value of the storage amount of video information to the value of a non-decreasing function that uses the value of the memory capacity as an argument.

また、格納量設定部１０６は、例えば、映像情報の格納量の最大値を、表示解像度の値が増加するに従って、線形的に増加、あるいは段階的に増加する値に設定する。例えば、格納量設定部１０６は、映像情報の格納量の最大値を、表示解像度を引数とする非減少関数の値に設定してもよい。 In addition, the storage amount setting unit 106 sets, for example, the maximum value of the storage amount of video information to a value that increases linearly or stepwise as the display resolution value increases. For example, the storage amount setting unit 106 may set the maximum value of the storage amount of video information to the value of a non-decreasing function that uses the display resolution as an argument.

また、格納量設定部１０６は、これらの組合せにより映像情報の格納量の最大値を決定しても良い。かかることは、字幕情報の格納量の最大値についても同様である。 Further, the storage amount setting unit 106 may determine the maximum value of the storage amount of video information by a combination thereof. The same applies to the maximum value of the subtitle information storage amount.

また、格納量設定部１０６は、自装置の処理能力に関する１以上の要素の値を取得し、図示しない記憶媒体等に予め格納されている、上述した１以上の要素の値の範囲と、映像情報の格納量の最大値と、字幕情報の格納量の最大値とを対応付けた情報から、上記の要素の値を含む値の範囲と対応付けられた映像情報の格納量の最大値と、字幕情報の格納量の最大値とをそれぞれ取得しても良い。 In addition, the storage amount setting unit 106 acquires the value of one or more elements related to the processing capability of the own device, and stores the above-described range of one or more element values stored in advance in a storage medium (not shown) and the video. From the information in which the maximum value of the information storage amount and the maximum value of the subtitle information storage amount are associated, the maximum value of the storage amount of the video information associated with the range of values including the values of the above elements, The maximum value of the storage amount of caption information may be acquired.

なお、格納量設定部１０６は、上記以外のどのような処理により、映像情報の格納量の最大値を設定しても良い。 The storage amount setting unit 106 may set the maximum value of the storage amount of video information by any process other than the above.

出力状況取得部１０７は、出力部１１３による映像情報及び字幕情報の少なくとも一方の出力状況を示す情報を取得する。映像情報の出力状況とは、映像情報が予め指定されたタイミングで出力されているか否か（フレーム画像の出力が停止していないか否か等）、あるいは、映像情報が出力可能な状況であるか否か（例えば、映像格納部１０３に出力可能な映像情報が格納されているか）を示す情報である。予め指定されたタイミングとは、例えば、各フレーム画像に対応付けられた映像タイミング情報が示すタイミングや、映像情報の各フレーム画像の再生レート等を示す情報が示すタイミング等である。字幕情報の出力状況についても同様である。 The output status acquisition unit 107 acquires information indicating the output status of at least one of video information and subtitle information by the output unit 113. The output status of the video information is whether the video information is output at a predetermined timing (whether the output of the frame image is not stopped, etc.), or the status in which the video information can be output. (For example, video information that can be output is stored in the video storage unit 103). The timing designated in advance is, for example, a timing indicated by video timing information associated with each frame image, a timing indicated by information indicating a reproduction rate of each frame image of the video information, or the like. The same applies to the output status of subtitle information.

出力状況取得部１０７は、例えば、映像格納部１０３における映像情報の格納状況、及び字幕格納部１０４における字幕情報の格納状況の少なくとも一方を示す情報を、出力状況を示す情報として取得する。ここでの映像情報の格納状況とは、例えば、未出力の映像情報の格納状況である。かかることは、字幕情報の格納状況についても同様である。映像情報の格納状況は、例えば、映像情報の各フレーム画像が、予め指定されたタイミングで出力できる状況にあるか否かを示す情報となることから、実質的に映像情報の出力状況を示す情報と考えられる。例えば、出力状況取得部１０７は、予め指定された映像情報の格納量の最大値または格納量設定部１０６が設定した映像情報の格納量の最大値のうちの、予め指定された比率以上の映像情報が、映像格納部１０３に格納されているか否かを示す情報を取得する。予め指定された比率は、通常、０から１００％までの間の比率である。あるいは、最大値に対する映像情報が格納されている比率を示す値を取得する。あるいは、予め指定された閾値以上の映像情報が格納されているか否かを判断しても良い。あるいは、格納量設定部１０６が設定した映像情報の格納量の最大値に応じて取得された閾値以上の映像情報が格納されているか否かを判断しても良い。この閾値は、映像情報の格納量の値を引数として、予め指定された関数を用いて算出されても良いし、映像情報の格納量の最大値の範囲と対応付けられて図示しない記憶媒体等に予め格納されている閾値から取得しても良い。字幕情報の格納状況に関しても同様である。なお、字幕情報の予め指定されたタイミングは、例えば、字幕情報に対応付けられた字幕タイミング情報が示すタイミングである。 The output status acquisition unit 107 acquires, for example, information indicating at least one of the storage status of video information in the video storage unit 103 and the storage status of subtitle information in the subtitle storage unit 104 as information indicating the output status. Here, the storage state of video information is, for example, the storage state of unoutput video information. The same applies to the storage status of caption information. The storage status of the video information is, for example, information indicating whether or not each frame image of the video information is in a status that can be output at a timing specified in advance. it is conceivable that. For example, the output status acquisition unit 107 has a video that is equal to or greater than a predetermined ratio of the maximum value of the storage amount of video information specified in advance or the maximum value of the storage amount of video information set by the storage amount setting unit 106. Information indicating whether or not the information is stored in the video storage unit 103 is acquired. The predesignated ratio is usually a ratio between 0 and 100%. Alternatively, a value indicating a ratio in which video information with respect to the maximum value is stored is acquired. Or you may judge whether the video information more than the threshold value designated beforehand is stored. Alternatively, it may be determined whether or not video information equal to or greater than a threshold acquired according to the maximum storage amount of video information set by the storage amount setting unit 106 is stored. This threshold value may be calculated using a function specified in advance with the value of the storage amount of the video information as an argument, or a storage medium (not shown) associated with the range of the maximum value of the storage amount of the video information May be obtained from a threshold value stored in advance. The same applies to the storage status of caption information. Note that the timing designated in advance for the caption information is, for example, the timing indicated by the caption timing information associated with the caption information.

なお、出力状況を示す情報は、例えば、映像情報及び字幕情報の少なくとも一方が出力されているか否かを示す情報であってもよい。例えば、映像情報や字幕情報の出力が一時停止されているか否かや、スロー再生されているか否か等の情報であっても良い。また、映像受信部１０１や字幕受信部１０２の通信速度の情報であっても良い。 Note that the information indicating the output status may be, for example, information indicating whether or not at least one of video information and caption information is output. For example, it may be information such as whether output of video information or subtitle information has been paused or whether playback is slow. Further, it may be information on the communication speed of the video receiving unit 101 or the subtitle receiving unit 102.

要求送信部１０８は、字幕サーバ装置４に、字幕要求情報を送信する。字幕要求情報は、字幕情報の送信を要求する情報である。字幕要求情報は、字幕情報に対応する映像情報識別情報を有していても良い。要求送信部１０８は、例えば、後述する映像要求情報送信部１１０が送信する映像要求情報が有する映像識別情報と同じ映像識別情報を有する要求情報を送信する。 The request transmission unit 108 transmits caption request information to the caption server device 4. The caption request information is information for requesting transmission of caption information. The caption request information may include video information identification information corresponding to the caption information. For example, the request transmission unit 108 transmits request information having the same video identification information as the video identification information included in the video request information transmitted by the video request information transmission unit 110 described later.

字幕要求情報は、例えば、出力順番が、字幕受信部１０２が直前に受信した字幕情報の次以降である１または２以上の字幕情報の送信を要求する情報である。字幕要求情報は、送信を要求する字幕情報の量を指定する情報を有していても良い。ここでの量は、送信を要求する字幕情報が対応付けられた字幕タイミング情報の期間や、字幕情報または字幕組情報の数や、字幕情報のデータ容量や総文字数等で表される量である。 The caption request information is, for example, information for requesting transmission of one or more caption information whose output order is subsequent to the caption information received immediately before by the caption receiving unit 102. The caption request information may include information that specifies the amount of caption information that is requested to be transmitted. The amount here is an amount represented by the period of subtitle timing information associated with subtitle information for which transmission is requested, the number of subtitle information or subtitle group information, the data capacity of subtitle information, the total number of characters, and the like. .

要求送信部１０８は、例えば、出力状況取得部１０７が取得した出力状況を示す情報に応じて、字幕サーバ装置４に、字幕要求情報を送信する。要求送信部１０８は、例えば、字幕格納部１０４に字幕情報があふれないように、あるいは、字幕格納部１０４の字幕情報が空にならないように、出力状況を示す情報に応じて字幕要求情報を送信する。例えば、要求送信部１０８は、出力状況を示す情報に応じたタイミングで字幕要求情報を送信する。例えば、要求送信部１０８は、出力状況取得部１０７が取得した出力状況を示す情報として、映像情報（または、字幕情報）の格納されている量を取得した場合に、この格納されている量の増加に応じて、字幕要求情報を送信する間隔を、線形的に増加、あるいは段階的に増加させるようにする。要求送信部１０８は、出力状況取得部１０７が、出力状況を示す情報として、映像情報（または、字幕情報）の格納量の最大値のうちの予め指定された比率以上の映像情報（または、字幕情報）が格納されているか否かを示す情報を取得する場合において、予め指定された比率以上の映像情報（または、字幕情報）が格納されていることを示す情報を取得した場合は、要求送信部１０８は、例えば、予め指定された比率未満の場合よりも、遅いタイミング毎に、即ち、長い間隔で字幕情報の送信を要求する情報を送信する。あるいは、出力状況取得部１０７が、最大値に対する映像情報（または、字幕情報）が格納されている比率を示す値を取得する場合、要求送信部１０８は、この比率を示す値を引数とした非増加関数により算出したタイミング毎に字幕情報の送信を要求する情報を送信しても良い。非増加関数は、例えば、線形的に減少する関数や、段階的に減少する関数等である。あるいは、要求送信部１０８は、この比率を示す情報の値の属する値域に対して予め設定された、比率が高くなる毎に非増加となるタイミング毎に、字幕情報の送信を要求する情報を送信しても良い。なお、要求情報は、映像情報の出力状況と字幕情報の出力状況の組合せに応じて字幕要求情報を送信しても良い。 For example, the request transmission unit 108 transmits the subtitle request information to the subtitle server device 4 according to the information indicating the output status acquired by the output status acquisition unit 107. For example, the request transmission unit 108 transmits the subtitle request information according to the information indicating the output status so that the subtitle information does not overflow in the subtitle storage unit 104 or the subtitle information in the subtitle storage unit 104 does not become empty. To do. For example, the request transmission unit 108 transmits the subtitle request information at a timing according to information indicating the output status. For example, when the request transmission unit 108 acquires a stored amount of video information (or caption information) as information indicating the output status acquired by the output status acquisition unit 107, the request transmission unit 108 In accordance with the increase, the interval for transmitting the subtitle request information is increased linearly or stepwise. In the request transmission unit 108, the output status acquisition unit 107 uses video information (or subtitles) that is equal to or greater than a predetermined ratio of the maximum storage amount of video information (or subtitle information) as information indicating the output status. In the case of acquiring information indicating whether or not (information) is stored, if information indicating that video information (or caption information) of a ratio higher than a predetermined ratio is stored is acquired, a request transmission is performed. The unit 108 transmits, for example, information requesting transmission of caption information at a later timing, that is, at longer intervals than when the ratio is less than a predetermined ratio. Alternatively, when the output status acquisition unit 107 acquires a value indicating a ratio in which video information (or caption information) with respect to the maximum value is stored, the request transmission unit 108 uses the value indicating the ratio as an argument. You may transmit the information which requests | requires transmission of subtitle information for every timing calculated by the increase function. The non-increasing function is, for example, a function that decreases linearly or a function that decreases stepwise. Alternatively, the request transmission unit 108 transmits information requesting transmission of subtitle information at a timing set in advance with respect to a value range to which the information value indicating the ratio belongs, and at a non-incrementing timing every time the ratio increases. You may do it. As the request information, the subtitle request information may be transmitted according to a combination of the output state of the video information and the output state of the subtitle information.

また、出力状況を示す情報に応じて字幕要求情報を送信するということは、出力状況を示す情報に応じた量の字幕情報の送信を要求する情報を送信することであってもよい。例えば、要求送信部１０８は、上記において、要求情報を送信するタイミングを遅くする代わりに、字幕情報の送信量を減らすような要求を送信すればよい。 Further, transmitting the subtitle request information according to the information indicating the output status may be transmitting information requesting transmission of the subtitle information in an amount corresponding to the information indicating the output status. For example, in the above description, the request transmission unit 108 may transmit a request to reduce the transmission amount of caption information, instead of delaying the timing of transmitting the request information.

なお、ここでの出力状況を示す情報に応じて、字幕要求情報を送信するということは、出力状況に応じて、字幕要求情報を送信しないことも含む概念である。また、出力状況に応じて、字幕情報の送信を停止（一時停止も含む）を指示する字幕要求情報を送信しても良い。 Here, transmitting the subtitle request information according to the information indicating the output status is a concept including not transmitting the subtitle request information according to the output status. Further, subtitle request information for instructing to stop transmission (including pause) of subtitle information may be transmitted according to the output status.

なお、要求送信部１０８は、映像識別情報を字幕要求情報として送信しても良い。また、映像識別情報の代わりに、映像識別情報が示す映像情報に対応する字幕情報の群を識別する識別情報を用いるようにしてもよい。かかることは、以下においても同様である。このような字幕群識別情報は、例えば、後述する制御情報から取得可能である。 Note that the request transmission unit 108 may transmit the video identification information as caption request information. Further, instead of the video identification information, identification information for identifying a group of caption information corresponding to the video information indicated by the video identification information may be used. The same applies to the following. Such caption group identification information can be acquired from, for example, control information described later.

要求送信部１０８は、例えば、後述する制御情報が示す字幕サーバ装置４に字幕要求情報を送信する。 For example, the request transmission unit 108 transmits the caption request information to the caption server device 4 indicated by control information described later.

要求送信部１０８は、通常、無線または有線の通信手段で実現されるが、放送手段で実現されても良い。要求送信部１０８は、通信デバイスを含むと考えても、含まないと考えても良い。 The request transmission unit 108 is usually realized by a wireless or wired communication unit, but may be realized by a broadcasting unit. The request transmission unit 108 may or may not include a communication device.

指定送信部１０９は、映像受信部１０１が受信した映像情報の種類に応じて、字幕タイミング情報と対応付けられた字幕情報及び字幕タイミング情報と対応付けられていない字幕情報のいずれか一方の送信を指定する情報（以下、字幕指定情報と称す）を、字幕サーバ装置４に送信する。映像情報の種類は、映像受信部１０１が受信した映像情報の種類を示す情報から判断しても良い。また、後述する制御情報受信部１１２が受信した制御情報が有する映像情報の種類を示す情報から取得しても良い。字幕指定情報は、映像識別情報を有していても良い。 The designated transmission unit 109 transmits one of the subtitle information associated with the subtitle timing information and the subtitle information not associated with the subtitle timing information according to the type of the video information received by the video reception unit 101. Information to be designated (hereinafter referred to as caption designation information) is transmitted to the caption server device 4. The type of video information may be determined from information indicating the type of video information received by the video receiver 101. Moreover, you may acquire from the information which shows the kind of video information which the control information which the control information receiving part 112 mentioned later receives has. The caption designation information may include video identification information.

指定送信部１０９は、映像情報の種類を示す情報が、予め指定された種類を示す情報であるか否かを判断し、予め指定された情報であるか否かに応じて、異なる字幕指定情報を送信する。指定送信部１０９は、例えば、映像情報受信部が受信した映像情報の種類を示す情報が、ライブ（生）の映像情報であることを示す情報である場合に、字幕タイミング情報と対応付けられていない字幕情報の送信を指定する字幕指定情報を送信し、ライブの映像情報でないことを示す情報である場合（例えば、録画された映像情報であることを示す情報である場合）には、字幕タイミング情報と対応付けられた字幕情報の送信を指定する字幕指定情報を送信する。 The designated transmission unit 109 determines whether or not the information indicating the type of video information is information indicating the type specified in advance, and differs depending on whether or not the information indicates the type specified in advance. Send. For example, when the information indicating the type of video information received by the video information receiving unit is information indicating that the video information is live video information, the designated transmission unit 109 is associated with the subtitle timing information. In the case of information indicating that it is not live video information (for example, information indicating that it is recorded video information), subtitle timing is specified. Subtitle designation information for designating transmission of subtitle information associated with the information is transmitted.

指定送信部１０９は、通常、無線または有線の通信手段で実現されるが、放送手段で実現されても良い。指定送信部１０９は、通信デバイスを含むと考えても、含まないと考えても良い。 The designated transmission unit 109 is usually realized by a wireless or wired communication unit, but may be realized by a broadcasting unit. The designated transmission unit 109 may or may not include a communication device.

映像要求情報送信部１１０は、映像要求情報を映像サーバ装置３に送信する。映像要求情報は、映像情報の送信を要求する情報である。映像要求情報は、例えば、送信を要求する映像情報の映像識別情報を有する。映像識別情報は、例えば、映像情報のファイル名や、タイトル、映像情報に割り当てられたコード等である。また、映像要求情報は、映像情報を構成するフレーム画像を指定する情報を有していても良い。例えば、映像要求情報は、映像受信部１０１が直前に受信したフレーム画像の次以降の出力順番の１以上のフレーム画像を指定する情報を有していてもよい。また、映像要求情報は、送信する映像情報の量を指定する情報等を有していても良い。 The video request information transmission unit 110 transmits the video request information to the video server device 3. The video request information is information for requesting transmission of video information. The video request information includes, for example, video identification information of video information for which transmission is requested. The video identification information is, for example, a file name of video information, a title, a code assigned to the video information, and the like. Further, the video request information may include information for designating frame images constituting the video information. For example, the video request information may include information for designating one or more frame images in the output order subsequent to the frame image received by the video receiving unit 101 immediately before. Further, the video request information may include information specifying the amount of video information to be transmitted.

映像要求情報送信部１１０は、例えば、後述する制御情報が示す映像サーバ装置３に映像要求情報を送信する。 For example, the video request information transmission unit 110 transmits the video request information to the video server device 3 indicated by control information described later.

映像要求情報送信部１１０は、通常、無線または有線の通信手段で実現されるが、放送手段で実現されても良い。映像要求情報送信部１１０は、通信デバイスを含むと考えても、含まないと考えても良い。 The video request information transmitting unit 110 is usually realized by a wireless or wired communication unit, but may be realized by a broadcasting unit. The video request information transmission unit 110 may or may not include a communication device.

制御要求情報送信部１１１は、制御要求情報を、制御サーバ装置２に送信する。制御要求情報は、制御情報の送信を要求する情報である。制御情報は、映像情報及び字幕情報の受信に関する制御に用いられる情報である。制御要求情報は、例えば、映像識別情報を有している。この映像識別情報は、例えば、上述した映像要求情報が有する映像識別情報と同じ映像情報を示す映像識別情報である。これにより、例えば、ユーザが視聴しようとする映像情報に対応する制御情報の送信を要求することができる。なお、映像識別情報を制御要求情報として用いてもよい。例えば、制御要求情報送信部１１１は、図示しない受付部等を介してユーザから受け付けた、ユーザが視聴しようとする映像情報を示す映像識別情報を有する制御要求情報を送信する。 The control request information transmission unit 111 transmits control request information to the control server device 2. The control request information is information for requesting transmission of control information. The control information is information used for control related to reception of video information and caption information. The control request information includes, for example, video identification information. This video identification information is, for example, video identification information indicating the same video information as the video identification information included in the above-described video request information. Thereby, for example, it is possible to request transmission of control information corresponding to video information that the user intends to view. Note that the video identification information may be used as control request information. For example, the control request information transmission unit 111 transmits control request information having video identification information indicating video information that the user intends to view and received from the user via a reception unit (not shown) or the like.

制御情報は、例えば、映像情報及び字幕情報を送信する映像サーバ装置３及び字幕サーバ装置４を指定する情報を有する。映像サーバ装置３及び字幕サーバ装置４を指定する情報は、例えば、映像サーバ装置３及び字幕サーバ装置４のＵＲＬやＩＰアドレス等のアドレス情報である。制御情報は、更に、手話映像情報を送信する手話映像サーバ装置５を指定する情報を有していても良い。制御情報は、更に、映像情報や字幕情報が格納されているディレクトリを指定する情報や、映像識別情報や字幕情報の群の識別情報等を有していても良い。手話映像サーバ装置５を指定する情報は、手話映像サーバ装置５のＵＲＬやＩＰアドレス等のアドレス情報である。制御情報は、更に、映像情報の種類を示す情報を有していても良い。また、映像情報や、字幕情報や、手話映像情報を出力する際の配置場所を示す配置情報等を有していてもよい。例えば、映像情報と、字幕情報と、手話映像情報とを、一の画面に合成して表示する際の、それぞれの配置を示す配置情報を有していても良い。配置情報は、例えば、１または２以上の座標や、サイズ等の情報で構成される。 The control information includes, for example, information that specifies the video server device 3 and the caption server device 4 that transmit the video information and the caption information. The information specifying the video server device 3 and the caption server device 4 is, for example, address information such as the URL and IP address of the video server device 3 and the caption server device 4. The control information may further include information specifying the sign language video server device 5 that transmits the sign language video information. The control information may further include information specifying a directory in which video information and caption information are stored, video identification information, identification information of a group of caption information, and the like. The information for designating the sign language video server device 5 is address information such as a URL or an IP address of the sign language video server device 5. The control information may further include information indicating the type of video information. Moreover, you may have arrangement | positioning information etc. which show the arrangement | positioning place at the time of outputting video information, caption information, and sign language video information. For example, you may have arrangement | positioning information which shows each arrangement | positioning at the time of synthesize | combining and displaying video information, caption information, and sign language video information on one screen. The arrangement information is composed of, for example, information such as one or more coordinates and size.

制御情報のデータ形式等は問わない。例えば、制御情報は、ＸＨＴＭＬやＸＭＬで記述されたデータである。 The data format etc. of control information are not ask | required. For example, the control information is data described in XHTML or XML.

制御要求情報送信部１１１は、通常、無線または有線の通信手段で実現されるが、放送手段で実現されても良い。制御要求情報送信部１１１は、通信デバイスを含むと考えても、含まないと考えても良い。 The control request information transmitting unit 111 is usually realized by a wireless or wired communication unit, but may be realized by a broadcasting unit. The control request information transmission unit 111 may or may not include a communication device.

制御情報受信部１１２は、上述した制御要求情報送信部１１１が送信した制御要求情報に応じて、制御サーバ装置２から送信される制御情報を受信する。制御情報受信部１１２は、受信した制御情報を、図示しない記憶媒体等に蓄積する。 The control information receiving unit 112 receives control information transmitted from the control server device 2 in accordance with the control request information transmitted by the control request information transmitting unit 111 described above. The control information receiving unit 112 accumulates the received control information in a storage medium (not shown).

制御情報受信部１１２は、通常、無線または有線の通信手段で実現されるが、放送を受信する手段で実現されても良い。制御情報受信部１１２は、通信デバイスを含むと考えても、含まないと考えても良い。 The control information receiving unit 112 is usually realized by a wireless or wired communication means, but may be realized by a means for receiving a broadcast. The control information receiving unit 112 may or may not include a communication device.

出力部１１３は、映像受信部１０１が受信した映像情報を出力する。映像情報が更に音声情報を含む場合、この音声情報を更に出力しても良い。ここでの出力とは、ディスプレイへの表示、プロジェクターを用いた投影、外部の装置への送信、記録媒体への蓄積、他の処理装置や他のプログラムなどへの処理結果の引渡しなどを含む概念である。音声を出力するスピーカー等の出力デバイスを更に備えていても良い。 The output unit 113 outputs the video information received by the video receiving unit 101. When the video information further includes audio information, this audio information may be further output. The output here is a concept including display on a display, projection using a projector, transmission to an external device, storage in a recording medium, delivery of a processing result to another processing device or another program, etc. It is. You may further provide output devices, such as a speaker which outputs an audio | voice.

出力部１１３は、例えば、映像情報を構成するフレーム画像を、フレーム画像に対応付けられた出力順番に従って、予め指定されたフレームレート等で出力する。また、出力部１１３は、例えば、映像情報を構成する各フレーム画像を、各フレーム画像に対応付けられた映像タイミング情報が示すタイミングで出力してもよい。例えば、出力部１１３は、例えば、映像情報を構成する各フレーム画像を、各フレーム画像に対応付けられたタイムコードが示すタイミングで出力する。 For example, the output unit 113 outputs the frame images constituting the video information at a frame rate specified in advance in accordance with the output order associated with the frame images. For example, the output unit 113 may output each frame image constituting the video information at a timing indicated by video timing information associated with each frame image. For example, the output unit 113 outputs, for example, each frame image forming the video information at a timing indicated by a time code associated with each frame image.

出力部１１３は、字幕受信部１０２が受信した１以上の字幕情報を、各字幕情報と対応付けられた字幕タイミング情報を用いて、映像受信部１０１が受信した映像情報と合成して出力する。出力部１１３は、例えば、字幕受信部１０２が受信した１以上の字幕情報を、各字幕情報と対応付けられた字幕タイミング情報が示すタイミングで、映像受信部１０１が受信した映像情報と合成して出力する。例えば、出力部１１３は、映像情報の出力に応じて、映像情報の出力箇所を示す情報を取得し、出力箇所を示す情報に対応する字幕タイミング情報と対応付けられた字幕情報を出力する。ここでの出力箇所とは、例えば、時間軸上の箇所である。ここでの出力箇所とは、例えば、映像情報の出力開始時点からの経過時間やフレーム数で表した現在の出力箇所である。ここでの出力箇所とは、出力されるタイミングと考えてもよい。出力部１１３は、例えば、映像情報の出力開始からの経過時間の計測を開始して、次に出力する映像情報（具体的には、次に出力するフレーム画像）が出力される経過時間を取得し、この経過時間と一致する時間を示す文字タイミング情報と対応付けられた字幕情報を、字幕受信部１０２が受信した字幕情報から取得する。そして、出力部１１３は、取得した字幕情報と映像情報とを合成して出力する。また、例えば、出力部１１３は、映像情報の出力開始から出力したフレーム数のカウントを開始して、次に出力する画像フレームのフレーム数をカウント結果から取得し、この画像フレームと一致するフレーム数を示す文字タイミング情報と対応付けられた字幕情報を、字幕受信部１０２が受信した字幕情報から取得する。そして、出力部１１３は、取得した字幕情報と映像情報とを合成して出力するようにしてもよい。なお、出力部１１３は、適宜、フレーム数を時刻を示す情報に変換したり時刻を示す情報をフレーム数に変換しても良い。 The output unit 113 combines the one or more pieces of subtitle information received by the subtitle reception unit 102 with the video information received by the video reception unit 101 using subtitle timing information associated with each piece of subtitle information, and outputs the combined information. For example, the output unit 113 combines one or more pieces of subtitle information received by the subtitle receiving unit 102 with the video information received by the video receiving unit 101 at a timing indicated by subtitle timing information associated with each subtitle information. Output. For example, the output unit 113 acquires information indicating the output location of the video information in response to the output of the video information, and outputs subtitle information associated with the subtitle timing information corresponding to the information indicating the output location. The output location here is, for example, a location on the time axis. Here, the output location is, for example, the current output location represented by the elapsed time or the number of frames from the output start time of the video information. The output location here may be considered as the output timing. For example, the output unit 113 starts measuring the elapsed time from the start of video information output, and acquires the elapsed time at which video information to be output next (specifically, a frame image to be output next) is output. Then, the caption information associated with the character timing information indicating the time corresponding to the elapsed time is acquired from the caption information received by the caption receiving unit 102. Then, the output unit 113 synthesizes and outputs the obtained caption information and video information. Further, for example, the output unit 113 starts counting the number of frames output from the output start of the video information, obtains the number of image frames to be output next from the count result, and the number of frames matching the image frame Subtitle information associated with the character timing information indicating the subtitle information is obtained from the subtitle information received by the subtitle receiving unit 102. Then, the output unit 113 may synthesize and output the acquired caption information and video information. Note that the output unit 113 may appropriately convert the number of frames into information indicating time or convert information indicating time into the number of frames.

また、出力部１１３は、例えば、映像情報の一のフレーム画像を出力する際に、出力するフレーム画像と対応付けられた映像タイミング情報を指定する字幕タイミング情報と対応付けられた字幕情報を出力するようにしてもよい。例えば、出力部１１３は、映像受信部１０１が受信した映像情報の一のフレーム画像を出力する際に、このフレーム画像に対応付けられた映像タイミング情報を取得し、この映像タイミング情報を指定する字幕タイミング情報と対応付けられた字幕情報を出力する。例えば、出力部１１３は、映像受信部１０１が、映像受信部１０１が受信した映像情報の一のフレーム画像を出力する際に、このフレーム画像に対応付けられた映像タイミング情報であるタイムコードを取得し、この映像タイミング情報であるタイムコードと一致するタイムコードと対応付けられた字幕情報を出力する。映像タイミング情報であるタイムコードと一致するタイムコードと対応付けられた字幕情報とは、例えば、映像タイミング情報であるタイムコードが示す時刻を示す字幕タイミング情報、または映像タイミング情報であるタイムコードが示す時刻を含む期間を示す字幕タイミング情報と対応付けられた字幕情報である。 For example, when outputting one frame image of video information, the output unit 113 outputs subtitle information associated with subtitle timing information that specifies video timing information associated with the output frame image. You may do it. For example, when the output unit 113 outputs one frame image of the video information received by the video receiving unit 101, the output unit 113 acquires the video timing information associated with the frame image and specifies the video timing information. The subtitle information associated with the timing information is output. For example, when the video receiving unit 101 outputs one frame image of the video information received by the video receiving unit 101, the output unit 113 acquires a time code that is video timing information associated with the frame image. Then, caption information associated with a time code that matches the time code that is the video timing information is output. The subtitle information associated with the time code that matches the time code that is the video timing information is, for example, the subtitle timing information that indicates the time indicated by the time code that is the video timing information, or the time code that is the video timing information. This is subtitle information associated with subtitle timing information indicating a period including time.

出力部１１３は、例えば、映像格納部１０３に格納されている映像情報と、字幕格納部１０４に格納されている字幕情報とを読み出し、合成して出力する。例えば、出力部１１３は、映像格納部１０３に格納されている映像情報を、フレーム画像ごとに読み出して出力する。例えば、出力部１１３は、映像格納部１０３に格納されている映像情報のフレーム画像を、蓄積された順番に、あるいは予めタイムコード等で指定されたタイミング（例えば、時刻）に読み出して出力する。読み出したフレーム画像は、例えば、映像格納部１０３から削除する。また、出力部１１３は、字幕情報を、字幕情報に対応付けられた字幕タイミング情報が示すタイミング（例えば、時刻）に読み出して出力する。 For example, the output unit 113 reads out the video information stored in the video storage unit 103 and the subtitle information stored in the subtitle storage unit 104, combines them, and outputs them. For example, the output unit 113 reads and outputs the video information stored in the video storage unit 103 for each frame image. For example, the output unit 113 reads out and outputs the frame images of the video information stored in the video storage unit 103 in the order in which they are stored or at a timing (for example, time) designated in advance by a time code or the like. For example, the read frame image is deleted from the video storage unit 103. Further, the output unit 113 reads and outputs the caption information at the timing (for example, time) indicated by the caption timing information associated with the caption information.

出力部１１３は、更に、手話映像受信部１０５が受信した手話映像情報の各フレーム画像を、この各フレーム画像に対応する手話タイミング情報を用いて、映像受信部１０１が受信した映像情報と合成して出力してもよい。例えば、出力部１１３は、手話映像受信部１０５が受信した手話映像情報の各フレーム画像を、この各フレーム画像に対応する手話タイミング情報が示すタイミングで、映像受信部１０１が受信した映像情報と合成して出力してもよい。例えば、出力部１１３は、映像受信部１０１が、映像受信部１０１が受信した映像情報の一のフレーム画像を出力する際に、このフレーム画像に対応付けられた映像タイミング情報であるタイムコードを取得し、このタイムコードと一致する手話タイミング情報であるタイムコードと対応付けられた手話映像情報のフレーム画像を、手話映像受信部１０５が受信した手話映像情報から取得し、映像情報と合成して出力する。 The output unit 113 further synthesizes each frame image of the sign language video information received by the sign language video reception unit 105 with the video information received by the video reception unit 101 using the sign language timing information corresponding to each frame image. May be output. For example, the output unit 113 synthesizes each frame image of the sign language video information received by the sign language video reception unit 105 with the video information received by the video reception unit 101 at the timing indicated by the sign language timing information corresponding to each frame image. May be output. For example, when the video receiving unit 101 outputs one frame image of the video information received by the video receiving unit 101, the output unit 113 acquires a time code that is video timing information associated with the frame image. Then, a frame image of the sign language video information associated with the time code which is the sign language timing information matching the time code is acquired from the sign language video information received by the sign language video receiving unit 105, and is combined with the video information and output. To do.

また、出力部１１３は、字幕受信部１０２が受信した字幕情報が字幕タイミング情報と対応付けられていない字幕情報である場合には、字幕情報を映像受信部１０１が受信した映像情報と合成して出力する。つまり、字幕タイミング情報を用いずに、字幕情報を映像情報と合成して出力する。例えば、出力部１１３は、字幕情報を、映像情報に対する出力タイミングを考慮せずに、映像情報と合成して出力する。例えば、出力部１１３は、上述したような字幕タイミング情報を用いた字幕情報の出力されるタイミングを映像情報が出力されるタイミングと合わせる処理を行わずに字幕情報を出力する。例えば、出力部１１３は、字幕受信部１０２が受信した字幕情報を、受信した直後に、映像情報と合成して出力してもよい。あるいは、受信した字幕情報を字幕格納部１０４に蓄積し、字幕格納部１０４に蓄積されている字幕情報を、予め指定されたタイミングやトリガー等に応じて蓄積順に読み出して、例えば、読み出した時点で出力される映像情報（例えば、フレーム画像）と合成して出力しても良い。 Further, when the subtitle information received by the subtitle receiving unit 102 is subtitle information not associated with the subtitle timing information, the output unit 113 combines the subtitle information with the video information received by the video receiving unit 101. Output. That is, subtitle information is synthesized with video information and output without using subtitle timing information. For example, the output unit 113 synthesizes and outputs the subtitle information with the video information without considering the output timing for the video information. For example, the output unit 113 outputs the caption information without performing the process of matching the timing of outputting the caption information using the caption timing information as described above with the timing of outputting the video information. For example, the output unit 113 may combine the subtitle information received by the subtitle reception unit 102 with the video information immediately after reception and output the subtitle information. Alternatively, the received subtitle information is stored in the subtitle storage unit 104, and the subtitle information stored in the subtitle storage unit 104 is read in the order of storage in accordance with a predesignated timing, trigger, or the like. You may synthesize | combine and output with the video information (for example, frame image) output.

映像情報に対する、字幕情報及び手話映像情報の少なくとも一方の合成は、例えば、どのように行われても良い。例えば、映像情報の一部の領域に重ねて字幕情報及び手話映像情報の少なくとも一方を表示しても良い。重ねる際には、字幕情報及び手話映像情報の可読性や視認性を損ねないようにすることが好ましい。例えば、字幕情報と手話映像情報をそのまま重ねて出力しても良いし、乗算モード等の予め指定された合成モードで合成を行って重ねて出力しても良い。あるいは、映像情報と、字幕情報と、手話映像情報とを、それぞれが重ならないように異なる領域に出力しても良い。通常、手話映像情報は、映像情報よりも小さいサイズの領域に配置して出力される。このことは、手話映像情報の元のサイズ（例えば、画素数）自体が、映像情報のサイズ（画素数）よりも小さいことであっても良いし、手話映像情報が映像情報よりも小さいサイズの領域に出力されるよう、出力部１１３が、手話映像情報が配置される領域のサイズを、映像情報が配置される領域のサイズに対して相対的に縮小することであっても良い。 The synthesis of at least one of caption information and sign language video information with respect to video information may be performed, for example. For example, at least one of subtitle information and sign language video information may be displayed over a partial area of the video information. When overlapping, it is preferable not to impair the readability and visibility of caption information and sign language video information. For example, subtitle information and sign language video information may be superimposed and output as they are, or may be combined and output in a pre-designated combining mode such as a multiplication mode. Alternatively, the video information, the caption information, and the sign language video information may be output to different areas so that they do not overlap each other. Usually, sign language video information is arranged and output in an area having a smaller size than the video information. This may be that the original size (for example, the number of pixels) of the sign language video information itself is smaller than the size (the number of pixels) of the video information, or the sign language video information is smaller in size than the video information. The output unit 113 may reduce the size of the area in which the sign language video information is arranged relative to the size of the area in which the video information is arranged so that the information is output to the area.

映像情報と、字幕情報とを出力する際のそれぞれの配置や合成モード等はどのように決定しても良い。また、手話映像情報を更に出力する際の、配置や合成モード等はどのように決定しても良い。例えば、映像情報と、字幕情報と、手話映像情報とがそれぞれ配置される配置領域や合成モード等が、予めデフォルト等で指定されていても良い。また、上述した制御情報受信部１１２が受信した制御情報に含まれている配置を示す情報を用いて、映像情報と、字幕情報と、手話映像情報との配置を設定しても良い。 Each arrangement, composition mode, and the like when outputting video information and caption information may be determined in any way. In addition, the arrangement, composition mode, and the like when further outputting the sign language video information may be determined. For example, an arrangement area where the video information, subtitle information, and sign language video information are arranged, a composition mode, and the like may be designated in advance by default or the like. Further, the arrangement of the video information, the subtitle information, and the sign language video information may be set using information indicating the arrangement included in the control information received by the control information receiving unit 112 described above.

なお、出力部１１３は、字幕情報及び手話映像情報の少なくとも一方の、出力、非出力を、ユーザ等の指示に応じて切り替えられるようにしても良い。 Note that the output unit 113 may switch between output and non-output of at least one of caption information and sign language video information in accordance with an instruction from a user or the like.

なお、字幕情報の出力されている期間は、どのように決定されても良い。例えば、字幕情報に対応付けられた字幕タイミング情報が、字幕が表示される期間を指定する情報である場合には、出力部１１３は、例えば、この期間の開始から終了まで、字幕情報を表示する。また、字幕タイミング情報に、字幕情報の出力の開始のタイミングを指定する情報だけが含まれている場合、予め指定された経過時間だけ字幕情報を、映像情報と合成して出力しても良いし、あるいは、次の出力順番の字幕情報が出力されるタイミングまで、字幕情報を出力しても良い。 Note that the period during which the caption information is output may be determined in any manner. For example, when the subtitle timing information associated with the subtitle information is information specifying a period during which the subtitle is displayed, the output unit 113 displays the subtitle information from the start to the end of this period, for example. . In addition, when the subtitle timing information includes only information specifying the start timing of subtitle information output, the subtitle information may be combined with the video information and output for a predetermined elapsed time. Alternatively, the subtitle information may be output until the subtitle information in the next output order is output.

出力部１１３は、ディスプレイ等の出力デバイスを含むと考えても含まないと考えても良い。出力部１１３は、出力デバイスのドライバーソフトまたは、出力デバイスのドライバーソフトと出力デバイス等で実現され得る。 The output unit 113 may be considered as including or not including an output device such as a display. The output unit 113 can be realized by output device driver software, or output device driver software and an output device.

なお、映像出力装置１は、図示しない手話映像要求情報送信部を有していても良い。手話映像要求情報送信部は、手話映像要求情報を手話映像サーバ装置５に送信する。手話映像要求情報は、手話映像情報の送信を要求する情報である。手話映像要求情報は、手話映像情報が対応する映像情報の映像識別情報を有していても良い。手話映像要求情報送信部は、通常、無線または有線の通信手段で実現されるが、放送手段で実現されても良い。手話映像要求情報送信部は、通信デバイスを含むと考えても、含まないと考えても良い。 Note that the video output device 1 may include a sign language video request information transmission unit (not shown). The sign language video request information transmission unit transmits the sign language video request information to the sign language video server device 5. The sign language video request information is information requesting transmission of sign language video information. The sign language image request information may include image identification information of image information corresponding to the sign language image information. The sign language image request information transmitting unit is usually realized by a wireless or wired communication means, but may be realized by a broadcasting means. The sign language video request information transmission unit may or may not include a communication device.

制御サーバ装置２は、映像出力装置１に制御情報を送信するサーバ装置である。制御サーバ装置２は、例えば、コンピュータ等で実現可能である。 The control server device 2 is a server device that transmits control information to the video output device 1. The control server device 2 can be realized by a computer or the like, for example.

制御情報格納部２０１には、１または２以上の制御情報が格納される。制御情報は、映像識別情報と対応付けられて制御情報格納部２０１に格納されていても良い。一の映像識別情報と対応付けられた制御情報は、例えば、この映像識別情報が示す映像情報に関する情報を有している。一の映像識別情報と対応付けられた制御情報は、例えば、この映像識別情報が示す映像情報を送信する映像サーバ装置３、及びこの映像情報に対応する字幕情報を送信する字幕サーバ装置４を指定する情報を有する。この制御情報は、更に、この映像情報に対応する手話映像情報を送信する手話映像サーバ装置５を指定する情報を有していても良い。また、この制御情報は、更に、一の映像識別情報が示す映像情報や字幕情報が格納されているディレクトリを指定する情報や、一の映像識別情報が示す映像情報の種類を示す情報や、一の映像識別情報が示す映像情報やこれに対応する字幕情報や手話映像情報を出力する際の配置場所を示す配置情報等を有していてもよい。 The control information storage unit 201 stores one or more control information. The control information may be stored in the control information storage unit 201 in association with the video identification information. The control information associated with one piece of video identification information includes, for example, information regarding the video information indicated by the video identification information. The control information associated with one video identification information specifies, for example, the video server device 3 that transmits the video information indicated by the video identification information, and the caption server device 4 that transmits the caption information corresponding to the video information. It has information to do. The control information may further include information for designating the sign language video server device 5 that transmits the sign language video information corresponding to the video information. Further, the control information further includes information specifying a directory in which video information and caption information indicated by one video identification information are stored, information indicating the type of video information indicated by one video identification information, The video information indicated by the video identification information, the subtitle information corresponding to the video information, and the arrangement information indicating the arrangement location when outputting the sign language video information may be included.

制御サーバ側受信部２０２は、映像出力装置１の制御要求情報送信部１１１から送信される制御要求情報を受信する。制御サーバ側受信部２０２は、通常、無線または有線の通信手段で実現されるが、放送を受信する手段で実現されても良い。制御サーバ側受信部２０２は、通信デバイスを含むと考えても、含まないと考えても良い。 The control server side receiving unit 202 receives the control request information transmitted from the control request information transmitting unit 111 of the video output device 1. The control server side receiving unit 202 is usually realized by a wireless or wired communication means, but may be realized by a means for receiving a broadcast. The control server side receiving unit 202 may or may not include a communication device.

制御サーバ側送信部２０３は、制御情報格納部２０１に格納されている一の制御情報を読み出して、映像出力装置１に送信する。例えば、制御サーバ側送信部２０３は、制御サーバ側受信部２０２が受信した制御要求情報に対応した制御情報を読み出して、送信する。制御サーバ側送信部２０３は、例えば、制御要求情報が有する映像識別情報と一致する映像情報と対応付けて制御情報格納部２０１に格納されている制御情報を読み出して、制御要求情報の送信元となる映像出力装置１に送信する。制御サーバ側送信部２０３は、通常、無線または有線の通信手段で実現されるが、放送手段で実現されても良い。制御サーバ側送信部２０３は、通信デバイスを含むと考えても、含まないと考えても良い。 The control server side transmission unit 203 reads out one piece of control information stored in the control information storage unit 201 and transmits it to the video output device 1. For example, the control server side transmission unit 203 reads and transmits control information corresponding to the control request information received by the control server side reception unit 202. For example, the control server side transmission unit 203 reads the control information stored in the control information storage unit 201 in association with the video information that matches the video identification information included in the control request information, and transmits the control request information transmission source. To the video output device 1. The control server side transmission unit 203 is usually realized by a wireless or wired communication means, but may be realized by a broadcasting means. The control server side transmission unit 203 may or may not include a communication device.

映像サーバ装置３は、映像出力装置１に映像情報を送信するサーバ装置である。映像サーバ装置３は、例えば、コンピュータ等で実現可能である。 The video server device 3 is a server device that transmits video information to the video output device 1. The video server device 3 can be realized by, for example, a computer.

映像サーバ側格納部３０１には、１または２以上の映像情報が格納される。例えば、映像サーバ側格納部３０１には、１または２以上の映像情報が、映像識別情報と対応付けて格納される。映像情報は、映像情報の種類を示す情報と対応付けて格納されていても良い。 The video server side storage unit 301 stores one or more pieces of video information. For example, the video server side storage unit 301 stores one or more pieces of video information in association with the video identification information. Video information may be stored in association with information indicating the type of video information.

映像サーバ側受信部３０２は、映像出力装置１の映像要求情報送信部１１０から送信される映像要求情報を受信する。映像サーバ側受信部３０２は、通常、無線または有線の通信手段で実現されるが、放送を受信する手段で実現されても良い。映像サーバ側受信部３０２は、通信デバイスを含むと考えても、含まないと考えても良い。 The video server side receiving unit 302 receives the video request information transmitted from the video request information transmitting unit 110 of the video output device 1. The video server side receiving unit 302 is usually realized by a wireless or wired communication means, but may be realized by means for receiving a broadcast. The video server side receiving unit 302 may or may not include a communication device.

映像サーバ側送信部３０３は、映像サーバ側格納部３０１に格納されている映像情報を読み出して、映像要求情報の送信元となる映像出力装置１に送信する。映像サーバ側送信部３０３は、例えば、映像要求情報に含まれる映像識別情報と一致する映像識別情報と対応付けられている映像情報を、映像サーバ側格納部３０１から読み出して送信する。また、映像サーバ側送信部３０３は、映像情報に対応付けて格納されている映像情報の種類を示す情報を、更に映像サーバ側格納部３０１から読み出して、映像出力装置１に送信してもよい。映像サーバ側送信部３０３は、いわゆるストリーミング配信等のように、一の映像情報をパケット等に分割して一定または不定のタイミング等で送信しても良いし、一の映像情報を複数に分割して送信しても良いし、一の映像情報を一度に送信しても良い。映像サーバ側送信部３０３は、通常、無線または有線の通信手段で実現されるが、放送手段で実現されても良い。映像サーバ側送信部３０３は、通信デバイスを含むと考えても、含まないと考えても良い。 The video server side transmission unit 303 reads the video information stored in the video server side storage unit 301 and transmits the video information to the video output device 1 that is the transmission source of the video request information. For example, the video server side transmission unit 303 reads the video information associated with the video identification information that matches the video identification information included in the video request information from the video server side storage unit 301 and transmits the video information. Further, the video server side transmission unit 303 may further read information indicating the type of video information stored in association with the video information from the video server side storage unit 301 and transmit the information to the video output device 1. . The video server-side transmission unit 303 may divide one piece of video information into packets or the like and transmit it at a fixed or indefinite timing or the like, as in so-called streaming distribution, or divide one piece of video information into a plurality of pieces. May be transmitted, or one piece of video information may be transmitted at a time. The video server side transmission unit 303 is usually realized by a wireless or wired communication means, but may be realized by a broadcasting means. The video server side transmission unit 303 may or may not include a communication device.

字幕サーバ装置４は、字幕要求情報に応じて字幕情報を送信するサーバ装置である。字幕サーバ装置４は、例えば、コンピュータ等で実現可能である。 The caption server device 4 is a server device that transmits caption information according to caption request information. The caption server device 4 can be realized by, for example, a computer.

字幕サーバ側格納部４０１には、１または２以上の字幕情報と、これらの各字幕情報を出力するタイミングを示す字幕タイミング情報とが対応付けて格納されている。字幕情報は、更に、映像識別情報と対応付けられて字幕サーバ側格納部４０１に格納されていても良い。なお、字幕サーバ側格納部４０１には、字幕タイミング情報と対応付けられていない１または２以上の字幕情報が格納されていても良い。 The subtitle server side storage unit 401 stores one or more pieces of subtitle information and subtitle timing information indicating the timing for outputting each piece of subtitle information in association with each other. The caption information may be further stored in the caption server side storage unit 401 in association with the video identification information. Note that the subtitle server side storage unit 401 may store one or more pieces of subtitle information that are not associated with subtitle timing information.

字幕サーバ側受信部４０２は、映像出力装置１から送信される字幕要求情報を受信する。例えば、字幕サーバ側受信部４０２は、映像識別情報を有する字幕要求情報を受信する。 The subtitle server side receiving unit 402 receives subtitle request information transmitted from the video output device 1. For example, the caption server side receiving unit 402 receives caption request information having video identification information.

また、字幕サーバ側受信部４０２は、映像出力装置１から送信される字幕指定情報を受信する。 The subtitle server side receiving unit 402 receives subtitle designation information transmitted from the video output device 1.

字幕サーバ側受信部４０２は、通常、無線または有線の通信手段で実現されるが、放送を受信する手段で実現されても良い。字幕サーバ側受信部４０２は、通信デバイスを含むと考えても、含まないと考えても良い。 The subtitle server side receiving unit 402 is normally realized by a wireless or wired communication means, but may be realized by a means for receiving a broadcast. The subtitle server side receiving unit 402 may or may not include a communication device.

字幕サーバ側送信部４０３は、字幕サーバ側格納部４０１に格納されている字幕情報を読み出して映像出力装置１に送信する。字幕サーバ側送信部４０３は、例えば、字幕要求情報が有する映像識別情報と対応付けられた字幕情報を、字幕サーバ側格納部４０１から読み出して送信する。字幕サーバ側送信部４０３は、一の字幕要求情報の受信に応じて、一の映像情報に対応する全ての字幕情報を映像出力装置１に送信しても良いし、一の字幕要求情報を受信するごとに、一の映像情報に対応する予め指定された量の字幕情報、例えば、一の映像情報に対応する１または２以上の字幕情報を、出力順等に従って送信しても良い。また、一の字幕要求情報に応じて送信する字幕情報の量は、字幕要求情報が有する字幕情報の送信量を指定する情報や、送信量の増減を指定する情報に応じて決定される量としてもよい。 The subtitle server side transmission unit 403 reads the subtitle information stored in the subtitle server side storage unit 401 and transmits it to the video output device 1. For example, the caption server side transmission unit 403 reads the caption information associated with the video identification information included in the caption request information from the caption server side storage unit 401 and transmits it. The subtitle server side transmission unit 403 may transmit all subtitle information corresponding to one video information to the video output device 1 or receive one subtitle request information in response to reception of one subtitle request information. Each time, a predetermined amount of subtitle information corresponding to one video information, for example, one or more subtitle information corresponding to one video information may be transmitted according to the output order or the like. Also, the amount of subtitle information to be transmitted in accordance with one subtitle request information is determined as information that specifies the transmission amount of subtitle information included in the subtitle request information and information that specifies increase / decrease of the transmission amount. Also good.

また、字幕サーバ側送信部４０３は、字幕サーバ側受信部４０２が、字幕タイミング情報と対応付けられた字幕情報の送信を指定する字幕指定情報を受け付けた場合、字幕タイミング情報と対応付けられた字幕情報を送信する。例えば、字幕サーバ側受信部４０２が、字幕タイミング情報と対応付けられた字幕情報の送信を指定する字幕指定情報を受け付けた場合、字幕サーバ側送信部４０３は、この字幕指定情報が有する映像識別情報に対応付けられた字幕情報に対して、字幕タイミング情報と対応付けられた字幕情報を送信することを設定するための情報、例えば、いわゆるフラグ等の情報を付与する。この情報は、例えば、図示しない記憶媒体等に蓄積される。そして、この情報が付与された字幕情報を送信する際には、字幕タイミング情報と対応付けられた字幕情報を送信するようにしてもよい。 In addition, the subtitle server side transmission unit 403, when the subtitle server side reception unit 402 receives the subtitle designation information designating transmission of the subtitle information associated with the subtitle timing information, the subtitle associated with the subtitle timing information Send information. For example, when the subtitle server side receiving unit 402 receives subtitle designation information designating transmission of subtitle information associated with subtitle timing information, the subtitle server side transmission unit 403 includes video identification information included in the subtitle designation information. For the subtitle information associated with the subtitle information, information for setting transmission of the subtitle information associated with the subtitle timing information, for example, information such as a so-called flag is added. This information is accumulated in, for example, a storage medium (not shown). And when transmitting the subtitle information to which this information is given, the subtitle information associated with the subtitle timing information may be transmitted.

また、字幕タイミング情報と対応付けられていない字幕情報の送信を指定する情報を受け付けた場合、字幕サーバ側送信部４０３は、字幕タイミング情報と対応付けられていない字幕情報を送信する。例えば、字幕サーバ側受信部４０２が、字幕タイミング情報と対応付けられていない字幕情報の送信を指定する字幕指定情報を受け付けた場合、字幕サーバ側送信部４０３は、この字幕指定情報が有する映像識別情報に対応付けられた字幕情報に対して、字幕タイミング情報と対応付けられていない字幕情報を送信することを設定するための情報、例えば、いわゆるフラグ等の情報を付与する。そして、この情報が付与された字幕情報を送信する際には、字幕タイミング情報と対応付けられていない字幕情報を送信する。字幕タイミング情報と対応付けられていない字幕情報を送信するということは、例えば、字幕情報が、字幕タイミング情報と対応付けられているか否かに関わらず、字幕情報は送信するが、字幕タイミング情報は送信しないことであっても良いし、同じ映像識別情報と対応付けられた字幕情報として、字幕タイミング情報と対応付けられた字幕情報と、字幕タイミング情報と対応付けられていない字幕情報があった場合に、字幕タイミング情報と対応付けられていない字幕情報を送信することであってもよい。 Also, when receiving information specifying transmission of subtitle information that is not associated with subtitle timing information, the subtitle server side transmission unit 403 transmits subtitle information that is not associated with subtitle timing information. For example, when the subtitle server side receiving unit 402 receives subtitle designation information that designates transmission of subtitle information that is not associated with subtitle timing information, the subtitle server side transmission unit 403 includes the video identification included in the subtitle designation information. Information for setting transmission of subtitle information not associated with subtitle timing information, for example, information such as a so-called flag is assigned to the subtitle information associated with the information. And when transmitting the subtitle information provided with this information, the subtitle information not associated with the subtitle timing information is transmitted. Transmitting subtitle information that is not associated with subtitle timing information means that, for example, subtitle information is transmitted regardless of whether the subtitle information is associated with subtitle timing information, but subtitle timing information is When there is subtitle information associated with subtitle timing information and subtitle information not associated with subtitle timing information as subtitle information associated with the same video identification information. Alternatively, subtitle information that is not associated with subtitle timing information may be transmitted.

字幕サーバ側送信部４０３は、通常、無線または有線の通信手段で実現されるが、放送手段で実現されても良い。字幕サーバ側送信部４０３は、通信デバイスを含むと考えても、含まないと考えても良い。 The subtitle server side transmission unit 403 is usually realized by a wireless or wired communication means, but may be realized by a broadcasting means. The subtitle server side transmission unit 403 may or may not include a communication device.

手話映像サーバ装置５は、手話映像情報を送信するサーバ装置である。手話映像サーバ装置５は、例えば、コンピュータ等で実現可能である。 The sign language video server device 5 is a server device that transmits sign language video information. The sign language video server device 5 can be realized by, for example, a computer.

手話映像サーバ側格納部５０１には、１または２以上の手話映像情報が格納される。例えば、手話映像サーバ側格納部５０１には、１または２以上の手話映像情報が、映像識別情報と対応付けて格納される。 The sign language video server side storage unit 501 stores one or more pieces of sign language video information. For example, one or more sign language video information is stored in the sign language video server storage unit 501 in association with the video identification information.

手話映像サーバ側受信部５０２は、映像出力装置１の図示しない手話映像要求情報送信部から送信される手話映像要求情報を受信する。手話映像サーバ側受信部５０２は、通常、無線または有線の通信手段で実現されるが、放送を受信する手段で実現されても良い。手話映像サーバ側受信部５０２は、通信デバイスを含むと考えても、含まないと考えても良い。 The sign language video server side receiving unit 502 receives sign language video request information transmitted from a sign language video request information transmitting unit (not shown) of the video output device 1. The sign language video server side receiving unit 502 is normally realized by a wireless or wired communication means, but may be realized by a means for receiving a broadcast. The sign language video server side reception unit 502 may or may not include a communication device.

手話映像サーバ側送信部５０３は、手話映像サーバ側格納部５０１に格納されている手話映像情報を読み出して、手話映像要求情報の送信元となる映像出力装置１に送信する。映像サーバ側送信部３０３は、例えば、手話映像要求情報に含まれる映像識別情報と一致する映像識別情報と対応付けられている手話映像情報を、手話映像サーバ側格納部５０１から読み出して送信する。手話映像サーバ側送信部５０３は、いわゆるストリーミング配信等のように、一の手話映像情報をパケット等に分割して一定または不定のタイミング等で送信しても良いし、一の手話映像情報を複数に分割して送信しても良いし、一の手話映像情報を一度に送信しても良い。手話映像サーバ側送信部５０３は、通常、無線または有線の通信手段で実現されるが、放送手段で実現されても良い。手話映像サーバ側送信部５０３は、通信デバイスを含むと考えても、含まないと考えても良い。 The sign language video server side transmission unit 503 reads the sign language video information stored in the sign language video server side storage unit 501 and transmits it to the video output device 1 that is the transmission source of the sign language video request information. For example, the video server side transmission unit 303 reads and transmits sign language video information associated with video identification information that matches the video identification information included in the sign language video request information from the sign language video server side storage unit 501. The sign language video server side transmission unit 503 may divide one sign language video information into packets or the like and transmit it at a fixed or indefinite timing or the like, as in so-called streaming distribution, or a plurality of one sign language video information. It may be divided and transmitted, or one sign language video information may be transmitted at a time. The sign language video server side transmission unit 503 is usually realized by a wireless or wired communication unit, but may be realized by a broadcasting unit. The sign language video server side transmission unit 503 may or may not include a communication device.

次に、映像システム１０００の映像出力装置１の動作の一例について図２のフローチャートを用いて説明する。ここでは、映像受信部１０１が、映像情報として、フレーム画像が映像タイミング情報と対応付けられた受信し、この映像タイミング情報を利用して字幕情報の出力を適宜行う場合を例に挙げて説明する。また、手話映像情報を構成する各フレーム画像にも、映像タイミング情報と同様の情報（ここでは、手話映像タイミング情報と称す）が対応付けられているものとする。 Next, an example of the operation of the video output device 1 of the video system 1000 will be described with reference to the flowchart of FIG. Here, an example will be described in which the video reception unit 101 receives, as video information, a frame image associated with video timing information, and appropriately outputs subtitle information using the video timing information. . Also, it is assumed that information similar to the video timing information (herein referred to as sign language video timing information) is associated with each frame image constituting the sign language video information.

（ステップＳ１０１）映像出力装置１は、図示しない受付部等を介して、一の映像情報を出力する指示を受け付けたか否かを判断する。受け付けた場合、ステップＳ１０２に進み、受け付けていない場合、ステップＳ１０１に戻る。 (Step S101) The video output device 1 determines whether or not an instruction to output one piece of video information has been received via a receiving unit (not shown) or the like. If accepted, the process proceeds to step S102. If not accepted, the process returns to step S101.

（ステップＳ１０２）格納量設定部１０６は、自装置の処理能力に関する値を取得し、取得した値に応じて映像格納部１０３と字幕格納部１０４の格納量の最大値をそれぞれ設定する。 (Step S 102) The storage amount setting unit 106 acquires a value related to the processing capability of the own device, and sets maximum values of the storage amounts of the video storage unit 103 and the caption storage unit 104 according to the acquired values.

（ステップＳ１０３）制御要求情報送信部１１１は、制御サーバ装置２に制御要求情報を送信する。 (Step S 103) The control request information transmission unit 111 transmits control request information to the control server device 2.

（ステップＳ１０４）制御情報受信部１１２は、制御情報を受信したか否かを判断する。受信した場合、ステップＳ１０４に進み、受信していない場合、ステップＳ１０４に戻る。 (Step S104) The control information receiving unit 112 determines whether or not control information has been received. If received, the process proceeds to step S104. If not received, the process returns to step S104.

（ステップＳ１０５）映像要求情報送信部１１０は、制御情報が示す映像サーバ装置３に映像要求情報を送信する。 (Step S105) The video request information transmission unit 110 transmits the video request information to the video server device 3 indicated by the control information.

（ステップＳ１０６）指定送信部１０９は、制御情報が示す映像情報の種類に応じた字幕指定情報を字幕サーバ装置４に送信する。 (Step S106) The designation transmitting unit 109 transmits caption designation information corresponding to the type of video information indicated by the control information to the caption server device 4.

（ステップＳ１０７）要求送信部１０８は、字幕要求情報を制御情報が示す字幕サーバ装置４に送信する。 (Step S107) The request transmission unit 108 transmits the subtitle request information to the subtitle server device 4 indicated by the control information.

（ステップＳ１０８）図示しない手話映像要求情報送信部は、手話映像要求情報を、制御情報が示す手話映像サーバ装置５に送信する。 (Step S108) A sign language video request information transmitting unit (not shown) transmits the sign language video request information to the sign language video server device 5 indicated by the control information.

（ステップＳ１０９）映像受信部１０１は、映像サーバ装置３から映像情報を受信したか否かを判断する。例えば、出力順にパケット単位等で分割されて送信される映像情報を構成する一以上のフレーム画像を受信したか否かを判断する。受信した場合、ステップＳ１１０に進み、受信していない場合、ステップＳ１１１に進む。 (Step S 109) The video receiving unit 101 determines whether video information has been received from the video server device 3. For example, it is determined whether or not one or more frame images constituting video information that is divided and transmitted in packet order in the output order are received. If received, the process proceeds to step S110. If not received, the process proceeds to step S111.

（ステップＳ１１０）映像受信部１０１は、ステップＳ１０９で受信した映像情報を、映像格納部１０３に蓄積する。なお、映像受信部１０１は、例えば、ステップＳ１０９で受信した映像情報の一以上のパケットを図示しない記憶媒体等に一時記憶しておくようにし、一時記憶された一以上のパケットに対して復号化等を行うことにより、映像情報の一以上のフレーム画像を取得して、映像格納部１０３に蓄積してもよい。そして、ステップＳ１１１に進む。 (Step S110) The video receiving unit 101 accumulates the video information received in step S109 in the video storage unit 103. Note that the video receiving unit 101 temporarily stores one or more packets of the video information received in step S109 in a storage medium (not shown) or the like, and decodes the temporarily stored one or more packets. For example, one or more frame images of the video information may be acquired and stored in the video storage unit 103. Then, the process proceeds to step S111.

（ステップＳ１１１）字幕受信部１０２は、字幕サーバ装置４から、字幕情報を受信したか否かを判断する。字幕サーバ装置４から、字幕情報と、字幕タイミング情報との一以上の組が送信された場合、この組を受信したか否かを判断しても良い。受信した場合、ステップＳ１１２に進み、受信していない場合、ステップＳ１１３に進む。字幕情報と、この字幕情報と対応付けられた字幕タイミング情報との一以上の組は、例えば、上述した字幕組情報であってもよい。 (Step S 111) The caption receiving unit 102 determines whether caption information is received from the caption server device 4. When one or more sets of subtitle information and subtitle timing information are transmitted from the subtitle server device 4, it may be determined whether or not this set has been received. If received, the process proceeds to step S112. If not received, the process proceeds to step S113. One or more sets of subtitle information and subtitle timing information associated with the subtitle information may be, for example, the above-described subtitle set information.

（ステップＳ１１２）字幕受信部１０２は、受信した字幕情報を字幕格納部１０４に蓄積する。また、字幕情報と、字幕情報と対応付けられた字幕タイミング情報との一以上の組を受信した場合、字幕受信部１０２は、この組を、字幕格納部１０４に蓄積してもよい。そして、ステップＳ１１３に進む。 (Step S112) The caption receiving unit 102 accumulates the received caption information in the caption storage unit 104. Further, when one or more sets of subtitle information and subtitle timing information associated with the subtitle information are received, the subtitle receiving unit 102 may accumulate the sets in the subtitle storage unit 104. Then, the process proceeds to step S113.

（ステップＳ１１３）手話映像受信部１０５は、手話映像サーバ装置５から、手話映像情報を受信したか否かを判断する。例えば、パケット単位等で分割されて送信される手話映像情報を受信したか否かを判断する。受信した場合、ステップＳ１１４に進み、受信していない場合、ステップＳ１１５に進む。 (Step S113) The sign language video receiving unit 105 determines whether or not sign language video information has been received from the sign language video server device 5. For example, it is determined whether or not sign language video information divided and transmitted in packet units is received. If received, the process proceeds to step S114. If not received, the process proceeds to step S115.

（ステップＳ１１４）手話映像受信部１０５は、受信した手話映像情報を、図示しない手話映像格納部に一時記憶する。この処理は、ステップＳ１１０と同様の処理としても良い。そして、ステップＳ１１５に進む。 (Step S114) The sign language video receiving unit 105 temporarily stores the received sign language video information in a sign language video storage unit (not shown). This process may be the same process as step S110. Then, the process proceeds to step S115.

（ステップＳ１１５）出力部１１３は、映像情報を出力するタイミングであるか否かを判断する。例えば、映像格納部１０３に格納されている映像情報を構成するフレーム画像に対応付けられた映像タイミング情報のうちの、最も早い出力タイミングを示す映像タイミング情報が示すタイミングであるか否かを判断する。そして、このタイミングである場合、映像情報を出力するタイミング、ここでは、このタイミング情報と対応付けられたフレーム画像を出力するタイミングであると判断する。また、このタイミングでなければ、映像情報を出力するタイミングでないと判断する。なお、一番最初のフレーム画像を出力する際には、単に、一番最初のフレーム画像が映像格納部１０３に格納されているか否かを判断するようにしてもよい。あるいは、映像格納部１０３に１フレーム画像分以上の予め指定された量の映像情報が蓄積されている場合に、出力するタイミングであると判断しても良い。出力するタイミングである場合、ステップＳ１１６に進み、タイミングでない場合、ステップＳ１２４に進む。 (Step S115) The output unit 113 determines whether it is time to output the video information. For example, it is determined whether or not it is the timing indicated by the video timing information indicating the earliest output timing among the video timing information associated with the frame images constituting the video information stored in the video storage unit 103. . If it is this timing, it is determined that it is the timing to output the video information, here the timing to output the frame image associated with this timing information. If it is not this timing, it is determined that it is not the timing to output the video information. Note that when the first frame image is output, it may be determined simply whether or not the first frame image is stored in the video storage unit 103. Alternatively, it may be determined that it is the output timing when a predetermined amount of video information corresponding to one frame image or more is stored in the video storage unit 103. If it is time to output, the process proceeds to step S116, and if not, the process proceeds to step S124.

（ステップＳ１１６）出力部１１３は、映像格納部１０３から出力する映像情報を取得する。ここでは、例えば、映像格納部１０３から、最も早い出力時機を示す映像タイミング情報と対応付けられたフレーム画像を読み出す。なお、読み出したフレーム画像は、映像格納部１０３から削除する。ここでの削除は、削除されたことを示すフラグ等の情報を付与することであっても良い。かかることは以下においても同様である。 (Step S 116) The output unit 113 acquires video information output from the video storage unit 103. Here, for example, a frame image associated with video timing information indicating the earliest output timing is read from the video storage unit 103. Note that the read frame image is deleted from the video storage unit 103. Deletion here may be to give information such as a flag indicating deletion. The same applies to the following.

（ステップＳ１１７）出力部１１３は、字幕格納部１０４に格納されている字幕情報が、字幕タイミング情報と対応付けられているか否かを判断する。対応付けられている場合、ステップＳ１１８に進み、対応付けられていない場合、ステップＳ１２７に進む。 (Step S117) The output unit 113 determines whether the subtitle information stored in the subtitle storage unit 104 is associated with the subtitle timing information. When it is associated, the process proceeds to step S118, and when it is not associated, the process proceeds to step S127.

（ステップＳ１１８）出力部１１３は、出力する映像情報、ここでは、例えば、ステップＳ１１６で読み出したフレーム画像に対応付けられている映像タイミング情報を取得する。映像タイミング情報は、例えば、フレーム画像に付与されているタイムコードである。 (Step S118) The output unit 113 acquires video information to be output, here, for example, video timing information associated with the frame image read in step S116. The video timing information is, for example, a time code given to the frame image.

（ステップＳ１１９）出力部１１３は、ステップＳ１１７で取得した映像タイミング情報を指定する字幕タイミング情報を、字幕格納部１０４に格納されている一以上の字幕情報に対応付けられた字幕タイミング情報の中から検出する処理を行う。例えば、出力部１１３は、字幕格納部１０４に格納されている字幕タイミング情報のうちの、最も早い時機（あるいは期間）を示す字幕タイミング情報が、ステップＳ１１７で取得した映像タイミング情報が指定する時機と一致する時機を含むか否かを判断する。そして、含むと判断された場合に、その字幕タイミング情報をステップＳ１１７で取得した映像タイミング情報を指定する字幕タイミング情報として検出する。含まれない場合、検出できなかったと判断する。 (Step S119) The output unit 113 selects subtitle timing information specifying the video timing information acquired in step S117 from the subtitle timing information associated with one or more pieces of subtitle information stored in the subtitle storage unit 104. Perform detection processing. For example, the output unit 113 includes subtitle timing information indicating the earliest timing (or period) of the subtitle timing information stored in the subtitle storage unit 104 as the timing specified by the video timing information acquired in step S117. It is determined whether or not the matching timing is included. And when it is judged that it is included, the subtitle timing information is detected as subtitle timing information specifying the video timing information acquired in step S117. If it is not included, it is determined that it could not be detected.

（ステップＳ１２０）出力部１１３は、ステップＳ１１９において、ステップＳ１１７で取得した映像タイミング情報を指定する字幕タイミング情報が検出できたか否かを判断する。検出できた場合、ステップＳ１２１に進み、検出できなかった場合、ステップＳ１２２に進む。 (Step S120) In step S119, the output unit 113 determines whether or not caption timing information specifying the video timing information acquired in step S117 has been detected. When it can detect, it progresses to step S121, and when it cannot detect, it progresses to step S122.

（ステップＳ１２１）出力部１１３は、ステップＳ１１９で検出した字幕タイミング情報と対応付けて格納されている字幕情報を、字幕格納部１０４から取得する。なお、取得した字幕情報及び字幕タイミング情報は、字幕格納部１０４から削除する。 (Step S121) The output unit 113 acquires the caption information stored in association with the caption timing information detected in Step S119 from the caption storage unit 104. The acquired subtitle information and subtitle timing information are deleted from the subtitle storage unit 104.

（ステップＳ１２２）出力部１１３は、図示しない手話映像格納部から手話映像情報を取得する。例えば、出力部１１３は、ステップＳ１１６と同様に、手話映像情報のフレーム画像を取得する。出力部１１３は、例えば、取得した手話映像情報を図示しない手話映像格納部から削除する。 (Step S122) The output unit 113 acquires sign language video information from a sign language video storage unit (not shown). For example, the output unit 113 acquires a frame image of sign language video information as in step S116. For example, the output unit 113 deletes the acquired sign language video information from a sign language video storage unit (not shown).

（ステップＳ１２３）出力部１１３は、ステップＳ１１６で取得した映像情報と、ステップＳ１２１（または、後述するステップＳ１２７）で取得した字幕情報と、ステップＳ１２２で取得した手話映像情報とを合成して出力する。なお、ステップＳ１２１で取得した字幕情報が対応付けられている字幕タイミング情報が、字幕情報の出力される期間を示す情報である場合、出力部１１３は、例えば、この期間の間は、ステップＳ１２１で取得した字幕情報を、ステップＳ１１６で取得した映像情報を合成して出力する。また、ステップＳ１２１で取得した字幕情報が対応付けられている字幕タイミング情報が、字幕情報の出力開始時期を示す情報である場合、またはステップＳ１２７で取得した字幕情報である場合においては、出力部１１３は、例えば、予め指定された時間だけ、取得した字幕情報を、ステップＳ１１６で取得した映像情報を合成して出力するようにする。なお、ステップＳ１２１（または、後述するステップＳ１２７）で字幕情報が取得されなかった場合、出力部１１３は、字幕情報は出力しない。 (Step S123) The output unit 113 synthesizes and outputs the video information acquired in Step S116, the caption information acquired in Step S121 (or Step S127 described later), and the sign language video information acquired in Step S122. . When the subtitle timing information associated with the subtitle information acquired in step S121 is information indicating a period during which the subtitle information is output, the output unit 113, for example, during this period, in step S121. The acquired caption information is combined with the video information acquired in step S116 and output. Further, when the subtitle timing information associated with the subtitle information acquired in step S121 is information indicating the output start time of the subtitle information, or in the case of the subtitle information acquired in step S127, the output unit 113. For example, the acquired caption information is combined with the video information acquired in step S116 for a predetermined time and output. In addition, when subtitle information is not acquired by step S121 (or step S127 mentioned later), the output part 113 does not output subtitle information.

（ステップＳ１２４）出力状況取得部１０７は、映像格納部１０３における映像情報の格納状況を示す情報、及び字幕格納部１０４における字幕情報の格納状況を示す情報の少なくとも一方を取得する。例えば、映像格納部１０３に格納されている映像情報の量が、映像格納部１０３に設定された映像情報の格納量の最大値の予め指定された比率以上（例えば、９０％以上）であるか否かを判断し、その判断結果を映像情報の格納状況を示す情報として取得する。 (Step S124) The output status acquisition unit 107 acquires at least one of information indicating the storage status of video information in the video storage unit 103 and information indicating the storage status of subtitle information in the subtitle storage unit 104. For example, whether the amount of video information stored in the video storage unit 103 is greater than or equal to a predetermined ratio (for example, 90% or more) of the maximum value of the video information storage amount set in the video storage unit 103. And the result of the determination is acquired as information indicating the storage status of the video information.

（ステップＳ１２５）要求送信部１０８は、ステップＳ１２４で取得した格納状況を示す値に応じて設定される字幕要求情報を送信するタイミングになったか否かを判断する。例えば、要求送信部１０８は、ステップＳ１２４で取得した格納状況を示す情報が、予め指定された比率以上の映像情報が格納されていることを示す判断結果であった場合には、字幕要求情報を送信するタイミングを第一の間隔、例えば、３０秒間隔に設定し、予め指定された比率以上の映像情報が格納されていないことを示す判断結果であった場合、タイミングを第二の間隔、例えば、１０秒間隔に設定する。そして、直前に字幕要求情報を送信してからの経過時間がこの設定した間隔となった場合に、字幕要求情報を送信するタイミングであると判断する。タイミングである場合、ステップＳ１２６に進み、タイミングでない場合、ステップＳ１０９に進む。 (Step S125) The request transmission unit 108 determines whether it is time to transmit the subtitle request information set according to the value indicating the storage status acquired in step S124. For example, when the information indicating the storage status acquired in step S124 is a determination result indicating that video information having a ratio higher than a predetermined ratio is stored, the request transmission unit 108 displays subtitle request information. When the transmission timing is set to a first interval, for example, 30 seconds, and the determination result indicates that video information of a ratio higher than a predetermined ratio is not stored, the timing is set to a second interval, for example, Set to 10-second intervals. Then, when the elapsed time since the transmission of the caption request information immediately before becomes the set interval, it is determined that it is the timing for transmitting the caption request information. If it is timing, the process proceeds to step S126, and if not, the process proceeds to step S109.

（ステップＳ１２６）要求送信部１０８は、字幕要求情報を字幕サーバ装置４に送信する。そして、ステップＳ１０９に戻る。 (Step S126) The request transmission unit 108 transmits the caption request information to the caption server device 4. Then, the process returns to step S109.

（ステップＳ１２７）出力部１１３は、字幕格納部１０４に格納されている一の字幕情報を取得する。例えば、出力部１１３は、蓄積されたタイミングが最も早い字幕情報を読み出す。読み出した字幕情報は、字幕格納部１０４から削除する。そして、ステップＳ１２２に進む。 (Step S127) The output unit 113 acquires one piece of subtitle information stored in the subtitle storage unit 104. For example, the output unit 113 reads the caption information with the earliest accumulated timing. The read subtitle information is deleted from the subtitle storage unit 104. Then, the process proceeds to step S122.

なお、図２に示したフローチャートにおいては、ステップＳ１２４と、ステップＳ１２５等の間に、予め指定されたタイミングになった場合に映像要求情報送信部１１０が映像要求情報を映像サーバ装置３に送信する処理を有していても良い。同様に、予め指定されたタイミングになった場合に、図示しない手話映像要求情報送信部が、手話要求情報等を手話映像サーバ装置５に送信する処理を有していても良い。 In the flowchart shown in FIG. 2, the video request information transmitting unit 110 transmits the video request information to the video server device 3 when the timing designated in advance is reached between step S124 and step S125. You may have processing. Similarly, a sign language video request information transmission unit (not shown) may have a process of transmitting sign language request information or the like to the sign language video server device 5 when the timing designated in advance is reached.

また、出力部１１３が、映像情報のフレーム画像を出力する毎に、出力した映像情報のフレーム画像の数をカウントすることで、出力済のフレーム数を取得するようにし、この出力済のフレーム数から取得した次に出力する映像情報のフレーム画像の出力順番を示す値を、ステップＳ１１８において取得する映像タイミング情報の代わりに取得するようにしてもよい。 Further, every time the output unit 113 outputs the frame image of the video information, the output unit 113 counts the number of the frame images of the output video information, thereby acquiring the number of output frames. May be acquired instead of the video timing information acquired in step S118.

なお、図２のフローチャートにおいて、電源オフや処理終了の割り込みにより処理は終了する。 In the flowchart of FIG. 2, the process is terminated by powering off or a process termination interrupt.

次に、映像システム１０００の字幕サーバ装置４の動作の一例について図３のフローチャートを用いて説明する。 Next, an example of the operation of the caption server device 4 of the video system 1000 will be described using the flowchart of FIG.

（ステップＳ２０１）字幕サーバ側受信部４０２は、字幕指定情報を受け付けたか否かを判断する。受け付けた場合、ステップＳ２０２に進み、受け付けていない場合、ステップＳ２０３に進む。 (Step S201) The subtitle server side receiving unit 402 determines whether or not subtitle designation information has been received. If accepted, the process proceeds to step S202. If not accepted, the process proceeds to step S203.

（ステップＳ２０２）字幕サーバ受信部４０２は、ステップＳ２０１で受け付けた字幕指定情報に応じて、字幕タイミング情報と対応付けられた字幕情報を送信するか、字幕タイミング情報と対応付けられていない字幕情報を送信するかを設定する情報を更新する。この設定の情報を、以下、字幕タイミング設定情報と称す。字幕タイミング設定情報は、例えば、図示しない記憶媒体等に格納されているものとする。なお、字幕タイミング設定の情報は、更新されるまでは、例えば、デフォルトで指定された値であるとする。また、字幕タイミング設定情報が、既に格納されている字幕タイミング設定情報と同じ場合は、この字幕タイミング設定情報を蓄積しないようにしても良く、この場合も、字幕タイミング設定情報を、最新の情報に更新したと考えても良い。字幕タイミング設定情報は、例えば、字幕指定情報を送信した映像出力装置１の識別情報（例えば、ＩＰアドレスや、装置名等）と対応付けられて格納される。 (Step S202) The subtitle server reception unit 402 transmits subtitle information associated with subtitle timing information or subtitle information not associated with subtitle timing information according to the subtitle designation information received in step S201. Update the information to set whether to send. This setting information is hereinafter referred to as subtitle timing setting information. For example, the subtitle timing setting information is stored in a storage medium (not shown). Note that the subtitle timing setting information is, for example, a value designated by default until it is updated. In addition, when the subtitle timing setting information is the same as the already stored subtitle timing setting information, the subtitle timing setting information may not be accumulated. In this case, the subtitle timing setting information is updated to the latest information. You may think that it updated. The caption timing setting information is stored in association with, for example, identification information (for example, an IP address, a device name, etc.) of the video output device 1 that has transmitted the caption designation information.

（ステップＳ２０３）字幕サーバ受信部４０２は、字幕タイミング情報を受信したか否かを判断する。受信した場合、ステップＳ２０４に進み、受信していない場合、ステップＳ２０１に戻る。 (Step S203) The caption server reception unit 402 determines whether or not caption timing information has been received. If received, the process proceeds to step S204. If not received, the process returns to step S201.

（ステップＳ２０４）字幕サーバ側送信部４０３は、字幕タイミング設定情報が示す設定が、字幕タイミング情報と対応付けられた字幕情報を送信することを示す設定であるか否かを判断する。設定である場合、ステップＳ２０５に進み、設定でない場合、ステップＳ２０６に進む。 (Step S204) The subtitle server side transmission unit 403 determines whether or not the setting indicated by the subtitle timing setting information is a setting indicating that subtitle information associated with the subtitle timing information is transmitted. If it is set, the process proceeds to step S205. If it is not set, the process proceeds to step S206.

（ステップＳ２０５）字幕サーバ側送信部４０３は、字幕要求情報により特定される字幕情報であって、出力順番が早い未送信の１または２以上の字幕情報と、各字幕情報と対応付けられた字幕タイミング情報とを字幕サーバ側格納部４０１から取得する。字幕サーバ側送信部４０３は、例えば、字幕要求情報が有する映像識別情報と対応付けられた字幕情報を、字幕要求情報により特定される字幕情報として取得する。また、例えば、字幕サーバ側送信部４０３は、映像出力装置１が送信する字幕要求情報が、映像出力装置１が送信する映像要求情報が有する映像識別情報に対応する字幕群識別情報を有する場合、この字幕群識別情報に対応する字幕情報を、字幕要求情報により特定される字幕情報として取得してもよい。この字幕群識別情報は、例えば、制御情報受信部１１２が受信した制御情報から、要求送信部１０８等が取得する。また、字幕サーバ側送信部４０３は、例えば、字幕要求情報が指定するディレクトリ等に格納されている字幕情報を、字幕要求情報により特定される字幕情報として取得する。このディレクトリ等の情報は、例えば、制御情報から要求送信部１０８等が取得する。 (Step S205) The subtitle server side transmission unit 403 is subtitle information specified by subtitle request information, and one or two or more subtitle information that has not been transmitted in the early output order, and subtitles associated with each subtitle information Timing information is acquired from the subtitle server side storage unit 401. For example, the caption server side transmission unit 403 acquires caption information associated with video identification information included in the caption request information as caption information specified by the caption request information. Further, for example, when the subtitle request information transmitted from the video output device 1 includes the subtitle group identification information corresponding to the video identification information included in the video request information transmitted from the video output device 1, the subtitle server side transmission unit 403 includes: The caption information corresponding to the caption group identification information may be acquired as caption information specified by the caption request information. The subtitle group identification information is acquired by the request transmission unit 108 or the like from the control information received by the control information reception unit 112, for example. Also, the caption server side transmission unit 403 acquires, for example, caption information stored in a directory or the like specified by the caption request information as caption information specified by the caption request information. The information such as the directory is acquired by the request transmission unit 108 or the like from the control information, for example.

例えば、字幕サーバ側送信部４０３は、未送信の字幕情報の順番等をカウントして、そのカウント値から、次に送信される字幕情報を判断しても良い。例えば、カウント値に１を加算した値を示す順番の字幕情報を、出力対象の字幕情報と判断しても良い。ただし、どのように、送信済であるか否かを判断しても良い。例えば、送信した字幕情報に対して、送信済であることを示すフラグ等の情報を付与することで、この情報の有無により判断可能である。また、例えば、送信済の字幕情報の識別情報である字幕識別情報を、図示しない管理表等に登録するようにしてもよい。そして、ステップＳ２０７に進む。 For example, the subtitle server side transmission unit 403 may count the order of non-transmitted subtitle information, and determine subtitle information to be transmitted next from the count value. For example, the subtitle information in the order indicating the value obtained by adding 1 to the count value may be determined as the subtitle information to be output. However, it may be determined whether or not the transmission has been completed. For example, by adding information such as a flag indicating that transmission has been completed to the transmitted caption information, it can be determined based on the presence or absence of this information. Further, for example, caption identification information that is identification information of transmitted caption information may be registered in a management table (not shown) or the like. Then, the process proceeds to step S207.

（ステップＳ２０６）字幕サーバ側送信部４０３は、字幕要求情報により特定される字幕情報であって、出力順番が早い未出力の１または２以上の字幕情報を字幕サーバ側格納部４０１から取得する。そして、ステップＳ２０７に進む。 (Step S206) The subtitle server side transmission unit 403 acquires, from the subtitle server side storage unit 401, one or two or more subtitle information that is subtitle information specified by the subtitle request information and whose output order is early. Then, the process proceeds to step S207.

（ステップＳ２０７）字幕サーバ側送信部４０３は、ステップＳ２０５で取得した字幕情報と字幕タイミング情報との一以上の組、またはステップＳ２０６で取得した一以上の字幕情報を、字幕要求情報の送信元となる映像出力装置１に送信する。 (Step S207) The subtitle server side transmission unit 403 uses one or more sets of subtitle information and subtitle timing information acquired in step S205 or one or more subtitle information acquired in step S206 as the transmission source of subtitle request information. To the video output device 1.

（ステップＳ２０８）字幕サーバ側送信部４０３は、送信した字幕情報の数を示す値を１増加させる。例えば、送信した字幕情報の数を示すカウンターの値を１インクリメントする。そして、ステップＳ２０１に戻る。 (Step S208) The caption server side transmission unit 403 increments the value indicating the number of transmitted caption information by one. For example, the counter value indicating the number of transmitted caption information is incremented by one. Then, the process returns to step S201.

なお、図３のフローチャートにおいて、電源オフや処理終了の割り込みにより処理は終了する。 In the flowchart of FIG. 3, the process ends when the power is turned off or the process is terminated.

以下、制御サーバ装置２の動作の一例について簡単に説明する。制御サーバ側受信部２０２は、制御要求情報を受け付けたか否かの判断を繰り返し、制御要求情報を受け付けた場合、制御サーバ側送信部２０３は、この制御要求情報に対応する制御情報を、制御情報格納部２０１から読み出す。制御要求情報に対応する制御情報とは、例えば、制御要求情報が有する映像識別情報と対応付けられて制御情報格納部２０１に格納されている制御情報である。制御要求情報に含まれる映像識別情報は、例えば、映像出力装置１が出力する指示を受け付けた映像情報の映像識別情報である。そして、制御サーバ側送信部２０３は、読み出した制御情報を制御要求情報の送信元となる映像出力装置１に送信する。そして、上記の、制御要求情報を受け付けたか否かを判断する処理に戻る。 Hereinafter, an example of the operation of the control server device 2 will be briefly described. When the control server side receiving unit 202 repeatedly determines whether or not the control request information has been received. When the control server side receiving unit 202 receives the control request information, the control server side transmitting unit 203 converts the control information corresponding to the control request information to the control information. Read from the storage unit 201. The control information corresponding to the control request information is, for example, control information stored in the control information storage unit 201 in association with video identification information included in the control request information. The video identification information included in the control request information is, for example, video identification information of video information that has received an instruction output from the video output device 1. Then, the control server side transmission unit 203 transmits the read control information to the video output device 1 that is the transmission source of the control request information. Then, the process returns to the process of determining whether or not the control request information has been received.

以下、映像サーバ装置３の動作の一例について簡単に説明する。映像サーバ側受信部３０２は、映像要求情報を受け付けたか否かの判断を繰り返し、映像要求情報を受け付けた場合、映像要求情報が指定する映像情報を映像サーバ側格納部３０１から読み出す。映像要求情報が指定する映像情報は、例えば、映像要求情報が有する映像識別情報と対応付けて映像サーバ側格納部３０１に格納されている映像情報や、映像要求情報が指定するディレクトリ等に格納されている映像情報である。このディレクトリ等の情報は、例えば、制御情報から映像要求情報送信部１１０等が取得する。そして、読み出した映像情報を、映像要求情報の送信元の映像出力装置１に送信する。そして、上記の、映像要求情報を受け付けたか否かを判断する処理に戻る。なお、映像サーバ側送信部３０３は、映像要求情報を受ける毎に、映像情報のうちの、未送信の一部を、出力順に沿って送信するようにしても良い。 Hereinafter, an example of the operation of the video server device 3 will be briefly described. The video server-side receiving unit 302 repeatedly determines whether video request information has been received. When video request information is received, the video server-side receiving unit 302 reads the video information specified by the video request information from the video server-side storage unit 301. The video information specified by the video request information is stored in, for example, video information stored in the video server side storage unit 301 in association with video identification information included in the video request information, a directory specified by the video request information, or the like. Video information. The information such as the directory is acquired by the video request information transmission unit 110 or the like from the control information, for example. Then, the read video information is transmitted to the video output device 1 that is the transmission source of the video request information. Then, the process returns to the process of determining whether or not the video request information has been received. Note that the video server-side transmission unit 303 may transmit a part of the video information that has not been transmitted in the order of output each time it receives the video request information.

以下、手話映像サーバ装置５の動作の一例について簡単に説明する。手話映像サーバ側受信部５０２は、手話映像要求情報を受け付けたか否かの判断を繰り返し、手話映像要求情報を受け付けた場合、手話映像要求情報が指定する手話映像情報を手話映像サーバ側格納部５０１から読み出す。手話映像要求情報が指定する手話映像情報は、例えば、手話映像要求情報が有する映像識別情報と対応付けて手話映像サーバ側格納部５０１に格納されている手話映像情報や、手話映像要求情報が指定するディレクトリ等に格納されている手話映像情報である。このディレクトリ等の情報は、例えば、制御情報から図示しない手話映像要求情報送信部等が取得する。そして、読み出した手話映像情報を、手話映像要求情報の送信元の映像出力装置１に送信する。そして、上記の、手話映像要求情報を受け付けたか否かを判断する処理に戻る。なお、手話映像サーバ側送信部５０３は、手話映像要求情報を受ける毎に、手話映像情報のうちの、未送信の一部を、出力順に沿って送信するようにしても良い。 Hereinafter, an example of the operation of the sign language video server device 5 will be briefly described. The sign language video server side receiving unit 502 repeatedly determines whether or not sign language video request information has been received. When sign language video request information is received, the sign language video server side storage unit 501 stores the sign language video information specified by the sign language video request information. Read from. The sign language image information specified by the sign language image request information is, for example, specified by the sign language image information stored in the sign language image server side storage unit 501 in association with the image identification information included in the sign language image request information or the sign language image request information. Sign language video information stored in a directory or the like. The information such as the directory is acquired by, for example, a sign language video request information transmission unit (not shown) from the control information. Then, the read sign language video information is transmitted to the video output device 1 that is the transmission source of the sign language video request information. Then, the process returns to the process of determining whether or not the sign language image request information has been received. Note that the sign language video server-side transmission unit 503 may transmit a part of the sign language video information that has not been transmitted in the order of output each time it receives the sign language video request information.

以下、本実施の形態における映像システム１０００の具体的な動作について説明する。映像システム１０００の概念図は図４である。ここでは、説明を簡単にするために、映像出力装置１、制御サーバ装置２、映像サーバ装置３、字幕サーバ装置４、及び手話映像サーバ装置５をそれぞれ一つずつ備えている場合を例に挙げて説明するが、本発明の映像システム１０００においては、映像出力装置１、制御サーバ装置２、映像サーバ装置３、字幕サーバ装置４、及び手話映像サーバ装置５は、それぞれ１または２以上であればよい。 Hereinafter, a specific operation of the video system 1000 in the present embodiment will be described. A conceptual diagram of the video system 1000 is shown in FIG. Here, in order to simplify the explanation, a case where each of the video output device 1, the control server device 2, the video server device 3, the caption server device 4, and the sign language video server device 5 is provided is taken as an example. However, in the video system 1000 of the present invention, the video output device 1, the control server device 2, the video server device 3, the subtitle server device 4, and the sign language video server device 5 are each 1 or 2 or more. Good.

図５は、制御サーバ装置２の制御情報格納部２０１に格納されている制御情報を管理する制御情報管理表である。制御情報管理表は、「制御情報」と「映像ＩＤ」という属性を有している。「制御情報」は、制御情報のファイル名であるとする。「映像ＩＤ」は、制御情報が対応する映像情報の識別情報であるとする。 FIG. 5 is a control information management table for managing control information stored in the control information storage unit 201 of the control server device 2. The control information management table has attributes of “control information” and “video ID”. It is assumed that “control information” is a file name of the control information. The “video ID” is identification information of video information corresponding to the control information.

図６は、映像サーバ装置３の映像サーバ側格納部３０１に格納されている映像情報を管理する映像情報管理表の一例を示す図である。映像情報管理表は、「映像ＩＤ」と「映像情報」という属性を有している。「映像ＩＤ」は、図５の「映像ＩＤ」に対応している。「映像情報」は、映像ＩＤにより識別される映像情報である。 FIG. 6 is a diagram illustrating an example of a video information management table for managing video information stored in the video server side storage unit 301 of the video server device 3. The video information management table has attributes of “video ID” and “video information”. “Video ID” corresponds to “Video ID” in FIG. “Video information” is video information identified by a video ID.

図７は、図６に示した映像情報管理表で管理されている映像情報の一例を示す模式図であり、ここでは、映像ＩＤ「ｍ１００１」と対応付けられた映像情報を示している。映像情報は、複数のフレーム画像７１−７３を有しており、各フレーム画像には、映像タイミング情報として、タイムコード７１ａ−７３ａが対応付けられている。タイムコード７１ａ−７３ａは、ここでは、「時：分：秒：フレーム」を表しているものとする。映像情報の最初のフレーム画像の表示時刻が、ここでは、タイムコードの「０：００：００．０１」に設定される。なお、ここでの映像情報は、説明のための模式図であり、フレーム画像とタイムコード等とが格納されている映像情報の実際のデータ構造等を示すものではない。 FIG. 7 is a schematic diagram illustrating an example of video information managed in the video information management table illustrated in FIG. 6, and here, video information associated with the video ID “m1001” is illustrated. The video information includes a plurality of frame images 71-73, and time codes 71a-73a are associated with each frame image as video timing information. Here, it is assumed that the time codes 71a to 73a represent “hour: minute: second: frame”. In this example, the display time of the first frame image of the video information is set to “0: 00: 00.01” of the time code. Note that the video information here is a schematic diagram for explanation, and does not indicate an actual data structure or the like of the video information in which the frame image and the time code are stored.

図８は、字幕サーバ装置４の字幕サーバ側格納部４０１に格納されている字幕群情報を管理する字幕群情報管理表である。字幕群情報とは、一の映像情報に対応する字幕情報の群を有する情報であり、ここでは、一の映像情報に対応付けられた（具体的には、一の映像ＩＤと対応付けられた）１または２以上の字幕情報と、各字幕情報と対応付けられた字幕タイミング情報とを有する情報であるとする。ここでは、一例として、一の映像情報に対応付けられた字幕群情報は、一のファイルを構成しているものとする。字幕群情報管理表は、「映像ＩＤ」と、「字幕群情報」という属性を有している。「映像ＩＤ」は、図５および図７の「映像ＩＤ」に対応している。「字幕群情報」は、字幕群情報のファイル名を示している。ただし、ここでは拡張子等は省略している。 FIG. 8 is a caption group information management table for managing caption group information stored in the caption server side storage unit 401 of the caption server device 4. The caption group information is information having a group of caption information corresponding to one video information. Here, the caption group information is associated with one video information (specifically, associated with one video ID). ) It is assumed that the information includes one or more subtitle information and subtitle timing information associated with each subtitle information. Here, as an example, it is assumed that subtitle group information associated with one video information constitutes one file. The caption group information management table has attributes of “video ID” and “caption group information”. “Video ID” corresponds to “Video ID” in FIGS. 5 and 7. “Subtitle group information” indicates the file name of the subtitle group information. However, the extension is omitted here.

図９は、図８の字幕群情報管理表で管理されている字幕群情報の一例を示す図である。ここでは、ファイル名が「ｃ１００１」である字幕群情報を示している。ここでは、字幕群情報は、ＸＨＴＭＬ形式で記述された情報であるとする。図９に示した字幕群情報において、「ｐ」タグで囲まれた要素が、一の字幕情報を示す。「ｐ」タグ内の「ｂｅｇｉｎ」属性の属性値は、字幕情報に対応する字幕タイミング情報であり、字幕情報の開始時刻を示す情報である。また、「ｅｎｄ」属性の属性値は、字幕情報に対応する字幕タイミング情報であり、字幕情報の終了時刻を示す情報である。この字幕群情報内においては、字幕情報は、出力時刻が早いものから順に、上から下に向かって配置されているものとする。 FIG. 9 is a diagram illustrating an example of caption group information managed in the caption group information management table of FIG. Here, subtitle group information whose file name is “c1001” is shown. Here, it is assumed that the caption group information is information described in the XHTML format. In the caption group information shown in FIG. 9, an element surrounded by a “p” tag indicates one caption information. The attribute value of the “begin” attribute in the “p” tag is subtitle timing information corresponding to the subtitle information, and is information indicating the start time of the subtitle information. The attribute value of the “end” attribute is subtitle timing information corresponding to the subtitle information, and is information indicating the end time of the subtitle information. In the subtitle group information, the subtitle information is arranged from top to bottom in order from the earliest output time.

図１０は、手話映像サーバ装置５に格納されている手話映像情報を管理する手話映像管理表である。映像情報管理表は、「映像ＩＤ」と「手話ＩＤ」と「手話映像」という属性を有している。「映像ＩＤ」は、図５の「手話ＩＤ」に対応している。「手話ＩＤ」は、手話映像情報の識別情報である。「手話映像」は、手話ＩＤにより識別される手話映像情報である。 FIG. 10 is a sign language video management table for managing sign language video information stored in the sign language video server device 5. The video information management table has attributes of “video ID”, “sign language ID”, and “sign language video”. “Video ID” corresponds to “Sign Language ID” in FIG. “Sign language ID” is identification information of sign language video information. “Sign language video” is sign language video information identified by a sign language ID.

図１１は、図１０に示した手話映像情報管理表で管理されている手話映像情報の一例を示す模式図であり、ここでは、手話ＩＤ「ｈ１００１」と対応付けられた手話映像情報を示している。手話映像情報は、複数のフレーム画像８１−８３を有しており、各フレーム画像には、手話映像情報のフレーム画像が出力されるタイミングを示す情報である手話映像タイミング情報として、映像タイミング情報と同様の、タイムコード８１ａ−８３ａが対応付けられている。なお、ここでの手話映像情報は、説明のための模式図であり、フレーム画像とタイムコード等とが格納されている手話映像情報の実際のデータ構造等を示すものではない。 FIG. 11 is a schematic diagram showing an example of sign language video information managed in the sign language video information management table shown in FIG. 10. Here, the sign language video information associated with the sign language ID “h1001” is shown. Yes. The sign language video information includes a plurality of frame images 81-83, and each frame image includes video timing information as sign language video timing information which is information indicating a timing at which a frame image of the sign language video information is output. Similar time codes 81a to 83a are associated with each other. Here, the sign language video information is a schematic diagram for explanation, and does not indicate an actual data structure or the like of the sign language video information in which the frame image, the time code, and the like are stored.

まず、映像出力装置１を操作するユーザが、図示しないＷＥＢサイトに接続して、映像情報のリスト（図示せず）を示すＷＥＢページの情報を取得し、出力部１１３と接続された図示しないモニタにこのＷＥＢページを表示させたとする（図示せず）。この映像情報のリストの各項目には、各項目の映像情報の識別情報である映像ＩＤが対応付けられているものとする。そして、ユーザが視聴したい一の映像情報を選択する操作（例えば、映像情報のアイコンをクリックする操作）等を行ったとする。 First, a user who operates the video output apparatus 1 connects to a WEB site (not shown), acquires information on a WEB page indicating a list of video information (not shown), and is connected to the output unit 113 (not shown). Suppose that this WEB page is displayed (not shown). Each item in the list of video information is associated with a video ID that is identification information of the video information of each item. Then, it is assumed that the user performs an operation of selecting one piece of video information that the user wants to view (for example, an operation of clicking an icon of the video information).

映像を設定する操作が行われると、格納量設定部１０６は、映像格納部１０３の格納量の最大値と、字幕格納部１０４の格納量の最大値とを設定する。ここでは、映像格納部１０３及び字幕格納部１０４として、映像出力装置１の記憶媒体であるメモリ（図示せず）が用いられるものとする。まず、格納量設定部１０６は、自装置である映像出力装置１の、映像格納部１０３及び字幕格納部１０４として利用される記憶媒体であるメモリの空き領域を検出し、その空き領域のデータ容量を取得する。例えば、取得した空き領域のデータ容量が１１ＧＢであったとする。 When an operation for setting a video is performed, the storage amount setting unit 106 sets the maximum value of the storage amount of the video storage unit 103 and the maximum value of the storage amount of the caption storage unit 104. Here, a memory (not shown) that is a storage medium of the video output device 1 is used as the video storage unit 103 and the caption storage unit 104. First, the storage amount setting unit 106 detects a free area of a memory that is a storage medium used as the video storage unit 103 and the subtitle storage unit 104 of the video output device 1 that is its own device, and the data capacity of the free area To get. For example, assume that the data capacity of the acquired free area is 11 GB.

図１２は、映像出力装置１のメモリの空き領域の容量と、映像格納部１０３に格納される映像情報の格納量の最大値と、字幕格納部１０４に格納される字幕情報の格納量の最大値との関係を示す格納量管理表である。格納量管理表は、「空き容量」と、「映像最大値」と、「字幕最大値」という属性を有している。「空き容量」は、メモリの空き領域の容量の範囲を示し、単位はバイトである。「映像最大値」は、映像格納部１０３に格納される映像情報の格納量の最大値を示し、単位は秒であるとする。つまり、「映像最大値」が示す秒数分の映像情報が、映像格納部１０３に格納される映像情報の最大値である。「字幕最大値」は、字幕格納部１０４に格納される字幕情報の格納量の最大値を示し、単位は秒であるとする。つまり、出力される期間の合計が、「字幕最大値」が示す秒数分以下となる１または２以上の字幕情報が、字幕格納部１０４に格納可能であることを示す。 12 shows the capacity of the free space in the memory of the video output device 1, the maximum storage amount of video information stored in the video storage unit 103, and the maximum storage amount of subtitle information stored in the subtitle storage unit 104. It is a storage amount management table showing the relationship with the value. The storage amount management table has attributes of “free space”, “video maximum value”, and “caption maximum value”. “Free space” indicates the range of free space in the memory, and its unit is bytes. The “video maximum value” indicates the maximum value of the storage amount of video information stored in the video storage unit 103, and the unit is seconds. That is, the video information for the number of seconds indicated by the “video maximum value” is the maximum value of the video information stored in the video storage unit 103. The “caption maximum value” indicates the maximum value of the storage amount of caption information stored in the caption storage unit 104, and the unit is seconds. That is, it indicates that one or more pieces of caption information whose total output period is equal to or less than the number of seconds indicated by the “caption maximum value” can be stored in the caption storage unit 104.

格納量設定部１０６は、図１２に示した格納量管理表の各レコード（行）から、「空き容量」が示す範囲が、上記で取得した空き領域のデータ容量の値１４ＧＢを含む範囲であるレコードを検出し、検出したレコードの「映像最大値」と「字幕最大値」との属性値を取得する。ここでは、１１ＧＢは、上から３行目のレコードの「６ＧＢ以上１２ＧＢ未満」の範囲に属するため、格納量設定部１０６は、このレコードの「映像最大値」の属性値である３０秒と、「字幕最大値」の属性値である１０秒とを取得する。そして、映像格納部１０３の格納量の最大値を３０秒に、また、字幕格納部１０４の格納量の最大値を１０秒に設定する。例えば、このような設定を示す情報を、図示しない記憶媒体等に蓄積する。 In the storage amount setting unit 106, from each record (row) of the storage amount management table shown in FIG. 12, the range indicated by “free space” is a range including the data capacity value 14 GB of the free space acquired above. A record is detected, and attribute values of “video maximum value” and “caption maximum value” of the detected record are acquired. Here, since 11 GB belongs to the range of “6 GB or more and less than 12 GB” of the record in the third row from the top, the storage amount setting unit 106 sets the attribute value of the “video maximum value” of this record to 30 seconds, 10 seconds that is the attribute value of the “caption maximum value” are acquired. Then, the maximum storage amount of the video storage unit 103 is set to 30 seconds, and the maximum storage amount of the caption storage unit 104 is set to 10 seconds. For example, information indicating such settings is accumulated in a storage medium (not shown).

ここで設定される映像最大値は、映像情報の出力時間を示す。つまり、映像最大値が示す時間分の映像情報（例えば、フレーム画像）が、映像格納部１０３に格納可能である。また、ここで設定される字幕最大値は、字幕情報の出力開始時刻の属する範囲の最大値を示す。つまり、字幕格納部１０４には、出力の開始される時刻が、現在映像情報や字幕情報が出力されている時刻よりも、１０秒以内の範囲となる１または２以上の字幕情報が格納可能であることを意味する。 The video maximum value set here indicates the output time of the video information. That is, video information (for example, frame images) for the time indicated by the video maximum value can be stored in the video storage unit 103. The maximum caption value set here indicates the maximum value in the range to which the output start time of caption information belongs. That is, the caption storage unit 104 can store one or more caption information whose output start time is within 10 seconds from the time when the current video information and caption information is output. It means that there is.

なお、図１２に示した格納量管理表の各行においては、「字幕最大値」の大きさは、「映像最大値」の大きさよりも小さい値に設定されている。このため、字幕格納部１０４の格納量の最大値や、映像格納部１０３の格納量の最大値よりも小さい値に設定される。 In each row of the storage amount management table shown in FIG. 12, the size of “caption maximum value” is set to a value smaller than the size of “video maximum value”. For this reason, it is set to a value smaller than the maximum value of the storage amount of the caption storage unit 104 or the maximum value of the storage amount of the video storage unit 103.

また、ここでは、格納量管理表を用いて映像格納部１０３と字幕格納部１０４の格納量の最大値を設定したが、空き領域のデータ容量の値を引数とした予め指定された関数等を用いてこれらの最大値を算出しても良い。 In this example, the maximum storage amount of the video storage unit 103 and the caption storage unit 104 is set using the storage amount management table, but a function specified in advance using the data capacity value of the free area as an argument is used. These maximum values may be calculated by using them.

次に、制御要求情報送信部１１１は、上記で選択された映像情報の映像ＩＤである「ｍ１００１」を取得し、上記のＷＥＢページにおいて指定されている制御サーバ装置２に、この映像ＩＤを有する制御要求情報を送信する。 Next, the control request information transmission unit 111 acquires “m1001” that is the video ID of the video information selected above, and has this video ID in the control server device 2 specified in the WEB page. Send control request information.

制御サーバ装置２の制御サーバ側受信部２０２は、映像出力装置１から送信される制御要求情報を受信する。 The control server side receiving unit 202 of the control server device 2 receives control request information transmitted from the video output device 1.

制御サーバ側送信部２０３は、制御サーバ側受信部２０２が受信した制御要求情報に含まれる映像ＩＤ「ｍ１００１」を取得し、図５に示した制御情報管理表から、「映像ＩＤ」が、「ｍ１００１」と一致するレコード（行）を検索し、検出したレコードの「制御情報」の属性値である「ｃｔｒｌ００１」を取得する。そして、制御情報格納部２０１から、ファイル名が「ｃｔｒｌ００１」である制御情報を読み出す。読み出した制御情報を、制御要求情報の送信元となる映像出力装置１に送信する。 The control server-side transmission unit 203 acquires the video ID “m1001” included in the control request information received by the control server-side reception unit 202, and the “video ID” is obtained from the control information management table illustrated in FIG. A record (row) that matches “m1001” is searched, and “ctrl001” that is an attribute value of “control information” of the detected record is acquired. Then, control information whose file name is “ctrl001” is read from the control information storage unit 201. The read control information is transmitted to the video output device 1 that is the transmission source of the control request information.

映像出力装置１の制御情報受信部１１２は、制御サーバ装置２から制御情報「ｃｔｒｌ００１」を受信する。 The control information receiving unit 112 of the video output device 1 receives the control information “ctrl001” from the control server device 2.

図１３は、制御情報受信部１１２が受信した制御情報「ｃｔｒｌ００１」の一例を示す図である。ここでは、制御情報「ｃｔｒｌ００１」を示している。制御情報は、ここでは、ＸＨＴＭＬ形式で記述されているものとする。ここでは、「ｉｄ」属性の属性値が「ｖｉｄｅｏｓｅｒｖｅｒ」である「ｐ」タグの要素、（即ち＜ｐｉｄ＝"ｖｉｄｅｏｓｅｒｖｅｒ"＞と＜／ｐ＞とで囲まれた情報）が、映像ＩＤが「ｍ１００１」である映像情報の取得先となる映像サーバ装置３のアドレス情報（ここではＵＲＬ）であるとする。また、「ｉｄ」属性の属性値が「ｃａｐｔｉｏｎｓｅｒｖｅｒ」である「ｐ」タグの要素が、映像ＩＤが「ｍ１００１」である映像情報に対応する字幕情報の取得先となる字幕サーバ装置４のアドレス情報（ここではＵＲＬ）であるとする。また、「ｉｄ」属性の属性値が「ｓｉｇｎ＿ｌａｎｇｕａｇｅｓｅｒｖｅｒ」である「ｐ」タグの要素が、映像ＩＤが「ｍ１００１」である映像情報に対応する手話映像情報の取得先となる手話映像サーバ装置５のアドレス情報（ここではＵＲＬ）であるとする。さらに、「ｉｄ」属性の属性値が「ｖｉｄｅｏｔｙｐｅ」である「ｐ」タグの要素が、映像ＩＤが「ｍ１００１」である映像情報の種類を示す情報であり、要素「ｒｅｃｏｒｄｅｄｖｉｄｅｏ」は、ライブ（生）でない映像情報であることを示す。 FIG. 13 is a diagram illustrating an example of the control information “ctrl001” received by the control information receiving unit 112. Here, the control information “ctrl001” is shown. Here, it is assumed that the control information is described in the XHTML format. Here, the element of the “p” tag whose attribute value of the “id” attribute is “videoserver” (that is, the information enclosed by <pid = “videoserver”> and ) has the video ID. It is assumed that the address information (here, URL) of the video server apparatus 3 from which video information “m1001” is acquired. Also, the address information of the caption server device 4 that is the acquisition destination of caption information corresponding to the video information having the “p” tag whose attribute value is “captionserver” and whose video ID is “m1001”. (URL here). In addition, the element of the “p” tag whose attribute value of the “id” attribute is “sign_languageserver” is the sign language video server 5 that is the acquisition destination of the sign language video information corresponding to the video information whose video ID is “m1001”. It is assumed that the address information (URL here). Furthermore, the element of “p” tag whose attribute value of “id” attribute is “videotype” is information indicating the type of video information whose video ID is “m1001”, and the element “recorded video” is live ( Indicates that the video information is not live.

制御情報受信部１１２が制御情報を受信すると、指定送信部１０９は、図１３に示した制御情報から、映像情報の種類を示す情報である「ｒｅｃｏｒｄｅｄｖｉｄｅｏ」を取得する。そして、この映像情報の種類を示す情報が、ライブ（生）である映像情報を示す「ｌｉｖｅｖｉｄｅｏ」という情報と一致するか否かを判断する。ここでは一致しないため、指定送信部１０９は、字幕タイミング情報と対応付けられた字幕情報の送信を指定する字幕指定情報の送信を決定する。そして、要求送信部１０８は、図１３に示した制御情報から、字幕サーバ装置４のアドレス情報である「ｈｔｔｐ：／／ｗｗｗ．ｃａｐｔｉｏｎ００２．ｃｏｍ／」を取得し、このアドレス情報が示す字幕サーバ装置４に、字幕タイミング情報と対応付けられた字幕情報の送信を指定する字幕指定情報を送信する。 When the control information receiving unit 112 receives the control information, the designated transmitting unit 109 acquires “recorded video” that is information indicating the type of video information from the control information illustrated in FIG. 13. Then, it is determined whether or not the information indicating the type of the video information matches the information “live video” indicating the live (live) video information. Since there is no match here, the designation transmission unit 109 determines transmission of caption designation information that designates transmission of caption information associated with the caption timing information. Then, the request transmission unit 108 acquires “http://www.caption002.com/” which is the address information of the caption server device 4 from the control information shown in FIG. 13, and the caption server device indicated by the address information 4, subtitle designation information for designating transmission of subtitle information associated with subtitle timing information is transmitted.

また、映像要求情報送信部１１０は、図１３に示した受信した制御情報から、映像サーバ装置３のアドレス情報である「ｈｔｔｐ：／／ｗｗｗ．ｖｉｄｅｏ００１．ｃｏｍ／」を取得する。そして、このアドレス情報が示す映像サーバ装置３に、映像ＩＤ「ｍ１００１」を含む映像要求情報を送信する。つまり、映像ＩＤが「ｍ１００１」である映像情報の送信を要求する情報を送信する。 Further, the video request information transmission unit 110 acquires “http://www.video001.com/” which is the address information of the video server device 3 from the received control information shown in FIG. Then, the video request information including the video ID “m1001” is transmitted to the video server device 3 indicated by the address information. That is, information requesting transmission of video information whose video ID is “m1001” is transmitted.

また、要求送信部１０８は、図１３に示した制御情報から、字幕サーバ装置４のアドレス情報である「ｈｔｔｐ：／／ｗｗｗ．ｃａｐｔｉｏｎ００２．ｃｏｍ／」を取得し、このアドレス情報が示す字幕サーバ装置４に、映像ＩＤ「ｍ１００１」を含む字幕要求情報を送信する。 Further, the request transmission unit 108 acquires “http://www.caption002.com/” which is the address information of the caption server device 4 from the control information shown in FIG. 13, and the caption server device indicated by the address information 4, the subtitle request information including the video ID “m1001” is transmitted.

また、図示しない手話映像要求情報送信部は、図１３に示した受信した制御情報から、手話映像サーバ装置５のアドレス情報である「ｈｔｔｐ：／／ｗｗｗ．ｗｗｗ．ｈａｎｄｓｉｇｎ００３．ｃｏｍ／」を取得し、このアドレス情報が示す手話映像サーバ装置５に、映像ＩＤ「ｍ１００１」を含む手話映像要求情報を送信する。 Further, a sign language video request information transmission unit (not shown) acquires “http://www.www.handsign003.com/” which is the address information of the sign language video server device 5 from the received control information shown in FIG. The sign language video request information including the video ID “m1001” is transmitted to the sign language video server device 5 indicated by the address information.

映像サーバ装置３の映像サーバ側受信部３０２は、映像出力装置１から送信される映像ＩＤ「ｍ１００１」を含む映像要求情報を受信する。映像サーバ側送信部３０３は、図６に示した映像情報管理表を参照して、映像ＩＤ「ｍ１００１」に対応付けられた映像情報を映像サーバ側格納部３０１において検出し、この映像情報を構成するフレーム画像と、フレーム画像に対応付けられたタイムコードとを、タイムコードの早い順に順次パケットに分割して、映像出力装置１に送信する。 The video server side receiving unit 302 of the video server device 3 receives the video request information including the video ID “m1001” transmitted from the video output device 1. The video server-side transmitting unit 303 refers to the video information management table shown in FIG. 6 and detects video information associated with the video ID “m1001” in the video server-side storage unit 301, and configures this video information. The frame image to be performed and the time code associated with the frame image are sequentially divided into packets in the order of early time code and transmitted to the video output apparatus 1.

映像出力装置１の映像受信部１０１は、映像サーバ装置３からパケットに分割されて送信される映像情報を受信し、受信した１または２以上のパケットを用いてタイムコードと対応付けられたフレーム画像を構成して、順次、映像格納部１０３に蓄積する。例えば、映像格納部１０３には、図７のフレーム画像７１とタイムコード７１ａとの組や、フレーム画像７２とタイムコード７２ａとの組等が、タイムコードの早い順に順次格納される。 The video receiving unit 101 of the video output device 1 receives video information divided into packets from the video server device 3 and transmits the frame information, and a frame image associated with a time code using the received one or more packets. Are sequentially stored in the video storage unit 103. For example, in the video storage unit 103, the set of the frame image 71 and the time code 71a in FIG. 7, the set of the frame image 72 and the time code 72a, and the like are sequentially stored in the order of the time code.

字幕サーバ装置４の字幕サーバ側受信部４０２は、映像出力装置１から送信される字幕指定情報を受信する。そして字幕指定情報に応じて字幕タイミング設定情報を設定する。ここでは、字幕サーバ側受信部４０２は、字幕タイミング情報と対応付けられた字幕情報の送信を指定する字幕指定情報を映像出力装置１から受信する。このため、字幕サーバ側受信部４０２は、この字幕指定情報に応じて、字幕タイミング情報と対応付けられた字幕情報を送信する設定となるよう、映像タイミング設定情報を更新する。 The subtitle server side receiving unit 402 of the subtitle server device 4 receives the subtitle designation information transmitted from the video output device 1. Then, subtitle timing setting information is set according to the subtitle designation information. Here, subtitle server side receiving section 402 receives subtitle designation information for designating transmission of subtitle information associated with subtitle timing information from video output device 1. For this reason, the subtitle server side receiving unit 402 updates the video timing setting information so that the subtitle information associated with the subtitle timing information is transmitted according to the subtitle designation information.

また、字幕サーバ装置４の字幕サーバ側受信部４０２は、映像出力装置１から送信される映像ＩＤ「ｍ１００１」を含む字幕要求情報を受信する。映像サーバ側送信部３０３は、図８に示した字幕群管理情報を参照して、映像ＩＤ「ｍ１００１」に対応付けられた字幕群情報を字幕サーバ側格納部４０１において検出する。ここでは、ファイル名が、「ｃ１００１」である字幕群情報を検出する。ここでは、上述したように、字幕タイミング設定情報が、字幕タイミング情報と対応付けられた字幕情報を送信する設定となっているため、字幕サーバ側送信部４０３は、図９に示す字幕群情報「ｃ１００１」から、字幕情報と、この字幕情報に対応付けられた字幕タイミング情報との組を、予め指定された数だけ出力順に従って読み出す。ここでの予め指定された組は１組であるとする。例えば、字幕サーバ側送信部４０３は、図９に示すファイル名が「ｃ１００１」である字幕群情報から、まず、一番上の「ｐ」タグで囲まれた要素である「日本の達人シリーズ」という文字列を、字幕情報として読み出す。また、一番上の「ｐ」タグ内の「ｂｅｇｉｎ」属性の属性値「０：０：２０．０１」と、「ｅｎｄ」属性の属性値「０：０：２４．０１」とを、字幕タイミング情報として取得する。そして、字幕サーバ側送信部４０３は、読み出した字幕情報と字幕タイミング情報とで構成される字幕組情報を映像出力装置１に送信する。 The subtitle server side receiving unit 402 of the subtitle server device 4 receives the subtitle request information including the video ID “m1001” transmitted from the video output device 1. The video server side transmission unit 303 refers to the caption group management information shown in FIG. 8 and detects the caption group information associated with the video ID “m1001” in the caption server side storage unit 401. Here, caption group information whose file name is “c1001” is detected. Here, as described above, since the subtitle timing setting information is set to transmit subtitle information associated with the subtitle timing information, the subtitle server side transmission unit 403 performs subtitle group information “ From “c1001”, a set of subtitle information and subtitle timing information associated with the subtitle information is read out in a predetermined number according to the output order. Here, it is assumed that there is one set designated in advance. For example, the subtitle server side transmission unit 403 starts from the subtitle group information whose file name is “c1001” shown in FIG. 9, first, “Japanese master series” that is an element surrounded by the top “p” tag. Is read as subtitle information. Also, the attribute value “0: 0: 20.01” of the “begin” attribute and the attribute value “0: 0: 24.01” of the “end” attribute in the top “p” tag are subtitled. Obtained as timing information. Then, the subtitle server side transmission unit 403 transmits subtitle group information including the read subtitle information and subtitle timing information to the video output device 1.

なお、字幕サーバ側送信部４０３は、字幕情報を読み出す毎に、読み出した字幕情報の個数を累積してカウントし、次回、字幕情報を送信する際には、カウント値に１を加えた順番の字幕情報を読み出すようにする。例えば、字幕サーバ側受信部４０２が、映像ＩＤ「ｍ１００１」に対応付けら字幕要求情報を更に受信した場合、字幕サーバ側送信部４０３は、直前のカウント値である「１」に「１」を加えた値である「２」を取得し、図９に示した字幕群情報の、上から二番目の字幕情報と、この字幕情報と対応付けられた字幕タイミング情報との組を取得して、映像出力装置１に送信する。 The subtitle server side transmission unit 403 accumulates and counts the number of read subtitle information every time subtitle information is read out, and the next time subtitle information is transmitted, the subtitle server side transmission unit 403 adds the count value to “1”. Read subtitle information. For example, when the subtitle server side receiving unit 402 further receives subtitle request information associated with the video ID “m1001”, the subtitle server side transmitting unit 403 sets “1” to “1” which is the previous count value. “2” which is the added value is acquired, and a set of the second subtitle information from the top of the subtitle group information shown in FIG. 9 and subtitle timing information associated with the subtitle information is acquired, Transmit to the video output device 1.

映像出力装置１の字幕受信部１０２は、字幕サーバ装置４から送信される一の字幕組情報を受信し、字幕格納部１０４に蓄積する。 The caption receiving unit 102 of the video output device 1 receives one caption set information transmitted from the caption server device 4 and accumulates it in the caption storage unit 104.

図１４は、字幕格納部１０４に格納されている字幕組情報を管理する字幕組管理表を示す図であり、図１４（ａ）は、最初の字幕組情報が蓄積された状態を示し、図１４（ｂ）は、その後に複数の字幕組情報が追記された状態を示す。字幕組管理表は、「字幕情報」、「開始時刻」、及び「終了時刻」という属性を有している。「字幕情報」は、字幕情報であり、ここではテキスト情報である。「開始時刻」は、字幕の出力されるタイミングを示す字幕タイミング情報であり、タイムコードで示されている。また、「開始時刻」は、字幕の出力を終了するタイミングを示す字幕タイミング情報であり、タイムコードで示されている。 FIG. 14 is a diagram showing a caption group management table for managing caption group information stored in the caption storage unit 104. FIG. 14A shows a state in which the first caption group information is accumulated. 14 (b) shows a state in which a plurality of subtitle group information is additionally written thereafter. The subtitle group management table has attributes of “subtitle information”, “start time”, and “end time”. “Subtitle information” is subtitle information, and here is text information. “Start time” is subtitle timing information indicating a subtitle output timing, and is indicated by a time code. The “start time” is subtitle timing information indicating the timing at which subtitle output ends, and is indicated by a time code.

手話映像サーバ装置５の手話映像サーバ側受信部５０２は、映像出力装置１から送信される映像ＩＤ「ｍ１００１」を含む手話映像要求情報を受信する。手話映像サーバ側送信部５０３は、図１０に示した手話映像管理情報を参照して、映像ＩＤ「ｍ１００１」に対応付けられた手話映像情報を手話映像サーバ側格納部５０１において検出する。ここでは、「手話ＩＤ」が、「ｈ１００１」である手話映像情報を検出する。そして、手話映像サーバ側送信部５０３は、検出した手話映像情報を構成するフレーム画像と、このフレーム画像に対応付けられたタイムコードとを、タイムコードの早い順に順次パケットに分割して、映像出力装置１に送信する。なお、手話映像情報を構成するフレーム画像は、手話フレーム画像と呼んでもよい。 The sign language video server side receiving unit 502 of the sign language video server device 5 receives the sign language video request information including the video ID “m1001” transmitted from the video output device 1. The sign language video server side transmission unit 503 refers to the sign language video management information shown in FIG. 10 and detects sign language video information associated with the video ID “m1001” in the sign language video server side storage unit 501. Here, sign language video information whose “sign language ID” is “h1001” is detected. Then, the sign language video server-side transmission unit 503 sequentially divides the frame image constituting the detected sign language video information and the time code associated with the frame image into packets in order from the earliest time code to output the video. Transmit to device 1. Note that the frame images constituting the sign language video information may be called sign language frame images.

映像出力装置１の手話映像受信部１０５は、手話映像サーバ装置５からパケットに分割されて送信される手話映像情報を受信し、受信した１または２以上のパケットを用いてタイムコードと対応付けられた手話映像情報のフレーム画像を構成して、順次、図示しない手話映像格納部に蓄積する。例えば、手話映像格納部には、図１１のフレーム画像８１とタイムコード８１ａとの組や、フレーム画像８２とタイムコード８２ａとの組等が順次格納される。 The sign language video receiving unit 105 of the video output device 1 receives the sign language video information transmitted by being divided into packets from the sign language video server device 5, and is associated with the time code using the received one or more packets. A frame image of the sign language video information is constructed and sequentially stored in a sign language video storage unit (not shown). For example, the sign language video storage unit sequentially stores a set of the frame image 81 and the time code 81a of FIG. 11, a set of the frame image 82 and the time code 82a, and the like.

次に、出力部１１３は、映像情報を出力するタイミングであるか否かを判断する。ここではまだ、フレーム画像が出力されていないため、出力部１１３は、映像格納部１０３に、予め指定された格納量以上の映像情報が格納されているか否かを判断する。予め指定された格納量が２秒であったとすると、出力部１１３は、２秒以上の映像情報が映像格納部１０３に格納されているか否かを判断する。ここでは、映像格納部１０３に、２秒以上の映像情報、即ちフレーム画像が格納されていなかったとすると、出力部１１３は、映像情報を出力するタイミングでないと判断する。このため、映像情報を出力する処理を行わない。 Next, the output unit 113 determines whether it is time to output the video information. Here, since the frame image has not been output yet, the output unit 113 determines whether or not video information of a storage amount specified in advance or more is stored in the video storage unit 103. Assuming that the storage amount designated in advance is 2 seconds, the output unit 113 determines whether video information of 2 seconds or more is stored in the video storage unit 103. Here, if video information of 2 seconds or more, that is, a frame image is not stored in the video storage unit 103, the output unit 113 determines that it is not the timing to output the video information. For this reason, the process of outputting video information is not performed.

次に、出力状況取得部１０７は、以下に示すように、出力状況を示す情報を取得する処理を行う。出力状況取得部１０７は、映像格納部１０３に格納されている映像情報の格納量及び映像情報の字幕格納部１０４に格納されている字幕情報の格納量の少なくとも一方の値を取得する。ここでは、字幕格納部１０４に格納されている字幕情報の格納量を取得するものとする。字幕情報の格納量として、ここでは、字幕格納部１０４に格納されている字幕情報が出力される時間の合計を用いるものとする。例えば、図１４（ａ）の字幕組管理表の各レコード（行）の「終了時刻」から「開始時刻」の値を減算した値の合計を算出する。ここでは、字幕情報が一つしか格納されていないため、字幕情報の格納量は、４秒となる。そして、取得した格納量が、字幕格納部１０４に設定された閾値以上であるか否かを判断する。閾値は、ここでは、字幕格納部１０４の格納量の最大値の７０％の値であるとする。つまり、閾値は７秒となる。なお、この比率は、字幕情報の量や、映像情報の種類等に応じて手動や自動で変更しても良い。このため、出力状況取得部１０７は、字幕情報の格納量が閾値を超えないと判断し、格納状況を示す情報として、字幕情報の格納量が閾値を超えないことを示す情報を取得する。 Next, the output status acquisition unit 107 performs processing for acquiring information indicating the output status as described below. The output status acquisition unit 107 acquires at least one of the storage amount of the video information stored in the video storage unit 103 and the storage amount of the caption information stored in the caption storage unit 104 of the video information. Here, it is assumed that the storage amount of the caption information stored in the caption storage unit 104 is acquired. Here, the total amount of time for which the caption information stored in the caption storage unit 104 is output is used as the storage amount of the caption information. For example, the sum of values obtained by subtracting the value of “start time” from “end time” of each record (row) in the caption group management table of FIG. Here, since only one subtitle information is stored, the storage amount of subtitle information is 4 seconds. Then, it is determined whether or not the acquired storage amount is equal to or larger than a threshold set in the caption storage unit 104. Here, the threshold value is assumed to be 70% of the maximum value of the storage amount of the caption storage unit 104. That is, the threshold is 7 seconds. Note that this ratio may be changed manually or automatically in accordance with the amount of caption information, the type of video information, and the like. For this reason, the output status acquisition unit 107 determines that the storage amount of the caption information does not exceed the threshold, and acquires information indicating that the storage amount of the caption information does not exceed the threshold as information indicating the storage status.

出力状況取得部１０７が、格納状況を示す情報として、字幕情報の格納量が閾値を超えないことを示す情報を取得したため、要求送信部１０８は、予めデフォルトで指定されている字幕要求情報を送信するタイミングを示す情報、ここでは、送信間隔を示す情報を取得し、前回、字幕要求情報を送信した時刻からの経過時刻が、このタイミングを示す情報が示す送信間隔以上であるか否かを判断する。例えば、要求送信部１０８は、字幕要求情報を送信する毎に、送信からの経過時間を図示しない時計等で計測するようにして、経過時間を取得する。デフォルトで指定されている字幕要求情報を送信するタイミングを示す情報は、例えば、２秒であったとする。経過時間が２秒を経過している場合、次の字幕情報の送信を要求する字幕要求情報を送信するタイミングであると判断して、字幕要求情報を送信し、経過していない場合は、字幕要求情報の送信は行わない。ここでは、経過していないため、字幕要求情報を送信しないとする。 Since the output status acquisition unit 107 has acquired information indicating that the storage amount of subtitle information does not exceed the threshold as information indicating the storage status, the request transmission unit 108 transmits subtitle request information specified in advance by default. Information indicating the transmission timing, here, information indicating the transmission interval, and determining whether the elapsed time from the time when the caption request information was transmitted last time is equal to or greater than the transmission interval indicated by the information indicating this timing To do. For example, each time the subtitle request information is transmitted, the request transmission unit 108 acquires the elapsed time by measuring the elapsed time from the transmission with a clock (not shown) or the like. It is assumed that the information indicating the timing for transmitting the subtitle request information specified by default is, for example, 2 seconds. When the elapsed time has passed 2 seconds, it is determined that it is the timing for transmitting the subtitle request information for requesting the transmission of the next subtitle information, and the subtitle request information is transmitted. Request information is not sent. Here, it is assumed that the subtitle request information is not transmitted because it has not elapsed.

更に、映像受信部１０１が映像情報を受信する処理を繰り返し行い、また、手話映像受信部１０５が手話映像情報を受信する処理を繰り返し行う。 Furthermore, the video receiving unit 101 repeatedly performs the process of receiving the video information, and the sign language video receiving unit 105 repeatedly performs the process of receiving the sign language video information.

また、出力状況取得部１０７は、出力状況を示す情報を取得する処理を繰り返し行う。 The output status acquisition unit 107 repeatedly performs processing for acquiring information indicating the output status.

また、要求送信部１０８が字幕要求情報を送信するタイミングであるか否かを判断し、タイミングであると判断した場合、字幕要求情報を送信する。 Also, the request transmission unit 108 determines whether or not it is a timing to transmit the subtitle request information. When it is determined that it is the timing, the subtitle request information is transmitted.

また、字幕受信部１０２は、字幕要求情報に応じて送信される字幕組情報を受信する処理を行う。 In addition, the caption receiving unit 102 performs a process of receiving caption group information transmitted in response to caption request information.

そして、映像格納部１０３に格納されている映像情報の格納量が２秒以上となった時点で、出力部１１３は、映像情報を出力するタイミングであると判断し、映像情報を、映像情報に対応付けられたタイムコードが示すタイミングで出力する。具体的には、映像情報の出力開始時刻を「０：００：００．０１」に設定した図示しない時計等が示す現在の時刻と一致するタイムコードと対応付けられたフレーム画像が格納されているか否かを判断し、一致するタイムコードと対応付けられたフレーム画像がある場合、映像情報を出力するタイミングであると判断して、このフレーム画像を出力する処理を行う。 Then, when the storage amount of the video information stored in the video storage unit 103 becomes 2 seconds or more, the output unit 113 determines that it is a timing to output the video information, and converts the video information into the video information. Output at the timing indicated by the associated time code. Specifically, whether a frame image associated with a time code that matches the current time indicated by a clock or the like (not shown) that sets the output start time of the video information to “0: 00: 00.01” is stored. If there is a frame image associated with the matching time code, it is determined that it is time to output the video information, and processing for outputting this frame image is performed.

出力部１１３が映像情報のフレーム画像を出力する処理について以下に詳細に説明する。 A process in which the output unit 113 outputs a frame image of video information will be described in detail below.

出力部１１３は、現在の時刻と一致するタイムコードと対応付けられたフレーム画像を、映像格納部１０３から読み出す。読み出したフレーム画像は、映像格納部１０３から削除する。 The output unit 113 reads the frame image associated with the time code that matches the current time from the video storage unit 103. The read frame image is deleted from the video storage unit 103.

また、出力部１１３は、出力するフレーム画像と対応付けられたタイムコードと同じタイムコードと対応付けられた手話映像情報のフレーム画像を、図示しない手話映像格納部から読み出す。例えば、出力部１１３は、出力する映像情報のフレーム画像のタイムコードが、「０：００：００．０１」である場合、このタイムコードと一致するタイムコードと対応付けられた手話映像情報のフレーム画像を読み出す。読み出したフレーム画像は、図示しない手話映像格納部から削除する。 Further, the output unit 113 reads a frame image of sign language video information associated with the same time code as the time code associated with the output frame image from a sign language video storage unit (not shown). For example, when the time code of the frame image of the video information to be output is “0: 00: 00.01”, the output unit 113 outputs a frame of sign language video information associated with the time code that matches the time code. Read the image. The read frame image is deleted from a sign language video storage unit (not shown).

また、出力部１１３は、出力する映像情報のフレーム画像に対応付けられたタイムコードを含む期間を示す文字タイミング情報と対応付けられた字幕情報を、図１４（ａ）に示した字幕組管理表を用いて検出して字幕格納部１０４から読み出す処理を行う。例えば、図１４（ａ）に示した字幕組管理表を構成する各レコード（行）の「開始時刻」の値と「終了時刻」との値の間に、出力する映像情報のフレーム画像に対応付けられたタイムコードが示す時刻が含まれるか否かを順次判断し、含まれる場合には、このレコードの「字幕情報」の値を読み出す。含まれるレコードが検出できなかった場合、出力する字幕情報がないと判断する。ここでは例えば、出力する映像情報のフレーム画像に対応するタイムコード「０：００：００．０１」がであり、このタイムコードを含む期間を示す字幕タイミング情報と対応付けられた字幕情報がなかったため、字幕情報を読み出さなかったとする。 Also, the output unit 113 displays the caption information associated with the character timing information indicating the period including the time code associated with the frame image of the video information to be output, as shown in FIG. Is detected and read from the caption storage unit 104. For example, it corresponds to the frame image of the video information to be output between the value of “start time” and “end time” of each record (row) constituting the caption group management table shown in FIG. It is sequentially judged whether or not the time indicated by the attached time code is included, and if included, the value of “subtitle information” of this record is read. If the included record cannot be detected, it is determined that there is no caption information to be output. Here, for example, there is a time code “0: 00: 00.01” corresponding to the frame image of the video information to be output, and there is no subtitle information associated with subtitle timing information indicating a period including this time code. Suppose that the subtitle information is not read.

そして、出力部１１３は、上記で取得した映像情報のフレーム画像及び手話映像情報のフレーム画像を合成したフレーム画像を構成し、出力する。手話映像情報は、例えば、映像情報内の予め指定された小領域内に表示されるものとする。ここでの出力は、例えば、出力部１１３と接続されたモニタ１１３ａによる表示であるとする。 Then, the output unit 113 configures and outputs a frame image obtained by combining the frame image of the video information and the frame image of the sign language video information acquired above. For example, the sign language video information is displayed in a small area designated in advance in the video information. The output here is, for example, display on the monitor 113a connected to the output unit 113.

図１５は、出力部１１３が出力する映像情報の表示例を示す図である。ここでは、字幕情報が取得されていないため、映像情報１５１の一のフレーム画像上の一部の領域に手話映像情報１５２の一のフレーム画像が表示されている。 FIG. 15 is a diagram illustrating a display example of video information output by the output unit 113. Here, since the subtitle information is not acquired, one frame image of the sign language video information 152 is displayed in a partial area on one frame image of the video information 151.

同様にして、出力部１１３は、現在時刻の変化に応じて、映像情報と手話映像情報等とを合成した新たなフレーム画像を出力する。具体的には、新たなフレーム画像で、直前に出力されていたフレーム画像を更新する。 Similarly, the output unit 113 outputs a new frame image obtained by synthesizing video information, sign language video information, and the like according to a change in the current time. Specifically, the frame image output immediately before is updated with a new frame image.

ここで、現在の時刻が「０：００：２０．０１」になったとし、現在の時刻「０：００：２０．０１」に対応するタイムコードと対応付けられた映像情報のフレーム画像を出力部１１３が、字幕格納部１０４において検出したとする。出力部１１３は、上記と同様に、現在の時刻に対応するタイムコードと対応付けられた映像情報のフレーム画像と、手話映像情報のフレーム画像とを読み出す。 Here, assuming that the current time is “0: 00: 20.01”, a frame image of video information associated with a time code corresponding to the current time “0: 00: 20.01” is output. Assume that the unit 113 detects in the caption storage unit 104. Similarly to the above, the output unit 113 reads out the frame image of the video information and the frame image of the sign language video information associated with the time code corresponding to the current time.

また、出力部１１３は、このタイムコード「０：００：２０．０１」を含む期間を示す文字タイミング情報と対応付けられた字幕情報を、図１４（ａ）に示した字幕組管理表を用いて検出して字幕格納部１０４から読み出す処理を行う。ここでは、図１４（ａ）に示した字幕組管理表の一番目のレコードの「開始時刻」の値「０：００：２０．０１」と、「終了時刻」の値「０：００：２４．０１」との間に、出力する映像情報のフレーム画像に対応付けられたタイムコード「０：００：２０．０１」の値が含まれると判断する。このため、出力部１１３は、このレコードの「字幕情報」の値「日本の達人シリーズ」を読み出す。 Further, the output unit 113 uses the caption group management table shown in FIG. 14A for the caption information associated with the character timing information indicating the period including the time code “0: 00: 20.01”. To detect and read from the caption storage unit 104. Here, the “start time” value “0: 00: 20.01” and the “end time” value “0:00:24” of the first record in the subtitle group management table shown in FIG. .01 ”, it is determined that the value of the time code“ 0: 00: 20.01 ”associated with the frame image of the video information to be output is included. Therefore, the output unit 113 reads the value “subtitle information” “Japanese master series” of this record.

そして、出力部１１３は、上記で取得した映像情報のフレーム画像と、手話映像画像と、字幕情報「日本の達人シリーズ」とを合成してモニタ１１３ａに出力する。 Then, the output unit 113 synthesizes the frame image of the video information acquired above, the sign language video image, and the subtitle information “Japanese master series” and outputs the synthesized video image to the monitor 113a.

図１６は、出力部１１３が出力する映像情報の表示例を示す図である。 FIG. 16 is a diagram illustrating a display example of video information output from the output unit 113.

なお、ここで出力された字幕情報は、字幕情報と対応付けられた字幕タイミング情報に応じて、映像情報のフレーム画像や手話映像情報のフレーム画像が変化しても、現在の時刻が「０：００：２４．０１」になるまで、映像情報のフレーム画像等と合成されて、繰り返し出力される。また、この字幕情報は、現在の時刻が「終了時刻」である「０：００：２４．０１」と一致した時点で、字幕格納部１０４から削除される。 Note that the subtitle information output here has a current time “0:” even if the frame image of the video information or the frame image of the sign language video information changes according to the subtitle timing information associated with the subtitle information. Until it becomes “00: 24.01”, it is combined with the frame image of the video information and the like and repeatedly output. Also, the caption information is deleted from the caption storage unit 104 when the current time coincides with “0: 00: 24.01”, which is the “end time”.

ここで、字幕格納部１０４に新たな字幕組情報が蓄積されたことにより、図１４（ａ）に示した状態の字幕管理表が、図１４（ｂ）に示した状態に更新されたとする。そして、出力状況取得部１０７が、上記と同様に、出力状況を取得する情報を取得する処理を行ったとする。 Here, it is assumed that the caption management table in the state illustrated in FIG. 14A is updated to the state illustrated in FIG. 14B due to the accumulation of new caption group information in the caption storage unit 104. Then, it is assumed that the output status acquisition unit 107 performs processing for acquiring information for acquiring the output status, as described above.

具体的には、まず、出力状況取得部１０７は、字幕格納部１０４に格納されている字幕情報の格納量を取得するために、図１４（ｂ）の字幕組管理表の各レコード（行）の「終了時刻」から「開始時刻」の値を減算した値の合計を算出する。ここでは、字幕情報が２つ格納されているため、字幕情報の格納量は、４秒＋５秒＝９秒となる。そして、取得した格納量が、字幕格納部１０４に設定された閾値以上であるか否かを判断する。閾値は、上述したように７秒であるため、出力状況取得部１０７は、字幕情報の格納量が閾値を超えた判断し、格納状況を示す情報として、字幕情報の格納量が閾値を超えたことを示す情報を取得する。 Specifically, the output status acquisition unit 107 first acquires each record (row) of the subtitle group management table of FIG. 14B in order to acquire the storage amount of the subtitle information stored in the subtitle storage unit 104. The sum of values obtained by subtracting the value of “start time” from “end time” is calculated. Here, since two pieces of caption information are stored, the storage amount of the caption information is 4 seconds + 5 seconds = 9 seconds. Then, it is determined whether or not the acquired storage amount is equal to or larger than a threshold set in the caption storage unit 104. Since the threshold value is 7 seconds as described above, the output status acquisition unit 107 determines that the storage amount of the caption information exceeds the threshold, and the storage amount of the caption information exceeds the threshold as information indicating the storage status. Acquire information indicating that.

このように、出力状況取得部１０７が、格納状況を示す情報として、字幕情報の格納量が閾値を超えたことを示す情報を取得したため、要求送信部１０８は、新たに字幕要求情報を送信するタイミングを示す情報、ここでは、送信間隔を示す情報を取得する。ここでは、字幕情報の格納量の最大値の５０％の時間を新たなタイミングを示す情報として取得する。この比率は、字幕情報の量や、映像情報の種類等に応じて手動や自動で変更しても良い。ここでは、格納量の最大値が１０秒であるため、５秒を新たなタイミングを示す情報、即ち送信間隔を示す情報として取得する。 Thus, since the output status acquisition unit 107 has acquired information indicating that the storage amount of caption information has exceeded the threshold as information indicating the storage status, the request transmission unit 108 newly transmits caption request information. Information indicating the timing, here, information indicating the transmission interval is acquired. Here, 50% of the maximum value of the subtitle information storage amount is acquired as information indicating a new timing. This ratio may be changed manually or automatically according to the amount of caption information, the type of video information, and the like. Here, since the maximum value of the storage amount is 10 seconds, 5 seconds is acquired as information indicating a new timing, that is, information indicating a transmission interval.

そして、出力部１１３は、前回、字幕要求情報を送信した時刻からの経過時刻が、この新たなタイミングを示す情報が示す送信間隔である５秒以上であるか否かを判断して、送信間隔以上であれば、字幕要求情報を送信し、送信間隔未満であれば、字幕要求情報を送信しないようにする。 Then, the output unit 113 determines whether or not the elapsed time from the time when the caption request information was transmitted last time is 5 seconds or more which is the transmission interval indicated by the information indicating the new timing, and the transmission interval If it is above, caption request information is transmitted, and if it is less than the transmission interval, caption request information is not transmitted.

これにより、字幕格納部１０４に、十分な出力期間分の字幕情報が格納されている場合には、字幕要求情報を送信するタイミングを遅らせることで、字幕情報の出力に遅延が発生しない範囲で、字幕情報の格納量や、字幕情報の送受信量を低減させることができる。 As a result, when subtitle information for a sufficient output period is stored in the subtitle storage unit 104, by delaying the timing for transmitting the subtitle request information, the output of subtitle information is not delayed. The amount of stored caption information and the amount of transmitted / received caption information can be reduced.

なお、出力状況に応じて変更される字幕要求情報の送信タイミングは、字幕情報の格納量が増加した場合に、間隔が線形的に増加、あるいは段階的に増加するもの、であることが好ましい。 In addition, it is preferable that the transmission timing of the subtitle request information changed according to the output status is such that the interval increases linearly or stepwise when the subtitle information storage amount increases.

ここで、仮に、図１３に示した制御情報受信部１１２が受信した制御情報に含まれる映像情報の種類を示す情報が、上述した「ｒｅｃｏｒｄｅｄｖｉｄｅｏ」ではなく、ライブ（生）である映像情報を示す「ｌｉｖｅｖｉｄｅｏ」という情報と一致するものであったとする。この場合、指定送信部１０９は、字幕タイミング情報と対応付けられていない字幕情報の送信を指定する字幕指定情報の送信を決定する。そして、要求送信部１０８は、図１３に示した制御情報から、字幕サーバ装置４のアドレス情報である「ｈｔｔｐ：／／ｗｗｗ．ｃａｐｔｉｏｎ００２．ｃｏｍ／」を取得し、このアドレス情報が示す字幕サーバ装置４に、字幕タイミング情報と対応付けられていない字幕情報の送信を指定する字幕指定情報を送信する。 Here, suppose that the information indicating the type of video information included in the control information received by the control information receiving unit 112 shown in FIG. 13 is not the “recorded video” described above, but video information that is live (live). Assume that the information matches the information “live video” shown. In this case, the designated transmission unit 109 determines transmission of caption designation information that designates transmission of caption information that is not associated with caption timing information. Then, the request transmission unit 108 acquires “http://www.caption002.com/” which is the address information of the caption server device 4 from the control information shown in FIG. 13, and the caption server device indicated by the address information 4, subtitle designation information for designating transmission of subtitle information not associated with subtitle timing information is transmitted.

字幕サーバ側受信部４０２が、字幕タイミング情報と対応付けられていない字幕情報の送信を指定する字幕指定情報を映像出力装置１から受信すると、字幕サーバ側受信部４０２は、この字幕指定情報に応じて、字幕タイミング情報と対応付けられていない字幕情報を送信する設定となるよう、映像タイミング設定情報を更新する。 When the caption server side receiving unit 402 receives from the video output device 1 subtitle designation information that designates transmission of subtitle information that is not associated with subtitle timing information, the subtitle server side receiving unit 402 responds to the subtitle designation information. Thus, the video timing setting information is updated so that the subtitle information not associated with the subtitle timing information is transmitted.

この場合、字幕サーバ装置４の字幕サーバ側受信部４０２が、映像出力装置１から映像ＩＤ「ｍ１００１」を含む字幕要求情報を受信すると、上記と同様に、映像サーバ側送信部３０３は、図８に示した字幕群管理情報を参照して、映像ＩＤ「ｍ１００１」に対応付けられた字幕群情報を字幕サーバ側格納部４０１において検出する。ただし、ここでは、上述したように、字幕タイミング設定情報が、字幕タイミング情報と対応付けられていない字幕情報を送信する設定となっているため、字幕サーバ側送信部４０３は、検出した字幕群情報から、字幕情報を予め指定された数だけ出力順に従って読み出すが、字幕タイミング情報は読み出さない。そして、映像サーバ側送信部３０３は、読み出した字幕情報を映像出力装置１に送信する。 In this case, when the subtitle server side receiving unit 402 of the subtitle server device 4 receives the subtitle request information including the video ID “m1001” from the video output device 1, the video server side transmitting unit 303, as described above, The subtitle group information associated with the video ID “m1001” is detected in the subtitle server side storage unit 401 with reference to the subtitle group management information shown in FIG. However, here, as described above, since the subtitle timing setting information is set to transmit subtitle information that is not associated with the subtitle timing information, the subtitle server side transmission unit 403 detects the subtitle group information detected. Thus, the subtitle information is read out in a predetermined number according to the output order, but the subtitle timing information is not read out. Then, the video server side transmission unit 303 transmits the read caption information to the video output device 1.

また、字幕受信部１０２が受信する字幕情報は、字幕タイミング情報と対応付けられていないため、字幕受信部１０２は、受信した字幕情報を字幕タイミング情報と対応付けずに字幕格納部１０４に蓄積する。そして、蓄積された字幕情報は、予め指定されたタイミング（例えば、２秒間隔）で蓄積順に順次読み出されて、フレーム画像と合成されて出力される。なお、この場合の字幕情報の出力が継続される期間は、例えば、予め指定された時間となる。つまり、予め指定された時間が経過するまでは、映像情報の各フレーム画像と合成して出力され、時間が経過した時点で、字幕情報は、字幕格納部１０４から削除される。なお、字幕情報を読み出す予め指定されたタイミングが、時間間隔である場合、この時間間隔は、通常、字幕情報の出力が継続される時間以上の長さに設定される。なお、字幕受信部１０２は、受信した字幕情報を字幕格納部１０４に蓄積せずに、蓄積直後に映像情報と合成して出力するようにしても良い。 Moreover, since the subtitle information received by the subtitle receiving unit 102 is not associated with the subtitle timing information, the subtitle receiving unit 102 accumulates the received subtitle information in the subtitle storage unit 104 without associating with the subtitle timing information. . The stored caption information is sequentially read out in the order of storage at a predesignated timing (for example, at intervals of 2 seconds), and is combined with the frame image and output. In this case, the period during which the subtitle information is continuously output is, for example, a time specified in advance. That is, until a predetermined time elapses, the video information is combined with each frame image of the video information and output. When the time elapses, the subtitle information is deleted from the subtitle storage unit 104. In addition, when the timing designated beforehand which reads subtitle information is a time interval, this time interval is normally set to the length more than the time when the output of subtitle information is continued. Note that the caption receiving unit 102 may combine the received caption information with the video information immediately after the storage, without storing the received caption information in the caption storage unit 104.

このようにすることで、ライブ（生）の映像情報のように、字幕情報を出力するタイミングを、映像情報が出力されるタイミングに合わせることが困難な場合等には、できるだけ、映像情報に対して遅延がないように、字幕情報を出力させることができる。また、字幕サーバ装置４においては、字幕の作成されたタイミング等を示す字幕タイミング情報を字幕情報と対応付けて字幕サーバ側格納部４０１等に蓄積しておくことで、後に映像情報を、録画した映像情報として提供する際に、字幕情報の出力タイミングを、字幕タイミング情報を利用して映像情報の出力タイミングに合わせることができる。 In this way, when it is difficult to match the timing of outputting the caption information to the timing of outputting the video information, such as live (live) video information, the video information is as much as possible. Thus, caption information can be output so that there is no delay. Further, in the caption server device 4, the caption information indicating the timing of creating the caption is stored in the caption server-side storage unit 401 or the like in association with the caption information to record video information later. When providing as video information, the output timing of subtitle information can be matched with the output timing of video information using subtitle timing information.

以上、本実施の形態によれば、字幕タイミング情報を用いることにより、映像情報と、字幕を示す文字列の情報とを、タイミングを適切に合わせて出力することができる。 As described above, according to the present embodiment, by using the caption timing information, it is possible to output the video information and the character string information indicating the caption in appropriate timing.

なお、上記実施の形態において、映像出力装置１に、制御サーバ装置２の構成を組み込むようにしても良い。この場合、例えば、制御要求情報送信部１１１と制御サーバ側受信部２０２、及び制御情報受信部１１２と制御サーバ側送信部２０３とは、それぞれ、信号線やバス等を適宜用いて直接接続するようにすればよい。 In the above embodiment, the configuration of the control server device 2 may be incorporated in the video output device 1. In this case, for example, the control request information transmission unit 111 and the control server side reception unit 202, and the control information reception unit 112 and the control server side transmission unit 203 are directly connected using a signal line, a bus, or the like as appropriate. You can do it.

また、上記実施の形態においては、字幕格納部１０４に格納されている各字幕情報を、出力直後に削除する例について説明したが、字幕情報を、出力終了後も、例えば、予め指定されたタイミングとなるまで、字幕格納部１０４に蓄積しておくようにしても良い。そして、出力部１１３の出力内容を巻き戻す操作を、図示しない受付部等が受け付けた場合、出力部１１３がこの巻き戻す操作により指定される過去の出力時刻に対応する字幕情報を、字幕情報に対応付けられた字幕タイミング情報を用いて、字幕格納部１０４から読み出して出力するようにしても良い。例えば、出力部１１３は、巻き戻し操作により指定される過去の出力時刻と一致する時刻を示す字幕タイミング情報、または、この過去の出力時刻を含む期間を指定する字幕タイミング情報と対応付けられた字幕情報を字幕格納部１０４から読み出して出力する。出力内容を巻き戻す操作は、映像情報を巻き戻す操作であっても良いし、字幕情報を巻き戻す操作であっても良い。出力を巻き戻す操作が、映像情報を巻き戻す操作である場合、操作により指定される映像情報も読み出して、字幕情報を合成して出力するようにしても良い。出力終了後の字幕情報は、出力終了時から予め指定された一定の時間が経過するまで、字幕格納部１０４に蓄積しておいても良いし、出力終了時から、字幕格納部１０４の格納量の最大値に応じて設定される時間が経過するまで、字幕格納部１０４に蓄積しておいても良い。また、予め指定されたトリガーが発生するまで（例えば、次の字幕要求情報が送信されるまで）、蓄積しておいても良い。ここでの巻き戻しは、逆再生等も含む概念と考えても良い。 In the above embodiment, the example in which each piece of subtitle information stored in the subtitle storage unit 104 is deleted immediately after output has been described. However, subtitle information is also output after completion of output, for example, at a predetermined timing. Until it becomes, it may be made to accumulate in subtitle storage part 104. Then, when an unillustrated accepting unit or the like accepts an operation for rewinding the output content of the output unit 113, the subtitle information corresponding to the past output time designated by the output unit 113 by this rewinding operation is converted into the subtitle information. Using the associated subtitle timing information, it may be read from the subtitle storage unit 104 and output. For example, the output unit 113 may display caption timing information indicating a time that coincides with a past output time specified by a rewind operation, or a caption associated with caption timing information that specifies a period including the past output time. Information is read from the caption storage unit 104 and output. The operation of rewinding the output contents may be an operation of rewinding the video information or an operation of rewinding the caption information. When the operation for rewinding the output is an operation for rewinding the video information, the video information specified by the operation may also be read and the subtitle information may be synthesized and output. Subtitle information after the end of output may be accumulated in the subtitle storage unit 104 until a predetermined time elapses from the end of output, or the storage amount of the subtitle storage unit 104 from the end of output. It may be accumulated in the caption storage unit 104 until the time set according to the maximum value of elapses. Further, it may be accumulated until a trigger designated in advance occurs (for example, until the next caption request information is transmitted). Rewinding here may be considered as a concept including reverse reproduction and the like.

なお、字幕情報を出力終了後も字幕格納部１０４に蓄積しておく場合において、出力を巻き戻す操作を受け付けた場合、出力部１１３は、字幕情報だけを出力するようにしても良い。例えば、映像情報がストリーミング形式のデータである場合、著作権保護の観点等から、出力後のフレーム画像等を削除することが多く、出力を巻き戻す場合には、映像情報については、再度、映像情報を映像サーバ装置３から取得する必要があり、出力されるまでに時間がかかるが、字幕情報を字幕格納部１０４に蓄積しておくことで、字幕情報だけを巻き戻して出力する場合は、字幕格納部１０４に格納されている過去の字幕情報を読み出せば良いため、短時間で字幕情報を出力することができる。このため、ユーザが直前に出力された字幕を見落とした場合においても、瞬時に字幕だけを巻き戻して出力することが可能となる。 In addition, when subtitle information is accumulated in the subtitle storage unit 104 even after the output is completed, when an operation for rewinding the output is received, the output unit 113 may output only the subtitle information. For example, if the video information is streaming format data, the frame image after output is often deleted from the viewpoint of copyright protection, etc. When rewinding the output, the video information is It is necessary to acquire information from the video server device 3 and it takes time to be output. However, when subtitle information is stored in the subtitle storage unit 104 and only the subtitle information is rewound and output, Since it is only necessary to read past caption information stored in the caption storage unit 104, the caption information can be output in a short time. For this reason, even when the user overlooks the subtitle that was output immediately before, only the subtitle can be instantaneously rewound and output.

なお、上記実施の形態において、各処理（あるいは各機能）は、単一の装置（あるいはシステム）によって集中処理されることによって実現されてもよく、あるいは、複数の装置によって分散処理されることによって実現されてもよい。 In the above embodiment, each processing (or each function) may be realized by centralized processing by a single device (or system), or by distributed processing by a plurality of devices. It may be realized.

また、上記実施の形態において、一の装置に存在する２以上の通信手段（例えば、映像受信部、手話映像受信部等）は、物理的に一の媒体で実現されても良いことは言うまでもない。 In the above-described embodiment, it is needless to say that two or more communication means (for example, a video receiving unit, a sign language video receiving unit, etc.) existing in one device may be physically realized by one medium. .

また、上記実施の形態では、映像出力装置がスタンドアロンである場合について説明したが、映像出力装置は、スタンドアロンの装置であってもよく、サーバ・クライアントシステムにおけるサーバ装置であってもよい。後者の場合には、出力部や受付部は、通信回線を介して入力を受け付けたり、画面を出力したりすることになる。 In the above-described embodiment, the case where the video output device is a stand-alone has been described. However, the video output device may be a stand-alone device or a server device in a server / client system. In the latter case, the output unit or the reception unit receives an input or outputs a screen via a communication line.

また、上記実施の形態において、各構成要素は専用のハードウェアにより構成されてもよく、あるいは、ソフトウェアにより実現可能な構成要素については、プログラムを実行することによって実現されてもよい。例えば、ハードディスクや半導体メモリ等の記録媒体に記録されたソフトウェア・プログラムをＣＰＵ等のプログラム実行部が読み出して実行することによって、各構成要素が実現され得る。その実行時に、プログラム実行部は、格納部（例えば、ハードディスクやメモリ等の記録媒体）にアクセスしながらプログラムを実行してもよい。 In the above embodiment, each component may be configured by dedicated hardware, or a component that can be realized by software may be realized by executing a program. For example, each component can be realized by a program execution unit such as a CPU reading and executing a software program recorded on a recording medium such as a hard disk or a semiconductor memory. At the time of execution, the program execution unit may execute the program while accessing a storage unit (for example, a recording medium such as a hard disk or a memory).

なお、上記実施の形態における映像出力装置を実現するソフトウェアは、以下のようなプログラムである。つまり、このプログラムは、コンピュータを、映像情報を映像サーバ装置から受信する映像受信部と、字幕を示す文字列の情報である１以上の字幕情報と、映像情報の出力に対する当該各字幕情報が出力されるタイミングを示す字幕タイミング情報とを、字幕サーバ装置から受信する字幕受信部と、字幕受信部が受信した１以上の字幕情報を、各字幕情報と対応付けられた字幕タイミング情報を用いて、映像受信部が受信した映像情報と合成して出力する出力部として機能させるためのプログラムである。 The software that realizes the video output apparatus in the above embodiment is the following program. In other words, this program outputs a computer, a video receiving unit that receives video information from the video server device, one or more subtitle information that is character string information indicating subtitles, and each subtitle information corresponding to the output of the video information. Subtitle timing information indicating the timing to be transmitted from the subtitle server device, and one or more subtitle information received by the subtitle receiver using subtitle timing information associated with each subtitle information, This is a program for functioning as an output unit that synthesizes and outputs video information received by the video receiving unit.

なお、上記プログラムにおいて、上記プログラムが実現する機能には、ハードウェアでしか実現できない機能は含まれない。例えば、情報を取得する取得部や、情報を出力する出力部などにおけるモデムやインターフェースカードなどのハードウェアでしか実現できない機能は、上記プログラムが実現する機能には含まれない。 In the program, the functions realized by the program do not include functions that can be realized only by hardware. For example, a function that can be realized only by hardware such as a modem or an interface card in an acquisition unit that acquires information or an output unit that outputs information is not included in the function realized by the program.

また、このプログラムを実行するコンピュータは、単数であってもよく、複数であってもよい。すなわち、集中処理を行ってもよく、あるいは分散処理を行ってもよい。 Further, the computer that executes this program may be singular or plural. That is, centralized processing may be performed, or distributed processing may be performed.

図１７は、上記プログラムを実行して、上記実施の形態による映像出力装置を実現するコンピュータの外観の一例を示す模式図である。上記実施の形態は、コンピュータハードウェア及びその上で実行されるコンピュータプログラムによって実現されうる。 FIG. 17 is a schematic diagram showing an example of the external appearance of a computer that executes the program and realizes the video output apparatus according to the embodiment. The above-described embodiment can be realized by computer hardware and a computer program executed on the computer hardware.

図１７において、コンピュータシステム９００は、ＣＤ−ＲＯＭ（ＣｏｍｐａｃｔＤｉｓｋＲｅａｄＯｎｌｙＭｅｍｏｒｙ）ドライブ９０５、ＦＤ（Ｆｌｏｐｐｙ（登録商標）Ｄｉｓｋ）ドライブ９０６を含むコンピュータ９０１と、キーボード９０２と、マウス９０３と、モニタ９０４とを備える。 In FIG. 17, a computer system 900 includes a computer 901 including a CD-ROM (Compact Disk Read Only Memory) drive 905, an FD (Floppy (registered trademark) Disk) drive 906, a keyboard 902, a mouse 903, a monitor 904, and the like. Is provided.

図１８は、コンピュータシステム９００の内部構成を示す図である。図１８において、コンピュータ９０１は、ＣＤ−ＲＯＭドライブ９０５、ＦＤドライブ９０６に加えて、ＭＰＵ（ＭｉｃｒｏＰｒｏｃｅｓｓｉｎｇＵｎｉｔ）９１１と、ブートアッププログラム等のプログラムを記憶するためのＲＯＭ９１２と、ＭＰＵ９１１に接続され、アプリケーションプログラムの命令を一時的に記憶すると共に、一時記憶空間を提供するＲＡＭ（ＲａｎｄｏｍＡｃｃｅｓｓＭｅｍｏｒｙ）９１３と、アプリケーションプログラム、システムプログラム、及びデータを記憶するハードディスク９１４と、ＭＰＵ９１１、ＲＯＭ９１２等を相互に接続するバス９１５とを備える。なお、コンピュータ９０１は、ＬＡＮへの接続を提供する図示しないネットワークカードを含んでいてもよい。 FIG. 18 is a diagram showing an internal configuration of the computer system 900. In FIG. 18, in addition to the CD-ROM drive 905 and the FD drive 906, a computer 901 is connected to an MPU (Micro Processing Unit) 911, a ROM 912 for storing a program such as a bootup program, and the MPU 911. A RAM (Random Access Memory) 913 that temporarily stores program instructions and provides a temporary storage space, a hard disk 914 that stores application programs, system programs, and data, and an MPU 911 and a ROM 912 are interconnected. And a bus 915. The computer 901 may include a network card (not shown) that provides connection to the LAN.

コンピュータシステム９００に、上記実施の形態による映像出力装置等の機能を実行させるプログラムは、ＣＤ−ＲＯＭ９２１、またはＦＤ９２２に記憶されて、ＣＤ−ＲＯＭドライブ９０５、またはＦＤドライブ９０６に挿入され、ハードディスク９１４に転送されてもよい。これに代えて、そのプログラムは、図示しないネットワークを介してコンピュータ９０１に送信され、ハードディスク９１４に記憶されてもよい。プログラムは実行の際にＲＡＭ９１３にロードされる。なお、プログラムは、ＣＤ−ＲＯＭ９２１やＦＤ９２２、またはネットワークから直接、ロードされてもよい。 A program that causes the computer system 900 to execute the functions of the video output device and the like according to the above-described embodiment is stored in the CD-ROM 921 or FD 922, inserted into the CD-ROM drive 905 or FD drive 906, and stored in the hard disk 914. May be forwarded. Instead, the program may be transmitted to the computer 901 via a network (not shown) and stored in the hard disk 914. The program is loaded into the RAM 913 when executed. The program may be loaded directly from the CD-ROM 921, the FD 922, or the network.

プログラムは、コンピュータ９０１に、上記実施の形態による映像出力装置の機能を実行させるオペレーティングシステム（ＯＳ）、またはサードパーティプログラム等を必ずしも含んでいなくてもよい。プログラムは、制御された態様で適切な機能（モジュール）を呼び出し、所望の結果が得られるようにする命令の部分のみを含んでいてもよい。コンピュータシステム９００がどのように動作するのかについては周知であり、詳細な説明は省略する。 The program does not necessarily include an operating system (OS) or a third-party program that causes the computer 901 to execute the functions of the video output device according to the above-described embodiment. The program may include only a part of an instruction that calls an appropriate function (module) in a controlled manner and obtains a desired result. How the computer system 900 operates is well known and will not be described in detail.

本発明は、以上の実施の形態に限定されることなく、種々の変更が可能であり、それらも本発明の範囲内に包含されるものであることは言うまでもない。 The present invention is not limited to the above-described embodiments, and various modifications are possible, and it goes without saying that these are also included in the scope of the present invention.

以上のように、本発明にかかる映像出力装置等は、映像情報を出力する装置等として適しており、特に、映像情報と字幕情報とを出力する装置等として有用である。 As described above, the video output device according to the present invention is suitable as a device that outputs video information, and is particularly useful as a device that outputs video information and caption information.

１映像出力装置
２制御サーバ装置
３映像サーバ装置
４字幕サーバ装置
５手話映像サーバ装置
１０１映像受信部
１０２字幕受信部
１０３映像格納部
１０４字幕格納部
１０５手話映像受信部
１０６格納量設定部
１０７出力状況取得部
１０８要求送信部
１０９指定送信部
１１０映像要求情報送信部
１１１制御要求情報送信部
１１２制御情報受信部
１１３出力部
２０１制御情報格納部
２０２制御サーバ側受信部
２０３制御サーバ側送信部
３０１映像サーバ側格納部
３０２映像サーバ側受信部
３０３映像サーバ側送信部
４０１字幕サーバ側格納部
４０２字幕サーバ側受信部
４０３字幕サーバ側送信部
５０１手話映像サーバ側格納部
５０２手話映像サーバ側受信部
５０３手話映像サーバ側送信部
１０００映像システム DESCRIPTION OF SYMBOLS 1 Video output device 2 Control server device 3 Video server device 4 Subtitle server device 5 Sign language video server device 101 Video receiving part 102 Subtitle receiving part 103 Video storage part 104 Subtitle storage part 105 Sign language video receiving part 106 Storage amount setting part 107 Output condition Acquisition unit 108 Request transmission unit 109 Designated transmission unit 110 Video request information transmission unit 111 Control request information transmission unit 112 Control information reception unit 113 Output unit 201 Control information storage unit 202 Control server side reception unit 203 Control server side transmission unit 301 Video server Side storage unit 302 video server side reception unit 303 video server side transmission unit 401 subtitle server side storage unit 402 subtitle server side reception unit 403 subtitle server side transmission unit 501 sign language video server side storage unit 502 sign language video server side reception unit 503 sign language video Server-side transmitter 1000 Video system Stem

Claims

A video receiver for receiving video information from the video server device;
Subtitle reception for receiving, from a subtitle server apparatus, one or more pieces of subtitle information that is information of a character string indicating subtitles and subtitle timing information that is information indicating the timing at which each piece of subtitle information is output with respect to the output of the video information And
An output unit that synthesizes and outputs one or more pieces of subtitle information received by the subtitle receiving unit with video information received by the video receiving unit using subtitle timing information associated with each piece of subtitle information; Video output device.

The output unit acquires information indicating an output location of video information in response to the output of the video information, and outputs subtitle information associated with subtitle timing information corresponding to the information indicating the output location. The video output device according to 1.

The video receiving unit is video information having a plurality of frame images, video timing information indicating the timing of outputting each frame image is received video information associated with each frame image,
The subtitle timing information is information specifying one or more video timing information,
The video output device according to claim 1, wherein, when outputting one frame image, the output unit outputs subtitle information associated with subtitle timing information designating video timing information associated with the frame image. .

A video storage unit for temporarily storing video information received by the video reception unit;
A subtitle storage unit that temporarily stores subtitle information received by the video reception unit;
The video receiving unit accumulates the received video information in the video storage unit,
The caption receiving unit accumulates the received caption information in the caption storage unit,
The video output device according to any one of claims 1 to 3, wherein the output unit reads out and outputs video information stored in the video storage unit and subtitle information stored in the subtitle storage unit.

An output status acquisition unit for acquiring information indicating an output status of at least one of video information and subtitle information by the output unit;
In accordance with the information indicating the output status acquired by the output status acquisition unit, the subtitle server device further includes a request transmission unit that transmits subtitle request information that is information for requesting transmission of subtitle information,
The video output device according to claim 4, wherein the caption server device transmits caption information according to the caption request information.

6. The output status acquisition unit acquires information indicating at least one of a storage status of video information in the video storage unit and a storage status of caption information in the caption storage unit as information indicating the output status. Video output device.

The video according to any one of claims 4 to 6, wherein a maximum value of a storage amount of video information stored in the video storage unit and a maximum value of a storage amount of caption information stored in the subtitle storage unit are different from each other. Output device.

The maximum storage amount of video information stored in the video storage unit and the maximum storage amount of subtitle information stored in the subtitle storage unit are set to values according to the processing capability of the own device. The video output device according to claim 4, further comprising a storage amount setting unit configured to perform the storage amount setting.

Depending on the type of video information received by the video receiving unit, information specifying one of subtitle information associated with subtitle timing information and subtitle information not associated with subtitle timing information, A designated transmission unit for transmitting to the caption server device;
The subtitle receiving unit further receives subtitle information that is transmitted by the subtitle server device in response to the information specifying the transmission and is not associated with subtitle timing information;
The output unit combines and outputs the subtitle information with the video information received by the video receiving unit when the subtitle information received by the subtitle receiving unit is not associated with the subtitle timing information. Item 9. The video output device according to any one of Items 8 to 9.

Information indicating sign language images having a plurality of frame images associated with the image information received by the image receiving unit, and sign language timing information indicating the timing of outputting each frame image corresponds to each frame image A sign language video receiving unit for receiving the attached sign language video information from the sign language video server device;
The output unit further combines each frame image of the sign language video information received by the sign language video reception unit with the video information received by the video reception unit using sign language timing information corresponding to the frame image. The video output device according to claim 1, wherein the video output device outputs the video.

A control request information transmission unit for transmitting control request information, which is information for requesting transmission of control information used for control related to reception of video information and subtitle information, to the control server device;
A control information receiving unit for receiving control information transmitted in response to control request information from the control server device;
The video output device according to claim 1, wherein video information and caption information are received using the control information.

A caption server device that transmits caption information to the video output device according to any one of claims 1 to 11,
A subtitle server side storage unit in which one or more pieces of subtitle information and subtitle timing information indicating the timing of outputting each subtitle information are stored in association with each other;
A caption server device comprising: a caption server side transmission unit that reads out caption information stored in the caption server side storage unit and transmits the read caption information to the video output device.

A video output method performed using a video receiver, a caption receiver, and an output unit,
A video receiving step in which the video receiving unit receives video information from the video server device;
The subtitle receiving unit receives one or more pieces of subtitle information, which is information of a character string indicating a subtitle, and subtitle timing information indicating a timing at which each piece of subtitle information with respect to the output of the video information is output from the subtitle server device. Subtitle receiving step,
The output unit outputs the one or more pieces of subtitle information received in the subtitle reception step by combining with the video information received in the video reception step using subtitle timing information associated with each subtitle information. A video output method comprising steps.

Computer
A video receiver for receiving video information from the video server device;
A caption receiving unit that receives, from a caption server device, one or more caption information that is information of a character string indicating a caption, and caption timing information that indicates a timing at which each caption information with respect to the output of the video information is output;
To function as an output unit that synthesizes and outputs one or more pieces of subtitle information received by the subtitle receiving unit with video information received by the video receiving unit using subtitle timing information associated with each subtitle information Program.