JP2004260711A

JP2004260711A - Method for creating video search database and recording medium

Info

Publication number: JP2004260711A
Application number: JP2003051284A
Authority: JP
Inventors: Sotoku Go; 宗▲徳▼ 呉
Original assignee: Institute for Information Industry
Current assignee: Institute for Information Industry
Priority date: 2003-02-27
Filing date: 2003-02-27
Publication date: 2004-09-16

Abstract

<P>PROBLEM TO BE SOLVED: To provide a method for creating a video search database and a recording medium. <P>SOLUTION: Image data with superimposed characters are divided into a plurality of shots first. According to video in the plurality of shots, a plurality of key frames are extracted. An image browser search is set by the plurality of key frames. Continuously, a text region is captured according to image features in the image data. A text is divided with respect to the text region, and a plurality of text features are generated. The plurality of text features are compared with a text in the database, and text data are generated. A text index table is set by the text data. Finally, the video search database is created from the image browser search and the text index table. <P>COPYRIGHT: (C)2004,JPO&NCIPI

Description

【０００１】
【発明の属する技術分野】
本発明は、語学教材製作に関するもので、特に、画像ブラウザ検索及びテキスト索引表を備える語学教材のインタラクティブビデオに関するものである。
【０００２】
【従来の技術】
国際化の趨勢下、様々な英語自己学習教材が発売され、中でも、インタラクティブ性が特に重視されている。しかし、語学教材は製作が難しく、かかる人力もコストも膨大で、又、消費者にとって、現在市販されているインタラクティブ式の画像語学教材は素材も限界があり、価格も高く、内容面から言っても、効果的に、しかも飽きずに学習を継続することが出来ない。
【０００３】
【発明が解決しようとする課題】
本発明は外国語ビデオをインタラクティブな語学教材ビデオに転換する、ビデオ検索データベースを作成する方法を提供することを目的とする。
【０００４】
【課題を解決するための手段】
上述の目的を達成するため、本発明はビデオ検索データベースを作成する方法を提供し、先ず、字幕が付いた画像データを複数のショットに分割する。複数のショット中のビデオに従って、複数のキーフレームを抽出する。複数のキーフレームにより、画像ブラウザ検索を設定する。続いて、画像データ中の画像特徴に従って、テキスト領域をとらえる。テキスト領域に対してテキストの分割を行い、複数のテキスト特徴を生成する。複数のテキスト特徴とデータベース中のテキストとを比較し、テキストデータを生成する。テキストデータにより、テキスト索引表を設定する。最後に、画像ブラウザ検索とテキスト索引表により、ビデオ検索データベースを作成する。
【０００５】
【発明の実施の形態】
上述した本発明の目的、特徴、及び長所をいっそう明瞭にするため、以下に本発明の好ましい実施の形態を挙げ、図を参照にしながらさらに詳しく説明する。
【０００６】
図１は、本発明の実施例による、ビデオ検索データベースを作成する方法のフローチャートであり、操作工程は以下の通りである。
【０００７】
先ず、工程Ｓ１０において、例えば、中国語字幕や英語字幕を含む外国語ビデオを分析し、工程Ｓ１２において、コンピューターシステムにより、自動的、且つ快速にショット検出を行い、ショット切り換えタイミングに従って、ビデオファイルを複数のショットに分割する。
【０００８】
工程Ｓ１４において、各ショット中、フレームの画面変化に基づいて、複数のキーフレームをとらえる。キーフレームは一ショットを代表する。工程Ｓ１６において、画像ブラウザ検索を設定し、一時間ちょっとのビデオは数千のショットを含み、キーフレームは一ショットを代表することにより、複数のキーフレームを用いて、速やかに画像ブラウザ検索の設定を完成することが出来、ユーザーはこのキーフレームの時間位置及び内容により、速やかに閲覧したいビデオ個所を探し出すことが出来る。
【０００９】
工程Ｓ２２において、画像データ中のビデオに対し強化を施し、即ち、ビデオ中のエッジを強化し、続いて、エッジ検出により、テキスト領域をとらえる。ビデオ中のイメージ部分のエッジは大きな弧度を備え、不規則で、ビデオ中の字幕部分のエッジは直線が多いため、この画像特徴に従って、画像中のテキスト領域をとらえる。
【００１０】
工程Ｓ２４において、テキスト領域はテキストの分割が行われ、テキスト領域を検出し、テキストの長さ、広さ、高さ、線の密度、構造に従って、テキスト領域中のテキストを複数のテキスト領域に分割し、ニ値化を用いて、それぞれのテキスト領域中の色を白黒の２色に分ける。一般のビデオ中のテキストは、大部分が複雑な画面上にあるため、複雑な背景を除去、つまりそれぞれのテキスト領域を白地に黒文字に転換して、テキストと背景を分け、テキスト特徴を生成する。
【００１１】
工程Ｓ２６において、それぞれのテキスト特徴とテキスト特徴データベース中のテキストを比較して、テキスト識別を実行し、類似のテキストを探し、テキストデータを作成する。
【００１２】
工程Ｓ２８において、テキストデータとビデオの対応関係を用いて、テキスト索引表を設定し、再生したいビデオ個所を探す。
【００１３】
工程Ｓ３２において、テキストデータにより、辞書データベースを作成する。
【００１４】
工程Ｓ３０において、画像ブラウザ検索とテキスト索引表を保存し、ビデオ検索データベースを作成する。
【００１５】
図２は本発明の実施例による、ビデオ検索データベースを示す図で、検索データベース４０中の画像ブラウザ検索５０により、ユーザーは画像ブラウザ検索中の任意のキーフレームを選択し、このキーフレームがあるビデオ個所を再生すると同時に、検索データベース４０中のテキスト索引表６０を用いて、テキスト索引表６０中の任意の検索フィールドを選択し、検索フィールドと対応するビデオ個所を再生することが出来る。この他、ユーザーはキーボード入力字幕個所を検索データとし、再生したいビデオを探すことが出来る。
【００１６】
また、ビデオを再生する時、辞書データベース機能を実行し、スクリーン画面を２つのウィンドウに分割し、ビデオを再生するのに用いられるビデオ再生区７５中のツールバー６７により、ビデオの一時停止、再生、早送り、巻き戻しの機能を操作することが出来る。
【００１７】
この他、字幕マスク６５により、ビデオ中のテキスト領域を遮蔽、即ちビデオ中の字幕を遮蔽し、字幕マスク６５を切り換え及び制御する機能により、字幕の遮蔽、或いは表示が選択できる。字幕ディスプレイ８０となるもう一つのウィンドウでビデオ中の字幕を表示し、字幕ディスプレイ８０中のテキストは、ビデオ中の字幕を分析して得られるテキストデータで、字幕ディスプレイ８０中のテキストデータを選択し、辞書データベース７０により、テキスト意義、語彙性質、熟語などを備えるテキストの注解を表示し、画像ディスプレイ８５で、字幕ディスプレイ８０中のテキストに対応するショットを表示する。
【００１８】
本発明が提供するビデオ検索データベースを作成する方法により、ユーザーは効果的に外国語ビデオをインタラクティブな語学教材に転換することが出来、ビデオ中の字幕に基づいて、索引データを作成し、ユーザーに最良のビデオブラウザ制御方法及び便利な語学教材道具を提供する。
【００１９】
本発明では好ましい実施例を前述の通り開示したが、これらは決して本発明に限定するものではなく、当該技術を熟知する者なら誰でも、本発明の精神と領域を脱しない範囲内で各種の変動や潤色を加えることができ、従って本発明の保護範囲は、特許請求の範囲で指定した内容を基準とする。
【００２０】
【発明の効果】
インタラクティブな語学ビデオ教材が得られる。
【図面の簡単な説明】
【図１】本発明の実施例によるビデオ検索データベースを作成する方法を示すフローチャートである。
【図２】本発明の実施例によるビデオ検索データベースを示す図である。
【符号の説明】
４０検索データベース
５０画像ブラウザ検索
６０テキスト索引表
６５字幕マスク
６７ツールバー
７０辞書データベース
７５ビデオ再生区
８０字幕ディスプレイ
８５画像ディスプレイ[0001]
TECHNICAL FIELD OF THE INVENTION
The present invention relates to the production of language teaching materials, and more particularly, to an interactive video of language teaching materials with an image browser search and a text index table.
[0002]
[Prior art]
Under the trend of internationalization, various self-learning materials for English have been released, and interactivity has been particularly emphasized. However, it is difficult to produce language teaching materials, the manpower and cost are huge, and for consumers, the interactive image language teaching materials currently on the market have limited materials, are expensive, and in terms of content, However, learning cannot be continued effectively and without getting tired.
[0003]
[Problems to be solved by the invention]
An object of the present invention is to provide a method for creating a video search database that converts foreign language videos into interactive language teaching videos.
[0004]
[Means for Solving the Problems]
In order to achieve the above object, the present invention provides a method for creating a video search database, first dividing subtitled image data into a plurality of shots. Extract multiple keyframes according to the video in multiple shots. Set image browser search with multiple keyframes. Subsequently, a text area is captured according to the image feature in the image data. Split text into text regions to generate multiple text features. A plurality of text features are compared with text in a database to generate text data. A text index table is set according to the text data. Finally, a video search database is created using an image browser search and a text index table.
[0005]
BEST MODE FOR CARRYING OUT THE INVENTION
In order to further clarify the objects, features and advantages of the present invention described above, preferred embodiments of the present invention will be described below with reference to the accompanying drawings.
[0006]
FIG. 1 is a flowchart of a method for creating a video search database according to an embodiment of the present invention, and the operation steps are as follows.
[0007]
First, in step S10, for example, foreign language videos including Chinese subtitles and English subtitles are analyzed. In step S12, shot detection is performed automatically and quickly by a computer system, and the video file is converted according to the shot switching timing. Divide into multiple shots.
[0008]
In step S14, a plurality of key frames are captured during each shot based on a change in the screen of the frame. Keyframes represent one shot. In step S16, an image browser search is set up, and a video for one hour contains thousands of shots, and a key frame is representative of one shot. Can be completed, and the user can quickly find a video location to be viewed based on the time position and the content of the key frame.
[0009]
In step S22, the video in the image data is enhanced, that is, edges in the video are enhanced, and subsequently, a text region is captured by edge detection. The edges of the image portion in the video have a large radius and are irregular, and the edges of the subtitle portion in the video have many straight lines. Therefore, the text region in the image is captured according to this image feature.
[0010]
In step S24, the text area is subjected to text segmentation, the text area is detected, and the text in the text area is divided into a plurality of text areas according to the text length, width, height, line density, and structure. Then, the color in each text area is divided into two colors of black and white using binarization. Most of the text in general video is on a complicated screen, so remove the complex background, that is, convert each text area to black on a white background, separate the text from the background, and generate text features .
[0011]
In step S26, each text feature is compared with the text in the text feature database to perform text identification, search for similar text, and create text data.
[0012]
In step S28, a text index table is set using the correspondence between the text data and the video, and a video portion to be reproduced is searched.
[0013]
In step S32, a dictionary database is created from the text data.
[0014]
In step S30, the image browser search and the text index table are saved, and a video search database is created.
[0015]
FIG. 2 is a diagram showing a video search database according to an embodiment of the present invention. In the image browser search 50 in the search database 40, the user selects an arbitrary key frame during the image browser search, and selects a video having the key frame. Simultaneously with the reproduction of the location, an arbitrary search field in the text index table 60 can be selected using the text index table 60 in the search database 40, and the video location corresponding to the search field can be reproduced. In addition, the user can search for a video to be reproduced by using the keyboard input subtitle portion as search data.
[0016]
Also, when playing a video, the dictionary database function is executed, the screen screen is divided into two windows, and a video is paused, played, and played by a toolbar 67 in a video playback area 75 used for playing the video. Fast forward and rewind functions can be operated.
[0017]
In addition, the subtitle mask 65 blocks the text area in the video, that is, blocks the subtitle in the video, and the subtitle mask 65 is switched and controlled, so that subtitle shielding or display can be selected. The subtitle in the video is displayed in another window that becomes the subtitle display 80, and the text in the subtitle display 80 is text data obtained by analyzing the subtitle in the video, and the text data in the subtitle display 80 is selected. In addition, the dictionary database 70 displays the commentary of the text including the meaning of the text, the vocabulary, the idiom, and the like, and the image display 85 displays the shot corresponding to the text in the subtitle display 80.
[0018]
The method of creating a video search database provided by the present invention allows users to effectively convert foreign language videos into interactive language teaching materials, create index data based on subtitles in the videos, and provide users with index data. Provide the best video browser control method and convenient language teaching tool.
[0019]
Although preferred embodiments of the present invention have been disclosed as described above, they are not intended to limit the present invention in any way, and any person skilled in the art may make various modifications without departing from the spirit and scope of the present invention. Variations and hydrations can be added, and the protection scope of the present invention is based on the contents specified in the claims.
[0020]
【The invention's effect】
Get interactive language video teaching materials.
[Brief description of the drawings]
FIG. 1 is a flowchart illustrating a method for creating a video search database according to an embodiment of the present invention.
FIG. 2 is a diagram illustrating a video search database according to an embodiment of the present invention.
[Explanation of symbols]
40 search database 50 image browser search 60 text index table 65 subtitle mask 67 toolbar 70 dictionary database 75 video playback section 80 subtitle display 85 image display

Claims

Dividing image data with subtitles into a plurality of shots;
Extracting a plurality of keyframes according to a change in video during the plurality of shots;
Setting an image browser search using the plurality of key frames;
Capturing a text area according to the image features in the image data;
Subjecting the text region to text segmentation to generate a plurality of text features;
Comparing the plurality of text features and text in a database to create text data;
Setting a text index table using the text data;
Creating a video search database using the image browser search and the text index table;
A method for creating a video search database, comprising:

The method of claim 1, wherein the subtitles are Chinese and English subtitles.

The method of claim 1, wherein the plurality of key frames are generated according to a video having a large screen change.

The method of claim 1, wherein selecting an arbitrary search field in the text index table plays a video location represented by the search field.

2. The method according to claim 1, wherein a corresponding video portion is obtained by using a keyboard input device and using the input character data as search data.

The method of claim 1, wherein a shot with the key frame is played by selecting an arbitrary key frame in the image index table.

2. The method according to claim 1, wherein shielding or displaying the text subtitle is selected by shielding the text area.

The method according to claim 1, wherein a dictionary database is created using the text data.

A recording medium for recording a program that causes a computer to execute a method of creating a video search database,
Dividing image data with subtitles into a plurality of shots;
Extracting a plurality of keyframes according to a change in video during the plurality of shots;
Setting an image browser search using the plurality of key frames;
Capturing a text area according to the image features in the image data;
Subjecting the text region to text segmentation to generate a plurality of text features;
Comparing the plurality of text features and text in a database to create text data;
Setting a text index table using the text data;
Creating a video search database using the image browser search and the text index table;
A recording medium comprising:

The recording medium according to claim 9, wherein the subtitles are Chinese and English subtitles.

The recording medium according to claim 9, wherein the plurality of key frames are generated according to a video having a large screen change.

10. The recording medium according to claim 9, wherein a video portion represented by the search field is reproduced by selecting an arbitrary search field in the text index table.

10. The recording medium according to claim 9, wherein a corresponding video portion is obtained by using the input character data as search data using a keyboard input device.

10. The recording medium according to claim 9, wherein a shot having the key frame is reproduced by selecting an arbitrary key frame in the image index table.

The recording medium according to claim 9, wherein shielding or displaying of the text subtitle is selected by shielding the text area.

The recording medium according to claim 9, wherein a dictionary database is created using the text data.