JP4504216B2

JP4504216B2 - Image processing apparatus and image processing program

Info

Publication number: JP4504216B2
Application number: JP2005031796A
Authority: JP
Inventors: 豊一坂井
Original assignee: Xing Inc
Current assignee: Xing Inc
Priority date: 2005-02-08
Filing date: 2005-02-08
Publication date: 2010-07-14
Anticipated expiration: 2025-02-08
Also published as: JP2006221253A

Description

本発明は、入力画像を記号の集合体で構成した画像に置換して、かかる置換画像を動画再生する画像処理装置および画像処理プログラムに関する。 The present invention relates to an image processing apparatus and an image processing program for replacing an input image with an image composed of a collection of symbols and reproducing the replacement image as a moving image.

近年、電子メールやインターネットが広く普及し、これに伴い、電子メールやインターネットの掲示板等で、文章表現ツールの１種としていわゆる「顔文字」が広く使用されている。「顔文字」は、表情を模した画像が記号（文字）の組み合わせにより作成されたテキストデータである。 In recent years, e-mail and the Internet have become widespread, and accordingly, so-called “emoticons” are widely used as a kind of text expression tool in e-mail and Internet bulletin boards. “Emoji” is text data in which an image simulating an expression is created by a combination of symbols (characters).

かかる「顔文字」は、使用者個人が記号を組み合わせて作成するもの以外に、予めコンピュータ内部に記憶されていたり、サーバからのダウンロードにより取得することができる。使用者は、かかる「顔文字」を表現内容に合わせて適宜選択し、文章中に添付することにより、文章表現に加え、自己の気分や感情を表現する。この「顔文字」の添付により、文章だけでは伝達できない情緒が表現され、自己と他者との間で高度なコミュニケーションを行うことができる上、文章の娯楽性を向上させることができるのである。 Such “emoticons” can be stored in advance in the computer or obtained by downloading from a server, in addition to those created by a user combining symbols. The user appropriately selects such “emoticon” according to the expression content and attaches it to the sentence, thereby expressing his / her feelings and emotions in addition to the sentence expression. By attaching this “emoticon”, an emotion that cannot be transmitted only by the text is expressed, and advanced communication can be performed between the self and others, and the entertainment of the text can be improved.

また、かかる「顔文字」を顔画像のパターンに対応付けて記憶させており、カメラにて撮像された使用者の顔の表情（顔画像のパターン）に応じて、対応する「顔文字」を文中に入力する顔文字入力のためのシステムが開示されている（特許文献１参照）。
特開２００２−３１８６４９号公報 Further, the “emoticon” is stored in association with the face image pattern, and the corresponding “emoticon” is stored in accordance with the facial expression of the user (face image pattern) captured by the camera. A system for inputting emoticons to be input in a sentence is disclosed (see Patent Document 1).
JP 2002-318649 A

しかしながら、上記文献に開示されたシステムでは、カメラから取り込んだ使用者の顔の表情に応じて画像を添付できるものの、「顔文字」は、不特定多数の者が共通の認識を持つことのできる表情を模した画像であって個々の使用者の特徴が反映されたものではない。このため、「顔文字」を使用した表現は個性のない画一的な表現になってしまうという問題点があった。更に、「顔文字」等による表現は、広く普及しているので新鮮さに欠け、閲覧する者の興趣を高め難いという問題点があった。 However, in the system disclosed in the above document, although an image can be attached according to the facial expression of the user captured from the camera, the “emoticon” can be shared by many unspecified people. It is an image that imitates an expression and does not reflect the characteristics of each user. For this reason, there is a problem that the expression using the “emoticon” becomes a uniform expression without individuality. Furthermore, since expressions such as “emoticons” are widespread, there is a problem that they lack freshness and it is difficult to enhance the interest of the viewer.

一方で、入力された画像の各部を、濃淡に応じて予め定めた記号（テキストデータ）にて置換し、画像（いわゆる「アスキー画像」）を形成する手法が広く知られている。これによれば、入力された画像を「顔文字」同様、テキストデータにて表現することができ、入力された画像とは異なる態様でありながらも、入力された画像の特徴を反映した表現を行うことができる。しかし、かかる画像は、明瞭さに欠けた画像品質が悪いものとなりがちであるという問題点があった。また、動画再生においては、処理するデータ量が多くなり、その処理が煩雑となるという問題点があった。 On the other hand, a method of forming each image (so-called “ASCII image”) by replacing each part of the input image with a predetermined symbol (text data) according to the density is widely known. According to this, like the “emoticon”, the input image can be expressed by text data, and although the aspect is different from the input image, the expression reflecting the characteristics of the input image It can be carried out. However, such an image has a problem that the image quality lacking in clarity tends to be poor. Further, in the reproduction of moving images, there is a problem that the amount of data to be processed increases and the processing becomes complicated.

本発明は、上記問題点を解決するためになされたものであり、入力された画像の特徴が反映された簡素な画像を記号データで形成して動画再生することにより、高い娯楽性を有する演出を行うことのできる画像処理装置および画像処理プログラムを提供することを目的としている。 The present invention has been made in order to solve the above-mentioned problems, and by creating a simple image reflecting the characteristics of an input image with symbol data and reproducing the moving image, the present invention has high entertainment. An object of the present invention is to provide an image processing apparatus and an image processing program capable of performing the above.

この目的を達成するために、請求項１記載の画像処理装置は、複数の画像フレームを有し連続した画像を現出させ得る画像データを入力する画像データ入力手段と、その画像データ入力手段により入力された画像データを各画像フレーム毎に加工する加工手段と、その加工手段により加工された画像フレームをその入力順に従って順次出力することにより画像データを出力する出力手段とを備えた画像処理部と、その画像処理部の出力手段により出力された前記画像フレームを順に表示して前記画像データを動画表示する表示装置とを備えており、前記画像処理部は、複数の記号の画像データが記憶された記号データ記憶手段を備えており、前記画像処理部の加工手段は、前記画像データ入力手段にて入力された画像データを線画像データに変換する線画変換手段と、前記画像データ入力手段にて入力された画像データの画像フレーム又は前記線画変換手段により変換された線画像データを有する画像フレームを所定数のブロックに分割する分割手段と、その分割手段により分割されたブロック単位で、前記線画像データの態様に応じて割当てられている記号を前記記号データ記憶手段に記憶される記号の画像データの中から抽出する抽出手段と、その抽出手段により抽出された記号の画像データで、対応するブロックの前記線画像データを置換した置換画像データを形成する画像形成手段とを備えており、前記画像処理部の出力手段は、前記置換画像データにて形成された置換画像データを出力するものであり、前記表示装置にて前記置換画像データの動画表示を行い、前記画像処理部は、前記画像データ入力手段として撮影により画像データを取得する撮像装置を備えており、更に、その撮像装置により背景画像として取得された背景画像データを記憶する背景画像記憶手段を備えると共に、前記画像処理部の加工手段は、前記撮像装置により取得された画像データから前記背景画像記憶手段に記憶される背景画像データとの共通部分を除いて、前記画像フレーム毎に固有の固有画像データを抽出する固有画像データ抽出手段を備え、前記画像処理部の線画変換手段は、その固有画像データ抽出手段により抽出された固有画像データを線画像データに変換するものであり、前記画像処理部の画像形成手段は、その固有画像データから変換された線画像データによって置換画像データを形成するものであり、前記画像処理部の加工手段は、前記分割手段により前記画像フレームを分割した場合に、その画像フレーム内において前記線画像データまたは前記固有画像データを有するブロック数が予め定めた下限値以下であると、前記画像フレーム内において前記線画像データまたは前記固有画像データを有するブロック数を増加させる方向に、前記線画像データまたは前記固有画像データを伸張する画像データ伸張手段を備えており、前記画像処理部の画像形成手段は、その画像データ伸張手段により前記線画像データまたは固有画像データが伸張された場合には、伸張後の画像データに基づいて置換画像データを形成するものである。 In order to achieve this object, an image processing apparatus according to claim 1 includes image data input means for inputting image data having a plurality of image frames and capable of displaying a continuous image, and the image data input means. An image processing unit comprising processing means for processing input image data for each image frame, and output means for outputting image data by sequentially outputting the image frames processed by the processing means in accordance with the input order And a display device that sequentially displays the image frames output by the output means of the image processing unit and displays the image data as a moving image. The image processing unit stores image data of a plurality of symbols. Symbol data storage means, and the processing means of the image processing unit converts the image data input by the image data input means into line image data. A line drawing conversion means for converting, a dividing means for dividing an image frame of the image data input by the image data input means or an image frame having the line image data converted by the line drawing conversion means into a predetermined number of blocks; Extraction means for extracting symbols assigned in accordance with the mode of the line image data from the symbol image data stored in the symbol data storage means in units of blocks divided by the division means, and the extraction And image forming means for forming replacement image data obtained by replacing the line image data of the corresponding block with the image data of the symbol extracted by the means, and the output means of the image processing unit includes the replacement image data and it outputs a replacement image data formed by, perform moving picture display of the replacement image data at the display device, the image The processing unit includes an imaging device that acquires image data by photographing as the image data input unit, and further includes a background image storage unit that stores background image data acquired as a background image by the imaging device, The processing means of the image processing unit removes unique image data unique to each image frame by excluding a common part with the background image data stored in the background image storage means from the image data acquired by the imaging device. A unique image data extracting means for extracting, and the line drawing conversion means of the image processing section converts the unique image data extracted by the unique image data extracting means into line image data, and the image of the image processing section The forming means forms replacement image data from line image data converted from the unique image data, and the image processing When the image frame is divided by the dividing means, the processing means of the section is configured such that the number of blocks having the line image data or the unique image data in the image frame is not more than a predetermined lower limit Image data expansion means for expanding the line image data or the unique image data in a direction to increase the number of blocks having the line image data or the unique image data in a frame, and forming an image in the image processing unit The means forms replacement image data based on the decompressed image data when the line image data or unique image data is decompressed by the image data decompressing means .

請求項２記載の画像処理装置は、請求項１記載の画像処理装置において、前記画像データ伸張手段は、抽出元の画像フレームに対して前記固有画像データ抽出手段により抽出された前記固有画像データが占める割合が、所定の割合以下であるか否かを判断する判断手段を備えており、その判断手段により、前記固有画像データが占める割合が所定の割合以下であると判断された場合に、前記固有画像データを有するブロック数が予め定めた下限値以下であるとして、前記固有画像データの伸張を行うものである。 The image processing apparatus according to claim 2 is the image processing apparatus according to claim 1 , wherein the image data decompression unit is configured such that the unique image data extracted by the unique image data extraction unit with respect to an image frame from which the image is extracted is stored. A determination unit that determines whether or not the proportion of the specific image data is less than or equal to a predetermined proportion; The unique image data is expanded assuming that the number of blocks having the unique image data is equal to or less than a predetermined lower limit value.

請求項３記載の画像処理装置は、請求項１又は２に記載の画像処理装置において、前記画像処理部の加工手段は、前記分割手段により前記画像フレームを分割した場合に、その画像フレーム内において前記線画像データまたは前記固有画像データを有するブロック数が予め定めた上限値以上であると、前記線画像データまたは前記固有画像データを圧縮し、前記画像フレーム内において前記線画像データまたは前記固有画像データを有するブロック数を減少させる画像データ圧縮手段を備えており、前記画像処理部の画像形成手段は、その画像データ圧縮手段により前記画像データが圧縮された場合には、圧縮後の画像データに基づいて置換画像データを形成するものである。 The image processing apparatus of claim 3, wherein, in the image processing apparatus according to claim 1 or 2, the processing means of the image processing unit, in case of dividing the image frame by the dividing means, in the image frame When the number of blocks having the line image data or the unique image data is equal to or greater than a predetermined upper limit, the line image data or the unique image data is compressed, and the line image data or the unique image is included in the image frame. Image data compression means for reducing the number of blocks having data; and when the image data is compressed by the image data compression means, the image forming means of the image processing unit converts the image data into compressed image data. Based on this, replacement image data is formed.

請求項４記載の画像処理装置は、請求項１から３のいずれかに記載の画像処理装置において、伴奏情報を記憶する伴奏情報記憶手段と、その伴奏情報記憶手段に記憶される伴奏情報に基づいて伴奏を発生する伴奏発生手段と、音声データを入力する音声データ入力手段と、その音声データ入力手段により入力された音声データと、伴奏発生手段にて発生された伴奏とを発音する発音手段と、各伴奏情報に対応し、且つ、前記発音手段により発音される伴奏に合わせて前記表示装置に表示される歌詞データを記憶する歌詞データ記憶手段とを備えており、その歌詞データ記憶手段に記憶される歌詞データと共に、前記出力手段により出力された前記置換画像データを前記表示装置に表示するものである。 An image processing apparatus according to a fourth aspect is the image processing apparatus according to any one of the first to third aspects, based on accompaniment information storage means for storing accompaniment information and accompaniment information stored in the accompaniment information storage means. Accompaniment generating means for generating accompaniment, sound data input means for inputting sound data, sound generating means for sounding sound data input by the sound data input means and accompaniment generated by the accompaniment generating means Lyric data storage means corresponding to each accompaniment information and storing lyric data displayed on the display device in accordance with the accompaniment sounded by the sounding means, and stored in the lyric data storage means The replacement image data output by the output means is displayed on the display device together with the lyrics data.

請求項５記載の画像処理装置は、請求項１から４のいずれかに記載の画像処理装置において、請求項１から４のいずれかに記載の画像処理部を備え回線を介して相手側装置に接続されると共に、音声データを入力する音声データ入力手段と、その音声データ入力手段により入力された音声データを前記回線に出力するデータ信号に変換するデータ変換手段と、前記画像処理部の画像形成手段により形成された置換画像データに対応する識別コードを音声データのデータ信号に変換して前記データ変換手段により変換されたデータ信号に重畳する重畳手段とを有する送信側装置を備えており、前記画像処理部の出力手段は、その重畳手段により重畳されたデータ信号を前記回線に出力して前記相手側装置に置換画像データを出力するものである。 An image processing apparatus according to a fifth aspect is the image processing apparatus according to any one of the first to fourth aspects, wherein the image processing apparatus according to any one of the first to fourth aspects comprises the image processing unit according to any one of the first to fourth aspects. Audio data input means for inputting audio data, data conversion means for converting the audio data input by the audio data input means into a data signal output to the line, and image formation of the image processing unit A transmission-side device having a superimposing unit that converts an identification code corresponding to the replacement image data formed by the unit into a data signal of audio data and superimposes it on the data signal converted by the data conversion unit, The output unit of the image processing unit outputs the data signal superimposed by the superimposing unit to the line and outputs replacement image data to the counterpart device.

請求項６記載の画像処理装置は、請求項５記載の画像処理装置において、前記送信側装置は、前記音声データ入力手段により入力された音声データについて、非再生領域として設定される非再生周波数成分を除去する除去手段を備えており、前記重畳手段は、前記置換画像データの識別コードを、前記除去手段により除去された非再生周波数成分の帯域のデータ信号に変換して前記データ変換手段により変換されたデータ信号に重畳するものである。 The image processing apparatus according to claim 6, wherein, in the image processing apparatus according to claim 5, wherein said transmitting side apparatus, a non-regenerative frequency components above the audio data input by the voice data input unit, is set as the non-reproducing area The superimposing means converts the identification code of the replacement image data into a data signal in the band of the non-reproduction frequency component removed by the removing means and converts the data signal by the data converting means. Is superimposed on the data signal.

請求項７記載の画像処理装置は、請求項５または６に記載の画像処理装置において、前記送信側装置は、前記音声データ入力手段により入力された音声データについて、可聴限界を越えた限界周波数成分を除去する第２除去手段を備えており、前記重畳手段は、前記置換画像データの識別コードを、前記第２除去手段により除去された限界周波数成分のデータ信号に変換して前記データ変換手段により変換されたデータ信号に重畳するものである。 The image processing apparatus according to claim 7, wherein, in the image processing apparatus according to claim 5 or 6, wherein the transmitting-side apparatus, the audio data inputted by said voice data input means, the limit frequency components exceeding the audible limit The superimposing means converts the identification code of the replacement image data into the data signal of the limit frequency component removed by the second removing means, and the data converting means It is superimposed on the converted data signal.

請求項８記載の画像処理装置は、請求項５から７のいずれかに記載の画像処理装置において、前記回線を介して前記送信側装置に接続されると共に、前記表示装置と、音声データを受信する音声データ受信手段と、その音声データ受信手段により受信された音声データを音声で再生する音声再生手段と、前記音声データ受信手段により受信された音声データに前記置換画像データに対応する識別コードのデータ信号が含まれている場合にそのデータ信号から前記置換画像データを再生する画像データ再生手段とを有し、その画像データ再生手段により再生された置換画像データを前記表示装置にて表示する相手側装置を備えている。 The image processing apparatus according to claim 8, wherein the receiving the image processing apparatus according to any one of claims 5-7, which is connected to the transmitting-side device through the line, and the display device, the audio data Voice data receiving means, voice playback means for playing back the voice data received by the voice data receiving means, and voice data received by the voice data receiving means in an identification code corresponding to the replacement image data. An image data reproducing means for reproducing the replacement image data from the data signal when a data signal is included, and the display device for displaying the replacement image data reproduced by the image data reproducing means A side device is provided.

請求項９記載の画像処理プログラムは、複数の画像フレームを有し連続した画像を現出させ得る画像データを入力する画像データ入力手段を備えたコンピュータに、その画像データ入力手段により入力された画像データを各画像フレーム毎に加工する加工ステップと、その加工ステップにより加工された画像フレームをその入力順に従って順次出力することにより画像データを出力する出力ステップとを備えた画像処理ステップを実行させ、その画像処理ステップの出力ステップにより出力された前記画像フレームを順に表示して前記コンピュータの表示装置に前記画像データを動画表示させるものであって、前記コンピュータは、複数の記号の画像データが記憶された記号データ記憶手段を備えており、前記画像処理ステップの加工ステップは、前記画像データ入力手段にて入力された画像データを線画像データに変換する線画変換ステップと、前記画像データ入力手段にて入力された画像データの画像フレーム又は前記線画変換ステップにより変換された線画像データを有する画像フレームを所定数のブロックに分割する分割ステップと、その分割ステップにより分割されたブロック単位で、前記線画像データの態様に応じて割当てられている記号を前記記号データ記憶手段に記憶される記号の画像データの中から抽出する抽出ステップと、その抽出ステップにより抽出された記号の画像データで、対応するブロックの前記線画像データを置換した置換画像データを形成する画像形成ステップとを前記コンピュータに実行させ、前記画像処理ステップの出力ステップは、前記置換画像データにて形成された置換画像データを出力するものであり、前記表示装置にて前記置換画像データの動画表示を行い、前記コンピュータは、前記画像データ入力手段として撮影により画像データを取得する撮像装置を備えており、更に、その撮像装置により背景画像として取得された背景画像データを記憶する背景画像記憶手段を備え、前記画像処理ステップの加工ステップは、前記撮像装置により取得された画像データから前記背景画像記憶手段に記憶される背景画像データとの共通部分を除いて、前記画像フレーム毎に固有の固有画像データを抽出する固有画像データ抽出ステップを前記コンピュータに実行させ、前記画像処理ステップの線画変換ステップは、その固有画像データ抽出ステップにより抽出された固有画像データを線画像データに変換するものであり、前記画像処理ステップの画像形成ステップは、その固有画像データから変換された線画像データによって置換画像データを形成するものであり、前記画像処理ステップの加工ステップは、前記分割ステップにより前記画像フレームを分割した場合に、その画像フレーム内において前記線画像データまたは前記固有画像データを有するブロック数が予め定めた下限値以下であると、前記画像フレーム内において前記線画像データまたは前記固有画像データを有するブロック数を増加させる方向に、前記線画像データまたは前記固有画像データを伸張する画像データ伸張ステップをコンピュータに実行させ、前記画像処理ステップの画像形成ステップは、その画像データ伸張ステップにより前記線画像データまたは固有画像データが伸張された場合には、伸張後の画像データに基づいて置換画像データを形成するものである。 The image processing program according to claim 9 is an image input by a computer having image data input means for inputting image data having a plurality of image frames and capable of displaying a continuous image. An image processing step including a processing step for processing data for each image frame, and an output step for outputting image data by sequentially outputting the image frames processed by the processing step according to the input order, is executed, The image frames output in the output step of the image processing step are sequentially displayed and the image data is displayed as a moving image on the display device of the computer. The computer stores image data of a plurality of symbols. Symbol data storage means, and a processing step of the image processing step. Is a line drawing conversion step of converting the image data input by the image data input section to the line image data, converted by the image frame or the line drawing conversion step of the image data input by the image data input means A division step of dividing an image frame having line image data into a predetermined number of blocks, and symbols assigned according to the mode of the line image data in units of blocks divided by the division step, the symbol data storage means An extraction step of extracting from the symbol image data stored in the image, and an image formation step of forming replacement image data in which the line image data of the corresponding block is replaced with the symbol image data extracted by the extraction step preparative to execute the computer, the output step of the image processing step, the location The replacement image data formed by the image data is output, the moving image of the replacement image data is displayed on the display device, and the computer captures image data by photographing as the image data input means A background image storage means for storing background image data acquired as a background image by the imaging device, and the processing step of the image processing step is based on the image data acquired by the imaging device. Excluding a common part with the background image data stored in the background image storage means, the computer executes a unique image data extraction step for extracting unique image data for each image frame, and the image processing step The line drawing conversion step converts the unique image data extracted by the unique image data extraction step. Converting to line image data, and the image forming step of the image processing step forms replacement image data by line image data converted from the unique image data, and the processing step of the image processing step includes When the image frame is divided by the dividing step, if the number of blocks having the line image data or the unique image data in the image frame is equal to or less than a predetermined lower limit value, the line in the image frame Causing the computer to execute an image data expansion step of expanding the line image data or the unique image data in a direction to increase the number of blocks having the image data or the unique image data, and the image forming step of the image processing step includes: The line image data is obtained by the image data expansion step. Others when specific image data is decompressed, and forms a replacement image data based on the image data after decompression.

請求項１０記載の画像処理プログラムは、請求項９記載の画像処理プログラムにおいて、前記画像データ伸張ステップは、抽出元の画像フレームに対して前記固有画像データ抽出ステップにより抽出された前記固有画像データが占める割合が、所定の割合以下であるか否かを判断する判断ステップを前記コンピュータに実行させ、その判断ステップにより、前記固有画像データが占める割合が所定の割合以下であると判断された場合に、前記固有画像データを有するブロック数が予め定めた下限値以下であるとして、前記固有画像データの伸張を行うものである。 An image processing program according to a tenth aspect of the present invention is the image processing program according to the ninth aspect , wherein the unique image data extracted by the unique image data extracting step is extracted from the original image frame. When the computer executes a determination step for determining whether the proportion is less than or equal to a predetermined proportion , and the determination step determines that the proportion occupied by the unique image data is less than or equal to a predetermined proportion The unique image data is expanded on the assumption that the number of blocks having the unique image data is equal to or less than a predetermined lower limit value.

請求項１記載の画像処理装置によれば、画像データ入力手段にて入力された画像データは、線画変換手段により、線画像データに変換される。また、分割手段により、その入力された画像データの画像フレーム又は線画変換手段により変換された線画像データを有する画像フレームは、所定数のブロックに分割され、分割されたブロック単位で、線画像データの態様と近似する記号が、抽出手段により、記号データ記憶手段に記憶される記号の画像データの中から抽出される。そして、抽出された記号の画像データで対応するブロックの線画像データを置換した置換画像データが置換画像形成手段により形成される。形成された置換画像データは、出力手段により表示装置に出力され、表示装置にて置換画像データの動画表示が行われる。 According to the image processing apparatus of the first aspect, the image data input by the image data input means is converted into line image data by the line drawing conversion means. Further, the image frame of the input image data by the dividing unit or the image frame having the line image data converted by the line drawing converting unit is divided into a predetermined number of blocks, and the line image data is divided into divided blocks. The symbol that approximates this aspect is extracted from the symbol image data stored in the symbol data storage means by the extraction means. Then, replacement image data obtained by replacing the line image data of the corresponding block with the extracted symbol image data is formed by the replacement image forming means. The formed replacement image data is output to the display device by the output means, and the replacement image data is displayed as a moving image on the display device.

よって、入力された画像を記号の画像データで構成された画像に変換して出力することができ、単純な画像が自己の動作に応じてアクションするというコミカルで娯楽性の高い演出を提供することができる。更に、入力された画像が変換された線画から記号の画像データを形成するので、出力画像をシンプルな画像とすることができ、入力画像データ全体を記号データで構成した置換画像データに変換する場合に比べて、表示装置に表示する画像をシンプル且つシャープな画像とすることができるという効果がある。 Therefore, it is possible to convert an input image into an image composed of symbol image data and output it, and to provide a comical and highly entertaining effect that a simple image acts according to its own operation. Can do. Furthermore, since the image data of the symbol is formed from the line drawing obtained by converting the input image, the output image can be a simple image, and the entire input image data is converted to the replacement image data composed of the symbol data. As compared with the above, there is an effect that the image displayed on the display device can be a simple and sharp image.

加えて、表示装置にて表示される画像を、入力された実際の画像データとは異なる画像でありつつ、実画像の特徴的部分を残した画像することができるという効果がある。例えば、入力された画像中の人物を予め用意されたキャラクタ（「顔文字」を含む）に置換して画像出力する場合には、予め作成されたキャラクタに置き換えられるだけであるので、対象人物の特徴が反映され難い。また、特徴を少しでも反映しようとすれば多くのキャラクタデータが必要になる。しかし、置換画像データは、記号データを用いて入力された画像データから形成されるので、表示装置での出力画像が画一的な画像とならず、個々の画像データのニュアンスが反映された高度な表現を実現することができる。 In addition, there is an effect that the image displayed on the display device can be an image that is different from the input actual image data, but leaves the characteristic portion of the actual image. For example, when an image is output by replacing a person in the input image with a character prepared in advance (including “emoticon”), it is simply replaced with a character created in advance. The characteristics are difficult to be reflected. Also, a large amount of character data is required to reflect the features even a little. However, since the replacement image data is formed from the image data input using the symbol data, the output image on the display device does not become a uniform image, and the nuance of the individual image data is reflected. Can be realized.

また、撮像装置により取得された画像データから背景画像データとの共通部分を除いて、画像フレーム毎に、固有の固有画像データが固有画像データ抽出手段により抽出される。置換画像データは、抽出された固有画像データが変換された線画像データから形成される。 Further, the unique image data is extracted by the unique image data extraction unit for each image frame, except for the common part with the background image data from the image data acquired by the imaging device. The replacement image data is formed from line image data obtained by converting the extracted unique image data.

よって、背景画像部分を省略して置換画像データを形成することができ、画像データの全てから置換画像データを形成する場合に比べて、置換画像データの形成速度を高速にすることができるという効果がある。また、形成された置換画像データを表示装置等へ出力する場合に、そのデータ出力速度を高速にすることができるという効果がある。更に背景画像データを除去することにより、置換画像データの変換元の画像データの猥雑さを軽減することができる。従って、表示装置に簡素な置換画像データを出力でき、出力される画像が何を表現したものであるか不明確（識別困難）になることを回避できるという効果がある。背景画像データが含まれた猥雑（複雑）な画像データである場合、個々のオブジェクトの境界が不明確になりかねず、かかる画像データを置換画像データに変換すると、場合によっては、個々のオブジェクトとは異なる形状に分割あるいは連結された画像が出現しかねない。しかし、（背景画像データが除去された）固有画像データが変換された線画像データから置換画像データを形成するので、かかる不具合を軽減し、簡素で明瞭な置換画像データを表示装置に表示できる。 Accordingly, the replacement image data can be formed by omitting the background image portion, and the replacement image data can be formed at a higher speed than the case where the replacement image data is formed from all the image data. There is. Further, when the formed replacement image data is output to a display device or the like, the data output speed can be increased. Further, by removing the background image data, it is possible to reduce the complexity of the image data from which the replacement image data is converted. Therefore, simple replacement image data can be output to the display device, and it is possible to avoid unclear (difficult to identify) what the output image represents. In the case of complicated (complex) image data including background image data, the boundaries of individual objects may be unclear, and when such image data is converted into replacement image data, May appear in an image divided or connected into different shapes. However, since the replacement image data is formed from the line image data obtained by converting the unique image data (with the background image data removed), it is possible to reduce such problems and display simple and clear replacement image data on the display device.

また、分割手段により画像フレームを分割した場合に、その画像フレーム内において線画像データまたは固有画像データを有するブロック数が予め定めた下限値以下であると、画像データ伸張手段により、線画像データまたは固有画像データを有するブロック数を増加させる方向に、線画像データまたは固有画像データは伸張される。そして、伸張後の画像データに基づいて置換画像データが形成される。 Further, in case of dividing an image frame by dividing means, when the number of blocks is less than a predetermined lower limit value having a linear image data or unique image data in the image frame, the image data expansion means, the line image data or The line image data or the unique image data is expanded in the direction of increasing the number of blocks having the unique image data. Then, replacement image data is formed based on the decompressed image data.

撮像装置にて画像データを取得する場合、撮像装置と被撮影体（固有画像データに対応するもの）との距離によっては、被撮影体の大きさが背景画像に対して（著しく）小さくなる場合がある。かかる場合の画像データにおいては、画像フレーム内において、固有画像データ（線画像データ）を有するブロック数が減少する。少ない数の記号データで構成される置換画像データは、元の画像（被撮影体）の微細な部分の画像表現が省略されるため、その特徴を具有するものとはなり難く、場合によっては、オブジェクトを表現した画像とは認識できない単なる記号の配列になりかねない。 When acquiring image data with an imaging device, depending on the distance between the imaging device and the subject (corresponding to the unique image data), the size of the subject is (remarkably) smaller than the background image There is. In the image data in such a case, the number of blocks having unique image data (line image data) is reduced in the image frame. The replacement image data composed of a small number of symbol data is less likely to have the characteristics because the image representation of the fine part of the original image (the subject) is omitted, and in some cases, It may be just an array of symbols that cannot be recognized as an image representing the object.

しかし、固有画像データ（線画像データ）を伸張すれば、画像フレーム内において固有画像データ（線画像データ）を有するブロック数を増加させることができる。つまり、微細部分についても表現し得る画像に拡大した上で、置換画像データを形成することができるので、入力された画像データの固有画像データが小さくとも、元の画像の特徴を備えた置換画像データを形成することができるという効果がある。 However, if the unique image data (line image data) is expanded, the number of blocks having the unique image data (line image data) in the image frame can be increased. In other words, since the replacement image data can be formed after enlarging the image to be able to express even a fine portion, the replacement image having the characteristics of the original image even if the unique image data of the input image data is small. There is an effect that data can be formed.

請求項２記載の画像処理装置によれば、請求項１記載の画像処理装置の奏する効果に加え、抽出元の画像フレームに対して抽出された固有画像データの占める割合が所定の割合以下であると判断手段により判断された場合に、固有画像データを有するブロック数が予め定めた下限値以下であるとして固有画像データの伸張が行われれる。よって、伸張するべき固有画像データの選別を的確に行うことができるという効果がある。 According to the image processing apparatus according to claim 2, wherein, in addition to the effects of the image processing apparatus according to claim 1, wherein, the ratio of specific image data extracted on the extracted original image frame is less than a predetermined ratio If the determination means determines that the number of blocks having unique image data is equal to or less than a predetermined lower limit value, the unique image data is expanded. Therefore, there is an effect that the specific image data to be expanded can be accurately selected.

固有画像データの大きさをブロック数で測る（ブロック単位でカウントする）場合には、ブロック全面にドットが配されていなくとも１カウントされる。このため、ブロック単位で固有画像データの大きさを測る場合、簡便である一方、測定結果が実際の大きさよりも大きい側にシフトしがちになり、場合によっては、伸張する必要のある固有画像データが伸張されないという事態が発生しかねない。しかし、抽出元の画像フレームに対して抽出された固有画像データの占める割合が所定の割合以下である場合に、固有画像データの伸張を行うので、的確に伸張の必要のある固有画像データを選別することができるのである。 When the size of the unique image data is measured by the number of blocks (counted in units of blocks), 1 is counted even if no dots are arranged on the entire block. Therefore, when measuring the size of the unique image data in units of blocks, it is convenient, but the measurement result tends to shift to a larger side than the actual size, and in some cases, the unique image data that needs to be expanded There is a possibility that the situation will not be expanded. However, when the ratio of the extracted unique image data to the extraction source image frame is equal to or less than the predetermined ratio, the unique image data is decompressed, so that the unique image data that needs to be decompressed accurately is selected. It can be done.

請求項３記載の画像処理装置によれば、請求項１又は２に記載の画像処理装置の奏する効果に加え、分割手段により画像フレームを分割した場合に、その画像フレーム内において線画像データまたは固有画像データを有するブロック数が予め定めた上限値以上であると、画像データ圧縮手段により、線画像データまたは固有画像データは圧縮され、画像フレーム内において線画像データまたは固有画像データを有するブロック数を減少させる。置換画像データは、圧縮後の画像データに基づいて形成される。よって、表示装置の画面に適度な大きさで置換画像データを表示することができるという効果がある。 According to the image processing device of the third aspect , in addition to the effect produced by the image processing device according to the first or second aspect, when the image frame is divided by the dividing means, the line image data or the unique image is included in the image frame. If the number of blocks having image data is greater than or equal to a predetermined upper limit, the line image data or unique image data is compressed by the image data compression means, and the number of blocks having line image data or unique image data in the image frame is determined. Decrease. The replacement image data is formed based on the compressed image data. Therefore, there is an effect that the replacement image data can be displayed on the screen of the display device with an appropriate size.

請求項４記載の画像処理装置によれば、請求項１から３のいずれかに記載の画像処理装置の奏する効果に加え、音声データ入力手段により入力された音声データと伴奏発生手段にて発生された伴奏とが発音手段により発音される。また、各伴奏情報に対応した歌詞データが、発音手段により発音される伴奏に合わせて表示装置に表示される。そして、かかる歌詞データと共に、出力手段により出力された置換画像データが表示装置に表示される。よって、発音された伴奏情報に合わせつつ、表示される歌詞データを見ながら歌唱する環境を構築できる上、置換画像データによる演出を行うことができるという効果がある。 According to the image processing apparatus of the fourth aspect , in addition to the effect produced by the image processing apparatus according to any one of the first to third aspects, the sound data input by the sound data input means and the accompaniment generation means are generated. The accompaniment is pronounced by the pronunciation means. In addition, lyrics data corresponding to each accompaniment information is displayed on the display device in accordance with the accompaniment sounded by the sound generation means. Then, the replacement image data output by the output means is displayed on the display device together with the lyrics data. Therefore, there is an effect that it is possible to construct an environment for singing while viewing the displayed lyric data while matching the sounded accompaniment information, and it is possible to produce an effect using the replacement image data.

一般に、音声データ入力手段と、伴奏情報発生手段と、発音手段と、歌詞データが表示される表示装置とを有する装置は、「カラオケ」と称されており、歌唱者の歌唱を補助する娯楽用の装置（機能）である。故に、「カラオケ」の機能と置換画像データを表示装置にて動画表示する機能を共に有する装置を提供することにより、「カラオケ」に新たな興趣を付加することができ、その娯楽性を向上させることができる。また、置換画像データは歌詞データと共に表示装置に表示されるので、置換画像データが表示装置に表示されても、歌詞が非表示となることはなく、歌唱者が歌唱不能となることを回避できるという効果がある。 Generally, an apparatus having an audio data input means, an accompaniment information generation means, a sound generation means, and a display device on which lyrics data is displayed is called “karaoke” and is used for entertainment for assisting a singer's singing. This is a device (function). Therefore, by providing a device having both the function of “Karaoke” and the function of displaying the replacement image data as a moving image on the display device, it is possible to add a new interest to “Karaoke” and improve its entertainment. be able to. Further, since the replacement image data is displayed on the display device together with the lyrics data, even if the replacement image data is displayed on the display device, the lyrics are not hidden, and it is possible to prevent the singer from being unable to sing. There is an effect.

請求項５記載の画像処理装置によれば、請求項１から４のいずれかに記載の画像処理装置の奏する効果に加え、音声データ入力手段により入力された音声データは、データ変換手段により回線に出力するデータ信号に変換される。また、置換画像データに対応する識別コードは、重畳手段により音声データのデータ信号に変換され、データ変換手段により変換されたデータ信号に重畳される。そして、出力手段により、その重畳されたデータ信号が回線に出力され、これにより置換画像データが相手側装置に出力される。 According to the image processing apparatus of the fifth aspect , in addition to the effect produced by the image processing apparatus according to any one of the first to fourth aspects, the audio data input by the audio data input means is connected to the line by the data conversion means. It is converted into a data signal to be output. Further, the identification code corresponding to the replacement image data is converted into a data signal of audio data by the superimposing unit and is superimposed on the data signal converted by the data converting unit. Then, the superimposed data signal is output to the line by the output means, whereby the replacement image data is output to the counterpart device.

よって、置換画像データを、ビットマップデータなどよりも大幅にデータ量の少ない識別コードで相手側装置に送信することができ、送信する置換画像データのデータ量を低減することができるという効果がある。これによれば、（データ量が少ないため）音声データの一部として置換画像データ（識別コード）を相手側装置に送出することができる。また、かかる識別コードを、音声データの一部として相手側装置に送出するので、送信側装置は音声データを送出する手段を備えておれば良く、わざわざ、画像データを送信する為の手段（置換画像データを画像データの態様で送信するための手段）を設ける必要がない。従って、装置コストを低減することができるという効果がある。 Therefore, the replacement image data can be transmitted to the counterpart apparatus with an identification code whose data amount is significantly smaller than that of bitmap data or the like, and the data amount of the replacement image data to be transmitted can be reduced. . According to this, the replacement image data (identification code) can be sent to the counterpart device as part of the audio data (because the data amount is small). Further, since the identification code is sent to the counterpart device as a part of the voice data, the sending device only needs to be provided with a means for sending the voice data. There is no need to provide means for transmitting image data in the form of image data. Therefore, there is an effect that the apparatus cost can be reduced.

請求項６記載の画像処理装置によれば、請求項５記載の画像処理装置の奏する効果に加え、送信側装置において、入力された音声データは、非再生領域として設定される非再生周波数成分が除去手段により除去される。そして、重畳手段により、置換画像データの識別コードが、除去手段により除去された非再生周波数成分の帯域のデータ信号に変換されて、データ変換手段により変換されたデータ信号に重畳される。 According to the image processing apparatus of the sixth aspect , in addition to the effect produced by the image processing apparatus of the fifth aspect , in the transmission side apparatus, the input audio data has a non-reproduction frequency component set as a non-reproduction area. It is removed by removing means. Then, the identification code of the replacement image data is converted into a data signal in the band of the non-reproduction frequency component removed by the removing unit, and is superimposed on the data signal converted by the data converting unit.

よって、送信側装置からの音声データを受信した相手側装置において、置換画像データの識別コードが音声データとして再生されることを、非実行とすることができるという効果がある。つまり、元々再生されない非再生周波数成分の音声データに置換画像データの識別コードは変換されているので、相手側装置においては、置換画像データの識別コードを含む音声データを受信しても、再生されるべき正当な（置換画像データの識別コードを除いた）音声データのみが再生される。従って、置換画像データがノイズとなって音声データで出力されてしまうが故に、必要な音声データの聞き取りが不能になるといった不具合を回避でき、相手側装置側における受話者に、違和感を抱かせることがないという効果がある。このため、受信した音声データから置換画像データの識別コードを分離して音声データの再生を行うという処理および装置を相手側装置に設ける必要がなく、相手側装置のコストを低減することができる。 Therefore, there is an effect that it is possible to make it non-executable that the identification code of the replacement image data is reproduced as audio data in the counterpart device that has received the audio data from the transmitting device. That is, since the replacement image data identification code is converted into the non-reproduction frequency component audio data that is not originally reproduced, even if the other device receives the audio data including the replacement image data identification code, it is reproduced. Only valid audio data (excluding the identification code of the replacement image data) to be reproduced is reproduced. Therefore, since the replacement image data is output as voice data as noise, it is possible to avoid a problem that it becomes impossible to hear the necessary voice data, and to make the receiver on the other device side feel uncomfortable. There is an effect that there is no. For this reason, it is not necessary to provide the other apparatus with a process and apparatus for reproducing the audio data by separating the identification code of the replacement image data from the received audio data, and the cost of the other apparatus can be reduced.

請求項７記載の画像処理装置によれば、請求項５または６に記載の画像処理装置の奏する効果に加え、送信側装置は、音声データ入力手段により入力された音声データについて、可聴限界を越えた限界周波数成分が、第２除去手段により除去される。そして、重畳手段により、置換画像データの識別コードが、第２除去手段により除去された限界周波数成分のデータ信号に変換されデータ変換手段により変換されたデータ信号に重畳される。 According to the image processing apparatus of the seventh aspect , in addition to the effect achieved by the image processing apparatus of the fifth or sixth aspect , the transmitting side apparatus exceeds the audible limit for the audio data input by the audio data input means. The marginal frequency component is removed by the second removing means. Then, the identification code of the replacement image data is converted by the superimposing unit into the data signal of the limit frequency component removed by the second removing unit and superimposed on the data signal converted by the data converting unit.

よって、送信側装置からの音声データを受信した相手側装置において、置換画像データのコードを含んだ状態で音声データが再生されても、その再生された音声帯域が可聴限界範囲を超えているので、かかる置換画像コードに起因するノイズを、相手側装置側における受話者に、極めて聴きとられ難いものとすることができるという効果がある。言い換えれば、再生される音声データの聞き取り性が悪いという不快感や違和感を受話者に抱かせることがない。従って、受信した音声データから置換画像データの識別コードを分離して音声データの再生を行うという処理および装置を相手側装置に設ける必要がなく、相手側装置のコストを低減することができる。 Therefore, even if the audio data is reproduced with the replacement image data included in the counterpart device that has received the audio data from the transmission side device, the reproduced audio band exceeds the audible limit range. Thus, there is an effect that the noise caused by the replacement image code can be made extremely difficult to be heard by the receiver on the counterpart device side. In other words, the listener does not feel uncomfortable or uncomfortable that the hearability of the reproduced audio data is poor. Therefore, it is not necessary to provide the counterpart apparatus with a process and apparatus for reproducing the audio data by separating the identification code of the replacement image data from the received audio data, and the cost of the counterpart apparatus can be reduced.

特に、不特定多数の相手側装置が想定される場合には、相手側装置が画像を表示する手段を備えず、音声データの送受信のみ行う装置（例えば通常の電話機など）であることも十分に考えられるが、かかる装置であっても、受信した音声データの再生に支障を来すことなく、良好な品質で音声出力を行わせることができる。 In particular, when an unspecified number of other-side devices are assumed, it is also sufficient that the other-side device is not provided with a means for displaying an image and is a device that only transmits and receives audio data (for example, a normal telephone). Although it is conceivable, even with such a device, it is possible to output sound with good quality without hindering reproduction of received sound data.

請求項８記載の画像処理装置によれば、請求項５から７のいずれかに記載の画像処理装置の奏する効果に加え、表示装置を備えた相手側装置を備えており、該相手側装置において、音声データ受信手段により音声データが受信される。受信された音声データは、音声再生手段により音声で再生される。ここで、受信された音声データに置換画像データに対応する識別コードのデータ信号が含まれている場合には、画像データ再生手段によりそのデータ信号から置換画像データが再生される。そして、再生された置換画像データが表示装置にて表示される。 According to the image processing device of the eighth aspect , in addition to the effect of the image processing device according to any one of the fifth to seventh aspects, the counterpart device provided with the display device is provided. The voice data is received by the voice data receiving means. The received audio data is reproduced as audio by the audio reproducing means. If the received audio data includes a data signal of an identification code corresponding to the replacement image data, the replacement image data is reproduced from the data signal by the image data reproduction means. Then, the reproduced replacement image data is displayed on the display device.

よって、置換画像データに対応する識別コードを含んでいる音声データを受信した相手側装置において、表示装置に的確に置換画像データを出力することができる。このため、送信側装置と相手側装置とにより、実際の映像を出力するテレビ電話とは異なる趣を持った画像によるコミュニケーションをはかることができるという効果がある。相手側装置にて表示される置換画像データは、送信側装置にて実際に撮影（入力）された画像データでない、加工（デフォルメ）されたものである。つまり、相手側装置では、送信側装置にて入力された画像データが（入力された画像データが再現された状態で）出力されるわけではない。従って、送信側装置の操作者（送話者）がカメラ等にて撮影されることを好まない場合や、諸事情により実際の映像を相手側に送信したくない場合には、実際の映像（入力された画像でデータ）をデフォルメした置換画像データにより、送話者は相手側に伝達したい内容を伝えることができる。ここで、置換画像データは、入力された画像データから形成されており、入力された画像データの特徴的部分を反映させることができるものであるので、相手側装置において、実際の映像（入力された画像データ）に代えて置換画像データが出力されても、自己の表現したい事項を画像を通じて、送話者から相手側に的確に伝達する高度なコミュニケーションを行うことができるという効果がある。 Therefore, in the counterpart device that has received the audio data including the identification code corresponding to the replacement image data, the replacement image data can be accurately output to the display device. For this reason, there is an effect that communication by an image having a taste different from that of a videophone that outputs an actual video can be performed between the transmission side device and the counterpart side device. The replacement image data displayed on the counterpart device is processed (deformed), not image data actually captured (input) on the transmitting device. That is, the counterpart device does not output the image data input by the transmitting device (in a state where the input image data is reproduced). Therefore, if the operator (speaker) of the transmission side apparatus does not like to be photographed by a camera or the like, or if it is not desired to transmit the actual video to the other side due to various circumstances, the actual video ( By using the replacement image data obtained by deforming the data) with the input image, the transmitter can transmit the contents to be transmitted to the other party. Here, the replacement image data is formed from the input image data and can reflect the characteristic part of the input image data. Even if the replacement image data is output instead of the image data), there is an effect that it is possible to perform high-level communication that accurately transmits the item that the user wants to express to the other party through the image.

請求項９記載の画像処理プログラムによれば、入力された画像データは、線画変換ステップにより、線画像データに変換される。また、その入力された画像データの画像フレーム又は線画変換ステップにより変換された線画像データを有する画像フレームは、分割ステップにより所定数のブロックに分割され、分割されたブロック単位で、線画像データの態様と近似する記号が、抽出ステップにより、複数の記号の画像データの中から抽出される。そして、抽出された記号の画像データで対応するブロックの線画像データを置換した置換画像データが、置換画像形成ステップにより形成される。 According to the image processing program of the ninth aspect , the input image data is converted into line image data by the line drawing conversion step. Further, the image frame of the input image data or the image frame having the line image data converted by the line drawing conversion step is divided into a predetermined number of blocks by the dividing step, and the line image data is divided into divided blocks. The symbol that approximates the aspect is extracted from the image data of the plurality of symbols by the extraction step. Then, replacement image data in which the line image data of the corresponding block is replaced with the extracted symbol image data is formed by the replacement image forming step.

よって、表示装置などに、入力された画像を記号の画像データで構成された画像に変換して出力し得、コミカルな娯楽性の高い画像による演出を提供することができる。更に、入力された画像が変換された線画から記号の画像データを形成するので、出力画像をシンプルな画像とすることができ、入力画像データ全体を記号データで構成した置換画像データに変換する場合に比べて、表示装置に表示する画像をシンプル且つシャープな画像とすることができるという効果がある。 Therefore, an input image can be converted into an image composed of symbol image data and output to a display device or the like, and an effect can be provided by a comical and highly entertaining image. Furthermore, since the image data of the symbol is formed from the line drawing obtained by converting the input image, the output image can be a simple image, and the entire input image data is converted to the replacement image data composed of the symbol data. As compared with the above, there is an effect that the image displayed on the display device can be a simple and sharp image.

また、入力された画像データから背景画像データとして入力された背景画像データとの共通部分を除いて、画像フレーム毎に固有の固有画像データが、固有画像データ抽出ステップにより抽出される。その固有画像データ抽出ステップにより抽出された固有画像データは、線画変換ステップにより線画像データに変換され、画像形成ステップにより、その固有画像データから変換された線画像データから置換画像データが形成される。 Also, the unique image data unique to each image frame is extracted by the unique image data extraction step, except for the common part with the background image data inputted as background image data from the inputted image data. The unique image data extracted by the unique image data extraction step is converted into line image data by the line drawing conversion step, and the replacement image data is formed from the line image data converted from the unique image data by the image forming step. .

よって、背景画像部分を省略して置換画像データを形成することができるので、画像データの全てから置換画像データを形成する場合に比べて、置換画像データの形成速度を高速にすることができるという効果がある。また、形成された置換画像データを表示装置等へ出力する場合に、そのデータ出力速度を高速にすることができるという効果がある。更に背景画像データを除去することにより、置換画像データへの変換元の画像データの猥雑さを軽減することができる。従って、表示装置に出力する場合に、簡素な置換画像データを出力でき、出力される画像が何を表現したものであるか不明確（識別困難）になることを回避できるという効果がある。背景画像データが含まれた猥雑（複雑）な画像データである場合、個々のオブジェクトの境界が不明確になりかねず、かかる画像データを置換画像データに変換すると、場合によっては、個々のオブジェクトとは異なる形状に分割あるいは連結された画像が出現しかねない。しかし、（背景画像データが除去された）固有画像データが変換された線画像データから置換画像データを形成するので、かかる不具合を軽減し、簡素で明瞭な置換画像データを、表示装置に表示できるのである。 Therefore, since the replacement image data can be formed by omitting the background image portion, the replacement image data can be formed at a higher speed than the case where the replacement image data is formed from all the image data. effective. Further, when the formed replacement image data is output to a display device or the like, the data output speed can be increased. Further, by removing the background image data, it is possible to reduce the complexity of the original image data converted into the replacement image data. Therefore, when outputting to a display device, simple replacement image data can be output, and it is possible to avoid unclear (difficult to identify) what the output image represents. In the case of complicated (complex) image data including background image data, the boundaries of individual objects may be unclear, and when such image data is converted into replacement image data, May appear in an image divided or connected into different shapes. However, since the replacement image data is formed from the line image data obtained by converting the unique image data (with the background image data removed), it is possible to reduce such problems and display simple and clear replacement image data on the display device. It is.

また、分割ステップにより画像フレームを分割した場合に、その画像フレーム内において線画像データまたは固有画像データを有するブロック数が予め定めた下限値以下であると、画像フレーム内において線画像データまたは固有画像データを有するブロック数を増加させる方向に、線画像データまたは固有画像データが伸張される。そして、伸張後の画像データに基づいて置換画像データが形成される。 Further, when an image frame is divided by the dividing step, if the number of blocks having line image data or unique image data in the image frame is equal to or less than a predetermined lower limit value, the line image data or unique image in the image frame Line image data or unique image data is expanded in the direction of increasing the number of blocks having data. Then, replacement image data is formed based on the decompressed image data.

請求項１０記載の画像処理プログラムによれば、請求項９記載の画像処理プログラムの奏する効果に加え、抽出元の画像フレームに対して抽出された固有画像データの占める割合が所定の割合以下であると判断ステップにより判断された場合に、固有画像データを有するブロック数が予め定めた下限値以下であるとして固有画像データの伸張が行われれる。よって、伸張するべき固有画像データの選別を的確に行うことができるという効果がある。 According to claim 10, wherein the image processing program, in addition to the effects of the image processing program according to claim 9, wherein, the ratio of specific image data extracted on the extracted original image frame is less than a predetermined ratio When the determination step determines that the number of blocks having unique image data is equal to or less than a predetermined lower limit value, the unique image data is expanded. Therefore, there is an effect that the specific image data to be expanded can be accurately selected.

固有画像データの大きさをブロック数で測る（ブロック単位でカウントする）場合には、ブロック全面にドットが配されていなくとも１カウントされる。このため、ブロック単位で固有画像データの大きさを測る場合、簡便である一方、測定結果が実際の大きさよりも大きい側にシフトしがちになり、場合によっては、伸張する必要のある固有画像データが伸張されないという事態が発生しかねない。しかし、抽出元の画像フレームに対して抽出された固有画像データの占める割合が所定の割合以下である場合に、固有画像データの伸張を行うので、的確に伸張の必要のある固有画像データを的確に選別することができるのである。 When the size of the unique image data is measured by the number of blocks (counted in units of blocks), 1 is counted even if no dots are arranged on the entire block. Therefore, when measuring the size of the unique image data in units of blocks, it is convenient, but the measurement result tends to shift to a larger side than the actual size, and in some cases, the unique image data that needs to be expanded There is a possibility that the situation will not be expanded. However, since the unique image data is expanded when the ratio of the extracted unique image data to the extraction source image frame is equal to or less than a predetermined ratio, the unique image data that needs to be accurately decompressed is accurately detected. Can be sorted into

以下、本発明の好ましい実施例について、添付図面を参照して説明する。図１は、本発明の画像処理装置としてのカラオケ装置１を示す概略図である。このカラオケ装置１は、画像処理部としての本体１ａと、表示装置としてのＣＲＴディスプレイ２と、本体１ａを遠隔操作するための電子早見本３０とを備えており、本体１ａとＣＲＴディスプレイ２とはケーブル４を介して接続されている。該カラオケ装置１は、入力された画像データを記号で構成されたアスキー画像に変換する画像処理を実行し、変換されたアスキー画像をＣＲＴディスプレイ２に出力することにより演出を実行する機能を備えている。 Hereinafter, preferred embodiments of the present invention will be described with reference to the accompanying drawings. FIG. 1 is a schematic diagram showing a karaoke apparatus 1 as an image processing apparatus of the present invention. The karaoke apparatus 1 includes a main body 1a as an image processing unit, a CRT display 2 as a display device, and an electronic quick sample 30 for remotely operating the main body 1a. The main body 1a and the CRT display 2 are They are connected via a cable 4. The karaoke apparatus 1 has a function of executing an effect by executing image processing for converting input image data into an ASCII image composed of symbols, and outputting the converted ASCII image to a CRT display 2. Yes.

また、カラオケ装置１は、いわゆる通信カラオケであって、回線を介して接続されるカラオケサーバから配信された楽曲データ（伴奏情報）や歌詞データを、本体１ａ内に記憶している。そして、電子早見本３０から、歌唱者により選曲された楽曲を示す楽曲コードが送信されると、本体１ａ内から対応する楽曲データと歌詞データとを読出し、楽曲データに基づいて伴奏を自動演奏するとともに、その楽曲の進行に合わせて歌詞と映像（またはアスキー画像）とをＣＲＴディスプレイ２に表示するように構成されている。 The karaoke apparatus 1 is a so-called communication karaoke, and stores music data (accompaniment information) and lyrics data distributed from a karaoke server connected via a line in the main body 1a. When the music code indicating the music selected by the singer is transmitted from the electronic quick sample 30, the corresponding music data and lyrics data are read from the main body 1a, and the accompaniment is automatically played based on the music data. At the same time, lyrics and video (or ASCII images) are displayed on the CRT display 2 as the music progresses.

このカラオケ装置１の本体１ａは、制御部５と、該制御部５に接続されるＣＣＤカメラ３とマイクロフォン２３とを備えている。制御部５のケーシング前面には、数値を入力するためのテンキーや、各種コマンドを制御部５に入力するコマンドボタンを備えた操作パネル２０が配設されている。かかるテンキーやコマンドボタンの操作により、使用者は、選曲や、演奏テンポの設定、音程の設定、音量の設定、リバーブなどの音響効果の設定や、文字の入力などを行うことができる。また、該操作パネル２０には、液晶表示器（以下単に「ＬＣＤ」と称す）が設けられており、テンキーにより入力された数値やコマンドボタンにより入力されたコマンドの内容、電子早見本３０から送信されたコマンドの内容、更には、カラオケ装置１の動作状態等がこのＬＣＤに表示される。 The main body 1 a of the karaoke apparatus 1 includes a control unit 5, a CCD camera 3 and a microphone 23 connected to the control unit 5. An operation panel 20 having a numeric keypad for inputting numerical values and command buttons for inputting various commands to the control unit 5 is disposed on the casing front surface of the control unit 5. By operating the numeric keypad and the command button, the user can perform music selection, performance tempo setting, pitch setting, volume setting, sound effect setting such as reverb, character input, and the like. In addition, the operation panel 20 is provided with a liquid crystal display (hereinafter simply referred to as “LCD”), which transmits a numerical value input by a numeric keypad, a command content input by a command button, and an electronic quick sample 30. The contents of the received command and the operation status of the karaoke apparatus 1 are displayed on the LCD.

操作パネル２０の下方には、電子早見本３０との間で赤外線データによる無線通信を行うための無線通信部２６が配設されている。電子早見本３０から赤外線データで送信される各種コマンドや楽曲コードなどのデータは、この無線通信部２６により受信される。この無線通信部２６により、電子早見本３０からの楽曲コードや所定の処理の実行を指示するコマンドを受信した場合には、本体１ａは、その楽曲コードやコマンドに基づいた処理を実行する。無線通信部２６の右方には、入力端子を介してマイクロフォン２３が接続されており、歌唱者の発声は、かかるマイクロフォン２３から制御部５内に入力される。 Below the operation panel 20, a wireless communication unit 26 is provided for performing wireless communication using infrared data with the electronic sample 30. Data such as various commands and music codes transmitted as infrared data from the electronic quick sample 30 are received by the wireless communication unit 26. When the wireless communication unit 26 receives a music code from the electronic quick sample 30 or a command instructing execution of a predetermined process, the main body 1a executes a process based on the music code or the command. A microphone 23 is connected to the right side of the wireless communication unit 26 via an input terminal, and the utterance of the singer is input from the microphone 23 into the control unit 5.

ＣＣＤカメラ３は、連続的に画像を撮像する一般的な撮像装置であり、ＣＲＴディスプレイ２の筐体上面に設置されている。ＣＣＤカメラ３により撮像された画像（画像データ）は、非図示のケーブルを介して制御部５内に入力される。ＣＣＤカメラ３内には、Ａ／Ｄコンバータなどが内蔵されており、制御部５には、デジタルデータに変換された画像データが入力される。制御部５は、このＣＣＤカメラ３により入力された画像データに基づいて、アスキー画像を作成する。 The CCD camera 3 is a general imaging device that continuously captures images, and is installed on the upper surface of the casing of the CRT display 2. An image (image data) picked up by the CCD camera 3 is input into the control unit 5 via a cable (not shown). The CCD camera 3 includes an A / D converter and the like, and image data converted into digital data is input to the control unit 5. The control unit 5 creates an ASCII image based on the image data input by the CCD camera 3.

電子早見本３０は、楽曲の選択および選択した楽曲の指定（楽曲コードの本体１ａへの送信）に加え、本体１ａ（制御部５）を遠隔操作（演奏の中止やスキップ等）するための装置であり、タッチパネルディスプレイ４０を備えた表示本体３０ａと、該表示本体３０ａを立設した状態で維持するためのクレイドル４４とを備えている。表示本体３０ａには、本体１ａとの間で赤外線データによる無線通信を行うための無線通信部３６が、その筐体上部に設けられている。電子早見本３０へ送信される楽曲コード等のデータや、所定の処理の実行を指示するコマンドは、この無線通信部３６から赤外線データで出力される。 The electronic sample book 30 is an apparatus for remotely operating the main body 1a (control unit 5) (stopping or skipping performances) in addition to selecting a music piece and specifying the selected music piece (sending the music code to the main body 1a). And a display main body 30a having a touch panel display 40, and a cradle 44 for maintaining the display main body 30a in a standing state. The display main body 30a is provided with a wireless communication unit 36 for wireless communication using infrared data with the main body 1a at the top of the casing. Data such as a music code transmitted to the electronic quick sample 30 and a command for instructing execution of a predetermined process are output from the wireless communication unit 36 as infrared data.

また、表示本体３０ａは、案内画像を表示するタッチパネルディスプレイ４０を備えている。タッチパネルディスプレイ４０は、画像を表示するＬＣＤで構成された表示装置４１と、その表示装置４１上に設けられ押下により信号入力を実行するタッチパネル４２とを備えている。 The display body 30a includes a touch panel display 40 that displays a guide image. The touch panel display 40 includes a display device 41 configured by an LCD that displays an image, and a touch panel 42 that is provided on the display device 41 and executes signal input when pressed.

案内画像は、選曲のための各種データや、選曲の為の各種操作を行うための操作ボタンを表示する画像であり、使用者が視覚的に電子早見本３０の操作を理解することができるように構成されている。尚、案内画像（即ち表示装置４１）に表示された各操作ボタンの押下は、表示装置４１上に設けられたタッチパネル４２により、その押下位置に対応する電気信号として入力される。これにより、押下された操作ボタンの入力としてＣＰＵ１２（図２参照）に認識され、対応する動作（コマンド入力）が実行される。 The guide image is an image displaying various data for music selection and operation buttons for performing various operations for music selection, so that the user can visually understand the operation of the electronic quick sample 30. It is configured. Note that pressing of each operation button displayed on the guide image (that is, the display device 41) is input as an electrical signal corresponding to the pressed position by the touch panel 42 provided on the display device 41. As a result, the CPU 12 (see FIG. 2) recognizes the input of the pressed operation button, and the corresponding operation (command input) is executed.

具体的には、この案内画像は、複数画面で構成されており、各画面において、カテゴリや歌手毎に分類された項目や、その項目の選択によって表示される曲名などが表示される。各曲名の表示は、曲名を指定するための操作ボタンとなっており、この曲名の表示が押下されることにより選曲がなされる（楽曲データの指定）。この選曲が実行される画面内には、アスキーアートボタン４２ａが表示されている。 Specifically, this guidance image is composed of a plurality of screens, and on each screen, an item classified for each category and singer, a song name displayed by selection of the item, and the like are displayed. The display of each song name is an operation button for designating the song name, and the song selection is made by pressing this song name display (designation of song data). An ASCII art button 42a is displayed in the screen on which the music selection is executed.

アスキーアートボタン４２ａは、本体１ａにアスキー画像による演出を指示するためのものである。このアスキーアートボタン４２ａが、その画面上から（タッチパネル４２を介して）押下されると、選曲された楽曲コードに、アスキー画像による演出を本体１ａに指示するアスキーアートコードが付加されて、本体１ａに送信される。本体１ａでは、受信した楽曲コードにアスキーアートコードが付加されていると、かかる楽曲再生中のＣＲＴディスプレイ２における演出を、予め記憶された映像に代えて、アスキー画像を表示することにより実行する。 The ASCII art button 42a is for instructing the main body 1a to produce an ASCII image. When the ASCII art button 42a is pressed from the screen (via the touch panel 42), an ASCII art code for instructing the main body 1a to produce an ASCII image is added to the selected music code and transmitted to the main body 1a. Is done. In the main body 1a, when an ASCII art code is added to the received music code, the effect on the CRT display 2 during the reproduction of the music is executed by displaying an ASCII image instead of the previously stored video.

更に、操作ボタンとしては、楽曲の選択等、電子早見本３０においての各処理を実行させるボタンに加え、本体１ａを操作するためのボタンが各種設けられている。かかる操作ボタンの操作（表示された操作ボタンの画面上からの押下）により、本体１ａの操作パネル２０に備えられたボタン操作と同様に、選曲や、演奏テンポの設定、音程の設定、音量の設定、リバーブなどの音響効果の設定や、文字の入力などを行うことができる。つまり、上記の本体１ａの操作パネル２０の機能の一部は、この電子早見本３０による遠隔操作によっても実行される。 Furthermore, as the operation buttons, various buttons for operating the main body 1a are provided in addition to buttons for executing various processes in the electronic quick sample 30 such as selection of music. Similar to the button operation provided on the operation panel 20 of the main body 1a by the operation of the operation button (pressing the displayed operation button on the screen), the music selection, performance tempo setting, pitch setting, volume You can set sound effects such as setting and reverb, and input characters. That is, some of the functions of the operation panel 20 of the main body 1a are also executed by remote operation using the electronic quick sample 30.

クレイドル４４は、表示本体３０ａの下部と左右との三方を支持する矩形状に形成されており、このクレイドル４４と表示本体３０ａとは着脱可能に構成されている。また、クレイドル４４は、電子早見本３０と本体１ａとの接続架台であり、かかるクレイドル４４によって、本体１ａに取り付けられた電子早見本３０は、有線により本体１ａと電気的に接続される。このクレイドル４４を介した有線接続により、電子早見本３０に内蔵されたバッテリに対する充電が実行される。また、本体１ａから電子早見本３０に、新たな楽曲に対応する楽曲コードや曲名などのデータを送信することができる。 The cradle 44 is formed in a rectangular shape that supports the lower side and the left and right sides of the display main body 30a, and the cradle 44 and the display main body 30a are configured to be detachable. The cradle 44 is a connection frame between the electronic quick sample 30 and the main body 1a, and the electronic quick sample 30 attached to the main body 1a is electrically connected to the main body 1a by wire. By the wired connection via the cradle 44, the battery built in the electronic quick sample 30 is charged. Further, data such as a music code and a music title corresponding to a new music can be transmitted from the main body 1a to the electronic quick sample 30.

図２は、上記のように構成されたカラオケ装置１の電気的構成を示すブロック図である。カラオケ装置１は、本体１ａと、ＣＲＴディスプレイ２と、電子早見本３０とを備え、本体１ａは、更に、制御部５とＣＣＤカメラ３とマイクロフォン２３とを備えている。 FIG. 2 is a block diagram showing an electrical configuration of the karaoke apparatus 1 configured as described above. The karaoke apparatus 1 includes a main body 1a, a CRT display 2, and an electronic quick sample 30. The main body 1a further includes a control unit 5, a CCD camera 3, and a microphone 23.

制御部５は、演算装置であるＣＰＵ１１によりカラオケ装置１の制御を行うものであり、ＣＰＵ１１と、ＲＯＭ１２と、ＲＡＭ１３と、画像用ＲＡＭ２７と、ハードディスク１４と、モデム１５と、ディスクプレイヤー１６と、映像処理部１７と、入出力ポート（Ｉ／Ｏ）１９と、音源であるシンセサイザを備えた伴奏装置２１と、電子早見本３０（クレイドル４４）を有線接続するためのＬＡＮポート２５と、電子早見本３０からの赤外線データを受信するための無線通信部２６とを備えている。これらの各デバイスはＣＰＵ１１に接続され、ＣＰＵ１１により制御される。また、制御部５は、ディスクプレイヤー１６と映像処理部１７とに接続されたビデオミキサ１８と、入出力ポート（Ｉ／Ｏ）１９に接続された操作パネル２０と、伴奏装置２１に接続されたアンプミキサ２２と、アンプミキサ２２に接続されたスピーカ２４とを備えている。また、入出力ポート（Ｉ／Ｏ）１９には、ＣＣＤカメラ３が接続されている。 The control unit 5 controls the karaoke device 1 by the CPU 11 which is an arithmetic device. The CPU 11, ROM 12, RAM 13, image RAM 27, hard disk 14, modem 15, disk player 16, video A processing unit 17, an input / output port (I / O) 19, an accompaniment device 21 having a synthesizer as a sound source, a LAN port 25 for wired connection of the electronic sample 30 (cradle 44), and an electronic sample And a wireless communication unit 26 for receiving infrared data from 30. Each of these devices is connected to the CPU 11 and controlled by the CPU 11. The control unit 5 is connected to a video mixer 18 connected to the disc player 16 and the video processing unit 17, an operation panel 20 connected to an input / output port (I / O) 19, and an accompaniment device 21. An amplifier mixer 22 and a speaker 24 connected to the amplifier mixer 22 are provided. The CCD camera 3 is connected to the input / output port (I / O) 19.

ＣＰＵ１１は、電子早見本３０や操作パネル２０から楽曲コードが入力された場合、その楽曲コードを、ＲＡＭ１３の所定のエリアに設けられた予約曲リストエリアに登録したり、その予約曲リストエリアに記憶される楽曲コードを登録順に読み出して、楽曲の再生出力を制御するものである。尚、後述するカラオケ再生処理（図４参照）では、かかる予約曲リストエリアに楽曲コードが登録されていると、楽曲コードの入力有りと判断して、楽曲を再生する処理を実行する。楽曲が再生されると、対応する楽曲コードは、予約曲リストエリアから削除される。また、ＣＰＵ１１は、楽曲の演奏時に行われる演出の１つとして、ＣＣＤカメラ３により撮像された画像からアスキー画像を形成し、ＣＲＴディスプレイ２に出力する制御を行う。 When the music code is input from the electronic sample 30 or the operation panel 20, the CPU 11 registers the music code in a reserved music list area provided in a predetermined area of the RAM 13 or stores it in the reserved music list area. Are read out in the order of registration, and the reproduction output of the music is controlled. In the karaoke playback process (see FIG. 4), which will be described later, if a music code is registered in the reserved music list area, it is determined that a music code has been input, and a process of playing a music is executed. When the music is played, the corresponding music code is deleted from the reserved music list area. Further, the CPU 11 performs control to form an ASCII image from an image captured by the CCD camera 3 and output it to the CRT display 2 as one of the effects performed when the music is played.

ＲＯＭ１２は、ＣＰＵ１１により実行される各種のプログラムや固定値などを記憶した不揮発性のメモリである。 The ROM 12 is a nonvolatile memory that stores various programs executed by the CPU 11 and fixed values.

ＲＡＭ１３は、ＲＯＭ１２内に記憶されるプログラムや後述するハードディスク１４に記憶される制御プログラムの実行に際して各種のデータ等を一時的に記憶するためのメモリであり、背景画像メモリ１３ａと、アスキー画像バッファ１３ｂと、アスキーアートフラグ１３ｃとを備えている。 The RAM 13 is a memory for temporarily storing various data and the like when executing a program stored in the ROM 12 or a control program stored in the hard disk 14 described later, and includes a background image memory 13a and an ASCII image buffer 13b. And an ASCII art flag 13c.

背景画像メモリ１３ａは、背景画像としてＣＣＤカメラ３により撮像（取得）された画像データを記憶するためのメモリである。背景画像は、アスキー画像による１の演出、即ち１の楽曲の再生が実行される毎に、アスキー画像による演出に先だってＣＣＤカメラ３により撮像される。新たに背景画像が撮像されると、この背景画像メモリ１３ａに記憶される背景画像の画像データは、新たに撮像された背景画像の画像データによって更新される。 The background image memory 13a is a memory for storing image data captured (acquired) by the CCD camera 3 as a background image. The background image is picked up by the CCD camera 3 prior to the production by the ASCII image every time one production by the ASCII image, that is, reproduction of one music piece is executed. When a new background image is captured, the image data of the background image stored in the background image memory 13a is updated with the image data of the newly captured background image.

アスキー画像の形成に際しては、入力された画像データから、背景画像メモリ１３ａに記憶される背景画像の画像データに対応する部分の画像データが除かれ、背景画像の画像データが除かれた残りの部分（人物画像データ、固有画像データ）がアスキー画像に変換される。 When forming the ASCII image, the image data of the portion corresponding to the image data of the background image stored in the background image memory 13a is removed from the input image data, and the remaining portion of the image data of the background image is removed. (Person image data, unique image data) is converted into an ASCII image.

ＣＣＤカメラ３は、通常の撮像において、その撮像範囲に存在する各種のオブジェクトの画像データが、各オブジェクトが詳細に識別し得る解像度で、即ち、良好な画像品質を実現する画素数で取得されるように設計されている。このため、ＣＣＤカメラ３の設置場所によっては、取得される画像データが複雑なもの（複数のオブジェクトが混在する画像）となる。かかる複雑な画像データをアスキー画像に変換する場合には、入り組んだオブジェクトの輪郭線の処理が不的確になりかねず、個々のオブジェクトがアスキー画像にて的確に表現されないという事態が発生しかねない。しかし、取得された画像データから背景画像を除いた画像データ（人物画像データ）を、アスキー画像に変換するので、変換元の画像データを簡素にすることができる。このため、形成されるアスキー画像をオブジェクト（本実施の形態では主として人物）が的確かつ明確に示された画像とすることができる。 In normal imaging, the CCD camera 3 acquires image data of various objects existing in the imaging range at a resolution that allows each object to be identified in detail, that is, with a number of pixels that realizes good image quality. Designed to be For this reason, depending on the installation location of the CCD camera 3, the acquired image data is complicated (an image in which a plurality of objects are mixed). When converting such complex image data into an ASCII image, the processing of the contour lines of complicated objects may become inaccurate, and a situation may occur in which individual objects are not accurately represented in the ASCII image. . However, since the image data (person image data) obtained by removing the background image from the acquired image data is converted to the ASCII image, the conversion source image data can be simplified. Therefore, the ASCII image to be formed can be an image in which an object (mainly a person in the present embodiment) is accurately and clearly shown.

具体的には、背景画像以外の部分とは、例えば、ＣＣＤカメラ３による背景画像の撮像後、ＣＣＤカメラ３の撮像範囲に入ったオブジェクトの画像であり、本実施の形態では、人物（歌唱者）の画像である。つまり、本実施の形態では、アスキー画像への変換対象を人物（動きのある動体）の画像としており、変化の乏しい背景画像に比べてかかる人物画像（背景画像以外の部分）は、一連の撮像での特徴的部分となる。故に、一連の撮像を通して共通する変化の乏しい背景画像の画像データを除去して、その残りの部分である人物画像データにてアスキー画像を形成しても、演出の意図を大きく変化させることはないばかりか、画像の特徴的部分に絞り込んだ単純明快なアスキー画像をＣＲＴディスプレイ２に表示することができる。 Specifically, the portion other than the background image is, for example, an image of an object that has entered the imaging range of the CCD camera 3 after the background image is captured by the CCD camera 3. In this embodiment, a person (singer) ). In other words, in the present embodiment, an object to be converted to an ASCII image is an image of a person (moving moving object), and the person image (portion other than the background image) is captured in a series of images as compared to a background image with little change. It becomes a characteristic part in. Therefore, removing the image data of the background image with little change common through a series of imaging, and forming the ASCII image with the remaining person image data, does not greatly change the intention of the production. In addition, a simple and clear ASCII image narrowed down to a characteristic part of the image can be displayed on the CRT display 2.

アスキー画像バッファ１３ｂは、後述するアスキー変換処理（Ｓ２８、図６参照）において、形成されたアスキー画像を一時的に記憶するためのメモリである。アスキー画像は、記号データを組み合わせて形成された画像であり、即ち、巨視的に画像の態様を成す記号の集合体である。詳細には、アスキー画像は、アスキー変換処理（Ｓ２８）にて、変換元の線画像データをブロックに分割した後、各ブロックの分割画像を、近似する態様の記号データに置換（嵌め込み）して形成される。置換された記号データは、ブロックの位置に従って、順に、このアスキー画像バッファ１３ｂに記憶される。 The ASCII image buffer 13b is a memory for temporarily storing the ASCII image formed in the ASCII conversion process (S28, see FIG. 6) described later. The ASCII image is an image formed by combining symbol data, that is, a collection of symbols that macroscopically form an image. Specifically, the ASCII image is divided (inserted) into symbol data in an approximate form after dividing the line image data of the conversion source into blocks in the ASCII conversion process (S28). It is formed. The replaced symbol data is sequentially stored in the ASCII image buffer 13b in accordance with the block position.

ここで、アスキー画像バッファ１３ｂには、ブロックの位置に従って、順に記号データが記憶され、画像フレームの端部に達した場合には改行コードが記憶される。このため、アスキー画像バッファ１３ｂには、変換前の線画像データの画像フレームの座標に対応付けて、記号データが記憶されることとなる。その結果、記憶された記号データを、アスキー画像バッファ１３ｂから順に取り出すことにより、元画像に対応したアスキー画像を出力することができる。尚、アスキー画像バッファ１３ｂに記憶される記号データは、記号を示す記号コードであっても良く、記号コードが画像に展開されたドットデータ（記号の画像データ）であっても良い。 Here, the ASCII image buffer 13b stores the symbol data in order according to the position of the block, and stores the line feed code when the end of the image frame is reached. For this reason, the ASCII image buffer 13b stores symbol data in association with the coordinates of the image frame of the line image data before conversion. As a result, an ASCII image corresponding to the original image can be output by sequentially extracting the stored symbol data from the ASCII image buffer 13b. The symbol data stored in the ASCII image buffer 13b may be a symbol code indicating a symbol, or may be dot data (symbol image data) obtained by expanding the symbol code into an image.

アスキーアートフラグ１３ｃは、選曲された楽曲に対応する演出を、その楽曲データに対応して予め記憶されている映像で実行するか、アスキー画像で実行するかを識別するためのフラグである。このアスキーアートフラグ１３ｃは、電子早見本３０から送信された楽曲コードにアスキーアートコードが付加されていた場合にオンされ、入力された楽曲コードに対応する楽曲の再生が終了するとオフされる。 The ASCII art flag 13c is a flag for identifying whether an effect corresponding to the selected music is executed with a video stored in advance corresponding to the music data or an ASCII image. The ASCII art flag 13c is turned on when the ASCII art code is added to the music code transmitted from the electronic quick sample 30, and is turned off when the reproduction of the music corresponding to the inputted music code is completed.

画像用ＲＡＭ２７は、ＤＲＡＭまたはＳＤＲＡＭなどで構成された大容量のＲＡＭであり、画像データを一時的に記憶するものである。背景画像撮影後にＣＣＤカメラ３から入力された画像データ（背景と人物とが撮像された画像）は、１の画像フレーム毎に、この画像用ＲＡＭ２７に記憶される。記憶された画像データは、アスキー画像を作成する際に、１の画像フレーム毎に、この画像用ＲＡＭ２７から読み出されてアスキー画像へと変換される。 The image RAM 27 is a large-capacity RAM composed of DRAM, SDRAM, or the like, and temporarily stores image data. Image data (an image in which a background and a person are captured) input from the CCD camera 3 after background image shooting is stored in the image RAM 27 for each image frame. The stored image data is read out from the image RAM 27 and converted into an ASCII image for each image frame when an ASCII image is created.

ハードディスク１４は、書換可能な大容量の不揮発性のメモリであり、本カラオケ装置１を制御する制御プログラム１４ａと、記号データメモリ１４ｂと、楽曲データメモリ１４ｃと、歌詞データメモリ１４ｄとを備えている。尚、図４〜図６に示したフローチャートのプログラムは、制御プログラム１４ａの一部として格納されている。 The hard disk 14 is a rewritable large-capacity non-volatile memory, and includes a control program 14a for controlling the karaoke apparatus 1, a symbol data memory 14b, a music data memory 14c, and a lyrics data memory 14d. . The programs of the flowcharts shown in FIGS. 4 to 6 are stored as a part of the control program 14a.

記号データメモリ１４ｂは、記号の画像データ（記号データ）を記憶するメモリであり、各記号を示す記号コードに対応付けられて、各記号の画像がドットデータで記憶されている。アスキー画像の作成時には、この記号データメモリ１４ｂが参照され、線画像データに近似する記号のドットデータがテンプレートマッチングにより抽出されて、アスキー画像が形成される。尚、本カラオケ装置１においては、記号コードには、ＡＳＣＩＩコードに準じたＪＩＳ７単位符号（８ビットのデータで表現されるコード）が採用されている。尚、コード体系は、ＪＩＳ７単位符号に限られるものでなく、例えば、他のＡＳＣＩＩコードや、ＥＢＣＤＩＣコード、Ｕｎｉｃｏｄｅ、更には、ＪＩＳ漢字コードなどを適宜用いることができる。 The symbol data memory 14b is a memory for storing symbol image data (symbol data), and stores an image of each symbol as dot data in association with a symbol code indicating each symbol. When creating an ASCII image, the symbol data memory 14b is referred to, and dot data of a symbol that approximates line image data is extracted by template matching to form an ASCII image. In the present karaoke apparatus 1, a JIS 7 unit code (a code expressed by 8-bit data) conforming to the ASCII code is employed as the symbol code. The code system is not limited to JIS7 unit codes, and other ASCII codes, EBCDIC codes, Unicodes, and JIS kanji codes can be used as appropriate.

楽曲データメモリ１４ｃは、再生する楽曲の楽曲データを記憶するメモリである。各楽曲データは、１の楽曲コードを指定する楽曲コードのそれぞれに対応して記憶されている。楽曲コードが入力されると、その楽曲コードに対応する１の楽曲データ（ＭＩＤＩ（Musical Instrument Digital Interface）形式のデータや、音符長を示すデータなど）が、この楽曲データメモリ１４ｃから読み出され、後述する伴奏装置２１に出力される。 The music data memory 14c is a memory for storing music data of music to be reproduced. Each piece of music data is stored corresponding to each piece of music code that specifies one music code. When a music code is input, one music data (MIDI (Musical Instrument Digital Interface) format data, data indicating a note length, etc.) corresponding to the music code is read from the music data memory 14c. It is output to an accompaniment device 21 to be described later.

歌詞データメモリ１４ｄは、楽曲の歌詞データ（タイトルデータを含む）を記憶するメモリである。各歌詞データは、１の楽曲コードを指定する楽曲コードのそれぞれに対応して、テキストデータで記憶されている。また、この歌詞データメモリ１４ｄに記憶される歌詞データは、ＪＩＳ漢字コードで形成されている。楽曲コードが入力されると、ＣＰＵ１１により、この歌詞データメモリ１４ｄから対応する歌詞データ（タイトルデータを含む）が読み出されると共に、漢字コードに対応してハードディスク１４の所定の領域に記憶された文字の画像データに、読み出した歌詞データは展開され、歌詞文字画像データ（ドットデータ）が生成される。 The lyric data memory 14d is a memory for storing lyric data (including title data) of music. Each lyric data is stored as text data corresponding to each music code designating one music code. The lyric data stored in the lyric data memory 14d is formed of JIS kanji codes. When the music code is input, the CPU 11 reads out the corresponding lyric data (including title data) from the lyric data memory 14d, and the character stored in the predetermined area of the hard disk 14 corresponding to the kanji code. The read lyric data is expanded into image data, and lyric character image data (dot data) is generated.

新曲等のリリースにより、歌唱者が所望する楽曲は変化するため、楽曲データメモリ１４ｃに記憶される楽曲データと、歌詞データメモリ１４ｄに記憶される歌詞データとは、定期的に更新される。本カラオケ装置１は、上記したように通信カラオケであり、回線を介して楽曲データ等を配信するカラオケサーバに接続されると共に、本カラオケ装置１からカラオケサーバにアクセスし、楽曲データと、その楽曲データに対応する歌詞データとを（場合によっては楽曲データに対応する映像データも）、１の楽曲コードに対応付けられた一連のデータとしてダウンロードすることができるように構成されている。ダウンロードされた楽曲データは、この楽曲データメモリ１４ｃに書き加えられる（追加して記憶される）。また、歌詞データについては、歌詞データメモリ１４ｄに書き加えられる（追加して記憶される）。 Since the song desired by the singer changes due to the release of a new song or the like, the song data stored in the song data memory 14c and the lyrics data stored in the lyrics data memory 14d are regularly updated. As described above, the karaoke apparatus 1 is a communication karaoke, and is connected to a karaoke server that distributes music data and the like via a line. The lyrics data corresponding to the data (and video data corresponding to the music data in some cases) can be downloaded as a series of data associated with one music code. The downloaded music data is added to the music data memory 14c (added and stored). Also, the lyrics data is added to the lyrics data memory 14d (added and stored).

モデム１５は、変復調装置であり、アナログ信号とデジタル信号との変換を行う装置である。また、伝送制御用の各種手順信号を送受信するものである。カラオケ装置１から回線に出力されるデータ（例えば、カラオケサーバへのアクセス要求など）は、このモデム１５によってデジタル信号からアナログ信号に変換された後に回線に出力される。また、回線から受信したデータ（カラオケサーバからダウンロードされる楽曲データや歌詞データなど）は、このモデム１５によりアナログ信号からデジタル信号に変換される。 The modem 15 is a modulation / demodulation device that converts an analog signal and a digital signal. It also transmits and receives various procedure signals for transmission control. Data output from the karaoke device 1 to the line (for example, an access request to the karaoke server) is converted from a digital signal to an analog signal by the modem 15 and then output to the line. Further, data received from the line (music data or lyrics data downloaded from the karaoke server) is converted from an analog signal to a digital signal by the modem 15.

このモデム１５は、回線制御を行うためのネットワークコントロールユニット（以下単に「ＮＣＵ」と称す）を内蔵しており、このＮＣＵを介してカラオケ装置１は回線に接続されている。このＮＣＵにより通信相手を呼び出すためのダイヤル信号が送出される。 The modem 15 incorporates a network control unit (hereinafter simply referred to as “NCU”) for performing line control, and the karaoke apparatus 1 is connected to the line via the NCU. The NCU sends a dial signal for calling a communication partner.

ディスクプレイヤー１６は、歌唱者が歌詞を参照しながら歌唱する際に、ＣＲＴディスプレイ２に表示される映像を記憶したＣＤやＤＶＤを再生する再生装置である。本カラオケ装置１は、楽曲データの再生時におけるＣＲＴディスプレイ２での演出を、ディスクプレイヤー１６にて再生される映像とするか、ＣＣＤカメラ３により取得された画像データに基づくアスキー画像とするかを、選択可能に構成されている。かかる演出をアスキー画像で行うことが指定されている場合には、ディスクプレイヤー１６による映像の再生は非実行とされる。尚、アスキー画像による演出が指定された場合に、ディスクプレイヤー１６によっても映像を再生し、その映像とアスキー画像とを共にＣＲＴディスプレイ２に表示するように、本カラオケ装置１を構成しても良い。映像とアスキー画像とを共にＣＲＴディスプレイ２に表示する場合には、ＣＲＴディスプレイ２の画面を分割すると共に各分割画面にそれぞれ映像とアスキー画像とを個別に表示してもよく、映像にアスキー画像を重ねて１画面で表示しても良い。 The disc player 16 is a playback device that plays back a CD or DVD that stores video displayed on the CRT display 2 when a singer sings while referring to the lyrics. The karaoke apparatus 1 determines whether an effect on the CRT display 2 when reproducing the music data is an image reproduced by the disc player 16 or an ASCII image based on the image data acquired by the CCD camera 3. , Is configured to be selectable. When it is specified that such an effect is to be performed on an ASCII image, the video playback by the disc player 16 is not executed. The karaoke apparatus 1 may be configured such that when an effect by an ASCII image is designated, the video is also reproduced by the disc player 16 and both the video and the ASCII image are displayed on the CRT display 2. . When both the video and the ASCII image are displayed on the CRT display 2, the screen of the CRT display 2 may be divided and the video and the ASCII image may be individually displayed on each divided screen, and the ASCII image is displayed on the video. They may be displayed on a single screen.

映像処理部１７は、ＣＰＵ１１にて生成された歌詞文字画像データ（ドットデータ）のテロップやアスキー画像のＣＲＴディスプレイ２への出力を制御する装置である。ビデオミキサ１８は、ＣＰＵ１１にて生成され且つ映像処理部１７から出力される歌詞文字画像データと、ディスクプレイヤー１６によって再生される映像またはアスキー画像とを合成して、ＣＲＴディスプレイ２に表示させるための映像合成装置である。 The video processing unit 17 is a device that controls the output of telop of lyric character image data (dot data) generated by the CPU 11 and the ASCII image to the CRT display 2. The video mixer 18 combines the lyric character image data generated by the CPU 11 and output from the video processing unit 17 with the video or ASCII image reproduced by the disc player 16 and displayed on the CRT display 2. This is a video composition device.

伴奏装置２１は、伴奏を発生するものであって各種楽器音などの楽音波形を記憶している。（楽曲データメモリ１４ｃから読み出された楽曲データの）ＭＩＤＩ形式のデータが示す音色と音程と音量とに対応する音楽信号を、音符長を示すデータにて示される長さ分、楽音波形に基づいてデジタル信号で生成する。生成された音楽信号は、アナログ信号に変換されて、アンプミキサ２２に出力される。 The accompaniment device 21 generates accompaniment and stores musical sound waveforms such as various instrument sounds. The music signal corresponding to the tone, pitch, and volume indicated by the MIDI format data (of the song data read from the song data memory 14c) is based on the musical sound waveform for the length indicated by the data indicating the note length. To generate digital signals. The generated music signal is converted into an analog signal and output to the amplifier mixer 22.

アンプミキサ２２は、伴奏装置２１からの音楽信号とマイクロフォン２３から入力された発声（音声データ）とをミキシングして増幅する装置であり、ミキシングされた信号は、電気的に増幅されてスピーカ２４から出力される。 The amplifier mixer 22 is a device that mixes and amplifies the music signal from the accompaniment device 21 and the utterance (audio data) input from the microphone 23, and the mixed signal is electrically amplified and output from the speaker 24. Is done.

電子早見本３０は、上記したように表示本体３０ａと、クレイドル４４とを備えており、表示本体３０ａは、演算装置であるＣＰＵ３１と、ＣＰＵ３１により実行される各種のプログラムや固定値などを記憶した不揮発性のメモリであるＲＯＭ３２と、ＲＯＭ３２内に記憶される制御プログラムの実行に際して各種のデータ等を一時的に記憶するためのメモリであるＲＡＭ３３と、ハードディスク３４と、インターフェース３５と、無線通信部３６と、クレイドル４４を介して表示本体３０ａと本体１ａとを有線接続するためのＬＡＮポート３７と、表示装置４１とタッチパネル４２とを有するタッチパネルディスプレイ４０と、タッチパネルディスプレイ４０に設けられた表示装置４１を制御する表示制御部３８と、タッチパネルディスプレイ４０に設けられたタッチパネル４２による入力を制御する入力制御部と、ＬＡＮポート３７とクレイドル４４との間に介在して、表示本体３０ａとクレイドル４４とを接続するコネクタ４３とを備えている。 The electronic quick sample 30 includes the display main body 30a and the cradle 44 as described above, and the display main body 30a stores the CPU 31 that is an arithmetic unit, various programs executed by the CPU 31, fixed values, and the like. ROM 32 that is a non-volatile memory, RAM 33 that is a memory for temporarily storing various data and the like when executing a control program stored in ROM 32, a hard disk 34, an interface 35, and a wireless communication unit 36 A LAN port 37 for wired connection between the display main body 30a and the main body 1a via the cradle 44, a touch panel display 40 having a display device 41 and a touch panel 42, and a display device 41 provided on the touch panel display 40. Display control unit 38 to be controlled and touch panel display An input control unit for controlling the input by the touch panel 42 provided on the stomach 40, is interposed between the LAN port 37 and cradle 44, and a connector 43 for connecting the display main body 30a and the cradle 44.

ハードディスク３４は、書換可能な大容量の不揮発性のメモリであり、選曲可能な楽曲のタイトルデータが楽曲コードに対応つけて記憶されている。また、各楽曲のタイトルデータには、ジャンルや歌手名などのカテゴリデータが付加されており、カテゴリ毎に表示装置４１へ表示することができるようになっている。更に、ハードディスク３４には、案内画像の各画面の書式データが記憶されている。 The hard disk 34 is a rewritable large-capacity non-volatile memory, and stores title data of songs that can be selected in association with song codes. In addition, category data such as genre and singer name is added to the title data of each musical piece, and can be displayed on the display device 41 for each category. Further, the hard disk 34 stores format data for each screen of the guide image.

ＣＰＵ３１は、楽曲のタイトルデータの出力要求がなされる（使用者により選曲操作が開始される）と、選曲の操作手順に応じて、案内画像の各画面の書式や、タイトルデータなどをハードディスク３４から読み出す。そして、表示制御部３８に、楽曲のタイトルデータなどのデータを対応する案内画像の画面の書式で、表示装置４１に表示させる。画面上から（タッチパネル４２の押下により）選曲、即ち、楽曲データの指定が実行されると、ＣＰＵ３１は、選曲された楽曲コードを無線通信部３６から無線で出力する。ここで、選曲と共に画面上から（タッチパネル４２の押下により）アスキーアートボタン４２ａが押下された場合には、ＣＰＵ３１は、楽曲コードにアスキーアートコードを付加して出力する。アスキーアートコードは、使用者によりアスキー画像による演出を本体１ａに指示するコマンドであり、アスキーアートコードが楽曲コードに付加されていた場合には、本体１ａは、アスキー画像による演出指示と認識し、アスキー画像による演出を実行する。 When the CPU 31 is requested to output the title data of the music (the music selection operation is started by the user), the CPU 31 displays the format of each screen of the guide image, the title data, etc. from the hard disk 34 in accordance with the music selection operation procedure. read out. Then, the display control unit 38 causes the display device 41 to display data such as music title data in the format of the corresponding guide image screen. When music selection, that is, music data designation is executed from the screen (by pressing the touch panel 42), the CPU 31 wirelessly outputs the selected music code from the wireless communication unit 36. Here, when the ASCII art button 42a is pressed from the screen (by pressing the touch panel 42) along with the music selection, the CPU 31 adds the ASCII art code to the music code and outputs it. The ASCII art code is a command for instructing the main body 1a to produce the ASCII image by the user. When the ASCII art code is added to the music code, the main body 1a recognizes the production instruction using the ASCII image, and the ASCII image. The production by is executed.

次に、図４〜図６のフローチャートを参照して、本カラオケ装置１により実行される各処理について説明する。 Next, each process executed by the karaoke apparatus 1 will be described with reference to the flowcharts of FIGS.

図４は、本体１ａの制御部５のＣＰＵ１１により実行されるカラオケ再生処理のフローチャートである。カラオケ再生処理は、歌唱者等により指定された楽曲について、伴奏を発生させ、歌詞データを表示すると共に、ＣＲＴディスプレイ２にて楽曲の演出を行う処理である。このカラオケ再生処理は、楽曲の再生が指定されること、即ち、楽曲コードの入力により開始されるので、まず、楽曲コードが入力されたか否かを確認する（Ｓ１）。ここで、楽曲コードが入力されていなければ（Ｓ１：Ｎｏ）、このカラオケ再生処理を終了する。一方、楽曲コードが入力されていれば（Ｓ１：Ｙｅｓ）、入力された楽曲コードにアスキーコードが付加されているか否かを確認し（Ｓ２）、入力された楽曲コードにアスキーコードが付加されていなければ（Ｓ２：Ｎｏ）、Ｓ３〜Ｓ６の処理をスキップして、その処理を楽曲データ出力処理（Ｓ７）に移行する。一方、アスキーアートコードが付加されていると（Ｓ２：Ｙｅｓ）、楽曲再生中の演出をアスキー画像で行うことの要求であると認識し、アスキーアートフラグ１３ｃをオンする（Ｓ３）。 FIG. 4 is a flowchart of the karaoke playback process executed by the CPU 11 of the control unit 5 of the main body 1a. The karaoke playback process is a process of generating accompaniment and displaying lyrics data for the music designated by the singer or the like, and producing the music on the CRT display 2. Since this karaoke playback process is started by designating the playback of a music piece, that is, by inputting a music code, it is first confirmed whether or not a music code has been input (S1). Here, if the music code is not inputted (S1: No), this karaoke reproduction process is ended. On the other hand, if a music code has been input (S1: Yes), it is confirmed whether or not an ASCII code has been added to the input music code (S2), and an ASCII code has been added to the input music code. If not (S2: No), the process of S3 to S6 is skipped, and the process proceeds to the music data output process (S7). On the other hand, if an ASCII art code is added (S2: Yes), it recognizes that it is a request | requirement of performing the effect | action during music reproduction with an ASCII image, and turns on the ASCII art flag 13c (S3).

次に、「背景画像を撮影します」の表示を、所定時間ＣＲＴディスプレイ２に出力して背景画像の撮像を告知し（Ｓ４）、歌唱者などに注意を促す。本実施の形態では、アスキー画像による演出が指定されと、その楽曲の再生が開始される前に、背景画像の撮影を実行する。ここで、「背景画像を撮影します」の表示と共に撮影領域の画像がＣＲＴディスプレイ２に出力され、背景画像の撮影領域を歌唱者等に認識させることができるようになっている。 Next, the display of “capturing the background image” is output to the CRT display 2 for a predetermined time to notify the background image capturing (S4), and the singer or the like is alerted. In the present embodiment, when an effect by an ASCII image is designated, the background image is shot before the reproduction of the music is started. Here, an image of the shooting area is output to the CRT display 2 together with the display of “shooting a background image”, so that a singer or the like can recognize the shooting area of the background image.

その後、ＣＣＤカメラ３によって撮像された画像を取込み（１の画像フレームの取得）、取得した画像データを背景画像メモリ１３ａに書き込む（Ｓ５）。続いて、「撮影を開始します」の表示を、所定時間ＣＲＴディスプレイ２に出力する（Ｓ６）。そして、入力された楽曲コードに対応する楽曲データを、楽曲データメモリ１４ｃから読み出して、伴奏装置２１に出力する楽曲データ出力処理を実行する（Ｓ７）。伴奏装置２１には、１の楽曲コードに対応する楽曲データが一括出力され、出力された楽曲データは、伴奏装置２１内に設けられた所定のメモリに一括して記憶される。 Thereafter, an image captured by the CCD camera 3 is captured (acquisition of one image frame), and the acquired image data is written in the background image memory 13a (S5). Subsequently, the display of “start shooting” is output to the CRT display 2 for a predetermined time (S6). And the music data output process which reads the music data corresponding to the input music code from the music data memory 14c, and outputs it to the accompaniment apparatus 21 is performed (S7). The accompaniment device 21 collectively outputs music data corresponding to one music code, and the output music data is collectively stored in a predetermined memory provided in the accompaniment device 21.

次に、伴奏装置２１に出力された楽曲データに対応する歌詞データを、歌詞データメモリ１４ｄから読み出して映像処理部１７に出力する歌詞データ出力処理を実行する（Ｓ８）。この歌詞データ出力処理（Ｓ８）では、入力された楽曲コードにて指定される歌詞データ（タイトルデータを含む）を読み出すと共に、読み出した歌詞データを、その歌詞データを構成する漢字コードに対応してハードディスク１４の所定の領域に記憶された文字の画像データに展開し、歌詞文字画像データ（ドットデータ）を生成する。そして、生成された歌詞文字画像データを映像処理部１７に出力する。１の楽曲コードに対応する歌詞データ（歌詞文字画像データ）は、楽曲再生開始前に映像処理部１７内に設けられた所定のメモリに一括して記憶される。 Next, the lyric data output process which reads the lyric data corresponding to the music data output to the accompaniment apparatus 21 from the lyric data memory 14d and outputs it to the image | video process part 17 is performed (S8). In this lyric data output process (S8), the lyric data (including title data) specified by the input music code is read out, and the read out lyric data is corresponded to the kanji code constituting the lyric data. The image data is expanded into character image data stored in a predetermined area of the hard disk 14 to generate lyric character image data (dot data). Then, the generated lyric character image data is output to the video processing unit 17. Lyric data (lyric character image data) corresponding to one music code is collectively stored in a predetermined memory provided in the video processing unit 17 before the music reproduction is started.

次いで、アスキーアートフラグ１３ｃがオンであるか否かを確認し（Ｓ９）、アスキーアートフラグ１３ｃがオンであれば（Ｓ９：Ｙｅｓ）、ＣＣＤカメラ３により取得された画像データ（１の画像フレーム）から、１フレームのアスキー画像を形成するアスキーアート処理を実行する（Ｓ１０）。アスキーアート処理（Ｓ１０）の実行後は、楽曲の再生中であるか否かを確認し（Ｓ１１）、楽曲の再生中であれば（Ｓ１１：Ｙｅｓ）、Ｓ１５の処理に移行し、再生中でなければ（Ｓ１１：Ｎｏ）、楽曲が終了したタイミングであるか否かを確認する（Ｓ１２）。ここで、楽曲が終了したタイミングであれば（Ｓ１２：Ｙｅｓ）、アスキーアートフラグ１３ｃをオフして、このカラオケ再生処理を終了する。 Next, it is confirmed whether or not the ASCII art flag 13c is on (S9). If the ASCII art flag 13c is on (S9: Yes), from the image data (one image frame) acquired by the CCD camera 3, ASCII art processing for forming an ASCII image of one frame is executed (S10). After the execution of the ASCII art process (S10), it is confirmed whether or not the music is being reproduced (S11). If the music is being reproduced (S11: Yes), the process proceeds to S15 and the music must be being reproduced. (S11: No), it is confirmed whether or not it is the timing when the music is finished (S12). If the timing is the end of the music (S12: Yes), the ASCII art flag 13c is turned off, and the karaoke playback process ends.

また、Ｓ１２の処理で確認した結果、楽曲が終了したタイミングでなければ（Ｓ１２：Ｎｏ）、楽曲の再生を開始するタイミングであるので、再生開始を映像処理部１７およびに伴奏装置２１指示する（Ｓ１４）。これにより、映像処理部１７による制御によって、伴奏装置２１にて生成される伴奏に対応する歌詞データ（歌詞文字画像データ）が、ビデオミキサ１８を介してＣＲＴディスプレイ２に出力される。また、伴奏装置２１により伴奏が生成され、生成された伴奏がスピーカ２４から出力される。 As a result of checking in the process of S12, if the timing is not the end of the music (S12: No), it is the time to start playing the music, so the video processing unit 17 and the accompaniment device 21 are instructed to start playing ( S14). As a result, the lyrics data (lyric character image data) corresponding to the accompaniment generated by the accompaniment apparatus 21 is output to the CRT display 2 via the video mixer 18 under the control of the video processing unit 17. Further, an accompaniment is generated by the accompaniment apparatus 21, and the generated accompaniment is output from the speaker 24.

Ｓ１４の処理の後は、アスキー画像バッファ１３ｂに記憶されるアスキー画像をビデオミキサ１８に出力する（Ｓ１５）。出力されたアスキー画像は、ビデオミキサ１８において歌詞データ（歌詞文字画像データ）と合成され、映像処理部１７による制御によってＣＲＴディスプレイ２に出力される。 After the process of S14, the ASCII image stored in the ASCII image buffer 13b is output to the video mixer 18 (S15). The output ASCII image is combined with the lyrics data (lyric character image data) in the video mixer 18 and output to the CRT display 2 under the control of the video processing unit 17.

その後、その処理をＳ１０の処理に移行する。これにより、入力された画像データの１の画像フレーム毎にアスキー画像の１フレームを形成するアスキーアート処理（Ｓ１０）が、楽曲が終了する（Ｓ１２：Ｙｅｓ）まで繰り返して実行され、ＣＲＴディスプレイ２においてアスキー画像が動画表示される。 Thereafter, the process proceeds to S10. As a result, the ASCII art process (S10) for forming one ASCII image frame for each image frame of the input image data is repeatedly executed until the music ends (S12: Yes). The image is displayed as a movie.

一方、Ｓ９の処理で確認した結果、アスキーアートフラグ１３ｃがオフであれば（Ｓ９：Ｎｏ）、入力された楽曲コードにはアスキーアートコードが付加されていなかった、即ち、楽曲再生中の演出をアスキー画像で行うことは要求されていないと認識し、ディスク再生処理を実行する（Ｓ１６）。ディスク再生処理（Ｓ１６）は、楽曲毎に予め定められた映像をディスクプレイヤー１６に再生させる処理である。ハードディスク１４の所定の領域には、楽曲コードとＣＤまたはＤＶＤの識別コードとが対応付けられた対応テーブルが記憶されており、かかる対応テーブルに基づいて、ＣＰＵ１１は、入力された楽曲コードに対応するＣＤまたはＤＶＤの識別コードをディスクプレイヤー１６に指示する。ディスクプレイヤー１６におけるＣＤまたはＤＶＤの選択は、機械的な動作にて実行されるため、このディスク再生処理（Ｓ１６）において、ディスクプレイヤー１６におけるその選択動作が完了するまで、処理をウェイトする。 On the other hand, if the ASCII art flag 13c is turned off as a result of the confirmation in S9 (S9: No), the ASCII code has not been added to the input music code, that is, the effect during the music reproduction is displayed in the ASCII image. In step S16, it is recognized that it is not requested to perform the disk reproduction process. The disc playback process (S16) is a process for causing the disc player 16 to play a video predetermined for each piece of music. In a predetermined area of the hard disk 14, a correspondence table in which a music code and a CD or DVD identification code are associated with each other is stored. Based on the correspondence table, the CPU 11 corresponds to the input music code. The disc player 16 is instructed to identify the CD or DVD identification code. Since the selection of the CD or DVD in the disc player 16 is executed by a mechanical operation, in this disc reproduction process (S16), the process waits until the selection operation in the disc player 16 is completed.

ウェイト終了後、再生開始をディスクプレイヤー１６、映像処理部１７、伴奏装置２１に指示して（Ｓ１７）、このカラオケ再生処理を終了する。かかる再生指示により、入力された楽曲コードに対応する楽曲の伴奏がスピーカ２４から出力されると共に、その楽曲の歌詞データ（歌詞文字画像データ）と対応する映像とがビデオミキサ１８を介してＣＲＴディスプレイ２に出力される。 After the end of the wait, the disc player 16, the video processing unit 17, and the accompaniment device 21 are instructed to start playback (S17), and the karaoke playback process is ended. In accordance with such a reproduction instruction, an accompaniment of the music corresponding to the input music code is output from the speaker 24, and the lyrics data (lyric character image data) of the music and the corresponding video are displayed via the video mixer 18 on the CRT display. 2 is output.

尚、ＣＰＵ１１は、Ｓ１４，Ｓ１７の処理にて再生開始を指示した後は、映像処理部１７と伴奏装置２１とにタイミング信号を出力し、ＣＲＴディスプレイ２への歌詞データ（歌詞文字画像データ）の出力と、伴奏装置２１による伴奏の生成とを同期させる。 The CPU 11 outputs a timing signal to the video processing unit 17 and the accompaniment device 21 after instructing the start of reproduction in the processes of S14 and S17, and the lyrics data (lyric character image data) to the CRT display 2 is output. The output and the generation of the accompaniment by the accompaniment device 21 are synchronized.

図５は、図４のカラオケ再生処理の中で実行されるアスキーアート処理（Ｓ１０）のフローチャートである。この図５に示したアスキーアート処理（Ｓ１０）を、アスキー画像が形成される課程を模式的に示した図である図３を参照しつつ説明する。 FIG. 5 is a flowchart of the ASCII art process (S10) executed in the karaoke playback process of FIG. The ASCII art process (S10) shown in FIG. 5 will be described with reference to FIG. 3, which is a diagram schematically showing a process of forming an ASCII image.

アスキーアート処理（Ｓ１０）では、まず、画像用ＲＡＭ２７に記憶されている１の画像フレームの画像データを読み出す（Ｓ２１）。そして、画像用ＲＡＭ２７に記憶される画像データと、背景画像メモリ１３ａに記憶される背景画像の画像データとを、共に、８０×２５文字を作成するドット数となるように、データを圧縮する（Ｓ２２）。 In the ASCII art process (S10), first, image data of one image frame stored in the image RAM 27 is read (S21). Then, the image data stored in the image RAM 27 and the image data of the background image stored in the background image memory 13a are both compressed so that the number of dots for creating 80 × 25 characters is obtained ( S22).

１の記号は８×１６ドットで構成されるため、総ドット数を８０×２５×１２８とするデータサイズに画像データは圧縮される。背景画像メモリ１３ａおよび画像用メモリ２７に記憶される各画像データは、ＣＲＴディスプレイ２に出力した場合に良好な画像品質を実現し得る画素数を有しており、データ量が多い。このため、画像用ＲＡＭ２７に記憶される画像データを、そのまま、線画像データに変換し、更に、アスキー画像へ変換すると多大な処理時間が必要となり、出力されるアスキー画像の動画が、被撮影体の実際の動作から大きく遅延しかねない。更に、一般的な表示装置における標準の表示サイズは、全画面表示において８０×２５文字（記号）を表示するサイズとされているので、それ以上の文字（記号）にてアスキー画像が形成されると、その一部が非表示になりかねない。故に、アスキー画像作成にかかる処理時間を短縮し、更に、作成されたアスキー画像の大きさが一般的な表示装置に適合するように、このＳ２２の処理にてデータを圧縮している。尚、必ずしも、画像データを、総ドット数８０×２５×１２８とするデータサイズに圧縮する必要はなく、その圧縮後のデータサイズは、作成するアスキー画像の態様（細密度）や、表示装置等に応じて適宜採用することができる。 Since the symbol 1 is composed of 8 × 16 dots, the image data is compressed to a data size in which the total number of dots is 80 × 25 × 128. Each image data stored in the background image memory 13a and the image memory 27 has a number of pixels that can realize good image quality when output to the CRT display 2, and has a large amount of data. For this reason, if the image data stored in the image RAM 27 is converted into line image data as it is, and further converted into an ASCII image, a great amount of processing time is required. The actual operation of the can be greatly delayed. Furthermore, since a standard display size in a general display device is a size that displays 80 × 25 characters (symbols) in full screen display, an ASCII image is formed with more characters (symbols). And some of them can be hidden. Therefore, the processing time required for creating the ASCII image is shortened, and the data is compressed in the process of S22 so that the size of the created ASCII image is compatible with a general display device. Note that it is not always necessary to compress the image data to a data size having a total number of dots of 80 × 25 × 128. The data size after the compression depends on the aspect of the ASCII image to be created (fine density), the display device, and the like. Depending on the situation, it can be adopted as appropriate.

Ｓ２２の処理の後は、圧縮された両画像データをドット毎に比較し、共通部分を抽出して、その共通部分によりマスクを作成する（Ｓ２３）。次に、圧縮後の画像データから、作成されたマスクに対応する部分、即ち、背景画像部分を削除し、人物画像データ（背景画像以外の部分の画像データ）を抽出する（Ｓ２４）。この人物画像データが抽出された状態を図３（ａ）に示す。この図３（ａ）に表示される枠にて区画された画像が１の画像フレームを模式的に示しており、中心に描画された人物画像が抽出された人物画像データに対応している。図３（ａ）は、Ｓ２４の処理によりマスク処理が実行された後であるので、背景画像が消去され、画像フレーム内には人物画像のみが表示されている。 After the process of S22, both compressed image data are compared for each dot, a common part is extracted, and a mask is created using the common part (S23). Next, the portion corresponding to the created mask, that is, the background image portion is deleted from the compressed image data, and person image data (image data of a portion other than the background image) is extracted (S24). FIG. 3A shows a state where the person image data is extracted. The image divided by the frame displayed in FIG. 3A schematically shows one image frame, and corresponds to human image data in which a human image drawn at the center is extracted. In FIG. 3A, since the mask process is executed by the process of S24, the background image is deleted, and only the person image is displayed in the image frame.

尚、マスクは、圧縮後の画像データを比較することにより形成するものに限られず、画像用ＲＡＭ２７に記憶される画像データと、背景画像メモリ１３ａに記憶される背景画像の画像データとのそれぞれを、圧縮前に比較して、マスクを作成しても良く、さらに、画像用ＲＡＭ２７に記憶される圧縮前の画像データから作成したマスクを用いて人物画像データを抽出しても良い。 The mask is not limited to the one formed by comparing the compressed image data, and each of the image data stored in the image RAM 27 and the image data of the background image stored in the background image memory 13a. Compared with before compression, a mask may be created, and further, human image data may be extracted using a mask created from image data before compression stored in the image RAM 27.

そして、抽出された人物画像の面積（ドット数）が、画像フレームの全体面積に対して半分以上を占めているか否か（人物画像面積≧１／２×（画像フレームの全体面積））を調べる（Ｓ２５）。ここで、その抽出された人物画像の面積が、画像フレームの全体面積に対して半分以上を占めている場合は（Ｓ２５：Ｙｅｓ）、人物画像が十分な大きさで撮影されていると判断し、そのままアスキー画像データに変換するべく、人物画像データからノイズを除去する（Ｓ２６）。続いて、ノイズの除去された人物画像データをラプラシアン変換により線画像データに変換する（Ｓ２７）。これにより、面で表された画像データから、その特徴部分を線で表現した線画像に変換される。この人物画像データをラプラシアン変換により線画像データに変換した状態を図３（ｂ）に示す。その後、線画像データをアスキー画像に変換するアスキー変換処理を実行して（Ｓ２８）、このアスキーアート処理（Ｓ１０）を終了する。 Then, it is checked whether or not the area (number of dots) of the extracted person image occupies more than half of the entire area of the image frame (person image area ≧ 1/2 × (total area of the image frame)). (S25). Here, when the area of the extracted person image occupies more than half of the entire area of the image frame (S25: Yes), it is determined that the person image is captured with a sufficient size. Then, the noise is removed from the person image data in order to convert it into ASCII image data as it is (S26). Subsequently, the human image data from which noise has been removed is converted into line image data by Laplacian conversion (S27). As a result, the image data represented by the plane is converted into a line image in which the characteristic portion is represented by a line. FIG. 3B shows a state in which this person image data is converted into line image data by Laplacian conversion. Thereafter, an ASCII conversion process for converting the line image data into an ASCII image is executed (S28), and the ASCII art process (S10) is terminated.

一方、Ｓ２５の処理で確認した結果、抽出された人物画像の面積（ドット数）が、画像フレームの全体面積に対して半分未満であれば（Ｓ２５：Ｎｏ）、人物画像は基準よりも小さく不適切な大きさで撮影されていることが示されている。そこで、人物画像の面積が画像フレームの全体面積に対して半分以上となるように、人物画像データを拡大（伸張）する。アスキー画像を形成する場合に変換元の画像データ（本実施形態では人物画像データ）が小さくなると、変換元の画像データを有するブロック数が少なくなり、その結果、少ない数の記号にて小さなアスキー画像が形成されてしまう。少ない記号で組み立てられた小さなアスキー画像には、元の画像の特徴が反映され難く、場合によっては何を表したものであるかの識別が不能となる。そこで、本実施の形態では、画像フレーム全体の半分の面積を下限値とし、かかる値以下である場合に人物画像データを拡大して、元画像（撮像された人物）の特徴が反映されたアスキー画像が形成されるように構成されている。 On the other hand, if the area (number of dots) of the extracted person image is less than half of the entire area of the image frame as a result of checking in the process of S25 (S25: No), the person image is smaller than the reference and is not valid. It is shown that it was shot at an appropriate size. Therefore, the human image data is enlarged (expanded) so that the area of the human image is more than half of the entire area of the image frame. If the conversion source image data (person image data in this embodiment) is reduced when forming an ASCII image, the number of blocks having the conversion source image data decreases, and as a result, a small ASCII image with a small number of symbols. Will be formed. A small ASCII image assembled with a small number of symbols is difficult to reflect the characteristics of the original image, and in some cases, it is impossible to identify what it represents. Therefore, in the present embodiment, half the area of the entire image frame is set as the lower limit value, and the person image data is enlarged when the area is equal to or lower than the lower limit value, and the ASCII characteristic of the original image (captured person) is reflected. An image is formed.

故に、まず、画面（画像フレーム）横軸に対する人物画像データのドット分布を算出し、横軸方向の分布の中心（中央座標）を決定してから（Ｓ２９）、人物画像の面積を、画像フレーム全体の半分の面積にて除して拡大率を算出し（Ｓ３０）、Ｓ２９の処理で決定された中央座標を中心に、その算出された拡大率で人物画像データを拡大する（Ｓ３１）。これによれば、人物画像の中心から等方的にズームアップされた画像を形成できるので、拡大された画像がいびつな画像になることを回避できる。その後、Ｓ２６の処理に移行して、拡大された人物画像データに基づいたアスキー画像の形成を実行する。 Therefore, first, the dot distribution of the human image data with respect to the horizontal axis of the screen (image frame) is calculated, the center (central coordinate) of the distribution in the horizontal axis direction is determined (S29), and then the area of the human image is determined as the image frame. The enlargement ratio is calculated by dividing the area by half of the entire area (S30), and the person image data is enlarged with the calculated enlargement ratio around the central coordinates determined in the process of S29 (S31). According to this, since an image zoomed isotropically from the center of the person image can be formed, it is possible to avoid the enlarged image from becoming a distorted image. Thereafter, the process proceeds to S26, and ASCII image formation based on the enlarged person image data is executed.

尚、画像データの拡大を判断する下限値は、必ずしも、画像フレーム全体の半分の面積に限られるものでなく、良好なアスキー画像が形成できる範囲で任意に設定できる。更に、人物画像の大きさは、人物画像データのドット数によって判定したが、これに代えて、画像フレームを８０×２５に分割し、人物画像データを有するブロック数が所定の数（例えば半分）以下であるか否かによって、人物画像データの拡大の実行と非実行とを判定しても良い。これによれば、判定にかかる処理時間を短縮することができる。 Note that the lower limit value for determining the enlargement of the image data is not necessarily limited to half the area of the entire image frame, and can be arbitrarily set within a range in which a good ASCII image can be formed. Further, the size of the person image is determined by the number of dots of the person image data. Instead, the image frame is divided into 80 × 25, and the number of blocks having the person image data is a predetermined number (for example, half). Whether or not to enlarge the human image data may be determined depending on whether or not the following is true. According to this, the processing time for determination can be shortened.

図６は、図５のアスキーアート処理（Ｓ１０）の中で実行されるアスキー変換処理（Ｓ２８）のフローチャートである。この図６に示したアスキー変換処理（Ｓ２８）を図３を参照しつつ説明する。 FIG. 6 is a flowchart of the ASCII conversion process (S28) executed in the ASCII art process (S10) of FIG. The ASCII conversion process (S28) shown in FIG. 6 will be described with reference to FIG.

アスキー変換処理（Ｓ２８）では、まず、線画像データを有する画像フレームの横方向Ｘのサイズを、記号データの横サイズＺ１で除し、横分割数ｍを求める（Ｓ４１）。具体的には、本実施の形態では、１の記号データは８×１６ドットの大きさとされており、記号データの横サイズＺ１は、８ドットである。また、横方向Ｘに配列する文字（記号）数は、８０であるので、横方向Ｘのサイズは、ドット換算で、８ドット×８０、即ち、６４０ドットのサイズである。従って、このＳ４１の処理により、横分割数ｍは、６４０を８で除した値「８０」となる。 In the ASCII conversion process (S28), first, the size in the horizontal direction X of the image frame having line image data is divided by the horizontal size Z1 of the symbol data to obtain the horizontal division number m (S41). Specifically, in this embodiment, one symbol data has a size of 8 × 16 dots, and the horizontal size Z1 of the symbol data is 8 dots. Since the number of characters (symbols) arranged in the horizontal direction X is 80, the size in the horizontal direction X is 8 dots × 80, that is, a size of 640 dots in terms of dots. Therefore, by the processing of S41, the horizontal division number m becomes “80” obtained by dividing 640 by 8.

続いて、その画像フレームの縦方向Ｙのサイズを記号データの縦サイズＺ２で除し、縦分割数ｎを求める（Ｓ４２）。具体的には、本実施の形態では、記号データの縦サイズＺ２は、１６ドットである。また、縦方向Ｙに配列する文字（記号）数は、２５であるので、縦方向Ｙのサイズは、ドット換算で、１６ドット×２５、即ち、４００ドットのサイズである。従って、このＳ４２の処理により、縦分割数ｎは、４００を１６で除した値「２５」となる。 Subsequently, the vertical division number n is obtained by dividing the size of the image frame in the vertical direction Y by the vertical size Z2 of the symbol data (S42). Specifically, in the present embodiment, the vertical size Z2 of the symbol data is 16 dots. Since the number of characters (symbols) arranged in the vertical direction Y is 25, the size in the vertical direction Y is 16 dots × 25, that is, 400 dots in terms of dots. Therefore, the vertical division number n becomes a value “25” obtained by dividing 400 by 16 by the processing of S42.

このＳ４１，Ｓ４２の処理により、図３（ｂ）に示すように、画像フレームは所定の大きさのブロックに分割される。本実施の形態においては、画像フレームを、記号データと同じ８×１６ドットの大きさのブロックに分割する。これによれば、テンプレートマッチングに際し、ブロックに分割された線画像データの各部（分割画像）と記号データとの大きさを整合させる処理を行うことなく、直ちにマッチングを行うことができるので、迅速にマッチングを行うことができる。尚、画像データが、総ドット数８０×２５×１２８以上または以下のデータサイズに圧縮されている場合など、画像フレームを分割するＳ４１，Ｓ４２において生じた端数は繰り上げとし、８０×２５以上または８０×２５以下のブロック数に分割される。 By the processing of S41 and S42, the image frame is divided into blocks of a predetermined size as shown in FIG. In the present embodiment, the image frame is divided into blocks having the same size of 8 × 16 dots as the symbol data. According to this, at the time of template matching, matching can be performed immediately without performing processing for matching the size of each part (divided image) of the line image data divided into blocks and the symbol data. Matching can be performed. Note that, when the image data is compressed to a total dot number of 80 × 25 × 128 or less or less, the fraction generated in S41 and S42 for dividing the image frame is rounded up to 80 × 25 or more or 80 Divided into blocks of × 25 or less.

そして、変数ｉおよび変数ｊにそれぞれ「０」をセットした後（Ｓ４３）、Ｓ４４〜Ｓ５１の処理により、分割された線画像データを、各ブロック毎に、記号データに変換する処理を実行する。この記号データに変換する処理は、画像フレーム内の各ブロックの行毎に、上段から下段に向かって実行される。画像フレーム内の各位置は、横方向をＸ、縦方向をＹとする座標（Ｘｉ，Ｙｊ）で指定される。 Then, after each variable i and variable j are set to “0” (S43), the process of converting the divided line image data into symbol data is executed for each block by the processes of S44 to S51. The process of converting to symbol data is executed from the upper level to the lower level for each row of each block in the image frame. Each position in the image frame is specified by coordinates (Xi, Yj) in which the horizontal direction is X and the vertical direction is Y.

具体的には、Ｓ４４の処理により、変数ｊが縦分割数ｎ未満であるか（ｊ＜ｎ）否かを調べる（Ｓ４４）。ここで、変数ｊが縦分割数ｎ未満でなければ（Ｓ４４：Ｎｏ）、変数ｊが縦分割数ｎに到達したことが示されており、記号データへの変換が分割されたブロックの最下段まで完了したと判断して、このアスキー変換処理（Ｓ２８）を終了する。 Specifically, it is checked whether or not the variable j is less than the vertical division number n (j <n) by the process of S44 (S44). Here, if the variable j is not less than the vertical division number n (S44: No), it is indicated that the variable j has reached the vertical division number n, and the lowest level of the block into which the conversion to the symbol data is divided is shown. This ASCII conversion process (S28) is terminated.

一方、変数ｊが縦分割数ｎ未満であれば（Ｓ４４：Ｙｅｓ）、記号データへの変換が未完了であるので、変数ｉが横分割数ｍ未満であるか（ｉ＜ｍ）であるか否かを調べる（Ｓ４５）。変数ｉが横分割数ｍ未満であれば（Ｓ４５：Ｙｅｓ）、横方向Ｘの１行の変換が未完了であるので、変換する１の領域（ブロック）の座標を求める（Ｓ４６）。変換する１のブロックの横方向Ｘの座標は、変数ｉに記号データの横サイズＺ１に乗ずること（ｉ×Ｚ１→Ｘｉ）によって求められ、縦方向Ｙの座標は、変数ｊに記号データの横サイズＺ２に乗じて（ｊ×Ｚ２→Ｙｉ）求められる。例えば、開始時には、変数ｉ＝変数ｊ＝０であるので、座標（Ｘ０，Ｙ０）は、（０，０）となる。その次の変換対象のブロックの座標（Ｘ１，Ｙ０）は、（Ｘ０，Ｙ０）よりもそのブロックの大きさ分横方向Ｘにシフトした位置である。つまり、次に変換される１の領域（ブロック）のＸ座標（Ｘ１）は、（ｉ×Ｚ１→Ｘｉ）に従って、変数ｉ＝１を記号データの横サイズＺ１に乗じて求められる。同様に、次の段に移動した場合の座標（Ｘ０，Ｙ１）は、そのＹ座標が、（ｊ×Ｚ２→Ｙｉ）に従って、変数ｊ＝１を記号データの縦サイズＺ２に乗じて求められる。 On the other hand, if the variable j is less than the vertical division number n (S44: Yes), conversion to the symbol data is incomplete, so whether the variable i is less than the horizontal division number m (i <m). Whether or not is checked (S45). If the variable i is less than the horizontal division number m (S45: Yes), since the conversion of one row in the horizontal direction X is incomplete, the coordinates of one area (block) to be converted are obtained (S46). The coordinate in the horizontal direction X of one block to be converted is obtained by multiplying the variable i by the horizontal size Z1 of the symbol data (i × Z1 → Xi), and the coordinate in the vertical direction Y is the horizontal value of the symbol data in the variable j. It is obtained by multiplying the size Z2 (j × Z2 → Yi). For example, since variable i = variable j = 0 at the start, the coordinates (X0, Y0) are (0, 0). The coordinates (X1, Y0) of the next block to be converted are positions shifted in the horizontal direction X by the size of the block from (X0, Y0). That is, the X coordinate (X1) of one region (block) to be converted next is obtained by multiplying the variable i = 1 by the horizontal size Z1 of the symbol data according to (i × Z1 → Xi). Similarly, the coordinates (X0, Y1) when moving to the next stage are obtained by multiplying the variable j = 1 by the vertical size Z2 of the symbol data according to the Y coordinate (j × Z2 → Yi).

そして、求められた座標（Ｘｉ，Ｙｊ）と、座標（Ｘｉ＋Ｚ１，Ｙｊ＋Ｚ２）とを対角線とする領域（１のブロックの指定）に分割された線画像データの各部（分割画像）に最も近似する１の記号データを、記号データメモリ１４ｂに記憶される記号データの中からテンプレートマッチングにより検出する（Ｓ４７）。続いて、その検出された記号データを座標に対応付けてアスキー画像バッファ１３ｂに書込み（Ｓ４８）、その後、変数ｉに１を加算して（ｉ＋１→ｉ）（Ｓ４９）、その処理をＳ４５の処理に移行する。これにより、１の行に属するブロックについて変換が終わるまで、Ｓ４５〜Ｓ４９の処理が繰り返される。 Then, 1 that most closely approximates each part (divided image) of the line image data divided into a region (designation of one block) having the obtained coordinates (Xi, Yj) and coordinates (Xi + Z1, Yj + Z2) as diagonal lines. The symbol data is detected from the symbol data stored in the symbol data memory 14b by template matching (S47). Subsequently, the detected symbol data is written in the ASCII image buffer 13b in association with the coordinates (S48), and then 1 is added to the variable i (i + 1 → i) (S49), and the process is performed in S45. Migrate to Thereby, the processing of S45 to S49 is repeated until the conversion for the block belonging to one row is completed.

また、Ｓ４５の処理で確認した結果、変数ｉが横分割数ｍ未満でなければ（Ｓ４５：Ｎｏ）、変数ｉが横分割数ｍに到達したことが示されており、横方向Ｘの１の行について、各ブロックの線画像データの記号データへの変換が完了したと判断して、アスキー画像バッファに改行コードを書き込んだ後（Ｓ５０）、変数ｊに１を加算する（ｊ＋１→ｊ）と共に、変数ｉに「０」をセットして（Ｓ５１）、その処理をＳ４４の処理に移行する。これにより、次の段へと変換対象位置が移動する。そして、上記したように、Ｓ４４の処理にて、線画像データの記号データへの変換が終了した（Ｓ４４：Ｎｏ）と判断されるまで、Ｓ４４〜Ｓ５１の処理が繰り返される。これにより、図３（ｃ）に示すように、線画像データが、分割されたブロックごとに記号データへと変換されたアスキー画像が形成される。 As a result of checking in the process of S45, if the variable i is not less than the horizontal division number m (S45: No), it is indicated that the variable i has reached the horizontal division number m. After determining that the conversion of the line image data of each block into the symbol data has been completed for the row and writing the line feed code to the ASCII image buffer (S50), 1 is added to the variable j (j + 1 → j) and Then, “0” is set to the variable i (S51), and the process proceeds to S44. As a result, the conversion target position moves to the next stage. Then, as described above, the processes of S44 to S51 are repeated until it is determined in the process of S44 that the conversion of the line image data into the symbol data has been completed (S44: No). As a result, as shown in FIG. 3C, an ASCII image is formed by converting the line image data into symbol data for each divided block.

尚、本実施の形態では、線画像データに変換後（Ｓ２７）に、アスキー変換処理（Ｓ２８）において、画像フレームの分割（Ｓ４１，Ｓ４２）を行ったが、画像フレームの分割（Ｓ４１，Ｓ４２）後に、線画像データへの変換（Ｓ２７）を行っても良い。 In this embodiment, the image frame is divided (S41, S42) in the ASCII conversion process (S28) after being converted into line image data (S27). However, the image frame is divided (S41, S42). Later, conversion to line image data (S27) may be performed.

このように、本実施の形態のカラオケ装置１によれば、楽曲データの再生中におけるＣＲＴディスプレイ２での演出を、アスキー画像での演出にて実行することができ、楽曲毎に予め定められた映像を出力する演出とは異なる新鮮な興趣を提供することができる。また、歌唱者の動きがアスキー画像に反映されるため、歌唱者がコミカルな挙動などを歌唱と共に行えば、余興として高い娯楽性を実現することができる。 As described above, according to the karaoke apparatus 1 of the present embodiment, the effect on the CRT display 2 during the reproduction of the music data can be executed by the effect on the ASCII image, which is predetermined for each music. It is possible to provide fresh entertainment that is different from the production of outputting video. Moreover, since a singer's motion is reflected in an ASCII image, if a singer performs a comical behavior etc. with a song, high entertainment as entertainment can be implement | achieved.

尚、楽曲に応じて予め定められた映像の出力と、アスキー画像の出力とを切り替えるスイッチを設けると共に、かかるスイッチの状態を所定時間毎に監視する監視手段を設け、その監視手段により検出されるスイッチの状態に応じて、出力する画像を切り替えるように構成しても良い。これによれば、使用者が任意のタイミングにおいて、映像の出力と、アスキー画像の出力とを切り替えることができる。 In addition, a switch that switches between a predetermined video output and an ASCII image output according to the music is provided, and a monitoring unit that monitors the state of the switch every predetermined time is provided, and is detected by the monitoring unit. You may comprise so that the image to output may be switched according to the state of a switch. According to this, the user can switch between video output and ASCII image output at any timing.

更に、カラオケ装置１は、カラオケ装置１に設けられたＣＣＤカメラ３から入力される画像データをアスキー画像に変換して、ＣＲＴディスプレイ２に出力するように構成され、また、マイクロフォン２３から入力された音声データをカラオケ装置１のスピーカ２４から出力するように構成された。これに代えて、アスキー画像に変換する画像データの入力および変換されたアスキー画像の出力を、カラオケ装置１以外の外部装置にて実行するように構成しても良い。かかる外部装置は、カラオケ装置１とは別体で設けられ、且つ、カラオケ装置１と通信可能に形成されるデバイスであり、撮像装置（および表示装置）を備えた携帯電話や、ＰＤＡ（Personal Digital Assistance）、パーソナルコンピュータ
に加え、上記の電子早見本３０にＣＣＤカメラ３を搭載したもの等が例示される。また、マイクロフォン２３およびスピーカ２４に代えて、上記した外部装置に設けられたマイクロフォンやスピーカを利用して音声データの入出力を行うように構成しても良い。 Further, the karaoke apparatus 1 is configured to convert image data input from the CCD camera 3 provided in the karaoke apparatus 1 into an ASCII image and output the ASCII image to the CRT display 2, and input from the microphone 23. The audio data is output from the speaker 24 of the karaoke apparatus 1. Instead of this, the image data to be converted into an ASCII image and the output of the converted ASCII image may be executed by an external device other than the karaoke device 1. Such an external device is a device that is provided separately from the karaoke device 1 and is formed so as to be communicable with the karaoke device 1. Assistance), a personal computer, and the electronic quick sample 30 mounted with the CCD camera 3 are exemplified. Further, instead of the microphone 23 and the speaker 24, audio data may be input / output using a microphone or a speaker provided in the external device described above.

これによれば、各個人がそれぞれ所有する携帯電話等から画像データと音声データとを入力することにより、入力された画像データに基づくアスキー画像と音声データとを、カラオケ装置１を介して、他の携帯電話等の外部装置に出力することができる。このため、画像データや音声データの入出力デバイスの自由度を向上させることができ、特定の者にのみアスキー画像を閲覧させたいという要望や、携帯可能なデバイスでアスキー画像を手元で閲覧したいという要望等、多種多様な利用者の嗜好に応じることができる。 According to this, by inputting image data and sound data from a mobile phone or the like owned by each individual, the ASCII image and sound data based on the input image data can be transferred via the karaoke device 1 Can be output to an external device such as a mobile phone. For this reason, the degree of freedom of the input / output device for image data and audio data can be improved, and there is a desire to allow only a specific person to view the ASCII image, or to view the ASCII image at hand with a portable device. It is possible to meet a wide variety of user preferences such as requests.

次に、図７から図１１を参照して、本発明の第２の実施形態について説明する。第１の実施形態では、画像処理装置はカラオケ装置１にて構成された。これに代えて、第２の実施形態では、画像処理装置はインターネットプロトコル（Internet Protocol）を用いた
電話装置（以下単に「ＩＰ電話装置」と称す）にて構成され、画像処理部を有する送信側装置と、表示装置を有する受信側装置とを備えている。尚、上記した第１の実施形態と同じ部分には同じ符号を付し、その説明を省略する。 Next, a second embodiment of the present invention will be described with reference to FIGS. In the first embodiment, the image processing apparatus is configured by the karaoke apparatus 1. Instead, in the second embodiment, the image processing apparatus is configured by a telephone device using the Internet Protocol (hereinafter simply referred to as “IP telephone device”), and has a transmission side having an image processing unit. And a receiving device having a display device. In addition, the same code | symbol is attached | subjected to the same part as above-mentioned 1st Embodiment, and the description is abbreviate | omitted.

図７は、第２の実施形態の画像処理装置であるＩＰ電話装置２００を備えた電話システム２２０を示した概略図である。各ＩＰ電話装置２００（２００ａ，２００ｂ，２００ｃ）は、いずれもデータを送信する送信側装置であり且つデータを受信する相手側装置でもある。従って、各ＩＰ電話装置２００は、それぞれ、画像処理部と表示装置との両者を備えている。また、図７に示すように、ＩＰ電話装置２００ａ，２００ｂは、コンピュータ本体とそのコンピュータ本体へ各種データやコマンドなどを入力するためのキーボードと、コンピュータ本体で実行される処理をモニターするためのＣＲＴディスプレイ１１２とを備えたパーソナルコンピュータ（以下単に「ＰＣ」と略す）である。ＩＰ電話装置２００ｃは受話器と送話器とを備えた電話機である。尚、テレビ電話装置２００以外にＩＰパケット網に接続される他の電話装置は、図示を省略している。 FIG. 7 is a schematic diagram showing a telephone system 220 including an IP telephone apparatus 200 that is an image processing apparatus according to the second embodiment. Each of the IP telephone devices 200 (200a, 200b, 200c) is both a transmission-side device that transmits data and a partner-side device that receives data. Accordingly, each IP telephone device 200 includes both an image processing unit and a display device. As shown in FIG. 7, the IP telephone devices 200a and 200b include a computer main body, a keyboard for inputting various data and commands to the computer main body, and a CRT for monitoring processing executed in the computer main body. A personal computer (hereinafter simply referred to as “PC”) provided with a display 112. The IP telephone device 200c is a telephone provided with a receiver and a transmitter. In addition to the video phone device 200, other telephone devices connected to the IP packet network are not shown.

各ＩＰ電話装置２００ａ，２００ｂ，２００ｃは、ＩＰネットワーク網に接続され、かかるＩＰネットワーク網を介して、相互に音声データや画像データ等のデータのパケット通信を行う装置であり、音声データおよび画像データのパケット処理機能やＩＰ電話固有のプロトコル処理機能を備えＩＰアダプタとして機能する通信制御部１２０（図８参照）を内蔵している。また、各ＩＰ電話装置２００ａ，２００ｂ，２００ｃは、音声とともに画像を相手側装置との間で送受信するテレビ電話機能を有しており、送話者の画像を撮像するためのＣＣＤカメラ１１６と、受信した画像データを再生表示するための表示装置（例えばＣＲＴディスプレイ１１２等）とを備えている。 Each of the IP telephone devices 200a, 200b, and 200c is a device that is connected to an IP network and performs packet communication of data such as voice data and image data with each other via the IP network. And a communication control unit 120 (see FIG. 8) that functions as an IP adapter. Each of the IP telephone devices 200a, 200b, and 200c has a videophone function for transmitting and receiving an image together with voice to and from the other device, and a CCD camera 116 for capturing an image of the sender, A display device (for example, a CRT display 112) for reproducing and displaying the received image data.

送信側装置であるＩＰ電話装置２００（例えばＩＰ電話装置２００ａとする）は、送話者の要求（指定）に応じて、ＣＣＤカメラ１１６の入力画像をアスキー画像に変換し、変換されたアスキー画像に対応するコードを音声データと共に相手側装置のＩＰ電話装置２００ｂ，２００ｃに送出するように構成されている。送出されるデータは、ＩＰアダプタにより、所定の長さに区切られてデータブロック化され、１のデータブロックが１のパケットに入れられて相手側装置へと送出される。 The IP telephone device 200 (for example, the IP telephone device 200a), which is the transmission side device, converts the input image of the CCD camera 116 into an ASCII image in response to the request (designation) of the speaker, and the converted ASCII image. Is sent to the IP telephone devices 200b and 200c of the counterpart device together with the voice data. The data to be sent is divided into data blocks by the IP adapter, and one data block is put into one packet and sent to the counterpart device.

一方、相手側装置であるＩＰ電話装置２００（例えばＩＰ電話装置２００ａを送信側装置とした場合のＩＰ電話装置２００ｂ，２００ｃ）は、受信したパケットの遅延時間調整と、パケットの順序調整を行った後、パケットからデータを取り出すようになっており、受信したデータにアスキー画像に対応するコードが含まれていると、アスキー画像を再生して自己の表示装置（例えばＣＲＴディスプレイ１１２等）に出力するように構成されている。 On the other hand, IP telephone apparatus 200 (for example, IP telephone apparatuses 200b and 200c in the case where IP telephone apparatus 200a is used as a transmission-side apparatus) performs delay time adjustment of received packets and packet order adjustment. Thereafter, data is extracted from the packet. When the received data includes a code corresponding to the ASCII image, the ASCII image is reproduced and output to its own display device (for example, the CRT display 112). It is configured as follows.

各ＩＰ電話装置２００ａ，２００ｂ，２００ｃは、ＡＤＳＬ（Asymmetric
Degital Subscriber Line）をアクセス回線として、ＩＰネットワーク網に接続されている。ＩＰネットワーク網は、非図示のゲートキーパによって呼制御が行われる通信業者が専用に構築したネットワーク回線であり、回線経路に設置されたルータによって相手側装置まで、各パケットを送信するものである。更に、ＩＰネットワーク網は、非図示のＶｏＩＰゲートウェイを介して既存の電話網と相互に接続されており、一般の電話とも相互に通信可能に構成されている。 Each of the IP telephone devices 200a, 200b, and 200c has an ADSL (Asymmetric
(Degital Subscriber Line) is connected to the IP network as an access line. The IP network is a network line that is exclusively constructed by a communication company whose call control is performed by a gatekeeper (not shown), and transmits each packet to a partner apparatus by a router installed in the line route. Furthermore, the IP network is connected to an existing telephone network via a VoIP gateway (not shown), and is configured to be able to communicate with a general telephone.

尚、アクセス回線は、通常の一般公衆回線や、ＩＳＤＮ回線、ケーブルテレビのインターネット接続回線などであってもよい。また、ＩＰネットワーク網は、一般のインターネットのＩＰパケット網を使用しても良い。 The access line may be a normal general public line, an ISDN line, a cable TV internet connection line, or the like. The IP network may be a general Internet IP packet network.

図８は、上記のように構成された電話システム２２０のＩＰ電話装置２００の電気的構成を示すブロック図である。尚、ＩＰ電話装置２００ｂは、ＩＰ電話装置２００ａと同様に構成されているのでその説明を省略し、ＩＰ電話装置２００ｃについては、ＩＰ電話装置２００ａと異なる部分のみ説明する。 FIG. 8 is a block diagram showing an electrical configuration of the IP telephone apparatus 200 of the telephone system 220 configured as described above. The IP telephone apparatus 200b is configured in the same manner as the IP telephone apparatus 200a, and therefore the description thereof will be omitted. Only the parts different from the IP telephone apparatus 200a will be described for the IP telephone apparatus 200c.

ＩＰ電話装置２００ａは、ＰＣに電話機能を実現する各デバイスが搭載されたものであり、ＣＰＵ１０１、ＲＯＭ１０２、ＲＡＭ１０３、画像用ＲＡＭ１０５、音声用ＲＡＭ１０６を備えている。演算装置であるＣＰＵ１０１は、ＲＯＭ１０２に記憶される制御プログラムや、後述するハードディスク１０４に記憶される制御プログラム１０４ａにより、ＩＰ電話装置２００ａを制御するものである。ＲＯＭ１０２は、ＣＰＵ１０１により実行される各種のプログラムや固定値などを記憶した不揮発性のメモリである。ＲＡＭ１０３は、ＣＰＵ１０１による各種プログラムの実行に際して、各種のデータ等を一時的に記憶するためのメモリであり、背景画像メモリ１０３ａと、アスキー画像バッファ１０３ｂと、アスキーアートフラグ１０３ｃとを備えている。 The IP telephone device 200a is a device in which each device for realizing a telephone function is mounted on a PC, and includes a CPU 101, a ROM 102, a RAM 103, an image RAM 105, and an audio RAM 106. The CPU 101 that is an arithmetic device controls the IP telephone device 200a by a control program stored in the ROM 102 or a control program 104a stored in the hard disk 104 described later. The ROM 102 is a nonvolatile memory that stores various programs executed by the CPU 101 and fixed values. The RAM 103 is a memory for temporarily storing various data when the CPU 101 executes various programs, and includes a background image memory 103a, an ASCII image buffer 103b, and an ASCII art flag 103c.

背景画像メモリ１０３ａおよびアスキー画像バッファ１０３ｂは、第１の実施形態のカラオケ装置１（制御部５のＲＡＭ１３）に設けられた背景画像メモリ１３ａおよびアスキー画像バッファ１３ｂと同様のメモリである。背景画像メモリ１０３ａには、背景画像としてＣＣＤカメラ３により撮像（取得）された画像データが記憶され、アスキー画像バッファ１０３ｂには、アスキー変換処理（Ｓ２８、図６参照）において、形成されたアスキー画像が一時的に記憶される。 The background image memory 103a and the ASCII image buffer 103b are the same memories as the background image memory 13a and the ASCII image buffer 13b provided in the karaoke apparatus 1 (RAM 13 of the control unit 5) according to the first embodiment. The background image memory 103a stores image data captured (acquired) by the CCD camera 3 as a background image, and the ASCII image buffer 103b stores the ASCII image formed in the ASCII conversion process (S28, see FIG. 6). Is temporarily stored.

アスキーアートフラグ１０３ｃは、相手側装置に送出する画像データをＣＣＤカメラ１１６で撮像された画像とするか、アスキー画像とするかを識別するためのフラグである。このアスキーアートフラグ１０３ｃは、後述する電話処理（図９参照）を実行する入力画面（電話処理のメイン画面）において、アスキー画像指定コマンドが入力されるとオンされ、その後に回線が開放されるとオフされる。 The ASCII art flag 103c is a flag for identifying whether the image data sent to the counterpart device is an image captured by the CCD camera 116 or an ASCII image. The ASCII art flag 103c is turned on when an ASCII image designation command is input on an input screen (telephone processing main screen) for executing telephone processing (to be described later), and is turned off when the line is released thereafter. Is done.

画像用ＲＡＭ１０５および音声用ＲＡＭ１０６は、ＤＲＡＭまたはＳＤＲＡＭなどで構成された大容量のＲＡＭであり、画像用ＲＡＭ１０５は、第１の実施形態の画像用ＲＡＭ２７と同様に、背景画像撮影後にＣＣＤカメラ１１６にて取得された画像データを一時的に記憶するものである。また、音声用ＲＡＭ１０６は音声データを一時的に記憶するものであり、マイクロフォン１１５を介して入力された音声データ（送話者の発声）は、音声用ＣＯＤＥＣ１０９により符号化された後、この音声用ＲＡＭ１０６に記憶される。 The image RAM 105 and the audio RAM 106 are large-capacity RAMs configured by DRAMs or SDRAMs, and the image RAM 105 is connected to the CCD camera 116 after taking a background image in the same manner as the image RAM 27 of the first embodiment. The image data acquired in this way is temporarily stored. The audio RAM 106 temporarily stores audio data. The audio data (speaker's utterance) input via the microphone 115 is encoded by the audio CODEC 109 and then the audio data. Stored in the RAM 106.

図８に示すように、ＣＰＵ１０１、ＲＯＭ１０２、ＲＡＭ１０３、画像用ＲＡＭ１０５、音声用ＲＡＭ１０６はバスライン１０７を介して互いに接続されており、バスライン１０７はまた入出力ポート１０８にも接続されている。この入出力ポート１０８は、バスライン１０７の他に、ハードディスク１０４、音声用ＣＯＤＥＣ１０９、画像用ＣＯＤＥＣ１１０、ＣＲＴディスプレイ１１２、キーボード１１３、スピーカ１１４、マイクロフォン１１５、ＣＣＤカメラ１１６、通信制御部１２０と接続されている。 As shown in FIG. 8, the CPU 101, ROM 102, RAM 103, image RAM 105, and audio RAM 106 are connected to each other via a bus line 107, and the bus line 107 is also connected to an input / output port 108. In addition to the bus line 107, the input / output port 108 is connected to the hard disk 104, the audio CODEC 109, the image CODEC 110, the CRT display 112, the keyboard 113, the speaker 114, the microphone 115, the CCD camera 116, and the communication control unit 120. Yes.

ハードディスク１０４は、書換可能な大容量の不揮発性のメモリであり、本ＩＰ電話装置２００ａを制御する制御プログラム１０４ａと、記号データメモリ１０４ｂとを備えている。尚、図９〜図１１に示したフローチャートのプログラムは、制御プログラム１０４ａの一部として格納されている。記号データメモリ１０４ｂは、第１の実施形態のカラオケ装置１（制御部５のハードディスク１４）に設けられた記号データメモリ１４ｂと同様のメモリであり、記号の画像データが記憶されている。 The hard disk 104 is a rewritable large-capacity nonvolatile memory and includes a control program 104a for controlling the IP telephone apparatus 200a and a symbol data memory 104b. The programs of the flowcharts shown in FIGS. 9 to 11 are stored as a part of the control program 104a. The symbol data memory 104b is a memory similar to the symbol data memory 14b provided in the karaoke apparatus 1 (the hard disk 14 of the control unit 5) according to the first embodiment, and stores symbol image data.

音声用ＣＯＤＥＣ１０９および画像用ＣＯＤＥＣ１１０は、音声データと画像データとを、それぞれ、符号化および復号化する電子回路（ハードウェア）である。音声用ＣＯＤＥＣ１０９は、アナログデータで入力される音声データ（送話者の発声）を、所定のサンプリング周期で取得して、量子化、符号化することにより、デジタルデータに変換する処理（符号化）を行う回路である。符号化された音声データは、音声用ＲＡＭ１０６に書き込まれる。通信制御部１２０にて受信された相手側装置からの音声データは、この音声用ＣＯＤＥＣ１０９にて復号化（伸張）された後、スピーカ１１４から出力される。 The audio CODEC 109 and the image CODEC 110 are electronic circuits (hardware) that encode and decode audio data and image data, respectively. The audio CODEC 109 obtains audio data (speaker's utterance) input as analog data at a predetermined sampling period, and quantizes and encodes it to convert it into digital data (encoding). It is a circuit which performs. The encoded audio data is written into the audio RAM 106. The audio data from the counterpart device received by the communication control unit 120 is output from the speaker 114 after being decoded (expanded) by the audio CODEC 109.

画像用ＣＯＤＥＣ１１０は、ＣＣＤカメラ１１６から入力され、画像用ＲＡＭ１０５に記憶された画像データを、相手側装置に送信するデータ形式に圧縮符号化するものである。また、通信制御部１２０にて受信された相手側装置からの画像データは、この画像用ＣＯＤＥＣ１１０にて復号化（伸張）された後、ＣＲＴディスプレイ１１２から出力される。 The image CODEC 110 compresses and encodes image data input from the CCD camera 116 and stored in the image RAM 105 into a data format to be transmitted to the counterpart device. Further, the image data from the counterpart device received by the communication control unit 120 is output from the CRT display 112 after being decoded (expanded) by the image CODEC 110.

尚、ＩＰ電話装置２００ａでは、符号化された音声データに対し、更に、帯域圧縮符号化が行われている。この音声データの帯域圧縮符号化には、離散コサイン変換（ＤＣＴ）が適用されている。離散コサイン変換は、画像を小さなブロックに分割して周波数成分係数を量子化および符号化して圧縮する一般的な方式である。従って、相手側装置からの音声データを受信した場合には、まず、離散コサイン変換の逆変換（ＩＤＣＴ）処理を行ってから、音声用ＣＯＤＥＣ１０９にてアナログデータに復号化して再生出力する。 In IP telephone apparatus 200a, band compression encoding is further performed on the encoded audio data. Discrete cosine transform (DCT) is applied to the band compression encoding of the audio data. The discrete cosine transform is a general method in which an image is divided into small blocks, and frequency component coefficients are quantized and encoded to be compressed. Therefore, when voice data from the counterpart device is received, first, an inverse transform (IDCT) process of discrete cosine transform is performed, and then decoded into analog data by the voice CODEC 109 and reproduced and output.

通信制御部１２０は、ＩＰ電話装置２００ａが他のＩＰ電話装置２００ｂ，２００ｃ（または他の電話装置）との間で、ＩＰネットワーク網を介したパケット通信を実行するための装置であり、モデム１２１、信号分離部１２３、信号多重部１２４、バッファ１２５を備えている。 The communication control unit 120 is a device for the IP telephone apparatus 200a to perform packet communication with the other IP telephone apparatuses 200b and 200c (or other telephone apparatuses) via the IP network, and the modem 121. , A signal separation unit 123, a signal multiplexing unit 124, and a buffer 125.

モデム１２１は、ＡＤＳＬモデムであり、ＩＰ電話装置２００ａから送出するデータ（信号）をＡＤＳＬ回線に適応したＡＤＳＬ信号に変換すると共に、ＩＰ電話装置２００ａにて受信したＡＤＳＬ信号をＩＰ電話装置２００ａ内で処理する適切な信号（例えばイーサネット（登録商標）の信号）に変換する信号変換機である。また、モデム１２１は、通信相手を呼び出すためにダイヤル信号を送出し、回線を制御するＮＣＵの機能を備えている。 The modem 121 is an ADSL modem, converts data (signal) transmitted from the IP telephone apparatus 200a into an ADSL signal adapted to the ADSL line, and also receives the ADSL signal received by the IP telephone apparatus 200a in the IP telephone apparatus 200a. It is a signal converter that converts an appropriate signal to be processed (for example, an Ethernet (registered trademark) signal). The modem 121 has an NCU function for sending a dial signal to call a communication partner and controlling the line.

信号分離部１２３は、ＩＰ電話装置２００ｂ，２００ｃ等の相手側装置から受信したデータについて、画像データと音声データとに分離する装置である。後述するように、本電話システム２２０においてアスキー画像データは、音声データとして相手側装置から送出されているので、この信号分離部１２３においては分離されない。信号多重部１２４は、送信バッファ１２５ａに記憶される音声データ及び画像データを時間軸方向に、画像フレームと音声データとを多重化する装置であり、多重化されたデータは、モデム１２１を介して送出される。 The signal separation unit 123 is a device that separates data received from a counterpart device such as the IP telephone devices 200b and 200c into image data and audio data. As will be described later, in the telephone system 220, the ASCII image data is transmitted as audio data from the counterpart device, and therefore is not separated by the signal separation unit 123. The signal multiplexing unit 124 is a device that multiplexes the audio data and the image data stored in the transmission buffer 125 a in the time axis direction, and the image frame and the audio data. The multiplexed data is transmitted via the modem 121. Sent out.

バッファ１２５は、一時的にデータを記憶しておくメモリであり、送出するデータを一時的に記憶するための送信バッファ１２５ａと、受信したデータを一時的に記憶するための受信バッファ１２５ｂとを備えている。送信バッファ１２５ａには、後述する電話処理（図９参照）にて圧縮符号化された音声データ及び画像データが記憶されている。記憶される音声データ及び画像データは、所定のタイミングでこの送信バッファ１２５ａから読み出されて、信号多重部１２４に出力される。受信バッファ１２５ｂには、信号分離部１２３により分離された画像データと音声データとが、それぞれフレーム単位で記憶されている。受信バッファ１２５ｂにデータが記憶されていることがＣＰＵ１０１により認識されると、受信バッファ１２５ｂに記憶されるデータは、この受信バッファ１２５ｂから読み出され、復号化と再生出力とが実行される。 The buffer 125 is a memory for temporarily storing data, and includes a transmission buffer 125a for temporarily storing data to be transmitted and a reception buffer 125b for temporarily storing received data. ing. The transmission buffer 125a stores audio data and image data that have been compression-encoded by telephone processing (see FIG. 9) described later. The stored audio data and image data are read from the transmission buffer 125 a at a predetermined timing and output to the signal multiplexing unit 124. The reception buffer 125b stores the image data and audio data separated by the signal separation unit 123 in units of frames. When the CPU 101 recognizes that data is stored in the reception buffer 125b, the data stored in the reception buffer 125b is read from the reception buffer 125b, and decoding and reproduction output are executed.

このように、ＩＰ電話装置２００ａは、通信制御部１２０を介し（ＡＤＳＬ回線を経て）、ＩＰネットワーク網に接続されており、ＩＰネットワーク網を経由して他のＩＰ電話装置２００ｂ，２００ｃなどと相互に接続されている。これにより、ＩＰ電話装置２００ａと他のＩＰ電話装置２００ｂ，２００ｃ（または他の電話装置）との間で、音声データ及び画像データを送受信するテレビ電話を実行することができるようになっている。 As described above, the IP telephone device 200a is connected to the IP network via the communication control unit 120 (via the ADSL line), and is mutually connected to the other IP telephone devices 200b and 200c via the IP network. It is connected to the. Thus, a videophone that transmits and receives audio data and image data can be executed between the IP telephone apparatus 200a and the other IP telephone apparatuses 200b and 200c (or other telephone apparatuses).

尚、ＩＰ電話装置２００ｃは、電話機であるので、マイクフォン１１５として送話器が、スピーカ１１４としては受話器が機能し、ＣＲＴディスプレイ１１２の代わりにＬＣＤが設けられると共に、アスキー画像指定コマンドを入力するコマンド入力ボタンを備えている。 Since IP telephone apparatus 200c is a telephone, a transmitter functions as microphone 115, a receiver functions as speaker 114, an LCD is provided instead of CRT display 112, and an ASCII image designation command is input. A command input button is provided.

次に、図９から図１１を参照して、上記のように構成されたＩＰ電話装置２００にて実行される各処理について説明する。 Next, with reference to FIGS. 9 to 11, each process executed by the IP telephone apparatus 200 configured as described above will be described.

図９は、上記のＩＰ電話装置２００（２００ａ，２００ｂ）にて実行される電話処理を示したフローチャートである。ＩＰ電話装置２００ａ，２００ｂはＰＣであるので、電話処理は、キーボード１１３操作により、開始を要求する所定のコマンドが入力されると開始され、まず、電話処理のメイン画面をＣＲＴディスプレイ２に表示する（Ｓ１０１）。その後、相手側装置の電話番号が指定されたか否かを確認し（Ｓ１０２）、相手側装置の電話番号が指定されていなければ（Ｓ１０２：Ｎｏ）、その指定を待機する。尚、Ｓ１０２の処理においては、所定時間の経過が監視されており、所定時間が到来しても相手側装置の電話番号が指定されていない場合には、その処理をＳ１１４の各処理に移行する。 FIG. 9 is a flowchart showing telephone processing executed by the IP telephone apparatus 200 (200a, 200b). Since the IP telephone devices 200a and 200b are PCs, the telephone processing is started when a predetermined command for requesting start is input by operating the keyboard 113. First, the main screen of the telephone processing is displayed on the CRT display 2. (S101). Thereafter, it is confirmed whether or not the telephone number of the counterpart device is designated (S102). If the telephone number of the counterpart device is not designated (S102: No), the designation is awaited. In the process of S102, the elapse of a predetermined time is monitored, and when the telephone number of the counterpart device is not specified even when the predetermined time has come, the process proceeds to each process of S114. .

一方、相手側装置の電話番号が指定されていれば（Ｓ１０２：Ｙｅｓ）、アスキー画像指定コマンドが入力されたか否かを確認する（Ｓ１０３）。アスキー画像指定コマンドは、相手側装置に送信する画像データをアスキー画像とすることを指示するコマンドであり、メイン画面の表示に従って所定のキー操作を行うことにより入力される。 On the other hand, if the telephone number of the counterpart device is designated (S102: Yes), it is confirmed whether an ASCII image designation command has been input (S103). The ASCII image designation command is a command for instructing the image data to be transmitted to the counterpart device to be an ASCII image, and is input by performing a predetermined key operation according to the display on the main screen.

ここで、アスキー画像指定コマンドが入力されていなければ（Ｓ１０３：Ｎｏ）、Ｓ１０４の処理をスキップしてその処理をＳ１０５の処理に移行する。一方、アスキー画像指定コマンドが入力されていると（Ｓ１０３：Ｙｅｓ）、アスキーアートフラグ１０３ｃをオンし（Ｓ１０４）、その後、相手側装置を発呼する（Ｓ１０５）。Ｓ１０５の処理により発呼要求がＩＰネットワーク網に設置されたゲートキーパに送信され、ゲートキーパの動作によって、発呼側装置であるＩＰ電話装置２００ａと相手側装置との間の通話状態が確立される。 If no ASCII image designation command is input (S103: No), the process of S104 is skipped and the process proceeds to S105. On the other hand, if an ASCII image designation command is input (S103: Yes), the ASCII art flag 103c is turned on (S104), and then the counterpart device is called (S105). The call request is transmitted to the gatekeeper installed in the IP network by the processing of S105, and the call state between the IP telephone device 200a as the calling device and the partner device is established by the operation of the gatekeeper.

その後、アスキーアートフラグ１０３ｃがオンであるか否かを確認し（Ｓ１０６）、アスキーアートフラグ１０３ｃがオンであれば（Ｓ１０６：Ｙｅｓ）、アスキーアート処理を実行して（Ｓ１０７）、ＣＣＤカメラ１１６から取り込んだ画像データをアスキー画像データに変換する。アスキーアート処理（Ｓ１０７）は、図５に示した第１の実施形態のアスキーアート処理（Ｓ１０）同様の処理である。尚、第１の実施形態においては、アスキー変換処理（Ｓ２８）のＳ４７の処理により、アスキー画像バッファ１３ｂには、テンプレートマッチングにより検出された記号データ（画像データ）が書き込まれるように構成したが、第２の実施形態では、アスキー変換処理（Ｓ２８）のＳ４７の処理において、テンプレートマッチングにより検出された記号データに対応する記号コードが、アスキー画像バッファ１３ｂに書き込まれるものとする。 Thereafter, it is confirmed whether or not the ASCII art flag 103c is on (S106). If the ASCII art flag 103c is on (S106: Yes), ASCII art processing is executed (S107), and the image captured from the CCD camera 116 is obtained. Convert data to ASCII image data. The ASCII art process (S107) is the same process as the ASCII art process (S10) of the first embodiment shown in FIG. In the first embodiment, symbol data (image data) detected by template matching is written to the ASCII image buffer 13b by the process of S47 of the ASCII conversion process (S28). In the second embodiment, it is assumed that the symbol code corresponding to the symbol data detected by template matching in the processing of S47 of the ASCII conversion processing (S28) is written in the ASCII image buffer 13b.

また、Ｓ１０６の処理とＳ１０７の処理との間には、背景画像をＣＣＤカメラ１１６にて撮像し、撮像された背景画像の画像データを背景画像データメモリ１０３ａに書き込む非図示のステップが設けられている。かかるステップは、回線閉結後の初回のアスキーアートフラグ１０３ｃの状態確認（Ｓ１０６）直後に実行され、それ以降は、非実行とされる。 Further, a step (not shown) is provided between the process of S106 and the process of S107, in which a background image is captured by the CCD camera 116 and image data of the captured background image is written to the background image data memory 103a. Yes. This step is executed immediately after checking the state of the first ASCII art flag 103c after the line is closed (S106), and is not executed thereafter.

その後、相手側装置から受信した画像データが受信バッファ１２５ｂに記憶されているか否かを確認すると共に、画像データが受信バッファ１２５ｂに記憶されていると、その画像データをＣＲＴディスプレイ１１２に再生出力する画像出力処理を実行する（Ｓ１０８）。この画像出力処理により出力される画像データは、メイン画面の一部にマルチウィンドウを開いて表示される。尚、受信バッファ１２５ｂに記憶される画像データは、上記したように画像用ＣＯＤＥＣ１１０にて復号化されてＣＲＴディスプレイ１１２に出力される。 Thereafter, it is confirmed whether or not the image data received from the counterpart device is stored in the reception buffer 125b. When the image data is stored in the reception buffer 125b, the image data is reproduced and output to the CRT display 112. Image output processing is executed (S108). The image data output by this image output processing is displayed by opening a multi-window on a part of the main screen. Note that the image data stored in the reception buffer 125b is decoded by the image CODEC 110 and output to the CRT display 112 as described above.

画像出力処理（Ｓ１０８）の後は、マイクロフォン１１５から入力された音声データを相手側装置に送信するための音声送出処理を実行した後（Ｓ１０９）、相手側装置から受信した音声データを、スピーカ１１４から再生出力する音声出力処理を実行する（Ｓ１１０）。その後、通話が終了したか否かを確認し、通話終了であれば（Ｓ１１１：Ｙｅｓ）、相手側装置との通話状態を解消し（通話終了をゲートキーパに通知する）（Ｓ１１２）、アスキーアートフラグ１０３ａをオフする（Ｓ１１３）。続いて、各処理を実行して（Ｓ１１４）、受信した画像データを画像出力するためのマルチウィンドウ表示を終了する処理を行うと共に、電話処理を終了する終了コマンドや他のコマンド（例えば電話番号の登録処理の実行コマンドなど）の入力や、新たな電話番号の指定を待機する。そして、この各処理（Ｓ１１４）において、終了コマンドの入力が認識されると、メイン画面を終了してこの電話処理を終了する。また、その他のコマンドが入力された場合には、入力されたコマンドの内容に応じた処理を実行する。新たな電話番号の指定が認識された場合には、その処理をＳ１０３の処理に移行する。 After the image output process (S108), after executing the voice transmission process for transmitting the voice data input from the microphone 115 to the counterpart device (S109), the voice data received from the counterpart device is sent to the speaker 114. A sound output process for reproducing and outputting from is executed (S110). Thereafter, it is confirmed whether or not the call is finished. If the call is finished (S111: Yes), the call state with the other device is canceled (the end of the call is notified to the gatekeeper) (S112), and the ASCII art flag 103a. Is turned off (S113). Subsequently, each process is executed (S114), a process for ending the multi-window display for outputting the received image data as an image is performed, and at the same time, an end command for ending the telephone process or other commands (for example, telephone number Input of registration processing execution command, etc.) and waiting for designation of a new telephone number. In each process (S114), when the input of the end command is recognized, the main screen is ended and the telephone process is ended. When another command is input, processing corresponding to the content of the input command is executed. If the designation of a new telephone number is recognized, the process proceeds to S103.

更に、Ｓ１０６の処理で確認した結果、アスキーアートフラグ１０３ｃがオフであれば（Ｓ１０６：Ｎｏ）、ＣＣＤカメラ１１６から入力された画像データ（映像）を相手側装置に送出する通常の画像送出処理を実行する（Ｓ１１５）。背景画像の撮影後にＣＣＤカメラ１１６から入力された画像データは、画像用ＲＡＭ１０５に画像フレーム単位で記憶されている。この画像送出処理（Ｓ１１５）では、画像用ＲＡＭ１０５に記憶される画像データを、入力順に従って画像用ＣＯＤＥＣ１１０に出力する。画像用ＲＡＭ１０５から１の画像フレームの出力が終了すると、この画像送出処理（Ｓ１１５）を終了し、その処理をＳ１０８の処理に移行する。画像用ＣＯＤＥＣ１１０に出力された画像データは、圧縮符号化された後、通信制御部１２０に出力される。 Further, if the ASCII art flag 103c is turned off as a result of the confirmation in S106 (S106: No), normal image transmission processing for transmitting image data (video) input from the CCD camera 116 to the counterpart device is executed. (S115). Image data input from the CCD camera 116 after the background image is captured is stored in the image RAM 105 in units of image frames. In this image transmission process (S115), the image data stored in the image RAM 105 is output to the image CODEC 110 in the order of input. When the output of one image frame from the image RAM 105 is completed, this image transmission process (S115) is terminated, and the process proceeds to S108. The image data output to the image CODEC 110 is compressed and encoded, and then output to the communication control unit 120.

また、Ｓ１１１の処理で確認した結果、通話終了でなければ（Ｓ１１１：Ｎｏ）、その処理をＳ１０６の処理に移行し、通話終了が認識される（Ｓ１１１：Ｙｅｓ）まで、Ｓ１０６の処理〜Ｓ１１１の処理、または、Ｓ１０６，Ｓ１１５、Ｓ１０８〜Ｓ１１１の処理を繰り返し、通話状態を維持する。 If the result of confirmation in S111 is not the end of the call (S111: No), the process proceeds to S106, and the process from S106 to S111 is performed until the end of the call is recognized (S111: Yes). The process or the processes of S106, S115, and S108 to S111 are repeated to maintain the call state.

尚、ＩＰ電話装置２００ｃにおいても、上記と同様の電話処理が実行されるが、Ｓ１０１の処理は省略され、発呼と着呼とを交互に検出する待機状態となっている。そして、オフフック状態が検出されると、発呼要求と判断して、Ｓ１０２の処理が実行されることとなる。また、アスキー画像指定コマンドは、そのコマンド入力ボタンを操作することにより入力される。 In the IP telephone device 200c, the same telephone process as described above is executed, but the process of S101 is omitted, and the apparatus is in a standby state in which outgoing calls and incoming calls are detected alternately. When the off-hook state is detected, it is determined that the request is a call request, and the process of S102 is executed. Also, the ASCII image designation command is input by operating the command input button.

図１０は、図９の電話処理の中で実行される音声送出処理（Ｓ１０９）のフローチャートである。この音声送出処理（Ｓ１０９）は、上記したようにマイクロフォン１１５から入力された音声データ（送話者の発声）を、相手側装置に送出する処理である。マイクロフォン１１５から入力された音声データは、音声用ＣＯＤＥＣ１０９によりデジタルデータに変換され音声用ＲＡＭ１０６に記憶されている。そして、この音声用ＲＡＭ１０６に記憶される音声データをＤＣＴ処理（離散コサイン変換処理）した後（Ｓ１２１）、そのＤＣＴ処理された音声データについて最可聴限界以下の成分を除去する（Ｓ１２２）。 FIG. 10 is a flowchart of the voice transmission process (S109) executed in the telephone process of FIG. This voice sending process (S109) is a process for sending the voice data (speaker's voice) input from the microphone 115 to the counterpart device as described above. Audio data input from the microphone 115 is converted into digital data by the audio CODEC 109 and stored in the audio RAM 106. Then, the audio data stored in the audio RAM 106 is subjected to DCT processing (discrete cosine transform processing) (S121), and then the components below the audible limit are removed from the DCT-processed audio data (S122).

最可聴限界とは、通常の人間の聴力で聴きとることのできる限界の音声であり、周波数帯域と音量のレベルとにより決定される。例えば、２０ヘルツ〜５キロヘルツは、小さな音でも良く聞こえる周波数帯域であり、これよりも高い周波数や低い周波数になればなるほど可聴困難な領域となる。また、同じ周波数帯域であっても、音量レベルが低いと、可聴困難となる。最可聴限界以下とは、周波数帯域および音量レベルの因子により、通常、人間の聴力で聴きとることのできない音声の領域である。 The most audible limit is a limit sound that can be heard with normal human hearing, and is determined by a frequency band and a volume level. For example, 20 Hz to 5 kHz is a frequency band in which even a small sound can be heard well, and the higher the frequency or the lower the frequency, the more difficult it is to hear. Even in the same frequency band, if the volume level is low, it becomes difficult to hear. Below the maximum audible limit is a region of sound that cannot be heard by human hearing normally due to factors of frequency band and volume level.

従って、Ｓ１２２の処理により、最可聴限界以下の成分であるか否かは、ＤＣＴ処理された音声データの各成分の周波数帯域と音量レベルとにより判別される。尚、周波数帯域毎に定められた音量レベルが最可聴限界を示す閾値としてハードディスク１０４の所定のエリアに記憶されている。 Therefore, whether or not the component is below the maximum audible limit is determined by the processing of S122 based on the frequency band and volume level of each component of the audio data subjected to DCT processing. The volume level determined for each frequency band is stored in a predetermined area of the hard disk 104 as a threshold value indicating the maximum audible limit.

次に、アスキーアートフラグ１０３ａがオンであるか否かを確認し（Ｓ１２３）、オフであれば（１２３：Ｎｏ）、Ｓ１２４，Ｓ１２５の処理をスキップして、その処理をＳ１２６の処理に移行する。オンであれば（Ｓ１２３：Ｙｅｓ）、アスキー画像バッファ１０３ｂに記憶される記号コード（８ビットのデータ、２進数のデータ）を、除去した最可聴限界以下の成分として、音声データに付加する（Ｓ１２４）。つまり、除去された音声データの成分の周波数帯域に、記号データを、その周波数帯域に対応して予め定められた音量レベル以下となる大きさで組み込む（限界周波数成分のデータ信号に変換する）。 Next, it is confirmed whether or not the ASCII art flag 103a is on (S123). If it is off (123: No), the processes of S124 and S125 are skipped, and the process proceeds to the process of S126. If it is on (S123: Yes), the symbol code (8-bit data, binary data) stored in the ASCII image buffer 103b is added to the audio data as a component below the maximum audible limit (S124). ). That is, the symbol data is incorporated into the frequency band of the removed audio data component in a magnitude that is equal to or lower than a predetermined volume level corresponding to the frequency band (converted to a data signal of a limit frequency component).

これにより、記号データを、音声データの可聴限界以下の成分として、音声データに重畳（保有）させることができ、記号データが音声データ中に含まれていても、再生する音声の品質に大きな影響を与えることはない。従って、かかる記号コードの部分を除去せずに音声データを再生出力しても、付加された記号コードによる耳障りなノイズや可聴不能とする音声品質の低下が発生することはない。故に、相手側装置が、汎用の電話装置であっても、通話に支障を来すことはない。 As a result, the symbol data can be superimposed (held) on the audio data as a component below the audible limit of the audio data, and even if the symbol data is included in the audio data, the quality of the reproduced audio is greatly affected. Never give. Therefore, even if the audio data is reproduced and output without removing such a symbol code portion, the added symbol code does not cause annoying noise or voice quality deterioration that makes it inaudible. Therefore, even if the counterpart device is a general-purpose telephone device, it does not hinder the call.

尚、相手側装置が、最可聴限界以下の成分を音声として出力しないとする（再生しない周波数帯（非再生領域）が予め設けられている）ものであれば、相手側装置において、最可聴限界以下の成分として保有される記号データは、音声データとしては出力されない。かかる場合の、音声データの最可聴限界以下の成分を除去するＳ１２２の処理が、請求項８記載の第１除去手段に該当する。尚、再生しない周波数帯（非再生領域）は、最可聴限界以下の成分の領域に限られるものではなく、所定の周波数帯域に設定されていても良い。 If the counterpart device does not output a component below the maximum audible limit as a sound (a frequency band that is not reproduced (non-reproduction area) is provided in advance), the maximum audible limit is established in the counterpart device. Symbol data held as the following components is not output as audio data. In this case, the process of S122 for removing the component below the maximum audible limit of the audio data corresponds to the first removing means according to claim 8. The frequency band that is not reproduced (non-reproduction area) is not limited to the area of components below the maximum audible limit, and may be set to a predetermined frequency band.

また、アスキー画像バッファ１０３ｂには、分割されたブロックの位置（座標）に従って、順に記号コードが記憶されており、Ｓ１２４の処理では、アスキー画像バッファ１０３ｂから、その記憶された順（アドレスの順）に従って記号コードを読み出し、その読み出した順に音声データに付加する。このため、受信した音声データから順次、最可聴限界以下の成分を取り出せば、記号コードの配列を元画像の座標に従ったものとでき、かかる記号コードの配列から元のアスキー画像を再生出力することができる。 The ASCII image buffer 103b stores symbol codes in order according to the position (coordinates) of the divided blocks. In the process of S124, the stored order (address order) from the ASCII image buffer 103b. The symbol codes are read according to the above and added to the audio data in the read order. For this reason, if components below the maximum audible limit are sequentially extracted from the received audio data, the symbol code arrangement can be made in accordance with the coordinates of the original image, and the original ASCII image is reproduced and output from the symbol code arrangement. be able to.

また、第２の実施形態においては、音声用ＣＯＤＥＣ１０９にてサンプリングされた１サンプリング単位の音声データからＤＣＴ処理により変換した１単位の音声データに、１の記号コードを付加するものとする。尚、１単位の音声データに付加する記号コードは２以上の複数の記号コードであってもよく、１単位の音声データにいくつの記号コードを付加するかは、伝送速度や画像品質を加味して、適宜、設定されるものとする。 In the second embodiment, it is assumed that one symbol code is added to one unit of audio data converted by DCT processing from one sampling unit of audio data sampled by the audio CODEC 109. The symbol code added to one unit of audio data may be two or more symbol codes. The number of symbol codes added to one unit of audio data depends on the transmission speed and image quality. It shall be set as appropriate.

Ｓ１２４の処理の後は、音声データにアスキー画像を形成する記号コードが含まれていることを示すアスキー保有コードを、音声データに付加して（Ｓ１２５）、そのアスキー保有コードの付加された音声データを通信制御部１２０に出力し（Ｓ１２６）、この音声送出処理（Ｓ１０９）を終了する。 After the process of S124, an ASCII possession code indicating that the symbol data for forming the ASCII image is included in the speech data is added to the speech data (S125), and the speech data to which the ASCII possession code is added. Is output to the communication control unit 120 (S126), and the voice transmission process (S109) is terminated.

通信制御部１２０では、音声送出処理（Ｓ１０９）により入力された音声データを時間軸方向に区切ってパケット化する共に、相手側装置の識別子などと共にアスキー保有コードがパケットのヘッダデータとして付加される。そしてパケット化されたデータを所定の伝送方式で回線へ出力する。これにより、アスキー画像は、音声データの態様で相手側装置に送出されることとなる。 In the communication control unit 120, the voice data input by the voice transmission process (S109) is divided into packets in the time axis direction, and the ASCII holding code is added as the packet header data together with the identifier of the counterpart device. Then, the packetized data is output to the line by a predetermined transmission method. As a result, the ASCII image is sent to the counterpart device in the form of audio data.

尚、アスキー画像の出力でない場合など、画像送出処理（Ｓ１１５）により、画像用ＣＯＤＥＣ１１０を介して通信制御部１２０に送出された画像データが、送信バッファ１２５ａに記憶されていると、その画像データと対応する音声データとが信号多重部１２４により多重化された後、パケット化されて、回線に出力される。 When the image data sent to the communication control unit 120 via the image CODEC 110 by the image sending process (S115) is stored in the transmission buffer 125a, such as when the ASCII image is not output, the image data and The corresponding audio data is multiplexed by the signal multiplexing unit 124, then packetized and output to the line.

図１１は、図９の電話処理の中で実行される音声出力処理（Ｓ１１０）のフローチャートである。この音声出力処理（Ｓ１１０）では、まず、通信制御部１２０の受信バッファ１２５ｂに、音声データが記憶されているか否かを確認し（Ｓ１３１）、記憶されていなければ（Ｓ１３１：Ｎｏ）、この音声出力処理（Ｓ１１０）を終了する。一方、受信バッファ１２５ｂに音声データが記憶されていると（Ｓ１３１：Ｙｅｓ）、受信バッファ１２５ｂに記憶されている音声データを読み出し、読み出した音声データにアスキー保有コードが付加されているか否かを確認する（Ｓ１３２）。アスキー保有コードが付加されていた場合には（Ｓ１３２：Ｙｅｓ）、その音声データに最可聴限界以下の周波数成分として具有される２進数の値を、受信した音声データの順に従って、順次、ＲＡＭ１０３の所定領域に書き込む（Ｓ１３３）。尚、受信バッファ１２５ｂに記憶されている音声データは、最可聴限界以下の周波数成分の２進数の値をＲＡＭ１０３の所定領域に書き込んだ後、受信バッファ１２５ｂから消去される。その後、ＲＡＭ１０３の所定領域に書き込まれた２進数を８ビット毎に読み取り、ハードディスク１４の記号データメモリ１４ｂを参照して、読み取った８ビットの記号コードを記号データに変換する（Ｓ１３４）。そして、変換された記号データを、ＣＲＴディスプレイ１１２に表示される電話処理のメイン画面の所定領域（所定のウィンドウ）に出力する（Ｓ１３５）。尚、ＩＰ電話機２００ｃにおいては、このＳ１３５の処理により、アスキー画像データがＬＣＤに出力される。 FIG. 11 is a flowchart of the voice output process (S110) executed in the telephone process of FIG. In this audio output process (S110), first, it is confirmed whether or not audio data is stored in the reception buffer 125b of the communication control unit 120 (S131). If not stored (S131: No), this audio is processed. The output process (S110) ends. On the other hand, when the audio data is stored in the reception buffer 125b (S131: Yes), the audio data stored in the reception buffer 125b is read, and it is confirmed whether or not the ASCII holding code is added to the read audio data. (S132). When the ASCII possession code is added (S132: Yes), the binary value included in the sound data as a frequency component below the maximum audible limit is sequentially stored in the RAM 103 according to the order of the received sound data. Write to a predetermined area (S133). The audio data stored in the reception buffer 125b is erased from the reception buffer 125b after writing the binary value of the frequency component below the maximum audible limit in a predetermined area of the RAM 103. Thereafter, the binary number written in the predetermined area of the RAM 103 is read every 8 bits, and the read 8-bit symbol code is converted into symbol data by referring to the symbol data memory 14b of the hard disk 14 (S134). Then, the converted symbol data is output to a predetermined area (predetermined window) of the main screen of the telephone process displayed on the CRT display 112 (S135). In the IP telephone 200c, ASCII image data is output to the LCD by the process of S135.

その後、受信バッファ１２５ｂに記憶される音声データにＩＤＣＴ処理（離散コサイン逆変換処理）を行ってから（Ｓ１３６）、ＩＤＣＴ処理された音声データを、音声用ＣＯＤＥＣ１０９に出力して（Ｓ１３７）、この音声出力処理（Ｓ１１０）を終了する。音声用ＣＯＤＥＣ１０９に出力された音声データは、復号化によりアナログデータに変換されて、スピーカ１１４から再生出力される。 Thereafter, the audio data stored in the reception buffer 125b is subjected to IDCT processing (inverse discrete cosine transform processing) (S136), and the audio data subjected to IDCT processing is output to the audio CODEC 109 (S137). The output process (S110) ends. The audio data output to the audio CODEC 109 is converted into analog data by decoding and reproduced and output from the speaker 114.

一方、Ｓ１３２の処理で確認した結果、アスキー保有コードが付加されていなければ（Ｓ１３２：Ｎｏ）、音声データには、アスキー画像を形成するデータ（記号コード）が含まれていないと判断し、Ｓ１３３〜Ｓ１３５の処理をスキップして、その処理をＳ１３６の処理に移行する。 On the other hand, if the ASCII possession code is not added as a result of the confirmation in the process of S132 (S132: No), it is determined that the data (symbol code) forming the ASCII image is not included in the audio data, and S133. The process of S135 is skipped and the process proceeds to S136.

パケット内の各音声データから、アスキー画像を形成する記号コードが含まれているか否かを判断する場合には、最可聴限界以下の成分の２進数のデータが「０」が、記号コードが含まれていないが故であるのか、画像データの無地部分であるが故であるかの判別が困難になる。場合によっては、１の画像フレームに対応する全パケットについて、データの読み取りを終了するまで、その判別ができず、アスキー画像形成の処理速度を低下させがねない。しかし、第２の実施形態においては、アスキー画像を形成する記号コードを含む音声データであることは、アスキー保有コードにて示されるので、音声データに記号コードが含まれているか否かを容易に判定でき、アスキー画像形成をスムーズに実行することができる。 When it is determined from each audio data in the packet whether or not a symbol code forming an ASCII image is included, the binary data of the component below the maximum audible limit is “0” and the symbol code is included. It is difficult to determine whether this is because it is not, or because it is a plain part of the image data. In some cases, all packets corresponding to one image frame cannot be discriminated until data reading is completed, and the processing speed of ASCII image formation may be reduced. However, in the second embodiment, since the voice data including the symbol code forming the ASCII image is indicated by the ASCII possessed code, it is easily determined whether or not the symbol data is included in the voice data. As a result, ASCII image formation can be executed smoothly.

このように、第２の実施形態によれば、音声と画像とを相手側装置に出力するＩＰ電話装置２００において、相手側装置に送出する画像をアスキー画像とすることができる。このため、実際の映像を出力するテレビ電話とは異なる趣を持った画像によるコミュニケーションをはかることができる。また、実際の映像をそのまま相手側装置に送出したくない場合などにおいては、送話者側のプライバシーを守りつつ、元の映像に基づいた高度なコミュニケーションを行うことができる。また、アスキー画像は、記号の組み合わせにより描画されるものであるので、画像形成機能のない装置やプログラムであっても、文字記号を表示する機能を備えていれば、アスキー画像を現出させて演出を実行することができる。 As described above, according to the second embodiment, in the IP telephone device 200 that outputs sound and an image to the counterpart device, the image sent to the counterpart device can be an ASCII image. For this reason, it is possible to communicate with an image having a taste different from that of a videophone that outputs an actual video. Further, in the case where it is not desired to transmit the actual video as it is to the counterpart device, it is possible to perform advanced communication based on the original video while protecting the privacy of the sender. In addition, ASCII images are drawn by combinations of symbols, so even if an apparatus or program without an image forming function has a function of displaying character symbols, the ASCII image can be displayed. Production can be performed.

尚、第２の実施形態においては、送信側装置となるＩＰ電話装置２００は、アスキー画像指定コマンドが入力された場合に実際の映像に代えてアスキー画像を相手側装置に送出したが、実際の映像と共にアスキー画像を相手側装置に送出するように構成しても良い。アスキー画像と実際の映像（ＣＣＤカメラ１１６から入力された画像データ）とを共に相手側装置に送出することにより送信側装置から送出する画像データの量が増加しても、アスキー画像は音声データとして送出されるので、画像データを送出する処理（画像データ送出処理）が煩雑になることはない。 In the second embodiment, the IP telephone device 200 serving as the transmission side device sends an ASCII image to the counterpart device instead of the actual video when the ASCII image designation command is input. You may comprise so that an ASCII image may be sent to a partner apparatus with a video. Even if the ASCII image and the actual video (image data input from the CCD camera 116) are both sent to the counterpart device and the amount of image data sent from the sender device is increased, the ASCII image is converted into audio data. Since the data is transmitted, the process of transmitting the image data (image data transmission process) does not become complicated.

また、送信側装置となるＩＰ電話装置２００を、上記のように、実際の映像と共にアスキー画像を送出するように構成した場合において、相手側装置となるＩＰ電話装置２００に、実際の映像の出力とアスキー画像の出力とを切り替えるスイッチと、かかるスイッチの状態を割り込み処理にて監視する監視手段とを設け、その監視手段により検出されるスイッチの状態に応じて、受信した映像とアスキー画像との一方を出力するように構成しても良い。 In addition, when the IP telephone device 200 serving as the transmission side device is configured to send an ASCII image together with the actual video as described above, the actual video output to the IP telephone device 200 serving as the counterpart side device. And a switch for switching the output of the ASCII image, and monitoring means for monitoring the state of the switch by interrupt processing, and according to the state of the switch detected by the monitoring means, the received video and ASCII image You may comprise so that one may be output.

以上、上記各実施の形態に基づき本発明を説明したが、本発明は、上記各実施の形態に何ら限定されるものでなく、本発明の趣旨を逸脱しない範囲内で種々の改良変形が可能であることは容易に推察できるものである。 Although the present invention has been described based on the above embodiments, the present invention is not limited to the above embodiments, and various improvements and modifications can be made without departing from the spirit of the present invention. It can be easily guessed.

例えば、上記各実施の形態では、アスキー変換処理（Ｓ２８）において、画像フレームは、記号の大きさと同じ大きさのブロックに分割したが、記号の大きさブロックの大きさとは、テンプレートマッチングの際に整合されていればよく、マッチングする際に一方の画像を拡大または縮小しても良い。このため、分割されるブロックの大きさは、必ずしも記号の大きさと整合させる必要はない。 For example, in each of the above embodiments, in the ASCII conversion process (S28), the image frame is divided into blocks having the same size as the symbol size. The symbol size block size is the same as that in template matching. As long as they are matched, one image may be enlarged or reduced when matching. For this reason, the size of the block to be divided does not necessarily match the size of the symbol.

また、分割するブロック数は、８０×２５に限られるものでなく、更に、そのブロック数を可変値とし、使用者が操作パネルからその値を入力することにより任意に設定できるようにしてもよい。使用者によりブロック数が設定された場合には、Ｓ４１およびＳ４２の処理において、その設定されたブロック数となるように横分割数ｍと、縦分割数ｎが設定される。これにより、使用者の意向を反映したアスキー画像を作成でき、表現の自由度を向上させて娯楽性を向上させると共に、使用者が自己の操作によって所望の変形を施すことができるので、作成されるアスキー画像に対し、使用者に高い満足度を付与し得る。 Further, the number of blocks to be divided is not limited to 80 × 25. Further, the number of blocks may be a variable value, and may be arbitrarily set by the user inputting the value from the operation panel. . When the number of blocks is set by the user, the horizontal division number m and the vertical division number n are set so that the set number of blocks is obtained in the processing of S41 and S42. As a result, an ASCII image reflecting the user's intention can be created, and the degree of freedom of expression is improved and the entertainment is improved, and the user can perform desired deformation by his / her own operation. A high degree of satisfaction can be given to the user with respect to the ASCII image.

更に、上記各実施の形態では、背景画像は、アスキー画像による１の演出が開始される毎に行うように構成されたが、背景画像の取得（背景画像メモリ１３ａ，１０３ａに記憶される背景画像の画像データの更新）は、必ずしも、アスキー画像による演出が実行される毎に行われる必要はない。例えば、操作者により要求された場合にのみ背景画像を撮像し、背景画像が取得された場合に限って、背景画像メモリ１３ａ，１０３ａに記憶される背景画像の画像データを、新たに取得した背景画像の画像データにより更新するようにしても良い。 Further, in each of the above-described embodiments, the background image is configured to be performed every time an effect of the ASCII image is started. However, the background image is acquired (the background image stored in the background image memories 13a and 103a). (Update of the image data) does not necessarily have to be performed every time the effect by the ASCII image is executed. For example, the background image is captured only when requested by the operator, and the image data of the background image stored in the background image memories 13a and 103a is newly acquired only when the background image is acquired. You may make it update with the image data of an image.

また、上記各実施の形態においては、各画像フレームのそれぞれをアスキー画像に変換したが、取得された全画像データ（画像フレーム）から、所定間隔で画像フレームをサンプリングし、サンプリングされた画像フレームについてのみアスキー画像を形成して、他の画像フレームについては廃棄するように構成しても良い。また、サンプリングする所定間隔については、使用者が任意に設定できるようにしても良い。これによれば、効率的にアスキー画像を形成してその形成処理の速度全体を高速化できる上、コマ送り表示として再生されるアスキー画像の動画の娯楽性を高めることができる。 In each of the above embodiments, each of the image frames is converted to an ASCII image. However, image frames are sampled at predetermined intervals from all acquired image data (image frames), and the sampled image frames are sampled. Only the ASCII image may be formed and the other image frames may be discarded. In addition, the user may arbitrarily set the predetermined interval for sampling. According to this, it is possible to efficiently form an ASCII image and increase the overall speed of the forming process, and it is possible to improve the entertainment of the moving image of the ASCII image reproduced as a frame advance display.

更に、アスキーアート処理（Ｓ２８，Ｓ１０７）においては、人物画像の面積が画像フレーム全体の半分未満である場合に、人物画像の面積が画像フレーム全体の半分となるように１段階で画像の拡大を行った。これに代えて、１回で拡大される拡大率を固定倍率とし、段階的に元の人物画像を拡大するように構成しても良い。これによれば、操作者所望の倍率に人物画像を拡大することができる。 Further, in the ASCII art process (S28, S107), when the area of the person image is less than half of the entire image frame, the image is enlarged in one step so that the area of the person image becomes half of the entire image frame. It was. Instead of this, it may be configured to enlarge the original person image in a stepwise manner with an enlargement ratio that is enlarged once as a fixed magnification. According to this, the person image can be enlarged to a magnification desired by the operator.

加えて、上記各実施の形態では、画像フレームを分割した場合に、その画像フレーム内における人物画像の面積が所定の面積未満であると、人物画像データの拡大を行ったが、人物画像の面積が所定の面積以上（人物画像データを有するブロック数が所定数以上）である場合については、人物画像データの圧縮を実行しても良い。これによれば、表示画面に対して、人物画像が大きすぎるために視認性が悪化することを、回避できる。尚、かかる場合の人物画像データの圧縮を実行する処理が、請求項５記載の画像データ圧縮手段に該当する。 In addition, in each of the above embodiments, when the image frame is divided, the human image data is enlarged if the area of the human image in the image frame is less than the predetermined area. Is greater than or equal to a predetermined area (the number of blocks having person image data is greater than or equal to a predetermined number), the person image data may be compressed. According to this, it is possible to avoid deterioration in visibility due to the person image being too large with respect to the display screen. In this case, the process for executing the compression of the human image data corresponds to the image data compression means according to claim 5.

更に、上記各実施の形態では、画像データを入力する画像データ入力手段として、リアルタイムで画像データを取得するＣＣＤカメラを採用したが、入力される画像データはリアルタイムで取得されたものに限られるものでなく、予め画像データが記憶された記憶媒体とその記憶媒体に記憶される画像データを読み取る読み取り手段とにより画像データを入力する画像データ入力手段を構成しても良い。 Furthermore, in each of the above-described embodiments, a CCD camera that acquires image data in real time is adopted as the image data input means for inputting image data. However, the input image data is limited to that acquired in real time. Alternatively, the image data input means for inputting the image data may be configured by a storage medium in which image data is stored in advance and a reading means for reading the image data stored in the storage medium.

本発明の画像処理装置を備えたカラオケ装置を示す概略図である。It is the schematic which shows the karaoke apparatus provided with the image processing apparatus of this invention. カラオケ装置の電気的構成を示すブロック図である。It is a block diagram which shows the electric constitution of a karaoke apparatus. アスキー画像が形成される課程を模式的に示した図である。It is the figure which showed typically the process in which an ASCII image is formed. 図４は、カラオケ装置本体の制御部にて実行されるカラオケ再生処理のフローチャートである。FIG. 4 is a flowchart of karaoke playback processing executed by the control unit of the karaoke apparatus main body. 図４のカラオケ再生処理の中で実行されるアスキーアート処理のフローチャートである。5 is a flowchart of ASCII art processing executed in the karaoke playback processing of FIG. 図５のアスキーアート処理の中で実行されるアスキー変換処理のフローチャートである。6 is a flowchart of an ASCII conversion process executed in the ASCII art process of FIG. 第２の実施形態の画像処理装置であるＩＰ電話装置を備えた電話システムを示した概略図である。FIG. 5 is a schematic diagram showing a telephone system including an IP telephone device that is an image processing device of a second embodiment. 電話システムのＩＰ電話装置の電気的構成を示すブロック図である。It is a block diagram which shows the electrical constitution of the IP telephone apparatus of a telephone system. 第２の実施形態の画像処理装置であるＩＰ電話装置にて実行される電話処理を示したフローチャートである。6 is a flowchart showing telephone processing executed by an IP telephone device which is an image processing device according to a second embodiment. 図９の電話処理の中で実行される音声送出処理のフローチャートである。10 is a flowchart of voice transmission processing executed in the telephone processing of FIG. 9. 図９の電話処理の中で実行される音声出力処理のフローチャートである。It is a flowchart of the audio | voice output process performed in the telephone process of FIG.

１カラオケ装置（画像処理装置）
１ａ本体（画像処理部）
２ＣＲＴディスプレイ（表示装置）
３，１１６ＣＣＤカメラ（画像データ入力手段、撮像装置）
１３ａ，１０３ａ背景画像メモリ（背景画像記憶手段）
１４ｂ，１０４ｂ記号データメモリ（記号データ記憶手段）
１４ｃ楽曲データメモリ（伴奏情報記憶手段）
１４ｄ歌詞データメモリ（歌詞データ記憶手段）
２１，１１５伴奏装置（伴奏発生手段）
２３マイクロフォン（音声データ入力手段）
２４スピーカ（発音手段）
１０９音声用ＣＯＤＥＣ（データ変換手段の一部、音声再生手段の一部）
１２０通信制御部（出力手段、データ変換手段の一部、音声データ受信手段）
２００ＩＰ電話装置、相手側装置、送信側装置
Ｓ１５出力手段の一部
Ｓ２３，Ｓ２４固有画像データ抽出手段、固有画像データ抽出ステップ
Ｓ２５，Ｓ２９〜Ｓ３１画像データ伸張手段、画像データ伸張ステップ
Ｓ２５判断手段、判断ステップ
Ｓ２７線画変換手段、線画変換ステップ
Ｓ４１，Ｓ４２分割手段、分割ステップ
Ｓ４７抽出手段、抽出ステップ
Ｓ４８画像形成手段の一部、画像形成ステップ
Ｓ１２１，Ｓ１２２データ変換手段の一部
Ｓ１２２第２除去手段
Ｓ１２４重畳手段
Ｓ１３２，Ｓ１３７音声再生手段の一部
Ｓ１３４，Ｓ１３５画像データ再生手段の一部
Ｓ１０アスキーアート処理（加工手段の一部、画像処理プログラム） 1 Karaoke device (image processing device)
1a Body (image processing unit)
2 CRT display (display device)
3,116 CCD camera (image data input means, imaging device)
13a, 103a Background image memory (background image storage means)
14b, 104b Symbol data memory (symbol data storage means)
14c Music data memory (accompaniment information storage means)
14d Lyric data memory (lyric data storage means)
21,115 Accompaniment device (accompaniment generating means)
23 Microphone (voice data input means)
24 Speaker (pronunciation means)
109 CODEC for voice (part of data conversion means, part of voice playback means)
120 Communication control unit (output means, part of data conversion means, voice data receiving means)
200 IP telephone apparatus, counterpart apparatus, transmission apparatus S15 part of output means S23, S24 unique image data extraction means, unique image data extraction steps S25, S29-S31 image data expansion means, image data expansion step S25 determination means, Determination step S27 Line drawing conversion means, line drawing conversion steps S41, S42 division means, division step S47 extraction means, extraction step S48 part of image formation means, image formation steps S121, S122 part of data conversion means S122 second removal means S124 Superimposing means S132, S137 Part of sound reproducing means S134, S135 Part of image data reproducing means S10 ASCII art processing (part of processing means, image processing program)

Claims

Image data input means for inputting image data having a plurality of image frames and capable of appearing continuous images, processing means for processing the image data input by the image data input means for each image frame, and An image processing unit including an output unit that outputs image data by sequentially outputting image frames processed by the processing unit according to the input order; and the image frames output by the output unit of the image processing unit in order In an image processing apparatus comprising a display device for displaying and displaying the image data as a moving image,
The image processing unit includes a symbol data storage unit that stores image data of a plurality of symbols, and the processing unit of the image processing unit converts the image data input by the image data input unit into line image data A line drawing conversion means for converting the image data into a predetermined number of blocks; and an image frame of the image data input by the image data input means or an image frame having line image data converted by the line drawing conversion means Extraction means for extracting, in block units divided by the dividing means, symbols assigned according to the mode of the line image data from the symbol image data stored in the symbol data storage means; An image forming method for forming replacement image data by replacing the line image data of the corresponding block with the image data of the symbol extracted by the extraction means Includes bets, output means of the image processing unit, and outputs a replacement image data formed by the replacement image data,
It performs video display of the replacement image data at the display device,
The image processing unit includes an imaging device that acquires image data by photographing as the image data input unit, and further includes a background image storage unit that stores background image data acquired as a background image by the imaging device. With
The processing means of the image processing unit removes unique image data unique to each image frame by excluding a common part with the background image data stored in the background image storage means from the image data acquired by the imaging device. A unique image data extracting means for extracting,
The line drawing conversion means of the image processing unit converts the unique image data extracted by the unique image data extraction means into line image data,
The image forming unit of the image processing unit forms replacement image data by line image data converted from the unique image data,
When the image processing unit of the image processing unit divides the image frame, the number of blocks having the line image data or the unique image data in the image frame is equal to or less than a predetermined lower limit value. An image data expansion means for expanding the line image data or the unique image data in a direction to increase the number of blocks having the line image data or the unique image data in the image frame;
The image forming unit of the image processing unit forms replacement image data based on the decompressed image data when the line image data or unique image data is decompressed by the image data decompressing unit. An image processing apparatus.

The image data decompression unit includes a determination unit that determines whether or not a ratio of the unique image data extracted by the unique image data extraction unit to a source image frame is equal to or less than a predetermined ratio. And
When the determining means determines that the ratio occupied by the unique image data is equal to or less than a predetermined ratio, the number of blocks having the unique image data is determined to be equal to or less than a predetermined lower limit value. The image processing apparatus according to claim 1 , wherein the image processing apparatus performs expansion.

When the image processing unit of the image processing unit divides the image frame, the number of blocks having the line image data or the unique image data in the image frame is greater than or equal to a predetermined upper limit value. Image data compression means for compressing the line image data or the unique image data and reducing the number of blocks having the line image data or the unique image data in the image frame;
The image forming unit of the image processing unit forms replacement image data based on the compressed image data when the image data is compressed by the image data compression unit. Item 3. The image processing apparatus according to Item 1 or 2 .

Accompaniment information storage means for storing accompaniment information;
Accompaniment generating means for generating accompaniment based on the accompaniment information stored in the accompaniment information storage means;
Voice data input means for inputting voice data;
A sounding means for sounding the sound data input by the sound data input means and the accompaniment generated by the accompaniment generating means;
Lyric data storage means corresponding to each accompaniment information and storing lyric data displayed on the display device in accordance with the accompaniment sounded by the sounding means,
With the lyrics data stored in the lyrics data memory means, according to any one of claims 1 to 3, characterized in that the replacement image data output by said output means is for displaying on said display device Image processing device.

5. An audio data input unit that includes the image processing unit according to any one of claims 1 to 4 and is connected to a counterpart device via a line, and that inputs audio data; and an audio that is input by the audio data input unit Data conversion means for converting data into a data signal to be output to the line, and an identification code corresponding to replacement image data formed by the image forming means of the image processing section is converted into a data signal of audio data to convert the data A transmitting side device having superimposing means for superimposing on the data signal converted by the means,
Said output means of the image processing unit of claims 1 to 4, characterized in that in which the data signal superimposed by the superimposing means and outputting to said line and outputs the replacement image data to the partner apparatus The image processing apparatus according to any one of the above.

The transmission side device includes first removal means for removing a non-reproduction frequency component set as a non-reproduction area for the audio data input by the audio data input means,
The superimposing unit converts the identification code of the replacement image data into a data signal of a band of the non-reproducing frequency component removed by the first removing unit and superimposes it on the data signal converted by the data converting unit The image processing apparatus according to claim 5, wherein:

The transmission side device includes second removal means for removing a limit frequency component exceeding an audible limit for the voice data input by the voice data input means,
The superimposing means converts the identification code of the replacement image data into a data signal of a limit frequency component removed by the second removing means and superimposes it on the data signal converted by the data converting means. The image processing apparatus according to claim 5 or 6 .

The display device, voice data receiving means for receiving voice data, and voice playback means for playing back voice data received by the voice data receiving means by voice while being connected to the transmission side device via the line And when the audio data received by the audio data receiving means includes a data signal of an identification code corresponding to the replacement image data, image data reproduction means for reproducing the replacement image data from the data signal a, the image processing apparatus according to any one of claims 5, characterized in that it comprises a mating device 7 for displaying the replacement image data reproduced by the image data reproduction means at the display device.

Processing for processing image data input by the image data input means for each image frame in a computer having image data input means for inputting image data having a plurality of image frames and capable of appearing continuous images An image processing step comprising: a step, and an output step for outputting image data by sequentially outputting the image frame processed by the processing step according to the input order; In the image processing program for displaying the image frames in order and displaying the image data as a moving image on the display device of the computer ,
The computer includes symbol data storage means in which image data of a plurality of symbols is stored,
The processing step of the image processing step includes a line drawing conversion step of converting the image data input by the image data input means into line image data, and an image frame of the image data input by the image data input means or the A division step for dividing an image frame having line image data converted by the line drawing conversion step into a predetermined number of blocks, and a block unit divided by the division step , are allocated according to the mode of the line image data . An extraction step for extracting the symbol from the symbol image data stored in the symbol data storage means, and a replacement image obtained by replacing the line image data of the corresponding block with the symbol image data extracted by the extraction step and an image forming step of forming a data is executed on the computer, the image processing Output step of step is for outputting the replacement image data formed by the replacement image data,
The moving image of the replacement image data is displayed on the display device,
The computer includes an imaging device that acquires image data by photographing as the image data input unit, and further includes a background image storage unit that stores background image data acquired as a background image by the imaging device,
In the processing step of the image processing step, unique image data unique to each image frame is removed from the image data acquired by the imaging device except for a common portion with the background image data stored in the background image storage unit. Causing the computer to execute a unique image data extraction step to extract,
The line drawing conversion step of the image processing step converts the unique image data extracted by the unique image data extraction step into line image data,
The image forming step of the image processing step is to form replacement image data by line image data converted from the unique image data,
In the processing step of the image processing step, when the image frame is divided by the dividing step, the number of blocks having the line image data or the unique image data in the image frame is equal to or less than a predetermined lower limit value. , Causing the computer to execute an image data expansion step of expanding the line image data or the unique image data in a direction to increase the number of blocks having the line image data or the unique image data in the image frame,
In the image forming step of the image processing step, when the line image data or unique image data is expanded by the image data expansion step, replacement image data is formed based on the expanded image data. An image processing program characterized by the above.

The image data decompression step includes a determination step of determining whether a ratio of the unique image data extracted by the unique image data extraction step to a source image frame is equal to or less than a predetermined ratio. Let the computer run,
In the determination step, when it is determined that the ratio occupied by the unique image data is equal to or less than a predetermined ratio, the number of blocks having the unique image data is determined to be equal to or less than a predetermined lower limit value. claim 9, wherein the image processing program, characterized in that performs a decompression.