JP2014238773A

JP2014238773A - Character recognition device, character recognition method, and character recognition program

Info

Publication number: JP2014238773A
Application number: JP2013121992A
Authority: JP
Inventors: 勝利小原; Katsutoshi Obara; 中村　一夫; Kazuo Nakamura; 一夫中村
Original assignee: Fujitsu Frontech Ltd
Current assignee: Fujitsu Frontech Ltd
Priority date: 2013-06-10
Filing date: 2013-06-10
Publication date: 2014-12-18
Anticipated expiration: 2033-06-10
Also published as: JP6081298B2

Abstract

【課題】定義体を自動生成する。【解決手段】文字認識装置は、取得部と、抽出部と、生成部とを有する。取得部は、紙面の画像から１以上のアイテムの画像を取得する。取得部は、紙面の画像から１以上のアイテムの画像を取得する。抽出部は、取得した１以上のアイテムの画像から、第１サイズ以上のアイテムの画像を抽出する。生成部は、アイテムの画像を取得した紙面の種類と、抽出したアイテムの画像とを関連付けて格納する定義体を生成する。【選択図】図１A definition body is automatically generated. A character recognition device includes an acquisition unit, an extraction unit, and a generation unit. The acquisition unit acquires an image of one or more items from the image on the paper. The acquisition unit acquires an image of one or more items from the image on the paper. An extraction part extracts the image of the item of 1st size or more from the acquired image of the 1 or more item. The generation unit generates a definition body that stores the type of the page from which the item image is acquired and the extracted item image in association with each other. [Selection] Figure 1

Description

本発明は文字を認識する技術に関する。 The present invention relates to a technique for recognizing characters.

近年、帳票に記載された文字を認識（以下、帳票の文字認識とも言う。）するために、ＯＣＲ（Optical Character Reader）機能を有する文字認識装置が用いられている。文字認識装置では、例えば、帳票の文字認識をするとき、帳票に記載された文字の位置や文字の種類などを格納した定義体が用いられている。 In recent years, a character recognition device having an OCR (Optical Character Reader) function has been used to recognize characters written on a form (hereinafter also referred to as form character recognition). In the character recognition device, for example, when character recognition is performed on a form, a definition body that stores the position of the character written on the form, the type of character, and the like is used.

また、文字認識装置は、複数種類の帳票の文字認識をする場合、各帳票の文字認識で用いられる定義体に、さらに文字認識をする帳票の種類と帳票が有する特有の図形とを関連付けて格納する。そして、文字認識装置は、帳票の文字認識をするとき、入力された帳票の画像から図形を取得し、複数の定義体の中から取得した図形と同じ図形を格納した定義体を検索する。これにより、文字認識装置は、文字認識する対象の帳票が検索された定義体で示される種類の帳票であると判別する。そして、文字認識装置は、検索された定義体に格納された文字の位置や文字の種類などを用いて、帳票の文字認識をする処理を実行する。 In addition, when recognizing characters of multiple types of forms, the character recognition device stores the definition type used for character recognition of each form by further associating the type of form for character recognition with the specific figure of the form. To do. When the character recognition device recognizes characters in a form, it acquires a figure from the input form image and searches for a definition body that stores the same figure as the acquired figure from among a plurality of definition bodies. As a result, the character recognition device determines that the form for character recognition is the type of form indicated by the searched definition. Then, the character recognition apparatus executes a process for recognizing the character of the form by using the character position and the character type stored in the searched definition body.

関連する技術として、文字認識部によって帳票上に記載された文字を認識したとき、リジェクト率が非常に高い場合にはそのイメージデータをイメージデータ格納部に保存するとともに、認識結果の統計を行う。そして、必要と判断した場合には、フォーマット情報生成部がその帳票のイメージデータを使用してフォーマット情報を自動生成する。また、フォントが異なることによる認識率の低下に対してはフォント情報を再登録する技術が知られている。 As a related technique, when the character written on the form is recognized by the character recognition unit, if the rejection rate is very high, the image data is stored in the image data storage unit and statistics of the recognition result are performed. If it is determined that the format information is necessary, the format information generation unit automatically generates the format information using the image data of the form. Also, a technique for re-registering font information is known for a reduction in recognition rate due to different fonts.

関連する他の技術として、文字認識情報記憶手段には、予め、文字認識を行なう帳票上の対象範囲の位置指定を含む文字認識情報が格納されている。同様に、構文ルール情報記憶手段には、対象範囲の文字列によって表される文字認識項目と対象範囲の文字列についての規定を含む構文ルール情報が格納されている。文字認識手段は、イメージスキャナによって光学的に読み取られた帳票のイメージ情報を入力し、文字認識情報記憶手段から読み出した文字認識情報に基づき、対象範囲のイメージ情報を抽出して文字認識を行なう。文字認識された結果は、構文解析手段に送られる。構文解析手段は、文字認識結果である対象範囲の文字列を構文ルール情報に基づいて解析し、文字認識項目との対応付けを行なう技術が知られている。 As another related technique, the character recognition information storage means stores in advance character recognition information including position designation of a target range on a form for character recognition. Similarly, the syntax rule information storage means stores syntax rule information including a character recognition item represented by a character string in the target range and a rule for the character string in the target range. The character recognition means inputs the image information of the form optically read by the image scanner, extracts the image information of the target range based on the character recognition information read from the character recognition information storage means, and performs character recognition. The result of character recognition is sent to the parsing means. As the syntax analysis means, a technique is known in which a character string in a target range, which is a character recognition result, is analyzed based on syntax rule information and associated with a character recognition item.

関連する他の技術として、画像入力装置で入力され、画像記憶装置に格納された部分画像データは、画像認識処理によって罫線、文字コードに変換され、記憶装置に格納される。記憶装置に格納された部分領域の罫線、文字コードから、書式定義データが作成され、書式記憶装置に格納される。入力部分画像から表単位の書式データを生成し、これらを合成することにより、帳票全体の書式定義データを生成する技術が知られている。 As another related technique, partial image data input by an image input device and stored in the image storage device is converted into ruled lines and character codes by image recognition processing and stored in the storage device. Format definition data is created from the ruled lines and character codes of the partial areas stored in the storage device, and stored in the format storage device. A technique for generating format definition data for the entire form by generating format data for each table from an input partial image and combining them is known.

関連する他の技術として、マスター画像入力部と、データ画像入力部と、マスク領域入力部と、画像整合部と、差分抽出部と、相違度出力部とを備える画像識別装置がある。マスター画像入力部は、第１の画像としてのマスター画像を入力する。データ画像入力部は、第２の画像としてのデータ画像を入力する。マスク領域入力部は、マスター画像に対して指定されるマスク領域の組を入力する。画像整合部は、マスター画像とデータ画像とを整合させる。差分抽出部は、整合されたマスター画像とデータ画像との間で、マスク領域を除いた差分を抽出する。相違度出力部は、抽出された差分の大きさにもとづいてマスター画像とデータ画像との相違度を出力する技術が知られている。 As another related technique, there is an image identification device including a master image input unit, a data image input unit, a mask area input unit, an image matching unit, a difference extraction unit, and a difference degree output unit. The master image input unit inputs a master image as the first image. The data image input unit inputs a data image as the second image. The mask area input unit inputs a set of mask areas designated for the master image. The image matching unit matches the master image and the data image. The difference extraction unit extracts a difference excluding the mask area between the matched master image and the data image. A technique for outputting the difference degree between the master image and the data image based on the extracted difference is known.

関連する他の技術として、被分類帳票から抽出された罫線特徴リストと、帳票様式データベース内の基準帳票の罫線特徴リストとを対応付け、罫線位置の補正量が補正量検出手段により算出され、罫線特徴補正手段で基準帳票の罫線位置の補正がされる。罫線特徴照合手段で、被分類帳票の罫線特徴リストと補正後の基準帳票の罫線特徴リストとを照合し、その類似度を求める。被分類帳票は、最大の類似度を持つ基準帳票と同一の様式として分類される。補正量は、被分類帳票の複数の罫線位置と、比較対照される一つの基準帳票の複数の罫線位置との間の、全ての組合せの対応関係から求められる技術が知られている。（例えば、特許文献１〜５）。 As another related technique, the ruled line feature list extracted from the classified form is associated with the ruled line feature list of the reference form in the form format database, and the correction amount of the ruled line position is calculated by the correction amount detecting means, and the ruled line The ruled line position of the reference form is corrected by the feature correcting means. The ruled line feature collating means collates the ruled line feature list of the classified form with the ruled line feature list of the corrected standard form, and obtains the similarity. The classified form is classified as the same format as the reference form having the maximum similarity. A technique is known in which the correction amount is obtained from the correspondence of all combinations between the plurality of ruled line positions of the classified form and the plurality of ruled line positions of one reference form to be compared. (For example, Patent Documents 1 to 5).

特開平９−７３５００号公報JP-A-9-73500 特開２００４−１９９５２９号公報Japanese Patent Laid-Open No. 2004-199529 特開平５−６７１８９号公報JP-A-5-67189 特開２０１３−６１７６４号公報JP 2013-61764 A 特開２００３−１０９００７号公報JP 2003-109007 A

前述した文字認識技術では、例えば、特有の図形を用いて帳票の種類を判別する場合、ユーザが各帳票に記載された図形の中から各帳票に特有の図形を選択し、各帳票で用いられる定義体に選択した図形を予め格納している。したがって、前述した文字認識技術では、文字を認識する対象の帳票の種類が増加すると、定義体を生成する作業が煩雑になることがある。 In the above-described character recognition technology, for example, when the type of a form is determined using a specific figure, the user selects a figure specific to each form from the figures described in each form, and is used in each form. The selected figure is stored in advance in the definition body. Therefore, in the character recognition technology described above, when the types of forms for which characters are to be recognized increases, the task of generating a definition body may become complicated.

本発明は、一側面として、定義体を自動生成する技術を提供する。 As one aspect, the present invention provides a technique for automatically generating a definition body.

本明細書で開示する文字認識装置のひとつに、取得部と、抽出部と、生成部とを有する文字認識装置がある。取得部は、紙面の画像から１以上のアイテムの画像を取得する。抽出部は、取得した１以上のアイテムの画像から、第１サイズ以上のアイテムの画像を抽出する。生成部は、アイテムの画像を取得した紙面の種類と、抽出したアイテムの画像とを関連付けて格納する定義体を生成する。 One of the character recognition devices disclosed in this specification is a character recognition device having an acquisition unit, an extraction unit, and a generation unit. The acquisition unit acquires an image of one or more items from the image on the paper. An extraction part extracts the image of the item of 1st size or more from the acquired image of the 1 or more item. The generation unit generates a definition body that stores the type of the page from which the item image is acquired and the extracted item image in association with each other.

１実施態様によれば、定義体を自動生成することができる。 According to one embodiment, the definition body can be automatically generated.

文字認識装置の一実施例を示す機能ブロック図である。It is a functional block diagram which shows one Example of a character recognition apparatus. 帳票の種類を判別する処理を示すフローチャートである。It is a flowchart which shows the process which discriminate | determines the kind of form. 帳票の種類を判別する処理を示すフローチャートである。It is a flowchart which shows the process which discriminate | determines the kind of form. 定義体を生成する処理を示すフローチャートである。It is a flowchart which shows the process which produces | generates a definition body. 定義体を生成する処理を示すフローチャートである。It is a flowchart which shows the process which produces | generates a definition body. 定義体を生成する処理を示すフローチャートである。It is a flowchart which shows the process which produces | generates a definition body. 定義体を生成する処理を示すフローチャートである。It is a flowchart which shows the process which produces | generates a definition body. 定義体を生成する処理を示すフローチャートである。It is a flowchart which shows the process which produces | generates a definition body. 定義体を生成する処理を示すフローチャートである。It is a flowchart which shows the process which produces | generates a definition body. 帳票の一例を示す図である。It is a figure which shows an example of a form. 帳票の一例を示す図である。It is a figure which shows an example of a form. 帳票判別情報の一例を示す図である。It is a figure showing an example of form discernment information. 文字認識情報の一例を示す図である。It is a figure which shows an example of character recognition information. 判別データの一例を示す図である。It is a figure which shows an example of discrimination | determination data. 取引データの一例を示す図である。It is a figure which shows an example of transaction data. 判別データの一例を示す図である。It is a figure which shows an example of discrimination | determination data. アイテムデータの一例を示す図である。It is a figure which shows an example of item data. 抽出データの一例を示す図である。It is a figure which shows an example of extraction data. 抽出データの一例を示す図である。It is a figure which shows an example of extraction data. 見出しデータの一例を示す図である。It is a figure which shows an example of heading data. 認識領域を説明する図である。It is a figure explaining a recognition area. 認識領域を説明する図である。It is a figure explaining a recognition area. コンピュータ装置の一実施例を示すブロック図である。It is a block diagram which shows one Example of a computer apparatus.

実施形態の文字認識装置について説明する。
図１は、文字認識装置の一実施例を示す機能ブロック図である。 The character recognition device of the embodiment will be described.
FIG. 1 is a functional block diagram showing an embodiment of a character recognition device.

図１を参照して、文字認識装置１について説明する。
文字認識装置１は、制御部１０と、記憶部２０と、読取部３０と、表示部４０とを備える。文字認識装置１は、例えば、後述するコンピュータ装置である。 A character recognition device 1 will be described with reference to FIG.
The character recognition device 1 includes a control unit 10, a storage unit 20, a reading unit 30, and a display unit 40. The character recognition device 1 is, for example, a computer device described later.

制御部１０は、取得部１１と、抽出部１２と、生成部１３と、認識部１４と、判別部１５との機能を有する。 The control unit 10 has functions of an acquisition unit 11, an extraction unit 12, a generation unit 13, a recognition unit 14, and a determination unit 15.

取得部１１は、紙面の画像から１以上のアイテムの画像を取得する。紙面とは、例えば、帳票、解答用紙、健康診断表、およびアンケート用紙などであり、見出しとデータとが関連付けられて記載されている用紙のことである。アイテムとは、例えば、紙面に記載された図形および文字列である。以下の説明において、文字列とは、１文字以上の文字を含む文言を意味する。 The acquisition unit 11 acquires an image of one or more items from a paper image. The paper is, for example, a form, an answer sheet, a health checkup table, a questionnaire sheet, and the like, and is a sheet on which headings and data are described in association with each other. Items are, for example, figures and character strings written on paper. In the following description, a character string means a word including one or more characters.

抽出部１２は、取得した１以上のアイテムの画像から、第１サイズ以上のアイテムの画像を抽出する。第１サイズとは、例えば、紙面の特徴となるアイテムの画像を抽出するときに用いられる閾値である。第１サイズ以上のアイテムの画像とは、例えば、紙面の特徴を示すアイテムの画像である。以下の説明では、紙面の特徴を示すアイテムの画像のことを特徴画像とも言う。 The extraction part 12 extracts the image of the item of 1st size or more from the acquired image of the 1 or more item. The first size is, for example, a threshold value used when extracting an image of an item that is a feature of the paper surface. The image of the item of the first size or larger is, for example, an image of an item that shows the characteristics of the page. In the following description, an item image indicating a feature of a paper surface is also referred to as a feature image.

また、第１サイズは、アイテムの画像の縦幅のサイズと横幅のサイズとを含んでも良い。このとき、抽出部１２は、取得した１以上のアイテムの画像から、縦幅のサイズが第１サイズに含まれる縦幅のサイズ以上、および横幅のサイズが第１サイズに含まれる横幅のサイズ以上のアイテムの画像を抽出しても良い。 Further, the first size may include a vertical size and a horizontal size of the item image. At this time, the extraction unit 12 determines from the acquired image of one or more items that the vertical size is equal to or larger than the vertical size included in the first size, and the horizontal width is equal to or larger than the horizontal width included in the first size. Images of items may be extracted.

抽出部１２は、取得した１以上のアイテムの画像が１以上の文字列の画像を含むとき、１以上の文字列の画像から、所定の文字サイズ以上の文字を含む第１文字数以上の文字列の画像を抽出する。所定の文字サイズとは、例えば、紙面の特徴となる文字列の画像を抽出するために設定された文字のサイズの閾値である。所定の文字サイズとは、文字の縦幅のサイズと横幅のサイズとを含んでも良い。そして、所定の文字サイズには、第１文字サイズと第１文字サイズよりも小さい第２文字サイズを含んでも良い。第１文字数は、例えば、紙面の特徴となる文字列の画像を抽出するために用いられる閾値である。 When the acquired image of one or more items includes an image of one or more character strings, the extraction unit 12 includes a character string of a first character number or more including characters of a predetermined character size or more from the one or more character string images. Extract images. The predetermined character size is, for example, a character size threshold that is set to extract an image of a character string that is a feature of the page. The predetermined character size may include a vertical size and a horizontal size of the character. The predetermined character size may include a first character size and a second character size smaller than the first character size. The first number of characters is, for example, a threshold value used for extracting an image of a character string that is a feature of the page.

抽出部１２は、紙面が有する罫線の配置と同じ罫線の配置と、紙面から第１サイズを用いて抽出したアイテムの画像と同じアイテムの画像とを格納している定義体が記憶部に記憶されているとき、第１サイズよりも小さい第２サイズ以上のアイテムの画像を抽出する。第２サイズは、例えば、紙面の特徴となるアイテムの画像を抽出するときに用いられる閾値である。 The extraction unit 12 stores in the storage unit a definition body that stores the same ruled line layout as the layout of the ruled lines on the page, and the same item image as the item image extracted from the page using the first size. The image of the item of the second size or more smaller than the first size is extracted. The second size is a threshold used when, for example, an image of an item that is a feature of the paper surface is extracted.

抽出部１２は、紙面が有する罫線の配置と同じ罫線の配置と、紙面から第１文字数を用いて抽出した文字列の画像と同じ文字列の画像とを格納している定義体が記憶部に記憶されているとき、第１文字数よりも少ない第２文字数以上の文字列の画像を抽出する。第２文字数は、例えば、紙面の特徴となる文字列の画像を抽出するために用いられる閾値である。 The extracting unit 12 stores in the storage unit a definition body that stores the same ruled line layout as the layout of the ruled lines on the page, and the same character string image as the character string image extracted from the page using the first number of characters. When stored, an image of a character string having a second character number or more smaller than the first character number is extracted. The second number of characters is, for example, a threshold used for extracting an image of a character string that is a feature of the page.

生成部１３は、アイテムの画像を取得した紙面の種類と抽出したアイテムの画像とを関連付けて格納する定義体を生成する。 The generation unit 13 generates a definition body that stores the type of the page from which the item image is acquired and the extracted item image in association with each other.

生成部１３は、アイテムの画像を取得した紙面の種類と、抽出したアイテムの画像と、抽出したアイテムの画像が記載された領域を示す画像領域とを関連付けて格納する定義体を生成する。 The generation unit 13 generates a definition body that associates and stores the type of the paper surface from which the item image is acquired, the extracted item image, and an image area indicating the area in which the extracted item image is described.

生成部１３は、アイテムの画像を取得した紙面が有する罫線の配置と、同じ罫線の配置を格納した定義体が記憶部に記憶されていないとき、アイテムの画像を取得した紙面の種類と、抽出したアイテムの画像とを関連付けて格納する定義体を生成する。 The generation unit 13 extracts the ruled line layout of the page from which the item image is acquired and the type of the page from which the item image was acquired when the definition unit storing the same ruled line layout is not stored in the storage unit. A definition body for storing the image of the item in association with the generated image is generated.

生成部１３は、項目種に対応する種類のデータが、項目種を示すと認識した文字列の近傍にあるとき、項目種と、項目種に対応する種類のデータが記載された領域を示す認識領域とを関連付けて格納する定義体を生成する。項目種とは、例えば、紙面の見出しの種別のことである。紙面の見出しの種別とは、例えば、紙面が帳票であるとき、銀行名、支店名、預金種目、口座番号、金額、受取人、および依頼人などのことを言う。データの種類とは、例えば、漢字、カナ、英字および数字などの種類のことを言う。以下の説明においては、項目種を示すと認識した文字列のことを見出し文言とも言う。また、項目種に対応するデータのことを項目データとも言う。 When the data of the type corresponding to the item type is in the vicinity of the character string recognized as indicating the item type, the generation unit 13 recognizes the item type and an area in which the type of data corresponding to the item type is described. Generate a definition body that stores an area in association with it. The item type is, for example, the type of heading on the page. The type of the headline on the page means, for example, a bank name, a branch name, a deposit item, an account number, an amount, a payee, and a client when the page is a form. The data type refers to, for example, types such as kanji, kana, alphabetic characters, and numbers. In the following description, a character string recognized as indicating an item type is also referred to as a headline wording. The data corresponding to the item type is also referred to as item data.

生成部１３は、データが罫線に囲まれているとき、罫線に囲まれた領域を認識領域にする。 When the data is surrounded by ruled lines, the generation unit 13 sets the area surrounded by the ruled lines as a recognition area.

生成部１３は、データが罫線に囲まれていないとき、データを囲み、他の文言を含まない領域を認識領域にする。 When the data is not surrounded by ruled lines, the generation unit 13 surrounds the data and sets an area that does not include other words as a recognition area.

認識部１４は、取得した１以上のアイテムの画像が文字列の画像を含むとき、文字列の文言と同じ見出し文言に関連付けられた項目種を見出し情報から検索し、文字列が検索した項目種を示すと認識する。 When the acquired image of one or more items includes an image of a character string, the recognition unit 14 searches the item type associated with the same heading wording as the wording of the character string from the heading information, and the item type searched by the character string Recognize that

判別部１５は、取得したアイテムの画像と同じアイテムの画像を格納した定義体を検索し、アイテムを取得した紙面の種類を、検索された定義体に格納された紙面の種類であると判別する。 The determination unit 15 searches for a definition body that stores an image of the same item as the acquired item image, and determines that the type of paper on which the item is acquired is the type of paper stored in the searched definition body. .

判別部１５は、取得したアイテムの画像と同じアイテムの画像と、取得したアイテムの画像領域と同じアイテムの画像領域とを関連付けて格納した定義体を検索し、アイテムを取得した紙面の種類を、検索された定義体に格納された紙面の種類であると判別する。 The determination unit 15 searches for a definition body that stores the image of the same item as the acquired item image and the image area of the same item as the acquired item image area in association with each other. It is determined that the type of paper stored in the searched definition body.

また、記憶部２０は、定義体情報２１と、取引情報２２と、判別情報２３と、アイテム情報２４と、抽出情報２５と、見出し情報２６と、設定情報２７とを記憶する。 The storage unit 20 also stores definition body information 21, transaction information 22, discrimination information 23, item information 24, extraction information 25, heading information 26, and setting information 27.

定義体情報２１には、例えば、文字認識をする紙面の種類ごとに、紙面の種別を判別し、紙面の文字認識をするときに用いられる情報を格納した定義体が記憶される。 The definition body information 21 stores, for example, a definition body that stores information used for determining the type of paper for each type of paper for character recognition and for character recognition on the paper.

取引情報２２には、例えば、文字認識をする紙面の種類ごとに、紙面から読み取った各項目種に対応するデータに関する情報を格納した取引データが記憶される。 The transaction information 22 stores, for example, transaction data that stores information regarding data corresponding to each item type read from the page for each type of the page on which character recognition is performed.

判別情報２３には、例えば、紙面の種類を判別した結果に関する情報を格納した判別データが記憶される。 In the discrimination information 23, for example, discrimination data storing information related to the result of discriminating the type of paper is stored.

アイテム情報２４には、例えば、文字認識をする紙面の種類ごとに、紙面から取得部１１が取得したアイテムに関する情報を格納したアイテムデータが記憶される。 The item information 24 stores, for example, item data storing information about items acquired by the acquisition unit 11 from the paper for each type of paper on which character recognition is performed.

抽出情報２５には、例えば、文字認識をする紙面の種類ごとに、抽出部１２が抽出したアイテムを示す情報を格納した抽出データが記憶される。 In the extraction information 25, for example, extraction data storing information indicating items extracted by the extraction unit 12 is stored for each type of paper on which character recognition is performed.

見出し情報２６には、例えば、紙面が有する項目種ごとに、使用される見出し文言に関する情報を格納した見出しデータが記憶される。 In the heading information 26, for example, heading data storing information on the heading wording used is stored for each item type on the page.

設定情報２７には、例えば、第１サイズ、第２サイズ、第１文字サイズ、第２文字サイズ、第１文字数、および第２文字数などの設定情報が記憶される。 The setting information 27 stores setting information such as a first size, a second size, a first character size, a second character size, a first character number, and a second character number.

読取部３０は、紙面の画像を取得する。読取部３０は、例えば、スキャナでも良い。そして、読取部３０は、光学的に紙面の画像を読み取る機能を有する。 The reading unit 30 acquires a paper image. The reading unit 30 may be a scanner, for example. The reading unit 30 has a function of optically reading a paper image.

表示部４０は、制御部１０から入力された情報を表示する。
文字認識装置１について、さらに詳細に説明する。 The display unit 40 displays information input from the control unit 10.
The character recognition device 1 will be described in more detail.

以下の説明では、文字認識装置１が文字認識をする紙面の一例として、帳票を用いて説明する。ただし、文字認識装置１は、帳票に限定されるものではなく、見出しとデータとが関連付けられて記載された各種紙面の文字認識に適用することができる。 In the following description, a form is used as an example of a paper surface on which the character recognition device 1 performs character recognition. However, the character recognition device 1 is not limited to a form, and can be applied to character recognition on various types of paper on which headings and data are described in association with each other.

図２、図３は、帳票の種類を判別する処理を示すフローチャートである。
図２、図３を参照して、帳票の種類を判別する処理を説明する。また、図２、図３を参照して、項目データを認識する処理を説明する。 2 and 3 are flowcharts showing processing for determining the type of form.
With reference to FIG. 2 and FIG. 3, processing for determining the type of form will be described. The process for recognizing item data will be described with reference to FIGS.

以下の説明では、記憶部２０には、予め定義体情報２１と、見出し情報２６と、設定情報２７とが記憶されているものとする。そして、帳票は、例えば、ユーザにより読取部３０に設置されているものとする。また、読取部３０は、後述する帳票１００の画像を読み取ったものとして説明する。文字認識装置１による図形、文字列、および罫線の認識は、例えば、ＯＣＲ機能などを用いて実行しても良い。さらに、文字認識装置１は、例えば、各種データに識別子や名称を付与するとき、乱数を用いたアルゴリズムや所定の演算を用いたアルゴリズムなどを使用して任意の識別子や名称を付与しても良い。 In the following description, it is assumed that the definition unit information 21, the heading information 26, and the setting information 27 are stored in the storage unit 20 in advance. The form is assumed to be installed in the reading unit 30 by the user, for example. The reading unit 30 will be described as having read an image of a form 100 described later. Recognition of a figure, a character string, and a ruled line by the character recognition device 1 may be performed using, for example, an OCR function. Furthermore, for example, when the identifier or name is assigned to various data, the character recognition device 1 may assign an arbitrary identifier or name using an algorithm using a random number or an algorithm using a predetermined calculation. .

図２を参照して説明する。
読取部３０は、図１０に示す帳票１００の画像を読み取る（Ｓ１０１）。そして、読取部３０は、帳票１００の画像を取得部１１に出力する。 This will be described with reference to FIG.
The reading unit 30 reads the image of the form 100 shown in FIG. 10 (S101). Then, the reading unit 30 outputs the image of the form 100 to the acquisition unit 11.

図１０を参照して、読取部３０で読み取られた帳票１００について説明する。
図１０は、帳票の一例を示す図である。 The form 100 read by the reading unit 30 will be described with reference to FIG.
FIG. 10 is a diagram illustrating an example of a form.

帳票１００には、図１０に示すように、項目種を示す見出し文言として、銀行名、支店名、預金種目、口座番号、振込額、受取人、および依頼人が記載されている。そして、銀行名、支店名、預金種目、口座番号、振込額、受取人、および依頼人は、それぞれ銀行名、支店名、種目、口座番号、金額、受取人、および依頼人の項目種を示す見出し文言である。また、帳票１００は、第１サイズ以上のサイズを有するアイテムとして、図形ＳＨ１、ＳＨ３および文字列ＣＨ１、ＣＨ２を含む。さらに、帳票１００は、第１サイズよりも小さく、第２サイズ以上のサイズを有するアイテムとして、図形ＳＨ５および文字列ＣＨ３を含む。 As shown in FIG. 10, the form 100 includes a bank name, a branch name, a deposit type, an account number, a transfer amount, a payee, and a client as headline words indicating item types. The bank name, branch name, deposit type, account number, transfer amount, payee, and client indicate the item name of the bank name, branch name, item, account number, amount, payee, and client, respectively. It is a headline wording. The form 100 includes figures SH1 and SH3 and character strings CH1 and CH2 as items having a size equal to or larger than the first size. Furthermore, the form 100 includes a graphic SH5 and a character string CH3 as items having a size smaller than the first size and equal to or larger than the second size.

図２を参照して説明する。
取得部１１は、読取部３０から帳票１００の画像が入力されると、帳票１００の画像に含まれる罫線の配置を取得する（Ｓ１０２）。そして、取得部１１は、判別部１５に取得した罫線の配置を出力する。 This will be described with reference to FIG.
When the image of the form 100 is input from the reading unit 30, the acquisition unit 11 acquires the arrangement of ruled lines included in the image of the form 100 (S102). Then, the acquisition unit 11 outputs the acquired ruled line arrangement to the determination unit 15.

図１１を参照して、取得部１１による罫線の配置を取得する処理について説明する。
図１１は、帳票の一例を示す図である。図１１は、図１０で示した帳票１００の一部の領域を拡大した図である。以下の説明では、罫線Ｌ１の配置を取得する処理について説明する。取得部１１は、その他の罫線（例えば、図１１に示す罫線Ｌ２〜Ｌ７）についても同様に、罫線の配置を取得する。ただし、取得部１１が罫線の配置を取得する方法は、以下に説明する方法に限定するものではない。 With reference to FIG. 11, processing for acquiring the arrangement of ruled lines by the acquisition unit 11 will be described.
FIG. 11 is a diagram illustrating an example of a form. FIG. 11 is an enlarged view of a part of the form 100 shown in FIG. In the following description, a process for acquiring the arrangement of the ruled line L1 will be described. The acquisition unit 11 similarly acquires the arrangement of ruled lines for other ruled lines (for example, ruled lines L2 to L7 shown in FIG. 11). However, the method by which the acquisition unit 11 acquires the arrangement of ruled lines is not limited to the method described below.

取得部１１は、例えば、ＯＣＲ機能を用いて、帳票１００の画像から罫線Ｌ１を抽出する。そして、取得部１１は、罫線Ｌ１の配置として、罫線座標（Ａ１、Ｂ１）−（Ａ２、Ｂ１）を取得する。罫線座標（Ａ１、Ｂ１）−（Ａ２、Ｂ１）は、帳票１００上に設定された座標（Ａ１、Ｂ１）と座標（Ａ２、Ｂ１）とを結ぶ直線が罫線Ｌ１であることを示す情報である。座標（Ａ１、Ｂ１）は、例えば、罫線Ｌ１の始点の座標である。また、座標（Ａ２、Ｂ１）は、例えば、罫線Ｌ１の終点の座標である。 The acquisition unit 11 extracts the ruled line L1 from the image of the form 100 using, for example, the OCR function. And the acquisition part 11 acquires ruled line coordinate (A1, B1)-(A2, B1) as arrangement | positioning of the ruled line L1. Ruled line coordinates (A1, B1)-(A2, B1) are information indicating that a straight line connecting coordinates (A1, B1) and coordinates (A2, B1) set on the form 100 is a ruled line L1. . The coordinates (A1, B1) are, for example, the coordinates of the starting point of the ruled line L1. The coordinates (A2, B1) are, for example, the coordinates of the end point of the ruled line L1.

図２を参照して説明する。
判別部１５は、取得部１１から帳票１００の罫線の配置が入力されると、定義体情報２１に記憶されている定義体を参照して、帳票１００と同じ罫線の配置を格納した定義体（以下、罫線が一致する定義体とも言う。）があるか否かを判定する（Ｓ１０３）。このとき、判別部１５は、例えば、取得部１１からの帳票１００に記載された各罫線の罫線座標の入力を受け付け、入力された罫線座標と定義体情報２１に記憶されている各定義体の罫線座標との一致判定をする。これにより、判別部１５は、帳票１００と罫線が一致する定義体があるか否かを判定しても良い。 This will be described with reference to FIG.
When the arrangement of the ruled lines of the form 100 is input from the acquisition unit 11, the determination unit 15 refers to the definition body stored in the definition body information 21 and defines the definition body (the same arrangement of the ruled lines as the form 100 ( Hereinafter, it is determined whether or not there is a definition body with matching ruled lines (S103). At this time, for example, the determination unit 15 receives an input of ruled line coordinates of each ruled line described in the form 100 from the acquiring unit 11, and receives the input ruled line coordinates and the definition body information stored in the definition object information 21. Judgment of coincidence with ruled line coordinates. Accordingly, the determination unit 15 may determine whether there is a definition body in which the form 100 and the ruled line match.

図１２、図１３を参照して、定義体に格納されている情報を説明する。
図１２は、帳票判別情報の一例を示す図である。図１３は、文字認識情報の一例を示す図である。定義体には、図１２に示す帳票判別情報２００と、図１３に示す文字認識情報２０１とが格納されている。以下の説明では、一例として、帳票１００に対応する定義体に格納された情報について説明する。ただし、定義体が格納する情報は、帳票判別情報２００、および文字認識情報２０１に限定されるものではなく、文字認識装置１が紙面の種別を判別し、紙面の文字認識をするときに用いられる情報を格納すれば良い。また、文字認識装置１は、文字認識をする他の帳票についても、同形式の定義体を格納しても良い。 Information stored in the definition body will be described with reference to FIGS.
FIG. 12 is a diagram illustrating an example of the form determination information. FIG. 13 is a diagram illustrating an example of character recognition information. In the definition body, form discrimination information 200 shown in FIG. 12 and character recognition information 201 shown in FIG. 13 are stored. In the following description, information stored in a definition body corresponding to the form 100 will be described as an example. However, the information stored in the definition body is not limited to the form discrimination information 200 and the character recognition information 201, and is used when the character recognition device 1 discriminates the type of the page and recognizes the character on the page. Information only needs to be stored. Further, the character recognition device 1 may store a definition body of the same format for other forms for character recognition.

帳票判別情報２００には、図１２に示すように、帳票種類と、罫線情報と、特徴情報とが関連付けられて格納されている。 As shown in FIG. 12, the form discriminating information 200 stores the form type, ruled line information, and feature information in association with each other.

帳票種類には、帳票１００の種類を示す帳票識別子を格納する。帳票Ｎは、帳票１００の種類を示す帳票識別子である。 In the form type, a form identifier indicating the type of the form 100 is stored. The form N is a form identifier indicating the type of the form 100.

罫線情報は、罫線識別子と、罫線座標とを関連付けて格納する。罫線識別子は、帳票１００に記載された各罫線を識別する情報を示す。なお、罫線識別子は、文字認識装置１が罫線情報を生成するときに、各レコードに付与しても良い。 The ruled line information is stored in association with ruled line identifiers and ruled line coordinates. The ruled line identifier indicates information for identifying each ruled line described in the form 100. The ruled line identifier may be assigned to each record when the character recognition device 1 generates ruled line information.

特徴情報は、特徴識別子と、画像領域と、特徴画像とを関連付けて格納する。
特徴識別子は、帳票１００に記載された各特徴画像を識別する情報を示す。なお、特徴識別子は、文字認識装置１が特徴情報を生成するときに、各レコードに付与しても良い。 The feature information stores a feature identifier, an image area, and a feature image in association with each other.
The feature identifier indicates information for identifying each feature image described in the form 100. The feature identifier may be given to each record when the character recognition device 1 generates feature information.

画像領域は、帳票１００において、特徴画像が記載されている領域を示す。特徴識別子ＳＨ１に対応する画像領域（Ｃ３、Ｄ３）−（Ｃ４、Ｄ４)は、例えば、図１１に示すように、図形ＳＨ１を囲む矩形の左上の座標（Ｃ３、Ｄ３）と右下の座標（Ｃ４、Ｄ４）とを示す。そして、画像領域（Ｃ３、Ｄ３）−（Ｃ４、Ｄ４)は、左上の座標（Ｃ３、Ｄ３）と右下の座標（Ｃ４、Ｄ４）とを結ぶ線を対角線とする矩形を示す情報として用いられる。なお、画像領域が示す矩形は、例えば、特徴画像を囲む矩形の中で最小の矩形としても良い。 The image area indicates an area where a feature image is described in the form 100. The image regions (C3, D3)-(C4, D4) corresponding to the feature identifier SH1 are, for example, as shown in FIG. 11, the upper left coordinates (C3, D3) and lower right coordinates (C3, D3) of the rectangle surrounding the figure SH1. C4, D4). The image areas (C3, D3)-(C4, D4) are used as information indicating a rectangle having a diagonal line connecting the upper left coordinates (C3, D3) and the lower right coordinates (C4, D4). . Note that the rectangle indicated by the image area may be, for example, the smallest rectangle among the rectangles surrounding the feature image.

特徴画像は、帳票１００に特有のアイテムの画像データである。特徴画像ＩＭ２は、例えば、画像領域（Ｃ３、Ｄ３）−（Ｃ４、Ｄ４）で示される領域を切り出した図形１の画像データである。 The feature image is image data of an item unique to the form 100. The feature image IM2 is, for example, image data of the graphic 1 obtained by cutting out the area indicated by the image areas (C3, D3)-(C4, D4).

文字認識情報２０１には、図１３に示すように、項目種と、認識領域と、データ種と、最大桁数とが帳票種類に関連付けられて格納されている。 In the character recognition information 201, as shown in FIG. 13, the item type, the recognition area, the data type, and the maximum number of digits are stored in association with the form type.

認識領域とは、帳票１００において、項目種に対応する項目データが記載されている領域を示す情報である。銀行名（項目種）に対応する認識領域（Ｇ１、Ｈ１）−（Ｇ２、Ｈ２）は、図１１に示すように、銀行名に対応する項目データである南多摩を囲む矩形の左上の座標（Ｇ１、Ｈ１）と右下の座標（Ｇ２、Ｈ２）とを示す。これにより、認識領域（Ｇ１、Ｈ１）−（Ｇ２、Ｈ２)は、左上の座標（Ｇ１、Ｈ１）と右下の座標（Ｇ２、Ｈ２）とを結ぶ線を対角線とする矩形を示す情報として用いられる。なお、認識領域が示す矩形は、例えば、項目データを囲む矩形の中で最小の矩形としても良い。 The recognition area is information indicating an area in which item data corresponding to an item type is described in the form 100. As shown in FIG. 11, the recognition area (G1, H1)-(G2, H2) corresponding to the bank name (item type) is a coordinate (upper left corner of the rectangle surrounding Minami Tama, which is item data corresponding to the bank name). G1, H1) and lower right coordinates (G2, H2) are shown. Thereby, the recognition areas (G1, H1)-(G2, H2) are used as information indicating a rectangle whose diagonal line is a line connecting the upper left coordinates (G1, H1) and the lower right coordinates (G2, H2). It is done. Note that the rectangle indicated by the recognition area may be, for example, the smallest rectangle among the rectangles surrounding the item data.

最大桁数とは、項目データの最大の文字数を示す情報である。そして、認識部１４は、帳票１００の文字認識をするとき、例えば、項目種に関連付けられた最大文字数よりも多い文字数を用いたデータを項目データとして認識しない。 The maximum number of digits is information indicating the maximum number of characters of item data. And the recognition part 14 does not recognize the data using more characters than the maximum number of characters linked | related with the item kind as item data, for example, when performing the character recognition of the form 100. FIG.

図２を参照して説明する。
判別部１５は、帳票１００と同じ罫線の配置を格納した定義体が定義体情報２１に記憶されていないとき（Ｓ１０３にてＮｏ）、帳票１００の種類を罫線の一致する定義体がない新規の帳票であると判別する（Ｓ１０４）。そして、判別部１５は、後述するＳ２０６の処理を実行する。 This will be described with reference to FIG.
When the definition body that stores the same ruled line arrangement as the form 100 is not stored in the definition body information 21 (No in S103), the determination unit 15 determines that the type of the form 100 has no definition body that matches the ruled line. It is determined that it is a form (S104). Then, the determination unit 15 executes a process of S206 described later.

図３を参照して説明する。
判別部１５は、帳票１００と同じ罫線の配置を格納した定義体（以下、罫線が一致する定義体とも言う。）が定義体情報２１に記憶されているとき（Ｓ１０３にてＹｅｓ）、罫線が一致する定義体に格納されている各特徴画像の画像領域を取得する（Ｓ２０１）。 This will be described with reference to FIG.
When the definition body storing the same ruled line arrangement as the form 100 (hereinafter, also referred to as a definition body with matching ruled lines) is stored in the definition body information 21 (Yes in S103), the determination unit 15 The image area of each feature image stored in the matching definition body is acquired (S201).

判別部１５は、帳票１００からＳ２０１で取得した画像領域に記載された各アイテムの画像を取得する（Ｓ２０２）。なお、判別部１５は、紙面に記載されたアイテム画像と定義体に格納されている特徴画像とが同じ画像であっても、互いの画像領域が異なっているとき、紙面から特徴画像と同じアイテムの画像を取得しない。また、判別部１５は、紙面に記載されたアイテム画像と定義体に格納されている特徴画像とが同じ画像であり、互いの画像領域が同じとき、紙面から特徴画像と同じアイテムの画像を取得する。 The determination unit 15 acquires an image of each item described in the image area acquired in S201 from the form 100 (S202). Note that, even if the item image described on the page and the feature image stored in the definition body are the same image, the determination unit 15 determines the same item as the feature image from the page when the image areas are different from each other. Do not get the image. In addition, when the item image described on the paper and the feature image stored in the definition body are the same image and the image areas are the same, the determination unit 15 obtains an image of the same item as the feature image from the paper. To do.

そして、判別部１５は、Ｓ２０２で取得した各アイテムの画像と、定義体に格納された各特徴画像とがそれぞれ一致（以下、特徴画像が一致するとも言う。）しているか否かを判定する（Ｓ２０３）。すなわち、判別部１５は、Ｓ２０１〜Ｓ２０３を実行することにより、取得したアイテムの画像と同じ特徴画像と、取得したアイテムの画像領域と同じ特徴画像の画像領域とを関連付けて格納した定義体があるか否かを判定している。 Then, the determination unit 15 determines whether the image of each item acquired in S202 matches each feature image stored in the definition body (hereinafter also referred to as the feature image matches). (S203). In other words, the determination unit 15 executes S201 to S203, so that there is a definition body in which the same feature image as the acquired item image and the image region of the same feature image as the acquired item image are associated and stored. It is determined whether or not.

判別部１５は、Ｓ２０２で取得した各アイテムの画像と、定義体に格納された各特徴画像とが全て一致しないとき（Ｓ２０３にてＮｏ）、帳票１００の種類を罫線が一致する定義体はあるが、罫線が一致する定義体と特徴画像が一致しない新規の帳票であると判別する（Ｓ２０４）。そして、判別部１５は、後述するＳ２０６の処理を実行する。 When the image of each item acquired in S202 does not match all the feature images stored in the definition body (No in S203), the determination unit 15 has a definition body in which the ruled line matches the type of the form 100. However, it is determined that the definition form matches the ruled line and the new form does not match the feature image (S204). Then, the determination unit 15 executes a process of S206 described later.

判別部１５は、Ｓ２０２で取得した各アイテムの画像と、定義体に格納された各特徴画像とが全て一致しているとき（Ｓ２０３にてＹｅｓ）、帳票１００の種類を既存の帳票であると判別する（Ｓ２０５）。 When the image of each item acquired in S202 and each feature image stored in the definition body all match (Yes in S203), the determination unit 15 determines that the type of the form 100 is an existing form. A determination is made (S205).

なお、判別部１５は、Ｓ１０３において、帳票１００と罫線の一致する定義体が、定義体情報２１に複数格納されていると判定したとき、各罫線の一致した定義体について、Ｓ２０１〜Ｓ２０５の処理を実行しても良い。 Note that when the determination unit 15 determines in S103 that a plurality of definition bodies that match the form 100 and the ruled line are stored in the definition object information 21, the processing of S201 to S205 is performed on the definition bodies that match the ruled lines. May be executed.

そして、判別部１５は、帳票１００の帳票識別子（帳票Ｎ）と、帳票１００の画像データ（ＳＰｎ）と、帳票１００の種類の判別結果（新規：罫線不一致）とを関連付けて判別データに格納する（Ｓ２０６）。このとき、判別部１５は、乱数を用いて任意の帳票識別子を生成し、帳票１００の画像と関連付けて格納しても良い。 Then, the determination unit 15 associates the form identifier (form N) of the form 100, the image data (SPn) of the form 100, and the determination result (new: ruled line mismatch) of the form 100 and stores them in the determination data. (S206). At this time, the determination unit 15 may generate an arbitrary form identifier using a random number and store it in association with the image of the form 100.

図１４を参照して、判別データに格納されている情報を説明する。
図１４は、判別データの一例を示す図である。 With reference to FIG. 14, the information stored in the discrimination data will be described.
FIG. 14 is a diagram illustrating an example of the discrimination data.

判別データ３００には、図１４に示すように、帳票識別子と、帳票画像と、判別結果と、取引識別子と、罫線一致帳票とが関連付けられて格納されている。 As shown in FIG. 14, the discrimination data 300 stores a form identifier, a form image, a discrimination result, a transaction identifier, and a ruled line matching form in association with each other.

帳票画像には、帳票の画像データが格納されている。帳票の画像データとは、読取部３０で読み取られた帳票の画像データである。 The form image stores image data of the form. The form image data is image data of the form read by the reading unit 30.

判別結果は、帳票の種類の判別結果を示す情報である。既存とは、帳票に記載されている罫線とアイテムの画像とアイテムの画像領域とに、一致する罫線と特徴画像と特徴画像の画像領域とを格納した定義体が定義体情報２１に記憶されていることを示す。新規：罫線不一致とは、帳票に記載されている罫線と、罫線が一致する定義体が定義体情報２１に記憶されていないことを示す。新規：特徴不一致とは、帳票に記載されている罫線と、罫線の一致する定義体が定義体情報２１に記憶されているが、罫線の一致する定義体に格納された特徴画像の画像領域に対応する帳票上の領域に、特徴画像と一致するアイテムが記載されていないことを示す。 The discrimination result is information indicating the discrimination result of the form type. “Existing” means that a definition body that stores ruled lines, feature images, and image areas of feature images stored in the definition body information 21 is stored in ruled lines, item images, and item image areas described in the form. Indicates that New: ruled line mismatch indicates that the ruled line described in the form and the definition body that matches the ruled line are not stored in the definition body information 21. New: Feature mismatch means that a ruled line described in a form and a definition that matches the ruled line are stored in the definition body information 21, but an image area of the feature image stored in the definition that matches the ruled line This indicates that no item matching the feature image is described in the corresponding area on the form.

取引識別子とは、取引情報２２から、帳票に対応する取引データを検索するときに用いられる識別子である。なお、文字認識装置１は、判別データ３００を生成するときに、各レコードに、対応する取引データを示す取引識別子を付与しても良い。 The transaction identifier is an identifier used when searching for transaction data corresponding to the form from the transaction information 22. In addition, when the character recognition apparatus 1 produces | generates the discrimination | determination data 300, you may provide the transaction identifier which shows corresponding transaction data to each record.

罫線一致帳票とは、帳票識別子で示される帳票と罫線が一致する定義体の帳票識別子である。 The ruled line matching form is a form identifier of a definition that matches the form indicated by the form identifier and the ruled line.

図３を参照して説明する。
認識部１４は、帳票１００の画像から取引データを取得する（Ｓ２０７）。 This will be described with reference to FIG.
The recognition unit 14 acquires transaction data from the image of the form 100 (S207).

図１５を参照して、取引データに格納されている情報を説明する。
図１５は、取引データの一例を示す図である。 With reference to FIG. 15, information stored in the transaction data will be described.
FIG. 15 is a diagram illustrating an example of transaction data.

取引データ４００には、図１５に示すように、帳票種類と、取引識別子と、項目種と、項目データとが関連付けられて格納されている。なお、取引データ４００は、一例として、帳票１００の記載に対応する取引結果を示している。 As shown in FIG. 15, the transaction data 400 stores a form type, a transaction identifier, an item type, and item data in association with each other. Note that the transaction data 400 shows a transaction result corresponding to the description of the form 100 as an example.

取引データ４００は、帳票種類と、取引識別子と、項目種と、項目データとを格納している。 The transaction data 400 stores a form type, a transaction identifier, an item type, and item data.

取引識別子は、各帳票の取引データを識別するための情報である。なお、取引識別子は、文字認識装置１が取引データを生成するときに、各レコードに付与しても良い。 The transaction identifier is information for identifying transaction data of each form. In addition, you may provide a transaction identifier to each record, when the character recognition apparatus 1 produces | generates transaction data.

ただし、取引データ４００が格納する情報は、帳票種類と、取引識別子と、項目種と、項目データとに限定されるものではなく、文字認識装置１が紙面から認識した取引結果に関する情報を格納すれば良い。また、文字認識装置１は、文字認識する他の帳票についても、同形式の取引データを格納しても良い。 However, the information stored in the transaction data 400 is not limited to the form type, the transaction identifier, the item type, and the item data, but stores information related to the transaction result recognized by the character recognition device 1 from the page. It ’s fine. The character recognition device 1 may store transaction data of the same format for other forms for character recognition.

図１３、図１５を参照して、Ｓ２０７において、認識部１４が取引データ４００を取得する処理を説明する。 With reference to FIG. 13, FIG. 15, the process in which the recognition part 14 acquires the transaction data 400 in S207 is demonstrated.

以下の説明では、一例として、帳票１００に対応する取引データ４００に格納された情報を用いて説明する。 In the following description, an example will be described using information stored in the transaction data 400 corresponding to the form 100.

認識部１４は、図１３に示す文字認識情報２０１に格納されている各認識領域を取得する。そして、認識部１４は、帳票１００の画像を検索し、各認識領域の示す位置に記載された項目データを取得し、それぞれの項目データを項目種に関連付けて取引データ４００に格納する。 The recognition unit 14 acquires each recognition area stored in the character recognition information 201 illustrated in FIG. Then, the recognition unit 14 searches the image of the form 100, acquires item data described at the position indicated by each recognition area, and stores each item data in the transaction data 400 in association with the item type.

認識部１４は、帳票１００が汚れているなどして、帳票１００の画像から取得できない項目データがあるとき、表示部４０に帳票１００の画像を表示し、ユーザに取引データ４００への値の入力を促しても良い。これにより、ユーザは、表示部４０に表示された帳票１００の画像を参照しながら、取引データ４００に認識部１４が取得できなかった項目種の項目データを格納しても良い。 When there is item data that cannot be acquired from the image of the form 100 because the form 100 is dirty or the like, the recognition unit 14 displays the image of the form 100 on the display unit 40 and inputs a value to the transaction data 400 to the user. May be encouraged. Thereby, the user may store item data of item types that the recognition unit 14 could not acquire in the transaction data 400 while referring to the image of the form 100 displayed on the display unit 40.

さらに、認識部１４は、定義体情報２１に、帳票１００に対応する文字認識情報２０１を格納した定義体が記憶されていないとき、表示部４０に帳票１００の画像を表示し、ユーザに取引データ４００への値の入力を促しても良い。これにより、ユーザは、表示部４０に表示された帳票１００の画像を参照しながら、取引データ４００に各項目種に対応する項目データを入力しても良い。 Further, the recognizing unit 14 displays the image of the form 100 on the display unit 40 and displays the transaction data to the user when the definition body that stores the character recognition information 201 corresponding to the form 100 is not stored in the definition body information 21. The input of a value to 400 may be prompted. Thus, the user may input item data corresponding to each item type in the transaction data 400 while referring to the image of the form 100 displayed on the display unit 40.

なお、認識部１４は、各項目データを認識するとき、文字認識情報２０１において、項目種に関連付けられているデータ種を参照し、文字認識の対象となるデータの種類に対応した文字認識アルゴリズムを用いて文字認識を実行しても良い。さらに、認識部１４は、各項目データを認識するとき、文字認識情報２０１において、項目種に関連付けられている最大桁数を参照し、最大桁数以下の文字数のデータのみを認識しても良い。これにより、認識部１４は、文字認識の精度を向上することができる。 When recognizing each item data, the recognizing unit 14 refers to the data type associated with the item type in the character recognition information 201, and determines a character recognition algorithm corresponding to the type of data to be subjected to character recognition. May be used to perform character recognition. Further, when recognizing each item data, the recognizing unit 14 may refer to the maximum number of digits associated with the item type in the character recognition information 201 and recognize only the data having the number of characters equal to or less than the maximum number of digits. . Thereby, the recognition part 14 can improve the precision of character recognition.

図３を参照して説明する。
認識部１４は、帳票１００の帳票識別子（帳票Ｎ）と関連付けて取引データ４００の取引識別子（ＴＲｎ）を判別データ３０１に格納する（Ｓ２０８）。これにより、認識部１４は、図１６の判別データ３０１に示すように、帳票１００の判別結果に対応するレコードを生成する。なお、図１６は、Ｓ２０６、およびＳ２０８の処理により、帳票１００の判別結果に対応する、帳票識別子（帳票Ｎ）、帳票画像（ＳＰｎ）、判別結果（新規：罫線不一致）、および取引識別子（ＴＲｎ）を格納したレコードを含む判別データ３０１を示している。 This will be described with reference to FIG.
The recognition unit 14 stores the transaction identifier (TRn) of the transaction data 400 in the discrimination data 301 in association with the form identifier (form N) of the form 100 (S208). As a result, the recognition unit 14 generates a record corresponding to the determination result of the form 100 as indicated by the determination data 301 in FIG. In FIG. 16, the forms identifier (form N), the form image (SPn), the discrimination result (new: ruled line mismatch), and the transaction identifier (TRn) corresponding to the discrimination result of the form 100 are obtained by the processing of S206 and S208. The discriminating data 301 including the record that stores) is shown.

以上により、文字認識装置１は、帳票１００の種類を判別する処理と、帳票１００に記載されている各項目データを認識する処理とを実行する。 As described above, the character recognition device 1 executes the process of determining the type of the form 100 and the process of recognizing each item data described in the form 100.

図４〜図９は、定義体を生成する処理を示すフローチャートである。
図４〜図６は、帳票判別情報２００を生成する処理を示すフローチャートである。図７〜図９は、文字認識情報２０１を生成する処理を示すフローチャートである。 4 to 9 are flowcharts showing processing for generating a definition body.
4 to 6 are flowcharts showing processing for generating the form discrimination information 200. FIG. 7 to 9 are flowcharts showing processing for generating the character recognition information 201.

図４〜図６を参照して、帳票判別情報２００を生成する処理を説明する。
以下の説明では、文字認識装置１が図２、図３を用いて説明した帳票の種類を判別する処理を実行し、文字認識の対象とする各帳票の判別結果を含む判別データ３０１が生成されているものとする。また、以下の説明では、帳票１００の定義体を生成する処理を一例として示す。なお、文字認識装置１は、他の帳票に関しても、以下で説明する処理を実行することにより、帳票判別情報を生成しても良い。 With reference to FIGS. 4-6, the process which produces | generates the document discrimination | determination information 200 is demonstrated.
In the following description, the character recognition device 1 executes the process of determining the form type described with reference to FIGS. 2 and 3, and the determination data 301 including the determination result of each form that is the object of character recognition is generated. It shall be. Further, in the following description, a process for generating a definition body of the form 100 is shown as an example. Note that the character recognition device 1 may generate the form discrimination information by executing the processing described below for other forms.

取得部１１は、判別データ３０１からレコードを取得する（Ｓ３０１）。このとき、取得部１１は、帳票Ｎに対応するレコードを取得したものとする。 The acquisition unit 11 acquires a record from the discrimination data 301 (S301). At this time, the acquisition unit 11 acquires a record corresponding to the form N.

取得部１１は、Ｓ３０１で取得したレコードに含まれる帳票画像を取得する（Ｓ３０２）。 The acquisition unit 11 acquires a form image included in the record acquired in S301 (S302).

そして、取得部１１は、Ｓ３０２で取得した帳票画像に含まれるアイテムの画像を取得し、アイテムデータに格納する（Ｓ３０３）。取得部１１は、例えば、アイテムの画像を取得する帳票が帳票１００のとき、ＯＣＲ機能を用いて、図１０に示す帳票１００に記載されている図形および文字列の画像領域を切り出したアイテムの画像を取得しても良い。 Then, the acquisition unit 11 acquires the image of the item included in the form image acquired in S302 and stores it in the item data (S303). For example, when the form from which the item image is acquired is the form 100, the acquisition unit 11 uses the OCR function to extract the image area of the item and the graphic and character strings described in the form 100 illustrated in FIG. You may get

図１７は、アイテムデータの一例を示す図である。
アイテムデータ５００は、一例として、帳票１００に対応するアイテムデータを示す。 FIG. 17 is a diagram illustrating an example of item data.
The item data 500 indicates item data corresponding to the form 100 as an example.

アイテムデータ５００には、図１７に示すように、帳票種類と、アイテム識別子と、アイテム名と、画像領域と、アイテム画像とが関連付けられて格納されている。 In the item data 500, as shown in FIG. 17, a form type, an item identifier, an item name, an image area, and an item image are stored in association with each other.

アイテム識別子は、帳票１００に記載されている各アイテムを識別する情報である。なお、アイテム識別子は、文字認識装置１がアイテムデータを生成するときに、各レコードに付与しても良い。 The item identifier is information for identifying each item described in the form 100. The item identifier may be assigned to each record when the character recognition device 1 generates item data.

アイテム名とは、帳票１００に記載されている各アイテムの名称を示す情報である。アイテム名には、例えば、アイテムが文字列のとき、認識部１４が認識した文字列を格納しても良い。 The item name is information indicating the name of each item described in the form 100. In the item name, for example, when the item is a character string, a character string recognized by the recognition unit 14 may be stored.

画像領域とは、帳票１００において、アイテムが記載されている領域を示す。アイテム識別子ＳＨ１に対応する画像領域（Ｃ３、Ｄ３）−（Ｃ４、Ｄ４)は、例えば、図１１に示すように、図形ＳＨ１を囲む矩形の左上の座標（Ｃ３、Ｄ３）と右下の座標（Ｃ４、Ｄ４）とを示す。これにより、画像領域（Ｃ３、Ｄ３）−（Ｃ４、Ｄ４)は、左上の座標（Ｃ３、Ｄ３）と右下の座標（Ｃ４、Ｄ４）とを結ぶ線を対角線とする矩形を示す情報として用いられる。なお、画像領域が示す矩形は、例えば、アイテムを囲む矩形の中で最小の矩形としても良い。 The image area indicates an area in which an item is described in the form 100. The image regions (C3, D3)-(C4, D4) corresponding to the item identifier SH1 are, for example, as shown in FIG. 11, the upper left coordinates (C3, D3) and the lower right coordinates (C3, D3) of the rectangle surrounding the figure SH1. C4, D4). Thus, the image areas (C3, D3)-(C4, D4) are used as information indicating a rectangle whose diagonal line is a line connecting the upper left coordinates (C3, D3) and the lower right coordinates (C4, D4). It is done. Note that the rectangle indicated by the image area may be, for example, the smallest rectangle among the rectangles surrounding the item.

アイテム画像とは、帳票１００に記載されたアイテムの画像データである。アイテム画像ＩＭ２は、例えば、画像領域（Ｃ３、Ｄ３）−（Ｃ４、Ｄ４）で示される領域を切り出した図形１の画像データである。 The item image is image data of an item described in the form 100. The item image IM2 is, for example, the image data of the graphic 1 obtained by cutting out the area indicated by the image areas (C3, D3)-(C4, D4).

図４を参照して説明する。
取得部１１は、Ｓ３０１で取得したレコードに含まれる判別結果は新規か否かを判定する（Ｓ３０４）。 This will be described with reference to FIG.
The acquisition unit 11 determines whether or not the determination result included in the record acquired in S301 is new (S304).

取得部１１は、Ｓ３０１で取得したレコードに含まれる判別結果が新規でない（既存である）とき（Ｓ３０４にてＮｏ）、後述するＳ６０１の処理を実行する。 When the determination result included in the record acquired in S301 is not new (existing) (No in S304), the acquiring unit 11 executes the process of S601 described later.

取得部１１は、Ｓ３０１で取得したレコードに含まれる判別結果が新規であるとき（Ｓ３０４にてＹｅｓ）、Ｓ３０１で取得したレコードに含まれる判別結果が罫線不一致であるか否かを判定する（Ｓ３０５）。 When the determination result included in the record acquired in S301 is new (Yes in S304), the acquisition unit 11 determines whether the determination result included in the record acquired in S301 is a ruled line mismatch (S305). ).

取得部１１は、Ｓ３０１で取得したレコードに含まれる判別結果が罫線不一致でないとき（Ｓ３０５にてＮｏ）、罫線が一致した帳票に対応する定義体を参照し、罫線座標を取得する（Ｓ３０６）。そして、取得部１１は、取得した罫線座標を生成部１３に出力する。生成部１３は、後述するＳ３０８の処理を実行する。 When the determination result included in the record acquired in S301 does not match the ruled line (No in S305), the acquiring unit 11 refers to the definition body corresponding to the form with the matched ruled line and acquires the ruled line coordinates (S306). Then, the acquisition unit 11 outputs the acquired ruled line coordinates to the generation unit 13. The generation unit 13 executes a process of S308 described later.

取得部１１は、Ｓ３０１で取得したレコードに含まれる判別結果が罫線不一致のとき（Ｓ３０５にてＹｅｓ）、帳票画像に含まれる罫線座標を取得する（Ｓ３０７）。そして、取得部１１は、取得した罫線座標を生成部１３に出力する。このとき、取得部１１は、例えば、帳票が帳票１００のとき、ＯＣＲ機能を用いて図１１に示す帳票１００に記載されている罫線座標を取得しても良い。なお、帳票１００（帳票識別子：帳票Ｎ）は、図１６の判別データ３０１に示すように、罫線一致帳票がないので、Ｓ３０５にてＹｅｓの場合に該当する。 When the determination result included in the record acquired in S301 does not match the ruled line (Yes in S305), the acquiring unit 11 acquires the ruled line coordinates included in the form image (S307). Then, the acquisition unit 11 outputs the acquired ruled line coordinates to the generation unit 13. At this time, for example, when the form is the form 100, the obtaining unit 11 may obtain the ruled line coordinates described in the form 100 shown in FIG. 11 using the OCR function. Note that the form 100 (form identifier: form N) corresponds to the case of Yes in S305 because there is no ruled line matching form as shown in the discrimination data 301 of FIG.

そして、生成部１３は、罫線座標が入力されると、罫線情報に罫線識別子と、取得した罫線座標とを関連付けて格納する新規の定義体を生成する（Ｓ３０８）。なお、生成部１３は、例えば、帳票１００の定義体を生成するとき、図１２に示す帳票判別情報２００の罫線識別子、および罫線座標に示す情報を定義体に格納しても良い。定義体の帳票種類に格納する帳票識別子は、Ｓ３０１で取得したレコードに含まれる帳票識別子を格納しても良い。帳票１００の場合には、例えば、帳票種類に帳票Ｎを格納しても良い。 When the ruled line coordinates are input, the generating unit 13 generates a new definition body that stores the ruled line identifier and the acquired ruled line coordinates in association with the ruled line information (S308). For example, when generating the definition body of the form 100, the generation unit 13 may store the ruled line identifier of the form determination information 200 illustrated in FIG. 12 and information indicated by the ruled line coordinates in the definition body. The form identifier stored in the form type of the definition body may store the form identifier included in the record acquired in S301. In the case of the form 100, for example, the form N may be stored in the form type.

生成部１３は、生成した新規の定義体を定義体情報２１に記憶する（Ｓ３０９）。
図５を参照して説明する。 The generation unit 13 stores the generated new definition body in the definition body information 21 (S309).
This will be described with reference to FIG.

抽出部１２は、設定情報２７から第１サイズを取得する。そして、抽出部１２は、アイテムデータ５００に格納した画像領域を検索し、縦幅のサイズが第１サイズに含まれる縦幅のサイズ以上、および横幅のサイズが第１サイズに含まれる横幅のサイズ以上のアイテムの画像を抽出する（Ｓ４０１）。また、抽出部１２は、アイテムデータ５００に格納されたアイテムが文字列のとき、第１文字サイズを用いて、縦幅のサイズが第１文字サイズに含まれる縦幅のサイズ以上、および横幅のサイズが第１文字サイズに含まれる横幅のサイズ以上のアイテムの画像を抽出しても良い。 The extraction unit 12 acquires the first size from the setting information 27. Then, the extraction unit 12 searches the image area stored in the item data 500, and the vertical size is equal to or larger than the vertical size included in the first size, and the horizontal size includes the horizontal size included in the first size. The image of the above item is extracted (S401). In addition, when the item stored in the item data 500 is a character string, the extraction unit 12 uses the first character size and the vertical size is equal to or larger than the vertical width included in the first character size, and the horizontal width You may extract the image of the item more than the size of the width included in the 1st character size.

そして、抽出部１２は、抽出したアイテムの画像が文字列の画像であるか否かを判定する（Ｓ４０２）。 Then, the extracting unit 12 determines whether or not the extracted item image is a character string image (S402).

抽出部１２は、抽出した抽出したアイテムの画像が文字列の画像でない（図形である）とき（Ｓ４０２にてＮｏ）、後述するＳ４０４の処理を実行する。 When the extracted image of the extracted item is not a character string image (a graphic) (No in S402), the extraction unit 12 executes the process of S404 described later.

抽出部１２は、抽出したアイテムの画像が文字列の画像であるとき（Ｓ４０２にてＹｅｓ）、文字列の画像に含まれる文字数が第１文字数以上か否かを判定する（Ｓ４０３）。 When the extracted item image is a character string image (Yes in S402), the extraction unit 12 determines whether the number of characters included in the character string image is equal to or greater than the first character number (S403).

抽出部１２は、文字列の画像に含まれる文字数が第１文字数以上でないとき（Ｓ４０３にてＮｏ）、後述するＳ４０６の処理を実行する。 When the number of characters included in the image of the character string is not equal to or greater than the first character number (No in S403), the extracting unit 12 performs the process of S406 described later.

抽出部１２は、文字列の画像に含まれる文字数が第１文字数以上であるとき（Ｓ４０３にてＹｅｓ）、Ｓ４０４の処理を実行する。すなわち、抽出部１２は、Ｓ４０１で抽出したアイテムの画像が文字列の画像の場合、文字列の文字数が第１文字数以上であるとき、文字列の画像を特徴画像の候補として抽出する。 When the number of characters included in the character string image is greater than or equal to the first number of characters (Yes in S403), the extraction unit 12 executes the process of S404. That is, when the image of the item extracted in S401 is a character string image, the extraction unit 12 extracts the character string image as a feature image candidate when the number of characters in the character string is equal to or greater than the first character number.

さらに、生成部１３は、アイテムデータから抽出したアイテムを含むレコードを取得する（Ｓ４０４）。 Furthermore, the production | generation part 13 acquires the record containing the item extracted from item data (S404).

そして、生成部１３は、Ｓ４０４で取得したレコードを抽出データに格納する（Ｓ４０５）。 And the production | generation part 13 stores the record acquired by S404 in extraction data (S405).

図１８は、抽出データの一例を示す図である。
抽出データ６００には、図１８に示すように、アイテム識別子と、アイテム名と、画像領域と、アイテム画像とが関連付けられて格納されている。なお、抽出データ６００は、帳票１００に対応するデータである。抽出データ６００には、帳票１００に記載されたアイテムの画像から、第１サイズ以上のアイテムの画像を抽出した結果が格納されている。すなわち、抽出データ６００には、帳票１００の特徴画像の候補が格納されている。 FIG. 18 is a diagram illustrating an example of extracted data.
In the extracted data 600, as shown in FIG. 18, an item identifier, an item name, an image area, and an item image are stored in association with each other. The extracted data 600 is data corresponding to the form 100. The extracted data 600 stores a result of extracting an image of an item of the first size or more from an item image described in the form 100. That is, the extracted data 600 stores feature image candidates for the form 100.

そして、抽出部１２は、アイテムデータに格納された画像領域について、全て検索が終了したか否かを判定する（Ｓ４０６）。すなわち、抽出部１２は、第１サイズ以上のアイテムを全て抽出したか否かを判定する。 Then, the extraction unit 12 determines whether or not the search has been completed for all image regions stored in the item data (S406). That is, the extraction unit 12 determines whether all items of the first size or larger have been extracted.

抽出部１２は、アイテムデータに格納された画像領域について、全て検索していないとき（Ｓ４０６にてＮｏ）、Ｓ４０１の処理を実行する。 When all the image areas stored in the item data are not searched (No in S406), the extraction unit 12 performs the process of S401.

抽出部１２は、アイテムデータに格納された画像領域について、全て検索したとき（Ｓ４０６にてＹｅｓ）、Ｓ５０１の処理を実行する。すなわち、抽出部１２は、帳票から全ての特徴画像の候補となるアイテムの画像を抽出したとき、Ｓ５０１の処理を実行する。 When all the image areas stored in the item data are searched (Yes in S406), the extraction unit 12 executes the process of S501. That is, the extraction unit 12 executes the processing of S501 when images of items that are candidates for all feature images are extracted from the form.

図６を参照して説明する。
生成部１３は、Ｓ３０１で取得したレコードに含まれる判別結果が罫線不一致か否かを判定する（Ｓ５０１）。 This will be described with reference to FIG.
The generation unit 13 determines whether the determination result included in the record acquired in S301 is a ruled line mismatch (S501).

生成部１３は、Ｓ３０１で取得したレコードに含まれる判別結果が罫線不一致のとき（Ｓ５０１にてＹｅｓ）、抽出データの各レコードを新規の定義体に格納する（Ｓ５０２）。生成部１３は、例えば、Ｓ３０８において生成した帳票１００の定義体に、図１８に示す抽出データ６００のアイテム識別子、画像領域、およびアイテム画像を、それぞれ図１２に示す帳票判別情報２００のアイテム識別子、画像領域、および特徴画像に格納する。すなわち、生成部１３は、Ｓ３０８とＳ５０２とを実行することで、帳票１００の定義体について、帳票判別情報２００を生成する。そして、後述するＳ６０１の処理を実行する。 When the determination result included in the record acquired in S301 does not match the ruled line (Yes in S501), the generation unit 13 stores each record of the extracted data in a new definition body (S502). For example, the generation unit 13 adds the item identifier, the image area, and the item image of the extracted data 600 illustrated in FIG. 18 to the definition body of the form 100 generated in S308, respectively, the item identifier of the form determination information 200 illustrated in FIG. Store in the image area and feature image. That is, the generation unit 13 generates the form determination information 200 for the definition body of the form 100 by executing S308 and S502. And the process of S601 mentioned later is performed.

生成部１３は、Ｓ３０１で取得したレコードに含まれる判別結果が罫線一致のとき（Ｓ５０１にてＮｏ）、罫線一致帳票に対応する定義体に格納されている画像領域と、抽出したアイテムの画像領域とが一致するか否かを判定する（Ｓ５０３）。このとき、生成部１３は、読取部３０による読取り誤差などを考慮して、所定の誤差を設定し、誤差の範囲内であれば罫線一致帳票に対応する定義体に格納されている画像領域と、抽出したアイテムの画像領域とが一致していると判定しても良い。なお、罫線一致帳票に対応する定義体とは、例えば、Ｓ３０１で取得したレコードに含まれる罫線一致帳票に格納された帳票識別子に対応する定義体である。 When the determination result included in the record acquired in S301 is a ruled line match (No in S501), the generation unit 13 stores the image area stored in the definition body corresponding to the ruled line match form, and the image area of the extracted item Is matched (S503). At this time, the generation unit 13 sets a predetermined error in consideration of a reading error by the reading unit 30 and the image region stored in the definition body corresponding to the ruled line matching form if within the error range. Alternatively, it may be determined that the image area of the extracted item matches. The definition body corresponding to the ruled line matching form is, for example, a definition body corresponding to the form identifier stored in the ruled line matching form included in the record acquired in S301.

生成部１３は、罫線一致帳票に対応する定義体に格納されている画像領域と、抽出したアイテムの画像領域とが一致しないとき（Ｓ５０３にてＮｏ）、Ｓ５０２の処理を実行する。このとき、生成部１３は、Ｓ５０２において、罫線一致帳票に対応する定義体に格納された罫線情報と、Ｓ４０１〜Ｓ４０６の処理において生成した抽出データを格納した特徴情報とを関連付けて格納した新規の定義体を生成する。 When the image area stored in the definition body corresponding to the ruled line matching form does not match the image area of the extracted item (No in S503), the generation unit 13 executes the process of S502. At this time, in S502, the generation unit 13 associates the ruled line information stored in the definition body corresponding to the ruled line matching form with the feature information storing the extracted data generated in the processes of S401 to S406, and stores the new information. Generate a definition body.

生成部１３は、罫線一致帳票に対応する定義体に格納されている画像領域と、抽出したアイテムの画像領域とが一致するとき（Ｓ５０３にてＹｅｓ）、罫線一致帳票に対応する定義体に格納されている特徴画像と、抽出したアイテムの画像が一致するか否かを判定する（Ｓ５０４）。 When the image area stored in the definition body corresponding to the ruled line matching form matches the image area of the extracted item (Yes in S503), the generation unit 13 stores the definition area corresponding to the ruled line matching form. It is determined whether or not the feature image being made matches the image of the extracted item (S504).

生成部１３は、罫線一致帳票に対応する定義体に格納されている特徴画像と、抽出したアイテムの画像とが一致しないとき（Ｓ５０４にてＮｏ）、Ｓ５０２の処理を実行する。このとき、生成部１３は、Ｓ５０２において、罫線一致帳票に対応する定義体に格納された罫線情報と、Ｓ４０１〜Ｓ４０６の処理において生成した抽出データを格納した特徴情報とを関連付けて格納した新規の定義体を生成する。 When the feature image stored in the definition body corresponding to the ruled line matching form does not match the extracted item image (No in S504), the generation unit 13 executes the process of S502. At this time, in S502, the generation unit 13 associates the ruled line information stored in the definition body corresponding to the ruled line matching form with the feature information storing the extracted data generated in the processes of S401 to S406, and stores the new information. Generate a definition body.

生成部１３は、罫線一致帳票に対応する定義体に格納されている特徴画像と、抽出したアイテムの画像とが一致するとき（Ｓ５０４にてＹｅｓ）、未使用のサイズの閾値と、未使用の文字数の閾値とが設定情報２７に格納されているか否かを判定する（Ｓ５０５）。ここで、未使用のサイズの閾値とは、例えば、第１サイズをＳ４０１で使用していたとき、設定情報２７に格納されている第１サイズよりも小さい第２サイズのことである。また、未使用の文字数の閾値とは、例えば、第１文字数をＳ４０３で使用していたとき、設定情報２７に格納されている第１文字数よりも少ない第２文字数のことである。 When the feature image stored in the definition body corresponding to the ruled line matching form matches the extracted item image (Yes in S504), the generation unit 13 determines the unused size threshold value and the unused size image. It is determined whether the threshold value for the number of characters is stored in the setting information 27 (S505). Here, the unused size threshold is, for example, a second size smaller than the first size stored in the setting information 27 when the first size is used in S401. The unused character count threshold is, for example, the second character count that is smaller than the first character count stored in the setting information 27 when the first character count is used in S403.

生成部１３は、未使用のサイズの閾値と、未使用の文字数の閾値とが設定情報２７に格納されていないとき（Ｓ５０５にてＮｏ）、後述するＳ６０１の処理を実行する。このとき、生成部１３は、定義体を生成する処理をエラーとして終了しても良い。また、生成部１３は、表示部４０にエラー情報と、帳票の画像を表示させ、ユーザに対して手入力による帳票判別情報２００の生成を促しても良い。そして、生成部１３は、ユーザが帳票判別情報２００を生成したあと、後述するＳ６０１の処理を実行しても良い。 When the unused size threshold value and the unused character count threshold value are not stored in the setting information 27 (No in S505), the generation unit 13 executes the process of S601 described later. At this time, the generation unit 13 may end the process of generating the definition body as an error. The generation unit 13 may display error information and a form image on the display unit 40 and prompt the user to generate the form determination information 200 by manual input. Then, after the user generates the form determination information 200, the generation unit 13 may execute the process of S601 described later.

生成部１３は、未使用のサイズの閾値と、文字数の閾値とが設定情報２７に格納されているとき（Ｓ５０５にてＹｅｓ）、サイズの閾値、および文字数の閾値とを変更し、Ｓ４０１〜Ｓ４０６の処理を実行する（Ｓ５０６）。 The generation unit 13 changes the size threshold and the character count threshold when the unused size threshold and the character count threshold are stored in the setting information 27 (Yes in S505). The process is executed (S506).

図１９は、抽出データの一例を示す図である。
図１０、図１８および図１９を参照して、Ｓ５０６において、サイズの閾値と文字数の閾値とを変更したときの抽出データを説明する。以下の説明では、生成部１３は、サイズの閾値を第１サイズから第２サイズに変更し、文字数の閾値を第１文字数から第２文字数に変更したものとする。そして、第１文字数は、例えば、５文字であるものとする。また、第２文字数は、例えば、２文字であるものとする。 FIG. 19 is a diagram illustrating an example of extracted data.
The extracted data when the size threshold and the character count threshold are changed in S506 will be described with reference to FIG. 10, FIG. 18, and FIG. In the following description, it is assumed that the generation unit 13 changes the size threshold value from the first size to the second size, and changes the character number threshold value from the first character number to the second character number. The first number of characters is assumed to be 5 characters, for example. In addition, the second number of characters is assumed to be two characters, for example.

図１８に示す抽出データ６００は、帳票１００において、抽出部１２が第１サイズと第１文字数とを用いてアイテムの画像を抽出したときの抽出データである。抽出データ６００には、抽出部１２によって抽出された、第１サイズ以上の大きさである図形ＳＨ１、図形ＳＨ３が格納される。さらに、抽出データ６００には、抽出部１２によって抽出された、第１サイズ以上の大きさであり、かつ第１文字数（５文字）以上の文字数を有する文字列の画像である払込取扱表（ＣＨ２）が格納される。 The extracted data 600 shown in FIG. 18 is extracted data when the extracting unit 12 extracts an image of an item using the first size and the first number of characters in the form 100. The extracted data 600 stores the figures SH1 and SH3 that are extracted by the extracting unit 12 and have a size equal to or larger than the first size. Further, in the extracted data 600, a payment handling table (CH2) which is an image of a character string having a size equal to or larger than the first size and having a number of characters equal to or larger than the first number of characters (5 characters) extracted by the extracting unit 12. ) Is stored.

図１９に示す抽出データ６００は、帳票１００において、抽出部１２が第２サイズと第２文字数とを用いてアイテムの画像を抽出したときの抽出データである。抽出データ６００には、抽出部１２によって抽出された、第２サイズ以上の大きさである図形ＳＨ１、図形ＳＨ３、図形ＳＨ５が格納される。さらに、抽出データ６００には、抽出部１２によって抽出された、第２サイズ以上の大きさであり、かつ第２文字数（２文字）以上の文字数を有する文字列の画像である東京（ＣＨ１）と払込取扱表（ＣＨ２）とが格納される。 The extracted data 600 illustrated in FIG. 19 is extracted data when the extracting unit 12 extracts an image of an item using the second size and the second number of characters in the form 100. The extracted data 600 stores a figure SH1, a figure SH3, and a figure SH5 that are extracted by the extraction unit 12 and have a size equal to or larger than the second size. Furthermore, the extracted data 600 includes Tokyo (CH1), which is an image of a character string having a size equal to or larger than the second size and having a number of characters equal to or larger than the second number (2 characters) extracted by the extraction unit 12. A payment handling table (CH2) is stored.

なお、生成部１３は、Ｓ５０５において、未使用のサイズの閾値、および文字数の閾値のいずれか１以上が設定情報２７に記憶されているとき、Ｓ５０６において、サイズの閾値、および文字数の閾値のいずれか１以上を変更しても良い。また、生成部１３は、第１文字サイズよりも小さいサイズである第２文字サイズが設定情報２７に記憶されているとき、Ｓ５０６において、第1文字サイズを第２文字サイズに変更しても良い。 In S505, when one or more of the unused size threshold and the character count threshold is stored in the setting information 27, the generation unit 13 determines in S506 which of the size threshold and the character count threshold. Or one or more may be changed. In addition, when the second character size that is smaller than the first character size is stored in the setting information 27, the generation unit 13 may change the first character size to the second character size in S506. .

以上により、文字認識装置１は、帳票判別情報２００を生成する処理を終了する。続いて、文字認識装置１は、文字認識情報２０１を生成する処理を実行する。 Thus, the character recognition device 1 ends the process of generating the form discrimination information 200. Subsequently, the character recognition device 1 executes processing for generating the character recognition information 201.

図７〜図９を参照して、文字認識情報２０１を生成する処理を説明する。
図７を参照して説明する。以下の説明では、帳票１００の文字認識情報２０１の生成を一例として説明する。なお、文字認識装置１は、他の帳票に関しても、以下で説明する処理を実行することにより、文字認識情報を生成しても良い。 Processing for generating the character recognition information 201 will be described with reference to FIGS.
This will be described with reference to FIG. In the following description, generation of the character recognition information 201 of the form 100 will be described as an example. Note that the character recognition device 1 may generate character recognition information for other forms by executing the processing described below.

生成部１３は、アイテムデータ５００から文字列（アイテム名）を取得する（Ｓ６０１）。生成部１３は、後述するＳ６０４の処理に続いて、Ｓ６０１の処理を実行するとき、未取得の文字列をアイテムデータ５００から取得しても良い。 The generation unit 13 acquires a character string (item name) from the item data 500 (S601). The generation unit 13 may acquire an unacquired character string from the item data 500 when executing the process of S601 following the process of S604 described later.

そして、生成部１３は、見出しデータ７００を検索し、Ｓ６０１で取得した文字列と同じ見出し文言があるか否かを判定する（Ｓ６０２）。 Then, the generation unit 13 searches the heading data 700 and determines whether or not there is the same heading wording as the character string acquired in S601 (S602).

図２０は、見出しデータの一例を示す図である。
見出しデータ７００は、見出し識別子と、項目種と、見出し文言と、データ種と、最大桁数とを記憶している。 FIG. 20 is a diagram illustrating an example of heading data.
The heading data 700 stores a heading identifier, item type, heading wording, data type, and maximum number of digits.

見出し識別子とは、各見出しを識別するための情報である。
生成部１３は、見出しデータ７００を検索し、Ｓ６０１で取得した文字列と同じ見出し文言があるとき（Ｓ６０２にてＹｅｓ）、文字列が見出し文言に関連付けられた項目種を示す見出しであると認識する（Ｓ６０３）。そして、生成部１３は、項目種を示す見出し文言として、文字列を設定情報２７に格納する。これにより、設定情報２７には、各項目種と、対応する見出し文言が関連付けられて記憶される。そして、生成部１３は、Ｓ６０４の処理を実行する。 The heading identifier is information for identifying each heading.
The generation unit 13 searches the heading data 700, and when there is the same heading wording as the character string acquired in S601 (Yes in S602), the generation unit 13 recognizes that the character string is a heading indicating the item type associated with the heading wording. (S603). Then, the generation unit 13 stores the character string in the setting information 27 as the headline wording indicating the item type. Thereby, in the setting information 27, each item type and the corresponding headline wording are stored in association with each other. And the production | generation part 13 performs the process of S604.

生成部１３は、見出しデータ７００を検索し、Ｓ６０２で取得した文字列と同じ見出し文言がないとき（Ｓ６０２にてＮｏ）、Ｓ６０４の処理を実行する。 The generation unit 13 searches the heading data 700, and when there is no heading wording that is the same as the character string acquired in S602 (No in S602), executes the processing of S604.

生成部１３は、Ｓ６０１において、アイテムデータ５００の全ての文字列を取得したか否かを判定する（Ｓ６０４）。 The generation unit 13 determines whether all the character strings of the item data 500 have been acquired in S601 (S604).

生成部１３は、Ｓ６０１において、アイテムデータ５００の全ての文字列を取得していないとき（Ｓ６０４にてＮｏ）、Ｓ６０１の処理を実行する。 When the generation unit 13 has not acquired all the character strings of the item data 500 in S601 (No in S604), the generation unit 13 executes the process of S601.

生成部１３は、Ｓ６０１において、アイテムデータ５００の全ての文字列を取得したとき（Ｓ６０４にてＹｅｓ）、Ｓ７０１の処理を実行する。このとき、生成部１３は、帳票１００に記載されている項目種に対応する見出し文言を全て認識したものとする。なお、生成部１３は、帳票１００に記載されている項目種に対応する見出し文言を全て認識できないとき、定義体を生成する処理をエラーとして終了しても良い。このとき、生成部１３は、表示部４０にエラー情報を表示させることにより、ユーザに対して手入力による定義体の生成を促しても良い。 When the generating unit 13 acquires all the character strings of the item data 500 in S601 (Yes in S604), the generating unit 13 executes the process of S701. At this time, the generation unit 13 recognizes all the headline wordings corresponding to the item types described in the form 100. Note that when the generation unit 13 cannot recognize all the headline wordings corresponding to the item types described in the form 100, the generation unit 13 may end the process of generating the definition body as an error. At this time, the generation unit 13 may prompt the user to generate a definition body manually by displaying error information on the display unit 40.

図８を参照して説明する。
生成部１３は、アイテムデータ５００から文字列（アイテム名）を取得する（Ｓ７０１）。生成部１３は、後述するＳ７０３の処理に続いて、Ｓ７０１の処理を実行するとき、未取得の文字列をアイテムデータ５００から取得しても良い。 This will be described with reference to FIG.
The generation unit 13 acquires a character string (item name) from the item data 500 (S701). The generation unit 13 may acquire an unacquired character string from the item data 500 when executing the processing of S701 following the processing of S703 described later.

生成部１３は、取引データ４００にＳ７０１で取得した文字列と同じ文字列を示す項目データがあるか否かを判定する（Ｓ７０２）。 The generation unit 13 determines whether there is item data indicating the same character string as the character string acquired in S701 in the transaction data 400 (S702).

生成部１３は、取引データ４００にＳ７０１で取得した文字列と同じ文字列を示す項目データがないとき（Ｓ７０２にてＮｏ）、Ｓ７０１においてアイテムデータ５００の全ての文字列を取得したか否かを判定する（Ｓ７０３）。 When there is no item data indicating the same character string as the character string acquired in S701 in the transaction data 400 (No in S702), the generation unit 13 determines whether or not all the character strings of the item data 500 are acquired in S701. Determination is made (S703).

生成部１３は、Ｓ７０１においてアイテムデータ５００の全ての文字列を取得していないとき（Ｓ７０３にてＮｏ）、Ｓ７０１の処理を実行する。 When the generation unit 13 has not acquired all the character strings of the item data 500 in S701 (No in S703), the generation unit 13 performs the process of S701.

生成部１３は、Ｓ７０１において、アイテムデータ５００の全ての文字列を取得したとき（Ｓ７０３にてＹｅｓ）、定義体を生成する処理を終了する。このとき、生成部１３は、後述するＳ８０５において、全ての項目種に対応するレコードを生成していないと判定されている場合、定義体を生成する処理をエラーとして終了しても良い。そして、生成部１３は、表示部４０にエラー情報と、生成していない文字認識情報２０１のレコードとを表示させ、ユーザに対して手入力による定義体の生成を促しても良い。 When the generation unit 13 acquires all the character strings of the item data 500 in S701 (Yes in S703), the generation unit 13 ends the process of generating the definition body. At this time, if it is determined in S805, which will be described later, that the records corresponding to all the item types have not been generated, the generation unit 13 may end the process of generating the definition body as an error. Then, the generation unit 13 may cause the display unit 40 to display error information and a record of the character recognition information 201 that has not been generated, and prompt the user to generate a definition body by manual input.

Ｓ７０２において、生成部１３は、取引データ４００にＳ７０１で取得した文字列と同じ文字列を示す項目データがあるとき（Ｓ７０２にてＹｅｓ）、取引データ４００から同じ文字列を示す項目データに関連付けられた項目種を取得する（Ｓ７０４）。 In S702, when the transaction data 400 includes item data indicating the same character string as the character string acquired in S701 (Yes in S702), the generation unit 13 is associated with item data indicating the same character string from the transaction data 400. The obtained item type is acquired (S704).

生成部１３は、設定情報２７を参照して、Ｓ７０４で取得した項目種を示す見出し文言を取得する（Ｓ７０５）。 The generation unit 13 refers to the setting information 27 and acquires a headline wording indicating the item type acquired in S704 (S705).

そして、生成部１３は、Ｓ７０１で取得した文字列が、Ｓ７０５で取得した項目種を示す見出し文言の近傍にあるか否かを判定する（Ｓ７０６）。生成部１３は、例えば、アイテムデータ５００から、Ｓ７０５で取得した項目種を示す見出し文言に対応する画像領域を取得し、Ｓ７０１で取得した文字列に対応する画像領域との位置関係を判定する。これにより、生成部１３は、Ｓ７０１で取得した文字列が、Ｓ７０５で取得した項目種を示す見出しの近傍にあるか否かを判定する。見出しの近傍とは、例えば、紙面の種類や見出し種別により異なる。見出しの近傍とは、紙面が帳票１００であるとき、例えば、見出しの下、右下、および右側にある所定の領域のことを言う。 Then, the generation unit 13 determines whether the character string acquired in S701 is in the vicinity of the headline wording indicating the item type acquired in S705 (S706). For example, the generation unit 13 acquires an image area corresponding to the headline wording indicating the item type acquired in S705 from the item data 500, and determines the positional relationship with the image area corresponding to the character string acquired in S701. Thereby, the generation unit 13 determines whether or not the character string acquired in S701 is near the heading indicating the item type acquired in S705. The vicinity of a headline differs depending on, for example, the type of paper and the type of headline. The vicinity of the heading means, for example, a predetermined area under the heading, the lower right, and the right side when the paper surface is the form 100.

生成部１３は、Ｓ７０１で取得した文字列が、Ｓ７０５で取得した項目種を示す見出しの近傍にないとき（Ｓ７０６にてＮｏ）、Ｓ７０３の処理を実行する。 When the character string acquired in S701 is not in the vicinity of the heading indicating the item type acquired in S705 (No in S706), the generation unit 13 executes the process of S703.

生成部１３は、Ｓ７０１で取得した文字列が、Ｓ７０５で取得した項目種を示す見出しの近傍にあるとき（Ｓ７０６にてＹｅｓ）、文字列が項目種に対応する項目データであると認識する（Ｓ７０７）。 When the character string acquired in S701 is in the vicinity of the heading indicating the item type acquired in S705 (Yes in S706), the generation unit 13 recognizes that the character string is item data corresponding to the item type ( S707).

なお、生成部１３は、Ｓ７０６において、Ｓ７０５で取得した見出し文言に対応するデータ種を見出しデータ７００から取得しても良い。そして、生成部１３は、取得したデータ種がＳ７０１で取得した文字列の種類に対応するとき、Ｓ７０７の処理を実行しても良い。また、生成部１３は、取得したデータ種がＳ７０１で取得した文字列の種類に対応しないとき、Ｓ７０３の処理を実行しても良い。 Note that the generation unit 13 may acquire the data type corresponding to the headline wording acquired in S705 from the headline data 700 in S706. Then, the generation unit 13 may execute the process of S707 when the acquired data type corresponds to the character string type acquired in S701. Further, the generation unit 13 may execute the process of S703 when the acquired data type does not correspond to the character string type acquired in S701.

生成部１３は、Ｓ７０７で項目データと認識した文字列を囲む認識領域を取得する（Ｓ７０８）。そして、Ｓ８０１の処理を実行する。 The generation unit 13 acquires a recognition area surrounding the character string recognized as item data in S707 (S708). Then, the process of S801 is executed.

図２１、および図２２は、認識領域を説明する図である。
図２１を参照して、認識領域の設定について説明する。以下の説明においては、項目種（金額）に対応する見出し文言（振込額）について、対応する項目データの認識領域を設定する処理について説明する。 21 and 22 are diagrams illustrating the recognition area.
The setting of the recognition area will be described with reference to FIG. In the following description, processing for setting a recognition area for corresponding item data for a headline wording (transfer amount) corresponding to an item type (amount) will be described.

生成部１３は、図２１に示すように、項目データ８００が罫線に囲まれているとき、項目データ８００を囲む罫線の左上の座標（Ｇ９、Ｈ９）と、右下の座標（Ｇ１０、Ｈ１０）とを取得する。これにより、生成部１３は、左上の座標（Ｇ９、Ｈ９）と右下の座標（Ｇ１０、Ｈ１０）とを結ぶ線を対角線とする矩形を示す画像領域（Ｇ９、Ｈ９）−（Ｇ１０、Ｈ１０)を取得する。そして、生成部１３は、取得した画像領域（Ｇ９、Ｈ９）−（Ｇ１０、Ｈ１０)を項目データ８００の認識をする認識領域に設定する。 As shown in FIG. 21, when the item data 800 is surrounded by ruled lines, the generation unit 13 coordinates the upper left corner (G9, H9) and the lower right coordinates (G10, H10) of the ruled line surrounding the item data 800. And get. Thereby, the generation unit 13 displays an image area (G9, H9)-(G10, H10) indicating a rectangle having a diagonal line connecting the upper left coordinates (G9, H9) and the lower right coordinates (G10, H10). To get. Then, the generation unit 13 sets the acquired image region (G9, H9) − (G10, H10) as a recognition region for recognizing the item data 800.

生成部１３は、図２２（ａ）に示すように、項目データ８０１が罫線に囲まれていないとき、図２２（ｂ）に示すように、項目データ８０１を囲み、他の文言を含まない領域を囲む矩形を生成する。そして、生成部１３は、生成した矩形の左上の座標（Ｇ９、Ｈ９）と、矩形の右下の座標（Ｇ１０、Ｈ１０）とを取得する。これにより、生成部１３は、左上の座標（Ｇ９、Ｈ９）と右下の座標（Ｇ１０、Ｈ１０）とを結ぶ線を対角線とする矩形を示す画像領域（Ｇ９、Ｈ９）−（Ｇ１０、Ｈ１０)を取得する。そして、生成部１３は、取得した画像領域（Ｇ９、Ｈ９）−（Ｇ１０、Ｈ１０)を項目データ８００の認識をする認識領域に設定する。なお、認識領域が示す矩形は、例えば、項目データを囲む矩形の中で最小の矩形としても良い。 When the item data 801 is not surrounded by ruled lines as shown in FIG. 22A, the generation unit 13 surrounds the item data 801 and does not include other words as shown in FIG. 22B. Creates a rectangle that encloses Then, the generation unit 13 acquires the upper left coordinates (G9, H9) of the generated rectangle and the lower right coordinates (G10, H10) of the rectangle. Thereby, the generation unit 13 displays an image area (G9, H9)-(G10, H10) indicating a rectangle having a diagonal line connecting the upper left coordinates (G9, H9) and the lower right coordinates (G10, H10). To get. Then, the generation unit 13 sets the acquired image region (G9, H9) − (G10, H10) as a recognition region for recognizing the item data 800. Note that the rectangle indicated by the recognition area may be, for example, the smallest rectangle among the rectangles surrounding the item data.

図９を参照して説明する。
生成部１３は、Ｓ３０１で取得したレコードに含まれる判別結果が新規であるか否かを判定する（Ｓ８０１）。 This will be described with reference to FIG.
The generation unit 13 determines whether or not the determination result included in the record acquired in S301 is new (S801).

生成部１３は、Ｓ３０１で取得したレコードに含まれる判別結果が新規であるとき（Ｓ８０１にてＹｅｓ）、文字認識情報２０１に、Ｓ７０４で取得した項目種に対応するレコードを生成する。そして、生成部１３は、Ｓ７０４で取得した項目種と関連付けて、Ｓ７０８で取得した認識領域を文字認識情報２０１に格納する（Ｓ８０２）。 When the determination result included in the record acquired in S301 is new (Yes in S801), the generation unit 13 generates a record corresponding to the item type acquired in S704 in the character recognition information 201. Then, the generation unit 13 stores the recognition area acquired in S708 in the character recognition information 201 in association with the item type acquired in S704 (S802).

生成部１３は、見出しデータ７００を検索し、Ｓ７０４で取得した項目種を含むレコードを取得する（Ｓ８０３）。 The generation unit 13 searches the heading data 700 and acquires a record including the item type acquired in S704 (S803).

生成部１３は、取得したレコードに含まれる情報を文字認識情報２０１に格納する（Ｓ８０４）。すなわち、生成部１３は、Ｓ７０４で取得した項目種を含むレコードから、データ種、および最大桁数を取得し、取得した情報をＳ７０４で取得した項目種に関連付けて文字認識情報２０１に格納する。 The generation unit 13 stores the information included in the acquired record in the character recognition information 201 (S804). That is, the generation unit 13 acquires the data type and the maximum number of digits from the record including the item type acquired in S704, and stores the acquired information in the character recognition information 201 in association with the item type acquired in S704.

そして、生成部１３は、全ての項目種に対応するレコードを生成したか否かを判定する（Ｓ８０５）。 Then, the generation unit 13 determines whether records corresponding to all item types have been generated (S805).

生成部１３は、全ての項目種に対応するレコードを生成していないとき（Ｓ８０５にてＮｏ）、Ｓ７０１の処理を実行する。 When the generation unit 13 has not generated records corresponding to all the item types (No in S805), the generation unit 13 executes the process of S701.

生成部１３は、全ての項目種に対応するレコードを生成したとき（Ｓ８０５にてＹｅｓ）、定義体を生成する処理を終了する。 When the generation unit 13 generates records corresponding to all item types (Yes in S805), the generation unit 13 ends the process of generating the definition body.

Ｓ８０１において、生成部１３は、Ｓ３０１で取得したレコードに含まれる判別結果が既存であるとき（Ｓ８０１にてＮｏ）、Ｓ３０１で取得したレコードに含まれる罫線一致帳票に対応する定義体から、Ｓ７０４で取得した項目種に関連付けられた認識領域（以下、既存の認識領域とも言う。）を取得する。そして、生成部１３は、Ｓ７０８で取得した認識領域と、既存の認識領域とが異なるか否かを判定する（Ｓ８０６）。 In S801, when the determination result included in the record acquired in S301 already exists (No in S801), the generation unit 13 determines from the definition corresponding to the ruled line matching form included in the record acquired in S301 in S704. A recognition area associated with the acquired item type (hereinafter also referred to as an existing recognition area) is acquired. Then, the generation unit 13 determines whether or not the recognition area acquired in S708 is different from the existing recognition area (S806).

生成部１３は、Ｓ７０８で取得した認識領域と、既存の認識領域とが同じとき（Ｓ８０６にてＮｏ）、Ｓ８０５の処理を実行する。 When the recognition area acquired in S708 is the same as the existing recognition area (No in S806), the generation unit 13 executes the process of S805.

生成部１３は、Ｓ７０８で取得した認識領域と、既存の認識領域とが異なるとき（Ｓ８０６にてＹｅｓ）、既存の帳票の文字認識情報２０１に格納された認識領域をＳ７０８で取得した認識領域に更新する（Ｓ８０７）。そして、生成部１３は、Ｓ８０５の処理を実行する。これにより、生成部１３は、既存の帳票において、項目データの認識領域のみが変更されたとき、既存の帳票に対応する定義体の認識領域を自動で更新する。 When the recognition area acquired in S708 is different from the existing recognition area (Yes in S806), the generation unit 13 changes the recognition area stored in the character recognition information 201 of the existing form to the recognition area acquired in S708. Update (S807). Then, the generation unit 13 executes the process of S805. As a result, when only the item data recognition area is changed in the existing form, the generation unit 13 automatically updates the definition area recognition area corresponding to the existing form.

図２３は、コンピュータ装置の一実施例を示すブロック図である。
図２３を参照して、文字認識装置１の構成について説明する。 FIG. 23 is a block diagram illustrating an embodiment of a computer device.
The configuration of the character recognition device 1 will be described with reference to FIG.

図２３において、コンピュータ装置９００は、制御回路９０１と、記憶装置９０２と、読書装置９０３と、記録媒体９０４と、通信インターフェイス９０５（通信Ｉ／Ｆ）と、入出力インターフェイス９０６（入出力Ｉ／Ｆ）と、表示装置９０７とネットワーク９０８とを備えている。また、各構成要素は、バス９０９により接続されている。 23, a computer device 900 includes a control circuit 901, a storage device 902, a reading device 903, a recording medium 904, a communication interface 905 (communication I / F), and an input / output interface 906 (input / output I / F). ), A display device 907, and a network 908. Each component is connected by a bus 909.

制御回路９０１は、コンピュータ装置９００全体の制御をする。そして、制御回路９０１は、例えば、ＣＰＵ、マルチコアＣＰＵ、ＦＰＧＡ（ＦｉｅｌｄＰｒｏｇｒａｍｍａｂｌｅＧａｔｅＡｒｒａｙ）およびＰＬＤ（ＰｒｏｇｒａｍｍａｂｌｅＬｏｇｉｃＤｅｖｉｃｅ）などである。制御回路９０１は、例えば、図１において、制御部１０として機能する。なお、ＣＰＵ、ＦＰＧＡ、およびＰＬＤのキャッシュは、例えば、図１に示す設定情報２７を記憶しても良い。 The control circuit 901 controls the entire computer device 900. The control circuit 901 is, for example, a CPU, a multi-core CPU, an FPGA (Field Programmable Gate Array), a PLD (Programmable Logic Device), or the like. For example, the control circuit 901 functions as the control unit 10 in FIG. Note that the CPU, FPGA, and PLD cache may store, for example, the setting information 27 shown in FIG.

記憶装置９０２は、各種データを記憶する。そして、記憶装置９０２は、例えば、ＲＯＭ（ＲｅａｄＯｎｌｙＭｅｍｏｒｙ）およびＲＡＭ（ＲａｎｄｏｍＡｃｃｅｓｓＭｅｍｏｒｙ）などのメモリや、ＨＤ（ＨａｒｄＤｉｓｋ）などで構成される。記憶装置９０２は、例えば、図１において、記憶部２０として機能する。そして、記憶装置９０２は、例えば、図１に示す、定義体情報２１と、取引情報２２と、判別情報２３と、アイテム情報２４と、抽出情報２５と、見出し情報２６と、設定情報２７とを記憶しても良い。 The storage device 902 stores various data. The storage device 902 includes, for example, a memory such as a ROM (Read Only Memory) and a RAM (Random Access Memory), an HD (Hard Disk), and the like. For example, the storage device 902 functions as the storage unit 20 in FIG. Then, the storage device 902 includes, for example, definition body information 21, transaction information 22, discrimination information 23, item information 24, extraction information 25, heading information 26, and setting information 27 shown in FIG. You may remember.

また、ＲＯＭは、ブートプログラムなどのプログラムを記憶している。ＲＡＭは、制御回路９０１のワークエリアとして使用される。ＨＤは、ＯＳ、アプリケーションプログラム、ファームウェアなどのプログラム、および各種データを記憶している。 The ROM stores a program such as a boot program. The RAM is used as a work area for the control circuit 901. The HD stores an OS, an application program, a program such as firmware, and various data.

記憶装置９０２は、例えば、制御回路９０１を、制御部１０として機能させる文字認識プログラムを記憶する。 The storage device 902 stores, for example, a character recognition program that causes the control circuit 901 to function as the control unit 10.

文字認識装置１は、帳票の種類を判別する処理や定義体を生成する処理をするとき、記憶装置９０２に記憶された文字認識プログラムをＲＡＭに読み出す。そして、文字認識装置１は、制御回路９０１で、ＲＡＭに読み出された文字認識プログラムを実行することにより、帳票の種類を判別する処理や定義体を生成する処理を実行する。 When the character recognition device 1 performs processing for determining the type of form or processing for generating a definition body, the character recognition program 1 reads the character recognition program stored in the storage device 902 into the RAM. Then, the character recognition device 1 executes a process for determining the type of the form and a process for generating a definition by executing the character recognition program read into the RAM by the control circuit 901.

なお、文字認識プログラムは、制御回路９０１が通信インターフェイス９０５を介してアクセス可能であれば、ネットワーク９０８上のサーバが有する記憶装置に記憶されていても良い。 Note that the character recognition program may be stored in a storage device included in a server on the network 908 as long as the control circuit 901 is accessible via the communication interface 905.

読書装置９０３は、制御回路９０１に制御され、着脱可能な記録媒体９０４のデータのリード／ライトを行なう。そして、読書装置９０３は、例えば、ＦＤＤ（ＦｌｏｐｐｙＤｉｓｋＤｒｉｖｅ）、ＣＤＤ（ＣｏｍｐａｃｔＤｉｓｃＤｒｉｖｅ）、ＤＶＤＤ（ＤｉｇｉｔａｌＶｅｒｓａｔｉｌｅＤｉｓｋＤｒｉｖｅ）、ＢＤＤ（Ｂｌｕ−ｒａｙ（登録商標）ＤｉｓｋＤｒｉｖｅ）およびＵＳＢ（ＵｎｉｖｅｒｓａｌＳｅｒｉａｌＢｕｓ）などである。 The reading device 903 is controlled by the control circuit 901 and reads / writes data on the removable recording medium 904. The reading device 903 includes, for example, an FDD (Floppy Disk Drive), a CDD (Compact Disc Drive), a DVDD (Digital Versatile Disk Drive), a BDD (Blu-ray (registered trademark) U USB Ver. Etc.

記録媒体９０４は、各種データを保存する。記録媒体９０４は、例えば、文字認識プログラムを記憶する。さらに、記録媒体９０４は、例えば、図１に示す、定義体情報２１と、取引情報２２と、判別情報２３と、アイテム情報２４と、抽出情報２５と、見出し情報２６と、設定情報２７とを記憶しても良い。 The recording medium 904 stores various data. The recording medium 904 stores, for example, a character recognition program. Further, the recording medium 904 includes, for example, definition body information 21, transaction information 22, discrimination information 23, item information 24, extraction information 25, heading information 26, and setting information 27 shown in FIG. You may remember.

そして、記録媒体９０４は、読書装置９０３を介してバス９０９に接続され、制御回路９０１が読書装置９０３を制御することにより、データのリード／ライトが行なわれる。また、記録媒体９０４は、例えば、ＦＤ（ＦｌｏｐｐｙＤｉｓｋ）、ＣＤ（ＣｏｍｐａｃｔＤｉｓｃ）、ＤＶＤ（ＤｉｇｉｔａｌＶｅｒｓａｔｉｌｅＤｉｓｋ）、ＢＤ（Ｂｌｕ−ｒａｙ(登録商標）Ｄｉｓｋ）、およびフラッシュメモリなどである。 The recording medium 904 is connected to the bus 909 via the reading device 903, and the control circuit 901 controls the reading device 903 to read / write data. The recording medium 904 is, for example, an FD (Floppy Disk), a CD (Compact Disc), a DVD (Digital Versatile Disk), a BD (Blu-ray (registered trademark) Disk), and a flash memory.

通信インターフェイス９０５は、ネットワーク９０８を介してコンピュータ装置９００と他の装置とを通信可能に接続する。 The communication interface 905 connects the computer apparatus 900 and other apparatuses via a network 908 so that they can communicate with each other.

入出力インターフェイス９０６は、例えば、キーボード、マウス、タッチパネル、およびスキャナなどと接続され、接続された装置から各種情報を示す信号が入力されると、バス９０９を介して入力された信号を制御回路９０１に出力する。また、入出力インターフェイス９０６は、制御回路９０１から出力された各種情報を示す信号がバス９０９を介して入力されると、接続された各種装置にその信号を出力する。入出力インターフェイス９０６は、例えば、第１サイズ、第２サイズ、第１文字サイズ、第２文字サイズ、第１文字数、および第２文字数の設定値の入力を受け付けても良い。また、入出力インターフェイス９０６に接続されるスキャナは、例えば、図１に示す読取部３０として機能する。 The input / output interface 906 is connected to, for example, a keyboard, a mouse, a touch panel, a scanner, and the like. When signals indicating various types of information are input from the connected devices, the control circuit 901 receives the signals input via the bus 909. Output to. When a signal indicating various information output from the control circuit 901 is input via the bus 909, the input / output interface 906 outputs the signal to various connected devices. The input / output interface 906 may accept input of setting values for the first size, the second size, the first character size, the second character size, the first character number, and the second character number, for example. The scanner connected to the input / output interface 906 functions as the reading unit 30 shown in FIG. 1, for example.

表示装置９０７は、例えば、入出力インターフェイス９０６に接続され、制御部１０から入力される信号に基づいて、各種情報を表示する。また、表示装置９０７は、例えば、例えばＣＲＴ（ＣａｔｈｏｄｅＲａｙＴｕｂｅ）、ＬＣＤ（ＬｉｑｕｉｄＣｒｙｓｔａｌＤｉｓｐｌａｙ）、ＰＤＰ（ＰｌａｓｍａＤｉｓｐｌａｙＰａｎｅｌ）、およびＯＥＬＤ（ＯｒｇａｎｉｃＥｌｅｃｔｒｏｌｕｍｉｎｅｓｃｅｎｃｅＤｉｓｐｌａｙ）などである。そして、表示装置９０７は、例えば、図１において、表示部４０として機能する。 The display device 907 is connected to the input / output interface 906, for example, and displays various types of information based on signals input from the control unit 10. The display device 907 is, for example, a CRT (Cathode Ray Tube), an LCD (Liquid Crystal Display), a PDP (Plasma Display Panel), or an OELD (Organic Electroluminescence Display). The display device 907 functions as the display unit 40 in FIG.

ネットワーク９０８は、例えば、ＬＡＮ、無線通信、またはインターネットなどであり、コンピュータ装置９００と他の装置を通信接続する。 The network 908 is, for example, a LAN, wireless communication, the Internet, or the like, and connects the computer apparatus 900 and other apparatuses for communication.

以上のように、実施形態の文字認識装置１は、紙面に記載されたアイテムの画像から所定のサイズ以上の画像を抽出し、抽出したアイテムの画像を特徴画像として格納する定義体を生成する。これにより、実施形態の文字認識装置１は、定義体を自動生成することができる。 As described above, the character recognition device 1 according to the embodiment extracts an image having a predetermined size or more from an item image described on a sheet, and generates a definition body that stores the extracted item image as a feature image. Thereby, the character recognition device 1 of the embodiment can automatically generate the definition body.

実施形態の文字認識装置１は、読取部３０で読み取った紙面の画像からアイテムの画像を取得し、取得したアイテムの画像と同じ画像を、定義体に格納されている特徴画像から検索する。これにより、実施形態の文字認識装置１は、読取部３０で読み取った紙面の種類を、検索された特徴画像を格納した定義体に格納された紙面の種類であると認識することができる。 The character recognition device 1 according to the embodiment acquires an image of an item from a paper image read by the reading unit 30 and searches the feature image stored in the definition body for the same image as the acquired image of the item. Thereby, the character recognition device 1 of the embodiment can recognize that the type of the page read by the reading unit 30 is the type of the page stored in the definition body storing the searched feature image.

実施形態の文字認識装置１は、紙面から取得されたアイテムの画像の中で、縦幅のサイズと横幅のサイズとが所定のサイズ以上のアイテムの画像を抽出し、抽出したアイテムの画像を特徴画像として格納する定義体を生成する。したがって、実施形態の文字認識装置１は、縦幅、および横幅のいずれか一方のみが長い、文章や線などを抽出しない。これにより、実施形態の文字認識装置１は、紙面に記載されたアイテムの画像の中から、他の紙面にない特徴画像を精度よく抽出することができる。 The character recognition device 1 according to the embodiment extracts an image of an item having a vertical size and a horizontal size that are equal to or larger than a predetermined size from the item images acquired from the page, and features the extracted item image. Generate a definition body to be stored as an image. Therefore, the character recognition device 1 according to the embodiment does not extract a sentence, a line, or the like in which only one of the vertical width and the horizontal width is long. As a result, the character recognition device 1 according to the embodiment can accurately extract a feature image that is not on the other paper from the image of the item described on the paper.

実施形態の文字認識装置１は、アイテムの画像が文字列の画像を含むとき、文字列に含まれる文字が所定のサイズ以上のアイテムの画像を特徴画像として格納する定義体を生成する。したがって、実施形態の文字認識装置１は、文字列の文字が所定のサイズよりも小さいアイテムの画像を特徴画像として抽出しない。これにより、実施形態の文字認識装置１は、紙面に記載されたアイテムの画像の中から、他の紙面にない特徴画像を精度よく抽出することができる。 When the item image includes a character string image, the character recognition device 1 according to the embodiment generates a definition body that stores, as a feature image, an item image in which the characters included in the character string are a predetermined size or larger. Therefore, the character recognition device 1 according to the embodiment does not extract an image of an item whose character in the character string is smaller than a predetermined size as a feature image. As a result, the character recognition device 1 according to the embodiment can accurately extract a feature image that is not on the other paper from the image of the item described on the paper.

実施形態の文字認識装置１は、アイテムの画像が文字列の画像を含むとき、文字列に含まれる文字数が所定の文字数以上のアイテムの画像を特徴画像として格納する定義体を生成する。したがって、実施形態の文字認識装置１は、文字列の文字数が所定の文字数よりも少ないアイテムの画像を特徴画像として抽出しない。これにより、実施形態の文字認識装置１は、紙面に記載されたアイテムの画像の中から、他の紙面にない特徴画像を精度よく抽出することができる。 When the item image includes a character string image, the character recognition device 1 according to the embodiment generates a definition body that stores, as a feature image, an item image in which the number of characters included in the character string is equal to or greater than a predetermined number of characters. Therefore, the character recognition device 1 of the embodiment does not extract an image of an item in which the number of characters in the character string is smaller than the predetermined number of characters as a feature image. As a result, the character recognition device 1 according to the embodiment can accurately extract a feature image that is not on the other paper from the image of the item described on the paper.

実施形態の文字認識装置１は、特徴画像が記載された領域を示す画像領域と特徴画像とを関連付けて格納する定義体を生成する。そして、実施形態の文字認識装置１は、紙面の種類を判別するとき、紙面に記載されているアイテムの画像と、定義体に格納されている特徴画像とのそれぞれに対応する画像領域を一致判定する。これにより、実施形態の文字認識装置１は、紙面と定義体とに格納されている画像領域が一致したとき、読取部３０で読み取った紙面の種類を、画像領域が一致した定義体に格納された紙面の種類であると認識することができる。 The character recognition device 1 according to the embodiment generates a definition body that stores an image region indicating a region in which a feature image is described and a feature image in association with each other. When the character recognition device 1 according to the embodiment determines the type of the page, the character recognition apparatus 1 determines whether the image area corresponding to each of the image of the item described on the page and the feature image stored in the definition body matches. To do. As a result, when the image areas stored in the page and the definition body match, the character recognition device 1 of the embodiment stores the type of the page read by the reading unit 30 in the definition body in which the image area matches. It can be recognized that it is a type of paper.

実施形態の文字認識装置１は、アイテムの画像を取得した紙面が有する罫線の配置と、同じ罫線の配置を格納した定義体が記憶部２０に記憶されていないとき、アイテムの画像を取得した紙面の種類と、抽出したアイテムの画像とを関連付けて格納する定義体を生成する。これにより、実施形態の文字認識装置１は、既存の定義体を重複して生成する処理を行わないので、処理の煩雑化を抑制することができる。 The character recognition device 1 according to the embodiment, when the storage unit 20 does not store the definition of the ruled line layout of the paper surface from which the item image is acquired and the same ruled line layout, the paper surface from which the item image is acquired. A definition body that stores the type of the item and the extracted item image in association with each other is generated. Thereby, since the character recognition apparatus 1 of embodiment does not perform the process which produces | generates the existing definition body redundantly, it can suppress complication of a process.

実施形態の文字認識装置１は、特徴画像を抽出するときに用いる閾値として、第１サイズと第２サイズとを記憶する。そして、実施形態の文字認識装置１は、紙面が有する罫線の配置と同じ罫線の配置と、紙面から第１サイズを用いて抽出したアイテムの画像と同じアイテムの画像とを格納している定義体が記憶部２０に記憶されているとき、第２サイズ以上のアイテムの画像を抽出する。これにより、実施形態の文字認識装置１は、特徴画像として抽出するアイテムの画像を段階的に小さくするので、抽出される特徴画像の数が多くなり処理が煩雑化するのを抑制し、かつ他の帳票にない特徴画像を抽出することができる。 The character recognition device 1 of the embodiment stores a first size and a second size as threshold values used when extracting a feature image. The character recognition device 1 according to the embodiment stores a definition of a ruled line that is the same as the layout of the ruled line on the page, and an image of the same item as the image of the item extracted from the page using the first size. Is stored in the storage unit 20, an image of an item of the second size or larger is extracted. Thereby, the character recognition device 1 according to the embodiment gradually reduces the size of the item image to be extracted as the feature image, so that the number of feature images to be extracted increases and the processing becomes complicated, and the like. It is possible to extract feature images that are not in the form.

実施形態の文字認識装置１は、紙面が有する罫線の配置と同じ罫線の配置と、紙面から第１文字数を用いて抽出した文字列の画像と同じ文字列の画像とを格納している定義体が記憶部２０に記憶されているとき、第１文字数よりも少ない第２文字数以上の文字列の画像を抽出する。これにより、実施形態の文字認識装置１は、特徴画像として抽出する文字列の文字数を段階的に少なくするので、抽出される特徴画像の数が多くなり処理が煩雑化するのを抑制し、かつ他の帳票にない特徴画像を抽出することができる。 The character recognition device 1 according to the embodiment includes a definition body that stores the same ruled line arrangement as the arrangement of the ruled lines on the page, and the same character string image as the character string image extracted from the page using the first number of characters. Is stored in the storage unit 20, an image of a character string having a number of characters equal to or larger than a second character number smaller than the first character number is extracted. Thereby, since the character recognition device 1 of the embodiment reduces the number of characters of the character string extracted as the feature image step by step, the number of feature images to be extracted increases and the processing becomes complicated, and It is possible to extract feature images that do not exist in other forms.

実施形態の文字認識装置１は、項目種に対応する種類のデータが、項目種を示すと認識した文字列の近傍にあるとき、項目種と、項目種に対応する種類のデータが記載された領域を示す認識領域とを関連付けて格納する定義体を生成する。これにより、実施形態の文字認識装置１は、紙面に記載されている項目種に対応するデータの認識領域を示す定義体の作成を自動化することができる。 In the character recognition device 1 of the embodiment, when the type of data corresponding to the item type is in the vicinity of the character string recognized as indicating the item type, the item type and the type of data corresponding to the item type are described. A definition body is generated that stores an association with a recognition area indicating the area. Thereby, the character recognition device 1 of the embodiment can automate the creation of a definition body indicating a recognition area of data corresponding to the item type described on the paper.

実施形態の文字認識装置１は、項目種に対応するデータが罫線に囲まれているとき、罫線に囲まれた領域を認識領域にする。これにより、実施形態の文字認識装置１は、認識領域を自動で設定することができる。 When the data corresponding to the item type is surrounded by ruled lines, the character recognition device 1 of the embodiment sets the area surrounded by the ruled lines as a recognition area. Thereby, the character recognition device 1 of the embodiment can automatically set the recognition area.

実施形態の文字認識装置１は、データが罫線に囲まれていないとき、データを囲み、他の文言を含まない領域を認識領域にする。これにより、実施形態の文字認識装置１は、認識領域を自動で設定することができる。 When the data is not surrounded by ruled lines, the character recognition device 1 of the embodiment surrounds the data and sets a region that does not include other words as a recognition region. Thereby, the character recognition device 1 of the embodiment can automatically set the recognition area.

実施形態の文字認識装置１は、紙面から取得したアイテムの画像と同じ特徴画像を格納した定義体を検索し、アイテムを取得した紙面の種類を、検索された定義体に格納された紙面の種類であると判別する。これにより、実施形態の文字認識装置１は、定義体を用いて紙面の種類を判別することができる。 The character recognition device 1 according to the embodiment searches for a definition body that stores the same feature image as the image of the item acquired from the page, and determines the type of the page from which the item is acquired as the type of the page stored in the searched definition body. It is determined that Thereby, the character recognition apparatus 1 of embodiment can discriminate | determine the kind of paper surface using a definition body.

実施形態の文字認識装置１は、取得したアイテムの画像と同じ特徴画像と、取得したアイテムの画像領域と同じ特徴画像の画像領域とを関連付けて格納した定義体を検索し、アイテムを取得した紙面の種類を、検索された定義体に格納された紙面の種類であると判別する。これにより、実施形態の文字認識装置１は、定義体を用いて紙面の種類を判別する精度を向上することができる。 The character recognition device 1 according to the embodiment searches for a definition body in which the same feature image as the acquired item image and the image area of the same feature image as the acquired item are stored in association with each other, and acquires the item. Is determined to be the type of the paper stored in the searched definition body. Thereby, the character recognition apparatus 1 of embodiment can improve the precision which discriminate | determines the kind of paper surface using a definition body.

なお、本実施形態は、以上に述べた実施形態に限定されるものではなく、本実施形態の要旨を逸脱しない範囲内で種々の構成または実施形態を取ることができる。 In addition, this embodiment is not limited to embodiment described above, A various structure or embodiment can be taken in the range which does not deviate from the summary of this embodiment.

１文字認識装置
１０制御部
１１取得部
１２抽出部
１３生成部
１４認識部
１５判別部
２０記憶部
２１定義体情報
２２取引情報
２３判別情報
２４アイテム情報
２５抽出情報
２６情報
２７設定情報
３０読取部
４０表示部
１００帳票
２００帳票判別情報
２０１字認識情報
３００、３０１判別データ
４００取引データ
５００アイテムデータ
６００抽出データ
７００見出しデータ
８００、８０１項目データ
９００コンピュータ装置
９０１制御回路
９０２記憶装置
９０３読書装置
９０４記録媒体
９０５通信インターフェイス
９０６入出力インターフェイス
９０７表示装置
９０８ネットワーク
９０９バス DESCRIPTION OF SYMBOLS 1 Character recognition apparatus 10 Control part 11 Acquisition part 12 Extraction part 13 Generation part 14 Recognition part 15 Discrimination part 20 Storage part 21 Definition body information 22 Transaction information 23 Discrimination information 24 Item information 25 Extraction information 26 Information 27 Setting information 30 Reading part 40 Display unit 100 Form 200 Form discrimination information 201 Character recognition information 300, 301 Discrimination data 400 Transaction data 500 Item data 600 Extraction data 700 Heading data 800, 801 Item data 900 Computer device 901 Control circuit 902 Storage device 903 Reading device 904 Recording medium 905 Communication interface 906 Input / output interface 907 Display device 908 Network 909 Bus

Claims

An acquisition unit that acquires an image of one or more items from an image on a paper surface;
An extraction unit that extracts an image of an item of a first size or more from the acquired image of the one or more items;
A generating unit that generates a definition body that associates and stores the type of the page from which the image of the item is acquired and the image of the extracted item;
A character recognition device comprising:

The first size is
Including the vertical and horizontal dimensions of the item image,
The extraction unit includes:
From the acquired image of one or more items, an image of an item whose vertical size is equal to or larger than the vertical size included in the first size, and whose horizontal width is equal to or larger than the horizontal width included in the first size. The character recognition device according to claim 1, wherein the character recognition device is extracted.

The extraction unit includes:
When the acquired image of one or more items includes an image of one or more character strings, an image of a character string of a first character number or more including characters of a predetermined character size or more from the one or more character string images. The character recognition device according to claim 1, wherein the character recognition device is extracted.

The generator is
Generating a definition body that associates and stores the type of the page from which the image of the item is acquired, the image of the extracted item, and an image area indicating an area in which the image of the extracted item is described. The character recognition apparatus as described in any one of Claims 1-3.

The character recognition device further includes:
A storage unit for storing one or more definition bodies;
The definition body further includes:
Store the paper type and ruled line layout in association with each other.
The generator is
The layout of the ruled lines of the paper surface from which the image of the item is acquired and the definition body storing the same layout of the ruled lines are not stored in the storage unit, the type of the paper surface from which the image of the item was acquired, and the extracted The character recognition apparatus according to claim 1, wherein a definition body that stores an image of an item in association with the image is generated.

The character recognition device further includes:
A storage unit for storing one or more definition bodies;
The definition body further includes:
Store the paper type and ruled line layout in association with each other.
The extraction unit includes:
A definition body storing an arrangement of the same ruled line as the arrangement of the ruled line on the page and an image of the item extracted from the page using the first size is stored in the storage unit. 6. The character recognition device according to claim 1, wherein an image of an item of a second size or larger that is smaller than the first size is extracted.

The character recognition device further includes:
A storage unit for storing one or more definition bodies;
The definition body further includes:
Store the paper type and ruled line layout in association with each other.
The extraction unit includes:
The definition unit storing the same ruled line arrangement as the ruled line arrangement on the page and the same character string image extracted from the page using the first number of characters is stored in the storage unit. 4. The character recognition device according to claim 3, wherein an image of a character string equal to or greater than a second character number smaller than the first character number is extracted.

The storage unit
Store heading information to store the heading item type and heading wording in association with each other,
The character recognition device further includes:
When the acquired image of one or more items includes an image of a character string, the item type associated with the same headline wording as the wording of the character string is searched from the heading information, and the item type searched by the character string is A recognition unit that recognizes when
The generator is
When the type of data corresponding to the item type is in the vicinity of the character string recognized as indicating the item type, the item type and a recognition area indicating an area in which the type of data corresponding to the item type is described; The character recognition device according to claim 1, wherein a definition body that stores the association is generated.

The generator is
The character recognition device according to claim 8, wherein when the data is surrounded by ruled lines, an area surrounded by the ruled lines is used as the recognition area.

The generator is
The character recognition device according to claim 8, wherein when the data is not surrounded by ruled lines, an area that surrounds the data and does not include other words is used as the recognition area.

The character recognition device further includes:
A storage unit for storing one or more definition bodies;
A determination unit that searches for a definition body that stores an image of the same item as the acquired image of the item, and determines that the type of paper on which the item is acquired is the type of paper stored in the searched definition body When,
The character recognition device according to claim 1, further comprising:

The character recognition device further includes:
A storage unit for storing one or more definition bodies;
The definition item stored in association with the image of the same item as the acquired image of the item and the image area of the same item as the image area of the acquired item is searched, and the type of the page on which the item is acquired is searched. A discriminating unit for discriminating that the type of paper stored in the defined definition body is;
The character recognition device according to claim 4, further comprising:

A character recognition method executed by a computer,
The computer
Acquire an image of one or more items from a paper image,
Extracting an image of an item of a first size or larger from the acquired image of the one or more items;
A character recognition method, comprising: generating a definition body that associates and stores the type of the page from which the item image is acquired and the extracted item image.

Acquire an image of one or more items from a paper image,
Extracting an image of an item of a first size or larger from the acquired image of the one or more items;
A character recognition program that causes a computer to execute a process for generating a definition body that stores a type of a page on which an image of the item is acquired and an image of the extracted item in association with each other.