JP2021144469A

JP2021144469A - Data input support system, data input support method, and program

Info

Publication number: JP2021144469A
Application number: JP2020042708A
Authority: JP
Inventors: 陽子井戸; Yoko Ido
Original assignee: Canon Inc
Current assignee: Canon Inc
Priority date: 2020-03-12
Filing date: 2020-03-12
Publication date: 2021-09-24

Abstract

To solve a problem such that, in data input work, a user needs to check whether or not an item subjected to character recognition processing is correct, and the user conventionally has to select an item for which character recognition cannot be performed and manually input a correct item, resulting in complicated check work.SOLUTION: In data input support processing of the present invention, processing of extracting a character string from a scanned image is performed to display it on a check screen. When a user designates a character string that has failed in extraction on a check screen, a character string is newly extracted according to a cause of failure in extraction. As a result, it is possible to enhance efficiency of check work as compared with a case where the user himself/herself manually performs correction.SELECTED DRAWING: Figure 9

Description

本発明は、テータ入力業務におけるユーザの確認作業を支援するためのデータ入力支援装置、データ入力支援方法及びプログラムに関する。 The present invention relates to a data input support device, a data input support method, and a program for supporting a user confirmation work in a data input business.

従来から、スキャナを用いて帳票に記載されている金額などの項目を読み取り、読み取った項目をＯＣＲ（Optical Character Recognition）などの文字認識処理により抽出し、自動的にシステムに登録するデータ入力業務がある。しかし、項目に網掛けがされているために読み取れない場合や、読み取れたとしても、文字認識処理をした結果があらかじめ設定された書式ルールと一致しないために抽出できない場合など、帳票からの項目の自動的な抽出に失敗するケースもある。したがって、データ入力業務に対しては、文字認識処理された項目が正しいかを確認し、正しくなかった場合にはそれを修正する、ユーザによる確認作業が必要である。 Conventionally, data entry work has been performed in which items such as the amount of money written on a form are read using a scanner, the read items are extracted by character recognition processing such as OCR (Optical Character Recognition), and automatically registered in the system. be. However, if the item cannot be read because it is shaded, or even if it can be read, it cannot be extracted because the result of character recognition processing does not match the preset format rules. In some cases, automatic extraction fails. Therefore, for the data input work, it is necessary for the user to confirm whether the item whose character recognition processing is correct is correct, and if it is not correct, correct it.

そのため、自動抽出された項目をユーザが確認する作業を支援するためのデータ入力支援システムが存在する。
しかし、読み取り対象の文書によっては、多様なレイアウトが存在する。例えば、帳票の一種である請求書について考えると、通常、請求書は発行元が独自のテンプレートを用いて作成され、多様なレイアウトが存在する。このため、データ入力支援システムでは、文書のレイアウト毎に項目の位置を登録する作業が必要であり、多様なレイアウトが存在し得る帳票に対して適用することは困難であった。 Therefore, there is a data input support system for assisting the user in confirming the automatically extracted items.
However, there are various layouts depending on the document to be read. For example, considering an invoice, which is a type of form, the invoice is usually created by the issuer using a template unique to the issuer, and there are various layouts. For this reason, in the data input support system, it is necessary to register the position of the item for each layout of the document, and it is difficult to apply it to a form in which various layouts can exist.

このような問題に対応する技術として、特許文献１には、帳票に記載された文字列を認識する帳票認識装置が開示されている。特許文献１では、多種のレイアウトが混在する帳票に対応するために、データの属性を表す文字列である項目名とデータを表す項目値を画像の文字認識結果から検索し、両者の位置関係に基づいて項目名と項目値を対応付けることで項目値の抽出を行う。
そして、表示画面に項目値の文字認識結果を表示するとともに、認識対象となった項目値の領域を太線の枠で囲んで表示する。また、特許文献１には、認識できない項目値があった場合には、操作部を通じて選択された項目値の入力を受け付けることが記載されている。 As a technique for dealing with such a problem, Patent Document 1 discloses a form recognition device that recognizes a character string described in a form. In Patent Document 1, in order to deal with a form in which various layouts are mixed, an item name representing a data attribute and an item value representing the data are searched from the character recognition result of the image, and the positional relationship between the two is determined. The item value is extracted by associating the item name with the item value based on the item value.
Then, the character recognition result of the item value is displayed on the display screen, and the area of the item value to be recognized is displayed by being surrounded by a thick line frame. Further, Patent Document 1 describes that when there is an unrecognizable item value, the input of the selected item value is accepted through the operation unit.

特開２０１６−５１３３９３号公報Japanese Unexamined Patent Publication No. 2016-513393

しかしながら、特許文献１においては、帳票認識装置が認識できなかった項目値については、ユーザがそれらを選択し、自ら正しい項目値を手入力する必要があるため、ユーザによる確認作業が煩雑となる。また、認識できなかった項目値について、再度、文字認識処理を行わせたとしても、認識に失敗した原因が解消されない限り、再び認識に失敗する蓋然性が高い。 However, in Patent Document 1, it is necessary for the user to select the item values that cannot be recognized by the form recognition device and manually input the correct item values by himself / herself, which complicates the confirmation work by the user. Further, even if the character recognition process is performed again for the item value that could not be recognized, there is a high possibility that the recognition will fail again unless the cause of the recognition failure is resolved.

本発明は、上記のような事情に鑑みてなされたものであり、ユーザによる確認作業の効率化を図るデータ入力支援システムを提供することを目的とする。 The present invention has been made in view of the above circumstances, and an object of the present invention is to provide a data input support system for improving the efficiency of confirmation work by a user.

本発明は、画像上の複数の項目について、前記項目に含まれる文字列を抽出する第１の抽出処理を行う第１の抽出手段と、前記複数の項目についての前記第１の抽出処理による第１の結果が表示される表示手段と、前記表示手段に前記第１の結果として表示された前記複数の項目のうち、１又は複数の項目の指定を受け付ける入力手段と、前記指定がされた項目について、前記第１の抽出処理とは異なる条件により、前記指定された項目に含まれる文字列を抽出する第２の抽出処理を行う第２の抽出手段と、を有することを特徴とするデータ入力支援システムである。 The present invention comprises a first extraction means for performing a first extraction process for extracting a character string included in a plurality of items on an image, and a first extraction process for the plurality of items. A display means for displaying the result of 1, an input means for accepting the designation of one or a plurality of items among the plurality of items displayed as the first result on the display means, and the designated item. The data input is characterized by having a second extraction means for performing a second extraction process for extracting a character string included in the designated item under conditions different from the first extraction process. It is a support system.

本発明によれば、データ入力業務におけるユーザによる確認作業を効率化することができる According to the present invention, it is possible to streamline the confirmation work by the user in the data input work.

データ入力支援装置のハードウェア構成を示す図である。It is a figure which shows the hardware configuration of the data input support device. 操作パネルに表示される画面の例である。This is an example of the screen displayed on the operation panel. データ入力支援装置のソフトウェア構成を示す図である。It is a figure which shows the software structure of the data input support device. 帳票画像の一例である。This is an example of a form image. 帳票画像から取得した項目情報の例である。This is an example of item information obtained from the form image. 確認画面の一例である。This is an example of a confirmation screen. 確認画面の一例である。This is an example of a confirmation screen. 二値化処理後帳票画像の例である。This is an example of a form image after binarization processing. 実施形態１のデータ入力支援処理を示すフローチャートである。It is a flowchart which shows the data input support process of Embodiment 1. 抽出修正処理を示すフローチャートである。It is a flowchart which shows the extraction correction processing. 確認画面の一例である。This is an example of a confirmation screen. 確認画面の一例である。This is an example of a confirmation screen. タッチ座標の領域を判断する方法を説明するための図である。It is a figure for demonstrating the method of determining the area of a touch coordinate. 実施形態２のデータ入力支援処理を示すフローチャートである。It is a flowchart which shows the data input support process of Embodiment 2.

以下に、図面を参照して、本発明を実施するための各実施形態について説明する。ただし、以下に説明する各実施形態はあくまで例示であり、本発明の範囲をそれらに限定する趣旨のものではない。また、以下の各実施形態で説明されている特徴の組み合わせのすべてが本発明の解決手段に必須のものとは限らない。 Hereinafter, embodiments for carrying out the present invention will be described with reference to the drawings. However, each embodiment described below is merely an example, and is not intended to limit the scope of the present invention to them. In addition, not all combinations of features described in the following embodiments are essential to the means of solving the present invention.

＜実施形態１＞
＜ハードウェア構成＞
図１は、実施形態１に係るデータ入力支援システムを実現するためのデータ入力支援装置１００のハードウェア構成を示す図である。
データ入力支援装置１００は、例えば、複合機（Multi Function Peripheral：ＭＦＰ）などの画像処理装置から構成される。データ入力支援装置１００は、制御部１０１と、ＲＯＭ１０２と、ＲＡＭ１０３と、ＨＤＤ１０４と、表示部１０５と、入力部１０６と、スキャナ１０７と、を有する。
制御部１０１は、ＣＰＵなどによって構成され、ＲＯＭ１０２に記憶された制御プログラムを読み出して各種処理を実行する。ＲＡＭ１０３は、制御部１０１の主メモリ、ワークエリアなどの一時記憶領域として用いられる。ＨＤＤ１０４は、各種データや各種プログラムなどを記憶する。なお、後述するデータ入力支援装置１００の機能や処理は、制御部１０１がＲＯＭ１０２またはＨＤＤ１０４に格納されているプログラムを読み出し、このプログラムを実行することにより実現される。 <Embodiment 1>
<Hardware configuration>
FIG. 1 is a diagram showing a hardware configuration of a data input support device 100 for realizing the data input support system according to the first embodiment.
The data input support device 100 is composed of, for example, an image processing device such as a multifunction device (MFP). The data input support device 100 includes a control unit 101, a ROM 102, a RAM 103, an HDD 104, a display unit 105, an input unit 106, and a scanner 107.
The control unit 101 is composed of a CPU or the like, reads a control program stored in the ROM 102, and executes various processes. The RAM 103 is used as a temporary storage area such as a main memory and a work area of the control unit 101. The HDD 104 stores various data, various programs, and the like. The functions and processes of the data input support device 100, which will be described later, are realized by the control unit 101 reading a program stored in the ROM 102 or the HDD 104 and executing this program.

表示部１０５は、液晶ディスプレイなどを有し、各種情報を表示する。入力部１０６は、キーボードやマウスを有し、ユーザによる各種操作を受け付ける。なお、表示部１０５と入力部１０６は、タッチパネルのように操作部として一体的に構成されてもよい。また、表示部１０５は、プロジェクタにより画像を投影するものであってもよい。この場合、入力部１０６は、投影された画像に対する指先の位置をカメラで認識するものであってもよい。
スキャナ１０７は、文書を読み取ってスキャン画像を生成する。なお、スキャナ１０７は、接触型スキャナに限られず、書画カメラやスマートフォンなどを用いた非接触型スキャナであってもよい。
本実施形態においては、スキャナ１０７は、帳票などの文書を読み取り、スキャン画像として帳票画像を生成する。そして、生成された帳票画像は、ＨＤＤ１０４などの記憶装置に記憶される。 The display unit 105 has a liquid crystal display and the like, and displays various information. The input unit 106 has a keyboard and a mouse, and accepts various operations by the user. The display unit 105 and the input unit 106 may be integrally configured as an operation unit like a touch panel. Further, the display unit 105 may project an image by a projector. In this case, the input unit 106 may recognize the position of the fingertip with respect to the projected image by the camera.
The scanner 107 reads the document and generates a scanned image. The scanner 107 is not limited to the contact type scanner, and may be a non-contact type scanner using a document camera, a smartphone, or the like.
In the present embodiment, the scanner 107 reads a document such as a form and generates a form image as a scanned image. Then, the generated form image is stored in a storage device such as HDD 104.

＜ＵＩ＞
図２は、本実施形態におけるデータ入力支援装置１００の表示部１０５及び入力部１０６を構成するＵＩ（User Interface）としての操作パネル２０１に表示される画面の例である。
操作パネル２０１は、タッチパネル２０２及びテンキー２０３を備える。タッチパネル２０２は、ログイン中のユーザＩＤを表示する欄２０２ａ、メインメニューなどが表示される欄２０２ｂなどを有する。
本実施形態において、ＵＩは、処理対象の帳票画像や、帳票画像から抽出した項目情報などをユーザに提供するための一手段であり、これらの情報はタッチパネル２０２上で提供される。なお、これらの情報は、タッチパネル２０２に限定されず、ＰＣに接続されたディスプレイ上に提供されてもよい。 <UI>
FIG. 2 is an example of a screen displayed on the operation panel 201 as a UI (User Interface) constituting the display unit 105 and the input unit 106 of the data input support device 100 in the present embodiment.
The operation panel 201 includes a touch panel 202 and a numeric keypad 203. The touch panel 202 has a column 202a for displaying a logged-in user ID, a column 202b for displaying a main menu, and the like.
In the present embodiment, the UI is one means for providing the user with a form image to be processed, item information extracted from the form image, and the like, and these information are provided on the touch panel 202. The information is not limited to the touch panel 202, and may be provided on a display connected to the PC.

＜ソフトウェア構成＞
図３は、データ入力支援装置１００を実現するソフトウェア構成を示すブロック図である。
図３に示されるように、データ入力支援装置１００を実現するソフトウェアは各種のモジュール３０１〜３１１から構成される。ソフトウェアを実現するプログラムは、ＲＯＭ１０２またはＨＤＤ１０４に記憶される。 <Software configuration>
FIG. 3 is a block diagram showing a software configuration for realizing the data input support device 100.
As shown in FIG. 3, the software that realizes the data input support device 100 is composed of various modules 301 to 311. The program that realizes the software is stored in the ROM 102 or the HDD 104.

制御手段３０１は、プログラムを制御し、各種モジュールに対する指示及び管理を行う。
表示手段３０２は、制御手段３０１からの指示に従い、データ入力支援装置１００をユーザが操作するためのＵＩや及び各種の処理結果を表示部１０５に提供する。
入力手段３０３は、データ入力支援装置１００に対するユーザの操作を受け付ける。
記憶手段３０４は、プログラム、及びプログラムが管理するその他の情報をＲＯＭ１０２またはＨＤＤ１０４に記憶する。
文字認識手段３０５は、帳票画像上の数字などの文字あるいは文字列（以下、まとめて「文字列」という）について、文字認識処理を行って文字列を抽出し、また、それらの座標及び文字種を特定する。 The control means 301 controls the program and gives instructions and management to various modules.
The display means 302 provides the display unit 105 with a UI for the user to operate the data input support device 100 and various processing results in accordance with the instruction from the control means 301.
The input means 303 accepts a user's operation on the data input support device 100.
The storage means 304 stores the program and other information managed by the program in the ROM 102 or the HDD 104.
The character recognition means 305 performs character recognition processing on characters or character strings such as numbers on a form image (hereinafter collectively referred to as "character strings") to extract character strings, and extracts their coordinates and character types. Identify.

項目情報抽出手段３０６は、帳票画像からデータ入力業務の対象となる項目から文字列を項目情報として抽出する。項目情報抽出手段３０６は、以下のサブモジュール３０７〜３１１を有する。ここで、項目とは、帳票画像においてまとまった文字列を含む各単位を指す。
項目値領域検出手段３０７は、帳票画像からデータ入力業務の対象データとなる文字列が記載されている領域を項目値領域として検出する。
項目名領域検出手段３０８は、帳票画像から項目値の名称を表す文字列が記載されている領域を項目名領域として検出する。
項目値取得手段３０９は、文字認識手段３０５により得られた項目値領域の文字列を項目値として取得する。
項目名取得手段３１０は、文字認識手段３０５により得られた項目名領域の文字列を項目名として取得する。
抽出成否取得手段３１１は、項目種類、項目値、項目名を抽出できたか否かに関する抽出可否情報を取得し、また、抽出できなかった場合にその原因である抽出失敗原因を判定し取得する。抽出失敗原因を判定する方法の詳細については、後述する。 The item information extracting means 306 extracts a character string as item information from the item to be the target of the data input business from the form image. The item information extraction means 306 has the following submodules 307 to 311. Here, the item refers to each unit including a set character string in the form image.
The item value area detecting means 307 detects an area in which a character string to be the target data of the data input business is described as the item value area from the form image.
The item name area detecting means 308 detects an area in which a character string representing the name of the item value is described as the item name area from the form image.
The item value acquisition means 309 acquires a character string in the item value area obtained by the character recognition means 305 as an item value.
The item name acquisition means 310 acquires the character string of the item name area obtained by the character recognition means 305 as the item name.
The extraction success / failure acquisition means 311 acquires extraction availability information regarding whether or not the item type, item value, and item name can be extracted, and if extraction is not possible, determines and acquires the cause of the extraction failure, which is the cause. Details of the method for determining the cause of extraction failure will be described later.

なお、項目値領域、項目名領域、項目値、項目名は、例えば、特許文献１で開示されるような公知の方法を用いて取得することができる。
また、文字認識手段３０５は帳票画像全体の文字列を文字認識処理の対象とする必要はなく、項目値取得手段３０９及び項目名取得手段３１０が必要とする文字列を認識すればよい。例えば、文字認識手段３０５は、文字列が記入された領域を抽出した後、該領域の位置、サイズ、領域間のレイアウトなどに基づき文字列が項目名ではないと判定した場合、該文字列について文字種を特定する処理を行わないことで計算量を軽減できる。 The item value area, item name area, item value, and item name can be obtained by using, for example, a known method as disclosed in Patent Document 1.
Further, the character recognition means 305 does not need to target the character string of the entire form image to the character recognition process, and may recognize the character string required by the item value acquisition means 309 and the item name acquisition means 310. For example, when the character recognition means 305 extracts an area in which a character string is entered and then determines that the character string is not an item name based on the position, size, layout between areas, etc. of the area, the character string is used. The amount of calculation can be reduced by not performing the process of specifying the character type.

＜項目情報＞
図４は、本実施形態において用いられる帳票を読み取ることにより生成された帳票画像４００の一例を示す図である。また、図５は、項目情報抽出手段３０６を用いて帳票画像４００から取得した、各項目に関する項目情報５０１である。
項目情報５０１は、１又は複数の項目（図５における各行）に関する各種情報を有する。各種情報としては、項目種類、項目値、項目名（複数ある場合もある）、正規形、抽出成否に関する情報、及び、項目値・項目名毎に領域の頂点座標に関する情報（不図示）などがある。 <Item information>
FIG. 4 is a diagram showing an example of a form image 400 generated by reading a form used in the present embodiment. Further, FIG. 5 is item information 501 for each item acquired from the form image 400 using the item information extraction means 306.
Item information 501 has various information about one or more items (each line in FIG. 5). Various information includes item type, item value, item name (there may be more than one), normal form, information on extraction success / failure, and information on the vertex coordinates of the area for each item value / item name (not shown). be.

図４及び図５の例では、図５におけるＮｏ．１〜３の項目についての項目情報は、それぞれ、図４における各領域４０１、４０３、４０５から抽出されたものである。同様に、図５におけるＮｏ．４〜７の項目についての項目情報は、それぞれ、図４における各領域４０６〜４０９から抽出されたものである。
また、図５におけるＮｏ．２〜６についての項目情報の各項目名１は、それぞれ、図４における領域４０３ａ、４０５ａ、４０４ａから抽出されたものである。同様に、図５におけるＮｏ．３についての項目情報の項目名２は、図４における領域４０５ｂから抽出されたものである。 In the examples of FIGS. 4 and 5, No. 5 in FIG. The item information for the items 1 to 3 is extracted from the regions 401, 403, and 405 in FIG. 4, respectively. Similarly, No. 5 in FIG. The item information for the items 4 to 7 is extracted from each region 406 to 409 in FIG. 4, respectively.
In addition, No. 5 in FIG. Each item name 1 of the item information about 2 to 6 is extracted from the regions 403a, 405a, and 404a in FIG. 4, respectively. Similarly, No. 5 in FIG. The item name 2 of the item information about No. 3 is extracted from the area 405b in FIG.

図５において、「項目種類」は、抽出された項目情報の種類を表す。図５の例では、項目種類として、「発行日」、「請求金額」、「電話番号」、「行小計」の４種類が抽出されている。なお、項目種類のうち、「行小計」項目は、請求金額の内訳を示す項目情報である。
「項目値」は、項目種類についてのデータの値であり、帳票画像に記載される文字列から抽出される。図５の例では、例えば、図４における領域４０６、４０８から、それぞれ、「４，４５０」、「５，０００」が抽出されている。しかし、領域４０７からは、文字認識に失敗したため、「？」とされている。また、領域４０８からは、数字の前後に括弧がつけられた「（１，２３６）」が抽出されている。
「項目名」は、項目種類に対応する文字列である。図５の例では、例えば、「電話番号」に対応する項目名として「ＴＥＬ」、「本社」が抽出されている。 In FIG. 5, the “item type” represents the type of extracted item information. In the example of FIG. 5, four types of items, "issue date", "billing amount", "telephone number", and "line subtotal", are extracted. Of the item types, the "line subtotal" item is item information indicating the breakdown of the billed amount.
The "item value" is a data value for the item type and is extracted from the character string described in the form image. In the example of FIG. 5, for example, “4,450” and “5,000” are extracted from the regions 406 and 408 in FIG. 4, respectively. However, from the area 407, the character recognition has failed, so the question is "?". Further, from the area 408, "(1,236)" with parentheses before and after the number is extracted.
The "item name" is a character string corresponding to the item type. In the example of FIG. 5, for example, "TEL" and "head office" are extracted as item names corresponding to "telephone number".

「正規形」は、項目種類毎に設定された書式ルールを適用することにより、項目値が変換（正規化）された文字列である。例えば、「発行日」については、「ＹＹＹＹＭＭＤＤ」という書式ルールが設定されている。これにより、図５の例では、Ｎｏ．１についての「２０１９年３月８日」という項目値は「２０１９０３０８」という文字列に正規化されている。同様に、「請求金額」や「行小計」については、「小数点以下２桁の実数」という書式ルールが、また、「電話番号」については、「数字のみで構成される文字列」という書式ルールが設定されている。これにより、帳票毎に異なる項目値の表記の揺れを解消することができる。 The "normal form" is a character string in which the item value is converted (normalized) by applying the format rule set for each item type. For example, for the "issue date", a format rule of "YYYYMMDD" is set. As a result, in the example of FIG. 5, No. The item value "March 8, 2019" for 1 is normalized to the character string "20190308". Similarly, for "billing amount" and "line subtotal", the format rule is "real number with two decimal places", and for "phone number", the format rule is "character string consisting only of numbers". Is set. As a result, it is possible to eliminate fluctuations in the notation of item values that differ for each form.

「抽出成否」には、項目値の抽出に成功したか失敗したかを示す情報と、抽出が失敗した場合にその原因（抽出失敗原因）である、抽出成否情報が記入される。例えば、行小計項目についての抽出成否の欄には、行小計である項目値（文字列）の抽出に成功した場合、成功を示す「〇」が記入される。また、抽出に失敗した場合は、抽出失敗原因が記入される。
なお、抽出に失敗した場合とは、文字列がそもそも抽出できなかった場合に加えて、抽出された文字列が設定された書式ルールに一致していない場合なども含まれる。本実施形態において、抽出失敗原因としては、具体的には、例えば、「文字抽出失敗」、「型不一致」などがある。 In the "extraction success / failure", information indicating whether the extraction of the item value was successful or unsuccessful, and the extraction success / failure information which is the cause (extraction failure cause) when the extraction fails are entered. For example, in the extraction success / failure column for the line subtotal item, when the item value (character string) which is the line subtotal is successfully extracted, "○" indicating success is entered. If the extraction fails, the cause of the extraction failure is entered.
The case where the extraction fails includes the case where the character string cannot be extracted in the first place and the case where the extracted character string does not match the set format rule. Specifically, in the present embodiment, the cause of the extraction failure includes, for example, "character extraction failure" and "type mismatch".

抽出失敗原因のうち、「文字抽出失敗」は、項目に含まれる文字列がそもそも抽出できない場合に記入される。例えば、文字抽出失敗は、行小計項目についての領域（行小計領域）において、文字列部分とそれ以外の画素が分離できない場合に生じる。例えば、行小計領域内には、書き込みがあったり、ノイズが重畳されていたり、網掛けがなされていたり、背景色が濃いなどの場合がある。このような場合、文字認識処理により文字列が抽出できないことがある。 Among the causes of extraction failure, "character extraction failure" is entered when the character string included in the item cannot be extracted in the first place. For example, a character extraction failure occurs when the character string portion and other pixels cannot be separated in the area for the line subtotal item (line subtotal area). For example, in the line subtotal area, there may be writing, noise may be superimposed, shading may be performed, or the background color may be dark. In such a case, the character string may not be extracted by the character recognition process.

例えば、図４に示す帳票画像４００の例では、領域４０７には網掛けがなされている。このよう帳票画像４００について、文字認識前処理として二値化処理を行うと、領域４０７は、図８の二値化処理後帳票画像８００において領域８０１で示されるように、黒く塗りつぶされたような画像となる場合がある。
このような場合、領域４０７からは、文字列を抽出することができない。ただし、領域４０７は、帳票画像４００における他の項目との位置関係から、行小計領域であると推測されるため、図５のＮｏ．５に示すように、項目種類には「行小計？」と記入される。そして、文字列が抽出できないため、項目値には「？」、正規形にも「？」、抽出成否には「文字抽出失敗」と記入される。 For example, in the example of the form image 400 shown in FIG. 4, the area 407 is shaded. When the form image 400 is binarized as the character recognition pre-processing, the area 407 is filled in black as shown by the area 801 in the post-binarized form image 800 of FIG. It may be an image.
In such a case, the character string cannot be extracted from the area 407. However, since the area 407 is presumed to be the line subtotal area from the positional relationship with other items in the form image 400, the No. 5 in FIG. As shown in 5, the item type is entered as "line subtotal?". Then, since the character string cannot be extracted, "?" Is entered in the item value, "?" Is entered in the normal form, and "character extraction failure" is entered in the extraction success / failure.

抽出失敗原因が文字抽出失敗である場合、文字認識処理の前に行う、ノイズ除去・二値化処理などの文字認識前処理を、条件を変更して行うことにより、正しい文字列が抽出されることがある。 If the cause of the extraction failure is a character extraction failure, the correct character string is extracted by performing the character recognition pre-processing such as noise removal / binarization processing that is performed before the character recognition processing by changing the conditions. Sometimes.

一方、抽出失敗原因のうち、「型不一致」は、行小計領域から項目値（文字列）を抽出することはできたものの、抽出された文字列が行小計としての文字列に対してあらかじめ設定されていた書式ルールに一致しない場合に記入される。
例えば、行小計としての文字列に対しては、「数字とカンマのみからなり、千の位の数字の後にカンマが存在する」という書式ルールがあらかじめ設定されているとする。この場合、図４の例では、帳票画像４００内の領域４０９は、文字抽出結果が「（１，２３６）」であり、数字とカンマ以外の括弧「（」「）」が含まれているため、書式ルールに一致していないことになる。 On the other hand, among the causes of extraction failure, "type mismatch" is set in advance for the character string as the line subtotal, although the item value (character string) could be extracted from the line subtotal area. Filled in if it does not match the formatting rules that were used.
For example, for a character string as a line subtotal, it is assumed that a format rule "consisting only of numbers and commas, and a comma exists after a number in the thousands" is set in advance. In this case, in the example of FIG. 4, the area 409 in the form image 400 has a character extraction result of "(1,236)" and includes parentheses "(" and ")" other than numbers and commas. , Will not match the formatting rules.

このような場合、領域４０９は、帳票画像４００における他の項目との位置関係から、行小計領域であると推測されるため、図５のＮｏ．７に示すように、項目種類には「行小計？」と記入される。また、文字列を抽出できたため、項目値としては「（１，２３６）」が記入される。
しかし、項目値が書式ルールに一致していないため、正規形には「？」、抽出成否には「型不一致」と記入される。
なお、「型不一致」の例としては、これ以外にも、該当領域内にチェックマークなどの書き込みがあったり、抽出された文字列の前後に非対応文字が存在したり、カンマがなかったりする場合なども考えられる。 In such a case, the area 409 is presumed to be the line subtotal area from the positional relationship with other items in the form image 400. As shown in 7, the item type is entered as "line subtotal?". Further, since the character string can be extracted, "(1,236)" is entered as the item value.
However, since the item value does not match the format rule, "?" Is entered in the normal form and "type mismatch" is entered in the extraction success / failure.
In addition to this, as an example of "type mismatch", there are writings such as check marks in the corresponding area, non-corresponding characters before and after the extracted character string, and no commas. There may be cases.

抽出失敗原因が型不一致である場合、書式ルールに一致するように正規化することにより、正しい文字列が抽出されることがある。なお、ユーザが手入力を行うことにより、正しい文字列に修正することもできる。 If the cause of the extraction failure is a type mismatch, the correct character string may be extracted by normalizing to match the format rules. It should be noted that the user can manually input the character string to correct the character string.

＜データ入力支援処理＞
次に、図９を用いて、データ入力支援処理について説明する。図９は、本実施形態のデータ入力支援処理の流れを示すフローチャートである。
本フローチャートの各ステップにおける処理は、ＣＰＵなどから構成される制御部１０１が、ＲＯＭ１０２に記憶された制御プログラムを読み出して実行することにより行われる。 <Data entry support processing>
Next, the data input support process will be described with reference to FIG. FIG. 9 is a flowchart showing the flow of the data input support process of the present embodiment.
The processing in each step of this flowchart is performed by the control unit 101 composed of a CPU or the like reading and executing the control program stored in the ROM 102.

まず、ステップＳ９０１において、制御部１０１は、ＨＤＤ１０４に記憶されている帳票画像４００を取得する。
次に、ステップＳ９０２において、制御部１０１は、帳票画像４００に対して文字認識処理を行う。なお、文字認識処理を行う前には、必要に応じて、二値化処理などの文字認識前処理を行う。これにより、文字認識結果として、帳票画像４００から各項目情報に対応する文字列などが取得される。
次に、ステップＳ９０３において、制御部１０１は、文字認識結果に基づいて、帳票画像４００から項目情報５０１を取得する。項目情報には、抽出成否に関する情報も含まれる。 First, in step S901, the control unit 101 acquires the form image 400 stored in the HDD 104.
Next, in step S902, the control unit 101 performs character recognition processing on the form image 400. Before performing the character recognition processing, character recognition pre-processing such as binarization processing is performed as necessary. As a result, as a character recognition result, a character string or the like corresponding to each item information is acquired from the form image 400.
Next, in step S903, the control unit 101 acquires item information 501 from the form image 400 based on the character recognition result. The item information also includes information on the success or failure of extraction.

次に、ステップＳ９０４において、制御部１０１は、帳票画像４００及び項目情報５０１をユーザに確認又は修正をさせるために、確認画面を生成し表示させる。 Next, in step S904, the control unit 101 generates and displays a confirmation screen in order for the user to confirm or correct the form image 400 and the item information 501.

図６は、ステップＳ９０４において表示される確認画面６００の一例である。
確認画面６００の概略について説明すると、確認画面６００の左側には、帳票画像４００をサムネイル表示した俯瞰画像６０１が表示される。確認画面６００の右側には、帳票画像４００から抽出された各項目情報が表示される。また、確認画面６００の右側には、下部に、確認作業を終了し、次の画面に移行させるための終了ボタン６１０が表示される。確認画面６００は、ポインタ６１１を用いた操作、又は、直接に指を用いたタッチによる操作などにより、画面上におけるユーザの指示を受け付ける。
なお、確認画面６００の詳細については、改めて後述する。 FIG. 6 is an example of the confirmation screen 600 displayed in step S904.
Explaining the outline of the confirmation screen 600, a bird's-eye view image 601 in which the form image 400 is displayed as a thumbnail is displayed on the left side of the confirmation screen 600. On the right side of the confirmation screen 600, each item information extracted from the form image 400 is displayed. Further, on the right side of the confirmation screen 600, an end button 610 for ending the confirmation work and moving to the next screen is displayed at the lower part. The confirmation screen 600 receives a user's instruction on the screen by an operation using the pointer 611, an operation by directly touching with a finger, or the like.
The details of the confirmation screen 600 will be described later.

次に、ステップＳ９０５において、制御部１０１は、ユーザによる確認が終了したか否かを判断する。
確認画面６００においてユーザにより終了ボタン６１０がタッチされると、制御部１０１は確認が終了したと判断し、図９のデータ入力支援処理を終了する。
一方、確認画面においてユーザにより終了ボタン６１０以外の領域がタッチされると、制御部１０１は、項目情報の修正が必要であるために、確認が終了していないと判断し、処理をステップＳ９０６に進める。 Next, in step S905, the control unit 101 determines whether or not the confirmation by the user has been completed.
When the user touches the end button 610 on the confirmation screen 600, the control unit 101 determines that the confirmation has been completed, and ends the data input support process of FIG.
On the other hand, when the user touches an area other than the end button 610 on the confirmation screen, the control unit 101 determines that the confirmation has not been completed because the item information needs to be corrected, and proceeds to step S906. Proceed.

ステップＳ９０６において、制御部１０１は、ユーザにより指定された確認画面６００上の位置の座標（タッチ座標）を取得する。
次に、ステップＳ９０７において、制御部１０１は、タッチ座標が、行小計項目が表示されている領域（行小計領域）内であるか否かを判断する。
ユーザがいずれかの行小計領域を指定した場合、すなわち、タッチ座標が行小計領域内である場合、制御部１０１は処理をステップＳ９０８に進める。
一方、タッチ座標が行小計領域外である場合は、制御部１０１は処理をステップＳ９１１に進める。そして、制御部１０１は、タッチされた位置が抽出修正処理（後述）の対象外の領域であることをユーザに通知し、ステップＳ９０４の確認画面６００の表示に戻す。 In step S906, the control unit 101 acquires the coordinates (touch coordinates) of the position on the confirmation screen 600 designated by the user.
Next, in step S907, the control unit 101 determines whether or not the touch coordinates are within the area where the line subtotal item is displayed (line subtotal area).
When the user specifies any row subtotal area, that is, when the touch coordinates are within the row subtotal area, the control unit 101 advances the process to step S908.
On the other hand, when the touch coordinates are outside the row subtotal area, the control unit 101 advances the process to step S911. Then, the control unit 101 notifies the user that the touched position is an area not subject to the extraction correction process (described later), and returns to the display of the confirmation screen 600 in step S904.

ここで、図１３を用いて、確認画面６００においてタッチ座標が行小計領域内にあるか否かを判断する手法について説明する。
図１３は、図６の確認画面６００における俯瞰画像６０１を拡大した拡大画像１３００を示したものである。
図１３において、座標のＸ方向及びＹ方向を１３０１のように示す。行小計領域４０６〜４０９を囲む各矩形座標は、（左端Ｘ、右端Ｘ、上端Ｙ、下端Ｙ）のように表される。すなわち、各行小計領域４０６〜４０９は、それぞれ、図１３（ａ）に示すように、（Ｌ１，Ｒ１，Ｔ１，Ｂ１）、（Ｌ２，Ｒ２，Ｔ２，Ｂ２）、（Ｌ３，Ｒ３，Ｔ３，Ｂ３）、（Ｌ４，Ｒ４，Ｔ４，Ｂ４）のように表される。 Here, a method of determining whether or not the touch coordinates are within the row subtotal area on the confirmation screen 600 will be described with reference to FIG.
FIG. 13 shows an enlarged image 1300 which is an enlarged view of the bird's-eye view image 601 on the confirmation screen 600 of FIG.
In FIG. 13, the X direction and the Y direction of the coordinates are shown as 1301. Each rectangular coordinate surrounding the row subtotal areas 406 to 409 is represented as (left end X, right end X, upper end Y, lower end Y). That is, as shown in FIG. 13A, each row subtotal region 406 to 409 is (L1, R1, T1, B1), (L2, R2, T2, B2), (L3, R3, T3, B3, respectively. ), (L4, R4, T4, B4).

ここで、タッチ座標が、Ｘ方向において行小計領域内にあるかを判断する場合、図１３（ａ）に示すように、左端Ｌ１〜Ｌ４の中の最小値Ｘｍｉｎから、右端Ｒ１〜Ｒ４の中の最大値Ｘｍａｘの間にあるか否かを判断する。
また、Ｙ方向において行小計領域内にあるかを判断する場合、図１３（ｂ）に示すように、行小計領域の一番上にある領域４０６の上端Ｔ１となる最小値Ｙｍｉｎから、一番下にある領域４０９の下端Ｂ４となる最大値Ｙｍａｘの間にあるか否かを判断する。 Here, when determining whether the touch coordinates are within the row subtotal area in the X direction, as shown in FIG. 13 (a), from the minimum value Xmin in the left ends L1 to L4, in the right ends R1 to R4. It is determined whether or not it is between the maximum value Xmax of.
Further, when determining whether or not the device is within the row subtotal region in the Y direction, as shown in FIG. 13B, the minimum value Ymin, which is the upper end T1 of the region 406 at the top of the row subtotal region, is the highest. It is determined whether or not it is between the maximum value Ymax which is the lower end B4 of the lower region 409.

その上で、さらに、行小計とみなされる領域の高さ範囲内にタッチ座標が含まれているか確認する。具体的には、図１３（ｃ）に示すように、まず、高さ方向において各行小計領域の中心座標ｍ１〜ｍ４を取得する。すなわち、中心座標間の間隔ｓ１〜ｓ３がほぼ同じとなるような中心座標ｍ１〜ｍ４を取得する。次に、中心座標ｍ１〜ｍ４を中心として高さ方向にｈ１〜ｈ４の範囲にタッチ座標が含まれるかを判定する。なお、各範囲にはある程度のマージンを持たせてもよい。 Then, it is further confirmed whether the touch coordinates are included in the height range of the area regarded as the row subtotal. Specifically, as shown in FIG. 13C, first, the center coordinates m1 to m4 of each row subtotal region are acquired in the height direction. That is, the center coordinates m1 to m4 are acquired so that the intervals s1 to s3 between the center coordinates are substantially the same. Next, it is determined whether or not the touch coordinates are included in the range of h1 to h4 in the height direction with the center coordinates m1 to m4 as the center. In addition, each range may have a certain margin.

図９のフローチャートの説明に戻り、ステップＳ９０８において、制御部１０１は、タッチ座標に対応する項目を項目情報５０１から検索し、その項目の抽出成否情報を取得する。
次に、ステップＳ９０９において、制御部１０１は、取得した抽出成否情報に基づいて、抽出修正処理を行う。抽出修正処理の詳細については、図１０を用いて後述する。
抽出修正処理が終了したら、ステップＳ９１０において、制御部１０１は、修正した項目情報を、不図示の外部システムに送信し、登録する。これにより、図９のデータ入力支援処理は終了する。 Returning to the description of the flowchart of FIG. 9, in step S908, the control unit 101 searches the item information 501 for the item corresponding to the touch coordinates, and acquires the extraction success / failure information of the item.
Next, in step S909, the control unit 101 performs extraction correction processing based on the acquired extraction success / failure information. The details of the extraction correction process will be described later with reference to FIG.
When the extraction correction process is completed, in step S910, the control unit 101 transmits the corrected item information to an external system (not shown) and registers it. As a result, the data input support process of FIG. 9 is completed.

＜確認画面＞
再び、図６を用いて、前述のステップＳ９０４で生成される確認画面６００について説明する。
確認画面６００は、ユーザが項目情報５０１の確認又は修正を行うことができるように、抽出された項目情報５０１をユーザに提示するための画面である。ユーザは、確認画面６００において、確認作業として、項目値が正しい領域から検出されているか、また正しい項目値が抽出されているかの確認を行い、誤りがあればその修正を行う。確認画面６００には、俯瞰画像６０１、項目種類テキスト６０２ａ〜ｄ、項目値テキスト６０３ａ〜ｃ、ｄａ、ｄｂ、項目画像６０４ａ〜ｃ、ｄａ、ｄｂ、終了ボタン６１０、などが含まれる。 <Confirmation screen>
Again, with reference to FIG. 6, the confirmation screen 600 generated in step S904 described above will be described.
The confirmation screen 600 is a screen for presenting the extracted item information 501 to the user so that the user can confirm or modify the item information 501. On the confirmation screen 600, as a confirmation operation, the user confirms whether the item value is detected from the correct area and whether the correct item value is extracted, and if there is an error, corrects the error. The confirmation screen 600 includes a bird's-eye view image 601, item type texts 602a to d, item value texts 603a to c, da, db, item images 604a to c, da, db, an end button 610, and the like.

俯瞰画像６０１は、帳票画像４００をサムネイル表示したものである。図６の俯瞰画像６０１では、項目情報５０１おける項目値と項目名に対応する領域４０１、４０３、４０３ａ、４０５、４０５ａ、４０５ｂ、４０６、４０８がハイライト表示されている。また、項目値に対応する領域４０１、４０３、４０５、４０６、４０８と、項目名に対応する領域４０３ａ、４０５ａ、４０５ｂとは、それぞれ区別してハイライト表示されている。俯瞰画像６０１は、スワイプ操作やピンチイン・ピンチアウト操作を行うことにより、表示位置や表示倍率を変更することが可能である。 The bird's-eye view image 601 is a thumbnail display of the form image 400. In the bird's-eye view image 601 of FIG. 6, the areas 401, 403, 403a, 405, 405a, 405b, 406, and 408 corresponding to the item value and the item name in the item information 501 are highlighted. Further, the areas 401, 403, 405, 406, 408 corresponding to the item values and the areas 403a, 405a, 405b corresponding to the item names are highlighted separately. The bird's-eye view image 601 can change the display position and the display magnification by performing a swipe operation or a pinch-in / pinch-out operation.

項目種類テキスト６０２ａ〜ｄには、前述のステップＳ９１０で外部システムに送信される項目種類の名称が表示される。図６の例では、項目種類テキスト６０２ａには「発行日」が、項目種類テキスト６０２ｂには「請求金額」が、項目種類テキスト６０２ｃには「電話番号」が、項目種類テキスト６０２ｄには「行小計」が、それぞれ、表示されている。 In the item type texts 602a to d, the name of the item type transmitted to the external system in step S910 described above is displayed. In the example of FIG. 6, the item type text 602a has an "issue date", the item type text 602b has a "billing amount", the item type text 602c has a "phone number", and the item type text 602d has a "line". "Subtotal" is displayed respectively.

項目値テキスト６０３ａ〜ｃ、ｄａ、ｄｂには、俯瞰画像６０１にハイライト表示された領域に対応する項目値がテキストとして表示される。項目値テキストの欄ではユーザ入力が可能であり、文字認識結果に誤りがある場合には、ユーザはこの欄において修正を行うことができる。 In the item value texts 603a to c, da, and db, the item values corresponding to the areas highlighted in the bird's-eye view image 601 are displayed as text. User input is possible in the item value text field, and if there is an error in the character recognition result, the user can make corrections in this field.

項目画像６０４ａ〜ｃ、ｄａ、ｄｂには、俯瞰画像６０１にハイライト表示された領域に対応する項目値や項目名から抽出した画像が表示される。ユーザにより各項目画像が選択されると、選択された項目画像に対応する項目値テキストが確認画面６００上において表示されるように、さらに、ハイライト表示されるように、構成してもよい。 In the item images 604a to c, da, and db, images extracted from the item values and item names corresponding to the areas highlighted in the bird's-eye view image 601 are displayed. When each item image is selected by the user, the item value text corresponding to the selected item image may be displayed on the confirmation screen 600 and may be further highlighted.

終了ボタン６１０は、確認画面６００を終了させるためのボタンである。確認画面６００において項目情報５０１の確認作業が終了すると、ユーザは終了ボタン６１０を押下し、確認画面６００を閉じる。 The end button 610 is a button for ending the confirmation screen 600. When the confirmation work of the item information 501 is completed on the confirmation screen 600, the user presses the end button 610 to close the confirmation screen 600.

＜抽出修正処理＞
次に、図１０を用いて、図９のステップＳ９０９で実行される抽出修正処理について説明する。図１０は、抽出修正処理の詳細を示すフローチャートである。
本フローチャートの各ステップにおける処理も、ＣＰＵなどから構成される制御部１０１が、ＲＯＭ１０２に記憶された制御プログラムを読み出して実行することにより行われる。 <Extraction correction processing>
Next, the extraction correction process executed in step S909 of FIG. 9 will be described with reference to FIG. FIG. 10 is a flowchart showing the details of the extraction correction process.
The processing in each step of this flowchart is also performed by the control unit 101 composed of a CPU or the like reading and executing the control program stored in the ROM 102.

ステップＳ１００１において、制御部１０１は、前述のステップＳ９０６でユーザがタッチした項目に対応する項目情報５０１の抽出成否に記入された抽出失敗原因を取得し、抽出失敗理由が「型不一致」であるか否かを判断する。
抽出失敗理由が「型不一致」である場合、制御部１０１は処理をステップＳ１００２に進める。一方、抽出失敗理由が「型不一致」でない場合は、処理をステップＳ１００４に進める。 In step S1001, the control unit 101 acquires the extraction failure cause entered in the extraction success / failure of the item information 501 corresponding to the item touched by the user in the above-mentioned step S906, and whether the extraction failure reason is "type mismatch". Judge whether or not.
If the reason for the extraction failure is "type mismatch", the control unit 101 advances the process to step S1002. On the other hand, if the reason for the extraction failure is not "type mismatch", the process proceeds to step S1004.

ステップＳ１００２において、制御部１０１は、ステップＳ９０６でユーザがタッチし項目に対応する項目情報５０１から項目値を取得して、更新した確認画面を生成し表示させる。 In step S1002, the control unit 101 acquires an item value from the item information 501 corresponding to the item touched by the user in step S906, and generates and displays an updated confirmation screen.

ここで、図７を用いて、ステップＳ１００２で生成される確認画面７００について説明する。図７は、確認画面７００の一例である。
例えば、図６に示されるように、ステップＳ９０６で、確認画面６００においてユーザが領域Ａを指定したとする。この場合、図７に示されるように、確認画面７００において、俯瞰画像６０１は行小計領域７０７が拡大表示された拡大画像７０１に変換される。それに伴い、「品名」などの他の列との比較表示もできるように、タッチされた列を固定して他の列をスクロール可能な表示方法に変換される。スクロールさせるためには、左ボタン７０９又は右ボタン７１０を利用すればよい。 Here, the confirmation screen 700 generated in step S1002 will be described with reference to FIG. 7. FIG. 7 is an example of the confirmation screen 700.
For example, as shown in FIG. 6, it is assumed that the user specifies the area A on the confirmation screen 600 in step S906. In this case, as shown in FIG. 7, on the confirmation screen 700, the bird's-eye view image 601 is converted into an enlarged image 701 in which the line subtotal area 707 is enlarged and displayed. Along with this, the touched column is fixed and the other columns are converted to a scrollable display method so that the comparison display with other columns such as "product name" can be performed. To scroll, the left button 709 or the right button 710 may be used.

ここで、ユーザが指定した領域Ａが行小計領域７０４であるため、ステップＳ１００１で取得される抽出失敗原因は「型不一致」となる。この場合、文字認識処理そのものはされているため、確認画面７００において項目値テキスト７０５には文字認識結果がそのまま表示される。また、対応する項目画像７０６も併せて表示される。
図７の例では、文字認識処理された項目値テキスト７０５に「（」「）」が含まれているため、あらかじめ設定されている「数字とカンマのみからなる」という行小計の書式ルールに一致していない。そのため、書式ルールに一致していないことをユーザに分かりやすく通知するために、項目値テキスト７０５に色を塗るなどの強調表示がされている。 Here, since the area A specified by the user is the line subtotal area 704, the cause of the extraction failure acquired in step S1001 is "type mismatch". In this case, since the character recognition process itself is performed, the character recognition result is displayed as it is on the item value text 705 on the confirmation screen 700. In addition, the corresponding item image 706 is also displayed.
In the example of FIG. 7, since the item value text 705 that has undergone character recognition processing contains "(" and ")", it is one of the preset format rules for the line subtotal "consisting of only numbers and commas". I haven't done it. Therefore, in order to notify the user in an easy-to-understand manner that the format rules are not matched, the item value text 705 is highlighted by coloring it.

図１０のフローチャートの説明に戻り、ステップＳ１００３において、確認画面７００において行われるユーザによる修正の指示に基づいて、制御部１０１は項目値を修正する。
図７の例では、項目値テキスト７０５に対して、行小計の書式ルールに一致するように、「（」「）」を削除する修正が行われる。修正がなされると、図１０の抽出修正処理は終了する。
ステップＳ１００１で抽出失敗理由が「型不一致」でなかった場合、ステップＳ１００４において、制御部１０１は、抽出失敗理由が「文字抽出失敗」であるか否かを判断する。
抽出失敗理由が「文字抽出失敗」である場合、すなわち、前述のステップＳ９０６においてタッチ座標が行小計領域内であったにもかかわらず、文字列が抽出できなかった場合、制御部１０１は処理をステップＳ１００５に進める。それ以外の場合は、処理をステップＳ１００９に進める。 Returning to the description of the flowchart of FIG. 10, in step S1003, the control unit 101 corrects the item value based on the correction instruction by the user performed on the confirmation screen 700.
In the example of FIG. 7, the item value text 705 is modified by deleting "(" and ")" so as to match the format rule of the line subtotal. When the correction is made, the extraction correction process of FIG. 10 ends.
If the extraction failure reason is not "type mismatch" in step S1001, in step S1004, the control unit 101 determines whether or not the extraction failure reason is "character extraction failure".
If the reason for the extraction failure is "character extraction failure", that is, if the character string cannot be extracted even though the touch coordinates are within the line subtotal area in step S906 described above, the control unit 101 performs processing. Proceed to step S1005. Otherwise, the process proceeds to step S1009.

ステップＳ１００５において、制御部１０１は、条件を変更して文字認識前処理を行う。
例えば、帳票画像４００に対して文字認識前処理として二値化処理を行った結果、網掛けされている領域４０７が、図８に示すように、領域８０１のように黒く塗りつぶされたような画像になったとする。このような場合、ステップＳ１００５では、文字列が抽出できる画像が取得できるように、条件を変更して改めて文字認識前処理を行う。文字認識前処理の条件を変更する例としては、二値化処理における閾値の変更や、ノイズ除去方法の変更などが考えられる。 In step S1005, the control unit 101 changes the conditions and performs character recognition preprocessing.
For example, as a result of performing binarization processing on the form image 400 as character recognition preprocessing, as shown in FIG. 8, the shaded area 407 is an image that is painted black like the area 801. Suppose that In such a case, in step S1005, the character recognition preprocessing is performed again by changing the conditions so that an image from which the character string can be extracted can be obtained. Examples of changing the conditions of the character recognition preprocessing include changing the threshold value in the binarization processing and changing the noise removal method.

ステップＳ１００６において、制御部１０１は、ステップＳ１００５で文字認識前処理をした帳票画像４００に対して文字認識処理を改めて行う。ステップＳ１００６で行われる文字認識処理は、ステップＳ９０２で行われる文字認識処理と同じ手法であっても、違う手法であっても構わない。 In step S1006, the control unit 101 performs character recognition processing again on the form image 400 that has undergone character recognition preprocessing in step S1005. The character recognition process performed in step S1006 may be the same method as the character recognition process performed in step S902, or may be a different method.

ステップＳ１００７において、制御部１０１は、項目情報の抽出に成功したか否かをユーザに確認させるために、更新した確認画面を生成し表示する。
ステップＳ１００８において、制御部１０１は、ユーザによる確認が終了したか否かを判断する。
ユーザによる確認が行われた場合、本フローチャートの抽出確認処理は終了する。ユーザによる確認が行われない場合、制御部１０１は処理をステップＳ１００９に進める。 In step S1007, the control unit 101 generates and displays an updated confirmation screen in order to make the user confirm whether or not the item information has been successfully extracted.
In step S1008, the control unit 101 determines whether or not the confirmation by the user has been completed.
When the user confirms, the extraction confirmation process of this flowchart ends. If the user does not confirm, the control unit 101 advances the process to step S1009.

ここで、図１１を用いて、ステップＳ１００７において表示される確認画面１１００について説明する。図１１は、確認画面１１００の一例である。
例えば、図６に示されるように、ステップＳ９０６で、確認画面６００においてユーザが領域Ｂを指定したとする。この場合、図１１に示されるように、確認画面１１００において、俯瞰画像６０１は行小計領域７０７が拡大表示された拡大画像１１０１に変換される。 Here, the confirmation screen 1100 displayed in step S1007 will be described with reference to FIG. FIG. 11 is an example of the confirmation screen 1100.
For example, as shown in FIG. 6, it is assumed that the user specifies the area B on the confirmation screen 600 in step S906. In this case, as shown in FIG. 11, on the confirmation screen 1100, the bird's-eye view image 601 is converted into the enlarged image 1101 in which the line subtotal area 707 is enlarged and displayed.

ここで、ユーザが指定した領域Ｂが網掛けされている行小計領域１１０２であるため、確認画面１１００には、行小計領域１１０２に対して条件を変更して文字認識前処理を行った後に文字認識処理を行った結果が項目値テキスト１１０３に表示されている。なお、項目値テキスト１１０３は、ユーザによる確認を容易にするために、色を塗るなどして強調表示されている。また、項目値テキスト１１０３に対応して、項目画像１１０４も併せて表示されている。 Here, since the area B specified by the user is the shaded line subtotal area 1102, the confirmation screen 1100 shows the characters after changing the conditions for the line subtotal area 1102 and performing the character recognition preprocessing. The result of the recognition process is displayed in the item value text 1103. The item value text 1103 is highlighted by painting or the like in order to facilitate confirmation by the user. In addition, the item image 1104 is also displayed corresponding to the item value text 1103.

再び図１０のフローチャートの説明に戻り、ステップＳ１００９において、制御部１０１は、ユーザの入力にしたがい項目値を修正する。例えば、ユーザは、項目値テキスト１１０３に正しい文字列が表示されていないと判断した場合、正しい文字列を手入力することにより修正する。
以上で、抽出確認処理を終了する。 Returning to the description of the flowchart of FIG. 10 again, in step S1009, the control unit 101 corrects the item value according to the input of the user. For example, when the user determines that the correct character string is not displayed in the item value text 1103, the user corrects it by manually inputting the correct character string.
This completes the extraction confirmation process.

なお、図１０のフローチャートでは、文字認識前処理についての条件の変更は１回のみ行うようにしたが、文字列の抽出に成功しなかった場合には、複数回、条件を変更して文字認識前処理を行っても構わない。
また、本実施形態では、行小計に対して抽出確認処理を行う例について説明したが、抽出確認処理の対象は行小計に限られるものではない。行小計以外の項目情報に対しても、帳票画像から必要な項目情報で抽出できなかった項目値があった場合、前述と同様の抽出確認処理を適用することができる。 In the flowchart of FIG. 10, the condition for the character recognition preprocessing is changed only once, but if the character string extraction is not successful, the condition is changed a plurality of times for character recognition. Preprocessing may be performed.
Further, in the present embodiment, an example in which the extraction confirmation processing is performed on the row subtotal has been described, but the target of the extraction confirmation processing is not limited to the row subtotal. For item information other than the line subtotal, if there is an item value that could not be extracted from the form image with the required item information, the same extraction confirmation process as described above can be applied.

以上のように、本実施形態では、帳票から各項目に含まれる文字列を抽出する抽出処理において、文字列の抽出に失敗した場合、抽出失敗原因に応じて条件を変更して改めて抽出処理を行う。
これにより、抽出された文字列を確認画面においてユーザが確認・修正する確認作業において、抽出に失敗した文字列をユーザが自ら見つけて、手入力により修正する場合に比べて、作業の効率化を図ることができる。 As described above, in the present embodiment, when the extraction process for extracting the character string included in each item from the form fails, the condition is changed according to the cause of the extraction failure and the extraction process is performed again. conduct.
As a result, in the confirmation work in which the user confirms and corrects the extracted character string on the confirmation screen, the work efficiency is improved as compared with the case where the user finds the extracted character string by himself and manually corrects it. Can be planned.

＜実施形態２＞
実施形態１では、抽出に失敗した行小計項目に対して抽出修正処理を行い、確認画面を更新する例について説明した。
実施形態２では、さらに、抽出修正処理を行った行小計項目を、確認画面において適切な位置に表示するように処理をする。 <Embodiment 2>
In the first embodiment, an example in which the extraction correction process is performed on the row subtotal item for which extraction has failed and the confirmation screen is updated has been described.
In the second embodiment, the line subtotal item for which the extraction correction processing has been performed is further processed so as to be displayed at an appropriate position on the confirmation screen.

図１２は、本実施形態において表示される確認画面１２００の一例である。なお、図１２の例では、確認画面１２００は、抽出修正処理が行われた後の状態を示しており、行小計領域１２０１、１２０２には、抽出修正処理の対象となる項目値が記入されている。 FIG. 12 is an example of the confirmation screen 1200 displayed in the present embodiment. In the example of FIG. 12, the confirmation screen 1200 shows the state after the extraction correction processing is performed, and the item values to be the target of the extraction correction processing are entered in the line subtotal areas 1201 and 1202. There is.

図１４は、実施形態２におけるデータ入力支援処理のフローチャートである。図１４のフローチャートでは、実施形態１で説明した図９のフローチャートに対して、ステップＳ１４０１からステップＳ１４０３が追加されている。なお、ステップＳ９０１からステップＳ９０９までの処理は、実施形態１と同様である。
また、本フローチャートの各ステップにおける処理も、ＣＰＵなどから構成される制御部１０１が、ＲＯＭ１０２に記憶された制御プログラムを読み出して実行することにより行われる。
以下では、実施形態１との相違であるステップＳ１４０１からステップＳ１４０３までの処理について説明する。 FIG. 14 is a flowchart of the data input support process according to the second embodiment. In the flowchart of FIG. 14, steps S1401 to S1403 are added to the flowchart of FIG. 9 described in the first embodiment. The processes from step S901 to step S909 are the same as those in the first embodiment.
Further, the processing in each step of this flowchart is also performed by the control unit 101 composed of a CPU or the like reading and executing the control program stored in the ROM 102.
Hereinafter, the processes from step S1401 to step S1403, which are different from the first embodiment, will be described.

ステップＳ１４０１において、制御部１０１は、抽出修正処理が行われた項目値と同一の項目種類として、他の項目値が存在するか否かを確認する。
同一の項目種類として他の項目値が存在する場合、制御部１０１は処理をステップＳ１４０２に進める。同一の項目種類の項目値テキストが存在しない場合は、処理をステップＳ１４０３に進める。
図１２の例では、抽出修正処理が行われた行小計領域１２０１、１２０２と同じ列に、抽出済みの行小計領域７０２、７０３が存在する。 In step S1401, the control unit 101 confirms whether or not there is another item value as the same item type as the item value for which the extraction correction process has been performed.
When another item value exists as the same item type, the control unit 101 advances the process to step S1402. If the item value text of the same item type does not exist, the process proceeds to step S1403.
In the example of FIG. 12, the extracted row subtotal areas 702 and 703 exist in the same column as the row subtotal areas 1201 and 1202 for which the extraction correction process has been performed.

ステップＳ１４０２において、制御部１０１は、抽出修正処理が行われた項目値を、確認画面６００において表示されている項目値の位置に対応するように、確認画面１２００の右側の適切な位置に表示する。
図１２の例では、行小計領域１２０１から抽出された項目値テキスト１２０３ａが、拡大画面１２０５で示される行小計領域と同様に、項目値テキスト６０３ｄａと項目値テキスト６０３ｄｂの間に表示される。その際、項目画像１２０４ａも項目値テキスト１２０３ａに対応して表示される。
また、項目値テキスト１２０３ｂと項目画像１２０４ｂも、項目値テキスト１２０３ａと項目画像１２０４ａと同様に表示される。 In step S1402, the control unit 101 displays the item value for which the extraction correction process has been performed at an appropriate position on the right side of the confirmation screen 1200 so as to correspond to the position of the item value displayed on the confirmation screen 600. ..
In the example of FIG. 12, the item value text 1203a extracted from the line subtotal area 1201 is displayed between the item value text 603da and the item value text 603db, similarly to the line subtotal area shown on the enlarged screen 1205. At that time, the item image 1204a is also displayed corresponding to the item value text 1203a.
Further, the item value text 1203b and the item image 1204b are also displayed in the same manner as the item value text 1203a and the item image 1204a.

一方、ステップＳ１４０３において、制御部１０１は、抽出修正処理が行われた項目値をユーザにより指示された位置に追加して表示する。ユーザが位置を指示する方法としては、例えば、図１２に示すように、ポインタ１２０６を用いて項目値テキスト１２０３ａをドラッグし、確認画面１２００上の適切な位置に移動させる方法などがある。 On the other hand, in step S1403, the control unit 101 adds and displays the item value for which the extraction correction process has been performed at the position instructed by the user. As a method of instructing the position by the user, for example, as shown in FIG. 12, there is a method of dragging the item value text 1203a using the pointer 1206 and moving the item value text 1203a to an appropriate position on the confirmation screen 1200.

以上のように、実施形態２によれば、抽出修正処理が行われた文字列を、確認画面において適切な位置に配置して表示するようにする。
これにより、抽出された文字列を確認画面においてユーザが確認・修正する確認作業において、さらなる効率化を図ることができる。 As described above, according to the second embodiment, the character string subjected to the extraction correction processing is arranged and displayed at an appropriate position on the confirmation screen.
As a result, it is possible to further improve the efficiency of the confirmation work in which the user confirms / corrects the extracted character string on the confirmation screen.

＜その他の実施形態＞
本発明は、上述の実施形態の１以上の機能を実現するプログラムを、ネットワーク又は記憶媒体を介してシステム又は装置に供給し、そのシステム又は装置のコンピュータにおける１つ以上のプロセッサがプログラムを読出し実行する処理でも実現可能である。また、１以上の機能を実現する回路（例えば、ＡＳＩＣ）によっても実現可能である。
また、本発明は、複数の機器から構成されるシステムに適用しても、１つの機器からなる装置に適用してもよい。
本発明は上述の実施形態に限定されるものではなく、本発明の趣旨に基づき種々の変形が可能であり、それらを本発明の範囲から除外するものではない。すなわち、上述した構成例及びその変形例を組み合わせた構成もすべて本発明に含まれるものである。 <Other Embodiments>
The present invention supplies a program that realizes one or more functions of the above-described embodiment to a system or device via a network or storage medium, and one or more processors in the computer of the system or device reads and executes the program. It can also be realized by the processing to be performed. It can also be realized by a circuit (for example, ASIC) that realizes one or more functions.
Further, the present invention may be applied to a system composed of a plurality of devices or a device composed of one device.
The present invention is not limited to the above-described embodiment, and various modifications can be made based on the gist of the present invention, and these are not excluded from the scope of the present invention. That is, all the configurations in which the above-mentioned configuration examples and modifications thereof are combined are also included in the present invention.

１００データ入力支援装置
１０１制御部
４００帳票画像
６００確認画面 100 Data input support device 101 Control unit 400 Form image 600 Confirmation screen

Claims

With respect to a plurality of items on the image, a first extraction means for performing a first extraction process for extracting a character string included in the item, and
A display means for displaying the first result of the first extraction process for the plurality of items, and
An input means that accepts the designation of one or a plurality of items among the plurality of items displayed as the first result on the display means.
It has a second extraction means for performing a second extraction process for extracting the character string included in the designated item under conditions different from the first extraction process for the designated item. A data input support system featuring.

When the extraction of the character string included in the specified item fails in the first extraction process, the second extraction means is based on the conditions set according to the cause of the failure. The data input support system according to claim 1, wherein a second extraction process is performed.

The second extraction means said that the cause of the failure was the first cause that the character string was not extracted, or the extracted character string did not match the set rule. The data input support system according to claim 2, wherein the second extraction process is performed based on a condition set according to the cause.

The data input according to claim 3, wherein when the cause of the failure is the second cause, the character string that failed to be extracted in the first extraction process is converted so as to match the rule. Support system.

The data input support system according to any one of claims 1 to 4, wherein the second extraction process includes a character recognition process and a preprocess performed before the character recognition process. ..

The data input support system according to claim 5, wherein the preprocessing is a binarization process of the image.

The data input support system according to claim 5 or 6, wherein the second extraction means performs the second extraction process in the pretreatment under conditions different from those of the first extraction process.

When the extraction of the character string included in the specified item fails due to the first cause that the character string was not extracted in the first extraction process, the second extraction means is the above-mentioned front. The data input support system according to any one of claims 5 to 7, wherein the second extraction process is performed under conditions different from those of the first extraction process.

The data input support system according to any one of claims 1 to 8, wherein a character string extracted by the first extraction process and the image are displayed on the display means. ..

The data input support system according to claim 9, wherein the designated item is specified based on the coordinates received by the input means.

The data input support system according to any one of claims 1 to 10, wherein the display means displays the second result of the second extraction process.

The data input support system according to claim 11, wherein the display means displays the second result so as to correspond to the first result.

For a plurality of items on the image, a step of performing a first extraction process for extracting a character string included in the item, and
A step of displaying the first result of the first extraction process for the plurality of items, and
A step of accepting the designation of one or a plurality of items among the plurality of items displayed as the first result, and
It is characterized by having a step of performing a second extraction process for extracting the character string included in the designated item under conditions different from the first extraction process for the designated item. Data entry support method.

A program for causing a computer to execute the data input support method according to claim 13.