JPH0644406A

JPH0644406A - Method and device for processing image

Info

Publication number: JPH0644406A
Application number: JP4199746A
Authority: JP
Inventors: Hiroaki Ikeda; 裕章池田
Original assignee: Canon Inc
Current assignee: Canon Inc
Priority date: 1992-07-27
Filing date: 1992-07-27
Publication date: 1994-02-18

Abstract

PURPOSE:To enhance the precision for segmenting a character, and to decrease an erroneous recognition by tracking vertically a white picture element from a dividable position in image information containing plural characters. CONSTITUTION:First of all, in order to segment a first character, a method for taking a projection is used. As a result, with respect to the whole inputted image, a projection is taken in the line direction and a line rectangle is segmented, and at every segmented line rectangle, a projection is taken in the direction being vertical to the line, and character images 301-304 are segmented. Subsequently, a dividable position of the character image is derived. Thereafter, lines 305-308 are drawn at height of half of a character rectangle, and a middle point of the part in which a white picture element continues on the line becomes a divibable position. Next, a second character segmentation processing is executed. In this case, a divided position is searched vertically from the obtained dividable position and the processing for segmenting a character is finished. That is, the character rectangle 302 is divided into three of 309-311 and the character segmentation processing is finished.

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】本発明は画像処理方法及び装置に
関し、特に入力した画像情報から文字を１文字ずつ切り
出す為の画像処理方法及び装置に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to an image processing method and apparatus, and more particularly to an image processing method and apparatus for cutting out characters one by one from input image information.

【０００２】[0002]

【従来の技術】従来の光学的文字認識装置における文字
認識処理の一例を図１３に示す。2. Description of the Related Art FIG. 13 shows an example of character recognition processing in a conventional optical character recognition device.

【０００３】まず、イメージスキャナ等を用いて原稿を
読み取り（Ｓ３０１）、続いて入力された画像から１文
字分の文字画像を切り出す（Ｓ３０２）。次に切り出さ
れた文字画像の特徴を抽出し（Ｓ３０３）、予め容易さ
れている各カテゴリの標準的な特徴等を用いて類似度を
計算する（Ｓ３０４）。その結果、類似度が最も大きい
カテゴリが認識結果となり、ＣＲＴ等に表示する（Ｓ３
０５）。First, an original is read using an image scanner or the like (S301), and then a character image for one character is cut out from the input image (S302). Next, the features of the cut-out character image are extracted (S303), and the similarity is calculated using the standard features of each category that are facilitated in advance (S304). As a result, the category with the highest degree of similarity becomes the recognition result and is displayed on the CRT or the like (S3).
05).

【０００４】入力画像から文字を切り出す方法としてい
くつか知られているが、その中で最も一般的な方法の一
つである文字画像の射影を取る方法を、簡単に説明を行
う。There are some known methods for cutting out a character from an input image, but a method for projecting a character image, which is one of the most general methods, will be briefly described.

【０００５】図１４に示すような横書きの文字列を例に
とると、横方向の射影面４０１に射影４０２をとること
で、射影の長さｈを高さとする行矩形４０３が得られ
る。さらに得られた行矩形を縦方向に射影を取ることで
射影面４０４に射影４０５〜４１０が得られ、文字が切
り出される。Taking a horizontal character string as shown in FIG. 14 as an example, by taking a projection 402 on a horizontal projection plane 401, a line rectangle 403 whose height is the length h of the projection is obtained. Further, by projecting the obtained line rectangle in the vertical direction, projections 405 to 410 are obtained on the projection surface 404, and characters are cut out.

【０００６】[0006]

【発明が解決しようとしている課題】しかしながら、上
記従来例では、図１５のように「こ」と「う」と「で」
がオーバーラップしてしまっている場合、画像の射影を
取ったのでは、枠１５で示すように、３文字合わせて
「こうで」が１文字として切り出されてしまう。そこ
で、１度切り出した矩形の中の、高さと幅の比率等から
更に分割する必要がある矩形を判断し、更なる分割が必
要な矩形に対する文字切り出し処理を再度行わなければ
ならない。However, in the above conventional example, as shown in FIG. 15, "ko", "uu" and "de" are used.
If the two overlap, the projection of the image is taken, and as shown by the frame 15, three characters are combined and “kode” is cut out as one character. Therefore, it is necessary to judge a rectangle that needs to be further divided from the once cut-out rectangles based on the ratio of the height to the width, and perform the character cutting process again for the rectangle that needs to be further divided.

【０００７】しかし、図１６のように、文字切り出しの
対象が英文等の文字幅が文字によって大きく異なる文字
列である場合には、平均文字幅での切り出しはできず、
また「ＡＷＡ」を縦一線で切ると正しく１文字分が得ら
れず、後の認識が正しく行われないという欠点がある。However, as shown in FIG. 16, when an object to be cut out is a character string such as an English sentence whose character width greatly differs depending on the character, it cannot be cut out with the average character width.
Further, if "AWA" is cut in a vertical line, one character cannot be obtained correctly, and subsequent recognition cannot be performed correctly.

【０００８】また、図１７のように、隣りどうしの文字
が接触している場合、文字画像の境界を追跡していく方
法でも２文字を分けることができないという欠点があ
る。Further, as shown in FIG. 17, when adjacent characters are in contact with each other, there is a drawback that the two characters cannot be separated even by the method of tracing the boundary of the character image.

【０００９】[0009]

【課題を解決するための手段】上記課題を解決する為
に、本発明は複数文字を含む画像情報において、分割可
能位置を求め、前記分割可能位置から上下に白画素を追
跡し、前記追跡して得た分割線によって前記画像情報を
分割することを特徴とする画像処理方法及び装置を提供
する。In order to solve the above problems, the present invention obtains a dividable position in image information containing a plurality of characters, tracks white pixels above and below the dividable position, and traces the white pixels. There is provided an image processing method and device, characterized in that the image information is divided by the dividing line obtained.

【００１０】上記課題を解決する為に、好ましくは前記
複数文字を含む画像情報は、文書画像情報から文字切り
処理を行って得た一つの画像情報枠とする。In order to solve the above problems, preferably, the image information containing the plurality of characters is one image information frame obtained by performing character cutting processing from the document image information.

【００１１】上記課題を解決する為に、好ましくは前記
分割可能位置は、前記複数文字を含む画像情報中に設定
した線分上の白画素連続領域上とする。In order to solve the above-mentioned problems, it is preferable that the dividable position is on a white pixel continuous area on a line segment set in the image information including the plurality of characters.

【００１２】上記課題を解決する為に、好ましくは前記
画像情報中に設定する線分は、該画像情報の高さの半分
の位置とする。In order to solve the above problems, it is preferable that the line segment set in the image information is at a position half the height of the image information.

【００１３】上記課題を解決する為に、本発明は複数文
字を含む画像情報において、分割候補位置を導出し、前
記導出した分割候補位置で分割した画像情報を各分割領
域毎に認識して類似度を演算し、前記演算した類似度に
従って前記分割位置を決定することを特徴とする画像処
理方法及び装置。In order to solve the above-mentioned problems, the present invention derives a division candidate position in image information containing a plurality of characters, recognizes the image information divided at the derived division candidate position for each divided area, and resembles each other. An image processing method and apparatus, characterized in that a degree is calculated and the division position is determined according to the calculated similarity.

【００１４】上記課題を解決する為に、好ましくは前記
複数文字を含む画像情報は、文書画像情報から文字切り
処理を行って得た一つの画像情報枠とする。In order to solve the above-mentioned problems, it is preferable that the image information containing the plurality of characters is one image information frame obtained by performing character cutting processing from the document image information.

【００１５】[0015]

【Example】

〔実施例１〕図１は本実施例の画像処理装置の構成を示
すブッロク図である。図１において、１０１はＲＯＭ１
０４に格納されている制御プログラムに従って画像の入
力、文字画像の切り出し等の処理や本装置全体の制御等
を行うＣＰＵ、１０２は文字の入力や修正等を行うキー
ボード（ＫＢ）、１０３はマウス等のポインティングデ
バイス（ＰＤ）、１０４はＣＰＵ１０１が実行する後述
するフローチャートに示す処理の制御プログラム等を格
納するＲＯＭ、１０５は文字画像や文字切り出し結果や
認識結果等を記憶するＲＡＭ、１０６は切り出された文
字画像について各文字との類似度を計算する類似度計算
部、１０７は画像を読み取るイメージスキャナ（ＳＣＡ
Ｎ）であり、１０８は無イメージスキャナのインターフ
ェース（Ｉ／Ｆ）、１０９は文字認識結果等を表示する
ディスプレイである。[Embodiment 1] FIG. 1 is a block diagram showing the configuration of an image processing apparatus according to this embodiment. In FIG. 1, 101 is a ROM 1
A CPU for inputting images, cutting out character images, and controlling the entire apparatus according to a control program stored in 04, a keyboard (KB) 102 for inputting and correcting characters, and a mouse 103. Pointing device (PD), 104 is a ROM for storing a control program for the processing executed by the CPU 101, which will be described later with reference to the flowchart, 105 is a RAM for storing character images, character cutout results, recognition results, etc., and 106 is a cutout. A similarity calculation unit that calculates the similarity between each character image and each character, 107 is an image scanner (SCA) that reads the image.
N), 108 is an interface (I / F) of the imageless scanner, and 109 is a display for displaying a character recognition result and the like.

【００１６】本実施例は、図１６のように文字幅の異な
る英文等の文字切り出しを分割可能点の追跡により求め
る方法を示す。図１に示す構成の文字認識装置が実行す
る本実施例の文字切り出し処理全体の概略を図２のフロ
ーチャートに示し説明する。This embodiment shows a method for obtaining character cut-outs of English sentences having different character widths as shown in FIG. 16 by tracing the dividable points. An outline of the entire character segmentation process of the present embodiment executed by the character recognition device configured as shown in FIG. 1 will be described with reference to the flowchart of FIG.

【００１７】まず、Ｓ２０１で従来よりあった写影を取
ることによる第１の文字切り出しを行う。切り出された
文字画像が更に分割できるか否かを判定し（Ｓ２０
２）、分割でる場合は、その位置を分割可能位置として
求める（Ｓ２０３）。Ｓ２０３で求めた分割可能位置に
従って第２の文字切り出し処理を行うか否か判断し（Ｓ
２０４）、第２の文字切り出しを行う。更に詳細な説明
をするために、図３のプロポーショナルピッチのアルフ
ァベットの文字画像を用いて各ステップを説明する。図
３（ａ）は入力さりた文字画像で、この画像情報に対し
文字切り出しを行う。まず、Ｓ２０１において、第１の
文字切り出しとして、写影を取る方法を用いる。これ
は、入力した画像全体に対して、まず行方向に写影を取
り行矩形を切り出し、その後切り出した行矩形ごとに行
に垂直な方向に写影を取り、文字画像を切り出すもので
ある。その結果、図３（ｂ）に示すように、４つの矩形
３０１，３０２，３０３，３０４が得られる。First, in step S201, a first character is cut out by taking a conventional mapping. It is determined whether the cut-out character image can be further divided (S20).
2) In the case of division, the position is obtained as a dividable position (S203). According to the dividable position obtained in S203, it is determined whether or not the second character cutting process is performed (S
204), the second character is cut out. For more detailed description, each step will be described using the character image of the alphabet of proportional pitch in FIG. FIG. 3A shows the input character image, and character cutting is performed on this image information. First, in S201, a method of taking a projection is used as the first character cutout. In this, a mapping is first performed in the row direction and a line rectangle is cut out for the entire input image, and then a mapping is performed in a direction perpendicular to the line for each cut out line rectangle, and a character image is cut out. As a result, four rectangles 301, 302, 303, 304 are obtained as shown in FIG.

【００１８】ここで、Ｓ２０１で得た各矩形を更に分割
するか否かを決める（Ｓ２０２）。この決め方として
は、例えば各矩形（３０１，３０２，３０３，３０４）
の高さと幅の比率から判断したり、或いは幅がある基準
値以上（以下）であることにより判断することができ
る。また、このＳ２０２は省略し、すぐＳ２０３で分割
可能位置を求めても良い。Here, it is determined whether or not each rectangle obtained in S201 is further divided (S202). As a method of determining this, for example, each rectangle (301, 302, 303, 304)
It can be determined from the ratio of the height to the width of the sheet, or can be determined from the width being equal to or larger than (equal to or less than) a reference value. Further, this step S202 may be omitted, and the dividable position may be immediately obtained in step S203.

【００１９】次に、文字画像の分割可能位置を求める
（Ｓ２０３）。これは、図３（ｃ）のように文字矩形の
半分の高さに線３０５，３０６，３０７，３０８を引
き、線上で白画素が連続する部分の中点を分割可能位置
とする。Ｓ２０４では、各文字矩形（３０１，３０２，
３０３，３０４）のなかにＳ２０３で求めた分割可能位
置が存在するかどうかを調べ、存在するならば、第２の
文字切り出し処理Ｓ２０５を行う。この例の場合、文字
矩形３０１と３０２に分割可能位置（図３（ｃ）におい
てｘで示す）が、存在するので、その２つの矩形につい
て第２の文字切り出し処理を行う。Ｓ２０３で求めた分
割可能位置から上下に追跡を行い、矩形を更に分割する
方法を、図４及び図５のフローチャートを用いて説明す
る。図４は上方向への追跡、図５は下方向への追跡を行
う、文字切り出し処理（Ｓ２０５）のフローチャートで
ある。座標軸は、横方向をｘ軸、縦方向をｙ軸とする。Next, the dividable position of the character image is obtained (S203). As shown in FIG. 3C, lines 305, 306, 307, and 308 are drawn at the height of a half of the character rectangle, and the midpoint of the part where white pixels are continuous on the line is set as the dividable position. In S204, each character rectangle (301, 302,
(303, 304) whether or not the dividable position obtained in S203 exists, and if there is, a second character cutout process S205 is performed. In the case of this example, since there are dividable positions (indicated by x in FIG. 3C) in the character rectangles 301 and 302, the second character cutout process is performed on the two rectangles. A method for further dividing the rectangle by tracing up and down from the dividable position obtained in S203 will be described with reference to the flowcharts of FIGS. 4 and 5. FIG. 4 is a flowchart of the character segmentation processing (S205) for tracing in the upward direction and FIG. 5 for tracing in the downward direction. The coordinate axes are x-axis in the horizontal direction and y-axis in the vertical direction.

【００２０】まず、得られた分割可能位置をＰｓとし、
分割可能位置Ｐｓから上下に分割位置を捜していく為の
追跡点ＰをＰｓに置く（Ｓ４０１）。次に、上方向への
分割を試みるため、Ｐを上へ１画素ずらす（Ｓ４０
２）。Ｐが黒画素でなければどんどん上へずらし、Ｐが
文字矩形の上部に達する（文字矩形の上方向の分割終
了）まで行う（Ｓ４０３）。分割線追跡中に、Ｐが黒画
素上にきた場合（Ｓ４０４）、文字画像の（黒画素領
域）の境界線を追跡することにする。黒画素領域の境界
線を右回りに追跡する方法を、図７のように「８」から
「Ｐ」に追跡が移動した時を例に説明する。追跡点Ｐの
回りの８画素に対して、一つ前のＰの位置である「８」
の右隣りすなわち「１」から順に黒画素を調べ、初めて
黒画素があった点、すなわち「５」を追跡点Ｐの次なる
移動先として進めるものである。左回りの追跡の場合
は、Ｓ４１２〜Ｓ４１８において左回りに調べる。境界
線の追跡は、まずＰの座標をＰｍに記憶しておく（Ｓ４
０５）。次に右回りに次の境界線上の黒画素を見つけＰ
を進める（Ｓ４０６）。もし、Ｐが文字矩形の上部に達
すれば上方向の分割は終了である（Ｓ４０７）。また、
もしＰのｙ座標がＰｓのｙ座標と等しくなったら、追跡
が下向きに進んでいるとし右回りの追跡を中止する（Ｓ
４０８）。追跡を進めた結果、Ｐのｘ座標がＰｓのｘ座
標と等しくなったら、境界線による追跡は終了するが
（Ｓ４０９）、ただし、その時ＰがＰｍより上に存在し
なければ、その点は以前にＰが通過した点であり、右回
りの追跡は続行不可能となり中止する（Ｓ４１０）。そ
の様子を図６に示す。なお、Ｐの上の画素が黒画素の場
合、境界線がまだ続いているので、再びＳ４０６に戻る
（Ｓ４１１）。そうでなければ、再び上部に向かい分割
を試みる。First, let Ps be the obtained dividable position,
The tracking point P for searching for the division position up and down from the dividable position Ps is set at Ps (S401). Next, P is shifted upward by one pixel in order to attempt the upward division (S40
2). If P is not a black pixel, the pixel is gradually moved upward until P reaches the upper part of the character rectangle (end of division of the character rectangle in the upward direction) (S403). When P is on the black pixel during the dividing line tracking (S404), the boundary line of the (black pixel area) of the character image is tracked. A method of tracking the boundary line of the black pixel region in the clockwise direction will be described by taking the case where the tracking moves from "8" to "P" as shown in FIG. For the 8 pixels around the tracking point P, the position of the previous P is "8".
The black pixels are sequentially examined from the right adjacent to, ie, “1”, and the point where the black pixel is present for the first time, that is, “5” is advanced as the next movement destination of the tracking point P. In the case of counterclockwise tracking, the counterclockwise check is performed in S412 to S418. To trace the boundary line, the coordinate of P is first stored in Pm (S4
05). Next, find a black pixel on the next boundary line clockwise and set P
(S406). If P reaches the upper portion of the character rectangle, the upward division is completed (S407). Also,
If the y coordinate of P becomes equal to the y coordinate of Ps, it is determined that the tracking is proceeding downward, and the clockwise tracking is stopped (S
408). When the x-coordinate of P becomes equal to the x-coordinate of Ps as a result of advancing the tracking, the tracking by the boundary line ends (S409). It is a point where P has passed, and the clockwise tracking cannot be continued and is stopped (S410). This is shown in FIG. If the pixel above P is a black pixel, the boundary line is still continuing, and therefore the process returns to S406 again (S411). If not, try heading up again to split.

【００２１】右回りの追跡で文字矩形の分割が出来なか
った場合、Ｐの座標をＰｍに戻して（Ｓ４１２）、左回
りの追跡で分割を試みる（Ｓ４１３）。これも右回りと
同様に処理を進め、追跡が中止になる条件になった場
合、その文字矩形は分割できないと判断し、追跡をやめ
る（Ｓ４１９）。When the character rectangle cannot be divided by the clockwise tracking, the coordinate of P is returned to Pm (S412), and the division is attempted by the counterclockwise tracking (S413). In this case as well, the process proceeds in the same manner as clockwise, and if the condition for stopping the tracking is met, it is determined that the character rectangle cannot be divided, and the tracking is stopped (S419).

【００２２】上方向の分割が成功したなら、次に下方向
に分割を試みる。その処理を図５のフローチャートに示
す。これも、上方向の分割とほぼ同様であり、追跡点Ｐ
が文字矩形の下部に達すれば分割が成功である（Ｓ５１
９）。最終的には図３（ｄ）のように３０２の文字矩形
が３０９，３１０，３１１の３つに分割され、文字切り
出しの処理が終了する。If the upward division is successful, then the downward division is tried. The process is shown in the flowchart of FIG. This is also similar to the upward division, and the tracking point P
If the character reaches the bottom of the character rectangle, the division is successful (S51).
9). Finally, as shown in FIG. 3D, the character rectangle 302 is divided into three parts 309, 310, and 311 and the character cutting process is completed.

【００２３】従って本実施例に従えば、射影を用いた文
字切り出しと、境界線追跡による文字切り出しを用いる
ことで、文字画像がオーバーラップしている場合でも、
文字の切り出しが可能となり、斜文字の文書などでも文
字が切り出せる。また、境界線の追跡は局部的に行わ
れ、必要以外の場所では行われないので、処理を高速に
行える効果がある。Therefore, according to the present embodiment, by using the character segmentation using the projection and the character segmentation by the boundary line tracking, even when the character images overlap each other,
Characters can be cut out, and it is possible to cut out characters even in italicized documents. Further, since the boundary line is traced locally and not in a place other than necessary, there is an effect that the processing can be performed at high speed.

【００２４】先の説明では、Ｓ２０３において分割可能
位置を探す為に文字矩形の半分の高さに線を引いたが、
この位置は変化させてもよく、或いは各矩形毎に横方向
の黒画素のヒストグラムを取り、最大となった部分に線
を引いてもよい。その場合、分割可能位置を減らせ、第
２の文字切り出し処理を行う回数を減少させる効果があ
る。また、前述の実施例において、分割可能位置を白画
素の線分の中点としたが、中心線上の黒画素から白画素
に変わる点、或いは白画素から黒画素に変わる点として
もよい。In the above description, a line is drawn at half the height of the character rectangle in order to search for a dividable position in S203.
This position may be changed, or a histogram of black pixels in the horizontal direction may be taken for each rectangle and a line may be drawn at the maximum portion. In that case, there is an effect that the dividable position is reduced and the number of times the second character cutout process is performed is reduced. Further, in the above-described embodiment, the dividable position is the midpoint of the line segment of the white pixel, but it may be the point where the black pixel on the center line changes to the white pixel or the point where the white pixel changes to the black pixel.

【００２５】〔実施例２〕本実施例は、図１７のように
隣り合う文字が接触している場合の文字切り出しの方法
を示す。[Embodiment 2] This embodiment shows a method of cutting out characters when adjacent characters are in contact as shown in FIG.

【００２６】本実施例における画像処理装置の構成は、
実施例１と同様であり、図１に示すものである。The configuration of the image processing apparatus in this embodiment is as follows.
This is similar to the first embodiment and is shown in FIG.

【００２７】本実施例に示す文字切り出し処理を図８の
フローチャートに示し、詳細に説明する。The character cutout process shown in this embodiment is shown in the flowchart of FIG. 8 and will be described in detail.

【００２８】まず、Ｓ８０１でイメージスキャナ１０７
から画像を入力し、文字画像をＲＡＭ１０５に格納す
る。Ｓ８０２では、従来例で説明した、或いは他の公知
の方法、例えば射影を取る等の方法で文字切り出しを行
う。そして、切り出した枠から、横書きならば標準文字
幅、縦書きならば標準文字高を演算し求める（Ｓ８０
３）。標準文字幅を求める処理は図１２のフローチャー
トに示し、後で詳細に説明する。First, in step S801, the image scanner 107
The image is input from and the character image is stored in the RAM 105. In step S802, character extraction is performed by the method described in the conventional example or another known method, for example, a method such as projection. Then, from the cut-out frame, the standard character width is calculated for horizontal writing and the standard character height is calculated for vertical writing (S80).
3). The process of obtaining the standard character width is shown in the flowchart of FIG. 12 and will be described later in detail.

【００２９】Ｓ８０４では、Ｓ８０３で求めた標準文字
幅（高）を用いてＳ８０２で切り出した１つのブロック
を複数のブロックに分割する必要があるか否かを判断す
る。このＳ８０４における判断は、例えば、横書きの場
合、Ｓ８０２で切り出された各文字ブロックの幅がＳ８
０３で求めた標準文字幅の１．５倍を超えた場合に、分
割する必要があると判断することができる。Ｓ８０４で
分割することが認められた場合、分割ブロック作成を行
う（Ｓ８０５）。In step S804, it is determined whether or not one block cut out in step S802 needs to be divided into a plurality of blocks using the standard character width (high) obtained in step S803. In the determination in S804, for example, in the case of horizontal writing, the width of each character block cut out in S802 is S8.
If it exceeds 1.5 times the standard character width obtained in 03, it can be determined that division is necessary. If the division is recognized in S804, division blocks are created (S805).

【００３０】Ｓ８０５の更なる分割の処理を、図９の文
字ブロックの接触が起こりやすい、文字間がほとんどな
い日本語の横書きの文字画像を例に説明する。Ｓ８０４
で選択された文字ブロックが標準文字幅Ｗのほぼ整数倍
になっていれば、標準文字幅毎に分割した場合（９０
１，９０２）と分割しない場合（９０３）の３つの文字
ブロックを作成しておく（Ｓ８０５）。図１０は、図９
に半角文字が含まれたような複雑な場合の例である。半
角や倍角文字を考慮して、標準文字幅の半分を分割の単
位にし、文字ブロックの幅が標準文字幅の１．５倍を超
えた場合、分割ブロックの作成を開始するようにする。
この例の場合、半角ブロック（１００１〜１００６）、
全角ブロック（１００７〜１０１１）、倍角ブロック
（１０１２〜１０１４）を作成しておく。The process of further division in S805 will be described by taking a horizontally written character image of Japanese as shown in FIG. S804
If the character block selected in step is almost an integral multiple of the standard character width W, it is divided by the standard character width (90
1, 902) and when not divided (903), three character blocks are created (S805). FIG. 10 shows FIG.
This is an example of a complicated case where half-width characters are included in. Considering half-width and double-width characters, half of the standard character width is used as a unit of division, and when the width of the character block exceeds 1.5 times the standard character width, creation of the divided block is started.
In the case of this example, half-width blocks (1001 to 1006),
Full-width blocks (1007 to 1011) and double-width blocks (1012 to 1014) are created.

【００３１】以上の様に作成された文字ブロックを特徴
抽出（Ｓ８０５）、類似度計算（Ｓ８０７）し、Ｓ８０
５で分割ブロックが作成されたものについてどのブロッ
クを採用するかをＳ８０９で決定する。The character block created as described above is subjected to feature extraction (S805), similarity calculation (S807), and S80.
In S809, it is determined which block is to be adopted for the divided block created in 5.

【００３２】Ｓ８０９の決定方法について、図９、図１
０の分割例を用いて説明する。図９では、ブロック９０
１と９０２の類似度が小さい方と、９０３の類似度を比
較し、９０３の類似度が大きければ９０１と９０２は採
用せず、９０３を文字切り結果とし、その逆の場合、９
０１と９０２を採用し９０３は使用しないとする。この
例では９０３の類似度が小さくなることが予想されるの
で、接触文字であっても、文字切り枠９０１と９０２が
求まる。次に図１０の場合であるが、上記例と同様に１
００１から１０１４を用いて全ての組合せを考え、各組
合せで最小類似度となる分割ブロックの中で、類似度が
最も大きい組合せを文字切り結果として採用する。或い
は、左側から文字切り結果に採用するブロックを決定し
ていく、すなわち、まず、１００１を含んだブロック
（１００１，１００７，１０１２）の中で類似度が最大
となるものもを文字切り結果とし、次のブロックに進
む。この場合１００７が最大類似度となることが予想さ
れ、それを採用する。次は、１００３，１００９，１０
１４で同様の判定を行う。その結果、上記２例とも１０
０７，１００３，１０１０，１００６が採用されること
が予想され、正しい文字切り結果が得られる。Regarding the determination method of S809, FIG. 9 and FIG.
This will be described using an example of division of 0. In FIG. 9, block 90
The similarity between 1 and 902 is smaller, and the similarity between 903 is compared. If the similarity between 903 is large, 901 and 902 are not adopted, 903 is taken as the character cutting result, and vice versa.
01 and 902 are adopted and 903 is not used. In this example, it is expected that the degree of similarity of 903 will be small, so that the character cutting frames 901 and 902 can be obtained even for a contact character. Next, in the case of FIG. 10, as in the above example, 1
All combinations are considered using 001 to 1014, and among the divided blocks having the minimum similarity in each combination, the combination having the highest similarity is adopted as the character cutting result. Alternatively, the block to be adopted as the character cutting result is determined from the left side, that is, the block having the maximum similarity among the blocks (1001, 1007, 1012) including 1001 is set as the character cutting result. Go to the next block. In this case, 1007 is expected to be the maximum similarity and that is adopted. Next is 1003, 1009, 10
The same judgment is made at 14. As a result, both of the above two examples are 10
It is expected that 07, 1003, 1010, 1006 will be adopted, and a correct character cutting result can be obtained.

【００３３】以上の様にして得られた認識結果をディス
プレイ１０９に表示する（Ｓ８１０）。The recognition result obtained as described above is displayed on the display 109 (S810).

【００３４】本発明により、特に日本語文書等のように
文字画像の外接矩形がほぼ正方形に近い文字で大分部が
構成される文書に含まれる接触文字を、標準文字サイズ
と類似度の比較のみで精度良く認識が可能となり、なお
かつ、文字サイズの異なる文字例えば半角や倍角等が接
触文字内に混在していても正しい認識が可能である。According to the present invention, contact characters included in a document whose circumscribed rectangle of a character image is almost square and whose major part is large, such as a Japanese document, are compared only with a standard character size and similarity. Allows accurate recognition, and correct recognition is possible even when characters having different character sizes, such as half-width characters and double-width characters, are mixed in the contact character.

【００３５】ここで、Ｓ８０５の分割ブロックの作成方
法についての他の例を説明する。Now, another example of the method of creating divided blocks in S805 will be described.

【００３６】図１１は、文字が接触し１つの文字ブロッ
クとなった文字画像である。この場合、文字ブロックの
幅は標準文字幅よりわずかに大きいだけで、前実施例の
様に標準文字幅、或いはその半分を分割位置とした場
合、正しい認識結果が得られないような分割がなされる
（１１０１）。そこで、分割パターンを少しずつ変えた
ものを類似度計算を行う前に作成しておく。類似度計算
後、最も類似度の大きい組合せを採用する。FIG. 11 shows a character image in which characters come into contact with each other to form one character block. In this case, the width of the character block is only slightly larger than the standard character width, and if the standard character width or half of the standard character width is used as the division position as in the previous embodiment, division is performed so that a correct recognition result cannot be obtained. (1101). Therefore, a pattern in which the division pattern is changed little by little is created before the similarity calculation. After the similarity calculation, the combination with the highest similarity is adopted.

【００３７】本発明により、標準文字サイズと異なる文
字が接触をした場合でも、高精度の認識結果が得られ、
また、英文の様に文字幅が文字により異なる文書につい
ても同様の効果が得られる。According to the present invention, even if a character different from the standard character size comes into contact, a highly accurate recognition result can be obtained.
Also, the same effect can be obtained for documents such as English whose character width varies depending on the characters.

【００３８】なお、以上の実施例では文字の切り出し処
理をＣＰＵ１０１で行うとして説明したが、文字切り出
し処理部を独立させた構成でも良く、また、認識計算を
ＣＰＵ１０１で行っても良い。In the above embodiments, the character cutout processing is performed by the CPU 101, but the character cutout processing unit may be independent, and the recognition calculation may be performed by the CPU 101.

【００３９】また、画像の入力はイメージスキャナ１０
７からに限るものではなく、外部記憶装置等が構成され
ていれば、別の手段で得られた画像データを一時的に記
憶しておき、そこから取り込んでも良い。The image is input by the image scanner 10.
However, if the external storage device is configured, the image data obtained by another means may be temporarily stored and taken in from there.

【００４０】先に述べたＳ８０３の標準文字幅（高）を
求める方法を、図１２のフローチャートを用いて説明す
る。A method of obtaining the standard character width (height) in S803 described above will be described with reference to the flowchart of FIG.

【００４１】まず、Ｓ８０２で切り出された各文字ブロ
ックの高さが最大であるものを見つけ、その高さをｈｍ
ａｘとする（Ｓ１２０）。注目する文字ブロックを行の
先頭のブロックとし（Ｓ１２１）、行内のブロックすべ
てをチェックするまで以下の処理を行う。First, the character block cut out in S802 is found to have the maximum height, and its height is hm.
Ax (S120). The character block of interest is set as the first block of the line (S121), and the following processing is performed until all blocks in the line are checked.

【００４２】注目ブロックの幅Ｗとｈｍａｘとを比較し
（Ｓ１２３）、注目文字ブロックの幅がｈｍａｘに比べ
十分狭かったり、十分広くなければ、このブロックを標
準文字ブロックとし（Ｓ１２４）、標準文字幅を求める
のに使用する。The width W of the target block is compared with hmax (S123), and if the width of the target character block is sufficiently narrow or not wide enough than hmax, this block is set as a standard character block (S124), and the standard character width is set. Used to ask for.

【００４３】もし、ブロック幅がｈｍａｘに比べて十分
狭い場合、次の文字ブロックと合成し（Ｓ１２６）、合
成した文字ブロックの幅とｈｍａｘを比較する（Ｓ１２
７）。合成したブロックの幅がＳ１２７の判定条件と同
様の条件を満たせば、そのブロックを標準文字ブロック
とし、まだ狭ければ更に合成して同様の判定をする。一
方、十分大きい場合は合成しすぎであり、合成を取り消
す。標準文字ブロックかどうかの判定が終了したら、注
目ブロックを次に移す（Ｓ１２５）。このようにして求
まった標準文字ブロックの幅の平均を計算し、これを標
準文字幅Ｗとする（Ｓ１２９）。If the block width is sufficiently narrower than hmax, it is combined with the next character block (S126) and the width of the combined character block is compared with hmax (S12).
7). If the width of the combined block satisfies the same condition as the judgment condition of S127, the block is set as a standard character block, and if it is still narrow, it is further combined and the same judgment is performed. On the other hand, if it is sufficiently large, it means that the composition is too much and the composition is canceled. When it is determined whether the block is a standard character block, the block of interest is moved to the next (S125). The average width of the standard character blocks thus obtained is calculated, and this is set as the standard character width W (S129).

【００４４】縦書きの場合も、横方向を高さ方向に、縦
方向を幅方向に置き換えれば、同様な方法で標準文字高
さを求められる。Also in the case of vertical writing, if the horizontal direction is replaced by the height direction and the vertical direction is replaced by the width direction, the standard character height can be obtained by the same method.

【００４５】[0045]

【発明の効果】以上説明したように、文字ピッチや文字
幅が一定しない文書に関し、第１の文字切り出しのみで
は、十分に確実さが得られなかったものに対しても、分
離可能位置を探し、第２の文字切り出しを行うことで文
字切り出しの精度が向上し、誤認識が減少する効果があ
る。これにより、その後の修正作業が軽減され、文書入
力時間を短縮できる利点も有する。As described above, regarding a document in which the character pitch and the character width are not constant, a separable position is searched for even if the first character cutout alone cannot provide sufficient certainty. By performing the second character cutout, there is an effect that the accuracy of the character cutout is improved and erroneous recognition is reduced. As a result, the subsequent correction work is reduced, and the document input time can be shortened.

【００４６】以上説明したように、これまでの文字切り
方法では正確に行うことが出来なかつた接触文字等が存
在しても、疑わしい文字画像についていくつかの文字切
りパターンを用意し、識別計算後最も確からしいパター
ンを認識結果として選択することで、誤認識が減少し、
修正作業が軽減される効果がある。As described above, some character cutting patterns are prepared for suspicious character images even if there are contact characters etc. that cannot be accurately performed by the conventional character cutting methods, and after the identification calculation. By selecting the most probable pattern as the recognition result, false recognition is reduced,
This has the effect of reducing correction work.

【００４７】また、既存文書の入力作業が短時間で容易
に行える効果がある。Further, there is an effect that the input operation of the existing document can be easily performed in a short time.

[Brief description of drawings]

【図１】本実施例の文字認識装置のブロック図。FIG. 1 is a block diagram of a character recognition device according to an embodiment.

【図２】実施例１の文字認識処理のフローチャート。FIG. 2 is a flowchart of character recognition processing according to the first embodiment.

【図３】実施例１を説明するための文字画像。FIG. 3 is a character image for explaining the first embodiment.

【図４】実施例１における文字切り出し処理の第２のフ
ローチャート。FIG. 4 is a second flowchart of character cutting processing according to the first embodiment.

【図５】実施例１における文字切り出し処理の第２のフ
ローチャート。FIG. 5 is a second flowchart of character cutting processing according to the first embodiment.

【図６】文字切り出しの例示図。FIG. 6 is an exemplary diagram of character cutout.

【図７】境界線追跡の説明図。FIG. 7 is an explanatory diagram of boundary line tracking.

【図８】実施例２の文字認識処理のフローチャート。FIG. 8 is a flowchart of character recognition processing according to the second embodiment.

【図９】分割パターンの第１の例示図。FIG. 9 is a first exemplary diagram of a division pattern.

【図１０】分割パターンの第２の例示図。FIG. 10 is a second exemplary diagram of a division pattern.

【図１１】分割パターンの第３の例示図。FIG. 11 is a third exemplary diagram of a division pattern.

【図１２】標準文字幅（高）を求める処理のフローチャ
ート。FIG. 12 is a flowchart of processing for obtaining a standard character width (height).

【図１３】従来の文字認識処理のフローチャート。FIG. 13 is a flowchart of conventional character recognition processing.

【図１４】射影による文字切り出しの例示図。FIG. 14 is an exemplary diagram of character segmentation by projection.

【図１５】文字切り出しの例示図。FIG. 15 is an exemplary diagram of character segmentation.

【図１６】文字切り対象の第１の例示図。FIG. 16 is a first exemplary diagram of a character cutting target.

【図１７】文字切り対象の第２の例示図。FIG. 17 is a second exemplary diagram of a character cutting target.

Claims

[Claims]

1. The image information including a plurality of characters is obtained with a dividable position, white pixels are traced up and down from the dividable position, and the image information is divided by dividing lines obtained by the tracing. Image processing method.

2. The image processing method according to claim 1, wherein the image information including the plurality of characters is one image information frame obtained by performing character cutting processing from document image information.

3. The image processing method according to claim 1, wherein the dividable position is on a white pixel continuous area on a line segment set in the image information including the plurality of characters.

4. The image processing method according to claim 3, wherein the line segment set in the image information is at a position half the height of the image information.

5. In image information including a plurality of characters, a division candidate position is derived, image information divided at the derived division candidate position is recognized for each divided area, and a similarity is calculated. An image processing method, wherein the division position is determined according to the degree of similarity.

6. The image processing method according to claim 5, wherein the image information including the plurality of characters is one image information frame obtained by performing character cutting processing from document image information.

7. In image information including a plurality of characters, a dividable position deriving means for obtaining a dividable position, a tracing means for tracing white pixels up and down from the dividable position, and a tracing means for tracing the white pixels. An image processing apparatus comprising: a dividing unit that divides the image information by a dividing line.

8. The image processing apparatus according to claim 7, wherein the image information including the plurality of characters is one image information frame obtained by performing character cutting processing from the document image information.

9. The image processing apparatus according to claim 7, wherein the dividable position is on a white pixel continuous area on a line segment set in the image information including the plurality of characters.

10. The image processing apparatus according to claim 9, wherein the line segment set in the image information is at a position half the height of the image information.

11. In image information including a plurality of characters, a division candidate position is derived, image information divided at the derived division candidate position is recognized for each divided region, a similarity is calculated, and the calculated similarity is calculated. An image processing method, characterized in that the division position is determined according to the degree.

12. The image processing method according to claim 11, wherein the image information including the plurality of characters is one image information frame obtained by performing character cutting processing from document image information.

13. In image information including a plurality of characters, division candidate position deriving means for deriving a division candidate position, and image information divided at the division candidate position derived by the division candidate position deriving means is recognized for each divided area. An image processing apparatus comprising: a similarity calculation means for calculating a similarity and a division position determination means for determining the division position according to the similarity calculated by the similarity calculation means.

14. The image processing apparatus according to claim 13, wherein the image information including the plurality of characters is one image information frame obtained by performing character cutting processing from document image information.