JP2020086405A

JP2020086405A - Information processor, information processing system and information processing program

Info

Publication number: JP2020086405A
Application number: JP2018225798A
Authority: JP
Inventors: 浩一長田; Koichi Osada
Original assignee: Ricoh Co Ltd
Current assignee: Ricoh Co Ltd
Priority date: 2018-11-30
Filing date: 2018-11-30
Publication date: 2020-06-04

Abstract

To shorten time for replacing a character matching a specific replacement condition with a character having a different character shape.SOLUTION: An information processor 1 includes: an image data generation section 2 for generating image data of a second character on the basis of a character image indicating an image of the second character having a character shape different from a character shape of a first character described in a sentence; and a character replacement section 3 for replacing the first character with the second character on the basis of the image data of the second character when a character positioned before or after the first character matches a specific replacement condition in the sentence.SELECTED DRAWING: Figure 1

Description

本発明は、情報処理装置、情報処理システム及び情報処理プログラムに関する。 The present invention relates to an information processing device, an information processing system, and an information processing program.

特許文献１には、文字の字形の切り換え機能を持たないアプリケーションによる文章印刷であっても、ユーザが希望する異体字の字形での印刷を可能にする技術が開示される。特許文献１に開示される技術では、印刷データから複数の異体字データを持つ文字コードが検出され、複数の異体字が候補字形として表示器の画面に表示され、表示器の画面に表示された候補字形をユーザが参照しながら印刷に使う文字の字形を選択することで、字形が異なる文字への置換が行われる。 Patent Document 1 discloses a technique that enables printing in a variant character shape desired by a user even in the case of text printing by an application that does not have a function of switching the character shape. In the technique disclosed in Patent Document 1, a character code having a plurality of variant data is detected from print data, the plurality of variant characters are displayed as candidate glyphs on the screen of the display unit, and displayed on the screen of the display unit. When the user refers to the candidate glyph and selects the glyph of the character to be used for printing, replacement with a character having a different glyph is performed.

しかしながら、特許文献１に代表される技術では、特定の置換条件に合致する文字だけを、字形が異なる文字へ一括変更することができない。従って、ユーザは、特定の置換条件に合致する文字だけを選択する必要があるため、置換対象の文字数が増えるほどユーザに多大な負担がかかると共に、作業時間が長時間に及ぶという問題があった。 However, with the technique typified by Patent Document 1, it is not possible to collectively change only the characters that meet a specific replacement condition into characters with different glyphs. Therefore, the user needs to select only the characters that meet a specific replacement condition, and thus the larger the number of characters to be replaced, the greater the burden on the user and the long working time. ..

本発明は、上記課題に鑑み、特定の置換条件に合致する文字を字形が異なる文字へ置換する時間を短縮する。 In view of the above problems, the present invention shortens the time for replacing a character that matches a specific replacement condition with a character having a different glyph.

上記課題に鑑み、本発明に係る情報処理装置は、文章に記載される第１文字の字形と異なる字形の第２文字の画像を示す文字画像に基づき、前記第２文字の画像データを生成する画像データ生成部と、前記文章において、前記第１文字の前又は後に位置する文字が特定の置換条件に合致するとき、前記第２文字の画像データに基づき前記第１文字を前記第２文字に置き換える文字置換部と、を備える。 In view of the above problem, the information processing apparatus according to the present invention generates image data of the second character based on a character image showing an image of a second character having a character shape different from the character shape of the first character described in the sentence. In the image data generation unit and in the sentence, when a character positioned before or after the first character matches a specific replacement condition, the first character is changed to the second character based on the image data of the second character. And a character replacement unit for replacement.

本発明によれば、特定の置換条件に合致する文字を字形が異なる文字へ置換する時間を短縮できるという効果を奏する。 According to the present invention, it is possible to reduce the time required to replace a character that matches a specific replacement condition with a character having a different glyph.

本発明の実施の形態に係る情報処理装置の機能ブロックを示す一例の図である。It is an example of a functional block diagram of the information processing apparatus according to an embodiment of the present invention. フォントの字形の違いを示す図である。It is a figure which shows the difference of the character shape of a font. 本実施の形態に係る情報処理装置の動作を説明するためのフローチャートである。7 is a flowchart for explaining the operation of the information processing device according to the present embodiment. イメージバリアブルプログラムによる文字画像の置換前後のイメージを示す図である。It is a figure which shows the image before and after the replacement of the character image by an image variable program. イメージバリアブルデータの画像に置き換えるイメージを示す第１図である。It is FIG. 1 which shows the image replaced by the image of image variable data. イメージバリアブルデータの画像に置き換えるイメージを示す第２図である。It is FIG. 2 which shows the image replaced by the image of image variable data. 文章に記載される文字の一部が置換される様子を説明するための図である。It is a figure for explaining a mode that a part of character described in a sentence is replaced. 第１変形例に係る情報処理装置１のの機能ブロックを示す一例の図である。It is an example of a functional block diagram of the information processing apparatus 1 according to the first modification. 第１変形例に係る情報処理装置１の動作を説明するためのフローチャートである。9 is a flowchart for explaining the operation of the information processing device 1 according to the first modification. 第２変形例に係る情報処理装置１の機能ブロックを示す一例の図である。It is an example of a functional block diagram of the information processing apparatus 1 according to the second modification. 第２変形例に係る情報処理装置１の動作を説明するためのフローチャートである。11 is a flowchart for explaining the operation of the information processing device 1 according to the second modification. 第３変形例に係る情報処理装置１の機能ブロックを示す一例の図である。It is an example of a functional block diagram of the information processing apparatus 1 according to the third modification. 第３変形例に係る情報処理装置１の動作を説明するためのフローチャートである。It is a flow chart for explaining operation of information processor 1 concerning the 3rd modification. 本発明の実施の形態に係る情報処理装置１を実現するためのハードウェア構成例を示す図である。It is a figure which shows the hardware structural example for implement|achieving the information processing apparatus 1 which concerns on embodiment of this invention. 本実施の形態に係る情報処理装置１を備えた情報処理システム１Ａの構成例を示す図である。It is a figure which shows the structural example of 1 A of information processing systems provided with the information processing apparatus 1 which concerns on this Embodiment. 置換条件提案部４又は置換条件提案部４１に適用される機械学習部４００の一例を示す図である。It is a figure which shows an example of the machine learning part 400 applied to the replacement condition proposal part 4 or the replacement condition proposal part 41.

以下、本発明を実施するための形態について図面を用いて説明する。 Embodiments for carrying out the present invention will be described below with reference to the drawings.

実施の形態．
図１は本発明の実施の形態に係る情報処理装置の機能ブロックを示す一例の図である。情報処理装置１は、ＰＣ（Personal Computer）などである。情報処理装置１は、画像データ生成部２、文字置換部３、第１格納部１０及び第２格納部２０を備える。画像データ生成部２は、ユーザが生成した、文章に記載される第１文字の字形と異なる字形の第２文字の画像を示す文字画像に基づき、第２文字の画像データをユーザが生成する。文字置換部３は、当該文章において、第１文字の前又は後に位置する文字が特定の置換条件に合致するか否かを判定し、判定の結果、第１文字の前又は後に位置する文字が特定の置換条件に合致するとき、第２文字の画像データに基づき第１文字を前記第２文字に置き換える。特定の置換条件についての説明は後述する。文字置換部３によって文字が置き換えられた文章のデータは、不図示のメモリに格納される。 Embodiment.
FIG. 1 is a diagram showing an example of functional blocks of an information processing apparatus according to an embodiment of the present invention. The information processing device 1 is a PC (Personal Computer) or the like. The information processing device 1 includes an image data generation unit 2, a character replacement unit 3, a first storage unit 10, and a second storage unit 20. The image data generation unit 2 generates the image data of the second character by the user based on the character image that is generated by the user and shows the image of the second character having a character shape different from the character shape of the first character described in the sentence. The character replacement unit 3 determines whether the character positioned before or after the first character in the sentence matches a specific replacement condition, and as a result of the determination, the character positioned before or after the first character is When the specific replacement condition is met, the first character is replaced with the second character based on the image data of the second character. The specific replacement condition will be described later. The text data in which the characters have been replaced by the character replacement unit 3 is stored in a memory (not shown).

情報処理装置１は、例えば、過去に作成された文書中に、「JIS 90」に従った字形が複数記載されている場合、それらの字形の内、特定の置換条件に合致するものだけを「JIS 2004」に従った字形に変更することができるように構成されている。なお、変更前後の字形の種類はこれらに限定されるものではない。以下では、従来の問題点とフォントの字形について説明し、その後、情報処理装置１の動作を説明する。 For example, when a plurality of glyphs conforming to “JIS 90” are described in a document created in the past, the information processing apparatus 1 selects only those glyphs that meet a specific replacement condition from “glyph”. It is configured so that it can be changed to a character shape according to JIS 2004. It should be noted that the types of glyphs before and after the change are not limited to these. Below, the conventional problems and the shape of the font will be described, and then the operation of the information processing apparatus 1 will be described.

過去に作成した文書に含まれている文字の字形を一括して変更したいという要望が存在する。字形は、特定の骨格を有する文字の現れた形（デザイン）を意味する。デザインが同一の文字同士であっても、文字を形作る線の数、この線の太さ、文字を形作る点の数、この点の角度などが異なる場合があるため、文字を形作る線、点などが異なる文字同士は、字形は異なると言える。例えば、文字集合に関する代表的な規格には、1990年に改正された「JIS X 0208」と、2004年に改正された「JIS X 0213」とが存在する。「JIS X 0208」は、日本語表記、地名、人名などで用いられる図形文字を含む、情報交換用の2バイト符号化文字集合を規定する日本工業規格である。「JIS X 0208」の規格名称は、「7ビット及び8ビットの2バイト情報交換用符号化漢字集合」である。「JIS X 0208」は前述した「JIS 90」に相当する。「JIS X 0213」は、「JIS 90」を拡張した、日本語用の符号化文字集合を規定する日本工業規格である。「JIS X 0213」の規格名称は、「7ビット及び8ビットの2バイト情報交換用符号化拡張漢字集合」である。「JIS X 0213」は前述した「JIS 2004」に相当する。 There is a demand to collectively change the glyphs of characters included in documents created in the past. The glyph means the appearance (design) of a character having a specific skeleton. Even if the characters have the same design, the number of lines that form a character, the thickness of this line, the number of points that form the character, the angle of this point, etc. may differ, so the line that forms the character, the point, etc. It can be said that different characters have different glyphs. For example, representative standards for character sets include "JIS X 0208" revised in 1990 and "JIS X 0213" revised in 2004. "JIS X 0208" is a Japanese industrial standard that defines a 2-byte coded character set for information exchange, including graphic characters used in Japanese notation, place names, personal names, and the like. The standard name of "JIS X 0208" is "7-bit and 8-bit 2-byte information exchange coded kanji character set". "JIS X 0208" corresponds to "JIS 90" described above. "JIS X 0213" is a Japanese Industrial Standard that extends "JIS 90" and defines a coded character set for Japanese. The standard name of “JIS X 0213” is “7-bit and 8-bit 2-byte information exchange encoded extended kanji character set”. "JIS X 0213" corresponds to "JIS 2004" described above.

「JIS 2004」と「JIS 90」は何れも文字集合の規格であり、字形を規定する規格ではない。規格書に使われている字形は、あくまでも例示であって、規範ではない。しかしながら、各社がフォントを作成する際、規格書の例示字形を尊重する傾向があり、結果的に「JIS 2004」に従ったフォントと、「JIS 90」に従ったフォントとが存在する。これらのフォントの一例を図２に示す。 Both "JIS 2004" and "JIS 90" are standards for character sets, not standards for defining glyphs. The glyphs used in the standards are merely examples, not norms. However, each company tends to respect the exemplary glyphs in the standard when creating fonts, and as a result, there are fonts according to “JIS 2004” and fonts according to “JIS 90”. An example of these fonts is shown in FIG.

図２はフォントの字形の違いを示す図である。図２の左側には、「JIS 2004」に従ったフォントの字形が示される。図２の右側には、「JIS 90」に従ったフォントの字形が示される。「つじ」を示す２種類の漢字を比較した場合、「JIS 2004」に従ったフォントの字形は、「JIS 90」に従ったフォントの字形と異なる。「かつ」を示す２種類の漢字についても、「JIS 2004」に従ったフォントの字形は、「JIS 90」に従ったフォントの字形と異なる。 FIG. 2 is a diagram showing a difference in font shape. On the left side of FIG. 2, the font shape according to “JIS 2004” is shown. On the right side of FIG. 2, the font shape according to "JIS 90" is shown. When two types of kanji representing "tsuji" are compared, the font shape according to "JIS 2004" is different from the font shape according to "JIS 90". Regarding the two types of kanji representing "Katsu", the font shape according to "JIS 2004" is different from the font shape according to "JIS 90".

ここで、例えば、過去に作成された文書中に、「JIS 90」に従った字形が複数記載されている場合、それら全ての字形を「JIS 2004」に従った字形に変更したい、すなわち置換したいという要望が存在する。「JIS 90フォント」と「JIS 2004フォント」との両方が、置換対象の文字以外の全文字について、同一の字形を有する場合、文書中で指定されるフォント名を「JIS 90フォント」から「JIS 2004フォント」に変更するだけで、文章中に記載される全ての字形を一括変更できる。しかしながら、「JIS 90フォント」と「JIS 2004フォント」の内、同一デザインで提供されているものは、「ＭＳ明朝」、「ＭＳゴシック」など、一部のフォントに限られている。そのため、過去に使用された「JIS 90字形」のフォントに対して、同一デザインの「JIS 2004字形」のフォントは提供されていないことが多い。この場合、「JIS 90字形」を「JIS 2004字形」に一括変更するためには、「JIS 2004字形」のフォントを独自に用意する必要があった。ただし、フォントの作成には多大な費用と時間がかかるため、字形の一括変更は容易ではなかった。 Here, for example, if there are multiple glyphs conforming to "JIS 90" in a document created in the past, you want to change all of these glyphs to conform to "JIS 2004", that is, replace them. There is a demand that. If both "JIS 90 font" and "JIS 2004 font" have the same glyph for all characters except the character to be replaced, the font name specified in the document is changed from "JIS 90 font" to "JIS You can change all the glyphs in the text at once by simply changing to "2004 font". However, among the "JIS 90 fonts" and "JIS 2004 fonts", those provided with the same design are limited to some fonts such as "MS Mincho" and "MS Gothic". Therefore, in many cases, the "JIS 2004 character" font of the same design is not provided to the "JIS 90 character" font used in the past. In this case, in order to collectively change the "JIS 90 glyph" to the "JIS 2004 glyph", it was necessary to prepare the "JIS 2004 glyph" font independently. However, it takes a lot of money and time to create a font, so it is not easy to change all the glyphs.

さらに、過去に作成された文書に「JIS 90字形」のフォントが利用している場合において、この文章中の複数の文字の内、特定の置換条件に合致する文字の字形のみを、「JIS 2004字形」に変更したい、という要望も存在する。例えば、「辻Ａ太郎」と「辻Ｂ子」の名前が、同一文書に混在していると仮定する。さらに、この文書には「JIS 90字形」のフォントが使われており、「つじ」を表す漢字は、全て一点しんにょうの「辻」と仮定する。ここで、「辻Ｂ子」さんの「辻」は本人の強い希望に基づき二点しんにょうの「辻」に変更し、「辻Ａ太郎」さんの「辻」は本人の希望に基づき一点しんにょうのままとする場合について説明する。 In addition, when the font of "JIS 90 glyph" is used in the document created in the past, only the glyph of the character that meets the specific replacement condition among the plurality of characters in this document is specified in "JIS 2004 There is also a desire to change to "glyphs". For example, it is assumed that the names "Tsuji Ataro" and "Tsuji Boko" are mixed in the same document. In addition, this document uses a "JIS 90 glyph" font, and it is assumed that all kanji that represent "tsuji" are single-point Shinji "tsuji." Here, "Tsuji" of "Tsuji Boko" is changed to "Tsuji" of two points based on his strong hope, and "Tsuji" of "Taro Tsuji" is one point based on his wish. The case of leaving it as it is will be described.

この文書には、以下の特徴があるものとする。 This document shall have the following features:

第１特徴として、「辻Ａ太郎」は、同一文書中に「辻客員教授」と表記されることがある。また「客員教授の辻先生」と表記されることもある。この場合、「客員教授」は必ず「辻」の前又は後の１０文字以内に存在するものとする。 The first feature is that "Tsuji Ataro" may be written as "Tsuji visiting professor" in the same document. It is also sometimes referred to as "visiting professor Tsuji". In this case, "Visiting Professor" must exist within 10 characters before or after "Tsuji".

第２特徴として、「辻Ｂ子」は、同一文書中に「辻名誉教授」と表記されることがある。また「名誉教授の辻先生」と表記されることもある。この場合、「名誉教授」は必ず「辻」の前後１０文字以内に存在するものとする。 As a second feature, "Boji Tsuji" may be described as "Professor Emeritus Tsuji" in the same document. It is also sometimes referred to as "Professor Emeritus Tsuji". In this case, "Professor Emeritus" must exist within 10 characters before and after "Tsuji".

第３特徴として、「辻Ａ太郎」さんに関する記述と、「辻Ｂ子」さんに関する記述とは、文章中で互いに離れて記載されており、例えば「辻」の前後１０文字以内に「客員教授」と「名誉教授」の両方が存在することはないものとする。さらに、「辻Ａ太郎」の前後１０文字以内に「名誉教授」が存在することはなく、「辻Ｂ子」の前後１０文字以内に「客員教授」が存在することもないものとする。 As a third feature, the description about “Taro Tsuji” and the description about “Boko Tsuji” are described separately from each other in the text, for example, within 10 characters before and after “Tsuji” "And "Professor Emeritus" shall never exist. Furthermore, it is assumed that there is no "professor emeritus" within 10 characters before and after "Taro Tsuji" and there is no "visiting professor" within 10 characters before and after "Boko Tsuji".

このように特徴のある文章において、特定の置換条件が満たされているときだけ、「辻Ｂ子」という文字列の一点しんにょうの「辻」を、二点しんにょうの「辻」に変更する、という要望に対応する必要がある。特定の置換条件は、例えば「辻Ｂ子」という文字列の前後１０文字以内、又は「辻」の前後１０文字以内に、「名誉教授」が存在する場合、一点しんにょうの「辻」を二点しんにょうの「辻」に変更し、それ以外の場合、一点しんにょうの「辻」のままにする、という条件である。 In such a characteristic sentence, only when a specific replacement condition is satisfied, the single point Shinji "Tsuji" of the character string "Tsuji Boko" is changed to the two-point Shinji "Tsuji". Need to respond to requests. For example, if "Professor Emeritus" exists within 10 characters before or after the character string "Tsuji Biko", or within 10 characters before or after "Tsuji", then one point is two Shinji "Tsuji". The condition is that it is changed to Shinji's "Tsuji", and in all other cases, it remains as a single point Shinji's "Tsuji".

人名以外の固有名詞でも、同様の要望がある。例えば、作品名、曲名などに使われている文字に異体字が存在する場合がある。例えば、山梨県の地名である大菩薩峠を解説した文書がある。この文書には「JIS 90字形」のフォントが使われているものとする。この文書には、中里介山の小説「大菩薩峠」に言及している部分がある。ところが、作者の遺族の強い要望により、小説名「大菩薩峠」の「薩」だけを「JIS 2004字形」の「薩」に変更する必要があるとする。そして、この文書では、小説名の「大菩薩峠」が、必ず"小説「大菩薩峠」"、"大菩薩峠という小説"などの形式で記載されており、「大菩薩峠」という文字列の前後７文字以内には、「小説」という文字列が必ず存在するものとする。すなわち、「大菩薩峠」という文字列の前後７文字以内に「小説」という文字列が存在するとき、「大菩薩峠」は必ず小説名を表す。この場合、「大菩薩峠」の前後７文字以内に「小説」という文字列が存在するときだけ、「小説」という文字列の前後７文字以内に存在する「薩」を「JIS 2004字形」の「薩」に変更し、「小説」という文字列の前後７文字以内に存在していない「薩」に関しては「JIS 90字形」のまま残す、という要望に対応する必要がある。 There are similar requests for proper nouns other than personal names. For example, there may be variants in the characters used in the title of a work, the title of a song, and the like. For example, there is a document explaining the Daibosatsutoge Pass, which is a place name in Yamanashi Prefecture. This document assumes that "JIS 90 glyph" fonts are used. There is a part of this document that refers to Nakazato Uesan's novel "The Great Bodhisattva Pass". However, due to the strong demands of the author's bereaved family, it is necessary to change only the "satsu" in the novel name "Daisatsusatsutoge" to "JIS 2004" "satsu". And, in this document, the novel name "Daibosatsu Pass" is always written in the form of "Novel "Daisatsusatsu Pass", "Daisatsu Daisatsusugi Novel", and the character string "Daisatsusatsu Pass" It is assumed that the character string "novel" always exists within 7 characters before and after. That is, when the character string "novel" exists within 7 characters before and after the character string "daibosatsutoge", "daibosatsutoge" always represents the novel name. In this case, only when the character string "novel" exists within 7 characters before and after "Daibatsusatsutoge", the "satsu" that exists within 7 characters before and after the character string "novel" is converted into "JIS 2004 character form". It is necessary to change to "Satsu" and to respond to the request that "Satsu", which does not exist within 7 characters before and after the character string "novel", be left as "JIS 90 character form".

このように、特定の置換条件に合致する文字の字形のみを変更する場合、特許文献１に代表される従来技術は、特定の置換条件を判定する手段を備えていないため、特定の置換条件に合致する文字の字形のみを一括して変更することが困難であった。このような問題を解決するため、本実施の形態に係る情報処理装置１は、文章中の特定の文字について、当該文字の前又は後に位置する文字が特定の置換条件に合致するとき、字形が異なる文字に置き換えるように構成されている。以下では、情報処理装置１の動作を説明する。 In this way, when changing only the glyphs of the characters that match the specific replacement condition, the conventional technique typified by Patent Document 1 does not include a unit for determining the specific replacement condition. It was difficult to change only the glyphs of the matching characters all at once. In order to solve such a problem, the information processing apparatus 1 according to the present embodiment determines a character shape of a specific character in a sentence when a character positioned before or after the character matches a specific replacement condition. It is configured to replace with a different letter. The operation of the information processing device 1 will be described below.

図３は本実施の形態に係る情報処理装置の動作を説明するためのフローチャートである。ステップＳ１において、ユーザによって、イメージバリアブルプログラム用の文字画像データが生成される。イメージバリアブルプログラムは、バリアブル印刷を実行させるためのプログラムである。バリアブル印刷は、入力データに基づいて、印刷する内容を変えて印刷を行うことである。バリアブル印刷は、オフィスで行われる「差し込み印刷」を高機能化したものと考えてよい。イメージバリアブルプログラムは、バリアブル印刷の手法を使い、写真、絵画などの中に、文字を画像として埋め込むプログラムである。主に名前入りカレンダー、ダイレクトメールに使われている。本実施の形態では、イメージバリアブルプログラムが図１に示される画像データ生成部２として機能する。 FIG. 3 is a flow chart for explaining the operation of the information processing apparatus according to this embodiment. In step S1, the user generates character image data for an image variable program. The image variable program is a program for executing variable printing. Variable printing is to change the content to be printed based on input data and perform printing. Variable printing can be considered to be a high-performance version of "mail merge printing" performed in the office. The image variable program is a program that embeds characters as images in photographs, paintings, etc. by using a variable printing method. It is mainly used for calendars with names and direct mail. In the present embodiment, the image variable program functions as the image data generation unit 2 shown in FIG.

本実施の形態では、イメージバリアブルプログラムに、例えば「FusionPro Expression」（登録商標）が利用される。「FusionPro Expression」（登録商標）は、データと画像を組み合わせ、パーソナライズの画像を生成するバリアブル印刷用ソフトウェアである。「FusionPro Expression」（登録商標）によれば、ユーザが独自に準備した画像などを挿入し、ユニークなオリジナルデザインを作成することができる。なお、イメージバリアブルプログラムは、バリアブル印刷を実行させるためのプログラムであればよく、「FusionPro Expression」（登録商標）に限定されるものではない。 In this embodiment, for example, "FusionPro Expression" (registered trademark) is used for the image variable program. “Fusion Pro Expression” (registered trademark) is variable printing software that combines data and images to generate personalized images. According to “Fusion Pro Expression” (registered trademark), it is possible to insert a unique image prepared by the user and create a unique original design. The image variable program may be any program as long as it is a program for executing variable printing, and is not limited to “FusionPro Expression” (registered trademark).

イメージバリアブルプログラムの中には、「画像文字」(Image Character)機能を持つものがある。これは、文字列を構成する各文字の字形を、文字ごとに用意した画像に置き換えて出力するものである。図４はイメージバリアブルプログラムによる文字画像の置換前後のイメージを示す図である。図４には、例えば文章中の文字「Ｍ」、「Ａ」及び「Ｘ」が、これらに対応する文字画像に置き換えられた例が示される。図４の左側の文字が文章中の文字であり、図４の右側の文字が文字画像に置き換えられた文字である。文字画像には、ＰＮＧ（画像データを圧縮して記録するファイル形式の一つ）、ＴＩＦなどの画像形式ファイルが、１文字ごとに用意される。「Ｍ」、「Ａ」及び「Ｘ」などの文字の画像形式ファイルに、「文字コード．拡張子」のファイル名を与えることにより、イメージバリアブルプログラムが文字画像ファイルを識別できるようにする。例えば、上記の「Ｍ」の文字画像ファイルは、「Ｍ」という文字の画像形式ファイルに、「Ｍの文字コード．ｐｎｇ」、「Ｍの文字コード．ｔｉｆ」などのファイル名を与えることによって作成される。このようにして作成されたデータが、前述したイメージバリアブルプログラム用の文字画像データである。 Some image variable programs have an "Image Character" function. This is to replace the glyphs of each character forming the character string with an image prepared for each character and output the image. FIG. 4 is a diagram showing an image before and after replacement of a character image by the image variable program. FIG. 4 shows an example in which the characters “M”, “A”, and “X” in the text are replaced with the corresponding character images. The character on the left side of FIG. 4 is the character in the sentence, and the character on the right side of FIG. 4 is the character replaced by the character image. For the character image, an image format file such as PNG (one of file formats for compressing and recording image data) and TIF is prepared for each character. By giving a file name of "character code.extension" to the image format file of characters such as "M", "A", and "X", the image variable program can identify the character image file. For example, the character image file of "M" is created by giving a file name such as "M character code.png" or "M character code.tif" to the image format file of the character "M". To be done. The data created in this way is the character image data for the image variable program described above.

イメージバリアブルプログラムで扱われる文字画像ファイルは、装飾文字であることが多いが、本実施の形態に係る情報処理装置１で利用される文字画像ファイルは、装飾文字ではなく、通常のフォントである。 The character image file handled by the image variable program is often a decorative character, but the character image file used by the information processing device 1 according to the present embodiment is not a decorative character but a normal font.

図３に戻り、ステップＳ２において、画像データ生成部２は、イメージバリアブルプログラム用の文字画像データを用いて、イメージバリアブルデータを生成する。 Returning to FIG. 3, in step S2, the image data generation unit 2 generates image variable data using the character image data for the image variable program.

前述した「FusionPro Expression」（登録商標）には、「FusionPro VDP CREATOR」（登録商標）が組み合わされる。「FusionPro VDP CREATOR」（登録商標）は、バリアブル印刷プログラムの一例（差し込み印刷プログラムの一例）であり、図１に示される文字置換部３として機能する。本実施の形態では、バリアブル印刷プログラムが図１に示される文字置換部３として機能する。 The above-mentioned "FusionPro Expression" (registered trademark) is combined with "FusionPro VDP CREATOR" (registered trademark). “FusionPro VDP CREATOR” (registered trademark) is an example of a variable print program (an example of a merge print program), and functions as the character replacement unit 3 shown in FIG. In this embodiment, the variable print program functions as the character replacement unit 3 shown in FIG.

ステップＳ３において、文字置換部３は、文章中の特定の文字について、当該文字の前又は後に位置する文字が特定の置換条件に合致するか否かを判定する。当該文字の前又は後に位置する文字が特定の置換条件に合致する場合（ステップＳ３、Ｙｅｓ）、ステップＳ４の処理が実行される。当該文字の前又は後に位置する文字が特定の置換条件に合致しない場合（ステップＳ３、Ｎｏ）、文字が置換されることなく、処理動作が終了する。 In step S3, the character replacement unit 3 determines, with respect to a specific character in the sentence, whether or not the character positioned before or after the character matches the specific replacement condition. When the character positioned before or after the character matches the specific replacement condition (step S3, Yes), the process of step S4 is executed. When the character positioned before or after the character does not meet the specific replacement condition (No in step S3), the processing operation ends without the character being replaced.

なお、ステップＳ３において、特定の文字の前又は後に位置する文字が特定の置換条件に合致するか否かの判定方法の詳細については後述する。 The details of the method of determining whether or not the character positioned before or after the specific character matches the specific replacement condition in step S3 will be described later.

ステップＳ４において、文字置換部３は、文書中の指定されたフィールド(テキストフレーム)の文字をすべて、イメージバリアブルデータの画像に置き換える。すなわち、文字置換部３は、文章中の特定の文字のみを、イメージバリアブルデータの画像に置き換えるように構成されている。画像の置き換え方法を図５及び図６を用いて説明する。 In step S4, the character replacement unit 3 replaces all the characters in the designated field (text frame) in the document with the image of the image variable data. That is, the character replacement unit 3 is configured to replace only specific characters in a sentence with an image of image variable data. An image replacement method will be described with reference to FIGS.

図５はイメージバリアブルデータの画像に置き換えるイメージを示す第１図である。図５の左側の画面１００Ａには、文書中のテキストフレームＴＦが指定されている。図５の右側の画面１００Ｂには、背景画像ＢＧＩが指定されている。このようにテキストフレームＴＦ及び背景画像ＢＧＩが設定された後、バリアブル印刷プログラムが実行される。すると、バリアブル印刷プログラムは、テキストフレームＴＦの「Thessa」の部分を、差し込みデータ(名前の一覧など)から読み込んだ文字列(「MAX」など)に置き換え、さらに文字列を構成する各文字「M」、「A」、「X」を、それぞれに対応する画像に置き換える。このように、バリアブル印刷プログラムでは、ユーザが作成した文字画像ファイルに対応する文字を文書の中から検索し、見つかった文字すべてを、対応する画像に置き換える。なお、文字の検索は、例えばＪＩＳコードが利用されるが、ＪＩＳコードに限定されない。例えば、文章中の文字のＪＩＳコードと、イメージバリアブルデータに変化された文字のＪＩＳコードが比較され、ＪＩＳコードが一致する文字の画像が置換される。 FIG. 5 is a first diagram showing an image to be replaced with an image of image variable data. The text frame TF in the document is specified on the screen 100A on the left side of FIG. The background image BGI is specified on the screen 100B on the right side of FIG. After the text frame TF and the background image BGI are set in this way, the variable print program is executed. Then, the variable print program replaces the "Thessa" portion of the text frame TF with the character string (such as "MAX") read from the merge data (such as the list of names), and further, the characters "M" that make up the character string , "A", "X" are replaced with the corresponding images. In this way, the variable printing program searches the document for the character corresponding to the character image file created by the user, and replaces all the found characters with the corresponding image. In addition, for example, the JIS code is used for the character search, but the search is not limited to the JIS code. For example, the JIS code of the character in the sentence is compared with the JIS code of the character changed to the image variable data, and the image of the character having the matching JIS code is replaced.

バリアブル印刷プログラムが実行されることにより置き換えられた文字画像の例を図６に示す。図６はイメージバリアブルデータの画像に置き換えるイメージを示す第２図である。図６に示すように、図５の画面１００Ａで指定されたテキストフレームＴＦに変更が加えられている。このように置き換えられた画像は、PDFファイルとして保存される。 FIG. 6 shows an example of the character image replaced by executing the variable printing program. FIG. 6 is a second diagram showing an image to be replaced with an image of image variable data. As shown in FIG. 6, the text frame TF designated on the screen 100A of FIG. 5 is modified. The image thus replaced is saved as a PDF file.

なお、本実施の形態では、イメージバリアブルプログラムとして機能する画像データ生成部２と、バリアブル印刷プログラムとして機能する文字置換部３とは、ユーザがテキストフレームＴＦを指定しなくても、文書全体を処理対象とみなすように構成されている。 In the present embodiment, the image data generation unit 2 that functions as an image variable program and the character replacement unit 3 that functions as a variable print program process the entire document without the user designating the text frame TF. It is configured to be considered a subject.

次に、実際の文章に記載される文字の一部が置換される様子を、図７を用いて説明する。図７は文章に記載される文字の一部が置換される様子を説明するための図である。ステップＳ１では、画像作成プログラム(ペイントソフトプログラムなど)を利用して文字画像データが作成される。作成された文字画像データは、イメージバリアブルプログラム用文字画像データとして、第１格納部１０に格納される。 Next, how a part of the characters described in the actual sentence is replaced will be described with reference to FIG. FIG. 7 is a diagram for explaining how a part of the characters described in a sentence is replaced. In step S1, character image data is created using an image creation program (paint software program or the like). The created character image data is stored in the first storage unit 10 as image image data for an image variable program.

ここでは、イメージバリアブルプログラム用文字画像データの画像形式ファイルに、以下のようなファイル名が与えられている。「辻」のファイル名は「92D2.tif」である。「薩」のファイル名は「8E46.tif」である。 Here, the following file names are given to the image format file of the character image data for the image variable program. The file name of "Tsuji" is "92D2.tif". The file name of "Satsu" is "8E46.tif".

ステップＳ２では、第１格納部１０に格納されたデータが、イメージバリアブルプログラムによって、バリアブル印刷プログラムが参照できるデータ形式(*.fpx)に変換される。変換されたデータはイメージバリアブルデータとして、第２格納部２０に格納される。 In step S2, the data stored in the first storage unit 10 is converted by the image variable program into a data format (*.fpx) that can be referenced by the variable print program. The converted data is stored in the second storage unit 20 as image variable data.

ステップＳ３では、バリアブル印刷プログラムによって、文書ファイル内が検索され、文字列置き換え条件を示す情報に基づき、文章中の特定の文字の前又は後に位置する文字が、特定の置換条件に合致するか否かが判定される。以下では、説明の便宜上、「文字列置き換え条件」を単に「置換条件」と称する場合がある。置換条件を示す情報は第１格納部１０又は第２格納部２０に格納されている。置換条件の一例は、以下の通りである。 In step S3, the variable print program searches the document file, and based on the information indicating the character string replacement condition, whether or not the character positioned before or after the specific character in the sentence matches the specific replacement condition. Is determined. Hereinafter, for convenience of description, the “character string replacement condition” may be simply referred to as a “replacement condition”. The information indicating the replacement condition is stored in the first storage unit 10 or the second storage unit 20. An example of the replacement condition is as follows.

例えば第１置換条件は、「辻Ｂ子」及び「辻」の前後10文字以内に「名誉教授」が存在する場合、「名誉教授」の前後１０文字以内に存在する一点しんにょうの「辻」を、イメージバリアブルデータ「92D2.tif」に置き換える、というものである。 For example, the first replacement condition is that if "Professor Emeritus" exists within 10 characters before and after "Tsuji Buko" and "Tsuji", replace "Tsuji" with one point that exists within 10 characters before and after "Professor Emeritus". The image variable data is replaced with "92D2.tif".

第２置換条件は、「大菩薩峠」の前後７文字以内に「小説」が存在する場合、「小説」という文字列の前後７文字以内に存在する「薩」を、イメージバリアブルデータ「8E46.tif」に置き換える、というものである。 The second replacement condition is that when a "novel" exists within 7 characters before and after "Daibatsusatsutoge", the "satsu" that exists within 7 characters before and after the character string "novel" is converted to the image variable data "8E46. It is replaced with "tif".

ステップＳ４では、ステップＳ３の判定結果に従って、文字が置き換えられる。文字が置き換えられた文章は、PDF形式で所定の記憶部に格納される。なお、文字の置換方法は、元の文字を抜き出して、文字が抜き出した部分に新しい文字を埋め込むことでもよいし、元の文字の部分に新しい文字を重ねてもよい。 In step S4, the character is replaced according to the determination result of step S3. The text in which the characters have been replaced is stored in PDF format in a predetermined storage unit. The character replacement method may be such that the original character is extracted and a new character is embedded in the extracted portion, or a new character may be superimposed on the original character portion.

なお、置換条件は、上記の例に限定されるものではない。置換条件は、特定の文字列ABC(単一文字Aであってもよい)の前後ｎ文字以内に、別の特定文字列DEF(単一文字Dであってもよい)を検出したとき、文字列ABC (又は単一文字A)を別の文字列XYZ(又は単一文字X)に置き換える、という条件でもよい。ｎは１以上の自然数である。 The replacement condition is not limited to the above example. The replacement condition is that when another specific character string DEF (which may be a single character D) is detected within n characters before and after the specific character string ABC (which may be a single character A), the character string ABC is detected. (Or the single character A) may be replaced with another character string XYZ (or the single character X). n is a natural number of 1 or more.

また、置換条件は、例えば、敬称の有無、敬称の種類、称号の有無、敬称の種類に基づく条件にしてもよい。例えば「辻名誉教授」又は「辻Ｂ子名誉教授」という文字列に含まれる、一点しんにょうの「辻」は、二点しんにょうの「辻」に置き換えるが、この文字列以外の文字列、例えば「辻客員教授」又は「辻Ａ太郎客員教授」に含まれる、一点しんにょうの「辻」は、一点しんにょうの「辻」のままにする、という条件を採用してもよい。 Further, the replacement condition may be, for example, a condition based on the presence or absence of a title, the type of title, the presence or absence of a title, and the type of title. For example, "Tsuji" of 1 point Shinnyo, which is included in the character string "Professor Emeritus Tsuji" or "Professor Emeritus of Buko Tsuji", is replaced with "Tsuji" of 2 points Shinnyo. The one-point Shinji "Tsuji" included in "Tsuji visiting professor" or "Tsuji Ataro visiting professor" may remain as "one-point Shinji".

また、置換条件は、例えば、文章の分野別の辞書に基づく条件にしてもよい。例えば、料理関係の文章に記載される一点しんにょうの「辻」は、二点しんにょうの「辻」に置き換えるが、音楽関係の文章に記載される一点しんにょうの「辻」は、一点しんにょうの「辻」のままにする、という条件を採用してもよい。この場合、分野別の辞書データが例えば第１格納部１０などに格納されており、文字置換部３は、この辞書データを参照することによって、文書中に出現する語彙の種類、語彙の使用頻度を判定する。これにより、文字置換部３は、例えば、文章に記載される「辻Ａ太郎のお料理教室」、「辻Ａ太郎のレシピ」などの文字列に含まれる一点しんにょうの「辻」は、二点しんにょうの「辻」に置き換える。また、文字置換部３は、別の文章に記載される「辻Ａ太郎の音楽学校」、「辻Ａ太郎が作曲した交響曲」などの文字列に含まれる一点しんにょうの「辻」は、一点しんにょうの「辻」のままにする。 Further, the replacement condition may be, for example, a condition based on a dictionary for each field of text. For example, the single point Shinji "Tsuji" written in cooking-related texts is replaced with the two-point Shinji "Tsuji", but the single-point Shinji "Tsuji" written in music-related texts is the single point Shinji "Tsuji". It is also possible to adopt the condition of leaving it as ". In this case, dictionary data for each field is stored in, for example, the first storage unit 10, and the character replacement unit 3 refers to the dictionary data to refer to the type of vocabulary appearing in the document and the frequency of use of the vocabulary. To judge. As a result, the character replacement unit 3 includes, for example, one point Shinji "Tsuji" included in a character string such as "Tsuji Ataro's cooking class" and "Tsuji Ataro's recipe" described in the sentence. Replaced with Shinji's "Tsuji". In addition, the character replacement unit 3 includes a single point, "Tsuji", which is included in a character string such as "Tsuji A Taro's Music School" or "Tsuji A Taro's Symphony" written in a separate sentence. Leave as "Tsuji".

なお、このような置換条件を応用することによって、置換される文字を逆にするようにしてもよい。例えば、料理分野の文章に記載される特定の文字列に含まれる二点しんにょうの「辻」は、一点しんにょうの「辻」に置き換え、音楽分野の文章に記載される特定の文字列に含まれる一点しんにょうの「辻」は、二点しんにょうの「辻」に置き換える、ということも可能である。 The characters to be replaced may be reversed by applying such a replacement condition. For example, the two-point Shinji "Tsuji" included in a specific character string written in the text of the cooking field is replaced with the one-point Shinji "Tsuji" included in the specific character string described in the text of the music field. It is possible to replace one-point Shinji "Tsuji" with two-point Shinjyo "Tsuji".

なお、ユーザの期待通りの置換結果が得られるような置換条件を最初から設定することは難しいため、このような置換条件を設定するためには、ある程度の試行錯誤を経て、文字の置き換え精度を高めていくことが必要である。以下では、置換条件を最適化する構成例について説明する。 Note that it is difficult to set the replacement condition from the beginning that can obtain the replacement result that the user expects. Therefore, in order to set such a replacement condition, it takes some trial and error to improve the character replacement accuracy. It is necessary to raise it. Below, an example of composition which optimizes a substitution condition is explained.

図８は第１変形例に係る情報処理装置１の機能ブロックを示す一例の図である。図８に示される情報処理装置１は、画像データ生成部２及び文字置換部３に加えて、置換条件提案部４及び修正受付部６を備える。 FIG. 8 is an example of functional blocks of the information processing apparatus 1 according to the first modification. The information processing apparatus 1 illustrated in FIG. 8 includes a replacement condition proposing unit 4 and a correction receiving unit 6 in addition to the image data generating unit 2 and the character replacing unit 3.

図９は第１変形例に係る情報処理装置１の動作を説明するためのフローチャートである。ステップＳ１〜４の処理動作は、図３に示される処理動作と同様のため、説明を割愛する。 FIG. 9 is a flowchart for explaining the operation of the information processing device 1 according to the first modification. The processing operation of steps S1 to S4 is the same as the processing operation shown in FIG.

前述した第１置換条件に従って、ステップＳ３による置換が行われた結果、ユーザの期待通りの置換が行われない場合、ユーザの手作業によって、ユーザの期待通りの置換が行われない部分（文字）の置換、すなわち文字の修正を修正受付部６が受け付ける。（ステップＳ５）。 As a result of performing the replacement in step S3 according to the above-described first replacement condition, when the replacement as expected by the user is not performed, the replacement by manual operation by the user is not performed (character) Is accepted, that is, the modification accepting unit 6 accepts the modification of the character. (Step S5).

このようにして文字が修正される前の文章の内容を示す情報と、文字が修正された後の文章の内容を示す情報とは、置換条件を示す情報と対応付けて、第１格納部１０又は第２格納部２０に格納される。 In this way, the information indicating the content of the sentence before the characters are corrected and the information indicating the content of the sentence after the characters are corrected are associated with the information indicating the replacement condition, and are stored in the first storage unit 10 Alternatively, it is stored in the second storage unit 20.

置換条件提案部４は、第１格納部１０又は第２格納部２０に格納されたこれらの情報に基づき、置換条件毎に、文字が修正される前後の文章を比較して、置換条件と置換された文字との相関関係を分析する（ステップＳ６）。 The replacement condition proposing unit 4 compares the sentences before and after the characters are corrected for each replacement condition based on the information stored in the first storage unit 10 or the second storage unit 20, and replaces the replacement condition with the replacement condition. The correlation with the written character is analyzed (step S6).

具体的には、一点しんにょうの「辻」を二点しんにょうの「辻」に置換するための、２つの置換条件が存在していると仮定する。例えば、一方の置換条件（前述した第１置換条件）は、一点しんにょうの「辻」の前後１０文字以内に「名誉教授」が存在するときには一点しんにょうの「辻」を二点しんにょうの「辻」に置き換えるという置換条件である。他方の置換条件（ユーザが気づいていない潜在的なルールである第２置換条件）は、例えば、一点しんにょうの「辻」の前後５０文字以内に「ヴァイオリン」又は「バイオリン」が存在するときには一点しんにょうの「辻」を二点しんにょうの「辻」に置き換えるという置換条件である。置換条件提案部４は、これらの置換条件と、置換された文字との相関関係を分析する。 Specifically, it is assumed that there are two substitution conditions for substituting one-point Shinji "Tsuji" with two-point Shinji "Tsuji". For example, one substitution condition (the above-mentioned first substitution condition) is that when there is a "Professor Emeritus" within 10 characters before and after "Tsuji" of 1 point Shinnyo, "Tsuji" of 1 point Shinnyo is 2 points of Shinji "Tsuji". The replacement condition is to replace with. The other replacement condition (the second replacement condition, which is a potential rule that the user is not aware of) is, for example, when a "violin" or a "violin" exists within 50 characters before or after "Tsuji" of a single-point Shinnyo. The replacement condition is to replace "tsuji" with "tsuji". The replacement condition proposing unit 4 analyzes the correlation between these replacement conditions and the replaced character.

これにより、第１置換条件で置換される場合よりも、第２置換条件で置換される方が、置換の精度が高いと判定することができる。すなわち、置換の誤りが少ない置換条件（例えば第２置換条件）を判定することができる。この判定の結果、置換条件提案部４は、第１置換条件の代わりに第２置換条件を、修正が少ない置換条件としてユーザに提案する（ステップＳ７）。 As a result, it is possible to determine that the replacement accuracy is higher when the replacement is performed under the second replacement condition than when the replacement is performed under the first replacement condition. That is, it is possible to determine a replacement condition (for example, the second replacement condition) with few replacement errors. As a result of this determination, the replacement condition proposing unit 4 proposes the second replacement condition instead of the first replacement condition to the user as a replacement condition with few modifications (step S7).

置換条件提案部４は、置換条件の提案を以下に示すような方法でユーザに通知する（ステップＳ８）。置換条件提案部４は、例えば情報処理装置１に接続される表示部に置換条件の内容を示すメッセージを表示させてもよいし、情報処理装置１に接続される音声出力部から、置換条件の内容を示す音声メッセージを出力させてもよい。 The replacement condition proposing unit 4 notifies the user of the replacement condition proposal by the method as described below (step S8). The replacement condition proposing unit 4 may display a message indicating the contents of the replacement condition on the display unit connected to the information processing device 1, or the replacement condition proposal unit 4 may display the replacement condition from the voice output unit connected to the information processing device 1. A voice message indicating the content may be output.

図１０は第２変形例に係る情報処理装置１の機能ブロックを示す一例の図である。図１０に示される情報処理装置１は、画像データ生成部２及び文字置換部３に加えて、置換条件提案部４１を備える。置換条件提案部４１は、効率の良い文字置き換え作業を実現するために、ユーザが置換条件を考案しなくても推奨される置換条件を提案する機能である。 FIG. 10 is an example of functional blocks of the information processing apparatus 1 according to the second modification. The information processing device 1 illustrated in FIG. 10 includes a replacement condition proposing unit 41 in addition to the image data generating unit 2 and the character replacing unit 3. The replacement condition proposing unit 41 has a function of proposing a recommended replacement condition without the user having to devise the replacement condition in order to realize an efficient character replacement work.

図１１は第２変形例に係る情報処理装置１の動作を説明するためのフローチャートである。ステップＳ１〜４の処理動作は、図３に示される処理動作と同様のため、説明を割愛する。 FIG. 11 is a flowchart for explaining the operation of the information processing device 1 according to the second modification. The processing operation of steps S1 to S4 is the same as the processing operation shown in FIG.

例えば、文字の置換が行われる前後の文章の一定範囲（例えば先頭から５分の１の範囲）について、文字の置き換え（修正）が手作業で行われ、修正受付部６がユーザによる修正を受け付けたと仮定する（ステップＳ１５）。 For example, the character replacement (correction) is manually performed in a certain range (for example, a range of ⅕ from the beginning) before and after the character replacement, and the correction receiving unit 6 receives the correction by the user. (Step S15).

このようにして文字が修正される前の文章の内容を示す情報と、文字が修正された後の文章の内容を示す情報とが、文字が置換された範囲を示す情報と対応付けて、第１格納部１０又は第２格納部２０に複数格納される。 In this way, the information indicating the content of the sentence before the characters are corrected and the information indicating the content of the sentence after the characters are corrected are associated with the information indicating the range in which the characters are replaced, A plurality of data are stored in the first storage unit 10 or the second storage unit 20.

置換条件提案部４１は、第１格納部１０又は第２格納部２０に格納されたこれらの情報を参照することで、文字が修正される前の文章の一定範囲内の文章内容と、文字が修正された後の文章の一定範囲内の文章内容とを比較する。ここで比較される一定範囲は、修正前後の文書の同じ箇所であるとする。そして、置換条件提案部４１は、修正される前の文章内容と修正された文章内容との相関関係を分析する（ステップＳ１６）。 By referring to these pieces of information stored in the first storage unit 10 or the second storage unit 20, the replacement condition proposing unit 41 determines whether the text content within a certain range of the text before the text is corrected and the text The content of the revised text is compared with the text content within a certain range. It is assumed that the certain range compared here is the same part of the document before and after the correction. Then, the replacement condition proposing unit 41 analyzes the correlation between the text contents before being corrected and the corrected text contents (step S16).

置換条件提案部４１は、分析の結果、ユーザが文字の置き換えを行った一定範囲以外の範囲について、文字の置き換えを実行する置換条件を推定して、第１格納部１０又は第２格納部２０に格納する。また、置換条件提案部４１は、推定した置換条件を以下に示すような方法でユーザに通知する（ステップＳ１７）。 As a result of the analysis, the replacement condition proposing unit 41 estimates the replacement condition for executing the character replacement in a range other than the certain range in which the user replaces the character, and then the first storage unit 10 or the second storage unit 20. To store. Further, the replacement condition proposing unit 41 notifies the user of the estimated replacement condition by the method described below (step S17).

例えば情報処理装置１に接続される表示部に推定した置換条件の内容を示すメッセージを表示させてもよいし、情報処理装置１に接続される音声出力部から、推定した置換条件の内容を示す音声メッセージを出力させてもよい。その結果、ユーザは、置換条件を考案しなくても、情報処理装置１が推定した置換条件を利用して、効率の良い文字置き換え作業を実現できる。 For example, a message indicating the content of the estimated replacement condition may be displayed on the display unit connected to the information processing apparatus 1, or the content of the estimated replacement condition may be displayed from the voice output unit connected to the information processing apparatus 1. A voice message may be output. As a result, the user can realize efficient character replacement work using the replacement condition estimated by the information processing device 1 without devising the replacement condition.

図１２は第３変形例に係る情報処理装置１の機能ブロックを示す一例の図である。図１２に示される情報処理装置１は、画像データ生成部２及び文字置換部３に加えて、誤字通知部５を備える。誤字通知部５は、元の文書中に記述されている文字を置換条件に照合した結果、元の文書中に記述されている文字に誤りがある場合、そのことをユーザに通知する機能である。 FIG. 12 is an example of functional blocks of the information processing apparatus 1 according to the third modification. The information processing device 1 illustrated in FIG. 12 includes a typographical error notification unit 5 in addition to the image data generation unit 2 and the character replacement unit 3. The typographical error notifying unit 5 has a function of notifying a user of a character described in the original document, if the character described in the original document has an error as a result of collating the character described in the original document with the replacement condition. ..

図１３は第３変形例に係る情報処理装置１の動作を説明するためのフローチャートである。ステップＳ１〜４の処理動作は、図３に示される処理動作と同様のため、説明を割愛する。 FIG. 13 is a flowchart for explaining the operation of the information processing device 1 according to the third modification. The processing operation of steps S1 to S4 is the same as the processing operation shown in FIG.

誤字通知部５は、文字が修正される前の文章の内容を示す情報と、文字が修正された後の文章の内容を示す情報と、誤字判定条件とを用いて、元の文書中に存在する誤りを検出する（ステップＳ２５）。 The typographical error notification unit 5 exists in the original document using information indicating the content of the sentence before the character is corrected, information indicating the content of the sentence after the character is corrected, and the typographical error determination condition. Error is detected (step S25).

上記の例で言えば、一点しんにょうの「辻」を含む「辻名誉教授」を、二点しんにょうの「辻」を含む「辻名誉教授」に置き換える、という置換条件を元に、誤字判定条件が作成される。 In the above example, the typographical error condition is based on the replacement condition of replacing "Tsuji", which contains "Tsuji" with one point, with "Tsuji", which contains "Tsuji" with two points. Created.

ここでの誤字判定条件は、例えば、「辻ＸＸ教授」の「ＸＸ」が「名誉」と一致するか否かである。誤字通知部５は、文字が修正される前の文章の内容を確認し、「辻ＸＸ教授」という文字列の「ＸＸ」が「名誉」以外の文字である場合、「名誉」以外の文字を含む文字列、例えば「辻客員教授」に対しては、一点しんにょうが正しいと判定する。 The typographical error determination condition here is, for example, whether or not "XX" of "Professor Tsuji XX" matches "honor". The typographical error notification unit 5 confirms the content of the sentence before the characters are corrected, and when "XX" in the character string "Tsuji XX" is a character other than "honor", the character other than "honor" is displayed. For a character string that contains, for example, "Tsuji visiting professor," it is determined that the one-point score is correct.

このような判定処理を行うことにより、誤字通知部５は、「辻客員教授」の「辻」が二点しんにょうで表記されている場合、一点しんにょうが正しいと判定し、例えば「辻客員教授の辻は一点しんにょうの誤りではありませんか？」などのメッセージをユーザに通知する（ステップＳ２６）。メッセージは、表示メッセージでもよいし音声メッセージでもよい。これにより、ユーザは、文字の置き換え前の文章中の「辻客員教授」の「辻」が、二点しんにょうで表記されている場合でも、この「辻」は二点しんにょうではなく一点しんにょうが本来正しいということを知ることができる。 By performing the determination process as described above, the typographical error notification unit 5 determines that one point is correct when "Tsuji" in "Tsuji visiting professor" is indicated by two points, and for example, "Tsuji visiting professor" The user is notified of a message such as "Is Tsuji all right?" (step S26). The message may be a display message or a voice message. As a result, even if "Tsuji" of "Tsuji visiting professor" in the text before the replacement of characters is written as a two-dot line, this "Tsuji" is not a two-dot line, but a one-point line. You can know that it is correct.

なお、誤字通知部５の機能は、第２変形例及び第３変形例に係る情報処理装置１に組み合わせることも可能である。 The function of the typographical error notification unit 5 can be combined with the information processing device 1 according to the second modification and the third modification.

図１４は本発明の実施の形態に係る情報処理装置１を実現するためのハードウェア構成例を示す図である。情報処理装置１は、プロセッサ１０１と、ＲＡＭ（Random Access Memory）、ＲＯＭ（Read Only Memory）などで構成されるメモリ１０２と、ネットワークに接続するための入出力インタフェース１０３とにより実現することが可能である。プロセッサ１０１、メモリ１０２及び入出力インタフェース１０３は、バス１０４に接続され、バス１０４を介して、データ、制御情報などの受け渡しを相互に行うことが可能である。 FIG. 14 is a diagram showing a hardware configuration example for realizing the information processing device 1 according to the embodiment of the present invention. The information processing device 1 can be realized by a processor 101, a memory 102 including a RAM (Random Access Memory), a ROM (Read Only Memory), and an input/output interface 103 for connecting to a network. is there. The processor 101, the memory 102, and the input/output interface 103 are connected to the bus 104, and can exchange data, control information and the like with each other via the bus 104.

情報処理装置１を実現する場合、情報処理装置１用のプログラムをメモリ１０２に格納しておき、このプログラムをプロセッサ１０１が実行することにより、情報処理装置１の画像データ生成部２、文字置換部３、置換条件提案部４、修正受付部６、置換条件提案部４１、誤字通知部５などが実現される。情報処理装置１用のプログラムは、画像データ生成部２、文字置換部３、置換条件提案部４、置換条件提案部４１、誤字通知部５などの機能を実行するプログラムである。入出力インタフェース１０３は、文字画像データを第１格納部１０に保存するときなどに利用される。 When the information processing apparatus 1 is realized, a program for the information processing apparatus 1 is stored in the memory 102, and the processor 101 executes the program, so that the image data generation unit 2 and the character replacement unit of the information processing apparatus 1 are executed. 3, a replacement condition proposing unit 4, a correction receiving unit 6, a replacement condition proposing unit 41, a typographical error notifying unit 5, and the like are realized. The program for the information processing device 1 is a program that executes functions such as the image data generation unit 2, the character replacement unit 3, the replacement condition proposal unit 4, the replacement condition proposal unit 41, and the typographical error notification unit 5. The input/output interface 103 is used when, for example, the character image data is stored in the first storage unit 10.

以上に説明したように本実施の形態に係る情報処理装置１は、文章に記載される第１文字の字形と異なる字形の第２文字の画像を示す文字画像に基づき、第２文字の画像データを生成する画像データ生成部２と、文章において、第１文字の前又は後に位置する文字が特定の置換条件に合致するとき、第２文字の画像データに基づき第１文字を第２文字に置き換える文字置換部３とを備える。この構成により、特定の置換条件に合致する文字だけを、字形が異なる文字へ一括変更できる。従って、ユーザは特定の置換条件に合致する文字だけを選択する必要がなくなり、ユーザにかかる負担が大幅に軽減され、文字置換の作業時間が大幅に短縮される。 As described above, the information processing device 1 according to the present embodiment uses the image data of the second character based on the character image showing the image of the second character having a character shape different from the character shape of the first character described in the sentence. When the character positioned before or after the first character in the sentence matches the specific replacement condition, the first character is replaced with the second character based on the image data of the second character. The character replacement unit 3 is provided. With this configuration, only the characters that match the specific replacement condition can be collectively changed to the characters having different glyphs. Therefore, the user does not need to select only the characters that match a specific replacement condition, the burden on the user is significantly reduced, and the work time for character replacement is significantly reduced.

図１５は本実施の形態に係る情報処理装置１を備えた情報処理システム１Ａの構成例を示す図である。情報処理システム１Ａは、情報処理装置１と、通信ネットワーク２００と、処理装置であるサーバ３００とを備える。サーバ３００は、通信ネットワーク２００を介して、情報処理装置１と通信可能に接続される。情報処理システム１Ａによれば、例えばサーバ３００に保管された文字画像データを利用して文字の置換が可能であり、情報処理装置１の汎用性を高めることができる。 FIG. 15 is a diagram showing a configuration example of an information processing system 1A including the information processing device 1 according to the present embodiment. The information processing system 1A includes an information processing device 1, a communication network 200, and a server 300 that is a processing device. The server 300 is communicatively connected to the information processing device 1 via the communication network 200. According to the information processing system 1A, for example, the character image data stored in the server 300 can be used to replace characters, and the versatility of the information processing apparatus 1 can be enhanced.

また実施の形態に係る情報処理プログラムは、コンピュータに、文章に記載される第１文字の字形と異なる字形の第２文字の画像を示す文字画像に基づき、第２文字の画像データを生成させ、文章において、第１文字の前又は後に位置する文字が特定の置換条件に合致するとき、第２文字の画像データに基づき第１文字を第２文字に置き換えさせる。実施の形態に係る情報処理プログラムによれば、特定の置換条件に合致する文字だけを、字形が異なる文字へ一括変更できる。従って、ユーザは特定の置換条件に合致する文字だけを選択する必要がなくなり、ユーザにかかる負担が大幅に軽減され、文字置換の作業時間が大幅に短縮される。 Further, the information processing program according to the embodiment causes a computer to generate image data of a second character based on a character image indicating an image of a second character having a character shape different from the character shape of the first character described in a sentence, In the sentence, when the character positioned before or after the first character matches the specific replacement condition, the first character is replaced with the second character based on the image data of the second character. According to the information processing program according to the embodiment, only the characters that match a specific replacement condition can be collectively changed to the characters having different glyphs. Therefore, the user does not need to select only the characters that match a specific replacement condition, the burden on the user is significantly reduced, and the work time for character replacement is significantly reduced.

図１６は置換条件提案部４又は置換条件提案部４１に適用される機械学習部４００の一例を示す図である。機械学習とは、コンピュータに人のような学習能力を獲得させるための技術であり、コンピュータが、データ識別等の判断に必要なアルゴリズムを、事前に取り込まれる学習データから自律的に生成し、新たなデータについてこれを適用して予測を行う技術のことをいう。機械学習方法の一例を以下に説明する。
（教師あり学習）事前に与えられたサンプルとなるデータをもとにデータの識別や法則性の導出を行う手法（例：回帰分析、SVM（サポートベクターマシーン））。
（教師なし学習）サンプルとなるデータがない状態で、実データそのものを解析することで、データに存在する本質的な構造や特徴を抽出する手法（例：ｋ平均法（K-Means）、潜在的意味インデックス（LSI）、トピックモデル手法（LDA））。
（半教師学習）少数のサンプルを用いて学習をおこない、その後ある程度の実データを分類して、その結果のうち高い確度のものをサンプルと捉えなおして再度学習をする手法。（例：ブートストラップ法、Adaboost）
（構造学習）個別にデータを推定せず、データ全体の構造に最適化した形で個々の推定をまとめて行う手法。（例：構造化SVM、条件付き確率場（CRF））。
（強化学習）サンプルが存在しないが、代わりに学習した後からフィードバック情報を得ることでさらなる学習の手がかりとする手法。（例：バンディットアルゴリズム、UCBアルゴリズム）。
（深層学習）人の脳の構造をソフトウェア的に模倣するニューラルネットという手法を多層化し、高度化を図った手法（例：制限付きボルツマン機械、Category2Vec）。図１６には、例えばニューラルネットで構成された機械学習部４００が示される。 FIG. 16 is a diagram showing an example of the machine learning unit 400 applied to the replacement condition proposing unit 4 or the replacement condition proposing unit 41. Machine learning is a technology for allowing a computer to acquire human-like learning ability.The computer autonomously generates algorithms necessary for judgment such as data identification from learning data that is captured in advance, and This is a technique for applying predictions to various data to make predictions. An example of the machine learning method will be described below.
(Supervised learning) A method of identifying data and deriving the rules based on sample data given in advance (eg regression analysis, SVM (Support Vector Machine)).
(Unsupervised learning) A method of extracting the essential structure and features existing in the data by analyzing the actual data itself in the absence of sample data (eg, k-means (K-Means), latent Semantic Index (LSI), Topic Model Method (LDA)).
(Semi-supervised learning) A method in which learning is performed using a small number of samples, then actual data is classified to some extent, and the one with high accuracy among the results is re-learned as a sample. (Example: Bootstrap method, Adaboost)
(Structure learning) A method of collectively estimating individual data in a form optimized for the structure of the entire data, without estimating the data individually. (Eg structured SVM, conditional random field (CRF)).
(Reinforcement learning) There is no sample, but instead, it is used as a clue for further learning by obtaining feedback information after learning. (Example: Bandit algorithm, UCB algorithm).
(Deep learning) A technique that multi-layers a technique called a neural network that mimics the structure of the human brain in a software manner to improve its performance (eg, restricted Boltzmann machine, Category2Vec). FIG. 16 shows a machine learning unit 400 including, for example, a neural network.

上記の機械学習のための学習方法は、教師あり学習、教師なし学習、半教師学習、構造学習、強化学習、深層学習のいずれかの方法でもよく、さらに、これらの学習方法を組み合わせた学習方法でもよく、機械学習のための学習方法は問わない。 The learning method for machine learning may be any of supervised learning, unsupervised learning, semi-supervised learning, structure learning, reinforcement learning, and deep learning, and a learning method combining these learning methods. However, the learning method for machine learning does not matter.

置換条件提案部４は、修正後の文章と修正前の文章とを用いて機械学習された機械学習部４００である場合、分析対象の文章の分野、置換条件などを問わずに、置換の誤りが少ない置換条件を判定することが可能になる。 When the replacement condition proposing unit 4 is the machine learning unit 400 that is machine-learned using the corrected sentence and the uncorrected sentence, the replacement error may occur regardless of the field of the analysis target sentence, the replacement condition, or the like. It is possible to determine a replacement condition with less.

置換条件提案部４１は、修正後の文章の一定範囲に記述された文字と、修正前の文章の内、一定範囲と同じ範囲に記述された文字とを用いて、機械学習された機械学習部４００である場合、最適な置換条件を速やかに提案することができる。 The substitution condition proposing unit 41 uses a character described in a fixed range of the corrected sentence and a character described in the same range as the fixed range in the uncorrected sentence to perform a machine learning machine learning unit. When it is 400, the optimum replacement condition can be promptly proposed.

１：情報処理装置
１Ａ：情報処理システム
２：画像データ生成部
３：文字置換部
４：置換条件提案部
５：誤字通知部
６：修正受付部
１０：第１格納部
２０：第２格納部
４１：置換条件提案部
１００Ａ：画面
１００Ｂ：画面
１０１：プロセッサ
１０２：メモリ
１０３：入出力インタフェース
１０４：バス
２００：通信ネットワーク
３００：サーバ
４００：機械学習部 1: Information processing apparatus 1A: Information processing system 2: Image data generation unit 3: Character replacement unit 4: Replacement condition proposal unit 5: Error character notification unit 6: Correction acceptance unit 10: First storage unit 20: Second storage unit 41 : Replacement condition proposing unit 100A: Screen 100B: Screen 101: Processor 102: Memory 103: Input/output interface 104: Bus 200: Communication network 300: Server 400: Machine learning unit

特開２０１４−０８１７５３号公報JP, 2014-081753, A

Claims

An image data generation unit that generates image data of the second character based on a character image showing an image of a second character having a character shape different from the character shape of the first character described in the sentence;
In the sentence, when a character positioned before or after the first character matches a specific replacement condition, a character replacement unit that replaces the first character with the second character based on image data of the second character,
An information processing apparatus including.

The information processing apparatus according to claim 1, further comprising a correction acceptance unit that accepts a correction by the user for the sentence including the second character replaced by the character replacement unit.

When the correction receiving unit receives a correction made to the sentence by the user, the plurality of replacement conditions used for the replacement with the second character by comparing the corrected sentence with the uncorrected sentence. The information processing apparatus according to claim 2, further comprising: a replacement condition suggesting unit configured to analyze a correlation between the replaced second character and the replaced second character, and notify the replacement condition with few replacement errors by a display message or a voice message. .

The information processing apparatus according to claim 3, wherein the replacement condition proposing unit is a machine learning unit that is machine-learned using the corrected sentence and the uncorrected sentence.

When the correction accepting unit receives a correction by the user for a certain range of the sentence, the characters described in the certain range of the corrected sentence and the same range of the uncorrected sentence as the certain range are described. By analyzing the correlation between the content of the text within the certain range before correction and the content of the text within the certain range after modification by comparing the characters that have been corrected, and the replacement condition for other than the certain range of the text. The information processing apparatus according to claim 2, further comprising: a replacement condition proposing unit that notifies, by a display message or a voice message.

The replacement condition proposing unit uses characters described in a certain range of the corrected sentence and characters described in the same range as the certain range in the sentence in which the first character is described, The information processing apparatus according to claim 5, which is a machine learning unit that has been machine-learned.

Based on the typographical error determination condition for determining that the character described in the sentence before being replaced with the second character is erroneous, it is detected that the character matching the substitution condition is typographical error, and the detected result is displayed. The information processing apparatus according to any one of claims 3 to 6, further comprising a typographical error notifying unit that notifies a message or a voice message.

An information processing system comprising: the information processing apparatus according to claim 1; a communication network; and a processing apparatus communicatively connected to the information processing apparatus via the communication network.

On the computer,
Generating image data of the second character based on a character image showing an image of a second character having a character shape different from the character shape of the first character described in the sentence;
An information processing program for replacing the first character with the second character based on the image data of the second character when a character positioned before or after the first character in the sentence matches a specific replacement condition.