JP6076285B2

JP6076285B2 - Translation apparatus, translation method, and translation program

Info

Publication number: JP6076285B2
Application number: JP2014062001A
Authority: JP
Inventors: 光昭小関
Original assignee: Zenrin Datacom Co Ltd
Current assignee: Zenrin Datacom Co Ltd
Priority date: 2014-03-25
Filing date: 2014-03-25
Publication date: 2017-02-08
Anticipated expiration: 2034-03-25
Also published as: JP2015184998A

Description

この発明は、例えば、地図上に記載される文字情報であるいわゆる地図注記などのように、大量に存在する名詞句の羅列データの翻訳を行うための装置、方法、プログラムに関する。 The present invention relates to an apparatus, a method, and a program for translating enumeration data of a large number of noun phrases such as so-called map notes that are character information described on a map.

従来から機械翻訳装置に関する種々の発明がなされている。例えば、後に記す特許文献１には、固有名詞を翻訳する場合であっても、固有名詞辞書を適切に整備しておくことにより正確に翻訳することができることが記載されている。また、同文献には、固有名詞辞書に登録された各固有名詞に対する捕捉語を記憶保持する捕捉語辞書をも用いることにより、固有名詞がどのようなものかについても翻訳結果に含めるようにすることが記載されている。これにより、翻訳結果をより理解し易いものとすることができるという点で効果がある。 Conventionally, various inventions relating to machine translation devices have been made. For example, Patent Document 1 described later describes that even when a proper noun is translated, it can be accurately translated by properly preparing a proper noun dictionary. In addition, the same document also includes a captured word dictionary that stores captured words for each proper noun registered in the proper noun dictionary, so that the proper result is included in the translation result. It is described. This is advantageous in that the translation result can be made easier to understand.

特開２００４−２２０４１６号公報JP 2004-220416 A

従来の機械翻訳装置の場合、翻訳のために種々の辞書が必要になっている。上述した特許文献１に記載の発明でも固有名詞などの翻訳辞書に加えて捕捉語辞書も必要なっている。翻訳のために辞書を検索する処理にはある程度時間がかかるために、大量の固有名詞などの名詞句の羅列の翻訳を行う場合には時間が掛かる。例えば、大量の名詞句の羅列の翻訳行う場合の例として、地図注記の翻訳を行う場合について考える。 In the case of a conventional machine translation device, various dictionaries are required for translation. The invention described in Patent Document 1 described above also requires a captured word dictionary in addition to a translation dictionary such as proper nouns. Since it takes a certain amount of time to search the dictionary for translation, it takes time to translate a list of noun phrases such as a large number of proper nouns. For example, consider the case of translating map notes as an example of translating a large number of noun phrases.

地図には、例えば、市街図、道路図、広域図、地方図、全国図といった種々のものがあり、さらに同じ種類の地図でも縮尺の異なる複数の地図が存在する。このように、種類や縮尺の異なる全ての地図を対象にした場合、地図注記は数千万件も存在する。そして、数字、アルファベット、アイコンのみにより構成される地図注記などの翻訳が必要ないものは除外し、重複する地図注記は１つに集約したとしても、翻訳対象となる地図注記は数百万件も存在する。 There are various maps such as city maps, road maps, regional maps, regional maps, and national maps, and there are a plurality of maps of different scales even with the same type of map. Thus, when all types of maps of different types and scales are targeted, there are tens of millions of map notes. And even if map annotations consisting only of numbers, alphabets, and icons are excluded, those that do not need to be translated are excluded, and even if multiple overlapping map notes are consolidated into one, millions of map notes are subject to translation. Exists.

この数百万件の地図注記を、従来の辞書を参照する方式で多言語に機械翻訳する場合には、数週間を要しているのが現状である。また、地図注記は、地図上の地物が無くなったり、新たにできたりした場合には、必ず変わるため、所定のタイミングで現地調査をして更新し、翻訳し直す必要がある。 It takes several weeks to machine-translate these millions of map notes into multiple languages using a conventional dictionary reference method. In addition, map notes always change when features on the map are lost or newly created, so it is necessary to conduct a field survey at a predetermined timing to update and re-translate.

このように、地図注記は、翻訳を繰り返す必要があるため、できるだけ高速に翻訳したいとする要求が従来からある。このことは地図注記だけでなく、住所リスト、電話帳リスト、顧客リストなどに記載された、例えば、地名、名称、名前、住所といった多数の名詞句の羅列を翻訳する場合にも同様に生じている課題である。 As described above, since map annotation needs to be repeatedly translated, there is a conventional request to translate as fast as possible. This also occurs when translating a large number of noun phrases such as place names, names, names, addresses, etc., not only in map notes but also in address lists, phone book lists, customer lists, etc. It is a problem.

以上のことに鑑み、この発明は、大量に存在する名詞句の羅列データを、従来の機械翻訳装置に比べて飛躍的に速く、且つ、正確に翻訳できるようにすることを目的とする。 In view of the above, an object of the present invention is to make it possible to translate a large amount of noun phrase enumeration data much faster and more accurately than conventional machine translation devices.

上記課題を解決するため、請求項１に記載の発明の翻訳装置は、
文字列とその読み仮名とを対応付けた読み仮名辞書と、
前記読み読み仮名辞書の内容をプログラムロジックに変換し、入力された文字列に読み仮名を付与する読み仮名付与プログラムを作成する第１のプログラム作成手段と、
読み仮名とその読み仮名に対応する所定の言語の文字列とを対応付けた翻訳辞書と、
前記翻訳辞書の内容をプログラムロジックに変換し、入力された読み仮名を所定の言語に翻訳する翻訳プログラムを作成する第２のプログラム作成手段と、
前記第１のプログラム作成手段で作成された前記読み仮名付与プログラムを実行し、翻訳対象の文字列に読み仮名を付与する読み仮名付与手段と、
前記第２のプログラム作成手段で作成された翻訳プログラムを実行し、前記読み仮名付与手段で付与された読み仮名を目的とする言語に翻訳する翻訳手段と
を備えることを特徴とする。 In order to solve the above problem, a translation device according to claim 1 is provided:
A phonetic dictionary that associates strings with their phonetic kana,
First program creating means for converting a content of the reading-reading kana dictionary into program logic and creating a reading-kana adding program for adding a reading kana to an input character string;
A translation dictionary associating a reading kana with a character string of a predetermined language corresponding to the reading kana,
Second program creating means for converting the contents of the translation dictionary into program logic and creating a translation program for translating the input reading pseudonym into a predetermined language;
A reading pseudonym giving unit that executes the reading pseudonym giving program created by the first program creation unit and gives a reading pseudonym to a character string to be translated;
A translation unit configured to execute the translation program created by the second program creation unit and translate the reading pseudonym given by the reading pseudonym granting unit into a target language.

この請求項１に記載の発明の翻訳装置によれば、読み仮名辞書は、第１のプログラム作成手段によって、その内容がプログラムロジックに変換され、入力さえた文字列に対してその読み仮名を付与する読み仮名付与プログラムとなる。同様に、翻訳辞書は、第２のプログラム作成手段によって、その内容がプログラムロジックに変換され、読み仮名を所定の言語に翻訳する翻訳プログラムとなる。 According to the translation device of the first aspect of the present invention, the reading kana dictionary is converted into the program logic by the first program creating means, and the reading kana is given to the inputted character string. It becomes a reading pseudonym grant program. Similarly, the translation dictionary is converted into a program logic by the second program creation means, and becomes a translation program for translating reading kana into a predetermined language.

当該翻訳装置では、読み仮名変換手段が、第１のプログラム作成手段で作成された前記読み仮名付与プログラムを実行し、翻訳対象の文字列に読み仮名を付与する。当該読み仮名付与プログラムは、読み仮名辞書の内容がそのままプログラムロジックとされたものなので、外部の読み仮名辞書を参照する必要は全く無い。 In the translation apparatus, the reading-kana conversion unit executes the reading-kana adding program created by the first program creating unit, and gives a reading-kana to the character string to be translated. Since the contents of the reading kana dictionary are directly used as the program logic, the reading kana giving program does not need to refer to an external reading kana dictionary at all.

当該翻訳装置では、翻訳手段が、第２のプログラム作成手段で作成された翻訳プログラムを実行し、読み仮名付与手段で付与された読み仮名を目的とする言語に翻訳する。当該翻訳プログラムは、翻訳辞書の内容がそのままプログラムロジックとされたものなので、外部の翻訳辞書を参照する必要は全く無い。 In the translation apparatus, the translation means executes the translation program created by the second program creation means, and translates the reading kana given by the reading kana giving means into the target language. In the translation program, the contents of the translation dictionary are directly used as program logic, so there is no need to refer to an external translation dictionary.

このように、当該翻訳装置では、翻訳対象の文字列に読み仮名を付与する場合にも、また、付与された読み仮名を目的とする言語に翻訳する場合にも、外部の辞書を全く参照しないため、極めて高速に翻訳を行える。従って、翻訳対象の文字列が多数存在していても、超高速に翻訳が行える。 In this way, the translation device does not refer to an external dictionary at all when adding a reading kana to a character string to be translated or when translating a given reading kana into a target language. Therefore, translation can be performed at extremely high speed. Therefore, even if there are many character strings to be translated, translation can be performed at a very high speed.

この発明によれば、例えば、地図注記などの大量に存在する名詞句の羅列を、従来の機械翻訳装置に比べて飛躍的に速く、且つ、正確に翻訳できる。 According to the present invention, for example, a large number of noun phrases such as map notes can be translated significantly faster and more accurately than conventional machine translation devices.

実施の形態の翻訳装置の構成例と動作の概要を説明するためのブロック図である。It is a block diagram for demonstrating the example of a structure of the translation apparatus of embodiment, and the outline | summary of operation | movement. 読み仮名辞書群１１１の内の固有名詞読み仮名辞書の一例を示す図である。It is a figure which shows an example of the proper noun reading kana dictionary in the reading kana dictionary group 111. FIG. 図２に示した固有名詞読み仮名辞書をプログラムロジックに変換して形成した読み仮名付与プログラムの例を示す図である。FIG. 3 is a diagram showing an example of a reading kana giving program formed by converting the proper noun reading kana dictionary shown in FIG. 2 into program logic. 入力データ「２３，１３８，小碓通」を翻訳処理する場合の例を説明するための図である。It is a figure for demonstrating the example in the case of translating input data "23,138, Kominatotsu". 従来の翻訳装置における翻訳処理の概要を説明するための図である。It is a figure for demonstrating the outline | summary of the translation process in the conventional translation apparatus. 実施の形態の翻訳装置で行われる翻訳処理のための事前準備処理を説明するためのフローチャートである。It is a flowchart for demonstrating the prior preparation process for the translation process performed with the translation apparatus of embodiment. 実施の形態の翻訳装置で行われる翻訳処理を説明するためのフローチャートである。It is a flowchart for demonstrating the translation process performed with the translation apparatus of embodiment.

以下、図を参照しながら、この発明の装置、方法、プログラムの一実施の形態について説明する。この発明は、地図注記、電話帳リスト、顧客リストなどに記載された地名、名称、名前、住所といった、大量に存在する名詞句の羅列データについて翻訳を行う場合に適用可能なものである。「名詞句」との文言は、物体、物質、人物、場所など具体的な対象を示す語句を意味する。この明細書において、翻訳対象となる「名詞句の羅列」の具体例として、地図注記、電話帳リスト、顧客リストなどに記載された、例えば、地名、名称、名前、住所といったものを想定している。 Hereinafter, embodiments of the apparatus, method, and program of the present invention will be described with reference to the drawings. The present invention can be applied to translation of enumerated data of a large number of noun phrases such as place names, names, names, and addresses described in map notes, telephone directory lists, customer lists, and the like. The term “noun phrase” means a phrase indicating a specific object such as an object, a substance, a person, or a place. In this specification, as a specific example of “a list of noun phrases” to be translated, for example, a place name, a name, a name, an address described in a map note, a phone book list, a customer list, etc. Yes.

以下においては、地図注記の翻訳を行う場合を例にして具体的に説明する。地図注記は、上述もしたように大量に存在し、頻繁に更新されるため、その更新の都度、翻訳し直さなければならないものであること。また、近年おいては、外国から日本を訪れる観光客やビジネスマンも増えてきており、最新の地図の地図注記を所定の外国語に翻訳して表した外国語対応の最新の地図を迅速に提供することが望まれていること。しかも、外国語は、中国語（簡体）、中国語（繁体）、英語、韓国語などというように、複数の言語に対応することが求められることなどの事情がある。 In the following, a specific description will be given by taking as an example the case of translating a map note. Map annotations exist in large quantities as described above, and are frequently updated. Therefore, map annotations must be re-translated each time they are updated. In recent years, the number of tourists and business people visiting Japan from abroad has increased, and the latest maps for foreign languages, which are translated from the map annotations of the latest maps into the specified foreign languages, can be quickly displayed. What you want to provide. In addition, foreign languages are required to support a plurality of languages such as Chinese (simplified), Chinese (traditional), English, and Korean.

このような背景の下、以下に説明する実施の形態の装置、方法、プログラムは、大量に存在する地図注記を、従来の機械翻訳装置に比べて、飛躍的に速く、しかも適切に、且つ、複数の言語に翻訳することを実現している。 Under such a background, the apparatus, method, and program according to the embodiments described below are capable of converting a large amount of map annotations significantly faster and more appropriately than a conventional machine translation apparatus, and It has been translated into multiple languages.

［ビッグデータ翻訳装置の構成例］
まず、この実施の形態のビッグデータ翻訳装置（以下、単に翻訳装置と記載する。）の構成例と動作の概要について説明する。図１は、この実施の形態の翻訳装置の構成例と動作の概要を説明するためのブロック図である。図１に示す翻訳装置において、制御部１０１は、当該翻訳装置の各部を制御する機能を、記憶装置１０２は情報記憶保持機能を実現し、操作部１０３は、ユーザーインターフェース機能を実現する。 [Configuration example of big data translation device]
First, a configuration example and an outline of operation of a big data translation apparatus (hereinafter simply referred to as a translation apparatus) of this embodiment will be described. FIG. 1 is a block diagram for explaining a configuration example and an outline of operation of the translation apparatus according to this embodiment. In the translation apparatus shown in FIG. 1, the control unit 101 implements a function for controlling each unit of the translation apparatus, the storage device 102 implements an information storage holding function, and the operation unit 103 implements a user interface function.

読み仮名辞書群１１１は、固有名詞読み仮名辞書、一般名詞読み仮名辞書、接頭語読み仮名辞書、接尾語読み仮名辞書、カタカナ語読み仮名辞書などの複数の読み仮名辞書を備え、日本語の地図注記データに対する読み仮名を提供する。なお、図示していないが、接頭語読み仮名辞書は、固有名詞接頭語読み仮名辞書、一般名詞接頭語読み仮名辞書、カタカナ語接頭語読み仮名辞書がある。同様に、接尾語読み仮名辞書は、固有名詞接尾語読み仮名辞書、一般名詞接尾語読み仮名辞書、カタカナ語接尾語読み仮名辞書がある。 The reading kana dictionary group 111 includes a plurality of reading kana dictionaries such as a proper noun reading kana dictionary, a general noun reading kana dictionary, a prefix reading kana dictionary, a suffix reading kana dictionary, a katakana reading kana dictionary, and a Japanese map. Provides reading kana for note data. Although not shown, the prefix reading kana dictionary includes a proper noun prefix reading kana dictionary, a general noun prefix reading kana dictionary, and a katakana prefix reading kana dictionary. Similarly, the suffix reading kana dictionary includes a proper noun suffix reading kana dictionary, a general noun suffix reading kana dictionary, and a katakana suffix reading kana dictionary.

また、固有名詞読み仮名辞書、一般名詞読み仮名辞書、カタカナ語読み仮名辞書のそれぞれは、部分一致により漢字の読み仮名の検索が可能なものである。また、各接頭語読み仮名辞書は、接頭語の読み仮名を提供するものであるので、前方一致により漢字の読み仮名の検索が可能なものである。また、各接尾語読み仮名辞書は、接尾語の読み仮名を提供するものであるので、後方一致により漢字の読み仮名の検索が可能なものである。 Each of the proper noun reading kana dictionary, the general noun reading kana dictionary, and the katakana reading kana dictionary is capable of searching for kanji reading kana by partial matching. Each prefix reading kana dictionary provides the reading kana of the prefix, so that it is possible to search for kanji reading kana by forward matching. Each suffix reading kana dictionary provides the reading kana of the suffix, so that the kanji reading kana can be searched by backward matching.

読み仮名付与プログラム作成部（第１のプログラム作成手段）１１２は、読み仮名辞書群１１１の各読み仮名辞書の内容をプログラムロジックに変換し、地図注記データに対して読み仮名を付与する読み仮名付与プログラムを作成する。読み仮名付与プログラム作成部１１２により作成された読み仮名付与プログラムは、読み仮名付与プログラム格納部１１３に格納される。読み仮名付与プログラム格納部１１３は、例えば、記憶装置１０２の所定の格納領域に形成される。 The reading kana giving program creation unit (first program creating means) 112 converts the contents of each reading kana dictionary in the reading kana dictionary group 111 into program logic, and gives the reading kana giving the reading kana to the map note data. Create a program. The reading pseudonym assignment program created by the reading pseudonym assignment program creation unit 112 is stored in the reading pseudonym assignment program storage unit 113. The reading pseudonym assignment program storage unit 113 is formed in a predetermined storage area of the storage device 102, for example.

図２は、読み仮名辞書群１１１の内の固有名詞読み仮名辞書の一例を示す図である。図３は、図２に示した固有名詞読み仮名辞書の一例から読み仮名付与プログラム作成部１１２により作成され、読み仮名付与プログラム格納部１１３に格納される読み仮名付与プログラムの一例を示す図である。 FIG. 2 is a diagram illustrating an example of the proper noun reading kana dictionary in the reading kana dictionary group 111. FIG. 3 is a diagram showing an example of a reading pseudonym assignment program created by the reading pseudonym assignment program creation unit 112 from the example of the proper noun reading kana dictionary shown in FIG. .

図２に示すように、読み仮名辞書は、エリアコード、属性番号、固有名詞及びその読み仮名からなる。エリアコードは、所定のエリアを一意に特定する情報である。エリアコードで特定される所定のエリアとしては、例えば都道府県がある。もちろん、都道府県よりも細分化したエリアとすることも可能である。属性番号は、地図注記として用いられる固有名詞がどのような属性のものかを示すものであり、例えば、地名、教育機関、公共機関、山岳、河川など１００以上の属性（レイヤ）に分類されている。 As shown in FIG. 2, the reading kana dictionary includes an area code, an attribute number, a proper noun, and its reading kana. The area code is information that uniquely identifies a predetermined area. As the predetermined area specified by the area code, for example, there is a prefecture. Of course, it is possible to make the area more detailed than the prefecture. The attribute number indicates what kind of attribute the proper noun used as the map annotation is, and for example, it is classified into 100 or more attributes (layers) such as place names, educational institutions, public institutions, mountains, rivers, etc. Yes.

また、図２において、固有名詞は、地図注記として用いられる固有名詞であり、読み仮名は、対応する固有名詞の読み仮名である。図２に示した固有名詞読み仮名辞書の一例は、愛知県（エリアコード＝２３）の、地名（属性番号＝１３８）の一部を示すものである。なお、ここでは、固有名詞読み仮名辞書の例を示したが、その他の一般名詞読み仮名辞書、接頭語読み仮名辞書、接尾語読み仮名辞書、カタカナ語読み仮名辞書も基本的に図２に示した固有名詞読み仮名辞書と同様の構成を有する。 Moreover, in FIG. 2, a proper noun is a proper noun used as a map note, and a reading kana is a reading kana of a corresponding proper noun. An example of the proper noun reading kana dictionary shown in FIG. 2 shows a part of a place name (attribute number = 138) in Aichi Prefecture (area code = 23). In addition, although the example of the proper noun reading kana dictionary was shown here, the other general noun reading kana dictionary, the prefix reading kana dictionary, the suffix reading kana dictionary, and the katakana reading kana dictionary are basically shown in FIG. It has the same configuration as the proper noun reading kana dictionary.

図２に示した固有名詞読み仮名辞書を、読み仮名付与プログラム作成部１１２によりプログラムロジックに変換すると、図３に示す読み仮名付与プログラムが得られる。図３に示した読み仮名付与プログラムは、プログラミング言語としてＰｅｒｌ（パール）によって記述されたものである。図３に示す読み仮名付与プログラムは、エリアコードと属性番号が一致し、且つ、地図注記（この例では固有名詞である地名）が一致したら、地図注記と読み仮名を出力するものである。 When the proper noun reading kana dictionary shown in FIG. 2 is converted into program logic by the reading kana giving program creation unit 112, the reading kana giving program shown in FIG. 3 is obtained. The reading pseudonym assignment program shown in FIG. 3 is written in Perl as a programming language. The reading kana giving program shown in FIG. 3 outputs a map note and a reading kana when the area code and the attribute number match and the map note (a place name which is a proper noun in this example) matches.

読み仮名付与プログラム作成部１１２は、具体例を示せば、図２に示した固有名詞読み仮名辞書から、図３に示した読み仮名付与プログラムを作成するものである。この実施の形態においては、最長一致法により地図注記の読み仮名を特定する。このため、読み仮名付与プログラム作成部１１２は、読み仮名辞書の内容を、名詞句（図２、図３に示した例では固有名詞）を長いもの順に並び変えたプログラムを作成している。図３に示した読み仮名付与プログラムを見れば分かるように、読み仮名辞書の内容がそのままプログラムロジックとなっているので、外部の辞書を参照する必要の全く無いプログラムとなっている。 If a specific example is shown, the reading kana giving program creation unit 112 creates the reading kana giving program shown in FIG. 3 from the proper noun reading kana dictionary shown in FIG. In this embodiment, the kana reading of the map note is specified by the longest match method. For this reason, the reading-kana adding program creating unit 112 creates a program in which the contents of the reading-kana dictionary are rearranged in the order of long noun phrases (proper nouns in the examples shown in FIGS. 2 and 3). As can be seen from the reading-a-kana giving program shown in FIG. 3, the contents of the reading-a-kana dictionary are directly program logic, so that the program does not need to refer to an external dictionary at all.

実際には、読み仮名辞書群１１１は、上述したように複数の読み仮名辞書を備えているため、読み仮名付与プログラム作成部１１２により作成される読み仮名付与プログラムは、各読み仮名辞書に対応したプログラムロジック部を有する。より具体的には、読み仮名付与プログラムは、初期処理部、ユーザー辞書提供部、固有名詞接尾語辞書（後方一致）ロジック部、固有名詞接頭語辞書（前方一致）ロジック部、固有名詞辞書（部分一致）ロジック部、一般名詞接尾語辞書（後方一致）ロジック部、一般名詞接頭語辞書（前方一致）ロジック部、一般有名詞辞書（部分一致）ロジック部、カタカナ語一般名詞接尾語辞書（後方一致）ロジック部、カタカナ語一般名詞接頭語辞書（前方一致）ロジック部、カタカナ語一般有名詞辞書（部分一致）ロジック部、例外処理部、終了処理部の各部からなる。 Actually, since the reading-kana dictionary group 111 includes a plurality of reading-kana dictionaries as described above, the reading-kana adding program created by the reading-kana adding program creating unit 112 corresponds to each reading kana dictionary. It has a program logic part. More specifically, the reading pseudonym assignment program includes an initial processing unit, a user dictionary providing unit, a proper noun suffix dictionary (backward match) logic unit, a proper noun prefix dictionary (forward match) logic unit, a proper noun dictionary (part) Match) logic part, general noun suffix dictionary (backward match) logic part, general noun prefix dictionary (forward match) logic part, general noun dictionary (partial match) logic part, Katakana general noun suffix dictionary (backward match) ) Logic unit, Katakana general noun prefix dictionary (forward match) logic unit, Katakana general noun dictionary (partial match) logic unit, exception processing unit, and termination processing unit.

読み仮名付与プログラム作成部１１２は、上記の各ロジック部の作成（生成）時には、（１）処理対象の読み仮名辞書から同じエリアコードと属性番号の辞書データ（エリアコード、属性番号、名詞句、読み仮名）を抽出する。そして、（２）読み仮名付与プログラム作成部１１２は、抽出した辞書データを名詞句の文字数の多いもの順に並べ変える。最長一致により読み仮名の抽出を可能にするためである。当該（１）、（２）の処理を処理対象の辞書から同じエリアコードと属性番号の辞書データが無くなるまで繰り返し、無くなれば、次のエリアコードと属性番号を有する辞書データの処理に移る。 At the time of creation (generation) of each of the logic units described above, the reading-kana giving program creating unit 112 (1) dictionary data (area code, attribute number, noun phrase, (Reading Kana) is extracted. Then, (2) the reading pseudonym assignment program creating unit 112 rearranges the extracted dictionary data in the order of the noun phrase having the largest number of characters. This is because it is possible to extract a reading kana by the longest match. The processes (1) and (2) are repeated until there is no dictionary data with the same area code and attribute number from the dictionary to be processed, and when there is no more, the process moves to processing of dictionary data having the next area code and attribute number.

このようにして、処理対処の全ての辞書データについて処理が完了すると、当該処理対処の読み仮名辞書に対応するロジック部が作成（生成）できる。そして、読み仮名辞書群１１１の各読み仮名辞書について、プログラムロジックを作成することにより、前方一致、後方一致、部分一致で用いられる各読み仮名辞書を最長一致法で読み仮名を抽出する読み仮名付与プログラムロジックに変換できる。 In this way, when processing is completed for all dictionary data for processing, a logic unit corresponding to the reading-kana dictionary for the processing can be created (generated). Then, by creating a program logic for each reading-kana dictionary in the reading-kana dictionary group 111, each reading-kana dictionary used for forward matching, backward matching, and partial matching is extracted with the longest matching method. Can be converted to program logic.

言語別翻訳辞書群１２１は、この実施の形態では、日本語の地図注記データの読み仮名を、目的とする外国語に翻訳するための言語別の翻訳辞書を備える。この実施の形態では、中国語（簡体）、中国語（繁体）、英語、韓国語の４言語に対応するため、４言語の翻訳辞書を備える。更に、各言語の翻訳辞書は、大きく分けると、一般名詞に関する辞書やカタカナ語に関する辞書を備える。一般名詞に関する辞書は、一般名詞接尾語辞書、一般名詞接頭語辞書、一般名詞辞書からなる。カタカナ語に関する辞書は、カタカナ語固有名詞接尾語辞書、カタカナ語固有名詞接頭語辞書、カタカナ語固有一般名詞辞書、カタカナ語一般名詞接尾語辞書、カタカナ語一般名詞接頭語辞書、カタカナ語一般名詞辞書からなる。 In this embodiment, the language-specific translation dictionary group 121 includes language-specific translation dictionaries for translating the reading pseudonym of Japanese map annotation data into a target foreign language. In this embodiment, a four-language translation dictionary is provided to support four languages: Chinese (simplified), Chinese (traditional), English, and Korean. Furthermore, the translation dictionaries for each language are roughly divided into a dictionary for general nouns and a dictionary for katakana. The dictionary for general nouns consists of a general noun suffix dictionary, a general noun prefix dictionary, and a general noun dictionary. The dictionary for Katakana is Katakana proper noun suffix dictionary, Katakana proper noun prefix dictionary, Katakana proper noun dictionary, Katakana general noun suffix dictionary, Katakana general noun prefix dictionary, Katakana general noun dictionary Consists of.

また、一般名詞辞書、カタカナ語固有名詞辞書、カタカナ語一般名詞辞書のそれぞれは、部分一致により読み仮名などに対応する翻訳データ（翻訳語句）の検索が可能なものである。また、各接頭語辞書は、接頭語に対応する翻訳データを提供するものであるので、前方一致により接頭語の翻訳データの検索が可能なものである。また、各接尾語辞書は、接尾語の翻訳データを提供するものであるので、後方一致により接尾語の翻訳データの検索が可能なものである。 Each of the general noun dictionary, the katakana proper noun dictionary, and the katakana general noun dictionary is capable of searching translation data (translation word / phrase) corresponding to a reading kana by partial matching. Each prefix dictionary provides translation data corresponding to the prefix, so that the prefix translation data can be searched by forward matching. Since each suffix dictionary provides suffix translation data, the suffix translation data can be searched by backward matching.

言語別翻訳辞書群１２１が有する翻訳辞書もまた、図２に示した読み仮名辞書の場合と同様に、エリアコード、属性番号を備える。翻訳辞書の場合、図２における「固有名詞」が「読み仮名」などとなり、図２における「読み仮名」が「所定の言語の文字列（翻訳データ）」となるものである。 The translation dictionary included in the language-specific translation dictionary group 121 also includes an area code and an attribute number, as in the case of the reading kana dictionary shown in FIG. In the case of a translation dictionary, the “proper noun” in FIG. 2 is “Yomikana” or the like, and the “Yomikana” in FIG. 2 is “a character string (translation data) in a predetermined language”.

言語別翻訳プログラム作成部（第２のプログラム作成手段）１２２は、言語別翻訳辞書群１２１の各翻訳辞書の内容をプログラムロジックに変換し、地図注記データの読み仮名などを、所定の言語に変換する言語別翻訳プログラムを作成する。言語別翻訳プログラム作成部１２２により作成された言語別翻訳プログラムは、言語別翻訳プログラム格納部１２３に格納される。言語別翻訳プログラム格納部１２３、例えば、記憶装置１０２の所定の格納領域に形成される。 The language-specific translation program creation unit (second program creation means) 122 converts the contents of each translation dictionary in the language-specific translation dictionary group 121 into program logic, and converts the reading kana of map annotation data into a predetermined language. Create a language-specific translation program. The language-specific translation program created by the language-specific translation program creation unit 122 is stored in the language-specific translation program storage unit 123. The language-specific translation program storage unit 123 is formed in a predetermined storage area of the storage device 102, for example.

言語別翻訳プログラム作成部１２２により作成される言語別翻訳プログラムのプログラムロジック自体は、図３に示した読み仮名変換プログラムと同様のものである。すなわち、言語別翻訳プログラム作成部１２２により作成される言語別翻訳プログラムは、エリアコードと属性番号が一致し、且つ、読み仮名など（この例では地図注記の読み仮名など）が一致したら、当該読み仮名などと対応する所定の言語の文字列（翻訳データ）を出力するものである。 The program logic of the language-specific translation program created by the language-specific translation program creation unit 122 is the same as that of the reading-kana conversion program shown in FIG. In other words, the language-specific translation program created by the language-specific translation program creation unit 122 has the same area code and attribute number, and if the reading kana and the like (in this example, the reading kana of a map note) match, A character string (translation data) in a predetermined language corresponding to a kana or the like is output.

実際には、言語別翻訳辞書群１２１は、上述したように複数の翻訳辞書を備えているため、言語別翻訳プログラム作成部１２２により作成される言語別翻訳プログラムは、各翻訳辞書に対応したプログラムロジック部を有する。より具体的には、言語別翻訳プログラムは、初期処理部、ユーザー辞書提供部、一般名詞接尾語辞書（後方一致）ロジック部、一般名詞接頭語辞書（前方一致）ロジック部、一般名詞辞書（部分一致）ロジック部、カタカナ語固有名詞接尾語辞書（後方一致）ロジック部、カタカナ語固有名詞接頭語辞書（前方一致）ロジック部、カタカナ語固有有名詞辞書（部分一致）ロジック部、カタカナ語一般名詞接尾語辞書（後方一致）ロジック部、カタカナ語一般名詞接頭語辞書（前方一致）ロジック部、カタカナ語一般有名詞辞書（部分一致）ロジック部、例外処理部、終了処理部などの各部からなる。 Actually, since the language-specific translation dictionary group 121 includes a plurality of translation dictionaries as described above, the language-specific translation program created by the language-specific translation program creation unit 122 is a program corresponding to each translation dictionary. It has a logic part. More specifically, the language-specific translation program includes an initial processing unit, a user dictionary providing unit, a general noun suffix dictionary (backward match) logic unit, a general noun prefix dictionary (forward match) logic unit, a general noun dictionary (part) Match) Logic part, Katakana proper noun suffix dictionary (backward match) logic part, Katakana proper noun prefix dictionary (forward match) logic part, Katakana proper noun dictionary (partial match) logic part, Katakana general noun It consists of a suffix dictionary (backward match) logic unit, a katakana general noun prefix dictionary (forward match) logic unit, a katakana general noun dictionary (partial match) logic unit, an exception processing unit, and an end processing unit.

言語別翻訳プログラム作成部１２２は、上記の各ロジック部の作成（生成）時には、（Ａ）処理対象の翻訳辞書から同じエリアコードと属性番号の辞書データ（エリアコード、属性番号、名詞句、翻訳データ）を抽出する。そして、（Ｂ）言語別翻訳プログラム作成部１２２は、抽出した辞書データを名詞句の文字数の多いもの順に並べ変える。最長一致により翻訳データの抽出を可能にするためである。当該（Ａ）、（Ｂ）の処理を処理対象の辞書から同じエリアコードと属性番号の辞書データが無くなるまで繰り返し、無くなれば、次のエリアコードと属性番号を有する辞書データの処理に移る。 When creating (generating) each logic unit described above, the language-specific translation program creating unit 122 (A) dictionary data of the same area code and attribute number (area code, attribute number, noun phrase, translation) from the translation dictionary to be processed Data). Then, (B) the language-specific translation program creation unit 122 rearranges the extracted dictionary data in descending order of the number of characters in the noun phrase. This is because translation data can be extracted by the longest match. The processes (A) and (B) are repeated until there is no dictionary data with the same area code and attribute number from the dictionary to be processed. When there is no more dictionary data, the process moves to processing of dictionary data having the next area code and attribute number.

このようにして、処理対処の全ての辞書データについて処理が完了すると、当該処理対処の翻訳辞書に対応するロジック部が作成（生成）できる。そして、言語別翻訳辞書群１２１の各翻訳辞書について、プログラムロジックを作成することにより、前方一致、後方一致、部分一致で用いられる各翻訳辞書を最長一致法で読み仮名を抽出する言語別翻訳プログラムロジックに変換できる。 In this way, when processing is completed for all dictionary data to be processed, a logic unit corresponding to the translation dictionary to be processed can be created (generated). Then, by creating a program logic for each translation dictionary in the language-specific translation dictionary group 121, each translation dictionary used for forward matching, backward matching, and partial matching is read by the longest matching method to extract a kana. Can be converted to logic.

翻訳対象データファイル（図１では、翻訳対象データＦと記載。）１３１は、翻訳対象となる多数の地図注記データを保持する。具体的に、翻訳対象データファイル１３１には、「エリアコード、属性番号、地図注記データ」からなる翻訳対象データが多数保持されている。この翻訳対象データファイル１３１には、例えば、図示しない外部インターフェイスなどを通じて外部機器から提供された翻訳対象データなどが格納される。 A translation target data file (described as translation target data F in FIG. 1) 131 holds a large number of map note data to be translated. Specifically, the translation target data file 131 holds a large number of translation target data composed of “area code, attribute number, map annotation data”. The translation target data file 131 stores, for example, translation target data provided from an external device through an external interface (not shown).

読み仮名付与プログラム実行部１３２は、読み仮名付与プログラム作成部１１２により作成され、読み仮名付与プログラム格納部１１３に格納されている読み仮名付与プログラムを読み出して実行する。これにより、翻訳対象データファイル１３１に格納されている翻訳対象データの地図注記データに対して、実行された読み仮名付与プログラムにより読み仮名が付与される。 The reading pseudonym assignment program execution unit 132 reads and executes the reading pseudonym assignment program created by the reading pseudonym assignment program creation unit 112 and stored in the reading pseudonym assignment program storage unit 113. As a result, a reading pseudonym is assigned to the map annotation data of the translation target data stored in the translation target data file 131 by the executed reading pseudonym assignment program.

このようにして、翻訳対象データに対して読み仮名が付与されて形成された読み仮名データは、読み仮名データファイル（図１では、読み仮名データＦと記載。）１３３に記録される。当該読み仮名データは、「エリアコード、属性番号、地図注記データ、対応する読み仮名」からなるものである。読み仮名データファイル１３３は、ハードディスクなどの大容量記録媒体に作成される。 In this way, the reading kana data formed by adding the reading kana to the translation target data is recorded in the reading kana data file (described as reading kana data F in FIG. 1) 133. The reading kana data consists of “area code, attribute number, map note data, corresponding reading kana”. The reading kana data file 133 is created on a large-capacity recording medium such as a hard disk.

言語別翻訳プログラム実行部１３４は、言語別翻訳プログラム作成部１２２により作成され、言語別翻訳プログラム格納部１２３に格納されている言語別翻訳プログラムを読み出して実行する。これにより、読み仮名データファイル１３３に格納されている各読み仮名データの読み仮名が、実行された言語別翻訳プログラムにより所定の言語に翻訳され、読み仮名データに対して、翻訳データが付加された言語別翻訳データが形成される。 The language-specific translation program execution unit 134 reads and executes the language-specific translation program created by the language-specific translation program creation unit 122 and stored in the language-specific translation program storage unit 123. As a result, the reading kana of each reading kana data stored in the reading kana data file 133 is translated into a predetermined language by the executed language-specific translation program, and the translation data is added to the reading kana data. Language-specific translation data is formed.

言語別翻訳プログラム実行部１３４により実行された言語別翻訳プログラムにより作成された言語別翻訳データは、言語別翻訳データファイル１３５に記録される。当該言語別翻訳データは、「エリアコード、属性情報、地図注記データ、対応する読み仮名、所定の言語の文字列（翻訳データ）」からなるものである。言語別翻訳データファイル１３５は、ハードディスクなどの大容量記録媒体に作成される。 The language-specific translation data created by the language-specific translation program executed by the language-specific translation program execution unit 134 is recorded in the language-specific translation data file 135. The language-specific translation data includes “area code, attribute information, map note data, corresponding reading pseudonym, character string (translation data) in a predetermined language”. The language-specific translation data file 135 is created on a large-capacity recording medium such as a hard disk.

このように、この実施の形態の翻訳装置は、読み仮名辞書をプログラムロジックに変換して、読み仮名付与プログラムを作成し、この読み仮名付与プログラムを用いて、翻訳対象の地図注記データに対して読み仮名を付与する。また、この実施の形態の翻訳装置は、言語別の翻訳辞書をプログラムロジックに変換して、言語別翻訳プログラムを作成し、この言語別翻訳プログラムを用いて、地図注記データに対して付与された読み仮名を目的とする言語に翻訳する。 Thus, the translation apparatus of this embodiment converts the reading kana dictionary into program logic, creates a reading kana giving program, and uses this reading kana giving program to map note data to be translated. Give a reading. In addition, the translation device of this embodiment converts a language-specific translation dictionary into program logic, creates a language-specific translation program, and uses this language-specific translation program to give to map annotation data Translate the reading to the target language.

このように、使用される種々の辞書は、プログラムロジックに変換され、プログラムとして機能するので、種々の辞書を参照することが無く、プログラムを通じて、地図注記に対して読み仮名を付与し、この読み仮名が所定の言語に翻訳される構成になっている。これにより、外部の辞書を参照しないので、大量の地図注記を高速に目的とする言語に翻訳できる。 In this way, the various dictionaries used are converted into program logic and function as a program. Therefore, the reading dictionary is assigned to the map annotation through the program without referring to the various dictionaries. The kana is translated into a predetermined language. Thereby, since an external dictionary is not referred to, a large amount of map notes can be translated into a target language at high speed.

［翻訳装置の動作の具体例］
次に、図１を用いて説明したこの実施の形態の翻訳装置における翻訳処理の具体例について説明する。この実施の形態の翻訳装置は、大量の地図注記を超高速に翻訳できるものであるが、その処理内容を簡単に説明するため、１件分の入力データがどのように処理されるのかを具体的に説明する。 [Specific example of translation device operation]
Next, a specific example of translation processing in the translation apparatus of this embodiment described with reference to FIG. 1 will be described. The translation device of this embodiment is capable of translating a large amount of map notes at an ultra-high speed, but in order to explain the processing contents in a simple manner, it is necessary to specifically describe how one input data is processed. I will explain it.

図４は、図１に示した翻訳装置において、エリアコードが「２３（愛知県）」で、属性番号が「１３８（地名）」で、地図注記が「小碓通」である入力データを処理する場合の例を説明するための図である。図４に示すように、「２３（エリアコード），１３８（属性番号），小碓通（地図注記）」である入力データが翻訳対象データファイル１３１に用意されている（ステップＳ１）。 FIG. 4 shows a case where the translation apparatus shown in FIG. 1 processes input data with an area code “23 (Aichi Prefecture)”, an attribute number “138 (place name)”, and a map note “Kominato”. It is a figure for demonstrating the example of. As shown in FIG. 4, input data “23 (area code), 138 (attribute number), Kosugetsu (map note)” is prepared in the translation object data file 131 (step S1).

そして、読み仮名辞書がプログラムロジックに変換されて形成された読み仮名付与プログラムを、読み仮名付与プログラム実行部１３２が実行する（ステップＳ２）。当該読み仮名付与プログラムは、翻訳対象データファイル１３１の当該入力データを読み出して、当該入力データ中の地図注記データに対応する読み仮名を付与した読み仮名データを形成し、これを読み仮名データファイル１３３に記録する（ステップＳ３）。 Then, the reading kana giving program execution unit 132 executes the reading kana giving program formed by converting the reading kana dictionary into the program logic (step S2). The reading pseudonym assigning program reads the input data of the translation object data file 131 to form reading pseudonym data to which the reading pseudonym corresponding to the map annotation data in the input data is assigned. (Step S3).

ステップＳ３で読み仮名データファイル１３３に記録される読み仮名データは、図４のステップＳ３に示したように、入力データに対して、更に読み仮名「こうすどおり」が付与されたものである。この読み仮名データは、どの言語に翻訳する場合にも共通に用いられる。すなわち、読み仮名データファイル１３３の読み仮名データは、各言語の翻訳プログラムによって共通に用いられる。 The reading kana data recorded in the reading kana data file 133 in step S3 is obtained by further adding the reading kana “Kodosori” to the input data as shown in step S3 of FIG. This kana data is commonly used when translating into any language. That is, the reading kana data in the reading kana data file 133 is commonly used by the translation programs of the respective languages.

次に、言語別翻訳辞書がプログラムロジックに変換されて形成された言語別翻訳プログラムを、言語別翻訳プログラム実行部１３４が実行する（ステップＳ４）。当該言語別翻訳プログラムは、読み仮名データファイル１３３の当該読み仮名データを読み出して、当該読み仮名データ中の読み仮名に対応する所定の言語の文字列（翻訳データ）を付与した言語別翻訳データを形成し、これを言語別翻訳データファイル１３５に記録する（ステップＳ５）。 Next, the language-specific translation program formed by converting the language-specific translation dictionary into the program logic is executed by the language-specific translation program execution unit 134 (step S4). The language-specific translation program reads the kana data in the reading kana data file 133 and reads the language-specific translation data to which a character string (translation data) of a predetermined language corresponding to the reading kana in the reading kana data is given. This is formed and recorded in the language-specific translation data file 135 (step S5).

ステップＳ５で言語別翻訳データファイル１３５に記録される翻訳データは、図４のステップＳ５に示したように、読み仮名データに対して、更に各言語の翻訳データが付与されたものである。この実施の形態においては、中国語（簡体）、中国語（繁体）、英語、韓国語の４言語に翻訳されるため、各言語の翻訳プログラムにより、図４のステップＳ５に示したように、４言語のそれぞれの言語別の翻訳データが形成される。 The translation data recorded in the language-specific translation data file 135 in step S5 is obtained by further adding translation data of each language to the reading kana data as shown in step S5 of FIG. In this embodiment, since it is translated into four languages of Chinese (simplified), Chinese (traditional), English, and Korean, as shown in step S5 of FIG. Translation data for each of the four languages is formed.

これに対して、図５は、従来の翻訳装置における翻訳処理の概要を説明するための図である。図５に示すように、従来の翻訳装置では、図４に示した例と同様の入力データを翻訳する場合、言語別翻訳プログラムが、まず、翻訳対象の文字列（この例の場合には地図注記）を形態素解析などの手法を用いて意味のある文字列単位に分解する。そして、この分解した各文字列（あるいは文字）を、翻訳辞書を用いて翻訳する。 On the other hand, FIG. 5 is a diagram for explaining an outline of translation processing in a conventional translation apparatus. As shown in FIG. 5, in the conventional translation apparatus, when the input data similar to the example shown in FIG. 4 is translated, the language-specific translation program first converts the character string to be translated (in this example, the map Note) is decomposed into meaningful character string units using techniques such as morphological analysis. Then, each decomposed character string (or character) is translated using a translation dictionary.

この従来の翻訳装置の場合には、形態素解析の段階や翻訳処理の段階で種々の辞書（データベース）を参照する必要がある。このように種々に辞書にアクセスする分の時間は、翻訳処理に掛かる時間の多くの部分を占めており、翻訳処理は時間の掛かるものとなっていた。 In the case of this conventional translation apparatus, it is necessary to refer to various dictionaries (databases) at the stage of morphological analysis and the stage of translation processing. As described above, the time required to access the dictionary in various ways occupies a large part of the time required for the translation process, and the translation process takes time.

しかし、図４と図５とを比較しても分かるように、この出願の翻訳装置の場合には、外部の辞書（データベース）を全く参照しないため、翻訳処理の高速化が実現できる。また、地図注記に読み仮名を付与することで、正しい読み仮名に応じた翻訳が可能となる。また、地図注記データに対して読み仮名が付与されて形成された読み仮名データは、各言語の翻訳プログラムで共通に使用される。このため、各言語の翻訳プログラムにおいて、地図注記に読み仮名を付与する処理は行わなくてもよい。 However, as can be seen from a comparison between FIG. 4 and FIG. 5, in the translation apparatus of this application, since an external dictionary (database) is not referred to at all, the translation process can be speeded up. Moreover, by assigning a reading kana to a map note, translation according to the correct reading kana becomes possible. Further, the reading kana data formed by adding the reading kana to the map annotation data is commonly used in the translation programs of the respective languages. For this reason, in the translation program of each language, it is not necessary to perform the process of giving a reading pseudonym to a map note.

［実施の形態の翻訳装置で行われる処理のまとめ］
この実施の形態の翻訳装置で行われる処理は、読み仮名付与プログラムと言語別翻訳プログラムを作成する事前準備処理と、作成された読み仮名付与プログラムと言語別翻訳プログラムを用いて地図注記を翻訳する翻訳処理とに大きく分けることができる。事前準備処理は、基になる読み仮名辞書や言語別翻訳辞書が変更されない限り、繰り返し行う必要は無い。また、この実施の形態の翻訳装置において行われる翻訳処理は、まず、地図注記データに対して読み仮名を付与して読み仮名データを形成する処理と、形成された読み仮名データの読み仮名を翻訳する処理とに分けられる。以下においては、この実施の形態の翻訳装置で行われる、事前準備処理と翻訳処理のそれぞれについて、具体的に説明する。 [Summary of Processing Performed by Translation Device of Embodiment]
The processing performed by the translation apparatus of this embodiment is a preparatory process for creating a reading pseudonym assignment program and a language-specific translation program, and a map annotation is translated using the created reading pseudonym assignment program and language-specific translation program. It can be broadly divided into translation processing. The pre-preparation process does not need to be repeated unless the base reading kana dictionary or language-specific translation dictionary is changed. In addition, the translation processing performed in the translation apparatus of this embodiment is performed by first assigning reading kana to map annotation data to form reading kana data, and translating the reading kana of the formed reading kana data. It is divided into processing to do. In the following, each of the preliminary preparation process and the translation process performed by the translation apparatus of this embodiment will be described in detail.

［事前準備処理］
図６は、この実施の形態の翻訳装置で行われる事前準備処理を説明するためのフローチャートである。事前準備処理は、図６に示すように、まず、読み仮名付与プログラム作成部１１２が、読み仮名辞書群１１１の各読み仮名辞書のぞれぞれの内容をプログラムロジックに変換して、読み仮名付与プログラムを作成する（ステップＳ２１）。ステップＳ２１の処理では、前方一致、後方一致、部分一致で使用される読み仮名辞書のそれぞれが、最長一致法で読み仮名を置換するプログラムロジックに変換される。 [Preparation process]
FIG. 6 is a flowchart for explaining pre-preparation processing performed in the translation apparatus of this embodiment. As shown in FIG. 6, in the preparatory process, first, the reading kana giving program creation unit 112 converts the contents of each reading kana dictionary in the reading kana dictionary group 111 into program logic, and reads the reading kana. A grant program is created (step S21). In the process of step S21, each of the reading kana dictionaries used for the forward match, the backward match, and the partial match is converted into a program logic that replaces the reading kana by the longest match method.

次に、言語別翻訳プログラム作成部１２２が、言語別翻訳辞書群１２１の各翻訳辞書の内容をプログラムロジックに変換して、言語別翻訳プログラムを作成する（ステップＳ２２）。ステップＳ２２の処理でも、前方一致、後方一致、部分一致で使用される翻訳辞書のそれぞれが、最長一致法で対象翻訳言語に置換するプログラムロジックに変換される。 Next, the language-specific translation program creation unit 122 converts the contents of each translation dictionary in the language-specific translation dictionary group 121 into program logic to create a language-specific translation program (step S22). Also in the process of step S22, each of the translation dictionaries used for the forward match, the backward match, and the partial match is converted into a program logic that is replaced with the target translation language by the longest match method.

これら２つの処理を通じて、読み仮名付与プログラム格納部１１３には読み仮名付与プログラムが格納され、言語別翻訳プログラム格納部１２３には、各言語別の翻訳プログラムが格納され、地図注記の翻訳処理の準備が整えられる。なお、言語別翻訳プログラム格納部１２３には、言語別の複数の翻訳プログラムが格納されることになる。 Through these two processes, the reading pseudonym assignment program storage unit 113 stores the reading pseudonym assignment program, the language-specific translation program storage unit 123 stores the translation program for each language, and prepares the map annotation translation processing. Is arranged. The language-specific translation program storage unit 123 stores a plurality of language-specific translation programs.

このように、図６に示した処理を通じて、翻訳処理前の事前準備ができる。そして、上述もしたように、図６に示す事前準備は、読み仮名辞書群１１１の読み仮名辞書に変更が生じたり、言語別翻訳辞書群１２１の翻訳辞書に変更が生じたりした場合に実行すればよい。もちろん、読み仮名辞書群１１１の読み仮名辞書だけに変更が生じた場合には、図６に示したステップＳ２１の処理だけを行えばよい。また、言語別翻訳辞書群１２１の翻訳辞書だけに変更が生じた場合には、図６に示したステップＳ２２の処理だけを行えばよい。 In this way, advance preparation before translation processing can be performed through the processing shown in FIG. Then, as described above, the advance preparation shown in FIG. 6 is executed when a change occurs in the reading kana dictionary of the reading kana dictionary group 111 or a change occurs in the translation dictionary of the language-specific translation dictionary group 121. That's fine. Of course, when only the reading kana dictionary of the reading kana dictionary group 111 is changed, only the process of step S21 shown in FIG. Further, when only the translation dictionary in the language-specific translation dictionary group 121 is changed, only the process of step S22 shown in FIG.

［翻訳処理］
図７は、この実施の形態の翻訳装置で行われる翻訳処理を説明するフローチャートである。当該翻訳処理は、図７に示すように、読み仮名付与処理（図７（Ａ））と、言語別翻訳処理（図７（Ｂ））からなる。読み仮名付与処理（図７（Ａ））は、読み仮名付与プログラム実行部１３２において実行される読み仮名付与プログラムによる処理である。言語別翻訳処理（図７（Ｂ））は、言語別翻訳プログラム実行部１３４において実行される言語別翻訳プログラムによる処理である。 [Translation processing]
FIG. 7 is a flowchart for explaining a translation process performed by the translation apparatus of this embodiment. As shown in FIG. 7, the translation process includes a reading pseudonym assignment process (FIG. 7A) and a language-specific translation process (FIG. 7B). The reading pseudonym assignment process (FIG. 7A) is a process performed by the reading pseudonym assignment program executed by the reading pseudonym assignment program execution unit 132. The language-specific translation process (FIG. 7B) is a process performed by the language-specific translation program executed by the language-specific translation program execution unit 134.

当該翻訳処理では、まず、読み仮名付与プログラム実行部１３２が読み仮名付与プログラムを実行し、翻訳対象データファイル１３１をインプットファイルとし、読み仮名データファイル１３３をアウトプットファイルとして、図７（Ａ）に示す読み仮名付与処理を行う。当該読み仮名付与処理において、読み仮名付与プログラム実行部１３２は、翻訳対象データファイル１３１から翻訳対象データを順次に読み出す（ステップＳ３１）。読み仮名付与プログラム実行部１３２は、翻訳対象データが読み出せたか否か（全ての翻訳対象データの読み出しが終了したか否か）を判別する（ステップＳ３２）。 In the translation process, first, the reading pseudonym assignment program execution unit 132 executes the reading pseudonym assignment program, the translation object data file 131 is set as an input file, and the reading pseudonym data file 133 is set as an output file as shown in FIG. The kana adding process shown is performed. In the reading pseudonym assignment process, the reading pseudonym assignment program execution unit 132 sequentially reads the translation target data from the translation target data file 131 (step S31). The reading pseudonym assignment program execution unit 132 determines whether the translation target data has been read (whether reading of all the translation target data has been completed) (step S32).

ステップＳ３２の判別処理において、翻訳対象データが読み出せたと判別したとする。この場合、読み仮名付与プログラム実行部１３２は、その読み出した翻訳対象データの地図注記データに対して読み仮名を付与して読み仮名データを形成し、これを読み仮名データファイル１３３に書き込む処理を行う（ステップＳ３３）。この後、ステップＳ３１からの処理を繰り返す。 Assume that it is determined in the determination process of step S32 that the data to be translated has been read. In this case, the reading pseudonym assignment program execution unit 132 forms a reading pseudonym data by assigning a reading pseudonym to the read map annotation data of the data to be translated, and performs a process of writing this into the reading pseudonym data file 133. (Step S33). Thereafter, the processing from step S31 is repeated.

ステップＳ３２の判別処理において、全ての翻訳対処データの読み出しが終了したと判別したときには、図７（Ａ）に示す読み仮名付与処理を終了する。 If it is determined in step S32 that the reading of all translation handling data has been completed, the reading pseudonym assignment process shown in FIG.

当該読み仮名付与プログラムによる読み仮名付与処理が終了すると、言語別翻訳プログラム実行部１３４が言語別翻訳プログラムを実行し、読み仮名データファイル１３３をインプットファイルとし、言語別翻訳データファイル１３５をアウトプットファイルとして、図７（Ｂ）に示す言語別翻訳処理を行う。当該言語別翻訳処理において、言語別翻訳プログラム実行部１３４は、読み仮名データファイル１３３から読み仮名データを順次に読み出す（ステップＳ４１）。言語別翻訳プログラム実行部１３４は、読み仮名データが読み出せたか否か（全ての読み仮名データの読み出しが終了したか否か）を判別する（ステップＳ４２）。 When the reading pseudonym assignment process by the reading pseudonym assignment program is completed, the language-specific translation program execution unit 134 executes the language-specific translation program, the reading pseudonym data file 133 is used as an input file, and the language-specific translation data file 135 is output as an output file. Then, the language-specific translation processing shown in FIG. In the language-specific translation process, the language-specific translation program execution unit 134 sequentially reads the reading kana data from the reading kana data file 133 (step S41). The language-specific translation program execution unit 134 determines whether reading kana data has been read (whether reading of all reading kana data has been completed) (step S42).

ステップＳ４２の判別処理において、読み仮名データが読み出せたと判別したとする。この場合、言語別翻訳プログラム実行部１３４は、その読み出した読み仮名データの読み仮名に対する所定の言語の文字列（翻訳データ）を付与して言語別翻訳データを形成し、これを言語別翻訳データファイル１３５に書き込む処理を行う（ステップＳ４３）。この後、ステップＳ４１からの処理を繰り返す。 Assume that it is determined in the determination process of step S42 that the reading kana data has been read. In this case, the language-specific translation program execution unit 134 assigns a character string (translation data) of a predetermined language to the reading kana of the read kana data and forms language-specific translation data, which is converted into the language-specific translation data. A process of writing to the file 135 is performed (step S43). Thereafter, the processing from step S41 is repeated.

ステップＳ４２の判別処理において、読み仮名データが読み出せなかった（全ての読み仮名データの読み出しが終了した）と判別したときには、図７（Ｂ）に示す言語別翻訳与処理を終了する。なお、この図７（Ｂ）に示す処理は、言語別の翻訳プログラムによって言語別に行われる処理である。 If it is determined in the determination process of step S42 that the reading kana data cannot be read (the reading of all reading kana data has been completed), the language-specific translation processing shown in FIG. The process shown in FIG. 7B is a process performed for each language by a language-specific translation program.

このように、読み仮名付与処理（図７（Ａ））と言語別翻訳処理（図７（Ｂ））の処理を通じて翻訳処理が行われる。そして、読み仮名付与処理と言語別翻訳処理のいずれにおいても、インプットファイルとアウトプットファイルが存在するだけで、参照データベースは一切存在しない。すなわち、読み仮名辞書や翻訳辞書は、プログラムロジックに変換されて、それぞれ、読み仮名付与プログラム、言語別翻訳プログラムとされている。このため、地図注記に対する読み仮名の付与は、読み仮名付与プログラムの中だけで完結するように処理され、付与された読み仮名の所定の言語への翻訳は、言語別翻訳プログラムの中だけで完結するように処理される
このようにして、この実施の形態の翻訳装置では、大量の地図注記を極めて高速に翻訳することを実現している。 As described above, the translation process is performed through the reading pseudonym assignment process (FIG. 7A) and the language-specific translation process (FIG. 7B). In both of the reading pseudonym assignment process and the language-specific translation process, only the input file and the output file exist, and no reference database exists. That is, the reading kana dictionary and the translation dictionary are converted into program logic, which are a reading kana giving program and a language-specific translation program, respectively. For this reason, the assignment of a reading kana to a map note is processed so as to be completed only in the reading kana giving program, and the translation of the given reading kana to a predetermined language is completed only in the language-specific translation program. In this way, the translation apparatus according to this embodiment realizes extremely high-speed translation of a large number of map annotations.

［実施の形態の効果］
上述した実施の形態の翻訳装置によれば、大量に存在する地図注記を、従来の機械翻訳装置に比べて飛躍的に速く、且つ、正確に翻訳できる。 [Effect of the embodiment]
According to the translation apparatus of the embodiment described above, a large amount of map notes can be translated significantly faster and more accurately than conventional machine translation apparatuses.

また、地図注記について読み仮名を付与し、この付与した読み仮名を翻訳する構成としているので、実際の読み方に即した翻訳ができる。不自然な翻訳となることが無い。 Moreover, since the reading kana is given to the map note, and the given reading kana is translated, the translation according to the actual reading can be performed. There is no unnatural translation.

また、読み仮名が付与されて形成された読み仮名データは、各言語の翻訳プログラムで共通に使用できる。つまり、地図注記を複数の言語に翻訳する場合であっても、各言語の翻訳プログラムなどで重複して読み仮名データを形成する処理を行わなくても済む。 Moreover, the reading kana data formed by adding the reading kana can be commonly used in the translation programs of the respective languages. That is, even when the map note is translated into a plurality of languages, it is not necessary to perform the process of forming the reading kana data redundantly by the translation program of each language.

そして、数百万件の地図注記を翻訳する場合、従来は翻訳に数週間かかっていたが、これを数時間に短縮することができた。各言語の翻訳プログラムによる処理は、それぞれ数十分の処理時間を実現している。 And when translating millions of map notes, it used to take weeks to translate, but it could be reduced to hours. Each language processing program realizes several tens of minutes of processing time.

［変形例など］
なお、上述した実施の形態では、地図注記データを中国語（簡体）、中国語（繁体）、英語、韓国語の４言語に翻訳するものとして説明したが、これに限るものではない。この他の種々の言語に翻訳することももちろんできる。翻訳対象の言語の組み合わせは種々の組み合わせとすることができる。もちろん、単一の言語に翻訳する場合にも対応できる。 [Modifications]
In the above-described embodiment, the map annotation data is described as being translated into four languages of Chinese (simplified), Chinese (traditional), English, and Korean. However, the present invention is not limited to this. Of course, it can be translated into other various languages. The combination of languages to be translated can be various combinations. Of course, it is also possible to translate into a single language.

また、上述した実施の形態では、地図注記データを翻訳する場合を例にして説明したが、これに限るものではない。例えば、電話帳リスト、顧客リスト、住所リストといった多数の名前、住所などの名詞句の羅列が含まれるものの当該多数の名詞句の羅列を翻訳する場合に、この発明を適用できる。 In the above-described embodiment, the case where the map annotation data is translated has been described as an example. However, the present invention is not limited to this. For example, the present invention can be applied to a case where a large number of noun phrases such as a phone book list, a customer list, and an address list are included, but a large number of noun phrases are translated.

また、上述した実施の形態では、読み仮名に対して翻訳を行うものとして説明した。しかし、これに限るものではない。例えば、ひらがな、カタカナ、漢字、アルファベット、数字、記号からなる翻訳対象そのものと作成した読み仮名とを比較しながら、より適切に翻訳を行うようにすることができる。 Further, in the above-described embodiment, it has been described that translation is performed on a reading pseudonym. However, it is not limited to this. For example, it is possible to perform translation more appropriately while comparing the translation target itself made up of hiragana, katakana, kanji, alphabet, numbers, and symbols with the created kana.

［その他］
上述した実施の形態の説明からも分かるように、請求項に記載した翻訳装置の読み仮名辞書は、実施の形態の翻訳装置の読み仮名辞書群の各読み仮名辞書に対応し、また、請求項に記載した翻訳装置の翻訳辞書は、実施の形態の翻訳装置の言語別翻訳辞書群の各翻訳辞書に対応している。また、請求項に記載した翻訳装置の第１のプログラム作成手段は、実施の形態の翻訳装置の読み仮名付与プログラム作成部１１２に対応し、請求項に記載した翻訳装置の第２のプログラム作成手段は、この実施の形態の翻訳装置の言語別翻訳プログラム作成部１２２に対応している。 [Others]
As can be understood from the above description of the embodiment, the reading device's reading kana dictionary corresponds to each reading device dictionary in the reading device dictionary group of the translation device according to the embodiment. The translation dictionary of the translation device described in the above corresponds to each translation dictionary in the language-specific translation dictionary group of the translation device of the embodiment. The first program creation means of the translation device recited in the claims corresponds to the reading pseudonym assignment program creation unit 112 of the translation device according to the embodiment, and the second program creation means of the translation device according to the claims. Corresponds to the language-specific translation program creation unit 122 of the translation apparatus of this embodiment.

また、請求項に記載した翻訳装置の読み仮名付与手段は、この実施の形態の翻訳装置の読み仮名付与プログラム実行部１３２に対応し、また、請求項に記載した翻訳装置の翻訳手段は、この実施の形態の翻訳装置の言語別翻訳プログラム実行部１３４に対応している。 Further, the reading device of the translation device described in the claims corresponds to the reading device of the translation device of the translation device of this embodiment, and the translation device of the translation device described in the claim This corresponds to the language-specific translation program execution unit 134 of the translation apparatus according to the embodiment.

また、図５、図６を用いて説明した翻訳処理は、この発明の翻訳方法の一実施の形態が適用されたものであり、図５、図６を用いて説明した翻訳処理を実行するプログラムは、この発明の翻訳プログラムの一実施の形態が適用されたものである。また、図１に示した読み仮名付与プログラム作成部１１２、言語別翻訳プログラム作成部１２２、読み仮名付与プログラム実行部１３２、言語別翻訳プログラム実行部１３４の各機能は、制御部１０１で実行されるプログラムにより、制御部１０１の機能として実現することもできる。 Moreover, the translation processing explained using FIG. 5 and FIG. 6 is one to which an embodiment of the translation method of the present invention is applied, and a program for executing the translation processing explained using FIG. 5 and FIG. Is applied with an embodiment of the translation program of the present invention. The functions of the reading-kana assignment program creating unit 112, the language-specific translation program creating unit 122, the reading-kana adding program execution unit 132, and the language-specific translation program execution unit 134 shown in FIG. It can also be realized as a function of the control unit 101 by a program.

１０１…制御部、１０２…記憶装置、１０３…操作部、１１１…読み仮名辞書群、１１２…読み仮名付与プログラム作成部、１１３…読み仮名付与プログラム格納部、１２１…言語別翻訳辞書群、１２２…言語別翻訳プログラム作成部、１２３…言語別翻訳プログラム格納部、１３１…翻訳対象データファイル、１３２…読み仮名付与プログラム実行部、１３３…読み仮名データファイル、１３４…言語別翻訳プログラム実行部、１３５…言語別翻訳データファイル DESCRIPTION OF SYMBOLS 101 ... Control part, 102 ... Memory | storage device, 103 ... Operation part, 111 ... Reading kana dictionary group, 112 ... Reading kana provision program creation part, 113 ... Reading kana provision program storage part, 121 ... Translation dictionary group according to language, 122 ... Language-specific translation program creation unit, 123... Language-specific translation program storage unit, 131... Translation target data file, 132 ... reading-kana adding program execution unit, 133 ... reading-kana data file, 134. Translation data file by language

Claims

A phonetic dictionary that associates strings with their phonetic kana,
First program creating means for converting a content of the reading-reading kana dictionary into program logic and creating a reading-kana adding program for adding a reading kana to an input character string;
A translation dictionary associating a reading kana with a character string of a predetermined language corresponding to the reading kana,
Second program creating means for converting the contents of the translation dictionary into program logic and creating a translation program for translating the input reading pseudonym into a predetermined language;
A reading pseudonym giving unit that executes the reading pseudonym giving program created by the first program creation unit and gives a reading pseudonym to a character string to be translated;
A translation apparatus comprising: a translation unit that executes the translation program created by the second program creation unit and translates the reading pseudonym assigned by the reading pseudonym granting unit into a target language.

The translation device according to claim 1,
The translation apparatus characterized in that the second program creation means can convert the program logic into a program logic that matches a predetermined translation condition.

A translation device according to claim 1 or 2, wherein
2. The translation apparatus according to claim 1, wherein the reading kana dictionary and the translation dictionary are made up of a plurality of dictionaries in which search word matching methods are different depending on forward matching, backward matching, and partial matching.

The translation device according to claim 3,
The first program creation means converts each of the reading kana dictionary used for forward matching, backward matching, and partial matching into a program logic for replacing the reading kana by the longest matching method,
The second program creation means converts each of the translation dictionaries used for forward matching, backward matching, and partial matching into a program logic that replaces the target translation language with the longest matching method. Translation device to do.

A translation device according to any one of claims 1, 2, 3, or 4,
The character string to be translated is related to a place, and in the reading kana conversion dictionary, corresponding to each character string, area information indicating a region to which the character string belongs and an attribute indicating an attribute of the character string Information is added,
The translation apparatus characterized in that the first program creation means converts the contents of the reading-reading kana conversion dictionary into program logic including the area information and the attribute information.

The first program creating means converts the contents of a reading kana dictionary in which a character string and its reading kana are associated with each other into program logic, and creates a reading kana adding program for adding a reading kana to the input character string. 1 program creation process;
The second program creation means converts the contents of the translation dictionary in which the reading kana and the character string of the predetermined language corresponding to the reading kana are associated with each other into program logic, and translates the input reading kana into the predetermined language A second program creation step for creating a translation program to be performed;
The reading pseudonym assigning step executed by the reading pseudonym assigning unit to give the reading pseudonym to the character string to be translated;
A translation method comprising: a translation step executed by a translation unit for translating the translation program created in the second program creation step and translating the reading pseudonym assigned in the reading pseudonym assignment step into a target language .

A computer mounted on an information processing apparatus comprising a reading kana dictionary that associates a character string with its reading kana, and a translation dictionary that associates the reading kana with a character string of a predetermined language corresponding to the reading kana The
First program creating means for converting a content of the reading-reading kana dictionary into program logic and creating a reading-kana adding program for adding a reading kana to an input character string;
Second program creating means for converting the contents of the translation dictionary into program logic and creating a translation program for translating the input reading pseudonym into a predetermined language;
A reading pseudonym giving unit that executes the reading pseudonym giving program created by the first program creation unit and gives a reading pseudonym to a character string to be translated;
A translation program that executes the translation program created by the second program creation means and functions as a translation means for translating the reading pseudonym given by the reading pseudonym giving means into a target language .