JP2719453B2

JP2719453B2 - Machine translation equipment

Info

Publication number: JP2719453B2
Application number: JP3043477A
Authority: JP
Inventors: 秀造釘宮; 徳行平井; 陽士福持; いち子佐田; 毅九津見
Original assignee: Sharp Corp
Current assignee: Sharp Corp
Priority date: 1991-03-08
Filing date: 1991-03-08
Publication date: 1998-02-25
Anticipated expiration: 2013-02-25
Also published as: JPH04280369A

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【産業上の利用分野】本発明は機械翻訳装置に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a machine translation device.

【０００２】[0002]

【従来の技術】従来の機械翻訳装置は、形態素解析部に
よりキーボードから入力されたソース言語の文章を所定
の辞書を用いて形態素に分割して各形態素の品詞を得
て、構文解析部により形態素解析部で形態素に分割され
た文章を所定の辞書と所定の文法規則とを用いて解析し
て文章の構文構造を得る。更に、変換部により構文解析
部で得られた構文構造をターゲット言語の文章の構文構
造に変換し、翻訳文生成部により変換部で得られた構文
構造に基づいてターゲット言語の文章を生成するように
構成されている。2. Description of the Related Art A conventional machine translation apparatus divides a sentence of a source language input from a keyboard by a morphological analysis unit into morphemes using a predetermined dictionary, obtains a part of speech of each morpheme, and obtains a morpheme by a syntax analysis unit. The analysis unit analyzes the sentence divided into morphemes using a predetermined dictionary and a predetermined grammar rule to obtain a syntax structure of the sentence. Furthermore, the syntax structure obtained by the syntax analysis unit by the conversion unit is converted into the syntax structure of a sentence in the target language, and the translated sentence generation unit generates a sentence in the target language based on the syntax structure obtained by the conversion unit. Is configured.

【０００３】機械翻訳は、翻訳手順により２つの方式に
大別される。[0003] Machine translation is roughly classified into two types according to the translation procedure.

【０００４】１つは、ソース言語及びターゲット言語の
どちらにも依存しない概念（中間言語と呼ぶ）まで解析
し、そこから文脈生成、意味生成、構文生成、形態素生
成と進み、翻訳文を得るピボット方式である。[0004] One is a pivot which obtains a translated sentence by analyzing a concept (called an intermediate language) which does not depend on either the source language or the target language, and proceeding with context generation, meaning generation, syntax generation, and morpheme generation. It is a method.

【０００５】他の１つは、形態素解析、構文解析、意味
解析及び文脈解析のうち、いずれかまでの解析を行って
原文の内部構造を得て翻訳文を生成するトランスファー
方式である。[0005] The other is a transfer system in which any one of morphological analysis, syntactic analysis, semantic analysis and context analysis is analyzed to obtain an internal structure of an original sentence and generate a translated sentence.

【０００６】[0006]

【発明が解決しようとする課題】上述した従来の機械翻
訳装置では、構文解析部が、形態素解析の結果得られた
品詞から辞書と文法規則とを用いて構文解析を行うとき
に、可能なあらゆる構文構造を組み立てて、その結果に
基づいて変換部と翻訳文生成部により複数の翻訳文が得
られるように構成されているので、ユーザは複数の翻訳
結果の中から正しいものを選択する必要があり、そのた
めに選択という煩わしい作業を行わなければならず、ま
た最終的な翻訳結果を得るまでに長時間を要するという
問題点がある。In the above-described conventional machine translation apparatus, the parsing unit performs all possible parsing using the dictionary and the grammatical rules from the part of speech obtained as a result of the morphological analysis. Since the syntax structure is constructed and the translation unit and translation generation unit can obtain multiple translations based on the results, the user needs to select the correct translation from multiple translation results. There is a problem that a complicated work of selection must be performed and a long time is required until a final translation result is obtained.

【０００７】本発明は、上記従来の機械翻訳装置におけ
る問題点に鑑み、正しい翻訳結果を迅速に得ることを可
能とする機械翻訳装置を提供することにある。SUMMARY OF THE INVENTION The present invention has been made in consideration of the above-described problems of the conventional machine translation apparatus, and has as its object to provide a machine translation apparatus capable of quickly obtaining a correct translation result.

【０００８】[0008]

【課題を解決するための手段】本発明は、形態素に分割
されたソース言語で記載された文章の構文構造を解析す
る構文解析手段と、文章に含まれるフレーズの品詞を指
定する品詞指定記号を文章に付加する記号付加手段と、
付加された品詞指定記号を検出する記号検出手段と、品
詞指定記号が検出されたときにフレーズに含まれる単語
数をカウントし、品詞指定記号によって指定されたフレ
ーズの品詞と単語数とを記憶する記憶手段とを備えてお
り、構文解析手段は、記憶手段に記憶された単語数でフ
レーズを判断し、記憶手段に記憶された品詞を判断され
たフレーズの品詞として構文構造を解析することを特徴
とする機械翻訳装置を提供する。SUMMARY OF THE INVENTION The present invention provides syntactic analysis means for analyzing the syntactic structure of a sentence described in a source language divided into morphemes, and a part-of-speech designating symbol for designating a part of speech of a phrase contained in the sentence. Means for adding a symbol to a sentence;
A symbol detecting means for detecting the added part of speech designation symbol, and a word included in the phrase when the part of speech designation symbol is detected
Counts the number of frames and
Storage means for storing the part of speech of the phrase and the number of words , and the syntactic analysis means stores the words based on the number of words stored in the storage means.
Judge the laze, judge the part of speech stored in the storage means
To provide a machine translation apparatus characterized by analyzing the parts of speech and to the syntax structure of phrase.

【０００９】[0009]

【作用】ソース言語で記載された文章に含まれるフレー
ズの品詞を指定するため、予め記号付加手段によってソ
ース言語で記載された文章に品詞指定記号を付加する。
そしてソース言語の文章を形態素に分割して各形態素の
品詞を得るときに記号検出手段がソース言語で記載され
た文章中に品詞指定記号を検出するとフレーズに含まれ
る単語数をカウントし、記憶手段は品詞指定記号が指定
するフレーズの品詞及び単語数を記憶する。構文解析手
段は、記憶手段に記憶された単語数でフレーズを判断
し、記憶手段に記憶された品詞を判断されたフレーズの
品詞として構文構造を解析してターゲット言語による構
文構造を得る。In order to designate the part of speech of a phrase contained in a sentence described in a source language, a part of speech designation symbol is added in advance to a sentence described in the source language by a symbol adding means.
When the sentence in the source language is divided into morphemes and the part of speech of each morpheme is obtained, the symbol detection means detects the part of speech designation symbol in the sentence described in the source language and is included in the phrase.
The storage means stores the part of speech and the number of words of the phrase specified by the part of speech designation symbol. The parsing means determines the phrase based on the number of words stored in the storage means
To obtain a syntax structure according to the target language to parse structure as the phrase part of speech determines the stored part of speech in the storage means.

【００１０】[0010]

【実施例】以下、図面を参照して本発明の機械翻訳装置
における実施例を詳述する。DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS Hereinafter, embodiments of the machine translation apparatus of the present invention will be described in detail with reference to the drawings.

【００１１】図１は、本発明における機械翻訳装置の主
要部である翻訳モジュールの動作を示すフロ−チャ−ト
である。FIG. 1 is a flowchart showing the operation of a translation module which is a main part of a machine translation apparatus according to the present invention.

【００１２】図２は、図１の翻訳モジュールの機能的な
構成を示す。FIG. 2 shows a functional configuration of the translation module of FIG.

【００１３】翻訳モジュール10は、機能的には原文を、
辞書を用いて形態素に分割して各形態素の品詞を得る形
態素解析部11、形態素解析部11により形態素に分割され
た文章を辞書と文法規則とを用いて解析して文章の構文
構造を得る構文解析部12、構文解析部12で得られた構文
構造をターゲット言語の構文構造に変換する変換部13、
変換部13によって得られた構文構造に基づいて翻訳文を
生成する翻訳文生成部14により構成されている。The translation module 10 functionally translates the original text,
A morphological analysis unit 11 that divides the morpheme into morphemes using a dictionary to obtain the part of speech of each morpheme, and a syntax that parses the text divided into morphemes by the morpheme analysis unit 11 using a dictionary and grammatical rules to obtain a syntactic structure of the text. An analysis unit 12, a conversion unit 13 that converts the syntax structure obtained by the syntax analysis unit 12 into a syntax structure of a target language,
The translation unit 13 is configured to generate a translated sentence based on the syntax structure obtained by the conversion unit 13.

【００１４】図３に翻訳モジュール10の具体的な構成を
示す。FIG. 3 shows a specific configuration of the translation module 10.

【００１５】翻訳ＣＰＵ15は、翻訳プログラムに基づい
て図２の各機能を実現する。メモリ16には所定の翻訳プ
ログラムが格納されており、バス17により翻訳ＣＰＵ15
に接続されている。The translation CPU 15 implements each function shown in FIG. 2 based on the translation program. The memory 16 stores a predetermined translation program.
It is connected to the.

【００１６】バッファＡ〜Ｅは、翻訳手順の各段階で各
種のデータを格納するためのものであり全てバス17に接
続されている。また、モジュール10が参照する辞書・規
則メモリ18はバス17に接続されている。The buffers AE are for storing various data at each stage of the translation procedure, and are all connected to the bus 17. The dictionary / rule memory 18 referred to by the module 10 is connected to the bus 17.

【００１７】図４は、本実施例における機械翻訳装置の
一構成例を示すブロック図である。図に示すように、本
実施例の機械翻訳装置は、メインＣＰＵ（中央処理装
置）、19、メインメモリ20、ＣＲＴ（カソード・レイ・
チューブ）21、キーボード22、翻訳モジュール10及び翻
訳モジュール10に接続されており翻訳用の辞書、文法規
則、木構造変換規則などが格納された辞書・規則メモリ
18により構成されている。上記の各構成部分は全てバス
23によって接続されている。FIG. 4 is a block diagram showing an example of the configuration of the machine translation apparatus according to this embodiment. As shown in the figure, the machine translation apparatus according to the present embodiment includes a main CPU (central processing unit), 19, a main memory 20, a CRT (cathode ray display).
Tube) 21, keyboard 22, translation module 10, and dictionary / rule memory connected to translation module 10 and storing translation dictionaries, grammar rules, tree structure conversion rules, and the like.
It is composed of 18. All of the above components are buses
Connected by 23.

【００１８】翻訳モジュール10は、ソース言語の文章
（原文）が入力されると、それを翻訳してターゲット言
語の文章（翻訳文）を出力する。When a source language sentence (original sentence) is input, the translation module 10 translates the sentence and outputs a target language sentence (translated sentence).

【００１９】キーボード22から原文が入力されると、Ｃ
ＰＵ19はそれを翻訳モジュール10に送る。翻訳モジュー
ル10は原文を受け取ると、辞書・規則メモリ18に格納さ
れている辞書、文法規則及び木構造変換規則などを用い
て、後述するようにターゲット言語に翻訳する。When the original text is input from the keyboard 22, C
PU 19 sends it to translation module 10. Upon receiving the original sentence, the translation module 10 translates the original sentence into a target language using a dictionary, grammar rules, tree structure conversion rules, and the like stored in the dictionary / rule memory 18 as described later.

【００２０】翻訳結果は、メインメモリ20に一旦記憶さ
れると共にＣＲＴ21に表示される。図５は、翻訳モジュ
ール10における翻訳手順を示す。The translation result is temporarily stored in the main memory 20 and displayed on the CRT 21. FIG. 5 shows a translation procedure in the translation module 10.

【００２１】図に示すように、翻訳モジュール10におけ
る翻訳手順は各レベルに分解できる。原文が与えられる
と、翻訳モジュール10は、レベルＬ１の辞書引き、レベ
ルＬ２の形態素解析、レベルＬ３の構文解析、・・・のよ
うに翻訳手順を進める。As shown in the figure, the translation procedure in the translation module 10 can be decomposed into each level. When the original sentence is provided, the translation module 10 proceeds with the translation procedure as in dictionary lookup at level L1, morphological analysis at level L2, syntax analysis at level L3, and so on.

【００２２】ここで図５に示す各解析について説明す
る。Here, each analysis shown in FIG. 5 will be described.

【００２３】（ａ）辞書引きおよび形態素解析（Ｌ１，
Ｌ２）辞書・規則メモリ18に格納された辞書を引き、原文を各
形態素列（単語列）に分割し、各単語の品詞などの文法
情報及び訳語を得、更に時制、人称、数などを解析す
る。(A) Dictionary lookup and morphological analysis (L1,
L2) Look up the dictionary stored in the dictionary / rule memory 18, divide the original sentence into morpheme strings (word strings), obtain grammatical information such as the part of speech of each word and translations, and analyze the tense, personal name, number, etc. I do.

【００２４】（ｂ）構文解析（Ｌ３）単語間の係り受けなどの文章の構造（構造解析木）を決
定する。(B) Parsing (L3) The structure of a sentence such as dependency between words (structure analysis tree) is determined.

【００２５】（ｃ）意味解析（Ｌ４）複数の構文解析の結果から意味的に正しいものとそうで
ないものとを判別する。(C) Semantic analysis (L4) From the results of a plurality of syntactic analyzes, it is determined whether the semantic one is correct or not.

【００２６】（ｄ）文脈解析（Ｌ５）話題を理解し、省略や曖昧さを取り去る。(D) Context Analysis (L5) Understand topics and remove omissions and ambiguities.

【００２７】本実施例の翻訳モジュール10は、トランス
ファー方式の機械翻訳を行うもので、少なくとも上記レ
ベルＬ３の構文解析のレベルまでを行うものとする。The translation module 10 of this embodiment performs machine translation in a transfer system, and performs at least the level of the syntax analysis at the level L3.

【００２８】次に、図６を参照して、フレーズの品詞指
定は行わずに英語を日本語に翻訳する場合の動作を説明
する。Next, an operation of translating English into Japanese without designating the part of speech of a phrase will be described with reference to FIG.

【００２９】キーボード21（図４参照）より入力され、
翻訳モジュール10に与えられた原文、例えば「This is
a pen.」は、図６に示すような形でバッファＡに格納さ
れる。辞書引き形態素解析部11（図２参照）は、バッ
ファＡに格納された原文の各単語の訳語などの情報を辞
書・規則メモリ18の辞書を用いて取得し、その結果をバ
ッファＢに格納する。例えば得られた情報の一部である
品詞情報は、表１のような形でバッファＢに格納され
る。Input from the keyboard 21 (see FIG. 4)
The original text given to the translation module 10, for example, "This is
a pen. "is stored in the buffer A in the form shown in FIG. The dictionary lookup morphological analysis unit 11 (see FIG. 2) acquires information such as a translation of each word of the original sentence stored in the buffer A using the dictionary of the dictionary / rule memory 18 and stores the result in the buffer B. . For example, part-of-speech information that is a part of the obtained information is stored in the buffer B in the form as shown in Table 1.

【００３０】[0030]

【表１】 [Table 1]

【００３１】ここで、「this」は多品詞語であるが、そ
の品詞は構文解析部12によって一意に決定される。構文
解析部12では、辞書・規則メモリ18の辞書と文法規則に
基づいて各単語間の係り受け関係を表す構造解析木（図
７）が決定されて、バッファＣに格納される。Here, "this" is a multi-speech word, and its part of speech is uniquely determined by the parsing unit 12. The syntax analysis unit 12 determines a structure analysis tree (FIG. 7) representing the dependency relationship between the words based on the dictionary in the dictionary / rule memory 18 and the grammar rules, and stores it in the buffer C.

【００３２】上記の構造解析木は、具体的には次のよう
にして決定される。The above-mentioned structure analysis tree is specifically determined as follows.

【００３３】まず、辞書・規則メモリ18の文法規則か
ら、文→主部、述部主部→名詞句述部→動詞、名詞句名詞句→代名詞名詞句→冠詞、名詞という規則が得られる。ここで例えば、１つ目の規則は
「文とは、主部と述部とからできている」ということを
表している。以下、この規則に従って構造解析木が決定
される。First, from the grammar rules in the dictionary / rule memory 18, the following rules are obtained: sentence → subject, predicate main → noun phrase predicate → verb, noun phrase noun phrase → pronoun noun phrase → article, noun. Here, for example, the first rule indicates that a sentence is made up of a main part and a predicate. Hereinafter, a structural analysis tree is determined according to this rule.

【００３４】変換部13では、構文解析部12と同様に辞書
・規則メモリ18の木構造変換規則を用いて、原文に対す
る構造解析木（図７）の構造が、図８に示すような日本
文に対する構造に変換される。得られた結果はバッファ
Ｄに格納される。The conversion unit 13 uses the tree structure conversion rules of the dictionary / rule memory 18 in the same manner as the syntax analysis unit 12 to convert the structure of the structure analysis tree (FIG. 7) for the original sentence into a Japanese sentence as shown in FIG. Is converted to the structure for The obtained result is stored in the buffer D.

【００３５】翻訳文生成部14は、得られた日本文字「こ
れペンである。」に適切な助詞「は」や助動詞をつ
け、「これはペンである。」という日本語の形にしてバ
ッファＥに格納する。このバッファＥに格納された翻訳
文は翻訳モジュール10から出力されてメインメモリ20に
格納されると共にＣＲＴ21に表示される。The translated sentence generation unit 14 attaches an appropriate particle "ha" or an auxiliary verb to the obtained Japanese character "this pen" and converts the Japanese character "this is a pen" into a buffer. Stored in E. The translated sentence stored in the buffer E is output from the translation module 10, stored in the main memory 20, and displayed on the CRT 21.

【００３６】次に、フレーズの品詞を指定して英語を日
本語に翻訳する場合の動作を、図１のフローチャートを
用いて説明する。Next, the operation of translating English into Japanese by designating the part of speech of a phrase will be described with reference to the flowchart of FIG.

【００３７】この実施例では、フレーズの品詞を指定す
るための品詞指定記号として、品詞の種類とフレーズの
始点とを表す表２のフレーズ品詞指定開始記号”［”
と、フレーズの終点を表すフレーズ品詞指定終了記
号”］”とを用いる。In this embodiment, as a part-of-speech designation symbol for designating the part-of-speech of a phrase, the phrase part-of-speech designation start symbol "[" in Table 2 representing the type of part of speech and the starting point of the phrase.
And the phrase part-of-speech designation end symbol "]" representing the end point of the phrase.

【００３８】[0038]

【表２】 [Table 2]

【００３９】表２について説明すると、１はこの記号で
始まるフレーズを名詞と指定するためのものであり、同
様に２は動詞、３は形容詞、４は副詞とそれぞれ指定す
るための記号である。ユーザは原文をキーボード22を操
作して入力する際、このような記号を原文中に挿入する
ことによってフレーズの品詞を指定する。Referring to Table 2, reference numeral 1 designates a phrase starting with this symbol as a noun, and similarly, 2 designates a verb, 3 designates an adjective, and 4 designates a sign which designates an adverb. When the user operates the keyboard 22 to input the original text, the user specifies the part of speech of the phrase by inserting such a symbol into the original text.

【００４０】キーボード22とＣＰＵ19とは品詞指定記号
を原文に付加するための記号付加手段として機能し、ユ
ーザはキーボード22を操作して品詞指定記号を含ませる
ことができる。The keyboard 22 and the CPU 19 function as a symbol adding means for adding the part of speech designation symbol to the original text, and the user can operate the keyboard 22 to include the part of speech designation symbol.

【００４１】以下、例文としてThe price increased 20
% d ＿[the previous year].という文章を入力したとき
の翻訳モジュール10の動作を図１を参照して詳述する。The following is an example sentence of The price increased 20
The operation of the translation module 10 when the sentence% d_ [the previous year]. is input will be described in detail with reference to FIG.

【００４２】翻訳モジュール10は、この原文を受け取る
とそれをバッファＡに格納する（ステップＳ１）。When receiving the original sentence, the translation module 10 stores it in the buffer A (step S1).

【００４３】次に、形態素解析部11は、記号検出手段と
して動作し、まず単語位置を示すポインタが先頭の単語
「The 」を指すように０にセットする（ステップＳ
２）。Next, the morphological analysis unit 11 operates as a symbol detecting means, and first sets the pointer indicating the word position to 0 so as to point to the first word "The" (step S).
2).

【００４４】そして現在ポイントされている単語の先頭
がフレーズ品詞指定開始記号か否かを判別し（ステップ
Ｓ３）、品詞指定開始記号でないときはフレーズ品詞指
定終了記号か否かを判別する（ステップＳ４）。Then, it is determined whether or not the head of the currently pointed word is a phrase part-of-speech designation start symbol (step S3). If not, it is determined whether it is a phrase part-of-speech designation end symbol (step S4). ).

【００４５】上記ステップＳ４における判別の結果、品
詞指定終了記号でない場合には、現在ポイントされてい
る単語「The 」をその単語位置「０」と共にバッファＢ
に表３に示すような形で格納する（ステップＳ５）。そ
の後、さらに次の単語があるか否かを判別し（ステップ
Ｓ６）、単語がまだある場合には単語位置ポインタを１
つ増加させて（ステップＳ７）から、上記ステップＳ３
に戻って同様の処理を行う。その結果、バッファＢに
は、「The priceincreased 20% 」までが格納される。If the result of the determination in step S4 is that the word is not a part-of-speech designation end symbol, the currently pointed word "The" is stored in the buffer B together with the word position "0".
In the form shown in Table 3 (step S5). Thereafter, it is determined whether or not there is a next word (step S6).
(Step S7), and then the above Step S3
And the same processing is performed. As a result, up to “The priceincreased 20%” is stored in the buffer B.

【００４６】[0046]

【表３】 [Table 3]

【００４７】単語位置ポインタが単語位置「４」まで進
みステップＳ３の処理に入ると、現在の単語「d ＿[th
e」の先頭が品詞指定開始記号であるため、指定された
品詞「副詞」と、単語位置「４」とを表４に示すような
形でバッファＦに格納し（ステップＳ９及びＳ１０）、
更に品詞指定終了記号「]」までの単語数をカウント
し、その数「３」をバッファＦに格納する（ステップＳ
１１）。When the word position pointer advances to word position "4" and enters the processing in step S3, the current word "d_ [th
Since the beginning of "e" is the part of speech designation start symbol, the designated part of speech "adverb" and the word position "4" are stored in the buffer F in the form shown in Table 4 (steps S9 and S10).
Further, the number of words up to the part of speech designation end symbol "]" is counted, and the number "3" is stored in the buffer F (step S).
11).

【００４８】[0048]

【表４】 [Table 4]

【００４９】そして形態素解析部11は品詞指定開始記号
を削除した単語をバッファＢに格納し（ステップＳ１
２）、上記ステップＳ６に移って残りの単語がある否か
を判別する。Then, the morphological analysis unit 11 stores the word from which the part of speech designation start symbol has been deleted in the buffer B (step S1).
2) The process proceeds to step S6 to determine whether there is a remaining word.

【００５０】単語位置ポインタが単語位置「６」まで進
むと、単語「year」の処理になる。形態素解析部11はス
テップＳ３で品詞指定開始記号か否かを判別するが、開
始記号ではないのでステップＳ４に進み、品詞指定終了
記号か否かを判別する。そして現在ポイントされている
単語「year] 」の最後が品詞指定終了記号であるため、
それを削除した単語をバッファＢに格納する（ステップ
Ｓ８）。When the word position pointer advances to the word position "6", the processing for the word "year" is performed. The morphological analysis unit 11 determines in step S3 whether or not the part-of-speech designation start symbol. However, since it is not the start symbol, the process proceeds to step S4 to determine whether or not the part-of-speech designation end symbol. And since the end of the currently pointed word "year" is the part of speech designation terminator,
The deleted word is stored in the buffer B (step S8).

【００５１】このようにステップＳ３〜Ｓ１２までの処
理を繰り返して文末までの単語をバッファＢに格納する
と、形態素解析部11はステップＳ６からステップＳ１３
に進み、バッファＢに格納されている見出し語に対して
辞書引きを行い、各見出し語に対して品詞および単語数
の情報を付加し、表５に示すような形でバッファＢに格
納する。When the processing up to steps S3 to S12 is repeated and the words up to the end of the sentence are stored in the buffer B, the morphological analysis unit 11 executes steps S6 to S13
The dictionary is searched for the headwords stored in the buffer B, and information on the part of speech and the number of words is added to each headword and stored in the buffer B in the form shown in Table 5.

【００５２】[0052]

【表５】 [Table 5]

【００５３】次に、形態素解析部11はバッファＦに格納
されている情報（表４）をバッファＢに加え、その結
果、単語位置「４」の単語「the 」は、辞書引きの結果
では「冠詞」という品詞を持っていたが、バッファＦの
情報により、単語位置「４」には単語数「３」の品詞
「副詞」が追加される。従ってバッファＢには最終的に
表６に示すような情報が格納されることになる。Next, the morphological analysis unit 11 adds the information (Table 4) stored in the buffer F to the buffer B, and as a result, the word “the” at the word position “4” becomes “ Although the part-of-speech “article” was included, the part-of-speech “adverb” having the number of words “3” is added to the word position “4” according to the information in the buffer F. Therefore, the information as shown in Table 6 is finally stored in the buffer B.

【００５４】なお、ここで単語数が「３」となっている
のは、「the previous year 」という３つの単語を一つ
のイディオムとして扱い、構文解析のとき、この３つの
単語を一つの「副詞」として処理できるようにするため
である。The reason that the number of words is "3" is that three words "the previous year" are treated as one idiom, and the three words are converted into one "adverb" during parsing. In order to be able to process it.

【００５５】[0055]

【表６】 [Table 6]

【００５６】構文解析部12は辞書・規則メモリ18の内容
を参照し、バッファＢに格納された各単語の品詞情報に
もとづいて構文構造を決定する。そして構文解析部12は
品詞決定手段として機能し、「the previos year」の品
詞を「副詞」と決定して図９に示すような構文構造木を
作成し、その結果をバッファＣに格納する。The syntax analyzer 12 refers to the contents of the dictionary / rule memory 18 and determines the syntax structure based on the part of speech information of each word stored in the buffer B. Then, the syntax analysis unit 12 functions as a part of speech determination means, determines the part of speech of “the previos year” as an “adverb”, creates a syntax structure tree as shown in FIG. 9, and stores the result in the buffer C.

【００５７】変換部13はこの構文構造木をもとに、辞書
・規則メモリ18の木構造変換規則を用いて日本語の構文
構造木を求め、その結果をバッファＤに格納する。The conversion unit 13 obtains a Japanese syntax structure tree using the tree structure conversion rules of the dictionary / rule memory 18 based on the syntax structure tree, and stores the result in the buffer D.

【００５８】翻訳文生成部14はバッファＤに格納された
情報をもとに、翻訳文を生成し、最終的に「価格は前年
２０％増大した。」という翻訳文を出力する。この翻訳
結果はメインメモリ20に格納されると共にＣＲＴ21に表
示される。The translation generation section 14 generates a translation based on the information stored in the buffer D, and finally outputs a translation "Price increased by 20% in the previous year." This translation result is stored in the main memory 20 and displayed on the CRT 21.

【００５９】原文中の「the previos year」に「副詞」
という品詞指定が行われなかったとすると、「the prev
ios year」は、通常の構文解析では「名詞句」となるの
で、構文構造を図９のような一つの文に組み立てること
はできず、構文解析は失敗となって翻訳文は「価格は２
０％増大した｜前年」のように分割された文章となる
が、本発明の機械翻訳装置によれば、上述のように「th
e previos year」に対して品詞指定を行うことができ、
「the previos year」は「副詞」として扱われるので、
正しい翻訳文「価格は前年２０％増大した。」が容易に
得られる。"Adverb" is added to "the previos year" in the original text.
If the part-of-speech designation was not performed, "the prev
The “ios year” is a “noun phrase” in ordinary parsing, so the syntactic structure cannot be assembled into one sentence as shown in FIG. 9, and the parsing fails, and the translated sentence is “price is 2”.
Although the sentence is divided as in the previous year, which is increased by 0%, according to the machine translation device of the present invention, as described above, "th
e previos year "can be specified.
"The previos year" is treated as an "adverb"
The correct translation, "Price increased by 20% in the previous year."

【００６０】[0060]

【発明の効果】本発明の機械翻訳装置は、形態素に分割
されたソース言語で記載された文章の構文構造を解析す
る構文解析手段と、文章に含まれるフレーズの品詞を指
定する品詞指定記号を文章に付加する記号付加手段と、
付加された品詞指定記号を検出する記号検出手段と、品
詞指定記号が検出されたときにフレーズに含まれる単語
数をカウントし、品詞指定記号によって指定されたフレ
ーズの品詞と単語数とを記憶する記憶手段とを備えてお
り、構文解析手段は、記憶手段に記憶された単語数でフ
レーズを判断し、記憶手段に記憶された品詞を判断され
たフレーズの品詞として構文構造を解析するので、フレ
ーズの判断とフレーズの品詞の特定が容易となり構文解
析のときに候補を減らすことができ、処理時間の短縮が
可能になり、その結果、無駄な構文解析を行なわずに正
しい翻訳結果が迅速に得られる。According to the machine translation apparatus of the present invention, a parsing means for analyzing the syntactic structure of a sentence described in a source language divided into morphemes, and a part-of-speech designation symbol for designating a part of speech of a phrase included in the sentence are provided. Means for adding a symbol to a sentence;
A symbol detecting means for detecting the added part of speech designation symbol, and a word included in the phrase when the part of speech designation symbol is detected
Counts the number of frames and
Storage means for storing the part of speech of the phrase and the number of words , and the syntactic analysis means stores the words based on the number of words stored in the storage means.
Judge the laze, judge the part of speech stored in the storage means
Since analyzing the parts of speech and to the syntax structure of phrase, it is possible to reduce the candidates when a specific syntax facilitated analysis of phrases determined and phrase parts of speech, it becomes possible to shorten the processing time, as a result, A correct translation result can be obtained quickly without performing useless parsing.

[Brief description of the drawings]

【図１】本発明による品詞指定機能を有する機械翻訳装
置を構成する翻訳モジュールの動作を説明するためのフ
ローチャートである。FIG. 1 is a flowchart for explaining the operation of a translation module constituting a machine translation apparatus having a part of speech designation function according to the present invention.

【図２】図１の翻訳モジュールを有する品詞指定機能を
有する機械翻訳装置の構成を示す図である。FIG. 2 is a diagram showing a configuration of a machine translation apparatus having a part of speech designation function having the translation module of FIG. 1;

【図３】図２の翻訳モジュールの具体的なブロック図で
ある。FIG. 3 is a specific block diagram of the translation module of FIG. 2;

【図４】図２の品詞指定機能を有する機械翻訳装置の全
体を示すブロック図である。FIG. 4 is a block diagram showing an entire machine translation apparatus having a part of speech designation function shown in FIG. 2;

【図５】図２の機械翻訳装置による機械翻訳における手
順をレベルに分解して示す説明図である。FIG. 5 is an explanatory diagram showing a procedure in machine translation by the machine translation device in FIG.

【図６】図２の翻訳モジュールを構成するバッファＡの
内容を示す説明図である。FIG. 6 is an explanatory diagram showing the contents of a buffer A constituting the translation module of FIG. 2;

【図７】図２の翻訳モジュールにおける構文解析の結果
を表す図である。FIG. 7 is a diagram illustrating a result of syntax analysis in the translation module of FIG. 2;

【図８】図２の翻訳モジュールにおける木構造変換の結
果を表す図である。FIG. 8 is a diagram showing a result of tree structure conversion in the translation module of FIG. 2;

【図９】図２の翻訳モジュールにおける構文解析の結果
を表す図である。FIG. 9 is a diagram illustrating a result of a syntax analysis in the translation module of FIG. 2;

[Explanation of symbols]

10 翻訳モジュール 11 辞書引き形態素解析部 12 構文解析部 13 変換部 14 翻訳文生成部 15 翻訳ＣＰＵ 16 翻訳プログラムメモリ 17，23 バス 18 辞書・規則メモリＡ〜Ｅバッファ 19 メインＣＰＵ 20 メインメモリ 21 ＣＲＴ 22 キーボード 10 Translation module 11 Dictionary lookup morphological analysis unit 12 Syntax analysis unit 13 Conversion unit 14 Translation sentence generation unit 15 Translation CPU 16 Translation program memory 17, 23 Bus 18 Dictionary / rule memory AE buffer 19 Main CPU 20 Main memory 21 CRT 22 keyboard

───────────────────────────────────────────────────── フロントページの続き (72)発明者佐田いち子大阪府大阪市阿倍野区長池町22番22号シャープ株式会社内 (72)発明者九津見毅大阪府大阪市阿倍野区長池町22番22号シャープ株式会社内 (56)参考文献電子通信学会論文誌、Ｖｏｌ．Ｊ64− Ｄ、Ｎｏ．９、高松忍、西田富士夫、「動詞パターンと格構造に基づく英日機械翻訳」、Ｐ．815−Ｐ．822、（1981. ９) ｂｉｔ別冊機械翻訳、鈴木等、「英日機械翻訳システムＤＵＥＴ−Ｅ／Ｊ」、Ｐ．167−Ｐ．172、共立出版（1988．９) ──────────────────────────────────────────────────続き Continued on the front page (72) Inventor Ichiko Sada 22-22 Nagaikecho, Abeno-ku, Osaka-shi, Osaka Inside Sharp Corporation (72) Inventor Takeshi Tsukumi 22-22 Nagaikecho, Abeno-ku, Osaka-shi, Osaka Sharp (56) References IEICE Transactions, Vol. J64-D, no. 9. Shinobu Takamatsu, Fujio Nishida, "English-Japanese Machine Translation Based on Verb Patterns and Case Structures", p. 815-P. 822, (1981. 9) bit separate volume Machine translation, Suzuki et al., "English-Japanese Machine Translation System DUET-E / J", p. 167-P. 172, Kyoritsu Shuppan (September 1988)

Claims

(57) [Claims]

1. A syntactic analysis means for analyzing a syntactic structure of a sentence described in a source language divided into morphemes, and a symbol addition for adding a part of speech designation symbol for designating a part of speech of a phrase included in the sentence to the sentence Means, symbol detection means for detecting the added part of speech designation symbol, and counting the number of words contained in the phrase when the part of speech designation symbol is detected.
The frame specified by the part of speech designation symbol.
And a storage means for storing the word counts and part-of-speech of over's, the syntax analysis means, a single stored in the storage means
The phrase is determined by the number of words and stored in the storage means.
Machine translation apparatus parts of speech as a part of speech phrase that is the determination, characterized in that analyzing the syntax structure.

2. The part-of-speech designation symbol, wherein the phrase is a noun,
2. The machine translation apparatus according to claim 1, further comprising a symbol indicating one of a verb, an adjective, and an adverb and a symbol indicating a start point or an end point of the phrase.