JPS63138465A

JPS63138465A - Analyzing device for syntax structure

Info

Publication number: JPS63138465A
Application number: JP61286348A
Authority: JP
Inventors: Yuji Sugano; 祐司菅野; Kenji Nagao; 健司長尾; Ryuichi Mato; 隆一間藤; Yoshihiro Ueda; 芳弘上田; Osamu Iwasaki; 修岩崎; Kenichi Ueda; 謙一上田
Original assignee: Matsushita Electric Industrial Co Ltd
Current assignee: Panasonic Holdings Corp
Priority date: 1986-12-01
Filing date: 1986-12-01
Publication date: 1988-06-10
Anticipated expiration: 2011-03-21
Also published as: JPH0827797B2

Abstract

PURPOSE:To improve the working efficiency with analysis of syntax structures by applying only a fixed syntax structure rule before analysis of the syntax structure to decrease the number of morphemes of an input sentence and reducing the analysis load of syntax structure at the next stage. CONSTITUTION:A syntax structure analyzing device contains a morpheme analyzing means 12 for input Japanese word sentences, a fixed rule deciding means 16 which decides whether just a single syntax rule is applicable to a part of the morpheme analysis result or not, a fixed rule application means 17 which uses the output of the means 16 to apply the syntax structure rule to the morpheme analysis result, and a syntax structure analyzing device 19 which applies the syntax structure analysis to a morpheme string undergone application of a fixed rule. The syntax structure rule is applied to an area where an applicable syntax structure rule can be fixed among those morpheme analysis results. Thus the number of morphemes are successively decreased and the result of this reduction is used as the input of a syntax structure analyzing part. In such a way, the analyzing time is shortened.

Description

【発明の詳細な説明】産業上の利用分野本発明は、機械翻訳システムやデータベース検索システ
ム等の自然言語処理部の構文解析装置に関するものであ
る。DETAILED DESCRIPTION OF THE INVENTION Field of Industrial Application The present invention relates to a parsing device for a natural language processing unit of a machine translation system, a database search system, or the like.

従来の技術自然言語の文を解析して構文構造を求める方法としては
、第２図のような構成のものが知られている。図におい
て、入力文１は文字列として形態素解析装置２に読み込
まれ、品詞と表記とを持つ形態素の列に分解される。こ
のとき、形態素の同定のために形態素辞書３を用いる。BACKGROUND OF THE INVENTION As a conventional method of analyzing natural language sentences to obtain syntactic structures, a structure as shown in FIG. 2 is known. In the figure, an input sentence 1 is read into a morphological analysis device 2 as a character string and decomposed into a string of morphemes having parts of speech and notation. At this time, the morpheme dictionary 3 is used to identify morphemes.

形態素解析装置２の出力は、形態素の列４として構文解
析装置５に供給され、構文規則６を用いて統語構造が解
析され、構文木７として出力される。構文規則６として
現在よく用いられている形式の１つに、文脈自由文法、
または各規則毎に、何らかの拡張機能が付加された拡張
文脈自由文法がある。文脈自由文法を用いる利点の１つ
は、構文解析が有限ステップで終了することが保障され
た手法（アルゴリズム）が知られていることである。The output of the morphological analysis device 2 is supplied to the syntactic analysis device 5 as a morpheme sequence 4, the syntactic structure is analyzed using the syntactic rules 6, and the result is output as a syntax tree 7. One of the formats currently commonly used as syntax rule 6 is context-free grammar,
Alternatively, for each rule, there is an extended context-free grammar with some extended functions added. One of the advantages of using a context-free grammar is that a method (algorithm) is known that guarantees that parsing is completed in finite steps.

発明が解決しようとする問題点しかし、従来の解析手法では、解析に要する時間や記憶
容量が、入力文の形態素数ｎが多くなるにつれて急激に
増大するという問題点があった。Problems to be Solved by the Invention However, conventional analysis methods have the problem that the time and storage capacity required for analysis rapidly increase as the number n of morphemes in an input sentence increases.

最も効率がよいとされているアー！Ｊ−（Ｅａｒｌｅｙ
）の解析手法を用いた場合でも、解析に要する時間はｒ
Ｌ３のオーダーであシ、他の手法の中にはｅ％のオーダ
ーのものもある。本発明は上記問題点を解決するもので
、構文解析を行なう前に、適用可能な構文規則が一意的
に定まる箇所について、あらかじめ構文規則を適用して
、入力文の形態素数を減らし、その結果を構文解析部の
入力とする事によシ、効率のよい構文解析手法を提供す
ることを目的とする。Ah! is said to be the most efficient! J-(Earley
), the time required for analysis is r
It is on the order of L3; some other methods are on the order of e%. The present invention solves the above-mentioned problems by applying syntactic rules in advance to locations where applicable syntactic rules are uniquely determined before performing syntactic analysis to reduce the number of morphemes in the input sentence. The purpose is to provide an efficient syntactic analysis method by using this as input to the syntax analysis section.

問題点を解決するための手段本発明は上記目的を達成するために、入力された日本語
文を形態素解析する手段と、形態素解析結果の一部分に
適用できる構文規則が、ただ１つかどうかの判断を行な
う確定規則判別手段と、前記確定規則判別手段の出力を
用いて、形態素解析結果に構文規則を適用する確定規則
適用手段と、確定規則適用後の形態素列を構文解析する
構文解析装置とを設けたものである。Means for Solving the Problems In order to achieve the above object, the present invention provides means for morphologically analyzing an input Japanese sentence and determining whether only one syntactic rule is applicable to a portion of the morphologically analyzed result. a deterministic rule discriminating means for applying a syntactic rule to a morphological analysis result using the output of the deterministic rule discriminating means, and a syntactic analysis device for parsing a morpheme sequence after applying the deterministic rule. It is something that

作用本発明は上記構成によシ、形態素解析結果のうち、適用
でき石構文規則が確定できる箇所について、構文規則を
適用して形態素数を順次減らしてゆき、その結果を、構
文解析部の入力とすることによシ解析に要する時間を大
幅に短縮するようにしたものである。According to the above-mentioned structure, the present invention applies syntactic rules to sequentially reduce the number of morphemes in the portions of the morphological analysis results for which applicable syntactic rules can be determined, and inputs the results into the syntactic analysis unit. By doing so, the time required for analysis is greatly reduced.

実施例以下、図面を参照しながら本発明の実施例について説明
する。第１図は本発明の実施例による構文解析装置であ
る。図において、分かち書きされていない、べた書きの
漢字かな混じ９日本語文１１が、形態素解析装置１２に
入力される。形態素解析装置１２は、形態素辞書１３を
参照して、文中の形態素を同定し、隣接形態素が接続し
得るかどうかのチェックを行ない、形態素の列１４を、
妥当性の高い順に出力する。確定規則判別装置１６は、
形態素の列１４と文脈自由文法規則１５から、形態素の
部分列に対し、一意的に適用可能な文脈自由文法規則を
探索し、もし存在した場合には、それらの規則を確定構
文規則として確定規則適用装置１７に出力する。確定規
則適用装置１７は、確定構文規則を形態素の列１４に対
して適用し、その結果、形態素の列１４は確定構文規則
による変更を受けて構文要素の数が減少する。変更後の
形態素の列に対して、再び確定規則判別装置１６が確定
構文規則を探索し、確定構文規則が存在する場合には、
再び確定構文規則の適用が、確定規則適用装置１７で行
なわれる。Embodiments Hereinafter, embodiments of the present invention will be described with reference to the drawings. FIG. 1 shows a parsing device according to an embodiment of the present invention. In the figure, nine Japanese sentences 11 written in solid letters, including kanji and kana, without any separation, are input to the morphological analysis device 12. The morpheme analysis device 12 refers to the morpheme dictionary 13, identifies morphemes in the sentence, checks whether adjacent morphemes can be connected, and converts the morpheme string 14 into
Output in order of validity. The confirmed rule discriminator 16 is
Search for context-free grammar rules that are uniquely applicable to the morpheme subsequence from the morpheme sequence 14 and context-free grammar rules 15, and if any exist, use these rules as definite syntax rules. It is output to the application device 17. The deterministic rule applying device 17 applies the deterministic syntax rules to the morpheme sequence 14, and as a result, the morpheme sequence 14 is modified by the deterministic syntactic rules and the number of syntactic elements is reduced. The definite rule discriminator 16 searches for a definite syntax rule again for the changed morpheme sequence, and if a definite syntax rule exists,
Application of the definite syntax rules again takes place in the definite rule application device 17.

この手続きは、確定構文規則が無くなるまで繰シ返され
、形態素の列の長さは順次減少してゆき、最終的に、短
縮された形態素列１８が構文解析装置１９に出力される
。構文解析装置１９は短縮された形態素列１８を受は取
って文脈自由文法規則１５を用いて、構文解析し、構文
木２ｏが得られる。This procedure is repeated until there are no more definite syntactic rules, and the length of the morpheme sequence is successively reduced.Finally, a shortened morpheme sequence 18 is output to the syntactic analysis device 19. The syntax analysis device 19 receives the shortened morpheme sequence 18 and parses it using the context-free grammar rules 15 to obtain a syntax tree 2o.

発明の効果以上のように、本発明は構文解析する前に、確定構文規
則のみを適用して入力文の形態素数を減らし、後段の構
文解析の負担を減らすことにより、効率のよい構文解析
を行なうことができ、その効果は大きい。Effects of the Invention As described above, the present invention reduces the number of morphemes in an input sentence by applying only definite syntactic rules before parsing, and reduces the burden of subsequent syntactic analysis, thereby achieving efficient syntactic analysis. It can be done and the effects are great.

[Brief explanation of the drawing]

第１図は本発明の実施例における日本語構文解析装置の
概念図、第２図は従来の構文解析装置の概念図である。１１・・・べた書き・漢字かな混じ構文、１２・・・形
態素解析装置、１３・・・形態素辞書、１４・・・形態
素の列、１５・・・文脈自由文法規則、１６・・・確定
規則判別装置、１７・・・確定規則適用装置、１８・・
・短縮された形態素列、１９・・・構文解析装置、２０
・・・構文木。代理人の氏名　弁理士　中　尾　敏　男　ほか１名第１
図第２図FIG. 1 is a conceptual diagram of a Japanese language syntax analysis device according to an embodiment of the present invention, and FIG. 2 is a conceptual diagram of a conventional syntax analysis device. 11...Solid writing/Kanji-kana mixed syntax, 12...Morphological analyzer, 13...Morpheme dictionary, 14...Sequence of morphemes, 15...Context-free grammar rules, 16...Determined rules Discrimination device, 17... Determined rule application device, 18...
・Shortened morpheme sequence, 19... Syntactic analysis device, 20
...Syntax tree. Name of agent: Patent attorney Toshio Nakao and 1 other person No. 1
Figure 2

Claims

[Claims]

(1) A morpheme sequence after applying the syntactic rules, comprising a means for morphologically analyzing an input sentence and a means for applying the syntactic rules to a location where only one syntactic rule applicable to the morphologically analyzed sentence can be determined. A syntactic analysis device, characterized in that: is used as an input to a syntactic analysis means.

(2) The syntactic analysis device according to claim 1, characterized in that a format of a context-free grammar or an extended context-free grammar is used as a syntax rule.