JPS63138465A - Analyzing device for syntax structure - Google Patents
Analyzing device for syntax structureInfo
- Publication number
- JPS63138465A JPS63138465A JP61286348A JP28634886A JPS63138465A JP S63138465 A JPS63138465 A JP S63138465A JP 61286348 A JP61286348 A JP 61286348A JP 28634886 A JP28634886 A JP 28634886A JP S63138465 A JPS63138465 A JP S63138465A
- Authority
- JP
- Japan
- Prior art keywords
- syntax structure
- rule
- analysis
- syntax
- morpheme
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 238000004458 analytical method Methods 0.000 claims abstract description 33
- 230000003247 decreasing effect Effects 0.000 abstract 1
- 230000000877 morphologic effect Effects 0.000 description 6
- 238000000034 method Methods 0.000 description 3
- 238000010586 diagram Methods 0.000 description 2
- 230000000694 effects Effects 0.000 description 2
- 239000007787 solid Substances 0.000 description 2
- 235000016496 Panda oleosa Nutrition 0.000 description 1
- 240000000220 Panda oleosa Species 0.000 description 1
- 238000007796 conventional method Methods 0.000 description 1
- 230000006870 function Effects 0.000 description 1
- 238000003058 natural language processing Methods 0.000 description 1
- 238000000926 separation method Methods 0.000 description 1
Landscapes
- Machine Translation (AREA)
Abstract
Description
【発明の詳細な説明】
産業上の利用分野
本発明は、機械翻訳システムやデータベース検索システ
ム等の自然言語処理部の構文解析装置に関するものであ
る。DETAILED DESCRIPTION OF THE INVENTION Field of Industrial Application The present invention relates to a parsing device for a natural language processing unit of a machine translation system, a database search system, or the like.
従来の技術
自然言語の文を解析して構文構造を求める方法としては
、第2図のような構成のものが知られている。図におい
て、入力文1は文字列として形態素解析装置2に読み込
まれ、品詞と表記とを持つ形態素の列に分解される。こ
のとき、形態素の同定のために形態素辞書3を用いる。BACKGROUND OF THE INVENTION As a conventional method of analyzing natural language sentences to obtain syntactic structures, a structure as shown in FIG. 2 is known. In the figure, an input sentence 1 is read into a morphological analysis device 2 as a character string and decomposed into a string of morphemes having parts of speech and notation. At this time, the morpheme dictionary 3 is used to identify morphemes.
形態素解析装置2の出力は、形態素の列4として構文解
析装置5に供給され、構文規則6を用いて統語構造が解
析され、構文木7として出力される。構文規則6として
現在よく用いられている形式の1つに、文脈自由文法、
または各規則毎に、何らかの拡張機能が付加された拡張
文脈自由文法がある。文脈自由文法を用いる利点の1つ
は、構文解析が有限ステップで終了することが保障され
た手法(アルゴリズム)が知られていることである。The output of the morphological analysis device 2 is supplied to the syntactic analysis device 5 as a morpheme sequence 4, the syntactic structure is analyzed using the syntactic rules 6, and the result is output as a syntax tree 7. One of the formats currently commonly used as syntax rule 6 is context-free grammar,
Alternatively, for each rule, there is an extended context-free grammar with some extended functions added. One of the advantages of using a context-free grammar is that a method (algorithm) is known that guarantees that parsing is completed in finite steps.
発明が解決しようとする問題点
しかし、従来の解析手法では、解析に要する時間や記憶
容量が、入力文の形態素数nが多くなるにつれて急激に
増大するという問題点があった。Problems to be Solved by the Invention However, conventional analysis methods have the problem that the time and storage capacity required for analysis rapidly increase as the number n of morphemes in an input sentence increases.
最も効率がよいとされているアー!J−(Earley
)の解析手法を用いた場合でも、解析に要する時間はr
L3のオーダーであシ、他の手法の中にはe%のオーダ
ーのものもある。本発明は上記問題点を解決するもので
、構文解析を行なう前に、適用可能な構文規則が一意的
に定まる箇所について、あらかじめ構文規則を適用して
、入力文の形態素数を減らし、その結果を構文解析部の
入力とする事によシ、効率のよい構文解析手法を提供す
ることを目的とする。Ah! is said to be the most efficient! J-(Earley
), the time required for analysis is r
It is on the order of L3; some other methods are on the order of e%. The present invention solves the above-mentioned problems by applying syntactic rules in advance to locations where applicable syntactic rules are uniquely determined before performing syntactic analysis to reduce the number of morphemes in the input sentence. The purpose is to provide an efficient syntactic analysis method by using this as input to the syntax analysis section.
問題点を解決するための手段
本発明は上記目的を達成するために、入力された日本語
文を形態素解析する手段と、形態素解析結果の一部分に
適用できる構文規則が、ただ1つかどうかの判断を行な
う確定規則判別手段と、前記確定規則判別手段の出力を
用いて、形態素解析結果に構文規則を適用する確定規則
適用手段と、確定規則適用後の形態素列を構文解析する
構文解析装置とを設けたものである。Means for Solving the Problems In order to achieve the above object, the present invention provides means for morphologically analyzing an input Japanese sentence and determining whether only one syntactic rule is applicable to a portion of the morphologically analyzed result. a deterministic rule discriminating means for applying a syntactic rule to a morphological analysis result using the output of the deterministic rule discriminating means, and a syntactic analysis device for parsing a morpheme sequence after applying the deterministic rule. It is something that
作用
本発明は上記構成によシ、形態素解析結果のうち、適用
でき石構文規則が確定できる箇所について、構文規則を
適用して形態素数を順次減らしてゆき、その結果を、構
文解析部の入力とすることによシ解析に要する時間を大
幅に短縮するようにしたものである。According to the above-mentioned structure, the present invention applies syntactic rules to sequentially reduce the number of morphemes in the portions of the morphological analysis results for which applicable syntactic rules can be determined, and inputs the results into the syntactic analysis unit. By doing so, the time required for analysis is greatly reduced.
実施例
以下、図面を参照しながら本発明の実施例について説明
する。第1図は本発明の実施例による構文解析装置であ
る。図において、分かち書きされていない、べた書きの
漢字かな混じ9日本語文11が、形態素解析装置12に
入力される。形態素解析装置12は、形態素辞書13を
参照して、文中の形態素を同定し、隣接形態素が接続し
得るかどうかのチェックを行ない、形態素の列14を、
妥当性の高い順に出力する。確定規則判別装置16は、
形態素の列14と文脈自由文法規則15から、形態素の
部分列に対し、一意的に適用可能な文脈自由文法規則を
探索し、もし存在した場合には、それらの規則を確定構
文規則として確定規則適用装置17に出力する。確定規
則適用装置17は、確定構文規則を形態素の列14に対
して適用し、その結果、形態素の列14は確定構文規則
による変更を受けて構文要素の数が減少する。変更後の
形態素の列に対して、再び確定規則判別装置16が確定
構文規則を探索し、確定構文規則が存在する場合には、
再び確定構文規則の適用が、確定規則適用装置17で行
なわれる。Embodiments Hereinafter, embodiments of the present invention will be described with reference to the drawings. FIG. 1 shows a parsing device according to an embodiment of the present invention. In the figure, nine Japanese sentences 11 written in solid letters, including kanji and kana, without any separation, are input to the morphological analysis device 12. The morpheme analysis device 12 refers to the morpheme dictionary 13, identifies morphemes in the sentence, checks whether adjacent morphemes can be connected, and converts the morpheme string 14 into
Output in order of validity. The confirmed rule discriminator 16 is
Search for context-free grammar rules that are uniquely applicable to the morpheme subsequence from the morpheme sequence 14 and context-free grammar rules 15, and if any exist, use these rules as definite syntax rules. It is output to the application device 17. The deterministic rule applying device 17 applies the deterministic syntax rules to the morpheme sequence 14, and as a result, the morpheme sequence 14 is modified by the deterministic syntactic rules and the number of syntactic elements is reduced. The definite rule discriminator 16 searches for a definite syntax rule again for the changed morpheme sequence, and if a definite syntax rule exists,
Application of the definite syntax rules again takes place in the definite rule application device 17.
この手続きは、確定構文規則が無くなるまで繰シ返され
、形態素の列の長さは順次減少してゆき、最終的に、短
縮された形態素列18が構文解析装置19に出力される
。構文解析装置19は短縮された形態素列18を受は取
って文脈自由文法規則15を用いて、構文解析し、構文
木2oが得られる。This procedure is repeated until there are no more definite syntactic rules, and the length of the morpheme sequence is successively reduced.Finally, a shortened morpheme sequence 18 is output to the syntactic analysis device 19. The syntax analysis device 19 receives the shortened morpheme sequence 18 and parses it using the context-free grammar rules 15 to obtain a syntax tree 2o.
発明の効果
以上のように、本発明は構文解析する前に、確定構文規
則のみを適用して入力文の形態素数を減らし、後段の構
文解析の負担を減らすことにより、効率のよい構文解析
を行なうことができ、その効果は大きい。Effects of the Invention As described above, the present invention reduces the number of morphemes in an input sentence by applying only definite syntactic rules before parsing, and reduces the burden of subsequent syntactic analysis, thereby achieving efficient syntactic analysis. It can be done and the effects are great.
第1図は本発明の実施例における日本語構文解析装置の
概念図、第2図は従来の構文解析装置の概念図である。
11・・・べた書き・漢字かな混じ構文、12・・・形
態素解析装置、13・・・形態素辞書、14・・・形態
素の列、15・・・文脈自由文法規則、16・・・確定
規則判別装置、17・・・確定規則適用装置、18・・
・短縮された形態素列、19・・・構文解析装置、20
・・・構文木。
代理人の氏名 弁理士 中 尾 敏 男 ほか1名第1
図
第2図FIG. 1 is a conceptual diagram of a Japanese language syntax analysis device according to an embodiment of the present invention, and FIG. 2 is a conceptual diagram of a conventional syntax analysis device. 11...Solid writing/Kanji-kana mixed syntax, 12...Morphological analyzer, 13...Morpheme dictionary, 14...Sequence of morphemes, 15...Context-free grammar rules, 16...Determined rules Discrimination device, 17... Determined rule application device, 18...
・Shortened morpheme sequence, 19... Syntactic analysis device, 20
...Syntax tree. Name of agent: Patent attorney Toshio Nakao and 1 other person No. 1
Figure 2
Claims (2)
た文に適用可能な構文規則がただ1つに決定できる箇所
について構文規則を適用する手段とを備え、構文規則を
適用後の形態素列を構文解析手段の入力とすることを特
徴とする構文解析装置。(1) A morpheme sequence after applying the syntactic rules, comprising a means for morphologically analyzing an input sentence and a means for applying the syntactic rules to a location where only one syntactic rule applicable to the morphologically analyzed sentence can be determined. A syntactic analysis device, characterized in that: is used as an input to a syntactic analysis means.
文法の形式を用いることを特徴とする特許請求の範囲第
1項記載の構文解析装置。(2) The syntactic analysis device according to claim 1, characterized in that a format of a context-free grammar or an extended context-free grammar is used as a syntax rule.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP61286348A JPH0827797B2 (en) | 1986-12-01 | 1986-12-01 | Parser |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP61286348A JPH0827797B2 (en) | 1986-12-01 | 1986-12-01 | Parser |
Publications (2)
Publication Number | Publication Date |
---|---|
JPS63138465A true JPS63138465A (en) | 1988-06-10 |
JPH0827797B2 JPH0827797B2 (en) | 1996-03-21 |
Family
ID=17703222
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
JP61286348A Expired - Fee Related JPH0827797B2 (en) | 1986-12-01 | 1986-12-01 | Parser |
Country Status (1)
Country | Link |
---|---|
JP (1) | JPH0827797B2 (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US8838440B2 (en) | 2010-09-14 | 2014-09-16 | International Business Machines Corporation | Generating parser combination by combining language processing parsers |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPS59140582A (en) * | 1983-01-31 | 1984-08-11 | Hitachi Ltd | Natural language translation assisting system |
JPS59183469A (en) * | 1983-03-31 | 1984-10-18 | Fujitsu Ltd | Machine interpreter |
JPS61105671A (en) * | 1984-10-29 | 1986-05-23 | Hitachi Ltd | Natural language processing device |
-
1986
- 1986-12-01 JP JP61286348A patent/JPH0827797B2/en not_active Expired - Fee Related
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPS59140582A (en) * | 1983-01-31 | 1984-08-11 | Hitachi Ltd | Natural language translation assisting system |
JPS59183469A (en) * | 1983-03-31 | 1984-10-18 | Fujitsu Ltd | Machine interpreter |
JPS61105671A (en) * | 1984-10-29 | 1986-05-23 | Hitachi Ltd | Natural language processing device |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US8838440B2 (en) | 2010-09-14 | 2014-09-16 | International Business Machines Corporation | Generating parser combination by combining language processing parsers |
Also Published As
Publication number | Publication date |
---|---|
JPH0827797B2 (en) | 1996-03-21 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US5890103A (en) | Method and apparatus for improved tokenization of natural language text | |
JPH0242572A (en) | Preparation/maintenance method for co-occurrence relation dictionary | |
JPS638864A (en) | Translating device | |
US6535886B1 (en) | Method to compress linguistic structures | |
US7302384B2 (en) | Left-corner chart parsing | |
JPS63138465A (en) | Analyzing device for syntax structure | |
JP2807236B2 (en) | Morphological analysis method | |
Carter | Lattice-based word identification in CLARE | |
JPS62139076A (en) | Language analysis system | |
JPS63221475A (en) | Analyzing method for syntax | |
JPH0320866A (en) | Text base retrieval system | |
JPS6368972A (en) | Unregistered word processing system | |
JP2954592B2 (en) | Language analyzer | |
JP2719453B2 (en) | Machine translation equipment | |
JP3197110B2 (en) | Natural language analyzer and machine translator | |
JPH0785057A (en) | Syntax analyzing method and device therefor | |
JPS6389975A (en) | Language analyzer | |
JPH04296969A (en) | Mechanical translation device | |
JPS62263568A (en) | Word processor | |
JPS63143669A (en) | Method for analyzing syntactical structure | |
JPS63255773A (en) | Mechanical translation processing system | |
JPH01145772A (en) | Word retrieving device | |
JPH01213750A (en) | Salvage method in syntax analysis for mechanical translation | |
JPH0528178A (en) | Japanese word input device | |
JPH02110771A (en) | Electronic translation device |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
LAPS | Cancellation because of no payment of annual fees |