[go: up one dir, main page]
More Web Proxy on the site http://driver.im/

JPS63138465A - Analyzing device for syntax structure - Google Patents

Analyzing device for syntax structure

Info

Publication number
JPS63138465A
JPS63138465A JP61286348A JP28634886A JPS63138465A JP S63138465 A JPS63138465 A JP S63138465A JP 61286348 A JP61286348 A JP 61286348A JP 28634886 A JP28634886 A JP 28634886A JP S63138465 A JPS63138465 A JP S63138465A
Authority
JP
Japan
Prior art keywords
syntax structure
rule
analysis
syntax
morpheme
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
JP61286348A
Other languages
Japanese (ja)
Other versions
JPH0827797B2 (en
Inventor
Yuji Sugano
祐司 菅野
Kenji Nagao
健司 長尾
Ryuichi Mato
隆一 間藤
Yoshihiro Ueda
芳弘 上田
Osamu Iwasaki
修 岩崎
Kenichi Ueda
謙一 上田
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Panasonic Holdings Corp
Original Assignee
Matsushita Electric Industrial Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Matsushita Electric Industrial Co Ltd filed Critical Matsushita Electric Industrial Co Ltd
Priority to JP61286348A priority Critical patent/JPH0827797B2/en
Publication of JPS63138465A publication Critical patent/JPS63138465A/en
Publication of JPH0827797B2 publication Critical patent/JPH0827797B2/en
Anticipated expiration legal-status Critical
Expired - Fee Related legal-status Critical Current

Links

Landscapes

  • Machine Translation (AREA)

Abstract

PURPOSE:To improve the working efficiency with analysis of syntax structures by applying only a fixed syntax structure rule before analysis of the syntax structure to decrease the number of morphemes of an input sentence and reducing the analysis load of syntax structure at the next stage. CONSTITUTION:A syntax structure analyzing device contains a morpheme analyzing means 12 for input Japanese word sentences, a fixed rule deciding means 16 which decides whether just a single syntax rule is applicable to a part of the morpheme analysis result or not, a fixed rule application means 17 which uses the output of the means 16 to apply the syntax structure rule to the morpheme analysis result, and a syntax structure analyzing device 19 which applies the syntax structure analysis to a morpheme string undergone application of a fixed rule. The syntax structure rule is applied to an area where an applicable syntax structure rule can be fixed among those morpheme analysis results. Thus the number of morphemes are successively decreased and the result of this reduction is used as the input of a syntax structure analyzing part. In such a way, the analyzing time is shortened.

Description

【発明の詳細な説明】 産業上の利用分野 本発明は、機械翻訳システムやデータベース検索システ
ム等の自然言語処理部の構文解析装置に関するものであ
る。
DETAILED DESCRIPTION OF THE INVENTION Field of Industrial Application The present invention relates to a parsing device for a natural language processing unit of a machine translation system, a database search system, or the like.

従来の技術 自然言語の文を解析して構文構造を求める方法としては
、第2図のような構成のものが知られている。図におい
て、入力文1は文字列として形態素解析装置2に読み込
まれ、品詞と表記とを持つ形態素の列に分解される。こ
のとき、形態素の同定のために形態素辞書3を用いる。
BACKGROUND OF THE INVENTION As a conventional method of analyzing natural language sentences to obtain syntactic structures, a structure as shown in FIG. 2 is known. In the figure, an input sentence 1 is read into a morphological analysis device 2 as a character string and decomposed into a string of morphemes having parts of speech and notation. At this time, the morpheme dictionary 3 is used to identify morphemes.

形態素解析装置2の出力は、形態素の列4として構文解
析装置5に供給され、構文規則6を用いて統語構造が解
析され、構文木7として出力される。構文規則6として
現在よく用いられている形式の1つに、文脈自由文法、
または各規則毎に、何らかの拡張機能が付加された拡張
文脈自由文法がある。文脈自由文法を用いる利点の1つ
は、構文解析が有限ステップで終了することが保障され
た手法(アルゴリズム)が知られていることである。
The output of the morphological analysis device 2 is supplied to the syntactic analysis device 5 as a morpheme sequence 4, the syntactic structure is analyzed using the syntactic rules 6, and the result is output as a syntax tree 7. One of the formats currently commonly used as syntax rule 6 is context-free grammar,
Alternatively, for each rule, there is an extended context-free grammar with some extended functions added. One of the advantages of using a context-free grammar is that a method (algorithm) is known that guarantees that parsing is completed in finite steps.

発明が解決しようとする問題点 しかし、従来の解析手法では、解析に要する時間や記憶
容量が、入力文の形態素数nが多くなるにつれて急激に
増大するという問題点があった。
Problems to be Solved by the Invention However, conventional analysis methods have the problem that the time and storage capacity required for analysis rapidly increase as the number n of morphemes in an input sentence increases.

最も効率がよいとされているアー!J−(Earley
)の解析手法を用いた場合でも、解析に要する時間はr
L3のオーダーであシ、他の手法の中にはe%のオーダ
ーのものもある。本発明は上記問題点を解決するもので
、構文解析を行なう前に、適用可能な構文規則が一意的
に定まる箇所について、あらかじめ構文規則を適用して
、入力文の形態素数を減らし、その結果を構文解析部の
入力とする事によシ、効率のよい構文解析手法を提供す
ることを目的とする。
Ah! is said to be the most efficient! J-(Earley
), the time required for analysis is r
It is on the order of L3; some other methods are on the order of e%. The present invention solves the above-mentioned problems by applying syntactic rules in advance to locations where applicable syntactic rules are uniquely determined before performing syntactic analysis to reduce the number of morphemes in the input sentence. The purpose is to provide an efficient syntactic analysis method by using this as input to the syntax analysis section.

問題点を解決するための手段 本発明は上記目的を達成するために、入力された日本語
文を形態素解析する手段と、形態素解析結果の一部分に
適用できる構文規則が、ただ1つかどうかの判断を行な
う確定規則判別手段と、前記確定規則判別手段の出力を
用いて、形態素解析結果に構文規則を適用する確定規則
適用手段と、確定規則適用後の形態素列を構文解析する
構文解析装置とを設けたものである。
Means for Solving the Problems In order to achieve the above object, the present invention provides means for morphologically analyzing an input Japanese sentence and determining whether only one syntactic rule is applicable to a portion of the morphologically analyzed result. a deterministic rule discriminating means for applying a syntactic rule to a morphological analysis result using the output of the deterministic rule discriminating means, and a syntactic analysis device for parsing a morpheme sequence after applying the deterministic rule. It is something that

作用 本発明は上記構成によシ、形態素解析結果のうち、適用
でき石構文規則が確定できる箇所について、構文規則を
適用して形態素数を順次減らしてゆき、その結果を、構
文解析部の入力とすることによシ解析に要する時間を大
幅に短縮するようにしたものである。
According to the above-mentioned structure, the present invention applies syntactic rules to sequentially reduce the number of morphemes in the portions of the morphological analysis results for which applicable syntactic rules can be determined, and inputs the results into the syntactic analysis unit. By doing so, the time required for analysis is greatly reduced.

実施例 以下、図面を参照しながら本発明の実施例について説明
する。第1図は本発明の実施例による構文解析装置であ
る。図において、分かち書きされていない、べた書きの
漢字かな混じ9日本語文11が、形態素解析装置12に
入力される。形態素解析装置12は、形態素辞書13を
参照して、文中の形態素を同定し、隣接形態素が接続し
得るかどうかのチェックを行ない、形態素の列14を、
妥当性の高い順に出力する。確定規則判別装置16は、
形態素の列14と文脈自由文法規則15から、形態素の
部分列に対し、一意的に適用可能な文脈自由文法規則を
探索し、もし存在した場合には、それらの規則を確定構
文規則として確定規則適用装置17に出力する。確定規
則適用装置17は、確定構文規則を形態素の列14に対
して適用し、その結果、形態素の列14は確定構文規則
による変更を受けて構文要素の数が減少する。変更後の
形態素の列に対して、再び確定規則判別装置16が確定
構文規則を探索し、確定構文規則が存在する場合には、
再び確定構文規則の適用が、確定規則適用装置17で行
なわれる。
Embodiments Hereinafter, embodiments of the present invention will be described with reference to the drawings. FIG. 1 shows a parsing device according to an embodiment of the present invention. In the figure, nine Japanese sentences 11 written in solid letters, including kanji and kana, without any separation, are input to the morphological analysis device 12. The morpheme analysis device 12 refers to the morpheme dictionary 13, identifies morphemes in the sentence, checks whether adjacent morphemes can be connected, and converts the morpheme string 14 into
Output in order of validity. The confirmed rule discriminator 16 is
Search for context-free grammar rules that are uniquely applicable to the morpheme subsequence from the morpheme sequence 14 and context-free grammar rules 15, and if any exist, use these rules as definite syntax rules. It is output to the application device 17. The deterministic rule applying device 17 applies the deterministic syntax rules to the morpheme sequence 14, and as a result, the morpheme sequence 14 is modified by the deterministic syntactic rules and the number of syntactic elements is reduced. The definite rule discriminator 16 searches for a definite syntax rule again for the changed morpheme sequence, and if a definite syntax rule exists,
Application of the definite syntax rules again takes place in the definite rule application device 17.

この手続きは、確定構文規則が無くなるまで繰シ返され
、形態素の列の長さは順次減少してゆき、最終的に、短
縮された形態素列18が構文解析装置19に出力される
。構文解析装置19は短縮された形態素列18を受は取
って文脈自由文法規則15を用いて、構文解析し、構文
木2oが得られる。
This procedure is repeated until there are no more definite syntactic rules, and the length of the morpheme sequence is successively reduced.Finally, a shortened morpheme sequence 18 is output to the syntactic analysis device 19. The syntax analysis device 19 receives the shortened morpheme sequence 18 and parses it using the context-free grammar rules 15 to obtain a syntax tree 2o.

発明の効果 以上のように、本発明は構文解析する前に、確定構文規
則のみを適用して入力文の形態素数を減らし、後段の構
文解析の負担を減らすことにより、効率のよい構文解析
を行なうことができ、その効果は大きい。
Effects of the Invention As described above, the present invention reduces the number of morphemes in an input sentence by applying only definite syntactic rules before parsing, and reduces the burden of subsequent syntactic analysis, thereby achieving efficient syntactic analysis. It can be done and the effects are great.

【図面の簡単な説明】[Brief explanation of the drawing]

第1図は本発明の実施例における日本語構文解析装置の
概念図、第2図は従来の構文解析装置の概念図である。 11・・・べた書き・漢字かな混じ構文、12・・・形
態素解析装置、13・・・形態素辞書、14・・・形態
素の列、15・・・文脈自由文法規則、16・・・確定
規則判別装置、17・・・確定規則適用装置、18・・
・短縮された形態素列、19・・・構文解析装置、20
・・・構文木。 代理人の氏名 弁理士 中 尾 敏 男 ほか1名第1
図 第2図
FIG. 1 is a conceptual diagram of a Japanese language syntax analysis device according to an embodiment of the present invention, and FIG. 2 is a conceptual diagram of a conventional syntax analysis device. 11...Solid writing/Kanji-kana mixed syntax, 12...Morphological analyzer, 13...Morpheme dictionary, 14...Sequence of morphemes, 15...Context-free grammar rules, 16...Determined rules Discrimination device, 17... Determined rule application device, 18...
・Shortened morpheme sequence, 19... Syntactic analysis device, 20
...Syntax tree. Name of agent: Patent attorney Toshio Nakao and 1 other person No. 1
Figure 2

Claims (2)

【特許請求の範囲】[Claims] (1)入力文を形態素解析する手段と、形態素解析され
た文に適用可能な構文規則がただ1つに決定できる箇所
について構文規則を適用する手段とを備え、構文規則を
適用後の形態素列を構文解析手段の入力とすることを特
徴とする構文解析装置。
(1) A morpheme sequence after applying the syntactic rules, comprising a means for morphologically analyzing an input sentence and a means for applying the syntactic rules to a location where only one syntactic rule applicable to the morphologically analyzed sentence can be determined. A syntactic analysis device, characterized in that: is used as an input to a syntactic analysis means.
(2)構文規則として文脈自由文法または拡張文脈自由
文法の形式を用いることを特徴とする特許請求の範囲第
1項記載の構文解析装置。
(2) The syntactic analysis device according to claim 1, characterized in that a format of a context-free grammar or an extended context-free grammar is used as a syntax rule.
JP61286348A 1986-12-01 1986-12-01 Parser Expired - Fee Related JPH0827797B2 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP61286348A JPH0827797B2 (en) 1986-12-01 1986-12-01 Parser

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP61286348A JPH0827797B2 (en) 1986-12-01 1986-12-01 Parser

Publications (2)

Publication Number Publication Date
JPS63138465A true JPS63138465A (en) 1988-06-10
JPH0827797B2 JPH0827797B2 (en) 1996-03-21

Family

ID=17703222

Family Applications (1)

Application Number Title Priority Date Filing Date
JP61286348A Expired - Fee Related JPH0827797B2 (en) 1986-12-01 1986-12-01 Parser

Country Status (1)

Country Link
JP (1) JPH0827797B2 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8838440B2 (en) 2010-09-14 2014-09-16 International Business Machines Corporation Generating parser combination by combining language processing parsers

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS59140582A (en) * 1983-01-31 1984-08-11 Hitachi Ltd Natural language translation assisting system
JPS59183469A (en) * 1983-03-31 1984-10-18 Fujitsu Ltd Machine interpreter
JPS61105671A (en) * 1984-10-29 1986-05-23 Hitachi Ltd Natural language processing device

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS59140582A (en) * 1983-01-31 1984-08-11 Hitachi Ltd Natural language translation assisting system
JPS59183469A (en) * 1983-03-31 1984-10-18 Fujitsu Ltd Machine interpreter
JPS61105671A (en) * 1984-10-29 1986-05-23 Hitachi Ltd Natural language processing device

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8838440B2 (en) 2010-09-14 2014-09-16 International Business Machines Corporation Generating parser combination by combining language processing parsers

Also Published As

Publication number Publication date
JPH0827797B2 (en) 1996-03-21

Similar Documents

Publication Publication Date Title
US5890103A (en) Method and apparatus for improved tokenization of natural language text
JPH0242572A (en) Preparation/maintenance method for co-occurrence relation dictionary
JPS638864A (en) Translating device
US6535886B1 (en) Method to compress linguistic structures
US7302384B2 (en) Left-corner chart parsing
JPS63138465A (en) Analyzing device for syntax structure
JP2807236B2 (en) Morphological analysis method
Carter Lattice-based word identification in CLARE
JPS62139076A (en) Language analysis system
JPS63221475A (en) Analyzing method for syntax
JPH0320866A (en) Text base retrieval system
JPS6368972A (en) Unregistered word processing system
JP2954592B2 (en) Language analyzer
JP2719453B2 (en) Machine translation equipment
JP3197110B2 (en) Natural language analyzer and machine translator
JPH0785057A (en) Syntax analyzing method and device therefor
JPS6389975A (en) Language analyzer
JPH04296969A (en) Mechanical translation device
JPS62263568A (en) Word processor
JPS63143669A (en) Method for analyzing syntactical structure
JPS63255773A (en) Mechanical translation processing system
JPH01145772A (en) Word retrieving device
JPH01213750A (en) Salvage method in syntax analysis for mechanical translation
JPH0528178A (en) Japanese word input device
JPH02110771A (en) Electronic translation device

Legal Events

Date Code Title Description
LAPS Cancellation because of no payment of annual fees