CN111143691A - Joint information extraction method and device - Google Patents
Joint information extraction method and device Download PDFInfo
- Publication number
- CN111143691A CN111143691A CN201911416984.0A CN201911416984A CN111143691A CN 111143691 A CN111143691 A CN 111143691A CN 201911416984 A CN201911416984 A CN 201911416984A CN 111143691 A CN111143691 A CN 111143691A
- Authority
- CN
- China
- Prior art keywords
- data
- semantic
- type
- target
- sequence
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 238000000605 extraction Methods 0.000 title claims abstract description 44
- 108091026890 Coding region Proteins 0.000 claims abstract description 49
- 230000003993 interaction Effects 0.000 claims abstract description 13
- 230000004927 fusion Effects 0.000 claims abstract description 8
- 230000011218 segmentation Effects 0.000 claims description 11
- 238000013527 convolutional neural network Methods 0.000 claims description 9
- 230000015654 memory Effects 0.000 claims description 6
- 230000007787 long-term memory Effects 0.000 claims description 3
- 238000007418 data mining Methods 0.000 abstract description 2
- 238000000034 method Methods 0.000 description 23
- 230000006870 function Effects 0.000 description 9
- 238000005457 optimization Methods 0.000 description 7
- 239000000284 extract Substances 0.000 description 6
- 230000007547 defect Effects 0.000 description 2
- 230000001419 dependent effect Effects 0.000 description 2
- 238000009825 accumulation Methods 0.000 description 1
- 238000013528 artificial neural network Methods 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 230000005540 biological transmission Effects 0.000 description 1
- 238000010276 construction Methods 0.000 description 1
- 230000018109 developmental process Effects 0.000 description 1
- 238000010586 diagram Methods 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 238000003058 natural language processing Methods 0.000 description 1
- 238000010606 normalization Methods 0.000 description 1
- 230000000644 propagated effect Effects 0.000 description 1
- 230000002787 reinforcement Effects 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/953—Querying, e.g. by the use of web search engines
- G06F16/9535—Search customisation based on user profiles and personalisation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/33—Querying
- G06F16/3331—Query processing
- G06F16/334—Query execution
- G06F16/3344—Query execution using natural language analysis
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/044—Recurrent networks, e.g. Hopfield networks
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/045—Combinations of networks
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/08—Learning methods
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- Data Mining & Analysis (AREA)
- Databases & Information Systems (AREA)
- Artificial Intelligence (AREA)
- Computational Linguistics (AREA)
- General Health & Medical Sciences (AREA)
- Evolutionary Computation (AREA)
- Molecular Biology (AREA)
- Computing Systems (AREA)
- Biophysics (AREA)
- Biomedical Technology (AREA)
- Mathematical Physics (AREA)
- Software Systems (AREA)
- Life Sciences & Earth Sciences (AREA)
- Health & Medical Sciences (AREA)
- Machine Translation (AREA)
Abstract
The invention belongs to the field of data mining and information extraction, and discloses a joint information extraction method and device, which solve the problems that the traditional information extraction technology depends on the prior knowledge of experts and the performance of a characteristic extraction tool, and the semantic relation judgment between targets is influenced by the extraction error of an information target. The device includes: the code initialization module is used for carrying out semantic code initialization on the data, the predefined relationship type and the target type; the coding interaction module is used for mutual coding interaction between the data semantic coding sequence and the relationship type semantic coding sequence as well as between the target type semantic coding sequences; the coding fusion module is used for fusing the data semantic coding sequence and the relation type coding sequence and fusing the data semantic coding sequence and the target type coding sequence; the relation prediction module is used for predicting the relation type contained in the obtained data semantic code; and the target prediction module is used for predicting a first target and a second target corresponding to the relation type contained in the data.
Description
Technical Field
The invention belongs to the field of data mining and information extraction, and particularly relates to a joint information extraction method and device.
Background
With the rapid development of the internet and the accumulation of a large amount of information, people have more and more urgent needs on how to quickly and automatically extract the implicit knowledge in the information, especially the text structure information extraction function in the field of natural language processing. The information extraction function can enable people to quickly know the hidden local information structure venation in the data source, can integrate and link the information extracted from a plurality of data sources, and can comprehensively know the whole information structure, so that input data sources or auxiliary information can be provided for other tasks such as public opinion early warning, an intelligent chat system, knowledge map construction, knowledge reasoning and the like.
The purpose of information extraction is to discover objects in a data source that need attention while identifying which objects have semantic relationships between them. Currently, the mainstream methods for information extraction include three types, namely template-based methods, statistical-based methods, and representation-based methods.
The template-based method usually makes corresponding rules and templates by experts according to the characteristics of data sources, and then extracts information by adopting a pattern matching method. The method has the advantages of high accuracy and low recall rate, can only extract and identify information structures in the range of the prepared template, and is also seriously dependent on the prior knowledge of experts, time-consuming and labor-consuming.
Statistical-based methods generally extract feature information, such as part of speech, dependency analysis, n-gram features, etc., in the field of text processing according to the characteristics of the data source, and then extract information structures using statistical learning methods. The method has the advantages that the characteristics can be automatically extracted, and then the information structure can be automatically extracted by learning through a statistical learning method. The method avoids the defect that an expert is needed to make rules and templates in the template-based method, but the extraction effect is seriously dependent on the tool performance of feature extraction.
The representation-based method generally directly performs initialization coding on a data division unit of a data source, and then performs further semantic coding on the data source by adopting neural network methods such as a convolutional neural network and a long-short time memory network. Aiming at the semantic code obtained by learning, firstly extracting information targets by adopting a pipeline mode, and then judging the semantic relation between the targets. The method not only avoids the limitation of an expert to formulate a rule template, but also avoids the defects of feature selection and dependence on a feature extraction tool. Because the method adopts a pipeline extraction mode, the relation between the information target extraction and the semantic relationship judgment between the targets is ignored, and meanwhile, the error of the information target extraction is propagated backwards to lead the information target to introduce more error information, thereby influencing the semantic relationship judgment between the targets.
Disclosure of Invention
The technical problem to be solved by the invention is as follows: the method and the device for extracting the joint information solve the problems that the prior knowledge of experts and the performance of a characteristic extraction tool are depended on in the traditional information extraction technology, and the semantic relation judgment between the targets is influenced by the extraction error of the information targets.
The technical scheme adopted by the invention for solving the technical problems is as follows:
in one aspect, the present invention provides a method for extracting joint information, including the following steps:
A. initializing semantic code sequences of data, predefined relationship types and target types;
B. performing mutual interaction according to the data and the initialized semantic code sequence of the predefined relationship to obtain a data semantic code sequence with relationship type information and a relationship type semantic code sequence with data information;
C. forward propagation of information is carried out by using a data semantic coding sequence with relationship type information, and relationship types contained in the data are obtained through prediction;
D. obtaining a data semantic coding sequence after the relationship type is strengthened according to the relationship type obtained by prediction, the relationship type semantic coding sequence with the data information and the data semantic coding sequence with the relationship type information;
E. initializing a semantic code sequence according to the data semantic code sequence with the strengthened relation type and a predefined target type, and performing mutual interaction to obtain a data semantic code sequence with target type information and a target type semantic code sequence with data information;
F. forward propagation is carried out by using a data semantic coding sequence with target type information, and a first target corresponding to a relation type contained in data is obtained through prediction;
G. and predicting to obtain a second target corresponding to the data implication relation type and the first target according to the predicted first target, the target type semantic code sequence with the data information and the data semantic code sequence with the target type information.
As a further optimization, step a specifically includes:
a1, performing minimum unit segmentation and minimum unit semantic combination-based segmentation on the data, so that the data are converted into sequence-based data;
a2, respectively carrying out semantic code initialization on the two types of segmented data obtained in the step A1, then combining the segmented sequence data based on the minimum unit, and fusing the two types of initialized semantic code sequences to obtain a fused data semantic code sequence;
and step A3, respectively carrying out semantic code initialization on the predefined relationship type and the target type to obtain a predefined relationship type initialization semantic code sequence and a predefined target type initialization semantic code sequence.
As a further optimization, step B specifically includes:
b1, according to the fused data semantic code sequence obtained in the step A2 and the predefined relationship type initialization code obtained in the step A3, enabling each sequence unit of the data to be respectively aligned with the predefined relationship type initialization code sequence to obtain an alignment weight, and then fusing the predefined relationship type semantic codes added by the weights to obtain a data semantic code sequence with relationship type information;
b2, according to the fused data semantic code sequence obtained in the step A2 and the predefined relationship type initialization codes obtained in the step A3, enabling each predefined relationship type initialization code to be respectively aligned with the data semantic code sequence to obtain an alignment weight, and then fusing the data semantic codes added by the weights to obtain a relationship type semantic code sequence with data information.
As a further optimization, in step C, the forward propagation of information is performed by using a data semantic coding sequence with relationship type information, and the relationship type contained in the data is predicted, specifically: and B1, using the data semantic code sequence with the relation type information obtained in the step B1 as input, carrying out forward propagation of the information in coding networks such as a convolutional neural network or a long-term memory network and the like to obtain semantic codes of the data, and then predicting the relation types in the data according to the obtained semantic codes.
As a further optimization, step D specifically includes:
d1, inquiring the relation type semantic code sequence with data information according to the predicted relation type to obtain the corresponding relation type semantic code;
d2, carrying out coding fusion on each sequence unit of the relation type semantic code obtained in the step D1 and the data semantic code sequence with the relation type information to obtain the data semantic code sequence with the strengthened relation type.
As a further optimization, step E specifically includes:
e1, aligning the semantic code of each enhanced data sequence unit with the predefined target type initialization semantic code sequence to obtain an alignment weight according to the relationship type enhanced data semantic code sequence obtained in the step D2 and the predefined target type initialization semantic code sequence obtained in the step A3, and then fusing the predefined target type semantic codes added by the weights to obtain a data semantic code sequence with target type information;
e2, according to the relationship type reinforced data semantic code sequence obtained in the step D2 and the predefined target type initialized semantic code sequence obtained in the step A3, enabling each predefined target type initialized code to be respectively aligned with the reinforced data semantic code sequence to obtain an alignment weight, and then fusing the reinforced data semantic codes added by the weights to obtain the target type semantic code sequence with data information.
As a further optimization, in step F, the forward propagation is performed by using the data semantic code sequence with the target type information, and the first target corresponding to the relationship type contained in the data is obtained by prediction, specifically: and E1, taking the data semantic code sequence with the target type information obtained in the step E1 as input, carrying out forward propagation on the information in coding networks such as a convolutional neural network or a long-time memory network and the like to obtain the semantic code sequence of the data, predicting the target type of each unit of the sequence according to the obtained semantic code sequence, and obtaining a first target corresponding to the relation type contained in the data.
As a further optimization, step G specifically includes:
g1, inquiring a target type semantic code sequence with data information according to the predicted first target to obtain a semantic code vector corresponding to the first target;
g2, fusing the first target semantic coding vector obtained in the step G1 with each data semantic coding sequence unit with the target type information obtained in the step E2 to obtain a data semantic coding sequence with enhanced target type;
g3, using the data semantic code sequence with the strengthened target type obtained in the step G2 as input, and carrying out forward propagation on information in coding networks such as a convolutional neural network or a long-time memory network and the like to obtain the data semantic code sequence; and predicting the target type of each unit of the sequence according to the obtained semantic coding sequence to obtain a relation type contained in the data and a second target corresponding to the first target.
In another aspect, the present invention further provides a joint information extraction device, which includes:
the code initialization module is used for carrying out semantic code initialization on the data, the predefined relationship type and the target type;
the coding interaction module is used for mutual coding interaction between the data semantic coding sequence and the relationship type semantic coding sequence as well as between the target type semantic coding sequences;
the coding fusion module is used for fusing the data semantic coding sequence and the relation type coding sequence and fusing the data semantic coding sequence and the target type coding sequence;
the relation prediction module is used for predicting the relation type contained in the obtained data semantic code;
and the target prediction module is used for predicting a first target and a second target corresponding to the relation type contained in the data.
The invention has the beneficial effects that:
the method can automatically extract the information structure contained in the data source without depending on expert prior knowledge and a feature extraction tool, and can avoid the problem of wrong information transmission caused by a representation-based pipeline extraction mode.
Drawings
FIG. 1 is a block diagram of a joint information extraction device according to the present invention;
fig. 2 is a flowchart of a joint information extraction method in an embodiment.
Detailed Description
The invention aims to provide a joint information extraction method and a joint information extraction device, and solves the problems that the prior knowledge of experts and the performance of a characteristic extraction tool are depended on in the traditional information extraction technology, and the semantic relation judgment between targets can be influenced by the extraction error of an information target. As shown in fig. 1, the joint information extraction device of the present invention includes: the system comprises a coding initialization module, a coding interaction module, a coding fusion module, a relation prediction module and a target prediction module; wherein,
the code initialization module is used for carrying out semantic code initialization on the data, the predefined relationship type and the target type;
the coding interaction module is used for mutual coding interaction between the data semantic coding sequence and the relationship type semantic coding sequence as well as between the target type semantic coding sequences;
the coding fusion module is used for fusing the data semantic coding sequence and the relation type coding sequence and fusing the data semantic coding sequence and the target type coding sequence;
the relation prediction module is used for predicting the relation type contained in the obtained data semantic code;
and the target prediction module is used for predicting a first target and a second target corresponding to the relation type contained in the data.
Example (b):
the embodiment is an embodiment of an information extraction method implemented based on the joint information extraction device, and the method includes the following steps:
step 1, performing minimum unit segmentation and minimum unit semantic combination based segmentation on data, so that the data is converted into sequence-based data. Word-based segmentation and word-based segmentation are employed for the text data source. For example, for the original text data, "wife of yaoming is leaf Li", the result after minimal segmentation based on words is: tc ═ yao, mingming, wife, son, yes, leaf, li ], based on one segmentation of the word, the results were: tw is the name "Yaoming", wife, Yeli ".
Step 2, respectively carrying out semantic coding initialization on the data, the predefined relationship type and the target type after the two types of segmentation in the step 1, wherein the result after the initialization of the Tw semantic coding in the step 1 is as follows: vw=[v0,v1,v2,v2,v3]Whereind is the dimension of the initialization vector, VwRepresenting word-based semantically initialized coding sequences, equivalently using VcRepresenting a word-based semantic initialization code sequence, VrInitialising a semantic code sequence, V, representing a predefined relationship typeeThe semantic code sequence is initialized to represent a predefined target type.
Step 3, comparing V in step 2wAnd VcAnd fusing to obtain a fused data semantic coding sequence V. Wherein V ═ fm(fw(Vw),fc(Vc) Wherein f) ismThe fusion function can be represented as a maximization function, a mean function, a weighted summation function and the like; fw and fc are dimension normalization functions, which can be functions of a convolutional neural network, a long-time and short-time memory network and the like.
Step 4, according to the fused data semantic code sequence V obtained in the step 3 and the predefined relation type initialization code sequence V obtained in the step 2rAnd respectively carrying out coding alignment on each sequence unit of the data and the initialized coding sequence of the predefined relationship type to obtain alignment weight, and then fusing the semantic codes of the predefined relationship type added by the weight to obtain a data semantic coding sequence V with relationship type information.
Step 5, according to the fused data semantic code sequence V obtained in the step 3 and the predefined relation type initialization code sequence V obtained in the step 2rRespectively aligning each predefined relation type initialization code with a data semantic code sequence to obtain an alignment weight, and then fusing the data semantic codes added by the weights to obtain a relation type semantic code sequence V with data informationr。
And 6, carrying out forward propagation on the information by using a data semantic coding sequence with the relationship type information, and predicting to obtain the relationship type contained in the data. Aiming at V obtained in the step 4, a coding function f is adoptedrCarrying out information forward propagation on the V, further coding the V in the propagation process to obtain the V, and further obtaining the probability p of the semantic relationr=fr(V). The relation r ═ argmax (p) contained in the datar)。
7, according to the relationship type r obtained by prediction and the V in the step 5 of the semantic code sequence of the relationship type with the data informationrAnd V in the step 6 of the data semantic code sequence with the relationship type information, and obtaining the data semantic code sequence V after the relationship type is strengthened. Specific vr=f(VrR) denotes the semantic code vector to which the relation r corresponds, then vi=f[V,i]The semantic code after relationship reinforcement is: v. ofi=[vi,vr],viRepresenting the semantic code of the ith unit in the data.
Step 8, according to the reinforced data semantic code sequence V obtained in step 7 and the predefined target type initialization semantic code sequence V obtained in step 2eAligning the semantic code of each strengthened data sequence unit with the predefined target type initialization semantic code sequence to obtain an alignment weight, and then fusing the weighted predefined target type semanticsAnd coding to obtain a data semantic coding sequence V with target type information.
Step 9, according to the reinforced data semantic code sequence V obtained in step 7 and the predefined target type initialization semantic code sequence V obtained in step 2eRespectively aligning each predefined target type initialization code with the enhanced data semantic code sequence to obtain an alignment weight, and then fusing the enhanced data semantic codes added by the weights to obtain a target type semantic code sequence V with data informatione;
Step 10, use V in step 9eAnd carrying out forward propagation, and predicting to obtain a first target corresponding to the relation type contained in the data. The method specifically comprises the following steps: using VeAs input, the information is carried into coding networks such as a convolutional neural network or a long-term memory network and the like to carry out forward propagation of the information, and a semantic coding sequence of the data is obtained; predicting the target type of each unit of the sequence according to the obtained semantic coding sequence to obtain a first target e corresponding to the relation type of data inclusion1。
Step 11, predicting the first target e according to the step 1019, target type semantic code sequence V with data informationeAnd 8, predicting a data semantic coding sequence V with target type information to obtain types r and e of relation with data implication1Corresponding second target e2。
Claims (9)
1. A joint information extraction method is characterized by comprising the following steps:
A. initializing semantic code sequences of data, predefined relationship types and target types;
B. performing mutual interaction according to the data and the initialized semantic code sequence of the predefined relationship to obtain a data semantic code sequence with relationship type information and a relationship type semantic code sequence with data information;
C. forward propagation of information is carried out by using a data semantic coding sequence with relationship type information, and relationship types contained in the data are obtained through prediction;
D. obtaining a data semantic coding sequence after the relationship type is strengthened according to the relationship type obtained by prediction, the relationship type semantic coding sequence with the data information and the data semantic coding sequence with the relationship type information;
E. initializing a semantic code sequence according to the data semantic code sequence with the strengthened relation type and a predefined target type, and performing mutual interaction to obtain a data semantic code sequence with target type information and a target type semantic code sequence with data information;
F. forward propagation is carried out by using a data semantic coding sequence with target type information, and a first target corresponding to a relation type contained in data is obtained through prediction;
G. and predicting to obtain a second target corresponding to the data implication relation type and the first target according to the predicted first target, the target type semantic code sequence with the data information and the data semantic code sequence with the target type information.
2. The joint information extraction method according to claim 1,
the step A specifically comprises the following steps:
a1, performing minimum unit segmentation and minimum unit semantic combination-based segmentation on the data, so that the data are converted into sequence-based data;
a2, respectively carrying out semantic code initialization on the two types of segmented data obtained in the step A1, then combining the segmented sequence data based on the minimum unit, and fusing the two types of initialized semantic code sequences to obtain a fused data semantic code sequence;
and step A3, respectively carrying out semantic code initialization on the predefined relationship type and the target type to obtain a predefined relationship type initialization semantic code sequence and a predefined target type initialization semantic code sequence.
3. The joint information extraction method according to claim 2,
the step B specifically comprises the following steps:
b1, according to the fused data semantic code sequence obtained in the step A2 and the predefined relationship type initialization code obtained in the step A3, enabling each sequence unit of the data to be respectively aligned with the predefined relationship type initialization code sequence to obtain an alignment weight, and then fusing the predefined relationship type semantic codes added by the weights to obtain a data semantic code sequence with relationship type information;
b2, according to the fused data semantic code sequence obtained in the step A2 and the predefined relationship type initialization codes obtained in the step A3, enabling each predefined relationship type initialization code to be respectively aligned with the data semantic code sequence to obtain an alignment weight, and then fusing the data semantic codes added by the weights to obtain a relationship type semantic code sequence with data information.
4. The joint information extraction method according to claim 3,
in step C, the forward propagation of the information is performed by using the data semantic code sequence with the relationship type information, and the relationship type contained in the data is predicted, specifically: and B1, using the data semantic code sequence with the relation type information obtained in the step B1 as input, carrying out forward propagation of the information in coding networks such as a convolutional neural network or a long-term memory network and the like to obtain semantic codes of the data, and then predicting the relation types in the data according to the obtained semantic codes.
5. The joint information extraction method according to claim 4,
the step D specifically comprises the following steps:
d1, inquiring the relation type semantic code sequence with data information according to the predicted relation type to obtain the corresponding relation type semantic code;
d2, carrying out coding fusion on each sequence unit of the relation type semantic code obtained in the step D1 and the data semantic code sequence with the relation type information to obtain the data semantic code sequence with the strengthened relation type.
6. The joint information extraction method according to claim 5,
the step E specifically comprises the following steps:
e1, aligning the semantic code of each enhanced data sequence unit with the predefined target type initialization semantic code sequence to obtain an alignment weight according to the relationship type enhanced data semantic code sequence obtained in the step D2 and the predefined target type initialization semantic code sequence obtained in the step A3, and then fusing the predefined target type semantic codes added by the weights to obtain a data semantic code sequence with target type information;
e2, according to the relationship type reinforced data semantic code sequence obtained in the step D2 and the predefined target type initialized semantic code sequence obtained in the step A3, enabling each predefined target type initialized code to be respectively aligned with the reinforced data semantic code sequence to obtain an alignment weight, and then fusing the reinforced data semantic codes added by the weights to obtain the target type semantic code sequence with data information.
7. The joint information extraction method according to claim 6,
in step F, the forward propagation is performed by using the data semantic code sequence with the target type information, and the first target corresponding to the relationship type contained in the data is obtained through prediction, specifically: and E1, taking the data semantic code sequence with the target type information obtained in the step E1 as input, carrying out forward propagation on the information in coding networks such as a convolutional neural network or a long-time memory network and the like to obtain the semantic code sequence of the data, predicting the target type of each unit of the sequence according to the obtained semantic code sequence, and obtaining a first target corresponding to the relation type contained in the data.
8. The joint information extraction method according to claim 7,
the step G specifically comprises the following steps:
g1, inquiring a target type semantic code sequence with data information according to the predicted first target to obtain a semantic code vector corresponding to the first target;
g2, fusing the first target semantic coding vector obtained in the step G1 with each data semantic coding sequence unit with the target type information obtained in the step E2 to obtain a data semantic coding sequence with enhanced target type;
g3, using the data semantic code sequence with the strengthened target type obtained in the step G2 as input, and carrying out forward propagation on information in coding networks such as a convolutional neural network or a long-time memory network and the like to obtain the data semantic code sequence; and predicting the target type of each unit of the sequence according to the obtained semantic coding sequence to obtain a relation type contained in the data and a second target corresponding to the first target.
9. A joint information extraction device, comprising:
the code initialization module is used for carrying out semantic code initialization on the data, the predefined relationship type and the target type;
the coding interaction module is used for mutual coding interaction between the data semantic coding sequence and the relationship type semantic coding sequence as well as between the target type semantic coding sequences;
the coding fusion module is used for fusing the data semantic coding sequence and the relation type coding sequence and fusing the data semantic coding sequence and the target type coding sequence;
the relation prediction module is used for predicting the relation type contained in the obtained data semantic code;
and the target prediction module is used for predicting a first target and a second target corresponding to the relation type contained in the data.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201911416984.0A CN111143691B (en) | 2019-12-31 | 2019-12-31 | Joint information extraction method and device |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201911416984.0A CN111143691B (en) | 2019-12-31 | 2019-12-31 | Joint information extraction method and device |
Publications (2)
Publication Number | Publication Date |
---|---|
CN111143691A true CN111143691A (en) | 2020-05-12 |
CN111143691B CN111143691B (en) | 2023-04-18 |
Family
ID=70522894
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201911416984.0A Active CN111143691B (en) | 2019-12-31 | 2019-12-31 | Joint information extraction method and device |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN111143691B (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN112115250A (en) * | 2020-09-28 | 2020-12-22 | 四川长虹电器股份有限公司 | Automatic question generation method and system |
Citations (15)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN107783960A (en) * | 2017-10-23 | 2018-03-09 | 百度在线网络技术(北京)有限公司 | Method, apparatus and equipment for Extracting Information |
CN107958044A (en) * | 2017-11-24 | 2018-04-24 | 清华大学 | Higher-dimension sequence data Forecasting Methodology and system based on depth space-time memory network |
CN108304911A (en) * | 2018-01-09 | 2018-07-20 | 中国科学院自动化研究所 | Knowledge Extraction Method and system based on Memory Neural Networks and equipment |
CN108416058A (en) * | 2018-03-22 | 2018-08-17 | 北京理工大学 | A kind of Relation extraction method based on the enhancing of Bi-LSTM input informations |
CN108536754A (en) * | 2018-03-14 | 2018-09-14 | 四川大学 | Electronic health record entity relation extraction method based on BLSTM and attention mechanism |
CN108776773A (en) * | 2018-05-04 | 2018-11-09 | 华南理工大学 | A kind of three-dimensional gesture recognition method and interactive system based on depth image |
CN109408812A (en) * | 2018-09-30 | 2019-03-01 | 北京工业大学 | A method of the sequence labelling joint based on attention mechanism extracts entity relationship |
US20190156253A1 (en) * | 2017-11-22 | 2019-05-23 | United Parcel Service Of America, Inc. | Automatically generating volume forecasts for different hierarchical levels via machine learning models |
CN109902145A (en) * | 2019-01-18 | 2019-06-18 | 中国科学院信息工程研究所 | A kind of entity relationship joint abstracting method and system based on attention mechanism |
CN109924977A (en) * | 2019-03-21 | 2019-06-25 | 西安交通大学 | A kind of surface electromyogram signal classification method based on CNN and LSTM |
CN109933801A (en) * | 2019-03-25 | 2019-06-25 | 北京理工大学 | Two-way LSTM based on predicted position attention names entity recognition method |
CN110188348A (en) * | 2019-05-07 | 2019-08-30 | 南京邮电大学 | A kind of Chinese language processing model and method based on deep neural network |
CN110263323A (en) * | 2019-05-08 | 2019-09-20 | 清华大学 | Keyword abstraction method and system based on the long Memory Neural Networks in short-term of fence type |
CN110400600A (en) * | 2019-08-01 | 2019-11-01 | 枣庄学院 | A kind of disease associated prediction technique of miRNA- based on rotation forest algorithm |
CN110413743A (en) * | 2019-08-09 | 2019-11-05 | 安徽科大讯飞医疗信息技术有限公司 | A kind of key message abstracting method, device, equipment and storage medium |
-
2019
- 2019-12-31 CN CN201911416984.0A patent/CN111143691B/en active Active
Patent Citations (15)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN107783960A (en) * | 2017-10-23 | 2018-03-09 | 百度在线网络技术(北京)有限公司 | Method, apparatus and equipment for Extracting Information |
US20190156253A1 (en) * | 2017-11-22 | 2019-05-23 | United Parcel Service Of America, Inc. | Automatically generating volume forecasts for different hierarchical levels via machine learning models |
CN107958044A (en) * | 2017-11-24 | 2018-04-24 | 清华大学 | Higher-dimension sequence data Forecasting Methodology and system based on depth space-time memory network |
CN108304911A (en) * | 2018-01-09 | 2018-07-20 | 中国科学院自动化研究所 | Knowledge Extraction Method and system based on Memory Neural Networks and equipment |
CN108536754A (en) * | 2018-03-14 | 2018-09-14 | 四川大学 | Electronic health record entity relation extraction method based on BLSTM and attention mechanism |
CN108416058A (en) * | 2018-03-22 | 2018-08-17 | 北京理工大学 | A kind of Relation extraction method based on the enhancing of Bi-LSTM input informations |
CN108776773A (en) * | 2018-05-04 | 2018-11-09 | 华南理工大学 | A kind of three-dimensional gesture recognition method and interactive system based on depth image |
CN109408812A (en) * | 2018-09-30 | 2019-03-01 | 北京工业大学 | A method of the sequence labelling joint based on attention mechanism extracts entity relationship |
CN109902145A (en) * | 2019-01-18 | 2019-06-18 | 中国科学院信息工程研究所 | A kind of entity relationship joint abstracting method and system based on attention mechanism |
CN109924977A (en) * | 2019-03-21 | 2019-06-25 | 西安交通大学 | A kind of surface electromyogram signal classification method based on CNN and LSTM |
CN109933801A (en) * | 2019-03-25 | 2019-06-25 | 北京理工大学 | Two-way LSTM based on predicted position attention names entity recognition method |
CN110188348A (en) * | 2019-05-07 | 2019-08-30 | 南京邮电大学 | A kind of Chinese language processing model and method based on deep neural network |
CN110263323A (en) * | 2019-05-08 | 2019-09-20 | 清华大学 | Keyword abstraction method and system based on the long Memory Neural Networks in short-term of fence type |
CN110400600A (en) * | 2019-08-01 | 2019-11-01 | 枣庄学院 | A kind of disease associated prediction technique of miRNA- based on rotation forest algorithm |
CN110413743A (en) * | 2019-08-09 | 2019-11-05 | 安徽科大讯飞医疗信息技术有限公司 | A kind of key message abstracting method, device, equipment and storage medium |
Non-Patent Citations (1)
Title |
---|
刘丹丹等: "词汇语义信息对中文实体关系抽取影响的比较", 《计算机应用》 * |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN112115250A (en) * | 2020-09-28 | 2020-12-22 | 四川长虹电器股份有限公司 | Automatic question generation method and system |
Also Published As
Publication number | Publication date |
---|---|
CN111143691B (en) | 2023-04-18 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN108829722B (en) | Remote supervision Dual-Attention relation classification method and system | |
CN109918640B (en) | Chinese text proofreading method based on knowledge graph | |
CN106503231B (en) | Search method and device based on artificial intelligence | |
CN109522420B (en) | Method and system for acquiring learning demand | |
CN112035841B (en) | Intelligent contract vulnerability detection method based on expert rules and serialization modeling | |
CN111723569A (en) | Event extraction method and device and computer readable storage medium | |
CN109857846B (en) | Method and device for matching user question and knowledge point | |
CN113312922B (en) | Improved chapter-level triple information extraction method | |
CN113254507B (en) | Intelligent construction and inventory method for data asset directory | |
CN113836925A (en) | Training method and device for pre-training language model, electronic equipment and storage medium | |
CN111967267B (en) | XLNET-based news text region extraction method and system | |
CN114332519A (en) | Image description generation method based on external triple and abstract relation | |
CN117093686A (en) | Intelligent question-answer matching method, device, terminal and storage medium | |
CN111143691B (en) | Joint information extraction method and device | |
CN110263345B (en) | Keyword extraction method, keyword extraction device and storage medium | |
CN114048314B (en) | Natural language steganalysis method | |
CN115062123A (en) | Knowledge base question-answer pair generation method of conversation generation system | |
CN112215629A (en) | Multi-target advertisement generation system and method based on construction countermeasure sample | |
CN112380882B (en) | Mongolian Chinese neural machine translation method with error correction function | |
CN111382333B (en) | Case element extraction method in news text sentence based on case correlation joint learning and graph convolution | |
CN117371534B (en) | Knowledge graph construction method and system based on BERT | |
CN113283243A (en) | Entity and relation combined extraction method | |
CN113076744A (en) | Cultural relic knowledge relation extraction method based on convolutional neural network | |
CN116992880A (en) | Building name identification method, device, electronic equipment and storage medium | |
CN116467437A (en) | Automatic flow modeling method for complex scene description |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |