[go: up one dir, main page]
More Web Proxy on the site http://driver.im/

CN110209839A - Agricultural knowledge map construction device, method and computer readable storage medium - Google Patents

Agricultural knowledge map construction device, method and computer readable storage medium Download PDF

Info

Publication number
CN110209839A
CN110209839A CN201910528268.5A CN201910528268A CN110209839A CN 110209839 A CN110209839 A CN 110209839A CN 201910528268 A CN201910528268 A CN 201910528268A CN 110209839 A CN110209839 A CN 110209839A
Authority
CN
China
Prior art keywords
agriculture
corpus
entity
knowledge map
agricultural knowledge
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201910528268.5A
Other languages
Chinese (zh)
Other versions
CN110209839B (en
Inventor
吴良顺
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Zhuo Erzhi Lian Wuhan Research Institute Co Ltd
Original Assignee
Zhuo Erzhi Lian Wuhan Research Institute Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Zhuo Erzhi Lian Wuhan Research Institute Co Ltd filed Critical Zhuo Erzhi Lian Wuhan Research Institute Co Ltd
Priority to CN201910528268.5A priority Critical patent/CN110209839B/en
Publication of CN110209839A publication Critical patent/CN110209839A/en
Application granted granted Critical
Publication of CN110209839B publication Critical patent/CN110209839B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/36Creation of semantic tools, e.g. ontology or thesauri
    • G06F16/367Ontology
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/951Indexing; Web crawling techniques
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/205Parsing
    • G06F40/211Syntactic parsing, e.g. based on context-free grammar [CFG] or unification grammars
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/253Grammatical analysis; Style critique
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/279Recognition of textual entities
    • G06F40/289Phrasal analysis, e.g. finite state techniques or chunking
    • G06F40/295Named entity recognition
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/30Semantic analysis

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Artificial Intelligence (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • General Health & Medical Sciences (AREA)
  • Databases & Information Systems (AREA)
  • Data Mining & Analysis (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Animal Behavior & Ethology (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

A kind of agricultural knowledge map construction method, comprising: obtain the agriculture field data of a target area, and the agriculture field data based on acquisition construct corpus;It carries out participle to the corpus in the corpus to handle with part-of-speech tagging, to identify multiple entitative concepts;The entitative concept identified is screened according to default screening rule to obtain multiple agriculture entities;Parsing is carried out to the corpus in the corpus to extract with relationship, obtains the incidence relation between multiple agriculture entities;And according to the incidence relation between each agriculture entity and each agriculture entity, establish agricultural knowledge map.The present invention also provides a kind of agricultural knowledge map construction device and computer readable storage mediums.Above-mentioned agricultural knowledge map construction device, method and computer readable storage medium are, it can be achieved that construct agricultural knowledge map, the convenience that promotion agriculture field data management efficiency and data use for agriculture field.

Description

Agricultural knowledge map construction device, method and computer readable storage medium
Technical field
The present invention relates to technical field of data processing more particularly to a kind of agricultural knowledge map construction devices, method and meter Calculation machine readable storage medium storing program for executing.
Background technique
Knowledge mapping has powerful data descriptive power, provides technical foundation for intelligent information application, passes through Implementation of inference conceptual retrieval, while structural knowledge can be presented to user in a manner of patterned.Knowledge mapping is in multiple necks Domain has application, such as medical treatment, finance, education, investment etc. to have industry existing for relationship.But not yet there is mature agriculture at present Industry knowledge mapping building mode.
Summary of the invention
In view of this, it is necessary to provide a kind of agricultural knowledge map construction device, method and computer readable storage medium, It, which can be realized, constructs agricultural knowledge map for agriculture field, and it is convenient that promotion agriculture field data management efficiency and data use Property.
An embodiment of the present invention provides a kind of agricultural knowledge map construction method, which comprises obtains a target The agriculture field data in region, and the agriculture field data based on acquisition construct corpus;To the corpus in the corpus into Row participle is handled with part-of-speech tagging, to identify multiple entitative concepts;According to default screening rule to the entitative concept identified It is screened to obtain multiple agriculture entities;Parsing is carried out to the corpus in the corpus to extract with relationship, is obtained multiple described Incidence relation between agriculture entity;It is closed according to the association between each agriculture entity and each agriculture entity System, establishes agricultural knowledge map;It swashes from network the encyclopaedia information obtained to each agriculture entity;It is advised based on default extraction Then the encyclopaedia content of pages crawled is extracted, and be added to the corpus and/or institute for obtained content is extracted State agricultural knowledge map;Similarity analysis is carried out to the encyclopaedia content of pages crawled, obtains each agriculture entity Classification information;And the classification information of each agriculture entity is added to the agricultural knowledge map;
Wherein, the step of described pair of encyclopaedia content of pages crawled carries out similarity analysis includes: to extract each institute State multiple specific characteristics in encyclopaedia content of pages;It is calculated between any two encyclopaedia content of pages using KNN algorithm The similarity of each feature group;And the similarity for the multiple feature groups being calculated is weighted to obtain described any two The comprehensive similarity of a encyclopaedia content of pages.
Preferably, the agriculture field data include non-structured agriculture field data and semi-structured agriculture field Data.
Preferably, the corpus in the corpus carries out participle and part-of-speech tagging processing, to identify multiple realities The step of body concept includes:
Participle is carried out to the corpus in the corpus using default lexical analysis tool to handle with part-of-speech tagging;And
Entity recognition is named to the result of part-of-speech tagging, to identify multiple entitative concepts.
Preferably, the corpus in the corpus carries out parsing and relationship extraction, and it is real to obtain multiple agriculturals Incidence relation between body:
Corpus in the corpus is parsed to obtain morphological information, syntactic information and semantic information;And
Obtained morphological information, syntactic information and the semantic information that parse is input to relationship extraction mould trained in advance Type obtains the incidence relation between multiple agriculture entities.
Preferably, the incidence relation according between each agriculture entity and each agriculture entity, builds The step of vertical agricultural knowledge map includes:
Incidence relation between the name identification of each agriculture entity and each agriculture entity is directed into Preset pattern database, and carry out visualization and be converted to the agricultural knowledge map.
An embodiment of the present invention provides a kind of agricultural knowledge map construction device, the agricultural knowledge map construction device Including processor and memory, several computer programs are stored on the memory, the processor is for executing memory The step of above-mentioned agricultural knowledge map construction method is realized when the computer program of middle storage.
An embodiment of the present invention also provides a kind of computer readable storage medium, and the computer readable storage medium is deposited A plurality of instruction is contained, a plurality of described instruction can be executed by one or more processor, to realize above-mentioned agricultural knowledge map The step of construction method.
Compared with prior art, above-mentioned agricultural knowledge map construction device, method and computer readable storage medium, can be with It realizes the agricultural knowledge map in one specified region of building, promotes the convenience that agriculture field data management efficiency and data use, Peasant can be assisted to carry out crop production, enterprise procurement, for public science popularization agricultural knowledge.
Detailed description of the invention
Fig. 1 is the structural schematic diagram of the agricultural knowledge map construction device of an embodiment of the present invention.
Fig. 2 is the functional block diagram of the agricultural knowledge map construction system of an embodiment of the present invention.
Fig. 3 is the functional block diagram of the agricultural knowledge map construction system of another embodiment of the present invention.
Fig. 4 is that the feature that the agriculture entity of an embodiment of the present invention is the encyclopaedia content of pages of pea divides schematic diagram.
Fig. 5 is the reasoning schematic diagram of the agricultural knowledge question and answer of an embodiment of the present invention.
Fig. 6 is the flow chart of the agricultural knowledge map construction method of an embodiment of the present invention.
Main element symbol description
The present invention that the following detailed description will be further explained with reference to the above drawings.
Specific embodiment
In order to make the objectives, technical solutions, and advantages of the present invention clearer, with reference to the accompanying drawings and embodiments, right The present invention is further elaborated.It should be appreciated that described herein, specific examples are only used to explain the present invention, not For limiting the present invention.Based on the embodiments of the present invention, those of ordinary skill in the art are not before making creative work Every other embodiment obtained is put, shall fall within the protection scope of the present invention.
Explanation is needed further exist for, herein, the terms "include", "comprise" or its any other variant are intended to contain Lid non-exclusive inclusion, so that process, method, article or device including a series of elements are not only wanted including those Element, but also including other elements that are not explicitly listed, or further include for this process, method, article or device Intrinsic element.In the absence of more restrictions, the element limited by sentence "including a ...", it is not excluded that There is also other identical elements in process, method, article or device including the element.
Referring to Fig. 1, being the schematic diagram of agricultural knowledge map construction device preferred embodiment of the present invention.
The agricultural knowledge map construction device 100 is including memory 10, processor 20 and is stored in the memory In 10 and the computer program 30 that can be run on the processor 20, such as agricultural knowledge map construction program.The processing Device 20 realizes the step in agricultural knowledge map construction embodiment of the method when executing the computer program 30, such as shown in Fig. 6 Step S600~S608.Alternatively, the processor 20 realizes agricultural knowledge map construction when executing the computer program 30 The function of each module in system embodiment, such as the module 101~105 in Fig. 2 or the module in Fig. 3 101~107.
The computer program 30 can be divided into one or more modules, and one or more of modules are stored It is executed in the memory 10, and by the processor 20, to complete the present invention.One or more of modules can be energy The series of computation machine program instruction section of specific function is enough completed, described instruction section is for describing the computer program 30 in institute State the implementation procedure in agricultural knowledge map construction device 100.For example, the computer program 30 can be divided into Fig. 2 Acquisition module 101, processing module 102, screening module 103, parsing module 104 and establish module 105, or be divided into figure Acquisition module 101, processing module 102, screening module 103 in 3, establish module 105, categorization module 106 at parsing module 104 And adding module 107.Each module concrete function referring to each module in agricultural knowledge map construction system embodiment function.
The agricultural knowledge map construction device 100 can be computer, server etc. and calculate equipment.Those skilled in the art It is appreciated that the schematic diagram is only the example of agricultural knowledge map construction device 100, do not constitute to agricultural knowledge map structure The restriction for building device 100 may include perhaps combining certain components or different portions than illustrating more or fewer components Part, such as the agricultural knowledge map construction device 100 can also include input-output equipment, network access equipment, bus etc..
Alleged processor 20 can be central processing unit (Central Processing Unit, CPU), can also be Other general processors, digital signal processor (Digital Signal Processor, DSP), specific integrated circuit (Application Specific Integrated Circuit, ASIC), ready-made programmable gate array (Field- Programmable Gate Array, FPGA) either other programmable logic device, discrete gate or transistor logic, Discrete hardware components etc..General processor can be microprocessor or the processor 20 is also possible to any conventional processing Device etc., the processor 20 can use the various pieces of various interfaces and connection agricultural knowledge map construction device 100.
The memory 10 can be used for storing the computer program 30 and/or module, and the processor 20 passes through operation Or the computer program and/or module being stored in the memory 10 are executed, and call the number being stored in memory 10 According to realizing the various functions of the agricultural knowledge map construction device 100.The memory 10 may include high random access Memory can also include nonvolatile memory, such as hard disk, memory, plug-in type hard disk, intelligent memory card (Smart Media Card, SMC), secure digital (Secure Digital, SD) card, flash card (Flash Card), at least one disk Memory device, flush memory device or other volatile solid-state parts.
Fig. 2 is the functional block diagram of agricultural knowledge map construction system preferred embodiment of the present invention.
As shown in fig.2, agricultural knowledge map construction system 40 may include obtaining module 101, processing module 102, sieve Modeling block 103, parsing module 104 and establish module 105.In one embodiment, above-mentioned module can be to be stored in described deposit In reservoir 10 and the programmable software instruction executed can be called by the processor 20.It is understood that in other implementations In mode, above-mentioned module can also be to solidify program instruction or firmware (firmware) in the processor 20.
The agriculture field data for obtaining module 101 and being used to obtain a target area, and the agriculture field based on acquisition Data construct corpus.
In one embodiment, the target area can be set according to actual use demand, than if necessary The agricultural knowledge map in a specified city is established, then the agriculture field data of the target area can be the agriculture field in the specified city If desired data establish the agricultural knowledge map in a specified county, then it is specified to can be this for the agriculture field data of the target area The agriculture field data in county.The agriculture field data can be unstructured data, semi-structured data, described unstructured Data such as can be agricultural product picture, audio, video, and it includes agricultural data that the semi-structured data, which such as can be, XML, JSON include the agriculture encyclopaedia page of agricultural data.The agriculture field data include but is not limited to vegetables data, water Fruit data, seed data, herding data, poultry data, aquatic products data, chemical fertilizer data, feed data, weather data, natural calamity Evil data etc..
In one embodiment, the mode for obtaining the acquisition agriculture field data of module 101 includes but is not limited to pass through Web crawlers technology accesses the agriculture commercial data base bought, industry research report, uses open agricultural data collection, uses Search engine etc..After the acquisition module 101 acquires the agriculture field data of target area, to the agricultural acquired FIELD Data is pre-processed to construct corpus.The pretreatment such as refers to unstructured data and semi-structured data It is handled to obtain corpus, then collects the corpus that processing obtains to construct the corpus.
The processing module 102 is used to carry out the corpus in the corpus participle and handles with part-of-speech tagging, with identification Multiple entitative concepts out.
In one embodiment, the processing module 102 can use default lexical analysis tool in the corpus Corpus carry out participle and handled with part-of-speech tagging, then Entity recognition is named to the result of part-of-speech tagging, it is multiple to identify Entitative concept.
For example, the processing module 102 is using Thulac Chinese lexical analysis kit in the corpus Corpus carries out participle and handles with part-of-speech tagging, when being identified as name entity, is marked out by default come non-when being identified as Entity is named, without mark.
The screening module 103 is used to be screened to obtain to the entitative concept identified according to default screening rule multiple Agriculture entity.
In one embodiment, the agriculture entity can be the entity for belonging to agriculture field, for example, the agricultural entity It can be vegetables entity, fruit entity, kind fructification, herding entity, poultry entity, aquatic products entity etc..Since multiple entities are general It may include non-agricultural entity in thought, the screening module 103 is also according to default screening rule to the entitative concept identified It is screened to obtain multiple agriculture entities.The default screening rule can be set according to actual use demand, for example, institute Stating default screening rule can be keyword screening, or scan obtained entity using default word combination and part-of-speech rule Concept, and then to filter out the part for not being agriculture entity.The part-of-speech rule is such as: if verb, then assert it is not agriculture Industry entity.
The parsing module 104 is used to carry out the corpus in the corpus parsing and extracts with relationship, obtains multiple institutes State the incidence relation between agriculture entity.
In one embodiment, the parsing module 104 can parse the progress sentence of the corpus in the corpus To morphological information, syntactic information and semantic information, recycle relationship trained in advance extract model to the morphological information being resolved to, Syntactic information and semantic information are analyzed, to obtain the incidence relation between multiple agriculture entities.
In one embodiment, when parsing to a corpus, the parsing module 104 can be by generating the corpus Parsing tree obtain the morphological information and syntactic information of the corpus, can be analyzed to obtain by the structure to corpus The semantic information of the corpus.
In one embodiment, it can be obtained by the way of remote supervisory for training the relationship to extract model Training sample data, for example existing knowledge can be corresponded in unstructured corpus, to generate a large amount of training sample Data are then based on training sample data and machine learning algorithm to train to obtain the relationship and extract model, the engineering Practising algorithm can be neural network algorithm, inductive algorithm.The source of those knowledge can be artificial mark, existing knowledge base or Specific sentence structure.For example, it may be considered a kind of specified, such as specified " A is B " structure for specific sentence structure For subclass relation, then A- subclass relation-B.Such as " banana is a kind of tropical fruit (tree) ", it is obtained by particular statement structure elucidation: Banana-subclass relation-tropical fruit (tree).
The module 105 of establishing is for according to the association between each agriculture entity and each agriculture entity Relationship establishes agricultural knowledge map.
In one embodiment, described to establish module after the incidence relation between each agriculture entity is established 105 can establish to obtain the agricultural according to the incidence relation between each agriculture entity and each agriculture entity Knowledge mapping.
It in one embodiment, include vegetables entity, fruit entity, herding entity, aquatic products entity with the agriculture entity And for chemical fertilizer entity, the building agricultural knowledge map can be accomplished by the following way in the module 105 of establishing: described It establishes module 105 and obtains the name identification of vegetables entity, fruit entity, herding entity, aquatic products entity and chemical fertilizer entity, and be based on The vegetables entity of acquisition, fruit entity, herding entity, aquatic products entity and chemical fertilizer entity name identification construct agricultural knowledge map Frame, then the incidence relation between each agriculture entity is filled to the agricultural knowledge map frame, it obtains the agricultural and knows Know map.
In one embodiment, it is described establish module 105 can also by the name identification of each agriculture entity and Incidence relation between each agriculture entity is directed into preset pattern database, and by the preset pattern data can The agricultural knowledge map is converted to depending on changing.For example, the preset pattern database can be Noe4j graphic data base, institute It states and establishes module 105 and lead the incidence relation between the name identification of each agriculture entity and each agriculture entity Enter to Noe4j graphic data base and visualized, the agricultural knowledge map can be generated.
Please refer to Fig. 3, compared with Fig. 2, the agricultural knowledge map construction system 40 further include categorization module 106 and Adding module 107.
The categorization module 106 is used to swash from network the encyclopaedia information obtained to each agriculture entity, and to climbing The encyclopaedia content of pages obtained carries out similarity analysis, obtains the classification information of each agriculture entity.
In one embodiment, for being identified as the entry of agriculture entity, the categorization module 106 can be in interaction hundred It is crawled to obtain the encyclopaedia information of each agriculture entity in encyclopaedia website disclosed in section/Baidupedia etc., abandons and do not deposit The page or mistake the page, then similarity analysis is carried out to the encyclopaedia content of pages that crawls, obtained each described The classification information of agriculture entity.
In one embodiment, the categorization module 106 can be extracted to obtain each encyclopaedia page according to encyclopaedia content of pages Multiple specific characteristics of face content, recycling are closed on algorithm (KNN algorithm) and are calculated between any two encyclopaedia content of pages Each feature group similarity, and the similarity for the multiple feature groups being calculated is weighted to obtain described any The comprehensive similarity of two encyclopaedia content of pages is to get to the comprehensive of two corresponding with two encyclopaedia content of pages agriculture entities Similarity is closed, realizes the classification for going out agriculture entity B by the category inferences of agriculture entity A.For example, each encyclopaedia content of pages In include first to third specific characteristic, the similarity of each feature group can be the first specified spy of two encyclopaedia content of pages The similarity between similarity, the second specific characteristic between sign, the similarity between third specific characteristic.
It is assumed that agriculture entity is pea, Fig. 4 illustrates the encyclopaedia content of pages of pea.The hundred of the pea Section's content of pages includes four specific characteristics, wherein the first specific characteristic is title, the second specific characteristic is open classification, third Specific characteristic is brief introduction, the 4th specific characteristic is essential information.The categorization module 106 calculates two by KNN algorithm The mode of similarity between encyclopaedia content of pages may is that the cosine of the term vector between a) two groups of calculating of " titles " is similar Degree;B) calculates the cosine similarity of the term vector between two groups " open classification ";C) is calculated in two groups " essential informations " and is repeated The number of item, to obtain a similarity;D) calculates the coincidence factor of two groups " brief introduction ";E) is by the operation result of above-mentioned a-d It is weighted to obtain the comprehensive similarity between two encyclopaedia content of pages.
It is understood that can corresponding weight coefficient be arranged in advance for first to fourth specific characteristic, and then can be with It is similar to the operation result of a-d to be weighted to obtain the synthesis between two encyclopaedia content of pages according to weight coefficient Degree.
The adding module 107 is used to the classification information of each agriculture entity being added to the agricultural knowledge figure Spectrum.
In one embodiment, the classification information of each agriculture entity can be added to by the adding module 107 The agricultural knowledge map, and then agricultural knowledge general view function and agriculture entity classification tree function may be implemented.
In one embodiment, the adding module 107 is also based on default extracting rule to the encyclopaedia crawled Content of pages extracts, and is added to the corpus for obtained content is extracted.The default extracting rule can be needle The specified region content of the encyclopaedia page is extracted.In other embodiments of the invention, the adding module 107 may be used also To extract based on default extracting rule to the encyclopaedia content of pages crawled, and institute is added to by obtained content is extracted State agricultural knowledge map.
In one embodiment, in order to ensure the accuracy of the agricultural knowledge map, default update can be set and advised Then the agricultural knowledge map is updated.The presupposed information updates rule can be set according to actual use demand It is fixed, for example it may include monthly updating the primary agricultural knowledge map that the presupposed information, which updates rule,.
In one embodiment, it can use the obtained agricultural knowledge map of establishing and realize following functions:
1) agricultural Entity recognition function may be implemented to identify agriculture entity and its agriculture in non-structured text Type.For example, a certain non-structured text content are as follows: " chemical fertilizer abbreviation chemical fertilizer.It is made of chemistry and (or) physical method The nutrient needed containing one or more of crop growths fertilizer, also referred to as inorganic fertilizer, including ammonia fertilizer, phosphate fertilizer, potassium Fertile, micro- fertilizer, composite chemical fertilizer etc..Pass through the available recognition result of agricultural knowledge map are as follows:Chemical fertilizerReferred to asChemical fertilizer。 Contain one or more made of chemistry and (or) physical methodCropsWhat growth neededNutrient'sFertilizer, also referred to asNothing Machine fertilizer, includingAmmonia fertilizerPhosphate fertilizerPotash fertilizerMicro- fertilizerComposite chemical fertilizerDeng.Wherein, scribing line content is the entity of identification.
2) agricultural entity encyclopaedia function can show the information such as essential information, the type of farming of agriculture entity.For example it looks into The agriculture entity of inquiry be " pea ", the agricultural knowledge map can show encyclopaedia entry " https: // Pea/822636 baike.baidu.com/item/? part/full content shown by fr=aladdin ".
3) agricultural knowledge general view function may be implemented to sort out agricultural knowledge.For example, fruit is returned Class.
Classification special topic: fruit
Higher level's classification: the edible plant of food agricultural-
Junior's classification:
Types of fruits
In one embodiment, classification can also be carried out with fruit title initial to show.Such as:
4) agriculture entity class is organized into tree and is shown by agricultural entity classification tree function, realization.Citing and Speech is that root node carries out tree displaying with agricultural, can be obtained:
Agricultural
5) associated entity query function, input entity can be inquired to obtain entity and relation information associated therewith.It lifts For example, the agriculture entity of input is banana, and following information can be obtained:
Entity 1 Relationship Entity 2
Banana Subclass relation Berry
Banana Subclass relation Fruit
Banana Subclass relation Fruit tree in many tropical and subtropical countries
Banana It is different from Plantain
Banana Color Yellow
Banana Color Brown
Banana Color It is red
Banana Color Green
Banana The natural product of taxon Musa
Banana The natural product of taxon Fruitlet open country any of several broadleaf plants hybridizes wild any of several broadleaf plants
…… …… ……
6) the agricultural knowledge problem of input is answered in agricultural knowledge question and answer function, realization.For example, problem is " suitable kind of Fuyang City Taihe County what? " reasoning foundation according to figure 5, available answer is: beech walnut broad-leaved Set Mongolian oak liana Acer.
Fig. 6 is the flow chart of agriculture knowledge mapping construction method in an embodiment of the present invention.Institute according to different requirements, The sequence for stating step in flow chart can change, and certain steps can be omitted.
Step S600 obtains the agriculture field data of a target area, and the agriculture field data based on acquisition construct language Expect library.
Step S602 carries out participle to the corpus in the corpus and handles with part-of-speech tagging, to identify multiple entities Concept.
Step S604 screens the entitative concept identified according to default screening rule to obtain multiple agriculture entities.
Step S606 carries out parsing to the corpus in the corpus and extracts with relationship, obtains multiple agriculture entities Between incidence relation.
Step S608 is established according to the incidence relation between each agriculture entity and each agriculture entity Agricultural knowledge map.
It is specified that building one may be implemented in above-mentioned agricultural knowledge map construction device, method and computer readable storage medium The agricultural knowledge map in region promotes the convenience that agriculture field data management efficiency and data use, can assist peasant into Row crop production, enterprise procurement, for public science popularization agricultural knowledge.
It will be apparent to those skilled in the art that the reality of production can be combined with scheme of the invention according to the present invention and inventive concept Border needs to make other and is altered or modified accordingly, and these change and adjustment all should belong to range disclosed in this invention.

Claims (7)

1. a kind of agricultural knowledge map construction method, which is characterized in that the described method includes:
The agriculture field data of a target area are obtained, and the agriculture field data based on acquisition construct corpus;
It carries out participle to the corpus in the corpus to handle with part-of-speech tagging, to identify multiple entitative concepts;
The entitative concept identified is screened according to default screening rule to obtain multiple agriculture entities;
It carries out parsing to the corpus in the corpus to extract with relationship, the association obtained between multiple agriculture entities is closed System;
According to the incidence relation between each agriculture entity and each agriculture entity, agricultural knowledge map is established;
It swashes from network the encyclopaedia information obtained to each agriculture entity;
The encyclopaedia content of pages crawled is extracted based on default extracting rule, and is added to obtained content is extracted The corpus and/or the agricultural knowledge map;
Similarity analysis is carried out to the encyclopaedia content of pages crawled, obtains the classification information of each agriculture entity;And
The classification information of each agriculture entity is added to the agricultural knowledge map;
Wherein, the step of described pair of encyclopaedia content of pages that crawls carries out similarity analysis include:
Extract multiple specific characteristics in each encyclopaedia content of pages;
The similarity of each feature group between any two encyclopaedia content of pages is calculated using KNN algorithm;And
The similarity for the multiple feature groups being calculated is weighted to obtain any two encyclopaedia content of pages Comprehensive similarity.
2. the method as described in claim 1, which is characterized in that the agriculture field data include non-structured agriculture field Data and semi-structured agriculture field data.
3. method according to claim 1 or 2, which is characterized in that the corpus in the corpus carry out participle with Part-of-speech tagging processing, to identify multiple entitative concepts the step of include:
Participle is carried out to the corpus in the corpus using default lexical analysis tool to handle with part-of-speech tagging;And
Entity recognition is named to the result of part-of-speech tagging, to identify multiple entitative concepts.
4. the method as described in claim 1, which is characterized in that the corpus in the corpus carries out parsing and relationship The step of extracting, obtaining the incidence relation between multiple agriculture entities include:
Corpus in the corpus is parsed to obtain morphological information, syntactic information and semantic information;And
Obtained morphological information, syntactic information and the semantic information that parse is input to relationship extraction model trained in advance, Obtain the incidence relation between multiple agriculture entities.
5. the method as described in claim 1, which is characterized in that described according to each agriculture entity and each agriculture Incidence relation between industry entity, the step of establishing agricultural knowledge map include:
Incidence relation between the name identification of each agriculture entity and each agriculture entity is directed into default Graphic data base, and carry out visualization and be converted to the agricultural knowledge map.
6. a kind of agricultural knowledge map construction device, described device includes processor and memory, is stored on the memory Several computer programs, which is characterized in that realized such as when the processor is for executing the computer program stored in memory The step of claim 1-5 described in any item agricultural knowledge map construction methods.
7. a kind of computer readable storage medium, which is characterized in that the computer-readable recording medium storage has a plurality of instruction, A plurality of described instruction can be executed by one or more processor, to realize that agricultural as described in any one in claim 1-5 is known The step of knowing map construction method.
CN201910528268.5A 2019-06-18 2019-06-18 Agricultural knowledge graph construction device and method and computer readable storage medium Active CN110209839B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910528268.5A CN110209839B (en) 2019-06-18 2019-06-18 Agricultural knowledge graph construction device and method and computer readable storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910528268.5A CN110209839B (en) 2019-06-18 2019-06-18 Agricultural knowledge graph construction device and method and computer readable storage medium

Publications (2)

Publication Number Publication Date
CN110209839A true CN110209839A (en) 2019-09-06
CN110209839B CN110209839B (en) 2021-07-27

Family

ID=67793313

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910528268.5A Active CN110209839B (en) 2019-06-18 2019-06-18 Agricultural knowledge graph construction device and method and computer readable storage medium

Country Status (1)

Country Link
CN (1) CN110209839B (en)

Cited By (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111488741A (en) * 2020-04-14 2020-08-04 税友软件集团股份有限公司 Tax knowledge data semantic annotation method and related device
CN111507758A (en) * 2020-04-09 2020-08-07 深圳传世智慧科技有限公司 Semantic analysis-based investigation method, device, system and server
CN112199508A (en) * 2020-08-10 2021-01-08 淮阴工学院 Parameter adaptive agricultural knowledge graph recommendation method based on remote supervision
CN112329468A (en) * 2020-11-03 2021-02-05 中国平安财产保险股份有限公司 Method and device for constructing heterogeneous relation network, computer equipment and storage medium
CN112465287A (en) * 2019-09-09 2021-03-09 国际商业机器公司 Improving farm farming quality
CN113010529A (en) * 2019-12-19 2021-06-22 广州极飞科技股份有限公司 Crop management method and device based on knowledge graph
CN113220827A (en) * 2021-04-23 2021-08-06 哈尔滨工业大学 Construction method and device of agricultural corpus
CN113761971A (en) * 2020-06-02 2021-12-07 中国人民解放军战略支援部队信息工程大学 Method and device for constructing target knowledge graph of remote sensing image

Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104484461A (en) * 2014-12-29 2015-04-01 北京奇虎科技有限公司 Method and system based on encyclopedia data for classifying entities
CN106250412A (en) * 2016-07-22 2016-12-21 浙江大学 The knowledge mapping construction method merged based on many source entities
CN106776711A (en) * 2016-11-14 2017-05-31 浙江大学 A kind of Chinese medical knowledge mapping construction method based on deep learning
CN106815293A (en) * 2016-12-08 2017-06-09 中国电子科技集团公司第三十二研究所 System and method for constructing knowledge graph for information analysis
US20180025008A1 (en) * 2016-07-20 2018-01-25 Baidu Usa Llc Systems and methods for homogeneous entity grouping
CN108804521A (en) * 2018-04-27 2018-11-13 南京柯基数据科技有限公司 A kind of answering method and agricultural encyclopaedia question answering system of knowledge based collection of illustrative plates
CN108984661A (en) * 2018-06-28 2018-12-11 上海海乂知信息科技有限公司 Entity alignment schemes and device in a kind of knowledge mapping
CN109062939A (en) * 2018-06-20 2018-12-21 广东外语外贸大学 A kind of intelligence towards Chinese international education leads method
CN109271530A (en) * 2018-10-17 2019-01-25 长沙瀚云信息科技有限公司 A kind of disease knowledge map construction method and plateform system, equipment, storage medium

Patent Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104484461A (en) * 2014-12-29 2015-04-01 北京奇虎科技有限公司 Method and system based on encyclopedia data for classifying entities
US20180025008A1 (en) * 2016-07-20 2018-01-25 Baidu Usa Llc Systems and methods for homogeneous entity grouping
CN106250412A (en) * 2016-07-22 2016-12-21 浙江大学 The knowledge mapping construction method merged based on many source entities
CN106776711A (en) * 2016-11-14 2017-05-31 浙江大学 A kind of Chinese medical knowledge mapping construction method based on deep learning
CN106815293A (en) * 2016-12-08 2017-06-09 中国电子科技集团公司第三十二研究所 System and method for constructing knowledge graph for information analysis
CN108804521A (en) * 2018-04-27 2018-11-13 南京柯基数据科技有限公司 A kind of answering method and agricultural encyclopaedia question answering system of knowledge based collection of illustrative plates
CN109062939A (en) * 2018-06-20 2018-12-21 广东外语外贸大学 A kind of intelligence towards Chinese international education leads method
CN108984661A (en) * 2018-06-28 2018-12-11 上海海乂知信息科技有限公司 Entity alignment schemes and device in a kind of knowledge mapping
CN109271530A (en) * 2018-10-17 2019-01-25 长沙瀚云信息科技有限公司 A kind of disease knowledge map construction method and plateform system, equipment, storage medium

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
HAORAN SUN等: ""Topic Model Based Konwledge Graph for Entity Similarity Measuring"", 《2018 IEEE 15TH INTERNATIONAL CONFERENCE ON E-BUSINESS ENGINEERING》 *
夏迎春: "" 基于知识图谱的农业知识服务系统研究"", 《中国优秀硕士学位论文全文数据库 信息科技辑》 *
王子涵等: ""基于实体相似度信息的知识图谱补全算法"", 《第七届中国数据挖掘会议》 *

Cited By (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112465287A (en) * 2019-09-09 2021-03-09 国际商业机器公司 Improving farm farming quality
CN113010529A (en) * 2019-12-19 2021-06-22 广州极飞科技股份有限公司 Crop management method and device based on knowledge graph
CN111507758A (en) * 2020-04-09 2020-08-07 深圳传世智慧科技有限公司 Semantic analysis-based investigation method, device, system and server
CN111507758B (en) * 2020-04-09 2023-10-31 深圳传世智慧科技有限公司 Investigation method, device, system and server based on semantic analysis
CN111488741A (en) * 2020-04-14 2020-08-04 税友软件集团股份有限公司 Tax knowledge data semantic annotation method and related device
CN113761971B (en) * 2020-06-02 2023-06-20 中国人民解放军战略支援部队信息工程大学 Remote sensing image target knowledge graph construction method and device
CN113761971A (en) * 2020-06-02 2021-12-07 中国人民解放军战略支援部队信息工程大学 Method and device for constructing target knowledge graph of remote sensing image
CN112199508A (en) * 2020-08-10 2021-01-08 淮阴工学院 Parameter adaptive agricultural knowledge graph recommendation method based on remote supervision
CN112199508B (en) * 2020-08-10 2024-01-19 淮阴工学院 Parameter self-adaptive agricultural knowledge graph recommendation method based on remote supervision
CN112329468A (en) * 2020-11-03 2021-02-05 中国平安财产保险股份有限公司 Method and device for constructing heterogeneous relation network, computer equipment and storage medium
CN112329468B (en) * 2020-11-03 2023-07-25 中国平安财产保险股份有限公司 Method and device for constructing heterogeneous relation network, computer equipment and storage medium
CN113220827B (en) * 2021-04-23 2023-03-28 哈尔滨工业大学 Construction method and device of agricultural corpus
CN113220827A (en) * 2021-04-23 2021-08-06 哈尔滨工业大学 Construction method and device of agricultural corpus

Also Published As

Publication number Publication date
CN110209839B (en) 2021-07-27

Similar Documents

Publication Publication Date Title
CN110209839A (en) Agricultural knowledge map construction device, method and computer readable storage medium
Liu et al. Tomato diseases and pests detection based on improved Yolo V3 convolutional neural network
CN113590837B (en) Deep learning-based food and health knowledge graph construction method
CN108182262A (en) Intelligent Answer System construction method and system based on deep learning and knowledge mapping
CN110597969B (en) Agricultural knowledge intelligent question and answer method and system and electronic equipment
CN112231460B (en) Construction method of question-answering system based on agricultural encyclopedia knowledge graph
CN112199508B (en) Parameter self-adaptive agricultural knowledge graph recommendation method based on remote supervision
CN111966787B (en) Knowledge graph-based construction method of intelligent fishery question-answering robot
CN117216294A (en) Agricultural knowledge graph construction method, device, equipment and storage medium
Deepa et al. An effective automated ontology construction based on the agriculture domain
CN114021546A (en) Peach production knowledge open question-answering method and device for migrating context network
Mohapatra et al. Using TF-IDF on Kisan call centre dataset for obtaining query answers
CN116010564A (en) Construction method of rice pest question-answering system based on multi-mode knowledge graph
Wang et al. A Weakly-Supervised Method for Named Entity Recognition of Agricultural Knowledge Graph.
CN114780740A (en) Construction method of tea knowledge graph
CN112231535B (en) Method for making multi-modal data set in field of agricultural diseases and insect pests, processing device and storage medium
CN113434644A (en) Agricultural technology knowledge service method and system
Gao The Application of Information Classification in Agricultural Production Based on Internet of Things and Deep Learning
Suganya et al. Ontology based information extraction-a review
Phoksawat et al. Ontology-based knowledge and optimization model for Decision Support System to intercropping
CN114996474A (en) Construction method of grape planting knowledge map database
CN114860917A (en) Agricultural knowledge question-answering method, device, electronic equipment and storage medium
Montellano Butterfly, Larvae and Pupae Defects Detection Using Convolutional Neural Network and Apriori Algorithm
Hashmi et al. Machine Learning and Deep Learning for Smart Agriculture and Applications
Alimkar et al. KISAN MITRA: AN INTELLIGENT CHATBOT

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant