CN107256260A - A kind of intelligent semantic recognition methods, searching method, apparatus and system - Google Patents
A kind of intelligent semantic recognition methods, searching method, apparatus and system Download PDFInfo
- Publication number
- CN107256260A CN107256260A CN201710440790.9A CN201710440790A CN107256260A CN 107256260 A CN107256260 A CN 107256260A CN 201710440790 A CN201710440790 A CN 201710440790A CN 107256260 A CN107256260 A CN 107256260A
- Authority
- CN
- China
- Prior art keywords
- keyword
- target
- regular expression
- keywords
- user
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000000034 method Methods 0.000 title claims abstract description 48
- 230000014509 gene expression Effects 0.000 claims abstract description 93
- 238000010276 construction Methods 0.000 claims description 4
- 230000011218 segmentation Effects 0.000 description 13
- 239000002184 metal Substances 0.000 description 5
- 230000008569 process Effects 0.000 description 5
- 238000006243 chemical reaction Methods 0.000 description 4
- 230000009471 action Effects 0.000 description 3
- 238000010586 diagram Methods 0.000 description 3
- 230000003993 interaction Effects 0.000 description 2
- 230000005540 biological transmission Effects 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/33—Querying
- G06F16/332—Query formulation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/33—Querying
- G06F16/3331—Query processing
- G06F16/3332—Query translation
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- Computational Linguistics (AREA)
- Data Mining & Analysis (AREA)
- Databases & Information Systems (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Mathematical Physics (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
The invention provides a kind of intelligent semantic recognition methods, searching method, apparatus and system, the intelligent semantic recognition methods includes:Configuration file is built, the configuration file includes at least one keyword and each described keyword distinguishes corresponding regular expression;Obtain at least one keyword of user's input;At least one target regular expression corresponding with least one keyword that the user inputs is determined from the configuration file;According at least one described target regular expression, at least one keyword that the user inputs is converted to the target keyword of at least one setting form;The target keyword of at least one setting form is sent to the search engine of outside.This programme can improve the accuracy of search result.
Description
Technical Field
The invention relates to the technical field of computers, in particular to an intelligent semantic identification method, a search method, a device and a system.
Background
With the advent of the big data age, the amount of data has increased dramatically. How to quickly and accurately acquire useful data from massive data becomes a key point of user attention. Search engines, as a system that can provide search services to users, are becoming the first choice for addressing such problems.
The full-text search engine is the most commonly used search engine, and the working principle thereof is generally as follows: according to the predefined word segmentation rule, performing word segmentation on characters in each article, then establishing a corresponding index for each word after word segmentation, and indicating the occurrence times and positions of the word in the article. When keywords input by a user are received, the corresponding articles can be searched according to the pre-established index, and the searched articles are fed back to the user.
Because the user generally inputs the keywords according to the daily use habit, the keywords input by the user are inconsistent with the word segmentation rule of the search engine, so that the corresponding articles cannot be accurately searched by using the keywords input by the user, and the accuracy of the search result is low.
Disclosure of Invention
The embodiment of the invention provides an intelligent semantic identification method, a search method, a device and a system, which can improve the accuracy of search results.
In a first aspect, an embodiment of the present invention provides an intelligent semantic identification method, including:
constructing a configuration file, wherein the configuration file comprises at least one keyword and a regular expression corresponding to each keyword;
further comprising:
acquiring at least one keyword input by a user;
determining at least one target regular expression corresponding to at least one keyword input by the user from the configuration file;
converting at least one keyword input by the user into at least one target keyword in a set format according to the at least one target regular expression;
and sending the at least one target keyword with the set format to an external search engine.
Preferably, the first and second electrodes are formed of a metal,
the constructing of the configuration file, wherein the configuration file comprises at least one keyword and a regular expression corresponding to each keyword respectively, comprises:
constructing an extensible markup language xml document, wherein the xml document comprises at least one keyword and a regular expression corresponding to each keyword.
Preferably, the first and second electrodes are formed of a metal,
the constructing of the xml document of the extensible markup language, wherein the xml document comprises at least one keyword and a regular expression corresponding to each keyword respectively, comprises:
constructing an xml initial document;
constructing at least one node in the xml initial document, and storing at least one regular expression under each node to form the xml document; and the types of the regular expressions stored under the same node are the same as the types of the corresponding keywords under the current node.
Preferably, the first and second electrodes are formed of a metal,
the determining, from the configuration file, at least one target regular expression corresponding to the at least one keyword input by the user includes:
for each of the keywords input by the user, performing:
determining the type corresponding to the keyword according to the format of the keyword;
determining a node corresponding to the type of the keyword from the xml document;
and determining a target regular expression corresponding to the key word from at least one regular expression stored under the determined node.
In a second aspect, an embodiment of the present invention provides a search method, applied to a search engine, including:
receiving at least one target keyword in a set format;
and searching according to the at least one target keyword with the set format.
Preferably, the first and second electrodes are formed of a metal,
further comprising: the method comprises the steps of constructing an index relationship between at least one keyword in a set format and at least one document in advance;
the searching according to the at least one target keyword with a set format comprises:
and determining at least one target document corresponding to the at least one target keyword according to the index relation.
In a third aspect, an embodiment of the present invention provides an intelligent semantic recognition device based on a configuration file, including: the device comprises a construction unit, an acquisition unit, a processing unit and a sending unit; wherein,
the building unit is used for building a configuration file, and the configuration file comprises at least one keyword and a regular expression corresponding to each keyword;
the acquisition unit is used for acquiring at least one keyword input by a user;
the processing unit is used for determining at least one target regular expression corresponding to at least one keyword input by the user from the configuration file; converting at least one keyword input by the user into at least one target keyword in a set format according to the at least one target regular expression;
and the sending unit is used for sending the at least one target keyword with the set format to an external search engine.
Preferably, the first and second electrodes are formed of a metal,
the constructing unit is used for constructing an xml initial document, constructing at least one node in the xml initial document, and storing at least one regular expression under each node to form the xml document; and the types of the regular expressions stored under the same node are the same as the types of the corresponding keywords under the current node.
In a fourth aspect, an embodiment of the present invention provides a search engine, including: a receiving unit and a searching unit; wherein,
the receiving unit is used for receiving at least one target keyword with a set format;
and the searching unit is used for searching according to the at least one target keyword with the set format.
In a fifth aspect, an embodiment of the present invention provides a search system, including: the intelligent semantic recognition device provided by any of the above embodiments of the present invention, and the search engine provided by any of the above embodiments of the present invention; wherein,
the intelligent semantic recognition device is used for converting at least one keyword input by a user into at least one target keyword in a set format and sending the target keyword to the search engine;
and the search engine is used for receiving the target keywords sent by the intelligent semantic recognition device and searching according to the target keywords.
The embodiment of the invention provides an intelligent semantic identification method, a search method, a device and a system. When keywords input by a user are acquired, a target regular expression corresponding to the keywords is determined from a configuration file, the acquired keywords are converted into target keywords in a set format according to the determined target regular expression, and then the converted target keywords are sent to a search engine so that the search engine can search according to the received target keywords. The keywords input by the user are converted through the pre-constructed regular expression, so that the keywords input by the user are consistent with the word segmentation rule of the search engine, and the accuracy of the search result is improved.
Drawings
In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings used in the description of the embodiments or the prior art will be briefly introduced below, and it is obvious that the drawings in the following description are some embodiments of the present invention, and for those skilled in the art, other drawings can be obtained according to these drawings without creative efforts.
FIG. 1 is a flow chart of an intelligent semantic recognition method provided by an embodiment of the invention;
FIG. 2 is a flow chart of a search method provided by an embodiment of the invention;
FIG. 3 is a schematic structural diagram of an intelligent semantic recognition apparatus according to an embodiment of the present invention;
fig. 4 is a schematic structural diagram of a search apparatus according to an embodiment of the present invention;
FIG. 5 is a schematic diagram of a search system according to an embodiment of the present invention;
fig. 6 is a flowchart of a method for using a search system according to an embodiment of the present invention.
Detailed Description
In order to make the objects, technical solutions and advantages of the embodiments of the present invention clearer and more complete, the technical solutions in the embodiments of the present invention will be described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are some, but not all, embodiments of the present invention, and based on the embodiments of the present invention, all other embodiments obtained by a person of ordinary skill in the art without creative efforts belong to the scope of the present invention.
As shown in fig. 1, an embodiment of the present invention provides an intelligent semantic recognition method, which may include the following steps:
step 101: constructing a configuration file, wherein the configuration file comprises at least one keyword and a regular expression corresponding to each keyword;
step 102: acquiring at least one keyword input by a user;
step 103: determining at least one target regular expression corresponding to at least one keyword input by the user from the configuration file;
step 104: converting at least one keyword input by the user into at least one target keyword in a set format according to the at least one target regular expression;
step 105: and sending the at least one target keyword with the set format to an external search engine.
In the above embodiment, a configuration file including at least one keyword and a regular expression corresponding to each keyword is constructed in advance. When keywords input by a user are acquired, a target regular expression corresponding to the keywords is determined from a configuration file, the acquired keywords are converted into target keywords in a set format according to the determined target regular expression, and then the converted target keywords are sent to a search engine so that the search engine can search according to the received target keywords. The keywords input by the user are converted through the pre-constructed regular expression, so that the keywords input by the user are consistent with the word segmentation rule of the search engine, and the accuracy of the search result is improved.
In an embodiment of the present invention, the specific implementation of step 101 may include:
constructing an extensible markup language xml document, wherein the xml document comprises at least one keyword and a regular expression corresponding to each keyword.
Here, the constructed configuration file may be an Extensible Markup Language (xml) document, and the keywords and the regular expressions corresponding to the keywords in the configuration file are in an xml format. For example, the regular expression corresponding to the age-formatted keyword is: v 0-9 {2} [ -/\ years ]? V,/[ 0-9] {2} [ -/\ years ] {1} - [0-9] {2} [ -/\\ years ] {1 }? /. Therefore, each keyword and the corresponding regular expression can be directly understood by the computer, so that the conversion efficiency of the keywords input by the user can be improved, and the search efficiency is further improved.
Specifically, in an embodiment of the present invention, the constructing an xml document in an extensible markup language, where the xml document includes at least one keyword and a regular expression corresponding to each keyword, includes:
constructing an xml initial document;
constructing at least one node in the xml initial document, and storing at least one regular expression under each node to form the xml document; and the types of the regular expressions stored under the same node are the same as the types of the corresponding keywords under the current node.
Here, an xml initial document is first constructed, nodes respectively corresponding to the types of various keywords are constructed in the xml initial document, and then regular expressions corresponding to the keywords of the same type are stored under the same node. For example, if the keyword types include an age type, a date type and a license plate number type, three nodes are established in the xml initial document and respectively correspond to each keyword type, and the regular expressions corresponding to the keywords of the age type, the date type and the license plate number type are stored in the nodes corresponding to the regular expressions to form the xml document, so that the configuration file is convenient to manage.
In an embodiment of the present invention, the specific implementation of step 103 may include:
for each of the keywords input by the user, performing:
determining the type corresponding to the keyword according to the format of the keyword;
determining a node corresponding to the type of the keyword from the xml document;
and determining a target regular expression corresponding to the key word from at least one regular expression stored under the determined node.
For example, when the keywords input by the user include age a, date B, and license plate number C, since the characters included in age are numbers, the date includes numbers and chinese characters or numbers and punctuation, and the license plate number includes numbers and letters, the type corresponding to the keywords can be determined according to different formats of the keywords. Here, the types corresponding to the age a, the date B, and the license plate number C are an age type, a date type, and a license plate number type, respectively. Nodes corresponding to the types can be determined from the xml documents, and then the target regular expression corresponding to the key words is determined from the determined nodes. By means of the step-by-step determination mode, the efficiency of determining the target regular expression can be improved, and further the searching efficiency is improved.
As shown in fig. 2, an embodiment of the present invention provides a search method applied to a search engine, where the method may include the following steps:
step 201: receiving at least one target keyword in a set format;
step 202: and searching according to the at least one target keyword with the set format.
In the embodiment, the search is performed according to the received target keywords in the set format, and the target keywords in the set format conform to the word segmentation rule of the search engine, so that the accuracy of the search result is improved.
In one embodiment of the present invention, the method may further comprise: the method comprises the steps of constructing an index relationship between at least one keyword in a set format and at least one document in advance;
specific embodiments of step 202 may include:
and determining at least one target document corresponding to the at least one target keyword according to the index relation.
In the full-text search engine, an index relationship between at least one keyword in a set format and at least one article can be pre-constructed, and after the keyword in the set format is received, a target article corresponding to the keyword can be determined according to the pre-constructed index relationship. Because the index relationship between the keywords and the documents is constructed in advance, after the keywords are received, the corresponding target documents can be directly determined according to the index relationship, so that the accuracy of the search result is improved, and meanwhile, the search efficiency is also improved.
As shown in fig. 3, an embodiment of the present invention provides an intelligent semantic recognition apparatus based on a configuration file, including: a construction unit 301, an acquisition unit 302, a processing unit 303 and a transmission unit 304; wherein,
the constructing unit 301 is configured to construct a configuration file, where the configuration file includes at least one keyword and a regular expression corresponding to each keyword;
the obtaining unit 302 is configured to obtain at least one keyword input by a user;
the processing unit 303 is configured to determine, from the configuration file constructed by the construction unit 301, at least one target regular expression corresponding to the at least one keyword acquired by the acquisition unit 302; converting at least one keyword input by the user into at least one target keyword in a set format according to the at least one target regular expression;
the sending unit 304 is configured to send the at least one target keyword in the set format converted by the processing unit 303 to an external search engine.
In the above embodiment, a configuration file including at least one keyword and a regular expression corresponding to each keyword is constructed in advance. When keywords input by a user are acquired, a target regular expression corresponding to the keywords is determined from a configuration file, the acquired keywords are converted into target keywords in a set format according to the determined target regular expression, and then the converted target keywords are sent to a search engine so that the search engine can search according to the received target keywords. The keywords input by the user are converted through the pre-constructed regular expression, so that the keywords input by the user are consistent with the word segmentation rule of the search engine, and the accuracy of the search result is improved.
In an embodiment of the present invention, the constructing unit 301 is configured to construct an xml initial document, and construct at least one node in the xml initial document, where at least one regular expression is stored under each node to form the xml document; and the types of the regular expressions stored under the same node are the same as the types of the corresponding keywords under the current node.
Here, the constructed configuration file may be an Extensible Markup Language (xml) document, and the keywords and the regular expressions corresponding to the keywords in the configuration file are in an xml format. For example, the regular expression corresponding to the age-formatted keyword is: v 0-9 {2} [ -/\ years ]? V,/[ 0-9] {2} [ -/\ years ] {1} - [0-9] {2} [ -/\\ years ] {1 }? /. Therefore, each keyword and the corresponding regular expression can be directly understood by the computer, so that the conversion efficiency of the keywords input by the user can be improved, and the search efficiency is further improved.
When constructing the xml document, firstly constructing an xml initial document, constructing nodes respectively corresponding to the types of various keywords in the xml initial document, and then storing regular expressions corresponding to the keywords of the same type into the same node. For example, if the keyword types include an age type, a date type and a license plate number type, three nodes are established in the xml initial document and respectively correspond to each keyword type, and the regular expressions corresponding to the keywords of the age type, the date type and the license plate number type are stored in the nodes corresponding to the regular expressions to form the xml document, so that the configuration file is convenient to manage.
Because the information interaction, execution process, and other contents between the units in the device are based on the same concept as the method embodiment of the present invention, specific contents may refer to the description in the method embodiment of the present invention, and are not described herein again.
As shown in fig. 4, an embodiment of the present invention provides a search engine, including: a receiving unit 401 and a searching unit 402; wherein,
the receiving unit 401 is configured to receive at least one target keyword in a set format;
the searching unit 402 is configured to perform a search according to the at least one target keyword in the set format.
In the embodiment, the search is performed according to the received target keywords in the set format, and the target keywords in the set format conform to the word segmentation rule of the search engine, so that the accuracy of the search result is improved.
Because the information interaction, execution process, and other contents between the units in the device are based on the same concept as the method embodiment of the present invention, specific contents may refer to the description in the method embodiment of the present invention, and are not described herein again.
As shown in fig. 5, an embodiment of the present invention provides a search system, including: the intelligent semantic recognition device 501 provided by any of the above embodiments of the present invention, and the search engine 502 provided by any of the above embodiments of the present invention; wherein,
the intelligent semantic recognition device 501 is configured to convert at least one keyword input by a user into at least one target keyword in a set format, and send the target keyword to the search engine;
the search engine 502 is configured to receive the target keyword sent by the intelligent semantic recognition device, and perform a search according to the target keyword.
In the above embodiment, the target keyword converted into the set format is sent to a search engine, so that the search engine performs a search according to the received target keyword. The keywords with the set format are consistent with the word segmentation rule of the search engine, so that the accuracy of the search result is improved.
As shown in fig. 6, an embodiment of the present invention provides a method for using a search system, which may include the following steps:
step 601: the intelligent semantic recognition device constructs an xml initial document.
Step 602: constructing at least one node in the xml initial document, and storing at least one regular expression under each node to form the xml document; and the types of the regular expressions stored under the same node are the same as the types of the corresponding keywords under the current node.
For example, if the keyword types include an age type, a date type and a license plate number type, three nodes are established in the xml initial document and respectively correspond to each keyword type, and the regular expressions corresponding to the keywords of the age type, the date type and the license plate number type are stored in the nodes corresponding to the regular expressions to form the xml document.
Step 603: at least one keyword input by a user is acquired.
For example, the keywords entered by the user include age a, date B, and license plate number C.
Step 604: determining the type corresponding to the keyword according to the format of the keyword, and determining a node corresponding to the type of the keyword from the xml document.
Step 605: and determining a target regular expression corresponding to the key word from at least one regular expression stored under the determined node.
For example, if the characters included in the age are numbers, the date includes numbers and Chinese characters or numbers and punctuation marks, and the license plate number includes numbers and letters, the type corresponding to the keywords can be determined according to different formats of the keywords. Here, the types corresponding to the age a, the date B, and the license plate number C are an age type, a date type, and a license plate number type, respectively. Nodes corresponding to the types can be determined from the xml documents, and then the target regular expression corresponding to the key words is determined from the determined nodes.
Step 606: and converting the at least one keyword input by the user into at least one target keyword in a set format according to the at least one target regular expression.
Step 607: and sending the at least one target keyword with the set format to an external search engine.
And converting the keywords input by the user according to the regular expression so as to enable the keywords to accord with word segmentation rules of a search engine. For example, for the user-entered age A, convert it to a year-of-birth month A', the search engine may retrieve it from the year-of-birth month.
Step 608: the search engine constructs an index relation between at least one keyword in a set format and at least one document in advance.
For example, the index relationship between different birth months and corresponding documents is constructed in advance.
Step 609: and determining at least one target document corresponding to the at least one target keyword according to the index relation.
Here, the target document corresponding to the birth year and month a 'is determined based on the birth year and month a' obtained after the conversion and the index relationship constructed in advance.
In summary, the intelligent semantic recognition device converts the keywords input by the user into the keywords in the set format according to the word segmentation rule of the search engine, so that the search engine can search according to the converted keywords in the set format, and the search accuracy is improved.
The invention also provides a readable medium comprising executable instructions which, when executed by a processor of a storage controller, cause the storage controller to perform a method as provided by any of the above-described embodiments of the invention.
In addition, the present invention also provides a memory controller comprising: a processor, a memory, and a bus; the memory is used for storing execution instructions, the processor is connected with the memory through the bus, and when the storage controller runs, the processor executes the execution instructions stored in the memory, so that the storage controller executes the method provided by any one of the above embodiments of the invention.
In summary, the embodiments of the present invention have at least the following advantages:
1. in the embodiment of the invention, a configuration file comprising at least one keyword and a regular expression corresponding to each keyword is constructed in advance. When keywords input by a user are acquired, a target regular expression corresponding to the keywords is determined from a configuration file, the acquired keywords are converted into target keywords in a set format according to the determined target regular expression, and then the converted target keywords are sent to a search engine so that the search engine can search according to the received target keywords. The keywords input by the user are converted through the pre-constructed regular expression, so that the keywords input by the user are consistent with the word segmentation rule of the search engine, and the accuracy of the search result is improved.
2. In the embodiment of the present invention, the constructed configuration file may be an xml document, and the keywords in the configuration file and the regular expressions corresponding to the keywords are in an xml format. Therefore, each keyword and the corresponding regular expression can be directly understood by the computer, so that the conversion efficiency of the keywords input by the user can be improved, and the search efficiency is further improved.
3. In the embodiment of the invention, the xml initial document is firstly constructed, the nodes respectively corresponding to the types of various keywords are constructed in the xml initial document, and then the regular expressions corresponding to the keywords of the same type are stored in the same node, so that the configuration file is convenient to manage.
4. In the embodiment of the invention, the type corresponding to the keyword is determined according to the format of the keyword input by the user, the node corresponding to the keyword type is determined from the xml document, and the regular expression corresponding to the keyword input by the user is determined from at least one regular expression stored under the determined node. By means of the step-by-step determination mode, the efficiency of determining the target regular expression can be improved, and further the searching efficiency is improved.
5. In the embodiment of the invention, by pre-constructing the index relationship between at least one keyword in a set format and at least one article, after the keyword in the set format is received, the corresponding target document can be directly determined according to the index relationship, so that the accuracy of the search result is improved, and the search efficiency is also improved.
It is noted that, herein, relational terms such as first and second, and the like may be used solely to distinguish one entity or action from another entity or action without necessarily requiring or implying any actual such relationship or order between such entities or actions. Also, the terms "comprises," "comprising," or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, method, article, or apparatus that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, method, article, or apparatus. Without further limitation, an element defined by the phrase "comprising a" does not exclude the presence of other similar elements in a process, method, article, or apparatus that comprises the element.
Those of ordinary skill in the art will understand that: all or part of the steps for realizing the method embodiments can be completed by hardware related to program instructions, the program can be stored in a computer readable storage medium, and the program executes the steps comprising the method embodiments when executed; and the aforementioned storage medium includes: various media that can store program codes, such as ROM, RAM, magnetic or optical disks.
Finally, it is to be noted that: the above description is only a preferred embodiment of the present invention, and is only used to illustrate the technical solutions of the present invention, and not to limit the protection scope of the present invention. Any modification, equivalent replacement, or improvement made within the spirit and principle of the present invention shall fall within the protection scope of the present invention.
Claims (10)
1. An intelligent semantic recognition method, comprising:
constructing a configuration file, wherein the configuration file comprises at least one keyword and a regular expression corresponding to each keyword;
further comprising:
acquiring at least one keyword input by a user;
determining at least one target regular expression corresponding to at least one keyword input by the user from the configuration file;
converting at least one keyword input by the user into at least one target keyword in a set format according to the at least one target regular expression;
and sending the at least one target keyword with the set format to an external search engine.
2. The method of claim 1,
the constructing of the configuration file, wherein the configuration file comprises at least one keyword and a regular expression corresponding to each keyword respectively, comprises:
constructing an extensible markup language xml document, wherein the xml document comprises at least one keyword and a regular expression corresponding to each keyword.
3. The method of claim 2,
the constructing of the xml document of the extensible markup language, wherein the xml document comprises at least one keyword and a regular expression corresponding to each keyword respectively, comprises:
constructing an xml initial document;
constructing at least one node in the xml initial document, and storing at least one regular expression under each node to form the xml document; and the types of the regular expressions stored under the same node are the same as the types of the corresponding keywords under the current node.
4. The method of claim 3,
the determining, from the configuration file, at least one target regular expression corresponding to the at least one keyword input by the user includes:
for each of the keywords input by the user, performing:
determining the type corresponding to the keyword according to the format of the keyword;
determining a node corresponding to the type of the keyword from the xml document;
and determining a target regular expression corresponding to the key word from at least one regular expression stored under the determined node.
5. A search method is applied to a search engine and comprises the following steps:
receiving at least one target keyword in a set format;
and searching according to the at least one target keyword with the set format.
6. The method of claim 5,
further comprising: the method comprises the steps of constructing an index relationship between at least one keyword in a set format and at least one document in advance;
the searching according to the at least one target keyword with a set format comprises:
and determining at least one target document corresponding to the at least one target keyword according to the index relation.
7. An intelligent semantic recognition device, comprising: the device comprises a construction unit, an acquisition unit, a processing unit and a sending unit; wherein,
the building unit is used for building a configuration file, and the configuration file comprises at least one keyword and a regular expression corresponding to each keyword;
the acquisition unit is used for acquiring at least one keyword input by a user;
the processing unit is used for determining at least one target regular expression corresponding to at least one keyword input by the user from the configuration file; converting at least one keyword input by the user into at least one target keyword in a set format according to the at least one target regular expression;
and the sending unit is used for sending the at least one target keyword with the set format to an external search engine.
8. The apparatus of claim 7,
the constructing unit is used for constructing an xml initial document, constructing at least one node in the xml initial document, and storing at least one regular expression under each node to form the xml document; and the types of the regular expressions stored under the same node are the same as the types of the corresponding keywords under the current node.
9. A search engine, comprising: a receiving unit and a searching unit; wherein,
the receiving unit is used for receiving at least one target keyword with a set format;
and the searching unit is used for searching according to the at least one target keyword with the set format.
10. A search system, comprising: the intelligent semantic recognition device of claim 7 or 8, and the search engine of claim 9; wherein,
the intelligent semantic recognition device is used for converting at least one keyword input by a user into at least one target keyword in a set format and sending the target keyword to the search engine;
and the search engine is used for receiving the target keywords sent by the intelligent semantic recognition device and searching according to the target keywords.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201710440790.9A CN107256260A (en) | 2017-06-13 | 2017-06-13 | A kind of intelligent semantic recognition methods, searching method, apparatus and system |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201710440790.9A CN107256260A (en) | 2017-06-13 | 2017-06-13 | A kind of intelligent semantic recognition methods, searching method, apparatus and system |
Publications (1)
Publication Number | Publication Date |
---|---|
CN107256260A true CN107256260A (en) | 2017-10-17 |
Family
ID=60024574
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201710440790.9A Pending CN107256260A (en) | 2017-06-13 | 2017-06-13 | A kind of intelligent semantic recognition methods, searching method, apparatus and system |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN107256260A (en) |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109284362A (en) * | 2018-11-11 | 2019-01-29 | 广东小天才科技有限公司 | Content retrieval method and system |
CN113779935A (en) * | 2021-09-10 | 2021-12-10 | 北京金堤科技有限公司 | Text information acquisition method and system |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP3711710B2 (en) * | 1996-12-10 | 2005-11-02 | セイコーエプソン株式会社 | Information search and collection system and storage medium storing information search and collection program |
CN101075435A (en) * | 2007-04-19 | 2007-11-21 | 深圳先进技术研究院 | Intelligent chatting system and its realizing method |
CN103092979A (en) * | 2013-01-31 | 2013-05-08 | 中国科学院对地观测与数字地球科学中心 | Processing method and device for searching of natural language by remote sensing data |
CN103631882A (en) * | 2013-11-14 | 2014-03-12 | 北京邮电大学 | Semantization service generation system and method based on graph mining technique |
US20150293975A1 (en) * | 2013-05-30 | 2015-10-15 | Tencent Technology (Shenzhen) Company Limited | Method and device for searching for contact object, and storage medium |
-
2017
- 2017-06-13 CN CN201710440790.9A patent/CN107256260A/en active Pending
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP3711710B2 (en) * | 1996-12-10 | 2005-11-02 | セイコーエプソン株式会社 | Information search and collection system and storage medium storing information search and collection program |
CN101075435A (en) * | 2007-04-19 | 2007-11-21 | 深圳先进技术研究院 | Intelligent chatting system and its realizing method |
CN103092979A (en) * | 2013-01-31 | 2013-05-08 | 中国科学院对地观测与数字地球科学中心 | Processing method and device for searching of natural language by remote sensing data |
US20150293975A1 (en) * | 2013-05-30 | 2015-10-15 | Tencent Technology (Shenzhen) Company Limited | Method and device for searching for contact object, and storage medium |
CN103631882A (en) * | 2013-11-14 | 2014-03-12 | 北京邮电大学 | Semantization service generation system and method based on graph mining technique |
Non-Patent Citations (1)
Title |
---|
戎"码"一生: "XML中配置正则表达式的写法", 《HTTPS://WWW.CNBLOGS.COM/LUCKY_HU/ARCHIVE/2013/01/04/2845014.HTML》 * |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109284362A (en) * | 2018-11-11 | 2019-01-29 | 广东小天才科技有限公司 | Content retrieval method and system |
CN113779935A (en) * | 2021-09-10 | 2021-12-10 | 北京金堤科技有限公司 | Text information acquisition method and system |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US20210256047A1 (en) | System and method for providing technology assisted data review with optimizing features | |
JP5698222B2 (en) | Search method, apparatus and system | |
US20240143904A1 (en) | Human-Computer Interaction Method And Apparatus Thereof | |
CN110297880B (en) | Corpus product recommendation method, apparatus, device and storage medium | |
CN103425687A (en) | Retrieval method and system based on queries | |
US9971828B2 (en) | Document tagging and retrieval using per-subject dictionaries including subject-determining-power scores for entries | |
WO2015084476A1 (en) | Non-standard and standard clause detection | |
CN107679208A (en) | A kind of searching method of picture, terminal device and storage medium | |
CN108875065B (en) | Indonesia news webpage recommendation method based on content | |
CN112883030A (en) | Data collection method and device, computer equipment and storage medium | |
US20230119590A1 (en) | Automatic identification of document sections to generate a searchable data structure | |
CN117851575A (en) | Large language model question-answer optimization method and device, electronic equipment and storage medium | |
CN117149804A (en) | Data processing method, device, electronic equipment and storage medium | |
US11328005B2 (en) | Machine learning (ML) based expansion of a data set | |
CN109299238B (en) | Data query method and device | |
CN107256260A (en) | A kind of intelligent semantic recognition methods, searching method, apparatus and system | |
CN112527954A (en) | Unstructured data full-text search method and system and computer equipment | |
CN112732969A (en) | Image semantic analysis method and device, storage medium and electronic equipment | |
CN111984797A (en) | Customer identity recognition device and method | |
CN109684357B (en) | Information processing method and device, storage medium and terminal | |
CN117056477A (en) | Case data retrieval method, device, equipment and readable storage medium | |
CN103744884A (en) | Method and system for collating information fragments | |
JP2018088182A (en) | Model generation device, click-log correct-answer likelihood calculation device, document retrieval device, method, and program | |
CN103729422A (en) | Information fragment associative output method and system | |
CN110083817B (en) | Naming disambiguation method, device and computer readable storage medium |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
RJ01 | Rejection of invention patent application after publication | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20171017 |