CN106844638B - Information retrieval method and device and electronic equipment - Google Patents
Information retrieval method and device and electronic equipment Download PDFInfo
- Publication number
- CN106844638B CN106844638B CN201710045053.9A CN201710045053A CN106844638B CN 106844638 B CN106844638 B CN 106844638B CN 201710045053 A CN201710045053 A CN 201710045053A CN 106844638 B CN106844638 B CN 106844638B
- Authority
- CN
- China
- Prior art keywords
- query
- module
- information
- user
- search term
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/903—Querying
Landscapes
- Engineering & Computer Science (AREA)
- Databases & Information Systems (AREA)
- Theoretical Computer Science (AREA)
- Computational Linguistics (AREA)
- Data Mining & Analysis (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
The invention provides an information retrieval method, an information retrieval device and electronic equipment. The method comprises the steps of obtaining keywords, combining the keywords to obtain a plurality of query indexes, establishing a query dictionary through the query indexes, carrying out matching query on a search term and the query indexes in the query dictionary after receiving the search term input by a user, and displaying information corresponding to the query indexes with the matching degree of the search term higher than a preset value and information associated with the information. According to the invention, by establishing the query dictionary and establishing the mapping relation between the search term and the query result, the searched answer can be provided more efficiently and accurately.
Description
Technical Field
The invention relates to the technical field of information, in particular to an information retrieval method, an information retrieval device and electronic equipment.
Background
With the rapid development of information technology, various data can be collected and stored, so that various massive data sources are formed, such as production environment data recorded by various sensors in industrial and agricultural production in real time; various transaction record data generated by the briskly developed internet e-commerce transaction; various image data and the like generated by a camera used in the fields of public safety, traffic monitoring and the like are becoming increasingly voluminous. The generated and ongoing data can provide reference guidance for various human decisions, for example, various data related to the operation of the company collected by the information department of the enterprise can provide reference for the operation decision of the enterprise; various resident, traffic, safety and other data collected by the public management department can provide reference for the optimization decision of the government department; health-related data such as electronic medical records generated by a hospital health department can provide decision references for insurance companies, health supervision departments and doctors.
Extracting intelligence enough for assisting decision from data, wherein a traditional data processing method is to set a fixed mode, extract data from a database and obtain an output through programming and operation, and the process usually waits for tens of minutes or longer; moreover, the interaction process is unidirectional and preset, that is, the software system or the data system executes the program to calculate the corresponding result according to a preset problem or a preset problem combination every time, so that the query efficiency and the accuracy are low.
Disclosure of Invention
In view of the above, embodiments of the present invention provide an information retrieval method, apparatus and electronic device, which enable a data system or a software system to have certain intelligence by intelligently extracting a keyword from data to set a query phrase and a matching algorithm, so as to more efficiently understand a search term and more efficiently and accurately provide an answer.
In order to achieve the above purpose, the technical solutions adopted in the embodiments of the present invention are as follows:
in a first aspect, an embodiment of the present invention provides an information retrieval method, where the method includes:
acquiring keywords, wherein the keywords are obtained by extraction from an information source or user-defined;
combining the keywords to obtain a plurality of query indexes, and establishing a query dictionary through the query indexes, wherein the query dictionary comprises the query indexes, and the query indexes at least comprise one or more types of phrases, words, morphemes and sentences;
receiving a search term input by a user;
matching the search term with the plurality of query indexes;
and displaying information corresponding to the query index with the matching degree of the search term higher than a preset value and information associated with the information.
Further, the method further comprises:
establishing a matching group corresponding to the search term and an information source corresponding to a query index with the matching degree higher than a preset value;
and backing up the matching group to a new information source according to the evaluation of the user on the matching group.
Further, the method further comprises:
and adding the matching group with the user evaluation higher than a preset evaluation value into the query dictionary.
Further, the method further comprises:
and sorting the matching groups according to the evaluation level of the user.
Further, the method further comprises:
duplicate and meaningless query indexes are removed, and the query indexes are sorted according to the frequency of matched queries.
Further, when a user inputs a search term, the method further includes:
and sequentially displaying the query indexes according to the matching degree with the search terms.
Further, the method further comprises:
and backing up the information corresponding to the query index with the matching degree higher than the preset value of the search term to a new information source.
Further, when the search term is a sentence, the method further comprises:
performing word segmentation on the search term;
calling out core data according to each participle;
displaying conclusion data with the matching degree with the core data higher than a preset value;
and responding to the user selection of the conclusion data, and adding the question data and the search term selected by the user into the query dictionary.
In a second aspect, an embodiment of the present invention provides an information retrieval apparatus, where the apparatus includes:
the acquisition module is used for acquiring keywords, and the keywords are obtained by extraction from an information source or user-defined;
the combination module is used for combining the keywords to obtain a plurality of query indexes, and establishing a query dictionary through the query indexes, wherein the query dictionary comprises the query indexes which at least comprise one or more types of phrases, words, morphemes and sentences;
the receiving module is used for receiving search terms input by a user;
the query module is used for carrying out matching query on the search terms and the plurality of query indexes;
and the display module is used for displaying information corresponding to the query index with the matching degree of the search term higher than a preset value and information associated with the information.
Further, the apparatus further comprises:
the matching group generating module is used for establishing a matching group corresponding to the search term and an information source corresponding to the query index with the matching degree higher than a preset value;
and the backup module is used for backing up the matched group to a new information source according to the evaluation of the user on the matched group.
Further, the apparatus further comprises:
and the updating module is used for adding the matching group with the evaluation higher than the preset evaluation value into the query dictionary.
Further, the apparatus further comprises:
and the sorting module is used for sorting the matching groups according to the evaluation level of the user.
Further, the apparatus further comprises:
a sift module for removing duplicate and meaningless query indexes;
and the sorting module is used for sorting and sorting the query indexes according to the frequency of the matched query.
Further, when a user inputs a search term, the display module is further used for sequentially displaying the query indexes according to the matching degree with the search term.
Further, the apparatus further comprises:
and the backup module is used for backing up the information corresponding to the query index with the matching degree higher than the preset value of the search term to a new information source.
Further, the apparatus further comprises:
the word segmentation module is used for segmenting the search terms;
the calling module is used for calling out core data according to each participle;
the display module is also used for displaying conclusion data of which the matching degree with the core data is higher than a preset value;
and the updating module is used for responding to the selection of the user on the conclusion data and adding the problem core data and the search terms selected by the user into the query dictionary.
In a third aspect, an embodiment of the present invention further provides an electronic device, including:
a processor;
a memory; and
an information retrieval device installed in the memory and including one or more software functional modules executed by the processor, the information retrieval device comprising:
the acquisition module is used for acquiring keywords, and the keywords are obtained by extraction from an information source or user-defined;
the combination module is used for combining the keywords to obtain a plurality of query indexes, and establishing a query dictionary through the query indexes, wherein the query dictionary comprises the query indexes which at least comprise one or more types of phrases, words, morphemes and sentences;
the receiving module is used for receiving search terms input by a user;
the query module is used for carrying out matching query on the search terms and the plurality of query indexes;
and the display module is used for displaying information corresponding to the query index with the matching degree of the search term higher than a preset value and information associated with the information.
Compared with the prior art, the information retrieval method, the information retrieval device and the electronic equipment provided by the invention have the advantages that the keywords are obtained and combined to obtain the plurality of query indexes, the query dictionary is established through the query indexes and comprises the plurality of query indexes, after the search terms input by the user are received, the search terms and the query indexes in the query dictionary are subjected to matching query, and the information corresponding to the query indexes with the matching degree higher than the preset value of the search terms and the information related to the information are displayed. According to the invention, by establishing the query dictionary and establishing the mapping relation between the search term and the query result, the searched answer can be provided more efficiently and accurately.
In order to make the aforementioned and other objects, features and advantages of the present invention comprehensible, preferred embodiments accompanied with figures are described in detail below.
Drawings
In order to more clearly illustrate the technical solutions of the embodiments of the present invention, the drawings needed to be used in the embodiments will be briefly described below, it should be understood that the following drawings only illustrate some embodiments of the present invention and therefore should not be considered as limiting the scope, and for those skilled in the art, other related drawings can be obtained according to the drawings without inventive efforts.
Fig. 1 is a block diagram of an electronic device according to an embodiment of the present invention.
Fig. 2 is a schematic diagram of a functional module architecture of an information retrieval apparatus according to an embodiment of the present invention.
Fig. 3 is a diagram illustrating an application example of the information retrieval method and apparatus according to the embodiment of the present invention.
Fig. 4-5 are flowcharts of an information retrieval method according to an embodiment of the present invention.
Icon: 100-an electronic device; 110-information retrieval means; 111-an acquisition module; 112-a combining module; 113-a receiving module; 114-a query module; 115-a display module; 116-a screening module; 117-matching group generation module; 118-a backup module; 119-an update module; 120-a sorting module; 121-word segmentation module; 122-calling module; 130-a memory; 150-processor.
Detailed Description
In order to make the objects, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are some, but not all, embodiments of the present invention. The components of embodiments of the present invention generally described and illustrated in the figures herein may be arranged and designed in a wide variety of different configurations.
Thus, the following detailed description of the embodiments of the present invention, presented in the figures, is not intended to limit the scope of the invention, as claimed, but is merely representative of selected embodiments of the invention. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.
It should be noted that: like reference numbers and letters refer to like items in the following figures, and thus, once an item is defined in one figure, it need not be further defined and explained in subsequent figures.
The information retrieval method and the information retrieval device provided by the embodiment of the invention are applied to electronic equipment. The electronic device may be, but is not limited to, a Personal Computer (PC), a smart phone, a tablet computer, a Personal Digital Assistant (PDA), and the like.
Fig. 1 is a block diagram of the electronic device 100. The electronic device 100 comprises an information retrieval apparatus 110, a memory 130 and a processor 150.
The elements of the memory 130 and the processor 150 are electrically connected to each other, directly or indirectly, to enable data transmission or interaction. For example, the components may be electrically connected to each other via one or more communication buses or signal lines. The information retrieving means 110 includes at least one software function module which can be stored in the memory 130 in the form of software or firmware (firmware) or solidified in an Operating System (OS) of the electronic device 100. The processor 150 is used for executing executable modules stored in the memory 130, such as software functional modules and computer programs included in the information retrieval device 110.
The Memory 130 may be, but is not limited to, a Random Access Memory (RAM), a Read Only Memory (ROM), a Programmable Read-Only Memory (PROM), an Erasable Read-Only Memory (EPROM), an electrically Erasable Read-Only Memory (EEPROM), and the like. The memory 130 is used for storing a program, and the processor 150 executes the program after receiving the execution instruction.
Fig. 2 is a schematic diagram of a functional module architecture of the information retrieval device 110. The information retrieval device 110 is used for searching and extracting required information from an information source according to the search term of a user. The information retrieval apparatus 110 includes an acquisition module 111, a combination module 112, a reception module 113, a query module 114, and a display module 115.
The obtaining module 111 is configured to obtain a keyword, where the keyword is obtained by extraction from an information source or user-defined.
In this embodiment, the information source may be various databases, and the databases may include common information such as text, video, audio, charts, and the like. Keywords are extracted from the information source, representing characteristic elements of the information, and typically consist of a title, an array name, a table name in a database, a column name, or an attribute name in a column database, a custom data range, a filter, a form name, and the like. For example, referring to fig. 3, the information source includes a spreadsheet file named "financial statement," which includes "year: 2014. 2015, 2016 "," type: the keywords that can be extracted from the tv, refrigerator, and washing machine "and the corresponding sales volume are" year "," 2016 "," 2015 "," 2014 "," type "," tv "," refrigerator "," washing machine ", and" sales volume ". For a video information source, a particular frame is extracted, the particular frame is labeled, and the label is used as a keyword, and for an audio information source, a signal feature segment can be extracted, the signal feature segment is labeled, and the label is used as a keyword.
If the information source is an unstructured data set, the keyword can be extracted only if no data in the information source exists in isolation, for example, data (metadata) in the information source generally includes elements such as time, place, event, person, passage, result, and the like, and the elements can be associated through preprocessing. The user may also customize the keywords. For example, if the method is applied to the medical field, the user can preset common terms of some medical fields; or some synonyms or similar synonyms are preset according to different habits of the user. For example, in some places, say "bessored? What's "with other places? "the semantics are the same, and for example, Chinese and Japanese or other foreign languages can preset keywords through synonyms or synonyms.
The combination module 112 is configured to combine the keywords to obtain a plurality of query indexes, and establish a query dictionary through the query indexes, where the query dictionary includes the plurality of query indexes, and the query indexes at least include one or more types of phrases, words, morphemes, and sentences.
The query index contains all possible combinations of keywords in a mathematical sense. For example, in the above example, query indexes that can be combined include, but are not limited to: "2016 colorcast", "2015 colorcast", "2014 colorcast", "2016 refrigerator", "2015 refrigerator", "2014 refrigerator", "2016 washing machine", "2015 washing machine", "2014 washing machine", "2016 pin amount", "2015 pin amount", "2014 pin amount", "colorcast pin amount", "refrigerator pin amount", "washing machine pin amount", "pin amount" and the like. The query index may include one or more types of phrases, words, morphemes, or sentences, where a phrase refers to a combination of two or more words to distinguish words, such as "new society" or "old society". The word is the smallest unit of language that can be used independently, such as "kelp". Morphemes are the smallest phonetic, semantic knot and the smallest meaningful units of language, such as "run" and "jump". A sentence is a syntactically self-organizing unit consisting of a word or a group of words that are syntactically related. For example, "king this year is sixteen years old".
Preferably, after the query index is generated, the query phrase is normally subjected to specification processing to remove simple repetition and meaningless combinations, for example, for "2014 color tv" and "2014 color tv" it can be regarded as simple repetition, so as to remove one of them; and as for the "pin amount type", it can be regarded as a meaningless combination and thus removed. In this embodiment, the information retrieval device 110 includes a culling module 116 for removing duplicate and meaningless query phrases. Then, a query dictionary is established through the query indexes, and the query dictionary comprises a plurality of query indexes.
A receiving module 113, configured to receive a search term input by a user.
In this embodiment, the user may input the search term through an input device such as a screen, a key, or a microphone of the electronic device, where the search term may be in the form of text, voice, or gesture. Any entered search term is translated into a corresponding input of textual information, including complete sentences, incomplete words or phrases, morphemes, and the like.
And a query module 114 for matching the search term with a plurality of query indexes.
Sometimes, the search term input by the user is not completely consistent with the query index, and the search term input by the user needs to be matched with the query index to find the query index with the matching degree, so that the corresponding information in the information source is queried through the query index. For example, the search term input by the user is "2016 number of sales of a color tv", and the query phrase for finding that the matching degree is higher than the preset value is "2016 number of sales of a color tv", "2016 of a color tv", and the preset value is defined in advance, which is not limited in this embodiment.
And a display module 115 for displaying information corresponding to the query index having the matching degree of the search term higher than a preset value and information associated with the information.
Each query index corresponds to information in the information source, and when the matched query index is determined, the query module 114 finds corresponding information, and the display module 115 displays the information. Preferably, when the user inputs a search term, the display module 115 is further configured to sequentially display the query phrases according to the degree of matching with the search term. For example, when the user inputs "2016" during the process of inputting "2016 color tv sales", the display module 115 sequentially displays "2016 color tv", "2016 sales", and "2016 color tv sales" in the search query index according to the matching degree to guide the user to select, and when the user inputs "2016 color", the display module 115 sequentially displays "2016 color tv" and "2016 color tv sales" in the search query index according to the matching degree until the user inputs the complete "2016 color tv sales". The system is more intelligent and is convenient for users to inquire and use. In addition, the display module 115 displays information associated with information corresponding to the query index, the associated information being calculated by a specific algorithm. For example: the user searches for "zhang san", and the display module 115 not only displays the information of "zhang san" contained in the information source, but also displays the prompt information that "zhang san is a suspected person of theft", which is not contained in the original data source, and the suspicion that "zhang san" is suspected is inferred through calculation by some algorithm.
Preferably, the information retrieval apparatus 110 further comprises a matching group generation module 117 and a backup module 118. The matching group generating module 117 is configured to establish a matching group corresponding to the search term and the information source corresponding to the query index having the matching degree of the search term higher than the preset value. For example, if the search term input by the user is "2016 number of sales of color tv", and the search result includes information sources corresponding to "2016 number of sales of color tv", the matching group generation module 117 establishes a matching group for each of the information sources corresponding to "2016 number of sales of color tv" and "2016 number of sales of color tv", and the information sources corresponding to "2016 number of sales of color tv". After the user has obtained the search results, the matched groups may be evaluated, such as scored. During the search, the user may generate a context, for example, the user continuously inputs three questions such as: "sales income distributed by region in the last three years", "sales situation in Beijing area in the last three years", and "product of top10 in the Beijing area in the last three years". Then the matching group formed by the input and output of the third question should also include the top and bottom questions of the question after adding the new information source. The questions of these contexts, as well as the results and evaluations of the corresponding questions, will all make reference to the system's determination of the significance of the matched set formed by the third question. The information retrieval device 110 learns to generate individual method models based on a plurality of related contexts and matching groups, thereby establishing a weaker correlation for answering the open questions of the user.
The backup module 118 is used to backup the matchgroups to a new information source based on the user's evaluation of the matchgroups. After user evaluation, the matched set will be backed up in the new information source. When the user searches for the same or similar search term again, the query module 114 queries the new information source for corresponding information directly. The backup module 118 is further configured to backup information corresponding to the query index with the matching degree of the search term higher than the preset value to a new information source, and temporarily store the information.
Preferably, the information retrieval apparatus 110 further comprises an updating module 119, configured to add a matching group with a user's evaluation higher than a preset evaluation value into the query dictionary. In order to further improve the speed and accuracy of information retrieval by the user, when the evaluation of some of the matching groups is higher than a preset evaluation value, for example, the evaluation of the user on the matching groups is higher than 4, the matching groups are added into the query dictionary, and when the user inputs the same or similar search terms again, the query module 114 directly calls the information source of the corresponding matching group in the query dictionary, so that the query speed and the accuracy of the query result are increased.
Preferably, the information retrieving apparatus 110 further comprises a sorting module 120 for sorting the matching groups according to the evaluation level of the user. After the user inputs the search term, the query module 114 queries the information according to the evaluation order of the matching group, and the display module 115 displays the information according to the evaluation order of the matching group. Because different users have different retrieval requirements and different use habits and have different preferences for the results of retrieval and query, the embodiment of the invention ranks the retrieval results through the evaluation of the retrieval results by the users, and continuously updates the query phrases, so that the information in the query phrases has more directionality, the speed and the accuracy of the retrieval information are greatly improved, and it needs to be explained that the matching groups serving as synonyms or similar words have the same ranking weight.
Preferably, in this embodiment, the sorting module 120 is further configured to sort the query phrases according to the frequency of the matched queries, and some query phrases that are frequently matched are placed at positions that are easier to be retrieved, so as to increase the rate of retrieving queries.
Preferably, the information retrieval device 110 further comprises a word segmentation module 121 and a calling module 122. When the user inputs the search term, if the search term is a question sentence, the word segmentation module 121 is used for segmenting the search term. For example, a question sentence is "how to increase sales this year by 30%? "this word is divided into" this year "," sales "," 30% increase ".
The calling module 122 is configured to call out the core data according to each participle. Each participle corresponds to core data in the query index, conclusion data matched with the core data can be obtained according to the core data, and the conclusion data are answers corresponding to the question sentences. The display module 115 is further configured to display conclusion data with a matching degree with the core data higher than a preset value. Each conclusion data may not be the best answer to the question that the search term contains at the outset, and the update module 119 is configured to add the question core data and the search term selected by the user to the query dictionary in response to the user selecting the conclusion data, which will become increasingly accurate with continued updating, filtering, modifying, and selecting by the user.
Referring to fig. 4, a flowchart of an information retrieval method according to an embodiment of the present invention is shown, where the information retrieval method includes the following steps:
step S101, obtaining keywords, combining the keywords to obtain a plurality of query indexes, and establishing a query dictionary through the query indexes.
In the present embodiment, this step S101 may be performed by the acquisition module 111 and the combination module 112 together. The keywords are obtained by extraction from an information source or user self-definition, the query dictionary comprises a plurality of query indexes, and the query indexes at least comprise one or more types of phrases, words, morphemes and sentences.
Step S102, receiving a search term input by a user.
In the present embodiment, the step S102 may be performed by the receiving module 113.
Step S103, matching and inquiring the search terms and a plurality of inquiry indexes.
In this embodiment, this step S103 may be performed by the query module 114.
And step S104, displaying information corresponding to the query index with the matching degree of the search term higher than the preset value and information associated with the information.
In the present embodiment, this step S104 may be performed by the display module 115.
And step S105, establishing a matching group corresponding to the search term and the information source corresponding to the query index with the matching degree higher than the preset value.
In this embodiment, this step S105 may be performed by the matching group generation module 117.
Step S106, the matched group is backed up to a new information source according to the evaluation of the matched group by the user.
In this embodiment, this step S106 may be performed by the backup module 118.
And step S107, adding the matching group with the evaluation higher than the preset value into a query dictionary.
In the present embodiment, this step S107 may be performed by the updating module 119.
And S108, sorting the matching groups according to the evaluation level of the user.
In this embodiment, this step S108 may be performed by the sorting module 120.
Step S109, removing repeated and meaningless query indexes, and sorting the query indexes according to the frequency of matched queries.
In this embodiment, this step S109 may be performed by the culling module 116 and the sorting module 120 together.
Step S110, the query indexes are sequentially displayed according to the matching degree with the search terms.
In this embodiment, the step S110 may be performed by the display module 115.
Step S111, backups the information corresponding to the query index whose matching degree with the search term is higher than the preset value to a new information source.
In this embodiment, this step S111 may be performed by the backup module 118.
When a user inputs a search term, if the search term is a question sentence, please refer to fig. 5, the information retrieval method further includes the following steps:
step S112, performing word segmentation on the search term.
In the present embodiment, this step S112 may be performed by the word segmentation module 121.
And step S113, calling out core data according to each participle.
In the present embodiment, this step S113 may be performed by the calling module 122.
And step S114, displaying conclusion data with the matching degree with the core data higher than a preset value.
In this embodiment, the step S114 may be performed by the display module 115.
And step S115, responding to the selection of the user on the data of the result, and adding the question data and the search term selected by the user into a query dictionary.
In this embodiment, this step S115 may be performed by the updating module 119.
Since each step in the information retrieval method can be executed by each functional module in the information retrieval device 110, the principle thereof has been explained in the foregoing embodiments, and is not described herein again.
In summary, the embodiments of the present invention provide an information retrieval method, an information retrieval device, and an electronic device. The method comprises the steps of obtaining keywords, combining the keywords to obtain a plurality of query indexes, establishing a query dictionary through the query indexes, carrying out matching query on a search term input by a user and the query indexes in the query dictionary after receiving the search term, and displaying information corresponding to the query indexes with the matching degree of the search term higher than a preset value and information associated with the information. According to the invention, by establishing the query dictionary and establishing the mapping relation between the search term and the query result, the searched answer can be provided more efficiently and accurately.
In the embodiments provided in the present application, it should be understood that the disclosed apparatus and method may be implemented in other ways. The apparatus embodiments described above are merely illustrative, and for example, the flowchart and block diagrams in the figures illustrate the architecture, functionality, and operation of possible implementations of apparatus, methods and computer program products according to various embodiments of the present invention. In this regard, each block in the flowchart or block diagrams may represent a module, segment, or portion of code, which comprises one or more executable instructions for implementing the specified logical function(s). It should also be noted that, in some alternative implementations, the functions noted in the block may occur out of the order noted in the figures. For example, two blocks shown in succession may, in fact, be executed substantially concurrently, or the blocks may sometimes be executed in the reverse order, depending upon the functionality involved. It will also be noted that each block of the block diagrams and/or flowchart illustration, and combinations of blocks in the block diagrams and/or flowchart illustration, can be implemented by special purpose hardware-based systems which perform the specified functions or acts, or combinations of special purpose hardware and computer instructions.
In addition, the functional modules in the embodiments of the present invention may be integrated together to form an independent part, or each module may exist separately, or two or more modules may be integrated to form an independent part.
The functions, if implemented in the form of software functional modules and sold or used as a stand-alone product, may be stored in a computer readable storage medium. Based on such understanding, the technical solution of the present invention may be embodied in the form of a software product, which is stored in a storage medium and includes instructions for causing a computer device (which may be a personal computer, a server, or a network device) to execute all or part of the steps of the method according to the embodiments of the present invention. And the aforementioned storage medium includes: a U-disk, a removable hard disk, a Read-Only Memory (ROM), a Random Access Memory (RAM), a magnetic disk or an optical disk, and other various media capable of storing program codes. It is noted that, herein, relational terms such as first and second, and the like may be used solely to distinguish one entity or action from another entity or action without necessarily requiring or implying any actual such relationship or order between such entities or actions. Also, the terms "comprises," "comprising," or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, method, article, or apparatus that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, method, article, or apparatus. Without further limitation, an element defined by the phrase "comprising an … …" does not exclude the presence of other identical elements in a process, method, article, or apparatus that comprises the element.
The above description is only a preferred embodiment of the present invention and is not intended to limit the present invention, and various modifications and changes may be made by those skilled in the art. Any modification, equivalent replacement, or improvement made within the spirit and principle of the present invention should be included in the protection scope of the present invention. It should be noted that: like reference numbers and letters refer to like items in the following figures, and thus, once an item is defined in one figure, it need not be further defined and explained in subsequent figures.
Claims (13)
1. An information retrieval method, the method comprising:
acquiring keywords, wherein the keywords are obtained by extraction from an information source or user-defined;
combining the keywords to obtain a plurality of query indexes, and establishing a query dictionary through the query indexes, wherein the query dictionary comprises the query indexes, and the query indexes at least comprise one or more types of phrases, words, morphemes and sentences;
receiving a search term input by a user;
matching the search term with the plurality of query indexes;
displaying information corresponding to the query index with the matching degree of the search term higher than a preset value and information associated with the information;
establishing a matching group corresponding to the search term and an information source corresponding to a query index with the matching degree higher than a preset value;
backing up the matched group to a new information source according to the evaluation of the matched group by the user;
the method further comprises the following steps: performing word segmentation on the search term;
according to each participle, determining core data corresponding to each participle from the query index;
displaying conclusion data with the matching degree with the core data higher than a preset value;
and responding to the user selection of the conclusion data, and adding the conclusion data selected by the user and the search term into the query dictionary.
2. The information retrieval method of claim 1, wherein the method further comprises:
and adding the matching group with the user evaluation higher than a preset evaluation value into the query dictionary.
3. The information retrieval method of claim 2, wherein the method further comprises:
and sorting the matching groups according to the evaluation level of the user.
4. The information retrieval method of claim 1, wherein the method further comprises:
duplicate and meaningless query indexes are removed, and the query indexes are sorted according to the frequency of matched queries.
5. The information retrieval method of claim 1, wherein when a user inputs a search term, the method further comprises:
and sequentially displaying the query indexes according to the matching degree with the search terms.
6. The information retrieval method of claim 1, wherein the method further comprises:
and backing up the information corresponding to the query index with the matching degree higher than the preset value of the search term to a new information source.
7. An information retrieval apparatus, characterized in that the apparatus comprises:
the acquisition module is used for acquiring keywords, and the keywords are obtained by extraction from an information source or user-defined;
the combination module is used for combining the keywords to obtain a plurality of query indexes, and establishing a query dictionary through the query indexes, wherein the query dictionary comprises the query indexes which at least comprise one or more types of phrases, words, morphemes and sentences;
the receiving module is used for receiving search terms input by a user;
the query module is used for carrying out matching query on the search terms and the plurality of query indexes;
the display module is used for displaying information corresponding to the query index with the matching degree of the search term higher than a preset value and information associated with the information;
the matching group generating module is used for establishing a matching group corresponding to the search term and an information source corresponding to the query index with the matching degree higher than a preset value;
the backup module is used for backing up the matching group to a new information source according to the evaluation of the user on the matching group;
the device further comprises:
the word segmentation module is used for segmenting the search terms;
the calling module is used for determining core data corresponding to each participle from the query index according to each participle;
the display module is further used for displaying conclusion data of which the matching degree with the core data is higher than a preset value;
and the updating module is used for responding to the selection of the conclusion data by the user and adding the conclusion data selected by the user and the search term into the query dictionary.
8. The information retrieval device of claim 7, wherein the device further comprises:
and the updating module is used for adding the matching group with the evaluation higher than the preset evaluation value into the query dictionary.
9. The information retrieval device of claim 8, wherein the device further comprises:
and the sorting module is used for sorting the matching groups according to the evaluation level of the user.
10. The information retrieval device of claim 7, wherein the device further comprises:
a sift module for removing duplicate and meaningless query indexes;
and the sorting module is used for sorting and sorting the query indexes according to the frequency of the matched query.
11. The information retrieval device of claim 7, wherein when a user inputs a search term, the display module is further configured to display the query indexes in order according to the matching degree with the search term.
12. The information retrieval device of claim 7, wherein the device further comprises:
and the backup module is used for backing up the information corresponding to the query index with the matching degree higher than the preset value of the search term to a new information source.
13. An electronic device, comprising:
a processor;
a memory; and
an information retrieval device installed in the memory and including one or more software functional modules executed by the processor, the information retrieval device comprising:
the acquisition module is used for acquiring keywords, and the keywords are obtained by extraction from an information source or user-defined;
the combination module is used for combining the keywords to obtain a plurality of query indexes, and establishing a query dictionary through the query indexes, wherein the query dictionary comprises the query indexes which at least comprise one or more types of phrases, words, morphemes and sentences;
the receiving module is used for receiving search terms input by a user;
the query module is used for carrying out matching query on the search terms and the plurality of query indexes;
the display module is used for displaying information corresponding to the query index with the matching degree of the search term higher than a preset value and information associated with the information;
the matching group generating module is used for establishing a matching group corresponding to the search term and an information source corresponding to the query index with the matching degree higher than a preset value;
the backup module is used for backing up the matching group to a new information source according to the evaluation of the user on the matching group;
the device further comprises:
the word segmentation module is used for segmenting the search terms;
the calling module is used for determining core data corresponding to each participle from the query index according to each participle;
the display module is further used for displaying conclusion data of which the matching degree with the core data is higher than a preset value;
and the updating module is used for responding to the selection of the conclusion data by the user and adding the conclusion data selected by the user and the search term into the query dictionary.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201710045053.9A CN106844638B (en) | 2017-01-19 | 2017-01-19 | Information retrieval method and device and electronic equipment |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201710045053.9A CN106844638B (en) | 2017-01-19 | 2017-01-19 | Information retrieval method and device and electronic equipment |
Publications (2)
Publication Number | Publication Date |
---|---|
CN106844638A CN106844638A (en) | 2017-06-13 |
CN106844638B true CN106844638B (en) | 2020-11-03 |
Family
ID=59120801
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201710045053.9A Active CN106844638B (en) | 2017-01-19 | 2017-01-19 | Information retrieval method and device and electronic equipment |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN106844638B (en) |
Families Citing this family (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110019678B (en) * | 2017-12-12 | 2023-08-29 | 北京百度网讯科技有限公司 | Information presentation and retrieval method and device |
CN109446417B (en) * | 2018-10-12 | 2021-09-21 | 湖北计研数字科技有限公司 | Intelligent retrieval method and device |
CN110162537A (en) * | 2019-04-19 | 2019-08-23 | 平安普惠企业管理有限公司 | Data query method and device, storage medium and electronic equipment |
CN111294275B (en) * | 2020-02-26 | 2022-10-14 | 上海云鱼智能科技有限公司 | User information indexing method, device, server and storage medium of IM tool |
CN111400253B (en) * | 2020-03-17 | 2023-04-21 | 北京华通人商用信息有限公司 | Statistical data query method and device, electronic equipment and storage medium |
CN111611489B (en) * | 2020-05-22 | 2022-05-20 | 北京字节跳动网络技术有限公司 | Search processing method and device, electronic equipment and storage medium |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1716244A (en) * | 2003-12-29 | 2006-01-04 | 西安迪戈科技有限责任公司 | Intelligent search, intelligent files system and automatic intelligent assistant |
CN1894689A (en) * | 2003-08-29 | 2007-01-10 | 伏泰劳普蒂克斯有限公司 | Method, device and software for querying and presenting search results |
CN101196898A (en) * | 2007-08-21 | 2008-06-11 | 新百丽鞋业(深圳)有限公司 | Method for applying phrase index technology into internet search engine |
CN101201838A (en) * | 2007-08-21 | 2008-06-18 | 新百丽鞋业(深圳)有限公司 | Method for improving searching engine based on keyword index using phrase index technique |
CN102200984A (en) * | 2010-03-24 | 2011-09-28 | 深圳市腾讯计算机系统有限公司 | Search method based on compound words and search engine server |
Family Cites Families (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101963965B (en) * | 2009-07-23 | 2013-03-20 | 阿里巴巴集团控股有限公司 | Document indexing method, data query method and server based on search engine |
-
2017
- 2017-01-19 CN CN201710045053.9A patent/CN106844638B/en active Active
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1894689A (en) * | 2003-08-29 | 2007-01-10 | 伏泰劳普蒂克斯有限公司 | Method, device and software for querying and presenting search results |
CN1716244A (en) * | 2003-12-29 | 2006-01-04 | 西安迪戈科技有限责任公司 | Intelligent search, intelligent files system and automatic intelligent assistant |
CN101196898A (en) * | 2007-08-21 | 2008-06-11 | 新百丽鞋业(深圳)有限公司 | Method for applying phrase index technology into internet search engine |
CN101201838A (en) * | 2007-08-21 | 2008-06-18 | 新百丽鞋业(深圳)有限公司 | Method for improving searching engine based on keyword index using phrase index technique |
CN102200984A (en) * | 2010-03-24 | 2011-09-28 | 深圳市腾讯计算机系统有限公司 | Search method based on compound words and search engine server |
Also Published As
Publication number | Publication date |
---|---|
CN106844638A (en) | 2017-06-13 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN106844638B (en) | Information retrieval method and device and electronic equipment | |
CN110968699B (en) | Logic map construction and early warning method and device based on fact recommendation | |
CN101408885B (en) | Modeling topics using statistical distributions | |
US10102254B2 (en) | Confidence ranking of answers based on temporal semantics | |
CN101872349B (en) | Method and device for treating natural language problem | |
US10127274B2 (en) | System and method for querying questions and answers | |
CN105408890B (en) | Performing operations related to listing data based on voice input | |
US9659084B1 (en) | System, methods, and user interface for presenting information from unstructured data | |
US8010524B2 (en) | Method of monitoring electronic media | |
US20180032606A1 (en) | Recommending topic clusters for unstructured text documents | |
AU2015203818B2 (en) | Providing contextual information associated with a source document using information from external reference documents | |
CN101408886A (en) | Selecting tags for a document by analyzing paragraphs of the document | |
US10460239B2 (en) | Generation of inferred questions for a question answering system | |
CN102640145A (en) | Trusted query system and method | |
CN101692223A (en) | Refining a search space inresponse to user input | |
AU2018411565B2 (en) | System and methods for generating an enhanced output of relevant content to facilitate content analysis | |
CN104239340A (en) | Search result screening method and search result screening device | |
Ojha et al. | Metadata driven semantically aware medical query expansion | |
US20080147631A1 (en) | Method and system for collecting and retrieving information from web sites | |
KR102126911B1 (en) | Key player detection method in social media using KeyplayerRank | |
US20190384812A1 (en) | Portfolio-based text analytics tool | |
Orogat et al. | CBench: Towards better evaluation of question answering over knowledge graphs | |
US20240086433A1 (en) | Interactive tool for determining a headnote report | |
CN113779981A (en) | Recommendation method and device based on pointer network and knowledge graph | |
CN115098619A (en) | Information duplication eliminating method and device, electronic equipment and computer readable storage medium |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
TA01 | Transfer of patent application right | ||
TA01 | Transfer of patent application right |
Effective date of registration: 20201019 Address after: 310000 FN96, Building 5, No. 567 Jiangling Road, Xixing Street, Binjiang District, Hangzhou City, Zhejiang Province Applicant after: HANGZHOU HUISHU ZHITONG TECHNOLOGY Co.,Ltd. Address before: 310003 No. 335, Stadium Road, Xiacheng District, Zhejiang, Hangzhou Applicant before: Wang Bibo |
|
GR01 | Patent grant | ||
GR01 | Patent grant |