[go: up one dir, main page]
More Web Proxy on the site http://driver.im/

CN102456054B - A kind of searching method and system - Google Patents

A kind of searching method and system Download PDF

Info

Publication number
CN102456054B
CN102456054B CN201010528444.4A CN201010528444A CN102456054B CN 102456054 B CN102456054 B CN 102456054B CN 201010528444 A CN201010528444 A CN 201010528444A CN 102456054 B CN102456054 B CN 102456054B
Authority
CN
China
Prior art keywords
search
query
search result
vertical
result
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201010528444.4A
Other languages
Chinese (zh)
Other versions
CN102456054A (en
Inventor
孙良
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shenzhen Shiji Guangsu Information Technology Co Ltd
Original Assignee
Shenzhen Shiji Guangsu Information Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shenzhen Shiji Guangsu Information Technology Co Ltd filed Critical Shenzhen Shiji Guangsu Information Technology Co Ltd
Priority to CN201010528444.4A priority Critical patent/CN102456054B/en
Publication of CN102456054A publication Critical patent/CN102456054A/en
Application granted granted Critical
Publication of CN102456054B publication Critical patent/CN102456054B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Landscapes

  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention discloses a kind of searching method, comprise: according to the query string identification user's of user's input query intention, in the time need to carrying out vertical search according to the query intention identifying is definite, carry out universal search and vertical search simultaneously, and vertical search result and generic search results are integrated, obtain final Search Results. The present invention also discloses a kind of search system simultaneously, adopts the present invention can fully understand user's query intention, for user provides abundanter Search Results, and realizes simply, conveniently.

Description

Searching method and system
Technical Field
The invention relates to a webpage information retrieval technology, in particular to a search method and a search system integrating various information.
Background
With the rapid development of network technology, obtaining required information data through a network has become an indispensable, even extremely important information obtaining way for people. At present, when a user needs to query a certain kind of information, a browser is mostly opened, a keyword of the information desired to be queried is input through a certain retrieval website, such as Baidu, Google and the like, then a webpage search engine searches related webpages according to the query word submitted by the user, and the searched webpage content is displayed to the user, which is a general webpage search mode and can be referred to as general search for short.
However, the general search has a large information amount, so that the query is not accurate enough, professional and deep enough, and therefore the general search gradually fails to meet the search requirements of people, and vertical search occurs to meet the requirements of users on professional, accurate and rich search results. The vertical search is a specialized search engine provided for a specific field, a specific crowd or a specific requirement, is a subdivision and extension of the search engine, and is a process of integrating specific information of a certain class in a webpage library once, extracting required data from directed subsections, and returning the data to a user in a certain form. The vertical search is characterized by being special, precise and deep, and the vertical search engine is more concentrated, concrete and deep compared with the disordering of mass information of a general search engine. The biggest difference between the vertical search engine and the general search engine is that structured information extraction is carried out on webpage information, unstructured data of a webpage are extracted into specific structured information data, the general search takes the webpage as the minimum unit, and the vertical search takes the structured data as the minimum unit.
In the prior art, when a user inputs a query word to search a webpage, only some text information can be provided, if the user wants to acquire search results such as video information, audio information, picture information and the like, the search results cannot be obtained directly, the user also needs to further query or click other links in related pages to enter a new webpage to search or acquire the new webpage, in short, the user needs to search for many times to find out the required search results, the search process is complex, and the required search results cannot be obtained directly. Also, no one has proposed a related scheme to combine the general search and the vertical search to provide a richer search result to the user in a simpler manner.
In addition, for web page search, in order to provide richer and more accurate search results for a user, the query intention of the user needs to be fully identified, and only if the query intention of the user is accurately known, a search result meeting the user requirement can be obtained, but no similar technical scheme is provided at present.
Disclosure of Invention
In view of this, embodiments of the present invention provide a search method and system, which can provide richer search results for a user, and are simple and convenient to implement.
The technical scheme of the invention is realized as follows:
the invention provides a searching method, which comprises the following steps: identifying the query intention of a user according to a query string input by the user, determining that vertical search is needed according to the identified query intention, simultaneously performing general search and vertical search, and integrating the vertical search result and the general search result to obtain a final search result.
In the above scheme, the method further comprises: and displaying the final search result.
In the above scheme, the query intention identifying the user according to the query string input by the user is as follows: receiving a query string input by a user, and preprocessing the received query string; and identifying the query intention of the user according to the preprocessing result.
In the above scheme, the method further comprises: when the vertical search is determined not to be needed according to the identified query intention, performing general search; and using the generic search result as a final search result.
In the above scheme, the method further comprises: and when the query intention is identified as the need of acquiring the character information and at least one other type of information except the character information, determining that vertical search is needed.
In the above scheme, the method further comprises: and sequencing the final search results according to the click rate condition.
The present invention also provides a search system, comprising: the system comprises a query string processing module, a universal search engine, a vertical search engine and a search result acquisition module; wherein,
the query string processing module is used for identifying the query intention of the user according to the query string input by the user and sending query requests to the general search engine and the vertical search engine when the vertical retrieval is determined to be required according to the identified query intention;
the universal search engine is used for completing universal search and sending a universal search result to the search result acquisition module;
the vertical search engine is used for completing vertical search and sending a vertical search result to the search result acquisition module;
and the search result acquisition module is used for integrating the received general search result and the vertical search result and taking the integrated search result as a final search result.
In the above scheme, the system further comprises a search result display module; the search result acquisition module is also used for sending the final search result to the search result display module; and the search result display module is used for displaying the final search result sent by the search result acquisition module.
In the foregoing solution, the query string processing module further includes: the device comprises a query string receiving unit, a preprocessing unit and an identification and judgment unit; the query string receiving unit is used for receiving a query string input by a user and sending the received query string to the preprocessing unit; the preprocessing unit is used for preprocessing the received query string and sending a preprocessing result to the identification and judgment unit; and the identification and judgment unit is used for identifying the query intention of the user according to the preprocessing result and sending a query request to the general search engine and the vertical search engine when determining that the vertical search is required.
Wherein, the identification and judgment unit is specifically configured to: and when the query intention is identified to be at least one other type of information except the character type information and the character type information according to the preprocessing result, determining that vertical search is required, and sending a query request to a general search engine and a vertical search engine.
According to the searching method and the searching system, the query intention of the user is identified according to the query string input by the user, when the vertical search is determined to be needed according to the identified query intention, the universal search and the vertical search are simultaneously carried out, and the search result of the vertical search and the search result of the universal search are integrated together and provided for the user. Therefore, various search results meeting the requirements of the user can be provided for the user, such as news, videos, pictures, blogs, forums and the like.
The invention identifies the query intention of the user according to the query string input by the user, not only can fully understand the query intention of the user, but also has simple and convenient implementation mode; the invention combines the universal search and the vertical search together, and can provide more professional, higher-precision and richer search results for users.
Drawings
FIG. 1 is a schematic flow chart of the implementation of the searching method of the present invention;
FIG. 2 is a schematic diagram of the structure of the search system according to the present invention.
Detailed Description
In the embodiment of the invention, the query intention of the user is identified according to the query string input by the user, when the vertical search is determined to be needed according to the identified query intention, the general search and the vertical search are simultaneously carried out, the vertical search result and the general search result are integrated, and the integrated search result is used as the final search result.
Here, if there is only a general search result, the general search result is directly used as a final search result; that is, if there are only generic search results, it is sufficient to process as in the prior art implementation.
Further, after the final search result is obtained, the obtained final search result can be displayed to the user.
The following describes the implementation of the present invention in detail with reference to the accompanying drawings.
The searching method of the present invention is shown in fig. 1, and comprises the following steps:
step 101: receiving a query string input by a user, and preprocessing the received query string;
here, the preprocessing mainly refers to performing word segmentation processing on a received query string, where the word segmentation processing is to identify the received query string according to phonetic meanings and segment the query string into a plurality of words, and the words can be used as query words respectively; for example, the query string received is: 26 nd fortune, the query string can be split into: 26. and the current fortune would have three words, and the 26 current fortune would have three query words. How to preprocess the query string belongs to the prior art, and is not detailed here.
Step 102: identifying the query intention of the user according to the preprocessing result;
here, the current query intention of the user can be analyzed according to the query term obtained after preprocessing, for example, if the current query term is mobile communication, the query intention is query text related information including news, noun introduction, related technology introduction, related product introduction, and the like; if the query term is Liu De Hua, the query expected by the user should include news information, video information, picture information, blog information, etc., in various combinations.
Since each user has a history search record as long as the user has searched before, in this step, the query intention of the user can be further identified by combining the preprocessing result and the history search record. Such as: when a user inputs a name for inquiry for the first time, only news information, character introduction information and the like can be provided, but the user acquires video information, blog information and picture information of the character through further searching, so that the news information, the character introduction information, the video information, the blog information and the picture information which need to be acquired after the name is input become history searching records; after the user retrieves the input query string again, if the name is confirmed, various types of information such as news information, text introduction information, video information, blog information, picture information and the like can be provided at the same time so as to meet the requirements of the user.
Step 103-105: determining whether vertical search is needed according to the identified user query intention, if vertical search is needed, performing vertical search by the search engine while completing general search, and then returning general search results and vertical search results by the search engine respectively to execute step 106 b; otherwise, the search engine only completes the general search, returns the general search result, and executes step 106 a;
for example, if it is determined that only the information of the text class needs to be acquired according to the preprocessed query word, it is determined that vertical search is not needed; and if it is confirmed that other types of information such as videos, pictures and/or blogs need to be acquired besides the character information according to the preprocessed query words, determining that vertical search is needed.
In the step, after determining whether the query intention comprises vertical search, sending a query request to a corresponding search engine, specifically, when determining that vertical search is not needed, only sending the query request to a general search engine, wherein the query string is carried in the request; and when the vertical search is determined to be needed, simultaneously sending a query request to the general search engine and the vertical search engine, wherein the query string is carried in the request.
For determining whether to perform vertical search, a query intention recognition model may be preset, and various rules or conditions of query intention are given in the query intention recognition model, so that which search is completed according to the preprocessing result or according to which rule the preprocessing result and the historical search record analysis conforms to, for example: information which only needs general search and conforms to the rule A, C, and information which needs vertical search and conforms to the rule B, D; correspondingly, the preprocessed query words are matched with rules or conditions in the query intention recognition model so as to recognize the current query intention of the user and further determine whether to perform vertical search.
Vertical search is adopted in the application, the data volume is relatively small, and quick updating can be almost achieved, so that the use experience of a user can be directly improved.
Step 106 a: taking the general search result as a final search result, and ending the current processing flow;
step 106 b: integrating the general search result and the vertical search result, and taking the integrated search result as a final search result;
here, for the case of only the general search result, the general search result is directly taken as the final search result; for the case of returning the general search result and the vertical search result at the same time, the two search results need to be integrated, that is, the two sets of search results are merged to obtain a union set, and the integrated search result is used as the final search result.
The final search results can be further sorted according to the click rate in the collected historical data, the search results which are clicked most are arranged at the top, and other search results can be sequentially arranged from the top to the bottom according to the click rate; and, because the click rate condition is changed in real time, the ranking of the search results can be adjusted in real time for the final search result obtained each time. Such as: the obtained final search results comprise seven search results A1-A7, and the seven search results can be arranged into A3, A5, A1, A2, A4, A6 and A7 according to historical click volumes; a3 may be a video search result, A5 may be an audio search result, A1 may be a picture search result, A2 may be a news search result, A4 may be a blog search result, and A6, A7 may be text search results.
Further, the method further comprises: displaying the obtained final search result to a user; wherein the final search result may be an ordered search result.
In order to implement the above method, the present invention further provides a search system, as shown in fig. 2, including: the system comprises a query string processing module, a universal search engine, a vertical search engine and a search result acquisition module; wherein,
the query string processing module is used for identifying the query intention of the user according to the query string input by the user and sending query requests to the general search engine and the vertical search engine when the vertical retrieval is determined to be required according to the identified query intention; the query request carries a query string.
The universal search engine is used for completing universal search and sending a universal search result to the search result acquisition module;
the vertical search engine is used for completing vertical search and sending a vertical search result to the search result acquisition module;
and the search result acquisition module is used for integrating the received general search result and the vertical search result and taking the integrated search result as a final search result.
Further, if the query string processing module determines that vertical retrieval is not required according to the identified query intention, the query string processing module only sends a query request to the general search engine; correspondingly, only the universal search engine sends the universal search result to the search result acquisition module; and the search result acquisition module takes the universal search result as a final search result only when receiving the universal search result.
The query string processing module further includes: the device comprises a query string receiving unit, a preprocessing unit and an identification and judgment unit; wherein,
the query string receiving unit is used for receiving a query string input by a user and sending the received query string to the preprocessing unit; the preprocessing unit is used for preprocessing the received query string and sending a preprocessing result to the identification and judgment unit;
here, the preprocessing unit may be a word segmentation unit, and the word segmentation unit may divide the received query string into one or more words according to the linguistic meaning, and then send the word segmentation result to the recognition and judgment unit;
and the identification and judgment unit is used for identifying the query intention of the user according to the preprocessing result and sending a query request to the general search engine and the vertical search engine when the vertical search is determined to be needed.
The identification and judgment unit is specifically configured to: and when the query intention is identified to be at least one other type of information except the character type information and the character type information according to the preprocessing result, determining that vertical search is required, and sending a query request to a general search engine and a vertical search engine.
In addition, the identifying and judging unit only sends a query request to the general search engine when determining that the vertical search is not required.
The system also includes a search result display module; the search result acquisition module is further configured to send the final search result to the search result display module, and the search result display module is configured to display the final search result sent by the search result acquisition module.
The above description is only exemplary of the present invention and should not be taken as limiting the scope of the present invention, and any modifications, equivalents, improvements, etc. that are within the spirit and principle of the present invention should be included in the present invention.

Claims (8)

1. A search method is characterized in that a query intention identification model is preset, and the query intention identification model comprises rules or conditions of various query intentions; the method further comprises the following steps:
receiving a query string input by a user, preprocessing the query string, identifying the query intention of the user by combining the preprocessing result and a historical search record, matching the preprocessed query word with rules or conditions in a preset query intention identification model according to the identified query intention, and determining whether to perform vertical search; when the fact that vertical search is needed is determined, general search and vertical search are conducted at the same time, and a vertical search result and a general search result are integrated to obtain a final search result; and sequencing the final search results obtained currently according to the click rate condition in the collected historical data.
2. The method of claim 1, further comprising: and displaying the final search result.
3. The method according to claim 1 or 2, characterized in that the method further comprises: when the vertical search is determined not to be needed according to the identified query intention, performing general search; and using the generic search result as a final search result.
4. A method according to claim 1 or 2, characterized in that the method further comprises: and when the query intention is identified as the need of acquiring the character information and at least one other type of information except the character information, determining that vertical search is needed.
5. A search system, the system comprising: the system comprises a query string processing module, a universal search engine, a vertical search engine and a search result acquisition module; wherein,
the query string processing module is used for receiving and preprocessing a query string input by a user, identifying the query intention of the user by combining the preprocessing result and the historical search record, matching the preprocessed query word with rules or conditions in a preset query intention identification model according to the identified query intention, and determining whether to perform vertical search; when determining that vertical retrieval is needed, sending a query request to a general search engine and a vertical search engine;
the universal search engine is used for completing universal search and sending a universal search result to the search result acquisition module;
the vertical search engine is used for completing vertical search and sending a vertical search result to the search result acquisition module;
the search result acquisition module is used for integrating the received general search result and the vertical search result and taking the integrated search result as a final search result; and the final search results are further sorted according to the click rate in the collected historical data.
6. The system of claim 5, further comprising a search result display module;
the search result acquisition module is also used for sending the final search result to the search result display module;
and the search result display module is used for displaying the final search result sent by the search result acquisition module.
7. The system of claim 5 or 6, wherein the query string processing module further comprises: the device comprises a query string receiving unit, a preprocessing unit and an identification and judgment unit; wherein,
the query string receiving unit is used for receiving a query string input by a user and sending the received query string to the preprocessing unit;
the preprocessing unit is used for preprocessing the received query string and sending a preprocessing result to the identification and judgment unit;
and the identification and judgment unit is used for identifying the query intention of the user according to the preprocessing result and sending a query request to the general search engine and the vertical search engine when determining that the vertical search is required.
8. The system of claim 7, wherein the identification and determination unit is specifically configured to: and when the query intention is identified to be at least one other type of information except the character type information and the character type information according to the preprocessing result, determining that vertical search is required, and sending a query request to a general search engine and a vertical search engine.
CN201010528444.4A 2010-10-28 2010-10-28 A kind of searching method and system Active CN102456054B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201010528444.4A CN102456054B (en) 2010-10-28 2010-10-28 A kind of searching method and system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201010528444.4A CN102456054B (en) 2010-10-28 2010-10-28 A kind of searching method and system

Publications (2)

Publication Number Publication Date
CN102456054A CN102456054A (en) 2012-05-16
CN102456054B true CN102456054B (en) 2016-05-18

Family

ID=46039251

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201010528444.4A Active CN102456054B (en) 2010-10-28 2010-10-28 A kind of searching method and system

Country Status (1)

Country Link
CN (1) CN102456054B (en)

Families Citing this family (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102982173B (en) * 2012-12-17 2016-10-12 北京奇虎科技有限公司 A kind of web search method and device
CN103885979B (en) 2012-12-21 2018-06-05 深圳市世纪光速信息技术有限公司 The method and apparatus of pushed information
CN104778176A (en) * 2014-01-13 2015-07-15 阿里巴巴集团控股有限公司 Data search processing method and device
CN105159898B (en) * 2014-06-12 2019-11-26 北京搜狗科技发展有限公司 A kind of method and apparatus of search
CN104615620B (en) * 2014-06-24 2018-07-24 腾讯科技(深圳)有限公司 Map search kind identification method and device, map search method and system
CN104408113A (en) * 2014-11-25 2015-03-11 百度在线网络技术(北京)有限公司 Information search method and device
CN104462510B (en) * 2014-12-22 2018-09-11 北京奇虎科技有限公司 Searching method based on user search intent and device
CN105786869B (en) * 2014-12-23 2020-05-29 北京奇虎科技有限公司 Method and device for obtaining question and answer special topic data based on search
CN104965918B (en) * 2015-07-06 2018-09-25 无锡天脉聚源传媒科技有限公司 A kind of searching method and device based on searching keyword
CN108287833A (en) * 2017-01-09 2018-07-17 北京艺鉴通科技有限公司 It is a kind of for the art work identification to scheme to search drawing method
CN114398393A (en) * 2022-01-14 2022-04-26 北京金山云网络技术有限公司 Data searching method and device and electronic equipment
CN114547474A (en) * 2022-04-21 2022-05-27 北京泰迪熊移动科技有限公司 Data searching method, system, electronic equipment and storage medium

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101052181A (en) * 2007-05-22 2007-10-10 中国移动通信集团浙江有限公司 Wireless searching system and its wireless searching method
CN101110077A (en) * 2007-08-24 2008-01-23 新诺亚舟科技(深圳)有限公司 Method for implementing associated searching on handhold learning terminal
CN101262494A (en) * 2008-01-23 2008-09-10 华为技术有限公司 Method, client, server and system for processing distributed information
CN101630330A (en) * 2009-08-14 2010-01-20 苏州锐创通信有限责任公司 Method for webpage classification
CN101807213A (en) * 2010-05-11 2010-08-18 天津大学 Method for vertical search of webpage

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20070266015A1 (en) * 2006-05-12 2007-11-15 Microsoft Corporation User Created Search Vertical Control of User Interface
US20090024410A1 (en) * 2007-07-16 2009-01-22 David Burgess Vertical content searching method and apparatus

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101052181A (en) * 2007-05-22 2007-10-10 中国移动通信集团浙江有限公司 Wireless searching system and its wireless searching method
CN101110077A (en) * 2007-08-24 2008-01-23 新诺亚舟科技(深圳)有限公司 Method for implementing associated searching on handhold learning terminal
CN101262494A (en) * 2008-01-23 2008-09-10 华为技术有限公司 Method, client, server and system for processing distributed information
CN101630330A (en) * 2009-08-14 2010-01-20 苏州锐创通信有限责任公司 Method for webpage classification
CN101807213A (en) * 2010-05-11 2010-08-18 天津大学 Method for vertical search of webpage

Also Published As

Publication number Publication date
CN102456054A (en) 2012-05-16

Similar Documents

Publication Publication Date Title
CN102456054B (en) A kind of searching method and system
US9070046B2 (en) Learning-based image webpage index selection
EP2570935B1 (en) Method for personalized meta-search and application terminal thereof
CN105701254B (en) Information processing method and device for information processing
CN103678576B (en) The text retrieval system analyzed based on dynamic semantics
WO2017024884A1 (en) Search intention identification method and device
WO2017024553A1 (en) Information emotion analysis method and system
CN101986306B (en) Method and equipment for acquiring yellow page information based on query sequence
CN102968465B (en) Network information service platform and the search service method based on this platform thereof
CN103455487B (en) The extracting method and device of a kind of search term
CN109918555B (en) Method, apparatus, device and medium for providing search suggestions
JP2013531289A (en) Use of model information group in search
US12124457B2 (en) Triggering local extensions based on inferred intent
CN107861753B (en) APP generation index, retrieval method and system and readable storage medium
US9971828B2 (en) Document tagging and retrieval using per-subject dictionaries including subject-determining-power scores for entries
US8938408B1 (en) Systems and methods for classification and segmentation of browsing logs based on user's search goals
CN103744954B (en) Word relevancy network model establishing method and establishing device thereof
US10127322B2 (en) Efficient retrieval of fresh internet content
CN107992563B (en) Recommendation method and system for user browsing content
CN102609539A (en) Search method and search system
CN103425767B (en) A kind of determination method and system pointing out data
CN103064907A (en) System and method for topic meta search based on unsupervised entity relation extraction
CN106202312B (en) A kind of interest point search method and system for mobile Internet
US20170235835A1 (en) Information identification and extraction
CN109446417B (en) Intelligent retrieval method and device

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
ASS Succession or assignment of patent right

Owner name: SHENZHEN SHIJI LIGHT SPEED INFORMATION TECHNOLOGY

Free format text: FORMER OWNER: TENGXUN SCI-TECH (SHENZHEN) CO., LTD.

Effective date: 20131024

C41 Transfer of patent application or patent right or utility model
COR Change of bibliographic data

Free format text: CORRECT: ADDRESS; FROM: 518044 SHENZHEN, GUANGDONG PROVINCE TO: 518057 SHENZHEN, GUANGDONG PROVINCE

TA01 Transfer of patent application right

Effective date of registration: 20131024

Address after: A Tencent Building in Shenzhen Nanshan District City, Guangdong streets in Guangdong province science and technology 518057 16

Applicant after: Shenzhen Shiji Guangsu Information Technology Co., Ltd.

Address before: Shenzhen Futian District City, Guangdong province 518044 Zhenxing Road, SEG Science Park 2 East Room 403

Applicant before: Tencent Technology (Shenzhen) Co., Ltd.

C14 Grant of patent or utility model
GR01 Patent grant