[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/1060745.1060804acmconferencesArticle/Chapter ViewAbstractPublication PagesthewebconfConference Proceedingsconference-collections
Article

Automatic identification of user goals in Web search

Published: 10 May 2005 Publication History

Abstract

There has been recent interests in studying the "goal" behind a user's Web query, so that this goal can be used to improve the quality of a search engine's results. Previous studies have mainly focused on using manual query-log investigation to identify Web query goals. In this paper we study whether and how we can automate this goal-identification process. We first present our results from a human subject study that strongly indicate the feasibility of automatic query-goal identification. We then propose two types of features for the goal-identification task: user-click behavior and anchor-link distribution. Our experimental evaluation shows that by combining these features we can correctly identify the goals for 90% of the queries studied.

References

[1]
D. Hawking and N. Craswell. Overview of the TREC-2001 Web track. In Proceedings of the Tenth Text REtrieval Conference (TREC-10), 2001.
[2]
N. Craswell, D. Hawking, and S. Robertson. Effective site finding using link anchor information. In Proceedings of ACM SIGIR '01, 2001.
[3]
T. Westerveld, W. Kraaij, and D. Hiemstra. Retrieving web pages using content, links, URLs and anchors. In Proceedings of the Tenth Text REtrieval Conference (TREC-10), 2001.
[4]
S. Brin and L. Page. The anatomy of a large-scale hypertextual Web search engine. In Proceedings of the Seventh Int'l World Wide Web Conf., 1998.
[5]
J. Kleinberg. Authoritative sources in a hyperlinked environment. Journal of the ACM, 46, 1999.
[6]
A. Broder. A taxonomy of Web search. SIGIR Forum, 36(2), 2002.
[7]
I. Kang and G. Kim. Query type classification for web document retrieval. In Proceedings of ACM SIGIR '03, 2003.
[8]
D.E. Rose and D. Levinson. Understanding user goals in Web search. In Proceedings of the Thirteenth Int'l World Wide Web Conf., 2004.
[9]
H. Zeng, Q. He, Z. Chen, W. Ma, and J. Ma. Learning to cluster web search results. In Proceedings of ACM SIGIR '04, 2004.
[10]
Oren Etzioni Oren Zamir. Grouper: a dynamic clustering interface to web search results. In Proceedings of the Eighth Int'l World Wide Web Conf., 1999.
[11]
Vivisimo search engine. http://vivisimo.com/.
[12]
C. Olston and E. H. Chi. Scenttrails: Integrating browsing and searching on the world wide web. ACM Transactions on Computer-Human Interaction, 10(3):177--197, September 2003.
[13]
M. Chen, M. Hearst, J. Hong, and J. Lin. Cha-Cha: A system for organizing intranet search results. In Proceedings of the 2nd USENIX Symposium on Internet Technologies and Systems, 1999.
[14]
U. Lee, Z. Liu, and J. Cho. Automatic identification of user goals in web search. Technical report, UCLA Computer Science, 2004.
[15]
A. Sugiura and O. Etzioni. Query routing for web search engines: Architecture and experiments. In Proceedings of the Ninth Int'l World Wide Web Conf., 2000.
[16]
C. Silverstein, M. Henzinger, H. Marais, and M. Moricz. Analysis of a very large Web search engine query log. SIGIR Forum, 33(1):6 -- 12, 1999.
[17]
Danny Sullivan. Searches per day. http://searchenginewatch.com/reports/article.php/2156461, 2003.
[18]
Z. Gyongyi, H. Garcia-Molina, and J. Pedersen. Combating web spam with trustrank. In Proceedings of VLDB '04, 2004.
[19]
T. Haveliwala. Topic-sensitive pagerank. In Proceedings of the Eleventh Int'l World Wide Web Conf., 2002.
[20]
J. Cho, N. Shivakumar, and H. Garcia-Molina. Finding replicated web collections. In Proceedings of ACM SIGMOD '00, 2000.
[21]
K. Bharat and A. Broder. Mirror, mirror, on the Web: A study of host pairs with replicated content. In Proceedings of the Eighth Int'l World Wide Web Conf., 1999.
[22]
J.L. Devore. Probability and Statistics for Engineering and the Sciences. Duxbury, 6th edition, 2004.
[23]
J.R. Quinlan. C4.5: Programs for Machine Learning. Morgan Kaufmann, 1993.
[24]
N. Cristianini and J. Shawe-Taylor. An introduciton to Support Vector Machines. Cambridge University Press, 2000.
[25]
D.D. Wackerly, W. Mendenhall III, and R.L. Scheaffer. Mathematical Statistics with Applications. Duxbury, 6th edition, 2002.
[26]
C. Hoelscher. How Internet experts search for information on the Web. In Proceedings of WebNet '98, 1998.
[27]
B.J. Jansen, A. Spink, and T. Saracevic. Real life, real users, and real needs: A study and analysis of user queries on the Web. Information Processing and Management, 36(2):207 -- 227, 2000.
[28]
A. Spink, B.J. Jansen, D. Wolfram, and T. Saracevic. From E-Sex to E-Commerce: Web search changes. IEEE Computer, 35(3):107 -- 109, 2002.
[29]
B.J. Jansen and U. Pooch. A review of Web searching studies and a framework for future research. J. of the American Society of Information Science and Technology, 52(3):235 -- 246, 2001.
[30]
D. Beeferman and A. Berger. Agglomerative clustering of a search engine query log. In Proceedings of ACM SIGKDD '00, 2000.
[31]
B.D. Davison, D.G. Deschenes, and D.B. Lewanda. Finding relevant Website queries. In Proceedings of the Twelfth Int'l World Wide Web Conf., 2003.
[32]
R. Kraft and J. Zien. Mining anchor text for query refinement. In Proceedings of the Thirteenth Int'l World Wide Web Conf., 2004.
[33]
N. Eiron and K.S. McCurley. Analysis of anchor text for Web search. In Proceedings of ACM SIGIR '03, 2003.

Cited By

View all
  • (2023)The vision of on-demand architectural knowledge systems as a decision-making companionJournal of Systems and Software10.1016/j.jss.2022.111560198:COnline publication date: 1-Apr-2023
  • (2022)Navigational, Informational or Punk-Rock? An Exploration of Search Intent in the Musical DomainProceedings of the 30th ACM Conference on User Modeling, Adaptation and Personalization10.1145/3503252.3531308(202-211)Online publication date: 4-Jul-2022
  • (2022)ORCAS-IProceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval10.1145/3477495.3531737(3057-3066)Online publication date: 6-Jul-2022
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
WWW '05: Proceedings of the 14th international conference on World Wide Web
May 2005
781 pages
ISBN:1595930469
DOI:10.1145/1060745
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 10 May 2005

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Web search
  2. query classification
  3. user goals

Qualifiers

  • Article

Acceptance Rates

Overall Acceptance Rate 1,899 of 8,196 submissions, 23%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)73
  • Downloads (Last 6 weeks)6
Reflects downloads up to 12 Dec 2024

Other Metrics

Citations

Cited By

View all
  • (2023)The vision of on-demand architectural knowledge systems as a decision-making companionJournal of Systems and Software10.1016/j.jss.2022.111560198:COnline publication date: 1-Apr-2023
  • (2022)Navigational, Informational or Punk-Rock? An Exploration of Search Intent in the Musical DomainProceedings of the 30th ACM Conference on User Modeling, Adaptation and Personalization10.1145/3503252.3531308(202-211)Online publication date: 4-Jul-2022
  • (2022)ORCAS-IProceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval10.1145/3477495.3531737(3057-3066)Online publication date: 6-Jul-2022
  • (2022)Intent Detection in Urdu Queries using Fine-tuned BERT models2022 16th International Conference on Open Source Systems and Technologies (ICOSST)10.1109/ICOSST57195.2022.10016834(1-6)Online publication date: 14-Dec-2022
  • (2022)Understanding Query Intention in Search Queries of Learners in Blended Learning Environments2022 IEEE 4th International Conference on Cybernetics, Cognition and Machine Learning Applications (ICCCMLA)10.1109/ICCCMLA56841.2022.9989024(308-315)Online publication date: 8-Oct-2022
  • (2022)Cognitive Information RetrievalAdvances in Information Retrieval10.1007/978-3-030-99739-7_58(473-479)Online publication date: 5-Apr-2022
  • (2021)Learning to Represent Human Motives for Goal-directed Web BrowsingProceedings of the 15th ACM Conference on Recommender Systems10.1145/3460231.3474260(361-371)Online publication date: 13-Sep-2021
  • (2021)Evaluation of Intent Classification Models on Frequently Asked Railway QueriesProceedings of International Conference on Data Science and Applications10.1007/978-981-16-5348-3_3(41-56)Online publication date: 23-Nov-2021
  • (2020)Closed sequential pattern mining for sitemap generationWorld Wide Web10.1007/s11280-020-00839-224:1(175-203)Online publication date: 27-Sep-2020
  • (2020)Feature selection for classifying multi-labeled past eventsInternational Journal on Digital Libraries10.1007/s00799-020-00293-5Online publication date: 8-Sep-2020
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media