[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/1321440.1321556acmconferencesArticle/Chapter ViewAbstractPublication PagescikmConference Proceedingsconference-collections
research-article

Evaluation of phrasal query suggestions

Published: 06 November 2007 Publication History

Abstract

This paper evaluates the uptake and efficacy of a unified approach to phrasal query suggestions in the context of a high-precision search engine. The search engine performs ranked extended-Boolean searches with the proximity operator <scp>NEAR</scp> being the default operation. Suggestions are offered to the searcher when the length of the result list falls outside predefined bounds. If the list is too long, the engine suggests narrowing the query through the use of super phrases; if the list is too short, the engine suggests broadening the query through the use of proximal subphrases.
We evaluated uptake of phrasal query suggestions by analyzing search log data from before and after the suggestion feature was added to a commercial version of the search engine. We looked at approximately 1.5 million queries and found that, after they were added, suggestions represented nearly 30% of the total queries.
We evaluated efficacy through a controlled study of 24 participants performing nine searches using three different search engines. We found that the engine with phrase suggestions had better high-precision recall than both the same search engine without suggestions and a search engine with a similar interface but using an Okapi BM25 ranking algorithm.

References

[1]
Anick, P. G. and Tipirneni, S. The paraphrase search assistant: terminological feedback for iterative information seeking. In Proceedings of the 22nd Annual international ACM SIGIR Conference on Research and Development in information Retrieval (Berkeley, California, United States, August 15 - 19, 1999).
[2]
Anick, P. Using terminological feedback for web search refinement: a log-based study. In Proceedings of the 26th Annual international ACM SIGIR Conference on Research and Development in informaion Retrieval (Toronto, Canada, July 28 - August 01, 2003).
[3]
Bailey, P., Craswell, N., and Hawking, D. Engineering a multi-purpose test collection for web retrieval experiments. Inf. Process. Manage. 39, 6 (Nov. 2003), 853--871.
[4]
Belkin, N. J., Cool, C., Kelly, D., Lin, S., Park, S. Y., Perez-Carballo, J., and Sikora, C. Iterative exploration, design and evaluation of support for query reformulation in interactive information retrieval. Inf. Process. Manage. 37, 3 (May. 2001), 403--434.
[5]
P. D. Bruza and S. Dennis. Query ReFormulation on the Internet: Empirical Data and the Hyperindex Search Engine, Proceedings of, RIAO'97, 488--499, 1997.
[6]
Buckley, C., Dimmick, D., Soboroff, I., and Voorhees, E. Bias and the limits of pooling. In Proceedings of the 29th Annual international ACM SIGIR Conference on Research and Development in information Retrieval (Seattle, Washington, USA, August 06 - 11, 2006).
[7]
Chau, M., Fang, X., and Sheng, O. R. 2005. Analysis of the query logs of a web site search engine. J. Am. Soc. Inf. Sci. Technol. 56, 13 (Nov. 2005), 1363--1376.
[8]
http://www.clusty.com
[9]
Nick Craswell and David Hawking. Overview of the TREC-2004 Web Track. In Proceedings of the Thirteenth Text REtrieval Conference (TREC 2004).
[10]
D. R. Cutting, D. R. Karger, J. O. Pedersen and J. W. Tukey, Scatter/Gather: a cluster-based approach to browsing large document collections. In Proceedings of the 15th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR'92), 1992, pp 318--329.
[11]
Gauch, S. and Smith, J. B. Search improvement via automatic query reformulation. ACM Trans. Inf. Syst. 9, 3 (Jul. 1991), 249--280.
[12]
Gutwin, C., Paynter, G., Witten, I., Nevill-Manning, C., and Frank, E. Improving browsing in digital libraries with keyphrase indexes. Decis. Support Syst. 27, 1-2 (Nov. 1999), 81--104.
[13]
Daqing He &amp; Ayşşe Göker. Detecting Session Boundaries from Web User Logs. Proceedings of the BCS-IRSG 22nd Annual Colloquium on Information Retrieval Research, 2000.
[14]
Henninger, S. and Belkin, N. Interface issues and interaction strategies for information retrieval systems. Conference Companion on Human Factors in Computing Systems (Denver, Colorado, United States, May 07 - 11, 1995).
[15]
William Hersh. TREC 2002 Interactive Track Report. In Proceedings of the Eleventh Text REtrieval Conference (TREC 2002).
[16]
Jansen, B. J., Spink, A., and Saracevic, T. Real life, real users, and real needs: a study and analysis of user queries on the web. Inf. Process. Manage. 36, 2 (Jan. 2000), 207--227.
[17]
Bernard J. Jansen and Michael D. McNeese. Evaluating the Effectiveness of and Patterns of Interactions With Automated Searching Assistance, Journal of the American Society for Information Science and Technology, 56(14), 2005.
[18]
http://www.kartoo.com
[19]
S. Khan, H. Jameel, A. Sajjad, and H. Iqbal. Evaluation of Proximity Search and Auto-Concept Expansion on a Web Newspapers Document Collection: Results Benchmarked against the Boolean and Vector Space Models. ACM Journal of Computer Documentation (accepted for publication).
[20]
Koenemann, J. and Belkin, N. J. 1996. A case for interaction: a study of interactive information retrieval behavior and effectiveness. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems: Common Ground (Vancouver, British Columbia, Canada, April 13 - 18, 1996).
[21]
http://www.northernlight.com/
[22]
Rao, R., Pedersen, J. O., Hearst, M. A., Mackinlay, J. D., Card, S. K., Masinter, L., Halvorsen, P., and Robertson, G. C. Rich Interaction in the Digital Library, Communications of the ACM, 38:4, pp. 29--39, 1995.
[23]
S. E. Robertson, S. Walker, S. Jones, M. M. Hancock-Beaulieu, and M. Gatford. Okapi at TREC-3, The Third Text Retrieval Conference (TREC-3) November 1994.
[24]
Salton, G. and Buckley, C. Improving retrieval performance by relevance feedback. In Readings in information Retrieval, K. Sparck Jones and P. Willett, Eds. Morgan Kaufmann Multimedia Information And Systems Series. Morgan Kaufmann Publishers, San Francisco, CA, 355--364, 1997.
[25]
Siegfried, Susan, Marcia J. Bates, and Deborah N. Wilde. A Profile of End-User Searching Behavior by Humanities Scholars: The Getty Online Searching Project Report No. 2. Journal of the American Society for Information Science 44 (June 1993): 273--291.
[26]
Silverstein, C., Marais, H., Henzinger, M., and Moricz, M. Analysis of a very large web search engine query log. SIGIR Forum 33, 1 (Sep. 1999), 6--12.
[27]
Wang, P., Berry, M. W., and Yang, Y. Mining longitudinal web queries: trends and patterns. J. Am. Soc. Inf. Sci. Technol. 54, 8 (Jun. 2003), 743--758.
[28]
Oren Eli Zamir. Zamir, O. and Etzioni, O. Grouper: a dynamic clustering interface to Web search results. Proceeding of the Eighth international Conference on World Wide Web (Toronto, Canada), 1999.
[29]
Zobel, J. How reliable are the results of large-scale information retrieval experiments? In Proceedings of the 21st Annual international ACM SIGIR Conference on Research and Development in information Retrieval (Melbourne, Australia, August 24 - 28, 1998).

Cited By

View all

Index Terms

  1. Evaluation of phrasal query suggestions

      Recommendations

      Comments

      Please enable JavaScript to view thecomments powered by Disqus.

      Information & Contributors

      Information

      Published In

      cover image ACM Conferences
      CIKM '07: Proceedings of the sixteenth ACM conference on Conference on information and knowledge management
      November 2007
      1048 pages
      ISBN:9781595938039
      DOI:10.1145/1321440
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Sponsors

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 06 November 2007

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tags

      1. proximity search
      2. query Log analysis
      3. user study
      4. web search

      Qualifiers

      • Research-article

      Conference

      CIKM07

      Acceptance Rates

      Overall Acceptance Rate 1,861 of 8,427 submissions, 22%

      Upcoming Conference

      CIKM '25

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)4
      • Downloads (Last 6 weeks)1
      Reflects downloads up to 11 Dec 2024

      Other Metrics

      Citations

      Cited By

      View all
      • (2023)The Infinite Index: Information Retrieval on Generative Text-To-Image ModelsProceedings of the 2023 Conference on Human Information Interaction and Retrieval10.1145/3576840.3578327(172-186)Online publication date: 19-Mar-2023
      • (2023)Investigation of Bias in Web Search QueriesAdvances in Information Retrieval10.1007/978-3-031-28241-6_50(443-449)Online publication date: 2-Apr-2023
      • (2020)Query SuggestionQuery Understanding for Search Engines10.1007/978-3-030-58334-7_8(171-203)Online publication date: 2-Dec-2020
      • (2019)Disjunctive Sets of Phrase Queries for Diverse Query SuggestionIEEE/WIC/ACM International Conference on Web Intelligence10.1145/3350546.3352566(449-455)Online publication date: 14-Oct-2019
      • (2017)Large-scale Generative Query AutocompletionProceedings of the 22nd Australasian Document Computing Symposium10.1145/3166072.3166083(1-8)Online publication date: 7-Dec-2017
      • (2017)Mailbox-Based vs. Log-Based Query Completion for Mail SearchProceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval10.1145/3077136.3080683(937-940)Online publication date: 7-Aug-2017
      • (2017)Learning to evaluate and recommend query in restaurant search systemsInformation Systems and e-Business Management10.1007/s10257-016-0309-815:1(51-68)Online publication date: 1-Feb-2017
      • (2016)Enhancing keyword suggestion of web serach by leveraging microblog dataJournal of Web Engineering10.5555/3177210.317721115:3-4(181-202)Online publication date: 1-Jul-2016
      • (2014)Generating relevant and diverse query phrase suggestions using topical n-gramsProceedings of the 5th Symposium on Information and Communication Technology10.1145/2676585.2676601(49-56)Online publication date: 4-Dec-2014
      • (2013)Query suggestions for textual problem solution repositoriesProceedings of the 35th European conference on Advances in Information Retrieval10.1007/978-3-642-36973-5_48(569-581)Online publication date: 24-Mar-2013
      • Show More Cited By

      View Options

      Login options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media