[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/2187980.2188197acmotherconferencesArticle/Chapter ViewAbstractPublication PagesthewebconfConference Proceedingsconference-collections
tutorial

Learning from users' querying experience on intranets

Published: 16 April 2012 Publication History

Abstract

Query recommendation is becoming a common feature of web search engines especially those for Intranets where the context is more restrictive. This is because of its utility for supporting users to find relevant information in less time by using the most suitable query terms. Selection of queries for recommendation is typically done by mining web documents or search logs of previous users. We propose the integration of these approaches by combining two models namely the concept hierarchy, typically built from an Intranet's documents, and the query flow graph, typically built from search logs. However, we build our concept hierarchy model from terms extracted from a subset (training set) of search logs since these are more representative of the user view of the domain than any concepts extracted from the collection. We then continually adapt the model by incorporating query refinements from another subset (test set) of the user search logs. This process implies learning from or reusing previous users' querying experience to recommend queries for a new but similar user query. The adaptation weights are extracted from a query flow graph built with the same logs. We evaluated our hybrid model using documents crawled from the Intranet of an academic institution and its search logs. The hybrid model was then compared to a concept hierarchy model and query flow graph built from the same collection and search logs respectively. We also tested various strategies for combining information in the search logs with respect to the frequency of clicked documents after query refinement. Our hybrid model significantly outperformed the concept hierarchy model and query flow graph when tested over two different periods of the academic year. We intend to further validate our experiments with documents and search logs from another institution and devise better strategies for selecting queries for recommendation from the hybrid model.

References

[1]
M.-D. Albakour, U. Kruschwitz, I. Adeyanju, D. Song, M. Fasli, and A. De Roeck. Enriching query flow graphs with click information. In AIRS'11 proceedings, pages 193--204. Springer, 2011.
[2]
M.-D. Albakour, N. Nanas, U. Kruschwitz, M. Fasli, Y. Kim, D. Song, and A. De Roeck. Autoeval: An evaluation methodology for evaluating query suggestions using query logs. In ECIR'2011 proceedings, pages 605--610, 2011.
[3]
P. Anick. Using terminological feedback for web search refinement - a log-based study. In SIGIR '03 proceedings, pages 88--95. ACM, 2003.
[4]
R. Baeza-Yates and A. Tiberi. Extracting semantic relations from query logs. In SIGKDD'07 proceedings, pages 76--85, 2007.
[5]
P. Boldi, F. Bonchi, C. Castillo, D. Donato, A. Gionis, and S. Vigna. The query-flow graph: model and applications. In CIKM '08, pages 609--618, 2008.
[6]
I. Bordino, C. Castillo, D. Donato, and A. Gionis. Query similarity by projecting the query-flow graph. In SIGIR'10 proceedings, pages 515--522. ACM, 2010.
[7]
J. Callan, B. W. Croft, and S. M. Harding. The inquery retrieval system. In Proceedings of the Third International Conference on Database and Expert Systems Applications, pages 78--83. Springer, 1992.
[8]
D. Carmel, E. Farchi, Y. Petruschka, and A. Soffer. Automatic query refinement using lexical affinities with maximal information gain. In SIGIR '02 proceedings, pages 283--290. ACM, 2002.
[9]
N. Craswell and M. Szummer. Random walks on the click graph. In SIGIR '07, pages 239--246, 2007.
[10]
S. Cucerzan and R. W. White. Query suggestion based on user landing pages. In SIGIR'07 proceedings, pages 875--876. ACM, 2007.
[11]
V. Dang and W. B. Croft. Query reformulation using anchor text. In WSDM'10, pages 41--50, 2010.
[12]
T. Joachims and F. Radlinski. Search engines that learn from implicit feedback. IEEE Computer, 40(8):34--40, 2007.
[13]
H. Joho, M. Sanderson, and M. Beaulieu. Hierarchical approach to term suggestion device. In SIGIR '02 proceedings, page 454. ACM, 2002.
[14]
R. Jones, B. Rey, and O. Madani. Generating query substitutions. In WWW '06 proceedings, pages 387--396, 2006.
[15]
R. Kraft and J. Zien. Mining anchor text for query refinement. In WWW'04, pages 666--674, 2004.
[16]
N. Nanas, V. Uren, A. De Roeck, and J. Domingue. Building and applying a concept hierarchy representation of a user profile. In SIGIR'03 proceedings, pages 198--204. ACM, 2003.
[17]
F. Radlinski, M. Kurup, and T. Joachims. How does clickthrough data reflect retrieval quality? In CIKM'08 proceedings, pages 43--52. ACM, 2008.
[18]
I. Ruthven. Re-examining the potential effectiveness of interactive query expansion. In SIGIR '03 proceedings, pages 213--220. ACM, 2003.
[19]
M. Sanderson and B. Croft. Deriving concept hierarchies from text. In SIGIR'99 proceedings, pages 206--213. ACM, 1999.
[20]
R. W. White, M. Bilenko, and S. Cucerzan. Studying the use of popular destination to enhance web search interaction. In SIGIR'07, pages 159--166, 2007.
[21]
R. W. White and I. Ruthven. A study of interface support mechanisms for interactive information retrieval. JASIST, 57(7):933--948, 2006.
[22]
J. Xu and W. B. Croft. Query expansion using local and global document analysis. In SIGIR'96 proceedings, pages 4--11. ACM, 1996.

Cited By

View all
  • (2021)Cluster Analysis of Influencing Factors of Regional Economic Growth Based on Random Walk Model2021 5th International Conference on Electronics, Communication and Aerospace Technology (ICECA)10.1109/ICECA52323.2021.9675884(1243-1246)Online publication date: 2-Dec-2021

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Other conferences
WWW '12 Companion: Proceedings of the 21st International Conference on World Wide Web
April 2012
1250 pages
ISBN:9781450312301
DOI:10.1145/2187980
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

  • Univ. de Lyon: Universite de Lyon

In-Cooperation

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 16 April 2012

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. intranet
  2. query reformulation
  3. users' search experience

Qualifiers

  • Tutorial

Conference

WWW 2012
Sponsor:
  • Univ. de Lyon
WWW 2012: 21st World Wide Web Conference 2012
April 16 - 20, 2012
Lyon, France

Acceptance Rates

Overall Acceptance Rate 1,899 of 8,196 submissions, 23%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)4
  • Downloads (Last 6 weeks)0
Reflects downloads up to 16 Dec 2024

Other Metrics

Citations

Cited By

View all
  • (2021)Cluster Analysis of Influencing Factors of Regional Economic Growth Based on Random Walk Model2021 5th International Conference on Electronics, Communication and Aerospace Technology (ICECA)10.1109/ICECA52323.2021.9675884(1243-1246)Online publication date: 2-Dec-2021

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media