[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/1968613.1968618acmconferencesArticle/Chapter ViewAbstractPublication PagesicuimcConference Proceedingsconference-collections
research-article

An approach to use query-related web context on document ranking

Published: 21 February 2011 Publication History

Abstract

With the development of Web search engines, it is considered as an important task to provide retrieved documents in a proper manner. Many search engines have used various document ranking algorithms to provide their retrieved documents in a more efficient way for users. However, even though a good algorithm is used, there are some limitations if they do not consider the characteristic of queries which is diverse depending on user intention or interest. Even if a user searches documents with the same query, he/she may want a different result depending on when he/she queries into a search engine. How can a search engine judge what way is more efficient to provide retrieved results? We suggest a simple and novel way which employs query-related Web context to answer this question. With the distribution of query-related tweets and news articles, we classify whether a query would be considered as a hot query or a cold query. And then, we extract major topic terms from the hot time slice if a query is classified as a hot query, or extract refined contents if a query is classified as a cold query. Finally, all retrieved results are re-ranked by reflecting these topic terms or refined contents according to the characteristic of the query. To show the meaningfulness of our approach, we compare our re-ranked results with original retrieved results from the commercial search engine.

References

[1]
Singhal, A. 2001. Modern information retrieval: a brief overview. IEEE Data Engineering Bulletin 24 (2001), 35--43.
[2]
Gerard Salton, G., Wong, A., and Yang, C. S. 1975. A vector space model for information retrieval. In Communications of the ACM (November 1975), Vol. 18, No. 11, 613--620.
[3]
Robertson, S. E. 1977. The probabilistic ranking principle in IR. Journal of Documentation, Vol. 33, 294--304.
[4]
Turtle, H. 1990. Inference Networks for Document Retrieval. Ph.D. thesis, Department of Computer Science, University of Massachusetts, Amherst, MA 01003, 90--92.
[5]
Beitzel, S. M., Jensen, E. C., Lewis, D. D., Chowdhury, A., and Frieder, O. 2007. Automatic classification of Web queries using very large unlabeled query logs. In ACM Transactions on Information Systems. ACM, New York, NY, USA. Vol. 25, No. 9 (April 2007).
[6]
Jansen, B. J., Spink, A., and Pederson, J. 2005. A temporal comparison of AltaVista Web searching. Journal of the American Society for Information Science and Technology. Vol. 56, Issue 6, 559--570.
[7]
Chien, S. and Immorlica, N. 2005. Semantic similarity between search engine queries using temporal correlation. In Proceedings of the 14th international conference on World Wide Web (Chiba, Japan).
[8]
Kang, I. H. and Kim, G. C. 2003. Query type classification for web document retrieval. In Proceedings of the 26th annual international ACM SIGIR conference on Research and development in information retrieval (Toronto, Canada, July 28--August 01, 2003).
[9]
Gravano, L., Hatzivassiloglou, V., and Lichtenstein, R. 2003. Categorizing web queries according to geographical locality. In ACM International Conference on Information and Knowledge Management (New Orleans, LA, USA), 325--333.
[10]
Liu, X. and Brzeski, V. 2009. Computational community interest for ranking. In Proceeding of the 18th ACM conference on Information and knowledge management (Hong Kong, China), 245--254.
[11]
Beitzel, S. M., Jensen, E. C., Frieder, O., Grossman, D., Lewis, D. D., Chowdhury, A., and Kolcz, A. 2005. Automatic web query classification using labeled and unlabeled training data. In Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval (Salvador, Brazil, August 15--19, 2005). ACM, New York, NY, USA, 581--582.
[12]
Liu, Y., Zhang, L., Song, R., Nie, J., and Wen, J. 2009. Clustering queries for better document ranking. In Proceeding of the 18th ACM conference on Information and knowledge management (Hong Kong, China), 1569--1572.
[13]
Murata, T. 2006. Towards the detection of breaking news from online web search keywords. In IEEE/WIC/ACM International conference on Web Intelligence and Intelligent Agent Technology Workshops, 401--404.
[14]
Murata, T. 2008. Detection of breaking news from online web search queries. New Generation Computing, Vol. 26, No. 1, 63--73.
[15]
http://www.wikipedia.org/
[16]
http://www.dogpile.com/info.dogpl/searchspy

Cited By

View all
  • (2015)A Fuzzy Driven Reliability and Relevancy Map for Web Content Search Optimization2015 International Conference on Computational Intelligence and Communication Networks (CICN)10.1109/CICN.2015.233(1211-1215)Online publication date: Dec-2015

Index Terms

  1. An approach to use query-related web context on document ranking

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    ICUIMC '11: Proceedings of the 5th International Conference on Ubiquitous Information Management and Communication
    February 2011
    959 pages
    ISBN:9781450305716
    DOI:10.1145/1968613
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 21 February 2011

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. Twitter
    2. data mining
    3. document ranking
    4. query classification
    5. web context
    6. web page ranking
    7. web search

    Qualifiers

    • Research-article

    Funding Sources

    Conference

    ICUIMC '11
    Sponsor:

    Acceptance Rates

    ICUIMC '11 Paper Acceptance Rate 135 of 534 submissions, 25%;
    Overall Acceptance Rate 251 of 941 submissions, 27%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)2
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 13 Dec 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2015)A Fuzzy Driven Reliability and Relevancy Map for Web Content Search Optimization2015 International Conference on Computational Intelligence and Communication Networks (CICN)10.1109/CICN.2015.233(1211-1215)Online publication date: Dec-2015

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media