[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/1498759.1498825acmconferencesArticle/Chapter ViewAbstractPublication PageswsdmConference Proceedingsconference-collections
research-article

Integration of news content into web results

Published: 09 February 2009 Publication History

Abstract

Aggregated search refers to the integration of content from specialized corpora or verticals into web search results. Aggregation improves search when the user has vertical intent but may not be aware of or desire vertical search. In this paper, we address the issue of integrating search results from a news vertical into web search results. News is particularly challenging because, given a query, the appropriate decision---to integrate news content or not---changes with time. Our system adapts to news intent in two ways. First, by inspecting the dynamics of the news collection and query volume, we can track development of and interest in topics. Second, by using click feedback, we can quickly recover from system errors. We define several click-based metrics which allow a system to be monitored and tuned without annotator effort.

References

[1]
J. Allan, editor. Topic Detection and Tracking: Event-based Information Organization, volume 12 of The Information Retrieval Series. Springer, 2002.
[2]
H. Becker, C. Meek, and D. M. Chickering. Modeling contextual factors of click rates. In AAAI, pages 1310--1315, 2007.
[3]
D. Beeferman and A. Berger. Agglomerative clustering of a search engine query log. In KDD 2000, pages 407--416, 2000.
[4]
S. M. Beitzel, E. C. Jensen, O. Frieder, D. Grossman, D. D. Lewis, A. Chowdhury, and A. Kolcz. Automatic web query classification using labeled and unlabeled training data. In SIGIR 2005, pages 581--582, 2005.
[5]
N. J. Belkin and W. B. Croft. Information filtering and information retrieval: two sides of the same coin? CACM, 35(12):29--38, 1992.
[6]
A. Z. Broder, M. Fontoura, E. Gabrilovich, A. Joshi, V. Josifovski, and T. Zhang. Robust classification of rare queries using web knowledge. In SIGIR 2007, pages 231--238, 2007.
[7]
J. Callan. Distributed information retrieval. In W. B. Croft, editor, Advances in Information Retrieval. Kluwer Academic Publishers, 2000.
[8]
B. Carterette and R. Jones. Evaluating search engines by modeling the relationship between relevance and clicks. In NIPS, 2007.
[9]
S. Cronen-Townsend, Y. Zhou, and W. B. Croft. Predicting query performance. In SIGIR 2002, pages 299--306, 2002.
[10]
A. S. Das, M. Datar, A. Garg, and S. Rajaram. Google news personalization: scalable online collaborative filtering. In WWW 2007, pages 271--280, 2007.
[11]
F. Diaz and R. Jones. Using temporal profiles of queries for precision prediction. In SIGIR 2004, pages 18--24, 2004.
[12]
G. Dupret, V. Murdock, and B. Piwowarski. Web search engine evaluation using clickthrough data and a user model. In Query Log Analysis: Social And Technological Challenges. A workshop at the 16th International World Wide Web Conference (WWW 2007), May 2007.
[13]
G. E. Dupret and B. Piwowarski. A user browsing model to predict search engine click data from past observations. In SIGIR 2008, pages 331--338, 2008.
[14]
R. Jones and F. Diaz. Temporal profiles of queries. ACM Trans. Inf. Syst., 25(3):14, July 2007.
[15]
N. K. Jong and P. Stone. Bayesian models of nonstationary markov decision processes. In The IJCAI-2005 Workshop on Planning and Learning in A Priori Unknown or Dynamic Domains, 2005.
[16]
I.-H. Kang and G. Kim. Query type classification for web document retrieval. In SIGIR 2003, pages 64--71, 2003.
[17]
R. Kleinberg, A. Slivkins, and E. Upfal. Multi-armed bandits in metric spaces. In STOC 2008. 2008.
[18]
V. Lavrenko and W. B. Croft. Relevance based language models. In SIGIR 2001, pages 120--127, 2001.
[19]
X. Li, Y.-Y. Wang, and A. Acero. Learning query intent from regularized click graphs. In SIGIR 2008, pages 339--346, 2008.
[20]
C.-J. Lin, R. C. Weng, and S. S. Keerthi. Trust region newton method for large-scale logistic regression. Journal of Machine Learning Research, 9:627--650, 2008.
[21]
A. K. McCallum and K. Nigam. Employing EM in pool-based active learning for text classification. In ICML 1998, pages 350--358, 1998.
[22]
D. Metzler, S. T. Dumais, and C. Meek. Similarity measures for short segments of text. In ECIR, pages 16--27, 2007.
[23]
V. Murdock and M. Lalmas, editors. Proceedings of the SIGIR Workshop on Aggregated Search, 2008.
[24]
S. Pandey, D. Agarwal, D. Chakrabarti, and V. Josifovski. Bandits for taxonomies: A model-based approach. In SDM, 2007.
[25]
S. Pandey, D. Chakrabarti, and D. Agarwal. Multi-armed bandit problems with dependent arms. In ICML 2007, pages 721--728, 2007.
[26]
F. Radlinski, R. Kleinberg, and T. Joachims. Learning diverse rankings with multi-armed bandits. In ICML 2008, pages 784--791, 2008.
[27]
M. Sahami and T. D. Heilman. A web-based kernel function for measuring the similarity of short text snippets. In WWW 2006, pages 377--386, 2006.
[28]
M. J. A. Strens. A bayesian framework for reinforcement learning. In ICML 2000, pages 943--950, 2000.
[29]
T. Strohman, D. Metzler, H. Turtle, and W. B. Croft. Indri: A language model-based search engine for complex queries. In Proceedings of the International Conference on Intelligence Analysis, 2004.
[30]
R. Sutton and A. Barto. Reinforcement Learning. MIT Press, 1998.
[31]
M. Vlachos, C. Meek, Z. Vagena, and D. Gunopulos. Identifying similarities, periodicities and bursts for online search queries. In SIGMOD 2004, pages 131--142, 2004.
[32]
J.-R. Wen, J.-Y. Nie, and H.-J. Zhang. Query clustering using user logs. ACM Trans. Inf. Syst., 20(1):59--81, 2002.
[33]
X. Zhang. Fast Algorithms for Burst Detection. PhD thesis, New York University, 2006.
[34]
Y. Zhang, W. Xu, and J. P. Callan. Exploration and exploitation in adaptive filtering based on bayesian active learning. In ICML 2003, pages 896--903, 2003.

Cited By

View all
  • (2024)An overview of aggregation methods for social networks analysisKnowledge and Information Systems10.1007/s10115-024-02296-zOnline publication date: 12-Dec-2024
  • (2023)A Multi-Agent Framework for Recommendation with Heterogeneous Sources2023 International Joint Conference on Neural Networks (IJCNN)10.1109/IJCNN54540.2023.10191154(1-8)Online publication date: 18-Jun-2023
  • (2022)A Novel Probabilistic Graphical Model-Based Click Model for Vertical SearchJournal of the Korean Institute of Industrial Engineers10.7232/JKIIE.2022.48.2.13848:2(138-150)Online publication date: 15-Apr-2022
  • Show More Cited By

Index Terms

  1. Integration of news content into web results

      Recommendations

      Comments

      Please enable JavaScript to view thecomments powered by Disqus.

      Information & Contributors

      Information

      Published In

      cover image ACM Conferences
      WSDM '09: Proceedings of the Second ACM International Conference on Web Search and Data Mining
      February 2009
      314 pages
      ISBN:9781605583907
      DOI:10.1145/1498759
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Sponsors

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 09 February 2009

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tags

      1. click prediction
      2. distributed information retrieval
      3. news search
      4. query similarity

      Qualifiers

      • Research-article

      Conference

      WSDM'09
      Sponsor:

      Acceptance Rates

      Overall Acceptance Rate 498 of 2,863 submissions, 17%

      Upcoming Conference

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)27
      • Downloads (Last 6 weeks)4
      Reflects downloads up to 10 Dec 2024

      Other Metrics

      Citations

      Cited By

      View all
      • (2024)An overview of aggregation methods for social networks analysisKnowledge and Information Systems10.1007/s10115-024-02296-zOnline publication date: 12-Dec-2024
      • (2023)A Multi-Agent Framework for Recommendation with Heterogeneous Sources2023 International Joint Conference on Neural Networks (IJCNN)10.1109/IJCNN54540.2023.10191154(1-8)Online publication date: 18-Jun-2023
      • (2022)A Novel Probabilistic Graphical Model-Based Click Model for Vertical SearchJournal of the Korean Institute of Industrial Engineers10.7232/JKIIE.2022.48.2.13848:2(138-150)Online publication date: 15-Apr-2022
      • (2022)UNICON: Combating Label Noise Through Uniform Selection and Contrastive Learning2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)10.1109/CVPR52688.2022.00945(9666-9676)Online publication date: Jun-2022
      • (2021)Context-aware Target Apps Selection and Recommendation for Enhancing Personal Mobile AssistantsACM Transactions on Information Systems10.1145/344767839:3(1-30)Online publication date: 5-May-2021
      • (2020)Query ClassificationQuery Understanding for Search Engines10.1007/978-3-030-58334-7_2(15-41)Online publication date: 2-Dec-2020
      • (2019)Aggregating E-commerce Search Results from Heterogeneous Sources via Hierarchical Reinforcement LearningThe World Wide Web Conference10.1145/3308558.3313455(1771-1781)Online publication date: 13-May-2019
      • (2019)When People Change their MindProceedings of the Twelfth ACM International Conference on Web Search and Data Mining10.1145/3289600.3290958(447-455)Online publication date: 30-Jan-2019
      • (2019)An Analysis Study of Vertical Selection Task in Aggregated SearchProcedia Computer Science10.1016/j.procs.2019.01.021148(171-180)Online publication date: 2019
      • (2019)PolyRecs: Improving Page–View Rates Using Real-Time Data AnalysisReal-Time Business Intelligence and Analytics10.1007/978-3-030-24124-7_7(95-112)Online publication date: 11-Oct-2019
      • Show More Cited By

      View Options

      Login options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media