More Web Proxy on the site http://driver.im/

research-article

Time is of the essence: improving recency ranking using Twitter data

Authors:

Ruiqiang Zhang,

Hongyuan ZhaAuthors Info & Claims

WWW '10: Proceedings of the 19th international conference on World wide web

Pages 331 - 340

https://doi.org/10.1145/1772690.1772725

Published: 26 April 2010 Publication History

Abstract

Realtime web search refers to the retrieval of very fresh content which is in high demand. An effective portal web search engine must support a variety of search needs, including realtime web search. However, supporting realtime web search introduces two challenges not encountered in non-realtime web search: quickly crawling relevant content and ranking documents with impoverished link and click information. In this paper, we advocate the use of realtime micro-blogging data for addressing both of these problems. We propose a method to use the micro-blogging data stream to detect fresh URLs. We also use micro-blogging data to compute novel and effective features for ranking fresh URLs. We demonstrate these methods improve effective of the portal web search engine for realtime web search.

References

[1]

E. Agichtein, E. Brill, and S. Dumais. Improving web search ranking by incorporating user behavior information. In Proceedings of 29th ACM SIGIR, 2006.

Digital Library

[2]

P. Bonacich. Factoring and weighting approaches to clique identification. Journal of Mathematical Sociology, 2:113--120, 1972.

[3]

K. Borau, C. Ullrich, J. Feng, and R. Shen. Microblogging for language learning: Using twitter to train communicative and cultural competence. In International Conference on Web Based Learning (ICWL) 2009, 2009.

Digital Library

[4]

S. Brin and L. Page. The anatomy of a large-scale hypertextual web search engine. Proceedings of International Conference on World Wide Web, 1998.

Digital Library

[5]

A. Broder. A taxonomy of web search. SIGIR Forum, 36(2):3--10, 2002.

Digital Library

[6]

C. Burges, T. Shaked, E. Renshaw, A. Lazier, M. Deeds, N. Hamilton, and G. Hullender. Learning to rank using gradient descent. Proc. of Intl. Conf. on Machine Learning, 2005.

Digital Library

[7]

Z. Cao, T. Qin, T. Liu, M. Tsai, and H. Li. Learning to rank: From pairwise approach to listwise. Proceedings of ICML conference, 2007.

Digital Library

[8]

F. Diaz. Integration of news content into web results. Proceedings of the Second ACM International Conference on Web Search and Data Mining (WSDM), pages 182--191, 2009.

Digital Library

[9]

A. Dong, Y. Chang, Z. Zheng, G. Mishne, J. Bai, R. Zhang, K. Buchner, C. Liao, and F. Diaz. Towards recency ranking in web search. In WSDM '10: Proceedings of the third ACM international conference on Web search and data mining, pages 11--20, New York, NY, USA, 2010. ACM.

Digital Library

[10]

J. C. Dunlap and P. R. Lowenthal. Tweeting the night away: Using twitter to enhance social presence. In Journal of Information Systems Education Special Issue, Impacts of Web 2.0 and Virtual World Technologies on IS Education, 2009.

[11]

Y. Freund, R. D. Iyer, R. E. Schapire, and Y. Singer. An efficient boosting algorithm for combining preferences. Proceedings of International Conference on Machine Learning, 1998.

Digital Library

[12]

J. H. Friedman. Greedy function approximation: A gradient boosting machine. Annals of Statistics, 29(5):1189--1232, 2001.

[13]

C. Honeycutt and S. C. Herring. Beyond microblogging: Conversation and collaboration via twitter. In System Sciences, 2009. HICSS '09. 42nd Hawaii International Conference on, pages 1--10, 2009.

Digital Library

[14]

B. A. Huberman, D. M. Romero, and F. Wu. Social networks that matter: Twitter under the microscope. Dec 2008.

[15]

A. L. Hughes and L. Palen. Twitter adoption and use in mass convergence and emergency events. In Proceedings of the 6th International Conference on Information Systems for Crisis Response and Management, 2009.

[16]

B. J. Jansen, M. Zhang, K. Sobel, and A. Chowdury. Twitter power: Tweets as electronic word of mouth. Journal of the American Society for Information Science and Technology, pages 1--20, 2009.

Digital Library

[17]

K. Jarvelin and J. Kekalainen. Cumulated gain-based evaluation of ir techniques. ACM Transactions on Information Systems, 20:422--446, 2002.

Digital Library

[18]

A. Java, X. Song, T. Finin, and B. Tseng. Why we twitter: understanding microblogging usage and communities. In WebKDD/SNA-KDD '07: Proceedings of the 9th WebKDD and 1st SNA-KDD 2007 workshop on Web mining and social network analysis, pages 56--65, New York, NY, USA, 2007. ACM.

Digital Library

[19]

T. Joachims. Optimizing search engines using clickthrough data. In Proceedings of the ACM Conference on Knowledge Discovery and Data Mining (KDD), 2002.

Digital Library

[20]

A. C. König, M. Gamon, and Q. Wu. Click-through prediction for news queries. In SIGIR 2009, 2009.

Digital Library

[21]

B. Krishnamurthy, P. Gill, and M. Arlitt. A few chirps about twitter. In WOSP '08: Proceedings of the first workshop on Online social networks, pages 19--24, New York, NY, USA, 2008. ACM.

Digital Library

[22]

T. Y. Liu. Learning to rank for information retrieval. Tutorial on WWW conference, 2009.

Digital Library

[23]

C. Manning, P. Raghavan, and H. Schütze. Introduction to Information Retrieval. Cambridge University Press, 2008.

Digital Library

[24]

D. Metzler, S. T. Dumais, and C. Meek. Similarity measures for short segments of text. In ECIR, pages 16--27, 2007.

Digital Library

[25]

D. Shamma, L. Kennedy, and E. Churchill. Tweet the debates: Understanding community annotation of uncollected sources. In Proceedings of the ACM International Conference on Multimedia. ACM, 2009.

Digital Library

[26]

X. Wang and C. Zhai. Learn from web search logs to organize search results. In Proceedings of the 30th ACM SIGIR, 2007.

Digital Library

[27]

D. Zhao and M. B. Rosson. How and why people twitter: the role that micro-blogging plays in informal communication at work. In GROUP '09: Proceedings of the ACM 2009 international conference on Supporting group work, pages 243--252, New York, NY, USA, 2009. ACM.

Digital Library

[28]

Z. Zheng, H. Zha, K. Chen, and G. Sun. A regression framework for learning ranking functions using relative relevance judgments. In Proceedings of the 30th ACM SIGIR conference, 2007.

Digital Library

[29]

Z. Zheng, H. Zha, T. Zhang, O. Chapelle, K. Chen, and G. Sun. A general boosting method and its application to learning ranking functions for web search. NIPS, 2007.

Cited By

Ulloa RMakhortykh MUrman AKulshrestha J(2023)Novelty in News Search: A Longitudinal Study of the 2020 US ElectionsSocial Science Computer Review10.1177/0894439323119547142:3(700-718)Online publication date: 14-Aug-2023
https://doi.org/10.1177/08944393231195471
Teodoro DFerdowsi SBorissov NKashani EVicente Alvarez DCopara JGouareb RNaderi NAmini P(2021)Information Retrieval in an Infodemic: The Case of COVID-19 PublicationsJournal of Medical Internet Research10.2196/3016123:9(e30161)Online publication date: 17-Sep-2021
https://doi.org/10.2196/30161
Han ZKong LQi H(2021)Time segment language model for microblog retrievalNeural Computing and Applications10.1007/s00521-020-05534-xOnline publication date: 3-Jan-2021
https://doi.org/10.1007/s00521-020-05534-x
Show More Cited By

Index Terms

Time is of the essence: improving recency ranking using Twitter data
1. Information systems
  1. Information retrieval

Recommendations

Towards recency ranking in web search
WSDM '10: Proceedings of the third ACM international conference on Web search and data mining

In web search, recency ranking refers to ranking documents by relevance which takes freshness into account. In this paper, we propose a retrieval system which automatically detects and responds to recency sensitive queries. The system detects recency ...
Improving recency ranking using twitter data
Special section on twitter and microblogging services, social recommender systems, and CAMRa2010: Movie recommendation in context

In Web search and vertical search, recency ranking refers to retrieving and ranking documents by both relevance and freshness. As impoverished in-links and click information is the the biggest challenge for recency ranking, we advocate the use of ...
Online learning for recency search ranking using real-time user feedback
CIKM '10: Proceedings of the 19th ACM international conference on Information and knowledge management

Traditional machine-learned ranking algorithms for web search are trained in batch mode, which assume static relevance of documents for a given query. Although such a batch-learning framework has been tremendously successful in commercial search engines,...

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Other conferences

WWW '10: Proceedings of the 19th international conference on World wide web

April 2010

1407 pages

ISBN:9781605587998

DOI:10.1145/1772690

General Chairs:
Michael Rappa
North Carolina State University, USA
,
Paul Jones
University of North Carolina at Chapel Hill, USA
,
Program Chairs:
Juliana Freire
University of Utah, USA
,
Soumen Chakrabarti
Indian Institute of Technology, India

Copyright © 2010 International World Wide Web Conference Committee (IW3C2).

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 26 April 2010

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Conference

WWW '10

WWW '10: The 19th International World Wide Web Conference

April 26 - 30, 2010

North Carolina, Raleigh, USA

Acceptance Rates

Overall Acceptance Rate 1,899 of 8,196 submissions, 23%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

134
Total Citations
View Citations
2,883
Total Downloads

Downloads (Last 12 months)11
Downloads (Last 6 weeks)2

Reflects downloads up to 10 Dec 2024

Other Metrics

View Author Metrics

Citations

Cited By

Ulloa RMakhortykh MUrman AKulshrestha J(2023)Novelty in News Search: A Longitudinal Study of the 2020 US ElectionsSocial Science Computer Review10.1177/0894439323119547142:3(700-718)Online publication date: 14-Aug-2023
https://doi.org/10.1177/08944393231195471
Teodoro DFerdowsi SBorissov NKashani EVicente Alvarez DCopara JGouareb RNaderi NAmini P(2021)Information Retrieval in an Infodemic: The Case of COVID-19 PublicationsJournal of Medical Internet Research10.2196/3016123:9(e30161)Online publication date: 17-Sep-2021
https://doi.org/10.2196/30161
Han ZKong LQi H(2021)Time segment language model for microblog retrievalNeural Computing and Applications10.1007/s00521-020-05534-xOnline publication date: 3-Jan-2021
https://doi.org/10.1007/s00521-020-05534-x
Goren GKurland OTennenholtz MRaiber FHuang JChang YCheng XKamps JMurdock VWen JLiu Y(2020)Ranking-Incentivized Quality Preserving Content ModificationProceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval10.1145/3397271.3401058(259-268)Online publication date: 25-Jul-2020
https://dl.acm.org/doi/10.1145/3397271.3401058
Son JLee JLarsen KWoo J(2020)Understanding the uncertainty of disaster tweets and its effect on retweetingJournal of the Association for Information Science and Technology10.1002/asi.2432971:10(1145-1161)Online publication date: 11-Sep-2020
https://dl.acm.org/doi/10.1002/asi.24329
Sayed MOard DPiwowarski BChevalier MGaussier EMaarek YNie JScholer F(2019)Jointly Modeling Relevance and Sensitivity for Search Among Sensitive ContentProceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval10.1145/3331184.3331256(615-624)Online publication date: 18-Jul-2019
https://dl.acm.org/doi/10.1145/3331184.3331256
Kim J(2019)A Document Ranking Method With Query-Related Web ContextIEEE Access10.1109/ACCESS.2019.29471667(150168-150174)Online publication date: 2019
https://doi.org/10.1109/ACCESS.2019.2947166
Albishre KLi YXu YHuang W(2019)Query-based unsupervised learning for improving social media searchWorld Wide Web10.1007/s11280-019-00747-0Online publication date: 27-Nov-2019
https://doi.org/10.1007/s11280-019-00747-0
Devi PGeetha RKarthika S(2019)Trendingtags—Classification & Prediction of Hashtag Popularity Using Twitter Features in Machine Learning Approach ProceedingsComputational Intelligence in Data Mining10.1007/978-981-13-8676-3_15(161-177)Online publication date: 18-Aug-2019
https://doi.org/10.1007/978-981-13-8676-3_15
(2018)Activity-based linkage and ranking methods for personal dataspaceInternational Journal of Mobile Communications10.1504/IJMC.2018.09138116:3(266-285)Online publication date: 1-Jan-2018
https://dl.acm.org/doi/10.1504/IJMC.2018.091381
Show More Cited By

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

EPUB

View this article in ePub.

Media

Figures

Other

Tables

View Table of Contents