[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/2911451.2914805acmconferencesArticle/Chapter ViewAbstractPublication PagesirConference Proceedingsconference-collections
research-article

Temporal Information Retrieval

Published: 07 July 2016 Publication History

Abstract

The study of temporal dynamics and its impact can be framed within the so-called temporal IR approaches, which explain how user behavior, document content and scale vary with time, and how we can use them in our favor in order to improve retrieval effectiveness.
This half-day tutorial will outline research issues with respect to temporal dynamics, and provide a comprehensive overview of temporal IR approaches, essentially regarding processing dynamic content, temporal information extraction, temporal query analysis, and time-aware retrieval and ranking. The tutorial is structured into two sessions. During the first session, we will explain the general and wide aspects associated to temporal dynamics by focusing on the web domain, from content and structural changes to variations of user behavior and interactions. We will begin with temporal indexing and query processing. Next step, we will explain current approaches to time-aware retrieval and ranking, which can be classified into different types based on two main notions of relevance with respect to time, namely, recency-based ranking, and time-dependent ranking.
In the latter session, we will describe research issues centered on determining the temporal intent of queries, and time-aware query enhancement, e.g., temporal relevance feedback, and time-aware query reformulation. In addition, we present applications in related research areas, e.g., exploration, summarization, and clustering of search results, as well as future event retrieval and prediction. To this end, we conclude our tutorial and outline future directions.
This tutorial targets graduate students, researchers and practitioners in the field of information retrieval. The goal is to provide an overview as well as an important context that enables further research on and practical applications within this area.

References

[1]
J. Allan, R. Papka, and V. Lavrenko. On-line new event detection and tracking. In Proceedings of the 21st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR '98, pages 37--45, 1998.
[2]
O. Alonso, M. Gertz, and R. A. Baeza-Yates. Clustering and exploring search results using timeline constructions. In Proceedings of the 18th ACM conference on Information and knowledge management, CIKM '09, pages 97--106, 2009.
[3]
O. Alonso, J. Strötgen, R. A. Baeza-Yates, and M. Gertz. Temporal information retrieval: Challenges and opportunities. In Proceedings of the 1st International Temporal Web Analytics Workshop (TWAW 2011), 2011.
[4]
G. Amodeo, R. Blanco, and U. Brefeld. Hybrid models for future event prediction. In Proceedings of the 20th ACM International Conference on Information and Knowledge Management, CIKM '11, pages 1981--1984, 2011.
[5]
A. Anand, S. Bedathur, K. Berberich, and R. Schenkel. Efficient temporal keyword search over versioned text. In Proceedings of the 19th ACM International Conference on Information and Knowledge Management, CIKM '10, pages 699--708, 2010.
[6]
A. Anand, S. Bedathur, K. Berberich, and R. Schenkel. Temporal index sharding for space-time efficiency in archive search. In Proceedings of the 34th International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR '11, pages 545--554, 2011.
[7]
A. Anand, S. Bedathur, K. Berberich, and R. Schenkel. Index maintenance for time-travel text search. In Proceedings of the 35th International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR '12, pages 235--244, New York, NY, USA, 2012. ACM.
[8]
C.-m. Au Yeung and A. Jatowt. Studying How the Past is Remembered: Towards computational history through large scale text mining. In Proceedings of the 20th ACM International Conference on Information and Knowledge Management, CIKM '11, pages 1231--1240, 2011.
[9]
R. A. Baeza-Yates. Searching the future. In Proceedings of SIGIR workshop on mathematical/formal methods in information retrieval MF/IR, SIGIR '05, 2005.
[10]
K. Berberich, S. Bedathur, T. Neumann, and G. Weikum. A time machine for text search. In Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval, SIGIR '07, pages 519--526, 2007.
[11]
K. Berberich, S. J. Bedathur, O. Alonso, and G. Weikum. A language modeling approach for temporal information needs. In Proceedings of the 32nd European Conference on IR Research on Advances in Information Retrieval, ECIR '10, pages 13--25, 2010.
[12]
M. Berry. Survey of Text Mining: Clustering, Classification, and Retrieval. Springer, Sep. 2003.
[13]
A. Z. Broder, N. Eiron, M. Fontoura, M. Herscovici, R. Lempel, J. McPherson, R. Qi, and E. J. Shekita. Indexing shared content in information retrieval systems. In Proceedings of the 10th International Conference on Extending Database Technology, EDBT '06, pages 313--330, 2006.
[14]
R. Campos, G. Dias, A. M. Jorge, and A. Jatowt. Survey of temporal information retrieval and related applications. ACM Comput. Surv., 47(2):15:1--15:41, Aug. 2014.
[15]
R. Campos, A. M. Jorge, G. Dias, and C. Nunes. Disambiguating implicit temporal queries by clustering top relevant dates in web snippets. In Proceedings of the The 2012 IEEE/WIC/ACM International Joint Conferences on Web Intelligence and Intelligent Agent Technology - Volume 01, WI-IAT '12, pages 1--8, 2012.
[16]
N. Dai and B. D. Davison. Freshness matters: in flowers, food, and web authority. In Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval, SIGIR '10, pages 114--121, 2010.
[17]
A. Dong, Y. Chang, Z. Zheng, G. Mishne, J. Bai, R. Zhang, K. Buchner, C. Liao, and F. Diaz. Towards recency ranking in web search. In Proceedings of the third ACM international conference on Web search and data mining, WSDM '10, pages 11--20, 2010.
[18]
M. Efron. Query representation for cross-temporal information retrieval. In Proceedings of the 36th International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR '13, pages 383--392, 2013.
[19]
J. L. Elsas and S. T. Dumais. Leveraging temporal dynamics of document content in relevance ranking. In Proceedings of the third ACM international conference on Web search and data mining, WSDM '10, pages 1--10, 2010.
[20]
J. He and T. Suel. Faster temporal range queries over versioned text. In Proceeding of the 34th International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR '11, pages 565--574, 2011.
[21]
J. He, H. Yan, and T. Suel. Compact full-text indexing of versioned document collections. In Proceedings of the 18th ACM Conference on Information and Knowledge Management, CIKM '09, pages 415--424, 2009.
[22]
J. He, J. Zeng, and T. Suel. Improved index compression techniques for versioned document collections. In Proceedings of the 19th ACM Conference on Information and Knowledge Management, CIKM '10, pages 1239--1248, 2010.
[23]
A. Jatowt, É. Antoine, Y. Kawai, and T. Akiyama. Mapping temporal horizons: Analysis of collective future and past related attention in twitter. In Proceedings of the 24th International Conference on World Wide Web, WWW '15, pages 484--494, 2015.
[24]
H. Joho, A. Jatowt, and B. Roi. A survey of temporal web search experience. In Proceedings of the 22nd International Conference on World Wide Web (Companion), WWW '13, pages 1101--1108, 2013.
[25]
R. Jones and F. Diaz. Temporal profiles of queries. ACM Trans. Inf. Syst., 25, July 2007.
[26]
A. C. Kaluarachchi, A. S. Varde, S. Bedathur, G. Weikum, J. Peng, and A. Feldman. Incorporating terminology evolution for query translation in text retrieval with association rules. In Proceedings of the 19th ACM international conference on Information and knowledge management, CIKM '10, pages 1789--1792, 2010.
[27]
N. Kanhabua, R. Blanco, and M. Matthews. Ranking related news predictions. In Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval, SIGIR '11, pages 755--764, 2011.
[28]
N. Kanhabua, R. Blanco, and K. Nørvåg. Temporal information retrieval. Foundations and Trends in Information Retrieval, 9(2):91--208, 2015.
[29]
N. Kanhabua and K. Nørvåg. Determining time of queries for re-ranking search results. In Proceedings of the 14th European conference on Research and advanced technology for digital libraries, ECDL'10, pages 261--272, 2010.
[30]
Y. Ke, L. Deng, W. Ng, and D.-L. Lee. Web dynamics and their ramifications for the development of web search engines. Computer Networks, 50(10):1430--1447, July 2006.
[31]
M. Keikha, S. Gerani, and F. Crestani. TEMPER: A temporal relevance feedback method. In Proceedings of the 33rd European Conference on IR Research on Advances in Information Retrieval, ECIR '11, pages 436--447, 2011.
[32]
A. Kulkarni, J. Teevan, K. M. Svore, and S. T. Dumais. Understanding temporal query dynamics. In Proceedings of the Forth International Conference on Web Search and Web Data Mining, WSDM '11, pages 167--176, 2011.
[33]
X. Li and W. B. Croft. Time-based language mmdels. In Proceedings of the 12th international conference on Information and knowledge management, CIKM '03, pages 469--475, 2003.
[34]
M. Matthews, P. Tolchinsky, R. Blanco, J. Atserias, P. Mika, and H. Zaragoza. Searching through time in the new york times. In HCIR Workshop on Bridging Human-Computer Interaction and Information Retrieval, HCIR '10, 2010.
[35]
R. McCreadie, C. Macdonald, and I. Ounis. Incremental update summarization: Adaptive sentence selection based on prevalence and novelty. In Proceedings of the 23rd ACM International Conference on Conference on Information and Knowledge Management, CIKM '14, pages 301--310, 2014.
[36]
T. N. Nguyen and N. Kanhabua. Leveraging dynamic query subtopics for time-aware search result diversification. In Proceedings of the 36th European Conference on Advances in Information Retrieval, ECIR '14, pages 222--234, 2014.
[37]
K. Nørvåg and A. O. Nybø. Dyst: Dynamic and scalable temporal text indexing. In Proceedings of the 13th International Symposium on Temporal Representation and Reasoning, TIME '06, pages 204--211, 2006.
[38]
S. Nunes, C. Ribeiro, and G. David. Use of temporal expressions in web search. In Proceedings of the 30th European Conference on IR Research on Advances in Information Retrieval, ECIR '08, pages 580--584, 2008.
[39]
D. Odijk, G. Santucci, M. d. Rijke, M. Angelini, and G. Granato. Time-aware exploratory search: Exploring word meaning through time. In SIGIR 2012 Workshop on Time-aware Information Access, TAIA '12, 2012.
[40]
M.-H. Peetz, E. Meij, and M. Rijke. Using temporal bursts for query modeling. Information Retrieval, 17(1):74--108, 2014.
[41]
K. Radinsky, F. Diaz, S. T. Dumais, M. Shokouhi, A. Dong, and Y. Chang. Temporal web dynamics and its application to information retrieval. In Proceedings of the Sixth ACM International Conference on Web Search and Data Mining, WSDM '13, pages 781--782, 2013.
[42]
K. Radinsky and E. Horvitz. Mining the web to predict future events. In Proceedings of the Sixth ACM International Conference on Web Search and Data Mining, WSDM '13, pages 255--264, 2013.
[43]
K. Radinsky, K. Svore, S. Dumais, J. Teevan, A. Bocharov, and E. Horvitz. Modeling and predicting behavioral dynamics on the web. In Proceedings of the 21st international conference on World Wide Web, WWW '12, pages 599--608, 2012.
[44]
D. Shan, W. X. Zhao, R. Chen, B. Shu, Z. Wang, J. Yao, H. Yan, and X. Li. Eventsearch: A system for event discovery and retrieval on multi-type historical data. In Proceedings of the 18th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD '12, pages 1564--1567, 2012.
[45]
M. Shokouhi. Detecting seasonal queries by time-series analysis. In Proceedings of the 34th International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR '11, pages 1171--1172, 2011.
[46]
J. Singh, W. Nejdl, and A. Anand. History by diversity: Helping historians search news archives. In Proceedings of the 1st International ACM SIGIR Conference on Human Information Interaction and Retrieval (CHIIR), 2016.
[47]
R. Sipos, A. Swaminathan, P. Shivaswamy, and T. Joachims. Temporal corpus summarization using submodular word coverage. In Proceedings of the 21st ACM International Conference on Information and Knowledge Management, CIKM '12, pages 754--763, 2012.
[48]
K. M. Svore, J. Teevan, S. T. Dumais, and A. Kulkarni. Creating temporally dynamic web search snippets. In Proceedings of the 35th International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR '12, pages 1045--1046, 2012.
[49]
N. Tahmasebi, G. Gossen, N. Kanhabua, H. Holzmann, and T. Risse. NEER: An Unsupervised Method for Named Entity Evolution Recognition. In Proceedings the 24th International Conference on Computational Linguistics, COLING '12, pages 2553--2568. ACL, 2012.
[50]
T. A. Tran, C. Niederée, N. Kanhabua, U. Gadiraju, and A. Anand. Balancing novelty and salience: Adaptive learning to rank entities for timeline summarization of high-impact events. In Proceedings of the 24th ACM International on Conference on Information and Knowledge Management, CIKM '15, pages 1201--1210, 2015.
[51]
S. Whiting, K. Zhou, J. Jose, and M. Lalmas. Temporal variance of intents in multi-faceted event-driven information needs. In Proceedings of the 36th International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR '13, pages 989--992, 2013.
[52]
R. Yan, X. Wan, J. Otterbacher, L. Kong, X. Li, and Y. Zhang. Evolutionary timeline summarization: A balanced optimization framework via iterative substitution. In Proceedings of the 34th International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR '11, pages 745--754, 2011.
[53]
J. Zhang and T. Suel. Efficient search in large textual collections with redundancy. In Proceedings of the 16th International Conference on World Wide Web, WWW '07, pages 411--420, 2007.
[54]
R. Zhang, Y. Konda, A. Dong, P. Kolari, Y. Chang, and Z. Zheng. Learning recurrent event queries for web search. In Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, EMNLP '10, pages 1129--1139, 2010.
[55]
X. W. Zhao, Y. Guo, R. Yan, Y. He, and X. Li. Timeline generation with social attention. In Proceedings of the 36th International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR '13, pages 1061--1064, 2013.
[56]
K. Zhou, S. Whiting, J. M. Jose, and M. Lalmas. The impact of temporal intent variability on diversity evaluation. In Proceedings of the 35th European Conference on Advances in Information Retrieval, ECIR '13, pages 820--823, 2013.

Cited By

View all
  • (2024)Reproducible Hybrid Time-Travel Retrieval in Evolving CorporaProceedings of the 2024 Annual International ACM SIGIR Conference on Research and Development in Information Retrieval in the Asia Pacific Region10.1145/3673791.3698421(203-208)Online publication date: 8-Dec-2024
  • (2024)Temporal JSON Keyword SearchProceedings of the ACM on Management of Data10.1145/36549802:3(1-27)Online publication date: 30-May-2024
  • (2024)An Efficient Corpus Indexer for dynamic corpora retrievalExpert Systems with Applications10.1016/j.eswa.2024.124306254(124306)Online publication date: Nov-2024
  • Show More Cited By

Index Terms

  1. Temporal Information Retrieval

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    SIGIR '16: Proceedings of the 39th International ACM SIGIR conference on Research and Development in Information Retrieval
    July 2016
    1296 pages
    ISBN:9781450340694
    DOI:10.1145/2911451
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 07 July 2016

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. adaptive crawling and caching
    2. temporal indexing
    3. temporal information extraction
    4. temporal queries
    5. time-aware ranking

    Qualifiers

    • Research-article

    Conference

    SIGIR '16
    Sponsor:

    Acceptance Rates

    SIGIR '16 Paper Acceptance Rate 62 of 341 submissions, 18%;
    Overall Acceptance Rate 792 of 3,983 submissions, 20%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)69
    • Downloads (Last 6 weeks)3
    Reflects downloads up to 04 Jan 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)Reproducible Hybrid Time-Travel Retrieval in Evolving CorporaProceedings of the 2024 Annual International ACM SIGIR Conference on Research and Development in Information Retrieval in the Asia Pacific Region10.1145/3673791.3698421(203-208)Online publication date: 8-Dec-2024
    • (2024)Temporal JSON Keyword SearchProceedings of the ACM on Management of Data10.1145/36549802:3(1-27)Online publication date: 30-May-2024
    • (2024)An Efficient Corpus Indexer for dynamic corpora retrievalExpert Systems with Applications10.1016/j.eswa.2024.124306254(124306)Online publication date: Nov-2024
    • (2024)Temporal validity reassessment: commonsense reasoning about information obsoletenessDiscover Computing10.1007/s10791-024-09433-w27:1Online publication date: 6-May-2024
    • (2023)BiTimeBERT: Extending Pre-Trained Language Representations with Bi-Temporal InformationProceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval10.1145/3539618.3591686(812-821)Online publication date: 19-Jul-2023
    • (2022)Time Masking for Temporal Language ModelsProceedings of the Fifteenth ACM International Conference on Web Search and Data Mining10.1145/3488560.3498529(833-841)Online publication date: 11-Feb-2022
    • (2022)Semantic Modelling of Document Focus-Time for Temporal Information RetrievalCompanion Proceedings of the Web Conference 202210.1145/3487553.3524668(896-902)Online publication date: 25-Apr-2022
    • (2022)ArchivalQAProceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval10.1145/3477495.3531734(3025-3035)Online publication date: 6-Jul-2022
    • (2021)Semantically Time Tracking of Events from Web DocumentsProceedings of the Brazilian Symposium on Multimedia and the Web10.1145/3470482.3479627(141-144)Online publication date: 5-Nov-2021
    • (2021)Complex Temporal Question Answering on Knowledge GraphsProceedings of the 30th ACM International Conference on Information & Knowledge Management10.1145/3459637.3482416(792-802)Online publication date: 26-Oct-2021
    • Show More Cited By

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media