[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/2499178.2499195acmotherconferencesArticle/Chapter ViewAbstractPublication PagesictirConference Proceedingsconference-collections
research-article

Information Retrieval with Time Series Query

Published: 29 September 2013 Publication History

Abstract

We study a novel information retrieval problem, where the query is a time series for a given time period, and the retrieval task is to find relevant documents in a text collection of the same time period, which contain topics that are correlated with the query time series. This retrieval problem arises in many text mining applications where there is a need to analyze text data in order to discover potentially causal topics. To solve this problem, we propose and study multiple retrieval algorithms that use the general idea of ranking text documents based on how well their terms are correlated with the query time series. Experiment results show that the proposed retrieval algorithm can effectively help users find documents that are relevant to the time series queries, which can help users analyze the variation patterns of the time series.

References

[1]
R. Agrawal, C. Faloutsos, and A. N. Swami. Efficient similarity search in sequence databases. In Proceedings of the 4th International Conference on Foundations of Data Organization and Algorithms, FODO '93, pages 69--84, London, UK, UK, 1993. Springer-Verlag.
[2]
R. Agrawal, K.-I. Lin, H. S. Sawhney, and K. Shim. Fast similarity search in the presence of noise, scaling, and translation in time-series databases. In Proceedings of the 21th International Conference on Very Large Data Bases, VLDB '95, pages 490--501, San Francisco, CA, USA, 1995. Morgan Kaufmann Publishers Inc.
[3]
R. Agrawal, G. Psaila, E. L. Wimmers, and M. Zaït. Querying shapes of histories. In Proceedings of the 21th International Conference on Very Large Data Bases, VLDB '95, pages 502--514, San Francisco, CA, USA, 1995. Morgan Kaufmann Publishers Inc.
[4]
K.-P. Chan and A. W.-C. Fu. Efficient time series matching by wavelets. In Data Engineering, 1999. Proceedings., 15th International Conference on, pages 126 --133, mar 1999.
[5]
M. Efron. Linear time series models for term weighting in information retrieval. J. Am. Soc. Inf. Sci. Technol., 61(7):1299--1312, July 2010.
[6]
C. Faloutsos, M. Ranganathan, and Y. Manolopoulos. Fast subsequence matching in time-series databases. In Proceedings of the 1994 ACM international conference on Management of data, SIGMOD '94, pages 419--429, New York, NY, USA, 1994. ACM.
[7]
C. W. J. Granger. Essays in econometrics. chapter Investigating causal relations by econometric models and cross-spectral methods, pages 31--47. Harvard University Press, Cambridge, MA, USA, 2001.
[8]
D. Gunopulos and G. Das. Time series similarity measures. In Tutorial notes of the sixth ACM SIGKDD international conference on Knowledge discovery and data mining, KDD '00, pages 243--307, New York, NY, USA, 2000. ACM.
[9]
K. Järvelin and J. Kekäläinen. Cumulated gain-based evaluation of ir techniques. ACM Trans. Inf. Syst., 20(4):422--446, Oct. 2002.
[10]
K. S. Jones, S. Walker, and S. E. Robertson. A probabilistic model of information retrieval: development and comparative experiments. Inf. Process. Manage., 36(6):779--808, Nov. 2000.
[11]
R. Jones and F. Diaz. Temporal profiles of queries. ACM Trans. Inf. Syst., 25(3), July 2007.
[12]
J. Kacprzyk and A. Wilbik. Linguistic summarization of time series using linguistic quantifiers: Augmenting the analysis by a degree of fuzziness. In Fuzzy Systems, 2008. FUZZ-IEEE 2008. (IEEE World Congress on Computational Intelligence). IEEE International Conference on, pages 1146 --1153, june 2008.
[13]
M. Kaya and M. Karsligil. Stock price prediction using financial news articles. In Information and Financial Engineering (ICIFE), 2010 2nd IEEE International Conference on, pages 478 --482, sept. 2010.
[14]
E. Keogh. Fast similarity search in the presence of longitudinal scaling in time series databases. In Proceedings of the 9th International Conference on Tools with Artificial Intelligence, ICTAI '97, pages 578--, Washington, DC, USA, 1997. IEEE Computer Society.
[15]
E. Keogh and P. Smyth. A probabilistic approach to fast pattern matching in time series databases. page 24. AAAI Press, 1997.
[16]
E. J. Keogh and M. J. Pazzani. Relevance feedback retrieval of time series data. In Proceedings of the 22nd annual international ACM conference on Research and development in information retrieval, SIGIR '99, pages 183--190, New York, NY, USA, 1999. ACM.
[17]
E. J. Keogh and M. J. Pazzani. Derivative dynamic time warping. In In First SIAM International Conference on Data Mining (SDM'01, 2001.
[18]
H. D. Kim, C. Zhai, T. A. Rietz, D. Diermeier, M. Hsu, M. Castellanos, and C. A. Ceja Limon. Incatomi: integrative causal topic miner between textual and non-textual time series data. In Proceedings of the 21st ACM international conference on Information and knowledge management, CIKM '12, pages 2689--2691, New York, NY, USA, 2012. ACM.
[19]
A. Kulkarni, J. Teevan, K. M. Svore, and S. T. Dumais. Understanding temporal query dynamics. In Proceedings of the fourth ACM international conference on Web search and data mining, WSDM '11, pages 167--176, New York, NY, USA, 2011. ACM.
[20]
R. Liebscher and R. K. Belew. Lexical dynamics and conceptual change: Analyses and implications for information retrieval. Cognitive Science Online, pages 46--57, 2003.
[21]
C. D. Manning, P. Raghavan, and H. Schtze. Introduction to Information Retrieval. Cambridge University Press, New York, NY, USA, 2008.
[22]
M. Meinard. Introduction to Information Retrieval. Springer, 2007.
[23]
G. Mitra and L. Mitra. The handbook of news analytics in finance /. Wiley;, Hoboken, N.J.:, 2011.
[24]
M. Ng and Z. Huang. Temporal data mining with a case study as astronomical data analysis. Lecture Notes in Computer Sciences, pages 2--18, 1997.
[25]
K. Radinsky, E. Agichtein, E. Gabrilovich, and S. Markovitch. A word at a time: computing word relatedness using temporal semantic analysis. In Proceedings of the 20th international conference on World wide web, WWW '11, pages 337--346, New York, NY, USA, 2011. ACM.
[26]
D. Rafiei and A. Mendelzon. Similarity-based queries for time series data. SIGMOD Rec., 26(2):13--25, June 1997.
[27]
D. Rafiei and A. Mendelzon. Efficient retrieval of similar time series. In K. Tanaka, S. Ghandeharizadeh, and Y. Kambayashi, editors, Information Organization and Databases, volume 579 of The Springer International Series in Engineering and Computer Science, pages 75--89. Springer US, 2000.
[28]
R. P. Schumaker and H. Chen. Textual analysis of stock market prediction using breaking financial news: The azfin text system. ACM Trans. Inf. Syst., 27(2):12:1--12:19, Mar. 2009.
[29]
A. Singhal. Modern information retrieval: A brief overview. Bulletin of the IEEE Computer Society Technical Committee on Data Engineering, 24(4):35--42, 2001.
[30]
K. Takahashi and M. Umano. Retrieval of similar time series with similarity degree of linguistic expressions for global trend and local features. In Fuzzy Systems (FUZZ-IEEE), 2012 IEEE International Conference on, pages 1--8, june 2012.
[31]
M. Umano, M. Okamura, and K. Seta. Improved method for linguistic expression of time series with global trend and local features. In Fuzzy Systems, 2009. FUZZ-IEEE 2009. IEEE International Conference on, pages 1169--1174, aug. 2009.
[32]
B.-K. Yi, H. V. Jagadish, and C. Faloutsos. Efficient retrieval of similar time sequences under time warping. In Proceedings of the Fourteenth International Conference on Data Engineering, ICDE '98, pages 201--208, Washington, DC, USA, 1998. IEEE.
[33]
C. Zhai. Statistical Language Models for Information Retrieval. Synthesis Lectures on Human Language Technologies. Morgan & Claypool Publishers, 2008.

Cited By

View all
  • (2017)Transductive Event Classification through Heterogeneous NetworksProceedings of the 23rd Brazillian Symposium on Multimedia and the Web10.1145/3126858.3126893(285-292)Online publication date: 17-Oct-2017
  • (2017)A new retrieval method based on time series variation using field association termsMathematical Methods in the Applied Sciences10.1002/mma.471341:15(5780-5791)Online publication date: 14-Dec-2017
  • (2016)When time meets information retrievalJournal of Information Science10.1177/016555151560727742:6(725-747)Online publication date: 1-Dec-2016
  • Show More Cited By

Recommendations

Reviews

Simon Berkovich

This work is devoted to a novel information retrieval problem. In contrast to the common action of simple retrieving of time-stamped data, this work is aimed at the discovery of time-related patterns. The query presents a time series, and the retrieval task is to find relevant documents in a text collection of the same time period. This retrieval problem arises in many applications where there is a need to analyze text data in order to discover potential causal factors. For example, reporting a negative event in the news might affect a company's stock prices. The alleged causal relationship could be suspected as soon as a correlation between the two time series is revealed. Traditional information retrieval is aimed primarily at the matching of query terms and does not consider their dynamics. The presented technique measures the correlation between the time characterization of each term and the given sequence of events in the considered time series query. The established correlation could present a decisive factor among the other issues that are used for determining the ranking of text documents. Besides the most obvious stock prices example, the developed approach can also be applied to other types of time-stamped text collections to be correlated with various time series curves. This may be related to certain signals from web traffic or from physical sensors. An important aspect of the presented concept could be the consideration of time series problems in the information processing of the brain, like in the identification of musical tunes or the recognition of gait patterns. Online Computing Reviews Service

Access critical reviews of Computing literature here

Become a reviewer for Computing Reviews.

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Other conferences
ICTIR '13: Proceedings of the 2013 Conference on the Theory of Information Retrieval
September 2013
148 pages
ISBN:9781450321075
DOI:10.1145/2499178
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Sponsors

  • Findwise: Findwise AB
  • Google Inc.
  • Spinque: Spinque
  • Univ. of Copenhagen: University of Copenhagen
  • LARM: LARM Audio Research Archive
  • Royal School of Library and Information Science: Royal School of Library and Information Science
  • Yahoo! Labs

In-Cooperation

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 29 September 2013

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Information Retrieval
  2. Text Stream
  3. Time Series Query

Qualifiers

  • Research-article
  • Research
  • Refereed limited

Conference

ICTIR '13
Sponsor:
  • Findwise
  • Spinque
  • Univ. of Copenhagen
  • LARM
  • Royal School of Library and Information Science

Acceptance Rates

ICTIR '13 Paper Acceptance Rate 11 of 51 submissions, 22%;
Overall Acceptance Rate 235 of 527 submissions, 45%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)7
  • Downloads (Last 6 weeks)2
Reflects downloads up to 20 Dec 2024

Other Metrics

Citations

Cited By

View all
  • (2017)Transductive Event Classification through Heterogeneous NetworksProceedings of the 23rd Brazillian Symposium on Multimedia and the Web10.1145/3126858.3126893(285-292)Online publication date: 17-Oct-2017
  • (2017)A new retrieval method based on time series variation using field association termsMathematical Methods in the Applied Sciences10.1002/mma.471341:15(5780-5791)Online publication date: 14-Dec-2017
  • (2016)When time meets information retrievalJournal of Information Science10.1177/016555151560727742:6(725-747)Online publication date: 1-Dec-2016
  • (2014)FAQProceedings of the 3rd International Conference on Big Data, Streams and Heterogeneous Source Mining: Algorithms, Systems, Programming Models and Applications - Volume 3610.5555/2999973.2999977(29-45)Online publication date: 24-Aug-2014

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media