[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/564691.564734acmconferencesArticle/Chapter ViewAbstractPublication PagesmodConference Proceedingsconference-collections
Article

Continually evaluating similarity-based pattern queries on a streaming time series

Published: 03 June 2002 Publication History

Abstract

In many applications, local or remote sensors send in streams of data, and the system needs to monitor the streams to discover relevant events/patterns and deliver instant reaction correspondingly. An important scenario is that the incoming stream is a continually appended time series, and the patterns are time series in a database. At each time when a new value arrives (called a time position), the system needs to find, from the database, the nearest or near neighbors of the incoming time series up to the time position. This paper attacks the problem by using Fast Fourier Transform (FFT) to efficiently find the cross correlations of time series, which yields, in a batch mode, the nearest and near neighbors of the incoming time series at many time positions. To take advantage of this batch processing in achieving fast response time, this paper uses prediction methods to predict future values. FFT is used to compute the cross correlations of the predicted series (with the values that have already arrived) and the database patterns, and to obtain predicted distances between the incoming time series at many future time positions and the database patterns. When the actual data value arrives, the prediction error together with the predicted distances is used to filter out patterns that are not possible to be the nearest or near neighbors, which provides fast responses. Experiments show that with reasonable prediction errors, the performance gain is significant.

References

[1]
R. Agrawal, C. Faloutsos, and A. N. Swami. Efficient similarity search in sequence databases. In Proceedings of the 4th FODO, pages 69-84, 1993.
[2]
R. Agrawal, K.-I. Lin, H. S. Sawhney, and K. Shim. Fast similarity search in the presence of noise, scaling, and translation in time-series databases. In The VLDB Journal, pages 490-501, 1995.
[3]
S. Babu and J. Widom. Continuous queries over data streams. In SIGMOD Record, Sept. 2001.
[4]
S. Berchtold and D. A. Keim. High-dimensional index structures, database support for next decade's applications (tutorial). In SIGMOD Conference, 1998.
[5]
C. Burrus and T. Parks. DFT/FFT and Convolution Algorithms. John Wiley and Sons, 1985.
[6]
J. Chen, D. J. DeWitt, and J. F. Naughton. Design and evaluation of alternative selection placement strategies in optimizing continuous queries. In ICDE Conference, 2002.
[7]
J. Chen, D. J. DeWitt, F. Tian, and Y. Wang. NiagaraCQ: a scalable continuous query system for Internet databases. In Proc. of the ACM SIGMOD Conference, pages 379-390, 2000.
[8]
K. K. W. Chu and M. H. Wong. Fast time-series searching with scaling and shifting. In Proc. of the 18th ACM PODS 1999, Philadelphia, pages 237-248, 1999.
[9]
C. Faloutsos, M. Ranganathan, and Y. Manolopoulos. Fast subsequence matching in time-series databases. In ACM SIGMOD Conference, pages 419-429, 1994.
[10]
M. Frigo and S. G. Johnson. FFTW: C subroutine library for computing the Discrete Fourier Transform (DFT). On-line. http://www.fftw.org/, 2001.
[11]
T. V. Gestel, J. Suykens, D.-E. Baestaens, A. Lambrechts, G. Lanckriet, B. Vandaele, D. B. Moor, and J. Vandewalle. Financial time series prediction using least squares support vector machines within the evidence framework. IEEE Transactions on Neural Networks, 12(4):809-821, 2001.
[12]
L. Gyorfi, G. Lugosi, and G. Morvai. A simple randomized algorithm for sequential prediction of ergodic time series. IEEE Transactions on Information Theory, 45(7):2642-2650, 1999.
[13]
Y.-W. Huang and P. S. Yu. Adaptive query processing for time-series data. In Proceedings of the 5th International Conference of Knowledge Discovery and Data Mining, pages 282-286, 1999.
[14]
T. Kahveci and A. K. Singh. Variable length queries for time series data. In ICDE 01, pages 273-282, 2001.
[15]
E. J. Keogh, K. Chakrabarti, S. Mehrotra, and M. J. Pazzani. Locally adaptive dimensionality reduction for indexing large time series databases. In Proceedings of ACM SIGMOD Conference on Management of Data, pages 151-162, 2001.
[16]
I. Kim and S.-R. Lee. A fuzzy time series prediction method based on consecutive values. In Fuzzy Systems Conference Proceedings, volume 2, pages 703-707, 1999.
[17]
L. Liu, C. Pu, R. S. Barga, and T. Zhou. Differential evaluation of continual queries. In International Conference on Distributed Computing Systems, pages 458-465, 1996.
[18]
L. Liu, C. Pu, and W. Tang. Continual queries for Internet scale event-driven information delivery. IEEE TKDE, 11(4):610-628, 1999.
[19]
S. Madden and M. J. Franklin. Fjording the stream: An architecture for queries over streaming sensor data. In ICDE Conference, 2002.
[20]
A. Oppenheim and R. Schafer. Digital Signal Processing. Prentice-Hall, Inc., 1975.
[21]
D. S. Parker, R. R. Muntz, and H. L. Chau. The tangram stream query processing system. In ICDE Conference, 1989.
[22]
C.-S. Perng, H. Wang, S. R. Zhang, and D. S. Parker. Landmarks: a new model for similarity-based pattern querying in time series databases. In ICDE, pages 33-42, 2000.
[23]
B. Plale and K. Schwan. Optimizations enabled by a relational data model view to querying data streams. In Proc. of 15th International Parallel and Distributed Processing Symposium, 2001.
[24]
S. Policker and A. Geva. A new algorithm for time series prediction by temporal fuzzy clustering. In Proceedings. 15th International Conference on Pattern Recognition, volume 2, pages 728-731, 2000.
[25]
A. D. Poularikas, editor. The transforms and applications handbook. CRC Press LLC, 2000.
[26]
D. Rafiei and A. Mendelzon. Similarity-based queries for time series data. In Proceedings of the ACM SIGMOD International Conference on Management of Data, pages 13-25, 1997.
[27]
D. Rafiei and A. O. Mendelzon. Querying time series data based on similarity. IEEE TKDE, 12(5):675-693, 2000.
[28]
H. Shatkay and S. B. Zdonik. Approximate queries and representations for large data sequences. In ICDE, pages 536-545, 1996.
[29]
D. Terry, D. Goldberg, D. Nichols, and B. Oki. Continuous queries over append-only databases. In Proc. of the ACM SIGMOD Conf. on Management of Data, pages 321-330, 1992.
[30]
L. Wang, K. K. Teo, and Z. Lin. Predicting time series with Wavelet packet neural networks. Proc. International Joint Conference on Neural Networks, 3:1593-1597, 2001.
[31]
S. Winograd. Some bilinear forms whose multiplicative complexity depends on the field of constants. Mathematical Systems Theory, 10:169-180, 1977.

Cited By

View all
  • (2023)Correlation Joins over Time Series Data Streams Utilizing Complementary Dimension Reduction and TransformationProceedings of the ACM on Management of Data10.1145/36267221:4(1-26)Online publication date: 12-Dec-2023
  • (2020)Multi-resolution Representation for Streaming Time Series RetrievalInternational Journal of Pattern Recognition and Artificial Intelligence10.1142/S021800142150019135:06(2150019)Online publication date: 23-Dec-2020
  • (2019)A Novel Similarity Search Approach for Streaming Time SeriesJournal of Physics: Conference Series10.1088/1742-6596/1302/2/0220841302(022084)Online publication date: 3-Sep-2019
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
SIGMOD '02: Proceedings of the 2002 ACM SIGMOD international conference on Management of data
June 2002
654 pages
ISBN:1581134975
DOI:10.1145/564691
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 03 June 2002

Permissions

Request permissions for this article.

Check for updates

Qualifiers

  • Article

Conference

SIGMOD/PODS02

Acceptance Rates

SIGMOD '02 Paper Acceptance Rate 42 of 240 submissions, 18%;
Overall Acceptance Rate 785 of 4,003 submissions, 20%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)14
  • Downloads (Last 6 weeks)2
Reflects downloads up to 05 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2023)Correlation Joins over Time Series Data Streams Utilizing Complementary Dimension Reduction and TransformationProceedings of the ACM on Management of Data10.1145/36267221:4(1-26)Online publication date: 12-Dec-2023
  • (2020)Multi-resolution Representation for Streaming Time Series RetrievalInternational Journal of Pattern Recognition and Artificial Intelligence10.1142/S021800142150019135:06(2150019)Online publication date: 23-Dec-2020
  • (2019)A Novel Similarity Search Approach for Streaming Time SeriesJournal of Physics: Conference Series10.1088/1742-6596/1302/2/0220841302(022084)Online publication date: 3-Sep-2019
  • (2016)Streaming similarity self-joinProceedings of the VLDB Endowment10.14778/2977797.29778059:10(792-803)Online publication date: 1-Jun-2016
  • (2016)Optimizing Cost of Continuous Overlapping Queries over Data Streams by Filter AdaptionIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2016.251654128:5(1258-1271)Online publication date: 1-May-2016
  • (2014)Similarity search in streaming time series with the support of Skyline indexInternational Journal of Business Intelligence and Data Mining10.1504/IJBIDM.2014.0628829:1(31-51)Online publication date: 1-Jun-2014
  • (2014)Safe MBR-transformation in similar sequence matchingInformation Sciences10.1016/j.ins.2014.02.127270(28-40)Online publication date: Jun-2014
  • (2013)Local correlation detection with linearity enhancement in streaming dataProceedings of the 22nd ACM international conference on Information & Knowledge Management10.1145/2505515.2505746(309-318)Online publication date: 27-Oct-2013
  • (2013)An effective method to analyze variations of high-dimensional patterns over medical streams2013 IEEE International Conference on Bioinformatics and Biomedicine10.1109/BIBM.2013.6732597(33-39)Online publication date: Dec-2013
  • (2012)Effective and Efficient Shape-Based Pattern Detection over Streaming Time SeriesIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2010.22324:2(265-278)Online publication date: 1-Feb-2012
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media