[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/1989323.1989416acmconferencesArticle/Chapter ViewAbstractPublication PagesmodConference Proceedingsconference-collections
research-article

E-Cube: multi-dimensional event sequence analysis using hierarchical pattern query sharing

Published: 12 June 2011 Publication History

Abstract

Many modern applications, including online financial feeds, tag-based mass transit systems and RFID-based supply chain management systems transmit real-time data streams. There is a need for event stream processing technology to analyze this vast amount of sequential data to enable online operational decision making. Existing techniques such as traditional online analytical processing (OLAP) systems are not designed for real-time pattern-based operations, while state-of-the-art Complex Event Processing (CEP) systems designed for sequence detection do not support OLAP operations. We propose a novel E-Cube model which combines CEP and OLAP techniques for efficient multi-dimensional event pattern analysis at different abstraction levels. Our analysis of the interrelationships in both concept abstraction and pattern refinement among queries facilitates the composition of these queries into an integrated E-Cube hierarchy. Based on this E-Cube hierarchy, strategies of drill-down (refinement from abstract to more specific patterns) and of roll-up (generalization from specific to more abstract patterns) are developed for the efficient workload evaluation. Our proposed execution strategies reuse intermediate results along both the concept and the pattern refinement relationships between queries. Based on this foundation, we design a cost-driven adaptive optimizer called Chase, that exploits the above reuse strategies for optimal E-Cube hierarchy execution. Our experimental studies comparing alternate strategies on a real world financial data stream under different workload conditions demonstrate the superiority of the Chase method. In particular, our Chase execution in many cases performs ten fold faster than the state-of-the art strategy for real stock market query workloads.

References

[1]
I. inetats. stock trade traces. http://www.inetats.com/.
[2]
K. S. Candan,W.-P. Hsiung, S. Chen, J. Tatemura, and D. Agrawal. AFilter: Adaptable XML filtering with prefix-caching and suffix-clustering. In VLDB, pages 559--570, 2006.
[3]
S. Chakravarthy, V. Krishnaprasad, E. Anwar, and S.-K. Kim. Composite events for active databases: Semantics, contexts and detection. In VLDB, pages 606--617, 1994.
[4]
S. Chaudhuri and U. Dayal. An overview of data warehousing and OLAP technology. SIGMOD Record, 26(1):65--74, 1997.
[5]
A. J. Demers, J. Gehrke, B. Panda, M. Riedewald, V. Sharma, and W. M.White. Cayuga: A general purpose event monitoring system. In CIDR, pages 412--422, 2007.
[6]
J. Edmonds. Optimum branchings. In J. Research of the National Bureau of Standards, pages 233--240., 1967.
[7]
S. Finkelstein. Common expression analysis in database applications. In SIGMOD, 1982.
[8]
H. N. Gabow, Z. Galil, T. H. Spencer, and R. E. Tarjan. Efficient algorithms for finding minimum spanning trees in undirected and directed graphs. Combinatorica, 6(2):109--122, 1986.
[9]
B. Gedik, K.-L. Wu, P. S. Yu, and L. Liu. Adaptive load shedding for windowed stream joins. In CIKM, pages 171--178, 2005.
[10]
H. Gonzalez, J. Han, and X. Li. Flowcube: Constructuing RFID FlowCubes for multi-dimensional analysis of commodity flows. In VLDB, pages 834--845, 2006.
[11]
A. Gupta, V. Harinarayan, and D. Quass. Aggregate-query processing in data warehousing environments. In VLDB, pages 358--369, 1995.
[12]
C. Gupta, S. Wang, I. Ari, M. C. Hao, U. Dayal, A. Mehta, M. Marwah, and R. K. Sharma. Chaos: A data stream analysis architecture for enterprise applications. In CEC, pages 33--40, 2009.
[13]
J. Han, Y. Cai, and N. Cercone. Knowledge discovery in databases: An attribute-oriented approach. In VLDB, pages 547--559, 1992.
[14]
J. Han, Y. Chen, G. Dong, J. Pei, B. W. Wah, J.Wang, and Y. D. Cai. Stream Cube: An architecture for multi-dimensional analysis of data streams. Distributed and Parallel Databases, 18(2):173--197, 2005.
[15]
V. Harinarayan, A. Rajaraman, and J. D. Ullman. Implementing data cubes efficiently. In SIGMOD, pages 205--216, 1996.
[16]
M. Hong, M. Riedewald, C. Koch, J. Gehrke, and A. J. Demers. Rule-based multi-query optimization. In EDBT, pages 120--131, 2009.
[17]
S. Krishnamurthy, C.Wu, and M. J. Franklin. On-the-fly sharing for streamed aggregation. In SIGMOD, pages 623--634, 2006.
[18]
J. Li, D. Maier, K. Tufte, V. Papadimos, and P. A. Tucker. Semantics and evaluation techniques for window aggregates in data streams. In SIGMOD, pages 311--322, 2005.
[19]
B. Liu, Y. Zhu, and E. A. Rundensteiner. Run-time operator state spilling for memory intensive long-running queries. In SIGMOD, pages 347--358, 2006.
[20]
M. Liu, M. Li, D. Golovnya, E. A. Rundensteiner, and K. T. Claypool. Sequence pattern query processing over out-of-order event streams. In ICDE, pages 784--795, 2009.
[21]
M. Liu, E. Rundensteiner, K. Greenfield, C. Gupta, S. Wang, I. Ari, and A. Mehta. E-cube: Multi-dimensional event sequence processing using concept and pattern hierarchies. In ICDE, pages 1097--1100, 2010.
[22]
M. Liu, E. Rundensteiner, K. Greenfield, C. Gupta, S. Wang, I. Ari, and A. Mehta. High-performance nested cep query processing over event streams. In ICDE, 2011.
[23]
M. Liu, E. Rundensteiner, K. Greenfield, C. Gupta, S. Wang, I. Ari, and A. Mehta. E-cube: Multi-dimensional event sequence processing using concept and pattern hierarchies. Technical Report WPI-CS-TR-09-08.
[24]
E. Lo, B. Kao, W.-S. Ho, S. D. Lee, C. K. Chui, and D. W. Cheung. OLAP on sequence data. In SIGMOD Conference, pages 649--660, 2008.
[25]
Y. Mei and S. Madden. Zstream: a cost-based query processor for adaptively detecting composite events. In SIGMOD, pages 193--206, 2009.
[26]
P. Roy, S. Seshadri, S. Sudarshan, and S. Bhobe. Efficient and extensible algorithms for multi query optimization. In SIGMOD, pages 249--260, 2000.
[27]
T. K. Sellis. Multiple-query optimization. ACM Trans. Database Syst., 13(1):23--52, 1988.
[28]
S. Wang, E. A. Rundensteiner, S. Ganguly, and S. Bhatnagar. State-slice: New paradigm of multi-query optimization of window-based stream queries. In VLDB, pages 619--630, 2006.
[29]
E. Wu, Y. Diao, and S. Rizvi. High-performance complex event processing over streams. In SIGMOD, pages 407--418, 2006.
[30]
Y. Zhu, E. A. Rundensteiner, and G. T. Heineman. Dynamic plan migration for continuous queries over data streams. In SIGMOD, pages 431--442, 2004.

Cited By

View all

Index Terms

  1. E-Cube: multi-dimensional event sequence analysis using hierarchical pattern query sharing

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    SIGMOD '11: Proceedings of the 2011 ACM SIGMOD International Conference on Management of data
    June 2011
    1364 pages
    ISBN:9781450306614
    DOI:10.1145/1989323
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 12 June 2011

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. OLAP
    2. algorithm
    3. complex event processing
    4. optimization
    5. streaming

    Qualifiers

    • Research-article

    Conference

    SIGMOD/PODS '11
    Sponsor:

    Acceptance Rates

    Overall Acceptance Rate 785 of 4,003 submissions, 20%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)21
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 30 Dec 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2023)Temporal Graph CubeIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2023.327046035:12(13015-13030)Online publication date: 1-Dec-2023
    • (2022)COREProceedings of the VLDB Endowment10.14778/3538598.353861515:9(1951-1964)Online publication date: 27-Jul-2022
    • (2022)DARLINGProceedings of the VLDB Endowment10.14778/3494124.349413715:3(541-554)Online publication date: 4-Feb-2022
    • (2022)Gloria: Graph-based Sharing Optimizer for Event Trend AggregationProceedings of the 2022 International Conference on Management of Data10.1145/3514221.3526145(1122-1135)Online publication date: 10-Jun-2022
    • (2022)HYPERSONIC: A Hybrid Parallelization Approach for Scalable Complex Event ProcessingProceedings of the 2022 International Conference on Management of Data10.1145/3514221.3517829(1093-1107)Online publication date: 10-Jun-2022
    • (2022)Online fleet monitoring with scalable event recognition and forecastingGeoInformatica10.1007/s10707-022-00465-226:4(613-644)Online publication date: 11-May-2022
    • (2021)A Formal Framework for Complex Event RecognitionACM Transactions on Database Systems10.1145/348546346:4(1-49)Online publication date: 8-Dec-2021
    • (2021)To Share, or not to Share Online Event Trend Aggregation Over Bursty Event StreamsProceedings of the 2021 International Conference on Management of Data10.1145/3448016.3452785(1452-1464)Online publication date: 9-Jun-2021
    • (2021)Realtime Bushfire Detection with Spatial-based Complex Event Processing2021 15th International Conference on Advanced Computing and Applications (ACOMP)10.1109/ACOMP53746.2021.00007(1-8)Online publication date: Nov-2021
    • (2020)Optimizing query answering under ontological constraintsProceedings of the VLDB Endowment10.14778/3402707.34027374:11(1004-1015)Online publication date: 3-Jun-2020
    • Show More Cited By

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media