Abstract
Sequence pattern detection over streaming data has many real world applications. Most of the present work is aimed to process sequence queries over single data stream. Situations where streaming data arrive from multiple sources have not been explored much. In traditional approaches a single centralized machine handles and processes sequence queries over multiple data streams. While running sequence queries on a single server, even though many of the events in data streams do not lead to successful pattern detection they are still handled and processed by the server. This consumes precious network bandwidth, server’s computing resources and precious time. In this paper we focus on sequence pattern detection, where patterns are defined on chains of events that arrive from multiple distributed data streams. We propose a three layer distributed framework to avoid unnecessary event processing by the server, and to efficiently process sequence queries to detect sequence patterns relying upon chains of events. The bottom layer of data sources sends continuous data streams to the middle layer, which then performs pattern detection locally, and on the basis of the feedback received from the top layer of global server, sends events to the global server to detect complete patterns. Our present work is aimed to detect sequence patterns over multiple data streams, but, our proposed model can be extended to many other areas of distributed stream processing.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Lu, H., Zhou, Y., Haustad, J.: Continuous skyline monitoring over distributed data streams. In: Gertz, M., Ludäscher, B. (eds.) SSDBM 2010. LNCS, vol. 6187, pp. 565–583. Springer, Heidelberg (2010)
Brenna, L., Gehrke, J., Hong, M., Johansen, D.: Distributed event stream processing with non-deterministic finite automata. In: Proceedings of the Third ACM International Conference on Distributed Event-Based Systems, p. 3. ACM (2009)
Golab, L., Özsu, M.T.: Issues in data stream management. ACM Sigmod Rec. 32(2), 5–14 (2003)
Liu, M., Li, M., Golovnya, D., Rundensteiner, E.A., Claypool, K.: Sequence pattern query processing over out-of-order event streams. In: 2009 IEEE 25th International Conference on Data Engineering. ICDE 2009, pp. 784–795. IEEE (2009)
Stonebraker, M., Çetintemel, U., Zdonik, S.: The 8 requirements of real-time stream processing. ACM SIGMOD Rec. 34(4), 42–47 (2005)
Kawashima, H., Kitagawa, H., Li, X.: Complex event processing over uncertain data streams. In: 2010 International Conference on P2P, Parallel, Grid, Cloud and Internet Computing (3PGCIC), pp. 521–526. IEEE (2010)
Ramakrishnan, R., Cheng, M., Livny, M., Seshadri, P.: What’s next? sequence queries. In: Proceedings of International Conference Management of Data. Citeseer (1994)
Wang, Y., Cao, K., Zhang, X.: Complex event processing over distributed probabilistic event streams. Comput. Math. Appl. 66(10), 1808–1821 (2013)
Jiang, Q., Chakravarthy, S.: Scheduling strategies for processing continuous queries over streams. In: Williams, H., MacKinnon, L.M. (eds.) BNCOD 2004. LNCS, vol. 3112, pp. 16–30. Springer, Heidelberg (2004)
Sharaf, M.A., Labrinidis, A., Chrysanthis, P.K.: Scheduling continuous queries in data stream management systems. Proc. VLDB Endowment 1(2), 1526–1527 (2008)
Mani, M.: Efficient event stream processing: handling ambiguous events and patterns with negation. In: Xu, J., Yu, G., Zhou, S., Unland, R. (eds.) DASFAA Workshops 2011. LNCS, vol. 6637, pp. 415–426. Springer, Heidelberg (2011)
Schultz-Møller, N.P., Migliavacca, M., Pietzuch, P.: Distributed complex event processing with query rewriting. In: Proceedings of the Third ACM International Conference on Distributed Event-Based Systems, p. 4. ACM (2009)
Agrawal, J., Diao, Y., Gyllstrom, D., Immerman, N.: Efficient pattern matching over event streams. In: Proceedings of the 2008 ACM SIGMOD International Conference on Management of Data, pp. 147–160. ACM (2008)
Babcock, B., Babu, S., Datar, M., Motwani, R., Thomas, D.: Operator scheduling in data stream systems. VLDB J. Int. J. Very Large Data Bases 13(4), 333–353 (2004)
Seshadri, P., Livny, M., Ramakrishnan, R.: Sequence query processing. In: ACM SIGMOD Record, vol. 23, pp. 430–441. ACM (1994)
Wu, J., Tan, K.-L., Zhou, Y.: QoS-oriented multi-query scheduling over data streams. In: Zhou, X., Yokota, H., Deng, K., Liu, Q. (eds.) DASFAA 2009. LNCS, vol. 5463, pp. 215–229. Springer, Heidelberg (2009)
Diao, Y., Immerman, N., Gyllstrom, D.: Sase+: An Agile Language for Kleene Closure Over Event Streams. ACM Press, New York (2007)
Law, Y.N., Wang, H., Zaniolo, C.: Query languages and data models for database sequences and data streams. In: Proceedings of the Thirtieth International Conference on Very Large Data Bases-Volume 30, VLDB Endowment, pp. 492–503 (2004)
Wu, E., Diao, Y., Rizvi, S.: High-performance complex event processing over streams. In: Proceedings of the 2006 ACM SIGMOD International Conference on Management of Data, pp. 407–418. ACM (2006)
Sadoghi, M., Singh, H., Jacobsen, H.A.: Towards highly parallel event processing through reconfigurable hardware. In: Proceedings of the Seventh International Workshop on Data Management on New Hardware, pp. 27–32. ACM (2011)
Mei, Y., Madden, S.: Zstream: a cost-based query processor for adaptively detecting composite events. In: Proceedings of the 2009 ACM SIGMOD International Conference on Management of Data, pp. 193–206. ACM (2009)
Balkesen, C., Dindar, N., Wetter, M., Tatbul, N.: Rip: Run-based intra-query parallelism for scalable complex event processing. In: Proceedings of the 7th ACM International Conference on Distributed Event-Based Systems, pp. 3–14. ACM (2013)
Hirzel, M.: Partition and compose: Parallel complex event processing. In: Proceedings of the 6th ACM International Conference on Distributed Event-Based Systems, pp. 191–200. ACM (2012)
Zhou, Y., Ma, C., Guo, Q., Shou, L., Chen, G.: Sequence pattern matching over time-series data with temporal uncertainty. In: EDBT, pp. 205–216 (2014)
Leghari, A.K., Wolf, M., Zhou, Y.: Efficient pattern detection over a distributed framework. In: Castellanos, M., Dayal, U., Pedersen, T.B., Tatbul, N. (eds.) BIRTE 2013 and 2014. LNBIP, vol. 206, pp. 133–149. Springer, Heidelberg (2015)
Wu, J., Zhou, Y., Aberer, K., Tan, K.L.: Towards integrated and efficient scientific sensor data processing: a database approach. In: Proceedings of the 12th International Conference on Extending Database Technology: Advances in Database Technology, pp. 922–933. ACM (2009)
http://www.infochimps.com/. 03 December 2014
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer International Publishing Switzerland
About this paper
Cite this paper
Leghari, A.K., Cao, J., Zhou, Y. (2015). Distributed Sequence Pattern Detection Over Multiple Data Streams. In: Tadeusz, M., Valduriez, P., Bellatreche, L. (eds) Advances in Databases and Information Systems. ADBIS 2015. Lecture Notes in Computer Science(), vol 9282. Springer, Cham. https://doi.org/10.1007/978-3-319-23135-8_26
Download citation
DOI: https://doi.org/10.1007/978-3-319-23135-8_26
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-23134-1
Online ISBN: 978-3-319-23135-8
eBook Packages: Computer ScienceComputer Science (R0)