Load shedding for multi-way stream joins based on arrival order patterns

Tae-Hyung Kwon¹,
Ki Yong Lee² &
Myoung Ho Kim³

213 Accesses
7 Citations
Explore all metrics

Abstract

We address the problem of load shedding for continuous multi-way join queries over multiple data streams. When the arrival rates of tuples from data streams exceed the system capacity, a load shedding algorithm drops some subset of input tuples to avoid system overloads. To decide which tuples to drop among the input tuples, most existing load shedding algorithms determine the priority of each input tuple based on the frequency or some historical statistics of its join attribute value, and then drop tuples with the lowest priority. However, those value-based algorithms cannot determine the priorities of tuples properly in environments where join attribute values are unique and each join attribute value occurs at most once in each data stream. In this paper, we propose a load shedding algorithm specifically designed for such environments. The proposed load shedding algorithm determines the priority of each tuple based on the order of streams in which its join attribute value appears, rather than its join attribute value itself. Consequently, the priorities of tuples can be determined effectively in environments where join attribute values are unique and do not repeat. The experimental results show that the proposed algorithm outperforms the existing algorithms in such environments in terms of effectiveness and efficiency.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic

£29.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price includes VAT (United Kingdom)

Instant access to the full article PDF.

Institutional subscriptions

Study a Join Query Strategy Over Data Stream Based on Sliding Windows

Load Shedding for Window Queries Over Continuous Data Streams

Towards load shedding and scheduling schemes for data streams that maintain quality and timing requirements of query results

Article 25 February 2015

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

References

Bai, Y., Wang, H., & Zaniolo, C. (2007). Load shedding in classifying multi-source streaming data: A bayes risk approach. In: Proceedings of the 7th SIAM international conference on data mining (pp. 425–430).
Chen, M. S., Park, J. S., & Yu, P. S. (1998). Efficient data mining for path traversal patterns. IEEE Transaction on Knowledge and Data Engineering, 10(2), 209–221.
Article Google Scholar
Cranor, C. D., Johnson, T., Spatscheck, O., & Shkapenyuk, V. (2003). Gigascope: A stream database for network applications. In: Proceedings of the 2003 ACM SIGMOD international conference on management of data (pp. 647–651).
Das, A., Gehrke, J., & Riedewald, M. (2003). Approximate join processing over data streams. In: Proceedings of the 2003 ACM SIGMOD international conference on management of data (pp. 40–51).
Dobra, A., Garofalakis, M. N., Gehrke, J., & Rastogi, R. (2002). Processing complex aggregate queries over data streams. In: Proceedings of the 2002 ACM SIGMOD international conference on management of data (pp. 61–72).
Gedik, B., Wu, K. L., Yu, P. S., & Liu, L. (2007). A load shedding framework and optimizations for m-way windowed stream joins. In: Proceedings of the 23rd IEEE international conference on data engineering (pp. 536–545).
Gehrke, J., & Madden, S. (2004). Query processing in sensor networks. IEEE Pervasive Computing, 3(1), 46–55.
Article Google Scholar
Golab, L., & Ozsu, M. T. (2003). Processing sliding window multi-joins in continuous queries over data streams. In: Proceedings of the 29th international conference on very large data bases (pp. 500–511).
Hammad, M. A., Aref, W. G., & Elmagarmid, A. K. (2003). Stream window join: Tracking moving objects in sensor-network databases. In: Proceedings of 15th international conference on scientific and statistical database management (pp. 75–84).
Kwon, T. H., Kim, H. G., Kim, M. H., & Son, J. H. (2009). Amjoin: An advanced join algorithm for multiple data streams using a bit-vector hash table. IEICE Transaction on Information and Systems, E92-D(7), 1429–1434.
Article Google Scholar
Law, Y. N., & Zaniolo, C. (2007). Load shedding for window joins on multiple data streams. In: Proceedings of the 23rd IEEE international conference on data engineering (pp. 674–683).
Nanopoulos, A., Katsaros, D., & Manolopoulos, Y. (2003). A data mining algorithm for generalized web prefetching. IEEE Transaction on Knowledge and Data Engineering, 15(5), 1155–1169.
Article Google Scholar
Srivastava, U., & Widom, J. (2004). Memory-limited execution of windowed stream joins. In: Proceedings of the 30th international conference on very large data bases (pp. 324–335).
Viglas, S., Naughton, J. F., & Burger, J. (2003). Maximizing the output rate of multi-way join queries over streaming information sources. In: Proceedings of the 29th international conference on very large data bases (pp. 285–296).
Yu, H., Lim, E. P., & Zhang, J. (2006). On in-network synopsis join processing for sensor networks. In: Proceedings of the 7th international conference on mobile data management (pp. 32–39).

Download references

Acknowledgement

This work was supported by the National Research Foundation of Korea (NRF) grant funded by the Korea government (MEST) (No. 2010-0018865).

Author information

Authors and Affiliations

Command and Control Directorate, Systems Division, ROK Air Force, P.O. Box, 309, Sinjang-dong, Pyeongtaek-si, Gyeonggi-do, Republic of Korea
Tae-Hyung Kwon
Department of Computer Science, Sookmyung Women’s University, 52 Hyochangwon-gil, Yongsan-gu, Seoul, 140-742, Republic of Korea
Ki Yong Lee
Department of Computer Science, Korea Advanced Institute of Science and Technology (KAIST), 373-1 Guseong-Dong, Yuseong-Gu, Daejeon, 305-701, Republic of Korea
Myoung Ho Kim

Authors

Tae-Hyung Kwon
View author publications
You can also search for this author in PubMed Google Scholar
Ki Yong Lee
View author publications
You can also search for this author in PubMed Google Scholar
Myoung Ho Kim
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Ki Yong Lee.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Kwon, TH., Lee, K.Y. & Kim, M.H. Load shedding for multi-way stream joins based on arrival order patterns. J Intell Inf Syst 37, 245–265 (2011). https://doi.org/10.1007/s10844-010-0138-z

Download citation

Received: 23 December 2009
Revised: 04 October 2010
Accepted: 05 October 2010
Published: 15 October 2010
Issue Date: October 2011
DOI: https://doi.org/10.1007/s10844-010-0138-z

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Study a Join Query Strategy Over Data Stream Based on Sliding Windows

Load Shedding for Window Queries Over Continuous Data Streams

Towards load shedding and scheduling schemes for data streams that maintain quality and timing requirements of query results

References

Acknowledgement

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Subscribe and save

Buy Now

Navigation

Load shedding for multi-way stream joins based on arrival order patterns

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Study a Join Query Strategy Over Data Stream Based on Sliding Windows

Load Shedding for Window Queries Over Continuous Data Streams

Towards load shedding and scheduling schemes for data streams that maintain quality and timing requirements of query results

Explore related subjects

References

Acknowledgement

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now

Search

Navigation