[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/2723372.2735357acmconferencesArticle/Chapter ViewAbstractPublication PagesmodConference Proceedingsconference-collections
research-article

CE-Storm: Confidential Elastic Processing of Data Streams

Published: 27 May 2015 Publication History

Abstract

Data Stream Management Systems (DSMS) are crucial for modern high-volume/high-velocity data-driven applications, necessitating a distributed approach to processing them. In addition, data providers often require certain levels of confidentiality for their data, especially in cases of user-generated data, such as those coming out of physical activity/health tracking devices (i.e., our motivating application). This demonstration will showcase Synefo, an infrastructure that enables elastic scaling of DSMS operators, and CryptStream, a framework that provides confidentiality and access controls for data streams while allowing computation on untrusted servers, fused as CE-Storm. We will demonstrate both systems working in tandem and also visualize their behavior over time under different scenarios.

References

[1]
IBM System S. http://researcher.watson.ibm.com/researcher/view_group_subpage.php?id=2534.
[2]
SQLstream. http://www.sqlstream.com/.
[3]
Summingbird. http://github.com/twitter/summingbird.
[4]
D. J. Abadi et al. Aurora: A new model and architecture for data stream management. The VLDB Journal, 12(2):120--139, Aug. 2003.
[5]
Y. Ahmad et al. Distributed operation in the borealis stream processing engine. In Proc. of ACM SIGMOD, pages 882--884, 2005.
[6]
D. T. T. Anh and A. Datta. Streamforce: outsourcing access control enforcement for stream data to the clouds. In CODASPY, pages 13--24. ACM, 2014.
[7]
A. Arasu et al. Stream: The stanford data stream management system. Technical Report 2004--20, Stanford InfoLab, 2004.
[8]
B. Babcock, S. Babu, R. Motwani, and M. Datar. Chain: Operator scheduling for memory minimization in data stream systems. In Proc. of ACM SIGMOD, pages 253--264, 2003.
[9]
M. Balazinska et al. Fault-tolerance in the borealis distributed stream processing system. ACM Trans. Database Syst., 33(1):3:1--3:44, Mar. 2008.
[10]
R. Castro Fernandez et al. Integrating scale out and fault tolerance in stream processing using operator state management. In Proc. of ACM SIGMOD, pages 725--736, 2013.
[11]
U. Çetintemel et al. S-store: A streaming newsql system for big velocity applications. PVLDB, 7(13):1633--1636, 2014.
[12]
H. V. Jagadish et al. Big data and its technical challenges. Comm. of the ACM, 57(7):86--94, Jul 2014.
[13]
L. A. Moakar, A. Labrinidis, and P. K. Chrysanthis. Adaptive class-based scheduling of continuous queries. In Proc. of IEEE SMDB Workshop, pages 1--6, 2012.
[14]
R. Nehme, E. A. Rundensteiner, and E. Bertino. A security punctuation framework for enforcing access control on streaming data. In Proc. of IEEE ICDE Conference, pages 406--415, 2008.
[15]
T. N. Pham, P. K. Chrysanthis, and A. Labrinidis. Self-managing load shedding for data stream management systems. In Proc. of SMDB Workshop, pages 1--7, 2013.
[16]
T. N. Pham, L. A. Moakar, P. K. Chrysanthis, and A. Labrinidis. Dilos: A dynamic integrated load manager and scheduler for continuous queries. In Proc. of SMDB Workshop, pages 10--15, 2011.
[17]
R. A. Popa et al. CryptDB: protecting confidentiality with encrypted query processing. In Proc. of ACM SOSP, pages 85--100, 2011.
[18]
M. A. Sharaf et al. Algorithms and metrics for processing multiple heterogeneous continuous queries. ACM Trans. Database Syst., 33(2):5.1--5.44, 2008.
[19]
N. Tatbul, U. Çetintemel, and S. Zdonik. Staying fit: Efficient load shedding techniques for distributed stream processing. In Proc. of VLDB, pages 159--170, 2007.
[20]
N. Tatbul et al. Load shedding in a data stream manager. In Proc. of VLDB, pages 309--320, 2003.
[21]
A. Toshniwal et al. Storm@twitter. In Proc. of ACM SIGMOD Conference, pages 147--156, 2014.
[22]
W. Yingjun and T. Kian-Lee. Chronostream: Elastic stateful stream computation in the cloud. In Proc. of IEEE ICDE Conference, 2015.
[23]
M. Zaharia, T. Das, H. Li, S. Shenker, and I. Stoica. Discretized streams: An efficient and fault-tolerant model for stream processing on large clusters. In Proc. of HotCloud Conference, pages 423--438, 2012.

Cited By

View all
  • (2021)Improving Stream Load Balance through Shedding2021 IEEE 37th International Conference on Data Engineering Workshops (ICDEW)10.1109/ICDEW53142.2021.00028(120-126)Online publication date: Apr-2021
  • (2020)SPEAr: Expediting Stream Processing with Accuracy Guarantees2020 IEEE 36th International Conference on Data Engineering (ICDE)10.1109/ICDE48307.2020.00100(1105-1116)Online publication date: Apr-2020
  • (2019)Processing of Aggregate Continuous Queries in a Distributed EnvironmentReal-Time Business Intelligence and Analytics10.1007/978-3-030-24124-7_4(45-62)Online publication date: 11-Oct-2019
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
SIGMOD '15: Proceedings of the 2015 ACM SIGMOD International Conference on Management of Data
May 2015
2110 pages
ISBN:9781450327589
DOI:10.1145/2723372
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 27 May 2015

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. confidentiality
  2. continuous queries
  3. distributed data stream management system
  4. elasticity

Qualifiers

  • Research-article

Funding Sources

Conference

SIGMOD/PODS'15
Sponsor:
SIGMOD/PODS'15: International Conference on Management of Data
May 31 - June 4, 2015
Victoria, Melbourne, Australia

Acceptance Rates

SIGMOD '15 Paper Acceptance Rate 106 of 415 submissions, 26%;
Overall Acceptance Rate 785 of 4,003 submissions, 20%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)13
  • Downloads (Last 6 weeks)0
Reflects downloads up to 16 Dec 2024

Other Metrics

Citations

Cited By

View all
  • (2021)Improving Stream Load Balance through Shedding2021 IEEE 37th International Conference on Data Engineering Workshops (ICDEW)10.1109/ICDEW53142.2021.00028(120-126)Online publication date: Apr-2021
  • (2020)SPEAr: Expediting Stream Processing with Accuracy Guarantees2020 IEEE 36th International Conference on Data Engineering (ICDE)10.1109/ICDE48307.2020.00100(1105-1116)Online publication date: Apr-2020
  • (2019)Processing of Aggregate Continuous Queries in a Distributed EnvironmentReal-Time Business Intelligence and Analytics10.1007/978-3-030-24124-7_4(45-62)Online publication date: 11-Oct-2019
  • (2019)Accelerating Real-Time Tracking Applications over Big Data Stream with Constrained SpaceDatabase Systems for Advanced Applications10.1007/978-3-030-18576-3_1(3-18)Online publication date: 24-Apr-2019
  • (2018)Concept-Driven Load Shedding: Reducing Size and Error of Voluminous and Variable Data Streams2018 IEEE International Conference on Big Data (Big Data)10.1109/BigData.2018.8622265(418-427)Online publication date: Dec-2018
  • (2018)Rethinking elastic online scheduling of big data streaming applications over high-velocity continuous data streamsThe Journal of Supercomputing10.1007/s11227-017-2151-274:2(615-636)Online publication date: 1-Feb-2018
  • (2018)Improving Large-Scale Fingerprint-Based Queries in Distributed InfrastructureComputational Science – ICCS 201810.1007/978-3-319-93713-7_36(425-433)Online publication date: 11-Jun-2018
  • (2017)Fast-FFAInternational Journal of Bio-Inspired Computation10.1504/IJBIC.2017.08671710:3(205-217)Online publication date: 1-Jan-2017
  • (2017)A holistic view of stream partitioning costsProceedings of the VLDB Endowment10.14778/3137628.313763910:11(1286-1297)Online publication date: 1-Aug-2017
  • (2017)Supporting Real-Time Analytic Queries in Big and Fast Data EnvironmentsDatabase Systems for Advanced Applications10.1007/978-3-319-55699-4_29(477-493)Online publication date: 22-Mar-2017
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media