[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
research-article

Delta: scalable data dissemination under capacity constraints

Published: 01 December 2013 Publication History

Abstract

In content-based publish-subscribe (pub/sub) systems, users express their interests as queries over a stream of publications. Scaling up content-based pub/sub to very large numbers of subscriptions is challenging: users are interested in low latency, that is, getting subscription results fast, while the pub/sub system provider is mostly interested in scaling, i.e., being able to serve large numbers of subscribers, with low computational resources utilization.
We present a novel approach for scalable content-based pub/sub in the presence of constraints on the available CPU and network resources, implemented within our pub/sub system Delta. We achieve scalability by off-loading some subscriptions from the pub/sub server, and leveraging view-based query rewriting to feed these subscriptions from the data accumulated in others. Our main contribution is a novel algorithm for organizing views in a multi-level dissemination network, exploiting view-based rewriting and powerful linear programming capabilities to scale to many views, respect capacity constraints, and minimize latency. The efficiency and effectiveness of our algorithm are confirmed through extensive experiments and a large deployment in a WAN.

References

[1]
N. Bansal, R. Khandekar, and V. Nagarajan. Additive guarantees for degree-bounded directed network design. SICOMP, 2009.
[2]
R. Chand and P. Felber. Semantic peer-to-peer overlays for publish/subscribe networks. In Euro-Par, 2005.
[3]
A. K. Chandra and P. M. Merlin. Optimal implementation of conjunctive queries in relational data bases. In STOC, 1977.
[4]
J. Chen, D. DeWitt, F. Tian, and Y. Wang. NiagaraCQ: A scalable continuous query system for internet databases. In SIGMOD Rec., 2000.
[5]
P. Chirita, S. Idreos, M. Koubarakis, and W. Nejdl. Publish/Subscribe for RDF-based P2P networks. In ESWS, 2004.
[6]
D. DeHaan, P.-A. Larson, and J. Zhou. Stacked Indexed Views in Microsoft SQL Server. In SIGMOD, 2005.
[7]
Y. Diao, M. Altinel, M. J. Franklin, H. Zhang, and P. Fischer. Path sharing and predicate evaluation for high-performance XML filtering. TODS, 2003.
[8]
Y. Diao, S. Rizvi, and M. Franklin. Towards an internet-scale XML dissemination service. In VLDB, 2004.
[9]
P. Eades, X. Lin, and W. Smyth. A fast and effective heuristic for the feedback arc set problem. Information Processing Letters, 1993.
[10]
A. Gupta, O. Sahin, D. Agrawal, and A. Abbadi. Meghdoot: Content-based Publish/Subscribe over P2P networks. In Middleware, 2004.
[11]
H. Gupta. Selection of views to materialize in a data warehouse. In ICDT, 1997.
[12]
J. Hoffmann and J. Koehler. A new method to index and query sets. In JCAI, 1999.
[13]
M. Hong, A. Demers, J. Gehrke, C. Koch, M. Riedewald, and W. White. Massively multi-query join processing in publish/subscribe systems. In SIGMOD, 2007.
[14]
K. Karanasos, A. Katsifodimos, and I. Manolescu. Delta: Scalable Data Dissemination under Capacity Constraints. Inria Research Report N°8385, October 2013.
[15]
K. Karanasos, A. Katsifodimos, I. Manolescu, and S. Zoupanos. ViP2P: Efficient XML management in DHT networks. In ICWE, 2012.
[16]
A. Katsifodimos, I. Manolescu, and V. Vassalos. Materialized view selection for XQuery workloads. In SIGMOD, 2012.
[17]
L. C. Lau, J. S. Naor, M. R. Salavatipour, and M. Singh. Survivable network design with degree or order constraints. SICOMP, 2009.
[18]
I. Manolescu, K. Karanasos, V. Vassalos, and S. Zoupanos. Efficient XQuery rewriting using multiple views. In ICDE, 2011.
[19]
I. Miliaraki and M. Koubarakis. Foxtrot: Distributed structural and value XML filtering. ACM TWEB, 2012.
[20]
O. Papaemmanouil. SemCast: Semantic multicast for content-based data dissemination. In ICDE, 2005.
[21]
R. Pottinger and A. Y. Halevy. MiniCon: A scalable algorithm for answering queries using views. VLDB J., 10(2-3), 2001.
[22]
K. A. Ross, D. Srivastava, and S. Sudarshan. Materialized view maintenance and integrity constraint checking: trading space for time. In SIGMOD, 1996.
[23]
A. Schmidt, F. Waas, M. L. Kersten, M. J. Carey, I. Manolescu, and R. Busse. XMark: A Benchmark for XML Data Management. In VLDB, 2002.
[24]
A. Segev and W. Fang. Currency-based updates to distributed materialized views. In ICDE, 1990.
[25]
N. Tang, J. X. Yu, M. T. Özsu, B. Choi, and K.-F. Wong. Multiple Materialized View Selection for XPath Query Rewriting. In ICDE, 2008.
[26]
W. W. Terpstra, S. Behnel, L. Fiege, A. Zeidler, and A. P. Buchmann. A peer-to-peer approach to content-based publish/subscribe. In DEBS, 2003.
[27]
F. Tian, B. Reinwald, H. Pirahesh, T. Mayr, and J. Myllymaki. Implementing a scalable XML publish/subscribe system using relational database systems. In ACM SIGMOD, 2004.
[28]
X. Wu, D. Theodoratos, and W. H. Wang. Answering XML queries using materialized views revisited. In CIKM, 2009.
[29]
Gurobi Optimizer. http://www.gurobi.com, 2013.

Cited By

View all
  • (2023)Optimizing Data Pipelines for Machine Learning in Feature StoresProceedings of the VLDB Endowment10.14778/3625054.362506016:13(4230-4239)Online publication date: 1-Sep-2023
  • (2021)GraphsurgeProceedings of the 2021 International Conference on Management of Data10.1145/3448016.3452837(1518-1530)Online publication date: 9-Jun-2021
  • (2019)Impact of technology evolution on the materialised viewsInternational Journal of Business Information Systems10.1504/ijbis.2019.09930530:4(427-462)Online publication date: 1-Jan-2019
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image Proceedings of the VLDB Endowment
Proceedings of the VLDB Endowment  Volume 7, Issue 4
December 2013
112 pages
ISSN:2150-8097
Issue’s Table of Contents

Publisher

VLDB Endowment

Publication History

Published: 01 December 2013
Published in PVLDB Volume 7, Issue 4

Qualifiers

  • Research-article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)1
  • Downloads (Last 6 weeks)0
Reflects downloads up to 07 Mar 2025

Other Metrics

Citations

Cited By

View all
  • (2023)Optimizing Data Pipelines for Machine Learning in Feature StoresProceedings of the VLDB Endowment10.14778/3625054.362506016:13(4230-4239)Online publication date: 1-Sep-2023
  • (2021)GraphsurgeProceedings of the 2021 International Conference on Management of Data10.1145/3448016.3452837(1518-1530)Online publication date: 9-Jun-2021
  • (2019)Impact of technology evolution on the materialised viewsInternational Journal of Business Information Systems10.1504/ijbis.2019.09930530:4(427-462)Online publication date: 1-Jan-2019
  • (2019)Materialized View Maintenance: Issues, Classification, and Open ChallengesInternational Journal of Cooperative Information Systems10.1142/S021884301930001828:01(1930001)Online publication date: 3-Apr-2019
  • (2019)Data Stream ManagementReal-Time & Stream Data Management10.1007/978-3-030-10555-6_4(43-55)Online publication date: 3-Jan-2019
  • (2018)Selecting subexpressions to materialize at datacenter scaleProceedings of the VLDB Endowment10.14778/3192965.319297111:7(800-812)Online publication date: 1-Mar-2018
  • (2017)Large-Scale Data Stream Processing SystemsHandbook of Big Data Technologies10.1007/978-3-319-49340-4_7(219-260)Online publication date: 26-Feb-2017
  • (2016)Reuse-based Optimization for Pig LatinProceedings of the 25th ACM International on Conference on Information and Knowledge Management10.1145/2983323.2983669(2215-2220)Online publication date: 24-Oct-2016
  • (2015)S$$^3$$3-TMThe VLDB Journal — The International Journal on Very Large Data Bases10.1007/s00778-015-0404-324:6(849-866)Online publication date: 1-Dec-2015

View Options

Login options

Full Access

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media