[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
research-article

Databases on Modern Networks: A Decade of Research That Now Comes into Practice

Published: 01 August 2023 Publication History

Abstract

Modern cloud networks are a fundamental pillar of data-intensive applications. They provide high-speed transaction (packet) rates and low overhead, enabling, for instance, truly scalable database designs. These networks, however, are fundamentally different from conventional ones. Arguably, the two key discerning technologies are RDMA and programmable network devices. Today, these technologies are not niche technologies anymore and are widely deployed across all major cloud vendors. The question is thus not if but how a new breed of data-intensive applications can benefit from modern networks, given the perceived difficulty in using and programming them. This tutorial addresses these challenges by exposing how the underlying principles changed as the network evolved and by presenting the new system design opportunities they opened. In the process, we also discuss several hard-earned lessons accumulated by making the transition first-hand.

References

[1]
Wei Bai, Shanim Sainul Abdeen, Ankit Agrawal, Krishan Kumar Attre, Paramvir Bahl, et al. 2023. Empowering Azure Storage with RDMA. In NSDI.
[2]
C. Binnig, A. Crotty, A. Galakatos, T. Kraska, and E. Zamanian. 2016. The End of Slow Networks: It's Time for a Redesign. In PVLDB.
[3]
M. Blöcher, T. Ziegler, C. Binnig, and P. Eugster. 2018. Boosting Scalable Data Analytics with Modern Programmable Networks. In DaMoN.
[4]
M. Burke, S. Dharanipragada, S. Joyner, A. Szekeres, J. Nelson, I. Zhang, and D. R. K. Ports. 2021. PRISM: Rethinking the RDMA Interface for Distributed Systems. In SOSP.
[5]
N. Gebara, A. Lerner, M. Yang, M. Yu, P. Costa, and M. Ghobadi. 2020. Challenging the Stateless Quo of Programmable Switches. In HotNets.
[6]
R. Hussein, A. Lerner, A. Ryser, L. Bürgi, A. Blarer, and P. Cudré-Mauroux. 2023. GraphINC: Graph Pattern Mining at Network Speed. In SIGMOD.
[7]
Matthias Jasny, Lasse Thostrup, and Carsten Binnig. 2023. Zero-Sided RDMA: Network-Driven Data Shuffling. In DaMoN.
[8]
M. Jasny, L. Thostrup, T. Ziegler, and C. Binnig. 2022. P4DB - The Case for In-Network OLTP. In SIGMOD.
[9]
T. Jepsen, D. Alvarez, N. Foster, C. Kim, J. Lee, M. Moshref, and R. Soulé. 2019. Fast String Searching on PISA. In SOSR.
[10]
T. Jepsen, A. Lerner, F. Pedone, R. Soulé, and P. Cudré-Mauroux. 2021. In-network Support for Transaction Triaging. In PVLDB.
[11]
Theo Jepsen, Masoud Moshref, Antonio Carzaniga, Nate Foster, and Robert Soulé. 2018. Life in the Fast Lane: A Line-Rate Linear Road. In SOSR.
[12]
A. Lerner, R. Hussein, and P. Cudré-Mauroux. 2019. The Case For Network Accelerated Query Processing. In CIDR.
[13]
A. Lerner, M. Jasny, T. Jepsen, C. Binnig, and P. Cudré-Mauroux. 2022. DBMS annihilator: a high-performance database workload generator in action. In PVLDB.
[14]
J. Li, E. Michael, N. Kr Sharma, A. Szekeres, and D.R.K. Ports. 2016. Just Say NO to Paxos Overhead: Replacing Consensus with Network Ordering. In OSDI.
[15]
F. Liu, L. Yin, and S. Blanas. 2017. Design and Evaluation of an RDMA-aware Data Shuffling Operator for Parallel Database Systems. In EuroSys.
[16]
D. R. K. Ports and J. Nelson. 2019. When Should The Network Be The Computer?. In HotOS.
[17]
A. Ryser, A. Lerner, A. Forencich, and P. Cudré-Mauroux. 2022. D-RDMA: Bringing Zero-Copy RDMA to Database Systems. In CIDR.
[18]
W. Rödiger, S. Idicula, A. Kemper, and T. Neumann. 2016. Flow-Join: Adaptive skew handling for distributed joins over high-speed networks. In ICDE.
[19]
A. Sapio, M. Canini, C.-Y. Ho, J. Nelson, P. Kalnis, C. Kim, A. Krishnamurthy, M. Moshref, D. R. K. Ports, and P. Richtarik. 2021. Scaling Distributed Machine Learning with In-Network Aggregation. In NSDI.
[20]
A. Singh et al. 2015. Jupiter Rising: A Decade of Clos Topologies and Centralized Control in Google's Datacenter Network. In SIGCOMM.
[21]
A. Singhvi et al. 2020. 1RMA: Re-envisioning Remote Memory Access for Multitenant Datacenters. In SIGCOMM.
[22]
W Richard Stevens and Thomas Narten. 1990. UNIX network programming. ACM SIGCOMM Computer Communication Review 20, 2 (1990), 8--9.
[23]
L. Thostrup, J. Skrzypczak, M. Jasny, T. Ziegler, and C. Binnig. 2021. DFI: The Data Flow Interface for High-Speed Networks. In SIGMOD.
[24]
Y. Yuan et al. 2022. Unlocking the Power of Inline Floating-Point Operations on Programmable Switches. In NSDI.
[25]
E. Zamanian, C. Binnig, T. Kraska, and T. Harris. 2017. The End of a Myth: Distributed Transaction Can Scale. In PVLDB.
[26]
L. Zeno et al. 2022. SwiSh: Distributed Shared State Abstractions for Programmable Switches. In NSDI.
[27]
H. Zhu, Z. Bai, J. Li, E. Michael, D. R. K. Ports, I. Stoica, and X. Jin. 2019. Harmonia: Near-Linear Scalability for Replicated Storage with in-Network Conflict Detection. In PVLDB.
[28]
H. Zhu, T. Wang, Y. Hong, D. R. K. Ports, A. Sivaraman, and X. Jin. 2022. NetVRM: Virtual Register Memory for Programmable Networks. In NSDI.
[29]
T. Ziegler, P. Bernstein, V. Leis, and C. Binnig. 2023. Is Scalable OLTP in the Cloud a Solved Problem?. In CIDR.
[30]
T. Ziegler, D. Mohan, V. Leis, and C. Binnig. 2022. EFA: A Viable Alternative to RDMA over InfiniBand for DBMSs?. In DaMoN.
[31]
Tobias Ziegler, Jacob Nelson-Slivon, Viktor Leis, and Carsten Binnig. 2023. Design Guidelines for Correct, Efficient, and Scalable Synchronization Using One-Sided RDMA. In SIGMOD.

Cited By

View all
  • (2025)Synchronizing Disaggregated Data Structures with One-Sided RDMA: Pitfalls, Experiments and Design GuidelinesACM Transactions on Database Systems10.1145/371637750:1(1-40)Online publication date: 14-Feb-2025
  • (2024)SIMDified Data Processing - Foundations, Abstraction, and Advanced TechniquesCompanion of the 2024 International Conference on Management of Data10.1145/3626246.3654694(613-621)Online publication date: 9-Jun-2024
  • (2024)Demystifying Datapath Accelerator Enhanced Off-path SmartNIC2024 IEEE 32nd International Conference on Network Protocols (ICNP)10.1109/ICNP61940.2024.10858560(1-12)Online publication date: 28-Oct-2024
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image Proceedings of the VLDB Endowment
Proceedings of the VLDB Endowment  Volume 16, Issue 12
August 2023
685 pages
ISSN:2150-8097
Issue’s Table of Contents

Publisher

VLDB Endowment

Publication History

Published: 01 August 2023
Published in PVLDB Volume 16, Issue 12

Check for updates

Qualifiers

  • Research-article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)91
  • Downloads (Last 6 weeks)14
Reflects downloads up to 07 Mar 2025

Other Metrics

Citations

Cited By

View all
  • (2025)Synchronizing Disaggregated Data Structures with One-Sided RDMA: Pitfalls, Experiments and Design GuidelinesACM Transactions on Database Systems10.1145/371637750:1(1-40)Online publication date: 14-Feb-2025
  • (2024)SIMDified Data Processing - Foundations, Abstraction, and Advanced TechniquesCompanion of the 2024 International Conference on Management of Data10.1145/3626246.3654694(613-621)Online publication date: 9-Jun-2024
  • (2024)Demystifying Datapath Accelerator Enhanced Off-path SmartNIC2024 IEEE 32nd International Conference on Network Protocols (ICNP)10.1109/ICNP61940.2024.10858560(1-12)Online publication date: 28-Oct-2024
  • (2024)On-The-Fly Data Distribution to Accelerate Query Processing in Heterogeneous Memory SystemsAdvances in Databases and Information Systems10.1007/978-3-031-70626-4_12(170-183)Online publication date: 28-Aug-2024

View Options

Login options

Full Access

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media