[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/3342195.3387519acmconferencesArticle/Chapter ViewAbstractPublication PageseurosysConference Proceedingsconference-collections
research-article

StRoM: smart remote memory

Published: 17 April 2020 Publication History

Abstract

Big data applications often incur large costs in I/O, data transfer and copying overhead, especially when operating in cloud environments. Since most such computations are distributed, data processing operations offloaded to the network card (NIC) could potentially reduce the data movement overhead by enabling near-data processing at several points of a distributed system. Following this idea, in this paper we present StRoM, a programmable, FPGA-based RoCE v2 NIC supporting the offloading of application level kernels. These kernels can be used to perform memory access operations directly from the NIC such as traversal of remote data structures as well as filtering or aggregation over RDMA data streams on both the sending or receiving sides. StRoM bypasses the CPU entirely and extends the semantics of RDMA to enable multi-step data access operations and in-network processing of RDMA streams. We demonstrate the versatility and potential of StRoM with four different kernels extending one-sided RDMA commands: 1) Traversal of remote data structures through pointer chasing, 2) Consistent retrieval of remote data blocks, 3) Data shuffling on the NIC by partitioning incoming data to different memory regions or CPU cores, and 4) Cardinality estimation on data streams.

References

[1]
G. Alonso, C. Binnig, I. Pandis, K. Salem, J. Skrzypczak, R. Stutsman, L. Thostrup, T. Wang, Z. Wang, and T. Ziegler. Dpi: The data processing interface for modern networks. In CIDR, 2019.
[2]
Altera. Programming FPGAs with OpenCL. https://www.altera.com/content/dam/altera-www/global/en_US/pdfs/literature/wp/wp-01173-opencl.pdf, 2013.
[3]
C. Balkesen, J. Teubner, G. Alonso, and M. T. Özsu. Main-memory hash joins on multi-core CPUs: Tuning to the underlying hardware. In ICDE, 2013.
[4]
B. W. Barrett, R. Brightwell, R. E. Grant, S. Hemmert, K. Pedretti, K. Wheeler, K. Underwood, R. Riesen, A. B. Maccabe, and T. Hudson. The portals 4.0.2 network programming interface. http://www.cs.sandia.gov/Portals/portals402.pdf, 2017.
[5]
C. Barthels, S. Loesing, G. Alonso, and D. Kossmann. Rack-scale in-memory join processing using RDMA. In SIGMOD, 2015.
[6]
C. Barthels, I. Müller, T. Schneider, G. Alonso, and T. Hoefler. Distributed join algorithms on thousands of cores. PVLDB, 2017.
[7]
C. Binnig, A. Crotty, A. Galakatos, T. Kraska, and E. Zamanian. The end of slow networks: It's time for a redesign. PVLDB, 2016.
[8]
S. Blanas, Y. Li, and J. M. Patel. Design and evaluation of main memory hash join algorithms for multi-core CPUs. In SIGMOD, 2011.
[9]
P. Bosshart, D. Daly, G. Gibb, M. Izzard, N. McKeown, J. Rexford, C. Schlesinger, D. Talayco, A. Vahdat, G. Varghese, and D. Walker. P4: Programming protocol-independent packet processors. SIGCOMM Comput. Commun. Rev., 2014.
[10]
P. Bosshart, G. Gibb, H.-S. Kim, G. Varghese, N. McKeown, M. Izzard, F. Mujica, and M. Horowitz. Forwarding metamorphosis: Fast programmable match-action processing in hardware for sdn. SIGCOMM Comput. Commun. Rev., 2013.
[11]
J. R. Corbin. Rpcgen. In The Art of Distributed Applications, pages 179--206. Springer, 1991.
[12]
H. T. Dang, M. Canini, F. Pedone, and R. Soulé. Paxos made switch-y. SIGCOMM Comput. Commun. Rev., 2016.
[13]
A. Dragojević, D. Narayanan, M. Castro, and O. Hodson. FaRM: Fast remote memory. In NSDI, 2014.
[14]
D. Firestone, A. Putnam, S. Mundkur, D. Chiou, et al. Azure accelerated networking: SmartNICs in the public cloud. In NSDI, 2018.
[15]
P. Flajolet, E. Fusy, O. Gandouet, and F. Meunier. HyperLogLog: the analysis of a near-optimal cardinality estimation algorithm. Discrete Mathematics & Theoretical Computer Science, 2007.
[16]
S. Heule, M. Nunkesser, and A. Hall. Hyperloglog in practice: Algorithmic engineering of a state of the art cardinality estimation algorithm. In EDBT, 2013.
[17]
T. Hoefler, S. D. Girolamo, K. Taranov, R. E. Grant, and R. Brightwell. sPIN: High-performance streaming Processing in the Network. In SC, 2017.
[18]
Z. István, D. Sidler, and G. Alonso. Caribou: Intelligent distributed storage. PVLDB, 2017.
[19]
Z. István, D. Sidler, G. Alonso, and M. Vukolic. Consensus in a box: Inexpensive coordination in hardware. In NSDI, 2016.
[20]
Z. István, L. Woods, and G. Alonso. Histograms as a side effect of data movement for big data. In SIGMOD, 2014.
[21]
X. Jin, X. Li, H. Zhang, N. Foster, J. Lee, R. Soulé, C. Kim, and I. Stoica. Netchain: Scale-free sub-rtt coordination. In NSDI, 2018.
[22]
X. Jin, X. Li, H. Zhang, R. Soulé, J. Lee, N. Foster, C. Kim, and I. Stoica. Netcache: Balancing key-value stores with fast in-network caching. In SOSP, 2017.
[23]
L. Jose, L. Yan, G. Varghese, and N. McKeown. Compiling packet programs to reconfigurable switches. In NSDI, 2015.
[24]
A. Kalia, M. Kaminsky, and D. Andersen. Datacenter rpcs can be general and fast. In NSDI, 2019.
[25]
A. Kalia, M. Kaminsky, and D. G. Andersen. Using RDMA efficiently for key-value services. SIGCOMM Comput. Commun. Rev., 2014.
[26]
A. Kalia, M. Kaminsky, and D. G. Andersen. Fasst: Fast, scalable and simple distributed transactions with two-sided (RDMA) datagram RPCs. In OSDI, 2016.
[27]
K. Kara and G. Alonso. Fast and robust hashing for database operators. In FPL, 2016.
[28]
K. Kara, J. Giceva, and G. Alonso. Fpga-based data partitioning. In SIGMOD, 2017.
[29]
K. Keeton, D. A. Patterson, and J. M. Hellerstein. A case for intelligent disks (idisks). SIGMOD Rec., 1998.
[30]
A. Lerner, R. Hussein, and P. Cudre-Mauroux. The case for network accelerated query processing. In CIDR, 2019.
[31]
B. Li, Z. Ruan, W. Xiao, Y. Lu, Y. Xiong, A. Putnam, E. Chen, and L. Zhang. KV-Direct: High-performance in-memory key-value store with programmable NIC. In SOSP, 2017.
[32]
X. Li, R. Sethi, M. Kaminsky, D. G. Andersen, and M. J. Freedman. Be fast, cheap and in control with switchkv. In NSDI, 2016.
[33]
F. Liu, L. Yin, and S. Blanas. Design and evaluation of an RDMA-aware data shuffling operator for parallel database systems. In EuroSys, 2017.
[34]
M. Liu, T. Cui, H. Schuh, A. Krishnamurthy, S. Peter, and K. Gupta. Offloading Distributed Applications onto SmartNICs Using IPipe. In SIGCOMM, 2019.
[35]
Mellanox. Mellanox asap2 accelerated switching and packet processing. https://www.mellanox.com/related-docs/products/SB_asap2.pdf, 2019.
[36]
C. Mitchell, Y. Geng, and J. Li. Using one-sided RDMA reads to build a fast, CPU-efficient key-value store. In ATC, 2013.
[37]
B. Networks. Tofino: World's fastest p4-programmable ethernet switch asics. https://barefootnetworks.com/products/brief-tofino/, 2019.
[38]
M. Oskin, F. T. Chong, and T. Sherwood. Active pages: a computation model for intelligent memory. In ISCA, 1998.
[39]
P. M. Phothilimthana, M. Liu, A. Kaufmann, S. Peter, R. Bodik, and T. Anderson. Floem: A Programming System for NIC-Accelerated Network Applications. In OSDI, 2018.
[40]
O. Polychroniou and K. A. Ross. A comprehensive study of main-memory partitioning and its application to large-scale comparison- and radix-sort. In SIGMOD, 2014.
[41]
Y. Qi, J. Fong, W. Jiang, B. Xu, J. Li, and V. Prasanna. Multi-dimensional packet classification on FPGA: 100 gbps and beyond. In FPT, 2010.
[42]
E. Riedel, G. A. Gibson, and C. Faloutsos. Active storage for large-scale data mining and multimedia. In VLDB, 1998.
[43]
W. Rödiger, T. Mühlbauer, A. Kemper, and T. Neumann. High-speed query processing over high-speed networks. PVLDB, 2015.
[44]
M. Ruiz, D. Sidler, G. Sutter, G. Alonso, and S. López-Buedo. Limago: An fpga-based open-source 100 gbe TCP/IP stack. In I. Sourdis, C. Bouganis, C. Álvarez, L. Toledo, P. Valero-Lara, and X. Martorell, editors, FPL, 2019.
[45]
D. Sidler, G. Alonso, M. Blott, K. Karras, K. A. Vissers, and R. Carley. Scalable 10gbps TCP/IP stack architecture for reconfigurable hardware. In FCCM, 2015.
[46]
D. Sidler, Z. István, and G. Alonso. Low-latency TCP/IP stack for data center applications. In FPL, 2016.
[47]
H. Song. Protocol-oblivious forwarding: Unleash the power of SDN through a future-proof forwarding plane. In HotSDN, 2013.
[48]
M. Su, M. Zhang, K. Chen, Z. Guo, and Y. Wu. RFP: When RPC is faster than server-bypass with RDMA. In EuroSys, 2017.
[49]
Y. Taleb, R. Stutsman, G. Antoniu, and T. Cortes. Tailwind: Fast and atomic RDMA-based replication. In ATC, 2018.
[50]
J. Teubner and L. Woods. Data Processing on FPGAs. Synthesis Lectures on Data Management. Morgan & Claypool Publishers, 2013.
[51]
D. Tiwari, S. Boboila, S. Vazhkudai, Y. Kim, X. Ma, P. Desnoyers, and Y. Solihin. Active flash: Towards energy-efficient, in-situ data analytics on extreme-scale machines. In FAST, 2013.
[52]
S.-Y. Tsai and Y. Zhang. LITE kernel RDMA support for datacenter applications. In SOSP, 2017.
[53]
Z. Wang, B. He, and W. Zhang. A study of data partitioning on opencl-based fpgas. In FPL, 2015.
[54]
J. Weerasinghe, F. Abel, C. Hagleitner, and A. Herkersdorf. Enabling FPGAs in hyperscale data centers. In UIC-ATC-ScalCom, 2015.
[55]
L. Woods, Z. István, and G. Alonso. Ibex - an intelligent storage engine with support for advanced SQL off-loading. PVLDB, 2014.
[56]
M. Wu, F. Yang, J. Xue, W. Xiao, Y. Miao, L. Wei, H. Lin, Y. Dai, and L. Zhou. GraM: Scaling graph computation to the trillions. In SoCC, 2015.
[57]
Xilinx. Vivado HLS. http://www.xilinx.com/products/design-tools/vivado/integration/esl-design.html, 2019.
[58]
N. Zilberman, Y. Audzevich, et al. NetFPGA SUME: Toward 100 gbps as research commodity. IEEE Micro, 2014.

Cited By

View all
  • (2024)ORNIC: A High-Performance RDMA NIC with Out-of-Order Packet Direct Write Method for Multipath TransmissionElectronics10.3390/electronics1401008814:1(88)Online publication date: 28-Dec-2024
  • (2024)TianMen: a DPU-based storage network offloading structure for disaggregated datacentersProceedings of the 2024 ACM Symposium on Cloud Computing10.1145/3698038.3698528(689-703)Online publication date: 20-Nov-2024
  • (2024)A Survey of RDMA Distributed StorageProceedings of the 2024 5th International Conference on Computing, Networks and Internet of Things10.1145/3670105.3670199(534-539)Online publication date: 24-May-2024
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
EuroSys '20: Proceedings of the Fifteenth European Conference on Computer Systems
April 2020
49 pages
ISBN:9781450368827
DOI:10.1145/3342195
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 17 April 2020

Permissions

Request permissions for this article.

Check for updates

Qualifiers

  • Research-article

Conference

EuroSys '20
Sponsor:
EuroSys '20: Fifteenth EuroSys Conference 2020
April 27 - 30, 2020
Heraklion, Greece

Acceptance Rates

EuroSys '20 Paper Acceptance Rate 43 of 234 submissions, 18%;
Overall Acceptance Rate 241 of 1,308 submissions, 18%

Upcoming Conference

EuroSys '25
Twentieth European Conference on Computer Systems
March 30 - April 3, 2025
Rotterdam , Netherlands

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)243
  • Downloads (Last 6 weeks)29
Reflects downloads up to 03 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2024)ORNIC: A High-Performance RDMA NIC with Out-of-Order Packet Direct Write Method for Multipath TransmissionElectronics10.3390/electronics1401008814:1(88)Online publication date: 28-Dec-2024
  • (2024)TianMen: a DPU-based storage network offloading structure for disaggregated datacentersProceedings of the 2024 ACM Symposium on Cloud Computing10.1145/3698038.3698528(689-703)Online publication date: 20-Nov-2024
  • (2024)A Survey of RDMA Distributed StorageProceedings of the 2024 5th International Conference on Computing, Networks and Internet of Things10.1145/3670105.3670199(534-539)Online publication date: 24-May-2024
  • (2024)A Memory-Disaggregated Radix TreeACM Transactions on Storage10.1145/366428920:3(1-41)Online publication date: 6-Jun-2024
  • (2024)RR-Compound: RDMA-Fused gRPC for Low Latency, High Throughput, and Easy InterfaceIEEE Transactions on Parallel and Distributed Systems10.1109/TPDS.2024.340439435:8(1488-1505)Online publication date: Aug-2024
  • (2024)Effortless Locality on Data Systems Using Relational FabricIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2024.338682736:12(7410-7422)Online publication date: Dec-2024
  • (2024)DPU-Direct: Unleashing Remote Accelerators via Enhanced RDMA for Disaggregated DatacentersIEEE Transactions on Computers10.1109/TC.2024.340408973:8(2081-2095)Online publication date: Aug-2024
  • (2024)FlexCross: High-Speed and Flexible Packet Processing via a Crosspoint-Queued Crossbar2024 27th Euromicro Conference on Digital System Design (DSD)10.1109/DSD64264.2024.00022(98-105)Online publication date: 28-Aug-2024
  • (2024)A Lossless-Ethernet-Based Interconnect for FPGA Clusters Toward FTQC2024 IEEE International Conference on Cluster Computing Workshops (CLUSTER Workshops)10.1109/CLUSTERWorkshops61563.2024.00040(164-165)Online publication date: 24-Sep-2024
  • (2024)Front-end RDMA over Converged Ethernet, real-time firmware simulationJournal of Instrumentation10.1088/1748-0221/19/03/C0303819:03(C03038)Online publication date: 19-Mar-2024
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media