[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/2658260.2658265acmconferencesArticle/Chapter ViewAbstractPublication PagesancsConference Proceedingsconference-collections
research-article

Efficient software packet processing on heterogeneous and asymmetric hardware architectures

Published: 20 October 2014 Publication History

Abstract

Heterogeneous and asymmetric computing systems are composed by a set of different processing units, each with its own unique performance and energy characteristics. Still, the majority of current network packet processing frameworks targets only a single device (the CPU or some accelerator), leaving other processing resources idle. In this paper, we propose an adaptive scheduling approach that supports heterogeneous and asymmetric hardware, tailored for network packet processing applications. Our scheduler is able to respond quickly to dynamic performance fluctuations that occur at real-time, such as traffic bursts, application overloads and system changes. The experimental results show that our system is able to match the peak throughput of a diverse set of packet processing workloads, while consuming up to 3.5x less energy.

References

[1]
1018 2 - PhidgetInterfaceKit 8/8/8. http://www.phidgets.com/.
[2]
1122 0 - 30 Amp Current Sensor AC/DC. http://www.phidgets.com/.
[3]
OpenCL. http://www.khronos.org/opencl/.
[4]
OpenSSL Project. http://www.openssl.org/.
[5]
The Snort IDS/IPS. http://www.snort.org/.
[6]
Intel 82599 10 GbE Controller Datasheet, Revision 2.0, 2009.
[7]
Intel HD Graphics DirectX Developer's Guide, 2010.
[8]
Intel SDK for OpenCL Applications 2013: Optimization Guide, 2013.
[9]
B. Aggarwal, A. Akella, A. Anand, A. Balachandran, P. Chitnis, C. Muthukrishnan, R. Ramjee, and G. Varghese. EndRE: an end-system redundancy elimination service for enterprises. In NSDI, 2010.
[10]
A. V. Aho and M. J. Corasick. E_cient string matching: an aid to bibliographic search. Communications of the ACM, 18(6):333--340, 1975.
[11]
A. Anand, A. Gupta, A. Akella, S. Seshan, and S. Shenker. Packet caches on routers: the implications of universal redundant tra_c elimination. In SIGCOMM, 2008.
[12]
T. Benson, A. Anand, A. Akella, and M. Zhang. Understanding Data Center Tra_c Characteristics. SIGCOMM CCR, 40(1), 2010.
[13]
M. Boyer, K. Skadron, S. Che, and N. Jayasena. Load Balancing in a Changing World: Dealing with Heterogeneity and Performance Variability. In ACM Computing Frontiers, 2013.
[14]
S. A. Crosby and D. S. Wallach. Denial of service via algorithmic complexity attacks. In USENIX Security, 2003.
[15]
G. F. Diamos and S. Yalamanchili. Harmony: An Execution Model and Runtime for Heterogeneous Many Core Systems. In HPDC, 2008.
[16]
M. Dobrescu, K. Argyraki, and S. Ratnasamy. Toward Predictable Performance in Software Packet-Processing Platforms. In NSDI, 2012.
[17]
M. Dobrescu, N. Egi, K. Argyraki, B.-G. Chun, K. Fall, G. Iannaccone, A. Knies, M. Manesh, and S. Ratnasamy. RouteBricks: Exploiting Parallelism to Scale Software Routers. In SOSP, 2009.
[18]
A. Ghodsi, V. Sekar, M. Zaharia, and I. Stoica. Multi-Resource Fair Queueing for Packet Processing. In SIGCOMM, 2012.
[19]
S. Han, K. Jang, K. Park, and S. Moon. PacketShader: a GPU-accelerated software router. In SIGCOMM, 2010.
[20]
O. Harrison and J. Waldron. Practical Symmetric Key Cryptography on Modern Graphics Hardware. In USENIX Security, 2008.
[21]
S. Hong and H. Kim. An integrated gpu power and performance model. In SIGARCH, 2010.
[22]
M. Jamshed, J. Lee, S. Moon, I. Yun, D. Kim, S. Lee, Y. Yi, and K. Park. Kargus: a Highly-scalable Software-based Intrusion Detection System. In CCS, 2012.
[23]
K. Jang, S. Han, S. Han, S. Moon, and K. Park. SSLShader: Cheap SSL Acceleration with Commodity Processors. In NSDI, 2011.
[24]
J. Kim, H. Kim, J. H. Lee, and J. Lee. Achieving a single compute device image in OpenCL for multiple GPUs. In PPoPP, 2011.
[25]
P. Kulkarni, F. Douglis, J. LaVoie, and J. M. Tracey. Redundancy elimination within large collections of _les. In USENIX ATC, 2004.
[26]
M. D. Linderman, J. D. Collins, H. Wang, and T. H. Meng. Merge: A Programming Model for Heterogeneous Multi-core Systems. In ASPLOS, 2008.
[27]
C.-K. Luk, S. Hong, and H. Kim. Qilin: Exploiting Parallelism on Heterogeneous Multiprocessors with Adaptive Mapping. In MICRO, 2009.
[28]
G. Maier, A. Feldmann, V. Paxson, and M. Allman. On dominant characteristics of residential broadband internet tra_c. In IMC, 2009.
[29]
L. Niccolini, G. Iannaccone, S. Ratnasamy, J. Chandrashekar, and L. Rizzo. Building a Power-Proportional Software Router. In USENIX ATC, 2012.
[30]
NVIDIA. CUDA C Programming Guide, Version 5.0, 2012.
[31]
R. Rivest. The MD5 message-digest algorithm. 1992.
[32]
L. Rizzo. netmap: A Novel Framework for Fast Packet I/O. In USENIX ATC, 2012.
[33]
J. Shen, J. Fang, H. Sips, and A. L. Varbanescu. Performance Traps in OpenCL for CPUs. In PDP, 2013.
[34]
R. Smith, N. Goyal, J. Ormont, K. Sankaralingam, and C. Estan. Evaluating GPUs for Network Packet Signature Matching. In ISPASS, 2009.
[35]
E. Sun, D. Schaa, R. Bagley, N. Rubin, and D. Kaeli. Enabling Task-Level Scheduling on Heterogeneous Platforms. In GPGPU, 2012.
[36]
W. Sun and R. Ricci. Fast and Flexible: Parallel Packet Processing with GPUs and Click. In ANCS, 2013.
[37]
G. Vasiliadis, S. Antonatos, M. Polychronakis, E. P. Markatos, and S. Ioannidis. Gnort: High Performance Network Intrusion Detection Using Graphics Processors. In RAID, 2008.
[38]
G. Vasiliadis, L. Koromilas, M. Polychronakis, and S. Ioannidis. GASPP: A GPU-Accelerated Stateful Packet Processing Framework. In USENIX ATC, 2014.
[39]
G. Vasiliadis, M. Polychronakis, S. Antonatos, E. P. Markatos, and S. Ioannidis. Regular Expression Matching on Graphics Hardware for Intrusion Detection. In RAID, 2009.
[40]
G. Vasiliadis, M. Polychronakis, and S. Ioannidis. MIDeA: a multi-parallel intrusion detection architecture. In CCS, 2011.
[41]
G. Wang and X. Ren. Power-e_cient work distribution method for cpu-gpu heterogeneous system. In ISPA, 2010.

Cited By

View all
  • (2022)The Best of Many Worlds: Scheduling Machine Learning Inference on CPU-GPU Integrated Architectures2022 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW)10.1109/IPDPSW55747.2022.00017(55-64)Online publication date: May-2022
  • (2022)Scheduling of multiple network packet processing applications using PythiaComputer Networks10.1016/j.comnet.2022.109006212(109006)Online publication date: Jul-2022
  • (2019)FlowShader: a Generalized Framework for GPU-accelerated VNF Flow Processing2019 IEEE 27th International Conference on Network Protocols (ICNP)10.1109/ICNP.2019.8888129(1-12)Online publication date: Oct-2019
  • Show More Cited By

Index Terms

  1. Efficient software packet processing on heterogeneous and asymmetric hardware architectures

      Recommendations

      Comments

      Please enable JavaScript to view thecomments powered by Disqus.

      Information & Contributors

      Information

      Published In

      cover image ACM Conferences
      ANCS '14: Proceedings of the tenth ACM/IEEE symposium on Architectures for networking and communications systems
      October 2014
      274 pages
      ISBN:9781450328395
      DOI:10.1145/2658260
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Sponsors

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 20 October 2014

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tags

      1. heterogeneous processing
      2. opencl
      3. packet processing
      4. packet scheduling

      Qualifiers

      • Research-article

      Conference

      ANCS '14

      Acceptance Rates

      ANCS '14 Paper Acceptance Rate 19 of 57 submissions, 33%;
      Overall Acceptance Rate 88 of 314 submissions, 28%

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)5
      • Downloads (Last 6 weeks)1
      Reflects downloads up to 15 Jan 2025

      Other Metrics

      Citations

      Cited By

      View all
      • (2022)The Best of Many Worlds: Scheduling Machine Learning Inference on CPU-GPU Integrated Architectures2022 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW)10.1109/IPDPSW55747.2022.00017(55-64)Online publication date: May-2022
      • (2022)Scheduling of multiple network packet processing applications using PythiaComputer Networks10.1016/j.comnet.2022.109006212(109006)Online publication date: Jul-2022
      • (2019)FlowShader: a Generalized Framework for GPU-accelerated VNF Flow Processing2019 IEEE 27th International Conference on Network Protocols (ICNP)10.1109/ICNP.2019.8888129(1-12)Online publication date: Oct-2019
      • (2018)Recent Advancements in Event ProcessingACM Computing Surveys10.1145/317043251:2(1-36)Online publication date: 13-Feb-2018
      • (2018)Enabling Efficient Network Service Function Chain Deployment on Heterogeneous Server Platform2018 IEEE International Symposium on High Performance Computer Architecture (HPCA)10.1109/HPCA.2018.00013(27-39)Online publication date: Feb-2018
      • (2017)Efficient Software Packet Processing on Heterogeneous and Asymmetric Hardware ArchitecturesIEEE/ACM Transactions on Networking10.1109/TNET.2016.264233825:3(1593-1606)Online publication date: 1-Jun-2017
      • (2017)Design and Implementation of a Stateful Network Packet Processing Framework for GPUsIEEE/ACM Transactions on Networking10.1109/TNET.2016.259716325:1(610-623)Online publication date: 1-Feb-2017
      • (2017)System architecture for deep packet inspection in high-speed networks2017 Siberian Symposium on Data Science and Engineering (SSDSE)10.1109/SSDSE.2017.8071958(27-32)Online publication date: Apr-2017
      • (2017)Traffic Analysis with Off-the-Shelf HardwareIEEE Communications Magazine10.1109/MCOM.2017.1600756CM55:3(163-169)Online publication date: 1-Mar-2017
      • (2017)No Sugar but All the Taste! Memory Encryption Without Architectural SupportComputer Security – ESORICS 201710.1007/978-3-319-66399-9_20(362-380)Online publication date: 12-Aug-2017
      • Show More Cited By

      View Options

      Login options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media