More Web Proxy on the site http://driver.im/

article

Toward a transparent and efficient GPU cloudification architecture

Authors:

Juan Gutiérrez-Aguado,

Jose M. Claver,

Raúl Peña-OrtizAuthors Info & Claims

The Journal of Supercomputing, Volume 75, Issue 7

Pages 3640 - 3672

https://doi.org/10.1007/s11227-018-2720-z

Published: 01 July 2019 Publication History

Abstract

The cloud model allows the access to a vast amount of computational resources, alleviating the need for acquisition and maintenance costs on a pay-per-use basis. However, other resources, such as (GPUs), have not been fully adapted to this model. Many areas would benefit from suitable cloud solutions based on GPUs: video encoding, sequencing in bioinformatics, scene rendering in remote gaming, or machine learning. Cloud providers offer local and exclusive access to GPUs by using PCI passthrough. This limitation can be overcome by integrating new virtual GPUs (vGPUs) in cloud infrastructures or by providing mechanisms to cloudify existing GPUs, cloudified GPUs (cGPUs), which do not support native virtualization. The proposed architecture enables an effective and transparent integration of cGPUs in public cloud infrastructures. Our solution offers several access modes (local/remote and exclusive/shared) and configures autonomously its components by integrating with the message middleware of the cloud infrastructure. A prototype of the proposed architecture has been evaluated in a real cloud deployment. Experiments assess overhead in the infrastructure and performance of GPU-based applications by considering three different programs: matrix multiplication, sequencing read alignment, and Monte-Carlo on multiple GPUs. Results show that our solution introduces low impact both on the infrastructure and the performance of applications.

References

[1]

Michael A, Armando F, Rean G, Joseph Anthony D, Randy K, Andy K, Gunho L, David P, Ariel R, Ion S, Matei Z (2010) A view of cloud computing. Commun ACM 53(4):50---58.

Digital Library

[2]

Mastelic T, Oleksiak A, Claussen H, Brandic I, Pierson J-M, Vasilakos AV (2014) Cloud computing: survey on energy efficiency. ACM Comput Surv 47(2):33:1---33:36. ISSN 0360-0300

Digital Library

[3]

Mell P, Grance T (2011) The NIST definition of cloud computing. NIST Pubs (800-154).

Digital Library

[4]

Che S, Li J, Sheaffer JW, Skadron K, Lach J (2008) Accelerating compute-intensive applications with GPUs and FPGAs. In: Symposium on Application Specific Processors, pp 101---107.

Digital Library

[5]

Rodríguez-Sánchez R, Martínez JL, Fernández-Escribano G, Sánchez JL, Claver JM, Diaz P (2012) Optimizing H.264/AVC interprediction on a GPU-based framework. Concurr Comput Pract Exp 24(14):1607---1624.

Digital Library

[6]

Yongchao L, Bertil S, Maskell Douglas L (2012) CUSHAW: a CUDA compatible short read aligner to large genomes based on the Burrows---Wheeler transform. Bioinformatics 28(14):1830---1837.

Digital Library

[7]

Wei C, Ryan S, Chun-Ying H, Kuan-Ta C, Jiangchuan L, Leung Victor CM, Cheng-Hsin H (2016) A survey on cloud gaming--future of computer games. IEEE Access 4:7605---7620.

[8]

Temam O (2016) Enabling future progress in machine-learning. In: IEEE Symposium on VLSI Circuits, Digest of Technical Papers, pp 1---3.

[9]

Amazon Web Services: EC2. http://aws.amazon.com/ec2. {Cited 2018-05-25}

[10]

Microsoft Azure: GPU optimized virtual machine sizes. https://docs.microsoft.com/en-us/azure/virtual-machines/windows/sizes-gpu/. {Cited 2018-05-25}

[11]

Google Cloud: GPUs on Compute Engine. https://cloud.google.com/compute/docs/gpus/. {Cited 2018-05-25}

[12]

NVIDIA GPU Cloud: GPU-Accelerated Containers. https://www.nvidia.com/en-us/gpu-cloud/. {Cited 2018-05-25}

[13]

Walters JP, Younge AJ, Kang DI, Yao KT, Kang M, Crago SP, Fox G (2014) GPU passthrough performance: a comparison of KVM, Xen, VMWare ESXi, and LXC for CUDA and OpenCL applications. In: IEEE 7th International Conference on Cloud Computing (CLOUD), pp 636---643. IEEE.

Digital Library

[14]

Amazon EC2 Elastic GPUs. https://aws.amazon.com/ec2/elastic-gpus/. {Cited 2018-05-25}

[15]

Hong C-H, Spence I, Nikolopoulos DS (2017) GPU virtualization and scheduling methods--a comprehensive survey. ACM Comput Surv 1(1).

Digital Library

[16]

OpenStack: The Open Source Cloud Operating System. http://www.openstack.org/software/. {Cited 2018-05-25}

[17]

Vogel A, Griebler D, Maron CAF, Schepke C, Fernandes LG (2016) Private IaaS clouds: a comparative analysis of OpenNebula, CloudStack and OpenStack. In: Proceedings of the 24th Euromicro International Conference on Parallel, Distributed, and Network-Based Processing, pp 672---679.

[18]

Chirivella-Perez E, Gutierrez-Aguado J, Claver JM, Alcaraz-Calero JM (2015) Hybrid and extensible architecture for cloud infrastructure deployment. In: 15th IEEE International Conference on Computer and Information Technology, pp 611---617.

[19]

Habib I (2008) Virtualization with KVM. Linux J 2008(166). http://www.linuxjournal.com/article/9764. ISSN 1075-3583

Digital Library

[20]

NVIDIA NVLink Fabric, 2017. https://www.nvidia.com/en-us/data-center/nvlink/. {Cited 2018-05-25}

[21]

NVIDIA. NVIDIA GRID Technology, 2015. http://www.nvidia.com/object/grid-technology.html. {Cited 2018-05-25}

[22]

Song J, Lv Z, Tian K (2014) KVMGT: a full GPU virtualization solution. https://www.linux-kvm.org/images/f/f3/01x08b-KVMGT-a.pdf. {Cited 2018-05-25}

[23]

Intel Graphics Virtualization Technology (Intel GVT), 2017. https://01.org/igvt-g/blogs/wangbo85/2017/intel-gvt-g-kvmgt-public-release-q22017. {Cited 2018-05-25}

[24]

Qi Z, Yao J, Zhang C, Yu M, Yang Z, Guan H (2014) VGRIS: virtualized GPU resource isolation and scheduling in cloud gaming. ACM Trans Archit Code Optim 11(2):17:1---17:25. ISSN 1544-3566

Digital Library

[25]

Liang T-Y, Chang Y-W (2011) GridCUDA: A grid-enabled CUDA programming toolkit. In: 25th IEEE International Conference on Advanced Information Networking and Applications Workshops, pp 141---146.

Digital Library

[26]

Oikawa M, Kawai A, Nomura K, Yasuoka K, Yoshikawa K, Narumi T (Nov 2012) DS-CUDA: a middleware to use many GPUs in the cloud environment. In: High Performance Computing, Networking, Storage and Analysis (SCC), pp 1207---1214.

Digital Library

[27]

Shi L, Chen H, Sun J (May 2009) vCUDA: GPU accelerated high performance computing in virtual machines. In: IEEE International Symposium on Parallel Distributed Processing, pp 1---11.

Digital Library

[28]

Giunta G, Montella R, Agrillo G, Coviello G (2010) A GPGPU transparent virtualization component for high performance computing clouds. In: European Conference on Parallel Processing, pp 379---391. Springer.

Digital Library

[29]

Reaño Crlos, Silla F, Shainer G, Schultz S (2015) Local and remote GPUs perform similar with EDR 100G InfiniBand. In: 16th International Middleware Conference, Middleware Industry'15, pp 4:1---4:7. ACM. ISBN 978-1-4503-3727-4

Digital Library

[30]

Reaño C, Silla F (2016) Reducing the performance gap of remote GPU virtualization with infiniband connect-IB. In: 21st IEEE Symposium on Computers and Communications, ISCC'16, pp 920---925.

[31]

Silla F, Iserte S, Reaño C, Prades J (2017) On the benefits of the remote GPU virtualization mechanism: the rCUDA case. Concurrency and Computation: Practice and Experience, pp e4072---e4089. ISSN 1532-0634

[32]

Hong CH, Spence I, Nikolopoulos DS (Dec 2017b) Fairgv: fair and fast gpu virtualization. IEEE Trans Parallel Distrib Syst 28(12):3472---3485. ISSN 1045-9219

[33]

Pérez F, Reaño C, Silla F (2016) Providing CUDA acceleration to KVM virtual machines in infiniband clusters with rCUDA. In: 16th IFIP International Conference on Distributed Applications and Interoperable Systems, DAIS'16, pp 82---95. Springer. ISBN 978-3-319-39577-7

Digital Library

[34]

Prades J, Reaño C, Silla F (2016) CUDA acceleration for Xen virtual machines in infiniband clusters with rCUDA. In: Proceedings of the 21st ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, PPoPP'16, pp 35:1---35:2. ACM, New York, NY, USA. ISBN 978-1-4503-4092-2

Digital Library

[35]

Raffaele M, Giulio G, Giuliano L, Marco L, Carlo P, Carmine F, Valentina P, Cheol-Ho H, Spence Ivor TA, Nikolopoulos Dimitrios S (2017) On the virtualization of CUDA based GPU remoting on ARM and X86 machines in the GVirtuS framework. Int J Parallel Program 45(5):1142---1163.

Digital Library

[36]

Diab KM, Rafique MM, Hefeeda M (2013) Dynamic sharing of GPUs in cloud systems. In: IEEE Parallel and Distributed Processing Symposium Workshops PhD Forum (IPDPSW), pp 947---954.

Digital Library

[37]

Jun TJ, Dung VQ, Yoo MH, Kim D, Cho H, Hahm J (2014) GPGPU enabled HPC cloud platform based on OpenStack. In: The International Conference for High Performance Computing, Networking, Storage and Analysis. http://hdl.handle.net/10203/211249

[38]

Iserte S, Clemente-Castelló FJ, Castelló A, Mayo R, Quintana-Ortí ES (2016) Enabling GPU virtualization in cloud environments. In: Proceedings of the 6th International Conference on Cloud Computing and Services Science, pp 249---256.

Digital Library

[39]

Popa L, Ratnasamy S, Iannaccone G, Krishnamurthy A, Stoica I (2010) A cost comparison of datacenter network architectures. In: Proceedings of the 6th International Conference, Co-NEXT'10, pp 16:1---16:12. New York, NY, USA. ISBN 978-1-4503-0448-1

Digital Library

[40]

Al-Fares M, Loukissas A, Vahdat A (2008) A scalable, commodity data center network architecture. In: Proceedings of the ACM SIGCOMM 2008 Conference on Data Communication, SIGCOMM'08, pp 63---74. ACM, New York, NY, USA. ISBN 978-1-60558-175-0

Digital Library

[41]

Calero JMA, Aguado JG (2015) MonPaaS: an adaptive Monitoring Platform as a Service for cloud computing infrastructures and services. IEEE Trans Serv Comput 8(1):65---78. ISSN 1939-1374

[42]

Lilja David J (2004) Measuring computer performance. A practitioner's guide. Cambridge University Press, Cambridge

[43]

Peña AJ, Reaño C, Silla F, Mayo R, Quintana-Ortí ES, Duato J (2014) A complete and efficient CUDA-sharing solution for HPC clusters. Parallel Comput 40(10):574---588. ISSN 0167-8191

Digital Library

Cited By

Salcedo-Navarro APeña-Ortiz RClaver JGarcia-Pineda MGutiérrez-Aguado J(2025)Towards GPU-enabled serverless cloud edge platforms for accelerating HEVC video codingCluster Computing10.1007/s10586-024-04692-028:1Online publication date: 1-Feb-2025
https://dl.acm.org/doi/10.1007/s10586-024-04692-0
Suplatov DShegay MSharapova YTimokhin IPopova NVoevodin VŠvedas V(2021)Co-designing HPC-systems by computing capabilities and management flexibility to accommodate bioinformatic workflows at different complexity levelsThe Journal of Supercomputing10.1007/s11227-021-03691-x77:11(12382-12398)Online publication date: 1-Nov-2021
https://dl.acm.org/doi/10.1007/s11227-021-03691-x

Toward a transparent and efficient GPU cloudification architecture

Recommendations

Towards an emerging cloudware paradigm for transparent computing
UCC '16: Proceedings of the 9th International Conference on Utility and Cloud Computing

Transparent computing is an implementation of ubiquitous computing that is aimed at providing active services for users. In transparent computing, the execution (computation) of computer instructions and data is temporally and spatially separated from ...
Supporting GPU sharing in cloud environments with a transparent runtime consolidation framework
HPDC '11: Proceedings of the 20th international symposium on High performance distributed computing

Driven by the emergence of GPUs as a major player in high performance computing and the rapidly growing popularity of cloud environments, GPU instances are now being offered by cloud providers. The use of GPUs in a cloud environment, however, is still ...
Cloud architecture: a preliminary look
iiWAS '11: Proceedings of the 13th International Conference on Information Integration and Web-based Applications and Services

Cloud computing has started taking root. Many vendors provide Infrastructure as a Service (IaaS), Software as a Service (SaaS), and Platform as a Service (PaaS). SaaS and PaaS are provided on top of an IaaS infrastructure. Different vendors have ...

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image The Journal of Supercomputing

The Journal of Supercomputing Volume 75, Issue 7

July 2019

628 pages

ISSN:0920-8542

Issue’s Table of Contents

Copyright © Copyright © 2019 Springer Science+Business Media, LLC, part of Springer Nature.

Publisher

Kluwer Academic Publishers

United States

Publication History

Published: 01 July 2019

Author Tags

Qualifiers

Article

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

2
Total Citations
View Citations
0
Total Downloads

Downloads (Last 12 months)0
Downloads (Last 6 weeks)0

Reflects downloads up to 13 Dec 2024

Other Metrics

View Author Metrics

Citations

Cited By

Salcedo-Navarro APeña-Ortiz RClaver JGarcia-Pineda MGutiérrez-Aguado J(2025)Towards GPU-enabled serverless cloud edge platforms for accelerating HEVC video codingCluster Computing10.1007/s10586-024-04692-028:1Online publication date: 1-Feb-2025
https://dl.acm.org/doi/10.1007/s10586-024-04692-0
Suplatov DShegay MSharapova YTimokhin IPopova NVoevodin VŠvedas V(2021)Co-designing HPC-systems by computing capabilities and management flexibility to accommodate bioinformatic workflows at different complexity levelsThe Journal of Supercomputing10.1007/s11227-021-03691-x77:11(12382-12398)Online publication date: 1-Nov-2021
https://dl.acm.org/doi/10.1007/s11227-021-03691-x

View Options

View options

Media

Figures

Other

Tables

View Issue’s Table of Contents