[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/3542929.3563499acmconferencesArticle/Chapter ViewAbstractPublication PagesmodConference Proceedingsconference-collections
research-article
Open access

CoSpot: a cooperative VM allocation framework for increased revenue from spot instances

Published: 07 November 2022 Publication History

Abstract

Most large cloud operators offer a lower-priced, lower-priority alternative to regular (on-demand or reserved) virtual machines, commonly referred to as spot instances. Spot instances are opportunistically allocated to servers in order to utilize any residual cloud capacity, but are evicted whenever regular virtual machines need to use that capacity. This paper proposes CoSpot, a lightweight framework for cooperative allocation of regular virtual machines and spot instances, which allows for easy integration of arbitrary virtual machine and spot allocators. In our experiments, employing the framework achieves up to 245% improvement (average 34% improvement) in spot revenue, with no loss in virtual machine revenue, compared to the baseline VM and spot allocation without using our framework. We also derive and release a reusable workload with both virtual machines and spot instances, based on data previously shared by Microsoft Azure.

References

[1]
Orna Agmon Ben-Yehuda, Muli Ben-Yehuda, Assaf Schuster, and Dan Tsafrir. 2013. Deconstructing Amazon EC2 Spot Instance Pricing. ACM Trans. Econ. Comput. 1, 3, Article 16 (2013), 20 pages.
[2]
Sarah Alkharif, Kyungyong Lee, and Hyeokman Kim. 2018. Time-Series Analysis for Price Prediction of Opportunistic Cloud Computing Resources. In 7th Intl Conf on Emerging Databases. 221--229.
[3]
Amazon Web Services 2022. Amazon Compute Service Level Agreement. https://aws.amazon.com/compute/sla/. Accessed: 2022-09-23.
[4]
Amazon Web Services 2022. Billing for interrupted Spot Instances --- Amazon Elastic Compute Cloud. https://docs.aws.amazon.com/AWSEC2/latest/UserGuide/billing-for-interrupted-spot-instances.html. Accessed: 2022-09-23.
[5]
Pradeep Ambati, Inigo Goiri, Felipe Frujeri, Alper Gun, Ke Wang, Brian Dolan, Brian Corell, Sekhar Pasupuleti, Thomas Moscibroda, Sameh Elnikety, Marcus Fontoura, and Ricardo Bianchini. 2020. Providing SLOs for Resource-Harvesting VMs in Cloud Platforms. In 14th USENIX Symp on Operating Systems Design and Implementation (OSDI 20). 735--751.
[6]
Shane Bergsma, Timothy Zeyl, Arik Senderovich, and J. Christopher Beck. 2021. Generating Complex, Realistic Cloud Workloads Using Recurrent Neural Networks. In Proceedings of the ACM SIGOPS 28th Symposium on Operating Systems Principles (SOSP '21). Association for Computing Machinery, 376--391.
[7]
Eric Boutin, Jaliya Ekanayake, Wei Lin, Bing Shi, Jingren Zhou, Zhengping Qian, Ming Wu, and Lidong Zhou. 2014. Apollo: Scalable and coordinated scheduling for cloud-scale computing. In 11th USENIX Symp on Operating Systems Design and Implementation (OSDI 14). 285--300.
[8]
Marcus Carvalho, Walfredo Cirne, Franciso Brasileiro, and John Wilkes. 2014. Long-term SLOs for reclaimed cloud computing resources. In Proceedings of the Fifth ACM Symposium on Cloud Computing (SoCC '14).
[9]
Mohan Baruwal Chhetri, Markus Lumpe, Quoc Bao Vo, and Ryszard Kowalczyk. 2017. On Estimating Bids for Amazon EC2 Spot Instances Using Time Series Forecasting. In 2017 IEEE Intl Conf on Services Computing (SCC). 44--51.
[10]
Eli Cortez, Anand Bonde, Alexandre Muzio, Mark Russinovich, Marcus Fontoura, and Ricardo Bianchini. 2017. Resource Central: Understanding and Predicting Workloads for Improved Resource Management in Large Cloud Platforms. In 26th Symp on Operating Systems Principles (SOSP '17). 153--167.
[11]
Carlo Curino, Djellel E Difallah, Chris Douglas, Subru Krishnan, Raghu Ramakrishnan, and Sriram Rao. 2014. Reservation-based scheduling: If you're late don't blame us!. In Proceedings of the Fifth ACM Symposium on Cloud Computing. 1--14.
[12]
Wesam Dawoud, Ibrahim Takouna, and Christoph Meinel. 2012. Increasing Spot Instances Reliability Using Dynamic Scalability. In 2012 IEEE Fifth Intl Conf on Cloud Computing. 959--961.
[13]
Pamela Delgado, Diego Didona, Florin Dinu, and Willy Zwaenepoel. 2016. Job-aware Scheduling in Eagle: Divide and Stick to Your Probes. In 7th Symp on Cloud Computing (SoCC '16).
[14]
Pamela Delgado, Diego Didona, Anne-Marie Kermarrec, and Willy Zwaenepoel. 2015. Hawk: Hybrid Datacenter Scheduling. In USENIX Annual Technical Conference (ATC 15).
[15]
Christina Delimitrou and Christos Kozyrakis. 2013. Paragon: QoS-Aware Scheduling for Heterogeneous Datacenters. In Proceedings of the Eighteenth International Conference on Architectural Support for Programming Languages and Operating Systems (Houston, Texas, USA) (ASPLOS '13). Association for Computing Machinery, New York, NY, USA, 77âĂŞ88.
[16]
Christina Delimitrou and Christos Kozyrakis. 2014. Quasar: Resource-Efficient and QoS-Aware Cluster Management. In Proceedings of the 19th International Conference on Architectural Support for Programming Languages and Operating Systems (Salt Lake City, Utah, USA) (ASPLOS '14). Association for Computing Machinery, New York, NY, USA, 127âĂş144.
[17]
Shridhar G. Domanal and G. Ram Mohana Reddy. 2018. An efficient cost optimized scheduling for spot instances in heterogeneous cloud environment. Future Generation Computer Systems 84 (2018), 11--21.
[18]
Panagiotis Garefalakis, Konstantinos Karanasos, Peter Pietzuch, Arun Suresh, and Sriram Rao. 2018. Medea: Scheduling of Long Running Applications in Shared Production Clusters. In 13th European Conf on Computer Systems (EuroSys'18).
[19]
Gareth George, Rich Wolski, Chandra Krintz, and John Brevik. 2019. Analyzing AWS Spot Instance Pricing. In 2019 IEEE Intl Conf on Cloud Engineering (IC2E). 222--228.
[20]
Ionel Gog, Malte Schwarzkopf, Adam Gleave, Robert N. M. Watson, and Steven Hand. 2016. Firmament: Fast, Centralized Cluster Scheduling at Scale. In 12th USENIX Symp on Operating Systems Design and Implementation (OSDI 16).
[21]
Bhavesh N. Gohil, Sachin Gamit, and Dhiren R. Patel. 2021. Fair Fit---A Load Balance Aware VM Placement Algorithm in Cloud Data Centers. In Advances in Communication and Computational Technology. Springer, 437--451.
[22]
Robert Grandl, Ganesh Ananthanarayanan, Srikanth Kandula, Sriram Rao, and Aditya Akella. 2014. Multi-Resource Packing for Cluster Schedulers. In 2014 ACM Conf on SIGCOMM (SIGCOMM '14). 455--466.
[23]
Weichao Guo, Kang Chen, Yongwei Wu, and Weimin Zheng. 2015. Bidding for Highly Available Services with Low Price in Spot Instance Market. In 24th Intl Symp on High-Performance Parallel and Distributed Computing (HPDC '15). 191--202.
[24]
Gurobi Optimization, LLC. 2022. Gurobi Optimizer Reference Manual. https://www.gurobi.com
[25]
Ori Hadary, Luke Marshall, Ishai Menache, Abhisek Pan, Esaias E Greeff, David Dion, Star Dorminey, Shailesh Joshi, Yang Chen, Mark Russinovich, and Thomas Moscibroda. 2020. Protean: VM Allocation Service at Scale. In 14th USENIX Symp on Operating Systems Design and Implementation (OSDI 20). 845--861.
[26]
Jaeung Han, Seungheun Jeon, Young-ri Choi, and Jaehyuk Huh. 2016. Interference Management for Distributed Parallel Applications in Consolidated Clusters. In Proceedings of the Twenty-First International Conference on Architectural Support for Programming Languages and Operating Systems (Atlanta, Georgia, USA) (ASPLOS '16). Association for Computing Machinery, New York, NY, USA, 443âĂŞS456.
[27]
Benjamin Hindman, Andy Konwinski, Matei Zaharia, Ali Ghodsi, Anthony D. Joseph, Randy Katz, Scott Shenker, and Ion Stoica. 2011. Mesos: A Platform for Fine-Grained Resource Sharing in the Data Center. In 8th USENIX Symp on Networked Systems Design and Implementation (NSDI 11).
[28]
Michael Isard, Vijayan Prabhakaran, Jon Currey, Udi Wieder, Kunal Talwar, and Andrew Goldberg. 2019. Quincy: Fair Scheduling for Distrubuted Computing Clusters. In 27th ACM Symp on Operating System Principles (SOSP'19).
[29]
Bahman Javadi, Ruppa K. Thulasiramy, and Rajkumar Buyya. 2011. Statistical Modeling of Spot Instance Prices in Public Cloud Environments. In 2011 Fourth IEEE Intl Conf on Utility and Cloud Computing. 219--228.
[30]
Qin Jia, Zhiming Shen, Weijia Song, Robbert van Renesse, and Hakim Weatherspoon. 2016. Smart Spot Instances for the Supercloud. In 3rd Workshop on CrossCloud Infrastructures & Platforms (CrossCloud '16). Article 5, 6 pages.
[31]
Tatiana Jin, Zhenkun Cai, Boyang Li, Chengguang Zheng, Guanxian Jiang, and James Cheng. 2020. Improving Resource Utilization by Timely Fine-Grained Scheduling. In Proceedings of the Fifteenth European Conference on Computer Systems (Heraklion, Greece) (EuroSys '20). Association for Computing Machinery, New York, NY, USA, Article 20, 16 pages.
[32]
Daeyong Jung, JongBeom Lim, and Heonchang Yu. 2014. Estimated Interval-Based Checkpointing (EIC) on Spot Instances in Cloud Computing. J of Applied Mathematics (5 2014).
[33]
JCS Kadupitiya, Vikram Jadhao, and Prateek Sharma. 2020. Modeling The Temporally Constrained Preemptions of Transient Cloud VMs. In 29th Intl Symp on High-Performance Parallel and Distributed Computing (HPDC '20). 41--52.
[34]
BogumiåĆ KamiåĎski and PrzemysåĆaw Szufel. 2015. On optimization of simulation execution on Amazon EC2 spot market. Simulation Modelling Practice and Theory 58 (2015), 172--187. Special issue on Cloud Simulation.
[35]
Konstantinos Karanasos, Sriram Rao, Carlo Curino, Chris Douglas, Kishore Chaliparambil, Giovanni Matteo Fumarola, Solom Heddaya, Raghu Ramakrishnan, and SarveshSakalanaga. 2015. Mercury: Hybrid Centralized and Distributed Scheduling in Large Shared Clusters. In USENIX Annual Technical Conference (ATC 15).
[36]
Xiaodi Ke, Cong Guo, Siqi Ji, Shane Bergsma, Zhenhua Hu, and Lei Guo. 2021. Fundy: A Scalable and Extensible Resource Manager for Cloud Resources. In IEEE Intl Conf on Cloud Computing.
[37]
Sunirmal Khatua and Nandini Mukherjee. 2013. Application-Centric Resource Provisioning for Amazon EC2 Spot Instances. In Euro-Par 2013 Parallel Processing. Springer Berlin Heidelberg, 267--278.
[38]
Mikhail Khodak, Liang Zheng, Andrew S. Lan, Carlee Joe-Wong, and Mung Chiang. 2018. Learning Cloud Dynamics to Optimize Spot Instance Bidding Strategies. In IEEE INFOCOM 2018 - IEEE Conf on Computer Communications. 2762--2770.
[39]
Alvaro López García, Enol Fernández del Castillo, and Isabel Campos Plasencia. 2019. An efficient cloud scheduler design supporting preemptible instances. Future Generation Computer Systems 95 (2019), 68--78.
[40]
Markus Lumpe, Mohan Baruwal Chhetri, Quoc Bao Vo, and Ryszard Kowalcyk. 2017. On Estimating Minimum Bids for Amazon EC2 Spot Instances. In 2017 17th IEEE/ACM Intl Symp on Cluster, Cloud and Grid Computing (CCGRID). 391--400.
[41]
Jason Mars and Lingjia Tang. 2013. Whare-Map: Heterogeneity in "Homogeneous" Warehouse-Scale Computers. In Proceedings of the 40th Annual International Symposium on Computer Architecture (Tel-Aviv, Israel) (ISCA '13). Association for Computing Machinery, New York, NY, USA, 619âĂŞ630.
[42]
Jason Mars, Lingjia Tang, Robert Hundt, Kevin Skadron, and Mary Lou Soffa. 2011. Bubble-Up: Increasing Utilization in Modern Warehouse Scale Computers via Sensible Co-Locations. In Proceedings of the 44th Annual IEEE/ACM International Symposium on Microarchitecture (Porto Alegre, Brazil) (MICRO-44). Association for Computing Machinery, New York, NY, USA, 248âĂŞ259.
[43]
Michele Mazzucco and Marlon Dumas. 2011. Achieving Performance and Availability Guarantees with Spot Instances. In 2011 IEEE Intl Conf on High Performance Computing and Communications. 296--303.
[44]
Ishai Menache, Ohad Shamir, and Navendu Jain. 2014. On-demand, Spot, or Both: Dynamic Resource Allocation for Executing Batch Jobs in the Cloud. In 11th Intl Conf on Autonomic Computing (ICAC 14). 177--187.
[45]
Phillip Moritz, Robert Nishihara, Stephanie Wang, Alexey Tumanov, Richard Liaw, Eric Liang, Melih Elibol, Zongheng Yang, William Paul, Michael I. Jordan, and Ion Stoica. 2018. Ray: A Distributed Framework for Emerging AI Applications. In 13th USENIX Symp on Operating Systems Design and Implementation (OSDI 18).
[46]
Kay Ousterhout, Christopher Canel, Sylvia Ratnasamy, and Scott Shenker. 2017. Monotasks: Architecting for Performance Clarity in Data Analytics Frameworks. In Proceedings of the 26th Symposium on Operating Systems Principles (Shanghai, China) (SOSP '17). Association for Computing Machinery, New York, NY, USA, 184âĂŞ200.
[47]
Kay Ousterhout, Patrick Wendell, Matei Zaharia, and Ion Stoica. 2013. Sparrow: Distributed, Low Latency Scheduling. In 24th ACM Symp on Operating System Principles (SOSP'13).
[48]
Rina Panigrahy, Kunal Talwar, Lincoln Uyeda, and Udi Wieder. 2011. Heuristics for Vector Bin Packing. https://www.microsoft.com/en-us/research/wp-content/uploads/2011/01/VBPackingESA11.pdf.
[49]
Jeff Rasley, Konstantinos Karanasos, Srikanth Kandula, Rodrigo Fonseca, Milan Vojnovic, and Sriram Rao. 2016. Efficient Queue Management for Cluster Scheduling. In 11th European Conf on Computer Systems (EuroSys'16).
[50]
Malte Schwarzkopf, AndyKonwinski, Michael Abd-El-Malek, and John Wilkes. 2013. Omega: flexible, scalable schedulers for large compute clusters. In SIGOPS European Conf on Computer Systems (EuroSys). 351--364.
[51]
Omar Sefraoui, Mohammed Aissaoui, and M. Eleuldj. 2012. OpenStack: Toward an Open-source Solution for Cloud Computing. Intl J of Computer Applications 55 (2012), 38--42.
[52]
Prateek Sharma, David Irwin, and Prashant Shenoy. 2016. How Not to Bid the Cloud. In 8th USENIX Workshop on Hot Topics in Cloud Computing (HotCloud 16).
[53]
Supreeth Shastri and David Irwin. 2017. HotSpot: Automated Server Hopping in Cloud Spot Markets. In 2017 Symp on Cloud Computing (SoCC '17). 493--505.
[54]
Supreeth Shastri, Amr Rizk, and David Irwin. 2016. Transient Guarantees: Maximizing the Value of Idle Cloud Capacity. In SC '16: Intl Conf for High Performance Computing, Networking, Storage and Analysis. 992--1002.
[55]
Jiyuan Shi, Fang Dong, Jinghui Zhang, Junzhou Luo, and Ding Ding. 2015. Two-Phase Online Virtual Machine Placement in Heterogeneous Cloud Data Center. In 2015 IEEE Intl Conf on Systems, Man, and Cybernetics. IEEE Press, 1369--1374.
[56]
Vivek Kumar Singh and Kaushik Dutta. 2015. Dynamic Price Prediction for Amazon Spot Instances. In 2015 48th Hawaii Intl Conf on System Sciences. 1513--1520.
[57]
Yang Song, Murtaza Zafer, and Kang-Won Lee. 2012. Optimal bidding in spot instance market. In 2012 IEEE INFOCOM. 190--198.
[58]
Supreeth Subramanya, Tian Guo, Prateek Sharma, David Irwin, and Prashant Shenoy. 2015. SpotOn: A Batch Computing Service for the Spot Market. In Proceedings of the Sixth ACM Symposium on Cloud Computing (SoCC '15). 329--341.
[59]
Chunqiang Tang, Kenny Yu, Kaushik Veeraraghavan, Jonathan Kaldor, Scott Michelson, Thawan Kooburat, Aravind Anbudurai, Matthew Clark, Kabir Gogia, Long Cheng, Ben Christensen, Alex Gartrell, Maxim Khutornenko, Sachin Kulkarni, Marcin Pawlowski, Tuomas Pelkonen, Andre Rodrigues, Rounak Tibrewal, Vaishnavi Venkatesan, and Peter Zhang. 2020. Twine: A Unified Cluster Management System for Shared Infrastructure. In 14th USENIX Symp on Operating Systems Design and Implementation (OSDI 20). 787--803.
[60]
ShaoJie Tang, Jing Yuan, and Xiang-Yang Li. 2012. Towards Optimal Bidding Strategy for Amazon EC2 Cloud Spot Instance. In 2012 IEEE Fifth Intl Conf on Cloud Computing. 91--98.
[61]
Muhammad Tirmazi, Adam Barker, Nan Deng, Md Ehtesam Haque, Zhijing Gene Qin, Steven Hand, Mor Harchol-Balter, and John Wilkes. 2020. Borg: the Next Generation. In EuroSys'20.
[62]
Vinod Kumar Vavilapalli, Arun C Murthy, Chris Douglas, Sharad Agarwal, Mahadev Konar, Robert Evans, Thomas Graves, Jason Lowe, Hitesh Shah, Siddharth Seth, et al. 2013. Apache hadoop yarn: Yet another resource negotiator. In 4th annual Symp on Cloud Computing. 1--16.
[63]
Abhishek Verma, Luis Pedrosa, Madhukar R. Korupolu, David Oppenheimer, Eric Tune, and John Wilkes. 2015. Large-scale cluster management at Google with Borg. In European Conf on Computer Systems (EuroSys).
[64]
William Voorsluys, Saurabh Kumar Garg, and Rajkumar Buyya. 2011. Provisioning Spot Market Cloud Resources to Create Cost-Effective Virtual Clusters. In Algorithms and Architectures for Parallel Processing. Springer, 395--408.
[65]
Rich Wolski, John Brevik, Ryan Chard, and Kyle Chard. 2017. Probabilistic Guarantees of Execution Duration for Amazon Spot Instances. In Intl Conf for High Performance Computing, Networking, Storage and Analysis (SC '17). Article 18,11 pages.
[66]
Xiaohu Wu, Francesco De Pellegrini, Guanyu Gao, and Giuliano Casale. 2019. A Framework for Allocating Server Time to Spot and On-Demand Services in Cloud Computing. ACM Trans. Model. Perform. Eval. Comput. Syst. 4, 4, Article 20 (2019), 31 pages.
[67]
Zichen Xu, Christopher Stewart, Nan Deng, and Xiaorui Wang. 2016. Blending on-demand and spot instances to lower costs for in-memory storage. In IEEE INFOCOM 2016 - The 35th Annual IEEE Intl Conf on Computer Communications. 1--9.
[68]
Hailong Yang, Alex Breslow, Jason Mars, and Lingjia Tang. 2013. Bubble-Flux: Precise Online QoS Management for Increased Utilization in Warehouse Scale Computers. In Proceedings of the 40th Annual International Symposium on Computer Architecture (Tel-Aviv, Israel) (ISCA '13). Association for Computing Machinery, New York, NY, USA, 607âĂŞ618.
[69]
Sangho Yi, Artur Andrzejak, and Derrick Kondo. 2012. Monetary Cost-Aware Checkpointing and Migration on Amazon Cloud Spot Instances. IEEE Trans. on Services Computing 5, 4 (2012), 512--524.
[70]
Murtaza Zafer, Yang Song, and Kang-Won Lee. 2012. Optimal Bids for Spot VMs in a Cloud for Deadline Constrained Jobs. In 2012 IEEE Fifth Intl Conf on Cloud Computing. 75--82.
[71]
Matei Zaharia, Dhruba Borthakur, Joydeep Sen Sarma, Khaled Elmeleegy, Scott Shenker, and Ion Stoica. 2010. Delay Scheduling: A Simple Technique for Achieving Locality and Fairness in Cluster Scheduling. In 5th European Conf on Computer Systems (EuroSys'10).
[72]
Qi Zhang, Quanyan Zhu, and Raouf Boutaba. 2011. Dynamic Resource Allocation for Spot Markets in Cloud Computing Environments. In 2011 Fourth IEEE Intl Conf on Utility and Cloud Computing. 178--185.
[73]
Yunqi Zhang, George Prekas, Giovanni Matteo Fumarola, Marcus Fontoura, Inigo Goiri, and Ricardo Bianchini. 2016. History-Based Harvesting of Spare Cycles and Storage in Large-Scale Datacenters. In Proceedings of the 12th USENIX Conference on Operating Systems Design and Implementation (Savannah, GA, USA) (OSDI'16). USENIX Association, USA, 755ÂĂŞ770.
[74]
Han Zhao, Miao Pan, Xinxin Liu, Xiaolin Li, and Yuguang Fang. 2012. Optimal Resource Rental Planning for Elastic Applications in Cloud Market. In 2012 IEEE 26th Intl Parallel and Distributed Processing Symp. 808--819.
[75]
Liang Zheng, Carlee Joe-Wong, Chee Wei Tan, Mung Chiang, and Xinyu Wang. 2015. How to Bid the Cloud. In 2015 ACM Conf on Special Interest Group on Data Communication (SIGCOMM '15). 71--84.

Cited By

View all
  • (2024)Can't be lateProceedings of the 21st USENIX Symposium on Networked Systems Design and Implementation10.5555/3691825.3691836(185-203)Online publication date: 16-Apr-2024
  • (2024)Towards SLO-Compliant and Cost-Effective Serverless Computing on Emerging GPU ArchitecturesProceedings of the 25th International Middleware Conference10.1145/3652892.3700760(211-224)Online publication date: 2-Dec-2024
  • (2023)spotDNN: Provisioning Spot Instances for Predictable Distributed DNN Training in the Cloud2023 IEEE/ACM 31st International Symposium on Quality of Service (IWQoS)10.1109/IWQoS57198.2023.10188717(1-10)Online publication date: 19-Jun-2023

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
SoCC '22: Proceedings of the 13th Symposium on Cloud Computing
November 2022
574 pages
ISBN:9781450394147
DOI:10.1145/3542929
This work is licensed under a Creative Commons Attribution International 4.0 License.

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 07 November 2022

Check for updates

Author Tags

  1. cloud computing
  2. low priority VM
  3. preemptible VM
  4. resource allocation
  5. spot instance
  6. virtual machine

Qualifiers

  • Research-article

Funding Sources

Conference

SoCC '22
Sponsor:
SoCC '22: ACM Symposium on Cloud Computing
November 7 - 11, 2022
California, San Francisco

Acceptance Rates

Overall Acceptance Rate 169 of 722 submissions, 23%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)231
  • Downloads (Last 6 weeks)37
Reflects downloads up to 15 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2024)Can't be lateProceedings of the 21st USENIX Symposium on Networked Systems Design and Implementation10.5555/3691825.3691836(185-203)Online publication date: 16-Apr-2024
  • (2024)Towards SLO-Compliant and Cost-Effective Serverless Computing on Emerging GPU ArchitecturesProceedings of the 25th International Middleware Conference10.1145/3652892.3700760(211-224)Online publication date: 2-Dec-2024
  • (2023)spotDNN: Provisioning Spot Instances for Predictable Distributed DNN Training in the Cloud2023 IEEE/ACM 31st International Symposium on Quality of Service (IWQoS)10.1109/IWQoS57198.2023.10188717(1-10)Online publication date: 19-Jun-2023

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media