Export Citations
Save this search
Please login to be able to save your searches and receive alerts for new content matching your search criteria.
- research-articleDecember 2024
Towards SLO-Compliant and Cost-Effective Serverless Computing on Emerging GPU Architectures
- Vivek M. Bhasi,
- Aakash Sharma,
- Rishabh Jain,
- Jashwant Raj Gunasekaran,
- Ashutosh Pattnaik,
- Mahmut Taylan Kandemir,
- Chita Das
Middleware '24: Proceedings of the 25th International Middleware ConferencePages 211–224https://doi.org/10.1145/3652892.3700760Serverless platforms are supporting an increasing variety of applications (apps). Among these, apps such as Machine Learning (ML) inference serving can benefit significantly from leveraging accelerators like GPUs. Yet, major serverless providers, despite ...
SpotVerse: Optimizing Bioinformatics Workflows with Multi-Region Spot Instances in Galaxy and Beyond
Middleware '24: Proceedings of the 25th International Middleware ConferencePages 74–87https://doi.org/10.1145/3652892.3700750As demand for cloud computing in bioinformatics increases, various studies have explored options for running large-scale workloads with reduced costs, often leveraging spot instances in multi-region deployments. For example, spot instances offer lower ...
- research-articleOctober 2023
AsyFunc: A High-Performance and Resource-Efficient Serverless Inference System via Asymmetric Functions
SoCC '23: Proceedings of the 2023 ACM Symposium on Cloud ComputingPages 324–340https://doi.org/10.1145/3620678.3624664Recent advances in deep learning (DL) have spawned various intelligent cloud services with well-trained DL models. Nevertheless, it is nontrivial to maintain the desired end-to-end latency under bursty workloads, raising critical challenges on high-...
- research-articleNovember 2022
Owl: performance-aware scheduling for resource-efficient function-as-a-service cloud
SoCC '22: Proceedings of the 13th Symposium on Cloud ComputingPages 78–93https://doi.org/10.1145/3542929.3563470This work documents our experience of improving the scheduler in Alibaba Function Compute, a public FaaS platform. It commences with our observation that memory and CPU are under-utilized in most FaaS sandboxes. A natural solution is to overcommit VM ...
- research-articleNovember 2022
Cypress: input size-sensitive container provisioning and request scheduling for serverless platforms
SoCC '22: Proceedings of the 13th Symposium on Cloud ComputingPages 257–272https://doi.org/10.1145/3542929.3563464The growing popularity of the serverless platform has seen an increase in the number and variety of applications (apps) being deployed on it. The majority of these apps process user-provided input to produce the desired results. Existing work in the ...
- research-articleNovember 2021
Kraken: Adaptive Container Provisioning for Deploying Dynamic DAGs in Serverless Platforms
- Vivek M. Bhasi,
- Jashwant Raj Gunasekaran,
- Prashanth Thinakaran,
- Cyan Subhra Mishra,
- Mahmut Taylan Kandemir,
- Chita Das
SoCC '21: Proceedings of the ACM Symposium on Cloud ComputingPages 153–167https://doi.org/10.1145/3472883.3486992The growing popularity of microservices has led to the proliferation of online cloud service-based applications, which are typically modelled as Directed Acyclic Graphs (DAGs) comprising of tens to hundreds of microservices. The vast majority of these ...
- research-articleJanuary 2021
Implications of Public Cloud Resource Heterogeneity for Inference Serving
- Jashwant Raj Gunasekaran,
- Cyan Subhra Mishra,
- Prashanth Thinakaran,
- Mahmut Taylan Kandemir,
- Chita R. Das
WoSC '20: Proceedings of the 2020 Sixth International Workshop on Serverless ComputingPages 7–12https://doi.org/10.1145/3429880.3430093We are witnessing an increasing trend towards using Machine Learning (ML) based prediction systems, spanning across different application domains, including product recommendation systems, personal assistant devices, facial recognition, etc. These ...
Fifer: Tackling Resource Underutilization in the Serverless Era
- Jashwant Raj Gunasekaran,
- Prashanth Thinakaran,
- Nachiappan C. Nachiappan,
- Mahmut Taylan Kandemir,
- Chita R. Das
Middleware '20: Proceedings of the 21st International Middleware ConferencePages 280–295https://doi.org/10.1145/3423211.3425683Datacenters are witnessing a rapid surge in the adoption of serverless functions for microservices-based applications. A vast majority of these microservices typically span less than a second, have strict SLO requirements, and are chained together as ...