research-article

HAS: : Hybrid auto-scaler for resource scaling in cloud environment

Authors:

Bibal Benifa J.V.,

Dejey DharmaAuthors Info & Claims

Volume 120, Issue C

Pages 1 - 15

https://doi.org/10.1016/j.jpdc.2018.04.016

Published: 01 October 2018 Publication History

Abstract

Auto-scaling is a crucial mechanism that supports autonomic provisioning and de-provisioning of computing resources in accordance with fluctuating demands in a cloud environment. The success factor of autonomic provisioning depends on efficient resource utilization and response time performance metrics. Existing literature focuses on reactive or predictive auto-scaling mechanism where the computing system is unable to scale proportionally with the Slashdot effect or abrupt traffic bursts while these mechanisms are employed in a discrete fashion. Predictive methods strive to predict the future computational needs and subsequently obtain or release the resources in advance; however it could be directed to under-utilization. Hence, a Hybrid Auto-Scaler (HAS) is proposed to adjust the required resources automatically to the application in demand. HAS forecasts the future behaviour of the system using a time series method and deploys the anticipated resources by computing the required capacity through a queuing model. Further, it uses a reactive approach to scale out the resources in accordance as the provisioned resources are insufficient to deal with the current needs. HAS also balances the load efficiently by employing Continuous Time Markov Model (CTMM). The proposed HAS is validated with several benchmark workloads to achieve significant improvement in CPU utilization and response time.

Highlights

•

Developed a Hybrid Auto-Scaler (HAS) framework for automated resource scaling in cloud environment. It is a hybrid method that combines the Predictive and the Reactive method for effective auto-scaling process.

•

HAS employs Auto-Regression of order one for estimating the future arrival rate. A novel set of equations is proposed to compute the future resource requirement.

•

Reactive method is utilized only when the computed resources are insufficient to handle the workloads.

•

Continuous Time Markov Model is employed to allocate the resources and to balance the load.

•

HAS framework is validated in a real cloud environment for proving its efficiency in terms of resource utilization, response time and scalability.

References

[1]

Al-Dhuraibi Y., Paraiso F., Djarallah N., Merle P., Elasticity in cloud computing: State of the art and research challenges, IEEE Trans. Serv. Comput. (2018),.

Abstract

Highlights

References

Cited By

Index Terms

Recommendations

RLPAS: Reinforcement Learning-based Proactive Auto-Scaler for Resource Provisioning in Cloud Environment

A cost-efficient auto-scaling mechanism for IoT applications in fog computing environment: a deep learning-based approach

Autonomic computing architecture for real-time medical application running on virtual private cloud infrastructures

Comments

Information

Published In

Publisher

Publication History

Author Tags

Qualifiers

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

Cited By

View options

Figures

Other

Share

Share this Publication link

Share on social media

Affiliations