Abstract
In cloud computing, resources are dynamically provisioned and delivered to users in a transparent manner automatically on-demand. Task execution failure is no longer accidental but a common characteristic of cloud computing environment. In recent times, a number of intelligent scheduling techniques have been used to address task scheduling issues in cloud without much attention to fault tolerance. In this research article, we proposed a dynamic clustering league championship algorithm (DCLCA) scheduling technique for fault tolerance awareness to address cloud task execution which would reflect on the current available resources and reduce the untimely failure of autonomous tasks. Experimental results show that our proposed technique produces remarkable fault reduction in task failure as measured in terms of failure rate. It also shows that the DCLCA outperformed the MTCT, MAXMIN, ant colony optimization and genetic algorithm-based NSGA-II by producing lower makespan with improvement of 57.8, 53.6, 24.3 and 13.4 % in the first scenario and 60.0, 38.9, 31.5 and 31.2 % in the second scenario, respectively. Considering the experimental results, DCLCA provides better quality fault tolerance aware scheduling that will help to improve the overall performance of the cloud environment.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.References
Gital AY, Ismail AS, Chen M, Chiroma H (2014) A framework for the design of cloud based collaborative virtual environment architecture. In: Proceedings of the international multi conference of engineers and computer scientists
Lu K, Yahyapour R, Wieder P, Yaqub E, Abdullah M, Schloer B, Kotsokalis C (2016) Fault-tolerant service level agreement lifecycle management in clouds using actor system. Future Gener Comput Syst 54:247–259
Moon Y-H, Youn C-H (2015) Multihybrid job scheduling for fault-tolerant distributed computing in policy-constrained resource networks. Comput Netw 82:81–95
He J, Dong M, Ota K, Fan M, Wang G (2014) NetSecCC: a scalable and fault-tolerant architecture for cloud computing security. Peer-to-Peer Netw Appl 9(1):67–81
Nawi NM, Khan A, Rehman MZ, Chiroma H, Herawan T (2015) Weight optimization in recurrent neural networks with hybrid metaheuristic Cuckoo search techniques for data classification. Math Probl Eng. doi:10.1155/2015/868375
Mills B, Znati T, Melhem R (2014) Shadow computing: an energy-aware fault tolerant computing model. In: 2014 International conference on computing, networking and communications (ICNC), IEEE, pp 73–77
Kashan HA (2009) League championship algorithm: a new algorithm for numerical function optimization. In: International conference of soft computing and pattern recognition, 2009. SOCPAR’09, IEEE, pp 43–48
Kashan HA, Karimi B (2012) A new algorithm for constrained optimization inspired by the sport league championships. In: 2010 IEEE congress on evolutionary computation (CEC), IEEE, pp 1–8
Abdulhamid SM, Latiff MSA, Ismaila I (2014) Tasks scheduling technique using league championship algorithm for makespan minimization in IAAS cloud. ARPN J Eng Appl Sci 9(12):2528–2533
Abdulhamid SM, Latiff MSA, Madni SHH, Oluwafemi O (2015) A survey of league championship algorithm: prospects and challenges. Indian Jo Sci Technol 8(S3):101–110
Yang Y-G, Tian J, Lei H, Zhou Y-H, Shi W-M (2016) Novel quantum image encryption using one-dimensional quantum cellular automata. Inf Sci 345:257–270
Dondi R, El-Mabrouk N, Swenson KM (2014) Gene tree correction for reconciliation and species tree inference: complexity and algorithms. J Discrete Algorithms 25:51–65
Abdulhamid SM, Latiff MSA, Bashir MB (2014) On-demand grid provisioning using cloud infrastructures and related virtualization tools: a survey and taxonomy. Int J Adv Stud Comput Sci Eng IJASCSE 3(1):49–59
Kushwah VS, Goyal SK, Narwariya P (2014) A survey on various fault tolerant approaches for cloud environment during load balancing. Int J Comput Netw Wirel Mobile Commun 4(6):25–34
Yang W, Zhang C, Shao Y, Shi Y, Li H, Khan M, Hussain F, Khan I, Cui L-J, He H (2014) A hybrid particle swarm optimization algorithm for service selection problem in the cloud. Int J Grid Distrib Comput 7(4):1–10
Hussin M, Lee YC, Zomaya AY (2010) Dynamic job-clustering with different computing priorities for computational resource allocation. In: Proceedings of the 2010 10th IEEE/ACM international conference on cluster, cloud and grid computing, IEEE Computer Society, pp 589–590
Vidhate D, Patil A, Guleria D (2010) Dynamic cluster resource allocations for jobs with known memory demands. In: Proceedings of the international conference and workshop on emerging trends in technology, ACM, pp 64–69
SiM Abdulhamid, Latiff SMA, Bashir MB (2014) Scheduling techniques in on-demand grid as a service cloud: a review. J Theor Appl Inf Technol 63(1):10–19
Abdullahi M, Ngadi MA (2016) Symbiotic organism search optimization based task scheduling in cloud computing environment. Future Gener Comput Syst 56:640–650
Madni SHH, Latiff MSA, Coulibaly Y (2016) An appraisal of meta-heuristic resource allocation techniques for IaaS cloud. Indian J Sci Technol 9(4):1–14. doi:10.17485/ijst/2016/v9i4/80561
Chiroma H, Shuib NLM, Muaz SA, Abubakar AI, Ila LB, Maitama JZ (2015) A review of the applications of bio-inspired flower pollination algorithm. Procedia Comput Sci 62:435–441
Boru D, Kliazovich D, Granelli F, Bouvry P, Zomaya AY (2015) Energy-efficient data replication in cloud computing datacenters. Clust Comput 18(1):385–402. doi:10.1007/s10586-014-0404-x
Gangeshwari R, Subbiah J, Malathy K, Miriam D (2012) HPCLOUD: a novel fault tolerant architectural model for hierarchical MapReduce. In: 2012 international conference on recent trends in information technology (ICRTIT), IEEE, pp 179–184
G\({\c a}\)sior J, Seredyński F (2013) Multi-objective parallel machines scheduling for fault-tolerant cloud systems. In: Joanna K, Di Martino B, Talia D, Xiong K (eds) Algorithms and architectures for parallel processing. Springer, Switzerland, pp 247–256. doi:10.1007/978-3-319-03859-9_21
Ling Y, Ouyang Y, Luo Z (2012) A novel fault-tolerant scheduling algorithm with high reliability in cloud computing systems. J Converg Inf Technol 7(15):107–115. doi:10.4156/jcit.vol7.issue15.13
Tawfeek M, El-Sisi A, Keshk A, Torkey F (2015) Cloud task scheduling based on ant colony optimization. Int Arab J Inf Technol (IAJIT) 12(2):129–137
Ganga K, Karthik S (2013) A fault tolerent approach in scientific workflow systems based on cloud computing. In: 2013 international conference on pattern recognition, informatics and mobile engineering (PRIME), IEEE, pp 387–390
Bala A, Chana I (2015) Autonomic fault tolerant scheduling approach for scientific workflows in Cloud computing. Concurr Eng 23(1):27–39. doi:10.1177/1063293X14567783
Bonvin N, Papaioannou TG, Aberer K (2010) A self-organized, fault-tolerant and scalable replication scheme for cloud storage. In: Proceedings of the 1st ACM symposium on cloud computing, ACM, pp 205–216
Sampaio AM, Barbosa JG (2015) A performance enforcing mechanism for energy-and failure-aware cloud systems. In: 2014 international green computing conference, IGCC 2014. doi:10.1109/IGCC.2014.7039151
Patra PK, Singh H, Singh G (2013) Fault tolerance techniques and comparative implementation in cloud computing. Int J Comput Appl 64(14):37–41
Nawi NM, Khan A, Rehman M, Chiroma H, Herawan T (2015) Weight optimization in recurrent neural networks with hybrid metaheuristic Cuckoo search techniques for data classification. Math Probl Eng 501:868375
Xu H, Yang B, Qi W, Ahene E (2016) A multi-objective optimization approach to workflow scheduling in clouds considering fault recovery. KSII Trans Internet Inf Syst 10(3):976–995. doi:10.3837/tiis.2016.03.002
Kumar VS, Aramudhan M (2014) Hybrid optimized list scheduling and trust based resource selection in cloud computing. J Theor Appl Inf Technol 69(3):434–442
Choi S, Chung K, Yu H (2014) Fault tolerance and QoS scheduling using CAN in mobile social cloud computing. Clust Comput 17(3):911–926
Kaveh A (2014) Particle swarm optimization. In: Advances in metaheuristic algorithms for optimal design of structures. Springer, Switzerland, pp 9–40. doi:10.1007/978-3-319-05549-7
Yuan H, Li C, Du M (2014) Optimal virtual machine resources scheduling based on improved particle swarm optimization in cloud computing. J Softw 9(3):705–708
Verma A, Kaushal S (2014) Bi-criteria priority based particle swarm optimization workflow scheduling algorithm for cloud. In: 2014 recent advances in engineering and computational sciences (RAECS), IEEE, pp 1–6
Ramezani F, Lu J, Hussain FK (2014) Task-based system load balancing in cloud computing using particle swarm optimization. Int J Parallel Program 42(5):739–754
Yang W, Zhang C, Shao Y, Shi Y, Li H, Khan M, Hussain F, Khan I, Cui L-J, He H (2014) A hybrid particle swarm optimization algorithm for service selection problem in the cloud. Int J Grid Distrib Comput 7(4):1–10. doi:10.14257/ijgdc.2014.7.4.01
Wu K (2014) A tunable workflow scheduling algorithm based on particle swarm optimization for cloud computing. Master’s Projects, Paper 358. San José State University, USA
Zhang W, Xie H, Cao B, Cheng AM (2014) Energy-aware real-time task scheduling for heterogeneous multiprocessors with particle swarm optimization algorithm. Math Probl Eng 2014: 1–9. doi:10.1155/2014/287475
Gao Y, Gupta SK, Wang Y, Pedram M (2014) An energy-aware fault tolerant scheduling framework for soft error resilient cloud computing systems. In: Design, automation and test in Europe conference and exhibition (DATE), 2014, IEEE, pp 1–6
Hu Y, Gong B, Wang F (2010) Cloud model-based security-aware and fault-tolerant job scheduling for computing grid. In: ChinaGrid conference (ChinaGrid), 2010 fifth annual, IEEE, pp 25–30
Qiang W, Jiang C, Ran L, Zou D, Jin H (2015) CDMCR: multi-level fault-tolerant system for distributed applications in cloud. Secur Commun Netw 2015:SCN-SI-077. doi:10.1002/sec.1187
Urgaonkar R, Wang SQ, He T, Zafer M, Chan K, Leung KK (2015) Dynamic service migration and workload scheduling in edge-clouds. Perform Eval 91:205–228. doi:10.1016/j.peva.2015.06.013
Vobugari S, Somayajulu D, Subaraya BM (2015) Dynamic replication algorithm for data replication to improve system availability: a performance engineering approach. IETE J Res 61(2):132–141. doi:10.1080/03772063.2014.988757
Rathore N, Chana I (2014) Load balancing and job migration techniques in grid: a survey of recent trends. Wirel Pers Commun 79(3):2089–2125
Yadav S, Nanda SJ (2015) League championship algorithm for clustering. In: 2015 IEEE power, communication and information technology conference (PCITC), IEEE, pp 321–326
Xu W, Wang R, Yang J (2015) An improved league championship algorithm with free search and its application on production scheduling. J Intell Manuf 1–10. doi:10.1007/s10845-015-1099-4
Buyya R, Ranjan R, Calheiros RN (2009) Modeling and simulation of scalable cloud computing environments and the CloudSim toolkit: challenges and opportunities. In: International conference on high performance computing & simulation, 2009. HPCS’09, IEEE, pp 1–11
Parallel Workload Archive - SDSC-SP2-1998-4.swf (2015). http://www.cs.huji.ac.il/labs/parallel/workload/l_sdsc_sp2/index.html. Accessed 30 Jan 2015
Ramakrishnan L, Reed DA (2008) Performability modeling for scheduling and fault tolerance strategies for scientific workflows. In: Proceedings of the 17th international symposium on High performance distributed computing, ACM, pp 23–34
Garg R, Singh AK (2014) Fault tolerant task scheduling on computational grid using checkpointing under transient faults. Arabian J Sci Eng 39(12):8775–8791
Acknowledgments
The authors would like to acknowledge and appreciate the support of Universiti Teknologi Malaysia (UTM), Research University Grant Q. J130000.2528.05H87 and the Nigerian Tertiary Education Trust Fund (TetFund) for their support.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Abdulhamid, S.M., Abd Latiff, M., Madni, S.H.H. et al. Fault tolerance aware scheduling technique for cloud computing environment using dynamic clustering algorithm. Neural Comput & Applic 29, 279–293 (2018). https://doi.org/10.1007/s00521-016-2448-8
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00521-016-2448-8