[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
research-article

Formulating Criticality-Based Cost-Effective Fault Tolerance Strategies for Multi-Tenant Service-Based Systems

Published: 01 March 2018 Publication History

Abstract

The proliferation of cloud computing has fueled the rapid growth of multi-tenant service-based systems (SBSs), which serve multiple tenants simultaneously by composing existing services in the form of business processes. In a distributed and volatile operating environment, runtime anomalies may occur to the component services of an SBS and cause end-to-end quality violations. Engineering multi-tenant SBSs that can quickly handle runtime anomalies cost effectively has become a significant challenge. Different approaches have been proposed to formulate fault tolerance strategies for engineering SBSs. However, none of the existing approaches has sufficiently considered the service criticality based on multi-tenancy where multiple tenants share the same SBS instance with different multi-dimensional quality preferences. In this paper, we propose Criticality-based Fault Tolerance for Multi-Tenant SBSs (CFT4MTS), a novel approach that formulates cost-effective fault tolerance strategies for multi-tenant SBSs by providing redundancy for the critical component services. First, the criticality of each component service is evaluated based on its multi-dimensional quality and multiple tenants sharing the component service with differentiated quality preferences. Then, the fault tolerance problem is modelled as an Integer Programming problem to identify the optimal fault tolerance strategy. The experimental results show that, compared with three existing representative approaches, CFT4MTS can alleviate degradation in the quality of multi-tenant SBSs in a much more effective and efficient way.

References

[1]
L. Baresi and S. Guinea, “Self-supervising BPEL processes,” IEEE Trans. Softw. Eng., vol. Volume 37, no. Issue 2, pp. 247–263, 2011.
[2]
R. Calinescu, L. Grunske, M. Kwiatkowska, R. Mirandola, and G. Tamburrelli, “Dynamic QoS management and optimization in service-based systems,” IEEE Trans. Softw. Eng., vol. Volume 37, no. Issue 3, pp. 387–409, May/2011.
[3]
Q. He, J. Han, Y. Yang, H. Jin, J.-G. Schneider, and S. Versteeg, “Formulating cost-effective monitoring strategies for service-based systems,” IEEE Trans. Softw. Eng., vol. Volume 40, no. Issue 5, pp. 461–482, 2014.
[4]
, “Growth in web APIs from 2005 to 2013,” 2014. {Online}. Available: http://www.programmableweb.com/api-research
[5]
Q. He, J. Yan, H. Jin, and Y. Yang, “Quality-aware service selection for service-based systems based on iterative multi-attribute combinatorial auction,” IEEE Trans. Softw. Eng., vol. Volume 40, no. Issue 2, pp. 192–215, 2014.
[6]
Q. He, J. Han, Y. Yang, J. Grundy, and H. Jin, “QoS-driven service selection for multi-tenant SaaS,” in Proc. 5th IEEE Int. Conf. Cloud Comput., 2012, pp. 566–573.
[7]
M. Armbrust, et al., “A view of cloud computing,” Commun. ACM, vol. Volume 53, no. Issue 4, pp. 50–58, 2010.
[8]
B. P. Rimal, E. Choi, and I. Lumb, “A taxonomy and survey of cloud computing systems,” in Proc. 5th Int. Joint Conf. INC IMS IDC, 2009, pp. 44–51.
[9]
N. Poggi, D. Carrera, R. Gavalda, and E. Ayguadé, “Non-intrusive estimation of QoS degradation impact on e-commerce user satisfaction,” in Proc. 10th IEEE Int. Symp. Netw. Comput. Appl., 2011, pp. 179–186.
[10]
J. Barr, A. Narin, and J. Varia, “Building fault-tolerant applications on AWS,” 2011. {Online}. Available: http://media.amazonwebservices.com/AWS_Building_Fault_Tolerant_ Applications.pdf
[11]
L. Zeng, B. Benatallah, A. H. Ngu, M. Dumas, J. Kalagnanam, and H. Chang, “QoS-aware middleware for web services composition,” IEEE Trans. Softw. Eng., vol. Volume 30, no. Issue 5, pp. 311–327, 2004.
[12]
D. Ardagna and B. Pernici, “Adaptive service composition in flexible processes,” IEEE Trans. Softw. Eng., vol. Volume 33, no. Issue 6, pp. 369–384, 2007.
[13]
V. Nallur and R. Bahsoon, “A decentralized self-adaptation mechanism for service-based applications in the cloud,” IEEE Trans. Softw. Eng., vol. Volume 39, no. Issue 5, pp. 591–612, 2013.
[14]
K. Birman, R. van Renesse, and W. Vogels, “Adding high availability and autonomic behavior to web services,” in Proc. 26th Int. Conf. Softw. Eng., 2004, pp. 17–26.
[15]
Z. Zheng and M. R. Lyu, “A distributed replication strategy evaluation and selection framework for fault tolerant web services,” in Proc. 6th IEEE Int. Conf. Web Serv., 2008, pp. 145–152.
[16]
W. Zhao, P. Melliar-Smith, and L. E. Moser, “Fault tolerance middleware for cloud computing,” in Proc. 3rd IEEE Int. Conf. Cloud Comput., 2010, pp. 67–74.
[17]
P. A. Bonatti and P. Festa, “On optimal service selection,” in Proc. 14th Int. Conf. World Wide Web, 2005, pp. 530–538.
[18]
Z. Zheng, T. C. Zhou, M. R. Lyu, and I. King, “Component ranking for fault-tolerant cloud applications,” IEEE Trans. Serv. Comput., vol. Volume 5, no. Issue 4, pp. 540–550, –Dec. 2012.
[19]
Q. He, et al., “QoS-aware service selection for customisable multi-tenant service-based systems: Maturity and approaches,” in Proc. 8th IEEE Int. Conf. Cloud Comput., 2015, pp. 237–244.
[20]
Y. Wang, Q. He, and Y. Yang, “QoS-aware service recommendation for multi-tenant SaaS on the cloud,” in Proc. 12th IEEE Int. Conf. Service Comput., 2015, pp. 178–185.
[21]
T. Yu, Y. Zhang, and K.-J. Lin, “Efficient algorithms for web services selection with end-to-end QoS constraints,” ACM Trans. Web, vol. Volume 1, no. Issue 1, 2007, Art. no. 6.
[22]
M. Alrifai, D. Skoutas, and T. Risse, “Selecting skyline services for QoS-based web service composition,” in Proc. 19th Int. Conf. World Wide Web, 2010, pp. 11–20.
[23]
M. Zeleny and J. L. Cochrane, Multiple Criteria Decision Making . Columbia, SC, USA: Univ. South Carolina Press, 1973.
[24]
A. Saltelli, et al., Global Sensitivity Analysis: The Primer . Hoboken, NJ, USA: Wiley, 2008.
[25]
G. N. Rodrigues, D. S. Rosenblum, and S. Uchitel, “Sensitivity analysis for a scenario-based reliability prediction model,” in Proc. Workshop Archit. Depend. Syst., 2005, vol. Volume 30, no. Issue 4, pp. 1–5.
[26]
M. Harman, J. Krinke, I. Medina-Bulo, F. Palomo-Lozano, J. Ren, and S. Yoo, “Exact scalable sensitivity analysis for the next release problem,” ACM Trans. Softw. Eng. Methodology, vol. Volume 23, no. Issue 2, 2014, Art. no. 19.
[27]
F. C. Meng, “Comparing the importance of system components by some structural characteristics,” IEEE Trans. Rel., vol. Volume 45, no. Issue 1, pp. 59–65, 1996.
[28]
J. Freixas and M. Pons, “Identifying optimal components in a reliability system,” IEEE Trans. Rel., vol. Volume 57, no. Issue 1, pp. 163–170, 2008.
[29]
H. Peng, D. W. Coit, and Q. Feng, “Component reliability criticality or importance measures for systems with degrading components,” IEEE Trans. Rel., vol. Volume 61, no. Issue 1, pp. 4–12, 2012.
[30]
A. Avizienis, “The N-version approach to fault-tolerant software,” IEEE Trans. Softw. Eng., vol. Volume SE-11, no. Issue 12, pp. 1491–1501, 1985.
[31]
T. J. Shimeall and N. G. Leveson, “An empirical comparison of software fault tolerance and fault elimination,” IEEE Trans. Softw. Eng., vol. Volume 17, no. Issue 2, pp. 173–182, 1991.
[32]
B. Randell, “System structure for software fault tolerance,” ACM SIGPLAN Notices, vol. Volume 10, no. Issue 6, pp. 437–449, 1975.
[33]
V. Cardellini, E. Casalicchio, V. Grassi, S. Iannucci, F. L. Presti, and R. Mirandola, “MOSES: A framework for QoS driven runtime adaptation of service-oriented systems,” IEEE Trans. Softw. Eng., vol. Volume 38, no. Issue 5, pp. 1138–1159, 2012.
[34]
Q. He, J. Han, Y. Yang, J.-G. Schneider, H. Jin, and S. Versteeg, “Probabilistic critical path identification for cost-effective monitoring of service-based systems,” in Proc. 9th IEEE Int. Conf. Service Comput., 2012, pp. 178–185.
[35]
E. Al-Masri andQ. H. Mahmoud, “Discovering the best web service,” in Proc. 16th Int. Conf. World Wide Web, 2007, pp. 1257–1258.
[36]
D. P. Bertsekas, Nonlinear Programming . Belmont, MA, USA: Athena Scientific, 1999.
[37]
P. T. Boggs and J. W. Tolle, “Sequential quadratic programming,” Acta Numerica, vol. Volume 4, pp. 1–51, 1995.
[38]
S. Boyd and L. Vandenberghe, Convex Optimization . Cambridge, U.K.: Cambridge Univ. Press, 2004.
[39]
M. Salehie and L. Tahvildari, “Self-adaptive software: Landscape and research challenges,” ACM Trans. Auton. Adaptive Syst., vol. Volume 4, no. Issue 2, 2009, Art. no. 14.
[40]
G. Cugola, L. S. Pinto, and G. Tamburrelli, “QoS-aware adaptive service orchestrations,” in Proc. 19th IEEE Int. Conf. Web Serv., 2012, pp. 440–447.
[41]
R. Calinescu, C. Ghezzi, M. Kwiatkowska, and R. Mirandola, “Self-adaptive software needs quantitative verification at runtime,” Commun. ACM, vol. Volume 55, no. Issue 9, pp. 69–77, 2012.
[42]
G. A. Moreno, J. Cámara, D. Garlan, and B. Schmerl, “Proactive self-adaptation under uncertainty: A probabilistic model checking approach,” in Proc. 10th Joint Meet. Found. Softw. Eng., 2015, pp. 1–12.
[43]
M. Alrifai and T. Risse, “Combining global optimization with local selection for efficient QoS-aware service composition,” in Proc. 18th Int. Conf. World Wide Web, 2009, pp. 881–890.
[44]
D. E. Eckhardt, et al., “An experimental evaluation of software redundancy as a strategy for improving reliability,” IEEE Trans. Softw. Eng., vol. Volume 17, no. Issue 7, pp. 692–702, 1991.
[45]
A. Gorbenko, V. Kharchenko, and A. Romanovsky, “Using inherent service redundancy and diversity to ensure web services dependability,” in Methods, Models and Tools for Fault Tolerance . Berlin, Germany: Springer, 2009, pp. 324–341.
[46]
Z. Zheng and M. R. Lyu, “Selecting an optimal fault tolerance strategy for reliable service-oriented systems with local and global constraints,” IEEE Trans. Comput., vol. Volume 64, no. Issue 1, pp. 219–232, 2015.

Cited By

View all
  • (2023)FSPLO: a fast sensor placement location optimization method for cloud-aided inspection of smart buildingsJournal of Cloud Computing: Advances, Systems and Applications10.1186/s13677-023-00410-012:1Online publication date: 6-Mar-2023
  • (2023)Enhancing Fault Injection Testing of Service Systems via Fault-Tolerance BottleneckIEEE Transactions on Software Engineering10.1109/TSE.2023.328535749:8(4097-4114)Online publication date: 1-Aug-2023
  • (2023)Modular models for systems based on multi-tenant servicesJournal of King Saud University - Computer and Information Sciences10.1016/j.jksuci.2023.10167135:8Online publication date: 1-Sep-2023
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image IEEE Transactions on Software Engineering
IEEE Transactions on Software Engineering  Volume 44, Issue 3
March 2018
106 pages

Publisher

IEEE Press

Publication History

Published: 01 March 2018

Qualifiers

  • Research-article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 25 Dec 2024

Other Metrics

Citations

Cited By

View all
  • (2023)FSPLO: a fast sensor placement location optimization method for cloud-aided inspection of smart buildingsJournal of Cloud Computing: Advances, Systems and Applications10.1186/s13677-023-00410-012:1Online publication date: 6-Mar-2023
  • (2023)Enhancing Fault Injection Testing of Service Systems via Fault-Tolerance BottleneckIEEE Transactions on Software Engineering10.1109/TSE.2023.328535749:8(4097-4114)Online publication date: 1-Aug-2023
  • (2023)Modular models for systems based on multi-tenant servicesJournal of King Saud University - Computer and Information Sciences10.1016/j.jksuci.2023.10167135:8Online publication date: 1-Sep-2023
  • (2022)An Early Predictive and Recovery Mechanism for Scheduled Outages in Service-Based Systems (SBS)International Journal of Software Innovation10.4018/IJSI.30701610:1(1-35)Online publication date: 5-Aug-2022
  • (2022)DIMA: Distributed cooperative microservice caching for internet of things in edge computing by deep reinforcement learningWorld Wide Web10.1007/s11280-021-00939-725:5(1769-1792)Online publication date: 1-Sep-2022
  • (2021)Embedding app-library graph for neural third party library recommendationProceedings of the 29th ACM Joint Meeting on European Software Engineering Conference and Symposium on the Foundations of Software Engineering10.1145/3468264.3468552(466-477)Online publication date: 20-Aug-2021
  • (2021)SALDEFT: Self-Adaptive Learning Differential Evolution Based Optimal Physical Machine Selection for Fault Tolerance Problem in CloudWireless Personal Communications: An International Journal10.1007/s11277-021-08089-9118:2(1453-1480)Online publication date: 1-May-2021
  • (2020)Learning deep networks with crowdsourcing for relevance evaluationEURASIP Journal on Wireless Communications and Networking10.1186/s13638-020-01697-22020:1Online publication date: 25-Apr-2020
  • (2020)Study QoS Optimization and Energy Saving Techniques in Cloud, Fog, Edge, and IoTComplexity10.1155/2020/89641652020Online publication date: 1-Jan-2020
  • (2020)A New Multiple-Distribution GAN Model to Solve Complexity in End-to-End Chromosome KaryotypingComplexity10.1155/2020/89238382020Online publication date: 1-Jan-2020
  • Show More Cited By

View Options

View options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media