research-article

Modelling and developing conflict-aware scheduling on large-scale data centres

Authors:

Bo Gao,

Chang-Tsun LiAuthors Info & Claims

Volume 86, Issue C

Pages 995 - 1007

https://doi.org/10.1016/j.future.2017.07.043

Published: 01 September 2018 Publication History

Abstract

Large-scale data centres are the growing trend for modern computing systems. Since a large-scale data centre has to manage a large number of machines and jobs, deploying multiple independent schedulers (termed as distributed schedulers in literature) to make scheduling decisions simultaneously has been shown as an effective way to speed up the processing of large quantity of submitted jobs and data. The key drawback of distributed schedulers is that since these schedulers schedule different jobs independently, the scheduling decisions made by different schedulers may conflict with each other due to the possibility that different scheduling decisions refer to the same subset of the resources in the data centre. Conflicting scheduling decisions cause additional scheduling attempts and consequently increase the scheduling cost. More resources each scheduler demands, higher scheduling cost may incur and longer job response times the users may experience. It is useful to investigate the balanced points in terms of resource demands for each of independent schedulers, so that the distributed schedulers can all achieve decent job performance without experiencing undesired resource competition. To address this issue, we model distributed scheduling and resource conflict using the game theory and conduct the quantitative analysis about scheduling cost and job performance. Further, based on the analysis, we develop the conflict-aware scheduling strategies to reduce the scheduling cost and improve job performance. We have conducted the simulation experiments with workload trace and also real experiments on Amazon Web Services(AWS). The experimental results verify the effectiveness of the proposed modelling approach and scheduling strategies.

Highlights

•

Propose a method to quantify the relation between the scheduling conflicts and the resource demands.

•

Develop a game-theoretical solution for the distributed schedulers in large scale data centres.

•

Design and conduct both simulation experiments and real experiments on Amazon Web Service.

References

[1]

Verma A., Pedrosa L., Korupolu M., Oppenheimer D., Tune E., Wilkes J., Large-scale cluster management at Google with Borg, in: Proceedings of the Tenth European Conference on Computer Systems, ACM, 2015, p. 18.

Abstract

Highlights

References

Index Terms

Recommendations

Scheduling of deteriorating jobs with release dates to minimize the maximum lateness

Single machine parallel-batch scheduling with deteriorating jobs

Scheduling jobs under decreasing linear deterioration

Comments

Information

Published In

Publisher

Publication History

Author Tags

Qualifiers

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

View options

Share

Share this Publication link

Share on social media

Affiliations