Abstract
In server for cluster, the number of running programs is increasing. The benefit of consolidating multiple programs is good for server utilization, but it also leads to the program’s performance degradation. Severe performance degradation can result in significant losses. Therefore, it is essential to divide the shared resource to support consolidation. Our experiment showed that even some shared resources such as CPU cores and memory had been divided, the performance of programs still drop down significantly compared to the program running alone, then we found out the primary reason was the contention for LLC. In this paper, we proposed the LLC partitioning method to improve the performance for consolidation programs. We classify the LLC usage type of the program by analyzing the LLC behavior, then allocate reasonable LLC ways according to the LLC usage type. Meanwhile, we monitor the program’s performance in real-time and allocate the LLC ways dynamically. The experiment found that compared with the default LLC allocation method, our method reduced the performance loss by an average of 6.73% and improved the fairness by 0.03. Compared with the CPA method, our method reduced the performance loss by an average of 4.86%.
D. Li and L. Wang—Contributed equally to this work.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Nikas, K., Papadopoulou, N., Giantsidi, D., Karakostas, V., Goumas, G., Koziris, N.: DICER: diligent cache partitioning for efficient workload consolidation. In: Proceedings of the 48th International Conference on Parallel Processing, pp. 1–10 (2019)
Sfakianakis, Y., Kozanitis, C., Kozyrakis, C., Bilas, A.: QuMan: profile-based improvement of cluster utilization. ACM Trans. Archit. Code Optim. (TACO) 15(3), 1–25 (2018)
CGroups (2021). https://www.kernel.org/doc/Documentation/cgroup-v1/cgroups.txt
Intel(R) Resource Director Technology (2021). https://github.com/intel/intel-cmt-cat
SPECCPU2006 (2021). https://www.spec.org/cpu2006/
PARSEC (2021). https://parsec.cs.princeton.edu/
Tang, L., Mars, J., Soffa, M.L.: Contentiousness vs. sensitivity: improving contention aware runtime systems on multicore architectures. In: Proceedings of the 1st International Workshop on Adaptive Self-Tuning Computing Systems for the Exaflop Era, pp. 12–21 (2011)
Qureshi, M.K.: Adaptive spill-receive for robust high-performance caching in CMPs. In: 2009 IEEE 15th International Symposium on High Performance Computer Architecture, pp. 45–54. IEEE (2009)
Pons, L., Sahuquillo, J., Selfa, V., Petit, S., Pons, J.: Phase-aware cache partitioning to target both turnaround time and system performance. IEEE Trans. Parallel Distrib. Syst. 31(11), 2556–2568 (2020)
Selfa, V., Sahuquillo, J., Eeckhout, L., Petit, S., Gomez, M.E.: Application clustering policies to address system fairness with intel’s cache allocation technology. In: 2017 26th International Conference on Parallel Architectures and Compilation Techniques (PACT), pp. 194–205. IEEE (2017)
Park, J., Park, S., Baek, W.: CoPart: coordinated partitioning of last-level cache and memory bandwidth for fairness-aware workload consolidation on commodity servers. In: Proceedings of the Fourteenth EuroSys Conference 2019, pp. 1–16 (2019)
Aupy, G., Benoit, A., Goglin, B., Pottier, L., Robert, Y.: Co-scheduling HPC workloads on cache-partitioned CMP platforms. Int. J. High Perform. Comput. Appl. 33(6), 1221–1239 (2019)
Chen, W., Rao, J., Zhou, X.: Preemptive, low latency datacenter scheduling via lightweight virtualization. In: 2017 USENIX Annual Technical Conference (USENIXATC 17), pp. 251–263 (2017)
Xiang, Y., Wang, X., Huang, Z., Wang, Z., Luo, Y., Wang, Z.: DCAPS: dynamic cache allocation with partial sharing. In: Proceedings of the Thirteenth EuroSys Conference, pp. 1–15 (2018)
Lo, D., Cheng, L., Govindaraju, R., Ranganathan, P., Kozyrakis, C.: Heracles: improving resource efficiency at scale. In: Proceedings of the 42nd Annual International Symposium on Computer Architecture, pp. 450–462 (2015)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2022 Springer Nature Switzerland AG
About this paper
Cite this paper
Li, D., Wang, L., Huang, T., Zhu, X., Geng, S. (2022). PFA: Performance and Fairness-Aware LLC Partitioning Method. In: Lai, Y., Wang, T., Jiang, M., Xu, G., Liang, W., Castiglione, A. (eds) Algorithms and Architectures for Parallel Processing. ICA3PP 2021. Lecture Notes in Computer Science(), vol 13157. Springer, Cham. https://doi.org/10.1007/978-3-030-95391-1_44
Download citation
DOI: https://doi.org/10.1007/978-3-030-95391-1_44
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-95390-4
Online ISBN: 978-3-030-95391-1
eBook Packages: Computer ScienceComputer Science (R0)