PFA: Performance and Fairness-Aware LLC Partitioning Method

Donghua Li¹⁴,
Lin Wang¹⁴,
Tianyuan Huang¹⁴,
Xiaomin Zhu¹⁵ &
…
Shichao Geng¹⁴

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 13157))

Included in the following conference series:

International Conference on Algorithms and Architectures for Parallel Processing

1816 Accesses

Abstract

In server for cluster, the number of running programs is increasing. The benefit of consolidating multiple programs is good for server utilization, but it also leads to the program’s performance degradation. Severe performance degradation can result in significant losses. Therefore, it is essential to divide the shared resource to support consolidation. Our experiment showed that even some shared resources such as CPU cores and memory had been divided, the performance of programs still drop down significantly compared to the program running alone, then we found out the primary reason was the contention for LLC. In this paper, we proposed the LLC partitioning method to improve the performance for consolidation programs. We classify the LLC usage type of the program by analyzing the LLC behavior, then allocate reasonable LLC ways according to the LLC usage type. Meanwhile, we monitor the program’s performance in real-time and allocate the LLC ways dynamically. The experiment found that compared with the default LLC allocation method, our method reduced the performance loss by an average of 6.73% and improved the fairness by 0.03. Compared with the CPA method, our method reduced the performance loss by an average of 4.86%.

D. Li and L. Wang—Contributed equally to this work.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic

£29.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: GBP 19.95; Price includes VAT (United Kingdom)

eBook: GBP 79.50; Price includes VAT (United Kingdom)

Softcover Book: GBP 99.99; Price includes VAT (United Kingdom)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

When Partitioning Works and When It Doesn’t: An Empirical Study on Cache Way Partitioning

Min_c: Heterogeneous concentration policy for energy-aware scheduling of jobs with resource contention

Article 26 May 2017

A novel index system describing program runtime characteristics for workload consolidation

Article 13 November 2018

References

Nikas, K., Papadopoulou, N., Giantsidi, D., Karakostas, V., Goumas, G., Koziris, N.: DICER: diligent cache partitioning for efficient workload consolidation. In: Proceedings of the 48th International Conference on Parallel Processing, pp. 1–10 (2019)
Google Scholar
Sfakianakis, Y., Kozanitis, C., Kozyrakis, C., Bilas, A.: QuMan: profile-based improvement of cluster utilization. ACM Trans. Archit. Code Optim. (TACO) 15(3), 1–25 (2018)
Article Google Scholar
CGroups (2021). https://www.kernel.org/doc/Documentation/cgroup-v1/cgroups.txt
Intel(R) Resource Director Technology (2021). https://github.com/intel/intel-cmt-cat
SPECCPU2006 (2021). https://www.spec.org/cpu2006/
PARSEC (2021). https://parsec.cs.princeton.edu/
Tang, L., Mars, J., Soffa, M.L.: Contentiousness vs. sensitivity: improving contention aware runtime systems on multicore architectures. In: Proceedings of the 1st International Workshop on Adaptive Self-Tuning Computing Systems for the Exaflop Era, pp. 12–21 (2011)
Google Scholar
Qureshi, M.K.: Adaptive spill-receive for robust high-performance caching in CMPs. In: 2009 IEEE 15th International Symposium on High Performance Computer Architecture, pp. 45–54. IEEE (2009)
Google Scholar
Pons, L., Sahuquillo, J., Selfa, V., Petit, S., Pons, J.: Phase-aware cache partitioning to target both turnaround time and system performance. IEEE Trans. Parallel Distrib. Syst. 31(11), 2556–2568 (2020)
Article Google Scholar
Selfa, V., Sahuquillo, J., Eeckhout, L., Petit, S., Gomez, M.E.: Application clustering policies to address system fairness with intel’s cache allocation technology. In: 2017 26th International Conference on Parallel Architectures and Compilation Techniques (PACT), pp. 194–205. IEEE (2017)
Google Scholar
Park, J., Park, S., Baek, W.: CoPart: coordinated partitioning of last-level cache and memory bandwidth for fairness-aware workload consolidation on commodity servers. In: Proceedings of the Fourteenth EuroSys Conference 2019, pp. 1–16 (2019)
Google Scholar
Aupy, G., Benoit, A., Goglin, B., Pottier, L., Robert, Y.: Co-scheduling HPC workloads on cache-partitioned CMP platforms. Int. J. High Perform. Comput. Appl. 33(6), 1221–1239 (2019)
Article Google Scholar
Chen, W., Rao, J., Zhou, X.: Preemptive, low latency datacenter scheduling via lightweight virtualization. In: 2017 USENIX Annual Technical Conference (USENIXATC 17), pp. 251–263 (2017)
Google Scholar
Xiang, Y., Wang, X., Huang, Z., Wang, Z., Luo, Y., Wang, Z.: DCAPS: dynamic cache allocation with partial sharing. In: Proceedings of the Thirteenth EuroSys Conference, pp. 1–15 (2018)
Google Scholar
Lo, D., Cheng, L., Govindaraju, R., Ranganathan, P., Kozyrakis, C.: Heracles: improving resource efficiency at scale. In: Proceedings of the 42nd Annual International Symposium on Computer Architecture, pp. 450–462 (2015)
Google Scholar

Download references

Author information

Authors and Affiliations

Shandong Normal University, Jinan, China
Donghua Li, Lin Wang, Tianyuan Huang & Shichao Geng
Shandong Institute of Big Data, Jinan, China
Xiaomin Zhu

Authors

Donghua Li
View author publications
You can also search for this author in PubMed Google Scholar
Lin Wang
View author publications
You can also search for this author in PubMed Google Scholar
Tianyuan Huang
View author publications
You can also search for this author in PubMed Google Scholar
Xiaomin Zhu
View author publications
You can also search for this author in PubMed Google Scholar
Shichao Geng
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Xiamen University, Xiamen, China
Yongxuan Lai
Beijing Normal University, Zhuhai, China
Tian Wang
Xiamen University, Xiamen, China
Min Jiang
Tianjin University, Tianjin, China
Guangquan Xu
Hunan University, Changsha, China
Wei Liang
University of Naples Parthenope, Naples, Italy
Aniello Castiglione

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Li, D., Wang, L., Huang, T., Zhu, X., Geng, S. (2022). PFA: Performance and Fairness-Aware LLC Partitioning Method. In: Lai, Y., Wang, T., Jiang, M., Xu, G., Liang, W., Castiglione, A. (eds) Algorithms and Architectures for Parallel Processing. ICA3PP 2021. Lecture Notes in Computer Science(), vol 13157. Springer, Cham. https://doi.org/10.1007/978-3-030-95391-1_44

Download citation

DOI: https://doi.org/10.1007/978-3-030-95391-1_44
Published: 23 February 2022
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-95390-4
Online ISBN: 978-3-030-95391-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

PFA: Performance and Fairness-Aware LLC Partitioning Method

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

When Partitioning Works and When It Doesn’t: An Empirical Study on Cache Way Partitioning

Min_c: Heterogeneous concentration policy for energy-aware scheduling of jobs with resource contention

A novel index system describing program runtime characteristics for workload consolidation

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

PFA: Performance and Fairness-Aware LLC Partitioning Method

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

When Partitioning Works and When It Doesn’t: An Empirical Study on Cache Way Partitioning

Min_c: Heterogeneous concentration policy for energy-aware scheduling of jobs with resource contention

A novel index system describing program runtime characteristics for workload consolidation

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation