[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
research-article

T-CAT: Dynamic Cache Allocation for Tiered Memory Systems With Memory Interleaving

Published: 01 July 2023 Publication History

Abstract

New memory interconnect technology, such as Intel&#x0027;s Compute Express Link (CXL), helps to expand memory bandwidth and capacity by adding CPU-less NUMA nodes to the main memory system, addressing the growing memory wall challenge. Consequently, modern computing systems embrace the heterogeneity in memory systems, composing the memory systems with a tiered memory system with near and far memory (e.g., local DRAM and CXL-DRAM). However, adopting NUMA interleaving, which can improve performance by exploiting node-level parallelism and utilizing aggregate bandwidth, to the tiered memory systems can face challenges due to differences in the access latency between the two types of memory, leading to potential performance degradation for memory-intensive workloads. By tackling the challenges, we first investigate the effects of the NUMA interleaving on the performance of the tiered memory systems. We observe that while NUMA interleaving is essential for applications demanding high memory bandwidth, it can negatively impact the performance of applications demanding low memory bandwidth. Next, we propose a dynamic cache management, called <monospace>T-CAT</monospace>, which partitions the last-level cache between near and far memory, aiming to mitigate performance degradation by accessing far memory. <monospace>T-CAT</monospace> attempts to reduce the difference in the average access latency between near and far memory by re-sizing the cache partitions. Through dynamic cache management, <monospace>T-CAT</monospace> can preserve the performance benefits of NUMA interleaving while mitigating performance degradation by the far memory accesses. Our experimental results show that <monospace>T-CAT</monospace> improves performance by up to 17&#x0025; compared to cases with NUMA interleaving without the cache management.

References

[1]
K. Kim et al., “SMT: Software-defined memory tiering for heterogeneous computing systems with CXL memory expander,” IEEE Micro, vol. 43, no. 2, pp. 20–29, Mar./Apr. 2023.
[2]
M. Krause and H. Fellow, “Gen-Z: Communication at the speed of memory,” IEEE Supercomputing, 2017.
[3]
S. Park et al., “Scaling of memory performance and capacity with CXL memory expander,” in Proc. IEEE Hot Chips Symp., 2022, pp. 1–27.
[4]
“CXL,” Compute Express Link, 2023. Accessed: Jun. 25, 2023. [Online]. Available: https://www.computeexpresslink.org
[5]
Z. Yan, D. Lustig, D. Nellans, and A. Bhattacharjee, “Nimble page management for tiered memory systems,” in Proc. ACM Int. Conf. Architectural Support Program. Lang. Operating Syst., 2019, pp. 331–345.
[6]
A. Maruf, A. Ghosh, J. Bhimani, D. Campello, A. Rudoff, and R. Rangaswami, “MULTI-CLOCK: Dynamic tiering for hybrid memory systems,” in Proc. IEEE Int. Symp. High Perform. Comput. Architecture, 2022, pp. 925–937.
[7]
H. Li et al., “Pond: CXL-based memory pooling systems for cloud platforms,” in Proc. ACM Int. Conf. Architectural Support Program. Lang. Operating Syst., 2023, pp. 574–587.
[8]
J. Kim, W. Choe, and J. Ahn, “Exploring the design space of page management for multi-tiered memory systems,” in Proc. USENIX Annu. Tech. Conf., 2021, pp. 715–728.
[9]
H. A. Maruf et al., “TPP: Transparent page placement for CXL-enabled tiered-memory,” in Proc. ACM Int. Conf. Architectural Support Program. Lang. Operating Syst., 2023, pp. 742–755.
[10]
J. Weiner et al., “TMO: Transparent memory offloading in datacenters,” in Proc. ACM Int. Conf. Architectural Support Program. Lang. Operating Syst., 2022, pp. 609–621.
[11]
S. Lee et al., “3D-Xpath: High-density managed dram architecture with cost-effective alternative paths for memory transactions,” in Proc. IEEE Int. Conf. Parallel Architectures Compilation Techn., 2018, pp. 1–12.
[12]
J. L. Henning, “SPECCPU2006 benchmark descriptions,” ACM SIGARCH Comput. Archit. News, vol. 34, no. 4, pp. 1–17, 2006.
[13]
S. Beamer, K. Asanović, and D. Patterson, “The gap benchmark suite,” 2015,.

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image IEEE Computer Architecture Letters
IEEE Computer Architecture Letters  Volume 22, Issue 2
July-Dec. 2023
120 pages

Publisher

IEEE Computer Society

United States

Publication History

Published: 01 July 2023

Qualifiers

  • Research-article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 0
    Total Downloads
  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 31 Dec 2024

Other Metrics

Citations

View Options

View options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media