research-article

T-CAT: Dynamic Cache Allocation for Tiered Memory Systems With Memory Interleaving

Authors:

Hwanjun Lee,

Seunghak Lee,

Yeji Jung,

Daehoon KimAuthors Info & Claims

IEEE Computer Architecture Letters, Volume 22, Issue 2

Pages 73 - 76

https://doi.org/10.1109/LCA.2023.3290197

Published: 01 July 2023 Publication History

Abstract

New memory interconnect technology, such as Intel's Compute Express Link (CXL), helps to expand memory bandwidth and capacity by adding CPU-less NUMA nodes to the main memory system, addressing the growing memory wall challenge. Consequently, modern computing systems embrace the heterogeneity in memory systems, composing the memory systems with a tiered memory system with near and far memory (e.g., local DRAM and CXL-DRAM). However, adopting NUMA interleaving, which can improve performance by exploiting node-level parallelism and utilizing aggregate bandwidth, to the tiered memory systems can face challenges due to differences in the access latency between the two types of memory, leading to potential performance degradation for memory-intensive workloads. By tackling the challenges, we first investigate the effects of the NUMA interleaving on the performance of the tiered memory systems. We observe that while NUMA interleaving is essential for applications demanding high memory bandwidth, it can negatively impact the performance of applications demanding low memory bandwidth. Next, we propose a dynamic cache management, called <monospace>T-CAT</monospace>, which partitions the last-level cache between near and far memory, aiming to mitigate performance degradation by accessing far memory. <monospace>T-CAT</monospace> attempts to reduce the difference in the average access latency between near and far memory by re-sizing the cache partitions. Through dynamic cache management, <monospace>T-CAT</monospace> can preserve the performance benefits of NUMA interleaving while mitigating performance degradation by the far memory accesses. Our experimental results show that <monospace>T-CAT</monospace> improves performance by up to 17% compared to cases with NUMA interleaving without the cache management.

References

[1]

K. Kim et al., “SMT: Software-defined memory tiering for heterogeneous computing systems with CXL memory expander,” IEEE Micro, vol. 43, no. 2, pp. 20–29, Mar./Apr. 2023.

Digital Library

Google Scholar

[2]

M. Krause and H. Fellow, “Gen-Z: Communication at the speed of memory,” IEEE Supercomputing, 2017.

Google Scholar

[3]

S. Park et al., “Scaling of memory performance and capacity with CXL memory expander,” in Proc. IEEE Hot Chips Symp., 2022, pp. 1–27.

Google Scholar

[4]

“CXL,” Compute Express Link, 2023. Accessed: Jun. 25, 2023. [Online]. Available: https://www.computeexpresslink.org

Google Scholar

[5]

Z. Yan, D. Lustig, D. Nellans, and A. Bhattacharjee, “Nimble page management for tiered memory systems,” in Proc. ACM Int. Conf. Architectural Support Program. Lang. Operating Syst., 2019, pp. 331–345.

Google Scholar

[6]

A. Maruf, A. Ghosh, J. Bhimani, D. Campello, A. Rudoff, and R. Rangaswami, “MULTI-CLOCK: Dynamic tiering for hybrid memory systems,” in Proc. IEEE Int. Symp. High Perform. Comput. Architecture, 2022, pp. 925–937.

Google Scholar

[7]

H. Li et al., “Pond: CXL-based memory pooling systems for cloud platforms,” in Proc. ACM Int. Conf. Architectural Support Program. Lang. Operating Syst., 2023, pp. 574–587.

Google Scholar

[8]

J. Kim, W. Choe, and J. Ahn, “Exploring the design space of page management for multi-tiered memory systems,” in Proc. USENIX Annu. Tech. Conf., 2021, pp. 715–728.

Google Scholar

[9]

H. A. Maruf et al., “TPP: Transparent page placement for CXL-enabled tiered-memory,” in Proc. ACM Int. Conf. Architectural Support Program. Lang. Operating Syst., 2023, pp. 742–755.

Google Scholar

[10]

J. Weiner et al., “TMO: Transparent memory offloading in datacenters,” in Proc. ACM Int. Conf. Architectural Support Program. Lang. Operating Syst., 2022, pp. 609–621.

Google Scholar

[11]

S. Lee et al., “3D-Xpath: High-density managed dram architecture with cost-effective alternative paths for memory transactions,” in Proc. IEEE Int. Conf. Parallel Architectures Compilation Techn., 2018, pp. 1–12.

Google Scholar

[12]

J. L. Henning, “SPECCPU2006 benchmark descriptions,” ACM SIGARCH Comput. Archit. News, vol. 34, no. 4, pp. 1–17, 2006.

Digital Library

Google Scholar

[13]

S. Beamer, K. Asanović, and D. Patterson, “The gap benchmark suite,” 2015,.

Google Scholar

Recommendations

Tiered Memory: An Iso-Power Memory Architecture to Address the Memory Power Wall

Moore's Law improvement in transistor density is driving a rapid increase in the number of cores per processor. DRAM device capacity and energy efficiency are increasing at a slower pace, so the importance of DRAM power is increasing. This problem ...
Cooperating Write Buffer Cache and Virtual Memory Management for Flash Memory Based Systems
RTAS '11: Proceedings of the 2011 17th IEEE Real-Time and Embedded Technology and Applications Symposium

Flash memory is becoming the storage media of choice for mobile devices and embedded systems. The performance of flash memory is impacted by the asymmetric speed of read and write operations, limited number of erase times and the absence of in-place ...
Cache Design with Domain Wall Memory
Domain wall memory (DWM) is a recently developed spin-based memory technology in which several bits of data are densely packed into the domains of a ferromagnetic wire. DWM has shown great promise in enabling non-volatile memory with very high density and ...

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image IEEE Computer Architecture Letters

IEEE Computer Architecture Letters Volume 22, Issue 2

July-Dec. 2023

120 pages

ISSN:1556-6056

Issue’s Table of Contents

Publisher

IEEE Computer Society

United States

Publication History

Published: 01 July 2023

Qualifiers

Research-article

Recommendations

Tiered Memory: An Iso-Power Memory Architecture to Address the Memory Power Wall

Cooperating Write Buffer Cache and Virtual Memory Management for Flash Memory Based Systems

Cache Design with Domain Wall Memory

Comments

Published In

Publisher

Publication History

Qualifiers

Other Metrics

Article Metrics

Other Metrics

Abstract

References

Recommendations

Tiered Memory: An Iso-Power Memory Architecture to Address the Memory Power Wall

Cooperating Write Buffer Cache and Virtual Memory Management for Flash Memory Based Systems

Cache Design with Domain Wall Memory

Comments

Information

Published In

Publisher

Publication History

Qualifiers

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

View options

Figures

Other

Share

Share this Publication link

Share on social media

Affiliations