[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.5555/2555692.2555693acmconferencesArticle/Chapter ViewAbstractPublication PagesesweekConference Proceedingsconference-collections
research-article

Reducing inter-core cache contention with an adaptive bank mapping policy in DRAM cache

Published: 29 September 2013 Publication History

Abstract

On-chip DRAM cache has the advantage of increased cache capacity that may alleviate the memory bandwidth problem. Recent research has demonstrated the benefits of high capacity on-chip DRAM cache that leads to reduced off-chip accesses. However, state-of-the-art has not taken into consideration the cache access patterns of concurrently running heterogeneous applications that can cause inter-core cache contention. We therefore propose an adaptive bank mapping policy in response to the diverse requirements of applications with different cache access behaviors that -- as a result -- reduces inter-core cache contention in DRAM-based cache architectures. On average, our adaptive bank mapping policy increases the harmonic mean instruction-per-cycle throughput by 19.3% (max. 71%) compared to state-of-the-art bank mapping policies.

References

[1]
Standard Performance Evaluation Corporation. http://www.spec.org.
[2]
R. X. Arroyo, R. J. Harrington, S. P. Hartman, and T. Nguyen. IBM POWER7 Systems. IBM Journal of Research and Development, 55(3): 2:1--2:13, 2011.
[3]
B. Black, M. Annavaram, N. Brekelbaum, J. DeVale, and L. Jiang. Die-Stacking (3D) Microarchitecture. In Proceedings of the 39th Annual IEEE/ACM International Symposium on Microarchitecture (MICRO), pages 469--479, December 2006.
[4]
S. Cho and L. Jin. Managing Distributed, Shared L2 Caches through OS-Level Page Allocation. In Proceedings of the 39th Annual IEEE/ACM International Symposium on Microarchitecture (MICRO), pages 455--468, December 2006.
[5]
F. Hameed, L. Bauer, and J. Henkel. Dynamic Cache Management in Multi-Core Architectures through Run-time Adaptation. In Proceedings of the 14th conference on Design, Automation and Test in Europe (DATE), pages 485--490, March 2012.
[6]
F. Hameed, L. Bauer, and J. Henkel. Adaptive Cache Management for a Combined SRAM and DRAM Cache Hierarchy for Multi-Cores. In Proceedings of the 15th conference on Design, Automation and Test in Europe (DATE), pages 77--82, March 2013.
[7]
A. Jaleel, W. Hasenplaugh, M. K. Qureshi, J. Sebot, S. Steely Jr., and J. Emer. Adaptive Insertion Policies for Managing Shared Caches. In Int'l Conference on Parallel Architectures and Compilation Techniques (PACT), pages 208--219, 2008.
[8]
Gabriel H. Loh. Extending the Effectiveness of 3D--stacked Dram Caches with an Adaptive Multi-Queue Policy. In IEEE/ACM International Symposium on Microarchitecture (MICRO), pages 174--183, 2009.
[9]
Gabriel H. Loh and Mark D. Hill. Efficiently Enabling Conventional Block Sizes for Very Large Die-stacked DRAM Caches. In IEEE/ACM International Symposium on Microarchitecture (MICRO), pages 454--464, 2011.
[10]
Gabriel H. Loh and Mark D. Hill. Supporting Very Large DRAM Caches with Compound Access Scheduling and MissMaps. In IEEE Micro Magazine, Special Issue on Top Picks in Computer Architecture Conferences, 2012.
[11]
Gabriel H. Loh, S. Subramaniam, and Y. Xie. Zesto: A Cycle-Level Simulator for Highly Detailed Microarchitecture Exploration. In Int'l Symposium on Performance Analysis of Systems and Software (ISPASS), 2009.
[12]
M. K. Qureshi and Y. N. Patt. Utility-Based Cache Partitioning: A Low-Overhead, High-performance, Runtime mechanism to Partition Shared Caches. In Proceedings of the 39th Annual IEEE/ACM International Symposium on Microarchitecture (MICRO), pages 423--432, 2006.
[13]
B. M. Rogers, A. Krishna., G. B. Bell, K. Vu, X. Jiang, and Y. Solihin. Scaling the Bandwidth Wall: Challenges in and Avenues for CMP Scaling. SIGARCH Computer Architecture News, 37(3):371--382, 2009.
[14]
M. Shah, J. Barreh, J. Brooks, R. Golla, G. Grohoski, R. Hetherington, P. Jordan, M. Luttrel, O. Chistopher, B. Saha, D. Sheahan, L. Spracklen, and A. Wynn. UltraSPARC T2: A Highly-Threaded, PowerEfficient, SPARC SOC. In IEEE Asian Solid State Circuit Conference, pages 22--25, 2007.
[15]
D. Wendel, R. Kalla, R. Cargoni, J. Clables, J. Friedrich, R. Frech, J. Kahle, B. Sinharoy, W. Starke, S. Taylor, S. Weitzel, S. G. Chu, S. Islam, and V. Zyuban. The Implementation of Power7TM: A Highly Parallel and Scalable Multi-core High-end Server Processor. In Solid-State Circuits Conference Digest of Technical Papers (ISSCC), 2010 IEEE International, pages 102--103, February 2010.
[16]
Y. Xie and G. H. Loh. Dynamic Classification of Program Memory Behaviors in CMPs. In 2nd Workshop on Chip Multiprocessor Memory Systems and Interconnects (CMP-MSI), June 2008.
[17]
Y. Xie and G. H. Loh. PIPP: Promotion/Insertion Pseudo-Partitioning of Multi-Core Shared Caches. In Proceedings of the 36th International Symposium on Computer Architecture (ISCA), pages 174--183, 2009.

Cited By

View all
  • (2017)Efficient STT-RAM last-level-cache architecture to replace DRAM cacheProceedings of the International Symposium on Memory Systems10.1145/3132402.3132414(141-151)Online publication date: 2-Oct-2017
  • (2015)Cooperatively managing dynamic writeback and insertion policies in a last-level DRAM cacheProceedings of the 2015 Design, Automation & Test in Europe Conference & Exhibition10.5555/2755753.2755794(187-192)Online publication date: 9-Mar-2015
  1. Reducing inter-core cache contention with an adaptive bank mapping policy in DRAM cache

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    CODES+ISSS '13: Proceedings of the Ninth IEEE/ACM/IFIP International Conference on Hardware/Software Codesign and System Synthesis
    September 2013
    335 pages
    ISBN:9781479914173

    Sponsors

    Publisher

    IEEE Press

    Publication History

    Published: 29 September 2013

    Check for updates

    Qualifiers

    • Research-article

    Conference

    ESWEEK'13
    ESWEEK'13: Ninth Embedded System Week
    September 29 - October 4, 2013
    Quebec, Montreal, Canada

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)5
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 22 Dec 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2017)Efficient STT-RAM last-level-cache architecture to replace DRAM cacheProceedings of the International Symposium on Memory Systems10.1145/3132402.3132414(141-151)Online publication date: 2-Oct-2017
    • (2015)Cooperatively managing dynamic writeback and insertion policies in a last-level DRAM cacheProceedings of the 2015 Design, Automation & Test in Europe Conference & Exhibition10.5555/2755753.2755794(187-192)Online publication date: 9-Mar-2015

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media