[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.5555/3145862.3145876acmconferencesArticle/Chapter ViewAbstractPublication PagessbcciConference Proceedingsconference-collections
research-article

Architectural exploration of last-level caches targeting homogeneous multicore systems

Published: 29 August 2017 Publication History

Abstract

The Last-Level Cache (LLC) influences the overall system performance and power dissipation in multicore systems significantly. This paper evaluates five LLC architectures targeting execution time, dynamic and static power dissipation, and area consumption. They are measured using the widely adopted PARSEC benchmark suite for parallel shared-memory systems. Employing Gem5 full-system simulator and 32 nm technology characterization of the McPAT framework, this work had two interesting findings: (i) the shared LLC has the overall best performance under the PARSEC parallel workload, even for applications with less than 20% of shared data. (ii) A privately accessed cache can reduce up to 20 times the dynamic power dissipation on 32 nm technology and 25 times the area consumption when compared to shared-accessed caches.

References

[1]
A. Asaduzzaman; F. Sibai; M. Rani. Impact of Level-2 Cache Sharing on the Performance and Power Requirements of Homogeneous Multicore Embedded Systems. Microprocessors and Microsystems, vol. 33, Issues 5--6, pp. 388--397, Aug. 2009.
[2]
H. Esmaeilzadeh et al. Dark Silicon and the End of Multicore Scaling. International Symposium on Computer Architecture (ISCA), pp. 365--376, 2011.
[3]
M. Hajkazemi; M. Tavana; H. Homayoun. Wide I/O or LPDDR? Exploration and Analysis of Performance, Power and Temperature Trade-offs of Emerging DRAM Technologies in Embedded MPSoCs. International Conference on Computer Design (ICCD), pp. 62--69, 2015.
[4]
M. Sabry; M. Ruggiero; P. Valle. Performance and Energy Trade-Offs Analysis of L2 on-chip Cache Architectures for Embedded MPSoCs. Great Lake Symposium on VLSI (GLVLSI), pp. 305--310, 2010,
[5]
H. Yun; P. Valsan. Evaluating the Isolation Effect of Cache Partitioning on COTS Multicore Platforms. Workshop on Operating Systems Platforms for Embedded Real-time Applications (OSPERT), pp. 45--50, 2015.
[6]
PARSEC team. The PARSEC Benchmark suite. parsec.cs.princeton.edu/.
[7]
McPAT research team. McPAT. www.hpl.hp.com/research/mcpat/.
[8]
Altera Corporation. Meeting the Low Power Imperative at 28 nm. Whitepaper, pp. 1--12, Sep. 2012.
[9]
H.-Y. Cheng et al. EECache: A Comprehensive Study on the Architectural Design for Energy-Efficient Last-Level Caches in Chip Multiprocessors. ACM Transactions on Architecture and Code Optimization (TACO), vol. 12, Issue 2, pp. 1--22, Jul. 2015.
[10]
SureCore Technology. Technology Whitepaper. White Papers Repository, pp 1--6, 2013.
[11]
H. Luo; P. Li; C. Ding. Parallel Data Sharing in Cache: Theory, Measurement, and Analysis. Technical Report TR-994, pp. 1--25, Mar. 2015.
[12]
U. Wiener. Modeling and Analysis of a Cache Coherent Interconnect. Thesis Report, Eindhoven University of Technology, pp. 1--83, Aug. 2012.
[13]
R. Sivaramakrishnan; S. Jairath. Next Generation SPARC Processor Cache Hierarchy. Presentation at Hot Chips (HC), pp. 1--28, 2014.
[14]
D. Woo; N. Seong; D. Lewis; H.-H. Lee. An Optimized 3D-Stacked Memory Architecture by Exploiting Excessive, High-Density TSV Bandwidth. International Symposium on High Performance Computer Architecture (HPCA), pp 1--12, 2010.
[15]
ARM. Cortex A-15. Technical Reference Manual, pp. 1--364, 2011.
[16]
N. Binkert et al. The gem5 Simulator. ACM SIGARCH Computer Architecture News, vol. 39, Issue 2, pp. 1--7, May 2011.
[17]
C. Bienia et al. The PARSEC Benchmark Suite: Characterization and Architectural Implications. International Conference on Parallel Architectures and Compilation Techniques (PACT), pp. 72--81, 2008.
[18]
G. Southern; J. Renau. Deconstructing PARSEC Scalability. Annual Workshop on Duplicating, Deconstructing and Debunking of International Symposium on Computer Architecture (ISCAWDDD), pp. 1--10, 2015.
  1. Architectural exploration of last-level caches targeting homogeneous multicore systems

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    SBCCI '16: Proceedings of the 29th Symposium on Integrated Circuits and Systems Design: Chip on the Mountains
    August 2016
    250 pages
    ISBN:9781509027361

    Sponsors

    Publisher

    IEEE Press

    Publication History

    Published: 29 August 2017

    Check for updates

    Author Tags

    1. gem5
    2. last-level cache
    3. multicore system

    Qualifiers

    • Research-article

    Conference

    SBCCI '16
    Sponsor:
    SBCCI '16: 29th Symposium on Integrated Circuits and Systems Design
    August 29 - September 3, 2016
    Belo Horizonte, Brazil

    Acceptance Rates

    Overall Acceptance Rate 133 of 347 submissions, 38%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • 0
      Total Citations
    • 73
      Total Downloads
    • Downloads (Last 12 months)1
    • Downloads (Last 6 weeks)1
    Reflects downloads up to 12 Dec 2024

    Other Metrics

    Citations

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media