More Web Proxy on the site http://driver.im/

research-article

Architectural exploration of last-level caches targeting homogeneous multicore systems

Authors:

Rodrigo Cataldo,

Guilherme Korol,

Ramon Fernandes,

César MarconAuthors Info & Claims

SBCCI '16: Proceedings of the 29th Symposium on Integrated Circuits and Systems Design: Chip on the Mountains

Article No.: 14, Pages 1 - 6

Published: 29 August 2017 Publication History

Abstract

The Last-Level Cache (LLC) influences the overall system performance and power dissipation in multicore systems significantly. This paper evaluates five LLC architectures targeting execution time, dynamic and static power dissipation, and area consumption. They are measured using the widely adopted PARSEC benchmark suite for parallel shared-memory systems. Employing Gem5 full-system simulator and 32 nm technology characterization of the McPAT framework, this work had two interesting findings: (i) the shared LLC has the overall best performance under the PARSEC parallel workload, even for applications with less than 20% of shared data. (ii) A privately accessed cache can reduce up to 20 times the dynamic power dissipation on 32 nm technology and 25 times the area consumption when compared to shared-accessed caches.

References

[1]

A. Asaduzzaman; F. Sibai; M. Rani. Impact of Level-2 Cache Sharing on the Performance and Power Requirements of Homogeneous Multicore Embedded Systems. Microprocessors and Microsystems, vol. 33, Issues 5--6, pp. 388--397, Aug. 2009.

Digital Library

[2]

H. Esmaeilzadeh et al. Dark Silicon and the End of Multicore Scaling. International Symposium on Computer Architecture (ISCA), pp. 365--376, 2011.

Digital Library

[3]

M. Hajkazemi; M. Tavana; H. Homayoun. Wide I/O or LPDDR? Exploration and Analysis of Performance, Power and Temperature Trade-offs of Emerging DRAM Technologies in Embedded MPSoCs. International Conference on Computer Design (ICCD), pp. 62--69, 2015.

Digital Library

[4]

M. Sabry; M. Ruggiero; P. Valle. Performance and Energy Trade-Offs Analysis of L2 on-chip Cache Architectures for Embedded MPSoCs. Great Lake Symposium on VLSI (GLVLSI), pp. 305--310, 2010,

Digital Library

[5]

H. Yun; P. Valsan. Evaluating the Isolation Effect of Cache Partitioning on COTS Multicore Platforms. Workshop on Operating Systems Platforms for Embedded Real-time Applications (OSPERT), pp. 45--50, 2015.

[6]

PARSEC team. The PARSEC Benchmark suite. parsec.cs.princeton.edu/.

[7]

McPAT research team. McPAT. www.hpl.hp.com/research/mcpat/.

[8]

Altera Corporation. Meeting the Low Power Imperative at 28 nm. Whitepaper, pp. 1--12, Sep. 2012.

[9]

H.-Y. Cheng et al. EECache: A Comprehensive Study on the Architectural Design for Energy-Efficient Last-Level Caches in Chip Multiprocessors. ACM Transactions on Architecture and Code Optimization (TACO), vol. 12, Issue 2, pp. 1--22, Jul. 2015.

Digital Library

[10]

SureCore Technology. Technology Whitepaper. White Papers Repository, pp 1--6, 2013.

[11]

H. Luo; P. Li; C. Ding. Parallel Data Sharing in Cache: Theory, Measurement, and Analysis. Technical Report TR-994, pp. 1--25, Mar. 2015.

[12]

U. Wiener. Modeling and Analysis of a Cache Coherent Interconnect. Thesis Report, Eindhoven University of Technology, pp. 1--83, Aug. 2012.

[13]

R. Sivaramakrishnan; S. Jairath. Next Generation SPARC Processor Cache Hierarchy. Presentation at Hot Chips (HC), pp. 1--28, 2014.

[14]

D. Woo; N. Seong; D. Lewis; H.-H. Lee. An Optimized 3D-Stacked Memory Architecture by Exploiting Excessive, High-Density TSV Bandwidth. International Symposium on High Performance Computer Architecture (HPCA), pp 1--12, 2010.

[15]

ARM. Cortex A-15. Technical Reference Manual, pp. 1--364, 2011.

[16]

N. Binkert et al. The gem5 Simulator. ACM SIGARCH Computer Architecture News, vol. 39, Issue 2, pp. 1--7, May 2011.

Digital Library

[17]

C. Bienia et al. The PARSEC Benchmark Suite: Characterization and Architectural Implications. International Conference on Parallel Architectures and Compilation Techniques (PACT), pp. 72--81, 2008.

Digital Library

[18]

G. Southern; J. Renau. Deconstructing PARSEC Scalability. Annual Workshop on Duplicating, Deconstructing and Debunking of International Symposium on Computer Architecture (ISCAWDDD), pp. 1--10, 2015.

Architectural exploration of last-level caches targeting homogeneous multicore systems
1. Hardware

Recommendations

MRU-Tour-based Replacement Algorithms for Last-Level Caches
SBAC-PAD '11: Proceedings of the 2011 23rd International Symposium on Computer Architecture and High Performance Computing

Memory hierarchy design is a major concern in current microprocessors. Many research work focuses on the Last-Level Cache (LLC), which is designed to hide the long miss penalty of accessing to main memory. To reduce both capacity and conflict misses, ...
Block value based insertion policy for high performance last-level caches
ICS '14: Proceedings of the 28th ACM international conference on Supercomputing

Last-level cache performance has been proved to be crucial to the system performance. Essentially, any cache management policy improves performance by retaining blocks that it believes to have higher values preferentially. Most cache management policies ...
Combining recency of information with selective random and a victim cache in last-level caches

Memory latency has become an important performance bottleneck in current microprocessors. This problem aggravates as the number of cores sharing the same memory controller increases. To palliate this problem, a common solution is to implement cache ...

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences

SBCCI '16: Proceedings of the 29th Symposium on Integrated Circuits and Systems Design: Chip on the Mountains

August 2016

250 pages

ISBN:9781509027361

General Chair:
Davies William de Lima Monteiro
DEE UFMG, Brazil
,
Program Chairs:
Frank Sill Torres
DELT, UFMG, Brazil
,
Leandro Soares Indrusiak
University of York, United Kingdom

Sponsors

SBC: Brazilian Computer Society
IEEE Circuits and Systems Society
SIGDA: ACM Special Interest Group on Design Automation
SBMICRO: Brazilian Microelectronics Society

Publisher

IEEE Press

Publication History

Published: 29 August 2017

Check for updates

Author Tags

Qualifiers

Research-article

Conference

SBCCI '16

Sponsor:

SBC
SIGDA
SBMICRO

SBCCI '16: 29th Symposium on Integrated Circuits and Systems Design

August 29 - September 3, 2016

Belo Horizonte, Brazil

Acceptance Rates

Overall Acceptance Rate 133 of 347 submissions, 38%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

0
Total Citations
73
Total Downloads

Downloads (Last 12 months)1
Downloads (Last 6 weeks)1

Reflects downloads up to 12 Dec 2024

Other Metrics

View Author Metrics

Citations

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Media

Figures

Other

Tables

View Table of Contents