[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
column

Improving power efficiency of D-NUCA caches

Published: 01 September 2007 Publication History

Abstract

D-NUCA caches are cache memories that, thanks to banked organization, broadcast search and promotion/demotion mechanism, are able to tolerate the increasing wire delay effects introduced by technology scaling. As a consequence, they will outperform conventional caches (UCA, Uniform Cache Architectures) in future generation cores.
Due to the promotion/demotion mechanism, we have found that, in a D-NUCA cache, the distribution of hits on the ways varies across applications as well as across different execution phases within a single application. In this paper, we show how such a behavior can be utilized to improve D-NUCA power efficiency as well as to decrease its access latencies. In particular, we propose a new D-NUCA structure, called Way Adaptable D-NUCA cache, in which the number of active (i.e. powered-on) ways is dynamically adapted to the need of the running application. Our initial evaluation shows that a consistent reduction of both the average number of active ways (42% in average) and the number of bank access requests (29% in average) is achieved, without significantly affecting the IPC.

References

[1]
V. Agarwal, M. S. Hrishikesh, S. Keckler. D. Burger. Clock Rate versus IPC: The End of the Road for Conventional Microarchitectures. Proc. 27th Int. Symp. on Computer Architecture, pp.248--259, Vancouver, Canada, June 2000
[2]
C. Kim, D. Burger, S. W. Keckler. An Adaptive, Non-Uniform Cache Structure for Wire-Delay Dominated On-Chip Caches. Proc. 10th Conf. on Architectural Support for Programming Languages and Operating Systems, pp. 211--222, San Jose, CA, Oct. 2002.
[3]
C. Kim, D. Burger, S. W. Keckler. Nonuniform cache architectures for wire-delay dominated on-chip caches. IEEE Micro, vol. 23(6), pp. 99--107, Nov./Dec., 2003
[4]
V. Venkatachalam, M. Franz. Power Reduction Techniques For Microprocessor Systems. ACM Computing Surveys, vol. 37(3), pp. 195--237, Sept. 2005
[5]
The International Technology roadmap for Semiconductors. Semiconductor Industrial Association, 2005.
[6]
T. Sherwood, B. Calder. Time varying behavior of programs. UC San Diego Technical Report UCSD-CS99--630, 1999
[7]
Standard Performance Evaluation Corporation http://www.spec.org/
[8]
R. Desikan et al., Sim-Alpha: A Validated Execution-Driven Alpha2164 Simulator, tech report TR-01-23, Dept. of Computer Sciences, Univ. Texas at Austin, 2001.
[9]
M. Powell, S. Yangh, B. Falsafi, K. Roy, T. N. Vijaykumar. Gated-Vdd: A Circuit Technique to Reduce Leakage in Deep-Submicron Cache Memories. Proc. 2000 Int. Symp. Low Power Electronics and Design, pp. 90--95, Rapallo, Italy, July 2000.
[10]
Z. Chisti, M. D. Powell, T. N. Vijaykumar. Distance Associativity for High-Performance Energy-Efficient Non-Uniform Cache Architectures. Proc. 36th Int. Symp. on Microarchitecture, pp.55--66, San Diego, CA, Dec. 2003
[11]
P. Foglia, D. Mangano, C. A. Prete. A Cache Design for High Performance Embedded Systems. Journal of Embedded Computing, vol. 1(4), pp. 587--598, 2005
[12]
H. Kobayashi, I. Kotera, H. Takizawa. Locality Analysis to Control Dynamically Way-Adaptable Caches. ACM SIGARCH Computer Architecture News, vol. 33(3), pp. 25--32, June 2005
[13]
D. H. Albonesi. Selective Cache Ways: On-Demand Cache Resource Allocation. Proc. 32nd Int. Symp on Microarchitecture, pp.248--259, Israel, November 1999
[14]
R. Balasubramonian, et al: Memory hierarchy reconfiguration for energy and performance in general purpose processor architectures. Proc. 33rd Int. Symp. On Microarchitecture, pp.245--257, Monterey, CA, Dec. 2000.
[15]
D. Dropsho, et al. Integrating Adaptive On-Chip Storage Structures for Reduced Dynamic Power. Proc. 2002 Int. Conf. on Parallel Architectures and Compilation Techniques, pp.141--152, Charlottesville, CA, Sep.2002.
[16]
Z. Hu, S. Kaxiras, M. Martonosi. Let caches decay: reducing leakage energy via exploitation of cache generational behavior. ACM Trans. on Computer Systems, vol. 20(2), pp. 161--190, May 2002.
[17]
H. Hanson, et al. Static energy reduction techniques for microprocessor caches. IEEE Trans. on VLSI, vol. 11(3), pp. 303--313, June 2003
[18]
K. Flautner, N. S. Kim, S. Martin D. Blaauw, T. Mudge. Drowsy Caches: Simple Techniques for Reducing Leakage Power. Proc. 29th Int. Symp. on Computer Architecture, pp.148--157, Anchorage, AK, May 2002
[19]
Y. Meng, T. Sherwood, Kastner. Exploring the limits of leakage power reduction in caches. ACM Trans. on Architectures and Code Optimization, vol. 2(3), pp.221--246, Sep. 2005
[20]
N. Mohyuddin, R. Bhatti, M. Dubois. Controlling Leakage Power with the Replacement Policy in Slumberous Cache. Proc. 2nd Conf. on Computing Frontiers, pp 161--170, Ischia, Italy, May 2005.
[21]
A. S. Dhodapkar, J. E. Smith. Comparing Program Phase Detection Techniques. Proc. of the 36th Int. Symp. on Microarchitecture, pp. 217--227, San Diego, CA, Dec. 2003.

Cited By

View all
  • (2022)Process variation aware DRAM-Cache resizingJournal of Systems Architecture: the EUROMICRO Journal10.1016/j.sysarc.2021.102364123:COnline publication date: 1-Feb-2022
  • (2016)Last level cache size heterogeneity in embedded systemsThe Journal of Supercomputing10.1007/s11227-015-1576-872:2(503-544)Online publication date: 1-Feb-2016
  • (2014)Remapping NUCAProceedings of the 2014 IEEE Intl Conf on High Performance Computing and Communications, 2014 IEEE 6th Intl Symp on Cyberspace Safety and Security, 2014 IEEE 11th Intl Conf on Embedded Software and Syst (HPCC,CSS,ICESS)10.1109/HPCC.2014.13(38-41)Online publication date: 20-Aug-2014
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM SIGARCH Computer Architecture News
ACM SIGARCH Computer Architecture News  Volume 35, Issue 4
September 2007
59 pages
ISSN:0163-5964
DOI:10.1145/1327312
Issue’s Table of Contents

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 01 September 2007
Published in SIGARCH Volume 35, Issue 4

Check for updates

Qualifiers

  • Column

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)6
  • Downloads (Last 6 weeks)1
Reflects downloads up to 14 Dec 2024

Other Metrics

Citations

Cited By

View all
  • (2022)Process variation aware DRAM-Cache resizingJournal of Systems Architecture: the EUROMICRO Journal10.1016/j.sysarc.2021.102364123:COnline publication date: 1-Feb-2022
  • (2016)Last level cache size heterogeneity in embedded systemsThe Journal of Supercomputing10.1007/s11227-015-1576-872:2(503-544)Online publication date: 1-Feb-2016
  • (2014)Remapping NUCAProceedings of the 2014 IEEE Intl Conf on High Performance Computing and Communications, 2014 IEEE 6th Intl Symp on Cyberspace Safety and Security, 2014 IEEE 11th Intl Conf on Embedded Software and Syst (HPCC,CSS,ICESS)10.1109/HPCC.2014.13(38-41)Online publication date: 20-Aug-2014
  • (2012)Cache Latency Control for Application Fairness or Differentiation in Power-Constrained Chip MultiprocessorsIEEE Transactions on Computers10.1109/TC.2011.18761:10(1371-1385)Online publication date: 1-Oct-2012
  • (2010)Achieving Fair or Differentiated Cache Sharing in Power-Constrained Chip MultiprocessorsProceedings of the 2010 39th International Conference on Parallel Processing10.1109/ICPP.2010.9(1-10)Online publication date: 13-Sep-2010
  • (2008)Modeling of cache access behavior based on Zipf's lawProceedings of the 9th workshop on MEmory performance: DEaling with Applications, systems and architecture10.1145/1509084.1509086(9-15)Online publication date: 26-Oct-2008
  • (2008)Performance Sensitivity of NUCA Caches to On-Chip Network ParametersProceedings of the 2008 20th International Symposium on Computer Architecture and High Performance Computing10.1109/SBAC-PAD.2008.20(167-174)Online publication date: 29-Oct-2008

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media