[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
research-article

Energy-efficient cache design using variable-strength error-correcting codes

Published: 04 June 2011 Publication History

Abstract

Voltage scaling is one of the most effective mechanisms to improve microprocessors' energy efficiency. However, processors cannot operate reliably below a minimum voltage, Vccmin, since hardware structures may fail. Cell failures in large memory arrays (e.g., caches) typically determine Vccmin for the whole processor. We observe that most cache lines exhibit zero or one failures at low voltages. However, a few lines, especially in large caches, exhibit multi-bit failures and increase Vccmin. Previous solutions either significantly reduce cache capacity to enable uniform error correction across all lines, or significantly increase latency and bandwidth overheads when amortizing the cost of error-correcting codes (ECC) over large lines.
In this paper, we propose a novel cache architecture that uses variable-strength error-correcting codes (VS-ECC). In the common case, lines with zero or one failures use a simple and fast ECC. A small number of lines with multi-bit failures use a strong multi-bit ECC that requires some additional area and latency. We present a novel dynamic cache characterization mechanism to determine which lines will exhibit multi-bit failures. In particular, we use multi-bit correction to protect a fraction of the cache after switching to low voltage, while dynamically testing the remaining lines for multi-bit failures. Compared to prior multi-bit-correcting proposals, VS-ECC significantly reduces power and energy, avoids significant reductions in cache capacity, incurs little area overhead, and avoids large increases in latency and bandwidth.

Supplementary Material

JPG File (isca_9b_2.jpg)
MP4 File (isca_9b_2.mp4)

References

[1]
Jaume Abellà, Javier Carretero, Pedro Chaparro, Xavier Vera and Antonio González, "Low Vccmin Fault-Tolerant Cache with Highly Predictable Performance", International Symposium on Microarchitecture, pp. 111--121, Dec. 2009.
[2]
Amin Ansari, Shantanu Gupta, Shuguang Feng and Scott Mahlke, "ZerehCache: Armoring Cache Architectures in High Defect Density Technologies", International Symposium on Microarchitecture, pp. 100--110, Dec. 2009.
[3]
Elwyn. R. Berlekamp, Algebraic coding theory, New York: McGraw-Hill, chapter 7, 1968.
[4]
Hannes Brunner, Andreas Curiger and Max Hofstetter, "On computing multiplicative inverses in GF(2m)", IEEE Transactions on Computers, vol. 42, pp. 1010--1015, Aug. 1993.
[5]
Douglas Bossen, Joel Tendler and Kevin Reick, "Power4 System Design for High Reliability", IEEE Micro, vol. 22, No. 2, pp. 16--24, Mar. 2002.
[6]
Herbert O. Burton, "Inversionless decoding of binary BCH codes", IEEE Transactions on Information Theory, vol. IT-17, pp. 464--466, 1971.
[7]
C. L. Chen and M. Y. Hsiao, "Error-correcting codes for semiconductor memory applications: A state-of-the-art-review", IBM Journal of Research Development, vol. 28, no. 2, pp. 124--134, Mar. 1984.
[8]
Robert T. Chien, "Cyclic decoding procedures for Bose-Chaudhuri-Hocquenghem codes", IEEE Transactions on Information Theory, vol. 10, no. 4, pp. 357--363, Oct. 1964.
[9]
Zeshan Chishti, Alaa R. Alameldeen, Chris Wilkerson, Wei Wu and Shih-Lien Lu, "Improving Cache Lifetime Reliability at Ultra-low Voltages", International Symposium on Microarchitecture, pp. 89--99, Dec. 2009.
[10]
Eiji Fujiwara, Code Design for Dependable Systems: Theory and Practical Applications, Wiley-Interscience, 2006.
[11]
Ad J. van de Goor, "Testing Semiconductor Memories: Theory and Practice", John Wiley & Sons, Inc., NY 1991.
[12]
Hideki Imai and Y. Kamiyanagi, "A construction method for double error correcting codes for application to main memories", Transactions of the IECE Japan, vol. J60-D, pp. 861--868, Oct. 1977.
[13]
Niraj K. Jha and Sandeep Gupta, "Testing of Digital Systems", Cambridge University Press, New York, NY 2002
[14]
Jangwoo Kim, Nikos Hardavellas, Ken Mai, Babak Falsafi and James Hoe, "Multi-bit Error Tolerant Caches Using Two-Dimensional Error Coding", International Symposium on Microarchitecture, pp. 197--209, Dec. 2007.
[15]
Seongwoo Kim and Arun K. Somani, "Area Efficient Architectures for Information Integrity in Cache Memories," International Symposium on Computer Architecture, pp. 246--255, Jun. 1999.
[16]
Jaydeep Kulkarni, Keejong Kim and Kaushik Roy, "A 160 mV Robust Schmitt Trigger Based Subthreshold SRAM", IEEE Journal of Solid-State Circuits, vol. 42, no. 10, pp. 2303--2313, Oct. 2007.
[17]
James Lee Massey, "Step-by-step decoding of the Bose-Chaudhuri-Hocquenghem codes", IEEE Transactions on Information Theory, vol. 11, no. 4, pp. 580--585, Apr. 1965.
[18]
Tomako Matsushima, Toshiyasu Matsushima, and Shigeichi Hirasawa, "Parallel encoder and decoder architecture for cyclic codes", IEICE Trans. on Fundamentals, vol. E79A, no. 9, pp. 1313--1323, 1996.
[19]
Saibal Mukhopadhyay, Hamid Mahmoodi and Kaushik Roy, "Modeling of failure probability and statistical design of SRAM array for yield enhancement in nanoscaled CMOS", IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, vol.24, no.12, pp. 1859--1880, Dec. 2005
[20]
Thammavarapu R.N. Rao, Eiji Fujiwara, Error-control coding for computer systems, Prentice-Hall, Inc., NJ, 1989.
[21]
Irving S. Reed, Ming-Tang Shih, and Trieu-Kien Truong, "VLSI design of inverse-free Berlekamp--Massey algorithm", Proc. IEE Proceedings E on Computers and Digital Techniques, vol. 138, pp. 295--298, Sept. 1991.
[22]
David Roberts, Nam Sung Kim and Trevor Mudge, "On-Chip Cache Device Scaling Limits and Effective Fault Repair Techniques in Future Nanoscale Technology", Digital System Design Architectures, Methods and Tools, pp. 570--578, Aug. 2007.
[23]
Stefan Rusu, Harry Muljono and Brian Cherkauer, "Itanium 2 Processor 6M: Higher Frequency and Larger L3 Cache", IEEE Micro, vol. 24, No. 2, pp. 10--18, Mar. 2004.
[24]
Stanley Schuster, "Multiple Word/Bit Line Redundancy for Semiconductor Memories", IEEE Journal of Solid-State Circuits, vol. 13, no. 5, pp. 698--703, Oct. 1978.
[25]
Dmitri Strukov, "The area and latency tradeoffs of binary bit-parallel BCH decoders for prospective nanoelectronic memories", in Asilomar Conference on Signals Systems and Computers, pp. 1183--1187, Oct. 2006.
[26]
Chris Wilkerson, Alaa R. Alameldeen, Zeshan Chishti, Wei Wu, Dinesh Somasekhar and Shih-Lien Lu, "Reducing Cache Power with Low Cost, Multi-bit Error-Correcting Codes", International Symposium on Computer Architecture, pp. 83--93, Jun. 2010.
[27]
Chris Wilkerson, Hongliang Gao, Alaa R. Alameldeen, Zeshan Chishti, Muhammad Khellah and Shih-Lien Lu, "Trading of Cache Capacity for Reliability to Enable Low Voltage Operation", International Symposium on Computer Architecture, pp. 203--214, Jun. 2008.
[28]
Doe Hyun Yoon and Mattan Erez, "Memory Mapped ECC: Low-Cost Error Protection for Last Level Caches", International Symposium on Computer Architecture, pp. 116--127, Jun. 2009.

Cited By

View all
  • (2024)Enhancing Neural Network Reliability: Insights From Hardware/Software Collaboration With Neuron Vulnerability QuantizationIEEE Transactions on Computers10.1109/TC.2024.339849273:8(1953-1966)Online publication date: 1-Aug-2024
  • (2024)Self-Managing DRAM: A Low-Cost Framework for Enabling Autonomous and Efficient DRAM Maintenance Operations2024 57th IEEE/ACM International Symposium on Microarchitecture (MICRO)10.1109/MICRO61859.2024.00074(949-965)Online publication date: 2-Nov-2024
  • (2024)Designing a Deep Neural Network engine for LLC block reuse prediction to mitigate Soft Error in MulticoreMicroelectronics Reliability10.1016/j.microrel.2024.115377156(115377)Online publication date: May-2024
  • Show More Cited By

Index Terms

  1. Energy-efficient cache design using variable-strength error-correcting codes

      Recommendations

      Comments

      Please enable JavaScript to view thecomments powered by Disqus.

      Information & Contributors

      Information

      Published In

      cover image ACM SIGARCH Computer Architecture News
      ACM SIGARCH Computer Architecture News  Volume 39, Issue 3
      ISCA '11
      June 2011
      462 pages
      ISSN:0163-5964
      DOI:10.1145/2024723
      Issue’s Table of Contents
      • cover image ACM Conferences
        ISCA '11: Proceedings of the 38th annual international symposium on Computer architecture
        June 2011
        488 pages
        ISBN:9781450304726
        DOI:10.1145/2000064
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 04 June 2011
      Published in SIGARCH Volume 39, Issue 3

      Check for updates

      Author Tags

      1. cache design
      2. error-correcting codes
      3. low-voltage design
      4. variable-strength codes

      Qualifiers

      • Research-article

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)15
      • Downloads (Last 6 weeks)1
      Reflects downloads up to 05 Mar 2025

      Other Metrics

      Citations

      Cited By

      View all
      • (2024)Enhancing Neural Network Reliability: Insights From Hardware/Software Collaboration With Neuron Vulnerability QuantizationIEEE Transactions on Computers10.1109/TC.2024.339849273:8(1953-1966)Online publication date: 1-Aug-2024
      • (2024)Self-Managing DRAM: A Low-Cost Framework for Enabling Autonomous and Efficient DRAM Maintenance Operations2024 57th IEEE/ACM International Symposium on Microarchitecture (MICRO)10.1109/MICRO61859.2024.00074(949-965)Online publication date: 2-Nov-2024
      • (2024)Designing a Deep Neural Network engine for LLC block reuse prediction to mitigate Soft Error in MulticoreMicroelectronics Reliability10.1016/j.microrel.2024.115377156(115377)Online publication date: May-2024
      • (2023)Impact of Voltage Scaling on Soft Errors Susceptibility of Multicore Server CPUsProceedings of the 56th Annual IEEE/ACM International Symposium on Microarchitecture10.1145/3613424.3614304(957-971)Online publication date: 28-Oct-2023
      • (2023)Realizing Strong Determinism Contract on Log-Structured Merge Key-Value StoresACM Transactions on Storage10.1145/358269519:2(1-29)Online publication date: 25-Mar-2023
      • (2023)Review of Memory RAS for Data CentersIEEE Access10.1109/ACCESS.2023.332998411(124782-124796)Online publication date: 2023
      • (2023)Energy efficiency in multicore shared cache by fault tolerance using a genetic algorithm based block reuse predictorMicroprocessors and Microsystems10.1016/j.micpro.2023.104864101(104864)Online publication date: Sep-2023
      • (2022)Temperature Estimation of HBM2 Channels with Tail Distribution of Retention Errors in FPGA-HBM2 PlatformElectronics10.3390/electronics1201003212:1(32)Online publication date: 22-Dec-2022
      • (2022)Self-Reinforcing Memoization for Cryptography Calculations in Secure Memory SystemsProceedings of the 55th Annual IEEE/ACM International Symposium on Microarchitecture10.1109/MICRO56248.2022.00055(678-692)Online publication date: 1-Oct-2022
      • (2021)Predicting and mitigating single-event upsets in DRAM using HOTHMicroelectronics Reliability10.1016/j.microrel.2020.114024117(114024)Online publication date: Feb-2021
      • Show More Cited By

      View Options

      Login options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Figures

      Tables

      Media

      Share

      Share

      Share this Publication link

      Share on social media