[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
research-article

Towards scalable arithmetic units with graceful degradation

Published: 10 March 2014 Publication History

Abstract

This article presents a new family of scalable arithmetic units (ScAUs) targeting resource-constrained, embedded devices. We, first, study the performance, power, area and scalability properties of general adders. Next, suitable error-detection schemes for low-power embedded systems are discussed. As a result, our ScAUs are enhanced with a suitable error-detection scheme, resulting in a Parity-Checked ScAU (PCScAU) design. The PCScAU strikes a flexible trade-off between space and time redundancy, offering dependability similar to high-end techniques for the area and power cost of low-end approaches. An alternative design, the Precision-Scalable Arithmetic Unit (PScAU) maintains throughput with degraded precision in case of hardware failures. The PScAU is targeting dependable applications where latency rather than numerical accuracy is more important. The PScAU's downscaled mode is also interesting for runtime thermal management due to its advantageous power consumption. We implemented and synthesized the PCScAU, PScAU and a few important reference designs (double-, triple- and quadruple-modular-redundancy adders with/without input gating) in 90-nm UMC technology. Overall, the PC-ScAU ranks first in 9 out of 10 power-delay-area (PDA)-product variants. It exhibits 16% area savings and 12% performance speedup for 7% increase in total power consumption, compared to the cheapest form of conventional hardware replication with the same fault coverage. The PDA product of the PCScAU is, thus, reduced by 21%. It is interesting that, while total power slightly increases, the PCScAU static power in fact decreases by 14%. Therefore, for newer technology nodes where the static power component is significant, the PCScAU can also achieve—next to performance and area -- significant power improvements.

References

[1]
R. Amirtharajah, Thucydides Xanthopoulos, and A. Chandrakasan. 1999. Power scalable processing using distributed arithmetic. In Proceedings of the International Symposium on Low-Power Electronics and Design. ACM, 170--175.
[2]
S. Borkar. 2005. Designing reliable systems from unreliable components: the challenges of transistor variability and degradation. IEEE Micro 25, 6, 10--16.
[3]
T. K. Callaway and E. E. Swartzlander. 1992. Optimizing arithmetic elements for signal processing. In Proceedings of the Workshop on VLSI Sig. Processing. 91--100.
[4]
G.-C. Cardarilli, M. Ottavi, S. Pontarelli, M. Re, and A. Salsano. 2006. Fault Localization, error correction, and graceful degradation in Radix 2 signed digit-based adders. IEEE Trans. Comput. 55, 534--540.
[5]
R. A. Davis. 1965. A checking arithmetic unit. In Proceedings of the American Federation of Information Processing Societies National Semi-Annual Computer Conference. ACM, 705--713.
[6]
S. S. Gorshe and B. Bose. 1996. A self-checking ALU design with efficient codes. In Proceedings of the IEEE VLSI Test Symposium. 157--161.
[7]
P. Hazucha, T. Karnik, J. Maiz, S. Walstra, B. Bloechel, J. Tschanz, G. Dermer, S. Hareland, P. Armstrong, and S. Borkar. 2003. Neutron soft error rate measurements in a 90-nm CMOS process and scaling trends in SRAM from 0.25-μ m to 90-nm generation. In IEEE International Electron Devices Meeting (IEDM'03) Technical Digest. 21.5.1--21.5.4.
[8]
J. L. Hennessy and D. A. Patterson. 2012. Computer Architecture: A Quantitative Approach Morgan Kaufmann Publishers Inc.
[9]
M. Y. Hsiao and F. F. Sellers. 1963. The carry-dependent sum adder. IEEE Trans. Electronic Computers 12, 3, 265--268.
[10]
ITRS. 2011. International Technology Roadmap for Semiconductors. www.itrs.net/Links/2011ITRS/Home 2011.htm. (2011).
[11]
A. Iyer and D. Marculescu. 2001. Power aware microarchitecture resource scaling. In Proceedings of the Conference and Exhibition on Design, Automation and Test in Europe (DATE'01). IEEE, 190--196.
[12]
S. Kaxiras. 2008. Computer Architecture Techniques for Power-Efficiency. Morgan and Claypool Publishers.
[13]
C. Khedhiri, M. Karmani, B. Hamdi, and K. L. Man. 2012. A fault tolerant adder based on alternative computation. Int. J. Design, Analysis Tools Integrated Circuits Systems 3, 14--18.
[14]
S. Kumar and A. Aggarwal. 2006. Reducing resource redundancy for concurrent error detection techniques in high performance microprocessors. In Proceedings of the International Symposium on High-Performance Computer Architecture (HPCA'06). 212--221.
[15]
Sumeet Kumar, Prateek Pujara, and Aneesh Aggarwal. 2005. Bit-sliced datapath for energy-efficient high performance microprocessors. In Power-Aware Computer Systems, Lecture Notes in Computer Science, vol. 3471, Springer, 30--45.
[16]
Eren Kursun, Glenn Reinman, Suleyman Sair, Anahita Shayesteh, and Tim Sherwood. 2005. Low-overhead core swapping for thermal management. In Power-Aware Computer Systems, Lecture Notes in Computer Science, vol. 3471, Springer, 46--60.
[17]
G. G. Langdon and C. K. Tang. 1970. Concurrent error detection for group look-ahead binary adders. IBM J. Res. Dev. 14, 5, 563--573.
[18]
H. Lee. 2005. Power-aware scalable pipelined booth multiplier. IEICE Trans. 88, 11, 3230--3234.
[19]
Ting-Wei Lin, Ming-Chung Lee, Fang-Ju Lin, and Herming Chiueh. 2005. A low power ALU cluster design for media streaming architecture. In Proceedings of the 48th Midwest Symposium on Circuits and Systems. 51--54.
[20]
J. C. Lo, S. Thanawastien, T. R. N. Rao, and M. Nicolaidis. 1992. An SFS Berger check prediction ALU and its application to self-checking processor designs. IEEE Trans. Comput. Aided Des. Integr. Circuits Syst. 11, 4, 525--540.
[21]
S. Mitra and E. J. McCluskey. 2000. Which concurrent error detection scheme to choose? In Proceedings of the International Test Conference. 985--994.
[22]
J. Monteiro, S Devadas, P. Ashar, and A. Mauskar. 1996. Scheduling techniques to enable power management. In Proceedings of the 33rd IEEE/ACM Design Automation Conference (DAC'96). 349--352.
[23]
T. Mudge. 2001. Power: A first-class architectural design constraint. Computer 34, 4, 52--58.
[24]
C. Nagendra, M. J. Irwin, and R. M. Owens. 1996. Area-time-power tradeoffs in parallel adders. IEEE Trans. Circuits Syst. Express Briefs 43, 10, 689--702.
[25]
A. Namazi, Y. Sedaghat, S. G. Miremadi, and A. Ejlali. 2009. A low-cost fault-tolerant technique for Carry Look-Ahead adder. In Proceedings of the 15th IEEE International On-Line Testing Symposium (IOLTS'09). 217--222.
[26]
B. Parhami. 2000. Computer Arithmetic: Algorithms and Hardware Designs. Oxford University Press.
[27]
S. Peng and R. Manohar. 2005. Fault tolerant asynchronous adder through dynamic self-reconfiguration. In Proceedings of the IEEE International Conference on Computer Design (ICCD'05). IEEE, 171--179.
[28]
O. A. Pfänder, R. Nopper, H.-J. Pfleiderer, S. Zhou, and A. Bermak. 2008. Comparison of reconfigurable structures for flexible word-length multiplication. Adv Radio Sci. 6, 113--118.
[29]
S. Purohit, S. R. Chalamalasetti, and M. Margala. 2010. Low overhead soft error detection and correction scheme for reconfigurable pipelined data paths. In Proceedings of the NASA/ESA Conference on Adaptive Hardware and Systems. 59--65.
[30]
J. M. Rabaey and M. Pedram. 1996. Low Power Design Methodologies. Kluwer Academic Publishers.
[31]
R. M. Seepers, C. Strydis, and G. N. Gaydadjiev. 2012. Architecture-level fault-tolerance for biomedical implants. In Proceedings of the International Conference on Embedded Computer Systems. 104--112.
[32]
S. Segars. 1997. ARM7TDMI power consumption. IEEE Micro 17, 4, 12--19.
[33]
Frederick F. Sellers, Muyue Xiao, and Leroy W. Bearnson. 1968. Error Detecting Logic for Digital Computers. McGraw-Hill.
[34]
A. Shrivastava, D. Kannan, S. Bhardwaj, and S. Vrudhula. 2010. Reducing functional unit power consumption and its variation using leakage sensors. IEEE Trans. VLSI Syst. 18, 6, 988--997.
[35]
C. W. Slayman. 2005. Cache and memory error detection, correction, and reduction techniques for terrestrial servers and workstations. IEEE Trans. Device Mater. Reliab. 5, 3, 397--404.
[36]
C. Strydis. 2011. Universal processor architecture for biomedical implants: The SiMS Project. Ph.D. dissertation, Delft University of Technology, Delft, The Netherlands.
[37]
Christos Strydis, Georgi N. Gaydadjiev, and Stamatis Vassiliadis. 2006. A new digital architecture for reliable, ultra-low-power systems. In Proceedings of the 17th Annual Workshop on Circuits, Systems and Signal Processing. 350--355.
[38]
V. Tiwari, S. Malik, and P. Ashar. 1998. Guarded evaluation: pushing power management to logic synthesis/design. IEEE Trans. Comput-aided Des. Integr. Circuits Syst. 17, 10, 1051--1060.
[39]
W. J. Townsend, J. A. Abraham, and E. E. Swartzlander. 2003. Quadruple Time redundancy adders. In Proceedings of the IEEE International Symposium on Defect and Fault Tolerance in VLSI Systems. 250--256.
[40]
M. Vratonjic, B. R. Zeydel, and V. G. Oklobdzija. 2005. Low- and ultra low-power arithmetic units: design and comparison. in Proceedings of the IEEE International Conference on Computer Design. 249--252.

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Transactions on Embedded Computing Systems
ACM Transactions on Embedded Computing Systems  Volume 13, Issue 4
Regular Papers
November 2014
647 pages
ISSN:1539-9087
EISSN:1558-3465
DOI:10.1145/2592905
Issue’s Table of Contents
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Journal Family

Publication History

Published: 10 March 2014
Accepted: 01 June 2013
Revised: 01 May 2013
Received: 01 February 2013
Published in TECS Volume 13, Issue 4

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Computer arithmetic
  2. embedded systems
  3. error correction
  4. error detection
  5. fault tolerance
  6. graceful degradation
  7. low power consumption
  8. scalable design

Qualifiers

  • Research-article
  • Research
  • Refereed

Funding Sources

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 306
    Total Downloads
  • Downloads (Last 12 months)3
  • Downloads (Last 6 weeks)0
Reflects downloads up to 14 Dec 2024

Other Metrics

Citations

View Options

Login options

Full Access

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media