[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.5555/647882.738083guideproceedingsArticle/Chapter ViewAbstractPublication PagesConference Proceedingsacm-pubtype
Article

Software-Implemented Fault Injection Methodology for Design and Validation of System Fault Tolerance

Published: 01 July 2001 Publication History

Abstract

Abstract: In this paper, we present our experience in developing a methodology and tool at the Jet Propulsion Laboratory (JPL) for Software-Implemented Fault Injection (SWIFI) into a parallel processing supercomputer, which is being designed for use in next generation space exploration missions. The fault injector uses software-based strategies to emulate the effects of radiation-induced transients occurring in the system hardware components. The JPL 's SWIFI tool set called JIFI (JPL 's Implementation of a Fault Injector) is being used, in conjunction with an appropriate system fault model, to evaluate candidate hardware and software fault tolerance architectures, determine the sensitivity of applications to faults and measure the effectiveness of fault detection, isolation, and recovery strategies. JIFI has been validated to inject faults into user-specified CPU registers and memory regions with a uniform random distribution in location and time. Together with verifiers, classifiers, and run scripts, JIFI enables massive fault injection campaigns and statistical data analysis.

References

[1]
R. R. Some and D. C. Ngo, "REE: A COTS-Based Fault Tolerant Parallel Processing Supercomputer for Spacecraft Onboard Scientific Data Analysis," Proc. of the Digital Avionics System Conference, vol. 2, pp. B3-1-7 - B3-1-12, 1999.
[2]
J. J. Beahan, L. Edmonds, R. D. Ferraro, A. Johnston, D. Katz, R. R. Some, "Detailed Radiation Fault Modeling of the Remote Exploration and Experimentation (REE) First Generation testbed Architecture," Aerospace Conf. Proc, vol. 5, pp. 279-291, 2000.
[3]
D. T. Stott, B. Fioering, D. Burke, Z. Kalbarcyzk and R. K. Iyer, "NFTAPE: a Framework for Assessing Dependability in Distributed Systems with lightweight fault injectors", Proceeding Computer Performance and Dependability Symposium, pp. 91-100, 2000.
[4]
J. Carreira, H. Madeira and J. G. Silva, "Xception: A technique for the Experimental Evaluation of Dependability in Modern Computers," IEEE Transactions On Software Engineering, Vol 24, pp. 125-135, Feb. 1998.
[5]
J. J. Beahan, "SWIFI: A Software-Implemented Fault Injection Tool," JPL Internal Document, June 2000.

Cited By

View all
  • (2010)HIFsuiteEURASIP Journal on Embedded Systems10.1155/2010/4363282010(1-20)Online publication date: 1-Jan-2010
  • (2008)AFIDProceedings of the 2008 international symposium on Software testing and analysis10.1145/1390630.1390653(179-188)Online publication date: 20-Jul-2008

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image Guide Proceedings
DSN '01: Proceedings of the 2001 International Conference on Dependable Systems and Networks (formerly: FTCS)
July 2001
500 pages
ISBN:0769511015

Publisher

IEEE Computer Society

United States

Publication History

Published: 01 July 2001

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 13 Dec 2024

Other Metrics

Citations

Cited By

View all
  • (2010)HIFsuiteEURASIP Journal on Embedded Systems10.1155/2010/4363282010(1-20)Online publication date: 1-Jan-2010
  • (2008)AFIDProceedings of the 2008 international symposium on Software testing and analysis10.1145/1390630.1390653(179-188)Online publication date: 20-Jul-2008

View Options

View options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media