[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
research-article
Open access

Evaluation of Histogram of Oriented Gradients Soft Errors Criticality for Automotive Applications

Published: 15 November 2016 Publication History

Abstract

Pedestrian detection reliability is a key problem for autonomous or aided driving, and methods that use Histogram of Oriented Gradients (HOG) are very popular. Embedded Graphics Processing Units (GPUs) are exploited to run HOG in a very efficient manner. Unfortunately, GPUs architecture has been shown to be particularly vulnerable to radiation-induced failures. This article presents an experimental evaluation and analytical study of HOG reliability. We aim at quantifying and qualifying the radiation-induced errors on pedestrian detection applications executed in embedded GPUs.
We analyze experimental results obtained executing HOG on embedded GPUs from two different vendors, exposed for about 100 hours to a controlled neutron beam at Los Alamos National Laboratory. We consider the number and position of detected objects as well as precision and recall to discriminate critical erroneous computations. The reported analysis shows that, while being intrinsically resilient (65% to 85% of output errors only slightly impact detection), HOG experienced some particularly critical errors that could result in undetected pedestrians or unnecessary vehicle stops.
Additionally, we perform a fault-injection campaign to identify HOG critical procedures. We observe that Resize and Normalize are the most sensitive and critical phases, as about 20% of injections generate an output error that significantly impacts HOG detection. With our insights, we are able to find those limited portions of HOG that, if hardened, are more likely to increase reliability without introducing unnecessary overhead.

References

[1]
Advanced Micro Devices AMD. 2015. BIOS and kernel developer guide for AMD family 16h models 00h-0Fh processors. Technical doc 48751 Rev 3.03.
[2]
ARM. 2016. ARM Cortex-A73 MPCore Processor Technical Reference Manual Revision: r0p2. Retrieved from http://infocenter.arm.com/help/topic/com.arm.doc.100048_0002_04_en/cortex_a73_trm_100048_0002_04_en.pdf.
[3]
M. Bagatin, S. Gerardin, A. Paccagnella, A. Visconti, L. Chiavarone, M. Calabrese, and C. D. Frost. 2014. Sensitivity of NOR flash memories to wide-energy spectrum neutrons during accelerated tests. In Proceedings of the 2014 IEEE International Reliability Physics Symposium. 5F.3.1--5F.3.6.
[4]
R. C. Baumann. 2005. Radiation-induced soft errors in advanced semiconductor technologies. IEEE Trans. Device Mater. Reliabil. 5, 3 (Sept. 2005), 305--316.
[5]
G. Bradski. 2000. The open source computer vision (OpenCV) library. Dr. Dobb’s J. Softw. Tools 1, 1 (2000), 1.
[6]
M. A. Breuer, S. K. Gupta, and T. M. Mak. 2004. Defect and error tolerance in the presence of massive numbers of defects. IEEE Design Test Comput. 21, 3 (May 2004), 216--227. 10.1109/MDT.2004.8
[7]
George Candea and Armando Fox. 2001. Recursive restartability: Turning the reboot sledgehammer into a scalpel. In Proceedings of the 8th Workshop on Hot Topics in Operating Systems (HOTOS’01). IEEE Computer Society, Washington, DC, 125.
[8]
C. Constantinescu. 2002. Impact of deep submicron technology on dependability of VLSI circuits. In Proceedings of the International Conference on Dependable Systems and Networks, 2002 (DSN 2002). 205--209.
[9]
N. Dalal and B. Triggs. 2005. Histograms of oriented gradients for human detection. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2005 (CVPR’05), Vol. 1. 886--893 vol. 1.
[10]
D. A. G. de Oliveira, L. L. Pilla, T. Santini, and P. Rech. 2016. Evaluation and mitigation of radiation-induced soft errors in graphics processing units. IEEE Trans. Comput. 65, 3 (March 2016), 791--804.
[11]
Nathan DeBardeleben, Sean Blanchard, Laura Monroe, Phil Romero, Daryl Grunau, Craig Idler, and Cornwell Wright. 2013. GPU behavior on a large HPC cluster. In Proceedings of the 6th Workshop on Resiliency in High Performance Computing (Resilience) in Clusters, Clouds, and Grids in Conjunction with the 19th International European Conference on Parallel and Distributed Computing (Euro-Par 2013).
[12]
P. Dollar, C. Wojek, B. Schiele, and P. Perona. 2012. Pedestrian detection: An evaluation of the state of the art. IEEE Trans. Pattern Anal. Mach. Intell. 34, 4 (Apr. 2012), 743--761.
[13]
J. J. Dongarra, H. W. Meuer, and E. Strohmaier. 2013. National Highway Traffic Safety Administration report. Retrieved from https://www.nhtsa.gov.
[14]
J. J. Dongarra, H. W. Meuer, and E. Strohmaier. 2015. ISO26262 Standard. Retrieved from https://www.iso.org/obp/ui/#iso:std:iso:26262:-1:ed-1:v1:en.
[15]
Fernando Fernandes dos Santos, Paolo Rech, Daniel Oliveira, Lucas Weigel, Philippe Navaux, Luigi Carro, and Claudio Rogito Jung. 2016. taco2016-log-data. Retrieved from https://github.com/UFRGS-CAROL/taco2016-log-data.
[16]
European New Car Assessment Programme. 2012. Euro NCAP Rating Review, Report from the Ratings Group. Retrieved from http://www.euroncap.com.
[17]
B. Fang, K. Pattabiraman, M. Ripeanu, and S. Gurumurthi. 2014. GPU-qin: A methodology for evaluating the error resilience of GPGPU applications. In Proceedings of the 2014 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS). 221--230.
[18]
Tom Fawcett. 2006. An introduction to ROC analysis. Pattern Recogn. Lett. 27, 8 (2006), 861--874.
[19]
A. D. Fogle, Don Darling, R. C. Blish, and E. Daszko. 2004. Flash memory under cosmic and alpha irradiation. IEEE Trans. Device Mater. Reliabil. 4, 3 (Sept. 2004), 371--376.
[20]
L. A. Bautista Gomez, Franck Cappello, L. Carro, N. DeBardeleben, B. Fang, S. Gurumurthi, K. Pattabiraman, R. Rech, and M. Sonza Reorda. 2014. GPGPUs: How to combine high computational power with high reliability. In Proceedings of the 2014 Design Automation and Test in Europe Conference and Exhibition. Dresden, Germany.
[21]
Google. 2016a. Google Self-Driving Car Project Montly Report. Retrieved June 1, 2016, from https://www.google.com/selfdrivingcar/reports/.
[22]
Google. 2016b. Wired Google’s Self-Driving Car Caused Its First Crash. Retrieved April 5, 2016, http://www.wired.com/2016/02/googles-self-driving-car-may-caused-first-crash/.
[23]
S. Hari, R. Venkatagiri, S. Adve, and H. Naeimi. 2014. TGangES: Gang error simulation for hardware resiliency evaluation. In Proceeding of the 41st Annual International Symposium on Computer Architecuture (ISCA’14).
[24]
John L. Hennessy and David A. Patterson. 2011. Computer Architecture, Fifth Edition: A Quantitative Approach (5th ed.). Morgan Kaufmann Publishers Inc., San Francisco, CA.
[25]
I. Herrera-Alzu and M. Lopez-Vallejo. 2014. System design framework and methodology for Xilinx virtex FPGA configuration scrubbers. IEEE Trans. Nucl. Sci. 61, 1 (Feb. 2014), 619--629.
[26]
Intel. 2014. Intel Xeon Phi Coprocessor System Software Developers Guide. (2014). Retrieved from https://software.intel.com/sites/default/files/managed/09/07/xeon-phi-coprocessor-system-software-developers-guide.pdf.
[27]
Intel. 2016. Intel 64 and IA-32 architectures software developers manual combined volumes: 1, 2A, 2B, 2C, 2D, 3A, 3B, 3C and 3D. Retrieved from http://www.intel.com/content/dam/www/public/us/en/documents/ manuals/64-ia-32-architectures-software-developer-manual-325462.pdf.
[28]
JEDEC. 2006. Measurement and Reporting of Alpha Particle and Terrestrial Cosmic Ray-Induced Soft Errors in Semiconductor Devices. Technical Report JESD89A. JEDEC Standard.
[29]
T. Joachims. 1999. Making large-scale SVM learning practical. In Advances in Kernel Methods - Support Vector Learning, B. Schölkopf, C. Burges, and A. Smola (Eds.). MIT Press, Cambridge, MA, 169--184.
[30]
G. Just, J. L. Autran, S. Serre, D. Munteanu, S. Sauze, A. Regnier, J. L. Ogier, P. Roche, and G. Gasiot. 2013. Soft errors induced by natural radiation at ground level in floating gate flash memories. In Proceedings of the 2013 IEEE International Reliability Physics Symposium (IRPS). 3D.4.1--3D.4.8.
[31]
Minsung Kang and Young Chul Lim. 2014. Pedestrian detection using HOG-based block selection. In Proceedings of the 2014 11th International Conference on Informatics in Control, Automation and Robotics (ICINCO), Vol. 02. 783--787.
[32]
Saeng-Hwan Kim, Won-Oh Lee, Jung-Ho Kim, Seong-Seop Lee, Sun-Young Hwang, Chang-Il Kim, Tae-Woo Kwon, Bong-Seok Han, Sung-Kwon Cho, Dae-Hui Kim, Jae-Keun Hong, Min-Yung Lee, Sung-Wook Yin, Hyeon-Gon Kim, Jin-Hong Ahn, Yong-Tark Kim, Yo-Hwan Koh, and Joong-Sik Kih. 2007. A low power and highly reliable 400Mbps mobile DDR SDRAM with on-chip distributed ECC. In Proceedings of the IEEE Asian Solid-State Circuits Conference, 2007 (ASSCC’07). 34--37.
[33]
J. C. Laprie. 1995. Dependable computing and fault tolerance: Concepts and terminology. In Proceedings of the 25 International Symposium on Fault-Tolerant Computing, 1995, Highlights from Twenty-Five Years. 2.
[34]
K. Lee and L. Sha. 2005. Process resurrection: A fast recovery mechanism for real-time embedded systems. In Proceedings of the 11th IEEE Real Time and Embedded Technology and Applications Symposium. 292--301.
[35]
Man-Lap Li, Pradeep Ramachandran, Swarup Kumar Sahoo, Sarita V. Adve, Vikram S. Adve, and Yuanyuan Zhou. 2008. Understanding the propagation of hard errors to software and implications for resilient system design. SIGOPS Oper. Syst. Rev. 42, 2 (March 2008), 265--276.
[36]
X. Li and X. Guo. 2013. A HOG feature and SVM based method for forward vehicle detection with single camera. In Proceedings of the 2013 5th International Conference on Intelligent Human-Machine Systems and Cybernetics (IHMSC), Vol. 1. 263--266.
[37]
Joseph J. Lim, C. L. Zitnick, and Piotr Dollar. 2013. Sketch tokens: A learned mid-level representation for contour and object detection. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[38]
R. R. Lutz. 1993. Analyzing software requirements errors in safety-critical, embedded systems. In Proceedings of IEEE International Symposium on Requirements Engineering, 1993. 126--133.
[39]
Subhasish Mitra. 2010. System-Level Single-Event Effects. IEEE Nuclear and Space Radiation Effects Conference, NSREC 2012 Short Course. (2010).
[40]
Shubhendu S. Mukherjee, Christopher Weaver, Joel Emer, Steven K. Reinhardt, and Todd Austin. 2003. A systematic methodology to compute the architectural vulnerability factors for a high-performance microprocessor. In Proceedings of the 36th Annual IEEE/ACM International Symposium on Microarchitecture. IEEE Computer Society, Washington, DC, USA, 29--.
[41]
Nithin Nakka, Giacinto Paolo Saggese, Zbigniew Kalbarczyk, and Ravishankar K. Iyer. 2005. An Architectural Framework for Detecting Process Hangs/Crashes. Springer, Berlin, 103--121.
[42]
M. Nicolaidis. 1999. Time redundancy based soft-error tolerance to rescue nanometer technologies. In Proceedings of the 17th IEEE VLSI Test Symposium, 1999. 86--94.
[43]
Nvidia. 2014. Tegra K1 techinical reference manual. DP-06905-001-v03p (2014).
[44]
NVIDIA. 2015. NVIDIAs Next Generation CUDA Compute Architecture: Kepler GK110. Retrieved from http://www.nvidia.com/content/PDF/kepler/NVIDIA-Kepler-GK110-Architecture-Whitepaper.pdf.
[45]
Nvidia. 2016. Tegra X1 techinical reference manual. TRM DP 07225 001-v1.1p (2016).
[46]
D. A. G. Oliveira, P. Rech, H. M. Quinn, T. D. Fairbanks, L. Monroe, S. E. Michalak, C. Anderson-Cook, P. O. A. Navaux, and L. Carro. 2014. Modern GPUs radiation sensitivity evaluation and mitigation through duplication with comparison. IEEE Trans. Nucl. Sci. 61, 6 (Dec. 2014), 3115--3122.
[47]
K. Pattabiraman, G. P. Saggese, D. Chen, Z. Kalbarczyk, and R. K. Iyer. 2006. Dynamic derivation of application-specific error detectors and their implementation in hardware. In Proceedings of the 2006 6th European Dependable Computing Conference, 97--108.
[48]
P. Rech, T. D. Fairbanks, H. M. Quinn, and L. Carro. 2013. Threads distribution effects on graphics processing units neutron sensitivity. IEEE Trans. Nucl. Sci. 60, 6 (Dec. 2013), 4220--4225.
[49]
Xiaofeng Ren and Deva Ramanan. 2013. Histograms of sparse codes for object detection. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[50]
G. P. Saggese, N. J. Wang, Z. T. Kalbarczyk, S. J. Patel, and R. K. Iyer. 2005. An experimental study of soft errors in microprocessors. IEEE Micro 25, 6 (Nov. 2005), 30--39.
[51]
Nirmal R. Saxena. 2016. Autonomous car is the new driver for resilient computing and design-for-test. In Silicon Errors in Logic—System Effects (SELSE) Keyonte. Retrieved from http://www.selse.org.
[52]
Bianca Schroeder, Eduardo Pinheiro, and Wolf-Dietrich Weber. 2009. DRAM errors in the wild: A large-scale field study. In SIGMETRICS.
[53]
Norbert Seifert, Xiaowei Zhu, and Lloyd W. Massengill. 2002. Impact of scaling on soft-error rates in commercial microprocessors. IEEE Trans. Nucl. Sci. 49, 6 (2002), 3100--3106.
[54]
S. Sivaraman and M. M. Trivedi. 2013. Looking at vehicles on the road: A survey of vision-based vehicle detection, tracking, and behavior analysis. IEEE Trans. Intell. Transport. Syst. 14, 4 (Dec 2013), 1773--1795.
[55]
Jingweijia Tan, N. Goswami, Tao Li, and Xin Fu. 2011. Analyzing soft-error vulnerability on GPGPU microarchitecture. In Proceedings of the 2011 IEEE International Symposium on Workload Characterization (IISWC). 226--235.
[56]
M. Violante, L. Sterpone, A. Manuzzato, S. Gerardin, P. Rech, M. Bagatin, A. Paccagnella, C. Andreani, G. Gorini, A. Pietropaolo, G. Cardarilli, S. Pontarelli, and C. Frost. 2007. A new hardware/software platform and a new 1/E neutron source for soft error studies: Testing fpgas at the ISIS facility. IEEE Trans. Nucl. Sci. 54, 4 (2007), 1184--1189.
[57]
Chin-Hsien Wu, Tei-Wei Kuo, and Li-Pin Chang. 2006. The design of efficient initialization and crash recovery for log-based file systems over flash memory. Trans. Stor. 2, 4 (Nov. 2006), 449--467.
[58]
H. J. Wunderlich, C. Braun, and S. Halder. 2013. Efficacy and efficiency of algorithm-based fault-tolerance on GPUs. In Proceedings of the 2013 IEEE 19th International On-Line Testing Symposium (IOLTS). 240--243.
[59]
Xiaodong Yang, Chenyang Zhang, and YingLi Tian. 2012. Recognizing actions using depth motion maps-based histograms of oriented gradients. In Proceedings of the 20th ACM International Conference on Multimedia (MM’12). ACM, New York, NY, 1057--1060.
[60]
Qiang Zhu, Mei-Chen Yeh, Kwang-Ting Cheng, and S. Avidan. 2006. Fast human detection using a cascade of histograms of oriented gradients. In Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, Vol. 2. 1491--1498.

Cited By

View all
  • (2024)Investigating the impact of transient hardware faults on deep learning neural network inferenceSoftware Testing, Verification and Reliability10.1002/stvr.187334:4Online publication date: Feb-2024
  • (2023)Fast and Accurate Error Simulation for CNNs Against Soft ErrorsIEEE Transactions on Computers10.1109/TC.2022.318427472:4(984-997)Online publication date: 1-Apr-2023
  • (2022)Characterizing Deep Learning Neural Network Failures Between Algorithmic Inaccuracy and Transient Hardware Faults2022 IEEE 27th Pacific Rim International Symposium on Dependable Computing (PRDC)10.1109/PRDC55274.2022.00020(54-67)Online publication date: Nov-2022
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Transactions on Architecture and Code Optimization
ACM Transactions on Architecture and Code Optimization  Volume 13, Issue 4
December 2016
648 pages
ISSN:1544-3566
EISSN:1544-3973
DOI:10.1145/3012405
Issue’s Table of Contents
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 15 November 2016
Accepted: 01 September 2016
Revised: 01 August 2016
Received: 01 June 2016
Published in TACO Volume 13, Issue 4

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. HOG
  2. pedestrian detection

Qualifiers

  • Research-article
  • Research
  • Refereed

Funding Sources

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)91
  • Downloads (Last 6 weeks)18
Reflects downloads up to 10 Dec 2024

Other Metrics

Citations

Cited By

View all
  • (2024)Investigating the impact of transient hardware faults on deep learning neural network inferenceSoftware Testing, Verification and Reliability10.1002/stvr.187334:4Online publication date: Feb-2024
  • (2023)Fast and Accurate Error Simulation for CNNs Against Soft ErrorsIEEE Transactions on Computers10.1109/TC.2022.318427472:4(984-997)Online publication date: 1-Apr-2023
  • (2022)Characterizing Deep Learning Neural Network Failures Between Algorithmic Inaccuracy and Transient Hardware Faults2022 IEEE 27th Pacific Rim International Symposium on Dependable Computing (PRDC)10.1109/PRDC55274.2022.00020(54-67)Online publication date: Nov-2022
  • (2022)TensorFI+: A Scalable Fault Injection Framework for Modern Deep Learning Neural Networks2022 IEEE International Symposium on Software Reliability Engineering Workshops (ISSREW)10.1109/ISSREW55968.2022.00074(246-251)Online publication date: Oct-2022
  • (2022)Dependability of Alternative Computing Paradigms for Machine Learning: hype or hope?2022 25th International Symposium on Design and Diagnostics of Electronic Circuits and Systems (DDECS)10.1109/DDECS54261.2022.9770138(7-13)Online publication date: 6-Apr-2022
  • (2021)An Accurate HOG based Exemplar Pyramid Method for Image Classification of Astragalus L. TaxaBitlis Eren University Journal of Science and Technology10.17678/beuscitech.94840111:2(22-28)Online publication date: 20-Dec-2021
  • (2020)Lightweight Fault Detection and Management for Image Restoration2020 IEEE International Symposium on Defect and Fault Tolerance in VLSI and Nanotechnology Systems (DFT)10.1109/DFT50435.2020.9250844(1-6)Online publication date: 19-Oct-2020
  • (2019)Detecting Errors in Convolutional Neural Networks Using Inter Frame Spatio-Temporal Correlation2019 IEEE 25th International Symposium on On-Line Testing and Robust System Design (IOLTS)10.1109/IOLTS.2019.8854431(310-315)Online publication date: Jul-2019
  • (2018)Kernel and layer vulnerability factor to evaluate object detection reliability in GPUsIET Computers & Digital Techniques10.1049/iet-cdt.2018.502613:3(178-186)Online publication date: 4-Dec-2018
  • (2017)Analyzing the criticality of transient faults-induced SDCS on GPU applicationsProceedings of the 8th Workshop on Latest Advances in Scalable Algorithms for Large-Scale Systems10.1145/3148226.3148228(1-7)Online publication date: 12-Nov-2017
  • Show More Cited By

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Login options

Full Access

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media