[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
research-article

A Multi-Level-Optimization Framework for FPGA-Based Cellular Neural Network Implementation

Published: 28 November 2018 Publication History

Abstract

Cellular Neural Network (CeNN) is considered as a powerful paradigm for embedded devices. Its analog and mix-signal hardware implementations are proved to be applicable to high-speed image processing, video analysis, and medical signal processing with its efficiency and popularity limited by smaller implementation size and lower precision. Recently, digital implementations of CeNNs on FPGA have attracted researchers from both academia and industry due to its high flexibility and short time-to-market. However, most existing implementations are not well optimized to fully utilize the advantages of FPGA platform with unnecessary design and computational redundancy that prevents speedup. We propose a multi-level-optimization framework for energy-efficient CeNN implementations on FPGAs. In particular, the optimization framework is featured with three level optimizations: system-, module-, and design-space-level, with focus on computational redundancy and attainable performance, respectively. Experimental results show that with various configurations our framework can achieve an energy-efficiency improvement of 3.54× and up to 3.88× speedup compared with existing implementations with similar accuracy.

References

[1]
Cheng Zhuo, Kassan Unda, Yiyu Shi, and Wei Kai Shih. 2018. From layout to system: Early stage power delivery and architecture co-exploration. IEEE Trans. Comput.-Aided Design Integr. Circ. Syst. PP, 99 (2018), 1--1.
[2]
Cheng Zhuo, Gustavo Wilke, Ritochit Chakraborty, Alaeddin A. Aydiner, Sourav Chakravarty, and Wei-Kai Shih. 2015. Silicon-validated power delivery modeling and analysis on a 32-nm DDR I/O interface. IEEE Trans. Very Large Scale Integr. Syst. 23, 9 (2015), 1760--1771.
[3]
Huaqing Li, Xiaofeng Liao, Chuandong Li, Hongyu Huang, and Chaojie Li. 2011. Edge detection of noisy images based on cellular neural networks. Commun. Nonlin. Sci. Numer. Simul. 16, 9 (2011), 3746--3759.
[4]
Osama Basil Gazi, Mohamed Belal, and Hala Abdel-Galil. 2014. Edge detection in satellite image using cellular neural network. System 8 (2014), 9.
[5]
Jeremy Hills and Yongmin Zhong. 2014. Cellular neural network-based thermal modelling for real-time robotic path planning. Int. J. Agile Syst. Manage. 20 7, 3--4 (2014), 261--281.
[6]
M. Duraisamy and F. Mary Magdalene Jane. 2014. Cellular neural network based medical image segmentation using artificial bee colony algorithm. In Proceedings of the International Conference on Green Computing Communication and Electrical Engineering (ICGCCEE’14). IEEE, 1--6.
[7]
Zhongyang Liu, Cheng Zhuo, and Xiaowei Xu. 2018. Efficient Segmentation Method Using Quantised and Non-linear CeNN for Breast Tumour Classification. Electronics Letters.
[8]
Fadi Al Machot, Mouhannad Ali, Ahmad Haj Mosa, Christopher Schwarzlmüller, Markus Gutmann, and Kyandoghere Kyamakya. 2016. Real-time raindrop detection based on cellular neural networks for ADAS. J. Real-Time Image Process. (2016), 1--13.
[9]
Nerhun Yildiz, Evren Cesur, and Vedat Tavsanoglu. 2016. On the way to a third generation real-time cellular neural network processor. In Proceedings of the International Workshop on Cellular Neural Networks and their Applications (CNNA’16).
[10]
Dilan Manatunga, Hyesoon Kim, and Saibal Mukhopadhyay. 2015. SP-CNN: A scalable and programmable CNN-based accelerator. IEEE Micro 35, 5 (2015), 42--50.
[11]
Hubert Harrer, Josef A. Nossek, Tams Roska, and Leon O. Chua. 1994. A current-mode DTCNN universal chip. In Proceedings of the IEEE International Symposium on Circuits and Systems (ISCAS’94), Vol. 4. IEEE, 135--138.
[12]
Angel Rodrguez-Vzquez, Gustavo Lin-Cembrano, L. Carranza, Elisenda Roca-Moreno, Ricardo Carmona-Galn, Francisco Jimnez-Garrido, Rafael Domnguez-Castro, and S. Espejo Meana. 2004. ACE16k: The third generation of mixed-signal SIMD-CNN ACE chips toward VSoCs. IEEE Trans. Circ. Syst. I: Reg. Papers 51, 5 (2004), 851--863.
[13]
Gabriele Manganaro, Paolo Arena, and Luigi Fortuna. 2012. Cellular Neural Networks: Chaos, Complexity and VLSI Processing. Vol. 1. Springer Science 8 Business Media.
[14]
Stephen J. Carey, David R. W. Barr, Bin Wang, Alexey Lopich, and Piotr Dudek. 2013. Mixed signal SIMD processor array vision chip for real-time image processing. Analog Integr. Circ. Signal Process. 77, 3 (2013), 385--399.
[15]
Nerhun Yildiz, Evren Cesur, Kamer Kayaer, Vedat Tavsanoglu, and Murathan Alpay. 2015. Architecture of a fully pipelined real-time cellular neural network emulator. IEEE Trans. Circ. Syst. I: Reg. Papers 62, 1 (2015), 130--138.
[16]
Seungjin Lee, Minsu Kim, Kwanho Kim, Joo-Young Kim, and Hoi-Jun Yoo. 2011. 24-GOPS 4.5-mm<sup>2</sup> digital cellular neural network for rapid visual attention in an object-recognition SoC. IEEE Trans. Neural Netw. 22, 1 (2011), 64--73.
[17]
Sasanka Potluri, Alireza Fasih, Laxminand Kishore Vutukuru, Fadi Al Machot, and Kyandoghere Kyamakya. 2011. CNN based high performance computing for real time image processing on GPU. In Proceedings of the Nonlinear Dynamics and Synchronization (INDS) 8 16th International Symposium on Theoretical Electrical Engineering (ISTET), 2011 Joint 3rd International Workshop. IEEE, 1--7.
[18]
Hsin-Chieh Chen, Yung-Ching Hung, Chang-Kuo Chen, Teh-Lu Liao, and Chun-Kuo Chen. 2006. Image-processing algorithms realized by discrete-time cellular neural networks and their circuit implementations. Chaos, Solitons Fract. 29, 5 (2006), 1100--1108.
[19]
Reid Porter, Jan Frigo, Al Conti, Neal Harvey, Garrett Kenyon, and Maya Gokhale. 2007. A reconfigurable computing framework for multi-scale cellular image processing. Microprocess. Microsyst. 31, 8 (2007), 546--563.
[20]
J. Javier Martnez, Javier Garrigs, Javier Toledo, and J. Manuel Fernandez. 2013. An efficient and expandable hardware implementation of multilayer cellular neural networks. Neurocomputing 114 (2013), 54--62.
[21]
Jens Muller, Robert Wittig, Jan Muller, and Ronald Tetzlaff. 2016. An improved cellular nonlinear network architecture for binary and greyscale image processing. IEEE Trans. Circ. Syst. II: Express Briefs 65, 8 (2016), 1084--1088.
[22]
Qian Wang, Youjie Li, Botang Shao, Siddhartha Dey, and Peng Li. 2017. Energy-efficient parallel neuromorphic architectures with approximate arithmetic on FPGA. Neurocomputing 221 (2017).
[23]
Qian Wang, Yingyezhe Jin, and Peng Li. 2015. General-purpose LSM learning processor architecture and theoretically guided design space exploration. In Proceedings of the Biomedical Circuits and Systems Conference (BioCAS’15). 1--4.
[24]
Bruno da Silva, An Braeken, Eril H. D’Hollander, and Abdellah Touhafi. 2013. Performance modeling for FPGAs: Extending the roofline model with high-level synthesis tools. Int. J. Reconfig. Comput. 2013, Article 7 (2013).
[25]
Rahimeh Rouhi, Mehdi Jafari, Shohreh Kasaei, and Peiman Keshavarzian. 2015. Benign and malignant breast tumors classification based on region growing and CNN segmentation. Expert Syst. Appl. 42, 3 (2015), 990--1002.
[26]
Wei Wang, Li-Jun Yang, Yu-Ting Xie, and You-wei An. 2014. Edge detection of infrared image with CNN_DGA algorithm. Optik-Int. J. Light Electron Optics 125, 1 (2014), 464--467.
[27]
Xiaoming Liu and Jinshan Tang. 2014. Mass classification in mammograms using selected geometry and texture features, and a new SVM-based feature selection method. IEEE Syst. J. 8, 3 (2014), 910--920.
[28]
Yu Zhang, Noriko Tomuro, Jacob Furst, and Daniela Stan Raicu. 2012. Building an ensemble system for diagnosing masses in mammograms. Int. J. Comput. Assisted Radiol. Surg. 7, 2 (2012), 323--329.
[29]
Brijesh Verma, Peter McLeod, and Alan Klevansky. 2010. Classification of benign and malignant patterns in digital mammograms for the diagnosis of breast cancer. Expert Syst. Appl. 37, 4 (2010), 3344--3351.
[30]
Leon O. Chua and Tamas Roska. 2002. Cellular Neural Networks and Visual Computing: Foundations and Applications. Cambridge University Press.
[31]
K. Karacs, G. Y. Cserey, Zarndy, P. Szolgay, C. S. Rekeczky, L. Kek, V. Szab, G. Pazienza, and T. Roska. 2010. Software library for cellular wave computing engines. Cellular Sensory and Wave Computing Laboratory of the Computer and Automation Research Institute. Hungarian Academy of Sciences (MTA SZTAKI), and the Jedlik Laboratories of the Pazmany University.
[32]
M. Heath, K. Bowyer, D. Kopans, R. Moore, and P. Kegelmeyer. 2001. The Digital Database for Screening Mammography. Springer, Netherlands, 457--460.
[33]
Chen Zhang, Peng Li, Guangyu Sun, Yijin Guan, Bingjun Xiao, and Jason Cong. 2015. Optimizing fpga-based accelerator design for deep convolutional neural networks. In Proceedings of the ACM/SIGDA International Symposium on Field-Programmable Gate Arrays. ACM, 161--170.

Cited By

View all
  • (2024)Diffusion-based Wasserstein generative adversarial network for blood cell image augmentationEngineering Applications of Artificial Intelligence10.1016/j.engappai.2024.108221133(108221)Online publication date: Jul-2024
  • (2023)Interface-type tunable oxygen ion dynamics for physical reservoir computingNature Communications10.1038/s41467-023-42993-x14:1Online publication date: 7-Nov-2023
  • (2022)Dynamic Effects of a MCNN-CS Under Electromagnetic Induction and Its ApplicationInternational Journal of Bifurcation and Chaos10.1142/S021812742250165632:11Online publication date: 26-Sep-2022
  • Show More Cited By

Index Terms

  1. A Multi-Level-Optimization Framework for FPGA-Based Cellular Neural Network Implementation

      Recommendations

      Comments

      Please enable JavaScript to view thecomments powered by Disqus.

      Information & Contributors

      Information

      Published In

      cover image ACM Journal on Emerging Technologies in Computing Systems
      ACM Journal on Emerging Technologies in Computing Systems  Volume 14, Issue 4
      Special Issue on Neuromorphic Computing
      October 2018
      164 pages
      ISSN:1550-4832
      EISSN:1550-4840
      DOI:10.1145/3294068
      • Editor:
      • Yuan Xie
      Issue’s Table of Contents
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Journal Family

      Publication History

      Published: 28 November 2018
      Accepted: 01 August 2018
      Revised: 01 May 2018
      Received: 01 December 2017
      Published in JETC Volume 14, Issue 4

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tags

      1. Cellular neural network
      2. FPGA
      3. acceleration

      Qualifiers

      • Research-article
      • Research
      • Refereed

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)7
      • Downloads (Last 6 weeks)1
      Reflects downloads up to 01 Jan 2025

      Other Metrics

      Citations

      Cited By

      View all
      • (2024)Diffusion-based Wasserstein generative adversarial network for blood cell image augmentationEngineering Applications of Artificial Intelligence10.1016/j.engappai.2024.108221133(108221)Online publication date: Jul-2024
      • (2023)Interface-type tunable oxygen ion dynamics for physical reservoir computingNature Communications10.1038/s41467-023-42993-x14:1Online publication date: 7-Nov-2023
      • (2022)Dynamic Effects of a MCNN-CS Under Electromagnetic Induction and Its ApplicationInternational Journal of Bifurcation and Chaos10.1142/S021812742250165632:11Online publication date: 26-Sep-2022
      • (2022)Joint Coprime Weights Optimization for Sub-Nyquist Tensor Beamforming2022 IEEE Radar Conference (RadarConf22)10.1109/RadarConf2248738.2022.9764278(1-6)Online publication date: 21-Mar-2022
      • (2022)BackboneAnalysis: Structured Insights into Compute Platforms from CNN Inference Latency2022 IEEE Intelligent Vehicles Symposium (IV)10.1109/IV51971.2022.9827260(1801-1809)Online publication date: 5-Jun-2022
      • (2022)An efficient Industrial Internet of Things video data processing system for protocol identification and quality enhancementIET Cyber-Physical Systems: Theory & Applications10.1049/cps2.120358:2(63-75)Online publication date: 22-Sep-2022
      • (2021)A Reconfigurable Multiplier for Signed Multiplications with Asymmetric Bit-WidthsACM Journal on Emerging Technologies in Computing Systems10.1145/344621317:4(1-16)Online publication date: 30-Jun-2021
      • (2021)Regularization-Free Structural Pruning for GPU Inference Acceleration2021 22nd International Symposium on Quality Electronic Design (ISQED)10.1109/ISQED51717.2021.9424299(149-153)Online publication date: 7-Apr-2021
      • (2021)Sample Fourth-order Cumulant Tensor Denoising for DOA Estimation with Coprime L-shaped Array2021 55th Asilomar Conference on Signals, Systems, and Computers10.1109/IEEECONF53345.2021.9723340(1668-1672)Online publication date: 31-Oct-2021
      • (2021)A Comprehensive Review of Machine Learning in Multi-objective Optimization2021 IEEE 4th International Conference on Big Data and Artificial Intelligence (BDAI)10.1109/BDAI52447.2021.9515233(7-14)Online publication date: 2-Jul-2021
      • Show More Cited By

      View Options

      Login options

      Full Access

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      HTML Format

      View this article in HTML Format.

      HTML Format

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media