[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.5555/3437539.3437799acmconferencesArticle/Chapter ViewAbstractPublication PagesdacConference Proceedingsconference-collections
research-article

Automated hardware generation of CNN models on FPGAs: late breaking results

Published: 18 November 2020 Publication History

Abstract

In this paper, we propose an automated framework that takes as input a TensorFlow inference graph and generates high-performance accelerators on FPGA by assembling CNN pre-implemented components as a puzzle, based on the graph topology. Using pre-implemented components allows us the only use the minimum of resources necessary, predict the performance and a gain in productivity We adopt a unified representation based on systolic array to perform the computational-hungry operations of the model and provide novel analysis of design trade-offs for FPGA CNN accelerators. Experimental results show the great performance, low latency and flexibility provided by the proposed framework.

References

[1]
D. Tchuinkou and C. Bobda, "R-covnet: Recurrent neural convolution network for 3d object recognition," in 2018 25th IEEE International Conference on Image Processing (ICIP). IEEE, 2018, pp. 331--335.
[2]
C. Lavin and A. Kaviani, "Rapidwright: Enabling custom crafted implementations for fpgas," in 2018 IEEE 26th Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM). IEEE, 2018, pp. 133--140.
[3]
C. Zhang, G. Sun, Z. Fang, P. Zhou, P. Pan, and J. Cong, "Caffeine: Towards uniformed representation and acceleration for deep convolutional neural networks," IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, 2018.
[4]
J. Zhang and J. Li, "Improving the performance of opencl-based fpga accelerator for convolutional neural network," in Proceedings of the 2017 ACM/SIGDA International Symposium on Field-Programmable Gate Arrays. ACM, 2017, pp. 25--34.

Cited By

View all
  • (2024)Efficient SpMM Accelerator for Deep Learning: Sparkle and Its Automated GeneratorACM Transactions on Reconfigurable Technology and Systems10.1145/366589617:3(1-30)Online publication date: 7-Jun-2024
  • (2022)Deploying Multi-tenant FPGAs within Linux-based Cloud InfrastructureACM Transactions on Reconfigurable Technology and Systems10.1145/347405815:2(1-31)Online publication date: 30-Jun-2022

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
DAC '20: Proceedings of the 57th ACM/EDAC/IEEE Design Automation Conference
July 2020
1545 pages
ISBN:9781450367257
  • General Chair:
  • Zhuo Li

Sponsors

In-Cooperation

  • IEEE-CEDA

Publisher

IEEE Press

Publication History

Published: 18 November 2020

Check for updates

Author Tags

  1. CNN
  2. FPGA
  3. systolic array
  4. tensorflow

Qualifiers

  • Research-article

Conference

DAC '20
Sponsor:

Acceptance Rates

Overall Acceptance Rate 1,770 of 5,499 submissions, 32%

Upcoming Conference

DAC '25
62nd ACM/IEEE Design Automation Conference
June 22 - 26, 2025
San Francisco , CA , USA

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)18
  • Downloads (Last 6 weeks)4
Reflects downloads up to 11 Dec 2024

Other Metrics

Citations

Cited By

View all
  • (2024)Efficient SpMM Accelerator for Deep Learning: Sparkle and Its Automated GeneratorACM Transactions on Reconfigurable Technology and Systems10.1145/366589617:3(1-30)Online publication date: 7-Jun-2024
  • (2022)Deploying Multi-tenant FPGAs within Linux-based Cloud InfrastructureACM Transactions on Reconfigurable Technology and Systems10.1145/347405815:2(1-31)Online publication date: 30-Jun-2022

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media