[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1109/FCCM.2015.30guideproceedingsArticle/Chapter ViewAbstractPublication PagesConference Proceedingsacm-pubtype
Article

Accelerating SpMV on FPGAs by Compressing Nonzero Values

Published: 02 May 2015 Publication History

Abstract

Sparse matrix vector multiplication (SpMV) is an important kernel in many areas of scientific computing, especially as a building block for iterative linear system solvers. We study how loss less nonzero compression can be used to overcome memory bandwidth limitations in FPGA-based SpMV implementations. We introduce a dictionary-based compression algorithm which reduces redundant nonzero values to improve memory bandwidth without reducing computation efficiency by making use of spare FPGA resources. We show how a sparse matrix in the CSR format can be converted to the proposed storage format on the CPU and that average compression ratios of 1.14 - 1.40 and up to 2.65 times can be achieved, over CSR, for relevant matrices in our benchmarks.

Cited By

View all
  • (2024)SPADES: A 0.54-GFLOPS/W Sparse Matrix Vector Multiplication Accelerator Featuring On-the-Fly GZIP Decompression for 3.36X Reduction in Off-Chip Data MovementProceedings of the 29th ACM/IEEE International Symposium on Low Power Electronics and Design10.1145/3665314.3670811(1-6)Online publication date: 5-Aug-2024
  • (2024)HiSpMV: Hybrid Row Distribution and Vector Buffering for Imbalanced SpMV Acceleration on FPGAsProceedings of the 2024 ACM/SIGDA International Symposium on Field Programmable Gate Arrays10.1145/3626202.3637557(154-164)Online publication date: 1-Apr-2024
  • (2022)High-Performance Sparse Linear Algebra on HBM-Equipped FPGAs Using HLSProceedings of the 2022 ACM/SIGDA International Symposium on Field-Programmable Gate Arrays10.1145/3490422.3502368(54-64)Online publication date: 13-Feb-2022
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image Guide Proceedings
FCCM '15: Proceedings of the 2015 IEEE 23rd Annual International Symposium on Field-Programmable Custom Computing Machines
May 2015
239 pages
ISBN:9781479999699

Publisher

IEEE Computer Society

United States

Publication History

Published: 02 May 2015

Author Tags

  1. FPGA
  2. Sparse matrix-vector multiplication
  3. nonzero compression

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 31 Dec 2024

Other Metrics

Citations

Cited By

View all
  • (2024)SPADES: A 0.54-GFLOPS/W Sparse Matrix Vector Multiplication Accelerator Featuring On-the-Fly GZIP Decompression for 3.36X Reduction in Off-Chip Data MovementProceedings of the 29th ACM/IEEE International Symposium on Low Power Electronics and Design10.1145/3665314.3670811(1-6)Online publication date: 5-Aug-2024
  • (2024)HiSpMV: Hybrid Row Distribution and Vector Buffering for Imbalanced SpMV Acceleration on FPGAsProceedings of the 2024 ACM/SIGDA International Symposium on Field Programmable Gate Arrays10.1145/3626202.3637557(154-164)Online publication date: 1-Apr-2024
  • (2022)High-Performance Sparse Linear Algebra on HBM-Equipped FPGAs Using HLSProceedings of the 2022 ACM/SIGDA International Symposium on Field-Programmable Gate Arrays10.1145/3490422.3502368(54-64)Online publication date: 13-Feb-2022
  • (2021)A reduced-precision streaming SpMV architecture for Personalized PageRank on FPGAProceedings of the 26th Asia and South Pacific Design Automation Conference10.1145/3394885.3431548(378-383)Online publication date: 18-Jan-2021
  • (2019)Compiling KB-sized machine learning models to tiny IoT devicesProceedings of the 40th ACM SIGPLAN Conference on Programming Language Design and Implementation10.1145/3314221.3314597(79-95)Online publication date: 8-Jun-2019
  • (2017)Efficient Assembly for High-Order Unstructured FEM Meshes (FPL 2015)ACM Transactions on Reconfigurable Technology and Systems10.1145/302406410:2(1-22)Online publication date: 6-Apr-2017
  • (2017)FPGA Acceleration for Computational Glass-Free DisplaysProceedings of the 2017 ACM/SIGDA International Symposium on Field-Programmable Gate Arrays10.1145/3020078.3021728(267-274)Online publication date: 22-Feb-2017
  • (2016)CASKProceedings of the 2016 ACM/SIGDA International Symposium on Field-Programmable Gate Arrays10.1145/2847263.2847338(179-184)Online publication date: 21-Feb-2016
  • (2016)Reducing Memory Requirements for High-Performance and Numerically Stable Gaussian EliminationProceedings of the 2016 ACM/SIGDA International Symposium on Field-Programmable Gate Arrays10.1145/2847263.2847281(244-253)Online publication date: 21-Feb-2016
  • (2016)A Multi-codec Framework to Enhance Data Channels in FPGA Streaming SystemsProceedings of the 12th International Symposium on Applied Reconfigurable Computing - Volume 962510.1007/978-3-319-30481-6_17(207-219)Online publication date: 22-Mar-2016

View Options

View options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media