[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/2909437.2909442acmotherconferencesArticle/Chapter ViewAbstractPublication PagesiwoclConference Proceedingsconference-collections
extended-abstract

clSPARSE: A Vendor-Optimized Open-Source Sparse BLAS Library

Published: 19 April 2016 Publication History

Abstract

Sparse linear algebra is a cornerstone of modern computational science. These algorithms ignore the zero-valued entries found in many domains in order to work on much larger problems at much faster rates than dense algorithms. Nonetheless, optimizing these algorithms is not straightforward. Highly optimized algorithms for multiplying a sparse matrix by a dense vector, for instance, are the subject of a vast corpus of research and can be hundreds of times longer than naïve implementations. Optimized sparse linear algebra libraries are thus needed so that users can build applications without enormous effort.
Hardware vendors release proprietary libraries that are highly optimized for their devices, but they limit interoperability and promote vendor lock-in. Open libraries often work across multiple devices and can quickly take advantage of new innovations, but they may not reach peak performance. The goal of this work is to provide a sparse linear algebra library that offers both of these advantages.
We thus describe clSPARSE, a permissively licensed open-source sparse linear algebra library that offers state-of-the-art optimized algorithms implemented in OpenCL™. We test clSPARSE on GPUs from AMD and Nvidia and show performance benefits over both the proprietary cuSPARSE library and the open-source ViennaCL library.

References

[1]
H. M. Aktulga, A. Buluç, S. Williams, and C. Yang. Optimizing Sparse Matrix-Multiple Vectors Multiplication for Nuclear Configuration Interaction Calculations. In Proc. of the Int'l Parallel and Distributed Processing Symposium (IPDPS), 2014.
[2]
M. Daga and J. L. Greathouse. Structural Agnostic SpMV: Adapting CSR-Adaptive for Irregular Matrices. In Proc. of the Int'l Conf. on High Performance Computing (HiPC), 2015.
[3]
I. S. Duff, M. A. Heroux, and R. Pozo. An Overview of the Sparse Basic Linear Algebra Subprograms: The New Standard from the BLAS Technical Forum. Trans. on Mathematical Software, 28(2):239--267, 2002.
[4]
J. L. Greathouse and M. Daga. Efficient Sparse Matrix-Vector Multiplication on GPUs Using the CSR Storage Format. In Proc. of the Int'l Conf. for High Performance Computing, Networking, Storage and Analysis (SC), 2014.
[5]
W. D. Gropp, D. K. Kaushik, D. E. Keyes, and B. F. Smith. Towards Realistic Performance Bounds for Implicit CFD Codes. In Proc. of the Int'l Parallel Computational Fluid Dynamics Conf. (PARCFD), 1999.
[6]
A. Kaiser, S. Williams, K. Madduri, K. Ibrahim, D. H. Bailey, J. W. Demmel, and E. Strohmaier. TORCH Computational Reference Kernels: A Testbed for Computer Science Research. Technical Report LBNL-4172E, Lawrence Berkeley National Laboratory, 2010.
[7]
M. Kreutzer, G. Hager, G. Wellein, H. Fehske, and A. R. Bishop. A Unified Sparse Matrix Data Format for Modern Processors with Wide SIMD Units. SIAM Journal on Scientific Computing, 36(5):C401--C423, 2014.
[8]
D. Langr and P. Tvrdík. Evaluation Criteria for Sparse Matrix Storage Formats. IEEE Trans. on Parallel and Distributed Systems, 27(2):428--440, Feb. 2016.
[9]
W. Liu and B. Vinter. An Efficient GPU General Sparse Matrix-Matrix Multiplication for Irregular Data. In Proc. of the Int'l Parallel and Distributed Processing Symp. (IPDPS), 2014.
[10]
K. Rupp, F. Rudolf, and J. Weinbub. ViennaCL - A High Level Linear Algebra Library for GPUs and Multi-Core CPUs. In Int'l Workshop on GPUs and Scientific Applications (GPUScA), 2010.
[11]
K. Rupp, J. Weinbub, A. Jüngel, and T. Grasser. Pipelined Iterative Solvers with Kernel Fusion for Graphics Processing Units. CoRR, abs/1410.4054, 2014.
[12]
B.-Y. Su and K. Keutzer. clSpMV: A Cross-Platform OpenCL SpMV Framework on GPUs. In Proc. of the Int'l Conf. on Supercomputing (ICS), 2012.

Cited By

View all
  • (2024)Automatically optimized component model computation for power system simulation on GPUElectric Power Systems Research10.1016/j.epsr.2024.110740235(110740)Online publication date: Oct-2024
  • (2023)An Efficient Eigenvalue Bounding Method: Cfl Condition RevisitedSSRN Electronic Journal10.2139/ssrn.4353590Online publication date: 2023
  • (2022)SPbLA: The Library of GPGPU-powered Sparse Boolean Linear Algebra OperationsJournal of Open Source Software10.21105/joss.037437:76(3743)Online publication date: Aug-2022
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Other conferences
IWOCL '16: Proceedings of the 4th International Workshop on OpenCL
April 2016
131 pages
ISBN:9781450343381
DOI:10.1145/2909437
Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.

In-Cooperation

  • The University of Bristol: The University of Bristol

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 19 April 2016

Check for updates

Author Tags

  1. GPGPU
  2. OpenCL
  3. Sparse Linear Algebra
  4. clSPARSE

Qualifiers

  • Extended-abstract
  • Research
  • Refereed limited

Conference

IWOCL '16
IWOCL '16: The 4th International Workshop on OpenCL
April 19 - 21, 2016
Vienna, Austria

Acceptance Rates

Overall Acceptance Rate 84 of 152 submissions, 55%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)4
  • Downloads (Last 6 weeks)0
Reflects downloads up to 02 Mar 2025

Other Metrics

Citations

Cited By

View all
  • (2024)Automatically optimized component model computation for power system simulation on GPUElectric Power Systems Research10.1016/j.epsr.2024.110740235(110740)Online publication date: Oct-2024
  • (2023)An Efficient Eigenvalue Bounding Method: Cfl Condition RevisitedSSRN Electronic Journal10.2139/ssrn.4353590Online publication date: 2023
  • (2022)SPbLA: The Library of GPGPU-powered Sparse Boolean Linear Algebra OperationsJournal of Open Source Software10.21105/joss.037437:76(3743)Online publication date: Aug-2022
  • (2022)The Linear Algebra Mapping Problem. Current State of Linear Algebra Languages and LibrariesACM Transactions on Mathematical Software10.1145/354993548:3(1-30)Online publication date: 20-Jul-2022
  • (2021)Exploiting Activation Sparsity for Fast CNN Inference on Mobile GPUsACM Transactions on Embedded Computing Systems10.1145/347700820:5s(1-25)Online publication date: 17-Sep-2021
  • (2021)GRIM: A General, Real-Time Deep Learning Inference Framework for Mobile Devices based on Fine-Grained Structured Weight SparsityIEEE Transactions on Pattern Analysis and Machine Intelligence10.1109/TPAMI.2021.3089687(1-1)Online publication date: 2021
  • (2021)SPbLA: The Library of GPGPU-Powered Sparse Boolean Linear Algebra Operations2021 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW)10.1109/IPDPSW52791.2021.00049(272-275)Online publication date: Jun-2021
  • (2021)On the implementation of flux limiters in algebraic frameworksComputer Physics Communications10.1016/j.cpc.2021.108230(108230)Online publication date: Nov-2021
  • (2021)A GPU Architecture Aware Fine-Grain Pruning Technique for Deep Neural NetworksEuro-Par 2021: Parallel Processing10.1007/978-3-030-85665-6_14(217-231)Online publication date: 25-Aug-2021
  • (2020)Flexible Group-Level Pruning of Deep Neural Networks for On-Device Machine Learning2020 Design, Automation & Test in Europe Conference & Exhibition (DATE)10.23919/DATE48585.2020.9116287(79-84)Online publication date: Mar-2020
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media