[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/3318170.3318179acmotherconferencesArticle/Chapter ViewAbstractPublication PagesiwoclConference Proceedingsconference-collections
poster

Sparse-Matrix Compression Primitives with OpenCL Framework to Support Halide

Published: 13 May 2019 Publication History

Abstract

Halide and OpenCL now play important roles for heterogeneous multi-core computing. OpenCL provides vendor-level support and Halide provides domain-specific support such as vision processing and AI model (TVM Halide IR). Halide also provides flexible scheduling for applications on target machines. OpenCL plays a supporting role for Halide environments. In this work, we investigate the research issues in supporting sparse computation with Halide and their corresponding OpenCL support. We present sparse matrix compression primitives on Halide for sparse matrix matrix (SpMM) multiplication with OpenCL framework. Halide is a programming language designed to process image and array from numerous algorithms and scheduling primitives to achieve state-of-art performance including SIMD and heterogeneous computation. This paper proposed the implementation of sparse matrix compression for Halide scheduling primitives including COO, CSR, and hybrid CSR. The design of experiments includes Halide primitives for sparse matrix compression and matrix computations. The experimental result of computation with compressing matrix shows the performance are improved by up to 85% compared to the baseline without compression.

References

[1]
Rong-Guey Chang, Tyng-Ruey Chuang, and Jenq Kuen Lee. 2004. Support and optimization for parallel sparse programs with array intrinsics of Fortran 90. Parallel Comput. 30, 4 (2004), 527--550.
[2]
Changwan Hong, Aravind Sukumaran-Rajam, Bortik Bandyopadhyay, Jinsung Kim, Süreyya Emre Kurt, Israt Nisa, Shivani Sabhlok, Ümit V Çatalyürek, Srinivasan Parthasarathy, and P Sadayappan. 2018. Efficient sparse-matrix multi-vector product on GPUs. In Proceedings of the 27th International Symposium on High-Performance Parallel and Distributed Computing. ACM, 66--79.
[3]
Jonathan Ragan-Kelley, Andrew Adams, Sylvain Paris, Marc Levoy, Saman Amarasinghe, and Frédo Durand. 2012. Decoupling algorithms from schedules for easy optimization of image processing pipelines. (2012).
[4]
Jonathan Ragan-Kelley, Connelly Barnes, Andrew Adams, Sylvain Paris, Frédo Durand, and Saman Amarasinghe. 2013. Halide: a language and compiler for optimizing parallelism, locality, and recomputation in image processing pipelines. ACM SIGPLAN Notices 48, 6 (2013), 519--530.

Cited By

View all
  • (2021)Support Convolution of CNN with Compression Sparse Matrix Multiplication Flow in TVM50th International Conference on Parallel Processing Workshop10.1145/3458744.3473352(1-7)Online publication date: 9-Aug-2021
  • (2020)Devise Sparse Compression Schedulers to Enhance FastText MethodsWorkshop Proceedings of the 49th International Conference on Parallel Processing10.1145/3409390.3409394(1-8)Online publication date: 17-Aug-2020
  • (2020)Accelerating NNEF Framework on OpenCL Devices Using clDNNProceedings of the International Workshop on OpenCL10.1145/3388333.3388655(1-2)Online publication date: 27-Apr-2020

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Other conferences
IWOCL '19: Proceedings of the International Workshop on OpenCL
May 2019
102 pages
ISBN:9781450362306
DOI:10.1145/3318170
Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.

In-Cooperation

  • Khronos: Khronos Group
  • Northeastern University
  • Codeplay: Codeplay Software Ltd.
  • Intel: Intel
  • The University of Bristol: The University of Bristol

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 13 May 2019

Check for updates

Author Tags

  1. Halide
  2. OpenCL
  3. Sparse Matrix

Qualifiers

  • Poster
  • Research
  • Refereed limited

Conference

IWOCL'19
IWOCL'19: International Workshop on OpenCL
May 13 - 15, 2019
MA, Boston, USA

Acceptance Rates

IWOCL '19 Paper Acceptance Rate 13 of 33 submissions, 39%;
Overall Acceptance Rate 84 of 152 submissions, 55%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)6
  • Downloads (Last 6 weeks)1
Reflects downloads up to 20 Dec 2024

Other Metrics

Citations

Cited By

View all
  • (2021)Support Convolution of CNN with Compression Sparse Matrix Multiplication Flow in TVM50th International Conference on Parallel Processing Workshop10.1145/3458744.3473352(1-7)Online publication date: 9-Aug-2021
  • (2020)Devise Sparse Compression Schedulers to Enhance FastText MethodsWorkshop Proceedings of the 49th International Conference on Parallel Processing10.1145/3409390.3409394(1-8)Online publication date: 17-Aug-2020
  • (2020)Accelerating NNEF Framework on OpenCL Devices Using clDNNProceedings of the International Workshop on OpenCL10.1145/3388333.3388655(1-2)Online publication date: 27-Apr-2020

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media