[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1109/XSW.2013.7guideproceedingsArticle/Chapter ViewAbstractPublication PagesConference Proceedingsacm-pubtype
Article

Kokkos: Enabling Performance Portability Across Manycore Architectures

Published: 15 August 2013 Publication History

Abstract

The manycore revolution in computational hardware can be characterized by increasing thread counts, decreasing memory per thread, and architecture specific performance constraints for memory access patterns. High performance computing (HPC) on emerging many core architectures requires codes to exploit every opportunity for thread-level parallelism and satisfy conflicting performance constraints. We developed the Kokkos C++ library to provide scientific and engineering codes with a user accessible many core performance portable programming model. The two foundational abstractions of Kokkos are (1) dispatch work to a many core device for parallel execution and (2) manage multidimensional arrays with polymorphic layouts. The integration of these abstractions enables users' code to satisfy multiple architecture specific memory access pattern performance constraints without having to modify their source code. In this paper we describe the Kokkos abstractions, summarize its application programmer interface (API), and present performance results for a molecular dynamics computational kernel and finite element mini-application.

Cited By

View all
  • (2024)Accelerating atmospheric physics parameterizations using graphics processing unitsInternational Journal of High Performance Computing Applications10.1177/1094342024123871138:4(282-296)Online publication date: 1-Jul-2024
  • (2024)OP-PIC - an Unstructured-Mesh Particle-in-Cell DSL for Developing Nuclear Fusion SimulationsProceedings of the 53rd International Conference on Parallel Processing10.1145/3673038.3673130(294-304)Online publication date: 12-Aug-2024
  • (2024)Collection skeletonsJournal of Systems and Software10.1016/j.jss.2024.112042213:COnline publication date: 1-Jul-2024
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image Guide Proceedings
XSW '13: Proceedings of the 2013 Extreme Scaling Workshop (xsw 2013)
August 2013
72 pages
ISBN:9781479936915

Publisher

IEEE Computer Society

United States

Publication History

Published: 15 August 2013

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 17 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2024)Accelerating atmospheric physics parameterizations using graphics processing unitsInternational Journal of High Performance Computing Applications10.1177/1094342024123871138:4(282-296)Online publication date: 1-Jul-2024
  • (2024)OP-PIC - an Unstructured-Mesh Particle-in-Cell DSL for Developing Nuclear Fusion SimulationsProceedings of the 53rd International Conference on Parallel Processing10.1145/3673038.3673130(294-304)Online publication date: 12-Aug-2024
  • (2024)Collection skeletonsJournal of Systems and Software10.1016/j.jss.2024.112042213:COnline publication date: 1-Jul-2024
  • (2024)Enabling performance portability on the LiGen drug discovery pipelineFuture Generation Computer Systems10.1016/j.future.2024.03.045158:C(44-59)Online publication date: 1-Sep-2024
  • (2023)Analysis of MURaM, a Solar Physics Application, for Scalability, Performance and PortabilityProceedings of the SC '23 Workshops of The International Conference on High Performance Computing, Network, Storage, and Analysis10.1145/3624062.3624606(1929-1938)Online publication date: 12-Nov-2023
  • (2023)View-aware Message Passing Through the Integration of Kokkos and ExaMPIProceedings of the 30th European MPI Users' Group Meeting10.1145/3615318.3615321(1-10)Online publication date: 11-Sep-2023
  • (2023)MOD2IR: High-Performance Code Generation for a Biophysically Detailed Neuronal Simulation DSLProceedings of the 32nd ACM SIGPLAN International Conference on Compiler Construction10.1145/3578360.3580268(203-215)Online publication date: 17-Feb-2023
  • (2023)A Case Study on DaCe Portability & Performance for Batched Discrete Fourier TransformsProceedings of the International Conference on High Performance Computing in Asia-Pacific Region10.1145/3578178.3578239(55-63)Online publication date: 27-Feb-2023
  • (2023)Supporting efficient overlapping of host-device operations for heterogeneous programming with CtrlEventsJournal of Parallel and Distributed Computing10.1016/j.jpdc.2023.04.009179:COnline publication date: 1-Sep-2023
  • (2022)On the Compilation Performance of Current SYCL ImplementationsProceedings of the 10th International Workshop on OpenCL10.1145/3529538.3529548(1-12)Online publication date: 10-May-2022
  • Show More Cited By

View Options

View options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media