Highlights
- Pro
Stars
GPU-modules
5 repositories
Tutorials for the Kokkos C++ Performance Portability Programming Ecosystem
Kokkos C++ Performance Portability Programming Ecosystem: The Programming Model - Parallel Execution and Memory Abstraction
Kokkos C++ Performance Portability Programming Ecosystem: Profiling and Debugging Tools
Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more
Zero-copy MPI communication of JAX arrays, for turbo-charged HPC applications in Python ⚡