CUDA Kernel Benchmarking Library
-
Updated
Nov 20, 2024 - Cuda
CUDA Kernel Benchmarking Library
From zero to hero CUDA for accelerating maths and machine learning on GPU.
Spiking Neural Networks in C++ with strong GPU acceleration through CUDA
CUDA kernel author's tools
High-speed GEMV kernels, at most 2.7x speedup compared to pytorch baseline.
A tool for examining GPU scheduling behavior.
Speed up image preprocess with cuda when handle image or tensorrt inference
Using custom CUDA kernels with Open CV Mat objects.
CUDA C implementation of Principal Component Analysis (PCA) through Singular Value Decomposition (SVD) using a highly parallelisable version of the Jacobi eigenvalue algorithm.
This is a Lattice-Boltzmann simulation using CUDA GPU graphics optimization.
CUDA Programming Practices
Get started with CUDA programming
Some common CUDA kernel implementations (Not the fastest).
Cahn Hilliard CUDA (Phase-Field Simulation of Spinodal Decomposition)
Allen Cahn CUDA (Phase-Field Simulation of Dendritic Solidification)
An extesnion for PHP allowing it to access GPU operations on CUDA graphics cards (NVIDIA)
🍕 Massively parallel DBSCAN algorithm implemented in CUDA.
Playing with CUDA and GPUs in Google Colab
Add a description, image, and links to the cuda-kernels topic page so that developers can more easily learn about it.
To associate your repository with the cuda-kernels topic, visit your repo's landing page and select "manage topics."