[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content

CUDA implementation of vector additon, matrix multiplication, reduction and sorting

Notifications You must be signed in to change notification settings

denyskryvytskyi/capgemini-cuda

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

20 Commits
 
 
 
 
 
 

Repository files navigation

Capgemini CUDA tasks

Tasks were implemented and tested in Windows 10 with the Visual Studio CUDA Integration tool and NVCC compiler.

Tasks list:

  • vectors addition;
  • matrix multiplication using tiles, GPU shared memory, and matrix transposition;
  • reduction (sum) with a custom kernel and Nvidia Thrust library for performance comparison;
  • sorting using the Nvidia Thrust library.

Getting Started

  • Install CUDA Toolkit.
  • [Windows] Install Visual Studio 2022 (we need MSVC compiler to compile host code and link with device code for the final executable).
  • [Linux] Install gcc compiler.

Compile code using nvcc compiler:

nvcc <program_name>.cu -o <program_name> -O3

Run: ./<program_name>