[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content

OpenCL implementation of vector additon, matrix multiplication, reduction and sorting

Notifications You must be signed in to change notification settings

denyskryvytskyi/capgemini-opencl

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

15 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Capgemini OpenCL tasks

Tasks were implemented and tested on:

  • Windows laptop: GPU: NVIDIA GTX 1050; CPU: Intel Core i7-7700HQ.
  • Linux AWS instance machine: GPU: Nvidia Tesla m60; CPU: Intel Xeon CPU E5-2686.

Tasks list:

  • vectors addition;
  • matrix multiplication using tiles, GPU shared memory, and matrix transposition;
  • reduction (sum);
  • sorting using a custom implementation of the Bitonic sort algorithm.

Getting Started

As both machines have NVIDIA GPU and installed CUDA toolkit, I've used OpenCL SDK from the CUDA toolkit.

  • Install CUDA Toolkit or OpenCL SDK separately.
  • [optional] Install Intel CPU Runtime for OpenCL to enable OpenCL on Intel CPU.
  • Check OpenCL .lib and headers in the Linux makefile and Windows solution for proper linking.
  • Run programs using Make on Linux and Visual Studio 2022 on Windows.