OpenACC CUDA Vector Add

GPU Teaching Kit – Accelerated Computing

Objective

Implement a vector addition using OpenACC directives.

Local Setup Instructions

The most recent version of source code for this lab along with the build-scripts can be found on the Bitbucket repository. A description on how to use the CMake tool in along with how to build the labs for local development found in the README document in the root of the repository.

The executable generated as a result of compiling the lab can be run using the following command:

./OpenAccVectorAdd_Template -e <expected.raw> \
  -i <input0.raw>,<input1.raw> -o <output.raw> -t vector

where <expected.raw> is the expected output, <input0.raw>,<input1.raw> is the input dataset, and <output.raw> is an optional path to store the results. The datasets can be generated using the dataset generator built as part of the compilation process.

Local Development & Obtaining a PGI Compiler License

The usage of OpenACC directives requires access to the PGI OpenACC compiler. Please follow the instructions on Bitbucket repository to download the tools, generate the license file and install the license.