CUDA Examples

This repo contains a collection of CUDA examples that were first used for a talk at the Melbourne C++ Meetup.

Listing

00-hello-world.cu - Vector addition on a CPU; the hello world of the parallel computing
01-cuda-hello-world.cu - Vector addition using CUDA
02-cuda-hello-world-faster.cu - Vector addition using CUDA, with a CPU bottleneck removed
03-templates-device-ptr.cu - Using C++ templates to make device pointers safer
04-templates-device-mem.cu - More templates, allowing for scoped device memory
05-thrust-rand-vectors.cu - Using Thrust to add up random numbers

Compiling

All examples can be compiled with nvcc. Only 02-cuda-hello-world-faster.cu requires an additional compiler option --expt-relaxed-constexpr (at least, when compiled on Linux).

A Makefile has been included, so all examples can built using make.

Credits

These are all based on examples found in the wild. 03 and 04, in particular, are based on code from Michael Gopshtein's CppCon talk, CUDA Kernels in C++. And examples 01 and 02 are based on the Vector Addition sample code included in the CUDA Toolkit.

License

This source code has been released into the public domain.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CUDA Examples

Listing

Compiling

Credits

License

About

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
.gitignore		.gitignore
00-hello-world.cu		00-hello-world.cu
01-cuda-hello-world.cu		01-cuda-hello-world.cu
02-cuda-hello-world-faster.cu		02-cuda-hello-world-faster.cu
03-templates-device-ptr.cu		03-templates-device-ptr.cu
04-templates-device-mem.cu		04-templates-device-mem.cu
05-thrust-rand-vectors.cu		05-thrust-rand-vectors.cu
Makefile		Makefile
README.md		README.md

tristanpenman/cuda-examples

Folders and files

Latest commit

History

Repository files navigation

CUDA Examples

Listing

Compiling

Credits

License

About

Topics

Resources

Stars

Watchers

Forks

Languages