|
Weighted Vector Addition on Nvidia Cuda framework
|
|
High Level Overview of the code to follow (heterogeneous programming)
- Prepare memory by copying data from the CPU memory to the GPU memory
- Run the GPU code (kernel) on the GPU until it completes
- Copy back over the results to CPU memory