CUDA (Compute Unified Device Architecture) is a parallel computing platform and programming model for NVIDIA GPUs (Graphics Processing Units).
I am new to CUDA programming, and I am working on a problem that requires multiple GPUs in one machine. …
cuda gpu sliI installed the tensorflow-0.8.0 GPU version, tensorflow-0.8.0-cp27-none-linux_x86_64.whl. It says it requires CUDA toolkit 7.5 and CuDNN v4. # …
tensorflow cuda gpu cudnnI have following code using loop unrolling: #pragma unroll for (int i=0;i<n;i++) { .... } here if n is …
loops cuda unrollUnder what circumstances should you use the volatile keyword with a CUDA kernel's shared memory? I understand that volatile tells …
compiler-construction cuda gpu gpgpu volatileFor using atomic operations in CUDA, is it necessary to include some CUDA header file? The CUDA programming guide seems …
cuda gpu-atomicsI have an application that requires processing multiple images in parallel in order to maintain real-time speed. It is my …
opencv parallel-processing cuda gpgpuI'm new to CUDA programming and I was wondering how the performance of pyCUDA is compared to programs implemented in …
c cuda pycuda