Top "Cuda" questions

CUDA (Compute Unified Device Architecture) is a parallel computing platform and programming model for NVIDIA GPUs (Graphics Processing Units).

SLI for multiple GPUs

I am new to CUDA programming, and I am working on a problem that requires multiple GPUs in one machine. …

cuda gpu sli
TensorFlow GPU: is cudnn optional? Couldn't open CUDA library libcudnn.so

I installed the tensorflow-0.8.0 GPU version, tensorflow-0.8.0-cp27-none-linux_x86_64.whl. It says it requires CUDA toolkit 7.5 and CuDNN v4. # …

tensorflow cuda gpu cudnn
Why does the performance of my #pragma-unrolled loop degrade if the trip count is not constant?

I have following code using loop unrolling: #pragma unroll for (int i=0;i<n;i++) { .... } here if n is …

loops cuda unroll
When to use volatile with shared CUDA Memory

Under what circumstances should you use the volatile keyword with a CUDA kernel's shared memory? I understand that volatile tells …

compiler-construction cuda gpu gpgpu volatile
Why does cudaMalloc() use pointer to pointer?

For example, cudaMalloc((void**)&device_array, num_bytes); This question has been asked before, and the reply was "because …

c++ c pointers cuda
Gcc versions later than 7 are not supported by CUDA 10 - Qt Error in Arch Linux

I am running Arch Linux and trying to build a project in Qt however, Qt spits the following error: /opt/…

qt gcc cuda g++ archlinux
error : conflicting declaration for uint32_t

When I try to compile my program first I got this error: error: ‘uint32_t’ does not name a type …

c++ cuda typedef uint32 uint32-t
Atomic Operations in CUDA? Which header file to include?

For using atomic operations in CUDA, is it necessary to include some CUDA header file? The CUDA programming guide seems …

cuda gpu-atomics
Parallel GPU computing using OpenCV

I have an application that requires processing multiple images in parallel in order to maintain real-time speed. It is my …

opencv parallel-processing cuda gpgpu
pyCUDA vs C performance differences?

I'm new to CUDA programming and I was wondering how the performance of pyCUDA is compared to programs implemented in …

c cuda pycuda