Top "Cuda" questions

CUDA (Compute Unified Device Architecture) is a parallel computing platform and programming model for NVIDIA GPUs (Graphics Processing Units).

Implementing a critical section in CUDA

I'm trying to implement a critical section in CUDA using atomic instructions, but I ran into some trouble. I have …

cuda synchronization locking critical-section
cudaStreamSynchronize vs CudaDeviceSynchronize vs cudaThreadSynchronize

What is the difference between these three functions, especially the last two? The library manual says Note that this function …

cuda
cuda shared library linking: undefined reference to cudaRegisterLinkedBinary

Goal: create a shared library containing my CUDA kernels that has a CUDA-free wrapper/header. create a test executable for …

c++ cuda makefile shared-libraries nvcc
efficient way of cuda file organization: .cpp .h .cu .cuh .curnel files

What is the most easy to understand/efficient etc. code organization for cuda. After some investigation i found that cuda …

c++ header cuda project organization
CUDA linking error - Visual Express 2008 - nvcc fatal due to (null) configuration file

I've been searching extensively for a possible solution to my error for the past 2 weeks. I have successfully installed the …

c++ visual-c++ cuda windows-xp linker-errors
What is CUDA like? What is it for? What are the benefits? And how to start?

I am interested in developing under some new technology and I was thinking in trying out CUDA. Now... their documentation …

cuda nvidia gpu-programming
How can I get number of Cores in cuda device?

I am looking for a function that count number of core of my cuda device. I know each microprocessor have …

c cuda nvidia
CUDA and nvcc: using the preprocessor to choose between float or double

The problem: Having a .h, I want to define real to be double if compiling for c/c++ or for …

cuda preprocessor nvcc
Compile cuda code for CPU

I'm study cuda 5.5 but i don't have any Nvidia GPU. In old version of nvcc have a flag --multicore to …

cuda nvidia nvcc
CUDA Block and Grid size efficiencies

What is the advised way of dealing with dynamically-sized datasets in cuda? Is it a case of 'set the block …

optimization cuda gpgpu