Top "Cuda" questions

CUDA (Compute Unified Device Architecture) is a parallel computing platform and programming model for NVIDIA GPUs (Graphics Processing Units).

printf inside CUDA __global__ function

I am currently writing a matrix multiplication on a GPU and would like to debug my code, but since I …

c++ c cuda gpu-programming
CUDA determining threads per block, blocks per grid

I'm new to the CUDA paradigm. My question is in determining the number of threads per block, and blocks per …

cuda dimensions nvidia matrix-multiplication
How do CUDA blocks/warps/threads map onto CUDA cores?

I have been using CUDA for a few weeks, but I have some doubts about the allocation of blocks/warps/…

cuda gpgpu nvidia warp-scheduler
Difference between global and device functions

Can anyone describe the differences between __global__ and __device__ ? When should I use __device__, and when to use __global__?.

cuda
ImportError: libcublas.so.9.0: cannot open shared object file

currently I have cuda 8.0 and cuda 9.0 installed in Gpu support system. I ran into this error while importing from keras …

python-3.x tensorflow cuda keras
Error compiling CUDA from Command Prompt

I'm trying to compile a cuda test program on Windows 7 via Command Prompt, I'm this command: nvcc test.cu But …

cuda nvidia
What is the difference between cuda vs tensor cores?

I am completely new to terms related to HPC computing, but I just saw that EC2 released its new type …

cuda gpu nvidia
Different CUDA versions shown by nvcc and NVIDIA-smi

I am very confused by the different CUDA versions shown by running which nvcc and nvidia-smi. I have both cuda9.2 …

cuda
Streaming multiprocessors, Blocks and Threads (CUDA)

What is the relationship between a CUDA core, a streaming multiprocessor and the CUDA model of blocks and threads? What …

cuda nvidia
Coding CUDA with C#?

I've been looking for some information on coding CUDA (the nvidia gpu language) with C#. I have seen a few …

c# cuda