Popular "cuda" questions | Page 18

I have developed a Windows application that captures video from an external device using DirectShow. The image resolution is 640x480 …

encoding directshow cuda real-time opencl

I have quite a good understanding about how to allocate and copy linear memory with cudaMalloc() and cudaMemcpy(). However, when …

c++ cuda

I've googled for a while and the only useful infos are: github.com/barnex/cuda5 mumax.github.io/ Unfortunately, the …

go cuda opencl archlinux hpc

I know that there are multiprocessors on a CUDA GPU which contain CUDA cores in them. In my workplace I …

caching memory cuda textures

CUDA document does not specific how many CUDA process can share one GPU. For example, if I launch more than …

cuda gpu gpgpu nvidia

I have a kernel which uses 17 registers, reducing it to 16 would bring me 100% occupancy. My question is: are there methods …

optimization cuda gpgpu

I want to measure time inner kernel of GPU, how how to measure it in NVIDIA CUDA? e.g. __global__ …

cuda gpu gpgpu nvidia

I use cudaMemcpy() one time to copy exactly 1GB of data to the device. This takes 5.9s. The other way …

cuda bus

CUDA provides built-in vector data types like uint2, uint4 and so on. Are there any advantages to using these data …

cuda abstract-data-type

The __shared__ memory in CUDA seems to require a known size at compile time. However, in my problem, the __shared__ …

cuda gpu-shared-memory

Top "Cuda" questions