CUDA (Compute Unified Device Architecture) is a parallel computing platform and programming model for NVIDIA GPUs (Graphics Processing Units).
when asking for nvidia-smi it gives this error: Failed to initialize NVML: GPU access blocked by the operating system other …
cuda gpu nvidiaI am still not sure how to properly specify the architectures for code generation when building with nvcc. I am …
cuda nvcc ptx fat-binariesI'm working on a statistical application containing approximately 10 - 30 million floating point values in an array. Several methods performing different, …
c++ c cuda parallel-processing gpuCan someone please help me with a very simple example on how to use shared memory? The example included in …
c cuda gpuHow do I allocate and transfer(to and from Host) 2D arrays in device memory in Cuda?
multidimensional-array memory-management 2d cuda deviceWhen I go to /usr/local/cuda/samples/1_Utilities/deviceQuery and execute moose@pc09 /usr/local/cuda/samples/1_Utilities/deviceQuery $ …
cuda ubuntu-14.04 nvidia linux-mintDoes CUDA support double precision floating point numbers? Also, what are the reasons for the same?
floating-point cuda gpu gpgpuIs there a way in CUDA to allocate memory dynamically in device-side functions ? I could not find any examples of …
memory-management cuda dynamic-memory-allocationI've noticed that CUDA applications tend to have a rough maximum run-time of 5-15 seconds before they will fail and …
cuda timeout gpgpu gpu-programming