Top "Gpgpu" questions

GPGPU is an acronym for the field of computer science known as "General Purpose computing on the Graphics Processing Unit (GPU)"

Why is the constant memory size limited in CUDA?

According to "CUDA C Programming Guide", a constant memory access benefits only if a multiprocessor constant cache is hit (Section 5.3.2.4)1. …

cuda gpgpu gpu-constant-memory
Should I unify two similar kernels with an 'if' statement, risking performance loss?

I have 2 very similar kernel functions, in the sense that the code is nearly the same, but with a slight …

c++ c optimization cuda gpgpu
When to use volatile with shared CUDA Memory

Under what circumstances should you use the volatile keyword with a CUDA kernel's shared memory? I understand that volatile tells …

compiler-construction cuda gpu gpgpu volatile
Getting started with PyOpenCL

I have recently discovered the power of GP-GPU (general purpose graphics processing unit) and want to take advantage of it …

python opencl gpgpu pyopencl
Parallel GPU computing using OpenCV

I have an application that requires processing multiple images in parallel in order to maintain real-time speed. It is my …

opencv parallel-processing cuda gpgpu
GPGPU vs. Multicore?

What are the key practical differences between GPGPU and regular multicore/multithreaded CPU programming, from the programmer's perspective? Specifically: What …

multithreading performance multicore gpgpu parallel-processing
cpu vs gpu - when cpu is better

I know many examples when GPU is much faster than CPU. But exists algorithms (problems) which are very hard to …

cpu gpgpu cpu-architecture
GPU Shared Memory Bank Conflict

I am trying to understand how bank conflicts take place. if i have an array of size 256 in global memory …

c++ cuda gpgpu bank-conflict
What is the context switching mechanism in GPU?

As I know, GPUs switch between warps to hide the memory latency. But I wonder in which condition, a warp …

cuda opencl gpu gpgpu
In OpenCL, what does mem_fence() do, as opposed to barrier()?

Unlike barrier() (which I think I understand), mem_fence() does not affect all items in the work group. The OpenCL …

opencl gpgpu memory-barriers barrier memory-fences