Top "Ptx" questions

Parallel Thread Execution (PTX) is a virtual machine instruction set architecture used in Nvidia's CUDA programming environment.

CUDA: How to use -arch and -code and SM vs COMPUTE

I am still not sure how to properly specify the architectures for code generation when building with nvcc. I am …

cuda nvcc ptx fat-binaries
What is the purpose of using multiple "arch" flags in Nvidia's NVCC compiler?

I've recently gotten my head around how NVCC compiles CUDA device code for different compute architectures. From my understanding, when …

cuda nvcc ptx
How to compile PTX code

I need to modify the PTX code and compile it directly. The reason is that I want to have some …

cuda nvcc ptx
Funnel shift - what is it?

When reading through CUDA 5.0 Programming Guide I stumbled on a feature called "Funnel shift" which is present in 3.5 compute-capable device, …

cuda intrinsics ptx