Top "Simd" questions

Single instruction, multiple data (SIMD) is the concept of having each instruction operate on a small chunk or vector of data elements.

How to Calculate single-vector Dot Product using SSE intrinsic functions in C

I am trying to multiply two vectors together where each element of one vector is multiplied by the element in …

c optimization vectorization sse simd
print a __m128i variable

I'm trying to learn to code using intrinsics and below is a code which does addition compiler used: icc #include&…

c assembly sse simd intrinsics
SSE instructions to add all elements of an array

I am new to SSE2 instructions. I have found an instruction _mm_add_epi8 which can add two array elements. …

c++ arrays sse simd sse2
Fastest way to do horizontal vector sum with AVX instructions

I have a packed vector of four 64-bit floating-point values. I would like to get the sum of the vector's …

x86 sse simd avx vector-processing
How to move 128-bit immediates to XMM registers

There already is a question on this, but it was closed as "ambiguous" so I'm opening a new one - …

assembly x86 sse simd
Getting started with Intel x86 SSE SIMD instructions

I want to learn more about using the SSE. What ways are there to learn, besides the obvious reading the …

c gcc x86 sse simd
Why ARM NEON not faster than plain C++?

Here is a C++ code: #define ARR_SIZE_TEST ( 8 * 1024 * 1024 ) void cpp_tst_add( unsigned* x, unsigned* y ) { for ( register int …

c++ arm simd neon cortex-a8
CPU SIMD vs GPU SIMD?

GPU uses the SIMD paradigm, that is, the same portion of code will be executed in parallel, and applied to …

parallel-processing gpu cpu simd
Difference between MOVDQA and MOVAPS x86 instructions?

I'm looking Intel datasheet: Intel® 64 and IA-32 Architectures Software Developer’s Manual and I can't find the difference between MOVDQA: …

assembly x86 sse simd mov
Should I use SIMD or vector extensions or something else?

I'm currently develop an open source 3D application framework in c++ (with c++11). My own math library is designed like …

c++ gcc sse simd