Top "Sse" questions

SSE (Streaming SIMD Extensions) was the first of many similarly-named vector extensions to the x86 instruction set.

SSE instructions to add all elements of an array

I am new to SSE2 instructions. I have found an instruction _mm_add_epi8 which can add two array elements. …

c++ arrays sse simd sse2
Efficient 4x4 matrix multiplication (C vs assembly)

I'm looking for a faster and trickier way to multiply two 4x4 matrices in C. My current research is focused …

c optimization assembly sse matrix-multiplication
Fastest way to do horizontal vector sum with AVX instructions

I have a packed vector of four 64-bit floating-point values. I would like to get the sum of the vector's …

x86 sse simd avx vector-processing
How to move 128-bit immediates to XMM registers

There already is a question on this, but it was closed as "ambiguous" so I'm opening a new one - …

assembly x86 sse simd
Getting started with Intel x86 SSE SIMD instructions

I want to learn more about using the SSE. What ways are there to learn, besides the obvious reading the …

c gcc x86 sse simd
How to divide floating-point number in x86 assembly?

When i try to write Heron algorithm to count sqrt from ECX register, it doesn't work. It looks like the …

assembly x86 floating-point sse x87
Difference between MOVDQA and MOVAPS x86 instructions?

I'm looking Intel datasheet: Intel® 64 and IA-32 Architectures Software Developer’s Manual and I can't find the difference between MOVDQA: …

assembly x86 sse simd mov
Should I use SIMD or vector extensions or something else?

I'm currently develop an open source 3D application framework in c++ (with c++11). My own math library is designed like …

c++ gcc sse simd
Using SSE in c# is it possible?

I was reading a question about c# code optimization and one solution was to use c++ with SSE. Is it …

c# sse
SSE, intrinsics, and alignment

I've written a 3D vector class using a lot of SSE compiler intrinsics. Everything worked fine until I started to …

c++ alignment sse intrinsics