Top "Sse" questions

SSE (Streaming SIMD Extensions) was the first of many similarly-named vector extensions to the x86 instruction set.

Speed up float 5x5 matrix * vector multiplication with SSE

I need to run a matrix-vector multiplication 240000 times per second. The matrix is 5x5 and is always the same, whereas …

c++ vectorization matrix-multiplication sse simd
Why is this SSE code 6 times slower without VZEROUPPER on Skylake?

I've been trying to figure out a performance problem in an application and have finally narrowed it down to a …

performance x86 intel sse avx
Detect the availability of SSE/SSE2 instruction set in Visual Studio

How can I check in code whether SSE/SSE2 is enabled or not by the Visual Studio compiler? I have …

c++ visual-studio x86 sse sse2
Fast counting the number of set bits in __m128i register

I should count the number of set bits of a __m128i register. In particular, I should write two functions …

c sse simd sse2 hammingweight
SSE multiplication 16 x uint8_t

I want to multiply with SSE4 a __m128i object with 16 unsigned 8 bit integers, but I could only find an …

x86 sse simd sse4
Using SSE instructions with gcc without inline assembly

I am interested in using the SSE vector instructions of x86-64 with gcc and don't want to use any …

c x86-64 sse simd intrinsics
Fast Vector Math in .NET - What are the options?

My 3D graphics software, written in C# using SlimDX, does a lot of vector operations on the CPU. (In this …

c# .net sse simd slimdx
Why is strcmp not SIMD optimized?

I've tried to compile this program on an x64 computer: #include <cstring> int main(int argc, char* argv[]) { …

c++ sse simd strcmp sse2
Fast method to copy memory with translation - ARGB to BGR

Overview I have an image buffer that I need to convert to another format. The origin image buffer is four …

c x86 rgb sse micro-optimization
Can one construct a "good" hash function using CRC32C as a base?

Given that SSE 4.2 (Intel Core i7 & i5 parts) includes a CRC32 instruction, it seems reasonable to investigate whether one …

hash intel sse crc32