Top "Sse" questions

SSE (Streaming SIMD Extensions) was the first of many similarly-named vector extensions to the x86 instruction set.

Websocket transport reliability (Socket.io data loss during reconnection)

Used NodeJS, Socket.io Problem Imagine there are 2 users U1 & U2, connected to an app via Socket.io. The …

node.js websocket socket.io sse eventsource
Using AVX CPU instructions: Poor performance without "/arch:AVX"

My C++ code uses SSE and now I want to improve it to support AVX when it is available. So …

c++ performance visual-studio-2010 sse avx
GCC SSE code optimization

This post is closely related to another one I posted some days ago. This time, I wrote a simple code …

c optimization sse compiler-optimization hpc
How is a vector's data aligned?

If I want to process data in a std::vector with SSE, I need 16 byte alignment. How can I achieve …

c++ vector sse memory-alignment allocator
What's the difference among cflgs sse options of -msse, -msse2, -mssse3, -msse4 rtc..? and how to determine?

For the GCC CFLAGS options: -msse, -msse2, -mssse3, -msse4, -msse4.1, -msse4.2. Are they exclusive in their use or can they …

gcc g++ gnu sse compiler-flags
Do any JVM's JIT compilers generate code that uses vectorized floating point instructions?

Let's say the bottleneck of my Java program really is some tight loops to compute a bunch of vector dot …

java floating-point jit sse vectorization
Intel SSE and AVX Examples and Tutorials

Is there any good C/C++ tutorials or examples for learning Intel SSE and AVX instructions? I found few on …

intel sse vectorization avx
How to Calculate single-vector Dot Product using SSE intrinsic functions in C

I am trying to multiply two vectors together where each element of one vector is multiplied by the element in …

c optimization vectorization sse simd
print a __m128i variable

I'm trying to learn to code using intrinsics and below is a code which does addition compiler used: icc #include&…

c assembly sse simd intrinsics
Using AVX intrinsics instead of SSE does not improve speed -- why?

I've been using Intel's SSE intrinsics for quite some time with good performance gains. Hence, I expected the AVX intrinsics …

c++ performance gcc sse avx