Search code examples
_mm_load_si128 loads data in reverse order...


cssesimdsse2

Read More
How to convert scalar code of the double version of VDT's Pade Exp fast_ex() approx into SSE2?...


c++sseintrinsicssse2exp

Read More
Better way to store or extract scalar int result using SSE2 intrinsic...


csseintrinsicssse2

Read More
SSE2 registers in x86 assembly...


assemblyx86sse2

Read More
SSE4.1 unsigned integer comparison with overflow...


assemblyx86simdsse2sse4

Read More
How to add to variable using SSE2?...


c++sse2

Read More
Simulating packusdw functionality with SSE2...


x86sseintrinsicssse2sse4

Read More
How to load two packed 64-bit quadwords into a 128-bit xmm register...


assemblyx86language-agnosticsse2

Read More
How do you do signed 32bit widening multiplication on SSE2?...


assemblyssesimdwebassemblysse2

Read More
how would you optimize this vectorized sum of harmonics?...


c++optimizationvectorizationsimdsse2

Read More
How to copy X bytes or bits from an __m128i into standard memory...


c++ssesimdintrinsicssse2

Read More
Is it possible to use SSE and SSE2 to make a 128-bit wide integer?...


assemblyssesse2

Read More
Test if any byte in an xmm register is 0...


stringassemblyx86simdsse2

Read More
How to divide 16-bit integer by 255 with using SSE?...


c++image-processingssesimdsse2

Read More
How to declare __m128i constant in MASM?...


assemblyx86-64masmsse2

Read More
SSE2 test xmm bitmask directly without using 'pmovmskb'...


assemblybitmasksse2

Read More
latency for 'pcmpeqb' - memory vs xmm register...


assemblyoptimizationssemicro-optimizationsse2

Read More
Penalty for switching from SSE to AVX?...


c++sseavxsse2

Read More
How can a SSE2 function be missing from the header it is supposed to be in?...


c++visual-studiovisual-c++simdsse2

Read More
Detect the availability of SSE/SSE2 instruction set in Visual Studio...


c++visual-studiox86ssesse2

Read More
SSE2 and inline assembly insert structur...


cgccsimdinline-assemblysse2

Read More
How to store the contents of a __m128d simd vector as doubles without accessing it as a union?...


cx86simdintrinsicssse2

Read More
Fast counting the number of equal bytes between two arrays...


c++cssesimdsse2

Read More
How to convert a ps vector of 4 float to 4 doubles and store to a pd array?...


c++type-conversionsimdsse2

Read More
How to floor/int in double using only SSE2?...


c++simdtruncateintrinsicssse2

Read More
How do you process exp() with SSE2?...


c++simdintrinsicssse2exp

Read More
What the method to manage Cleanup Code loop for a cumulative (single) value packed into two values u...


c++loopssimdsse2

Read More
Why can't I use _mm_sin_pd?...


c++ssesimdtrigonometrysse2

Read More
What's wrong in this SSE2 transposition?...


c++vectorizationintrinsicssse2

Read More
Using % with SSE2?...


c++intrinsicssse2

Read More
BackNext