Search code examples
Store four 16bit integers with SSE intrinsics...


c++sseintrinsicssse2

Read More
Intel intrinsics support for Atom cloverview processor...


intelsimdsse2sse3intel-atom

Read More
GDB is reporting EXC_BAD_ACCESS, when manipulating SSE2 registers...


c++x86gdbsse2aes-ni

Read More
Using XMM0 register and memory fetches (C++ code) is twice as fast as ASM only using XMM registers -...


c++performanceoptimizationassemblysse2

Read More
accelerate rgb planar to rgba interleaved conversion using sse or mmx...


image-processingssesimdintrinsicssse2

Read More
Exception in statically linked msvcrt using Visual Studio 2012...


c++visual-studio-2012cpu-architecturemsvcrtsse2

Read More
Moving a quadword number to xmm registers...


assemblyx86-64nasmsse2

Read More
Shift a __m128i of n bits...


cx86ssesimdsse2

Read More
How to optimize a cycle?...


c++optimizationassemblyintrinsicssse2

Read More
How to vectorize a distance calculation using SSE2...


c++visual-c++optimizationvectorizationsse2

Read More
SSE2 instruction to convert a 8x16 register to two 4x32 registers having the even and odd indexed el...


cx86intrinsicssse2

Read More
SSE2 instruction to typecast an integer register to short register and vice-versa...


x86ssesimdsse2

Read More
#error "SSE2 instruction set not enabled" when including <emmintrin.h>...


c++linuxcmakeintelsse2

Read More
converting four floats in xmm3 to four ints in memory...


assemblyx86ssesse2

Read More
Best way to load a 64-bit integer to a double precision SSE2 register?...


assemblydoublessesse2int64

Read More
SIMD code runs slower than scalar code...


coptimizationssesimdsse2

Read More
SQRT vs RSQRT vs SSE _mm_rsqrt_ps Benchmark...


c++benchmarkingsimdsse2

Read More
SSE2 double multiplication slower than with standard multiplication...


c++visual-c++ssesse2

Read More
SSE: cast double** to _m128d**...


c++ssesse2

Read More
Vectorized extraction of a specific pattern of shorts from an array, and also insertion into a new a...


c++algorithmvisual-c++vectorizationsse2

Read More
What's the most efficient way to multiply 4 floats by 4 floats using SSE?...


cgccassemblyssesse2

Read More
SSE2 movddup Not Moving Values...


dssesse2

Read More
SSE2, Visual Studio 2010, and Debug Build...


c++visual-studio-2010optimizationsse2

Read More
How to align 16-bit ints for use with SSE intrinsics...


cssesimdmemory-alignmentsse2

Read More
SSE 2 function execution timing not constant and is more than normal...


csse2

Read More
SSE instructions to add all elements of an array...


c++arraysssesimdsse2

Read More
How to optimize this Delphi function with SSE2?...


delphiassemblyx86sse2basm

Read More
What does the following assembly instruction do addsd -8(%rbp), %xmm0?...


x86x86-64sse2assemblymemory-access

Read More
Adding two __m128 types via Accelerate framework...


sse2accelerate-framework

Read More
SIMD array add for arbitrary array lengths...


carraysssesimdsse2

Read More
BackNext