So if its vector math op (SSE, etc), its really 2 64 bit doubles and not a "long double". Long double would mean over 64 bits of double/precision. You have "packed data" not a "long double". Does your CPU have 128 bit FP math (see also) or not? Does your compiler emulate 128 bit FP math (GCC yes as __float128)?