Intel 64 and IA-32 Architectures Software Developers Manual Volume 1, Basic Architecture

Vol. 1 11-7

PROGRAMMING WITH STREAMING SIMD EXTENSIONS 2 (SSE2)

The scalar double-precision floating-point instructions operate on the low (least

significant) quadwords of two source operands (X0 and Y0), as shown in Figure 11-4.

The high quadword (X1) of the first source operand is passed through to the destina-

tion. The scalar operations are similar to the floating-point operations performed in

x87 FPU data registers with the precision control field in the x87 FPU control word set

for double precision (53-bit significand), except that x87 stack operations use a

15-bit exponent range for the result while SSE2 operations use an 11-bit exponent

range.

See Section 11.6.8, “Compatibility of SIMD and x87 FPU Floating-Point Data Types,”

for more information about obtaining compatible results when performing both

scalar double-precision floating-point operations in XMM registers and in x87 FPU

data registers.

Figure 11-3. Packed Double-Precision Floating-Point Operations

Figure 11-4. Scalar Double-Precision Floating-Point Operations

X1 X0

X1 OP Y1 X0 OP Y0

Y1 Y0

X1 X0

X1 X0 OP Y0

Y1 Y0