Intel 64 and IA-32 Architectures Software Developers Manual Volume 2B, Instruction Set Reference, N-Z

Vol. 2B 4-321
INSTRUCTION SET REFERENCE, N-Z
SQRTPS—Compute Square Roots of Packed Single-Precision Floating-
Point Values
Description
Performs a SIMD computation of the square roots of the four packed single-precision
floating-point values in the source operand (second operand) stores the packed
single-precision floating-point results in the destination operand. The source operand
can be an XMM register or a 128-bit memory location. The destination operand is an
XMM register. See Figure 10-5 in the Intel
®
64 and IA-32 Architectures Software
Developer’s Manual, Volume 1, for an illustration of a SIMD single-precision floating-
point operation.
In 64-bit mode, using a REX prefix in the form of REX.R permits this instruction to
access additional registers (XMM8-XMM15).
Operation
DEST[31:0] SQRT(SRC[31:0]);
DEST[63:32] SQRT(SRC[63:32]);
DEST[95:64] SQRT(SRC[95:64]);
DEST[127:96] SQRT(SRC[127:96]);
Intel C/C++ Compiler Intrinsic Equivalent
SQRTPS __m128 _mm_sqrt_ps(__m128 a)
SIMD Floating-Point Exceptions
Invalid, Precision, Denormal.
Protected Mode Exceptions
#GP(0) For an illegal memory operand effective address in the CS, DS,
ES, FS or GS segments.
If a memory operand is not aligned on a 16-byte boundary,
regardless of segment.
#SS(0) For an illegal address in the SS segment.
#PF(fault-code) For a page fault.
Opcode Instruction
64-Bit
Mode
Compat/
Leg Mode Description
0F 51 /r SQRTPS xmm1,
xmm2/m128
Valid Valid Computes square roots of the packed
single-precision floating-point values
in xmm2/m128 and stores the results
in xmm1.