HP-UX Floating-Point Guide

ManualsBrandsHP ManualsSoftwareHP-UX Performance Tools

Chapter 2 43

Floating-Point Principles and the IEEE Standard for Binary Floating-Point Arithmetic

Floating-Point Formats

Table 2-2 Minimum and Maximum Positive Denormalized Values

Inﬁnity

Values that are larger in magnitude than the maximum-magnitude

normalized values are approximated by special bit patterns that

represent positive and negative inﬁnity.

According to the IEEE standard, inﬁnities are represented by setting all

the bits in the exponent ﬁeld to 1 (value 255 for single-precision, 2047 for

double-precision, 32767 for quad-precision) and setting the fraction bits

to 0. There are actually two inﬁnity values, negative inﬁnity if the sign

bit is 1 and positive inﬁnity if the sign bit is 0.

The IEEE standard deﬁnes the properties of inﬁnities. For example, it

deﬁnes what happens when you add a number to an inﬁnity or subtract

one inﬁnity from another. Table 2-3 shows some of these properties. The

term ﬁnite value in the table refers to any ﬂoating-point value other

than inﬁnity or NaN (see “Not-a-Number (NaN)” on page 45 for

information about NaN values). For the multiplication and division

operators, the sign of the result is determined by the usual arithmetic

rules.

Precision Values

Hexadecimal

Representation

Value

Single Minimum denormalized

Maximum denormalized

Minimum normalized

0000 0001

007F FFFF

0080 0000

−149

* (2

− 1)

−126

Double Minimum denormalized

Maximum denormalized

Minimum normalized

0000 0000 0000 0001

000F FFFF FFFF FFFF

0010 0000 0000 0000

−1074

* (2

− 1)

−1022

Quad Minimum denormalized

Maximum denormalized

Minimum normalized

(24 zeros)…0000 0001

0000 FFFF…(24 more F’s)

0001 0000…(24 more zeros)

−16494

* (2

112

−1)

−16382