Arithmetic underflow explained

The term arithmetic underflow (also floating point underflow, or just underflow) is a condition in a computer program where the result of a calculation is a number of more precise absolute value than the computer can actually represent in memory on its central processing unit (CPU).

Arithmetic underflow can occur when the true result of a floating point operation is smaller in magnitude (that is, closer to zero) than the smallest value representable as a normal floating point number in the target datatype.^[1] Underflow can in part be regarded as negative overflow of the exponent of the floating point value. For example, if the exponent part can represent values from -128 to 127, then a result with a value less than -128 may cause underflow.

For integers, the term "integer underflow" typically refers to a special kind of integer overflow or integer wraparound condition whereby the result of subtraction would result in a value less than the minimum allowed for a given integer type, i.e. the ideal result was closer to negative infinity than the output type's representable value closest to negative infinity.^[2] ^[3] ^[4] ^[5] ^[6]

Underflow gap

The interval between - zero and fminN, where fminN is the smallest positive normal floating point value, is called the underflow gap. This is because the size of this interval is many orders of magnitude larger than the distance between adjacent normal floating point values just outside the gap. For instance, if the floating point datatype can represent 20 bits, the underflow gap is 2²¹ times larger than the absolute distance between adjacent floating point values just outside the gap.^[7]

In older designs, the underflow gap had just one usable value, zero. When an underflow occurred, the true result was replaced by zero (either directly by the hardware, or by system software handling the primary underflow condition). This replacement is called "flush to zero".

The 1984 edition of IEEE 754 introduced subnormal numbers. The subnormal numbers (including zero) fill the underflow gap with values where the absolute distance between adjacent values is the same as for adjacent values just outside the underflow gap. This enables "gradual underflow", where a nearest subnormal value is used, just as a nearest normal value is used when possible. Even when using gradual underflow, the nearest value may be zero.^[8]

The absolute distance between adjacent floating point values just outside the gap is called the machine epsilon, typically characterized by the largest value whose sum with the value 1 will result in the answer with value 1 in that floating point scheme.^[9] This can be written as

fl(1+\epsilon)=fl(1)

, where

is a function which converts the real value into the floating point representation. While the machine epsilon is not to be confused with the underflow level (assuming subnormal numbers), it is closely related. The machine epsilon is dependent on the number of bits which make up the significand, whereas the underflow level depends on the number of digits which make up the exponent field. In most floating point systems, the underflow level is smaller than the machine epsilon.

Handling of underflow

The occurrence of an underflow may set a ("sticky") status bit, raise an exception, at the hardware level generate an interrupt, or may cause some combination of these effects.

As specified in IEEE 754, the underflow condition is only signaled if there is also a loss of precision. Typically this is determined as the final result being inexact.However, if the user is trapping on underflow, this may happen regardless of consideration for loss of precision. The default handling in IEEE 754 for underflow (as well as other exceptions) is to record as a floating point status that underflow has occurred. This is specified for the application-programming level, but often also interpreted as how to handle it at the hardware level.

Notes and References

Coonen. Jerome T. 206445847. An implementation guide to a proposed standard for floating-point arithmetic. Computer. 1980. 13. 1. 68–79. 10.1109/mc.1980.1653344.
Web site: CWE - CWE-191: Integer Underflow (Wrap or Wraparound) (3.1) . cwe.mitre.org.
Web site: Overflow And Underflow of Data Types in Java - DZone Java . dzone.com.
Web site: Integer Overflow/Underflow and Floating Point Imprecision . Mir . Tabish . 4 April 2017 . medium.com.
Web site: Integer underflow and buffer overflow processing MP4 metadata in libstagefright . Mozilla.
Web site: Avoiding Buffer Overflows and Underflows . developer.apple.com.
Book: Sun Microsystems. Numerical Computation Guide. 2005. Oracle. 21 April 2018.
Demmel. James. Underflow and the Reliability of Numerical Software. SIAM Journal on Scientific and Statistical Computing. 1984. 5. 4. 887–919. 10.1137/0905062.
Book: Heath. Michael T.. Scientific Computing. 2002. McGraw-Hill. New York. 0-07-239910-4. 20. Second.

Arithmetic underflow explained

Underflow gap

Handling of underflow

See also

Notes and References