F16C explained

The F16C[1] (previously/informally known as CVT16) instruction set is an x86 instruction set architecture extension which provides support for converting between half-precision and standard IEEE single-precision floating-point formats.

History

The CVT16 instruction set, announced by AMD on May 1, 2009,[2] is an extension to the 128-bit SSE core instructions in the x86 and AMD64 instruction set.

CVT16 is a revision of part of the SSE5 instruction set proposal announced on August 30, 2007, which is supplemented by the XOP and FMA4 instruction sets. This revision makes the binary coding of the proposed new instructions more compatible with Intel's AVX instruction extensions, while the functionality of the instructions is unchanged.

In recent documents, the name F16C is formally used in both Intel and AMD x86-64 architecture specifications.

Technical information

There are variants that convert four floating-point values in an XMM register or 8 floating-point values in a YMM register.

The instructions are abbreviations for "vector convert packed half to packed single" and vice versa:

The 8-bit immediate argument to VCVTPS2PH selects the rounding mode. Values 0–4 select nearest, down, up, truncate, and the mode set in MXCSR.RC.

Support for these instructions is indicated by bit 29 of ECX after CPUID with EAX=1.

CPUs with F16C

External links

Notes and References

  1. Web site: DirectXMath: F16C and FMA. Chuck Walbourn. September 11, 2012.
  2. News: 128-Bit and 256-Bit XOP, FMA4 and CVT16 Instructions . https://web.archive.org/web/20090520223024/http://support.amd.com/us/Processor_TechDocs/43479.pdf . 2009-05-20 . 2022-07-05 . AMD64 Architecture Programmer's Manual . 6 . 2009-05-01 .