It's interesting today to see people act as though half (f16) is a completely no...

djha-skin · on Aug 20, 2022

I still don't see the point of half precision. What applications are you implying are obvious? Actually curious.

TazeTSchnitzel · on Aug 20, 2022

It's very popular in hardware-accelerated computer graphics. It has much more range, and a bit more precision, than the traditional integer 8-bits-per-channel representation of colour, so it is used for High Dynamic Range framebuffers and textures. It's also ubiquitous as an arithmetic type in mobile GPU shaders, where it's used for things (like colours) that need to be floats but don't need full 32-bit precision. In many cases it doesn't just save memory bandwidth and register space, but also the shader core may have higher throughput for half precision.

Stratoscope · on Aug 20, 2022

There is a good discussion and examples here:

https://en.wikipedia.org/wiki/Half-precision_floating-point_...

mysterydip · on Aug 20, 2022

Makes me wonder if there's a use case for "dynamic fixed point" numbers: say for a 16 bit value the upper 2 bits are one of four values that says where the decimal is in the remaining 14. Say 0 (essentially an int), two spots in the middle, and 14 (all decimal). The CPU arithmetic for any operation is (bitshift+math), which should be an order faster than any float operation. The range isn't nearly as dynamic, but would allow for fractional influence. Maybe such a system would lack enough precision for accuracy?

pdpi · on Aug 20, 2022

What you just described is exactly floating point numbers, you're just using a different split for the exponent and mantissa and not using the "integer part is zero" simplification.

mysterydip · on Aug 20, 2022

hmm interesting, I never saw them that way. But now that you say that, it makes them "click" a lot more. Thanks!

olliej · on Aug 20, 2022

:D

Yeah, floating point is nothing more than your standard scientific notation of numbers, e.g.

    digit.xyz... * 10 ^^ +/- some exponent

The exponent is simply shifting where the decimal point is. The only different for floating point is that everything is base 2 because computers :D

Interestingly you're right that a bunch of fp functions can be faster than integers equivalents (although I'm still not convinced that this isn't simply due to the reduced number of bits involved), and more fun the relative performance of operations can actually change vs what they would be in integers. Also this is in the context of doing it in software vs hardware, where again the perf costs of things change.

dwaite · on Aug 21, 2022

It does have several additional attributes:

1. Since the normalized significand will always be 1.bbbb, the '1' bit is stripped from the significand representation, except:

2. To extend the range, the lowest 'zero' value of the exponent drops the leading '1'. This is referred to as the subnormal range

3. The highest exponent value, when the significand is zero, is used to represent positive and negative infinity

4. The highest exponent value with a non-zero significand is used to represent NaN

5. There are many different values usable for NaN by software, including a differentiation between 'quiet' NaNs and a (I believe implementation optional) 'signaling' variant, which will raise an interrupt when used. The idea is that these can be used to convey additional information, and that the signaling variant as well as the right interrupt handlers can be used to add additional functionality such as variable substitution.

6. Zero is signed

olliej · on Aug 21, 2022

Yes, I was giving a simplified description to convey how to translate what floating point is to something people are more familiar with.

The technical details of how it handles _every_ case weren’t particularly relevant.

However just to address 1,2 with some hilariousness (autocorrect wants this to be “hilarious mess” which may be more correct).

Ieee754’s 80 bit format was the first widely deployed format, and was largely used by intel to get the other manufacturers to stop trying to reduce the functionality of ieee floating point because “it couldn’t be implemented, implemented efficiently, etc”. However because of that it has a quirk that was fixed for fp32,64,etc.

FP80 uses an explicit bit for the leading 1. That means it can do 1.0 * 2 ^^ N, or 0.1 * 2 ^^ N it should hopefully be immediately obvious why this could be a problem :)

Not only do the multiple representations for a single value result in sadness, it also gives us a variety of concepts like pseudo-denormals, pseudo normals, pseudo infinities, pseudo nans, etc all of which cause their own problems.

Mercifully by default the only hardware fp80 implementation now (since 286 maybe?) defaults to just treating them as invalid and converts to Nan. But you can set a flag to make it treat them as it did originally.

NavinF · on Aug 20, 2022

It’s by far the most popular data type for training neural networks.

brrrrrm · on Aug 20, 2022

for lack of bfloat16 support

hwers · on Aug 20, 2022

whats the difference between float16 and bfloat16?

brrrrrm · on Aug 20, 2022

number of exponent bits. bfloat16 has a larger dynamic range with lower precision. e.g. this would be infinity in fp16 https://float.exposed/b0x5f80

davvid · on Aug 20, 2022

The OpenEXR file format, used a lot in graphics applications (compositing, rendering), is a fairly well-known application of half-floats.

There's some notes about the advantages of half-float pixel in the openexr documentation: https://openexr.readthedocs.io/en/latest/TechnicalIntroducti...

I don't think "obvious" was the best adjective, but "small memory/file size footprint" is probably the quality that's easiest to understand.

ecpottinger · on Aug 21, 2022

Interestingly enough, the people at Tesla are using a CFloat 8 format to get the most speed. Sometimes what you need is speed in processing.

olliej · on Aug 20, 2022

ML applications apparently don't need the full precision, but they do need very large amounts of them, and process enough of them that the perf win from fewer bits is meaningful and the cost of f16->f32 is large enough to also be meaningful.

Presumably they do benefit from the dynamic range as otherwise you'd think int16 would be sufficient, and not suffer the conversion costs.

SideQuark · on Aug 21, 2022

float16 and float128 (as well as longer formats) were standardized in IEEE754 in 2008. Half is now built into C#, many other languages, and gaining ground in hardware support.

Sharlin · on Aug 21, 2022

I definitely don't think half precision is completely normal obvious type except maybe in certain circles. None of current mainstream languages support it as a primitive type, for one. No mainstream CPUs have hardware support for half precision.

SideQuark · on Aug 21, 2022

C# supports it. CUDA supports it. ARM and related C/C++ compilers support it. Intel is adding hardware support in upcoming chips, so expect a lot more languages to add it.

adgjlsfhk1 · on Aug 21, 2022

Julia also supports it. it led to a really funny graph when doing performance tests on Fujitsu, because Julia is the only language that supports fp16 and is fast enough to write blas in, so there were some graphs where it was the only entry because c and fortran didn't bother showing up.

wilg · on Aug 21, 2022

https://developer.apple.com/documentation/swift/float16

https://scicomp.stackexchange.com/questions/35187/is-half-pr...

TazeTSchnitzel · on Aug 21, 2022

Most Intel and AMD CPUs produced in the last ~10 years have hardware support for converting 16-bit floats to and from 32-bit. That's not full hardware support of course, but you don't need it to be able to make good use of them. https://en.wikipedia.org/wiki/F16C

Sharlin · on Aug 21, 2022

Except when it comes to SIMD. Having twice the lanes compared to f32 would be a real benefit.