Knowing what portion of the FLOPs are in the tensor cores isn't quite the right ...

smallmancontrov · 2024-05-13T14:05:18 1715609118

Mind the Dark Silicon Fraction.

Some fraction of your transistors MUST go unused on average or you melt the silicon. This was already a thing in the 20nm days and I'm sure it has only gotten worse. 100% TDP utilization might correspond to 60% device utilization.

wtallis · 2024-05-13T14:32:10 1715610730

That's true for CPUs. Does it really apply to GPUs and other accelerators for embarrassingly parallel problems where going slower but wider is always a valid option?

incrudible · 2024-05-13T10:03:50 1715594630

There is not a lot of fixed function left in the modern graphics pipeline, economics of scale dictate that there is no net benefit in trimming it.

wtallis · 2024-05-13T14:43:27 1715611407

And yet, even NVIDIA does trim it from chips like the H100, which has no display outputs, RT cores, or video encoders (though they keep the decoders), and only has ROPs for two of the 72 TPCs.