Setting temperature to 0 does not make it completely deterministic, from their d...

ChaseMeAway · on March 25, 2023

My understanding of LLMs is sub-par at best, could someone explain where the randomness comes from in the event that the model temperature is 0?

I guess I was imagining that if temperature was 0, and the model was not being continuously trained, the weights wouldn’t change, and the output would be deterministic.

Is this a feature of LLMs more generally or has OpenAI more specifically introduced some other degree of randomness in their models?

simonster · on March 25, 2023

It's not the LLM, but the hardware. GPU operations generally involve concurrency that makes them non-deterministic, unless you give up some speed to make them deterministic.

dragonwriter · on March 25, 2023

Specifically, as I ubderstand it, the accumulation of rounding errors differs with the order in which floating point values are completed and intermediate aggregates are calculated, unless you put wait conditions in so that the aggregation order is fixed even if the completion order varies, which reduces efficient use of available compute cores in exchange for determinism.

tomberin · on March 25, 2023

TIL, thanks!