Where are you getting the non-determinism part from? It would seem surprising fo...

TeMPOraL · on Feb 16, 2024

Large ML models tend to be uncorrectably non-deterministic simply from doing lots of floating point math in parallel. Addition and multiplication of floats is neither commutative nor associative - you may get different results depending on the order in which you add/multiply numbers.

eapriv · on Feb 17, 2024

Addition and multiplication of floats are commutative.

Gormo · on Feb 16, 2024

> It would seem surprising for there to be anything non-deterministic about an ML model like this

I think there may be some confusion of ideas going in here. Machine learning is fundamentally stochastic, so it is non-deterministic almost by definition.