The whole premise is based on the fact that over-investing in GPUs and models are a good thing here as it yields more 'intelligence'.
This as it turned out was not true for rail roads - more and more rail roads isnt a good thing.
The real dilemma facing the model producers is that all this money invested for a general model, targeting general intelligence, is a disaster and essentially the investment into existing assets is a write off. Then on top of that if this is true, youve got data centres full of compute that aren't being used up.
This as it turned out was not true for rail roads - more and more rail roads isnt a good thing.
The real dilemma facing the model producers is that all this money invested for a general model, targeting general intelligence, is a disaster and essentially the investment into existing assets is a write off. Then on top of that if this is true, youve got data centres full of compute that aren't being used up.