A more banal explanation is that compute is expensive, so they are tweaking the models to get more for less, and it isn't always working out. Scaling by itself is a hard problem, scaling and improving efficiency (margins) doubly so.
I don't think current trends are necessarily related to my root comment, but it begged the question of if absolute secrecy of capability would be a good route forward.