Probably. There's good work showing impressively performing small models that were trained on more "text book" like data rather than just loads of text - but where the "text books" were either wholly or largely created by another AI model.
Using models to generate/score/rank/modify data to be more useful as training data is a very interesting angle.
Using models to generate/score/rank/modify data to be more useful as training data is a very interesting angle.