The parts of the model and how to go about training it are well known (i.e VIT V...

The parts of the model and how to go about training it are well known (i.e VIT VQGAN, auto-regressive transformers etc). They also use LAION-400 for training which is an open source dataset. Can be replicated, but will take time, patience and compute.

This is not true for some of the other recent headline papers: DALL-E2, PALM, Imagen etc. datasets are the primary deterrent, model details are well known.