These models are not repositories or archives of others work that they simply stitch together to create output. It's more accurate to say that they view work and then create an algorithm that can output the essence of that work.
For image models, people are often pretty surprised to learn that they are only a few gigabytes in size, despite training on petabytes of images.
For image models, people are often pretty surprised to learn that they are only a few gigabytes in size, despite training on petabytes of images.