> the exception rather than the rule [...] . Girl with a Pearl Earring is so fam...

lifthrasiir · on Sept 13, 2022

Technically speaking, there can be only so many such reproducible works in the model due to the information-theoretic constraint. But yes, I agree that this is a close enough reproduction of the Vermeer's work, and while this is indeed in public domain, you are right that other copyrighted works may be present in the model, in verbatim and extractable with a right but currently unknown prompt.

richardw · on Sept 13, 2022

The photos of the painting on the internet aren’t original works either. This system is learning from everyone taking pictures. If you paid a very good artist to paint that painting, they’d work from a picture. No artist emerges fully formed without impact of what they’ve seen.

There’s a difference between an artist and a machine (for now), but it’s not unreasonable to assume that some internalisation of others’ work is fair.

comex · on Sept 13, 2022

Not reliably, no. But at least for this size of model, its capacity to memorize seems to be limited to works that are so popular that 'everyone' knows what they look like. And it also seems to generate them only when asked. So in any situation where a human is reviewing the output, it doesn't seem like there's a big risk of plagiarizing something by accident. Larger models may differ.

newswasboring · on Sept 13, 2022

I feel like this is one of those occasions where perfect is an enemy of the good. Yes there might be some, very very famous, artworks stored inside it. But unless you think StableAI has completely blown past the current hutter prize winner, there is no way this thing is storing many things. The things it might be storing are too public to really be that big of a deal. It will spit out a Vermeer, but your submission to Divient Art in 2007 is definitely protected.

paranoidrobot · on Sept 13, 2022

> there is no way this thing is storing many things.

I didn't claim that it was storing all of the images.

It is, however, quite clearly storing at least some of the original training images. What that definition is, I don't know.

To go back to the original point of this thread:

> The thing is, people have wild misconceptions about what this technology does. Many clearly think it's a photocopier. As if StableDiffusion is hiding copies of all these works inside.

I don't think it's a wild misconception, or even an unreasonable concern to be worried that it could output some of it's training images. As an end-user of the technology, you have no idea.

newswasboring · on Sept 13, 2022

I feel like you completely missed the point I was making with the perfect vs good argument