> I think you are on the right track but for me personally it really depends on ...

protocolture · 2025-06-22T00:47:40 1750553260

>Let me offer a different perspective. Having an LLM that is trained on copyrighted material, memoized (or lossily compressed it) and then some "safety" machinery that tries to avoid verbatim-ish outputs of copyrighted material is fundamentally not really distinguishable from simply having a plaintext database of copyrighted material with machinery for "fuzzy" data extraction from said material.

Right so sort of like a search engine that caches thumbnails of copyrighted images to display quick search results? Something I have been using for years and have no issues with, where the legal arguments are framed more about where the links go, and how easy the search engine makes it for me to acquire the original image?

NewsaHackO · 2025-06-20T16:11:39 1750435899

Would your argument be the same if it was a human? If a person memorizes a book verbatim, however uses safety/common sense not the transcribe the book for others because it is a copyright infringement disallow him from using the information memorized whatsoever because he can duplicate it?

_aavaa_ · 2025-06-20T16:54:21 1750438461

What if it was an alien, or a magical being?

There is no reason the same reasoning must apply for humans as it does for machines or code. Our laws already work this way.

NewsaHackO · 2025-06-20T20:19:13 1750450753

I don't follow. Are you implying humans are not real, or can't memorize copyrighted material verbatim?

_aavaa_ · 2025-07-04T12:15:52 1751631352

I’m saying that it doesn’t matter what humans do this machine isn’t a human.

There is no reason to believe that humans and machines should be the same under the law.

The clearest example of this is that in the US it’s already been decided that ai generated art can’t be copyrighted because it was made by a computer rather than a person. Same as for the monkey selfie.