Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

There's plenty of public domain material out there to train on. The problem is just that having to filter out copyrighted material from the training dataset would be prohibitively expensive.

These AIs do not need to learn from your work in order to function, they just need to learn from some work. All that extending copyright in this manner would do is create a bunch of unnecessary busywork and slow the progress of technological development. You're not going to get paid $5 for letting a proprietary AI train on your drawing, blog post, or GitHub repo when they could get the same benefit from paying someone else $0.05 to find a public domain image, blog post, or repo elsewhere.

Basically, the amount of value your individual work contributes to generative AI is minuscule, but the amount of effort the entire industry would need to expend in order to compensate you for that value is massive. Unless stifling progress is your explicit goal, it's not worth it.



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: