Hacker News new | past | comments | ask | show | jobs | submit login

I’m wondering whether another motivation for this could be trying to keep the data set as clean as possible for future model training.

Creating videos takes quite a bit of time. If AI video generation becomes widely available, pretty soon, there could be more AI content being uploaded to YouTube than human-made stuff.

Presumably, training on AI generated stuff magnifies any artefacts/hallucinations present in the training set, reducing the quality of the model.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: