Given the already huge cost of training, and the evident lack of concern the LLM...

ChatGTP · on Sept 5, 2023

It's more difficult to scrape pay walled content no?

Clearly, places like Reddit have wised up to this and are making API usages non-free for example, so while it's not impossible, you can see the limitations being put into place already. Twitter is another one.

It seems like all this data is now considered gold and people lock up gold?