ChatGPT lists clickable sources in a lot of nontrivial queries. Those sites don’...

croes · 2025-07-24T06:12:30 1753337550

How many people click the links? What happens to LLMs if people don’t provide training data anymore because nobody visits their sites?

esnard · 2025-07-24T12:23:22 1753359802

Cloudflare publishes a "crawl-to-refer" ratio, which can be used to estimate the traffic from LLMs:

https://radar.cloudflare.com/ai-insights#crawl-to-refer-rati...

robryan · 2025-07-24T09:48:45 1753350525

They will either pay for it to be generated or get good enough at producing synthetic data that actually improves LLM quality.

croes · 2025-07-24T10:06:56 1753351616

So either even higher costs and hope that a bug problem of LLMs get solved somehow.

Given how much data they need that will be pretty expensive, I mean really really expensive. How many people can write good training data and how much per day?

Doesn’t sound sustainable.