I learned that lesson from GPT-5, where the preview was weeks long and the model...

belter · 2025-10-01T06:06:57 1759298817

Testing this, its way more aggressive on throttle back than previous model, and message token lengths. Constantly stops in the middle of an action if its not a simple request. I presume you did not have resource limitations during the preview?

simonw · 2025-10-01T06:25:49 1759299949

No, the preview was effectively unlimited usage (for two days).

fragmede · 2025-09-30T14:33:13 1759242793

Well, if they produced a really really really good image for pelicans on bicycles and nothing else, then their cheating would be obvious, so it makes sense to cheat just a little bit, across the board (if we want to assume they're cheating).

whywhywhywhy · 2025-09-30T16:19:11 1759249151

Yesterday someone posted an example of the same prompt but changing it to a human and it was basically trash, the example you've posted actually looks good all things considered. So yeah I do think its something they train on, same way they train on things in the benchmarks.

simonw · 2025-09-30T17:26:16 1759253176

The easy way to tell is to try it yourself - run "Generate an SVG of a pelican riding a bicycle" and then try "Generate an SVG of an otter riding a skateboard" and see if the quality of the images seems similar.

solarwindy · 2025-10-04T08:27:14 1759566434

How about a narwhal spacewalking from the ISS, with Earth visible below (specifically the Niger delta)?

https://claude.ai/public/artifacts/f3860a8a-2c7d-404f-978b-e...

Requesting an ‘extravagantly detailed’ version is quite impressive in the effort, if not quite the execution:

https://claude.ai/public/artifacts/f969805a-2635-4e30-8278-4...