Post actual results, make a blog post. Don't just say "this sucks" without tangi...

thorum · 2026-04-08T21:27:14 1775683634

I have the opposite experience: random HN/Reddit comments saying “this sucks” or “whoa this is a huge improvement” are the only benchmark that means anything. Standard benchmarks are all gamed and don’t capture the complexity of the real world.

titanomachy · 2026-04-08T23:22:58 1775690578

Then your internal benchmarks will be in the post-training set and you’ll have to make new ones.

_2d30 · 2026-04-08T23:54:55 1775692495

I may already have but I'm pseudonymous on this website.