Probably because the benchmarks with higher models are, at this time, negligible...

HaZeust 5 months ago | parent | context | favorite | on: OpenAI Announces SearchGPT

Probably because the benchmarks with higher models are, at this time, negligible. Increasing transformers and iterating attention might be a dead-stop for more capable models beyond 2T parameters. But, I'm not sure.