From what I've seen, using these huge models for inference at any kind of scale ...

Voloskaya · on June 23, 2022

Those models aren't trained with the objective of being deployed in production. They are trained to be used as teachers during distillation into smaller models that fit the cost/latency requirements for whatever scenario those big companies have. That's where the real value is.

f311a · on June 23, 2022

Yandex uses it for search and voice assistant