That's just with the smallest 124M model though; on short form content especially, I'm not convinced of the value of larger models.
That's just with the smallest 124M model though; on short form content especially, I'm not convinced of the value of larger models.