Thanks for publishing this. I quickly skimmed the paper, I saw the impressive li...

		groodt on April 8, 2023 \| parent \| context \| favorite \| on: Cerebras-GPT: Open Compute-Optimal Language Models... Thanks for publishing this. I quickly skimmed the paper, I saw the impressive linear scaling as you scaled to 16 nodes. How long did it take to train the various models in wall clock time?