"The fastest way to feel the magic is to run the speedrun script speedrun.sh, wh...

simonw · 2025-10-13T22:36:29 1760394989

H100s are expensive NVIDIA GPUs, each costing about $30,000. 8XH100 means you have 8 of those wired together in a big server in a data center somewhere, so around a quarter of a million dollars worth of hardware in a single box.

You need that much hardware because each H100 provides 80GB of GPU-accessible RAM, but to train this model you need to hold a LOT of model weights and training data in memory at once. 80*8 = 640GB.

~$24/hour is how much it costs to rent that machine from various providers.

calmoo · 2025-10-13T22:41:00 1760395260

Perfectly explained, thanks!

JKCalhoun · 2025-10-13T23:00:19 1760396419

Thank you.

llleeeooo · 2025-10-13T22:34:32 1760394872

Renting 8 H100s would cost you about 24/h