How do you handle hardware failures or screw ups for the server to go offline? W...

logotype · on May 2, 2024

Uptime for the last 3 years has been >99%. 1. Keep spares (esp. fans and drives). Hot swap if needed. 2. Dual ISP (1Gbit/s primary, auto switch to Starlink if primary link fails). 3. Keep a separate server for testing. Actually I lied, I test things in prod, sue me.

The monthly AWS bill is roughly 5 USD and consists of Route53 only!