Hacker News new | past | comments | ask | show | jobs | submit login

> Mechanical engineering has specifications for "bug" tolerance.

That's what these days is referred to as an SLO.

Golden signals, USE, RED are ways of measuring them on a high level.

You program your systems to be resilient to a certain level of failure (retries, graceful failure), and you program them to be able to cope without a malfunctioning part by isolating it from the rest of the system (circuit breakers, bulkheads).




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: