> Mechanical engineering has specifications for "bug" tolerance.
That's what these days is referred to as an SLO.
Golden signals, USE, RED are ways of measuring them on a high level.
You program your systems to be resilient to a certain level of failure (retries, graceful failure), and you program them to be able to cope without a malfunctioning part by isolating it from the rest of the system (circuit breakers, bulkheads).
That's what these days is referred to as an SLO.
Golden signals, USE, RED are ways of measuring them on a high level.
You program your systems to be resilient to a certain level of failure (retries, graceful failure), and you program them to be able to cope without a malfunctioning part by isolating it from the rest of the system (circuit breakers, bulkheads).