All posts in category #Reliability

Thoughts on Reliability

It's a process. Things will go wrong. Start with empathy and understanding. Help your team develop the necessary skills to stress the system in a controlled environment and monitor its behavior. Finally, make it more resilient by helping the system recover by itself.