Abstract Chaos Engineering is a helpful tool in understanding your system’s unknowns, but it is not the means to an end for achieving resilience. Instead, it helps to instill higher confidence in the ability to cope and be resilient in the face of inevitable failures.
In this talk, I’ll go over lessons learned and the impact to this confidence that Chaos Engineering has had at Netflix. As John Allspaw has said, "Resilience is the story of the outage that didn’t happen". I’ll share those stories from Chaos vulnerabilities that our team has found, how we follow those vulnerabilities, and how Chaos Engineering is incorporated into our day-to-day culture.
Bio Nora is a Senior Software Engineer at Netflix and a student of Human Factors and Systems Safety at Lund University. She is passionate about resilient software, people, and the intersection of those two worlds.
She recently co-wrote the book on Chaos Engineering and keynoted AWS re:Invent to an audience of over 40,000 people about the benefits and business case behind implementing Chaos Engineering.