7/10 10:40 - 11:20
Reliability is a Question
One thing that we all share—big environments, small environments, and everything in between, no matter what your organization does for a living—is a thirst for reliability. We spend (or perhaps should spend) large swaths of our time asking our systems to provide the appropriate level of reliability. And when they don’t, we then try to ask why. Sometimes we even get the hint of an answer. The problem is that the obvious questions don’t always lead to the right answers, relationships can be counter-intuitive, and learning from failure is not easy even if, or perhaps especially if, you approach things using a Site Reliability Engineering (SRE) mindset. Let’s talk about these questions and how they can lead you astray no matter how experienced you are with SRE. We will get you back on to the right path with better questions (and maybe even an answer or two!). You will be taking away some concrete approaches to your challenges around failures, communication, organizational structure, monitoring, and a few other surprise topics. Come join us for a chance to question your answers and answer your questions around reliability!

the editor/curator of Seeking SRE, and the author of Becoming SRE
David Blank-Edelman
David Blank-Edelman is the editor/curator of Seeking SRE: Conversations About Running Production Systems at Scale (O’Reilly) and the author of Becoming SRE: First Steps Toward Reliability for You and Your Organization (O’Reilly). David is a co-founder of the SREcon conference and has roughly 40 years of experience in the operations space.







































.png)























