Site Reliability Engineering (SRE) is what you get when you treat operations as a software problem. Good products are reliable but most engineers don’t think about how to maintain their services until after initial development.
Using error budgets and SRE best practices can improve the reliability, maintainability, and even feature velocity of products. This talk is an introduction into what SRE is, why it’s important to think about early in the development of software and how to measure the reliability of your service.