Site Reliability Engineering (SRE)
Understanding Risk in Reliability
Acceptable Risk Levels
Risk Mitigation Strategies
Purpose of SLOs
SLOs as Communication Tool
SLOs and Business Alignment
Identifying Toil
Impact of Toil on Productivity
Toil Reduction as Core Value
Benefits of Automation
Identifying Automation Opportunities
Automation Best Practices
Principles of Reliable Releases
Release Process Automation
Rollback and Rollforward Strategies
Value of Simplicity in Systems
Techniques for Achieving Simplicity
Avoiding Unnecessary Complexity
Previous
1. Introduction to Site Reliability Engineering
Go to top
Next
3. Service Level Management