Useful Links
1. Introduction to Site Reliability Engineering
2. Core Principles of SRE
3. Service Level Management
4. Observability and Monitoring
5. Incident Management and On-Call
6. Toil Management and Automation
7. Change and Release Management
8. System Design for Reliability
9. SRE Organization and Culture
10. Advanced SRE Practices
  1. Computer Science
  2. DevOps and SRE

Site Reliability Engineering (SRE)

1. Introduction to Site Reliability Engineering
2. Core Principles of SRE
3. Service Level Management
4. Observability and Monitoring
5. Incident Management and On-Call
6. Toil Management and Automation
7. Change and Release Management
8. System Design for Reliability
9. SRE Organization and Culture
10. Advanced SRE Practices
  1. Change and Release Management
    1. Safe Change Management Principles
      1. Change Approval Processes
        1. Risk Assessment for Changes
          1. Change Coordination
            1. Change Freeze Periods
            2. Progressive Delivery Techniques
              1. Gradual Rollouts
                1. Feature Flags
                  1. Canary Releases
                    1. Blue-Green Deployments
                      1. A/B Testing for Reliability
                      2. Continuous Integration and Delivery
                        1. CI/CD Pipeline Design
                          1. Automated Testing Gates
                            1. Deployment Automation
                              1. Pipeline Security
                                1. Artifact Management
                                2. Rollback and Recovery
                                  1. Designing for Quick Reversals
                                    1. Rollback Procedures
                                      1. Monitoring for Rollback Triggers
                                        1. Post-Rollback Analysis
                                          1. Forward Fix vs Rollback Decisions

                                        Previous

                                        6. Toil Management and Automation

                                        Go to top

                                        Next

                                        8. System Design for Reliability

                                        © 2025 Useful Links. All rights reserved.

                                        About•Bluesky•X.com