Useful Links
Computer Science
DevOps and SRE
Site Reliability Engineering (SRE)
1. Introduction to Site Reliability Engineering
2. Core Principles of SRE
3. Service Level Management
4. Observability and Monitoring
5. Incident Management and On-Call
6. Toil Management and Automation
7. Change and Release Management
8. System Design for Reliability
9. SRE Organization and Culture
10. Advanced SRE Practices
SRE Organization and Culture
SRE Team Structures
Embedded SRE Model
Centralized SRE Team Model
Hybrid Models
Team Size and Composition
SRE Career Paths
SRE Engagement Models
Partnership with Development Teams
SRE Onboarding Processes
Production Readiness Reviews
SRE Consulting and Support
Service Handoff Criteria
Building SRE Culture
Blameless Culture
Continuous Improvement Mindset
Data-Driven Decision Making
Psychological Safety
Knowledge Sharing Practices
Documentation Standards
SRE Skills and Competencies
Technical Skills
Soft Skills
Cross-Functional Collaboration
Mentoring and Knowledge Transfer
Previous
8. System Design for Reliability
Go to top
Next
10. Advanced SRE Practices