UsefulLinks
Computer Science
DevOps and SRE
Site Reliability Engineering (SRE)
1. Introduction to Site Reliability Engineering
2. Core Principles of SRE
3. Service Level Management
4. Observability and Monitoring
5. Incident Management and On-Call
6. Toil Management and Automation
7. Change and Release Management
8. System Design for Reliability
9. SRE Organization and Culture
10. Advanced SRE Practices
9.
SRE Organization and Culture
9.1.
SRE Team Structures
9.1.1.
Embedded SRE Model
9.1.2.
Centralized SRE Team Model
9.1.3.
Hybrid Models
9.1.4.
Team Size and Composition
9.1.5.
SRE Career Paths
9.2.
SRE Engagement Models
9.2.1.
Partnership with Development Teams
9.2.2.
SRE Onboarding Processes
9.2.3.
Production Readiness Reviews
9.2.4.
SRE Consulting and Support
9.2.5.
Service Handoff Criteria
9.3.
Building SRE Culture
9.3.1.
Blameless Culture
9.3.2.
Continuous Improvement Mindset
9.3.3.
Data-Driven Decision Making
9.3.4.
Psychological Safety
9.3.5.
Knowledge Sharing Practices
9.3.6.
Documentation Standards
9.4.
SRE Skills and Competencies
9.4.1.
Technical Skills
9.4.2.
Soft Skills
9.4.3.
Cross-Functional Collaboration
9.4.4.
Mentoring and Knowledge Transfer
Previous
8. System Design for Reliability
Go to top
Next
10. Advanced SRE Practices