Distributed Systems

  1. Fault Tolerance and Reliability
    1. Fault Tolerance Concepts
      1. Faults vs. Failures vs. Errors
        1. Fault Classification
          1. Hardware Faults
            1. Software Faults
              1. Network Faults
              2. Fault Tolerance Techniques
                1. Fault Prevention
                  1. Fault Removal
                    1. Fault Forecasting
                      1. Fault Tolerance
                    2. Failure Detection
                      1. Failure Detectors
                        1. Perfect Failure Detectors
                          1. Eventually Perfect Failure Detectors
                            1. Unreliable Failure Detectors
                            2. Detection Mechanisms
                              1. Heartbeat Protocols
                                1. Timeout-Based Detection
                                  1. Gossip-Based Detection
                                  2. Detection Accuracy
                                    1. False Positives
                                      1. False Negatives
                                        1. Detection Time
                                      2. Redundancy and Replication
                                        1. Types of Redundancy
                                          1. Hardware Redundancy
                                            1. Software Redundancy
                                              1. Information Redundancy
                                                1. Time Redundancy
                                                2. Replication Strategies
                                                  1. Active Replication
                                                    1. Passive Replication
                                                      1. Semi-Active Replication
                                                      2. Replica Management
                                                        1. Replica Placement
                                                          1. Replica Consistency
                                                            1. Replica Recovery
                                                          2. Process Resilience
                                                            1. Process Groups
                                                              1. Group Formation
                                                                1. Group Communication
                                                                  1. Group Membership
                                                                  2. Failure Masking
                                                                    1. Voting Mechanisms
                                                                      1. Majority Consensus
                                                                        1. Byzantine Fault Masking
                                                                        2. Process Recovery
                                                                          1. Restart Mechanisms
                                                                            1. State Recovery
                                                                              1. Rollback Recovery
                                                                            2. Reliable Communication
                                                                              1. Point-to-Point Reliability
                                                                                1. Acknowledgments
                                                                                  1. Retransmission
                                                                                    1. Duplicate Detection
                                                                                    2. Reliable Multicast
                                                                                      1. Atomic Multicast
                                                                                        1. Causal Multicast
                                                                                          1. Total Order Multicast
                                                                                          2. RPC Reliability
                                                                                            1. Failure Semantics
                                                                                              1. Exception Handling
                                                                                                1. Timeout Management
                                                                                              2. Recovery Techniques
                                                                                                1. Checkpointing
                                                                                                  1. Independent Checkpointing
                                                                                                    1. Coordinated Checkpointing
                                                                                                      1. Communication-Induced Checkpointing
                                                                                                      2. Message Logging
                                                                                                        1. Pessimistic Logging
                                                                                                          1. Optimistic Logging
                                                                                                            1. Causal Logging
                                                                                                            2. Recovery Protocols
                                                                                                              1. Rollback Recovery
                                                                                                                1. Forward Recovery
                                                                                                                  1. Hybrid Recovery