Useful Links
Computer Science
Distributed Systems
Parallel and Distributed Computing
1. Introduction to Parallel and Distributed Computing
2. Parallel Computing Fundamentals
3. Parallel Algorithms and Applications
4. Distributed Computing Fundamentals
5. Time and Coordination in Distributed Systems
6. Replication and Consistency
7. Fault Tolerance in Distributed Systems
8. Distributed Algorithms
9. Large-Scale Data Processing
10. Cloud Computing
11. High-Performance Computing
12. Emerging Paradigms and Technologies
13. Performance Analysis and Optimization
14. Security in Parallel and Distributed Systems
Fault Tolerance in Distributed Systems
Failure Detection
Timeout-based Detection
Heartbeat Mechanisms
Failure Detectors
Perfect Failure Detectors
Eventually Perfect Failure Detectors
Strong Failure Detectors
Weak Failure Detectors
Fault Tolerance Techniques
Redundancy
Hardware Redundancy
Software Redundancy
Information Redundancy
Checkpointing and Recovery
Independent Checkpointing
Coordinated Checkpointing
Communication-Induced Checkpointing
Log-based Recovery
Replication-based Fault Tolerance
Active Replication
Passive Replication
Semi-active Replication
Recovery Strategies
Backward Recovery
Forward Recovery
Rollback Recovery
Message Logging
Reliable Communication
At-most-once Semantics
At-least-once Semantics
Exactly-once Semantics
Reliable Multicast
Previous
6. Replication and Consistency
Go to top
Next
8. Distributed Algorithms