Distributed Systems Core Concepts Cheat Sheet

Updated 2026-05-28

Next Topic: Domain-Driven Design (DDD) Cheat Sheet

Distributed systems enable computation and data storage across multiple independent nodes that coordinate to appear as a single coherent system, powering everything from cloud databases and microservices to content delivery networks and blockchain platforms. As systems scale beyond single machines and geographic boundaries, understanding the fundamental tradeoffs — between consistency, availability, partition tolerance, and latency — becomes critical for architects building resilient global infrastructure. The key mental model: every distributed system is a collection of tradeoffs, not a collection of solutions, and mastering these tradeoffs is what separates reliable production systems from fragile ones.

What This Cheat Sheet Covers

This topic spans 20 focused tables and 158 indexed concepts. Below is a complete table-by-table outline of this topic, spanning foundational concepts through advanced details.

Table 1: CAP Theorem and Fundamental TradeoffsTable 2: Consistency ModelsTable 3: Consensus AlgorithmsTable 4: Time and Ordering in Distributed SystemsTable 5: Distributed TransactionsTable 6: Replication StrategiesTable 7: Data Partitioning and ShardingTable 8: Failure Detection and RecoveryTable 9: Conflict Resolution StrategiesTable 10: Gossip and Epidemic ProtocolsTable 11: Coordination and Distributed LockingTable 12: Distributed Hash TablesTable 13: Quorum SystemsTable 14: Data Integrity and VerificationTable 15: Message Delivery SemanticsTable 16: Broadcast PrimitivesTable 17: Retry and Resilience PatternsTable 18: Advanced ConceptsTable 19: Load Balancing StrategiesTable 20: Service Discovery Patterns

Table 1: CAP Theorem and Fundamental Tradeoffs

The CAP theorem is the starting point for reasoning about any distributed system, but PACELC extends it by addressing the latency-consistency tradeoff that occurs even in the absence of network partitions — which is the more common operating condition. Understanding BASE versus ACID, and knowing which quadrant a system occupies in the PACELC taxonomy, directly drives architectural decisions.

Concept	Example	Description
CAP Theorem	Choose 2 of 3: C+P, A+P	Only two of three guarantees simultaneously possible when a network partition occurs: Consistency, Availability, Partition Tolerance.
PACELC Theorem	PA/EL (Cassandra) vs PC/EC (Spanner)	• Extends CAP: during Partition → choose A vs C; Else → choose L (Latency) vs C (Consistency) • more practical than CAP for normal operations.
Consistency (CAP)	All nodes see same data at same time	• Every read receives the most recent write or an error • all replicas must agree before responding
Availability (CAP)	Every request gets non-error response	• System responds to every request even if some nodes are down • may return stale data during partition
Partition Tolerance (CAP)	System continues despite network split	System operates correctly even when network messages are lost or delayed between nodes.

Table 1: CAP Theorem and Fundamental Tradeoffs

Concept	Example	Description
CAP Theorem	Choose 2 of 3: C+P, A+P	Only two of three guarantees simultaneously possible when a network partition occurs: Consistency, Availability, Partition Tolerance.
PACELC Theorem	PA/EL (Cassandra) vs PC/EC (Spanner)	• Extends CAP: during Partition → choose A vs C; Else → choose L (Latency) vs C (Consistency) • more practical than CAP for normal operations.
Consistency (CAP)	All nodes see same data at same time	• Every read receives the most recent write or an error • all replicas must agree before responding
Availability (CAP)	Every request gets non-error response	• System responds to every request even if some nodes are down • may return stale data during partition
Partition Tolerance (CAP)	System continues despite network split	System operates correctly even when network messages are lost or delayed between nodes.