Skip to main content

5 docs tagged with "part2-distributed-data"

View all tags

Chapter 5: Replication

**Replication** means keeping a copy of the same data on multiple machines (connected via a network). Reasons to replicate:

Chapter 6: Partitioning

For very large datasets or very high query throughput, a single machine is not enough. **Partitioning** (also called sharding) breaks the data into.

Chapter 7: Transactions

Real applications are messy — the database can crash, network connections can drop, multiple clients write concurrently, and partial reads of partially updated.

Chapter 9: Consistency and Consensus

Chapter 8 cataloged everything that can go wrong in distributed systems. This chapter asks: **given all those failure modes, what guarantees can we actually.