Replication & Partitioning

Why Replicate?

High availability: if primary fails, promote a replica
Read scalability: distribute read queries across replicas
Geographic distribution: serve users from nearby region
Disaster recovery: off-site replicas for backup

Replication Architectures

Leader–Follower (Primary–Replica)

The most common model.

          Writes
Client ──────────→ [Leader / Primary]
                         │
                    WAL / binlog
                    ↙         ↘
             [Replica 1]  [Replica 2]
                 │               │
           Read queries    Read queries

All writes go to leader
Leader ships changes to followers via log (MySQL binlog, PostgreSQL WAL)
Followers are read-only (can serve SELECT queries)
On leader failure → failover promotes a replica

Replication lag: followers may be seconds behind the leader → eventual consistency for reads.

-- MySQL: check replication lag
SHOW SLAVE STATUS\G
-- Seconds_Behind_Master: N

Synchronous vs Asynchronous Replication

	Synchronous	Asynchronous
Durability	Higher (follower confirmed)	Leader can't wait
Write latency	Higher (waits for follower ack)	Lower
Risk	Blocks if follower is slow	Data loss if leader crashes before replica syncs
PostgreSQL	`synchronous_commit = on`	`synchronous_commit = off`

Semi-synchronous (MySQL): at least one follower must ack before commit returns.

Multi-Leader (Multi-Primary)

Multiple nodes accept writes. Used for:

Multi-datacenter deployments (each DC has a leader)
Offline-capable apps (each client is a "leader")

Challenge: write conflicts — two leaders can accept conflicting writes.

Conflict resolution strategies:

Last-write-wins (LWW): by timestamp (risk of data loss)
Merge: application-level merging (CRDTs)
Custom logic: application decides which wins

Leaderless (Dynamo-style)

Any node accepts reads and writes. Used by: Cassandra, DynamoDB, Riak.

Client writes to W nodes
Client reads from R nodes
Quorum: W + R > N → at least one response has latest write

No single point of failure
Eventual consistency by default
Tunable consistency: adjust W, R, N

Partitioning / Sharding

Deep Dive: Sharding & Partitioning

The detailed guide on Horizontal Partitioning, Sharding Strategies (Hash, Range, Consistent Hashing), and Cross-Shard complexity has been moved to its own centralized page. Please see the Database Sharding & Partitioning guide.

CAP Theorem

A distributed system can guarantee at most 2 of 3:

Property	Description
Consistency (C)	Every read receives the latest write (or an error)
Availability (A)	Every request receives a response (no error)
Partition tolerance (P)	System continues despite network partitions

Network partitions are inevitable in distributed systems, so the real choice is CP vs AP:

CP (Consistent + Partition tolerant): Returns error or waits when partitioned. Examples: HBase, Zookeeper, etcd, MongoDB (strong consistency mode)
AP (Available + Partition tolerant): Returns potentially stale data when partitioned. Examples: Cassandra (eventual consistency), CouchDB, DynamoDB

CAP is a simplification

The PACELC model extends CAP: even without partitions, there's a trade-off between latency and consistency.

PACELC Model

If Partition:    Availability   vs  Consistency     (AP vs CP)
Else (no part):  Latency        vs  Consistency     (EL vs EC)

System	Partition	Else
DynamoDB (default)	AP	EL
Cassandra	AP	EL
MongoDB	CP	EC
PostgreSQL (single node)	—	EC

Read Replicas and Replication Lag — Practical Issues

Read-Your-Writes Consistency

After a user submits a form, they must see their own update even if reads go to replicas:

Read from leader for N seconds after write
Track last write timestamp, only read from replica if it has caught up
Route user's reads to leader for their own data; replica for others

Monotonic Reads

A user should not read older data after reading newer data (by reading different replicas):

Assign user → specific replica (consistent routing by user_id)

Spring Boot Configuration for Read/Write Split

// Route reads to replica, writes to primary
@Configuration
public class DataSourceConfig {

    @Bean
    @Primary
    public DataSource routingDataSource(
            @Qualifier("primaryDs") DataSource primary,
            @Qualifier("replicaDs") DataSource replica) {

        Map<Object, Object> targets = new HashMap<>();
        targets.put("primary", primary);
        targets.put("replica", replica);

        AbstractRoutingDataSource routing = new AbstractRoutingDataSource() {
            @Override
            protected Object determineCurrentLookupKey() {
                return TransactionSynchronizationManager.isCurrentTransactionReadOnly()
                    ? "replica" : "primary";
            }
        };
        routing.setTargetDataSources(targets);
        routing.setDefaultTargetDataSource(primary);
        return routing;
    }
}

// Use @Transactional(readOnly = true) to route to replica
@Transactional(readOnly = true)
public List<Order> getRecentOrders() { ... }

🎯 Interview Questions

Q1. What is the difference between replication and sharding?

Replication copies the same data to multiple nodes — for high availability and read scaling. Sharding splits different data across nodes — for write scaling and handling datasets too large for one server. They're often combined: multiple shards, each with its own replicas.

Q2. What is replication lag and how do you handle it?

Replication lag is the delay between a write on the leader and its appearance on replicas. Handle it by: routing writes and immediate re-reads to the leader; using read-your-writes consistency; tracking replication positions; or accepting eventual consistency where appropriate.

Q3. Explain consistent hashing. Why is it better than regular hash modulo for sharding?

Consistent hashing maps both data and nodes onto a virtual ring. When a node is added/removed, only ~1/N of keys need to be remapped, versus N-1/N for hash-modulo. This makes it much cheaper to scale the cluster.

Q4. What is the CAP theorem? How does it apply to choosing a database?

CAP states a distributed system can have at most 2 of: Consistency (reads always see latest write), Availability (every request gets a response), and Partition tolerance. Since partitions are inevitable, the real choice is between CP (consistency over availability) and AP (availability with eventual consistency).

Q7. What is leader election and why is it critical for replication?

When the leader fails, a new leader must be elected from replicas. Common algorithms: Raft, Paxos, Zab (ZooKeeper). The new leader must have the most up-to-date data. Split-brain (two leaders simultaneously) is a dangerous failure mode — fencing mechanisms and quorums prevent it.

Q8. How does multi-leader replication handle write conflicts?

Conflicts occur when two leaders accept conflicting writes to the same row. Strategies: last-write-wins (LWW) by timestamp (risks data loss); per-record conflict avoidance (route each user to one leader); application-level merge (CRDTs); reporting conflict to user. There's no perfect automatic solution — it depends on the use case.

Replication & Partitioning

Why Replicate?

Replication Architectures

Leader–Follower (Primary–Replica)

Synchronous vs Asynchronous Replication

Multi-Leader (Multi-Primary)

Leaderless (Dynamo-style)

Partitioning / Sharding

CAP Theorem

PACELC Model

Read Replicas and Replication Lag — Practical Issues

Read-Your-Writes Consistency

Monotonic Reads

Spring Boot Configuration for Read/Write Split

🎯 Interview Questions

Advanced Editorial Pass: Replication and Partitioning in Distributed Reality

Senior Engineering Focus

Failure Modes to Anticipate

Practical Heuristics

Compare Next

Why Replicate?​

Replication Architectures​

Leader–Follower (Primary–Replica)​

Synchronous vs Asynchronous Replication​

Multi-Leader (Multi-Primary)​

Leaderless (Dynamo-style)​

Partitioning / Sharding​

CAP Theorem​

PACELC Model​

Read Replicas and Replication Lag — Practical Issues​

Read-Your-Writes Consistency​

Monotonic Reads​

Spring Boot Configuration for Read/Write Split​

🎯 Interview Questions​

Advanced Editorial Pass: Replication and Partitioning in Distributed Reality​

Senior Engineering Focus​

Failure Modes to Anticipate​

Practical Heuristics​

Compare Next​

Why Replicate?

Replication Architectures

Leader–Follower (Primary–Replica)

Synchronous vs Asynchronous Replication

Multi-Leader (Multi-Primary)

Leaderless (Dynamo-style)

Partitioning / Sharding

CAP Theorem

PACELC Model

Read Replicas and Replication Lag — Practical Issues

Read-Your-Writes Consistency

Monotonic Reads

Spring Boot Configuration for Read/Write Split

🎯 Interview Questions

Advanced Editorial Pass: Replication and Partitioning in Distributed Reality

Senior Engineering Focus

Failure Modes to Anticipate

Practical Heuristics

Compare Next