13 docs tagged with "ddia"

Chapter 1: Reliable, Scalable, and Maintainable Applications

Most modern applications are **data-intensive**, not compute-intensive. The bottleneck is rarely the CPU — it's the amount of data, how fast it changes, and.

Chapter 10: Batch Processing

So far the book has focused on systems that handle requests as they arrive (OLTP) or read/write in real-time. But some of the most important data processing.

Chapter 11: Stream Processing

Batch processing has one problem: **latency**. A job that runs once a day means insights that are 24 hours stale. Stream processing is like a continuous batch.

Chapter 12: The Future of Data Systems

The final chapter synthesizes everything in the book and looks forward. It addresses two questions:

Chapter 2: Data Models and Query Languages

Data models are probably the most important part of developing software — they shape not just how we write the code, but how we *think about the problem*. Each.

Chapter 3: Storage and Retrieval

As an application developer, you usually just call your database and trust it to do the right thing. But to choose the right database and tune it properly, you.

Chapter 4: Encoding and Evolution

Applications change over time — requirements evolve, new features are added, bugs are fixed. Your data model must evolve too. But in large systems, you can't.

Chapter 5: Replication

**Replication** means keeping a copy of the same data on multiple machines (connected via a network). Reasons to replicate:

Chapter 6: Partitioning

For very large datasets or very high query throughput, a single machine is not enough. **Partitioning** (also called sharding) breaks the data into.

Chapter 7: Transactions

Real applications are messy — the database can crash, network connections can drop, multiple clients write concurrently, and partial reads of partially updated.

Chapter 8: The Trouble with Distributed Systems

Working with distributed systems requires a fundamentally different mindset than single-machine programming. In a single process, if something works once, it.

Chapter 9: Consistency and Consensus

Chapter 8 cataloged everything that can go wrong in distributed systems. This chapter asks: **given all those failure modes, what guarantees can we actually.

Designing Data-Intensive Applications

Modern applications are not **compute-intensive** (CPU is rarely the bottleneck) — they are **data-intensive**. The real challenges are: