Skip to main content

🗄️ Database Knowledge Base

A structured guide covering everything you need to know about databases — from foundational SQL concepts to distributed NoSQL systems — with interview questions for each topic.

Topics Covered

#TopicDescription
1Relational FundamentalsSQL, joins, keys, constraints
2Database Design & NormalizationERD, 1NF–BCNF, schema patterns
3Advanced SQLWindow functions, CTEs, recursive queries
4Schema MigrationsFlyway, Liquibase, zero-downtime
5Indexing & Query OptimizationB-Tree, covering indexes, EXPLAIN
6Query Planner & OptimizerCBO, statistics, join algorithms
7Transactions & ConcurrencyACID, isolation levels, MVCC, deadlocks
8Storage Engines & Data StructuresInnoDB, LSM trees, WAL, buffer pool
9Replication & PartitioningLeader-follower, sharding, CAP
10NoSQL & Distributed DatabasesDocument, key-value, wide-column, graph
11Caching StrategiesRedis, eviction, cache patterns, pitfalls
12Performance & MonitoringSlow queries, profiling, connection pooling
13Full-Text SearchInverted index, tsvector, Elasticsearch
14Data Warehousing & OLAPStar schema, ETL/ELT, materialized views
15Database Patterns for MicroservicesOutbox, Saga, CQRS, Event Sourcing
16Time-Series DatabasesTimescaleDB, InfluxDB, Prometheus
17Backup & RecoveryRPO/RTO, PITR, DR checklist
18Database SecuritySQL injection, encryption, auditing
Java / Spring Tip

Throughout this guide, Java and Spring Data / JPA notes are included where relevant to bridge theory and real-world usage.


Advanced Editorial Pass: Database Decision-Making Under Real Constraints

Senior Engineering Focus

  • Choose patterns by access shape, consistency needs, and scaling envelope.
  • Treat schema, indexing, and query behavior as a single design unit.
  • Plan operability early: migration, observability, backup, and recovery.

Failure Modes to Anticipate

  • Tool-first choices that ignore data-model and workload realities.
  • Performance optimization without plan-level evidence.
  • Operational blind spots in replication, failover, and restore flows.

Practical Heuristics

  1. Document data contracts and change strategy before shipping.
  2. Validate assumptions with production-like cardinality and skew.
  3. Define ownership for schema evolution and incident response.

Compare Next