Indexing & Query Optimization

In large-scale database systems and system design interviews, "just add an index" is not a complete answer. Engineers must understand how data is physically structured on disk, how different data structures interact with memory hierarchies, how to analyze query paths using EXPLAIN, and the precise trade-offs of various indexing algorithms.

1. What is an Index & The Hardware Constraint

An index is a separate data structure that stores a subset of a table's columns in organized memory and disk pages so the database can locate records without performing a full table scan.

Trade-offs:

✅ Speeds up SELECT, JOIN, ORDER BY, WHERE
❌ Slows down INSERT, UPDATE, DELETE (index structures must be updated)
❌ Consumes additional disk and RAM storage

The Hardware Constraint: Why We Index

Before discussing tree structures, we must understand the core problem: Disk I/O.

Data in a relational database is continuously serialized onto persistent storage (SSD/HDD). To optimize block storage, databases divide data into fixed-size contiguous chunks called Pages (commonly 8KB in size).

The Memory Hierarchy

┌─────────────────────────────────────────────────────────────┐
│                    Memory Hierarchy                         │
├─────────────────────────────────────────────────────────────┤
│                                                              │
│  CPU Registers                                              │
│  ┌──────────────┐                                           │
│  │  < 1 KB      │  Speed: ~0.5 ns                           │
│  │  Direct CPU  │  Access: Instant                          │
│  └──────────────┘                                           │
│         │                                                    │
│         ▼                                                    │
│  CPU Cache (L1/L2/L3)                                       │
│  ┌──────────────┐                                           │
│  │  32 KB - 12 MB│ Speed: 1-10 ns                           │
│  │  Very Fast   │  Access: Very Fast                        │
│  └──────────────┘                                           │
│         │                                                    │
│         ▼                                                    │
│  Main Memory (RAM)                                          │
│  ┌──────────────┐                                           │
│  │  8 - 64 GB   │  Speed: 100 ns                            │
│  │  Fast        │  Access: Fast                            │
│  └──────────────┘                                           │
│         │                                                    │
│         ▼                                                    │
│  SSD Storage                                                │
│  ┌──────────────┐                                           │
│  │  256 GB - 4 TB│ Speed: 100 μs                            │
│  │  Medium      │  Access: Medium                          │
│  └──────────────┘                                           │
│         │                                                    │
│         ▼                                                    │
│  HDD Storage                                                │
│  ┌──────────────┐                                           │
│  │  1 - 10 TB   │  Speed: 10 ms                            │
│  │  Slow        │  Access: Slow                            │
│  └──────────────┘                                           │
│                                                              │
│  Key Insight: Disk I/O is 100,000x slower than RAM!         │
│                                                              │
└─────────────────────────────────────────────────────────────┘

The Sequential Scan Bottleneck

When querying a database without an index, the engine executes a Full Table Scan:

Fetches a page from disk into RAM.
Scans the ~100 rows within that page for the target value.
Evicts the page and fetches the next one.

The Math Behind the Bottleneck

If you have 100 million users, and each 8KB page holds 100 rows, your table spans 1 million pages. At approximately 100 microseconds per SSD-to-RAM round trip, scanning 1 million pages takes ~100 seconds in the worst case. Even with sequential pre-fetching reducing this to 3–5 seconds, this latency is catastrophic for synchronous application flows.

Indexes are auxiliary data structures stored on disk that act as a map. They allow the database engine to resolve a key to a specific memory offset/page pointer in logarithmic or constant time, fetching only the specific page required into memory.

Visual Example

Without Index (Full Table Scan):
┌─────────────────────────────────────────────────────────────┐
│  Query: SELECT * FROM users WHERE id = 999999               │
│                                                              │
│  Page 1: [1-100]      → Scan → Not found                    │
│  Page 2: [101-200]    → Scan → Not found                    │
│  Page 3: [201-300]    → Scan → Not found                    │
│  ...                                                        │
│  Page 10000: [999901-1000000] → Scan → Found! ✓            │
│                                                              │
│  Result: 10,000 disk I/O operations (~1 second)              │
└─────────────────────────────────────────────────────────────┘

With Index (B-Tree Lookup):
┌─────────────────────────────────────────────────────────────┐
│  Query: SELECT * FROM users WHERE id = 999999               │
│                                                              │
│  Index Page 1: [1-1000000] → Navigate to right child        │
│  Index Page 2: [500001-1000000] → Navigate to right child   │
│  Index Page 3: [750001-1000000] → Navigate to right child   │
│  Data Page: [999901-1000000] → Found! ✓                     │
│                                                              │
│  Result: 4 disk I/O operations (~0.4 ms)                    │
│                                                              │
│  Speedup: 2,500x faster!                                    │
└─────────────────────────────────────────────────────────────┘

B-Trees (B+ Trees): The Industry Workhorse

By far the most common index implementation across modern databases (PostgreSQL, MySQL, InnoDB) is the B-Tree (specifically the B+ Tree).

Structure and Memory Mapping

A B-Tree node is a sorted array of keys and pointers.

Internal Nodes: Contain routing keys and pointers to child nodes (other index pages).
Leaf Nodes: Contain pointers directly to the physical Data Pages (or cluster the data directly in the case of clustered indexes).

Because a node is specifically designed to fit within a single disk page (e.g., 8KB), B-Trees exhibit a massive "fan-out" (branching factor). This keeps the tree remarkably shallow—usually 3 or 4 levels deep—meaning it only takes 3-4 disk I/O operations to find any record among billions.

B-Tree Structure

┌─────────────────────────────────────────────────────────────┐
│                    B-Tree Structure                          │
├─────────────────────────────────────────────────────────────┤
│                                                              │
│                        Root Node                             │
│                    ┌───────────────┐                         │
│                    │  50 | 100    │                         │
│                    └───────┬───────┘                         │
│                            │                                 │
│              ┌─────────────┼─────────────┐                   │
│              │             │             │                   │
│              ▼             ▼             ▼                   │
│        ┌──────────┐ ┌──────────┐ ┌──────────┐              │
│        │  25      │ │  75      │ │  150     │              │
│        └────┬─────┘ └────┬─────┘ └────┬─────┘              │
│             │            │            │                     │
│      ┌──────┴──────┐ ┌──┴──┐ ┌──────┴──────┐              │
│      │             │ │     │ │             │              │
│      ▼             ▼ ▼     ▼ ▼             ▼              │
│   ┌─────┐ ┌─────┐ ┌─────┐ ┌─────┐ ┌─────┐ ┌─────┐        │
│   │ 10  │ │ 40  │ │ 60  │ │ 90  │ │ 125 │ │ 175 │        │
│   └─────┘ └─────┘ └─────┘ └─────┘ └─────┘ └─────┘        │
│     │       │       │       │       │       │             │
│     ▼       ▼       ▼       ▼       ▼       ▼             │
│  [Data]  [Data]  [Data]  [Data]  [Data]  [Data]           │
│                                                              │
│  Key Properties:                                            │
│  - Balanced tree (all leaves at same depth)                │
│  - Sorted keys for range queries                            │
│  - High fan-out (100+ children per node)                   │
│  - Shallow depth (3-4 levels for billions of records)      │
│                                                              │
└─────────────────────────────────────────────────────────────┘

B+ Tree vs B-Tree

B+ Tree (used in most databases):

All data stored in leaf nodes
Internal nodes only contain keys for navigation
Leaf nodes linked together for range queries
Better for disk I/O (larger fan-out)

B-Tree:

Data stored in all nodes
Smaller fan-out
Deeper tree
Less efficient for range queries

Why B-Trees Dominate

While B-Trees provide $O(\log N)$ exact lookups, their true superpower is Range Queries and Sorting.

Because keys are stored in strictly sorted order, a query like SELECT * FROM users WHERE age > 50 traverses the tree to find the node for 50, and simply follows the linked list of leaf nodes horizontally, pulling only the relevant pages into memory.

B-Tree Operations

Insertion:

def insert(btree, key, value):
    # Find appropriate leaf node
    leaf = find_leaf(btree, key)

    # Insert key into leaf
    if leaf.has_space():
        leaf.insert(key, value)
    else:
        # Split leaf node
        split_leaf(leaf, key, value)

        # Propagate split up if necessary
        propagate_split(leaf.parent)

Deletion:

def delete(btree, key):
    # Find leaf node containing key
    leaf = find_leaf(btree, key)

    # Remove key from leaf
    leaf.remove(key)

    # Rebalance if necessary
    if leaf.is_underflow():
        rebalance(leaf)

Search:

def search(btree, key):
    node = btree.root

    while not node.is_leaf():
        # Find appropriate child
        child = find_child(node, key)
        node = child

    # Search in leaf node
    return leaf.find(key)

Clustered vs Non-Clustered Indexes

Clustered Index:

Data stored directly in leaf nodes
Only one clustered index per table
Usually on primary key
Faster lookups (no extra I/O)

Non-Clustered Index:

Leaf nodes contain pointers to data
Multiple non-clustered indexes per table
Extra I/O to fetch data
More flexible

Clustered Index:
┌─────────────────────────────────────────────────────────────┐
│  Leaf Node: [10, data] [20, data] [30, data]               │
│  Data stored directly in index                              │
│  One I/O to get both index and data                         │
└─────────────────────────────────────────────────────────────┘

Non-Clustered Index:
┌─────────────────────────────────────────────────────────────┐
│  Leaf Node: [10, ptr] [20, ptr] [30, ptr]                  │
│  Pointers to data pages                                     │
│  Two I/Os: one for index, one for data                      │
└─────────────────────────────────────────────────────────────┘

Composite Indexes & Column Order

CREATE INDEX idx_user_date ON orders (user_id, created_at, status);

The left-prefix rule: the index is usable only if queries filter on columns from left to right without skipping.

Query	Uses Index?
`WHERE user_id = 1`	✅ (leftmost)
`WHERE user_id = 1 AND created_at > '2024-01-01'`	✅
`WHERE user_id = 1 AND status = 'active'`	✅ (user_id only, status skips)
`WHERE created_at > '2024-01-01'`	❌ (skips user_id)
`WHERE status = 'active'`	❌

Rule of thumb: Place equality columns first, range columns last, high cardinality columns first.

Covering Index

A covering index includes all columns needed by a query — no heap (table) lookup required.

-- Query
SELECT user_id, total FROM orders WHERE status = 'pending';

-- Covering index
CREATE INDEX idx_covering ON orders (status, user_id, total);
-- status in WHERE, user_id + total in SELECT — all in index

MySQL EXPLAIN will show Using index (not Using index condition) for a true covering index.

Hash Indexes: Fast but Inflexible

A Hash Index uses a standard hash map stored on disk. The search key is passed through a hash function to map directly to a bucket that contains the disk pointer.

Hash Index Structure

┌─────────────────────────────────────────────────────────────┐
│                    Hash Index Structure                      │
├─────────────────────────────────────────────────────────────┤
│                                                              │
│  Hash Function: hash(key) % bucket_count                   │
│                                                              │
│  ┌─────────────────────────────────────────────────────┐   │
│  │ Bucket 0: [key1, ptr1] [key2, ptr2]                │   │
│  │ Bucket 1: [key3, ptr3]                               │   │
│  │ Bucket 2: [key4, ptr4] [key5, ptr5] [key6, ptr6]    │   │
│  │ Bucket 3: []                                         │   │
│  │ ...                                                  │   │
│  └─────────────────────────────────────────────────────┘   │
│                                                              │
│  Lookup: O(1) average case                                 │
│  Insert: O(1) average case                                 │
│  Delete: O(1) average case                                 │
│                                                              │
└─────────────────────────────────────────────────────────────┘

Hash Index Operations

Insertion:

def insert(hash_index, key, value):
    bucket = hash_function(key) % bucket_count
    hash_index[bucket].append((key, value))

Search:

def search(hash_index, key):
    bucket = hash_function(key) % bucket_count
    for k, v in hash_index[bucket]:
        if k == key:
            return v
    return None

Deletion:

def delete(hash_index, key):
    bucket = hash_function(key) % bucket_count
    for i, (k, v) in enumerate(hash_index[bucket]):
        if k == key:
            del hash_index[bucket][i]
            return

Why are Hash Indexes rarely used in relational databases?

Despite offering $O(1)$ lookups, Hash Indexes explicitly destroy ordering. If you use a Hash Index, queries like WHERE age > 50 or ORDER BY created_at cannot utilize the index, resulting in a full table scan.

When to use them: Hash indexes are relegated to in-memory key-value stores (like Redis or Memcached), where the primary access pattern is strictly direct-key lookup and disk I/O paging is irrelevant.

Hash Index vs B-Tree

Feature	Hash Index	B-Tree
Lookup Time	O(1) average	O(log N)
Range Queries	Not supported	Supported
Sorting	Not supported	Supported
Prefix Searches	Not supported	Supported
Memory Usage	Higher	Lower
Collision Handling	Required	Not required
Best For	Exact lookups	General purpose

Geospatial Indexing: Solving Multi-Dimensional Data

B-trees natively index 1-dimensional data. They fail spectacularly when indexing 2-dimensional data (like Latitude and Longitude).

If you run a bounding box query (WHERE lat BETWEEN 100 AND 400 AND long BETWEEN 20 AND 200), a B-Tree forces the engine to fetch massive continuous strips of latitude pages, massive strips of longitude pages, and perform an extremely expensive set-intersection (merge) in memory.

To solve this, we use specialized algorithms that reduce 2D space into 1D space, or cluster data dynamically.

The Problem with 2D Data

┌─────────────────────────────────────────────────────────────┐
│                    2D Space Problem                          │
├─────────────────────────────────────────────────────────────┤
│                                                              │
│  Latitude                                                   │
│    ▲                                                         │
│    │                                                         │
│  400│     ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ●    │
│    │     ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ●    │
│  300│     ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ●    │
│    │     ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ●    │
│  200│     ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ●    │
│    │     ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ●    │
│  100│     ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ●    │
│    │     ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ●    │
│    └─────────────────────────────────────────────────────►  │
│      20   40   60   80  100  120  140  160  180  200       │
│                         Longitude                           │
│                                                              │
│  Query: Find points in rectangle [100-400, 20-200]         │
│                                                              │
│  B-Tree approach:                                           │
│  - Fetch all points with lat in [100-400]                   │
│  - Fetch all points with long in [20-200]                   │
│  - Intersect results in memory                              │
│  - Result: Massive I/O and memory usage                     │
│                                                              │
└─────────────────────────────────────────────────────────────┘

4.1 Geohashing (Z-Order Curves)

Geohashing recursively subdivides the map into four quadrants (0, 1, 2, 3), continuing downwards to increase precision.

By appending the quadrant numbers (e.g., 3 -> 31 -> 310), spatial proximity is converted into string prefix proximity.
These strings are Base-32 encoded.
Implementation: Once spatial data is converted to 1D strings, the database simply stores these strings inside a standard B-Tree. Finding nearby drivers in a radius becomes a simple string prefix match in the B-Tree. Redis uses Geohashing natively.

Geohashing Example

┌─────────────────────────────────────────────────────────────┐
│                    Geohashing Process                        │
├─────────────────────────────────────────────────────────────┤
│                                                              │
│  Level 1:                                                   │
│  ┌───────────────┐                                          │
│  │  0  │  1     │                                          │
│  ├─────┼─────┤  │                                          │
│  │  2  │  3     │                                          │
│  └───────────────┘                                          │
│                                                              │
│  Level 2 (for quadrant 3):                                  │
│  ┌───────────────┐                                          │
│  │ 30 │ 31      │                                          │
│  ├─────┼─────┤  │                                          │
│  │ 32 │ 33      │                                          │
│  └───────────────┘                                          │
│                                                              │
│  Level 3 (for quadrant 31):                                 │
│  ┌───────────────┐                                          │
│  │ 310 │ 311    │                                          │
│  ├─────┼─────┤  │                                          │
│  │ 312 │ 313    │                                          │
│  └───────────────┘                                          │
│                                                              │
│  Result: "313" represents a specific geographic region       │
│  Nearby regions share prefixes: "310", "311", "312"        │
│                                                              │
└─────────────────────────────────────────────────────────────┘

Geohashing Implementation

import base64

def geohash(latitude, longitude, precision=5):
    """Convert lat/long to geohash"""
    BASE32 = "0123456789bcdefghjkmnpqrstuvwxyz"
    lat_range = [-90.0, 90.0]
    long_range = [-180.0, 180.0]
    geohash = []
    bits = 0
    bit = 0
    even = True

    while len(geohash) < precision:
        if even:
            mid = (long_range[0] + long_range[1]) / 2
            if longitude > mid:
                bits |= (1 << bit)
                long_range[0] = mid
            else:
                long_range[1] = mid
        else:
            mid = (lat_range[0] + lat_range[1]) / 2
            if latitude > mid:
                bits |= (1 << bit)
                lat_range[0] = mid
            else:
                lat_range[1] = mid

        even = not even
        bit += 1

        if bit == 5:
            geohash.append(BASE32[bits])
            bits = 0
            bit = 0

    return ''.join(geohash)

def nearby_geohashes(geohash):
    """Get nearby geohashes for radius search"""
    neighbors = []
    # Implementation depends on precision level
    # For precision 5, check 8 surrounding cells
    return neighbors

4.2 Quad Trees

Quad trees split the world recursively similar to Geohashes, but they map the space to a tree structure and utilize a density constant k. A grid is only subdivided into four smaller grids if the number of items inside it exceeds k. This ensures memory is not wasted on empty oceanic spaces and depth is focused on high-density areas (like Manhattan).

Quad Tree Structure

┌─────────────────────────────────────────────────────────────┐
│                    Quad Tree Structure                        │
├─────────────────────────────────────────────────────────────┤
│                                                              │
│  Root (entire map)                                          │
│  ┌─────────────────────────────────────────────────────┐    │
│  │                                                     │    │
│  │  ┌──────────┐  ┌──────────┐                        │    │
│  │  │  NW      │  │  NE      │                        │    │
│  │  │  ● ●     │  │  ● ● ●   │                        │    │
│  │  │  ● ●     │  │  ● ● ●   │                        │    │
│  │  └──────────┘  └──────────┘                        │    │
│  │                                                     │    │
│  │  ┌──────────┐  ┌──────────┐                        │    │
│  │  │  SW      │  │  SE      │                        │    │
│  │  │  ●       │  │  ● ● ● ● │                        │    │
│  │  │  ●       │  │  ● ● ● ● │                        │    │
│  │  └──────────┘  └──────────┘                        │    │
│  │                                                     │    │
│  └─────────────────────────────────────────────────────┘    │
│                                                              │
│  Adaptive subdivision based on point density                │
│  Empty areas not subdivided                                │
│  Dense areas subdivided more deeply                        │
│                                                              │
└─────────────────────────────────────────────────────────────┘

Quad Tree Implementation

class QuadTreeNode:
    def __init__(self, bounds, capacity=4):
        self.bounds = bounds  # (x1, y1, x2, y2)
        self.capacity = capacity
        self.points = []
        self.children = None

    def insert(self, point):
        if not self.contains(point):
            return False

        if len(self.points) < self.capacity or self.children is None:
            self.points.append(point)
            return True

        if self.children is None:
            self.subdivide()

        return any(child.insert(point) for child in self.children)

    def subdivide(self):
        x1, y1, x2, y2 = self.bounds
        mid_x = (x1 + x2) / 2
        mid_y = (y1 + y2) / 2

        self.children = [
            QuadTreeNode((x1, y1, mid_x, mid_y), self.capacity),
            QuadTreeNode((mid_x, y1, x2, mid_y), self.capacity),
            QuadTreeNode((x1, mid_y, mid_x, y2), self.capacity),
            QuadTreeNode((mid_x, mid_y, x2, y2), self.capacity)
        ]

    def query(self, bounds):
        results = []
        if not self.intersects(bounds):
            return results

        for point in self.points:
            if self.point_in_rect(point, bounds):
                results.append(point)

        if self.children:
            for child in self.children:
                results.extend(child.query(bounds))

        return results

4.3 R-Trees (Region Trees)

R-trees (predecessors derived from Quad Trees) represent the modern standard for disk-based geospatial indexes. Instead of rigidly dividing the map into quadrants, R-Trees dynamically cluster objects into overlapping Minimum Bounding Rectangles (MBRs).

Implementation: When searching for a point, the engine traverses down intersecting bounding boxes. This is the underlying algorithm powering PostGIS (the spatial extension for PostgreSQL).

R-Tree Structure

┌─────────────────────────────────────────────────────────────┐
│                    R-Tree Structure                          │
├─────────────────────────────────────────────────────────────┤
│                                                              │
│  Root Node                                                  │
│  ┌─────────────────────────────────────────────────────┐    │
│  │                                                     │    │
│  │  ┌──────────────┐  ┌──────────────┐               │    │
│  │  │  MBR A       │  │  MBR B       │               │    │
│  │  │  ┌────────┐  │  │  ┌────────┐  │               │    │
│  │  │  │ ● ● ●  │  │  │  │ ● ●    │  │               │    │
│  │  │  │ ● ● ●  │  │  │  │ ● ●    │  │               │    │
│  │  │  └────────┘  │  │  └────────┘  │               │    │
│  │  └──────────────┘  └──────────────┘               │    │
│  │                                                     │    │
│  │  ┌──────────────┐  ┌──────────────┐               │    │
│  │  │  MBR C       │  │  MBR D       │               │    │
│  │  │  ┌────────┐  │  │  ┌────────┐  │               │    │
│  │  │  │ ● ●    │  │  │  │ ● ● ●  │  │               │    │
│  │  │  │ ● ●    │  │  │  │ ● ● ●  │  │               │    │
│  │  │  └────────┘  │  │  └────────┘  │               │    │
│  │  └──────────────┘  └──────────────┘               │    │
│  │                                                     │    │
│  └─────────────────────────────────────────────────────┘    │
│                                                              │
│  Overlapping MBRs for efficient spatial queries              │
│  Dynamic clustering based on data distribution               │
│                                                              │
└─────────────────────────────────────────────────────────────┘

R-Tree Implementation

class RTreeNode:
    def __init__(self, is_leaf=False, capacity=4):
        self.is_leaf = is_leaf
        self.capacity = capacity
        self.children = []
        self.mbr = None  # Minimum Bounding Rectangle

    def insert(self, item):
        if self.is_leaf:
            self.children.append(item)
            self.update_mbr()
            return True

        # Find best child to insert
        best_child = self.find_best_child(item)
        if best_child.insert(item):
            self.update_mbr()
            return True

        # Split if necessary
        if len(self.children) >= self.capacity:
            self.split()

        return False

    def find_best_child(self, item):
        # Find child with minimum MBR enlargement
        best_child = None
        min_enlargement = float('inf')

        for child in self.children:
            enlargement = self.calculate_enlargement(child.mbr, item.mbr)
            if enlargement < min_enlargement:
                min_enlargement = enlargement
                best_child = child

        return best_child

    def query(self, bounds):
        results = []
        if not self.intersects(bounds):
            return results

        if self.is_leaf:
            for item in self.children:
                if self.intersects(item.mbr, bounds):
                    results.append(item)
        else:
            for child in self.children:
                results.extend(child.query(bounds))

        return results

Inverted Indexes: Full-Text Search

If you need to query partial string matches (e.g., WHERE business_name LIKE '%pizza%'), a B-Tree is rendered useless. Because B-trees are sorted lexicographically, they can optimize prefix searches ('pizza%') but wildcard suffix searches force a Full Table Scan.

The Solution: The Inverted Index. Instead of mapping a Row ID to a string, an Inverted Index tokenizes text into discrete words and maps each word to a list of Document IDs.

Token	Posting List (Document IDs/Pointers)
`fast`	`[doc1, doc2]`
`reliable`	`[doc1]`
`range`	`[doc3]`

When a user searches for "fast", the engine simply looks up the token fast in a hash map or B-Tree, retrieves the pointers [doc1, doc2], and pulls those specific disk pages into RAM.

Implementation: This architecture is the core engine behind full-text search technologies like Elasticsearch, Lucene, and PostgreSQL's tsvector/tsquery.

Inverted Index Structure

┌─────────────────────────────────────────────────────────────┐
│                    Inverted Index Structure                  │
├─────────────────────────────────────────────────────────────┤
│                                                              │
│  Documents:                                                  │
│  Doc 1: "The quick brown fox jumps over the lazy dog"       │
│  Doc 2: "A fast brown fox runs quickly"                     │
│  Doc 3: "The lazy dog sleeps"                               │
│                                                              │
│  Inverted Index:                                             │
│  ┌─────────────────────────────────────────────────────┐    │
│  │ Token         │ Posting List                        │    │
│  ├───────────────┼─────────────────────────────────────┤    │
│  │ "the"         │ [1, 3]                              │    │
│  │ "quick"       │ [1]                                 │    │
│  │ "brown"       │ [1, 2]                              │    │
│  │ "fox"         │ [1, 2]                              │    │
│  │ "jumps"       │ [1]                                 │    │
│  │ "over"        │ [1]                                 │    │
│  │ "lazy"        │ [1, 3]                              │    │
│  │ "dog"         │ [1, 3]                              │    │
│  │ "a"           │ [2]                                 │    │
│  │ "fast"        │ [2]                                 │    │
│  │ "runs"        │ [2]                                 │    │
│  │ "quickly"     │ [2]                                 │    │
│  │ "sleeps"      │ [3]                                 │    │
│  └─────────────────────────────────────────────────────┘    │
│                                                              │
│  Query: "brown fox"                                          │
│  - Lookup "brown": [1, 2]                                    │
│  - Lookup "fox": [1, 2]                                     │
│  - Intersect: [1, 2]                                        │
│  - Result: Doc 1, Doc 2                                     │
│                                                              │
└─────────────────────────────────────────────────────────────┘

Inverted Index Implementation

import re
from collections import defaultdict

class InvertedIndex:
    def __init__(self):
        self.index = defaultdict(list)
        self.documents = {}

    def add_document(self, doc_id, text):
        self.documents[doc_id] = text
        tokens = self.tokenize(text)

        for token in tokens:
            if doc_id not in self.index[token]:
                self.index[token].append(doc_id)

    def tokenize(self, text):
        # Simple tokenization
        text = text.lower()
        tokens = re.findall(r'\b\w+\b', text)
        return tokens

    def search(self, query):
        query_tokens = self.tokenize(query)

        if not query_tokens:
            return []

        # Get posting lists for all tokens
        posting_lists = [self.index.get(token, []) for token in query_tokens]

        # Intersect posting lists
        result = set(posting_lists[0])
        for posting_list in posting_lists[1:]:
            result.intersection_update(posting_list)

        return list(result)

    def search_with_ranking(self, query):
        query_tokens = self.tokenize(query)
        scores = defaultdict(int)

        for token in query_tokens:
            for doc_id in self.index.get(token, []):
                scores[doc_id] += 1

        # Sort by score
        return sorted(scores.items(), key=lambda x: x[1], reverse=True)

Advanced Features

Stemming:

from nltk.stem import PorterStemmer

class InvertedIndexWithStemming(InvertedIndex):
    def __init__(self):
        super().__init__()
        self.stemmer = PorterStemmer()

    def tokenize(self, text):
        tokens = super().tokenize(text)
        # Apply stemming
        return [self.stemmer.stem(token) for token in tokens]

Stop Words:

class InvertedIndexWithStopWords(InvertedIndex):
    STOP_WORDS = {'the', 'a', 'an', 'and', 'or', 'in', 'on', 'at', 'to'}

    def tokenize(self, text):
        tokens = super().tokenize(text)
        # Remove stop words
        return [token for token in tokens if token not in self.STOP_WORDS]

Phrase Search:

def phrase_search(index, phrase):
    tokens = index.tokenize(phrase)
    if len(tokens) < 2:
        return index.search(phrase)

    # Get documents containing all tokens
    candidate_docs = set(index.index.get(tokens[0], []))
    for token in tokens[1:]:
        candidate_docs.intersection_update(index.index.get(token, []))

    # Check phrase proximity
    results = []
    for doc_id in candidate_docs:
        text = index.documents[doc_id]
        if phrase in text.lower():
            results.append(doc_id)

    return results

Analyzing Queries with EXPLAIN

EXPLAIN SELECT * FROM orders WHERE user_id = 5 AND status = 'pending';

Key EXPLAIN fields (MySQL)

Field	Look For
`type`	`const` > `eq_ref` > `ref` > `range` > `index` > `ALL` (worst)
`key`	Which index was chosen (NULL = no index)
`rows`	Estimated rows scanned
`Extra`	`Using index` (good), `Using filesort` (bad), `Using temporary` (bad)

EXPLAIN ANALYZE SELECT ...;  -- PostgreSQL: actual execution stats

PostgreSQL EXPLAIN example

Seq Scan on orders  (cost=0.00..4320.00 rows=85000 width=50)
  Filter: (status = 'pending')
→ Bad: full scan. Add index on status.

Index Scan using idx_status on orders  (cost=0.43..8.45 rows=1 width=50)
  Index Cond: (status = 'pending')
→ Good: index used.

Common Query Optimization Techniques

1. Avoid functions on indexed columns

-- ❌ Bad: index on created_at not used
WHERE YEAR(created_at) = 2024

-- ✅ Good: range on indexed column
WHERE created_at >= '2024-01-01' AND created_at < '2025-01-01'

2. Avoid implicit type conversion

-- ❌ Bad: user_id is INT but compared to VARCHAR
WHERE user_id = '42'

-- ✅ Good
WHERE user_id = 42

3. Use EXISTS instead of IN for subqueries

-- ❌ Potentially slow for large subquery
SELECT * FROM users WHERE id IN (SELECT user_id FROM orders WHERE total > 100);

-- ✅ Better: stops at first match
SELECT * FROM users u WHERE EXISTS (
    SELECT 1 FROM orders o WHERE o.user_id = u.id AND o.total > 100
);

4. Pagination: avoid OFFSET on large tables

-- ❌ Bad: scans and discards 100,000 rows
SELECT * FROM orders ORDER BY id LIMIT 20 OFFSET 100000;

-- ✅ Good: keyset pagination
SELECT * FROM orders WHERE id > 100000 ORDER BY id LIMIT 20;

5. SELECT only needed columns

-- ❌ Bad: fetches all columns
SELECT * FROM users;

-- ✅ Good: allows covering index
SELECT id, email FROM users WHERE active = true;

Index Selectivity & Cardinality

Cardinality = number of distinct values in a column.

High cardinality (e.g. email, user_id) → indexes very effective
Low cardinality (e.g. status with 3 values, boolean) → indexes often not used by optimizer (full scan cheaper)

-- Check cardinality in MySQL
SELECT COUNT(DISTINCT status) / COUNT(*) AS selectivity FROM orders;
-- > 0.01 = low selectivity, index may not help

Index Maintenance

-- MySQL: analyze table statistics
ANALYZE TABLE orders;

-- PostgreSQL: rebuild bloated index
REINDEX INDEX idx_orders_status;

-- Find unused indexes (PostgreSQL)
SELECT * FROM pg_stat_user_indexes WHERE idx_scan = 0;

-- Find missing indexes (MySQL slow query log + pt-query-digest)

Spring / JPA Notes

// Add indexes via JPA annotations
@Entity
@Table(name = "orders", indexes = {
    @Index(name = "idx_user_status", columnList = "user_id, status"),
    @Index(name = "idx_created_at",  columnList = "created_at")
})
public class Order { ... }

// Enable SQL logging to spot bad queries

## How Database Indexes Work Internally

### Index Storage

**On-Disk Storage:**

┌─────────────────────────────────────────────────────────────┐ │ Index Storage on Disk │ ├─────────────────────────────────────────────────────────────┤ │ │ │ Disk Layout: │ │ ┌─────────────────────────────────────────────────────┐ │ │ │ Index Header │ │ │ │ - Root page pointer │ │ │ │ - Index metadata │ │ │ ├─────────────────────────────────────────────────────┤ │ │ │ Index Pages (8KB each) │ │ │ │ - Internal nodes │ │ │ │ - Leaf nodes │ │ │ ├─────────────────────────────────────────────────────┤ │ │ │ Data Pages (8KB each) │ │ │ │ - Actual table data │ │ │ └─────────────────────────────────────────────────────┘ │ │ │ │ Page Structure: │ │ ┌─────────────────────────────────────────────────────┐ │ │ │ Page Header (96 bytes) │ │ │ │ - Page number │ │ │ │ - Page type (index/data) │ │ │ │ - Free space offset │ │ │ ├─────────────────────────────────────────────────────┤ │ │ │ Index Entries │ │ │ │ - Key value │ │ │ │ - Row ID / Page pointer │ │ │ ├─────────────────────────────────────────────────────┤ │ │ │ Free Space │ │ │ └─────────────────────────────────────────────────────┘ │ │ │ └─────────────────────────────────────────────────────────────┘

### Index Creation Process

```sql
-- Create index
CREATE INDEX idx_users_email ON users(email);

-- What happens internally:
-- 1. Scan entire table
-- 2. Extract email values and row IDs
-- 3. Sort by email
-- 4. Build B-Tree structure
-- 5. Write to disk

Internal Process:

def create_index(table, column):
    # Step 1: Scan table
    rows = scan_table(table)

    # Step 2: Extract key-value pairs
    key_values = [(row[column], row.id) for row in rows]

    # Step 3: Sort by key
    key_values.sort(key=lambda x: x[0])

    # Step 4: Build B-Tree
    btree = BTree()
    for key, row_id in key_values:
        btree.insert(key, row_id)

    # Step 5: Write to disk
    write_to_disk(btree)

    return btree

Index Maintenance

Insert Operation:

def insert_with_index(table, row):
    # Insert into table
    table.insert(row)

    # Update all indexes
    for index in table.indexes:
        key = row[index.column]
        index.insert(key, row.id)

Update Operation:

def update_with_index(table, row_id, new_values):
    # Get old row
    old_row = table.get(row_id)

    # Update table
    table.update(row_id, new_values)

    # Update indexes
    for index in table.indexes:
        old_key = old_row[index.column]
        new_key = new_values[index.column]

        if old_key != new_key:
            index.delete(old_key, row_id)
            index.insert(new_key, row_id)

Delete Operation:

def delete_with_index(table, row_id):
    # Get row
    row = table.get(row_id)

    # Delete from table
    table.delete(row_id)

    # Update indexes
    for index in table.indexes:
        key = row[index.column]
        index.delete(key, row_id)

Index Statistics

Statistics Collection:

class IndexStatistics:
    def __init__(self, index):
        self.index = index
        self.stats = {}

    def collect_statistics(self):
        # Number of distinct values
        self.stats['distinct_values'] = self.count_distinct()

        # Null values
        self.stats['null_values'] = self.count_nulls()

        # Average key length
        self.stats['avg_key_length'] = self.avg_key_length()

        # Index depth
        self.stats['depth'] = self.index.depth()

        # Page count
        self.stats['page_count'] = self.index.page_count()

        return self.stats

    def count_distinct(self):
        # Count distinct values in index
        distinct = set()
        for key in self.index.keys():
            distinct.add(key)
        return len(distinct)

    def count_nulls(self):
        # Count null values
        null_count = 0
        for key in self.index.keys():
            if key is None:
                null_count += 1
        return null_count

Query Optimization

Query Planner:

class QueryPlanner:
    def __init__(self, database):
        self.database = database

    def plan_query(self, query):
        # Parse query
        parsed = self.parse_query(query)

        # Identify usable indexes
        usable_indexes = self.find_usable_indexes(parsed)

        # Estimate costs
        costs = self.estimate_costs(parsed, usable_indexes)

        # Choose best plan
        best_plan = min(costs, key=lambda x: x['cost'])

        return best_plan

    def find_usable_indexes(self, query):
        usable_indexes = []

        for table in query.tables:
            for index in table.indexes:
                if self.can_use_index(query, index):
                    usable_indexes.append(index)

        return usable_indexes

    def can_use_index(self, query, index):
        # Check if index can be used for WHERE clause
        for condition in query.where_conditions:
            if condition.column == index.column:
                if condition.operator in ['=', '>', '<', '>=', '<=']:
                    return True

        # Check if index can be used for ORDER BY
        if query.order_by.column == index.column:
            return True

        return False

    def estimate_costs(self, query, indexes):
        costs = []

        # Full table scan cost
        full_scan_cost = self.estimate_full_scan_cost(query)
        costs.append({
            'plan': 'full_scan',
            'cost': full_scan_cost
        })

        # Index scan costs
        for index in indexes:
            index_cost = self.estimate_index_cost(query, index)
            costs.append({
                'plan': f'index_scan_{index.name}',
                'cost': index_cost
            })

        return costs

Real-World Implementations

PostgreSQL Indexes

B-Tree Index:

-- Create B-Tree index (default)
CREATE INDEX idx_users_email ON users(email);

-- Create unique index
CREATE UNIQUE INDEX idx_users_username ON users(username);

-- Create composite index
CREATE INDEX idx_users_name_age ON users(last_name, first_name, age);

-- Create partial index
CREATE INDEX idx_active_users ON users(email) WHERE is_active = true;

-- Create expression index
CREATE INDEX idx_users_lower_email ON users(LOWER(email));

Hash Index:

-- Create hash index (only for equality)
CREATE INDEX idx_users_id_hash ON users USING HASH(id);

-- Note: Hash indexes are not WAL-logged and not replicated

GIN Index (Generalized Inverted Index):

-- Create GIN index for array columns
CREATE INDEX idx_users_tags ON users USING GIN(tags);

-- Create GIN index for full-text search
CREATE INDEX idx_documents_content ON documents USING GIN(to_tsvector('english', content));

-- Full-text search query
SELECT * FROM documents
WHERE to_tsvector('english', content) @@ to_tsquery('english', 'fast');

GiST Index (Generalized Search Tree):

-- Create GiST index for geometric data
CREATE INDEX idx_locations_point ON locations USING GIST(point);

-- Create GiST index for full-text search
CREATE INDEX idx_documents_content_gist ON documents USING GIST(to_tsvector('english', content));

-- Range queries
SELECT * FROM locations
WHERE point <@ box '((0,0),(10,10))';

BRIN Index (Block Range Index):

-- Create BRIN index for large tables
CREATE INDEX idx_orders_created_at ON orders USING BRIN(created_at);

-- BRIN is very space-efficient but less precise
-- Good for time-series data

MySQL Indexes

B-Tree Index:

-- Create B-Tree index (default in InnoDB)
CREATE INDEX idx_users_email ON users(email);

-- Create unique index
CREATE UNIQUE INDEX idx_users_username ON users(username);

-- Create composite index
CREATE INDEX idx_users_name_age ON users(last_name, first_name, age);

-- Create full-text index
CREATE FULLTEXT INDEX idx_articles_content ON articles(content);

Hash Index (Memory Engine):

-- Hash indexes only available in Memory engine
CREATE TABLE users (
    id INT PRIMARY KEY,
    email VARCHAR(255),
    INDEX (email) USING HASH
) ENGINE=Memory;

MongoDB Indexes

Single Field Index:

// Create single field index
db.users.createIndex({ email: 1 });

// Create unique index
db.users.createIndex({ username: 1 }, { unique: true });

// Create sparse index
db.users.createIndex({ email: 1 }, { sparse: true });

Compound Index:

// Create compound index
db.users.createIndex({ lastName: 1, firstName: 1, age: -1 });

// Note: Order matters for queries

Text Index:

// Create text index
db.articles.createIndex({ content: "text", title: "text" });

// Text search
db.articles.find({ $text: { $search: "fast reliable" } });

Geospatial Index:

// Create 2dsphere index
db.locations.createIndex({ location: "2dsphere" });

// Geospatial query
db.locations.find({
    location: {
        $near: {
            $geometry: { type: "Point", coordinates: [-73.9667, 40.78] },
            $maxDistance: 1000
        }
    }
});

Hash Index:

// Create hash index (sharding key)
db.users.createIndex({ _id: "hashed" });

Redis Indexes

Sorted Sets (ZSET):

## Integration Patterns

### Pattern 1: Covering Index

Create an index that includes all columns needed for a query.

```sql
-- Query
SELECT id, name, email FROM users WHERE age > 30;

-- Covering index
CREATE INDEX idx_users_age_name_email ON users(age, name, email);

-- Benefits:
-- - No need to access table data
-- - All data available in index
-- - Faster query execution

Pattern 2: Composite Index

Create an index on multiple columns for complex queries.

-- Query
SELECT * FROM orders
WHERE user_id = 123
AND status = 'completed'
AND created_at > '2024-01-01';

-- Composite index
CREATE INDEX idx_orders_user_status_date
ON orders(user_id, status, created_at);

-- Note: Column order matters!
-- Most selective column first

Pattern 3: Partial Index

Create an index for a subset of data.

-- Query
SELECT * FROM users WHERE is_active = true AND email LIKE '%@gmail.com';

-- Partial index
CREATE INDEX idx_active_gmail_users
ON users(email)
WHERE is_active = true AND email LIKE '%@gmail.com';

-- Benefits:
-- - Smaller index size
-- - Faster maintenance
-- - Better query performance

Pattern 4: Expression Index

Create an index on computed expressions.

-- Query
SELECT * FROM users WHERE LOWER(email) = '[email protected]';

-- Expression index
CREATE INDEX idx_users_lower_email ON users(LOWER(email));

-- Benefits:
-- - Index on computed values
-- - Case-insensitive searches

Pattern 5: Functional Index

Create an index on function results.

-- Query
SELECT * FROM products WHERE price * quantity > 1000;

-- Functional index (PostgreSQL)
CREATE INDEX idx_products_total ON products((price * quantity));

-- Benefits:
-- - Index on computed values
-- - Faster complex queries

Pros and Cons

B-Tree Indexes

Pros: ✅ Versatile: Supports exact match, range, and sort operations ✅ Efficient: O(log N) lookup time ✅ Balanced: Consistent performance regardless of data distribution ✅ Widely Supported: Available in most databases ✅ Predictable: Stable performance characteristics

Cons: ❌ Size: Can be 20-30% of table size ❌ Write Overhead: Slows down INSERT/UPDATE/DELETE operations ❌ Memory: Requires memory for caching ❌ Maintenance: Needs periodic rebuilding

Hash Indexes

Pros: ✅ Fast: O(1) average lookup time ✅ Simple: Easy to implement ✅ Memory Efficient: Smaller than B-Tree for exact lookups

Cons: ❌ Limited: Only supports equality operations ❌ No Range: Cannot support range queries ❌ No Sorting: Cannot support ORDER BY ❌ Collisions: Need collision handling

Geospatial Indexes

Pros: ✅ Spatial: Efficient for location-based queries ✅ Radius: Supports radius searches ✅ Bounding Box: Supports bounding box queries

Cons: ❌ Complex: More complex to implement ❌ Size: Can be large for dense data ❌ Precision: Trade-off between precision and performance

Inverted Indexes

Pros: ✅ Full-Text: Efficient for text search ✅ Flexible: Supports various search operations ✅ Ranking: Can support relevance ranking

Cons: ❌ Size: Can be very large ❌ Maintenance: Complex to maintain ❌ Updates: Expensive to update

📋 System Design Cheatsheet: Choosing the Right Index

In a system design interview, your choice of index explicitly dictates the performance of your system. Use this decision matrix:

Do you need fast lookups?
- No ➡️ Full Table Scan (Sequential).
- Yes ➡️ Proceed to step 2.
Is your dataset small (e.g., configurations)?
- Yes ➡️ Full Table Scan (fits entirely in RAM).
- No ➡️ Proceed to step 3.
What is the geometric access pattern of the query?
- Full-Text / Wildcard Search: ➡️ Inverted Index (Elasticsearch, Lucene).
- Spatial / Radius / Location: ➡️ Geospatial Index (Geohashing in Redis, R-Trees in PostGIS).
- Strictly Exact-Match Key-Value (In-Memory): ➡️ Hash Index (Redis, Memcached).
- Relational / Sorting / Range / Everything Else: ➡️ B-Tree (Default RDBMS Index).
- Bonus: If you have a read-heavy workload with infrequent writes, consider a Clustered Index to store data directly in the leaf nodes of the B-Tree for even faster access.

🎯 Best Practices

Index Strategically: Only create indexes for frequently queried columns
Monitor Usage: Track index usage and remove unused indexes
Consider Write Overhead: Balance read performance with write performance
Use Covering Indexes: Include all needed columns in the index
Maintain Indexes: Regularly rebuild or reorganize fragmented indexes
Test Thoroughly: Test index performance with realistic data volumes
Document Decisions: Maintain architecture decision records for index choices
Plan for Growth: Design indexes that scale with data growth
Use Partial Indexes: Index only the data you need
Monitor Statistics: Keep index statistics up to date for query optimization

🎯 Interview Questions

Database & Query Execution Questions

Q1. What is a B-Tree index and how does it work?

A B-Tree is a self-balancing tree where each node holds multiple keys and pointers to child nodes. Searches traverse from root to leaf in O(log n). In B+ Trees (used in practice), only leaf nodes hold data pointers and leaves are linked for efficient range scans.

Q2. What is a covering index?

An index that includes all columns required by a query (in WHERE, SELECT, ORDER BY), so the DB can answer the query entirely from the index without reading the actual table row. Results in Using index in MySQL EXPLAIN.

Q3. Explain the left-prefix rule for composite indexes.

A composite index on (a, b, c) is only useful if the query's WHERE clause includes a, then optionally b, then optionally c — without skipping. A query filtering on b alone won't use the index.

Q4. When would you NOT add an index?

Low-cardinality columns (boolean, enum with few values)

Tables with heavy write loads where index maintenance cost outweighs read gains

Very small tables where a full scan is cheaper than index traversal

Columns never used in WHERE, JOIN, or ORDER BY

Q5. What does Using filesort mean in EXPLAIN and how do you fix it?

The DB cannot use an index to satisfy the ORDER BY and must sort results in memory or on disk. Fix by creating an index that matches the ORDER BY clause (and its sort direction), potentially as part of a composite index with WHERE columns.

Q6. What is the difference between a clustered and non-clustered index?

A clustered index determines the physical order of data on disk — the table IS the index (InnoDB uses the PK as clustered). There can be only one. A non-clustered index is a separate structure with pointers back to the heap rows. Secondary indexes in InnoDB store the PK value (not a direct pointer) to look up the row.

Q7. How do you find slow queries in MySQL/PostgreSQL?

MySQL: Enable slow_query_log, set long_query_time; analyze with pt-query-digest

PostgreSQL: pg_stat_statements extension for query stats; auto_explain for plans

Both: EXPLAIN ANALYZE to see actual vs. estimated row counts

Q8. What is index bloat and how do you handle it?

Over time, updates/deletes leave dead index entries ("bloat"). In PostgreSQL, run VACUUM (autovacuum handles this) or REINDEX. In MySQL, OPTIMIZE TABLE rebuilds the table and indexes. Bloat increases index size and slows scans.

System Design & Architectural Index Questions

Beginner Level

Q1: What is a database index? A: A database index is a data structure that improves the speed of data retrieval operations on a database table at the cost of additional writes and storage space. Indexes are used to quickly locate data without having to search every row in a database table.

Q2: Why do we need indexes? A: Indexes are needed to improve query performance. Without indexes, the database must perform a full table scan to find data, which is slow for large tables. Indexes allow the database to find data quickly using logarithmic or constant time lookups.

Q3: What is a B-Tree index? A: A B-Tree index is a balanced tree data structure that keeps data sorted and allows searches, sequential access, insertions, and deletions in logarithmic time. It's the most common type of index used in databases.

Q4: What is the difference between clustered and non-clustered indexes? A: A clustered index determines the physical order of data in a table. There can be only one clustered index per table. A non-clustered index is a separate structure that contains a pointer to the actual data. There can be multiple non-clustered indexes per table.

Q5: What is a composite index? A: A composite index is an index on multiple columns. It can improve performance for queries that filter on multiple columns. The order of columns in a composite index matters for query optimization.

Intermediate Level

Q6: How does a B-Tree index work? A: A B-Tree index works by storing keys in a balanced tree structure. Each node contains multiple keys and pointers to child nodes. The tree is balanced, meaning all leaf nodes are at the same depth. To find a key, the database traverses the tree from the root to the appropriate leaf node.

Q7: What is a hash index and when would you use it? A: A hash index uses a hash function to map keys to buckets. It provides O(1) average lookup time but only supports equality operations. Hash indexes are useful for exact lookups where range queries are not needed.

Q8: What is a covering index? A: A covering index is an index that contains all the columns needed for a query. When a query can be satisfied entirely from the index, the database doesn't need to access the table data, which improves performance.

Q9: How do indexes affect write performance? A: Indexes slow down write operations because the database must update the index structures for every INSERT, UPDATE, and DELETE operation. The more indexes a table has, the slower the write performance.

Q10: What is a partial index? A: A partial index is an index created on a subset of table rows that satisfy a predicate condition. Partial indexes are smaller and faster to maintain than full indexes, and they're useful for queries that only need to index a subset of data.

Advanced Level

Q11: Explain how geospatial indexing works. A: Geospatial indexing uses specialized data structures like R-Trees, Quad Trees, or Geohashing to index multi-dimensional spatial data. These structures reduce 2D space to 1D space or cluster data dynamically to efficiently answer spatial queries like radius searches and bounding box queries.

Q12: What is an inverted index and how is it used for full-text search? A: An inverted index maps words to document IDs. For each word in a document, the index stores a list of documents containing that word. When searching for a phrase, the database looks up each word in the index and intersects the posting lists to find documents containing all the words.

Q13: How does the query optimizer choose which index to use? A: The query optimizer uses statistics about the data distribution and index characteristics to estimate the cost of different query plans. It considers factors like selectivity, index depth, and I/O costs to choose the most efficient plan.

Q14: What is index fragmentation and how do you fix it? A: Index fragmentation occurs when index pages become disorganized due to insertions, updates, and deletions. Fragmentation can be fixed by rebuilding or reorganizing indexes, which reorganizes the index pages to improve performance.

Q15: How do you design indexes for a read-heavy vs. write-heavy workload? A: For read-heavy workloads, create more indexes to optimize query performance. For write-heavy workloads, minimize indexes to reduce write overhead. Use covering indexes to avoid table lookups, and consider partial indexes for frequently queried subsets of data.

Senior Level

Q16: Design an indexing strategy for a social media platform. A: Implement a combination of indexes: B-Tree indexes for user profiles and relationships, inverted indexes for content search, geospatial indexes for location-based features, and time-series indexes for activity feeds. Use covering indexes for common queries and partial indexes for active users.

Q17: How do you handle index maintenance in a distributed database? A: Implement distributed index maintenance with consistent hashing for index distribution, use gossip protocols for index synchronization, implement version vectors for conflict resolution, and use background processes for index rebuilding and optimization.

Q18: How do you optimize indexes for high-throughput systems? A: Use memory-optimized indexes, implement index partitioning, use covering indexes to avoid table lookups, implement index caching strategies, use read replicas for read-heavy workloads, and implement index pre-warming for critical queries.

Q19: How do you monitor and tune index performance? A: Monitor index usage statistics, track query performance metrics, analyze index fragmentation, monitor index size growth, track index maintenance overhead, and implement automated index tuning based on workload patterns.

Q20: How do you design indexes for multi-tenant applications? A: Implement tenant-specific indexes, use partial indexes with tenant filters, implement index partitioning by tenant, use composite indexes with tenant ID, and implement index isolation strategies to prevent cross-tenant index contention.

Senior Deep Dive: Advanced Topics: Advanced Topics

Topic 1: Index Partitioning

Horizontal Partitioning:

-- Partition index by range
CREATE INDEX idx_orders_date
ON orders(created_at)
PARTITION BY RANGE (created_at);

-- Partition index by hash
CREATE INDEX idx_users_id
ON users(id)
PARTITION BY HASH(id) PARTITIONS 16;

Vertical Partitioning:

-- Split index across multiple tables
CREATE TABLE users_hot (
    id INT PRIMARY KEY,
    name VARCHAR(100),
    email VARCHAR(255),
    INDEX idx_email (email)
);

CREATE TABLE users_cold (
    id INT PRIMARY KEY,
    bio TEXT,
    preferences JSON
);

Topic 2: Index Compression

Prefix Compression:

class CompressedBTree:
    def __init__(self):
        self.nodes = []

    def compress_keys(self, keys):
        compressed = []
        prev_key = ""

        for key in keys:
            # Find common prefix
            common_prefix = 0
            while (common_prefix < len(prev_key) and
                   common_prefix < len(key) and
                   prev_key[common_prefix] == key[common_prefix]):
                common_prefix += 1

            # Store prefix length and suffix
            compressed.append((common_prefix, key[common_prefix:]))
            prev_key = key

        return compressed

    def decompress_keys(self, compressed):
        keys = []
        prev_key = ""

        for prefix_len, suffix in compressed:
            key = prev_key[:prefix_len] + suffix
            keys.append(key)
            prev_key = key

        return keys

Topic 3: Index Caching

Buffer Pool Management:

class IndexBufferPool:
    def __init__(self, size):
        self.size = size
        self.cache = {}
        self.lru = []

    def get_page(self, page_id):
        if page_id in self.cache:
            # Update LRU
            self.lru.remove(page_id)
            self.lru.append(page_id)
            return self.cache[page_id]

        # Load from disk
        page = self.load_from_disk(page_id)

        # Evict if necessary
        if len(self.cache) >= self.size:
            evicted = self.lru.pop(0)
            del self.cache[evicted]

        # Add to cache
        self.cache[page_id] = page
        self.lru.append(page_id)

        return page

    def load_from_disk(self, page_id):
        # Load page from disk
        return disk.read(page_id)

Topic 4: Index Statistics

Histogram Statistics:

class IndexStatistics:
    def __init__(self, index):
        self.index = index
        self.histogram = {}

    def collect_histogram(self, buckets=100):
        # Collect value distribution
        values = self.index.get_all_values()

        # Create histogram
        min_val = min(values)
        max_val = max(values)
        bucket_size = (max_val - min_val) / buckets

        for value in values:
            bucket = int((value - min_val) / bucket_size)
            self.histogram[bucket] = self.histogram.get(bucket, 0) + 1

        return self.histogram

    def estimate_selectivity(self, min_val, max_val):
        # Estimate selectivity for range query
        total = sum(self.histogram.values())
        matching = 0

        for bucket, count in self.histogram.items():
            bucket_min = self.get_bucket_min(bucket)
            bucket_max = self.get_bucket_max(bucket)

            if bucket_min >= min_val and bucket_max <= max_val:
                matching += count
            elif bucket_min < max_val and bucket_max > min_val:
                # Partial match
                matching += count * self.get_overlap_ratio(
                    bucket_min, bucket_max, min_val, max_val
                )

        return matching / total

Topic 5: Index Rebuilding

Online Index Rebuilding:

class OnlineIndexRebuilder:
    def __init__(self, database):
        self.database = database

    def rebuild_index(self, table, index_name):
        # Create new index
        new_index = self.create_new_index(table, index_name)

        # Copy data from old index
        self.copy_index_data(table, index_name, new_index)

        # Switch indexes atomically
        self.switch_indexes(table, index_name, new_index)

        # Drop old index
        self.drop_old_index(table, index_name)

    def create_new_index(self, table, index_name):
        # Get index definition
        definition = self.get_index_definition(table, index_name)

        # Create new index with temporary name
        temp_name = f"{index_name}_temp"
        self.create_index(table, temp_name, definition)

        return temp_name

    def copy_index_data(self, table, old_index, new_index):
        # Copy data in batches
        batch_size = 1000
        offset = 0

        while True:
            batch = self.get_index_batch(table, old_index, offset, batch_size)

            if not batch:
                break

            # Insert into new index
            for item in batch:
                self.insert_into_index(table, new_index, item)

            offset += batch_size

    def switch_indexes(self, table, old_index, new_index):
        # Atomically switch indexes
        with self.database.transaction():
            self.database.execute(f"""
                ALTER TABLE {table}
                DROP INDEX {old_index}
            """)

            self.database.execute(f"""
                ALTER TABLE {table}
                RENAME INDEX {new_index} TO {old_index}
            """)

1. What is an Index & The Hardware Constraint​

Trade-offs:​

The Hardware Constraint: Why We Index​

The Memory Hierarchy​

The Sequential Scan Bottleneck​

Visual Example​

B-Trees (B+ Trees): The Industry Workhorse​

Structure and Memory Mapping​

B-Tree Structure​

B+ Tree vs B-Tree​

Why B-Trees Dominate​

B-Tree Operations​

Clustered vs Non-Clustered Indexes​

Composite Indexes & Column Order​

Covering Index​

Hash Indexes: Fast but Inflexible​

Hash Index Structure​

Hash Index Operations​

Hash Index vs B-Tree​

Geospatial Indexing: Solving Multi-Dimensional Data​

The Problem with 2D Data​

4.1 Geohashing (Z-Order Curves)​

Geohashing Example​

Geohashing Implementation​

4.2 Quad Trees​

Quad Tree Structure​

Quad Tree Implementation​

4.3 R-Trees (Region Trees)​

R-Tree Structure​

R-Tree Implementation​

Inverted Indexes: Full-Text Search​

Inverted Index Structure​

Inverted Index Implementation​

Advanced Features​

Analyzing Queries with EXPLAIN​

Key EXPLAIN fields (MySQL)​

PostgreSQL EXPLAIN example​

Common Query Optimization Techniques​

1. Avoid functions on indexed columns​

2. Avoid implicit type conversion​

3. Use EXISTS instead of IN for subqueries​

4. Pagination: avoid OFFSET on large tables​

5. SELECT only needed columns​

Index Selectivity & Cardinality​

Index Maintenance​

Spring / JPA Notes​

Index Maintenance​

Index Statistics​

Query Optimization​

Real-World Implementations​

PostgreSQL Indexes​

MySQL Indexes​

MongoDB Indexes​

Redis Indexes​

Pattern 2: Composite Index​

Pattern 3: Partial Index​

Pattern 4: Expression Index​

Pattern 5: Functional Index​

Pros and Cons​

B-Tree Indexes​

Hash Indexes​

Geospatial Indexes​

Inverted Indexes​

📋 System Design Cheatsheet: Choosing the Right Index​

🎯 Best Practices​

🎯 Interview Questions​

Database & Query Execution Questions​

System Design & Architectural Index Questions​

Beginner Level​

Intermediate Level​

Advanced Level​

Senior Level​

Senior Deep Dive: Advanced Topics: Advanced Topics​

Topic 1: Index Partitioning​

Topic 2: Index Compression​

Topic 3: Index Caching​

Topic 4: Index Statistics​

Topic 5: Index Rebuilding​

Advanced Editorial Pass: Index Strategy as Cost Engineering​

Senior Engineering Focus​

1. What is an Index & The Hardware Constraint

Trade-offs:

The Hardware Constraint: Why We Index

The Memory Hierarchy

The Sequential Scan Bottleneck

Visual Example

B-Trees (B+ Trees): The Industry Workhorse

Structure and Memory Mapping

B-Tree Structure

B+ Tree vs B-Tree

Why B-Trees Dominate

B-Tree Operations

Clustered vs Non-Clustered Indexes

Composite Indexes & Column Order

Covering Index

Hash Indexes: Fast but Inflexible

Hash Index Structure

Hash Index Operations

Hash Index vs B-Tree

Geospatial Indexing: Solving Multi-Dimensional Data

The Problem with 2D Data

4.1 Geohashing (Z-Order Curves)

Geohashing Example

Geohashing Implementation

4.2 Quad Trees

Quad Tree Structure

Quad Tree Implementation

4.3 R-Trees (Region Trees)

R-Tree Structure

R-Tree Implementation

Inverted Indexes: Full-Text Search

Inverted Index Structure

Inverted Index Implementation

Advanced Features

Analyzing Queries with EXPLAIN

Key EXPLAIN fields (MySQL)

PostgreSQL EXPLAIN example

Common Query Optimization Techniques

1. Avoid functions on indexed columns

2. Avoid implicit type conversion

3. Use EXISTS instead of IN for subqueries

4. Pagination: avoid OFFSET on large tables

5. SELECT only needed columns

Index Selectivity & Cardinality

Index Maintenance

Spring / JPA Notes

Index Maintenance

Index Statistics

Query Optimization

Real-World Implementations

PostgreSQL Indexes

MySQL Indexes

MongoDB Indexes

Redis Indexes

Pattern 2: Composite Index

Pattern 3: Partial Index

Pattern 4: Expression Index

Pattern 5: Functional Index

Pros and Cons

B-Tree Indexes

Hash Indexes

Geospatial Indexes

Inverted Indexes

📋 System Design Cheatsheet: Choosing the Right Index

🎯 Best Practices

🎯 Interview Questions

Database & Query Execution Questions

System Design & Architectural Index Questions

Beginner Level

Intermediate Level

Advanced Level

Senior Level

Senior Deep Dive: Advanced Topics: Advanced Topics

Topic 1: Index Partitioning

Topic 2: Index Compression

Topic 3: Index Caching

Topic 4: Index Statistics

Topic 5: Index Rebuilding

Advanced Editorial Pass: Index Strategy as Cost Engineering

Senior Engineering Focus