Jubin Soni - Portfolio & Blog

In the era of hyper-scale applications, the dream of a "global database" that is simultaneously fast, always available, and perfectly consistent everywhere is the holy grail of engineering. However, as any staff engineer who has managed production outages at 3 AM knows, distributed systems are governed by the unforgiving laws of physics and logic. The CAP theorem—Consistency, Availability, and Partition Tolerance—remains the most critical framework for making these architectural trade-offs.

When we design systems for global giants like Netflix or Stripe, we aren't just choosing a database; we are choosing which failure modes we can live with. A global database distributed across AWS regions in US-East-1, EU-West-1, and AP-Southeast-1 must navigate the reality that network partitions are inevitable. When the undersea cable between New York and London is clipped, your system must decide: do I stop accepting writes to ensure data integrity (CP), or do I keep the lights on and figure out the mess later (AP)?

This post explores the practical application of CAP theorem through the lens of a Global Inventory Management System—a classic high-stakes environment where a double-sold item results in a failed customer experience, but a system outage results in millions of dollars in lost revenue.

System Requirements

To design a production-grade global inventory system, we must balance the need for real-time stock accuracy with the requirement that users across the globe experience sub-second latency.

The following table estimates the capacity required to support a platform at the scale of a global retailer during a peak event like Black Friday.

Metric	Scale Requirement
Total Managed SKUs	500 Million
Peak Write Throughput	50,000 requests/sec
Peak Read Throughput	2,000,000 requests/sec
Data Retention	5 Years (Audit Logs)
Global Latency Target	< 150ms (p99)

High-Level Architecture

For a global inventory system, we utilize a "Cell-based Architecture." Each region operates its own localized stack, but they synchronize via a global backbone. We have two primary choices: a CP system like Google Spanner (using TrueTime for external consistency) or an AP system like Amazon DynamoDB (using global tables and last-writer-wins).

In this architecture, we prioritize Consistency (C) for inventory deductions to prevent overselling. This means during a network partition (P), the system may sacrifice Availability (A) for cross-region writes to maintain the "Source of Truth."

Detailed Design: Implementing Quorum-Based Writes

To achieve consistency in a distributed environment, we often implement a Quorum-based approach. In a system with $N$ replicas, a write is successful only if $W$ nodes acknowledge it, and a read is successful if $R$ nodes respond. To ensure consistency, $W + R > N$.

The following Go snippet demonstrates a simplified coordinator pattern for a global write that enforces a strict quorum, ensuring we don't commit an inventory change unless the majority of nodes agree.

type InventoryNode struct {
    ID string
    Mu sync.Mutex
    Stock int
}

func (s *Coordinator) UpdateStock(skuID string, delta int) error {
    nodes := s.GetNodesForSKU(skuID)
    successCount := 0
    requiredQuorum := (len(nodes) / 2) + 1

    var wg sync.WaitGroup
    results := make(chan bool, len(nodes))

    for _, node := range nodes {
        wg.Add(1)
        go func(n *NodeClient) {
            defer wg.Done()
            // Simulate a distributed lock or atomic increment
            if err := n.PrepareWrite(skuID, delta); err == nil {
                results <- true
            }
        }(node)
    }

    wg.Wait()
    close(results)

    for res := range results {
        if res { successCount++ }
    }

    if successCount >= requiredQuorum {
        return s.CommitWrite(skuID)
    }
    return errors.New("insufficient nodes for quorum: consistency at risk")
}

This implementation favors Consistency. If the network between regions is degraded and the coordinator cannot reach a majority of nodes, the write fails, preserving the integrity of the inventory count at the cost of availability.

Database Schema and Indexing

For global scale, we use a partitioned schema. We shard by product_id to ensure that all operations for a specific SKU land on the same consensus group.

The SQL implementation requires optimistic locking to handle concurrent updates without heavy row-level locks that kill performance.

sql

-- Partitioned by region_id for localized performance
CREATE TABLE regional_stock (
    sku_id VARCHAR(64) NOT NULL,
    region_id VARCHAR(16) NOT NULL,
    current_quantity INT NOT NULL,
    version_token INT NOT NULL,
    PRIMARY KEY (sku_id, region_id)
) PARTITION BY LIST (region_id);

CREATE INDEX idx_sku_lookup ON regional_stock (sku_id);

-- Atomic update with version check (Optimistic Locking)
UPDATE regional_stock 
SET current_quantity = current_quantity - 1, 
    version_token = version_token + 1
WHERE sku_id = 'IPHONE-15-PRO' 
  AND region_id = 'US-EAST' 
  AND current_quantity > 0 
  AND version_token = :last_seen_version;

Scaling Strategy

Scaling from 1,000 to 1,000,000+ users requires moving from a single primary database to a multi-master or globally distributed consensus model.

As we scale, the "P" in CAP becomes a daily reality rather than a theoretical edge case. To handle 1M+ users, we implement Geo-Partitioning, where data for European customers is pinned to EU regions to reduce latency, while still allowing global visibility through a unified API layer.

Failure Modes and Resilience

In a global system, failure is the default state. We use the Circuit Breaker pattern to prevent a single slow region from cascading failures across the entire global network.

When a network partition occurs, our CP-based inventory system will enter the "Open" state for cross-region writes. This prevents the application from hanging while waiting for a quorum that cannot be reached, instead returning a "Service Temporarily Unavailable" or "Retry Later" message to the user—a classic trade-off where we sacrifice Availability to prevent data corruption.

Conclusion

The CAP theorem isn't a barrier; it's a design guide. For global databases, the choice between CP and AP depends entirely on the business domain.

CP (Consistency/Partition Tolerance): Essential for financial ledgers and inventory (e.g., Google Spanner, TigerBeetle). You cannot afford to be "eventually consistent" when someone is spending money.
AP (Availability/Partition Tolerance): Ideal for social feeds, metrics, and shopping carts (e.g., Cassandra, DynamoDB). It is better to show a slightly outdated "Like" count than to bring down the entire platform.

By utilizing regional sharding, quorum-based writes, and robust circuit breakers, we can build systems that feel "CA" most of the time, while gracefully degrading when the inevitable network partition strikes.

https://research.google/pubs/pub39966/ https://www.allthingsdistributed.com/2017/06/amazon-dynamodb-design-patterns.html https://db-engines.com/en/blog_post/8 https://vldb.org/pvldb/vol13/p3372-verbitski.pdf