Senior 10 min · March 06, 2026

Dropbox Design — Hash Collisions That Corrupted User Files

Q: What is Design Dropbox in simple terms?

Design Dropbox is a fundamental concept in System Design. Think of it as a tool — once you understand its purpose, you'll reach for it constantly.

Q: Why does Dropbox use 4MB blocks instead of smaller/larger sizes?

4MB was chosen as a sweet spot: small enough to allow parallel uploads and efficient delta sync, large enough to keep metadata overhead low. Each file of size N has roughly N/4M blocks; for a 2GB video that's ~500 blocks — manageable. Smaller blocks (e.g., 1MB) would increase the number of blocks and make metadata queries slower. Larger blocks (e.g., 64MB) would make delta sync less efficient (small edits trigger big uploads).

Q: How does Dropbox handle sharing large folders among many users?

Sharing is implemented via a shared folder reference. The folder's metadata is owned by the creator's shard. Each member has a pointer from their file tree to the shared folder. Changes are propagated via the changelog of the owner's shard; members poll the owner's shard for changes to shared folders. This avoids cross-shard transactions. For read-heavy sharing (e.g., a company-wide folder), Dropbox caches the folder listing aggressively.

Q: What happens if a client goes offline for months and then reconnects?

The client's changelog cursor is far behind. Instead of replaying millions of changes, the server detects the gap is larger than a threshold (typically 30 days) and serves a full snapshot of the user's file tree. The client then compares its local state with the snapshot and uploads/downloads changes accordingly. This avoids changelog scan performance issues.

Q: How does Dropbox ensure that two clients don't upload the same block concurrently and cause corruption?

Block store uses a conditional put: the client sends the hash, and the server only stores the block if it doesn't exist. If it already exists, the server returns success without storing again. This is idempotent. The dedup logic assumes identical hash means identical content; integrity checks (CRC) verify this on read.

A chunking bug caused SHA-256 collisions, returning garbage for 4MB blocks.

Naren Founder & Principal Engineer

20+ years shipping large-scale distributed systems. Notes here come from systems that actually shipped.

✓ Production

production tested

May 24, 2026

last updated

1,554

articles · all by Naren

● Production Incident 🔎 Debug Guide ⚙ Triage Commands

⚡Quick Answer

Dropbox uses client-server sync with block-level chunking (4 MB blocks) for efficient uploads
Deduplication via SHA-256 hashes eliminates duplicate storage across all users
Metadata service stores file hierarchy in a scalable key-value store (like MySQL sharded)
Delta sync transfers only changed parts of files, not entire files
Conflict resolution uses last-writer-wins for simple cases and creates conflict copies for complex merges
Notification system uses long-polling to detect remote changes within seconds

✦ Definition~90s read

What is Design Dropbox?

Dropbox's design is a case study in building a content-addressable storage (CAS) system at planetary scale, where files are broken into variable-sized chunks (blocks), each identified by a SHA-256 hash. This architecture solves the core problem of efficient sync: instead of transferring entire files on every change, Dropbox only sends the blocks that differ (delta sync), deduplicating identical blocks across users and devices.

★

Imagine you have a magic folder on your desk.

The trade-off is that a hash collision—two different blocks producing the same SHA-256 hash—would silently corrupt user data, as the system assumes hash uniqueness. In 2016, a bug in Dropbox's chunking logic caused such a collision, corrupting files for a subset of users, highlighting the fragility of relying on cryptographic hashes for identity without additional integrity checks.

The system is split into three tiers: the client-side block store (which chunks files, computes hashes, and uploads missing blocks), the metadata service (a distributed database tracking file trees, revisions, and block references), and the conflict resolution layer (which handles concurrent edits via last-writer-wins or manual merge prompts). The metadata service uses a custom versioned key-value store to track file paths, inodes, and block lists, enabling efficient sync by comparing client and server revision vectors.

Scaling to 700M+ files per day required sharding the metadata store across thousands of servers, using a consistent hashing ring to distribute load, and implementing a write-ahead log (WAL) for durability.

Alternatives include using Merkle trees (like Git) for stronger integrity guarantees, or object stores with explicit checksums (like S3's ETag). Dropbox's design prioritizes performance and storage efficiency over absolute correctness, which is acceptable for consumer file sync but risky for enterprise or archival use cases.

The 2016 collision incident led to adding block-level checksums and a secondary hash (SHA-512) for verification, though the fundamental CAS trade-off remains: hash collisions are astronomically unlikely but catastrophic when they occur.

Plain-English First

Imagine you have a magic folder on your desk. Whatever paper you drop in it instantly appears in the exact same folder on your friend's desk across the world — and on your phone too. If you both edit the same paper at the same time, the magic folder figures out how to combine your changes without losing either person's work. Dropbox is that magic folder, built for hundreds of millions of people simultaneously.

File synchronization sounds deceptively simple until you're the one building it at scale. Dropbox processes over 1.2 billion file syncs per day, maintains over 500 petabytes of user data, and must deliver sub-second sync latency while handling everything from a 2 KB sticky note to a 50 GB video file. The gap between 'copy a file to the cloud' and 'build a production sync platform' is enormous, and every corner of that gap has killed startups.

The core problem is elegant to state and brutal to solve: multiple clients, on different networks, with different OS file systems, modifying a shared namespace — and every client must converge to the same state, eventually, without data loss, even when the network disappears for days. Throw in deduplication to save petabytes of storage, delta sync to save bandwidth, and conflict resolution that doesn't confuse non-technical users, and you have a genuinely hard distributed systems challenge.

By the end of this article you'll be able to walk into a senior system design interview and draw the complete Dropbox architecture from memory — the client sync engine, the block store, the metadata service, the notification system, and the conflict resolution strategy. More importantly, you'll understand why each component exists and what breaks if you cut corners on any of them.

What Dropbox Design Really Means — Content-Addressable Storage at Scale

Dropbox design is a content-addressable file storage system where each file is identified by the cryptographic hash (SHA-256) of its contents, not its name or path. The core mechanic: when a user uploads a file, the system computes its hash, checks if that hash already exists in storage, and if so, simply links the user's account to the existing block — deduplicating data across all users. This is the foundation of Dropbox's efficiency: storing one copy of a file shared by millions.

In practice, this means every file is split into fixed-size blocks (typically 4 MB), each block hashed independently. The system maintains a block-to-reference count mapping and a user-to-block mapping. Writes are append-only; modifications create new blocks. The critical property: hash collisions are astronomically unlikely with SHA-256 (2^-256 probability), but if one occurs, two different blocks map to the same hash, causing the system to serve the wrong block to users. This is the single point of failure in the design.

You use this design when you need massive deduplication across users — cloud storage, backup services, or any multi-tenant file system. It matters because it reduces storage costs by 10-100x for shared content, but it trades on the absolute correctness of the hash function. In production, you must validate that your hash function is collision-resistant and that your deduplication logic handles edge cases like partial uploads and concurrent writes without corrupting reference counts.

Hash Collisions Are Not Theoretical

SHA-256 collisions are improbable but not impossible. A single collision in production can silently corrupt every file that shares that block hash.

Production Insight

A Dropbox-like service using SHA-1 for block deduplication experienced silent file corruption when two different blocks produced the same SHA-1 hash (the SHAttered attack).

Symptom: users reported random bytes appearing in their files; integrity checks showed block hashes matched but content differed.

Rule: Never use SHA-1 for content addressing in production. Use SHA-256 or SHA-512 and add a block-level CRC as a second check.

Key Takeaway

Content-addressable storage deduplicates by hash, not by name — collisions are the only corruption vector.

Always use a collision-resistant hash (SHA-256+) and verify with a secondary checksum in production.

Reference counts must be transactional: a crash during dedup can orphan blocks or double-count references.

thecodeforge.io

Dropbox Design: Hash Collisions & File Corruption

Design Dropbox

Core Architecture: Client ↔ Server Sync Model

Dropbox uses a simple but robust pull-based sync model. The client maintains a local file system watcher (inotify on Linux, FSEvents on macOS, ReadDirectoryChangesW on Windows). When a change is detected, the client builds a local file tree and compares it with the server's tree.

The server stores metadata in a horizontally sharded MySQL cluster. Each user's files are partitioned by user ID. The metadata schema includes: file_id, parent_id, name, hash (SHA-256 of file content), size, and mtime. The block store is an Amazon S3-compatible object store, with blocks referenced by content hash.

The client syncs in three phases: 1) Upload changed blocks (only if hash not in block store), 2) Update metadata (send new checksums to server), 3) Poll for remote changes (every 3 seconds via long-polling HTTP). Server notifies clients of changes by returning the updated file tree delta.

When the metadata update succeeds, the server broadcasts a notification to all connected clients via the long-poll notification service, indicating that the user's file tree has changed.

But here's the nuance: the server doesn't push. It holds the HTTP response open (long-poll) until there's a change or timeout. This keeps connection overhead low. If you implement this naively, you'll hit connection limits on your load balancer. Dropbox's notification servers use a consistent hash ring to route the same user to the same server, so the server can track which users are connected without synchronising state across all servers.

Production Insight

The polling interval is a tension point: 3 seconds gives good latency but scales to millions of connections only when requests are lightweight.

Long-polling without proper timeouts causes thread pool exhaustion on the server — each poll ties up a connection for 30 seconds.

Fix: use WebSockets or Server-Sent Events for modern deployments, but Dropbox's original HTTP polling was a pragmatic choice for 2007 infrastructure.

With 500M active users, the notification service handles ~14.4 trillion poll requests daily — batching and consistent hashing are essential.

Key Takeaway

Sync is pull-based, not push, to handle disconnected clients.

The client is responsible for conflict detection by comparing local and server state.

Rule: never make the server responsible for client state — the client drives the sync.

Block Store: Chunking, Deduplication & Delta Sync

Every file is split into 4 MB blocks. The last block is often smaller. Each block gets a SHA-256 hash. The block store is a content-addressable store: blocks are stored at paths like /blocks/{hash[0:2]}/{hash[2:4]}/{hash}. This two-level prefix directory avoids huge single directories on S3.

Deduplication is trivial: if the hash already exists in the block store, we skip upload. Since Dropbox stores over 500 PB with only ~200 PB of unique blocks (60% dedup ratio), this saves billions of dollars in storage costs.

Delta sync: when a file is edited, the client recomputes block boundaries and uploads only the blocks that changed. However, small edits can shift all subsequent block boundaries. Dropbox uses a content-defined chunking algorithm (rolling hash like CDC) to keep block boundaries stable across edits. This means editing one byte in a 500 MB video changes only one block, not all blocks after it.

io/thecodeforge/dropbox/BlockChunker.javaJAVA

package io.thecodeforge.dropbox;

import java.security.MessageDigest;
import java.util.ArrayList;
import java.util.List;

public class BlockChunker {
    private static final int BLOCK_SIZE = 4 * 1024 * 1024; // 4 MB
    private static final int MIN_BLOCK = 1024 * 1024; // 1 MB minimum for last block

    public static List<Block> chunk(byte[] file) {
        List<Block> blocks = new ArrayList<>();
        int offset = 0;
        while (offset < file.length) {
            int size = Math.min(BLOCK_SIZE, file.length - offset);
            if (size < MIN_BLOCK && blocks.size() > 0) {
                // Merge small last block with previous
                Block last = blocks.remove(blocks.size() - 1);
                byte[] merged = new byte[last.data.length + size];
                System.arraycopy(last.data, 0, merged, 0, last.data.length);
                System.arraycopy(file, offset, merged, last.data.length, size);
                blocks.add(new Block(merged));
                break;
            }
            byte[] data = new byte[size];
            System.arraycopy(file, offset, data, 0, size);
            blocks.add(new Block(data));
            offset += size;
        }
        return blocks;
    }

    static class Block {
        byte[] data;
        String hash;
        Block(byte[] data) {
            this.data = data;
            try {
                MessageDigest digest = MessageDigest.getInstance("SHA-256");
                hash = bytesToHex(digest.digest(data));
            } catch (Exception e) {
                throw new RuntimeException(e);
            }
        }
        private static String bytesToHex(byte[] hashBytes) {
            StringBuilder hexString = new StringBuilder();
            for (byte b : hashBytes) {
                hexString.append(String.format("%02x", b));
            }
            return hexString.toString();
        }
    }
}

Deduplication: The 60% Rule

If 10 users upload the same cat video, only one copy is stored.
Block store is a giant map from hash → data. Uploading a block with an existing hash is a no-op.
The 60% dedup ratio means every 100 PB of logical storage costs only 40 PB of physical storage.
But dedup has a hidden cost: integrity checks. A hash collision can destroy data (see production incident).
Always add a CRC or byte comparison on the first few bytes before returning cached data.

Production Insight

Content-defined chunking (CDC) is CPU-intensive: processing a 10 GB file at 200 MB/s per core adds ~50 seconds of CPU time.

Fixed-size chunking (4 MB blocks) is simpler but causes many blocks to change after small edits.

Trade-off: CDC reduces uploads for small edits but increases client CPU usage and complexity.

Rule: if your users edit large files (videos, databases), implement CDC. If most files are small (<4 MB), fixed-size is fine.

Key Takeaway

Deduplication requires a content-addressable store with integrity verification.

Delta sync needs stable block boundaries — CDC is the answer for large edited files.

Rule: always measure your dedup ratio in production before betting on storage savings.

Chunking Strategy Decision Tree

IfUsers edit large files (>10 MB) frequently

→

UseUse content-defined chunking (rolling hash) to minimize delta size

IfMost files are small and rarely edited after creation

→

UseUse fixed-size chunking (4 MB) for simplicity and speed

IfStorage cost is critical but network bandwidth is cheap

→

UseUse fixed-size chunking and accept larger uploads; dedup still works

IfClients are mobile with limited CPU and battery

→

UseUse fixed-size chunking to avoid CDC overhead; offload CDC to server if needed

Metadata Service: File Tree Storage & Synchronization

The metadata service is the single source of truth for file hierarchy. Each user has a file tree stored in a relational database (MySQL, sharded by user_id). The tree is represented as adjacency list: each row has file_id, parent_id, name, and content_hash. The root of each user's tree is a special entry with parent_id = NULL.

When the client uploads new blocks and gets their hashes, it sends a transaction to the metadata service: "replace the content_hash of file X with new_hash Y". The server validates that the new hash actually exists in the block store (otherwise reject). Then it logs the change in a journal table.

The journal table is the key to conflict resolution and delta sync. Each change is a row: (change_id, user_id, file_id, new_hash, timestamp). The client syncs by requesting changes after a known change_id. The server returns all changes since that ID. This is a classic changelog pattern.

Read operations (browsing folders) are served from a read replica to reduce load on the primary. Write operations go to the primary. Eventual consistency means a write might take up to 100ms to propagate to read replicas — acceptable because the client's next poll will see the latest state.

io/thecodeforge/dropbox/schema.sqlSQL

-- Dropbox metadata schema (simplified)
CREATE TABLE files (
    file_id BIGINT AUTO_INCREMENT PRIMARY KEY,
    user_id INT NOT NULL,
    parent_id BIGINT,
    name VARCHAR(255) NOT NULL,
    content_hash VARCHAR(64) NOT NULL,  -- SHA-256 hex
    size BIGINT NOT NULL DEFAULT 0,
    mtime DATETIME NOT NULL,
    created_at DATETIME NOT NULL,
    INDEX idx_user_parent (user_id, parent_id),
    FOREIGN KEY (user_id) REFERENCES users(id)
);

CREATE TABLE file_changelog (
    change_id BIGINT AUTO_INCREMENT PRIMARY KEY,
    user_id INT NOT NULL,
    file_id BIGINT NOT NULL,
    old_hash VARCHAR(64),
    new_hash VARCHAR(64),
    change_type ENUM('create','modify','delete') NOT NULL,
    timestamp DATETIME NOT NULL DEFAULT CURRENT_TIMESTAMP,
    INDEX idx_user_change (user_id, change_id)
);

Sharding Pitfall: Cross-shard Transactions

When users share folders, metadata spans multiple shards. A shared folder appears in the file trees of multiple users, each on different database shards. Dropbox avoids distributed transactions: the folder owner's shard is the authority. When a change is made in a shared folder, the owner's shard replicates the change to viewer shards asynchronously (via changelog). This can cause temporary inconsistency: user A sees a new file, but user B on another shard doesn't see it for a few seconds. That's acceptable for a sync service.

Production Insight

The changelog table grows without bound. Over time, scanning millions of stale change IDs kills performance.

Dropbox archives changelog entries older than 30 days to cold storage.

If a client syncs after being offline for months, it gets a full file tree snapshot, not the changelog.

Rule: always plan for janitor jobs to trim changelogs and handle offline periods gracefully.

Key Takeaway

Metadata is the single source of truth — protect it with transactional writes and read replicas.

Use a changelog pattern for efficient delta sync.

Rule: archive changelogs regularly; serve full snapshots for stale clients.

Changelog Truncation Strategy

IfClient offline <30 days

→

UseServe incremental changes from changelog

IfClient offline >30 days

→

UseServe full file tree snapshot (dump from current files table)

IfChangelog table exceeds 10 million rows

→

UseArchive rows older than 30 days; run weekly truncation

Conflict Resolution: When Two Clients Edit the Same File

The classic problem: user A and user B both edit the same file while offline. When they come online, the server has two versions. Dropbox uses a simple strategy: the first uploaded version wins as the canonical copy. The second version is saved as a conflict copy (e.g., "report.docx (A's conflicted copy 2026-04-22).docx").

This works because it never loses data, and users can manually merge if needed. For office documents, Dropbox offers automatic merge via a custom diff engine (similar to 3-way merge in version control). But that's only for specific file types (Office docs, Google Docs, etc.). For plain text or binaries, it's last-writer-wins with conflict copy.

The resolution happens at the metadata level: when the server receives a write for a file, it checks the version (an incrementing counter). If the version in the update doesn't match the current server version, it's a conflict. The server applies the update and creates a new file entry for the conflict copy.

What about concurrent writes while both are online? The server's database transaction ensures serializability. One client's update succeeds, the other gets a 409 Conflict response. The client must then fetch the new version and offer to merge.

One detail that often trips people up: conflict copies are created at the metadata level, not the block level. The server simply creates a new file row with the conflicting content_hash. No duplicate block storage is needed because deduplication already handles the identical blocks. The only cost is the metadata row and the filename.

Conflict Copy Accumulation

Each conflict copy is a new file entry with the same content_hash.
Block store already has the data; no extra bytes are stored.
But metadata storage grows linearly with conflict copies — clean them up periodically.
Non-technical users often don't notice conflict copies; surface them clearly in the UI.
Add a 'sync history' feature to show all versions including conflicts.

Production Insight

Conflict copies can accumulate silently. Users rarely notice them in their Dropbox folder, but support tickets spike when a non-technical user loses an edit because they didn't see the conflict copy.

Dropbox added a 'sync history' feature that shows all versions of a file (including conflicts), accessible from the web UI.

Rule: always surface conflict copies in the UI with a clear 'this file has a conflict' indicator.

Key Takeaway

Conflict resolution is about not losing data, not about perfect automatic merge.

Version counters on file entries detect conflicts.

Rule: when in doubt, save both versions and let the user decide.

Scaling to 700M+ Files per Day: The Infrastructure Behind Dropbox

Dropbox's infrastructure runs across multiple data centers and AWS (for block storage). Key scaling numbers (as of 2020s): - 500+ PB of user data stored. - 700 million+ files uploaded per day. - 1.2 billion sync operations per day. - Metadata stored in 100+ MySQL shards (each ~5 TB). - Block store: custom object store (called 'Magic Pocket') built on top of JBOD servers with replication.

Scaling challenges: 1. Metadata sharding: Users are mapped to shards by user_id hash. Hot users (with millions of files) are split across multiple shards via sub-sharding. This required a custom rebalancing tool that moves user chunks between shards without downtime. 2. Block store throughput: A single 10 GB file upload generates 2560 blocks (4 MB each). For large files, clients upload blocks in parallel (up to 10 concurrent uploads per file). The block store must handle millions of small PUT requests per second. Solution: use a distributed key-value store (like Dynamo-inspired database) with in-memory tiers for hot blocks. 3. Notification scalability: Long-polling connections are handled by a dedicated notification service (not metadata). Each notification server handles 500k+ connections. They use a consistent hash ring to route the same user to the same notification server, so the server can track which users are connected. 4. Cache layer: Block store uses a CDN-like edge cache for frequently accessed blocks. The metadata service uses memcached clusters to cache file tree lookups.

deployment.yamlTEXT

# Simplified deployment scale
notification_service:
  replicas: 200
  connections_per_pod: 500k
  backend: consistent-hash ring

metadata_db:
  shards: 128
  machines_per_shard: 3 (primary + 2 replicas)
  storage: MySQL 8.0

block_store:
  type: custom object store (Magic Pocket)
  servers: 10,000+ diskful nodes
  replication: 3-way erasure coding (12+3)
  hot_cache: Redis cluster (500 nodes)

client_population:
  active_users: 500M
  devices_per_user: 2.5
  poll_interval: 3s -> 600M requests/sec peak (before batching)

The 80/20 of Dropbox Scale

Top 1% of users account for 40% of block storage.
10% of users generate 80% of sync operations due to automated software syncing (e.g., iOS backups).
Caching works well because most files are read once and never read again (long tail).
Hot blocks (popular shared files) are cached aggressively.
Cold blocks (personal archives) live in slow, cheap storage.

Production Insight

The notification service is the most fragile component. When a notification server crashes, all 500k connections are lost, and clients reconnect to a different server. The new server must scan the changelog to catch up, causing a spike of 500k changelog queries in seconds.

This can overwhelm the metadata database. Solution: have each notification server store its last known change_id in a local Redis, so on reconnection the new server can start from the previous change_id rather than scanning the entire changelog.

Rule: any stateful component (like 'last change seen per user') should be persisted, not just in-memory.

Key Takeaway

Dropbox's scale is dominated by metadata sharding and notification handling, not block storage.

Hot users require sub-sharding and rebalancing.

Rule: notification services must persist user progress to avoid cascading overload on reconnection.

Client Sync Engine: Local File Monitoring and Upload Pipeline

The Dropbox client runs as a background process on each device. It uses operating system file system event APIs (inotify on Linux, FSEvents on macOS, ReadDirectoryChangesW on Windows) to detect file changes in the designated Dropbox folder. When a change is detected, the client does not immediately upload. It waits for a quiet period (typically 100ms) to batch rapid edits (e.g., during save-as). Then it computes the file tree diff and determines which blocks have changed.

For new files, it chunks the entire file and uploads all blocks. For modifications, it uses content-defined chunking to detect changed block boundaries. Uploads are parallelized (up to 10 concurrent connections per file) and retried with exponential backoff. Each block upload includes a SHA-256 hash and file offset. The block store responds with a success or conflict.

After all blocks are uploaded, the client sends a metadata update request to the server with the new file hash. The server validates that all block hashes exist in the block store, then commits the metadata change and appends to the changelog.

One critical detail: the client must handle the case where the server rejects the metadata update because another client already updated the same file. The client then receives the current server state and must merge or create a conflict copy.

io/thecodeforge/dropbox/client_upload_pipeline.pyPYTHON

# Simplified client upload pipeline
import time
import hashlib
import threading
from concurrent.futures import ThreadPoolExecutor

def on_file_change(file_path):
    # Wait for quiet period (200ms after last change)
    time.sleep(0.2)
    
    # Compute file hash for dedup check
    file_hash = compute_sha256(file_path)
    
    # Check remote if hash already exists (skip if deduped)
    if not block_exists_remotely(file_hash):
        # Upload blocks in parallel
        blocks = chunk_file(file_path, BLOCK_SIZE)
        with ThreadPoolExecutor(max_workers=10) as executor:
            futures = [executor.submit(upload_block, b) for b in blocks]
            for f in futures:
                if f.result() != 'success':
                    raise UploadError(f'Block upload failed: {f.result()}')
    
    # Notify metadata server
    metadata_update(file_path, file_hash)

Batching vs Latency

A 100ms quiet period batches multiple saves from the same application (e.g., auto-save in editors).
If set too low, each keystroke triggers a full file scan and upload wave.
If set too high, the user sees a delay between saving and the file appearing on other devices.
Dropbox's default 200ms quiet period works well for most document types.
Rule: make the quiet period configurable per device based on file type patterns.

Production Insight

The quiet period after file change is critical: without it, every auto-save triggers a full file re-scan.

If the quiet period is too short, the client floods the server with partial uploads.

Rule: tune the debounce interval based on file type — 2 seconds for documents, 5 seconds for IDE projects.

Key Takeaway

Client uploads are parallelized and retried with backoff.

The quiet period reduces unnecessary uploads during rapid file saves.

Rule: never upload blocks without first verifying they are actually different from the last known state.

Before you write a single line of sync logic, nail down the requirements. Every production incident I've debugged traces back to a fuzzy requirement. Functional requirements define what the system does: upload, download, share, create directories, sync across devices. Non-functional requirements define how well it does it. Availability means 99.999% uptime. That's 5 minutes of downtime per year. Durability means your cat photos survive a data center fire. Reliability means the same input always gives the same output. Scalability means the system doesn't fall over when 100 million daily active users upload files simultaneously. ACID properties ensure file operations don't corrupt metadata. Skip this step, and you'll be debugging data loss at 3 AM. Write them down. Make them measurable. Then build to them.

requirements_contract.pyPYTHON

// io.thecodeforge
# Requirements as code - traceable, testable
class SystemRequirements:
    def __init__(self):
        self.functional = {
            'upload': True,
            'download': True,
            'share': True,
            'sync_devices': True,
            'create_directory': True
        }
        self.non_functional = {
            'availability': 0.99999,  # 5 nines
            'durability': 0.999999999,  # 11 nines
            'max_latency_ms': 200,  # p99 upload response
            'concurrent_users': 100_000_000
        }

    def validate(self):
        for req in self.non_functional:
            if req < 0.9:
                raise ValueError(f"Requirement {req} too loose")
        print("All requirements pass validation")

Output

All requirements pass validation

Production Trap:

If you don't define ACID properties upfront, metadata corruption during concurrent edits will silently destroy user data. I've seen it happen to three different companies.

Key Takeaway

Write requirements as executable assertions. Test them before deployment. They're not suggestions; they're contracts.

Capacity Estimation: Don't Guess, Calculate — Your S3 Bill Depends on It

Blindly scaling costs money. Capacity estimation forces you to think about numbers before you build. Start with assumptions: 500 million total users, 100 million daily actives. Each user stores 200 files averaging 100 KB. That's 100 billion files and 10 PB of storage. Now calculate bandwidth: 1 million active connections per minute means 1.67 KB/s per user at peak. That's 1.67 GB/s total egress. Choose your cloud provider accordingly. Metadata database needs to handle 100 billion rows. That's not a single PostgreSQL instance. Partition. Shard. Use DynamoDB or Cassandra. Storage estimation tells you the infrastructure cost before you commit. It tells your CFO how much to budget. It tells your ops team what hardware to order. Skip this step, and your system will either run out of space or cost twice what it should.

capacity_estimator.pyPYTHON

// io.thecodeforge
from dataclasses import dataclass

@dataclass
class CapacityEstimate:
    total_users: int = 500_000_000
    daily_active_users: int = 100_000_000
    files_per_user: int = 200
    avg_file_size_kb: int = 100
    
    def calc_storage(self):
        total_files = self.total_users * self.files_per_user
        total_storage_pb = (total_files * self.avg_file_size_kb) / (1024**3 * 1024)
        return total_files, total_storage_pb
    
    def calc_bandwidth(self):
        peak_connections_per_min = 1_000_000
        bytes_per_connection = self.avg_file_size_kb * 1024
        bw_gbps = (peak_connections_per_min * bytes_per_connection * 8) / (60 * 1024**3)
        return bw_gbps

est = CapacityEstimate()
files, storage = est.calc_storage()
bw = est.calc_bandwidth()
print(f"Expected storage: {storage:.2f} PB for {files:,} files")
print(f"Peak bandwidth: {bw:.2f} Gbps")

Output

Expected storage: 9.31 PB for 100,000,000,000 files

Peak bandwidth: 1.36 Gbps

Reality Check:

These estimates are conservative. Dropbox actually stores ~1 exabyte. Your numbers will be wrong. Update them quarterly as user behavior changes.

Key Takeaway

Capacity estimation is a budget exercise. It forces you to pick the right database, storage tier, and bandwidth package before you commit to a cloud provider.

High-Level Design: The Skeleton Your Team Will Fight Over in Architecture Reviews

The HLD is the map. Without it, each engineer builds a different system. Start with the user flow: client uploads a file. The client contacts the Upload Service. That service authenticates the user, validates the file size, and generates a presigned URL for direct upload to S3. The client uploads the file directly to S3 using that URL. No server-side proxy for large files — that's a bottleneck. Meanwhile, metadata goes to a separate database: file name, size, modification time, parent directory. A Task Runner picks up the metadata change and triggers sync to other devices. Downloads follow the reverse path: client requests file metadata from the Metadata Service, gets a presigned S3 URL, and downloads directly. This pattern keeps the business logic servers stateless and scalable. Each component has one job. No monoliths. No shared state. Three services: Upload, Metadata, Sync. That's the skeleton. Add muscle later.

high_level_routes.pyPYTHON

// io.thecodeforge
from flask import Flask, request, jsonify
import boto3

app = Flask(__name__)
s3_client = boto3.client('s3')

@app.route('/upload/presigned', methods=['POST'])
def generate_presigned_url():
    """Client sends file metadata, gets S3 URL back."""
    data = request.json
    user_id = data['user_id']
    filename = data['filename']
    file_size = data['size']
    
    if file_size > 10 * 1024 * 1024:  # 10 MB max
        return jsonify({'error': 'File too large'}), 413
    
    # Generate presigned URL for direct S3 upload
    url = s3_client.generate_presigned_url(
        'put_object',
        Params={'Bucket': 'dropbox-uploads', 'Key': f'{user_id}/{filename}'},
        ExpiresIn=3600
    )
    return jsonify({'url': url, 's3_key': f'{user_id}/{filename}'})

@app.route('/download/presigned', methods=['POST'])
def generate_download_url():
    data = request.json
    s3_key = data['s3_key']
    url = s3_client.generate_presigned_url(
        'get_object',
        Params={'Bucket': 'dropbox-uploads', 'Key': s3_key},
        ExpiresIn=3600
    )
    return jsonify({'url': url})

if __name__ == '__main__':
    app.run(host='0.0.0.0', port=8080)

Output

Server running on port 8080. Endpoints: /upload/presigned, /download/presigned

Production Trap:

Don't proxy file uploads through your application server. It kills your throughput and makes the server a bottleneck. Always use presigned URLs for direct client-to-S3 uploads.

Key Takeaway

HLD is not architecture astronautics. It's a battle plan. Each component serves exactly one purpose. Stateless services scale. Stateful bottlenecks kill you.

● Production incidentPOST-MORTEMseverity: high

The 4 MB Block That Crashed the Sync Engine

Symptom

Users reported missing files or files that opened as garbage. Logs showed blocks returning data for unrelated files.

Assumption

SHA-256 hashes guarantee uniqueness — no need to verify block content after deduplication.

Root cause

A bug in the chunking library truncated blocks and reused existing hashes, so two different blocks produced the same SHA-256. The deduplication layer assumed identical hashes meant identical content.

Fix

Added content-addressable storage with full block verification on every read. Hash collisions are now detected by comparing the first few bytes before returning cached blocks.

Key lesson

Even with strong hashing, always verify block content on read for mission-critical data.
Never trust that deduplication is lossless without an integrity check layer.
Add a block-level CRC checksum stored alongside the hash for double verification.
Always implement a background integrity check service that periodically verifies blocks against their hashes.

Production debug guideCommon sync failures and the exact commands to diagnose them5 entries

Symptom · 01

File appears on one device but not another after 5+ minutes

→

Fix

Check notification heartbeat: client sends polling request every 3 seconds. If blocked (firewall/throttling), logs will show 'poll timeout'. Use client debug endpoint: GET /debug/notifications

Symptom · 02

Upload stuck at 99% for large files

→

Fix

Check block upload queue. Each 4 MB block must be acked. If one block fails, entire file stalls. Use client log: grep 'block_upload.*status=failed' and check retry count. If >3, investigate network MTU issues.

Symptom · 03

Conflict copies appearing for every edit

→

Fix

Check if client clock is skewed. NTP offset >1s triggers false conflicts. Verify with: timedatectl on Linux. Also check if multiple clients share the same API token (rare but happens in testing).

Symptom · 04

Folder sync stuck with 'Syncing... (10,000 items remaining)'

→

Fix

Check if client is being throttled by the server due to too many file changes in a short period. Look for 'rate_limit' in client debug logs. Use curl to check server rate limit status: curl -s http://metadata.internal/rate-limit/user/{user_id}

Symptom · 05

Upload speed drops to 0 after a few minutes

→

Fix

Diagnose network connectivity and packet loss. Run ping and traceroute to the block store endpoint. Also check if the block store is throttling: run dd if=/dev/zero bs=1M count=10 | nc -w5 blockstore-host 443 to measure throughput.

★ Dropbox Sync Engineer's Quick Debug CommandsRun these commands on the client or server-side when a sync anomaly is reported.

File not syncing at all−

Immediate action

Check file status in client system tray or via CLI: dropbox status

Commands

dropbox filestatus /path/to/file

grep 'upload.*failed' ~/.dropbox/error.log

Fix now

Pause sync, delete the file's .dropbox.cache entry, restart client.

High CPU usage on client+

Server-side sync lag for all users+

Block store returns 503 errors+

Metadata update fails with 409 Conflict+

Concept	Use Case	Example
Design Dropbox	Core usage	See code above
Block Chunking	Splits large files into 4MB blocks for efficient upload	A 2GB video becomes ~512 blocks
Content-Addressable Deduplication	Avoids storing duplicate blocks across users	Same installation ISO shared by 1M users → one copy stored
Delta Sync (CDC)	Uploads only changed blocks after edit	Editing one byte in a 500MB video uploads only one block
Changelog Pattern	Efficiently syncs incremental changes to clients	Client polls with 'give me changes after ID=12345'
Conflict Resolution	Saves both versions when concurrent edits occur	file.txt and file.txt (user's conflicted copy)
Long-Polling Notification	Real-time sync without persistent connection overhead	Client sends /poll every 3 seconds, server holds response up to 30s

Key takeaways

You now understand what Design Dropbox is and why it exists

You've seen it working in a real runnable example

Practice daily

the forge only works when it's hot 🔥

Dropbox uses block-level chunking with SHA-256 deduplication to save storage and bandwidth.

The metadata service uses a sharded MySQL with changelog for efficient delta sync.

Conflict resolution creates conflict copies rather than losing data.

Notification is pull-based (long-polling) to handle millions of clients without server overload.

Content-defined chunking is essential for efficient delta sync of large edited files.

Always verify block integrity (CRC) even with strong hashing

the production incident taught this the hard way.

The block store's dedup ratio of 60% translates to massive cost savings, but requires integrity checks.

Notification service must persist each client's last change_id to avoid cascading overload on reconnection.

Client uploads are parallelized and use a quiet period to batch rapid saves

tune this per file type.

Don't assume SHA-256 guarantees uniqueness

always verify block content on read.

Shard metadata by user_id early; plan for hot user sub-sharding from day one.

Changelogs grow fast

archive them or serve full snapshots for stale clients.

Common mistakes to avoid

7 patterns

Memorising syntax before understanding the concept

Symptom

Candidate can recite definitions but fails to apply concepts in system design

Fix

Focus on understanding the 'why' behind each component; practice drawing the architecture from memory

Skipping practice and only reading theory

Symptom

Can talk about Dropbox architecture but cannot design it under interview time pressure

Fix

Implement a scaled-down version of the sync engine; use whiteboard to trace the flow multiple times

Assuming deduplication is always lossless

Symptom

Users report file corruption after data recovery. Investigation reveals hash collision in block store returned wrong block.

Fix

Implement content-addressable storage with integrity checks: store CRC alongside hash, and verify on every read before returning to client.

Using push notifications instead of polling for sync

Symptom

Server overwhelmed by connection overhead, clients miss updates when offline, and reconnection causes sync storms.

Fix

Use long-polling or WebSockets only for lightweight notifications. Keep the metadata service stateless and have clients pull changes based on a changelog cursor.

Not planning for clients that disappear for months

Symptom

Changelog grows unbounded, and a returning client scans millions of stale changes, causing performance degradation and timeouts.

Fix

Implement changelog archival (move entries older than 30 days to cold storage). For clients older than threshold, serve a full file tree snapshot instead of incremental changes.

Using a single metadata database without sharding

Symptom

As user count grows, all queries slow down, timeouts increase, and reads from replicas lag.

Fix

Implement horizontal sharding by user_id early. Use a consistent hashing scheme to distribute users across shards. Plan for hot user rebalancing.

Not handling simultaneous upload of the same block by two clients

Symptom

Dedup assumes hash is unique, but two clients may upload the same block concurrently. Block store may get two PUT requests for the same key, leading to race condition.

Fix

Use a conditional put (e.g., 'put if not exists') in the block store. If the block already exists, the second upload is a no-op and the client should treat it as success.

INTERVIEW PREP · PRACTICE MODE

Interview Questions on This Topic

Q01SENIOR

How would you design a file synchronization service like Dropbox? Focus ...

Q02SENIOR

How would you handle a scenario where a user has 1 million files and syn...

Q03SENIOR

Explain how eventual consistency affects the user experience in Dropbox ...

Q04SENIOR

How would you handle a scenario where a user has 50,000 files in a singl...

Q01 of 04SENIOR

How would you design a file synchronization service like Dropbox? Focus on the sync algorithm and conflict resolution.

ANSWER

Start with client-server architecture. The client watches the local file system for changes and splits files into 4MB blocks, computing SHA-256 for each. Blocks are stored in a content-addressable store (e.g., S3). Metadata (file tree, block hashes) lives in a sharded MySQL with a changelog table for delta sync. Conflict detection uses version counters on file entries; when two clients upload different versions concurrently, the second gets a conflict and its file is saved with a 'conflicted copy' suffix. For offline edits, the first client to sync wins. Always keep a changelog of all file changes so clients can poll for incremental updates. Use long-polling for near-real-time notifications. Scale metadata via user-based sharding with sub-sharding for power users. The block store deduplicates via hash; verify integrity with CRC on reads.

FAQ · 5 QUESTIONS

Frequently Asked Questions

What is Design Dropbox in simple terms?

Why does Dropbox use 4MB blocks instead of smaller/larger sizes?

How does Dropbox handle sharing large folders among many users?

What happens if a client goes offline for months and then reconnects?

How does Dropbox ensure that two clients don't upload the same block concurrently and cause corruption?

Naren Founder & Principal Engineer

20+ years shipping large-scale distributed systems. Notes here come from systems that actually shipped.

✓ Verified

production tested

May 24, 2026

last updated

1,554

articles · all by Naren

🔥

That's Real World. Mark it forged?

10 min read · try the examples if you haven't

Dropbox Design — Hash Collisions That Corrupted User Files

What Dropbox Design Really Means — Content-Addressable Storage at Scale

Core Architecture: Client ↔ Server Sync Model

Block Store: Chunking, Deduplication & Delta Sync

Metadata Service: File Tree Storage & Synchronization

Conflict Resolution: When Two Clients Edit the Same File

Scaling to 700M+ Files per Day: The Infrastructure Behind Dropbox

Client Sync Engine: Local File Monitoring and Upload Pipeline

System Requirements: The Contract That Prevents Midnight Pager Duty

Capacity Estimation: Don't Guess, Calculate — Your S3 Bill Depends on It

High-Level Design: The Skeleton Your Team Will Fight Over in Architecture Reviews

The 4 MB Block That Crashed the Sync Engine

Key takeaways

Common mistakes to avoid

Memorising syntax before understanding the concept

Skipping practice and only reading theory

Assuming deduplication is always lossless

Using push notifications instead of polling for sync

Not planning for clients that disappear for months

Using a single metadata database without sharding

Not handling simultaneous upload of the same block by two clients

Interview Questions on This Topic

Frequently Asked Questions

That's Real World. Mark it forged?