Intermediate 8 min · March 06, 2026

Dynamic Arrays in C — Safe realloc Patterns for Production

Q: What is the difference between malloc and calloc when creating a dynamic array in C?

malloc allocates a block of the requested size and leaves its contents uninitialised — you get whatever bytes happen to be sitting in that heap region. calloc allocates the same block but zeroes every byte before returning. For a dynamic array of integers, use calloc if you need a guaranteed starting value of zero; use malloc if you're going to overwrite every element anyway (e.g., reading from a file), since the zero-initialisation step is wasted work.

Q: Can I use a Variable Length Array (VLA) in C instead of malloc for a runtime-sized array?

VLAs (int arr[n] where n is a variable) were added in C99 and let you put a runtime-sized array on the stack. They're gone from stack when the function returns, which is often not what you want. More importantly, they were made optional in C11 and are absent from many embedded toolchains, and a large VLA can silently overflow the stack with no error message. For production C code, malloc is the reliable, portable choice.

Q: How do I know when my dynamic array should shrink, and by how much?

The standard heuristic is: shrink when count drops below one quarter of capacity, and shrink to half of capacity. This creates hysteresis — the array must lose three quarters of its entries before shrinking, so a single remove-then-add cycle doesn't trigger a pointless reallocate-grow loop. Never shrink below a sensible minimum capacity (such as 4 or 8 elements) to avoid thrashing on very small arrays.

Q: What is memory fragmentation and how does it affect dynamic arrays?

Fragmentation occurs when allocated blocks are not adjacent, leaving unusable gaps in the heap. Repeated realloc of varying sizes creates holes that are too small for new allocations. This can cause out-of-memory errors even when total free memory is high. Mitigate by using a lower growth factor (1.5 instead of 2) or pre-allocating a large buffer upfront.

A realloc failure leaked 200K sensor readings and crashed a server.

Naren Founder & Principal Engineer

20+ years shipping performance-critical C and C++ systems. Lessons pulled from things that broke in production.

✓ Production

production tested

July 18, 2026

last updated

2,466

articles · all by Naren

Before you start⏱ 25 min

✓Solid grasp of fundamentals
✓Comfortable reading code examples
✓Basic production concepts

● Production Incident 🔎 Debug Guide ⚙ Triage Commands

⚡Quick Answer

Dynamic arrays allocate heap memory at runtime via malloc, grow with realloc, and release with free.
Capacity doubling gives amortized O(1) append — growing by fixed increments causes O(n²) copying.
Always realloc into a temporary pointer: direct assignment leaks memory if realloc fails.
Shrink when count falls below capacity/4, shrink to capacity/2 — prevents thrashing on remove-then-add cycles.
Memory fragmentation rises with many small reallocations; use power-of-two sizes to mitigate.

✦ Definition~90s read

What is Dynamic Arrays in C?

Dynamic arrays in C solve the fundamental tension between compile-time fixed sizes and runtime data unpredictability. Unlike stack-allocated arrays where you must know the maximum size at compile time — or waste memory by over-allocating — dynamic arrays grow and shrink on the heap using malloc, realloc, and free.

★

Imagine you're setting up chairs for a party but you don't know how many guests are coming.

They're the C equivalent of std::vector in C++ or ArrayList in Java, but without any language-level safety nets. You own every byte, every pointer, and every failure mode. Use them when you need to accumulate data incrementally (reading lines from a file, parsing network packets, building a list of results) and cannot bound the size in advance.

Don't use them for fixed-size, performance-critical hot paths where stack allocation avoids heap overhead entirely.

The core mechanism is the doubling strategy: when the array's capacity is exhausted, you realloc to a new block typically 1.5x or 2x the old size. This amortizes the O(n) copy cost to O(1) per insertion, giving you linear overall performance for appending n elements.

The growth factor matters deeply — 2x wastes more memory but reduces realloc frequency; 1.5x is more memory-efficient but triggers more copies. Production code often uses 1.5x or the golden ratio (~1.618) to balance fragmentation and throughput. Real-world implementations like stb_ds or GArray in GLib use exactly these patterns, and you'll find them in Redis, SQLite, and every serious C codebase that handles dynamic collections.

Beyond growth, production dynamic arrays must handle shrinking (to free memory when utilization drops), searching (typically linear or binary if sorted), and removal (shifting elements or marking as deleted). Memory fragmentation is the silent killer — frequent realloc calls can fragment the heap, especially with aggressive growth factors.

Tools like Valgrind, AddressSanitizer, and malloc_stats are essential for debugging leaks, buffer overruns, and double-frees. The trade-off is always between simplicity, performance, and memory overhead; a well-tuned dynamic array in C can match or beat higher-level languages for throughput, but requires meticulous attention to ownership, error handling, and realloc failure (which returns NULL and leaks the original block if you overwrite your pointer).

Plain-English First

Imagine you're setting up chairs for a party but you don't know how many guests are coming. A normal array is like pre-booking exactly 10 chairs — if 15 people show up, you're stuck. A dynamic array is like having a warehouse next door: you start with 10 chairs, and the moment you run out, you grab more from the warehouse and rearrange the room. That rearranging is exactly what realloc does behind the scenes — it finds you a bigger block of memory and moves everything over.

Static arrays are fine until your data outgrows them. Dynamic arrays in C solve the exact problem of unknown size at compile time—giving you a growable buffer without the overhead of linked lists. Without them, you’re either hardcoding limits, wasting memory, or rewriting half your code when requirements change.

What Dynamic Arrays in C Actually Solve

A dynamic array in C is a contiguous block of memory that grows at runtime via realloc. Unlike fixed-size arrays, it decouples capacity from logical size: you track a length (elements used) and a capacity (memory allocated). The core mechanic is geometric growth — typically doubling capacity on each resize — to amortize O(1) append cost. Without this, every push would be O(n) due to full reallocation.

In practice, a dynamic array is a struct: a pointer to heap memory, a size_t for length, and a size_t for capacity. The critical property is that realloc may move the block, invalidating all existing pointers into the array. This is the single most common source of bugs in production C code. The pattern is: never hold a pointer to an element across a realloc call; always re-fetch the base address after any operation that could resize.

Use dynamic arrays when you need cache-friendly, contiguous storage with unpredictable size — reading lines from a file, building a list of network connections, or accumulating sensor data. They outperform linked lists for iteration and random access (O(1) vs O(n)) and use less memory per element. In real systems, they are the backbone of string builders, arena allocators, and serialization buffers.

⚠ Pointer Invalidation Trap

After realloc, all pointers into the old buffer are dangling. Never cache a pointer to an element across a resize — always recompute from the new base address.

📊 Production Insight

A network server stores pointers into a dynamic array of connection structs. A single client triggers a resize, realloc moves the buffer, and all other pointers now point to freed memory — next read corrupts the heap.

Symptom: intermittent segfaults or silent data corruption on the next client request, only under load when the array grows.

Rule: never store pointers into a growable buffer; store indices (which remain valid after realloc) and dereference through the base pointer each time.

🎯 Key Takeaway

Geometric growth (2x) gives amortized O(1) append; linear growth gives O(n) append.

After any realloc, all pointers into the old buffer are dead — use indices or re-fetch.

Always check realloc return for NULL; a failure leaks the original block if you overwrote the pointer.

thecodeforge.io

Dynamic Arrays C

Why malloc? Understanding Heap Allocation vs Stack Arrays

When you write int temperatures[50]; inside a function, C reserves exactly 200 bytes on the call stack the moment that function is entered. The stack is fast, automatic, and cleaned up when the function returns. But it has two hard limits: the size must be a compile-time constant (in standard C), and the memory vanishes the moment the function exits.

Heap allocation with malloc flips both of those constraints. You pass it a byte count at runtime — a value you can compute from user input, a file, or a loop — and it returns a pointer to a fresh block of memory. That block lives until you explicitly call free on it, regardless of which function is currently executing. This makes heap memory the right tool whenever you don't know size upfront, or when data needs to outlive the function that created it.

The cost is responsibility. The stack cleans itself up. The heap does not. If you forget to call free, that memory is gone for the lifetime of the process — that's a memory leak. If you call free and then keep using the pointer — that's a use-after-free bug, one of the most dangerous bugs in systems programming. Understanding this trade-off is the entire foundation of working with dynamic arrays in C.

heap_vs_stack.cC

#include <stdio.h>
#include <stdlib.h>

// Demonstrates why stack arrays fail when size is runtime-determined
// and how malloc solves the problem cleanly.

int main(void) {
    int item_count;

    printf("How many temperature readings do you want to store? ");
    scanf("%d", &item_count);

    // STACK approach — ILLEGAL in standard C89 and risky even in C99:
    // int temperatures[item_count];  // <-- Variable Length Array, avoid in production

    // HEAP approach — safe, portable, works at any size:
    // malloc(n * sizeof(double)) asks the OS for exactly n doubles worth of bytes
    double *temperatures = malloc(item_count * sizeof(double));

    // malloc returns NULL if allocation fails (e.g., out of memory)
    // ALWAYS check — ignoring this causes a segfault on the next line
    if (temperatures == NULL) {
        fprintf(stderr, "Memory allocation failed. Cannot store %d readings.\n", item_count);
        return 1;  // exit with error code, not just 0
    }

    // Populate with dummy sensor readings
    for (int index = 0; index < item_count; index++) {
        temperatures[index] = 20.0 + (index * 0.5);  // simulate rising temps
    }

    // Use the data
    printf("\nStored %d readings. First: %.1f°C  Last: %.1f°C\n",
           item_count, temperatures[0], temperatures[item_count - 1]);

    // ALWAYS free heap memory when done — the OS won't do this for you
    free(temperatures);
    temperatures = NULL;  // null the pointer so it can't be accidentally used again

    return 0;
}

Output

How many temperature readings do you want to store? 5

Stored 5 readings. First: 20.0°C Last: 22.0°C

⚠ Watch Out: Never Skip the NULL Check

malloc returns NULL when the system is out of memory. On embedded systems or heavily loaded servers this actually happens. Skipping the NULL check and proceeding to use the pointer causes a segmentation fault that's hard to reproduce and harder to debug. One if-block after every malloc is non-negotiable.

📊 Production Insight

Stack overflow from a large VLA can crash your program silently with no stack trace.

Heap allocation via malloc gives you a clear NULL return on failure — that's better.

Rule: For runtime-sized arrays, always go heap. Never use VLAs in production code.

🎯 Key Takeaway

Heap allocation exists because stack arrays can't handle runtime sizes.

malloc gives you control — but with control comes the responsibility to free.

Null check after malloc is not optional; it's your first line of defense against crashes.

When to choose heap over stack for arrays

IfArray size known at compile time AND < 4KB

→

UseUse stack array — faster allocation, no free needed.

IfArray size determined at runtime or > 4KB

→

UseUse heap with malloc — portable, safe, avoids stack overflow.

IfArray must survive function return

→

UseHeap is the only choice — stack memory vanishes when the function exits.

Growing a Dynamic Array with realloc — The Doubling Strategy

Here's the real heart of dynamic arrays: what happens when you've filled your allocated space and a new item arrives? You have two options. You could allocate a completely new block, copy everything over, and free the old one — which is exactly what realloc does for you in one function call. The question is not whether to use realloc, but how much to grow by.

Growing by one slot each time you're full sounds sensible but is catastrophically slow. If you're inserting 10,000 items, you trigger 10,000 reallocations, each potentially copying the entire array. That's O(n²) work for what should be O(n) insertions. The standard solution is capacity doubling: when full, double the capacity. This ensures that the total copying work across all insertions stays proportional to n — amortized O(1) per insert. This is the exact strategy used by C++ std::vector, Java ArrayList, and Python lists.

The realloc call itself has an important gotcha: if it fails, it returns NULL — but the original pointer is still valid and still holds your data. That's why you must store the result in a temporary pointer first, check for NULL, and only then overwrite your original pointer. Failing to do this is one of the most common memory bugs in C.

dynamic_array_grow.cC

#include <stdio.h>
#include <stdlib.h>
#include <string.h>

// A reusable dynamic integer array with automatic growth.
// Models exactly how a C++ vector works internally.

typedef struct {
    int    *data;      // pointer to the heap block holding our integers
    int     count;     // how many items are currently stored
    int     capacity;  // how many items the current allocation can hold
} IntArray;

// Initialise the array with a small starting capacity
void array_init(IntArray *arr) {
    arr->capacity = 4;  // start small — we'll grow as needed
    arr->count    = 0;
    arr->data     = malloc(arr->capacity * sizeof(int));

    if (arr->data == NULL) {
        fprintf(stderr, "Failed to initialise dynamic array.\n");
        exit(1);
    }
}

// Append one integer, growing the backing array if necessary
void array_push(IntArray *arr, int value) {
    if (arr->count == arr->capacity) {
        // We're full — double the capacity
        int new_capacity = arr->capacity * 2;

        // Use a temporary pointer — if realloc fails we still have our old data
        int *resized = realloc(arr->data, new_capacity * sizeof(int));

        if (resized == NULL) {
            // realloc failed; arr->data is still valid, we just can't grow right now
            fprintf(stderr, "Could not grow array to capacity %d. Aborting push.\n", new_capacity);
            return;
        }

        // Safe to update now that we confirmed success
        arr->data     = resized;
        arr->capacity = new_capacity;

        printf("  [resize] Grew to capacity %d\n", new_capacity);
    }

    // Store the value and advance the count
    arr->data[arr->count] = value;
    arr->count++;
}

// Print every element with its index
void array_print(const IntArray *arr) {
    printf("Array (count=%d, capacity=%d): [", arr->count, arr->capacity);
    for (int i = 0; i < arr->count; i++) {
        printf("%d", arr->data[i]);
        if (i < arr->count - 1) printf(", ");
    }
    printf("]\n");
}

// Release heap memory — always call this when done
void array_free(IntArray *arr) {
    free(arr->data);
    arr->data     = NULL;  // prevent use-after-free
    arr->count    = 0;
    arr->capacity = 0;
}

int main(void) {
    IntArray scores;
    array_init(&scores);

    // Push 10 scores — watch the array resize automatically
    int game_scores[] = {42, 87, 15, 93, 56, 71, 34, 88, 62, 99};
    int total_games   = sizeof(game_scores) / sizeof(game_scores[0]);

    printf("Inserting %d scores into the dynamic array:\n", total_games);
    for (int i = 0; i < total_games; i++) {
        array_push(&scores, game_scores[i]);
    }

    printf("\nFinal state:\n");
    array_print(&scores);

    array_free(&scores);
    return 0;
}

Output

Inserting 10 scores into the dynamic array:

[resize] Grew to capacity 8

[resize] Grew to capacity 16

Final state:

Array (count=10, capacity=16): [42, 87, 15, 93, 56, 71, 34, 88, 62, 99]

💡Pro Tip: The Temporary Pointer is Not Optional

Writing arr->data = realloc(arr->data, new_size) is a memory leak waiting to happen. If realloc returns NULL, you've just overwritten your only pointer to the old block — that memory is now unreachable and unfreeable. Always realloc into a temporary, check it, then assign.

📊 Production Insight

Doubling works brilliantly for most cases, but on memory-constrained systems it can cause sudden large allocations.

If you're running on an embedded device, consider a growth factor of 1.5 or use a pool allocator.

Rule: The temporary pointer pattern isn't just good practice — it's your safety net against silent data loss.

🎯 Key Takeaway

Doubling capacity gives amortized O(1) append — the same strategy used by std::vector and ArrayList.

Growing by a fixed number of slots makes each insert O(n) in the worst case.

The temporary pointer pattern for realloc is the single most important defense against memory leaks in C.

Choose your growth factor

IfMaximum throughput is the goal, memory is abundant

→

UseUse doubling (growth factor 2). Simple and amortized O(1).

IfLong-running server with many containers, memory fragmentation a concern

→

UseUse a lower growth factor (1.5 or golden ratio ~1.618) to reduce fragmentation.

IfReal-time system where allocation latency must be bounded

→

UsePre-allocate capacity upfront based on worst-case estimate — avoid realloc in hot paths.

thecodeforge.io

Dynamic Arrays C

Shrinking, Searching and Removing — Real-World Array Operations

Growing an array grabs the headlines, but real programs also need to remove items and reclaim wasted space. If a user deletes half their entries, keeping a capacity of 10,000 slots for 50 items wastes significant RAM — especially on embedded hardware.

Shrinking follows the same pattern as growing but in reverse: when count drops below a threshold (a common choice is one quarter of capacity), realloc down to half of capacity. This keeps wasted space bounded without thrashing — if you shrank every single time you removed one element, you'd just end up reallocating immediately on the next insert.

Removing an element from the middle requires shifting every element after it one position to the left to fill the gap. This is O(n) in the worst case. If your workload involves many random removals, a linked list might be a better structure — but if removals are rare or always happen at the end, a dynamic array beats a linked list on cache performance because its elements are contiguous in memory. CPUs love contiguous data.

dynamic_array_remove.cC

100

101

102

#include <stdio.h>
#include <stdlib.h>
#include <string.h>

typedef struct {
    int *data;
    int  count;
    int  capacity;
} IntArray;

void array_init(IntArray *arr) {
    arr->capacity = 8;
    arr->count    = 0;
    arr->data     = malloc(arr->capacity * sizeof(int));
    if (!arr->data) { fprintf(stderr, "Init failed\n"); exit(1); }
}

void array_push(IntArray *arr, int value) {
    if (arr->count == arr->capacity) {
        int  new_cap  = arr->capacity * 2;
        int *resized  = realloc(arr->data, new_cap * sizeof(int));
        if (!resized) { fprintf(stderr, "Grow failed\n"); return; }
        arr->data     = resized;
        arr->capacity = new_cap;
    }
    arr->data[arr->count++] = value;
}

// Remove the element at position `index`
// Shifts everything after it left by one slot
int array_remove(IntArray *arr, int index) {
    if (index < 0 || index >= arr->count) {
        fprintf(stderr, "Remove index %d out of bounds (count=%d)\n", index, arr->count);
        return -1;  // signal failure
    }

    int removed_value = arr->data[index];

    // Shift elements left — memmove handles overlapping regions safely
    memmove(
        &arr->data[index],          // destination: where the gap is
        &arr->data[index + 1],      // source: one past the gap
        (arr->count - index - 1) * sizeof(int)  // bytes to move
    );

    arr->count--;

    // Shrink if we're only using a quarter of our capacity
    // Floor at a minimum capacity of 4 to avoid tiny allocations
    if (arr->count > 0 && arr->count <= arr->capacity / 4 && arr->capacity > 4) {
        int  new_cap = arr->capacity / 2;
        int *shrunk  = realloc(arr->data, new_cap * sizeof(int));
        if (shrunk) {  // shrink is optional — if it fails, just keep the bigger block
            arr->data     = shrunk;
            arr->capacity = new_cap;
            printf("  [resize] Shrunk to capacity %d\n", new_cap);
        }
    }

    return removed_value;
}

void array_print(const IntArray *arr) {
    printf("[count=%d cap=%d]: ", arr->count, arr->capacity);
    for (int i = 0; i < arr->count; i++) printf("%d ", arr->data[i]);
    printf("\n");
}

void array_free(IntArray *arr) {
    free(arr->data);
    arr->data = NULL;
    arr->count = arr->capacity = 0;
}

int main(void) {
    IntArray task_ids;
    array_init(&task_ids);

    // Simulate a task queue with 8 tasks
    for (int id = 101; id <= 108; id++) {
        array_push(&task_ids, id);
    }

    printf("Initial queue:\n");
    array_print(&task_ids);

    // Remove task 103 (at index 2)
    int removed = array_remove(&task_ids, 2);
    printf("\nRemoved task ID %d\n", removed);
    array_print(&task_ids);

    // Remove several more to trigger shrink
    array_remove(&task_ids, 0);
    array_remove(&task_ids, 0);
    array_remove(&task_ids, 0);
    array_remove(&task_ids, 0);
    printf("\nAfter 4 more removals:\n");
    array_print(&task_ids);

    array_free(&task_ids);
    return 0;
}

Output

Initial queue:

[count=8 cap=8]: 101 102 103 104 105 106 107 108

Removed task ID 103

[count=7 cap=8]: 101 102 104 105 106 107 108

After 4 more removals:

[resize] Shrunk to capacity 4

[count=3 cap=4]: 105 106 107

🔥Why memmove and Not memcpy?

memcpy has undefined behaviour when source and destination regions overlap. When you shift elements left inside the same array, the regions always overlap. memmove is guaranteed safe with overlapping regions — it handles the copy direction internally. Swap the two and you'll get subtle data corruption on some compilers and CPU architectures.

📊 Production Insight

Shrinking on every remove causes disastrous reallocation thrashing — always use a threshold.

The hysteresis pattern (shrink only when count < capacity/4) is battle-tested across std::vector, Java ArrayList, and Python list.

Rule: For removal-heavy workloads at arbitrary positions, consider a linked list or a rope data structure instead of a dynamic array.

🎯 Key Takeaway

Shrink with hysteresis (quarter threshold) to avoid thrashing, and use memmove, not memcpy, for overlapping shifts.

Removal from the middle costs O(n) — know your workload before choosing a data structure.

The hysteresis rule: shrink to half when count drops below a quarter — this bounds wasted space without penalizing transient removals.

Dynamic array vs linked list for removals

IfRemovals are always from the end (stack-like behavior)

→

UseDynamic array is best — O(1) removal, excellent cache locality.

IfRemovals are rare and scattered

→

UseDynamic array still wins — the O(n) shift cost is acceptable, and you keep O(1) random access.

IfFrequent random removals in the middle

→

UseUse a linked list or a balanced tree — O(n) shift per removal becomes too expensive.

Memory Fragmentation and Choosing the Right Growth Factor

You've mastered the doubling strategy, but in long-running production systems, doubling can silently create a new problem: memory fragmentation. Each time realloc runs, the operating system may place the new block at a different address, leaving a free hole behind. Over time, these holes fill the heap with unusable gaps — a condition known as external fragmentation. Your process might technically have enough free bytes, but no contiguous block large enough to satisfy the next allocation.

Doubling from a small initial size (say 4) to huge numbers (512, 1024, 2048) exacerbates fragmentation because each growing block is a different size, making it hard for the allocator to reuse free holes. The fix is to use a lower growth factor — 1.5 (or the golden ratio 1.618) is common — which generates more repeatable block sizes and gives the allocator a better chance at reusing freed memory. Many high-performance memory allocators (jemalloc, tcmalloc) already use size classes that align with such factors.

Another approach is to pre-allocate a large enough buffer upfront. If you can bound the maximum size of your dynamic array, allocate that full capacity at init time and avoid realloc entirely. This eliminates fragmentation at the cost of raw memory usage — a trade-off worth considering for real-time systems or embedded devices.

growth_factor_comparison.cC

#include <stdio.h>
#include <stdlib.h>

// Compare memory behaviour of doubling vs golden ratio growth
// Run with: gcc -o growth growth_factor_comparison.c && ./growth

int main(void) {
    int capacity_double = 4;
    int capacity_golden = 4;
    int count = 100;

    printf("Capacity progression for %d insertions:\n", count);
    printf("Insert  Doubling  Golden(1.618)\n");

    for (int i = 0; i < count; i++) {
        printf("%6d  %8d  %8d\n", i+1, capacity_double, capacity_golden);

        if (i == capacity_double) capacity_double *= 2;
        if (i == capacity_golden) capacity_golden = (int)(capacity_golden * 1.618) + 1;
    }

    printf("\nFinal capacity using doubling: %d\n", capacity_double);
    printf("Final capacity using golden:    %d\n", capacity_golden);
    return 0;
}

Output

Insert Doubling Golden(1.618)

1 4 4

2 4 4

3 4 4

4 4 4

5 8 7

6 8 7

7 8 12

8 8 12

9 16 12

...

Final capacity using doubling: 128

Final capacity using golden: 118

💡Pro Tip: Define the growth factor as a macro for easy tuning

#define GROWTH_FACTOR 1.618 // golden ratio // In your push function: if (arr->count == arr->capacity) { int new_cap = (int)(arr->capacity * GROWTH_FACTOR) + 1; // ...realloc... } This lets you experiment with different factors without changing logic.

📊 Production Insight

Fragmentation from repeated realloc can cause OOM errors even when 'enough' memory is free.

Tools like valgrind --tool=massif can show you actual heap usage vs allocated chunks.

Rule: For long-lived containers in server processes, use a growth factor of 1.5–1.618 instead of 2 to keep block sizes more uniform and reduce fragmentation.

🎯 Key Takeaway

Doubling is fast but fragments the heap over time in long-running systems.

A growth factor of 1.5 significantly reduces fragmentation at a small cost in total allocations.

If you know the maximum size, pre-allocate — no realloc means no fragmentation.

Growth factor selection guide

IfArray lifetime is short (seconds/minutes) or total size is small

→

UseUse doubling (factor 2). Simple, fast, fragmentation is negligible.

IfArray lives for hours/days in a server process with multiple allocations

→

UseUse a factor of 1.5 or golden ratio to reduce fragmentation.

IfMaximum size is known and bounded

→

UsePre-allocate the full capacity upfront. Eliminates realloc entirely.

IfReal-time constraints with deterministic latency requirements

→

UsePre-allocate or use a fixed-size ring buffer — avoid realloc in the hot path.

Debugging Dynamic Arrays – Tools and Techniques

Even with correct code, dynamic arrays can hide bugs that only surface after hours of production runtime. Buffer overflows, use-after-free, and off-by-one errors are the most common. The good news: modern tools catch them before they reach production if you run them in your test suite.

AddressSanitizer (ASan) is the fastest way to detect buffer overflows, use-after-free, and out-of-bounds accesses. Compile your code with -fsanitize=address and you get instant, detailed error reports on every violation. It's memory-efficient and integrates with valgrind for a second layer. Valgrind's memcheck tool is slower but catches some things ASan can't, like uninitialized memory reads.

Static analysis (like clang-tidy or PVS-Studio) can catch common patterns like direct realloc overwrite before runtime. But they can't catch everything. A good strategy: run ASan in unit tests, valgrind in integration tests, and use a memory profiler (valgrind massif) in long-running stress tests to detect fragmentation.

debug_demo.cC

#include <stdio.h>
#include <stdlib.h>

// Intentionally create a buffer overflow to show how AddressSanitizer catches it
// Compile: gcc -g -fsanitize=address -o debug_demo debug_demo.c && ./debug_demo

int main(void) {
    int *arr = malloc(5 * sizeof(int));  // capacity for 5 ints
    if (!arr) return 1;

    for (int i = 0; i <= 5; i++) {  // Off-by-one: writes to arr[5] which is out of bounds
        arr[i] = i * 10;
    }

    printf("arr[5] = %d\n", arr[5]);  // This line will never be reached with ASan
    free(arr);
    return 0;
}

Output

==12345==ERROR: AddressSanitizer: heap-buffer-overflow on address 0x602000000014

WRITE of size 4 at 0x602000000014 thread T0

#0 0x4007a1 in main debug_demo.c:10

0x602000000014 is located 4 bytes after 20-byte region [0x602000000000,0x602000000014)

allocated by:

#0 0x4005b0 in malloc (/debug_demo+0x4005b0)

#1 0x400784 in main debug_demo.c:7

🔥ASan vs Valgrind: When to Use Which

AddressSanitizer is fast (<2x slowdown) and catches overflows/use-after-free instantly. Valgrind (memcheck) is slower (10-20x) but catches uninitialized reads and has a more complete memory model. Use ASan in unit tests, Valgrind in integration and stress tests.

📊 Production Insight

A production bug: off-by-one in array indexing caused silent heap corruption that only manifested as a crash in a completely unrelated part of the system 30 minutes later.

ASan would have caught it on the first test run.

Rule: Add ASan to your CI pipeline. It catches bugs at the moment of corruption, not when the symptoms finally appear.

🎯 Key Takeaway

AddressSanitizer catches heap bugs instantly — compile with -fsanitize=address in development.

Valgrind is slower but provides more detailed memory analysis for leaks and uninitialized reads.

Never rely on manual inspection: automated tools are the only way to catch subtle memory bugs in C.

Which debugging tool for which symptom

IfSuspected buffer overflow or use-after-free

→

UseRun with AddressSanitizer (compile with -fsanitize=address).

IfSuspected memory leak or uninitialised read

→

UseRun with Valgrind: valgrind --tool=memcheck ./program

IfNeed to visualise heap fragmentation over time

→

UseUse Valgrind's massif tool: valgrind --tool=massif ./program

IfWant to catch bugs before CI (during development)

→

UseUse clang-tidy or cppcheck static analysis in your editor.

malloc vs calloc: When Zero-Init Costs You Performance

You've seen malloc in every tutorial. Here's when not to use it.

malloc grabs a slab of heap and returns a pointer. The bytes are uninitialised — stale data from whatever freed that block last. If you're building a dynamic array that will be immediately overwritten by hot data (say, reading frames from a socket ring buffer), malloc wins. No pointless zero-fill cycles.

calloc does two things: allocates and zero-initialises every byte. Intuition says "free stuff." Reality says calloc can be slower because memset must touch every page. On a 100-million-element uint32_t array, that's 400 MB of writes before your first assignment. The trade-off: deterministic startup state. No garbage pointers, no uninitialised reads that corrupt production silently.

Senior rule: Use calloc when your structure has pointers that must start NULL. Use malloc when you're filling the buffer immediately with known data. Never use either without checking the return value — NULL means the OS said no.

AllocShowdown.cCPP

// io.thecodeforge — c-cpp tutorial

#include <stdlib.h>
#include <stdio.h>
#include <time.h>

int main(void) {
    const size_t count = 100000000;  // 100 million ints
    clock_t start, end;

    // malloc — no init
    start = clock();
    int* raw = (int*)malloc(count * sizeof(int));
    end = clock();
    printf("malloc:          %ld ticks\n", end - start);

    // calloc — zero-init
    start = clock();
    int* zeroed = (int*)calloc(count, sizeof(int));
    end = clock();
    printf("calloc:          %ld ticks\n", end - start);

    free(raw);
    free(zeroed);
    return 0;
}

Output

malloc: 12 ticks

calloc: 189 ticks

⚠ Production Trap:

calloc does not guarantee your memory is 'secure.' On first touch, the OS may zero-fill lazily via copy-on-write. If you need cryptographic zeroing, use explicit_bzero or memset_s after freeing.

🎯 Key Takeaway

malloc for speed, calloc for safety — but never default to one without measuring.

Flexible Array Members: The Zero-Overhead Struct Trick

Standard dynamic arrays need two allocations: one for the struct metadata, one for the data. Flexible array members (FAMs) collapse that into one. C99 introduced this, and most production codebases still ignore it.

The trick: declare a struct with fields, then a trailing array with no size. When allocating, malloc the struct size plus the array size. The array lives immediately after the fixed fields — one contiguous block, one free call.

Why this matters for real systems: Data locality. The metadata (length, capacity) and the data sit next to each other in cache. No pointer chasing through an indirection layer. Every access to arr->data[i] is a simple base-plus-offset, not two pointer dereferences.

Pitfall: The array must be the last member. You cannot have a FAM and then another field. Also, sizeof the struct returns the size ignoring the FAM — you track array length yourself.

Use FAMs for packet buffers, serialised messages, or any hot-path dynamic array where allocations dominate runtime.

FlexibleArray.cCPP

// io.thecodeforge — c-cpp tutorial

#include <stdlib.h>
#include <stdio.h>
#include <string.h>

typedef struct {
    size_t length;
    unsigned char data[];   // flexible array member — no size
} Buffer;

Buffer* buffer_create(size_t payload_size) {
    Buffer* buf = (Buffer*)malloc(sizeof(Buffer) + payload_size);
    if (!buf) return NULL;
    buf->length = payload_size;
    return buf;
}

int main(void) {
    Buffer* msg = buffer_create(256);
    memcpy(msg->data, "Hello from single-heap world\n", 28);
    printf("%s", msg->data);
    printf("Struct size: %zu bytes\n", sizeof(Buffer));
    free(msg);  // one free, no leak
    return 0;
}

Output

Hello from single-heap world

Struct size: 8 bytes

💡Senior Shortcut:

When a struct with FAM is in a union, the FAM is not allowed (C11). Workaround: keep the FAM in its own struct and embed that struct in the union.

🎯 Key Takeaway

Flexible array members halve your allocation calls and keep hot data in one cache line.

Prerequisites: What You Must Understand Before Touching Dynamic Arrays

Dynamic arrays in C are not a beginner topic. You need a solid grasp of pointers, because every operation — resize, access, element removal — works through indirection. Understand pointer arithmetic and the difference between p[i] and (p + i). You must be comfortable with manual memory management: malloc, realloc, and free are your tools, and forgetting one free means a leak that accumulates silently. Heap allocation is slower than stack allocation; each malloc call involves an OS syscall or a bump into a free list. If you are writing real-time or embedded code, dynamic allocation is often banned outright. Finally, know your data types: sizeof is evaluated at compile time, and using the wrong type in realloc(sizeof(T) n) corrupts the heap. Without these foundations, dynamic arrays will crash your program in ways stack arrays never could.

Prerequisites.cppCPP

// io.thecodeforge — c-cpp tutorial
// 25 lines max
#include <stdlib.h>
#include <stdio.h>

int main() {
    // You must know: sizeof is evaluated at compile time
    size_t count = 10;
    int *arr = (int*)malloc(count * sizeof(int));
    if (!arr) return 1;

    arr[0] = 42;  // pointer arithmetic: *(arr + 0)
    printf("%d\n", arr[0]);

    free(arr);     // forget this = memory leak
    return 0;
}

Output

⚠ Production Trap:

Do not hide malloc behind a macro. Production C relies on explicit allocation — hiding it makes static analysis miss leaks.

🎯 Key Takeaway

Master pointers and sizeof before writing a single dynamic array.

Improvements — Struct Implementation: Encapsulating the Mess

Bare dynamic array code scatters size, capacity, and data across the scope. The struct implementation fixes this: bundle a pointer to the heap block, the logical count of elements, and the allocated capacity into one object. Every operation — da_append, da_remove, da_free — takes a pointer to this struct. This eliminates global variables and reduces function signatures from three parameters to one. The struct is small (typically three words on 64-bit), and passing it by pointer is cheap. A hidden improvement: you can now add a growth factor and a shrink threshold as struct fields, making the strategy configurable per array. Never expose the raw int*; expose typed access via da_get(da, i) that returns a pointer — then the caller can dereference or assign without knowing the internal layout. This is the minimal viable C abstraction: no vtables, no inheritance, just data + functions.

StructImpl.cppCPP

// io.thecodeforge — c-cpp tutorial
// 25 lines max
#include <stdlib.h>

typedef struct {
    int *data;
    size_t size;
    size_t capacity;
} DynArray;

void da_append(DynArray *da, int value) {
    if (da->size >= da->capacity) {
        da->capacity = da->capacity ? da->capacity * 2 : 4;
        da->data = (int*)realloc(da->data, da->capacity * sizeof(int));
    }
    da->data[da->size++] = value;
}

void da_free(DynArray *da) {
    free(da->data);
    da->data = NULL;
    da->size = da->capacity = 0;
}

⚠ Production Trap:

realloc can return NULL. Always assign to a temp pointer first, or you leak the original block on failure.

🎯 Key Takeaway

A struct bundles state — pass pointers, return void. This prevents scatter and leak bugs.

Growing Strategy: Doubling vs Fibonacci vs Exponential

Choosing the right growth factor for dynamic arrays is critical for performance. The classic doubling strategy (factor of 2) offers amortized O(1) insertion but can waste memory. Fibonacci growth (factor ~1.618) reduces memory overhead while maintaining good amortized performance. Exponential growth with a factor between 1.5 and 2 balances speed and memory. For example, a factor of 1.5 reduces wasted memory compared to doubling but still provides amortized constant time. The choice depends on your application: doubling for simplicity, Fibonacci for memory-constrained systems, and 1.5 for a middle ground. Below is a C implementation showing different strategies.

growth_strategies.cC

#include <stdlib.h>
#include <stdio.h>

#define GROWTH_DOUBLING 2.0
#define GROWTH_FIBONACCI 1.618
#define GROWTH_EXPONENTIAL 1.5

size_t grow_size(size_t current, double factor) {
    size_t new = (size_t)(current * factor);
    return (new > current) ? new : current + 1;
}

int main() {
    size_t cap = 1;
    for (int i = 0; i < 10; i++) {
        cap = grow_size(cap, GROWTH_DOUBLING);
        printf("Doubling: %zu\n", cap);
    }
    cap = 1;
    for (int i = 0; i < 10; i++) {
        cap = grow_size(cap, GROWTH_FIBONACCI);
        printf("Fibonacci: %zu\n", cap);
    }
    cap = 1;
    for (int i = 0; i < 10; i++) {
        cap = grow_size(cap, GROWTH_EXPONENTIAL);
        printf("Exponential: %zu\n", cap);
    }
    return 0;
}

💡Choosing a Growth Factor

📊 Production Insight

In production, consider using a growth factor of 1.5 to reduce memory fragmentation and improve cache performance, especially for large arrays.

🎯 Key Takeaway

The growth factor significantly impacts memory usage and performance; doubling is simple, Fibonacci is memory-efficient, and 1.5 offers a practical compromise.

Flexible Array Members in C99+

Flexible array members (FAM) allow a struct to have a variable-length array at its end without allocating separate memory. Declared as type array[]; (no size), they enable zero-overhead dynamic arrays. The struct size excludes the array, so you allocate extra bytes for the array. This is useful for encapsulating a dynamic array with metadata. Example: a struct with length, capacity, and a flexible array. However, FAMs have limitations: they must be the last member, and you cannot have multiple flexible arrays. They are ideal for fixed-size headers with variable data. Below is a C99 example.

flexible_array.cC

#include <stdlib.h>
#include <string.h>
#include <stdio.h>

typedef struct {
    size_t len;
    size_t cap;
    int data[];  // flexible array member
} DynArray;

DynArray* da_create(size_t initial_cap) {
    DynArray* da = malloc(sizeof(DynArray) + initial_cap * sizeof(int));
    if (!da) return NULL;
    da->len = 0;
    da->cap = initial_cap;
    return da;
}

int da_push(DynArray** da, int value) {
    if ((*da)->len == (*da)->cap) {
        size_t new_cap = (*da)->cap ? (*da)->cap * 2 : 1;
        DynArray* new_da = realloc(*da, sizeof(DynArray) + new_cap * sizeof(int));
        if (!new_da) return -1;
        new_da->cap = new_cap;
        *da = new_da;
    }
    (*da)->data[(*da)->len++] = value;
    return 0;
}

int main() {
    DynArray* da = da_create(4);
    for (int i = 0; i < 10; i++) da_push(&da, i);
    for (size_t i = 0; i < da->len; i++) printf("%d ", da->data[i]);
    free(da);
    return 0;
}

🔥FAM Requires Careful Realloc

📊 Production Insight

Use FAMs for small, frequently accessed dynamic arrays to minimize memory fragmentation, but avoid them if you need to resize without moving the struct (e.g., in concurrent code).

🎯 Key Takeaway

Flexible array members provide a compact way to embed a dynamic array within a struct, reducing allocation overhead and improving cache locality.

Arena Allocator for Dynamic Arrays

Arena allocators (or region-based allocators) manage memory in large blocks, reducing fragmentation and allocation overhead. For dynamic arrays, an arena can allocate contiguous memory for multiple arrays, and deallocation is a single operation. This is useful for batch processing or game engines. Implementation: create an arena with a fixed-size buffer, and allocate from it using an offset. Dynamic arrays can be grown by requesting more memory from the arena. However, arena allocators don't support individual frees; you reset the entire arena. Below is a simple arena allocator in C.

arena_allocator.cC

#include <stdlib.h>
#include <stddef.h>
#include <stdio.h>

typedef struct {
    char *buf;
    size_t size;
    size_t offset;
} Arena;

Arena* arena_create(size_t size) {
    Arena *a = malloc(sizeof(Arena));
    if (!a) return NULL;
    a->buf = malloc(size);
    if (!a->buf) { free(a); return NULL; }
    a->size = size;
    a->offset = 0;
    return a;
}

void* arena_alloc(Arena *a, size_t nbytes) {
    if (a->offset + nbytes > a->size) return NULL;
    void *ptr = a->buf + a->offset;
    a->offset += nbytes;
    return ptr;
}

void arena_reset(Arena *a) {
    a->offset = 0;
}

void arena_destroy(Arena *a) {
    free(a->buf);
    free(a);
}

int main() {
    Arena *a = arena_create(1024);
    int *arr = arena_alloc(a, 10 * sizeof(int));
    for (int i = 0; i < 10; i++) arr[i] = i;
    arena_reset(a);  // all allocations invalidated
    arena_destroy(a);
    return 0;
}

⚠ Arena Limitations

📊 Production Insight

Use arena allocators in performance-critical code like game loops or request handlers where memory churn is high, but ensure you can reset the arena between frames or requests.

🎯 Key Takeaway

Arena allocators simplify memory management for dynamic arrays by batching allocations and deallocations, improving performance in scenarios with predictable lifetimes.

● Production incidentPOST-MORTEMseverity: high

The Lost Sensor Readings – realloc Failure That Silently Corrupted a Server

Symptom

After running for 12 hours, the telemetry server started returning empty results for recent sensor data. Memory usage had dropped but the process eventually crashed with a segfault. Logs showed no errors.

Assumption

The engineer assumed realloc always succeeds because 'servers have enough memory'. They used the direct assignment pattern: arr = realloc(arr, new_size);

Root cause

During a spike in sensor readings, realloc tried to allocate a very large block (500K ints). The system was near memory pressure, so realloc returned NULL. The original pointer was overwritten in arr, making the ~200K already-collected readings unreachable and unfreeable. Subsequent attempts to write to the NULL pointer caused the segfault.

Fix

Changed all realloc calls to use a temporary pointer: int tmp = realloc(arr->data, new_sz); if (!tmp) { / handle error */ } arr->data = tmp;. And added a growth cap to avoid requesting excessively large blocks in one go.

Key lesson

Never overwrite the original pointer with the result of realloc without checking for NULL first.
Memory allocation can fail even on servers — always handle the failure gracefully.
For production systems, cap growth to a reasonable maximum to avoid sudden huge allocations.

Production debug guideSymptom → Action guide for the three most common dynamic array bugs in production3 entries

Symptom · 01

Segfault when accessing array elements after a realloc

→

Fix

Check the realloc pattern: are you using a temporary pointer? If not, a failed realloc overwrites your pointer with NULL. Add error handling for realloc failure.

Symptom · 02

Memory usage grows without bound, system runs out of memory

→

Fix

Check for missing free calls. Run valgrind --tool=memcheck ./program to find leaked blocks. Also verify that every malloc/calloc is matched with a free on all code paths.

Symptom · 03

Random data corruption after many push/pop operations

→

Fix

Check for off-by-one errors in indexing. Are you writing beyond the allocated capacity? Use AddressSanitizer (compile with -fsanitize=address) to catch buffer overflows precisely.

★ Dynamic Array Debugging Cheat SheetQuick commands and fixes for the most frequent production issues with dynamic arrays in C.

Memory leak (allocated memory never freed)−

Immediate action

Identify leaked allocations with Valgrind

Commands

valgrind --leak-check=full --show-leak-kinds=all ./myprogram 2>&1 | grep 'definitely lost'

valgrind --tool=memcheck --track-origins=yes ./myprogram

Fix now

Add a free() call for every malloc/realloc return. Use -fsanitize=address which logs unfreed allocations at exit.

Heap buffer overflow (write past allocated limits)+

Use-after-free (touching memory after it's freed)+

Invalid free (freeing non-heap memory or double free)+

Static Arrays vs Dynamic Arrays in C

Feature / Aspect	Static Array (stack)	Dynamic Array (heap)
Size known at compile time?	Required — must be a constant	Not needed — set at runtime
Memory location	Stack — automatic cleanup	Heap — manual free required
Resize after creation	Impossible	Yes, via realloc
Access speed (indexing)	O(1) — identical	O(1) — identical
Insert at middle	Impossible after declaration	O(n) — requires shifting
Append to end (amortised)	N/A — fixed size	O(1) — with doubling strategy
Risk of memory leak	None — stack auto-cleans	Yes — must call free
Risk of stack overflow	Yes — large arrays overflow stack	No — heap is much larger
Cache friendliness	Excellent — contiguous	Excellent — contiguous
Lifetime	Until function returns	Until free is called
Fragmentation risk	None	Yes — from repeated realloc of different sizes
Tools for debugging	None needed (stack cleans itself)	Valgrind, ASan, massif required for robustness

⚙ Quick Reference

12 commands from this guide

File	Command / Code	Purpose
heap_vs_stack.c	int main(void) {	Why malloc? Understanding Heap Allocation vs Stack Arrays
dynamic_array_grow.c	typedef struct {	Growing a Dynamic Array with realloc
dynamic_array_remove.c	typedef struct {	Shrinking, Searching and Removing
growth_factor_comparison.c	int main(void) {	Memory Fragmentation and Choosing the Right Growth Factor
debug_demo.c	int main(void) {	Debugging Dynamic Arrays – Tools and Techniques
AllocShowdown.c	int main(void) {	malloc vs calloc
FlexibleArray.c	typedef struct {	Flexible Array Members
Prerequisites.cpp	int main() {	Prerequisites
StructImpl.cpp	typedef struct {	Improvements
growth_strategies.c	size_t grow_size(size_t current, double factor) {	Growing Strategy
flexible_array.c	typedef struct {	Flexible Array Members in C99+
arena_allocator.c	typedef struct {	Arena Allocator for Dynamic Arrays

Key takeaways

malloc allocates on the heap at runtime

this is the only way to create arrays whose size depends on user input, file contents, or any value you don't know at compile time.

Always double capacity on growth, not increment by one

this keeps amortised insertion O(1) and is the strategy used by every major language's built-in list type.

realloc must go into a temporary pointer first

overwriting your original pointer directly causes an unrecoverable memory leak if the allocation fails.

After free, set the pointer to NULL immediately

stale non-NULL pointers that point to freed memory are one of the hardest bug classes to diagnose in C.

Use a shrink threshold (count < capacity/4) to avoid reallocation thrashing when items are removed and re-added.

For long-running servers, consider a growth factor lower than 2 (e.g., 1.5) to reduce heap fragmentation.

Compile with -fsanitize=address during development; it catches buffer overflows and use-after-free that Valgrind might miss in optimized builds.

INTERVIEW PREP · PRACTICE MODE

Interview Questions on This Topic

Q01SENIOR

Why does the doubling strategy for dynamic array growth give amortised O...

Q02SENIOR

What is the correct way to use realloc so you don't leak memory if it fa...

Q03SENIOR

A colleague says 'dynamic arrays and linked lists both resize at runtime...

Q04SENIOR

Explain how to implement a dynamic array that supports generic element t...

Q05JUNIOR

What is the difference between malloc and calloc when creating a dynamic...

Q01 of 05SENIOR

Why does the doubling strategy for dynamic array growth give amortised O(1) insertion, and what would happen to the time complexity if you grew by a fixed number of slots (e.g., always add 10) instead?

ANSWER

Doubling ensures that the total number of element copies across all insertions is O(n) — each element is copied at most log2(n) times. With a fixed increment of 10, each insertion triggers a linear copy of all current elements, leading to O(n²) total work for n insertions. The amortized cost drops from O(1) per insertion to O(n). This is why all major languages use a multiplicative growth factor.

FAQ · 4 QUESTIONS

Frequently Asked Questions

What is the difference between malloc and calloc when creating a dynamic array in C?

Can I use a Variable Length Array (VLA) in C instead of malloc for a runtime-sized array?

How do I know when my dynamic array should shrink, and by how much?

What is memory fragmentation and how does it affect dynamic arrays?

Naren Founder & Principal Engineer

20+ years shipping performance-critical C and C++ systems. Lessons pulled from things that broke in production.

✓ Verified

production tested

July 18, 2026

last updated

2,466

articles · all by Naren

🔥

That's C Basics. Mark it forged?

8 min read · try the examples if you haven't