Python Advanced

Python itertools Module: Lazy Iterators, Combinatorics and Performance Secrets

📅 March 2026 ⏱ 8 min read 🎯 Advanced

In Plain English 🔥

Imagine you work at a pizza shop and need to try every possible topping combination. Instead of writing them all on paper first, you just call them out one by one as a customer asks. That's itertools — it figures out the next item only when you need it, never wasting paper (memory). It's like a magic recipe book that writes each line on demand instead of printing the entire book upfront.

⚡ Quick Answer

Python lists are great until they're not. The moment your data grows to millions of rows — log files, sensor streams, combinatoric search spaces — loading everything into memory stops being clever and starts being catastrophic. itertools is the standard library's answer to that problem: a collection of memory-efficient, blazingly fast iterator-building blocks that the CPython team wrote in C. Serious Python engineers reach for it constantly, yet most tutorials only scratch its surface.

The real problem itertools solves is the hidden cost of materialisation. Every time you write [item for item in range(10_000_000)] you're allocating a list in heap memory. With itertools you get a lazy pipeline — a chain of generator-like objects that produce values on demand, one at a time, with near-zero overhead. This isn't just academic niceness; it's the difference between a data pipeline that fits in 4 MB of RAM and one that crashes a container.

By the end of this article you'll understand how itertools works internally, know exactly which tool to reach for in each scenario, avoid the three gotchas that trip up even experienced engineers, and be able to answer the interview questions that separate itertools users from itertools understanders.

How itertools Actually Works: Lazy Evaluation Under the Hood

Every object returned by itertools implements Python's iterator protocol — __iter__ and __next__. When you call itertools.chain(a, b) you don't get a new list; you get a tiny C object that holds references to a and b and advances a pointer each time __next__ is called. No data is copied.

This is fundamentally different from list comprehensions or list(map(...)). Those are eager — they compute everything immediately and store it. itertools objects are lazy — they compute nothing until forced to by a for loop, next(), or a consuming function like list() or sum().

The performance implication is dramatic. itertools.chain runs in O(1) space regardless of input size. The equivalent list_a + list_b runs in O(n) space because it allocates a brand-new list. For infinite sequences like itertools.count(), eager evaluation would simply never finish.

One nuance that bites people: because these objects are lazy, they can only be consumed once. After you've iterated through an itertools.chain object it's exhausted — calling list() on it a second time gives you an empty list. Keep that in mind any time you pass an iterator to multiple consumers.

lazy_evaluation_demo.py · PYTHON

123456789101112131415161718192021222324252627282930

import itertools
import sys

# --- Eager approach: builds the entire list in memory ---
eager_numbers = list(range(1_000_000))          # Allocates ~8 MB on 64-bit Python
eager_size = sys.getsizeof(eager_numbers)       # Size of the list object itself

# --- Lazy approach: just a tiny C iterator object ---
lazy_numbers = itertools.islice(itertools.count(0), 1_000_000)  # Describes the sequence
lazy_size = sys.getsizeof(lazy_numbers)         # Size of the iterator object

print(f"Eager list size  : {eager_size:>10,} bytes")
print(f"Lazy iterator    : {lazy_size:>10,} bytes")
print(f"Memory ratio     : {eager_size // lazy_size:>10,}x more memory for the eager list")

# --- Proving single-consumption behaviour ---
letters = iter(['a', 'b', 'c'])                 # A plain iterator (same rule applies)
chained = itertools.chain(letters, ['d', 'e'])  # Wrap it in chain

first_pass  = list(chained)   # Consumes the chain object completely
second_pass = list(chained)   # chained is now exhausted — returns []

print(f"\nFirst  pass: {first_pass}")
print(f"Second pass: {second_pass}  <-- exhausted, not a bug, expected behaviour")

# --- Proof that the C iterator holds no data copy ---
big_range = range(50_000_000)                   # range object, not a list
chain_obj = itertools.chain(big_range, big_range)
print(f"\nchain object type : {type(chain_obj)}")
print(f"chain object size : {sys.getsizeof(chain_obj)} bytes (constant, not proportional to input)")

▶ Output

Eager list size : 8,448,728 bytes
Lazy iterator : 48 bytes
Memory ratio : 176,015x more memory for the eager list

First pass: ['a', 'b', 'c', 'd', 'e']
Second pass: [] <-- exhausted, not a bug, expected behaviour

chain object type : <class 'itertools.chain'>
chain object size : 48 bytes (constant, not proportional to input)

⚠️

Watch Out: Single-Consumption TrapIf you pass an itertools object to two different functions — say, `max(it)` and then `min(it)` — the second call sees an empty iterator. Convert to a `list` first if you need multiple passes, but only if the data fits comfortably in memory. Otherwise restructure your logic to make a single pass.

Infinite Iterators, Slicing and Chaining: Building Real Data Pipelines

itertools gives you three infinite iterators: count, cycle, and repeat. These sound dangerous but they're the backbone of patterns like round-robin load balancing (cycle), default-value filling (repeat), and auto-incrementing IDs (count). The key is always pairing them with islice or a takewhile to impose a finite boundary.

chain and chain.from_iterable deserve special attention because they're often the glue between pipeline stages. chain(*list_of_lists) unpacks at call time (so all sub-iterables must exist upfront), but chain.from_iterable(generator_of_lists) is fully lazy — it doesn't even touch the next sub-iterable until the previous one is exhausted. That distinction matters enormously when sub-iterables are themselves expensive generators.

islice is your lazy version of Python's slice syntax. It doesn't support negative indices (it can't — it doesn't know the total length), but for extracting a window from a stream it's indispensable. Combine it with dropwhile and takewhile to express SQL-like WHERE clauses over any iterable without materialising it first.

pipeline_demo.py · PYTHON

123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657

import itertools
from datetime import datetime, timedelta

# ── 1. count() + islice(): generate timestamped event IDs lazily ──────────────
start_time = datetime(2024, 1, 1, 9, 0, 0)

def timestamped_events(start: datetime, interval_minutes: int):
    """Infinite generator of (event_id, timestamp) tuples."""
    for event_id in itertools.count(start=1001):           # Starts at 1001, never stops
        yield event_id, start
        start += timedelta(minutes=interval_minutes)

# Pull only the first 5 events — islice imposes a safe boundary on the infinite stream
first_five_events = list(itertools.islice(timestamped_events(start_time, 15), 5))
print("First 5 events:")
for eid, ts in first_five_events:
    print(f"  Event {eid} at {ts.strftime('%H:%M')}")

# ── 2. cycle(): round-robin assignment across servers ─────────────────────────
servers          = ['server-A', 'server-B', 'server-C']
server_carousel  = itertools.cycle(servers)               # Loops forever: A, B, C, A, B, C ...
incoming_requests = [f"req-{i:03d}" for i in range(8)]   # 8 incoming requests

print("\nRound-robin routing:")
for request in incoming_requests:
    assigned_server = next(server_carousel)               # Pulls one from the cycle
    print(f"  {request} → {assigned_server}")

# ── 3. chain.from_iterable() vs chain(*args): lazy vs eager unpacking ─────────
def load_log_chunk(day: int):
    """Simulates reading a log file for a given day (a generator, not a list)."""
    print(f"  [Loading chunk for day {day}]")
    yield from [f"day{day}-line{i}" for i in range(3)]

log_days = range(1, 4)   # Days 1, 2, 3

# chain.from_iterable: each chunk is loaded ONLY when the previous is exhausted
print("\nLazy chaining with from_iterable:")
lazy_pipeline = itertools.chain.from_iterable(
    load_log_chunk(day) for day in log_days   # Generator expression — not evaluated yet
)
for log_line in itertools.islice(lazy_pipeline, 4):       # Only read first 4 lines
    print(f"  → {log_line}")
# Notice: day 3 never even loads because islice stops at 4 items

# ── 4. dropwhile + takewhile: extract a specific window without loading all ───
sensor_readings = [0.1, 0.2, 0.5, 3.8, 4.1, 4.7, 4.2, 0.8, 0.3]

# Skip readings below threshold, then take readings until they drop back below
threshold = 1.0
abnormal_window = list(
    itertools.takewhile(
        lambda reading: reading >= threshold,                         # Stop when reading drops
        itertools.dropwhile(lambda r: r < threshold, sensor_readings) # Skip until first spike
    )
)
print(f"\nAbnormal sensor window: {abnormal_window}")

▶ Output

First 5 events:
Event 1001 at 09:00
Event 1002 at 09:15
Event 1003 at 09:30
Event 1004 at 09:45
Event 1005 at 10:00

Round-robin routing:
req-000 → server-A
req-001 → server-B
req-002 → server-C
req-003 → server-A
req-004 → server-B
req-005 → server-C
req-006 → server-A
req-007 → server-B

Lazy chaining with from_iterable:
[Loading chunk for day 1]
→ day1-line0
→ day1-line1
→ day1-line2
[Loading chunk for day 2]
→ day2-line0

Abnormal sensor window: [3.8, 4.1, 4.7, 4.2]

⚠️

Pro Tip: chain.from_iterable is your lazy ETL friendWhen processing files, S3 objects, or database cursors, always prefer `chain.from_iterable(generator_of_iterables)` over `chain(*list_of_iterables)`. The `*` unpacking form forces Python to evaluate the entire outer collection before the first `next()` call is even made, defeating the laziness entirely.

Combinatoric Iterators: When Brute-Force Needs to Be Elegant

The combinatoric family — product, permutations, combinations, combinations_with_replacement — is where itertools earns its keep in search algorithms, test generation, and constraint solving. Understanding the mathematical difference between them isn't optional; calling the wrong one silently gives you wrong results.

product is the Cartesian product — like nested for loops flattened. permutations gives every ordered arrangement — the order of selection matters. combinations gives unordered selections where each element appears at most once. combinations_with_replacement allows repeats.

All four are implemented in C and generate tuples one at a time. They're still lazy. But here's the gotcha: the number of results grows factorially. permutations(range(12)) produces 479,001,600 tuples. Even at 48 bytes each that's 23 GB if you materialise it. Always profile the count before consuming.

Use math.perm, math.comb to calculate expected output size before deciding whether to iterate or materialise. In production, combinatoric iterators often feed directly into a filter() or itertools.filterfalse() so only valid combinations ever reach memory.

combinatorics_demo.py · PYTHON

1234567891011121314151617181920212223242526272829303132333435363738394041424344454647484950515253545556575859606162

import itertools
import math

# ── 1. product(): test all combinations of config flags ───────────────────────
debug_modes    = [True, False]
log_levels     = ['INFO', 'WARNING', 'ERROR']
cache_sizes_mb = [128, 256]

print("All configuration combinations (product):")
config_space = itertools.product(debug_modes, log_levels, cache_sizes_mb)
for debug, level, cache in config_space:
    print(f"  debug={str(debug):<5}  log={level:<7}  cache={cache}MB")

expected_count = len(debug_modes) * len(log_levels) * len(cache_sizes_mb)
print(f"Total: {expected_count} configs  (math: {math.prod([2, 3, 2])})")

# ── 2. permutations vs combinations: the classic confusion ───────────────────
task_names = ['deploy', 'migrate', 'notify']

# permutations: order matters (deploy→migrate is different from migrate→deploy)
ordered_plans = list(itertools.permutations(task_names, r=2))
print(f"\nPermutations (ordered pairs, r=2): {len(ordered_plans)} results")
for plan in ordered_plans:
    print(f"  {plan[0]} then {plan[1]}")

# combinations: order doesn't matter (a team of 2, not a sequence)
unordered_pairs = list(itertools.combinations(task_names, r=2))
print(f"\nCombinations (unordered pairs, r=2): {len(unordered_pairs)} results")
for pair in unordered_pairs:
    print(f"  {pair}")

# ── 3. Always check size before materialising ─────────────────────────────────
items_count = 12
r_choose    = 4

perm_count  = math.perm(items_count, r_choose)  # n! / (n-r)!
comb_count  = math.comb(items_count, r_choose)  # n! / (r! * (n-r)!)

print(f"\nWith n={items_count}, r={r_choose}:")
print(f"  permutations → {perm_count:,} tuples  (~{perm_count * 48 / 1e6:.1f} MB if materialised)")
print(f"  combinations → {comb_count:,} tuples  (~{comb_count * 48 / 1e6:.1f} MB if materialised)")

# ── 4. combinations_with_replacement: sampling WITH replacement ───────────────
dice_faces     = [1, 2, 3, 4, 5, 6]
two_dice_rolls = list(itertools.combinations_with_replacement(dice_faces, r=2))
print(f"\nUnique unordered two-dice outcomes: {len(two_dice_rolls)}")
print(f"  First five: {two_dice_rolls[:5]}")
print(f"  Last  five: {two_dice_rolls[-5:]}")

# ── 5. Lazy combinatorics with filter: only materialise valid items ───────────
password_chars = 'abc'
four_char_passwords = itertools.product(password_chars, repeat=4)

# Only keep passwords that start AND end with 'a' — filter lazily before list()
valid_passwords = [
    ''.join(pwd)
    for pwd in four_char_passwords
    if pwd[0] == 'a' and pwd[-1] == 'a'    # Constraint applied on-the-fly
]
print(f"\n4-char passwords from 'abc' starting and ending with 'a':")
print(f"  {valid_passwords}")
print(f"  {len(valid_passwords)} valid out of {len(password_chars)**4} total")

▶ Output

All configuration combinations (product):
debug=True log=INFO cache=128MB
debug=True log=INFO cache=256MB
debug=True log=WARNING cache=128MB
debug=True log=WARNING cache=256MB
debug=True log=ERROR cache=128MB
debug=True log=ERROR cache=256MB
debug=False log=INFO cache=128MB
debug=False log=INFO cache=256MB
debug=False log=WARNING cache=128MB
debug=False log=WARNING cache=256MB
debug=False log=ERROR cache=128MB
debug=False log=ERROR cache=256MB
Total: 12 configs (math: 12)

Permutations (ordered pairs, r=2): 6 results
deploy then migrate
deploy then notify
migrate then deploy
migrate then notify
notify then deploy
notify then migrate

Combinations (unordered pairs, r=2): 3 results
('deploy', 'migrate')
('deploy', 'notify')
('migrate', 'notify')

With n=12, r=4:
permutations → 11,880 tuples (~0.6 MB if materialised)
combinations → 495 tuples (~0.0 MB if materialised)

Unique unordered two-dice outcomes: 21
First five: [(1, 1), (1, 2), (1, 3), (1, 4), (1, 5)]
Last five: [(5, 5), (5, 6), (6, 6)]

4-char passwords from 'abc' starting and ending with 'a':
['aaaa', 'aaba', 'aaca', 'abaa', 'abba', 'abca', 'acaa', 'acba', 'acca']
9 valid out of 81 total

🔥

Interview Gold: permutations vs combinationsThe one-line answer interviewers love: 'permutations count arrangements — order matters; combinations count selections — order doesn't. For n=5, r=3 that's 60 permutations vs 10 combinations.' Bonus: mention that both are lazy and memory-safe even when the output space is huge, as long as you filter or consume incrementally.

groupby, accumulate and tee: The Power Tools Most Devs Underuse

groupby is the most misused function in itertools. It groups consecutive identical keys — not all matching keys across the entire iterable. If your data isn't sorted by the key first, you get multiple groups for the same key value. This surprises almost everyone on their first encounter.

accumulate is a lazy running aggregation. Where sum(data) gives one final total, accumulate(data) gives every intermediate total as it builds. This is invaluable for running totals, rolling maximums, and computing cumulative probability distributions without loading everything into pandas.

tee splits one iterator into n independent iterators. It sounds perfect for the multiple-consumer problem mentioned earlier, but it has a hidden cost: internally it buffers values that have been consumed by one branch but not yet by the other. If the two branches advance unevenly, memory grows. Use tee only when branches advance roughly in lockstep; otherwise materialise to a list.

Understanding these three tools separates engineers who use itertools as a convenience from those who deploy it as a deliberate architectural choice.

groupby_accumulate_tee.py · PYTHON

12345678910111213141516171819202122232425262728293031323334353637383940414243444546474849505152535455565758596061626364

import itertools
import operator
from collections import namedtuple

Transaction = namedtuple('Transaction', ['date', 'category', 'amount'])

transactions = [
    Transaction('2024-01-05', 'food',    12.50),
    Transaction('2024-01-06', 'food',    8.75),
    Transaction('2024-01-06', 'travel',  45.00),   # travel after food — but consecutive?
    Transaction('2024-01-07', 'food',    6.00),    # food again — NOT consecutive with earlier food
    Transaction('2024-01-08', 'travel',  120.00),
    Transaction('2024-01-08', 'travel',  30.00),
]

# ── 1. groupby GOTCHA: data must be sorted by key first ──────────────────────
print("=== groupby WITHOUT sorting (wrong) ===")
for category, group_iter in itertools.groupby(transactions, key=lambda t: t.category):
    items = list(group_iter)
    total = sum(t.amount for t in items)
    print(f"  {category:<8}: {len(items)} items, total={total:.2f}")  # 'food' appears TWICE!

print("\n=== groupby WITH sorting (correct) ===")
sorted_transactions = sorted(transactions, key=lambda t: t.category)  # Sort first!
for category, group_iter in itertools.groupby(sorted_transactions, key=lambda t: t.category):
    items = list(group_iter)                   # Materialise the group — group_iter is also lazy
    total = sum(t.amount for t.amount in [t.amount for t in items])
    total = sum(t.amount for t in items)       # Clean version
    print(f"  {category:<8}: {len(items)} items, total={total:.2f}")

# ── 2. accumulate: running totals and rolling maximums ───────────────────────
daily_sales = [320, 415, 290, 560, 480, 390, 610]

running_total   = list(itertools.accumulate(daily_sales))                        # Default: add
running_max     = list(itertools.accumulate(daily_sales, func=max))              # Rolling max
running_product = list(itertools.accumulate(daily_sales, func=operator.mul))     # Running product

print("\nDaily sales  :", daily_sales)
print("Running total:", running_total)
print("Running max  :", running_max)

# accumulate with initial value (Python 3.8+): prepend a starting accumulator
cumulative_with_carry = list(
    itertools.accumulate(daily_sales, initial=1000)  # Start at 1000 carry-over
)
print("With 1000 carry:", cumulative_with_carry)      # First value is 1000 itself

# ── 3. tee: splitting a stream for two consumers advancing in lockstep ────────
def sensor_stream():
    """Simulates a live sensor stream — we can only iterate it once."""
    for reading in [1.2, 0.8, 2.4, 1.9, 0.5, 3.1]:
        yield reading

raw_stream = sensor_stream()
stream_for_avg, stream_for_max = itertools.tee(raw_stream, 2)  # Split into 2 independent copies

# Both consumers advance together via zip — tee buffer stays small
print("\nStep-by-step: running avg vs running max")
avg_accumulator = itertools.accumulate(stream_for_avg,
                                       func=lambda running, new: (running + new) / 2)
max_accumulator = itertools.accumulate(stream_for_max, func=max)

for step, (avg_val, max_val) in enumerate(zip(avg_accumulator, max_accumulator), start=1):
    print(f"  Step {step}: running_avg={avg_val:.3f}  running_max={max_val:.1f}")

▶ Output

=== groupby WITHOUT sorting (wrong) ===
food : 2 items, total=21.25
travel : 1 items, total=45.00
food : 1 items, total=6.00
travel : 2 items, total=150.00

=== groupby WITH sorting (correct) ===
food : 3 items, total=27.25
travel : 3 items, total=195.00

Daily sales : [320, 415, 290, 560, 480, 390, 610]
Running total: [320, 735, 1025, 1585, 2065, 2455, 3065]
Running max : [320, 415, 415, 560, 560, 560, 610]
With 1000 carry: [1000, 1320, 1735, 2025, 2585, 3065, 3455, 4065]

Step-by-step: running avg vs running max
Step 1: running_avg=1.200 running_max=1.2
Step 2: running_avg=1.000 running_max=1.2
Step 3: running_avg=1.700 running_max=2.4
Step 4: running_avg=1.800 running_max=2.4
Step 5: running_avg=1.150 running_max=2.4
Step 6: running_avg=2.125 running_max=3.1

⚠️

Watch Out: groupby's group iterator expires immediatelyThe group iterator yielded by groupby is tied to the main iterator. The moment you advance to the next group — or forget to consume it — the previous group's data is gone forever. Always call `list(group_iter)` or consume it fully inside the loop body before the next iteration begins.

Tool	Lazy?	Memory Usage	Output Type	Best For	Watch Out For
itertools.chain	Yes	O(1)	Values from inputs in order	Concatenating iterables without copying	Single consumption — can't reuse
itertools.product	Yes	O(r) per tuple	Tuples (Cartesian product)	Config matrix, test grid generation	Exponential output count — measure first
itertools.groupby	Yes	O(1)	Key + sub-iterator pairs	Grouping pre-sorted streams	Must sort by key first or groups split
itertools.tee	Yes	O(buffer)	n independent iterators	Splitting one stream for n consumers	Diverging consumers grow internal buffer
itertools.accumulate	Yes	O(1)	Running aggregate values	Running totals, rolling max/min	`initial` kwarg only in Python 3.8+
list comprehension	No (eager)	O(n)	Full list in memory	Small, reusable collections	Crashes on huge data; no laziness

🎯 Key Takeaways

⚠ Common Mistakes to Avoid

✕Mistake 1: Calling groupby on unsorted data — Symptom: the same key appears multiple times as separate groups (e.g., 'food' group appears at position 1 AND position 3). Fix: always sorted(data, key=your_key_func) immediately before the groupby call. They're almost always used as a pair.
✕Mistake 2: Consuming an itertools object twice — Symptom: list(my_iterator) returns a full list the first time, but list(my_iterator) immediately after returns [] silently. There's no error. Fix: either store result = list(iterator) and reuse result, or if memory is a concern, recreate the iterator from the original source for the second pass.
✕Mistake 3: Using tee when consumers advance at wildly different rates — Symptom: memory usage grows unboundedly because tee's internal deque buffers every value that branch-1 has consumed but branch-2 hasn't yet seen. Fix: if one consumer needs to skip far ahead, materialise the full iterator to a list once and share the list. Use tee only for true lockstep consumers like the zip-based pattern shown above.

Interview Questions on This Topic

QWhat's the difference between itertools.chain(a, b) and itertools.chain.from_iterable([a, b]), and when does the distinction actually matter in production?
QIf I have a list of 10 items and I call itertools.permutations(items, r=10), how many results will I get? Would you ever call list() on that? How would you safely process it?
QSomeone on your team writes `it = itertools.groupby(records, key=lambda r: r.department)` and complains that the same department appears multiple times in the output. What's the bug and how do you fix it without changing the groupby call itself?

🔥

TheCodeForge Editorial Team Verified Author

Written and reviewed by senior developers with real-world experience across enterprise, startup and open-source projects. Every article on TheCodeForge is written to be clear, accurate and genuinely useful — not just SEO filler.

About Our Team Editorial Standards

Forged with 🔥 at TheCodeForge.io — Where Developers Are Forged