Senior 8 min · March 06, 2026

Graph Interview Problems - Topological Sort Cycle Bug

Q: When should I use BFS over DFS in a graph interview?

Use BFS when you need the shortest path in an unweighted graph or need to explore nodes in layers (closest first). Use DFS when you need to explore every possible path, detect cycles, or perform a topological sort via recursion.

Q: What is the best way to represent a graph: Adjacency Matrix or Adjacency List?

Adjacency Lists are usually preferred ($O(V+E)$ space) because most interview graphs are 'sparse.' Adjacency Matrices ($O(V^2)$ space) are only efficient for very 'dense' graphs where nearly every node is connected to every other node.

Q: Can Dijkstra's algorithm handle negative weights?

No. Dijkstra's algorithm uses a greedy approach that assumes adding an edge never decreases the total path cost. For graphs with negative weights, you must use the Bellman-Ford algorithm.

Q: What is the difference between topological sort using Kahn's algorithm and using DFS?

Kahn's algorithm uses in-degree decrementation with a queue, and produces a valid order directly. DFS-based topological sort does a post-order traversal and then reverses the finishing times. Both work on DAGs. Kahn's is often easier to reason about and directly detects cycles.

Q: Is Union-Find suitable for path queries between two nodes?

Yes, Union-Find can answer connectivity queries ('is node A connected to node B?') efficiently after all unions are performed. However, it cannot find the actual path — only whether such a path exists.

A single undetected cycle in a topological sort OOM-killed our build server.

Naren Founder & Principal Engineer

20+ years shipping production code across the stack, with years spent interviewing engineers. Written from production experience, not tutorials.

✓ Production

production tested

May 23, 2026

last updated

1,554

articles · all by Naren

● Production Incident 🔎 Debug Guide ⚙ Triage Commands

⚡Quick Answer

Graph problems appear in ~30% of FAANG coding rounds because real systems (Maps, Social Nets, Dispatch) are graphs under the hood.
Core patterns: BFS for shortest paths (unweighted), DFS for cycles/connectivity, Topological Sort for dependencies, Union-Find for dynamic connectivity.
Choosing the wrong traversal (DFS for shortest path) is the #1 mistake that costs an offer.
Performance trap: Adjacency List (O(V+E) space) beats Adjacency Matrix (O(V²) space) for sparse interview graphs.
Production insight: Forgetting to mark visited nodes causes infinite loops — same failure pattern in dependency resolution at scale.

✦ Definition~90s read

What is Top 10 Graph Interview Problems?

Graph problems are a staple of technical interviews because they test your ability to model relationships and reason about connectivity, dependencies, and traversal order. Unlike arrays or trees, graphs have no inherent root or linear structure — you must choose the right algorithm for the question at hand.

★

Imagine a city map.

This article focuses on five core patterns: BFS for shortest paths in unweighted graphs, DFS for cycle detection and exhaustive exploration, topological sort for dependency resolution (Kahn's algorithm), and Union-Find for dynamic connectivity and redundant edge detection. The most common bug in topological sort is failing to detect cycles, which makes the sort impossible — this article walks through exactly how that manifests in both BFS and DFS implementations.

These patterns cover roughly 80% of graph interview problems at companies like Google, Meta, and Amazon. BFS is your go-to for shortest path in unweighted graphs (e.g., word ladder, rotting oranges). DFS excels at detecting cycles in directed graphs and solving connectivity problems (e.g., number of islands, course schedule).

Topological sort handles task scheduling and build order problems, but only works on DAGs — if you skip cycle detection, you'll produce garbage output or infinite loops. Union-Find is ideal for problems involving disjoint sets and redundant connections (e.g., friend circles, redundant connection in a tree).

When not to use these: BFS is wasteful for weighted graphs (use Dijkstra's). DFS can blow the stack on large graphs (use iterative stack or BFS). Topological sort is meaningless if dependencies have cycles — you must detect them first. Union-Find doesn't help with pathfinding or shortest paths.

The first 60 seconds of any graph problem should be spent on representation: adjacency list for sparse graphs, adjacency matrix for dense ones, and edge list for Union-Find. Misidentifying the pattern early is the most common mistake — this article gives you the heuristics to avoid it.

Plain-English First

Imagine a city map. Every intersection is a dot, and every road connecting two intersections is a line. A graph is exactly that — dots (nodes) connected by lines (edges). Graph problems ask you things like: 'What's the shortest route from home to school?', 'Can I get from any city to any other city?', or 'Is there a road that, if it breaks, cuts two towns off from each other?' Every GPS app, social network, and package delivery system solves graph problems thousands of times per second.

Graph problems are the great filter of software engineering interviews. They appear in roughly 30% of FAANG-level coding rounds not because interviewers love theory, but because real systems — Google Maps, LinkedIn's friend-of-a-friend suggestions, Uber's dispatch engine, Kubernetes dependency resolution — are fundamentally graphs under the hood. If you can't navigate a graph, you can't reason about the systems that run the modern internet.

The challenge with graph interviews isn't memorizing algorithms. BFS and DFS are straightforward once you've seen them twice. The real difficulty is pattern recognition: knowing within 60 seconds whether a problem is a shortest-path problem, a cycle detection problem, a topological sort problem, or a connected-components problem — and then executing flawlessly under pressure without missing the edge cases that separate a 'hire' from a 'no hire'.

By the end of this article you'll have a mental framework for classifying any graph problem on sight, complete runnable solutions for the 10 most common interview problems, and the internal mechanics — why BFS gives shortest paths in unweighted graphs, why DFS finds cycles, why Union-Find beats DFS for dynamic connectivity — explained at the level an interviewer expects when they ask 'can you walk me through your reasoning?'

What Topological Sort Cycle Detection Actually Reveals

Topological sort is a linear ordering of vertices in a directed graph such that for every directed edge u→v, u appears before v. The core mechanic is simple: repeatedly pick nodes with zero in-degree (Kahn's algorithm) or perform DFS with a post-order stack. But the real interview challenge — and the production bug — is cycle detection. A cycle makes topological ordering impossible, and the bug surfaces when your algorithm silently returns a partial order or infinite loops.

Kahn's algorithm works by counting in-degrees, enqueuing zero-in-degree nodes, and decrementing neighbors. If the final sorted list length is less than the total node count, a cycle exists. DFS-based approaches track three states: unvisited, visiting (on current path), and visited. A cycle is detected when you encounter a node in the 'visiting' state. This is O(V+E) time and O(V) space — efficient enough for graphs with millions of nodes.

Use topological sort for dependency resolution (build systems, package managers), task scheduling, or course prerequisite ordering. In production, the cycle bug bites hardest in dependency graphs that evolve dynamically — like microservice startup order or data pipeline DAGs. A single cycle can cascade into deadlocked deployments or stuck batch jobs. Always validate acyclic property before trusting the order.

Partial Order ≠ Valid Order

A topological sort that returns fewer nodes than expected is not a 'partial result' — it's a cycle. Treat missing nodes as a hard failure, not a warning.

Production Insight

Microservice startup ordering: a circular dependency between service A (needs B) and B (needs A) causes both to hang on health-check timeouts.

Symptom: deployment completes but services never become healthy — logs show repeated connection refused errors in a loop.

Rule of thumb: always run a cycle detection pass before any topological sort in production — treat the graph as untrusted input.

Key Takeaway

Cycle detection is not optional — a topological sort without it is a bug waiting to happen.

Kahn's algorithm is production-friendly: O(V+E) time, no recursion depth issues, easy to detect cycles by counting sorted nodes.

DFS-based topological sort with three-state coloring is elegant but risks stack overflow on deep graphs — prefer Kahn for large-scale systems.

thecodeforge.io

Topological Sort Cycle Detection Bug

Breadth-First Search (BFS): The Shortest Path Pattern

Breadth-First Search is the definitive algorithm for finding the shortest path in an unweighted graph. It explores the graph in 'layers'—visiting all neighbors at distance 1, then distance 2, and so on. This guarantees that the first time you reach a target node, you've found the shortest route. In interviews, this pattern is frequently disguised as 'minimum number of moves to solve a puzzle' or 'nearest exit in a maze.'

Here's how the BFS pattern works in practice: you maintain a queue and a visited set. Each layer corresponds to one iteration over the current queue size. For each node popped, you examine its neighbors. If a neighbor hasn't been visited, mark it visited and enqueue it. The distance increments only after processing a full layer. This gives you the correct minimum distance by construction.

A common variant is the 'multi-source BFS' where you start from several nodes simultaneously — used in problems like 'walls and gates' or 'spreading fire.'

io/thecodeforge/graphs/BfsNavigator.javaJAVA

package io.thecodeforge.graphs;

import java.util.*;

public class BfsNavigator {
    /**
     * io.thecodeforge implementation of Shortest Path in an Adjacency List
     * Time Complexity: O(V + E), Space Complexity: O(V)
     */
    public int findShortestPath(int nodes, List<List<Integer>> adj, int start, int target) {
        Queue<Integer> queue = new LinkedList<>();
        boolean[] visited = new boolean[nodes];
        int distance = 0;

        queue.add(start);
        visited[start] = true;

        while (!queue.isEmpty()) {
            int size = queue.size();
            for (int i = 0; i < size; i++) {
                int current = queue.poll();
                if (current == target) return distance;

                for (int neighbor : adj.get(current)) {
                    if (!visited[neighbor]) {
                        visited[neighbor] = true;
                        queue.add(neighbor);
                    }
                }
            }
            distance++;
        }
        return -1; // No path found
    }
}

Output

// Example: Input 0 -> 2, Distance: 2 moves.

Forge Tip:

Type this code yourself rather than copy-pasting. The muscle memory of writing it will help it stick. Remember: For BFS, always use a Queue. For DFS, use a Stack (or recursion).

Production Insight

BFS is your go-to for shortest paths in unweighted graphs, but it fails when edge weights differ.

In production, systems like network routing or map directions use weighted graphs — BFS would give wrong results.

Rule: BFS = unweighted shortest path; Dijkstra = weighted (non-negative); Bellman-Ford = negative weights.

Key Takeaway

BFS gives shortest paths in unweighted graphs by exploring level by level.

The key invariant: mark visited when enqueued, not when dequeued — preventing duplicate processing.

If you need minimum steps in a grid or moves in a puzzle, BFS is your only first-choice pattern.

Depth-First Search (DFS): Cycle Detection, Connectivity, and Exhaustive Paths

Depth-First Search explores as far as possible along each branch before backtracking. It's the algorithm of choice when you need to explore all possibilities — detecting cycles, finding connected components, or solving problems like 'clone a graph' or 'path exists between two nodes.' DFS can be implemented recursively (simpler) or iteratively with a stack (avoids stack overflow for deep graphs).

Cycle detection using DFS is a classic pattern. For undirected graphs, you need to check if you encounter a neighbor that is visited and is NOT the parent of the current node. For directed graphs, you also need a recursion stack (set) to detect a back edge — a node that is currently in the recursion stack means a cycle.

DFS also forms the backbone of backtracking problems on graphs, like finding all paths from source to target in a DAG. In those cases, you don't mark nodes as visited globally; instead, you mark them on the current path and unmark after recursion.

io/thecodeforge/graphs/DfsCycleDetector.javaJAVA

package io.thecodeforge.graphs;

import java.util.*;

public class DfsCycleDetector {
    /**
     * Detects cycle in a directed graph using DFS with recursion stack.
     * O(V+E) time, O(V) space.
     */
    public boolean hasCycle(int nodes, List<List<Integer>> adj) {
        int[] state = new int[nodes]; // 0=unvisited, 1=in current stack, 2=processed
        for (int i = 0; i < nodes; i++) {
            if (state[i] == 0) {
                if (dfs(i, adj, state)) return true;
            }
        }
        return false;
    }

    private boolean dfs(int u, List<List<Integer>> adj, int[] state) {
        state[u] = 1;
        for (int v : adj.get(u)) {
            if (state[v] == 1) return true; // back edge => cycle
            if (state[v] == 0 && dfs(v, adj, state)) return true;
        }
        state[u] = 2;
        return false;
    }
}

Output

// Returns true if a cycle exists, false otherwise.

Three-Color DFS Model

White (0): node not visited yet.
Gray (1): node is currently on the recursion stack (part of current path).
Black (2): node and all descendants fully explored.
If we encounter a gray node during DFS, a cycle exists.

Production Insight

Recursive DFS can cause StackOverflowError on deep graphs (>10k depth).

In production, always use iterative DFS with an explicit stack for large graphs.

Rule: recursion depth > 1000 => switch to iterative stack.

Key Takeaway

DFS explores all paths — use it for cycle detection and exhaustive search.

Three-color state array avoids infinite loops and detects cycles correctly.

For undirected graphs, pass parent reference to avoid false positive cycles.

Topological Sort: Handling Task Dependencies (Kahn's Algorithm)

Topological sorting is only applicable to Directed Acyclic Graphs (DAGs). It provides a linear ordering of vertices such that for every directed edge $u \to v$, vertex $u$ comes before $v$. This is the foundation of build systems (like Gradle or Maven) and task scheduling. Kahn's Algorithm is the preferred iterative approach using in-degrees.

The algorithm works by maintaining an in-degree array for each node. Initialize a queue with all nodes that have in-degree 0. Then repeatedly pop a node, append it to result, and for each neighbor, decrement its in-degree. If a neighbor's in-degree becomes 0, push it into the queue. If at the end the result size doesn't equal the number of nodes, the graph has a cycle.

The alternative is recursive topological sort using DFS with a list of nodes in post-order — reverse of finishing times gives the topological order.

io/thecodeforge/graphs/KahnAlgorithm.javaJAVA

package io.thecodeforge.graphs;

import java.util.*;

public class DependencyResolver {
    /**
     * Resolves task order using Kahn's Algorithm (Topological Sort)
     */
    public int[] resolveOrder(int numTasks, int[][] dependencies) {
        int[] inDegree = new int[numTasks];
        List<List<Integer>> adj = new ArrayList<>();
        for (int i = 0; i < numTasks; i++) adj.add(new ArrayList<>());

        for (int[] dep : dependencies) {
            adj.get(dep[1]).add(dep[0]);
            inDegree[dep[0]]++;
        }

        Queue<Integer> queue = new LinkedList<>();
        for (int i = 0; i < numTasks; i++) {
            if (inDegree[i] == 0) queue.add(i);
        }

        int[] order = new int[numTasks];
        int index = 0;
        while (!queue.isEmpty()) {
            int curr = queue.poll();
            order[index++] = curr;

            for (int neighbor : adj.get(curr)) {
                inDegree[neighbor]--;
                if (inDegree[neighbor] == 0) queue.add(neighbor);
            }
        }

        return index == numTasks ? order : new int[0]; // Empty if cycle detected
    }
}

Output

// Resolves build order: Task 0 -> Task 1 -> Task 2

Edge Case Alert:

Always check for cycles! If the final ordered list doesn't contain all nodes, the graph has a cycle, meaning a circular dependency exists.

Production Insight

Kahn's algorithm with queue is O(V+E) but consumes O(V) memory for in-degree array.

In production dependency resolvers (e.g., Gradle, Maven), cycles cause build failures that waste hours.

Rule: always check result length against node count — a missing node means a cycle.

Key Takeaway

Topological sort = linear ordering respecting edge direction.

Kahn's algorithm uses in-degrees and a queue — simpler than DFS-based topo.

Run cycle detection first if there's any chance of circular dependencies.

Union-Find (Disjoint Set Union): Dynamic Connectivity & Redundant Edges

Union-Find is designed for problems where you need to efficiently merge sets and answer connectivity queries under dynamic edge additions. It excels in scenarios like 'find redundant connection' (detect where an edge connects two already-connected nodes) or 'number of components after adding edges online.'

The core operations are find() — with path compression — and union() — using union by rank or size. Optimized versions run in almost O(1) amortized time (inverse Ackermann function). Union-Find does not handle graph traversal or shortest paths; it's strictly for connectivity.

A classic interview problem: given a list of edges in a graph, find which edge creates a cycle (redundant connection). Using Union-Find, you process edges one by one. For each edge (u, v), if find(u) == find(v), the edge is redundant. Otherwise, union(u, v).

io/thecodeforge/graphs/UnionFind.javaJAVA

package io.thecodeforge.graphs;

public class UnionFind {
    private int[] parent, rank;

    public UnionFind(int n) {
        parent = new int[n];
        rank = new int[n];
        for (int i = 0; i < n; i++) parent[i] = i;
    }

    public int find(int x) {
        if (parent[x] != x) parent[x] = find(parent[x]); // path compression
        return parent[x];
    }

    public boolean union(int x, int y) {
        int rootX = find(x);
        int rootY = find(y);
        if (rootX == rootY) return false; // already connected

        // union by rank
        if (rank[rootX] < rank[rootY]) parent[rootX] = rootY;
        else if (rank[rootX] > rank[rootY]) parent[rootY] = rootX;
        else {
            parent[rootY] = rootX;
            rank[rootX]++;
        }
        return true;
    }

    public boolean isConnected(int x, int y) {
        return find(x) == find(y);
    }
}

Output

// Usage: UnionFind uf = new UnionFind(5); uf.union(0,1); uf.isConnected(0,2); // false

Union-Find as a Forest of Trees

find() climbs up the parent pointers to the root, compressing along the way.
union() attaches the smaller tree root to the larger tree root (by rank).
Time complexity: amortized O(α(n)) where α is the inverse Ackermann function.
Useful for: detecting cycles in undirected graphs, number of islands (dynamic version), Kruskals MST algorithm.

Production Insight

Union-Find is memory-efficient (just two arrays) but not suitable for dynamic disconnection.

In production, systems like Kubernetes node connectivity or load balancer health checks use more dynamic methods.

Rule: if you only need connectivity queries after edge additions, Union-Find beats BFS/DFS hands down.

Key Takeaway

Union-Find solves dynamic connectivity problems with near-constant time.

Path compression and union by rank are mandatory for performance — omit them and you're O(n).

Not for paths — only for connectivity checks.

Graph Representation & Pattern Recognition: The First 60 Seconds

The most critical skill in graph interviews is recognising the pattern within the first minute. Here's a quick decision tree:

Need shortest path in unweighted graph? → BFS.
Need all paths or detect cycles? → DFS.
Need to process tasks in order? → topological sort (Kahn/DFS).
Need dynamic connectivity or detect redundant edges? → Union-Find.
Weighted graph, shortest path, no negative edges? → Dijkstra.
Weighted graph with negative edges? → Bellman-Ford.
Need all-pairs shortest paths? → Floyd-Warshall.

Graph representation also matters. Adjacency List (List<List<Integer>>) is the default for sparse graphs (V up to 10^5, E up to 10^5). Adjacency Matrix (int[][]) only if V is small (< 1000) or graph is dense (E ~ V²). Edge List (List<int[]>) is useful when you need to process edges without adjacency queries.

In interviews, you'll rarely need to implement Dijkstra from scratch — but you must be able to reason about its behavior and write BFS/DFS/Kahn without hesitation.

io/thecodeforge/graphs/GraphRepresentation.javaJAVA

package io.thecodeforge.graphs;

import java.util.*;

public class GraphRepresentation {
    // Adjacency List (preferred)
    public List<List<Integer>> buildAdjList(int n, int[][] edges) {
        List<List<Integer>> adj = new ArrayList<>();
        for (int i = 0; i < n; i++) adj.add(new ArrayList<>());
        for (int[] e : edges) {
            adj.get(e[0]).add(e[1]);
            adj.get(e[1]).add(e[0]); // for undirected
        }
        return adj;
    }

    // Adjacency Matrix (for dense/small)
    public int[][] buildAdjMatrix(int n, int[][] edges) {
        int[][] matrix = new int[n][n];
        for (int[] e : edges) {
            matrix[e[0]][e[1]] = 1;
            matrix[e[1]][e[0]] = 1;
        }
        return matrix;
    }

    // Edge List (for edge processing)
    public int[][] buildEdgeList(int[][] edges) {
        return edges; // already edge list
    }
}

Output

// Choose based on constraints: List for 10^5, matrix for <1000 nodes.

Common Trap:

Don't start coding representation until you've identified the pattern. Many candidates dive into matrix because it looks simpler, but it wastes time and memory for large graphs.

Production Insight

Real-world systems often use adjacency lists with weighted edges in a HashMap per node for sparse graphs.

In production, graph representations must balance memory and query speed — adjacency list wins for most.

Rule: if V > 10^4, use adjacency list. If V < 1000, matrix is fine.

Key Takeaway

Pattern recognition determines success: BFS for shortest, DFS for cycles, Kahn for deps, Union-Find for connectivity.

Adjacency list is your default — it's O(V+E) space and fast traversal.

First 60 seconds: identify pattern and choose the right data structure.

Fundamentals Problems: The Non-Negotiable Baseline

Every interviewer starts here. Not because they're sadists, but because these problems strip away the syntax noise and reveal whether you actually understand graphs. If you can't implement BFS on an adjacency list in your sleep, you're not ready for anything harder.

The classics: Clone Graph, Course Schedule, Number of Islands. These aren't just practice — they're vocabulary. Clone Graph tests your grasp of graph traversal with a twist: you're building the graph as you walk it. Course Schedule is topological sort disguised as a college metaphor. Number of Islands is BFS/DFS with a grid wrapper.

Here's the real test: can you explain WHY the visited set must be checked before enqueuing, not after dequeuing? That single detail separates people who copy-paste from people who understand trade-offs. If you get this wrong in an interview, you've already lost.

Master these. Not by memorizing solutions, but by internalizing the traversal mechanics. Your interview won't ask you to solve Number of Islands — it'll ask you to solve something that reduces to Number of Islands. Recognize the pattern, not the problem statement.

CountIslands.pyPYTHON

// io.thecodeforge — interview tutorial

def count_islands(grid):
    if not grid:
        return 0
    
    rows, cols = len(grid), len(grid[0])
    visited = [[False] * cols for _ in range(rows)]
    islands = 0
    
    def bfs(start_row, start_col):
        queue = [(start_row, start_col)]
        visited[start_row][start_col] = True
        while queue:
            r, c = queue.pop(0)
            for dr, dc in [(1,0), (-1,0), (0,1), (0,-1)]:
                nr, nc = r + dr, c + dc
                if 0 <= nr < rows and 0 <= nc < cols:
                    if grid[nr][nc] == '1' and not visited[nr][nc]:
                        visited[nr][nc] = True
                        queue.append((nr, nc))
    
    for r in range(rows):
        for c in range(cols):
            if grid[r][c] == '1' and not visited[r][c]:
                islands += 1
                bfs(r, c)
    return islands

Output

count_islands([['1','1','0','0'],['1','0','0','1'],['0','0','1','1']])

=> 3

Production Trap:

Marking visited on dequeue instead of enqueue doubles memory usage and can cause infinite loops in cyclic graphs. Always mark when you add to the queue, not when you pop.

Key Takeaway

Every graph problem is a traversal problem. Master BFS and DFS on adjacency lists, then recognize which problems reduce to them.

Intermediate Problems: Where They Probe Your Edge Cases

Once you've proven you can walk a graph, interviewers start throwing wrenches. Directed vs undirected? Weighted vs unweighted? Dense vs sparse? These aren't academic distinctions — they determine whether your solution runs in milliseconds or crashes the production server.

Problems like Word Ladder and Network Delay Time hit this zone. Word Ladder isn't just BFS — it's BFS on an implicit graph where nodes are words and edges exist if words differ by one character. The naive approach builds the entire adjacency list upfront, O(N² K) where N is word count and K is word length. The production solution generates neighbors on the fly, O(N K²). That's the difference between passing and timing out.

Network Delay Time tests Dijkstra's algorithm, but more importantly, it tests whether you know why Dijkstra fails on negative weights. If you can't articulate that — and the fix (Bellman-Ford) — you're not ready for follow-ups.

These problems separate candidates who've seen graphs from those who've shipped graph code. The patterns here: build the graph efficiently, handle disconnected components, and know when shortest path means unweighted BFS vs weighted priority queue.

WordLadderBFS.pyPYTHON

// io.thecodeforge — interview tutorial

from collections import deque

def word_ladder(start, end, word_list):
    word_set = set(word_list)
    if end not in word_set:
        return 0
    
    queue = deque([(start, 1)])
    while queue:
        current_word, steps = queue.popleft()
        if current_word == end:
            return steps
        
        # Generate neighbors on the fly — O(K²) per word
        for i in range(len(current_word)):
            for c in 'abcdefghijklmnopqrstuvwxyz':
                next_word = current_word[:i] + c + current_word[i+1:]
                if next_word in word_set:
                    word_set.remove(next_word)  # avoid cycles
                    queue.append((next_word, steps + 1))
    return 0

Output

word_ladder('hit', 'cog', ['hot','dot','dog','lot','log','cog'])

=> 5

Senior Shortcut:

For implicit graphs (words, state spaces), never pre-build the adjacency list. Generate neighbors on demand — it's often faster and always uses less memory.

Key Takeaway

Implicit graphs are waiting to ambush you. Always ask: 'Can I generate edges on the fly?' before committing to an adjacency list implementation.

Advanced Problems: The Production Systems Interview

This is where FAANG+ pays its rent. These aren't toy problems — they're system design prototypes in disguise. Minimum Spanning Tree, strongly connected components, and maximum flow feel academic until your service needs to detect communities of fraudulent accounts or route traffic across regions.

Problems like Accounts Merge and Critical Connections in a Network test advanced Union-Find and Tarjan's algorithm. Accounts Merge looks like a connectivity problem, but the twist is you're merging sets based on shared emails. It's not enough to union — you must represent each person as a node, each email as an attribute, and then detect which persons actually form a group. The naive approach is O(N²) and gets slaughtered by test cases with 10,000 accounts.

Critical Connections tests Tarjan's algorithm for bridges in a graph. Most candidates freeze because they've never seen DFS with discovery times and low-link values outside of CLRS. But if you strip away the jargon: you're assigning each node a 'timestamp' on first visit, then tracking the smallest timestamp reachable via back edges. If a child can't reach an ancestor without passing through the parent, that edge is critical.

Master these and you've proven you can handle graphs at scale. Not because you'll implement Tarjan's in production, but because you understand the trade-offs between Union-Find, DFS, and BFS for real-world connectivity problems.

CriticalConnections.pyPYTHON

// io.thecodeforge — interview tutorial

def critical_connections(n, connections):
    graph = [[] for _ in range(n)]
    for u, v in connections:
        graph[u].append(v)
        graph[v].append(u)
    
    discovery = [-1] * n
    low = [0] * n
    visited = [False] * n
    result = []
    timer = [0]
    
    def dfs(node, parent):
        visited[node] = True
        timer[0] += 1
        discovery[node] = low[node] = timer[0]
        for neighbor in graph[node]:
            if neighbor == parent:
                continue
            if not visited[neighbor]:
                dfs(neighbor, node)
                low[node] = min(low[node], low[neighbor])
                if low[neighbor] > discovery[node]:
                    result.append([node, neighbor])
            else:
                low[node] = min(low[node], discovery[neighbor])
    
    for i in range(n):
        if not visited[i]:
            dfs(i, -1)
    return result

Output

critical_connections(4, [[0,1],[1,2],[2,0],[1,3]])

=> [[1, 3]]

Algorithm Deep Dive:

Tarjan's bridge detection runs in O(V+E) — same as DFS. The low-link value trick works because it exploits the fact that back edges create alternative paths. No alternative path means a critical edge.

Key Takeaway

Advanced graph problems reward understanding of graph invariants (discovery time, low-link values, rank). Memorize the algorithm once, but internalize why the invariant captures the property you need.

● Production incidentPOST-MORTEMseverity: high

The Build Pipeline That Looped Forever

Symptom

Build jobs for a microservice monorepo never completed. The scheduler kept recompiling the same modules in different orders, producing no output but consuming CPU and memory until the build server OOM-killed the runner.

Assumption

The team assumed the build failure was due to missing artifacts or stale caches. They restarted the pipeline multiple times, losing hours.

Root cause

The build graph contained a cycle — module A depended on B, B depended on C, and C depended on A. The topological sort algorithm (Kahn's) wasn't detecting cycles because the team had disabled cycle detection to 'speed up' the build. The scheduler pushed jobs into an infinite resolution loop.

Fix

Re‑enabled cycle detection in the build scheduler. Added a pre‑check: run a DFS-based cycle detection before invoking Kahn's algorithm. If a cycle exists, fail immediately with the cycle path in the error message.

Key lesson

A topological sort without cycle detection is not a topological sort — it's a ticking bomb.
Every dependency resolution system must fail fast on cycles. Silent retries are dangerous.
Add a dedicated cycle detection pass (DFS + recursion stack) before any DAG processing.

Production debug guideSymptom → Action cheat sheet for common graph failures5 entries

Symptom · 01

Infinite loop during traversal

→

Fix

Check visited[] initialisation. Ensure every node is marked visited when enqueued (BFS) or when explored (DFS). Inspect recursion depth if using recursion.

Symptom · 02

Wrong shortest path distance returned

→

Fix

Verify you are using BFS, not DFS. If using Dijkstra, confirm no negative weights exist. Check that distance array is initialised to INF and updated only when a shorter path is found.

Symptom · 03

Cycle detected where there is none

→

Fix

For directed graphs, use DFS with recursion stack (gray set) — visited alone isn't enough. For undirected graphs, check parent reference to skip the immediate back edge.

Symptom · 04

Topological sort returns empty or partial order

→

Fix

Check for cycles using Kahn's algorithm: if in-degree queue empties before all nodes processed, a cycle exists. Print the cycle path via DFS.

Symptom · 05

Union-Find operation returns wrong connectivity

→

Fix

Verify path compression and union by rank are implemented correctly. Test on a small graph with known connectivity. Check that all edges are processed.

★ Graph Interview Debugging Cheat SheetQuick reference for the three graph patterns you'll see most often in interviews.

Need shortest path in unweighted graph−

Immediate action

Switch to BFS immediately.

Commands

Queue<Integer> queue = new LinkedList<>();

int[] dist = new int[n]; Arrays.fill(dist, INF); dist[start]=0;

Fix now

Mark visited when enqueuing, not when polling.

Need to detect a cycle in a directed graph+

Need to find connected components dynamically+

Graph Algorithm Comparison

Algorithm	Core Pattern	Use Case	Time Complexity	Space Complexity
BFS	Level-order traversal	Shortest path (unweighted), nearest exit	O(V+E)	O(V)
DFS	Exhaustive recursion / stack	Cycle detection, connectivity, all paths	O(V+E)	O(V) (recursion stack or explicit stack)
Kahn's (Topo Sort)	In-degree processing	Dependency resolution, scheduling	O(V+E)	O(V)
Union-Find	Disjoint sets with path compression	Dynamic connectivity, redundant edges	O(α(V)) amortized	O(V)
Dijkstra	Greedy / Priority Queue	Shortest path in weighted (non-negative) graphs	O((V+E) log V)	O(V)
Bellman-Ford	Edge relaxation (V-1 iterations)	Shortest path with negative weights, detect negative cycles	O(VE)	O(V)

Key takeaways

You now understand that graph problems are often just 'Matrix or Adjacency List' puzzles in disguise.

You've implemented BFS for shortest paths, DFS for cycle detection, Kahn's Algorithm for topological sorting, and Union-Find for dynamic connectivity using io.thecodeforge standards.

The secret to graph interviews is choosing the right data structure (Queue vs. Stack vs. Priority Queue) for the traversal pattern.

Pattern recognition within 60 seconds separates hires from no-hires

shortest path => BFS, cycles/paths => DFS, dependencies => Topological Sort, connectivity => Union-Find.

Symptom

Undirected cycle detection using a visited array alone gives false positives (detects cycle from the edge back to parent).

Fix

For undirected, pass a parent parameter to avoid counting the immediate back edge. For directed, use a recursion stack in addition to visited.

INTERVIEW PREP · PRACTICE MODE

Interview Questions on This Topic

Q01SENIOR

Given a reference of a node in a connected undirected graph, return a de...

Q02SENIOR

You are given a list of intervals where each interval indicates a meetin...

Q03SENIOR

There are n cities and a list of edges representing roads between them. ...

Q01 of 03SENIOR

Given a reference of a node in a connected undirected graph, return a deep copy (clone) of the graph. Explain your approach and write the code.

ANSWER

Use DFS with a HashMap to map original node -> clone node. Start from the given node. If the node is already in the map, return the clone. Otherwise, create a clone, store it, and recursively clone all neighbors. This ensures we don't cycle infinitely. Time O(V+E), space O(V) for the map and recursion stack.

FAQ · 5 QUESTIONS

Frequently Asked Questions

When should I use BFS over DFS in a graph interview?

What is the best way to represent a graph: Adjacency Matrix or Adjacency List?

Can Dijkstra's algorithm handle negative weights?

What is the difference between topological sort using Kahn's algorithm and using DFS?

Is Union-Find suitable for path queries between two nodes?

Naren Founder & Principal Engineer

20+ years shipping production code across the stack, with years spent interviewing engineers. Written from production experience, not tutorials.

✓ Verified

production tested

May 23, 2026

last updated

1,554

articles · all by Naren

🔥

That's Coding Patterns. Mark it forged?

8 min read · try the examples if you haven't