DSA Beginner

FCFS Scheduling — Convoy Effect Cripples Payment API

Q: Does FCFS cause starvation?

No — every process eventually gets the CPU since there is no priority-based preemption. However, average waiting time can be very high (effectively long waits for short processes), which is sometimes confused with starvation.

Q: Can FCFS be preemptive?

No — by definition, FCFS is non-preemptive. If you add preemption, it becomes Round Robin (with a time quantum) or priority-based scheduling.

Q: Is FCFS the same as FIFO scheduling?

Yes, for processes with the same arrival order. FIFO refers to the queue discipline (First In, First Out); FCFS is the scheduling policy built on that queue.

Q: How does FCFS handle processes arriving at the exact same time?

If two processes have identical arrival times, tie-breaking is typically by process ID. The scheduler can choose any order, but FCFS assumes a stable ordering — the exact choice does not affect the algorithm's definition.

Q: Why is FCFS still taught if it's rarely used in modern OS?

Because it is the simplest scheduling model, serving as a baseline for evaluating all other algorithms. Understanding FCFS builds intuition for trade-offs around waiting time, fairness, and the convoy effect.

📅 March 24, 2026 ⏱ 3 min read 🎯 Beginner

Where developers are forged. · Structured learning · Free forever.

📍 Part of: Scheduling → Topic 1 of 4

Response time for payment API jumped from <200ms to >15s due to convoy effect in FCFS scheduling.

🧑‍💻 Beginner-friendly — no prior DSA experience needed

In this tutorial, you'll learn

Response time for payment API jumped from <200ms to >15s due to convoy effect in FCFS scheduling.

FCFS: sort by arrival, execute in order — non-preemptive.
Waiting Time = Turnaround Time - Burst Time. Turnaround Time = Completion - Arrival.
Convoy effect: short processes wait behind long ones → high average waiting time.

✦ Plain-English analogy ✦ Real code with output ✦ Interview questions

⚡Quick Answer

Processes execute in arrival order — pure FIFO queue.
No preemption: once running, a process keeps the CPU until it blocks or exits.
Convoy effect: a single long job holds the CPU while short jobs wait idle, inflating average waiting time.
Worst-case average waiting time grows linearly with burst time variance — not robust for interactive loads.
Production trap: using FCFS for mixed workloads (CPU + I/O) causes I/O devices to starve, tanking throughput.

🚨 START HERE

Quick FCFS Debug Commands

Commands to detect convoy effect and measure scheduling delays in real time

🟡

High average waiting time, short tasks delayed

Immediate ActionList processes sorted by arrival time and record cumulative waiting times

Commands

ps -eo pid,comm,etime,pri,nice --sort=start_time

for p in $(pgrep -f batch); do cat /proc/$p/sched | head -20; done

Fix NowRenice the long batch job to lower priority: renice +10 -p <pid>

🟠

Thread pool queue depth growing while CPU idle

Immediate ActionCheck thread pool work queue size and oldest task age

Commands

jstack <pid> | grep -A5 'pool-' | head -30

java -jar jmxterm.jar -l localhost:9010 -i commands.txt

Fix NowRestart with a custom thread pool that uses a priority queue or separate pools for different job types

Production Incident

Convoy Effect Brings Down an E-Commerce Checkout Service

A CPU-bound batch job arrived at the queue and held the CPU for 15 seconds while all payment processing requests queued behind it. Response time for checkout spiked from 200ms to 15 seconds. The engineering team initially blamed the database, but the root cause was FCFS scheduling on a single-core system running legacy batch processing and interactive services on the same thread pool.

SymptomResponse time for payment API calls rose from <200ms to >15 seconds during the first minute of every hour. The database showed no contention and queries were fast.

AssumptionThe team assumed the database was the bottleneck because queries appeared slow under load. They added read replicas and increased connection pool sizes — no improvement.

Root causeA nightly batch job (CPU-bound, burst 15s) arrived at the same thread pool used for payment requests. FCFS scheduling queued all incoming payment processes behind the batch job, causing convoy effect. The thread pool had only one worker thread.

FixSeparated batch processing to a dedicated thread pool or worker with lower priority. Applied priority scheduling (batch as low, interactive as high) or switched to a preemptive scheduler like Round Robin with a short time quantum.

Key Lesson

Never mix CPU-bound batch jobs and interactive I/O-bound requests on the same FCFS queue.Measure queue depth under load — it reveals convoy effect faster than response times alone.FCFS belongs in batch systems with homogeneous job sizes, not mixed workloads.

Production Debug Guide

How to identify convoy effect and confirm FCFS behaviour in production

Wide gap in completion times between short and long jobs despite low CPU utilization→Check process arrival times and durations. Plot waiting time vs burst time — a strong positive correlation indicates convoy effect under FCFS.

I/O-bound processes have high waiting time but the disk is idle→Verify the scheduler policy (ps -eo pid,comm,policy on Linux). If policy is SCHED_OTHER and no priorities are set, the default is essentially FCFS for equal priority tasks.

Response time spikes correlate with the start of a single long-running process→Trace process start times with /proc/[pid]/stat start time. Compare against spike timestamps. If the long process started seconds before the spike, FCFS convoy is likely.

Thread pool metrics show an ever-increasing queue depth after a specific job submission→Inspect thread pool implementation. If it uses a FIFO work queue, any long task will block subsequent short tasks. Switch to a work-stealing or priority queue.

FCFS is the algorithm you implement when you have a queue and zero information about job durations. It is fair in the sense that arrival order is respected — no process waits longer than all processes that arrived before it. But 'fair' does not mean 'efficient', and the convoy effect makes FCFS genuinely harmful for interactive workloads.

Understanding FCFS matters not for implementing it (it is trivially a FIFO queue) but for understanding what it optimises and what it sacrifices. Every scheduling algorithm trades off fairness, average waiting time, response time, and throughput differently. FCFS optimises arrival-order fairness at the cost of average waiting time. This trade-off analysis is the interview skill being tested.

Algorithm and Metrics

Key metrics

Completion Time (CT): When the process finishes
Turnaround Time (TAT): CT
Arrival Time
Waiting Time (WT): TAT
Burst Time
Response Time: Time from arrival to first execution (= WT for non-preemptive)

FCFS is trivially a FIFO queue. You sort processes by arrival time, then execute each to completion. The metrics tell the real story: average waiting time can skyrocket when a long job arrives early.

fcfs.py · PYTHON

12345678910111213141516171819202122232425262728293031

def fcfs(processes: list[dict]) -> list[dict]:
    """
    processes: list of {pid, arrival, burst}
    Returns processes with ct, tat, wt added.
    """
    # Sort by arrival time
    procs = sorted(processes, key=lambda p: p['arrival'])
    current_time = 0
    results = []
    for p in procs:
        start = max(current_time, p['arrival'])
        ct  = start + p['burst']
        tat = ct - p['arrival']
        wt  = tat - p['burst']
        current_time = ct
        results.append({**p, 'start': start, 'ct': ct, 'tat': tat, 'wt': wt})
    return results

processes = [
    {'pid':'P1','arrival':0,'burst':24},
    {'pid':'P2','arrival':1,'burst':3},
    {'pid':'P3','arrival':2,'burst':3},
]
results = fcfs(processes)
print(f"{'PID':<4} {'AT':>3} {'BT':>3} {'ST':>3} {'CT':>3} {'TAT':>4} {'WT':>4}")
for r in results:
    print(f"{r['pid']:<4} {r['arrival']:>3} {r['burst']:>3} {r['start']:>3} {r['ct']:>3} {r['tat']:>4} {r['wt']:>4}")
avg_tat = sum(r['tat'] for r in results) / len(results)
avg_wt  = sum(r['wt']  for r in results) / len(results)
print(f"\nAverage TAT: {avg_tat:.2f}")
print(f"Average WT:  {avg_wt:.2f}")

▶ Output

PID AT BT ST CT TAT WT
P1 0 24 0 24 24 0
P2 1 3 24 27 26 23
P3 2 3 27 30 28 25

Average TAT: 26.00
Average WT: 16.00

📊 Production Insight

These numbers hide a trap: the average WT of 16ms comes entirely from the two short jobs. P1 never waits. In a real system, if those short jobs are interactive, users feel a 25ms delay — unacceptable.

If you're building a metric dashboard, always track the distribution of waiting times, not just the average. The 99th percentile reveals the convoy victims.

🎯 Key Takeaway

FCFS arrival order is fair but waiting time depends entirely on the burst of earlier processes.

Average waiting time can be misleading — monitor percentiles.

If you see one long process inflating the mean, you're looking at a convoy.

The Convoy Effect

In the example above, P2 and P3 each take only 3ms but wait 23ms and 25ms because P1 (24ms) arrived first. This is the convoy effect — short processes form a convoy behind a long one.

If we ran P2, P3, P1 instead (SJF order)

P2: WT=0, P3: WT=2, P1: WT=5
Average WT = 2.33ms vs 16ms for FCFS

This is why FCFS has poor average waiting time for mixed workloads.

⚠ Convoy Effect

FCFS is particularly bad when a CPU-bound process arrives before many I/O-bound processes. The CPU-bound process holds the CPU while I/O-bound processes (which could be doing I/O) wait idle.

📊 Production Insight

Convoy effect is not just a textbook problem. In any production system mixing compute-heavy and light tasks, FCFS can cause I/O devices to starve. The I/O-bound processes could issue disk reads, but they sit blocked on CPU while the disk sits idle. Throughput drops because the system isn't overlapping CPU and I/O operations.

Debugging tip: If CPU utilization is low but I/O-bound processes have high waiting times, check the scheduler policy.

🎯 Key Takeaway

Convoy effect wastes I/O bandwidth, not just CPU time.

The fix is preemption — don't let one process monopolize the CPU.

If you cannot preempt, at least isolate batch jobs to separate queues.

Gantt Chart Representation

A Gantt chart visually shows which process runs at each time interval. For FCFS, the chart is simply the processes in arrival order, each spanning its burst duration. This representation makes the convoy effect obvious: long bars push short bars to the right.

gantt.py · PYTHON

1234567891011

def print_gantt(results: list[dict]) -> None:
    print('Gantt Chart:')
    bar = '|'
    times = '0'
    for r in results:
        bar += f" {r['pid']:^5}|"
        times += f"{r['ct']:>7}"
    print(bar)
    print(times)

print_gantt(results)

▶ Output

Gantt Chart:
| P1 | P2 | P3 |
0 24 27 30

📊 Production Insight

In a real system, you can generate a Gantt-like timeline from /proc/[pid]/schedstat. Plotting cumulative CPU time per process reveals if a single long task dominates early time slices.

If you see a flat line for short tasks while a long task runs, that's visual proof of convoy.

🎯 Key Takeaway

Gantt charts expose scheduling unfairness at a glance.

Automate their generation from performance monitoring data.

Long contiguous bars = convoy; many small bars = preemptive fairness.

When to Use FCFS in Practice

FCFS is appropriate only when

All jobs have roughly equal burst times (e.g., batch file processing with similar file sizes).
Order must be strictly preserved (e.g., transactions that must commit in arrival sequence).
No prior information about job durations exists and you cannot risk priority inversion.

FCFS is not appropriate for

Interactive systems where response time matters.
Mixed workloads with heterogeneous burst times.
Real-time systems where deadlines must be met.

In most general-purpose operating systems, FCFS is used only as a fallback scheduler for equal-priority processes under SCHED_OTHER. Linux's Completely Fair Scheduler (CFS) is fundamentally different — it's a weighted fair queuing scheduler that preempts to maintain fairness.

Mental Model

FCFS in the OS Ecosystem

FCFS is the 'null model' of scheduling — it's what happens when you don't make any decisions.

No decision: just a queue. Predictable in behaviour, unpredictable in performance.
Linux CFS targets a 1ms scheduling granularity — the opposite of FCFS's no-granularity.
FCFS survives as a special case for cooperative multitasking or dispatcher systems where preemption is impossible.

📊 Production Insight

You'll rarely configure a modern OS scheduler to pure FCFS because the kernel already does preemptive time-sharing. But FCFS emerges accidentally when a thread pool uses a single worker thread and a FIFO work queue. That's where the real convoy problems live.

Rule: Any single-threaded FIFO queue in a concurrent system is a potential FCFS convoy trap.

🎯 Key Takeaway

FCFS is the default when you opt out of scheduling decisions.

Watch out for accidental FCFS in thread pools and dispatchers.

Real-time and interactive systems must preempt.

FCFS vs. SJF vs. Round Robin

Comparing scheduling algorithms around burst time variance:

Algorithm	Average Waiting Time	Starvation Risk	Preemptive
FCFS	High with variance	None (everyone runs eventually)	No
SJF (non-preemptive)	Optimal for given arrival order	Yes, long processes	No
Round Robin (preemptive)	Moderate, depends on quantum	Low	Yes

FCFS minimises the maximum waiting time for processes with the same arrival order — that's its fairness guarantee. But it maximises average waiting time when burst times vary.

SJF exploits knowledge of burst times to minimise average waiting time, but can starve long processes. Round Robin trades off average waiting time for response time — good for interactive workloads.

In production, you rarely pick one in isolation. Hybrid schedulers like CFS combine elements: target latency (like RR) with fairness (like FCFS) and dynamic priority adjustments.

📊 Production Insight

In real systems, burst times are rarely known. SJF is often estimated from historical data (e.g., exponential averaging), which introduces its own complexity. FCFS avoids this estimation entirely but pays the convoy cost.

If you're building a job scheduler from scratch, start with FCFS for simplicity, then add preemption when you measure convoy effect in production.

🎯 Key Takeaway

No single algorithm dominates — trade-offs are inevitable.

FCFS is the baseline; anything more sophisticated must justify its complexity.

Measure your workload's burst time distribution before choosing.

Choosing a Scheduling Policy

IfAll jobs are CPU-bound and similar length

→

UseFCFS is fine — simplest, no overhead.

IfInteractive response time is critical

→

UseUse Round Robin or CFS — preemption ensures fairness in response time.

IfYou know job burst times beforehand

→

UseSJF minimises average waiting time, but watch for starvation — implement aging.

🗂 FCFS vs SJF vs Round Robin

Algorithm	Preemptive?	Avg. Wait Time	Starvation Risk	Best Use Case
FCFS	No	High with burst variance	None	Batch systems with homogeneous jobs
SJF (non-preemptive)	No	Optimal (theoretical)	High (long tasks)	When burst times known and short jobs dominate
Round Robin	Yes (time quantum)	Moderate, depends on quantum	Low	Interactive workloads, time-sharing

🎯 Key Takeaways

FCFS: sort by arrival, execute in order — non-preemptive.
Waiting Time = Turnaround Time - Burst Time. Turnaround Time = Completion - Arrival.
Convoy effect: short processes wait behind long ones → high average waiting time.
Simple to implement (FIFO queue) but rarely optimal.
Best used when all processes have similar burst times or process order must be preserved.

⚠ Common Mistakes to Avoid

✕Assuming FCFS is 'fair' for all metrics

Symptom

Short interactive processes experience high latency, users complain, but the scheduler report shows 'no process waited more than average'.

Fix

Track wait time percentiles (p99, p999) rather than just average. Educate the team that FCFS fairness is only arrival-order fairness, not performance fairness.

✕Using FCFS for I/O-heavy workloads because 'it's simple'

Symptom

CPU utilization is low (< 20%) but throughput is terrible. I/O devices are idle while processes sit in the run queue.

Fix

Switch to a preemptive scheduler (e.g., RR with quantum 10-50ms) to allow I/O-bound processes to issue requests and overlap I/O with CPU.

✕Implementing a thread pool with a single FIFO work queue and claiming it's 'FCFS scheduling'

Symptom

A long task in the queue blocks all subsequent short tasks. No parallelism despite multiple cores.

Fix

Use multiple worker threads, or a work-stealing queue (e.g., ForkJoinPool in Java), or a priority queue that prefers short tasks.

Interview Questions on This Topic

QExplain the convoy effect in FCFS scheduling and show it with an example.Mid-levelReveal
The convoy effect occurs when a long CPU-bound process arrives before several short I/O-bound processes. The long process holds the CPU, forcing the short processes to wait. This increases average waiting time and wastes I/O bandwidth because the I/O devices remain idle while short processes could have issued requests. Example: P1 (burst 24), P2 (burst 3), P3 (burst 3) arrive at times 0,1,2. Under FCFS, P1 runs first, P2 waits 23ms, P3 waits 25ms. Average WT = 16ms. Under SJF, P2 and P3 run first, average WT = 2.33ms. The convoy costs 13.67ms in average waiting time.
QIs FCFS preemptive or non-preemptive? What are the implications?JuniorReveal
FCFS is non-preemptive. Once a process starts executing, it holds the CPU until it voluntarily yields (e.g., I/O wait, termination) or blocks. This means no context switch overhead during a process's burst, but it also means a single long process can delay all others arbitrarily. Non-preemptive scheduling is simpler but not responsive to urgent or interactive tasks.
QWhen would you choose FCFS over other scheduling algorithms in a production system?SeniorReveal
When: - All processes have roughly equal burst times (so convoy effect is minimal). - Order preservation is a hard requirement (e.g., transaction logs). - You have no prior knowledge of job lengths and want to guarantee no starvation. - Overhead must be close to zero (FCFS is just a queue). In practice, modern OS schedulers like CFS do not use pure FCFS, but dedicated batch processing systems (e.g., job schedulers like AutoSys with single-executor queues) may effectively run FCFS.
QCalculate average waiting time for FCFS with processes P1(AT=0,BT=8), P2(AT=1,BT=4), P3(AT=2,BT=2).Mid-levelReveal
Order by arrival: P1 then P2 then P3. - P1: start=0, CT=8, WT=0. - P2: start=8, CT=12, WT=8-1=7? Wait: WT = TAT - BT = (12-1) - 4 = 7. - P3: start=12, CT=14, WT = (14-2) - 2 = 10. Average WT = (0+7+10)/3 = 5.67 ms. Now compare with SJF: P1(8), P2(4), P3(2). If SJF non-preemptive, after P1 (8), P3 (2) then P2 (4): P1 WT=0, P3 WT=8-2=6?, actually start of P3 = 8, WT = (10-2)-2 = 6. P2 start=10, WT = (14-1)-4 = 9. Avg WT = (0+6+9)/3 = 5.0. Slightly better, but convoy effect is small because burst times are similar. If P1 were much larger, difference would be stark.
QDoes FCFS cause starvation? Explain.JuniorReveal
No, FCFS does not cause starvation because every process will eventually reach the head of the queue and execute. There is no priority mechanism that could perpetually bypass a process. However, a process with a very long burst can cause other processes to wait an extremely long time (effectively bounded but high). This is sometimes confused with starvation, but starvation implies indefinite postponement, which cannot happen under FCFS.

Frequently Asked Questions

Does FCFS cause starvation?

No — every process eventually gets the CPU since there is no priority-based preemption. However, average waiting time can be very high (effectively long waits for short processes), which is sometimes confused with starvation.

Can FCFS be preemptive?

No — by definition, FCFS is non-preemptive. If you add preemption, it becomes Round Robin (with a time quantum) or priority-based scheduling.

Is FCFS the same as FIFO scheduling?

Yes, for processes with the same arrival order. FIFO refers to the queue discipline (First In, First Out); FCFS is the scheduling policy built on that queue.

How does FCFS handle processes arriving at the exact same time?

If two processes have identical arrival times, tie-breaking is typically by process ID. The scheduler can choose any order, but FCFS assumes a stable ordering — the exact choice does not affect the algorithm's definition.

Why is FCFS still taught if it's rarely used in modern OS?

Because it is the simplest scheduling model, serving as a baseline for evaluating all other algorithms. Understanding FCFS builds intuition for trade-offs around waiting time, fairness, and the convoy effect.

🔥

Naren Founder & Author

Developer and founder of TheCodeForge. I built this site because I was tired of tutorials that explain what to type without explaining why it works. Every article here is written to make concepts actually click.

About Naren Get in touch

Forged with 🔥 at TheCodeForge.io — Where Developers Are Forged