Queuing Theory Guide | Simulation Hub

Fundamental Parameters

To understand a queue, we need to know how fast people arrive (λ), how fast they are served (μ), and how many people can do the work (c).

$λ$ (Lambda) Arrival Rate: The average number of customers arriving per unit of time (e.g., 10 per hour).
$μ$ (Mu) Service Rate: The average number of customers a server can handle per unit of time.
$c$ (Servers) Server Count: The number of parallel service channels available in the system.
$ρ$ (Rho) Utilization Factor: The fraction of time a server is busy. For stability in infinite systems, $ρ < 1$ . $ρ = λ c \cdot μ$
$K$ (Capacity) System Capacity: The maximum number of customers allowed in the system (queue + service).

Kendall's Notation

This is a shorthand code used to describe any queue quickly, like a recipe name for a specific type of problem.

Standardized shorthand for describing queuing models in the format: A / B / c / K / m / Z

Symbol	Meaning	Common Values
A	Arrival Distribution	M, D, G
B	Service Distribution	M, D, G
c	Number of Servers	1, 2, 3, ... (Parallel channels)
K	System Capacity	∞ (Infinite), or finite integer (e.g., 10)
m	Source Population	∞ (Infinite), or finite pool sizes

Distribution Basics

Poisson Arrival

Random Arrivals

When arrivals are independent and occur randomly at a constant average rate. The number of arrivals in a time interval follows a Poisson Distribution.

Exponential Service

Markovian Property

The service time distribution is "memoryless," meaning the time remaining to complete a task does not depend on how much time has already elapsed.

Deterministic

Constant Time

Service or Arrivals occur at perfectly fixed intervals. Variance is zero ( $σ 2 = 0$ ). Common in automated machinery.

Queue Discipline & Priority

FIFO

Standard Queuing

First-In-First-Out: Customers are served in the exact order they arrive. This is the most common and "fair" discipline for general service systems.

Non-Preemptive (NP)

Queue Jumping

Priority over line: High-priority customers move to the front of the queue but do not interrupt someone currently being served.

Preemptive (PR)

Immediate Service

Service Interruption: High-priority customers can stop the current service session to be handled immediately. Common in emergency medical triage.

Performance Metrics

Average Queue Length (L_q)

Expected number of customers waiting in the queue.

L q = \sum (n-c) P n

Average System Length (L_s)

Expected number of customers in the entire system.

L s = L q + λ μ

Average Waiting Time (W_q)

Expected time a customer spends waiting in the queue.

W q = L q λ

Average Turnaround Time (W_s)

Total time spent in the system (Wait + Service).

W s = W q + 1 μ

Core Performance Formulas

M/M/1

Single Server Markovian

L q = ρ 2 1 - ρ

Real-World Use Case Bank teller with a single window, small coffee shops, or a single repair technician.

M/M/c

Multi-Server Markovian

P 0 = [ \sum (λ/μ) n n! + (λ/μ) c c!(1-ρ)] -1

Real-World Use Case Call centers with multiple agents, supermarket checkout lines, or hospital emergency rooms.

M/M/1/K

Finite Capacity Single Server

P 0 = 1 - ρ 1 - ρ K+1 P K = ρ K P 0

Real-World Use Case Single repair shop with limited parking space for cars waiting for service.

M/M/c/K

Finite Multi-Server Markovian

P f = P K = (λ/μ) K c!(c K-c) P 0

Real-World Use Case Parking lots with fixed spaces, buffer zones in assembly lines, or server clusters.

M/D/1

Deterministic Service

L q = ρ 2 2(1 - ρ)

Real-World Use Case Automated car wash, robotic assembly station, or periodic scheduled maintenance.

G/G/c

General Multi-Server (Approx)

W q \approx P(L\geqc) cμ(1-ρ) \cdot C a 2 + C s 2 2

Real-World Use Case Manufacturing systems with non-standard arrival and service distributions.

M/G/1

General Service

L q = λ 2 σ 2 + ρ 2 2(1 - ρ)

Real-World Use Case Post offices where different tasks (mailing, passport, etc.) take highly variable times.

Operational Efficiency & Costs

The Stability Rule

For any system with infinite capacity, the Stability Condition must be met: $λ < cμ$ (Arrival rate must be less than total service capacity). If this is violated, the queue grows to infinity!

Wait Cost vs. Service Cost

Management must balance two opposing costs:

Service Cost: Cost of providing service (paying servers, electricity). Increases as c increases.
Waiting Cost: Cost of customer dissatisfaction or lost productivity. Decreases as c increases.

Total Cost = (c × C_server) + (L_s × C_wait)

System Optimization

The "Sweet Spot" is the number of servers where the Total Cost is at its minimum value. This simulator helps you find that exact point by modeling different configurations.

System Failure Analysis (P_f)

Blocking Probability (P_f)

In finite capacity systems (K), arriving customers are blocked (dropped) if the system is full. This is often referred to as "System Failure" in certain engineering contexts.

Definition P_f is the probability that an arriving customer enters a system that already contains K customers.
Effective Rate λ_eff = λ(1 - P_f) is the actual rate at which customers enter and are served.

Operational Laws

Little's Law is a simple rule that says the number of people in a shop depends on how fast they enter and how long they stay.

Little's Law

The long-term average number of customers in a stationary system L is equal to the long-term average effective arrival rate λ multiplied by the average time a customer spends in the system W.

L = λW

Department Simulation

Queuing Theory Essentials

Fundamental Parameters

Kendall's Notation

Distribution Basics

Random Arrivals

Markovian Property

Constant Time

Queue Discipline & Priority

Standard Queuing

Queue Jumping

Immediate Service

Performance Metrics

Average Queue Length (L_q)

Average System Length (L_s)

Average Waiting Time (W_q)

Average Turnaround Time (W_s)

Core Performance Formulas

Single Server Markovian

Multi-Server Markovian

Finite Capacity Single Server

Finite Multi-Server Markovian

Deterministic Service

General Multi-Server (Approx)

General Service

Operational Efficiency & Costs

The Stability Rule

Wait Cost vs. Service Cost

System Optimization

System Failure Analysis (P_f)

Blocking Probability (P_f)

Operational Laws

Little's Law

Fundamental Parameters

Kendall's Notation

Distribution Basics

Random Arrivals

Markovian Property

Constant Time

Queue Discipline & Priority

Standard Queuing

Queue Jumping

Immediate Service

Performance Metrics

Average Queue Length (Lq)

Average System Length (Ls)

Average Waiting Time (Wq)

Average Turnaround Time (Ws)

Core Performance Formulas

Single Server Markovian

Multi-Server Markovian

Finite Capacity Single Server

Finite Multi-Server Markovian

Deterministic Service

General Multi-Server (Approx)

General Service

Operational Efficiency & Costs

The Stability Rule

Wait Cost vs. Service Cost

System Optimization

System Failure Analysis (Pf)

Blocking Probability (Pf)

Operational Laws

Little's Law

Average Queue Length (L_q)

Average System Length (L_s)

Average Waiting Time (W_q)

Average Turnaround Time (W_s)

System Failure Analysis (P_f)

Blocking Probability (P_f)