Queuing Theory Essentials
Fundamental concepts, definitions, and mathematical models for system optimization.
Fundamental Parameters
- λ (Lambda) Arrival Rate: The average number of customers arriving per unit of time (e.g., 10 per hour).
- μ (Mu) Service Rate: The average number of customers a server can handle per unit of time.
- c (Servers) Server Count: The number of parallel service channels available in the system.
-
ρ (Rho)
Utilization Factor: The fraction of time a server is busy. For stability in infinite systems, ρ < 1.
- K (Capacity) System Capacity: The maximum number of customers allowed in the system (queue + service).
Kendall's Notation
Standardized shorthand for describing queuing models in the format: A / B / c / K / m / Z
| Symbol | Meaning | Common Values |
|---|---|---|
| A | Arrival Distribution | M, D, G |
| B | Service Distribution | M, D, G |
| c | Number of Servers | 1, 2, 3, ... (Parallel channels) |
| K | System Capacity | ∞ (Infinite), or finite integer (e.g., 10) |
| m | Source Population | ∞ (Infinite), or finite pool sizes |
Distribution Basics
Random Arrivals
When arrivals are independent and occur randomly at a constant average rate. The number of arrivals in a time interval follows a Poisson Distribution.
Markovian Property
The service time distribution is "memoryless," meaning the time remaining to complete a task does not depend on how much time has already elapsed.
Constant Time
Service or Arrivals occur at perfectly fixed intervals. Variance is zero (σ2 = 0). Common in automated machinery.
Queue Discipline & Priority
Standard Queuing
First-In-First-Out: Customers are served in the exact order they arrive. This is the most common and "fair" discipline for general service systems.
Queue Jumping
Priority over line: High-priority customers move to the front of the queue but do not interrupt someone currently being served.
Immediate Service
Service Interruption: High-priority customers can stop the current service session to be handled immediately. Common in emergency medical triage.
Performance Metrics
Average Queue Length (Lq)
Expected number of customers waiting in the queue.
Average System Length (Ls)
Expected number of customers in the entire system.
Average Waiting Time (Wq)
Expected time a customer spends waiting in the queue.
Average Turnaround Time (Ws)
Total time spent in the system (Wait + Service).
Core Performance Formulas
Single Server Markovian
Multi-Server Markovian
Finite Capacity Single Server
Finite Multi-Server Markovian
Deterministic Service
General Multi-Server (Approx)
General Service
Operational Efficiency & Costs
The Stability Rule
For any system with infinite capacity, the Stability Condition must be met: λ < cμ (Arrival rate must be less than total service capacity). If this is violated, the queue grows to infinity!
Wait Cost vs. Service Cost
Management must balance two opposing costs:
- Service Cost: Cost of providing service (paying servers, electricity). Increases as c increases.
- Waiting Cost: Cost of customer dissatisfaction or lost productivity. Decreases as c increases.
Total Cost = (c × Cserver) + (Ls × Cwait)
System Optimization
The "Sweet Spot" is the number of servers where the Total Cost is at its minimum value. This simulator helps you find that exact point by modeling different configurations.
System Failure Analysis (Pf)
Blocking Probability (Pf)
In finite capacity systems (K), arriving customers are blocked (dropped) if the system is full. This is often referred to as "System Failure" in certain engineering contexts.
- Definition Pf is the probability that an arriving customer enters a system that already contains K customers.
- Effective Rate λeff = λ(1 - Pf) is the actual rate at which customers enter and are served.
Operational Laws
Little's Law
The long-term average number of customers in a stationary system L is equal to the long-term average effective arrival rate λ multiplied by the average time a customer spends in the system W.