Chapter 11
Chapter 11
Chapter 11
Served Customers
Queueing System
Queue
C S
Customers CCCCCCC C S Service
C S facility
C S
Served Customers
• The table shows his queueing system in action over a typical morning.
• The time between consecutive arrivals to a queueing system are called the
interarrival times.
• The expected number of arrivals per unit time is referred to as the mean
arrival rate.
• Most queueing models assume that the form of the probability distribution of
interarrival times is an exponential distribution.
Number of
Customers 3
in the
System
0 20 40 60 80 100
Time (in minutes)
0 Mean Time
• For most queueing systems, the servers have no control over when customers
will arrive. Customers generally arrive randomly.
• The only probability distribution with this property of random arrivals is the
exponential distribution.
• The fact that the probability of an arrival in the next minute is completely
uninfluenced by when the last arrival occurred is called the lack-of-memory
property.
• The number of customers in the queue (or queue size) is the number of
customers waiting for service to begin.
• The number of customers in the system is the number in the queue plus the
number currently being served.
• The queue capacity is the maximum number of customers that can be held in
the queue.
• An infinite queue is one in which, for all practical purposes, an unlimited
number of customers can be held there.
• When the capacity is small enough that it needs to be taken into account, then
the queue is called a finite queue.
• The queue discipline refers to the order in which members of the queue are
selected to begin service.
– The most common is first-come, first-served (FCFS).
– Other possibilities include random selection, some priority procedure, or even last-
come, first-served.
• When a customer enters service, the elapsed time from the beginning to the
end of the service is referred to as the service time.
• Basic queueing models assume that the service time has a particular
probability distribution.
• The symbol used for the mean of the service time distribution is
• Exponential Distribution
– The most popular choice.
– Much easier to analyze than any other.
– Although it provides a good fit for interarrival times, this is much less true for
service times.
– Provides a better fit when the service provided is random than if it involves a fixed
set of tasks.
– Standard deviation: = Mean
To identify which probability distribution is being assumed for service times (and
for interarrival times), a queueing model conventionally is labeled as follows:
• Managers who oversee queueing systems are mainly concerned with two
measures of performance:
– How many customers typically are waiting in the queueing system?
– How long do these customers typically have to wait?
• When customers are internal to the organization, the first measure tends to be
more important.
– Having such customers wait causes lost productivity.
• Statistics that are helpful to answer these types of questions are available for
some queueing systems:
– Pn = Steady-state probability of having exactly n customers in the system.
– P(W ≤ t) = Probability the time spent in the system will be no more than t.
– P(Wq ≤ t) = Probability the wait time will be no more than t.
• Current policy: Each tech rep’s territory is assigned enough machines so that
the tech rep will be active repairing machines (or traveling to the site) 75% of
the time.
– A repair call averages 2 hours, so this corresponds to 3 repair calls per day.
– Machines average 50 workdays between repairs, so assign 150 machines per rep.
• Proposed New Service Standard: The average waiting time before a tech rep
begins the trip to the customer site should not exceed two hours.
• The queue: The machines waiting for repair to begin at their sites.
• Service time: The total time the tech rep is tied up with a machine, either
traveling to the machine site or repairing the machine. (Thus, a machine is
viewed as leaving the queue and entering service when the tech rep begins the
trip to the machine site.)
• Assumptions
1. Interarrival times have an exponential distribution with a mean of 1/ .
2. Service times have an exponential distribution with a mean of 1/ .
3. The queueing system has one server.
• The expected number of customers in the system is
L = 1 – = –
• The expected waiting time in the system is
W = (1 / )L = 1 / ( – )
• The expected waiting time in the queue is
Wq = W – 1/ = / [( – )]
• The expected number of customers in the queue is
Lq = Wq = 2 / [( – )] = 2 / (1 – )
B C D E G H
3 Data Results
4 3 (mean arrival rate) L= 3
5 4 (mean service rate) Lq = 2.25
6 s= 1 (# servers)
7 W= 1
8 Pr(W > t) = 0.368 Wq = 0.75
9 when t = 1
10 0.75
11 Prob(W q > t) = 0.276
12 when t = 1 n Pn
13 0 0.25
14 1 0.1875
15 2 0.1406
16 3 0.1055
17 4 0.0791
18 5 0.0593
19 6 0.0445
20 7 0.0334
21 8 0.0250
22 9 0.0188
23 10 0.0141
• The proposed new service standard is that the average waiting time before
service begins be two hours (i.e., Wq ≤ 1/4 day).
• John Phixitt’s suggested approach is to lower the tech rep’s utilization factor
sufficiently to meet the new service requirement.
B C D E G H
3 Data Results
4 2 (mean arrival rate) L= 1
5 4 (mean service rate) Lq = 0.5
6 s= 1 (# servers)
7 W= 0.5
8 Pr(W > t) = 0.135 Wq = 0.25
9 when t = 1
10 0.5
11 Prob(Wq > t) = 0.068
12 when t = 1 n Pn
13 0 0.5
14 1 0.25
15 2 0.1250
16 3 0.0625
17 4 0.0313
18 5 0.0156
19 6 0.0078
20 7 0.0039
21 8 0.0020
22 9 0.0010
23 10 0.0005
Erlang, with shape parameter k 1/ (1/k) (1/) M/Ek/1 (k+1)/(2k) [2 / (1 – )]
• The proposed new service standard is that the average waiting time before
service begins be two hours (i.e., Wq ≤ 1/4 day).
• The Vice President for Engineering has suggested providing tech reps with
new state-of-the-art equipment that would reduce the time required for the
longer repairs.
• After gathering more information, they estimate the new equipment would
have the following effect on the service-time distribution:
– Decrease the mean from 1/4 day to 1/5 day.
– Decrease the standard deviation from 1/4 day to 1/10 day.
B C D E F G
3 Data Results
4 3 (mean arrival rate) L= 1.163
5 0.2 (expected service time) Lq = 0.563
6 0.1 (standard deviation)
7 s= 1 (# servers) W= 0.388
8 Wq = 0.188
9
10 0.6
11
12 P0 = 0.4
• With multiple servers, the formula for the utilization factor becomes
= / s
but still represents that average fraction of time that individual servers are
busy.
10 s = 25
s = 20
s = 15
s = 10
s=7
s=5
s=4
s=3
0.5 s=2
0.2 s=1
0.1
0 0.1 0.3 0.5 0.7 0.9 1.0
Utilization factor
s
• The proposed new service standard is that the average waiting time before
service begins be two hours (i.e., Wq ≤ 1/4 day).
• The Chief Financial Officer has suggested combining the current one-person
tech rep territories into larger territories that would be served jointly by
multiple tech reps.
• The Chief Financial Officer has suggested combining the current one-person
tech rep territories into larger territories that would be served jointly by
multiple tech reps.
Number of Number of
Tech Reps Machines s Wq
s = 25
10
s = 20
s = 15
s = 10
s=7
s=5
1.0 s=4
s=3
s=2
s=1
0.1
0 0.1 0.3 0.5 0.7 0.9 1.0
Utilization factor
s
• General Assumptions:
– There are two or more categories of customers. Each category is assigned to a
priority class. Customers in priority class 1 are given priority over customers in
priority class 2. Priority class 2 has priority over priority class 3, etc.
– After deferring to higher priority customers, the customers within each priority
class are served on a first-come-fist-served basis.
• Additional Assumptions
1. Preemptive priorities are used as previously described.
2. For priority class i (i = 1, 2, … , n), the interarrival times of the customers in that
class have an exponential distribution with a mean of 1/i.
3. All service times have an exponential distribution with a mean of 1/, regardless of
the priority class involved.
4. The queueing system has a single server.
= (1 + 2 + … + n) /
• Additional Assumptions
1. Nonpreemptive priorities are used as previously described.
2. For priority class i (i = 1, 2, … , n), the interarrival times of the customers in that
class have an exponential distribution with a mean of 1/i.
3. All service times have an exponential distribution with a mean of 1/, regardless of
the priority class involved.
4. The queueing system can have any number of servers.
= (1 + 2 + … + n) / s
• The proposed new service standard is that the average waiting time before
service begins be two hours (i.e., Wq ≤ 1/4 day).
• The mean arrival rates for the two classes of copiers are
– 1 = 1 customer (printer-copier) per workday (now)
– 2 = 2 customers (other machines) per workday (now)
B C D E F G
3 Data
4 n= 2 (# of priority classes)
5 4 (mean service rate)
6 s= 1 (# servers)
7
8 Results
9 i L Lq W Wq
10 Priority Class 1 1 0.5 0.25 0.5 0.25
11 Priority Class 2 2 2.5 2 1.25 1
12 Priority Class 3 1 #DIV/0! #DIV/0! #DIV/0! #DIV/0!
13 Priority Class 4 1 #DIV/0! #DIV/0! #DIV/0! #DIV/0!
14 Priority Class 5 1 1.75 1.5 1.75 1.5
15
16 3
17 0.75
B C D E F G
3 Data
4 n= 2 (# of priority classes)
5 4 (mean service rate)
6 s= 1 (# servers)
7
8 Results
9 i L Lq W Wq
10 Priority Class 1 1.5 0.825 0.45 0.55 0.3
11 Priority Class 2 1.5 2.175 1.8 1.45 1.2
12 Priority Class 3 1 #DIV/0! #DIV/0! #DIV/0! #DIV/0!
13 Priority Class 4 1 #DIV/0! #DIV/0! #DIV/0! #DIV/0!
14 Priority Class 5 1 1.75 1.5 1.75 1.5
15
16 3
17 0.75
1 Later 1.5 1.5 4 0.75 0.3 workday (2.4 hrs.) 1.2 workday (9.6 hrs.)
2 Now 2 4 4 0.75 0.107 workday (0.86 hr.) 0.439 workday (3.43 hrs.)
2 Later 3 3 4 0.75 0.129 workday (1.03 hrs.) 0.514 workday (4.11 hrs.)
3 Now 3 6 4 0.75 0.063 workday (0.50 hr.) 0.252 workday (2.02 hrs.)
3 Later 4.5 4.5 4 0.75 0.076 workday (0.61 hr.) 0.303 workday (2.42 hrs.)
Decision: Adopt fourth proposal (except for sparsely populated areas where
second proposal should be adopted).
2. Decreasing the variability of service times (without any change in the mean)
improves the performance of a queueing system substantially.
B C D E G H
3 Data Results
4 0.5 (mean arrival rate) L= 1
5 1 (mean service rate) Lq = 0.5
A B C D E
9 Data Table Demonstrating the Effect of
10 on Lq and L for M/M/1
Increasing
11 100
• When each server costs the same (Cs = cost of server per unit time),
SC = Cs s
• When the waiting cost is proportional to the amount of waiting (Cw = waiting
cost per unit time for each customer),
WC = Cw L
• The Acme Machine Shop has a tool crib for storing tool required by shop
mechanics.
• The estimates of the mean arrival rate and the mean service rate (per server)
are
= 120 customers per hour
= 80 customers per hour
• The total cost to the company of each tool crib clerk is $20/hour, so Cs = $20.
B C D E F G
3 Data Results
4 120 (mean arrival rate) L= 1.736842105
5 80 (mean service rate) Lq = 0.236842105
6 s= 3 (# servers)
7 W= 0.014473684
8 Pr(W > t) = 0.02581732 Wq = 0.001973684
9 when t = 0.05
10 0.5
11 Prob(Wq > t) = 0.00058707
12 when t = 0.05 n Pn
13 0 0.210526316
14 Economic Analysis: 1 0.315789474
15 Cs = $20.00 (cost / server / unit time) 2 0.236842105
16 Cw = $48.00 (waiting cost / unit time) 3 0.118421053
17 4 0.059210526
18 Cost of Service $60.00 5 0.029605263
19 Cost of Waiting $83.37 6 0.014802632
20 Total Cost $143.37 7 0.007401316
H I J K L M N
1 Data Table for Expected Total Cost of Alternatives
2
3 Cost of Cost of Total
4 s r L Service Waiting Cost
5 0.50 1.74 $60.00 $83.37 $143.37
6 1 1.50 #N/A $20.00 #N/A #N/A
7 2 0.75 3.43 $40.00 $164.57 $204.57
8 3 0.50 1.74 $60.00 $83.37 $143.37
9 4 0.38 1.54 $80.00 $74.15 $154.15
10 5 0.30 1.51 $100.00 $72.41 $172.41
$250
Cost of
$200
Service
Cost ($/hour)
$150 Cost of
$100 Waiting
Total Cost
$50
$0
0 1 2 3 4 5
Number of Servers (s)
• Service Facility
– Fast-food restaurants
– Post office
– Grocery store
– Bank
• Disneyland
• Highway traffic
• Manufacturing
• Product orders
• Number of servers
• Queue disciplince
– FCFS?
– Priority system?
• Population size
– Infinite or finite?
• Single Server
...
Customers Service
Center
• Multiple Servers
...
... ...
Customers ...
Customers Service
Service
Centers
Centers
0 1 2 3 4 5 6 7 8 9 10 11 12 13
Customers per time unit
Relative
Frequency (%)
Service Time
• Or any distribution
– Only single-server model is easily solved.
...
..
. Customers
Service
Center
...
...
...
A Taxonomy
— / — / — (and an optional fourth element / —)
Arrival Service Number of Maximum
Distribution Distribution Servers in Queue
where
M = Exponential (Markovian)
D = deterministic (constant)
G = general distribution
• Parameters:
• Performance Measures
ABC Car Wash is an automated car wash. Each customer deposits four quarters in
a coin slot, drives the car into the auto-washer, and waits while the car is
automatically washed. Cars arrive at an average rate of 20 cars per hour (Poisson).
The service time is exactly 2 minutes.
A B C D E F G H
1 Template for the M/G/1 Queueing Model
2
3 Data Results
4 20 (mean arrival rate) L= 1.333
5 0.03333333 (expected service time) Lq = 0.667
6 0 (standard deviation) minutes
7 s= 1 (# servers) W= 0.067 4
8 Wq = 0.033 2
9
10 0.666666667
11
12 P0 = 0.333333333
A grocery store has three registers open. Customers arrive to check out at an
average of 1 per minute (Poisson). The service time averages 2 minutes
(exponential).
A B C D E G H
1 Template for the M/M/s Queueing Model
2
3 Data Results
4 1 (mean arrival rate) L= 2.888888889
5 0.5 (mean service rate) Lq = 0.888888889
6 s= 3 (# servers)
7 W= 2.888888889
8 Pr(W > t) = 0.74131525 Wq = 0.888888889
9 when t = 1
10 0.666666667
11 Prob(Wq > t) = 0.26956918
12 when t = 1 n Pn
13 0 0.111111111
14 1 0.222222222
15 2 0.222222222
16 3 0.148148148
17 4 0.098765432
18 5 0.065843621
19 6 0.043895748
20 7 0.029263832
21 8 0.019509221
22 9 0.013006147
23 10 0.008670765
A call center that handles the tech support for a software manufacturer currently
has 10 telephone lines, with three people fielding the calls. Customers call at an
average rate of 40 per hour (Poisson). A customer can be served in an average of
four minutes (exponential).
A B C D E F G
1 Template for M/M/s Finite Queue Model
2
3 Data Results
4 40 (mean arrival rate) L = 4.5577194
5 15 (mean service rate) Lq = 2.0413889
6 s= 3 (# servers)
7 K= 10 (max customers) W= 0.1208
8 Wq = 0.0540838
9
10 0.8888889
11
12 n Pn
13 0 0.0406825
14 1 0.1084866
15 2 0.1446488
16 3 0.1285767
17 4 0.1142904
18 5 0.1015915
19 6 0.0903035
20 7 0.0802698
21 8 0.071351
22 9 0.0634231
23 10 0.0563761
Consider a small-town hospital emergency room (ER) that has just one doctor on
duty. When patients arrive, they are classified as either critical or non-critical.
When the doctor is finished treating a patient, she takes the next critical patient. If
there are no critical patients, then she takes the next non-critical patient. The ER
doctor spends an average of 10 minutes (exponential) treating each patient before
they are either released or admitted to the hospital. An average of 1 critical patient
and 3 non-critical patients arrive each hour (Poisson).
A B C D E F G H
1 Template for M/M/s Nonpreemptive Priorities Queueing Model
2
3 Data
4 1 n= 2 (# of priority classes)
5 0 6 (mean service rate)
6 0 s= 1 (# servers)
7 0
8 0 Results
9 0 i L Lq W Wq Wq (minutes)
10 0 Priority Class 1 1 0.3 0.133333333 0.3 0.133333333 8
11 0 Priority Class 2 3 1.7 1.2 0.566666667 0.4 24
12 0 Priority Class 3 1 2.166666667 2 2.166666667 2
13 0 Priority Class 4 1 #DIV/0! #DIV/0! #DIV/0! #DIV/0!
14 0 Priority Class 5 1 #DIV/0! #DIV/0! #DIV/0! #DIV/0!
15 0
16 0 4
17 0 0.666666667
Reconsider the same small-town hospital emergency room (ER). Now suppose
they change their policy so that if a critical patient arrives while a non-critical
patient is being treated, the doctor stops treating the non-critical patient, and
immediately starts treating the critical patient. Only when there are no critical
patients to be treated does the doctor start treating non-critical patients.
A B C D E F G H
1 Template for M/M/1 Preemptive Priorities Queueing Model
2
3 Data
4 n= 2 (# of priority classes)
5 6 (mean service rate)
6 s= 1 (# servers)
7
8 Results
9 i L Lq W Wq Wq (minutes)
10 Priority Class 1 1 0.2 0.033333333 0.2 0.033333333 2
11 Priority Class 2 3 1.8 1.3 0.6 0.433333333 26
12 Priority Class 3 1 3 2.833333333 3 2.833333333
13 Priority Class 4 1 #DIV/0! #DIV/0! #DIV/0! #DIV/0!
14 Priority Class 5 1 #DIV/0! #DIV/0! #DIV/0! #DIV/0!
15 4
16 0.666666667
• We can use the results from queueing models to make the following types of
decisions:
Cost
Total Cost
Cost of Service
Capacity
Cost of customers
waiting
Optimum
Service Capacity
• The MIS department of a high tech company handles employee requests for
assistance when computer questions arise. Employees requiring assistance
phone the MIS department with their questions (but may have to wait on hold
if all of the tech support staff are busy).
• The MIS department receives an average of 40 requests for assistance per hour
(Poisson).
Question: What is the optimal size of the MIS tech support staff?
I J K L M N
1 Data Table for Example #1
2
3 s Lq Wq (min) Cost of Service Cost of Waiting Total Cost
4 0.174 0.261 $60.00 $54.35 $114.35
5 3 0.889 1.333 $45.00 $72.22 $117.22
6 4 0.174 0.261 $60.00 $54.35 $114.35
7 5 0.040 0.060 $75.00 $51.00 $126.00
8 6 0.009 0.014 $90.00 $50.23 $140.23
• Customers arrive during the lunch hour at a rate of 98 customers per hour
(Poisson distribution).
Question #1: If management would not like the average customer to wait
longer than five minutes in line, how many registers should they open?
A B C D E G H I
1 Template for the M/M/s Queueing Model
2
3 Data Results
4 98 (mean arrival rate) L= 7.359291808
5 20 (mean service rate) Lq = 2.459291808
6 s= 6 (# servers) minutes
7 W= 0.075094814 4.51
8 Pr(W > t) = 0.34895764 minutes Wq = 0.025094814 1.51
9 when t = 0.08333333 5
10 0.816666667
11 Prob(W q > t) = 0.08826736 minutes
12 when t = 0.08333333 5 n Pn
K L M
1 Data Table for Example #2
2
3 s Wq (min) Pr(Wq > 5 min)
4 1.5057 0.08827
5 5 28.510 0.80443
6 6 1.506 0.08827
7 7 0.430 0.00908
8 8 0.148 0.00087
9 9 0.053 0.00008
• They are remodeling the parking area and drive-through lane. They would like
the drive-through lane to hold all of the customers at least 95% of the time.
Question: How many cars must the drive-through lane be able to hold?
A B C D E G H I
1 Template for the M/M/s Queueing Model
2
3 Data Results
4 40 (mean arrival rate) L= 2
5 60 (mean service rate) Lq = 1.333333333
6 s= 1 (# servers)
7 W= 0.05
8 Pr(W > t) = 2.0612E-09 Wq = 0.033333333
9 when t = 1
10 0.666666667
11 Prob(W q > t) = 1.3741E-09
12 when t = 1 n Pn cumulative
13 0 0.333333333 0.3333
14 1 0.222222222 0.5556
15 2 0.148148148 0.7037
16 3 0.098765432 0.8025
17 4 0.065843621 0.8683
18 5 0.043895748 0.9122
19 6 0.029263832 0.9415
20 7 0.019509221 0.9610
21 8 0.013006147 0.9740
22 9 0.008670765 0.9827
23 10 0.00578051 0.9884
• Current System: For most of the day (all but their lunch hour), they have
three registers open. Each cashier takes the customer’s order, collects the
money, and then gets the burgers and pours the drinks. This takes an average
of 3 minutes per customer (exponential distribution).
• Proposed System: They are considering having just one cash register. While
one person takes the order and collects the money, another will pour the
drinks, and another will get the burgers (like Wendys). The three together
think they can serve a customer in an average of 1 minute.
• The bank handles two kinds of customers: merchant customers and regular
customers. Each arrive at an average rate of 20 customers per hour (for a total
arrival rate of 40 customers per hour).