13 Binomial - Poissons Distribution
13 Binomial - Poissons Distribution
13 Binomial - Poissons Distribution
Introduction
We discussed the basic concepts of probability and described how the rules of addition and multiplication
were used to compute probabilities. Now we expand the study of probability to include the concept of a
probability distribution.
Random Variable
In any experiment of chance, the outcomes occur randomly. A random variable is a value determined by
the outcome of an experiment.
A random variable may have two forms: discrete or continuous. A discrete random variable may
assume only distinct values and is usually the result of counting.
Discrete random variable: A variable that can assume only certain clearly separated
values.
For example, the number of highway deaths in Arkansas on Memorial Day weekend may be 1, 2, 3,…
Another example is the number of students earning a grade of B in your statistics class. In both instances
the number of occurrences result from counting. Note that there can be 12 deaths or 15 B=s but there
cannot be 12.63 deaths or 15.27 B grades.
If we measure something, such as the diameter of a tree, the length of a field, or the time it takes to run
the Boston Marathon, the variable is called a continuous random variable.
Continuous random variable: A variable that can assume one of an infinitely large
number of values within certain limitations.
In brief, if the problem involves counting something, the resulting distribution is usually a discrete
probability distribution. If the distribution is the result of a measurement, then it is usually a continuous
probability distribution.
What is the difference between a random variable and a probability distribution? A probability
distribution lists all the possible outcomes as well as their corresponding probabilities. A random variable
lists only the outcomes. We will examine the continuous random variable and the continuous probability
distribution in the next chapter.
The Mean
The mean µ , or expected value E(x), of a probability distribution is its long-run average. It is computed
by the following formula:
Mean of a Probability Distribution µ = Σ x P ( x ) [5 − 1]
This formula directs you to multiply each outcome (x) by its probability P(x); and then add the products.
1. Subtract the mean (µ ) from each outcome ( x ) and square these differences.
1. Each outcome is classified into one of two mutually exclusive categories. An outcome is
classified as either a “success” or a “failure.” For example, 40 percent of the students at a
particular university are enrolled in the College of Business. For a selected student there are
only two possible outcomesCthe student is enrolled in the College of Business (designated a
success) or he/she is not enrolled in the College of Business (designated a failure).
2. The binomial distribution is the result of counting the number of successes in a fixed sample
size. If we select 5 students, 0, 1, 2, 3, 4, or 5 could be enrolled in the College of Business.
This rules out the possibility of 3.45 of the students being enrolled in the College of
Business. That is, there cannot be fractional counts.
3. The probability of a success remains the same from trial to trial. In the example regarding the
College of Business, the probability of a success remains at 40 percent for all five students
selected.
4. Each sampled item is independent. This means that if the first student selected is enrolled in the
College of Business, it has no effect on whether the second or the fourth one selected will be
in the College of Business.
For example, if a class consisted of 20 students, 12 males and 8 females, what is the probability of
selecting two people to serve on a committee who are both female? If Ms. Smith was selected on the first
trial, she cannot be selected again because she is already on the committee. Thus the outcome of the
second trial depends on the outcome of the first trial. The probability of a female on the first selection is
8/20, and if a female is selected first there are 7 females out of the 19 remaining students. Hence the
probability of selecting two females for the committee is 0.147, found by (8/20)(7/19).
This assumes that the population is finite, that is, the number in the population is known.
This probability may also be calculated using the hypergeometric distribution, which is described by the
formula:
( S C x ) ( N − S Cn − x )
Hypergeometric Distribution P ( x) = [5 − 6]
( N Cn )
Where:
N is the size of the population.
S is the number of successes in the population.
n is the size of the sample or the number of trials.
x is the number of successes in the sample.
C is the symbol for a combination.
P ( 2) =
( 8 C2 ) ( 20−8 C2− 2 )
( 20 C2 )
8! 12! 8 ⋅ 7 1
2!6! 0!12! 2 1 28
= = = = 0.147
20! 20 ⋅ 19 190
2!18! 2
Hence, the probability of selecting two students to serve on a committee and having that committee
consist of two females is 0.147. This is the same probability we computed earlier.
Poisson probability distribution: Has the same four characteristics as the binomial, but
in addition the probability of success (π ) is small, and n, the number of trials, is
relatively large.
µ x e− µ
Poisson Distribution P ( x) = [5 − 7]
x!
Where:
P(x) is the probability for a specified value of x.
x is the number of occurrences (successes).
µ is the arithmetic mean number of occurrences (successes) in a particular interval.
e is the mathematical constant 2.71828. (base of the Naperian logarithmic system)
Note that the mean number of successes, µ, can be determined by nπ , where n is the total number of
trials and π is the probability of success.
As an example where the Poisson distribution is applicable, suppose electric utility bills are based on the
actual reading of the electric meter. In 1 out of 100 cases the meter is incorrectly read (π = 0.01). Suppose
the number of errors that appear in the processing of 500 customer bills approximates the Poisson
distribution (n = 500). In this case the mean number of incorrect bills is 5, found by µ = nπ = 500
(0.01).
Finding the probability of exactly two errors appearing in 500 customer bills is rather tedious. Instead we
merely refer to the Poisson distribution in Poissons table. Locate by µ = (5.0) at the top of a set of
columns. Then find the x of 2 in the left column and read across to the column headed by 5.0. The
probability of exactly 2 billing errors is 0.0842.