Chapter 3
Chapter 3
Chapter 3
3. Introduction
In the previous chapters, you have learnt how to construct a frequency polygon for a given
frequency distribution. It seemed that there was no way of telling in advance how the polygon
would look like and how the mean and the standard deviation would be. As a result, it may be
necessary to further study the behavior of the frequency polygon so as to study the general
behavior of the distribution in general and make some conclusions, which are useful for
decision-making.
This section focuses on the definitions of random variable and probability distribution. Then, you
will deal with the two most common discrete probability distributions and finally with one types
of continuous probability distribution.
3.1.Random Variable
In block 6, we defined the concept of ‘experiment’ and its associated outcome. A random
variable provides a means of assigning numerical values to experimental outcomes. The
definition of a random variable is as follows:
Definition:
Notation: Random variables are usually denoted by capital letters like X, Y, Z, etc.
Let the random variable Y denotes the outcome ‘A number greater than 2 occurs’. Then
the random variable can assume the values 3, 4, 5 or 6.
Examples 3: Consider the experiment of rolling two fair dice once simultaneously.
If the random variable T indicates the outcome `the sum of the numbers on the two dice
is greater than 10,’ then T can take the pairs (5, 6), (6, 5) or (6, 6) since in each of these
cases the sum of the numbers is greater than 10.
As stated above, a random variable provides a means of associating a numerical value with each
possible experimental outcome.
Depending upon the numerical values it can assume, a random variable can be classified into two
major divisions.
A) Discrete Random Variable: is a random variable that may assume either a finite number
of values or an infinite sequence (e.g. 1, 2, 3…) of values. In general, a discrete random
variable takes whole number values, which can be counted or enumerated.
Example: The number of students who are enrolled for a diploma program in Unity
University College, the number of defective batteries observed in assessing its quality,
the number of customers who visit a shop during one day of operation are all examples of
discrete random variables.
B) Continuous Random Variable: is a random variable which may take on all values in a
certain interval or collection of intervals. A Continuous random variable, as the name
implies, assumes all possible values between any two values.
Example: Weight, time, temperature, etc are example of continuous random variable.
The probability distribution for a random variable describes how the probabilities are distributed
over the values of the random variable. For a discrete random variable X, the probability
function is denoted by P(X). The probability function provides the probability for each value of
the random variable.
A probability distribution may in general be defined as follows:
Definition:
Example 1: Construct a probability distribution for the number of heads in tossing two fair coins
simultaneously once.
Probability, P(X) ¼ ½ ¼
The probability distribution shows that the probability that the random variable can assume the
value 0 is ¼, the value 1 is ½ and the value 2 is ¼. Note that the sum of these probabilities is 1.
Example 2: The number of mistakes a typist made in ten days of assessment is shown in the
following table.
No of mistakes 2 3 4 5
No of days 1 4 3 2
Solution:
a) In constructing the probability distribution, our random variable assumes a value for the
number of mistakes the typist committed. Let the variable X denotes this random
variable. Then, we assign a probability for each of the number of days with respect to the
total number of days.
The probability distribution is shown below:
No of mistakes, X 2 3 4 5
Y axis
0.4
P(X)
Probability 0.3
0.2
0.1
1 2 3 4 x axis
Number of mistakes
3.2.Types Of Probability Distribution
There are three types of discrete probability distribution; the binomial, Poisson and hyper
geometric probability distributions. But in this section, we will discuss the two of the (binomial
and Poisson). In the construction of the probability distribution for a discrete random variable,
the following two conditions must be satisfied.
Properties (Required Conditions) for a Discrete Probability Distribution
The sum of the probabilities of all the events in the sample space must equal 1.
i.e. P(x) =1
The probability of each event in the sample space must be between or equal to 0 and 1.
i.e. 0 P(x) 1
For instance, in the above example, these two conditions are satisfied since
P(X) = P(2) + P(3) + P(4) + P(5) = 0.1+ 0.4 + 0.3 + 0.2 = 1 and
each of these probabilities is greater than or equal to 0 and less than or equal to 1.
For some discrete random variables, the probability distribution can be given as a formula that
yields (x) for every possible value of x.
Solution:
The outcome x assumes the values 0, 2 and 3
Out come, x 0 2 3
Probability, (x) 0/5 2/5 3/5
The expected value, or mean, of a random variable is a measure of the central location for the
random variable. It is denoted by E(x) or . The mathematical expression for the expected value
of a discrete random variable x is as follows:
Expected value of a discrete random variable:
E(x)= = x1 P(x1) + x2 . P(x2) +………..+ xn P(Xn) Or,
n
E (x) = xi . P(xi)
i 1
where x1, x2,-------,xn are the outcomes and P(x1), P(x2)…P(xn) are the
corresponding probabilities.
The above formula shows that in order to compute the expected value of a discrete random
variable, we must multiply each value of the random variable by the corresponding probability
P(x) and then add the resulting products.
ii. Variance
While the expected value provides the mean value for the random variable, we often need a
measure of dispersion, or variability, for the random variable just as we need variance in block 5
to summarize the dispersion in a data set. The mathematical expression for the variance of a
discrete random variable is as follows:
2
and the standard deviation is ó ó
Example 1: If three fair coins are tossed, find the expected number of heads that will occur and
obtain the variance
Solution:
Begin by constructing the probability distribution for the number of heads in tossing the three
coins.
The probability distribution is constructed below:
No of heads, x 0 1 2 3
Probability, P(x) 1/8 3/8 3/8 1/8
Then,
4
E(x)= i 1
xi.P(xi) = xi P(x1) + x2 . P(x2) + x3 . P(x3) + x4 . P(x4)
4
2 =
i 1
[(xi-)2·P(xi)]
= (x1 - )2 · P(x1) + (x2 - )2 · P(x2) + (x3- )2 · P(x3) + (x4- )2 ·P(x4)
= (0 - 1.5)2 · 1/8 + (1-1.5)2 · 3/8 + (2 - 1.5)2 · 3/8 + (3 - 1.5)2 · 1/8
2 = 0.5
Example 2: One thousand tickets are sold at $1 each for a color television valued at $350. What
is the expected value if a person purchases one ticket?
Solution:
Hence,
E(x) = $349 · 1/1000 + (-$1) · 999/1000 = -$0.65
Or,
E(x) = overall gain - $1 = $350 · 1/1000 - $1 = $0.65
i.e. The average loss is $0.65 for each of the 1000 ticket holders.
A. The Binomial Probability Distribution
The Binomial Probability Distribution is a discrete probability distribution that has many
applications. It is associated with a multi-step experiment that we call the Binomial experiment,
which is a probability experiment satisfying the following four requirements.
Definition:
A probability distribution showing the outcomes of a Binomial experiment along with the
corresponding probabilities is termed as a Binomial Probability Distribution.
P x
n!
. p x .qn x
n x ! x!
Note: q = 1 - p and 0 x n
Example 1: Consider the experiment of tossing a coin three times. Show that it is a binomial
experiment and find the probability of getting exactly two heads.
Solution:
Now, to find the probability of getting two heads, let p denotes the probability of getting a head
on a single toss.
P x
n!
. p x . q n x
n x ! x!
2
3! 1 1 32 3! 1 1
P(2) . . . .
3 2!2! 2 2 1!2! 4 2
3
= = 0.375
8
Example 2: A new drug is effective 60% of the time. What is the probability that in a random
sample of 4 patients, it will be effective on two of them?
Solution:
This is a Binomial experiment as the points of the experiment are satisfied. Define ‘effective’ as
‘success’ and ‘non effective’ as ‘failure’. Then,
p = 0.6, q = 1 - 0.6 = 0.4, n = 4, x=2
Required p (2) = ?
4 2 !2!
Hence, the drug will be effective on two of a random sample of 4 patients with a probability of
0.3456 (or 34.56%).
A discrete probability distribution that is useful when n is large and p is small and when the
independent variables occur over a period of time is called the Poisson probability distribution.
i) The probability of an occurrence is the same for any two intervals of equal length.
ii) The occurrence or non-occurrence in any interval is independent of the occurrence or
non-occurrence in any other interval.
The Poisson probability function
e . x
P x ;
x!
Where P(x, λ) is the probability of x occurrences in an interval of time, volume, area
etc for a variable, λ denotes the mean number of occurrences and e 2.7183
Example1: Past police records indicate a mean of five accidents per month while investigating
the safety of a dangerous intersection. The number of accidents is distributed according to the
probability in any month of
a) Exactly 3 accidents.
b) Fewer than 2 accidents.
P3
5 . 2.7183
3 5
125 0.00674
3! 6
= 0.1404
b) Fewer than 2 accidents comprise 0 and 1 accident during any month.
P0 P1
5 . 2.7183
0 5
5 2.7183
1 5
0! 1!
0.0674 + 0.3370
0.4044
Remark:- Although the above probability was determined by evaluating the probability
function, it is often easier to refer to the table for the Poisson probability distribution. These
table provides probabilities for specific values of x and . We have included the table at the
end of this block.
For convenience, in example 1a, = 5 and x = 3. In the first column of the table choose
x = 3 and correspond it with =5, the intersection of these two numbers gives you the
required probability, which is 0.1404.
e x 2.7183 . 0.4
0.4 3
P x , 0.00715
x! 3!
Thus, there is less than a 1% probability that a give page contains less than 3 errors.
So far, we have been concerned with discrete probability distributions. In this section, we shall
turn to cases in which the variable can take on any value within a given range and in which the
probability distribution is continuous.
A common continuous probability distribution is the normal probability distribution. Several
mathematicians were instrumental in its development; among them is the eighteen-century
mathematician and astronomer Karl Gauss. In honor of his work, the normal probability
distribution is often called the Gaussian distribution.
There are two basic reasons why the normal distribution occupies such a prominent place in
statistics. First, it has some properties that make it applicable to a great many situations in which
it is necessary to make inferences by taking samples. Second, the normal distribution comes
close to fitting the actual observed frequency distributions of many phenomena, including human
characteristics (weights, heights and IQS)
Many contentious variables such as height and weight have distributions that are bell-shaped and
are called approximately normally distributed variables, deriving the most important probability
distribution used to describe a continuous random variable called the normal probability
distribution.
The normal probability distribution is a continuous, symmetric, bell-shaped
distribution of a variable.
The form or shape of the normal probability distribution is shown below.
The shape and position of the normal distribution curve depends on two parameters, the mean
and the standard deviation. Each normally distributed variable will have its own normal
distribution curve.
Properties of the normal probability distribution
34.13% 34.13%
13.59% 13.59%
2.28% 2.28%
About 68%
About 95%
About 99.7%
The mathematical equation of the normal probability distribution is defined by the probability
density function.
x 2
f x
1 2 2
e
2
Where = mean
3.14159
e 2.7183
= Standard deviation
The standard normal probability distribution
A random variable that has a normal distribution with a mean of 0 and a standard deviation of 1
is said to have a standard normal probability distribution. Recall that the standard score (z-
score) of a value is the number of standard deviations that value is from the mean. All normally
distributed variables can be transformed into the standard normal distributed variable by using
the formula for the standard score:
z= value – mean
Standard deviation
X
Or, z=
As with other continuous random variables, probability calculations with any normal probability
distribution are made by computing areas under the graph of the probability density function.
Thus, to find the probability that a normal random variable lies within any specific interval, we
must, compute the area under the normal curve over that interval.
For the standard normal probability distribution, areas under the normal curve have been
computed and are available in tables that can be used in computing probabilities. The normal
probability distribution table is available at the end of this block.
For the solution of problems using the normal distribution, the following steps are used.
1. Draw a picture
2. Transform the given value to z-value
3. Shade the area desired
4. Read the area from the standard normal distribution table.
Example 1: Find the area under the normal curve between z=0 and z=2.34
Solution:
The standard normal curve
Representation is shown: From
the table the intersection 0 2.34
of z = 2.3 with 0.04 gives 0.4904 or
49.04% which is the required area.
Example 2 : Find the area under the normal distribution curve between z = -1.93 and z = 2.35
Area = 0.4732 + 0.4906 = 0.9638 or 96.38%. Note that it is equivalent to say that the
probability of the z-value lying between z = -1.93 and z = 2.35 is 96.38%. This can also be
written as:
P(-1.93 < z < 2.35) = 0.9638
Example 3: Find the probability that the z-value of a normally distributed variable lies to the
left of 1.65
Solution
The probability that the z-value
lies to the left of 1.65 is equivalent to
finding the area under the standard
normal curve, which is to the left of 1.65
Hence, total area = area to the left of 0 0 1.65
plus area between 0 and 1.65 = P(z < 1.65)
= 0.5000 + 0.4505 = 0.9505 or 95.05%
Which is required probability.
The area under the normal curve is used to solve practical application problems such as finding
probabilities or percentages of values. In order to solve such problems you need only transform
the values of the variable into the z values and read the standard normal distribution table.
Example 1: The scores for an IQ test are normally distributed with a mean of 100 and a
standard deviation of 15. Find the percentage of IQ scores that will fall below 112.
Solution
Step 1: Draw a figure and represent the area
Step 2: Find the z-value
Corresponding to an IQ
Score 112.
Z = x - = 112 – 100 = 0.8 100 112
115 0 0.8
Step3: From the table,
P(z < 0.8) = P(z < 0) + P(0 < z < 0.8) = 0.5000 + 0.2881 = 0.7881
Hence, 78.81% of the IQ scores fall below 112.
Example2: The monthly salaries of 2000 workers are normally distributed with a mean of birr
550 and of workers whose monthly salaries are
a) Between birr 600 and 700
b) Less than birr 700.
700 550
Z 1.875 550 600 700
80
0 0.625 1.875
Example 3 A college desires to accept only the top 10% of all graduating seniors on the basis
of the results o a national placement test. The test has a mean of 500 and a standard deviation of
100. Find the cut-off score for the exam.
Solution:
The area is shown.
We solve the problem back ward.
We need to determine the point on 500 x
the axis that cuts the upper 10% of the area. 0 z
Let it be denoted by x
From the table, the z – value that corresponds to the area 0.4000 is approximately 1.28.
x 500
Then, 1.28 x 628
100
Hence the score 628 should be used as a cut –off score. Any student scoring below 628 should
not be admitted.
Exercise
A standardized test has a mean of 50 and a standard deviation of 10. The scores are normally
distributed. If the test is administered to 800 students, approximately how many will score
between 48 and 62?