Chapter 16 - The Normal Distribution
Chapter 16 - The Normal Distribution
Chapter 16 - The Normal Distribution
a.k.a….Normal Curve
• A specific normal curve is characterised by it’s mean(𝝁) and standard
deviation(𝝈).
Normal curve with same mean but different standard deviation values
• A specific normal curve is characterised by it’s mean(𝝁) and standard
deviation(𝝈).
Normal curve with same standard deviation but different mean values
Main features of a normal distribution or curve:
1. It is bell shaped.
2. It is symmetric, hence mean and median are equal.
3. It is asymptotic to the horizontal axis, meaning the curve never
touches the horizontal axis as it moves out-word.
VERTICAL
H O R I Z O N TA L
4. Approximately 68% of a normal distribution lies within 1 standard
deviation of the mean; above 95% lies within 2 standard deviation of
the means and about 99.7% lies within 3 standard deviation of the
mean.
5. The centre and variation will depend on the values of 𝝁 and 𝝈.
6. The total area under any normal curve is 1, therefore as noted in
point 4;
• The area under the curve within 1 standard deviation if the mean is
0.68 approx.
• The area within 2 standard deviations of the mean is 0.95 approx.
• The area within 3 standard deviations of the mean is 0.997 approx.
Example: weight of watermelon (Kg)
10 Kg
8 Kg
6 Kg
2 Kg 4 Kg
Class intervals (Kg) Frequency (f)
0 – under 2 50
2 – under 4 200
4 – under 6 500
6 – under 8 120
8 – under 10 20
σ 𝑓 = 890
8-10 Kg
6-8 Kg
4-6 Kg
2-4 Kg
0-2 Kg
Class intervals (Kg) Frequency (f)
0 – under 0.5 5
0.5 – under 1 5
1 – under 1.5 10
1.5 – under 2 30
2 – under 2.5 25
Intervals decreased 2.5 – under 3 25
3 – under 3.5 50
3.5 – under 4 100
4 – under 4.5 110
4.5 – under 5 120
5 – under 5.5 130
5.5 – under 6 140
6 – under 6.5 …..
Etc…… σ 𝑓 =3000
…so this is a
normal distribution
of the watermelons
on the school
bus…..
The areas under the normal curve
• The proportions of observations that take on certain values are
represented by areas under the distribution curve
• The proportion of observations that take on a value between a and b
is the area under the curve between two vertical lines erected at a
and b
• We could calculate these areas and thus obtain values for the
proportions
• Examples 16.1 (page 453)
Watermelons had a mean(μ) = 4 and a standard deviation(σ) = 1
1 2 3 4 5 6 7
Shade the relevant regions μ = 4 & standard deviation σ = 1
Shade the region that represents the proportion of weight for each
watermelon between 2 and 6 Kgs.
1 2 3 4 5 6 7
Shade the region that represents the proportion of weight for each
watermelon more than 5Kgs.
1 2 3 4 5 6 7
Shade the region that represents the proportion of weight for each
watermelon less than 1Kg.
1 2 3 4 5 6 7
Shade the region that represents the proportion of weight for each
watermelon between 5 and 7 Kgs.
1 2 3 4 5 6 7
Standard Scores (z–scores)
Standard Scores (z–scores)
• It is rather complex to obtain specific measurements for items that
vary in their respective limitations. For instance, the weight of
watermelons and guava vary greatly hence different normal curves
would have to be created for each items (fruits) meaning their
respective mean and standard deviation would have to be calculated.
• This tedious process can be reduced or managed by standardising the
weight to cover all fruits. Therefore all measurements taken must be
converted into standard scores or z-scores.
• The z-score of a measurement is defined as the number of standard
deviations the measurement is away from the mean
• If the measurement is above the mean, the corresponding z-score is
positive; but if the measurement is below the mean, the
corresponding z-score is negative
observed value mean
Standard score z
standard deviation
The larger a positive z-value, the further the given x is above the mean, while
large negative z-values correspond to extreme values below the mean
Formula:
𝑥 −𝜇
∴z=
𝜎
(b) In order to find the area to the left of z = 0, the symmetry of the normal
curve can be used. In this case, the area between z = –2.10 and z = 0 is the
same as the area between z = 0 and z = + 2.10. From Table 6: The area is
0.4821
.00 .01 .02 .03
Solution:
(c) The area between z = 0.60 and z = 1.80 may be found by subtracting the area
between z = 0 and z = 0.60 from the area between z = 0 and z = 1.80.
From Table 6:
The area between z = 0 and z = 1.80 is 0.4641
The area between z = 0 and z = 0.60 is 0.2257
Therefore, the required area is 0.4641 – 0.2257 = 0.2384
Expanded version of the previous slide….just to get the mental image….
• The z-scores express the given problem in ‘standard form’ so that the
standard normal curve can be used
• To convert a raw score of x (from a distribution with mean μ and standard deviation σ) to
a z-score, subtract the mean from x and divide by the standard deviation
• To convert a z-score to a raw score x, multiply the z-score by the standard deviation and
add this product to the mean
• In equation form:
x Ζ
Conversion to raw score
• To find 𝑥:
𝑥−μ
Z=
σ
𝑥−μ
Zxσ=
σ xσ
Re-arrange z σ + μ = 𝑥 − μ +μ
𝑥=zσ+μ
∴ 𝑥 = 𝜇 + 𝑧𝜎
Computation of proportions
A proportion within a given interval
2. Find the area (from Table 6) for each z-value (If the z-value is negative,
ignore the sign)
3. lf the end points of the interval lie on opposite sides of the mean, add the
two areas found in Step 2. If the two end points lie on the same side of
the mean, subtract the smaller area from the larger one
The proportion greater or less than a given value
• We read z-score tables and found areas under the normal curve
• We found the z-score, given the area under the normal curve
• We computed proportions