Measure of Variation
Measure of Variation
Measure of Variation
The various measures of central tendency gives one single value that represents the
entire data, but that value alone can’t adequately describe a set of observations,
unless all the observations are the same. It is necessary to describe the variability
or dispersion of the observations.
Dispersion
The word dispersion has a technical meaning in statistics. The average measures
the center of the data, and it is one aspect of observation. Another feature of the
observation is how the observations are spread about the center. The observations
may be close to the center or they may be spread away from the center. If the
observations are close to the center (usually the arithmetic mean or median), we
say that dispersion, scatter or variation is small. If the observations are spread away
from the center, we say dispersion is large.
Case:
Suppose we have three groups of students who have obtained the following marks
on a test. The arithmetic means of the three groups are also given below:
In groups A and B the arithmetic means are equal, i.e. ̅̅̅ =̅̅̅̅ =50. But in group A
the observations are concentrated around the center. All students in group A have
almost the same level of performance. We say that there is consistency in the
observations in group A. In group B the mean is 50 but the observations are not
close to the center. One observation is as small as 30 and one observation is as
large as 70. Thus there is greater dispersion in group B. In group C the mean is 60
but the spread of the observations with respect to the center 60 is the same as the
spread of the observations in group B with respect to their own center, which is 50.
Thus in groups B and C the means are different but their dispersion is the same. In
groups A and C the means are different and their dispersions are also different.
Dispersion is an important feature of observation and it is measured with the help
of the measures of dispersion, scatter or variation. The word variability is also used
for the idea of dispersion.
1. The degree to which numerical data tend to spread about an average value is
called the dispersion or variation of the data.
2. Dispersion or variation may be defined as a statistic signifying the extent of the
scatteredness of items around a measure of central tendency.
3. Dispersion or variation is the measurement of the size of the scatter of items in
a series about the average.
Measures of Dispersion
For the study of dispersion, we need some measures which show whether the
dispersion is small or large. There are two types of measure of dispersion, which
are:
1. The Range
2. The Quartile Deviation
3. The Mean Deviation
4. The Standard Deviation and Variance
Range
Range, as the word suggests, represents the difference between the largest and the
smallest value of data. This helps us determine the range over which the data is
spread.
L = Largest Value
S = Smallest Value
Range = L – S
Example
There are ten students in the class, and they recently gave a test out of 100 marks.
There are two scenarios here.
First: 50, 53, 50, 51, 48, 93, 90, 92, 91, 90
Second: 71, 72, 70, 75, 73, 74, 75, 70, 74, 72
The range in the first scenario is represented by the difference between the largest
value, 93 and the smallest value, 48. The range therefore is,
Range in First set = 93 – 48 = 45
Whereas in the second scenario, the range is represented by the difference between
the highest value, 75 and the smallest value, 70.
Range in the second set = 75 – 70 = 5
The difference in the value of range between the two scenarios enables us to
estimate the range over which the values are spread. The larger the range, the
larger apart the values are spread.
Coefficient of Range
In order to compare the variability of two or more distributions, which are given in
two different units of measurement, we need a relative a relative measure which is
independent of the units of measurement, which is known as coefficient of range.
coefficient of range =
Example
Let us take two sets of observations. Set A contains the marks of five students in
mathematics out of 25 marks and group B contains marks of the same students in
English out of 100 marks.
Ungrouped Data
Example:
The following are the wages of 8 workers in a factory. Find the range and
coefficient of range. Wages are in dollars: 1400, 1450, 1520, 1380, 1485, 1495,
1575, 1440.
Grouped Data
i) Discrete Frequency Distribution
Example:
The following distribution gives the number of houses and the number of persons
per house.
Number of
Persons 1 2 3 4 5 6 7 8 9 10
Number of
Houses 26 113 120 95 60 42 21 14 55 44
Solution:
Here the largest value =L= 10and the smallest value =S=1
Range =L-S=10–1=9
Coefficient of Range =L-S/L+S=10–1/10+1=9/11=0.818
Solution:
Method 1:
Here L= the upper class boundary of the highest class =74.5
and S= the lower class boundary of the lowest class =59.5
Range =L-S=74.5–59.5=15
Coefficient of Range =L-S/L+S =74.5–59.5/74.5+59.5=15/134=0.1119
Method 2:
Here L= the mid value of the highest class =73
and S=the mid value of the lowest class =61
Range =L-S=73–61=12
Coefficient of Range =L-S/L+S =73–61/73+61=12/134=0.0895
Note:
It does not enjoy any prominent place in statistical theory, but it has its application
and utility in quality control methods which are used to maintain the quality of
products produced in factories. The quality of products is to be kept within a
certain range of values.
Range is based on two extreme observations. It gives no weight to the central
values of the data. It is a poor measure of dispersion and does not give a good
picture of the overall spread of the observations with respect to the center of the
observations. Let us consider three groups of data which have the same range:
Represents the difference between upper quartile (Q3) and lower quartile (Q1).
IQR = Q3 - Q1
Example
25, 55, 5, 45, 15, 35 are marks obtained by students. Find IQR.
N= 6
th th
Q1 = Size of ( ) item = Size of ( ) item
= Size of ( ) th item = Size of th
item
st nd st
= Size of item + 0.75[Size of item- Size of item]
= 5 + 0.75[15 – 5]
= 12.5
Q3 = = Size of ( ) th item = Size of ( ) th item
= Size of ( ) th item = Size of th
item
th th th
= Size of item + 0.25[Size of item- Size of item]
= 45 + 0.25[55 –45]
=47.5
Example
Let's calculate IQR for the following discrete data:
Frequency 2 5 1 6 4 2
Q1= 36
Q3 = 105
IQR = Q3 – Q1 = 105 – 36 = 69
Example
Find IQR
Weekly Income Number of Families
2000-4000 20
4000-6000 40
6000-8000 50
8000-10000 32
10000-12000 16
12000-14000 2
Q1= 5000
Q3 = 8625
IQR = Q3 – Q1 = 8625 – 5000 = 3625
Quartile Deviation
Quartile deviation is based on the lower quartile Q1 and the upper quartile Q3. The
difference Q3 –Q1 is called the inter quartile range. The difference Q3 –Q1 divided
by 2 is called semi-inter-quartile range or the quartile deviation. Thus
Q.D= Q3 – Q1 / 2
The quartile deviation is a slightly better measure of absolute dispersion than the
range. It indicates the average amount by which the quartile differs from the
median. This is absolute measure of quartile deviation.
A small quartile deviation implies small variation among the central 50% items of
the distribution.
A high quartile deviation implies high variation among the central 50% items of
the distribution.
Example
The wheat production (in Kg) of 20 acres is given as: 1120, 1240, 1320, 1040,
1080, 1200, 1440, 1360, 1680, 1730, 1785, 1342, 1960, 1880, 1755, 1720, 1600,
1470, 1750, and 1885. Find the quartile deviation and coefficient of quartile
deviation.
Solution:
After arranging the observations in ascending order, we get
1040, 1080, 1120, 1200, 1240, 1320, 1342, 1360, 1440, 1470, 1600, 1680, 1720,
1730, 1750, 1755, 1785, 1880, 1885, 1960.
Q1 =1260
Q3 = 1753. 75
Q.D= Q3 – Q1 / 2
= 1753.75 – 1260/2
= 246.875
Q1= 5000
Q3 = 8625
IQR = Q3 – Q1 = 8625 – 5000 = 3625
QD = Q3 – Q1/2 = 3625/ 2 = 1812.5
Coefficient of Quartile Deviation = Q3–Q1/Q3+Q1 = 3625/ 13625 = 0.2661
Question
The marks obtained by 9 students in a test are
25, 20, 15, 45, 18, 7, 10, 38, 12
Find the value of Q1, Q3, IQR, QD.
Solution
Arrange in increasing order
7, 10, 12, 15, 18, 20, 25, 38, 45
N=9
Q1 = Size of ( ) th item = Size of ( ) th
item
= Size of ( ) th item = Size of th
item
nd rd nd
= Size of item + 0.5[Size of item- Size of item]
= 10 + 0.5[12 – 10] = 10+0.5*2 = 10 + 1 = 11
Q3 = Size of ( ) th item = Size of ( ) th item
= Size of ( ) th item = Size of th
item
th th th
= Size of item + 0.5[Size of item- Size of item]
= 25 + 0.5[38 – 25] = 25+0.5*13 = 25+6.5 = 31.5
F
10-20 12
20-30 19
30-40 5
40-50 10
50-60 9
60-70 6
70-80 6
Solution
F Cf
10-20 12 12
20-30 19 31
30-40 5 36
40-50 10 46
50-60 9 55
60-70 6 61
70-80 6 67
N=67
th
Q1 = Size of ( ) item
th th
= Size of ( ) item = Size of item
th
Q3 = Size of ( ) item
th
= Size of ( ) item
th
=Size of item
Q3 = +( )*h= +( ) * 10 = 54.72
Decile Range
Decile Deviation
Example
Calculate Decile Range, Decile Deviation and Coefficient of Decile Deviation
from the following data
85, 96, 76, 108, 85, 80, 100, 85, 70, 95
Solution
Arranging the data in ascending order
70, 76, 80, 85, 85, 85, 95, 96, 100, 108
N=10
th th
= Size of {9( item= Size of {9( item
th
= Size of {9( item = Size of {9.9 th item
= Size of {9 th item+0.9[Size of {10 th item - Size of {9 th
item]
= 100+0.9[108 – 100] = 100+0.9*8= 100 + 7.2= 107.2
th th
= Size of ( item= Size of ( item
= Size of ( th item = Size of {1.1 th item
= Size of {1 st item +0.1[Size of {2 nd item - Size of {1 st
item]
= 70 + 0.1[76-70] = 70 +0.1*6 = 70 + 0.6 = 70.6
Percentile Range
Percentile Deviation
Solution
Arranging the data in ascending order
70, 76, 80, 85, 85, 85, 95, 96, 100, 108
N=10
th th
= Size of {90( item= Size of {90( item
th
= Size of {90( item = Size of {9.9 th item
= Size of {9 th item+0.9[Size of {10 th item - Size of {9 th item]
= 100+0.9[108 – 100] = 100+0.9*8= 100 + 7.2= 107.2
th th
= Size of {10( item= Size of {10( item
= Size of ( th item = Size of {1.1 th item
= Size of {1 st item +0.1[Size of {2 nd item - Size of {1 st
item]
= 70 + 0.1[76-70] = 70 +0.1*6 = 70 + 0.6 = 70.6
MEAN DEVIATION
It is the Arithmetic Mean of the absolute deviation of all items of the distribution
from a measure of central tendency.
(i) Ungrouped Data
Thus for an ungrouped data of N observations in which the suitable average is ̅ ,
the mean deviation M.D is given by the relation:
∑ ̅
M.D. =
∑ ̅ ̅ ̅ ̅ ̅
=
∑
M.D. =
∑
M.D. =
Example:
Calculate the mean deviation from the (1) arithmetic mean (2) median (3) mode in
respect to the marks obtained by nine students given below and show that the mean
deviation from the median is the minimum.
Solution:
Mean=∑X/n=80/9=8.89
th th th
Median = Size of ( ) item= Size of ( ) item= Size of ( ) item
th
= Size of item= 9
∑ ̅
M.D from mean M.D. = = 21.11/9=2.35
∑
M.D from median M.D. = =21/9=2.33
∑
M.D from mode M.D. = =23/9=2.56
From the above calculations, it is clear that the mean deviation from the median
has the least value.