Lesson2 1450
Lesson2 1450
Lesson2 1450
• the mode 𝑀𝑜
• the median 𝑀𝑑
• the mean 𝑋 (called X-bar)
Mode ( 𝑀𝑜 )
• The mode is the observed value that occurs with the greatest frequency
• It is found by simple inspection of the frequency distribution (it is easy to see
on a frequency polygon as the highest point on the curve)
• Unimodal distribution : the distribution has one mode
• If two scores both occur with the greatest frequency, the distribution is
bimodal
• If more than two scores occur with the greatest frequency, the distribution is
multimodal
• Some data sets do not have a mode because each value occurs only once
(uniform distribution).
• The mode is insensitive by extreme scores (outliers) in a distribution
Median (𝑴𝒅 )
• The median is the number that divides the frequency distribution in
half when all the scores are listed in order (from lowest to highest)
• The median is the score in the distribution that marks the 50th
percentile (C50). That is, 50% of the scores in the distribution fall
above the median and 50% fall below it
• When a distribution has an odd number (5,7,9,etc) of elements, the
median is the middle one
• When a distribution has an even number (6,8,10,etc) of elements,
the median lies halfway between the two middle scores (i.e., it is the
average or mean of the two middle scores)
Median (𝑴𝒅 ) (cont’n)
• For an odd number of scores: n = (n+1)/2 ; n : number of scores
6, 9, 15, 17, 24 : 𝑀𝑑 is 15
In a unimodal symmetrical
distribution (like the
normal distribution), all
three measures of central
tendency are identical
𝑀𝑜 = 𝑀𝑑 = 𝑋
Skewed distributions
• The mode and median are insensitive (or they are robust) to extreme scores (outliers) in
a distribution, while the mean is very sensitive (it is not robust) to extreme scores
• As a result, the mean in a skewed distribution is pulled in the direction of the tail
• The mean is used for numerical data and for symmetric (not skewed,
normal) distributions
• The median is used for ordinal data or for numerical data if the
distribution is skewed
• The mode is used primarily for bimodal distribution
Nominal Ordinal Interval Ratio
Mode + + + +
Median + + +
Mean + +
Location of the mean
s= 𝑠 2
𝑠 2 : the variance
𝑆𝑡𝑎𝑛𝑑𝑎𝑟𝑑 𝑑𝑒𝑣𝑖𝑎𝑡𝑖𝑜𝑛
Coefficient of variation = × 100
𝑀𝑒𝑎𝑛
𝑠
Coefficient of variation = × 100
𝑋
Coefficient of variation
< 10 % : Low
10-35 % : Medium
> 35 % : High
Example. Compare variability in systolic blood pressure
and shock index for a sample of 200 patients.
Systolic blood pressure Shock index
𝑋 = 138 𝑋 = 0.69
s = 26 s = 0.2
CV =
26
× 100 = 18.8% 0.2
138 CV = × 100 = 29.0%
0.69
In this case, a comparison of the standard deviations makes no sense because shock index and systolic BP are
measured on much different scales. Coefficient of variation (CV) measures a relative variation – variation
relative to the size of the mean.
𝟐 𝟐
Xi Frequency, f 𝑿𝒊 × 𝒇 𝑿𝒊 − 𝑿 𝑿𝒊 − 𝑿 𝑿𝒊 − 𝑿 ×𝒇
24 2 48 -5.9 34.7 69.5
26 4 104 -3.9 15.2 60.7
30 3 90 0.1 0.0 0.0
31 5 155 1.1 1.2 6.1
33 2 66 3.1 9.6 19.3
35 3 105 5.1 26.1 78.2
SUM 19 568 233.79
Mean 29.9
233.79
𝑋 = 29.9 𝑠2 = = 13.0 s = 13 = 3.6
19−1
𝑠 3.6
Coefficient of variation = × 100 = × 100 = 12.0
𝑋 29.9
3. Normal distribution. Empirical rule
Three main characteristics
of normal distribution
𝑋 = 70 beats / min
𝑠 = 10
The normal distribution of a resting 99.7% of students has a resting heart rate
between 40 and 100 beats/min
heart rate in a hypothetical sample
4. Five summary statistics and boxplot
Five summary statistics
Range = 93-68 = 25