Numerical Desciptive Statistics
Numerical Desciptive Statistics
Numerical Desciptive Statistics
Ritesh Pandey
Francis Galton
1822-1911
Numerical Descriptive Statistics
Measures of Central Tendency Measures of Dispersion Measures of Linear Relationship between variables Measures of shape
Karl Pearson
1857-1936
Ronald Fisher
1890-1962
Table of Contents
Measures of Dispersion
Measures of shape
Table of Contents
Measures of Dispersion
Measures of shape
Pn
i=1 xi
x=
n
Pn
w i xi
x w = Pi=1
n
i=1 wi
Pn
fi x i
x = Pi=1
n
i=1 fi
Pn
fi m i
x = Pi=1
n
i=1 fi
where mi is the midpoint of each interval.
Pn
i=1 di
x =A+
n
where di = (xi A) is the deviation of each value from any A.
n
" #1/n
Y
xg = xi
i=1
n
x h = Pn 1
i=1 xi
= f0 f
and
+ = f0 f+
Quartiles
Deciles
These divide the value range into 10 equal halves. They are
obviously nine in number. The kth decile can be calculated by
k
the usual formula.The kth decile class is given by (n + 1) 10
kn
10
(cf )
Pk = l0 + w0 .
f0
for i = 1,2, . . . , 9.
Percentiles
These divide the value range into 100 equal halves. They are
obviously ninty nine in number. The kth percentile can be
calculated by the usual formula. The kth percentile class is
k
given by (n + 1) 100
kn
100
(cf )
Pk = l0 + w0 .
f0
for i = 1,2, . . . , 99.
Table of Contents
Measures of Dispersion
Measures of shape
Range
Interquartile Range
Interquartile Range = Q3 Q1
Q3 Q1
Semi Interquartile Range =
2
P
|xi x|
MAD =
n
Here, x can be any measure of central tendency.
The Variance
Pn
2 i=1 (xi x)2
s =
n1
Pn
2 i=1 fi (mi x)2
s =
n1
s= s2
s
cv =
x
Table of Contents
Measures of Dispersion
Measures of shape
Pn
i=1 (xi x)(yi y )
sxy = cov (x, y ) =
n1
sxy
rxy = corr (x, y ) =
sx sy
with 1 rxy 1.
Table of Contents
Measures of Dispersion
Measures of shape
The Skewness
Pn
x)3 /(n 1) i=1 (xi
Skewness =
s3
Positive skew: right tail longer; mean > median
Negative skew: left tail longer; mean < median
The Kurtosis
Pn
i=1 (xi x)4 /(n 1)
Kurtosis =
s4
Negative: Lepto- (slender) -kurtic
Zero: Meso -kurtic
Positive: Platy- (broad) -kurtic
Chebysheffs Theorem
Theorem (Chebysheff)
Within k standard deviations around the mean, will be found a
proportion equal to at least 1 k12 of the total number of
observations.
2s will contain at least 75% data points.
3s will contain at least 89% data points.