Skewness
Skewness
Skewness
In a perfect normal distribution, the tails on either side of the curve are exact mirror
images of each other.
When a distribution is skewed to the left, the tail on the curve’s left-hand side is
longer than the tail on the right-hand side, and the mean is less than the mode. This
situation is also called negative skewness.
When a distribution is skewed to the right, the tail on the curve’s right-hand side is
longer than the tail on the left-hand side, and the mean is greater than the mode. This
situation is also called positive skewness.
Image 17
2] Pearson Second Coefficient of Skewness (Median skewness)
Image 18
Interpretations
Sample problem: Use Pearson’s Coefficient #1 and #2 to find the skewness for data
with the following characteristics:
Mean = 50.
Median = 56.
Mode = 60.
Note: Pearson’s first coefficient of skewness uses the mode. Therefore, if frequency of
values is very low then it will not give a stable measure of central tendency. For
example, the mode in both these sets of data is 9:
1, 2, 3, 4, 4, 5, 6, 7, 8, 9.
In the first set of data, the mode only appears twice. So it is not a good idea to use
Pearson’s First Coefficient of Skewness. But in the second set,
1, 2, 3, 4, 4, 4, 4, 4, 4, 4, 4, 5, 6, 7, 8, 9, 10, 12, 12, 13.
The exact interpretation of the measure of Kurtosis used to be disputed but is now
settled. It's about the existence of outliers. Kurtosis is a measure of whether the data
are heavy-tailed (profusion of outliers) or light-tailed (lack of outliers) relative to a
normal distribution.
Kurtosis [Image 19] (Image
courtesy: https://mvpprograms.com/help/mvpstats/distributions/SkewnessKurtosis)
Mesokurtic
Leptokurtic
Platykurtic
Distribution is the distribution that has kurtosis lesser than a Mesokurtic distribution.
Tails of such distributions thinner. If a curve of a distribution is less peaked than a
Mesokurtic curve, it is referred to as a Platykurtic curve.
Right-skewed
Left-skewed
The histogram with right-skewed data shows wait times. Most of the wait times
are relatively short, and only a few wait times are long. The histogram with left-
skewed data shows failure time data. A few items fail immediately, and many
more items fail later.
he output has two columns. The left column names the statistic and the right column gives
the value of the statistic. For example, the mean of this data is 1.26 (since your data set may
be different, you may get a different value.)
The skewness measure is greater than 0 when the distribution is skewed.
The kurtosis measure is 0 for a normal distribution. Positive values imply a leptokurtic
distribution, while negative values imply a platykurtic distribution.
Standard deviation
Standard deviation is the measurement of the average distance between each quantity
and mean. That is, how data is spread out from the mean. A low standard deviation
indicates that the data points tend to be close to the mean of the data set, while a high
standard deviation indicates that the data points are spread out over a wider range of
values.
There are situations when we have to choose between sample or population Standard
Deviation.
When we are asked to find SD of some part of a population, a segment of population;
then we use sample Standard Deviation.
Image 6