Chapter 4 MMW Part 3

Download as pdf or txt
Download as pdf or txt
You are on page 1of 64

MATHEMATICS IN

THE MODERN WORLD


Prepared by: RONNEL P. DAVIS
LEARNING OBJECTIVES:

i. Identify measures of relative position;


ii. Solve measures of relative position of grouped
and ungrouped data;
Chapter 4
DATA MANAGEMENT
1.2. Measures of Relative
Position (Grouped and
Ungrouped Data)
MEASURES OF RELATIVE POSITION:

❑ Statistical tools used to understand the position of a data


point in relation to the rest of the data in a dataset.

❑ provide insights into how one observation compares to


others by identifying its location within the distribution.

❑ These measures are essential for comparing different


datasets or individual values within the same dataset.
Percentile
(Grouped and Ungrouped Data)
PERCENTILE

❑ A percentile indicates the percentage of data that falls below


a specific value.
❑ Values of the variable that divide a set of ranked data into
100 equal subsets; each set of data has 99 percentiles.
Percentile
(Ungrouped Data)
Formula:
𝒌(𝑵 + 𝟏)
𝑷𝒌 =
𝟏𝟎𝟎
where: P is the percentile rank of a data
point;
𝒌 is the number of data points
less than the given value;
N is the total number of data
points in the dataset.
Problem: Finding the 70th Percentile in a Data Set

Suppose we have the following data set representing scores:


56, 62, 67, 71, 73, 75, 79, 83, 88, 90, 95, 98

Step 1: Arrange the data in ascending order.


The data is already sorted:

56, 62, 67, 71, 73, 75, 79, 83, 88, 90, 95, 98
Step 2: Use the formula for the percentile position and
substitute the values.
70(12 + 1) (70)(13)
𝑃70 = = = 9.1
100 100

Therefore, the 70th percentile rank is 9.1


Step 3: Interpret the result.

Since the 70th percentile is at the 9.1th position, it falls


between the 9th and 10th data points in the sorted list. The
9th and 10th data points are 88 and 90.

70th Percentile = 88+0.1×(90−88) = 88+0.1×2 = 88+0.2=


88.2

So, the 70th percentile is 88.2.


The 70th percentile means that 70% of the
scores are below 88.2, and 30% of the scores are
above 88.2 in the data set.
Percentile
(Grouped Data)
For discrete frequency distribution, the formula for k-th
percentile is:
𝒌𝑵 𝒕𝒉
𝑷𝒌 = such that k = 1, 2, …, 99
𝟏𝟎𝟎

Where N is the total number of observations or frequency.


Finding the kth percentile of grouped data:
Formula:
𝒌𝑵
−𝑭
𝑷𝒌 = 𝑳 + 𝟏𝟎𝟎 𝐡 𝒔𝒖𝒄𝒉 𝒕𝒉𝒂𝒕 𝒌 = 𝟏, 𝟐, 𝟑
𝒇

For example: Find the kth percentile of the grouped data in the
next slide.
The following table gives a frequency distribution of weight (in
pounds) of 57 children at a day care center.
Let X denote the weight of children at a day care center.

Here the classes are inclusive. To make them exclusive type


subtract 0.5 from the lower limit and add 0.5 to the upper
limit of each class.
𝒌𝑵 𝒕𝒉
𝑷𝒌 = value
𝟏𝟎𝟎
𝟑𝟎(𝟓𝟕) 𝒕𝒉
𝑷𝟑𝟎 = value
𝟏𝟎𝟎
𝒕𝒉
𝑷𝟑𝟎 = 𝟏𝟕. 𝟏 value

The cumulative frequency just greater than or equal to 17.1 is


24. The corresponding class 19.5 – 30.5 is the 30th
percentile class.
Thus;

L = 19.5 Lower boundary;


N = 57 Total frequency;
F=5 Cumulative class before the quartile class;
f = 19 Frequency of the 30th percentile class;
h = 10 Width of the class interval.
Substitute the values in the formula:
𝒌𝑵
−𝑭
𝑷𝒌 = 𝑳 + 𝟏𝟎𝟎 ×𝒉
𝒇
𝟏𝟕. 𝟏 − 𝟓
𝑷𝟑𝟎 = 𝟏𝟗. 𝟓 + × 𝟏𝟎
𝟏𝟗
𝟏𝟐. 𝟏
𝑷𝟑𝟎 = 𝟏𝟗. 𝟓 + × 𝟏𝟎
𝟏𝟗
𝑷𝟑𝟎 = 𝟏𝟗. 𝟓 + 𝟔. 𝟑𝟔𝟖𝟒
𝑷𝟑𝟎 = 𝟐𝟓. 𝟖𝟔𝟖
Quartile
(Ungrouped Data)
QUARTILES

❑ Quartiles divide a dataset into four equal parts,


with each part representing 25% of the dataset.
❑ They are used to understand the spread and
distribution of the data and are particularly useful
for skewed distributions.
QUARTILES

❑ First Quartile (Q1): The 25th percentile (the point


below which 25% of the data lie).
❑ Second Quartile (Q2): The 50th percentile, also
known as the median.
❑ Third Quartile (Q3): The 75th percentile (the point
below which 75% of the data lie).
Note!

1. The first quartile and 25th percentile are the


same, that is 𝑸𝟏 = 𝑷𝟐𝟓 , and 𝑸𝟑 = 𝑷𝟕𝟓 .
2. The median, the second quartile, and the 50th
percentile are the same. Therefore, when asked to
find 𝑸𝟐 𝒂𝒏𝒅 𝑷𝟓𝟎 , use the formula of finding the
median.
Formula:
The quartile rank can be computed similarly to
percentiles. To find the kth quartile, for example, you
use:

𝒌(𝑵+𝟏)
𝑸𝒌 = such that k = 1,2,3
𝟒
Where 𝑁 is the total number of data points.
Let’s solve for the quartiles step by step for the data
set:
3, 7, 8, 10, 15, 16, 20, 25, 28

Step 1: Arrange the data in ascending order.

The data is already sorted:


3, 7, 8, 10, 15, 16, 20, 25, 28.
Step 2: Identify the QuartilesQ2 (Median):

The median is the value that divides the dataset


into two equal parts. Since there are 9 data points,
the middle number (5th value) is the median.

Q2 (Median)=15
Q1 (First Quartile):Q1 is the median of the lower half
of the data (the numbers to the left of the median).

The lower half of the data is: 3, 7, 8, 10

To find Q1, we take the median of this lower half.


Since there are 4 data points, the median will be the
average of the 2nd and 3rd values.
𝟕 + 𝟖 𝟏𝟓
𝑸𝟏 = = = 𝟕. 𝟓
𝟐 𝟐

Therefore, Quartile 1 or Q1= 7.5


Q3 (Third Quartile):Q3 is the median of the upper half
of the data (the numbers to the right of the median).

The upper half of the data is: 16, 20, 25, 28

To find Q3, we take the median of this upper half.


Similar to Q1, we will take the average of the 2nd and
3rd values.
𝟐𝟎 + 𝟐𝟓 𝟒𝟓
𝑸𝟑 = = = 𝟐𝟐. 𝟓
𝟐 𝟐

Therefore, Quartile 3 or Q3= 22.5


Quartile
(Grouped Data)
Finding the kth quartile of grouped data:
Formula:
𝒌𝒏
−𝑭
𝑸𝒌 = 𝑳 + 𝟒 𝐡 𝒔𝒖𝒄𝒉 𝒕𝒉𝒂𝒕 𝒌 = 𝟏, 𝟐, 𝟑
𝒇

For example: Find the 1st quartile of the grouped data


in the next slide.
The following table gives the amount of time (in
minutes) spent on the internet each evening by a
group of 56 students. Compute five number summary
for the following frequency distribution.
Let X denote the amount of time (in minutes) spent on
the internet.

Here the classes are inclusive. To make them exclusive,


subtract 0.5 from the lower limit and add 0.5 to the
upper limit of each class.
The formula for kth quartile is
𝒕𝒉
𝒌(𝑵)
𝑸𝒌 =
𝟒
Where: n is the total number of observations.
𝟏(𝟓𝟔) 𝒕𝒉
𝑸𝟏 = = 𝟏𝟒𝒕𝒉 value
𝟒
The cumulative frequency just greater than or equal to is
15. The corresponding class 12.5 – 15.5 is the 1st quartile
class.
Thus;

L = 12.5 Lower boundary;


N = 56 Total frequency;
F=3 Cumulative class before the quartile class;
f = 12 Frequency of the 1st quartile class;
H=3 Width of the class interval.
Substitute the values in the formula:
𝒌𝑵
−𝑭
𝑸𝒌 = 𝑳 + 𝟒 ×𝒉
𝒇
𝟏𝟒 − 𝟑
𝑸𝟏 = 𝟏𝟐. 𝟓 + ×𝟑
𝟏𝟐
𝟏𝟏
𝑸𝟏 = 𝟏𝟐. 𝟓 + ×𝟑
𝟏𝟐
𝑸𝟏 = 𝟏𝟓. 𝟐𝟓
Decile
(Ungrouped Data)
DECILES

❑ Deciles divide a dataset into 10 equal parts.


❑ Each decile represents 10% of the sorted
data.
❑ These are typically used when more granular
analysis is required compared to quartiles.
DECILES

❑ D1 is the 10th percentile


❑ D5 is the 50th percentile (or median)
❑ D9 is the 90th percentile.
Example:

Let’s calculate a specific decile (for instance, the 4th


decile) for the following data set:

10, 15, 20, 22, 25, 30, 35, 40, 45, 50


DECILES Where:
➢ 𝐷𝑘 is the 𝑘-th decile
Formula: ➢ 𝑘 is the decile number (in
this case, 𝑘=4)
𝑘(𝑁+1) 𝑡ℎ
𝐷𝑘 = ➢ 𝑁 is the number of data
10
points in the dataset.
Step 1: Arrange the data in ascending order.

The data is already sorted:


10, 15, 20, 22, 25, 30, 35, 40, 45, 50
Step 2: Calculate the Position of D4

Substitute 𝑘=4 and 𝑁=10 into the formula:

𝟒(𝟏𝟎 + 𝟏) 𝟒𝟒
𝑫𝟒 = = = 𝟒. 𝟒
𝟏𝟎 𝟏𝟎
Step 3: Interpolate to Find the Exact Value of D4

To find the exact decile value, we use linear


interpolation between the 4th and 5th data points.

The 4th and 5th data points are 22 and 25. The decile
position is 4.4, so we are 0.4 of the way between these
two data points.
𝐷4=22 + 0.4 × (25−22) = 22+0.4×3 = 22+1.2 = 23.2
Decile
(Ungrouped Data)
The following table gives the amount of time (in
minutes) spent on the internet each evening by a group
of 56 students.
Let X denote the amount of time (in minutes) spent on
the internet.

Here the classes are inclusive. To make them exclusive


type subtract 0.5 from the lower limit and add 0.5 to
the upper limit of each class.
a. The maximum time spent on the internet by lower 20 % of
the students is second decile 𝑫𝟐 .
𝒕𝒉
𝒌𝑵
𝑫𝒌 = 𝒔𝒖𝒄𝒉 𝒕𝒉𝒂𝒕 𝒌 = 𝟏, 𝟐, … , 𝟗
𝟏𝟎

where N is the total number of observations or frequency


Solution:
𝒌𝑵 𝒕𝒉
𝑫𝒌 = value
𝟏𝟎
𝟐(𝟓𝟔) 𝒕𝒉
𝑫𝟐 = value
𝟏𝟎
𝒕𝒉
𝑫𝟐 = 𝟏𝟏. 𝟐 value

The cumulative frequency just greater than or equal to


11.2 is 15. The corresponding class 12.5 – 15.5 is the 2nd
decile class.
Thus;

L = 12.5 Lower boundary;


N = 56 Total frequency;
F=3 Cumulative class before the quartile class;
f = 12 Frequency of the 2nd decile class;
H=3 Width of the class interval.
Substitute the values in the formula:
𝒌𝑵
−𝑭
𝑫𝒌 = 𝑳 + 𝟏𝟎 ×𝒉
𝒇
𝟏𝟏. 𝟐 − 𝟑
𝑫𝟐 = 𝟏𝟐. 𝟓 + ×𝟑
𝟏𝟐
𝟖. 𝟑
𝑫𝟐 = 𝟏𝟐. 𝟓 + ×𝟑
𝟏𝟐
𝑫𝟐 = 𝟏𝟒. 𝟓𝟓
Z-scores
Z-SCORES (Standardized Scores)

❑ The z-score is a measure that indicates how many standard


deviations a data point is from the mean of the dataset.
❑ This measure is crucial when you need to compare
observations from different datasets or distributions with
different means and standard deviations.
❑ The z-score helps in normalizing the data.
Z-SCORES (Standardized Scores)

Formula:
𝑿−𝝁
𝒁=
𝝈
Where:
𝑋 is the data point
𝜇 is the mean of the data
𝜎 is the standard deviation.
Z-SCORES (Standardized Scores)

Example:

Suppose a student scored 85 on a test where the mean was


70, and the standard deviation was 10. The z-score would be:

𝟖𝟓 − 𝟕𝟎
𝒁= = 𝟏. 𝟓
𝟏𝟎
THANK YOU FOR
LISTENING

You might also like