Lesson4 Methods of Organizing Data
Lesson4 Methods of Organizing Data
Lesson4 Methods of Organizing Data
Statistical Research and Training Center Training Course on Basic Statistical Analysis Using MS Excel 2007
March 28 – April 1, 2011
2
Statistical Research and Training Center Training Course on Basic Statistical Analysis Using MS Excel 2007
March 28 – April 1, 2011
3
0 0 1 2 2 2 3 3 4 4
0 0 1 2 2 3 3 3 4 4
0 1 1 2 2 3 3 3 4 4
0 1 1 2 2 3 3 3 4 5
0 1 1 2 2 3 3 3 4 5
Statistical Research and Training Center Training Course on Basic Statistical Analysis Using MS Excel 2007
March 28 – April 1, 2011
4
Example of Single Value Grouping
Statistical Research and Training Center Training Course on Basic Statistical Analysis Using MS Excel 2007
March 28 – April 1, 2011
5
Dot Plot
A dot plot is a method of summarizing data to
illustrate the major features of the distribution
of the data in a convenient form where each
observation is represented by a dot.
• A horizontal axis shows the range of data values,
then each data value is represented by a dot placed
above the axis.
Statistical Research and Training Center Training Course on Basic Statistical Analysis Using MS Excel 2007
March 28 – April 1, 2011
6
Dot Plot
Given: 8, 4, 2, 12, 8, 2, 4, 12, 6, 8, 6, 8,
10, 10, 12, 16
Statistical Research and Training Center Training Course on Basic Statistical Analysis Using MS Excel 2007
March 28 – April 1, 2011
7
Definition of Terms Used in a Frequency
Distribution Table
Class interval contains the numbers defining a class.
Class frequency is the number of observations falling under a
class interval.
Class limits are the end numbers of a class interval.
* The lower class limit (LCL) is the lower end of the class
interval and the upper class limit (UCL) is the upper
end of the class interval.
* The number of digits of the class limits should be the
same as the number of digits of the raw data.
Open class interval is a class interval with either no lower class
limit or upper class limit.
Statistical Research and Training Center Training Course on Basic Statistical Analysis Using MS Excel 2007
March 28 – April 1, 2011
8
Statistical Research and Training Center Training Course on Basic Statistical Analysis Using MS Excel 2007
March 28 – April 1, 2011
9
Statistical Research and Training Center Training Course on Basic Statistical Analysis Using MS Excel 2007
March 28 – April 1, 2011
10
Steps in Constructing a Frequency
Distribution Table
1. Determine an adequate number of classes (K).
* The number of classes should not be too many or not
too few.
* Usually, the number of classes is between 5 and 20.
* The class intervals should be non-overlapping.
2. Determine the range (R). Range = Maximum – Minimum
(National 1st District 120,663 (Cagayan Batanes 2,535 (CALABARZON) Batangas 440,603
Eastern
Visayas) Antique 208,169 Visayas) Samar 202,680 Mindanao) Camiguin 41,017
Lanao Del
Capiz 328,635 Leyte 680,536 Norte 424,819
Northern Misamis
Guimaras 37,838 Samar 240,228 Occidental 260,764
Southern Misamis
Iloilo 690,639 Samar 116,738 Oriental 404,002
Negros Western
Occidental 1,312,961 Samar 348,054 Region 11 1,222,367
Davao del
Region 7 2,017,162 Region 9 1,254,884 (Davao Norte 637,298
Zamboanga
(Central Bohol 590,926 (Zamboanga del Norte 433,091 Region) Daval del Sur 412,442
Zamboanga Davao
Visayas) Cebu 973,490 Peninsula) del Sur 821,793 Oriental 172,627
(SOCCSKSAR (Autonomous
GEN) North Cotabato 509,463 (Caraga) Agusan del Norte 259,475 Region Basilan 123,825
Surigao del
South Cotabato 469,874 Norte 232,065 Maguindanao 534,628
1 Districts of NCR cover the following: 1st District – Manila; end District –
Mandaluyong, Marikina, Pasig, Quezon City and San Juan; 3rd District -
Valenzuela, Kaloocan City, Malabon and Navotas; and 4th District – Las Pinas,
Makati, Muntinlupa, Paranaque, Pasay City, Pateros, and Taguig.
2 Zamboanga Sibugay was part of Zamboanga del Sur in 2000. Thus, 2000
estimates of Zamboanga del Sur includes Zamboanga Sibugay
3 Isabela City was part of Basilan in 2000. Thus, 2000 estimates of Basilan still
includes Isabela City.
4 Davao del Norte estimates for 2000 include Compostela Valley.
Source: National Statistical Coordination Board
15
TABLE 4. Sorted Data (Array) of Magnitude of Poor
Population for the 82 provinces of the Philippines: 2000
Statistical Research and Training Center Training Course on Basic Statistical Analysis Using MS Excel 2007
March 28 – April 1, 2011
TABLE 5. Frequency Distribution Table on Magnitude of Poor
Population for the 82 Provinces of the Philippines: 2000
TABLE 5a TABLE 5b
CLASS LIMITS CLASS LIMITS
LCL UCL f LCL UCL f
2,500 152,499 24 2,500 202,499 31
152,500 302,499 24 202,500 402,499 26
302,500 452,499 18 402,500 602,499 16
452,500 602,499 7 602,500 802,499 5
602,500 752,499 4 802,500 1,002,499 3
752,500 902,499 3 1,002,500 1,202,499 0
902,500 1,052,499 1 1,202,500 1,402,499 1
1,052,500 1,202,499 0 82
1,202,500 1,352,499 1
82
17
TABLE 5c
CLASS LIMITS
LCL UCL f
2,500 192,499 30
192,500 382,499 26
382,500 572,499 16
572,500 762,499 5
762,500 952,499 3
952,500 1,142,499 1
1,142,500 1,332,499 1
82
Statistical Research and Training Center Training Course on Basic Statistical Analysis Using MS Excel 2007
March 28 – April 1, 2011
18
Example: This illustrates the use of appropriate column
labels in a frequency distribution table.
Statistical Research and Training Center Training Course on Basic Statistical Analysis Using MS Excel 2007
March 28 – April 1, 2011
20
Relative frequency
* divide the class frequency of a class interval to the number of
observations
* the sum of the relative frequency column is one
Statistical Research and Training Center Training Course on Basic Statistical Analysis Using MS Excel 2007
March 28 – April 1, 2011
TABLE 8. Frequency Distribution Table 21
with Relative Frequency and Relative
Frequency Percentage
Relative
Class Limits Relative Frequency
LCL UCL f Frequency Percentage
2,500 - 192,499 30 0.366 36.6
192,500 - 382,499 26 0.317 31.7
382,500 - 572,499 16 0.195 19.5
572,500 - 762,499 5 0.061 6.1
762,500 - 952,499 3 0.037 3.7
952,500 -1,142,499 1 0.012 1.2
1,142,500 -1,332,499 1 0.012 1.2
82 1.000 100.0
Statistical Research and Training Center Training Course on Basic Statistical Analysis Using MS Excel 2007
March 28 – April 1, 2011
TABLE 9. Frequency Distribution Table 22
with Less than Cumulative Frequency and Greater than
Cumulative Frequency Distributions
Greater than
Less than Cumulative
Class Limits cumulative Frequency
LCL UCL f Frequency
2,500 - 192,499 30 30 82
192,500 - 382,499 26 56 52
382,500 - 572,499 16 72 26
572,500 - 762,499 5 77 10
762,500 - 952,499 3 80 5
952,500 -1,142,499 1 81 2
1,142,500 -1,332,499 1 82 1
82
Statistical Research and Training Center Training Course on Basic Statistical Analysis Using MS Excel 2007
March 28 – April 1, 2011
23
Graphical Representation of the
Frequency Distribution
Frequency Histogram
- use the class frequency on the vertical axis and
the class boundaries on the horizontal axis
Frequency Polygon
- use the class frequency on the vertical axis and
the class mark on the horizontal axis
Statistical Research and Training Center Training Course on Basic Statistical Analysis Using MS Excel 2007
March 28 – April 1, 2011
24
Frequency Histogram
Statistical Research and Training Center Training Course on Basic Statistical Analysis Using MS Excel 2007
March 28 – April 1, 2011
25
35
30
Number of Provinces
25
20
15
10
5
0
0- 2 2- 4 4- 6 6- 8 8 - 10 10-11 11-13
Magnitude of Poor in Hundred Thousands
Statistical Research and Training Center Training Course on Basic Statistical Analysis Using MS Excel 2007
March 28 – April 1, 2011
26
Frequency Polygon
• graphical representation of the frequency distribution table that
shows the shape of the data set
• place the frequencies on the vertical axis and the class marks
on the horizontal axis
Statistical Research and Training Center Training Course on Basic Statistical Analysis Using MS Excel 2007
March 28 – April 1, 2011
27
Illustration of a Frequency Polygon
30
25
20
15
10
5
0
0.975 2.875 4.775 6.675 8.575 10.47512.375
magnitude of poor people in hundred thousands
Statistical Research and Training Center Training Course on Basic Statistical Analysis Using MS Excel 2007
March 28 – April 1, 2011
28
Use of Microsoft Excel
Statistical Research and Training Center Training Course on Basic Statistical Analysis Using MS Excel 2007
March 28 – April 1, 2011
29
Creation of a Frequency Distribution Table
1. Enter the data set in one column or one row.
2. List down in a column all upper class limits of the variable.
3. Click the right cell beside the first cell of the column
4. To tally the frequencies, click the function wizard (fx), choose
statistical function, click frequency, and highlight the data to
input cells for the data array and highlight the column of upper
class limits for the bins array.
5. Click OK. Highlight the whole column beside column of all
possible values. Place the cursor at the formula bar. Press
Ctrl-Shift-Enter.
6. Label the column as the name of the variable, label the column
to its right frequency, and provide the sum for the frequency
column by highlighting the blank cell after the last value of the
frequency column and clicking the button in the menu bar.
Statistical Research and Training Center Training Course on Basic Statistical Analysis Using MS Excel 2007
March 28 – April 1, 2011
30
Constructing a Frequency Histogram
1. Click the chart wizard. Choose the bar graph, highlight the
frequency column including the label but excluding the sum
seeing to it that you choose the option series.
2. Click series, and for category of X series, highlight the column
with the class boundaries excluding the label.
3. Click next, and improve the graph by typing the titles and
labels for the axes. Remove the legend.
4. Click next. You have a choice where to put your graph – in the
same sheet as the data or in a different sheet. Choose your
preference and click finish.
Statistical Research and Training Center Training Course on Basic Statistical Analysis Using MS Excel 2007
March 28 – April 1, 2011
31
Constructing a Frequency Histogram Using
Data Analysis
Statistical Research and Training Center Training Course on Basic Statistical Analysis Using MS Excel 2007
March 28 – April 1, 2011
32
3. Category (x) axis: class marks column with the blank cells at
both ends
Statistical Research and Training Center Training Course on Basic Statistical Analysis Using MS Excel 2007
March 28 – April 1, 2011