0% found this document useful (0 votes)
13 views29 pages

DSA-Midterm

The document discusses the importance of data visualization through graphs and charts for effective data-driven decision-making. It outlines various types of frequency distributions, including ungrouped, grouped, relative, and cumulative, and explains how to create frequency tables and graphs such as pie charts, bar charts, and histograms. The learning outcome emphasizes the ability to explain the significance of visualizations and to create them for communicating insights.

Uploaded by

mwashiiie
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
13 views29 pages

DSA-Midterm

The document discusses the importance of data visualization through graphs and charts for effective data-driven decision-making. It outlines various types of frequency distributions, including ungrouped, grouped, relative, and cumulative, and explains how to create frequency tables and graphs such as pie charts, bar charts, and histograms. The learning outcome emphasizes the ability to explain the significance of visualizations and to create them for communicating insights.

Uploaded by

mwashiiie
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 29

Data exploration and

Visualization
- Dave Miracle Cabanilla
Outline
● The Power of Graphs and Charts

● Data-Driven Decision-Making

● Creating Charts and Graphs to visualize data distributions and trends

i. Bar Charts, Line Charts, Heatmaps

ii. Histogram, Box Plots, Scatter Plots


Learning Outcome
● explain the importance of using graphs and charts; and

● create data visualizations to effectively communicate insights.


Frequency Distribution
A frequency distribution describes the number of observations for each possible
value of a variable. Frequency distributions are depicted using graphs and
frequency tables.
What is a frequency distribution?
The frequency of a value is the number of times it occurs in a dataset. A frequency
distribution is the pattern of frequencies of a variable. It’s the number of times each
possible value of a variable occurs in a dataset.
Types of frequency distributions
Ungrouped frequency distributions: The number of observations of each value of
a variable.
● You can use this type of frequency distribution for categorical variables.
Grouped frequency distributions: The number of observations of each class
interval of a variable. Class intervals are ordered groupings of a variable’s values.
● You can use this type of frequency distribution for quantitative variables.
Types of frequency distributions cont.
Relative frequency distributions: The proportion of observations of each value or
class interval of a variable.
● You can use this type of frequency distribution for any type of variable when
you’re more interested in comparing frequencies than the actual number of
observations.
Cumulative frequency distributions: The sum of the frequencies less than or equal
to each value or class interval of a variable.
● You can use this type of frequency distribution for ordinal or quantitative
variables when you want to understand how often observations fall below
certain values.
How to make a frequency table
Frequency distributions are often displayed using frequency tables. A frequency
table is an effective way to summarize or organize a dataset. It’s usually composed
of two columns:

● The values or class intervals


● Their frequencies

The method for making a frequency table differs between the four types of
frequency distributions.
How to make an ungrouped frequency table
1. Create a table with two columns and as many rows as there are values of the variable. Label the first
column using the variable name and label the second column “Frequency.” Enter the values in the first
column.
● For ordinal variables, the values should be ordered from smallest to largest in the table rows.
● For nominal variables, the values can be in any order in the table. You may wish to order them
alphabetically or in some other logical order.

2. Count the frequencies. The frequencies are the number of times each value occurs. Enter the
frequencies in the second column of the table beside their corresponding values.
● Especially if your dataset is large, it may help to count the frequencies by tallying. Add a third
column called “Tally.” As you read the observations, make a tick mark in the appropriate row of
the tally column for each observation. Count the tally marks to determine the frequency.
Example: Making an ungrouped frequency table
How to make a grouped frequency table
How to make a grouped frequency table cont.
Example: Grouped frequency distribution
Example: Grouped frequency distribution cont.
How to make a relative frequency table
How to make a relative frequency table cont.
How to make a cumulative frequency table
How to make a cumulative frequency table cont.
How to graph a frequency distribution
Pie charts, bar charts, and histograms are all ways of graphing frequency
distributions. The best choice depends on the type of variable and what you’re
trying to communicate.
Pie chart
● A pie chart is a graph that shows the relative frequency distribution of a
nominal variable.
● A pie chart is a circle that’s divided into one slice for each value. The size of
the slices shows their relative frequency.
● This type of graph can be a good choice when you want to emphasize that
one variable is especially frequent or infrequent, or you want to present the
overall composition of a variable.
● A disadvantage of pie charts is that it’s difficult to see small differences
between frequencies. As a result, it’s also not a good option if you want to
compare the frequencies of different values.
Pie chart
Bar chart
● A bar chart is a graph that shows the frequency or relative frequency
distribution of a categorical variable (nominal or ordinal).
● The y-axis of the bars shows the frequencies or relative frequencies, and the
x-axis shows the values. Each value is represented by a bar, and the length or
height of the bar shows the frequency of the value.
● A bar chart is a good choice when you want to compare the frequencies of
different values. It’s much easier to compare the heights of bars than the
angles of pie chart slices.
Bar chart
Histogram
● A histogram is a graph that shows the frequency or relative frequency
distribution of a quantitative variable. It looks similar to a bar chart.
● The continuous variable is grouped into interval classes, just like a grouped
frequency table. The y-axis of the bars shows the frequencies or relative
frequencies, and the x-axis shows the interval classes. Each interval class is
represented by a bar, and the height of the bar shows the frequency or relative
frequency of the interval class.
Histogram
Histogram

You might also like