MCA_S3_Data Visualisation_U2
MCA_S3_Data Visualisation_U2
Unit-02
Introduction to Charts and Plots
Semester-03
Master of Computer Applications 2
UNIT
Names of Sub-Units
Charts/Plots Used For Visualisation, Types Of Charts/Plots Used For Data Visualisation, Line Chart,
Area Chart, Bar Chart, Scatter Chart, Pie Chart, Surface Chart, Bubble Chart, Doughnut Chart,
Histogram, Box Plot, Hexbin Plots, Violin Plot, Heat Maps, Gantt Charts, Word Clouds (Text
Visualisation), Effectiveness Of Visualisation Across Data Types, Quantitative Data, Qualitative
(Categorical) Data
Overview
This unit begins by discussing the concept of charts or plots. Next, the unit explains the different types
of charts or plots used in data visualisation. Towards the end, the unit covers the effectiveness of
visualisation across data types.
Learning Objectives
3
Learning Outcomes
https://sites.tufts.edu/gis/files/2016/02/Introduction_to_Data_Visualisation.pdf
2.1 INTRODUCTION
A chart is a graphical representation for all the data visualisation, in which “the data is represented by
the indicators or symbols, such as bars in the bar chart, lines in the line chart or slices in a pie chart.
A chart can, thus show all of the tabular numeric data as well as functions or quality structures and
provide varied information.
A data chart is a type of diagram or graph that then organises & represents a set of al numerical or
qualitative data. Charts are maps that are embellished with all the additional info for a specific purpose,
such as a nautical chart or an aeronautical chart and are often distributed across numerous map sheets.
Other domain-specific constructs that are commonly referred to as charts include the chord chart,
which is used in music notation and the record chart, which is used to track album popularity.
Charts are often used to ease the understanding of large quantities of data & the relationships between
parts of the data. Charts can usually be read more quickly than the actual raw data. They are used in a
wide variety of fields and can also be created by hand or by the computer using a charting application.
Certain types of charts are then more useful for presenting a given data set than others.
4
2.3 TYPES OF CHARTS/PLOTS USED FOR DATA VISUALISATION
Data can be presented in various visual forms, which include simple line diagrams, bar graphs, tables,
matrices, etc. Some techniques used for a visual presentation of data are as follows:
Line chart Histogram
Area chart Box plot
Bar chart Hexbin plot
Scatter chart Violin plot
Pie chart Heat maps
Surface chart chart
Bubble chart Word clouds (text visualisation)
Doughnut chart
50 50
40
40
30 30
30
20
20
10
5
2.3.2 Area Chart
In an area chart, areas are used to represent values. It is similar to a line chart in that it displays a series
as a set of points connected by a line. However, the difference is that in an area chart, the area below
the line is filled with the colour of the line. Area charts help to draw attention to the total value across
a given data. For example, data showing the turnover of a business over time can be plotted in an area
chart to focus on the total turnover of the business.
Figure 2 shows the area chart:
Sales
Expenses
800
600
400
200
Year
6
Figure 3 shows the bar chart:
Records
4k
0
<18 18-29 40-49 60-69 70-79
9000
8000
7000
6000
Data
5000
4000
3000
2000
1000
7
The fundamental purpose of a scatter plot chart is to analyse and display relations between two different
numeric variables. A scatter chart can also help you spot additional patterns in your data. This chart
also used to identify the correlation relationship.
Work
8
$40,000
$20,000
$0
70
60
50
40 Series 1
Series 2
30 Series 3
Series 4
20
20 30 40 50
9
Figure 8: Doughnut Chart
2.3.9 Histogram
Histogram chart is used to show data in the form of frequency within a distribution. Each column in
the histogram chart is known as Bin. However, the continuously flowing data can be represented using
Histogram. It makes it easy to analyse the data defined within various data ranges.
Figure 9 shows the histogram chart:
20
Frequency
20
15 20 25 35 40
10
highlighted mean and outliers.
Figure 10 shows the box plot:
800
700
600
400
300
200
20
-2
-2
11
2.3.12 Violin Plot
A violin plot is a cross between a box plot and a statistical plot that shows data peaks. It is used to show
how numerical data is distributed. Unlike a box plot, which simply shows summary data, violin plots show
the intensity of each variable as well. A violin plot uses density curves to represent numeric data
distributions for one or more groups. Figure 12 shows the violin plot:
0.5
0.0
-0.5
Town A
Town B
Town C
Town D
Town E
Town F
Town G
Town H
Town I
Town K
12
Figure 13: Heat Map
Typically, all of the rows belong to one category, while all of the columns belong to another. The
subcategories are separated into different rows and columns, that are all matched together in a matrix.
Task One
Task Two
Task Three
Task Four
Task Five
Task Six
Task Seven
Task Eight
Task Nine
Task Ten
Task Eleven
13
Figure 15: Word Cloud
It is crucial to understand the different categories of data types before choosing the optimal data
visualisation for a given situation. Data are broadly divided into two types, which are as follows:
Quantitative
Qualitative (also known as categorical)
14
The examples of ordinal data are:
Website traffic
Water temperature
Wind speed
People’s perceptions are revealed through qualitative data. This information aids market researchers
in gaining a better understanding of their clients’ preferences, allowing them to tailor their ideas and
plans accordingly.
You can apply the different types of data visualisation techniques to both quantitative and qualitative
data.
A chart is a graphical representation for all the data visualisation, in which “the data is to represented
by the indicators or symbols.
A data chart is a type of diagram or graph that then organises & represents a set of al numerical or
qualitative data.
Data can be presented in various visual forms, which include simple line diagrams, bar graphs,
tables, matrices, etc.
Line charts are used to plot continuous data in the form of lines.
In an area chart, areas are used to represent values. It is similar to a line chart in that it displays a
series as a set of points connected by a line.
15
A bar chart is a visual presentation of category data.
Scatter charts are used to show the relationship between the numeric values in two data series.
A pie chart is used to show relative proportions or contributions to a whole, which is contributed by
each value in a single data series.
A doughnut chart is used to show the relationship of parts to a whole.
A boxplot, often known as a box and whisker plot, is a visual representation of a data set’s spread
and centres.
A violin plot is a cross between a box plot and a statistical plot that shows data peaks.
The Gantt chart illustrates the project schedule by summarising the tasks to be performed, their
start and end dates, sequence or order of occurrence and overall duration.
Data are broadly divided into two types, which are: quantitative and qualitative (also known as
categorical).
2.6 GLOSSARY
Chart: A graphical representation for all the data visualisation, in which “the data is to represented
by the indicators or symbols
Scatter chart: It is used to show the relationship between the numeric values in two data series
Surface chart: It shows a three-dimensional surface that connects a set of data points
Histogram chart: It is used to show data in the form of frequency within a distribution
Heat map: A sort of data visualisation in which the unique values in a matrix are represented by
colour variations
16
slices in a pie chart. A chart can thus show all of the tabular numeric data as well as functions or
quality structures, and provide varied information. Refer to Section Introduction
2. A bar chart is a visual presentation of category data. The data is represented using a bar chart,
which has a number of bars, every representing a different category. Each bar’s height corresponds
to a specific aggregate (for example the sum of the values in the category it represents). Refer to
Section Types of Charts/Plots Used for Data Visualisation
3. The Gantt chart illustrates the project schedule by summarising the tasks to be performed, their start
and end dates, sequence or order of occurrence and overall duration. It also indicates important
milestones in the project. The Gantt chart, developed by Henry Gantt in 1917, is a type of bar chart
that illustrates the entire project schedule. Refer to Section Types of Charts/Plots Used for Data
Visualisation
4. Data that cannot be quantified or tallied in the numeric form is referred to as qualitative or
categorical data. These kinds of data are organised by category rather than by number. Categorical
Data is the name given to it because of this. Audio, images, symbols and text are all examples of data.
A person’s gender, whether male, female or other, is qualitative data. Refer to Section Effectiveness
of Visualisation Across Data Types
5. The size of each word in a word cloud is related to the frequency with which it appears in a given
piece of text. The words are then grouped in a cloud. The text can be organised in any manner,
including diagonal lines, columns and within a shape. Refer to Section Types of Charts/Plots Used
for Data Visualisation
Discuss with your friends about the different types of charts and plots used for data visualisation
and their purposes.
17
18