Unit 1
Unit 1
Unit 1
com
om 3
1
Introduction Data visualization
The ways we structure and visualize information are changing rapidly and getting more complex with each
passing day. Thanks to the rise of social media, the ubiquity of mobile devices, and service digitaliza-
tion, data is available on any human activity that utilizes technology. The generated information is
hugely valuable and makes it possible to analyze trends and patterns, and to use big data to draw connections
between events. Thus, data visualization can be an effective mechanism for presenting the end user with
understandable information in real time.
Every company has data, be it to communicate with clients and senior managers or to help manage
the organization itself. It is only through research and interpretation that this data can acquire meaning and
be transformed into knowledge.
What is data visualization?
Data visualization is the process of acquiring, interpreting and comparing data in order to clearly communicate complex ideas, thereby facilitating the identification and analysis
of meaningful patterns.
Data visualization can be essential to strategic communication: it helps us interpret available data; detect patterns, trends, and anomalies; make
decisions; and analyze inherent processes.
All told, it can have a powerful impact on the business world.
The data
visualization process
Visualization process1
important in in our long-term memory. same information in a visual, the results are much
clearer (see the graph below).
reports and Consequently, for reports and statements, a visual
rep- resentation that uses images is a much more The graph takes what the numbers cannot communi-
statements? effective way to communicate information than text cate on their own and conveys it in a visible,
or a table; it also takes up much less space. memorable way. This is the real strength of data
visualization.
We live in the era of visual information, and visual
content plays an important role in every moment of This means that data visuals are more
our lives. A study by SH!FT Disruptive Learning attractive, simpler to take in, and easier to
100
The rationale behind the
power of visuals:
80
• The human mind can see an image for just 13
mil- liseconds and store the information, 75 62
56 58
provided that it is associated with a concept. 60
Our eyes can take in 36,000 visual messages 45
per hour. 40
36
• 40% of nerve fibers are connected to the
20
retina.
Month
Jan Feb Mar Apr May Jun
Data types, 2 kinds of data
relationships, and Before we talk about visuals themselves, we must first understand the
visualization different kinds of data that can be visualized and how they relate to one
another.
In order to communicate effectively, it is important • Discrete: Data that consists of whole numbers (0, 1, • Ordinal: Meaning it follows an order or sequence.
to understand different kinds of data and to 2, 3...). For example, the number of children in a That might be the alphabet or the months of the
establish visual relationships through the proper family. year.
use of graphics. Enrique Rodríguez (2012), a data • Continuous: Data that can take any value within • Categorical: Meaning it follows no fixed order.
analyst at DataNauta, once explained in an interview an interval. For example, people’s height For example, varieties of products sold.
that... (between 60 - 70 inches) or weight (between 90
and 110 pounds).
10
11 formats 1. Bar chart
Bar charts
There are two types of visualizations: static and interactive. aredepends
Their use one of the
on most popular
the search ways
and of
analysis dimension
They level. Static
are very visualsand
versatile, canthey
onlyare
analyze data in one dime
typically
visual- izing data because they present a data set
used to compare discrete categories, to analyze
As with any other form of communication, familiar- ity with in athe codeunderstood
quickly and resources thatthat
format are enables
available to us is essential if we’re going to use them successfully our goal. In this p
is listed in order of popularity in the “Visualization Universe” project by Google News Lab and Adioma, changes over time, or to compare parts of a
as of the publication of this report. viewers to identify highs and lows at a glance.
whole.
The three variations on the bar chart are:
Used for chronological data, and Used to visualize Used to visualize categories
it should be in left-to-right categories. that collectively add up to
format. 100%.
6,000
5,500
5,000
4,000
3,500
3,000
Entertainment Feb
2,500
2,000
1,500
Mar
Heatlh
1,000
500
0
Jan Feb Mar Apr May 0% 20% 40% 60% 80% 100% 0% 20% 40% 60% 80% 100%
11
1. Histograms
400K
Histograms represent a variable in the form of
bars, where the surface of each bar is 350K
>120
60-80
100K
2. Pie charts
0. 5.000
2
0
0
0.2 0.4 0.6 0.8 1.0 1.2
4. Heat
maps
size and color of the bubbles represent a dimension represented. They are equivalent to graphics with
that, along with the data, is very useful for visually parallel coordinates on polar coordinates. Typically,
stressing specific values. The two variations on the they are used to represent the behavior of a metric
bubble chart are: over the course of a set time cycle, such as the hours
of the day, months of the year, or days of the week.
Line chart
• Bubble map: used to visualize
three-dimensional values for
geographic regions.
Rad
ar
char
t
14
8. Waterfall charts
400K
These help us understand the cumulative
350K
effect of positive and negative values on
variables in a sequential fashion. 300K
250K
200K
150K
100K
50K
Start A B
C D E F G H I J K End
L
Fall Rise
9. Tree maps
1.0
1.0
0.8
0.8
0.6
0.6
0.4
0.4
0.2
0.2
0
0 1 2 3 4 5 6 0
0 1 2 3 4 5 6
A B C
Stacked area
100% stacked area
16