Data Visualization - Gender Statistics Example
Data Visualization - Gender Statistics Example
TRAINING
GENDER
STATISTICS
DATA VISUALIZATION
STRENGTHENING
Contents
Data visualization tip sheet
Module 1. Introduction to data visualization
Module 2. Data visualization principles and concepts
Module 3. Excel data and charts adjustments
Module 4. Visualization tools comparison
Module 5. Excel chart transformations
Module 6. DATAWRAPPER maps
Module 7. DATAWRAPPER range plots
Module 8. Annotations
WORLD BANK SGS DATA VISUALIZATION TRAINING | DATA VISUALIZATION TIP SHEET
Clustered/grouped bar or column • Mostly for breaking/grouping one variable into different • Should use the same color for all bars (unless
chart levels of disaggregation (subgroups) specifically disaggregated for example by sex).
• Show comparison/ranking • Make sure multiple levels of disaggregation or group
are shown in the axis to avoid labeling each category
• Bar used when variable/category labels are too long to name individually.
show horizontally in a column chart • The gap between the clusters/groups of bars should
be half the size of the width of the bar itself.
Stacked bar or column chart • Show the composition usually out of 100% (part- to-whole • Should have no more than 8 categories.
relationship of a variable’s categories) • Legends should be stretched across the top of the
• Can be simple (one variable) or chart or to the right, and the order should match the
clustered/grouped (different levels of order in the chart.
disaggregation or subgroups) • Should use categorical color palette.
• The “Other” category should not be the largest
section of the stacked chart. Try breaking the
“other” section into smaller sections.
WORLD BANK SGS DATA VISUALIZATION TRAINING | DATA VISUALIZATION TIP SHEET
Line chart • Show the trend in variables usually over time • No more than 4 to 6 lines in a chart.
(but can also be age ranges, or hours in a day). • When too many lines, break into “small
• Show multiple variables with multiple lines (if multiples” or “grid of charts” highlighting the lines
they are on the same scale). individually or highlighting the main line in one color
• Show the same variable for multiple observations and all other lines behind in light grey.
with multiple lines. • Option to shade area between male and female line
to better show gaps.
• When possible, directly label series. If lines are too
close together, use a legend.
• Avoid individual data labels. Individual data labels
may be acceptable if there are few data points and
data labels aren’t overlapping.
Area chart • Show the trend in composition (part-to-whole • Should use categorical color palette.
relationship of categories) over time. It is a combination • The “Other” category should not be the largest
of the line chart and stacked column chart. section of the stacked chart. Try breaking the “other”
section into smaller sections.
• X-axis should have an ordinal variable (time, age,
hours, temperature, etc.)
Pie chart or donut chart • Show the composition out of 100% (part-to- whole • Avoid pie charts whenever possible and
relationship of categories) typically for 3 or 4 but preferably use stacked charts instead.
no more than 6 categories. • If it is necessary to use a pie chart, there should be
no more than 6 slices.
• The “Other” slice should not be the largest
slice of the pie chart.
Range plot • Show comparison/ranking • Order the range plot by value for ease of
• Show gaps and absolute values comprehension.
• Gridlines should always be removed to avoid the
length of the gap blending into the gridline.
WORLD BANK SGS DATA VISUALIZATION TRAINING | DATA VISUALIZATION TIP SHEET
Heatmap • Show comparison with multiple levels or • For continuous data, use sequential color palette or
disaggregation with many subgroups or categories if showing the gender gap, use a diverging color
• Most commonly shown over time (hours, years) or across palette.
age groups • X-axis should have an ordinal variable (time, age,
• Pack a lot of information in one chart hours, temperature, etc.)
Scatterplot • Show relationship between two variables • Is typically reserved for visualizing data with many
• Could show the relationship between male and female observations (i.e., microdata).
values of the same variable or between two years of same • Add a 45-degree line to show equal values between
variable. the two variables. For example, gender parity when
• Show distribution, outliers plotting male and female values on the axes.
• Mostly for microdata but could plot disaggregation • Using a third variable as dot (bubble) size creates a
category with many subgroups (i.e., geographic regions, bubble chart.
occupations)
Dot plot • Show distribution of the whole set of • Is typically reserved for visualizing data with many
observations observations (i.e., microdata).
• Easily identify outliers • Can be shown vertically or horizontally.
• Mostly for microdata but could plot disaggregation • Can be colored by one of the disaggregation
category with many subgroups (i.e., geographic regions, categories. For example, plotting all households
occupations) from microdata and coloring by urban/rural.
Map • Show geographic data much more visually than through the • Should use sequential colors unless showing a gap
aforementioned chart types. which requires a diverging color scale.
• Can be shown at whichever level data are available, • Must use the same color scale and range/steps for
national, administrative regions, provinces, districts, male and female for proper side by side comparison.
cities, etc. • For a gap, use text in the legend instead of numbers
so as not to confuse the reader with negative
numbers. Legend should show parity in the middle
and female values higher than male values at one
end and vice versa for the opposite end of the scale.
WORLD BANK SGS DATA VISUALIZATION TRAINING | DATA VISUALIZATION TIP SHEET
Overall Does the visual highlight a significant gender-relevant finding or conclusion? If not, can the data structure be adjusted to highlight the
gender-relevant insight or is the data not gender-relevant?
Graphs will catch a
Total or aggregate statistics can often be removed from tables and graphs to facilitate comparisons between women and men. If the totals
viewer’s attention so only
are large, choose a chart type that highlights overall and gender-disaggregated statistics.
visualize the data that needs
attention. Too many graphics of
unimportant information dilute
the power of visualization. Is the type of visual appropriate for the data?
For example, change over time is displayed as a line graph, area chart, slope graph, or dot plot.
Do the individual visual elements work together to reinforce the overarching takeaway message? Are any of the visual elements
duplicating information?
Choices about graph type, text, arrangement, color, and lines should reinforce the same takeaway message without
duplication.
Color Is the color scheme intentional? Is there a consistent color for male and female?
Colors should be derived from an intentional choice, not the default color schemes. Use your organization’s colors.
Keep culture-laden color
Avoid colors and figures that reinforce gender stereotypes such as pink for women/girls and blue for men/boys.
connotations in mind.
Always use one color to represent women and another color to represent men. These colors should not be two different shades of the same color hue
Use sites like Color (i.e., light blue and dark blue). They should be two distinct shades of color (i.e., blue and orange).
Brewer to find color
schemes suitable for
Does the color highlight key patterns?
reprinting in black- and-
Action colors should guide the viewer to key parts of the display. Less important, supporting, or comparison data should be
white and for
a muted color, like gray.
colorblindness.
Is the color legible when printed in black and white or for people with colorblindness?
When printed or photocopied in black and white, the viewer should still be able to see patterns in the data. Avoid red-green and yellow-blue combinations when
those colors touch one another. Avoid using red to mean bad and green to mean good in the same chart.
WORLD BANK SGS DATA VISUALIZATION TRAINING | DATA VISUALIZATION TIP SHEET
Color (continued) For accessibility, don't rely solely on color to convey the idea. As much as possible, the presentation of the data should still be understandable if the color is
removed or there should be enough additional text that identifies the key ideas in the table for the visually impaired audience who can read the report with
a screen reader.
Does the graph have one horizontal and one vertical axis?
Viewers can best interpret one x-axis and one y-axis. Don’t add a second y-axis. Try a connected scatter plot or two graphs, side by side, instead. (A
secondary axis used to hack new graph types is ok, so long as viewers aren’t being asked to interpret a second y-axis.)
Arrangement Data should be displayed in an order that makes logical sense to the viewer. Data may be ordered by frequency counts (e.g., from greatest to
least for nominal categories), by groupings or bins (e.g., histograms), by time period (e.g., line
(continued) charts), alphabetically, etc. Use an order that supports interpretation of the data.
As a general rule, the chart should be ordered by value (for sex-disaggregated data typically by female value). Female values should always appear
first in a graph compared to male values (above or to the left of the values for men) – this is the case for bars or columns, for the order of tables, and
also for legends.
Are axis intervals equidistant? If not, is there a symbol that signifies the jump?
The spaces between axis intervals should be the same unit, even if every axis interval isn’t labeled.
Irregular data collection periods can be noted with markers on a line graph, for example. Some charts should not start at 0 or we will not see a
significant pattern, but there should at least be a symbol to make it clear that it does not start at 0).
Titles are in a larger size than subtitles or annotations, which are larger than labels, which are larger than axis labels, which are larger than source information. The smallest text -
Text axis labels - are at least 9 point font size on paper, at least 20 on
(continued) screen.
Does the direction of the text respect cultural and linguistic rules?
Make sure that graphs and tables are aligned according to the rules for right-to-left or top-down languages. Make sure that the correct grammar, punctuation, and
capitalization follow the rules of the corresponding language. When using English, make sure that the graphics and text are consistent with only one type of English (American
or British English).
Adapted from "Data Visualization Checklist" by Stephanie Evergreen & Ann K. Emery
WORLD BANK SGS DATA VISUALIZATION TRAINING | DATA VISUALIZATION TIP SHEET
Border
Anatomy of a visual 3D
Label
Chart Title
89
REGION G 44
67
Data Label
56
REGION F 42
49
51
REGION E 48
50
48
REGION D 41
44
49
REGION C 46
48
45
REGION B 42
43
51
REGION A 53
52
Axis
0 10 20 30 40 50 60 70 80 90 Gridlines
Internet Access
Axis Title
Axis Label
Legend
STRENGTHENING
GENDER
STATISTICS
DATA VISUALIZATION
TRAINING MODULE 1:
INTRODUCTION TO
DATA VISUALIZATION
Contents
1. What is data visualization?
2. Why is data visualization important?
3. Advantages/disadvantages of data visualization.
4. How are data visualizations disseminated?
5. For whom are data visualizations produced?
6. Practice exercise
1. Introduction to
data visualization
What is data visualization?
➢ Data visualization is the graphical representation of information and data.
➢ Visual elements:
• Include charts, diagrams, data visualization attributes (shape, position, color, pattern, etc.),
images/icons, annotations (lines, shaded areas, arrows, etc.)
• Provide an accessible way to see and understand trends, outliers, and patterns in data.
Pre-primary Pre-primary
education Primary education Secondary education enrollment Y1 to
Y2 Improvement
Y1 Y2 Y1 Y2 Y1 Y2
City A 11.1 18 66.4 77 19.9 32.2 6.9
City B 7.5 15.2 62.8 73.8 14.4 23.1 7.7
City C 23.6 26.6 77.7 86.8 35.2 59.2 3
City D 10.9 17.5 66.2 76.8 19.7 31.4 6.6
City E 17.7 25.8 81.9 92.7 42.1 64.2 8.1
City F 28.2 26.4 77.1 84.5 33.2 55.4 -1.8
City G 7.2 14.7 62.6 73.5 13.9 22.8 7.5
City H 18.5 36.1 71.8 85 27.5 61 17.6
City I 16.7 32.5 64.6 76.5 24.8 54.9 15.84
City J 9.7 17.9 62.5 75.6 18.3 29.8 8.2
City K 12.8 18 70.6 78.5 21.6 34.6 5.2
GROUP DISCUSSION
➢ What insights does the chart provide?
➢ Can you easily tell which level of education has the highest enrollment?
➢ Can you easily compare enrollment for different cities? From year 1 to year 2?
➢ Is it easier to answer these questions compared to the table?
Simplified presentation
1) Too much information is 2) Non intuitive for the
being shared in one chart audience to read through
Pre-primary
education Primary education Secondary education Improvement
Y1 Y2 Y1 Y2 Y1 Y2
City A 11.1 18 66.4 77 19.9 32.2 6.9
City B 7.5 15.2 62.8 73.8 14.4 23.1 7.7
City C 23.6 26.6 77.7 86.8 35.2 59.2 3
City D 10.9 17.5 66.2 76.8 19.7 31.4 6.6
City E 17.7 25.8 81.9 92.7 42.1 64.2 8.1
City F 28.2 26.4 77.1 84.5 33.2 55.4 -1.8
City G 7.2 14.7 62.6 73.5 13.9 22.8 7.5
City H 18.5 36.1 71.8 85 27.5 61 17.6
City I 16.7 32.5 64.6 76.5 24.8 54.9 15.84
City J 9.7 17.9 62.5 75.6 18.3 29.8 8.2
City K 12.8 18 70.6 78.5 21.6 34.6 5.2
-5 -1.8
City H City I City J City E City B City G City A City D City K City C City F
Provides a better sense of how pre- Conveys the exact information we are
primary enrollment has changed. trying to communicate with no extra
Top 3 easily identifiable. details. No need to search top cities.
Why is data visualization important?
The importance is simple – data visualization helps people:
➢ Whether simple or complex, the right visualization can bring everyone on the same page, regardless
of their level of expertise.
➢ While traditional education typically draws a distinct line between creative storytelling and technical
analysis, the modern professional world also values those who can cross between the two:
Data visualization sits right in the middle of analysis and visual storytelling.
ADVANTAGES DISADVANTAGES
• Easily identify trends that could be • Cannot always discern exact values
missed in tables. from the visual or they are not
provided.
• Appropriate use of visual elements
(colors/patterns) makes it easier to • Core messages can get lost.
find patterns and relationships.
• Wrong design or visual practices can
• Possibility to explore data in lead to inaccurate representations or
interactive formats through biased information.
dashboards.
Advantages/disadvantages of data visualization
➢ Can you quickly identify which city has the largest population?
Population Disease
City A 47.1 28.2
City B 102.0 61.2
City C 63.8 38.3
City D 96.0 57.6
City E 98.0 58.8
City F 50.2 30.1
City G 3.0 1.8
Population Disease
City A 47.1 28.2
City B 102.0 61.2
City C 63.8 38.3
City D 96.0 57.6
City E 98.0 58.8
City F 50.2 30.1
City G 3.0 1.8
Population Disease
City A 47.1 28.2
City B 102.0 61.2
City C 63.8 38.3
City D 96.0 57.6
City E 98.0 58.8
City F 50.2 30.1
City G 3.0 1.8
Population Disease
City A 47.1 28.2
City B 102.0 61.2
City C 63.8 38.3
City D 96.0 57.6
City E 98.0 58.8
City F 50.2 30.1
City G 3.0 1.8
Population Disease
City A 47.1 28.2
City B 102.0 61.2
City C 63.8 38.3
City D 96.0 57.6
City E 98.0 58.8
City F 50.2 30.1
City G 3.0 1.8
No Disease Disease
Population Disease
City A 47.1 28.2
City B 102.0 61.2
City C 63.8 38.3
City D 96.0 57.6
City E 98.0 58.8
City F 50.2 30.1
City G 3.0 1.8
No Disease Disease
Infographic
• An eye-catching collection of images, icons, charts with minimal text
Infographic
• An eye-catching collection of images, icons, charts with minimal text
Infographic
• An eye-catching collection of images, icons, charts with minimal text
Infographic
• An eye-catching collection of images, icons, charts with minimal text
Infographic
• An eye-catching collection of images, icons, charts with minimal text
Infographic
• An eye-catching collection of images, icons, charts with minimal text
Right data
Right format (topic, priority
(meets users’ issues,
needs) statistically
sound, quality)
Right users
(reaches intended users)
Mapping data outputs to audiences
Data analysts,
researchers, academia
•Complete granular raw data,
Development practitioners, questionnaires, codebooks, etc.
gender specialists in •Dedicated/comprehensive
government, NGOs databases of microdata and
Media, policymakers •Visuals of both summary and processed gender statistics
disaggregated data •Advanced visualizations often
•Summary tables, charts, trends,
visualizations, short stories •Analysis of results including using statistical concepts
1. What are the key or most useful insights from the data?
• Which disaggregations do you wish to communicate in the visualization?
What We Do
2. Who is/are the audience(s) you are communicating to?
3. Through which type of data output would you disseminate the data visualization?
• If you have multiple audiences, will there be more than one data output? Which types?
4. What type of potential actions can be taken based on the insights in the data visualization?
DATA VISUALIZATION
TRAINING MODULE 2:
DATA VISUALIZATION
PRINCIPLES & CONCEPTS
Contents
1. Data visualization overall principles
2. Data visualization steps
3. Data visualization tips and concepts by chart type
1. Overall data
visualization
principles
Data visualization improves communication
GUIDING PRINCIPLES BASIC RECOMMENDATIONS
1. Understand the information/data and 1. Highlight 1-2 key messages per visual.
prioritize the information for sharing.
2. Explore graph and chart design choices beyond
2. Choose the correct software/platform and the default options.
chart type for effective visuals designs keeping
in mind the audience, data, output type. 3. Recognize the right use of visual characteristics
especially color as it is incredibly powerful to
3. Use effective data visualization concepts and depict meaning.
characteristics (color, shape, size, pattern, etc.)
to maximize the impact of the data. 4. Reduce the clutter and keep only essential
elements (no duplicative visual characteristics)
4. Communicate data meaning clearly, quickly
and ethically. 5. Never mislead the audience/manipulate visual.
5. Always integrate text elements with the 6. Use annotations, minimize jargon, acronyms,
graphs and images to tell a story (consider and technical terms, and choose a font that is
infographics). easy to read.
BENEFITS
Decision-makers more quickly absorb gender-related insights from surveys
which in turn facilitates evidence-based policy-making
1. Understand and show the data
➢ To create a great visualization, you must understand the key insights and how to let the data shine
through the visual.
➢ Auto-generated charts will not automatically or adequately highlight the important aspects of the data.
➢ You must explore the data first to understand which insights to relay to audience.
• Do not include too many variables or too much information. You do not have to relay all the data.
Other data can go in a table in annex.
• Include helpful annotations.
2. Select the right type of graph
➢ Match the graph type to the audience’s level of data/statistics comprehension.
• Heatmap, tree map, histogram, Dot plot Histogram Heat map Treemap
distribution chart, dot plot, complex
Medium
expertise tables
F: Orange
M: Blue
F: Pink
M: Green
GROUP DISCUSSION
➢ Discussion – which colors Color or race
are used for male and
female in this report?
➢ Are they consistent? Total
Men
➢ Is there anything you
Women
would change about the
colors of these visuals? White Black or brown
Total
➢ Do not add values in a stacked bar or column or in a pie chart that do not total 100% For example,
stacking female, male, and total together.
➢ Don not use inconsistent scales.
8. Highlighting the gender-relevant insights
➢ NSOs' comparative advantage regarding statistics → highlight detailed levels of disaggregation that
international organizations don’t calculate or disseminate.
1. Use multiple disaggregations in the same charts whenever possible.
➢ Aesthetically more pleasing than just 2 bars for male and female and provide more gender-
relevant insights that tell a more nuanced story.
2. Use subnational region maps.
➢ International databases cannot showcase this subnational level as it’s not internationally
comparable.
8. Highlighting the gender-relevant insights
➢ Aim to highlight the gender gap rather than the distribution of female and male values across a different
category (age, sector, type of employment, etc.).
Showcasing the distribution of female/male employment across sectors Showcasing the gender gap
Key insight: Most women and men work in the agricultural sector Key insight: Gender gap in employment is biggest in the industrial sector
2. Data visualization
steps
Steps for creating a data visualization
Explore/try
visualization
options Three-fourths of creating a visualization is add
Optional: cleaning
Design, format, Publish and/or
Upload data and finalizeand formatting the data in the proper additional
download way to:
visualization visualization annotations
as needed
Adjust data match the required data structure, inputs,
structure
and features of the intended visualization type
25%
Formatting &
75% annotating
Three-fourths of creating a visualization is cleaning
the visual
Properly and formatting the data in the proper way to:
cleaning,
match the required data structure, inputs,
transposing and features of the intended visualization type
& preparing
the data highlight the right message and insights
Difference/gap between the types of unpaid work for a given gender. Difference/gap between men’s and women’s time spent on a given
type of unpaid work.
3. Data visualization
tips and concepts by
chart type
What We Do
COMPARISON – MAPS
Maps
➢ Maps will provide nuance to the national statistics by easily identifying the
discrepancies within the country regionally.
• These regional data are often not collected by international
organizations but are most crucial to policymakers in order to pinpoint
policies and interventions that work for each region.
➢ Use the same color scale and ranges when using side by side maps to
compare female and male values.
➢ To avoid side by side maps, map the gender gap rather than female and
male values.
• Make sure to use a diverging scale with the gender gap data.
• Rename the legend for the direction of the gap whether the female
value is relatively higher or lower than the male value or vice versa.
• Negative numbers will confuse the reader. Where possible don’t show
the negative numbers.
GROUP DISCUSISON
➢ Potential actions:
• Financial literacy classes in
school.
• Support to women owned
business.
• Simplify the loan approval
process.
What We Do
➢ Discussion – Is it easy to see the main insights? Easy to see the gender gap? Is there anything you
would change about the colors of these visuals?
What We Do
PRACTICE EXERCISES
Exercise – suggest changes to the set of visuals
➢ See Sheet named "Exercise 2" in the Excel file "Training Dataset Day 1" for an example of a set of visuals
that are part of the same report.
➢ Write a list of suggested changes that you would make to this set of visuals to ensure that they are
consistent and following best practices in data visualization principles.
➢ For best practices in data visualization refer to the "Data Visualization Tip Sheet" handout or Word file.
➢ There should be consistency across the whole report in:
• colors for the same categories;
• font type and size;
• title structure and punctuation;
• phrasing of categories.
DATA VISUALIZATION
TRAINING MODULE 3:
EXCEL CHARTS AND DATA
ADJUSTMENTS
Contents
1. Recap of data visualization steps and breakdown
2. Using data visualization principles to adjust data and chart
elements in Excel
- Grouping or ordering data, chart elements,
and legend for the right data presentation
- Decluttering the chart
- Appropriate sorting and use of colors
Specific actions for adjusting chart elements
Explore/try
visualization
options Three-fourths of creating a visualization is add
Optional: cleaning
Design, format, Publish and/or
Upload data and finalizeand formatting the data in the proper additional
download way to:
visualization visualization annotations
as needed
Adjust data match the required data structure, inputs,
structure
and features of the intended visualization type
25%
Formatting &
75% annotating
Three-fourths of creating a visualization is cleaning
the visual
Properly and formatting the data in the proper way to:
cleaning,
match the required data structure, inputs,
transposing and features of the intended visualization type
& preparing
the data highlight the right message and insights
1.
4.
5.
Incorrect data structure
➢ This visual is grouping values by type of work (row) and stacking female and male values (columns).
➢ The data are not structured properly for this stacked bar chart to show what part/percentage of the
whole day these unpaid work activities comprise for women and men.
1.
Adjustment option 1: transpose data in tool
➢ First, always try transposing the data in the visualization software.
➢ You should see that the visual has changed. The visual is grouped by gender now. This does not
change the data table in the Excel spreadsheet. It only changes the chart orientation.
Chart Title
Female
Male
0 5 10 15 20 25
Alternative visuals
30
20
Emphasis on
gender gap within
10
a given type of
0
Female Male
unpaid work.
Intended visual Unpaid Domestic Work Unpaid Care Work
Female
Unpaid Care Work Emphasis on
Male
gender gap within
Unpaid Domestic Work
0 10 20 30 a given type of
0 5 10 15 20 unpaid work.
Unpaid Domestic Work Unpaid Care Work
Female Male
Emphasis on gap or difference between the
types of unpaid work for a given gender. 20
Emphasis on gap or
10
difference between
0 the types of unpaid
Female Male
work for a given
Unpaid Domestic Work Unpaid Care Work gender.
Adjustment option 2: change visual type
How do you choose which Alternative visuals
- Correct message
alternative visual to use? 20
- Data labels and
10 color key needed
• Is the visual still emphasizing or 0 - Make chart area
highlighting the right Female Male taller and bars
Unpaid Domestic Work Unpaid Care Work slightly thinner
message/insight?
• Do you have to adjust the chart
Unpaid Care Work
area or the bar or column - Correct message
- Minimal aesthetic
height/width? Unpaid Domestic Work
changes or
• Do you have to add data labels, 0 5 10 15 20 ordering needed
Female Male
a color key, or gridlines?
• Do you have to sort, regroup, or 20 - Must transpose
re-order the bars or columns? 10 to highlight the
correct insight
0
Female Male
- Data labels or
Unpaid Domestic Work Unpaid Care Work gridlines needed
- Make chart area
taller and bars
slightly thinner
TASK 1
➢ Go to Sheet named "Task 1" in the Excel file "Training Dataset Day 1".
➢ Use either of the options demonstrated (transposing data or changing chart type) to adjust the chart in
the sheet so that it highlights the gender gap.
What
REORDERING We ELEMENTS
CHART Do TO
ENSURE THAT VISUALLY THE FEMALE VALUES
ARE BEFORE THE MALE VALUES
Adjusting the order/display of genders
➢ Female values should always be first in the chart either above the male values or to the left of the male
values.
➢ In this chart, the male values show up first, above the female values.
• The data table must be reordered to make the change show up in the chart.
Chart Title
Male
Visually, "Male" is
ordered above or Female
before "Female".
0 5 10 15 20 25
Chart Title
Male
Male
Female
0 5 10 15 20 25
Chart Title
Female
Male
Female
0 5 10 15 20 25
Chart Title
Female
Male
0 5 10 15 20 25
Capital City
Sole ownership 21.0 79.0
Joint ownership
33.4 66.6
Both sole and joint ownership 21.4 78.6
Urban
1.
4.
5.
Default chart view of multiple disaggregations
➢ The visual does not register the location grouping in this data structure and thinks it is part of the series.
➢ The data must be restructured so that the Excel visualization tool can cluster the column charts by
location.
Chart Title
100.0
90.0
80.0
70.0
60.0
50.0
40.0
30.0
20.0
10.0
0.0
Sole ownership
Joint
Bothownership
sole and joint ownership Sole ownership
Joint
Bothownership
sole and joint ownership Sole ownership
Joint
Bothownership
sole and joint ownership
Women Men
Inserting a new column
➢ To restructure the data, first insert a new column to the left of the "Ownership" column (currently column
A) by right clicking on the top of Column A and selecting “Insert".
➢ There should now be a blank first column.
Cutting and pasting data into blank column
➢ Cut the location categories and paste them into the first column in the first row of data for that section.
• You must cut each category and paste it individually. Repeat for each category that must be moved.
▪ Shortcut for the cut function is CTRL+X (holding CTRL and X at the same time after the intended
cell has been highlighted).
• In this example, the name of the location should be in the same row as the
“Sole ownership” data.
Opening the select data source window
➢ Right click on the chart and click “Select Data”.
➢ The “Select Data Source” box will pop up.
Selecting the new range of data in data source
➢ The previous data range will be denoted with a dashed green border.
➢ Select the full range of data including column A and hit the "Enter" button on the keyboard or click the
"OK" button in the popup box.
Adjusting grouping of multiple disaggregations
➢ Your visualization should now be grouped by the additional “location” disaggregation but there is a little
too much spacing in between the groups.
Chart Title
100.0
90.0
80.0
70.0
60.0
50.0
40.0
30.0
20.0
10.0
0.0
Sole Joint Both sole Sole Joint Both sole Sole Joint Both sole
ownership ownership and joint ownership ownership and joint ownership ownership and joint
ownership ownership ownership
Capital City Urban Rural
Women Men
Removing extra space between groupings
➢ Highlight the empty rows between the groupings by clicking on the grey number associated with the row.
➢ The rows can be highlighted and deleted separately or at the same time. To highlight and delete at
the same time, highlight the first row as indicated above, then hold down the CTRL button and
highlight the second empty row you plan to delete (while still holding the CTRL button) before
moving to the next step.
Removing extra space between groupings
➢ Once the rows you want are highlighted, right click the row and select “Delete”.
• If you accidentally highlight the wrong row, before you delete the row, click anywhere in the Excel
sheet to unhighlight, then start again with highlighting the right row as per the previous slide’s
instructions.
➢ If you are deleting each row separately, highlight the next row and repeat the process.
Adjusting grouping of multiple disaggregations
➢ Your visualization should now be grouped by the additional “location” disaggregation with no additional
space in between the groups.
Chart Title
100.0
90.0
80.0
70.0
60.0
50.0
40.0
30.0
20.0
10.0
0.0
Sole ownership Joint ownership Both sole and Sole ownership Joint ownership Both sole and Sole ownership Joint ownership Both sole and
joint ownership joint ownership joint ownership
Capital City Urban Rural
Women Men
TASK 3
➢ Go to Sheet named "Task 3" in the Excel file "Training Dataset Day 1".
➢ Adjust the chart in the sheet to have the multiple disaggregations neatly grouped within the chart rather
than individually listing all of the different combinations.
GROUP DISCUSSION
➢ Below is a visual that you have received in a report and must apply data visualization principles to fix it.
➢ Discussion – What are the immediate issues with this visual? What changes would you make?
0.64
0.60 0.61 0.62 0.61
0.53
1.
4.
5.
Decluttering visuals
➢ The visualization should now be 2-Dimensional.
0.64
0.61 0.62 0.61
0.60
0.53
0.50
0.40
0.30
0.20
0.10
0.00
Female to Male Employment Ratio
0.30
0.20
0.10
0.00
Female to Male Employment Ratio
Household with children Household without children Couple without children Female to Male Employment Ratio
Couple with children Extended Family Overall
What We Do
DECLUTTERING:
REMOVING GRIDLINES
Removing gridlines
➢ You can remove gridlines in multiple ways. You can click on the gridlines in the chart (the blue circles
confirm that the gridlines are selected and not another chart element) and then hit the “Delete” key.
➢ Or you can click on the chart and then the “Chart Elements” plus symbol and uncheck the box next to
"Gridlines".
What We Do
DECLUTTERING:
REMOVING AXIS TITLE AND LABELS
Removing the axis title and axis labels
➢ There are now duplications of elements. The y-axis is duplicating the data labels while the x-axis is
duplicating the chart title. They can be removed in the same way as the gridlines either by clicking on the
chart element and hitting the “Delete” key.
➢ Or you can click on the chart and then the “Chart Elements” plus symbol and uncheck the box next to
"Axes".
FEMALE TO MALE EMPLOYMENT RATIO
0.70 0.64 0.62 0.61 0.61 0.60
0.60 0.53
0.50
0.40
0.30
0.20
0.10
0.00
Female to Male Employment Ratio
4.
1.
TASK 4
➢ Go to Sheet named "Task 4" in the Excel file "Training Dataset Day 1".
➢ Adjust the chart in the sheet to remove any unnecessary elements (3D, gridlines, axes, etc.).
➢ Adjust the chart in the sheet to sort the bars in order of value.
Chart Title
66.8
REGION G 44.2
89.3
48.7
REGION F 41.6
55.7
49.5
REGION E 48.1
50.9
44.2
REGION D 40.7
47.6
47.6
REGION C 46.2
48.9
43.4
REGION B 41.6
45.1
51.6
REGION A 52.6
50.5
0 10 20 30 40 50 60 70 80 90
0.64
0.62 0.61 0.61 0.60
0.53
0.64 0.64
0.62 0.61 0.61 0.60 0.62 0.61 0.61 0.60
0.53 0.53
BEFORE AFTER
➢ Time for exercise: 20 minutes Labor force participation by gender, age, and location
90%
78.4% 78.5% 77.4%
80%
70% 66.5% 66.9%
61.0% 61.2%
60% 53.9% 54.6% 54.5% 55.4%
50%
40% 33.5%
30%
20%
10%
0%
15+ 15-24 15+ 15-24 15+ 15-24
Total Rural Urban
Female Male
What We Do
ANNEX
Video for widening the plot area
➢ Click the plot area and drag the right side of the area towards the right to make the plot area wider.
• To drag, you must hover over the circular point for the mouse to turn into an arrow (see video).
• You will see a light blue outline if done correctly while you’re dragging the plot area.
• If you happen to move the plot instead of widen it, you can still adjust it – it might take more steps.
WIDEN PLOT
MOVE PLOT
(CORRECT)
What We Do
TRANSFORMING LABELS
Moving the data labels to the base of the bars
➢ After removing the legend, instead of adding the category labels in another tool, another option is to
transform the data labels into the category labels.
➢ Click on the chart and then on the “Chart Elements” plus symbol. Then click “Data Labels” and select
“Inside Base”. You should see the data labels move to the bottom. Repeat and select “More Options”.
➢ In the “Format Data Labels” panel on the right side, click the bar chart symbol and then “Label Options”.
1. 3.
2.
4.
Transforming data labels into category labels
➢ Check the box next to “Series Name” and uncheck the boxes next to “Value” and “Show Leader Lines”
➢ You should now see the name rather than the data label. Repeat for each of the data labels.
3.
1.
2.
4.
Transforming data labels into category labels
➢ Check the box next to “Series Name” and uncheck the boxes next to “Value” and “Show Leader Lines”.
➢ Then click the “Text Options” tab and change the text color to black.
3.
3.
1.
2.
4.
4.
Adjusting the size of the plot area
➢ You should now see black category labels instead of a white data labels.
➢ Click the inner plot area with the bars and drag upwards to make the area shorter. Now there is space to
drag down each of the category labels.
Moving the labels below the plot area
➢ Click each category name separately and drag it down into the white space below the purple columns.
➢ Adjust the sizing of the text by moving the 4 corners of the text box.
➢ Repeat for each category until they are all below the corresponding purple column.
60%
50%
40%
30%
20%
10%
0% Couple
Household Extended Household Couple
Extended Household Couple Overall with
Household Couple with with Family without without
Family Overall
with children children without without children children children children
children children
STRENGTHENING
GENDER
STATISTICS
DATA VISUALIZATION
TRAINING MODULE 4:
VISUALIZATION TOOL
COMPARISON
Data visualization tools comparison
Feature Excel Datawrapper Flourish
• want to quickly create a standard static or interactive chart or advanced tables (with
sparklines or data bars);
• prefer to easily highlight one or a few main data points, lines, or bars in the chart;
• want to quickly create a subnational geographic map, range plot, bullet bar;
• need to work from a mobile-phone or tablet.
• need to create a chart type not available in other tools or a more advanced chart type;
• need to see sample data or visuals to understand how to create the visual;
• want to easily create a data story;
• prefer to group data horizontally instead of vertically;
• want to visualize data in “small multiples” or a “grid of charts”;
• prefer to add a highlight line or sort the legend options in a particular order.
Infographics & social media: Canva and Visme
➢ Canva, Visme, Infogram, Venngage, Piktochart etc. produce a
collection of imagery, text, and data visualizations to give an eye-
catching, creative overview of a topic. All have similar functions.
Dashboards: Tableau and PowerBI
➢ Tableau and PowerBI provide an interface with interactive visualizations and business intelligence “BI”
capabilities allowing users to create their own dashboards or reports mostly to track/monitor progress.
➢ Dashboards use a combination of interactive visualization types and tables that are sortable, filterable.
➢ Dashboards are exploratory (not explanatory) visualizations – they allow users to explore various
insights rather than highlighting one or w few key insights or narratives/stories.
ANNEX
Comparison of SGS
indicators in each data
visualization tool
Managerial positions
Time use
0 5 10 15 20 25
% of day
Unpaid Domestic Work Unpaid Care Work
Employment by sector
Unemployment
Asset Ownership
Ratio of female to male labor force participation
Asset ownership
Gender Pay Gap
DATA VISUALIZATION
TRAINING MODULE 5:
EXCEL CHART
TRANSFORMATIONS
Contents
Explore/try
visualization
options Three-fourths of creating a visualization is add
Optional: cleaning
Design, format, Publish and/or
Upload data and finalizeand formatting the data in the proper additional
download way to:
visualization visualization annotations
as needed
Adjust data match the required data structure, inputs,
structure
and features of the intended visualization type
25%
Formatting &
75% annotating
Three-fourths of creating a visualization is cleaning
the visual
Properly and formatting the data in the proper way to:
cleaning,
match the required data structure, inputs,
transposing and features of the intended visualization type
& preparing
the data highlight the right message and insights
Example 1: Example 1:
Grouping and ordering data, chart elements, Transforming pie chart to a stacked column chart.
and legend for the right data presentation.
Example 2:
Example 2: Transforming grouped column chart with multiple
Simplifying (removing unnecessary chart disaggregations into a bullet bar.
elements), intentional and appropriate sorting,
and using the right data visualization attributes Example 3:
to highlight insights (color, shape, pattern, etc.) Transforming grouped bar chart to line chart.
PRACTICE EXCERCISES
1. Recreate a visual from trainer’s screen using data from Training Dataset file.
2. Create one’s own visual with data of choice from Training Dataset file.
NOTE: It is recommended to complete PART 1 and the accompanying Practice Exercise from
Excel Module 3 prior to moving on to PART 2 (this module).
What We Do
TRANSFORMING A PIE CHART
INTO A STACKED CHART
GROUP DISCUSSION
➢ Discussion – Can you understand the main insight? What other ways could this visual be shown that
are more efficient? What would you change in this visual?
22% 23%
23% 24%
57% 43%
12%
12% 9%
16%
18%
16% 9%
9% 18% 16% 9%
Reasons for not seeking employment Reasons for not seeking employment
50 50
45
40 40
35
30 30
25
20
20
10 15
10
0 5
Female Male 0
Full time Too young/ Disabled/ Ill My spouse Occupied with Other
Full time student Too young/ too old student too old wouldn't allow home duties
that
Disabled/ Ill My spouse wouldn't allow that
Occupied with home duties Other Female Male
GROUP DISCUSSION
➢ The 100% stacked bar chart compared to the pie chart…
• shows the values in length which are easier to compare than the pie slices;
• shows the percentage/proportion of the whole;
• is easier to identify gender gap comparison with female visually above male comparing lengths;
• is created in one chart not two separate charts.
Reasons for not seeking employment Reasons for not seeking employment
Female Male
Female
23%
22% 23% 24%
57% 43%
12%
12% 9% Male
16% 18%
16% 9%
9% 18% 16% 9%
0% 10% 20% 30% 40% 50% 60% 70% 80% 90% 100%
2. 3.
4.
1.
Changing multiple pie charts into stacked chart
➢ Alternatively, click one of the original pie charts then click the “Chart Design” tab in the toolbar menu at
the top. Then click the “Change chart type” icon and select stacked bar chart.
2. 3.
3.
5.
4.
1.
Adjusting data selection for a chart
➢ The chart is currently only picking up the "Female" column so it just looks like one bar of data.
➢ To adjust the data range for the chart, right click the chart and select “Select Data”.
➢ The “Select Data Source” box will pop up highlighting the current data range in dashed green lines.
4.
1.
Readjusting data for stacked bar chart
➢ Highlight the full range of data to include both the female and male columns. Then click “OK” or hit the
"Enter" key.
➢ Now there should be both female and male data in the chart.
Female
Other 22%
Disabled/Ill 18%
0% 50% 100%
Male Female
Widening the plot area
➢ Click the plot area and drag the right side of the area towards the right to make the plot area wider.
• To drag, you must hover over the circular point for the mouse to turn into an arrow (see video).
• You will see a light blue outline if done correctly while you are dragging the plot area.
• If you happen to move the plot instead of widen it, you can still adjust it – it might take more steps.
1. 2.
TASK 6
➢ Go to Sheet named "Task 6" in the Excel file "Training Dataset Day 2".
➢ Adjust the chart in the sheet by changing one of the two pie charts to a stacked bar chart.
➢ Make sure to readjust the data structure to ensure there are two bar colors on the chart, not one.
Reformatting the stacked bar chart
➢ The stacked bar chart rows are by “Reason for not seeking employment” when they should be by “Gender”.
➢ To reformat, click the chart, then click the “Chart Design” tab in the Excel ribbon (top toolbar menu).
➢ Then click “Switch Row/Column" and you will see the chart transform.
2.
3.
Female
1.
Female
2. Male
4.
Male
1.
2.
3.
Choosing appropriate colors
➢ Choose categorical colors (not sequential or diverging).
➢ Choose colors that will work with the rest of the report. Do not reuse colors that have been assigned for
other categories “female”, “male”, “total”, “urban”, “rural”, etc.
➢ For the bar segments that you want to draw attention to like “Occupied with home duties” or “My spouse
wouldn’t allow that” choose darker or brighter colors. Choose lighter colors for those bar segments that
don’t seem to have much of a gender gap like “Too young/too old”, and “Disabled/ill”.
Diverging
Categorical Sequential
Selecting hex codes and color palettes
➢ Hex codes can be found at htmlcolorcodes.com and https://www.computerhope.com/htmcolor.htm
➢ Color palettes can be found at colorhunt.co or colormind.io
➢ You can also find these by typing searching in an internet search engine like Google Chrome.
➢ The hex code will appear upon hovering over the color or written right underneath the color.
Choosing appropriate colors
➢ Intentionally choosing colors transforms a visual from looking “outdated” to “polished”.
Reasons for not seeking employment by gender Reasons for not seeking employment by gender
Female Female
Male Male
0% 20% 40% 60% 80% 100% 0% 20% 40% 60% 80% 100%
Full time student Too young/Too old Full time student Too young/Too old
Disabled/Ill My spouse wouldn't allow that Disabled/Ill My spouse wouldn't allow that
Occupied with home duties Other Occupied with home duties Other
Alternative versions
➢ Choose colors that will work with the rest of the report. Do not reuse colors that have been assigned for
other categories “female”, “male”, “total”, “urban”, “rural”, etc.
Reasons for not seeking employment by gender Reasons for not seeking employment by gender
Female Female
Male Male
0% 20% 40% 60% 80% 100% 0% 20% 40% 60% 80% 100%
Full time student Too young/Too old Full time student Too young/Too old
Disabled/Ill My spouse wouldn't allow that Disabled/Ill My spouse wouldn't allow that
Occupied with home duties Other Occupied with home duties Other
Reasons for not seeking employment by gender Reasons for not seeking employment by gender
Female Female
Male Male
0% 20% 40% 60% 80% 100% 0% 20% 40% 60% 80% 100%
Full time student Too young/Too old Full time student Too young/Too old
Disabled/Ill My spouse wouldn't allow that Disabled/Ill My spouse wouldn't allow that
Occupied with home duties Other Occupied with home duties Other
TASK 8
➢ Go to the same chart you adjusted in Task 6 and 7 in the Sheet named "Task 6" in the Excel file "Training
Dataset Day 2".
➢ Choose a categorical palette for the stacked bar chart out of the following options.
➢ Use the eyedropper tool to find the hex codes on the color palettes below, then assign a color to each
category.
TRANSFORMATION
BEFORE AFTER
Reasons for not seeking employment Reasons for not seeking employment by gender
Female Male
Female
22% 23%
23% 24%
57% 43%
12%
12% 9%
16%
18% Male
16% 9%
9% 18% 16% 9%
0% 20% 40% 60% 80% 100%
Female Male
GROUP DISCUSSION
➢ Discussion – Can you understand the main insights better? Which one shows the gender gap better?
➢ Is there anything you would still change in the visual on the right?
Proportion in tertiary education by gender and field of study Proportion in tertiary education by gender and field of study
70 63.43 62.00
57.8 58.2 60 60 35
60 54.79 Science, Technology , Engineering and Mathematics… 32.28
50.3 49.0846
50 Agriculture, Forestry, Fisheries and Veterinary 46
37.5
36.26 49.08
40 32.2835
27.00 25.19 Engineering, Manufacturing and Construction 60
30 25.19
20 Business, Administration and Law 50.3
54.79
10
Information and Communication Technologies 60
0 27.00
Social Sciences, Journalism and Information 58.2
62.00
Arts and Humanities 57.8
63.43
Natural Sciences, Mathematics and Statistics 37.5
36.26
0 10 20 30 40 50 60 70
Male Female
Female Male
GROUP DISCUSSION
➢ The bullet bar compared to the clustered column chart:
• is easier to identify the gender gap at a glance with the skinny bar extending past the thicker;
• is cleaner and more consolidated (allows for many disaggregations to fit in one chart);
• allows the category labels to be horizontal for ease of reading;
• brings variety to the visualization types within a report, while still remaining close to the
traditional bar/column chart.
Proportion in tertiary education by gender and field of study (%) Proportion in tertiary education by gender and field of study (%)
70 63.43 62.00 60 60
57.8 58.2 54.79 Arts and Humanities
60 50.3 49.0846
50 Social Sciences, Journalism and Information
36.2637.5
40 32.2835
27.00 25.19 Business, Administration and Law
30
20 Agriculture, Forestry, Fisheries and Veterinary
10
Natural Sciences, Mathematics and Statistics
0
Science, Technology , Engineering and Mathematics
0 10 20 30 40 50 60 70
Female Male
Female Male
Changing chart type and removing data labels
➢ Click the original chart and then click the “Insert” tab in the Excel ribbon (top toolbar menu).
➢ Under the "Charts" section, click the bar chart looking icon and select the clustered bar chart.
➢ Then click on the chart and the “Chart Elements” plus symbol and uncheck the box next to “Data labels”.
2.
3.
6.
5.
4. 7.
1.
Overlapping bars to create bullet bars
➢ Overlapping bars will have either the female values on top or the male values or vice versa.
• Whichever bar is on top is thinner and the bar behind it is thicker in order to properly see the gap.
• If the bar behind is thinner, the data may end up being completely hidden by the thicker bar on top.
➢ Click on the bar series that you would like to show on top. The “Format Data Series” panel will show up.
➢ Click on the bar chart icon in order to see the series options.
1.
2.
Overlapping bars to create bullet bars
➢ After selecting the bars to go on top, under the “Series Options – Plot Series On” switch primary axis to
secondary axis. You will notice the bars are now overlapped and there is another axis on top of the chart.
➢ Under “Gap Width”, move the slider to the right. You will see the orange bars getting thinner. The
percentage in the box next to the slider will increase – you can also enter a percentage in that box (for
example, 430%).
1.
1.
2.
Overlapping bars to create bullet bars
➢ Select the other colored bar series and go directly to the “Gap Width” section within the “Format Data
Series” panel. Do not touch anything in the “Plot Series On” section.
➢ Under “Gap Width”, this time move the slider to the left. You’ll see the blue bars getting thicker. The
percentage in the box next to the slider will decrease – you can also enter a percentage in that box (for
example, 50%).
2.
1.
TASK 9
➢ Go to Sheet named "Task 9" in the Excel file "Training Dataset Day 2".
➢ Choose which bar series will go on top.
➢ Adjust the bars so that the the bar series you want on top is overlapping the other bar series.
➢ Adjust the top bar series so they are skinnier than the other bar series.
➢ Adjust the bottom bar series so they are thicker.
Sorting the bars by value
➢ Click in the column for the variable which you wish to sort. In this case we will sort by female values.
➢ Then under the “Home” tab in the Excel ribbon (top toolbar menu), click on the “Sort & Filter” button in
the “Editing” section. Then click “Sort A to Z” (or “Sort Z to A” for the other direction).
2.
3.
4.
1.
Finalizing the chart
➢ Click the chart title and add an appropriate chart title.
➢ Remove the secondary axis at the top of the chart by clicking the axis and hitting the delete button.
➢ Adjust the colors as per preference or consistency with the report.
0 10 20 30 40 50 60 70
Female Male
TRANSFORMATION
BEFORE AFTER
Proportion in tertiary education by gender and field of study Proportion in tertiary education by gender and field of study (%)
70 63.43 62.00 60 60
57.8 58.2 54.79 Arts and Humanities
60 50.3 49.0846
50 Social Sciences, Journalism and Information
37.5
36.26
40 32.2835
27.00 25.19 Business, Administration and Law
30
20 Agriculture, Forestry, Fisheries and Veterinary
10
0 Natural Sciences, Mathematics and Statistics
0 10 20 30 40 50 60 70
Female Male Female Male
What We Do
TRANSFORMING A GROUPED COLUMN CHART
INTO A LINE CHART
GROUP DISCUSSION
➢ Discussion –What other ways could this visual be shown that are more efficient? What do you like
about this visualization? What would you change in this visual based on data visualization principles?
Prevalence of stunting (% of children under 5) by gender and severity
41
37
34 34 35 35
34
30 30 31
29
27
17 18
16 15 15 16
13 13 14 13 14
11
1991 2004 2006 2011 2014 2018 1991 2004 2006 2011 2014 2018
DHS-I DHS-II DHS-III DHS-IV DHS-V DHS-VI DHS-I DHS-II DHS-III DHS-IV DHS-V DHS-VI
Stunting Severe stunting
Girls Boys
GROUP DISCUSSION
➢ Discussion – Can you understand the time series insights better? Is there anything you would still
change in the visual on the right?
0
1991 1993 1995 1997 1999 2001 2003 2005 2007 2009 2011 2013 2015 2017
Changing chart type to line chart
➢ Click the original chart and then click the “Insert” tab in the Excel ribbon (top toolbar menu).
➢ Under the "Charts" section click the line chart looking icon and select the line chart with the dots.
2.
3.
4.
1.
Adjusting the frequency of years on the x-axis
➢ The current chart is misleading with inconsistent frequency of years along the x-axis. To adjust this, click
the x-axis of the chart with the dates. You should see the “Format Axis” panel on the right side.
➢ In the “Format Axis” panel, click the bar chart icon and open the “Axis Options” section.
➢ Under “Axis Options”, change the setting from “Automatically select based on data” to “Date axis”.
You should now see that the x-axis has equally spaced years from 1991 to 2017.
Chart Title
2. 45
40
35
30
25
3. 20
15
1. 10
5
0
1991 1993 1995 1997 1999 2001 2003 2005 2007 2009 2011 2013 2015 2017
Chart Title
2. 45
1. 40
35
30
25
20
15
10
5
0
1991 1993 1995 1997 1999 2001 2003 2005 2007 2009 2011 2013 2015 2017
Chart Title
45
40
35
2. 30
1. 3. 25
20
15
10
5
0
4. 1991 1993 1995 1997 1999 2001 2003 2005 2007 2009 2011 2013 2015 2017
2.
3. 5.
1.
4.
6.
7.
Moving the legend and adding a title
➢ Click the chart title and add an appropriate chart title.
➢ To move the legend to the right side, click on the chart and then on the “Chart Elements” plus symbol.
➢ To the right of “Legend”, click on the arrow. Then select the position of the legend – in this case “Right”.
2.
Prevalence of stunting (% of children under 5) by
1. gender and severity
45
40
35
3. 4. 30
Stunting Female
25
Stunting Male
20
15 Severe Stunting Female
1991
1995
1999
2003
2007
2011
2015
1993
1997
2001
2005
2009
2013
2017
Widening the chart
➢ The chart seems a bit squished making the years on the x-axis unreadable at a 90 degree angle.
➢ Pull the left side of the chart at the circle towards the left until you see the years are horizontal again.
40
35
1. 30
Stunting Female
25
Stunting Male
20
Severe Stunting Female
15 Severe Stunting Male
10
0
1991 1993 1995 1997 1999 2001 2003 2005 2007 2009 2011 2013 2015 2017
TRANSFORMATION
BEFORE AFTER
1990 1992 1994 1996 1998 2000 2002 2004 2006 2008 2010 2012 2014 2016 2018
TASK 10
➢ Go to Sheet named "Task 10" in the Excel file "Training Dataset Day 2".
➢ Change the line type for one of the location disaggregation lines from solid to dashed (either for
national or for urban).
➢ Resize the marks (dots) along the line to be larger (size 7 for example).
➢ Change the color of the other disaggregation so that there are only two colors in the chart – one for
female and the other for male.
What We Do
PRACTICE EXERCISE:
Create a visual of your own with data from the Training Dataset file
Exercise – create your own visual
➢ Using any data from any sheet in the "Training Dataset Day 1" or "Training Dataset Day 2" Excel files,
create your own visual with more than four data points.
➢ The gender factbook’s colors are
• Female: #006389; RGB (0, 99, 137)
• Male: #00B0AB; RGB (0,176, 171)
➢ Use the "Data Visualization Tip Sheet" handout/Word file to review your visual prior to finalizing.
➢ A few major questions to keep in mind (more details in the "Data Visualization Tip Sheet" handout):
• Is the visual showing the gender-relevant insight or does data need to be transposed or
restructured?
• Does it look clean or cluttered? Are there unnecessary gridlines, labels, axes duplicate encodings?
• Does the visual have proper sorting, grouping, legend order, etc.?
WIDEN PLOT
MOVE PLOT
(CORRECT)
What We Do
ADDITIONAL LINE CHART STEPS - LABELS
Adding data labels
➢ To add data labels, click on the chart and then on the “Chart Elements” plus symbol.
➢ Check the box next to “Data Labels”.
2.
1. 3.
Changing position of data labels
➢ The data labels are currently overlapping. The position of the data labels should be changed so that the
line above has data labels above the line and the line below has data labels below the line.
➢ Click on the line that needs the data label change and then on the “Chart Elements” plus symbol. Click the
arrow to the right of “Data Labels” and select “Below”. Repeat this for the other blue line.
2.
3.
1.
4.
Removing gridlines and y-axis labels
➢ Since there are data labels, there is no longer a need for the y-axis labels or gridlines.
➢ To remove the gridlines, click on the chart and then on the “Chart Elements” plus symbol and uncheck
“Gridlines”.
➢ To remove the y-axis labels, click on the y-axis labels and hit the "Delete" button on the keyboard.
2.
1.
3.
STRENGTHENING
GENDER
STATISTICS
DATA VISUALIZATION
TRAINING MODULE 6:
DATAWRAPPER MAPS
Contents
1. Recap of data visualization steps and breakdown
2. Setting up the Datawrapper tool account login
3. Getting started with visualizing maps in Datawrapper
4. Designing, formatting, & annotating the data visualization
5. Downloading the visualization
Specific actions/functions
1. Selecting the chart type (slides 18-21)
2. Uploading data, reuploading data (slides 22-37, 40-42, 82-85)
3. Cleaning and adjusting data structure (slides 28-39)
- in Datawrapper (slides 28 – 33)
- in Excel (slides 34 – 39)
4. Choosing a color scale (slides 45 – 50)
5. Changing the color gradient (slides 51 – 56)
6. Changing the color legend (slides 58-63)
7. Adding legend labels (slides 64-66)
8. Adding data labels (slides 68-71, 87-89)
9. Merging values from two columns into a third column in Excel
using =CONCATENATE() function (slides 72 – 80)
10. Finalizing annotations: title, subtitle, source (slide 89)
11. Downloading the visual (slides 91-95)
1. Recap of data
visualization steps
and breakdown
Recap: steps for creating a data visualization
Explore/try
visualization
options Three-fourths of creating a visualization is add
Optional: cleaning
Design, format, Publish and/or
Upload data and finalizeand formatting the data in the proper additional
download way to:
visualization visualization annotations
as needed
Adjust data match the required data structure, inputs,
structure
and features of the intended visualization type
25%
Formatting &
75% annotating
Three-fourths of creating a visualization is cleaning
the visual
Properly and formatting the data in the proper way to:
cleaning,
match the required data structure, inputs,
transposing and features of the intended visualization type
& preparing
the data highlight the right message and insights
Difference/gap between the types of unpaid work for a given gender. Difference/gap between men’s and women’s time spent on a given
type of unpaid work.
2. Setting up the
Datawrapper tool
account login
Set up Datawrapper account
➢ Type https://app.datawrapper.de/ in your internet browser.
➢ Click "Create a new account" in the bottom of the grey box.
➢ Sign up with your email or with another account like Google, Microsoft, Github, or Twitter.
Set up Datawrapper account
➢ Enter your email address and create a password at least 8 characters long.
Set up Datawrapper account
➢ Once you have created an account, it will automatically log you in. You should see this screen.
➢ In the right hand corner you will see a message alerting you that your email address needs to be
confirmed.
➢ Check your inbox for the Datawrapper email.
Confirm Datawrapper account
➢ Once you find the email from Datatwrapper in your inbox, open it. It should look like this.
➢ Click the "Confirm your account" link in the email.
➢ Then log in to your Datawrapper account again.
Datawrapper account successfully confirmed
➢ After logging in you’ll see a green box in the top right that confirms you have successfully activated
your email address for Datawrapper.
3. Getting started
with visualizing maps
in Datawrapper
Log in to Datawrapper
➢ Type https://app.datawrapper.de/ in your internet browser.
➢ Click "Sign in with Email".
Log in to Datawrapper
➢ Enter your email address and password.
➢ Click the blue "Login" button.
Select a chart category
➢ Click the "Create New" button or "Nouveau" button.
➢ Then select the chart type: Chart, Map, or Table. For this first visual we will select "Map".
Select a map type
➢ There are three map types: Chloropleth map, Symbol map, and Locator map.
➢ Select the "Chloropleth map".
Select a geographic region
➢ In the search bar, type a geographic region (i.e. world, region name, country name).
➢ For our visual, we will type in Cameroon to find the Cameroonian maps.
Select the geographic map level for Cameroon
➢ There are two geographic map levels: Departments and Provinces.
➢ Choose the level for which you have data. We will select "Provinces". You will see a preview of the map.
➢ Then click the "Proceed" button.
What We Do
UPLOADING DATA
Upload data from Excel or CSV file
➢ Upload an Excel or CSV file by clicking the "Upload file" button. The box to select the file will pop up.
➢ Select the file from the folder on your computer and click "Open".
Upload data from Excel or CSV file
➢ A successful upload will show a green check mark next to the "Upload file" button.
➢ If there is more than one sheet in the Excel file, there will be a dropdown to select which sheet to use.
Match the data with the map
➢ Next to the "Upload" tab, click the "Match" tab.
➢ Make sure that the matching key is set to the right category (i.e. Name vs. Postal code for the province).
➢ Select and/or verify the column for the "Name" as well as the column for the "Values".
Check the data
➢ Next to the Match tab, click the Check tab. You’ll notice there is a yellow warning symbol and a yellow
warning box that shows two geographic areas do not match with data.
➢ In red, 4 region names do not match: Centre (Sands Yaounde), Yaounde, Littoral (Sans Douala), Douala.
➢ The region names need to be adjusted in the data table to match.
Required map data structure
➢ The current data structure separates data for cities Douala and Yaounde from their provinces.
➢ The map only takes data for provinces so the data structure must be adjusted in Datawrapper or Excel.
1.
2.
Remove unnecessary rows in Excel
➢ You should then see that the row has disappeared.
• In case you deleted the wrong row, you can “undo” the action by clicking on the “Undo” button in
the toolbar menu at the top.
➢ The final dataset after removing both rows for Douala and Yaounde is on the right side.
Save the data in Excel
➢ Save the Excel file so that the changes are not lost.
• It can be saved with the save symbol or by clicking File in the toolbar menu on the top.
• If you do not want to overwrite the Excel file, click Save As instead of Save and save the file under a
different name in your preferred location. For example, Cameroon Map Data Updated. Click "Save".
Reupload data to Datawrapper
➢ Option 1: Reupload Excel/CSV file by clicking the "Upload file" button, selecting a file, and clicking "Open".
➢ Option 2: Copy the data from the Excel file and paste it in the blank box and click the arrow on the right.
Option 1
Option 2
Successful upload of the correct data structure
➢ With the Copy/Paste method, there will be a popup notifying that data was pasted with option to revert.
➢ To verify that all data are correctly uploaded or pasted, review both the "Match" and "Check" tabs again
and adjust accordingly if needed.
Successful upload of the correct data structure
➢ The "Check" tab shows a green check mark and a green box that says "All map regions were matched to a
row in your dataset"
➢ Click the "Proceed" button.
4. Designing and
formatting the visual
Default map visual
➢ In the "Visualize" tab, you will already see a preview of your map with the default elements.
➢ On the left are buttons to customize the colors, legend, and layout.
What We Do
Continuous: assigns each value Steps: assigns all values within a specified
a different color on the gradient range (bucket/step) the same color
Default step color scale: linear
➢ Under “Colors” within the “Steps” section, the default number of steps is 5 and can be changed up to 20.
➢ The default steps are “linear” meaning each range is equidistant between the minimum and maximum.
Sequential
Diverging
Selecting a different color palette
➢ Under "Colors", within the “Select palette” section, click the color dropdown list.
➢ Select a different color gradient. Remember some are sequential and others are diverging.
Sequential
Diverging
Customizing the color palette
➢ If you don’t like the color palette options or must match the branding for a report, they can be changed.
➢ Under Color, within the “Select palette” section, click the wrench button.
➢ A color gradient with several color stops will pop up. Click the reverse button to draw attention to low
values rather than high values.
Customizing the color palette
➢ Change the color gradient by sliding one or more of the color stops left or right.
➢ Here by moving the color stops towards the green (left) section, the whole color gradient shifts towards
the blue hues than the green ones.
Customizing the color palette
➢ Change the color gradient by clicking one or more color stops and selecting new colors or entering the
hex codes of the desired color.
➢ On the left, 4 color stops from the original green/blue color palette were changed so that they are all
shades of blue (no green).
➢ On the right, all the color stops were changed to different shades of red.
Adjusting
the color
HEX code
TASK
➢ In the "Refine" tab, choose a color gradient and change the color scale.
➢ The version you submit should have different colors and scale than what is used in the demonstration
walk-through.
What We Do
Option 1
Option 2
Successful upload of the revised dataset
➢ With the Copy/Paste method, there will be a popup notifying that data was pasted with option to revert.
➢ To verify that all data are correctly uploaded or pasted, review both the Match and Check tabs again and
adjust accordingly if needed.
Successful upload of the revised dataset
➢ The Check tab shows a green check mark and a green box that says « All map regions were matched to a
row in your dataset »
➢ Click the proceed button.
What We Do
FINALIZING ANNOTATIONS
(TITLE, SOURCE, NEW DATA LABELS)
Return to the annotation tab of visualization
➢ Go to the “Visualize tab” and then switch from the “Refine” tab to the “Annotate” tab.
➢ Under “Map Labels” within the “Select column” section, you will see the “Birth Registration” column is
still selected.
1.
2.
Select new column for data labels
➢ Under “Map Labels” within the “Select column” section, select the name of the column with the
combined region and data labels you just created.
➢ In the example case, this column is named “Data labels”. The new labels should appear on the map.
Adding the title, description, and source
➢ Under the “Title” section, add a title in the blank box provided.
➢ Under the “Description” section, add any additional details or subtitles in the box provided.
➢ Under the “Data source” section”, add the name and year of survey in the box provided.
5. Downloading the visual
Download the visual
➢ Go to the "Publish & Embed" section after finalizing your visual.
➢ Click the PNG image to view the download options. 1.
2.
Download the visual
➢ Under “Include”, choose whether to download “Just chart” or the chart with “Full header and footer”.
➢ Under “Background”, choose whether you would like the background to be included (white) or
transparent (no background).
➢ Then click “Download image”.
Download option examples: full header/footer
➢ “Transparent” background is a good option for adding the visual to a social media card or infographic that
may have a colored background.
Download option examples: just chart
➢ “Just chart” is the best option if you want to add your own annotations or if the title and source are being
added directly in the report and not in each chart.
A note on publishing
➢ The visual does NOT need to be published in order to download a static image file.
➢ If you wish to create an interactive online version with a URL, you will first need to click Publish Now.
➢ When you publish a visual, the data will be visible to those who have the URL. Data cannot be
downloaded, but the data values can be seen on the data labels and the tooltip if either are enabled.
What We Do
PRACTICE EXERCISE 5:
Create a map in Datawrapper with data from the Training Dataset file
Exercise – create a map in Datawrapper
➢ Using the data in Sheet "Exercise 5 DW Map" from "Training Dataset Day 2" Excel file, create a map in
Datawrapper.
➢ Tips to remember:
• The Gender Parity Index indicator has values above and below 1 with 1 indicating equality (gender
parity).
• Diverging colors scales are used for diverging number scales.
• Captions, annotations, notes, etc. should be used if the data must be interpreted in a certain way
by the reader that may not be obvious.
DATA VISUALIZATION
TRAINING MODULE 7:
DATAWRAPPER
RANGE PLOTS
Contents
1. Recap of data visualization steps and breakdown
2. Setting up the Datawrapper tool account login
3. Getting started with visualizing maps in Datawrapper
4. Designing, formatting, & annotating the data visualization
5. Downloading the visualization
Specific actions/functions
1. Selecting the chart type (slides 18-19)
2. Exploring a sample dataset (slides 20-22)
3. Uploading data (slides 23-25)
4. Changing the chart type (slides 27 – 34)
5. Sorting data values (slides 35-37)
6. Finalizing annotations: title, subtitle, source (slides 38-39)
7. Downloading the visual (slide 40-43)
1. Recap of data
visualization steps
and breakdown
Recap: steps for creating a data visualization
Explore/try
visualization
options Three-fourths of creating a visualization is add
Optional: cleaning
Design, format, Publish and/or
Upload data and finalizeand formatting the data in the proper additional
download way to:
visualization visualization annotations
as needed
Adjust data match the required data structure, inputs,
structure
and features of the intended visualization type
25%
Formatting &
75% annotating
Three-fourths of creating a visualization is cleaning
the visual
Properly and formatting the data in the proper way to:
cleaning,
match the required data structure, inputs,
transposing and features of the intended visualization type
& preparing
the data highlight the right message and insights
Difference/gap between the types of unpaid work for a given gender. Difference/gap between men’s and women’s time spent on a given
type of unpaid work.
2. Setting up the
Datawrapper tool
account login
Set up Datawrapper account
➢ Type https://app.datawrapper.de/ in your internet browser.
➢ Click "Create a new account" in the bottom of the grey box.
➢ Sign up with your email or with another account like Google, Microsoft, Github, or Twitter.
Set up Datawrapper account
➢ Enter your email address and create a password at least 8 characters long.
Set up Datawrapper account
➢ Once you have created an account, it will automatically log you in. You should see this screen.
➢ In the right hand corner you will see a message alerting you that your email address needs to be
confirmed.
➢ Check your inbox for the Datawrapper email.
Confirm Datawrapper account
➢ Once you find the email from Datatwrapper in your inbox, open it. It should look like this.
➢ Click the "Confirm your account" link in the email.
➢ Then log in to your Datawrapper account again.
Datawrapper account successfully confirmed
➢ After logging in you’ll see a green box in the top right that confirms you have successfully activated
your email address for Datawrapper.
3. Getting started
with visualizing maps
in Datawrapper
Log in to Datawrapper
➢ Type https://app.datawrapper.de/ in your internet browser.
➢ Click "Sign in with Email".
Log in to Datawrapper
➢ Enter your email address and password.
➢ Click the blue "Login" button.
Select a chart category
➢ Click the "Create New" button.
➢ Then select the chart type "Chart".
See an example of the data structure for a range plot
➢ There is an example dataset in Datawrapper for range plots.
➢ Click "Select a sample dataset" in the dropdown menu and select the dataset "Gender Pay Gap"
under Range Plot.
See an example of the data structure for a range plot
➢ You will see that there are salaries for men and women by level of education for the United States
in this sample dataset.
See an example of the data structure for a range plot
➢ Go to the “Visualize” section to see what the sample dataset looks like.
See an example of the data structure for a range plot
➢ Return to the “Check & Describe” section to see how the dataset should look when you upload it
for the gender pay gap data.
➢ Based on this you can now create your own dataset in this format and go back to the “Upload
Data” section and paste the values into the white box or upload an Excel or CSV.
What We Do
UPLOADING DATA
Uploading data
➢ Option 1: Copy the data from the Excel file and paste it into the empty box. Click the “Proceed”
button in the bottom right.
Uploading data
➢ Option 2: Upload your Excel/CSV file by clicking on the "XLS/CSV Upload" button then "Upload a
file", selecting a file, and then clicking "Open".
4. Designing and
formatting the visual
What We Do
% change
Changing the visualization elements
➢ Under "Labels," click "Label First Range."
➢ You can now see that the category labels for “Female” and “Male” are visible.
What We Do
➢ Using the data from the "Exercise 6 DW Range Plot" worksheet in the "Training Dataset Day 2" file,
create a range plot in Datawrapper.
➢ Tips to remember:
➢ Select the data labels.
➢ Make sure the category labels are visible.
➢ Make sure the chart is sorted correctly.
➢ Change the colors to the appropriate female and male colors. Don't use stereotypical colors.
DATA VISUALIZATION
TRAINING MODULE 8:
ANNOTATIONS
Contents
1. Recap of data visualization steps and breakdown
2. Using annotations to highlight key information
a. Adding text boxes
b. Incorporating icons
c. Adding lines or shaded shapes to highlight information
3. Exercise to annotate visuals created in Day 1 and Day 2.
1. Recap of data
visualization steps
and breakdown
Recap: steps for creating a data visualization
Explore/try
visualization
options Three-fourths of creating a visualization is add
Optional: cleaning
Design, format, Publish and/or
Upload data and finalizeand formatting the data in the proper additional
download way to:
visualization visualization annotations
as needed
Adjust data match the required data structure, inputs,
structure
and features of the intended visualization type
25%
Formatting &
75% annotating
Three-fourths of creating a visualization is cleaning
the visual
Properly and formatting the data in the proper way to:
cleaning,
match the required data structure, inputs,
transposing and features of the intended visualization type
& preparing
the data highlight the right message and insights
INCORPORATING ICONS
Using icons for highlighting statistics
➢ Icons can be used to create pictograms using the same repeated icon and coloring/shading to show a
proportion like 2 in 5 women representing 40% or a ratio "For every one man there are 5 women who…".
20%
Using icons for highlighting statistics
➢ Infographic-like visuals with just images and numbers can be developed for incorporation in a report, an
executive summary, a social media card, etc.
Using icons for highlighting statistics
➢ Icons can enhance simple visuals like bar charts or pie charts that have very few data points (two or three
data points).
Using icons for highlighting statistics
➢ Icons can enhance simple visuals like bar charts or pie charts that have very few data points (two or three
data points).
Using icons for highlighting statistics
➢ Icons can enhance simple visuals like bar charts or pie charts that have very few data points (two or three
data points).
Using icons for legends
➢ Icons can be added to color legends instead of just having colored or patterned boxes for the legend.
45
40
35
30
25
20
15
10
5
0
1991 2004 2006 2011 2014 2018 1991 2004 2006 2011 2014 2018
DHS-I DHS-II DHS-III DHS-IV DHS-V DHS-VI DHS-I DHS-II DHS-III DHS-IV DHS-V DHS-VI
Stunting Severe stunting
Girls Boys
Inserting an icon
➢ To add an icon to a chart, go to the “Insert” tab in Excel or PowerPoint, click on “Icons” within the
“Illustration” section. A box will pop up of icons to choose from and you can enter a key word or choose a
topic to search for an appropriate icon. Once you have found an icon, click the “Insert” button.
1.
2.
Searching for and choosing an icon
➢ On the left I have selected the “Landscape” topic button. On the right I typed in “Rural” in the search bar
• Sometimes you have to use synonyms to find more options perhaps “Farm”, “Land”, “Agriculture”,
etc. Or instead of typing “Urban” typing “Building”, “City”, “Road”, “Street”, etc.
➢ There are usually two versions of the same icon – one outlined and one filled.
• The choice should depend on whether the icon is visible enough with just the outline, or it needs to
be more prominent.
Searching for and choosing an icon
➢ When creating a pictogram, typically both outlined and filled icons are selected to demonstrate the
proportion out of the whole.
• For example, 2 filled and 3 outlined versions of the same icon should signify 2 out of 5 or 40%.
• Alternatively, you could use only filled icons and use color to represent the statistic: one color for
the proportion and the other for the remaining icons that make up the whole.
• To change the color of an icon, click on the icon and the “Format Shape” panel should pop up on the
right. Click on the paint bucket icon in the "Color" section and choose a color from the dropdown.
Ensuring the icon is visible
➢ Make sure the icon is in front of the bar and not hiding behind the bar. This can be done in several ways:
• In the "Home" tab, go to the "Drawing" section and click "Arrange" and choose one of the options to
move the icon or graphic forward or backward.
• Click on the icon and go to the "Shape Format" tab and go to "Bring Forward" and click on "Bring
Forward". If you don't see the icon on the screen, try clicking on the chart, go to the "Shape Format"
tab and go to "Send Backward" and click "Send to Back“.
1.
1.
OR
2.
2.
TASK
➢ Go to Sheet named "Demo 10 icons" in the Excel file "Training Dataset Day 2" and copy/paste the chart
to Powerpoint.
• Create a color legend using the female and male icons. Female is hex code #C4831E and RGB (196,
131, 30) and male is hex code #440E5F and RGB(68, 14, 95).
• Add an icon in front of the bars signifying rural as well as a source.
25
13
TASK
➢ Create a pictogram in PowerPoint using the female icon to represent the statistic "4 in 5 women have a
national identification card".
➢ Add the text 4 in 5 women have a national identification card with the numbers larger than the text.
➢ Copy the finished product into the Sheet named "Demo 10 icons" in the Excel file "Training Dataset Day 2".
What We Do
ADDING LINES OR SHAPES
FOR HIGHLIGHTING INSIGHTS
Adding arrows for easier interpretation
➢ Arrows when combined with text can provide more context about how the reader should interpret the
scale or units of the chart.
➢ Arrows can also be used with a data callout for percentage point difference to allow readers to easily
quantify the gaps that they are seeing visually.
Adding lines for easier interpretation
➢ Solid or dotted lines can be added as reference or baseline values. They are often added with text that
allows the reader to better interpret data values above and below the lines.
➢ Lines for gender parity are always added to signify equal male and female values in order to help the
readers understand how far from gender equality each observation is.
Adding lines and shading to highlight insights
➢ Dotted lines, arrows, and shaded areas can help draw the readers’ eyes to gaps, outliers, ranges of years,
values, etc., and other insights more easily.
19.8
11.3
8.4
3.2
Rural Urban
Female Male
TASK
➢ Add a shaded rectangle or outline of a rectangle to the range plot from the demo in Module 7.
➢ Copy the finished product into the Sheet named "Demo 11 Lines Shapes" in the Excel file
"Training Dataset Day 2".
TASK
➢ Go to Sheet named "Demo 9 Text" using the chart created in Module 5.
➢ Transform the "Overall" data point in the purple column chart to a line. Add a notation that it is the
overall value.
PRACTICE EXERCISES:
Exercise – annotate demo data visualizations
➢ Take any of the visuals we have created in the demonstrations or tasks and polish the visualization
• Add annotations (labels, arrows, shading, etc.), a title (potentially a subtitle), and source.
➢ Tips to remember:
• Annotations should use an accent color that is complementary to the visual, so it doesn’t introduce
too many colors new colors into the visual.
• Annotations and highlights should contrast well with the original visual.
• Annotations and highlights should help the reader, not distract and make the visual cluttered.