0% found this document useful (0 votes)

52 views4 pages

2/ Organizing and Visualizing Variables: Dcova

Data visualization organizes and represents data in a way that allows trends and relationships to be easily perceived, in order to better understand information and drive business decision making. Various methods are described for organizing numerical and categorical data into frequency distributions, contingency tables, scatter plots, histograms and other visual formats to analyze and summarize the data. Care must be taken to present data in a clear way that does not obscure trends or create false impressions.

Uploaded by

Thong Phan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

52 views4 pages

2/ Organizing and Visualizing Variables: Dcova

Uploaded by

Thong Phan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 4

2/ Organizing and Visualizing Variables

Data visualisation is all about organizing and representing data in a way that allows us to
more easily perceive trends and relationships, and effectively communicate otherwise
complex information. We do this in order to better understand information to ultimately drive
better business decision making

DCOVA

  To properly apply statistics, you should follow a framework to minimize possible
errors.
  Define the data you want to study in order to solve a problem or meet an objective (e.g.
study sales data in order to solve the problem of advertising expenditure).
  Collect the data from appropriate sources.
  Organise the data collected by developing pages.
  Visualise the data by developing figures/charts
  Analyse the data collected to reach conclusions and present results.

A summary table tallies the frequencies or percentages of items in a set of categories so that
you can see differences between categories.
A Contingency Table Helps Organize Two or More Categorical Variables
  Used to study patterns that may exist between the responses of two or more

categorical variables.
  Cross tabulates or tallies jointly the responses of the categorical variables.
  For two variables the tallies for one variable are located in the rows and the tallies
for the second variable are located in the columns.

An ordered array is a sequence of data, in rank order, from the smallest value to the largest
value. Shows range (minimum value to maximum value). May help identify outliers (unusual
observations).
The frequency distribution is a summary table in which the data are arranged into
numerically ordered classes.
 You must give attention to selecting the appropriate number of class groupings for the
table, determining a suitable width of a class grouping, and establishing the
boundaries of each class grouping to avoid overlapping.
 The number of classes depends on the number of values in the data. With a larger
number of values, typically there are more classes. In general, a frequency distribution
should have at least 5 but no more than 15 classes.
 To determine the width of a class interval, you divide the range (Highest value–
Lowest value) of the data by the number of class groupings desired.

Organizing Numerical Data: Frequency Distribution Example

  Sort raw data in ascending order:

12, 13, 17, 21, 24, 24, 26, 27, 27, 30, 32, 35, 37, 38, 41, 43, 44, 46, 53, 58.
  Find range: 58 - 12 = 46.
  Select number of classes: 5 (usually between 5 and 15).
  Compute class interval (width): 10 (46/5 then round up).
  Determine class boundaries (limits):
o  Class 1: 10 but less than 20.
o  Class 2: 20 but less than 30.
o  Class 3: 30 but less than 40.
o  Class 4: 40 but less than 50.
o  Class 5: 50 but less than 60.
  Compute class midpoints: 15, 25, 35, 45, 55.
  Count observations (Frequency)
 Calculate percentage, cumulative percentage, cumulative frequency

Frequency Distribution

It condenses the raw data into a more useful form. It allows for a quick visual interpretation
of the data. It enables the determination of the major characteristics of the data set including
where the data are concentrated / clustered.

The bar chart visualizes a categorical variable as a series of bars. The length of each bar
represents either the frequency or percentage of values for each category. Each bar is
separated by a space called a gap.

The pie chart is a circle broken up into slices that represent categories. The size of each slice
of the pie varies according to the percentage in each category.

The doughnut chart is the outer part of a circle broken up into pieces that represent
categories. The size of each piece of the doughnut varies according to the percentage in each
category.

The Pareto Chart

  Used to portray categorical data (nominal scale).
  A vertical bar chart, where categories are shown in descending order of frequency.
  A cumulative polygon is shown in the same graph.
  Used to separate the “vital few” from the “trivial many.”

The side by side bar chart represents the data from a contingency table.

Stem-and-Leaf Display

A simple way to see how the data are distributed and where concentrations of data exist.

METHOD: Separate the sorted data series into leading digits (the stems) and the trailing
digits (the leaves).

A stem-and-leaf display organizes data into groups (called stems) so that the values within
each group (the leaves) branch out to the right on each row.

The Histogram
 A vertical bar chart of the data in a frequency distribution is called a histogram.
  In a histogram there are no gaps between adjacent bars.
  The class boundaries (or class midpoints) are shown on the

horizontal axis.

  The vertical axis is either frequency, relative frequency, or

percentage.

  The height of the bars represent the frequency, relative frequency, or

percentage.

The Polygon
  A percentage polygon is formed by having the midpoint of each class represent
the data in that class and then connecting the sequence of midpoints at their
respective class percentages.
  The cumulative percentage polygon, or ogive, displays the variable of interest
along the X axis, and the cumulative percentages along the Y axis.
  Useful when there are two or more groups to compare

Variables: The Scatter Plot

 Scatterplotsareusedfornumericaldataconsistingofpaired observations taken from two
numerical variables.

  Onevariableismeasuredontheverticalaxisandtheother variable is measured on the

horizontal axis.
  Scatterplotsareusedtoexaminepossiblerelationships between two numerical
variables.

Variables: The Time Series Plot

 A Time-Series Plot is used to study patterns in the values of a numeric variable over time.

 The Time-Series Plot:

 Numeric variable is measured on the vertical axis and the time period is measured on the
horizontal axis.

Visualize Many Variables

 A Pivot Table:

  Summarizes variables as a multidimensional summary table.

  Allows interactive changing of the level of summarization and formatting of the
variables.
  Allows you to interactively “slice” your data to summarize subsets of data that
meet specified criteria.
 Can be used to discover possible patterns and relationships in multidimensional
data that simpler tables and charts would fail to make apparent
 When organizing and visualizing data need to be mindful of:

o  The limits of other’s ability to perceive and comprehend.

o  Presentation issues that can undercut the usefulness of methods from this
chapter.

  It is easy to create summaries that:

o  Obscure the data or

o  Create false impressions.

Module 2 - Descriptive Statistics - PPT-3
No ratings yet
Module 2 - Descriptive Statistics - PPT-3
31 pages
Handbook of Parametric and Nonparametric Statistical Procedures PDF
100% (3)
Handbook of Parametric and Nonparametric Statistical Procedures PDF
972 pages
Complete Thesis in CD
100% (1)
Complete Thesis in CD
235 pages
Biostat Lecture 3-1
No ratings yet
Biostat Lecture 3-1
162 pages
Unit 2 - Summarizing Data - Charts and Tables
100% (1)
Unit 2 - Summarizing Data - Charts and Tables
33 pages
Week 2 Chapter 2 Describing Data
No ratings yet
Week 2 Chapter 2 Describing Data
46 pages
Business Statistics: Graphs, Charts, and Tables - Describing Your Data Graphs, Charts, and Tables - Describing Your Data
100% (1)
Business Statistics: Graphs, Charts, and Tables - Describing Your Data Graphs, Charts, and Tables - Describing Your Data
74 pages
Chapter 2
No ratings yet
Chapter 2
95 pages
The Effect of Mobile Phones On Students
No ratings yet
The Effect of Mobile Phones On Students
35 pages
Chapter 2_Sumarizing data_Statistics
No ratings yet
Chapter 2_Sumarizing data_Statistics
95 pages
Data Visualization (Fixed)
No ratings yet
Data Visualization (Fixed)
24 pages
Week 2.1 Data Presentation
No ratings yet
Week 2.1 Data Presentation
40 pages
Week 2 Data Presentation
No ratings yet
Week 2 Data Presentation
37 pages
Session 3 - Data Presentation
No ratings yet
Session 3 - Data Presentation
24 pages
SLIDES Statistics-Chapter 2
No ratings yet
SLIDES Statistics-Chapter 2
31 pages
Data Visualization: Are Merely Labels, Codes or Mutually Exclusive Categories
No ratings yet
Data Visualization: Are Merely Labels, Codes or Mutually Exclusive Categories
26 pages
Describing Data Visually
No ratings yet
Describing Data Visually
57 pages
5315 ch00 Plotschartshistogram
No ratings yet
5315 ch00 Plotschartshistogram
37 pages
1. Descriptive Statistics (1)
No ratings yet
1. Descriptive Statistics (1)
65 pages
Session 2
No ratings yet
Session 2
38 pages
Stats Unit2
No ratings yet
Stats Unit2
56 pages
Quantitative Methods MM ZG515 / QM ZG515: BITS Pilani
No ratings yet
Quantitative Methods MM ZG515 / QM ZG515: BITS Pilani
30 pages
Mini Tab 16 Help Data Sets
50% (2)
Mini Tab 16 Help Data Sets
21 pages
Week 02 Data Organizatiion and Presentaion
No ratings yet
Week 02 Data Organizatiion and Presentaion
51 pages
CH 02
No ratings yet
CH 02
38 pages
Organization and Presentation of Data
50% (2)
Organization and Presentation of Data
55 pages
Chapter 2 Math
No ratings yet
Chapter 2 Math
19 pages
DSA-Midterm
No ratings yet
DSA-Midterm
29 pages
Chapter2 MAS202
No ratings yet
Chapter2 MAS202
43 pages
MATH 101 - Data Management
No ratings yet
MATH 101 - Data Management
44 pages
2- Presenting Data Part
No ratings yet
2- Presenting Data Part
42 pages
Statistics For Management
No ratings yet
Statistics For Management
102 pages
BIOL 2163 Lecture 2 - Summarizing and Graphing Data
No ratings yet
BIOL 2163 Lecture 2 - Summarizing and Graphing Data
59 pages
frequency distribution & Graphs
No ratings yet
frequency distribution & Graphs
39 pages
2035 CH2 Notes
No ratings yet
2035 CH2 Notes
42 pages
SPSS Tutorial and Excersise Book_240514_081527
No ratings yet
SPSS Tutorial and Excersise Book_240514_081527
74 pages
Describing and Interpreting Data: Variable
No ratings yet
Describing and Interpreting Data: Variable
9 pages
Unit1 - 2charts and Graphs
No ratings yet
Unit1 - 2charts and Graphs
26 pages
Chapter 2
No ratings yet
Chapter 2
3 pages
Chapter 02 - Fundamentals of Data Visualization
No ratings yet
Chapter 02 - Fundamentals of Data Visualization
39 pages
Chapter 2 - Describing The Data
No ratings yet
Chapter 2 - Describing The Data
9 pages
2. presenting of data_١١١٠٥٩
No ratings yet
2. presenting of data_١١١٠٥٩
39 pages
Data Visualization & Data Exploration - Unit II
No ratings yet
Data Visualization & Data Exploration - Unit II
26 pages
ALY6000 Module 3
No ratings yet
ALY6000 Module 3
17 pages
QT Session 1 Introduction, Tables and Graphs
No ratings yet
QT Session 1 Introduction, Tables and Graphs
23 pages
BY Muhammad Imad Khan
No ratings yet
BY Muhammad Imad Khan
46 pages
2 Organizing and Visualizing Variables
No ratings yet
2 Organizing and Visualizing Variables
36 pages
An Introduction To SPSS
No ratings yet
An Introduction To SPSS
33 pages
Charts and Graphs
No ratings yet
Charts and Graphs
24 pages
CHAPTER 2 Descriptive Statistics
No ratings yet
CHAPTER 2 Descriptive Statistics
5 pages
Organizing-Data_250120_180858
No ratings yet
Organizing-Data_250120_180858
32 pages
Download Complete Death, Society, and Ideology in a Hohokam Community Randall H. Mcguire PDF for All Chapters
100% (2)
Download Complete Death, Society, and Ideology in a Hohokam Community Randall H. Mcguire PDF for All Chapters
65 pages
Picturing Distributions With Graphs
No ratings yet
Picturing Distributions With Graphs
21 pages
Chapter 2 Methods of Data Collection and Presentation
No ratings yet
Chapter 2 Methods of Data Collection and Presentation
35 pages
Topic 3
No ratings yet
Topic 3
22 pages
CH 2 Notes Filled
No ratings yet
CH 2 Notes Filled
22 pages
1st Mid
No ratings yet
1st Mid
19 pages
Essentials of Modern Business Statistics (7e) : Anderson, Sweeney, Williams, Camm, Cochran
No ratings yet
Essentials of Modern Business Statistics (7e) : Anderson, Sweeney, Williams, Camm, Cochran
35 pages
Data Visualization - Day 1 - in Class Exercises - Connecting To Data - Solution Final
No ratings yet
Data Visualization - Day 1 - in Class Exercises - Connecting To Data - Solution Final
36 pages
Module 6 Data Gathering Research Ethics
No ratings yet
Module 6 Data Gathering Research Ethics
19 pages
EJMCM - Volume 7 - Issue 8 - Pages 5106-5142
No ratings yet
EJMCM - Volume 7 - Issue 8 - Pages 5106-5142
37 pages
BUSS1020
No ratings yet
BUSS1020
6 pages
Leadership & Team Effectiveness
100% (1)
Leadership & Team Effectiveness
29 pages
CHAPTER 1 & 2_ STATS
No ratings yet
CHAPTER 1 & 2_ STATS
5 pages
Data visualization (3)
No ratings yet
Data visualization (3)
5 pages
Statanalysis C2a
No ratings yet
Statanalysis C2a
6 pages
Presentation On Data Visualization
No ratings yet
Presentation On Data Visualization
15 pages
Unit 2 Chapter 2 Notes - Statistics
No ratings yet
Unit 2 Chapter 2 Notes - Statistics
4 pages
MEDT 24 LAB L4 Data Presentation - 2022 PDF
No ratings yet
MEDT 24 LAB L4 Data Presentation - 2022 PDF
4 pages
Unit 01 Statistics
No ratings yet
Unit 01 Statistics
10 pages
A Study On Material Handling at Tube Products of India (Tpi) : Arun S 310011631007
No ratings yet
A Study On Material Handling at Tube Products of India (Tpi) : Arun S 310011631007
31 pages
Population vs. Sample
100% (1)
Population vs. Sample
44 pages
HRHHRHRHRHRRHHR
No ratings yet
HRHHRHRHRHRRHHR
11 pages
Chapter 2 Review
No ratings yet
Chapter 2 Review
12 pages
Chapter 2, Part A Descriptive Statistics
No ratings yet
Chapter 2, Part A Descriptive Statistics
5 pages
SPSS Prgms - P10
No ratings yet
SPSS Prgms - P10
11 pages
Chapter 15
No ratings yet
Chapter 15
27 pages
1 ORIGINAL But TITLE FORMAT Needs To Be Corrected
No ratings yet
1 ORIGINAL But TITLE FORMAT Needs To Be Corrected
11 pages
Introductory Statistics (Chapter 2)
No ratings yet
Introductory Statistics (Chapter 2)
3 pages
Early Grade Reading Assessment (Egra) - Sinugbuanong Binisaya Learner's Individual Assessment Result Grade - I
100% (1)
Early Grade Reading Assessment (Egra) - Sinugbuanong Binisaya Learner's Individual Assessment Result Grade - I
3 pages
Introductory Statistics (Chapter 2)
No ratings yet
Introductory Statistics (Chapter 2)
3 pages
Chi-Square Test Lecture
No ratings yet
Chi-Square Test Lecture
6 pages
Youth Buying Behaviour Towards Smartphone's: A Study in Ludhiana City
No ratings yet
Youth Buying Behaviour Towards Smartphone's: A Study in Ludhiana City
9 pages
Untitled4.ipynb - Colab
No ratings yet
Untitled4.ipynb - Colab
3 pages
Exercises On Significance Z
No ratings yet
Exercises On Significance Z
5 pages
MST-002 Descriptive Statistics: Theory of Attributes
No ratings yet
MST-002 Descriptive Statistics: Theory of Attributes
4 pages
Part A: Rows Will Represent Fund Type and Columns Will Show 5 Year Average Return, Accordingly
No ratings yet
Part A: Rows Will Represent Fund Type and Columns Will Show 5 Year Average Return, Accordingly
7 pages
Tabel Chi-Square
No ratings yet
Tabel Chi-Square
1 page
Phil IRI Pre Test Post Test ANALYSIS 2022 2023
No ratings yet
Phil IRI Pre Test Post Test ANALYSIS 2022 2023
1 page
De-Mystifying Math and Stats for Machine Learning: Mastering the Fundamentals of Mathematics and Statistics for Machine Learning
From Everand
De-Mystifying Math and Stats for Machine Learning: Mastering the Fundamentals of Mathematics and Statistics for Machine Learning
Seaport AI Madhavan
No ratings yet