0% found this document useful (0 votes)

2 views10 pages

Exploratory Data Analysis and Visualization

This document covers exploratory data analysis and visualization techniques using R, including importing various data formats, examining data frames, identifying missing values and outliers, and generating descriptive statistics. It also discusses creating frequency and proportion tables for qualitative variables, as well as visualizing data through bar charts, pie charts, histograms, and boxplots. The document includes class exercises to apply these concepts using specific datasets.

Uploaded by

Rezaul Karim

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

2 views10 pages

Exploratory Data Analysis and Visualization

Uploaded by

Rezaul Karim

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 10

Chapter: Exploratory Data Analysis and Visualization

Topics to be covered:
1. Importing Data in Various Formats to R Environment
2. Examining a Data Frame
3. Examining for Missing Values in a Data Frame
4. Examining Extreme Values/ Outliers
5. Use of Table and Proportion Table for Qualitative Variables
6. Use of Two-Way Table and Proportion Table for Qualitative Variables
7. Basic Descriptive Statistics for Quantitative Variables
8. Basic Data Visualization: Qualitative Variables
9. Basic Data Visualization: Quantitative Variables

# Importing Data in Various Formats to R Environment

It is important to know how to import data in various formats in R environment. In this section,
we will learn:
Importing excel data
Importing CSV data
Importing data in other Formats

myData <- readxl :: read_excel (“file_name.xlsx”, header = T, sheet = “sheet_name”)

External package readxl is required; data will be imported as a tibble (another flexible and
faster implementation of data frame)

Class Exercise-1: Import a sheet named Major from an excel workbook called Collected_Data_1
and label it as myMajor and examine the structure of myMajor.

Class Exercise-2: Import a single sheet excel worksheet called Gig and label it as myGigData and
examine the structure of myGigData.

myData <- read.csv (“file_name.csv”, header = T)

No external package is required; Data will be imported as a data frame.

Class Exercise-3: Import the Gig.csv data file and label it as myData and examine its structure.

1|Page
# Examining a Data frame

A data frame can be systematically examined in R by evaluating the following aspects:

 Structure of the Data Frame (str function)

 Dimension of the Data Frame (dim function)
 Top Six Rows (head function)
 Bottom Six Rows (tail function)
 Top 4 and Bottom 4 Rows (headTail function: package: psych)
 Statistical Summary of Data Frame (summary function)
 Names of the variables or columns (names or colnames function)
 Names of the rows or observations (rownames function)

Class Exercise-4: Evaluate myData data frame created by importing the Gig.csv file in terms of
dimension, top six rows, bottom six rows, top 4 & bottom 4 rows, and the statistical summary
of the data frame. Print the variables or column names of myData. Print the first 3 row names
of myData.

# Examining for Missing Value in a Data Frame

The presence or absence of missing value(s) can be examined for the entire data frame by using
the following codes in R:
is.na(dataframe) #will return a logical object
sum(is.na(data_frame)) # will return the number of rows in which have missing values
rows_with_na <- myData [!complete.cases(data_frame), ]

We can determine the variable-wise or column-wise missing values by using the following code
in R with pipe (|>) function:
colsums(is.na(dataframe$vector1)) |>
t() |>
t()

Class Exercise-5: Determine the number of rows containing missing values and print the
number of rows containing the missing values and label it rows_with_na from the data frame
myData. Show the variable or column-wise list of missing values. In which columns or variables,
don’t we have any missing values?

The presence or absence of missing value(s) can be examined for a particular variable in the
data frame by using the following functions:
is.na(data_frame$vector_1) # will return a logical object
which(is.na(data_frame$vector_1)) # will return the number of rows with NA in vector_1

2|Page
sum(is.na(data_frame$vector_1)) # will return the number of missing values in vector_1
na_in_vector_1 <- myData [is.na(data_frame$vector_1), ]

Class Exercise-6: Determine the number of missing values in industry variable in myData and
identify the rows containing missing value in the industry variable.

# Examining Extreme Values/ Outliers

There are several ways to detect extreme values or outliers. But the most frequently used
method to detect outliers is the boxplot (also known as box-and-whisker plot) method
developed by John Tukey in 1970:

Here the boxplot can be constructed by using the following code in R:

boxplot(dataframe$vector, horizontal = TRUE) # plot the boxplot horizontally
The number of small circles indicates the number of distinct outliers we have in the vector.
All the outliers can be identified by using the following code in R:
boxplot(dataframe$vector, horizontal = TRUE, plot = FALSE)$out
The total number of outliers can be identified by using the following code in R:
length(boxplot(dataframe$vector, horizontal = TRUE, plot = FALSE)$out)
All the outliers can be tabulated in terms of their frequency by using the following code in R:
table(boxplot(dataframe$vector, horizontal = TRUE, plot = FALSE)$out)

Class Exercise-7: Import the file stored in onlineshop.csv and label it as onlineshop. Evaluate
the structure of the data frame onlineshop. Use Tukey’s method to identify the number of
distinct outliers in the AGE variable of the data frame. Identify all the outliers of the variable
AGE. How many outliers do we have here? Tabulate the outliers. What is the least frequently
occurring outlier in the AGE variable.

3|Page
We can also create boxplots across the categories by using the following code in R:
boxplot(dataframe$vector1, dataframe$vector2) # vector1 is numeric and vector2 is categorical.

Class Exercise-8: Create a series of boxplots of hourly wage across the categories of industry
from the myData data frame created from importing Gig.csv file. In which industry or
industries, we don’t have any missing value?

# Use of Table and Proportion Table for the Qualitative Variable

To a get a frequency table for a qualitative variable in a dataframe in R, we may run the
following code in R:

table(dataframe$vector)

And to the get the proportion table for a qualitative variable in a dataframe in R, we may run
the following code in R:

proportion(table(dataframe$vector))

Class Exercise-9: Load Gig.csv data into the R environment and store it as myData and create a
subset of myData discarding the missing value and store it as myDataComplete. Create a
frequency table of the qualitative variable industry and also create proportion table of the
same variable.
Solution: Here,
myData <- read.csv("Gig.csv")
myDataComplete <- na.omit(myData)
table(myDataComplete$Industry)
proportions(table(myDataComplete$Industry))

# Use of Two-Way Table and Proportion Table for Multiple Qualitative Variables

To create a table of multiple qualitative variables in R environment, we may use table function
and add two variables separated by “,”, the first variable added will be arranged in the row and
the second variable added will be arranged in the column.

table(dataframe$vector1, dataframe$vector2)

To get the proportion table for multiple qualitative variables, we run

proportion (table(dataframe$vector1, dataframe$vector2), 1) # % calculated across rows

4|Page
proportion (table(dataframe$vector1, dataframe$vector2), 2) # % calculated across columns

Class Exercise-10: Create a two-way frequency table for the variables industry and job in the
myDataComplete data frame. Create a proportion table of the same two variables across the
rows.

Solution: Here,
table(myDataComplete$Industry, myDataComplete$Job)
proportions(table(myDataComplete$Industry, myDataComplete$Job), 1)

# Basic Descriptive Statistics for Quantitative Variables

There are many ways of calculating descriptive statistics for quantitative variables in R. The
following functions, in the base R, have already introduced to calculate various descriptive
statistics:
min()
max()
mean()
var()
sd()
median()
summary()

The summary function in R for a quantitative variable will return a summary of min, 1st
quartile, median or 2nd quartile, 3rd quartile, max, and mean. The syntax used for running
summary function on a quantitative variable vector1 in a data frame is given below:

summary(dataframe$vector1)

Classwork-11: Determine the summary statistics for the variable hourly wage in the dataframe
myDataComplete.

Solution: Here
summary(myDataComplete$HourlyWage)
Min. 1st Qu. Median Mean 3rd Qu. Max.
24.28 34.55 41.82 40.15 46.02 51.00

But one of the most comprehensive way of calculating descriptive statistics in R environment is
using the describe() function from psych package. To install and make the psych package active,
we run the following code

install.packages(“psych”) # If not already installed

5|Page
library (psych) # to make the psych package active in this session
To run the describe function from psych package, we run
describe(dataframe$vector1) # where vector1 is a numerical variable
We can calculate the group-wise statistics by using the describeBy function from psych:
describeBy(dataframe$vector1, dataframe$vector2) # where vector2 is the grouping variable.
If we want to calculate any specific function, across the group, we can use the tapply function in
base R:
Tapply(dataframe$vector1, vector2, function_name)

Class Exercise-12: Calculate the descriptive statistics of the variable hourly wage in the
dataframe myDataComplete. Calculate the descriptive statistics across the industry variable.
Calculate the mean values of HourlyWage across the industry variable
Solution: Here,
describe(myDataComplete$HourlyWage)
describeBy(myDataComplete$HourlyWage, myDataComplete$Industry)
tapply(myDataComplete$HourlyWage, myDataComplete$Industry, mean)

Skewness and Kurtosis can be calculated from the e1071 package in the following way:
skewness(dataframe$vector1) # where vector1 is a numeric vector
kurtosis(dataframe$vector1) # where vector1 is a numeric vector
Skewness and Kurtosis can also be calculated from psych package from the skew() and
kurtorsi() functions.

Class Exercise-13: Calculate the skewness and kurtosis from the HourlyWage variable in the
myDataComplete dataframe and interpret.

# Basic Data Visualization: Qualitative Variables

For qualitative variables, the most commonly used methods to visualize categorical or
qualitative variables are: Bar Chart and Pie Chart.

Bar Chart: We can construct a bar chart in R environment by using the barplot function by
setting certain parameters

barplot (table(dataframe$vector1), # where vector1 is qualitative variable

main = “The Title of the plot”,
xlab = “The label for x-axis”,
ylab = “the label for y-axis”)

Class Work-14: Construct a bar chart from the variable industry in the myDataComplete data
frame by setting the title as Industry Distribution of Workers and labeling x and y axis as you
may deem appropriate without changing the color parameter in the R environment.

6|Page
Solution: Here,
barplot(table(myDataComplete$Industry),
main = "Industry Distribution of Workers",
xlab = "Industry",
ylab = "Numbers of Employees")

Pie Chart: We can create a pie chart in R environment by using the pie function by setting
certain parameters:

pie(table(pie(table(myDataComplete$Industry),
main = "The Title of the Plot"))

Class Work-15: Construct a bar chart from the variable industry in the myDataComplete data
frame by setting the title as Industry Distribution of Workers without changing the color
parameter in the R environment.

Solution: Here,
pie(table(myDataComplete$Industry),
main = "Industry Distribution of Workers")

# Basic Data Visualization: Quantitative Variables

The most commonly used methods to visualize quantitative variables are:

 Histogram and
 Boxplot

Histogram: A basic histogram for a quantitative variable can be constructed by using the hist()
function by setting certain parameters:

hist(vector1, # vector1 is a numerical (quantitative) vector)

main = "The Title of the Plot",
xlab = " The label for x-axis ",
ylab = " The label for y-axis")

Class Work-16: Construct a histogram for the hourly wage in the myDataComplete dataframe
by setting title as The Distribution of Hourly Wage and labeling x and y axis as you may deem
appropriate without changing the color parameter in the R environment.

Boxplot: A basic boxplot for a quantitative variable can be constructed by using the boxplot
function by setting certain parameters:

boxplot(vector1, # vector1 is a numerical (quantitative) vector)

7|Page
main = " The Title of the Plot ",
horizontal = T) # by default horizontal is set to FALSE

Class Work-17: Construct a boxplot for the hourly wage in the myDataComplete data frame by
setting title as The Distribution of Hourly Wage and labeling x and y axis as you may deem
appropriate without changing the color parameter in the R environment.

8|Page
Exercises: Exploratory Data Analysis and Visualization

Exercise-1: Import the onlineshop.csv file and label it as onlineshop in the R environment.
Evaluate onlineshop in terms of dimension, top six rows, bottom six rows, top 4 & bottom 4
rows, and the statistical summary of the data frame. Print the variables or column names of
onlineshop. Print the first 3 row names of onlineshop.

Exericse-2: Consider the onlineshop data in Exercise-1. Determine the number of rows
containing missing values and print the number of rows containing missing values and label it
rows_with_na from the data frame onlineshop. Show the variable or column-wise list of
missing values. In which columns or variables, don’t we have any missing values?

Exericise-3: Consider the onlineshop data in Exercise-1. Determine the number of missing
values in TYPE variable in onlineshop and identify the rows containing missing value in the TYPE
variable.

Exericise-4: Consider the onlineshop data in Exercise-1. Create a subset of onlineshop data
frame by discarding all the missing values and label the new data frame as
onlineshopComplete. How many observations do we have in this new data frame? How many
of them are numeric? How many of them are character?
Exercise-5: Consider the onlineshopComplete data frame in created in Exercise-4. Convert the
variable TYPE into a factor labeling 1 = Manufacturing, and 2 = Service. Create a series of
boxplot across the TYPE of industry. How many distinct extreme values do we have in each
type?

Exericse-6: Consider the data frame onlineshopCompleted in Exercise-5. Create a frequency

table of the qualitative variable PAYMENT_METHOD and create proportion table of the same
variable. What percentage of customers are using PayPal?

Exercise-7: Consider the data frame onlineshopCompleted in Exercise-5. Create a two-way

frequency table for the variables GENDER and PAYMENT_METHOD in the
onlineshopCompleted data frame. Create a proportion table of the same two variables across
the rows. What percentage of PayPal users are male? Did you need to create another
proportion table? Explain.

Exercise-8: Consider the data frame onlineshopCompleted in Exercise-5. Determine the

summary statistics for the variable CREDIT_SCORE in the dataframe onlineshopCompleted.
What is the average mean credit score? Compare the median and mean credit score, which one
is higher. Calculate the mean value of the variable CREDIT_SCORE across the variable GENDER.
Which gender has higher mean credit score and lower skewness?

9|Page
Exercise-9: Consider the data frame onlineshopCompleted in Exercise-5. Construct a bar chart
from the variable PAYMENT_METHOD in the onlineshopCompleted data frame by setting the
title as Payment Method Distribution of the Users and labeling x and y axis as you may deem
appropriate by changing the color parameter to be “yellowgreen” in the R environment.

Exercise-10: Consider the data frame onlineshopCompleted in Exercise-5. Construct a

histogram for the AGE in the onlineshopCompleted data frame by setting title as The
Distribution of User Age and labeling x and y axis as you may deem appropriate without
changing the color parameter in the R environment. Create another histogram by setting the
breaks = 30. Which of these histograms is more informative? Explain

10 | P a g e

Phantom LUTs
No ratings yet
Phantom LUTs
10 pages
Lab 5
0% (1)
Lab 5
5 pages
Ma 3
No ratings yet
Ma 3
32 pages
Materi 4
No ratings yet
Materi 4
30 pages
Unit 2
No ratings yet
Unit 2
29 pages
4 Overview of R Part 2
No ratings yet
4 Overview of R Part 2
63 pages
Module 5-6
No ratings yet
Module 5-6
12 pages
Introduction To R For Business Analytics
No ratings yet
Introduction To R For Business Analytics
7 pages
Analysis Using Statistical: Introduction & Data Exploration
No ratings yet
Analysis Using Statistical: Introduction & Data Exploration
23 pages
Apunts BLOC 1 Estadística
No ratings yet
Apunts BLOC 1 Estadística
15 pages
Data Analyses R Manual NYTS
No ratings yet
Data Analyses R Manual NYTS
24 pages
R For Data Exploration
No ratings yet
R For Data Exploration
52 pages
Unit 2
No ratings yet
Unit 2
76 pages
MDPN460 Lecture05
No ratings yet
MDPN460 Lecture05
32 pages
Business Analytics - L2
No ratings yet
Business Analytics - L2
41 pages
Coursera Notes
No ratings yet
Coursera Notes
4 pages
Introduction To R PDF
No ratings yet
Introduction To R PDF
56 pages
R1 Uptovisualisation
No ratings yet
R1 Uptovisualisation
122 pages
STA1007S Lab 3: Plots (II) and Sub-Setting: "Sample"
No ratings yet
STA1007S Lab 3: Plots (II) and Sub-Setting: "Sample"
10 pages
DWDM - Lab Manual1
No ratings yet
DWDM - Lab Manual1
40 pages
Getting Started With R
No ratings yet
Getting Started With R
155 pages
Unit 1 R Reading-Writing Files
No ratings yet
Unit 1 R Reading-Writing Files
8 pages
L3 Notes-1
No ratings yet
L3 Notes-1
8 pages
R Study Material I
No ratings yet
R Study Material I
8 pages
Daur Unit 2
No ratings yet
Daur Unit 2
28 pages
Module 1: Unit - 1.1: Introduction To Analytics or R Programming
No ratings yet
Module 1: Unit - 1.1: Introduction To Analytics or R Programming
26 pages
Lab0 R Tutorial EHS
No ratings yet
Lab0 R Tutorial EHS
9 pages
Data Preparation: Handling Missing Values and Outliers
No ratings yet
Data Preparation: Handling Missing Values and Outliers
28 pages
Dar Lecture 7
No ratings yet
Dar Lecture 7
24 pages
Lecture 1
No ratings yet
Lecture 1
167 pages
Lesson 7 - The Data Frame
No ratings yet
Lesson 7 - The Data Frame
7 pages
Data analytic R
No ratings yet
Data analytic R
28 pages
Business Analytics Unit 4
No ratings yet
Business Analytics Unit 4
24 pages
CS ELEC 4 Midterm Module
No ratings yet
CS ELEC 4 Midterm Module
59 pages
Unit 1 Factor
No ratings yet
Unit 1 Factor
9 pages
R Assignment
No ratings yet
R Assignment
9 pages
Stats Lab1
No ratings yet
Stats Lab1
11 pages
Advance R Prog.-1
No ratings yet
Advance R Prog.-1
24 pages
Big Data - Lab 3
No ratings yet
Big Data - Lab 3
25 pages
CH 3
No ratings yet
CH 3
33 pages
(R) Internal-2 Q & A
No ratings yet
(R) Internal-2 Q & A
65 pages
R Complete
No ratings yet
R Complete
24 pages
R Lab Manual
No ratings yet
R Lab Manual
31 pages
Advanced Statistics
No ratings yet
Advanced Statistics
259 pages
Practical 1 - Data Frame Manipulation - 072502
No ratings yet
Practical 1 - Data Frame Manipulation - 072502
16 pages
DS Lab
No ratings yet
DS Lab
31 pages
UL2
No ratings yet
UL2
2 pages
Unit 4
No ratings yet
Unit 4
27 pages
R-Programming Lab Mannual
No ratings yet
R-Programming Lab Mannual
33 pages
Week3 2020
No ratings yet
Week3 2020
20 pages
Capital Gains
No ratings yet
Capital Gains
8 pages
MultivariateRGGobi PDF
No ratings yet
MultivariateRGGobi PDF
60 pages
Advanced R Programming Tidyverse Packages Notes
No ratings yet
Advanced R Programming Tidyverse Packages Notes
12 pages
R Imp Funtions
No ratings yet
R Imp Funtions
10 pages
People Analytics With R Part 3
No ratings yet
People Analytics With R Part 3
11 pages
R
No ratings yet
R
15 pages
Data Cleansing
No ratings yet
Data Cleansing
18 pages
Muthayammal College of Arts and Science Rasipuram: Assignment No - 1
No ratings yet
Muthayammal College of Arts and Science Rasipuram: Assignment No - 1
10 pages
Advanced C Concepts and Programming: First Edition
From Everand
Advanced C Concepts and Programming: First Edition
Gayatri
3/5 (1)
More on C# in Front Office
From Everand
More on C# in Front Office
Xing Zhou
No ratings yet
Introduction to Algorithms
From Everand
Introduction to Algorithms
S VASIST
No ratings yet
The Kemetic Tree of Life
No ratings yet
The Kemetic Tree of Life
1 page
Boundary Layer Notes PDF
No ratings yet
Boundary Layer Notes PDF
10 pages
Elephants Under Human Care The Behaviour, Ecology, and Welfare of Elephants in Captivity 1st Edition - Ebook PDFinstant Download
100% (3)
Elephants Under Human Care The Behaviour, Ecology, and Welfare of Elephants in Captivity 1st Edition - Ebook PDFinstant Download
54 pages
B7 - Control Relays - EN
No ratings yet
B7 - Control Relays - EN
28 pages
MODULE-12
No ratings yet
MODULE-12
8 pages
ATM Card Transaction Process and Security Mechanism
No ratings yet
ATM Card Transaction Process and Security Mechanism
19 pages
BlueSky 6.0 Release Notes
No ratings yet
BlueSky 6.0 Release Notes
3 pages
Chapter 4 5 Isometric and Orthographic Sketching
No ratings yet
Chapter 4 5 Isometric and Orthographic Sketching
23 pages
1200 CP For Pipelines Corrosion Prevention and Metallurgy Manual
No ratings yet
1200 CP For Pipelines Corrosion Prevention and Metallurgy Manual
1 page
ARHQ JLD REV2 Text
No ratings yet
ARHQ JLD REV2 Text
35 pages
QUIZ 3 Entrepreneurial Mind
No ratings yet
QUIZ 3 Entrepreneurial Mind
2 pages
SAEP-16 - 0305 - Project Execution Guide For Process Automation Systems
0% (1)
SAEP-16 - 0305 - Project Execution Guide For Process Automation Systems
18 pages
Sodra Blue Z - Product Information (5) - 250427 - 131822
No ratings yet
Sodra Blue Z - Product Information (5) - 250427 - 131822
2 pages
Op-Ed Therattil
No ratings yet
Op-Ed Therattil
2 pages
Tema 20
No ratings yet
Tema 20
7 pages
A Study to Assess the Effectiveness of Video Assisted Teaching Programme on Knowledge about Safety Measures Regarding Handling of Chemotherapy Drugs among Undergraduate Nursing Student from Selected Colleges of Chandrapur
No ratings yet
A Study to Assess the Effectiveness of Video Assisted Teaching Programme on Knowledge about Safety Measures Regarding Handling of Chemotherapy Drugs among Undergraduate Nursing Student from Selected Colleges of Chandrapur
3 pages
Law of Sucession Assignment
No ratings yet
Law of Sucession Assignment
7 pages
Production of Pulp From Banana Tree
No ratings yet
Production of Pulp From Banana Tree
38 pages
Zomato Food Order: Summary and Receipt: Item Quantity Unit Price Total Price
No ratings yet
Zomato Food Order: Summary and Receipt: Item Quantity Unit Price Total Price
1 page
List of Successful Start-Ups in West Bengal
No ratings yet
List of Successful Start-Ups in West Bengal
6 pages
Interview Assessment Form - Lateral - V 3.2
No ratings yet
Interview Assessment Form - Lateral - V 3.2
29 pages
Social Entrepreneurship Literature Review Johnson
No ratings yet
Social Entrepreneurship Literature Review Johnson
11 pages
General Awareness in Steam Turbine Manufacturing: An Industrial Training Presentation On
No ratings yet
General Awareness in Steam Turbine Manufacturing: An Industrial Training Presentation On
29 pages
Lesson 6 Chain Rule and Higher Order Derivatives
No ratings yet
Lesson 6 Chain Rule and Higher Order Derivatives
13 pages
AI Supplementary
No ratings yet
AI Supplementary
25 pages
Annex E: Corrosion Inhibitors
No ratings yet
Annex E: Corrosion Inhibitors
9 pages
Single Phase Induction Motor
No ratings yet
Single Phase Induction Motor
13 pages
AUTOSAR FO RS Main
No ratings yet
AUTOSAR FO RS Main
41 pages
Common Issues in Machine Learning
No ratings yet
Common Issues in Machine Learning
6 pages

Exploratory Data Analysis and Visualization

Uploaded by

Exploratory Data Analysis and Visualization

Uploaded by

Chapter: Exploratory Data Analysis and Visualization

# Importing Data in Various Formats to R Environment

myData <- readxl :: read_excel (“file_name.xlsx”, header = T, sheet = “sheet_name”)

myData <- read.csv (“file_name.csv”, header = T)

No external package is required; Data will be imported as a data frame.

A data frame can be systematically examined in R by evaluating the following aspects:

 Structure of the Data Frame (str function)

# Examining for Missing Value in a Data Frame

# Examining Extreme Values/ Outliers

Here the boxplot can be constructed by using the following code in R:

# Use of Table and Proportion Table for the Qualitative Variable

To get the proportion table for multiple qualitative variables, we run

proportion (table(dataframe$vector1, dataframe$vector2), 1) # % calculated across rows

# Basic Descriptive Statistics for Quantitative Variables

install.packages(“psych”) # If not already installed

# Basic Data Visualization: Qualitative Variables

barplot (table(dataframe$vector1), # where vector1 is qualitative variable

# Basic Data Visualization: Quantitative Variables

The most commonly used methods to visualize quantitative variables are:

hist(vector1, # vector1 is a numerical (quantitative) vector)

boxplot(vector1, # vector1 is a numerical (quantitative) vector)

Exericse-6: Consider the data frame onlineshopCompleted in Exercise-5. Create a frequency

Exercise-7: Consider the data frame onlineshopCompleted in Exercise-5. Create a two-way

Exercise-8: Consider the data frame onlineshopCompleted in Exercise-5. Determine the

Exercise-10: Consider the data frame onlineshopCompleted in Exercise-5. Construct a

You might also like