0% found this document useful (0 votes)

37 views30 pages

DSF Gourav-2

Uploaded by

abhishek9582822

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

37 views30 pages

DSF Gourav-2

Uploaded by

abhishek9582822

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 30

Department of CSE-DS Engineering

DPG Institute of Technology and

Management Gurugram 122004,Haryana

DATA SCIENCE LAB

FILE (LC-DS-341G)
V SEMESTER
CSE-DS ENGINEERING

Submitted TO: Submitted BY:

Dr. Poonam Sharma Shivam Kumar
Assoc. Professor Roll. No:
CSE Department B.Tech(CSE-DS)-5th sem
INDEX

S.No. Program Date Sign.

1. Downloading, installing.and setting path for R.

2. Give an idea of R Data Types.

3. R as a Calculator: Perform some arithmetic

operations in R.

4. Perform some Logical Operations in R.

5. Write a R script to Demonstrate Loops.

6. Write a R script to change the structure of a Data

frame.
7. Write a R script to Demonstrate aggregate
function in R.

8. Write a r script to handle missing values in r.

9. Write a r script to handle outliers.

PROGRAM-1
AIM :- Downloading, installing.and setting path for R.
INTRODUCTION :- R Studio is an integrated development environment(IDE) for R. IDE is
a GUI, where you can write your quotes, see the results and also see the variables that are
generated during the course of programming.
R is a language and environment for statistical computing and graphics. It is a GNU
project which is similar to the S language and environment which was developed at Bell
Laboratories (formerly AT&T, now Lucent Technologies) by John Chambers and colleagues.
• R Studio is available as both Open source and Commercial software.
• R Studio is also available as both Desktop and Server versions.
• R Studio is also available for various platforms such as Windows, Linux, and macOS
Why use R Studio?
• It is a powerful IDE, specifically used for the R language.
• Provides literate programming tools, which basically allow the use of R scripts,
outputs, text, and images into reports, Word documents, and even an HTML file.
• The use of Shiny (open-source R package) allows us to create interactive content in
reports and presentations.
Advantages of R Programming
• Open Source. R is an open-source programming language. ...
• Exemplary Support for Data Wrangling. R provides exemplary
support for data wrangling. ...
• The Array of Packages. ...
• Quality Plotting and Graphing. ...
• Highly Compatible. ...
• Platform Independent. ...
• Eye-Catching Reports. ...
• Machine Learning Operations.

Installing R Studio on Window:-

To Install R Studio on windows we will follow the following steps.
Step 1: First, you need to set up an R environment in your local machine. You can download
the same from internet.
Step 2: After downloading R for the Windows platform, install it by double-clicking it.
Step 3: Download R Studio from their official page. Note: It is free of cost (under AGPL
licensing).

Step 4: After downloading, you will get a file named “RStudio-1.x.xxxx.exe” in your
Downloads folder.
Step 5: Double-click the installer, and install the software.
Step 6: Test the R Studio installation

• Search for RStudio in the Window search bar on Taskbar.

• Start the application.

• Insert the following code in the console.

Input : print('Hello
world!') Output : [1] "Hello world!"

Step 7: Your installation is successful.

Result:- R studio has been successfully installed on your system.
PROGRAM-2
AIM:-Give an idea of R Data Types.
R supports a variety of data types, which can be broadly categorized as follows:
1. Basic Data Types:
 Numeric: Represents real numbers (both integers and decimals). By default, R stores
numbers as double-precision floating-point numbers.
x <- 3.14 # numeric
y <- 5 # numeric (treated as double)
 Integer: Represents whole numbers. Use the L suffix to explicitly declare
integers. x <- 5L # integer
 Complex: Represents complex numbers with real and imaginary
parts. z <- 2 + 3i # complex number
 Logical: Boolean values, TRUE or
FALSE. x <- TRUE
y <- FALSE
 Character: Represents text or string
data. name <- "John Doe"
2. Data Structures:
 Vector: A one-dimensional array that holds elements of the same type. Vectors can
be numeric, logical, character, or integer.
v <- c(1, 2, 3, 4) # numeric vector
names <- c("Alice", "Bob") # character vector
 Matrix: A two-dimensional array that holds elements of the same type. All elements
in a matrix must be of the same data type.
m <- matrix(1:9, nrow = 3, ncol = 3)
 Array: A multi-dimensional generalization of a matrix that can store elements of the
same type. Arrays can have more than two dimensions.
a <- array(1:8, dim = c(2, 2, 2)) # 3D array
 List: A generic vector that can hold elements of different types (numeric, character,
lists, etc.).
lst <- list(1, "apple", TRUE)
 Data Frame: A table-like structure, where each column can be of a different data
type (similar to a spreadsheet or SQL table).
df <- data.frame(Name = c("John", "Alice"), Age = c(23, 25))
3. Factor:
Represents categorical data and stores it as integers with a corresponding set of levels.
Factors are useful for handling categorical data.
factor_data <- factor(c("Male", "Female", "Female", "Male"))
4. NULL:
Represents an empty or undefined value.
x <- NULL
5. NA:
Represents missing values or undefined
data. x <- c(1, 2, NA, 4)
6. NaN (Not a Number):
Represents undefined mathematical operations like dividing by zero.
x <- 0/0 # NaN
PROGRAM -3
AIM:-R as a Calculator: Perform some arithmetic operations in R.
Creating a simple calculator in R is straightforward. You can implement basic operations like
addition, subtraction, multiplication, and division using a function that takes user input for
the numbers and the operation. Here's how you can build a simple calculator:
Simple Calculator in R
# Define the calculator function
calculator <- function() {

# Display menu options

cat("Simple R Calculator\n")
cat("1. Addition (+)\n")
cat("2. Subtraction (-)\n")
cat("3. Multiplication (*)\n")
cat("4. Division (/)\n")

# Get user input for numbers and operation

num1 <- as.numeric(readline(prompt = "Enter the first number: "))
num2 <- as.numeric(readline(prompt = "Enter the second number: "))
operator <- readline(prompt = "Choose an operation (+, -, *, /): ")

# Perform the operation based on user input

result <- switch(operator,
"+" = num1 + num2,
"-" = num1 - num2,
"*" = num1 * num2,
"/" = if(num2 != 0) num1 / num2 else "Error: Division by
zero", "Invalid operator")

# Display the result

cat("Result: ", result, "\n")
}

# Call the calculator function to run it

calculator()
Explanation:
1. User Input: The readline() function is used to take user input for the two numbers
and the operator. The as.numeric() function converts the input to numeric data type.
2. Switch Statement: The switch() function is used to handle the different operations
based on the user's choice. It selects the correct operation (addition, subtraction,
multiplication, or division) based on the operator entered by the user.
3. Division Handling: To avoid division by zero, a condition is used to check if the
second number is zero when performing division.
4. Result Display: The cat() function is used to display the result.
Example Output:
markdown
Simple R Calculator
1. Addition (+)
2. Subtraction (-)
3. Multiplication (*)
4. Division (/)
Enter the first number: 10
Enter the second number: 5
Choose an operation (+, -, *, /): +
Result: 15
PROGRAM.-4
AIM:- Perform some Logical Operations in R.
Logical operations in R are used to compare values and return Boolean results (TRUE or
FALSE). These operations can be applied to vectors, matrices, or individual values. Here's an
overview of logical operators and how they work in R:
1. Basic Logical Operators:
 AND (& for element-wise and && for short-circuit):
o &: Element-wise logical AND. It checks each element pair.
o &&: Only checks the first element of each vector and performs the logical
AND operation.
x <- c(TRUE, FALSE, TRUE)
y <- c(TRUE, TRUE, FALSE)

# Element-wise AND
x & y # Returns: TRUE FALSE FALSE

# Short-circuit AND (only compares the first elements)

x && y # Returns: TRUE
 OR (| for element-wise and || for short-circuit):
o |: Element-wise logical OR.
o ||: Only checks the first element of each vector and performs the logical OR
operation.
x <- c(TRUE, FALSE, TRUE)
y <- c(FALSE, TRUE, FALSE)

# Element-wise OR
x | y # Returns: TRUE TRUE TRUE

# Short-circuit OR (only compares the first elements)

x || y # Returns: TRUE
 NOT (!):
o Negates a logical value (TRUE becomes FALSE, and vice versa).
x <- TRUE
!x # Returns: FALSE
2. Comparison Operators:
These operators return logical values based on comparisons between two values or vectors.
 Equal to (==):
5 == 5 # Returns: TRUE
5 == 6 # Returns: FALSE
 Not equal to (!=):
5 != 6 # Returns: TRUE
5 != 5 # Returns: FALSE
 Greater than (>):
7 > 3 # Returns: TRUE
 Less than (<):
2 < 8 # Returns: TRUE
 Greater than or equal to
(>=): 5 >= 5 # Returns: TRUE
 Less than or equal to
(<=): 4 <= 6 # Returns: TRUE
3. Logical Functions:
 any(): Returns TRUE if at least one of the elements is
TRUE. x <- c(FALSE, TRUE, FALSE)
any(x) # Returns: TRUE
 all(): Returns TRUE only if all elements are
TRUE. x <- c(TRUE, TRUE, FALSE)
all(x) # Returns: FALSE
 xor(): Returns TRUE when exactly one of the two operands is TRUE, but not
both. xor(TRUE, FALSE) # Returns: TRUE
xor(TRUE, TRUE) # Returns: FALSE
4. Combining Logical Operations:
Logical operators can be combined to form more complex expressions.
x <- 5
y <- 10

(x < 6) & (y > 5) # Returns: TRUE (both conditions are TRUE)

!(x > 6) | (y == 10) # Returns: TRUE (because one condition is TRUE)

5. Logical Operations with Vectors:
When logical operations are applied to vectors, they are evaluated element-wise.
a <- c(1, 2, 3)
b <- c(3, 2, 1)

a > b # Returns: FALSE FALSE TRUE

These logical operations help in data manipulation, filtering, and decision-making in R.
PROGRAM.-5
AIM:-Write an R Script to demonstrate loops.
In R, loops are used to iterate over a sequence of elements or execute code repeatedly. The
most common types of loops in R are for, while, and repeat loops. Here’s an overview of
each with examples.
1. For Loop
The for loop in R iterates over a sequence, executing the code block for each element.
Syntax:
for (variable in sequence) {
# Code to execute
}
Example:
# Print numbers 1 to 5
for (i in 1:5) {
print(i)
}
Example with Vector:
# Loop through a vector
vec <- c("apple", "banana", "cherry")
for (fruit in vec) {
print(fruit)
}
2. While Loop
A while loop keeps executing the block of code as long as the condition is TRUE.
Syntax:
while (condition) {
# Code to execute
}
Example:
# Print numbers from 1 to 5
i <- 1
while (i <= 5) {
print(i)
i <- i + 1
}
3. Repeat Loop
A repeat loop is an infinite loop unless a condition is met and the loop is broken using break.
Syntax:
repeat {
# Code to execute
if (condition) {
break
}
}
Example:
# Print numbers from 1 to 5 using repeat loop
i <- 1
repeat {
print(i)
i <- i + 1
if (i > 5) {
break
}
}
4. Loop Control Statements
 break: Used to exit a loop early.
 next: Skips the current iteration and moves to the next iteration.
Example of break:
# Stop the loop when i is equal to 3
for (i in 1:5) {
if (i == 3) {
break
}
print(i)
}
Example of next:
# Skip printing the number 3
for (i in 1:5) {
if (i == 3) {
next
}
print(i)
}
5. Nested Loops
Loops can be nested within other loops.
Example:
# Nested for loop to print a 3x3 matrix
for (i in 1:3) {
for (j in 1:3) {
print(paste("i:", i, "j:", j))
}
}
PROGRAM.-6
AIM:-Write an R script to change the structure of a Data Frame.
In R, data frames are one of the most commonly used data structures for handling tabular
data. A data frame is essentially a table where each column can contain different types of data
(e.g., numeric, character, or logical). Data frames are widely used for data manipulation and
analysis in R, especially in the context of datasets.
1. Creating a Data Frame
You can create a data frame using the data.frame() function by specifying vectors of equal
length as columns.
Example:
# Creating vectors for each column
names <- c("John", "Alice", "Bob")
ages <- c(25, 30, 28)
scores <- c(88.5, 92.0, 79.5)

# Creating a data frame

df <- data.frame(Name = names, Age = ages, Score = scores)

# Display the data frame

print(df)
Output:
Name Age Score
1 John 25 88.5
2 Alice 30 92.0
3 Bob 28 79.5
2. Exploring and Accessing Data in a Data Frame
Once you have a data frame, you can access and explore the data in various ways.
Accessing Columns:
You can access individual columns of a data frame using the $ operator or by indexing.
# Accessing the "Name" column
df$Name
# Using indexing
df[, "Name"] # Same as df$Name
df[["Name"]] # Same as df$Name
Accessing Rows:
You can use row indexing to access specific rows of a data
frame. # Access the first row
df[1, ]

# Access multiple rows

df[1:2, ]
Accessing Specific Elements:
To access specific elements, you can use row and column
indexing. # Access the element in the first row and second column
df[1, 2] # Output: 25 (John's age)

# Access the element in the second row and "Score" column

df[2, "Score"] # Output: 92.0
Viewing the Structure:
To view the structure of a data frame (including data types of each column), use the str()
function.
str(df)
Example Output:
ruby
'data.frame': 3 obs. of 3 variables:
$ Name : chr "John" "Alice" "Bob"
$ Age : num 25 30 28
$ Score: num 88.5 92 79.5
3. Adding and Removing Columns
You can easily add or remove columns in a data frame.
Adding a New Column:
# Adding a new column for gender
df$Gender <- c("Male", "Female", "Male")
print(df)
Output:
Name Age Score Gender
1 John 25 88.5 Male
2 Alice 30 92.0 Female
3 Bob 28 79.5 Male
Removing a Column:
To remove a column, you can use the NULL
assignment. # Removing the "Gender" column
df$Gender <- NULL
4. Adding and Removing Rows
Adding a Row:
To add a row, you can use rbind() to combine an existing data frame with a new row.
# Adding a new row
new_row <- data.frame(Name = "Emma", Age = 22, Score = 85.0)
df <- rbind(df, new_row)
print(df)
Output:
Name Age Score
1 John 25 88.5
2 Alice 30 92.0
3 Bob 28 79.5
4 Emma 22 85.0
Removing a Row:
To remove a row, you can use negative
indexing. # Removing the first row
df <- df[-1, ]
print(df)
5. Subsetting Data Frames
You can extract subsets of data based on conditions.
Example:
# Subset rows where Age is greater than 25
subset_df <- subset(df, Age > 25)
print(subset_df)
Output:
Name Age Score
2 Alice 30 92.0
3 Bob 28 79.5
6. Handling Missing Data
Missing values in data frames are represented as NA. You can detect, remove, or replace
missing data as needed.
Detecting Missing Values:
# Check for missing values
is.na(df)
Removing Rows with Missing Values:
# Remove rows with any missing values
df_clean <- na.omit(df)
Replacing Missing Values:
# Replace missing values in the "Score" column with the mean score
df$Score[is.na(df$Score)] <- mean(df$Score, na.rm = TRUE)
7. Sorting Data Frames
You can sort a data frame by one or more columns using the order() function.
Sorting by One Column:
# Sorting by the "Age" column
df_sorted <- df[order(df$Age), ]
print(df_sorted)
Sorting by Multiple Columns:
# Sorting by "Age" and then by "Score"
df_sorted <- df[order(df$Age, df$Score), ]
print(df_sorted)
8. Merging Data Frames
You can merge two data frames using the merge() function, which performs a join operation.
Example:
# Create another data frame with additional information
df2 <- data.frame(Name = c("John", "Alice", "Emma"),
Country = c("USA", "UK", "Canada"))

# Merge the two data frames by the "Name"

column df_merged <- merge(df, df2, by = "Name")
print(df_merged)
Output:
Name Age Score Country
1 Alice 30 92.0 UK
2 Emma 22 85.0 Canada
3 John 25 88.5 USA
9. Summary Statistics of Data Frames
You can calculate summary statistics for numerical columns using the summary() function.
Example:
# Summarize the data frame
summary(df)
Example Output:
mathematica
Name Age Score
John :1 Min. :22.00 Min. :79.50
Alice:1 1st Qu.:24.25 1st Qu.:84.63
Bob :1 Median :26.50 Median :88.50
Emma :1 Mean :26.25 Mean :86.25
3rd Qu.:28.50 3rd Qu.:90.13
Max. :30.00 Max. :92.00
PROGRAM.-7
AIM:- Write an R script to demonstrate Aggregate Functions in R
Aggregate() function is used to get the summary statistics of the data by group. The statistics
include mean, min, sum. max etc.
Syntax:
aggregate(dataframe$aggregate_column, list(dataframe$group_column), FUN)
where
 dataframe is the input dataframe.
 aggregate_column is the column to be aggregated in the dataframe.
 group_column is the column to be grouped with FUN.
 FUN represents sum/mean/min/ max.
Example 1: R program to create with 4 columns and group with subjects and get the
aggregates like minimum, sum, and maximum.
 R

# create a dataframe with 4 columns

data = data.frame(subjects=c("java", "python", "java",
"java", "php", "php"),
id=c(1, 2, 3, 4, 5, 6),
names=c("manoj", "sai", "mounika",
"durga", "deepika", "roshan"),
marks=c(89, 89, 76, 89, 90, 67))

# display
print(data)

# aggregate sum of marks with subjects

print(aggregate(data$marks, list(data$subjects), FUN=sum))

# aggregate minimum of marks with subjects

print(aggregate(data$marks, list(data$subjects), FUN=min))

# aggregate maximum of marks with subjects

print(aggregate(data$marks, list(data$subjects),
FUN=max))
Output:

Example 2: R program to create with 4 columns and group with subjects and get the average
(mean).
 R

# create a dataframe with 4 columns

# aggregate average of marks with subjects

print(aggregate(data$marks, list(data$subjects),
FUN=mean))
Output:
PROGRAM.-8
AIM:- Write a R script to handle missing values in R.
Handling missing values is crucial in data preprocessing before performing any analysis in R.
Here's an R script that demonstrates various methods to handle missing values (NAs) in a
dataset:
Sample R Script: Handling Missing Values
# Sample data frame with missing values
(NA) data <- data.frame(
Name = c("John", "Alice", "Sam", NA, "Kate"),
Age = c(28, NA, 34, 25, NA),
Salary = c(50000, 60000, NA, 45000, 52000),
stringsAsFactors = FALSE
)

# Display the original data

print("Original Data:")
print(data)

# 1. Identify Missing Values

print("Identifying Missing Values (TRUE if
missing):") is.na(data) # Returns TRUE for missing
values

# 2. Count Missing Values in Each Column

print("Count of Missing Values in Each
Column:")
colSums(is.na(data)) # Sum of TRUE values (i.e., NAs) for each column

# 3. Removing Rows with Missing Values (Complete

Cases) print("Data with Rows Containing NAs Removed:")
data_clean <- na.omit(data) # Removes rows where any NA is present
print(data_clean)
# 4. Replace Missing Values with a Specific Value (e.g., Mean, Median, etc.)
# Replace missing Age with the mean of non-missing values
mean_age <- mean(data$Age, na.rm = TRUE) # Calculate mean, excluding NAs
data$Age[is.na(data$Age)] <- mean_age # Replace NA with mean
print("Data After Replacing Missing Age with Mean:")
print(data)

# Replace missing Salary with a specific value (e.g.,

median) median_salary <- median(data$Salary, na.rm =
TRUE) data$Salary[is.na(data$Salary)] <- median_salary
print("Data After Replacing Missing Salary with Median:")
print(data)

# 5. Fill Missing Values Using Linear Interpolation (For Numeric Data)

# Using the zoo package for interpolation
# install.packages("zoo") # Uncomment to install the package if needed
library(zoo)
data$Age <- na.approx(data$Age) # Perform linear interpolation for Age
print("Data After Linear Interpolation for Age:")
print(data)

# 6. Filter Rows with Missing Values in Specific

Columns print("Rows where 'Age' is not missing:")
data_age_present <- data[!is.na(data$Age), ] # Keep rows where 'Age' is not missing
print(data_age_present)

# 7. Imputation using Mean/Median (Multiple Columns)

impute_mean <- function(x) {
x[is.na(x)] <- mean(x, na.rm = TRUE)
return(x)
}

# Apply imputation to all numeric columns

data_imputed <- data
data_imputed$Age <- impute_mean(data$Age)
data_imputed$Salary <- impute_mean(data$Salary)
print("Data After Imputation of Missing Values:")
print(data_imputed)
Explanation of the Script:
1. Identify Missing Values:
o is.na() is used to check for missing values in the dataset.
2. Count Missing Values:
o colSums(is.na(data)) gives the count of missing values for each column.
3. Removing Rows with Missing Values:
o na.omit() removes rows containing any NA values.
4. Replacing Missing Values:
o You can replace missing values with specific values (e.g., mean or median)
using mean() and median() functions.
5. Linear Interpolation:
o With the help of the zoo package's na.approx(), missing values in numeric data
can be interpolated linearly.
6. Filtering Rows Based on Missing Values in Specific Columns:
o Rows with missing values in specific columns can be filtered out using logical
conditions.
7. Imputation for Multiple Columns:
o A custom function impute_mean() replaces NA values with the mean of the
column. It can be applied to multiple columns.
PROGRAM.-9
AIM:- Write an R script to handle outliers.
Outliers are data points that significantly differ from other observations in a dataset. Handling
outliers is an essential step in data preprocessing. Here's an R script that demonstrates how to
detect and handle outliers using several common techniques:
Sample R Script: Handling Outliers
# Sample data with potential outliers
set.seed(123)
data <- data.frame(
ID = 1:20,
Age = c(25, 28, 22, 27, 35, 30, 24, 29, 100, 26, 23, 27, 28, 31, 26, 25, 200, 29, 28, 26), # Age
has outliers
Salary = c(30000, 32000, 29000, 33000, 1000000, 31000, 29500, 30500, 29500, 31500,
30000, 32000, 28000, 33000, 31000, 100000, 32000, 30000, 31000, 30000) # Salary has
outliers
)

# Display original data

print("Original Data:")
print(data)

# 1. Identifying Outliers Using the IQR (Interquartile Range)

Method # For numeric columns (Age and Salary)

outliers_iqr <- function(x) {

Q1 <- quantile(x, 0.25) # First quartile (25th percentile)
Q3 <- quantile(x, 0.75) # Third quartile (75th percentile)
IQR <- Q3 - Q1 # Interquartile Range

lower_bound <- Q1 - 1.5 * IQR # Lower bound for outliers

upper_bound <- Q3 + 1.5 * IQR # Upper bound for outliers
return(x < lower_bound | x > upper_bound) # TRUE if outlier
}

# Apply the IQR method to detect outliers in the 'Age' column

data$outlier_age <- outliers_iqr(data$Age)
print("Identified Outliers in Age (TRUE indicates an outlier):")
print(data$outlier_age)

# Apply the IQR method to detect outliers in the 'Salary' column

data$outlier_salary <- outliers_iqr(data$Salary)
print("Identified Outliers in Salary (TRUE indicates an outlier):")
print(data$outlier_salary)

# 2. Visualizing Outliers Using

Boxplots # Boxplot for Age
boxplot(data$Age, main = "Boxplot of Age", ylab = "Age", col = "lightblue")

# Boxplot for Salary

boxplot(data$Salary, main = "Boxplot of Salary", ylab = "Salary", col = "lightgreen")
# 3. Handling Outliers: Removal
# Remove rows with outliers in Age or Salary
data_clean <- data[!data$outlier_age & !data$outlier_salary, ]
print("Data After Removing Outliers:")
print(data_clean)

# 4. Handling Outliers: Capping (Winsorization)

# Capping replaces extreme values with lower/upper bounds

cap_outliers <- function(x) {

Q1 <- quantile(x, 0.25)
Q3 <- quantile(x, 0.75)
IQR <- Q3 - Q1

lower_bound <- Q1 - 1.5 * IQR

upper_bound <- Q3 + 1.5 * IQR

x[x < lower_bound] <- lower_bound # Cap lower outliers

x[x > upper_bound] <- upper_bound # Cap upper outliers

return(x)
}

# Apply capping to the Age and Salary columns

data$Age_capped <- cap_outliers(data$Age)
data$Salary_capped <- cap_outliers(data$Salary)
print("Data After Capping Outliers:")
print(data)

# 5. Handling Outliers: Replacing with Mean/Median

# Replace outliers with the median of the column

replace_with_median <- function(x) {

Q1 <- quantile(x, 0.25)
Q3 <- quantile(x, 0.75)
IQR <- Q3 - Q1

lower_bound <- Q1 - 1.5 * IQR

upper_bound <- Q3 + 1.5 * IQR

x[x < lower_bound | x > upper_bound] <- median(x, na.rm = TRUE) # Replace outliers
with median
return(x)
}

# Replace outliers in Age and Salary with median

data$Age_replaced <- replace_with_median(data$Age)
data$Salary_replaced <- replace_with_median(data$Salary)
print("Data After Replacing Outliers with Median:")
print(data)

Management Information System Case Studies
57% (7)
Management Information System Case Studies
3 pages
Unit 1 Notes R Programming
No ratings yet
Unit 1 Notes R Programming
7 pages
Qshell - Iseries
No ratings yet
Qshell - Iseries
226 pages
Satyam Jha r File
No ratings yet
Satyam Jha r File
41 pages
R Programming
No ratings yet
R Programming
114 pages
Introduction To Rlogistic
No ratings yet
Introduction To Rlogistic
135 pages
CH 4 Data Analytics With R and Weak Machine Learning
No ratings yet
CH 4 Data Analytics With R and Weak Machine Learning
82 pages
Krish Bhatia BAS assignment
No ratings yet
Krish Bhatia BAS assignment
63 pages
r File Finall
No ratings yet
r File Finall
75 pages
World Class Manufacturing
No ratings yet
World Class Manufacturing
78 pages
1. R Programming
No ratings yet
1. R Programming
22 pages
20ITPL702 - DataScienceWithMachineLearning
No ratings yet
20ITPL702 - DataScienceWithMachineLearning
69 pages
R Course ISLR Basics 2023
No ratings yet
R Course ISLR Basics 2023
77 pages
Statistics With R Programming For Bigdata (Autosaved)
No ratings yet
Statistics With R Programming For Bigdata (Autosaved)
41 pages
P2 - Basics of R Programming
No ratings yet
P2 - Basics of R Programming
47 pages
PushpendraLabFile
No ratings yet
PushpendraLabFile
51 pages
R Lab
No ratings yet
R Lab
114 pages
2 Undefined
No ratings yet
2 Undefined
86 pages
Performance Analysis of Webrtc-Based Video Confer-Encing: B.A. Jansen
No ratings yet
Performance Analysis of Webrtc-Based Video Confer-Encing: B.A. Jansen
90 pages
Unit 4 - Big Data Technologies
No ratings yet
Unit 4 - Big Data Technologies
48 pages
Da Session 4
No ratings yet
Da Session 4
75 pages
Statistics With R Unit 1
No ratings yet
Statistics With R Unit 1
25 pages
Introduction To R
No ratings yet
Introduction To R
34 pages
Data Analysis Using R - 2
No ratings yet
Data Analysis Using R - 2
23 pages
Ata-80 Aircraft Engine Starting Systems: Electrical Starter Motors
No ratings yet
Ata-80 Aircraft Engine Starting Systems: Electrical Starter Motors
13 pages
Live Class - 2 - 24.08.24
No ratings yet
Live Class - 2 - 24.08.24
19 pages
R programmimg Lab FIle
No ratings yet
R programmimg Lab FIle
35 pages
RBigData NTL
No ratings yet
RBigData NTL
24 pages
Question Paper 1 Answers (R) by Siddu
No ratings yet
Question Paper 1 Answers (R) by Siddu
17 pages
R prog lab manual theory.docx
No ratings yet
R prog lab manual theory.docx
16 pages
Document (1)
No ratings yet
Document (1)
32 pages
Ba Assignment Sem 6 (22504025) Dhruvi Pathania
No ratings yet
Ba Assignment Sem 6 (22504025) Dhruvi Pathania
28 pages
R Module 2
No ratings yet
R Module 2
30 pages
Case Study TCS
0% (1)
Case Study TCS
19 pages
Wed Breakout DB DLA
No ratings yet
Wed Breakout DB DLA
36 pages
R Project
0% (1)
R Project
25 pages
BRM PRACTICAL FILE H--
No ratings yet
BRM PRACTICAL FILE H--
37 pages
WINSEM2021-22 MAT2001 ELA VL2021220501462 Reference Material I 04-01-2022 1. Introduction of R Language - I
No ratings yet
WINSEM2021-22 MAT2001 ELA VL2021220501462 Reference Material I 04-01-2022 1. Introduction of R Language - I
15 pages
Tutorial 1
No ratings yet
Tutorial 1
29 pages
Unit III R Programming Fundamentals
No ratings yet
Unit III R Programming Fundamentals
33 pages
R PPT
No ratings yet
R PPT
63 pages
Data Analysis Using R and Vectors
No ratings yet
Data Analysis Using R and Vectors
35 pages
R program questions 1-24 (21)
No ratings yet
R program questions 1-24 (21)
56 pages
Basic-coding-syntax-and-structure-in-R---version-2
No ratings yet
Basic-coding-syntax-and-structure-in-R---version-2
19 pages
R-Programming Notes
100% (1)
R-Programming Notes
33 pages
Getting Started in R
No ratings yet
Getting Started in R
39 pages
R Lanaguage
No ratings yet
R Lanaguage
25 pages
Rintro
No ratings yet
Rintro
14 pages
1. About R Language
No ratings yet
1. About R Language
15 pages
datatypes variables operators in R
No ratings yet
datatypes variables operators in R
22 pages
SMuR Assignment
No ratings yet
SMuR Assignment
8 pages
Homo Deus A Brief History of Tomorrow
No ratings yet
Homo Deus A Brief History of Tomorrow
19 pages
R Studio
No ratings yet
R Studio
41 pages
R Course Notes
No ratings yet
R Course Notes
10 pages
Introduction to Analytics and R file
No ratings yet
Introduction to Analytics and R file
29 pages
Introduction to r Chap 2
No ratings yet
Introduction to r Chap 2
30 pages
Experiment 1 UART and RS232C Standard: Objectives
No ratings yet
Experiment 1 UART and RS232C Standard: Objectives
8 pages
LAB MANUAL
No ratings yet
LAB MANUAL
46 pages
Cyber security Fundamentals for Understanding Threats and Mitigation Strategies
No ratings yet
Cyber security Fundamentals for Understanding Threats and Mitigation Strategies
26 pages
Big-Data Unit-4
No ratings yet
Big-Data Unit-4
110 pages
Data Science Using R - Lab Manual-Complete Ver 2.0 - Nov 2024
No ratings yet
Data Science Using R - Lab Manual-Complete Ver 2.0 - Nov 2024
36 pages
Chapter 2 SFT
No ratings yet
Chapter 2 SFT
26 pages
Introduction To R Installation: Data Types Value Examples
No ratings yet
Introduction To R Installation: Data Types Value Examples
9 pages
data anlytics using r notes
No ratings yet
data anlytics using r notes
14 pages
Plug in EV Handbook For Public Charging Station Hosts
No ratings yet
Plug in EV Handbook For Public Charging Station Hosts
20 pages
Lecture Notes - 17ec741 - Module - Audio & Video Compression - Raja GV
No ratings yet
Lecture Notes - 17ec741 - Module - Audio & Video Compression - Raja GV
51 pages
Introduction To R PDF
No ratings yet
Introduction To R PDF
56 pages
Unit I R Data Structures
No ratings yet
Unit I R Data Structures
30 pages
Digital Marketing PPT - 2021 - Conselling
No ratings yet
Digital Marketing PPT - 2021 - Conselling
32 pages
DFN 40183: Open Source Server Administrator.: Final Assessment (Practical Test)
No ratings yet
DFN 40183: Open Source Server Administrator.: Final Assessment (Practical Test)
12 pages
Notes
No ratings yet
Notes
5 pages
STATS LAB Basics of R PDF
No ratings yet
STATS LAB Basics of R PDF
77 pages
unit2-Cassandra
No ratings yet
unit2-Cassandra
15 pages
A Formal Security Analysis of The Signal Messaging Protocol
No ratings yet
A Formal Security Analysis of The Signal Messaging Protocol
30 pages
Introduction To R: 1 Getting Started
No ratings yet
Introduction To R: 1 Getting Started
14 pages
Nokia n8-00 Rm-596 Service Schematics v2.0
No ratings yet
Nokia n8-00 Rm-596 Service Schematics v2.0
11 pages
NSDTS Portal Template Product Data Submitted V1.0
No ratings yet
NSDTS Portal Template Product Data Submitted V1.0
8 pages
Mobile Computing Syllabus
No ratings yet
Mobile Computing Syllabus
2 pages
Building A New PACS Ecosystem: Technology Trends
No ratings yet
Building A New PACS Ecosystem: Technology Trends
6 pages
Bank Management System Report PDF
78% (18)
Bank Management System Report PDF
142 pages
Session 4 Assessment-MCQ-Basic Level - Attempt Review
No ratings yet
Session 4 Assessment-MCQ-Basic Level - Attempt Review
5 pages
CylancePROTECT Install Guide
No ratings yet
CylancePROTECT Install Guide
5 pages
1 - Heart Disease Prediction Using Machine Learning
81% (26)
1 - Heart Disease Prediction Using Machine Learning
59 pages
Social Media Marketing Project Report
83% (12)
Social Media Marketing Project Report
78 pages
Online Booking System Project Report
67% (6)
Online Booking System Project Report
67 pages
Student Management System Project Report
88% (24)
Student Management System Project Report
66 pages
HW - 7 1
No ratings yet
HW - 7 1
4 pages
Bca Project On Courier Management System
83% (53)
Bca Project On Courier Management System
245 pages
Cilindros SSI
No ratings yet
Cilindros SSI
6 pages
A Project Report On Bank Management System
77% (232)
A Project Report On Bank Management System
27 pages
Project Report On Swiggy
88% (17)
Project Report On Swiggy
74 pages
Performance Appraisal HR Project
84% (292)
Performance Appraisal HR Project
87 pages
KVFinder Manual
No ratings yet
KVFinder Manual
4 pages
Project Report On Supply Chain Management
82% (17)
Project Report On Supply Chain Management
50 pages
Advanced
No ratings yet
Advanced
3 pages
4th SEM Final Project MBA HR Department
100% (4)
4th SEM Final Project MBA HR Department
91 pages
MBA Project Report On HR
71% (14)
MBA Project Report On HR
66 pages
Performance Appraisal Mba Project
85% (13)
Performance Appraisal Mba Project
53 pages
Online Voting System
100% (4)
Online Voting System
59 pages
Project Report BBA Human Resource Planning in Dainika Bhaskar
91% (11)
Project Report BBA Human Resource Planning in Dainika Bhaskar
62 pages
Project Report
0% (1)
Project Report
82 pages
Final Project Report On Digital Marketing
70% (181)
Final Project Report On Digital Marketing
88 pages
Summer Internship Report For AKTU
0% (1)
Summer Internship Report For AKTU
85 pages
Ai Chatbot Using Python Report
100% (8)
Ai Chatbot Using Python Report
30 pages
Project Report Marketing Strategy Flipkart
95% (21)
Project Report Marketing Strategy Flipkart
83 pages
Practice Test Quick Start Guide
No ratings yet
Practice Test Quick Start Guide
2 pages
MRP Project Report of Mba
74% (62)
MRP Project Report of Mba
89 pages
PROJECT REPORT ON "E - Commerce"
79% (34)
PROJECT REPORT ON "E - Commerce"
58 pages
Mba Project Marketing
100% (7)
Mba Project Marketing
60 pages
A PROJECT REPORT ON STUDY OF WATER SUPPLY SYSTEM PROBLEMS SOLUTIONS IN MUMBAI CITY Submitted To The Division of Civil Engineering
No ratings yet
A PROJECT REPORT ON STUDY OF WATER SUPPLY SYSTEM PROBLEMS SOLUTIONS IN MUMBAI CITY Submitted To The Division of Civil Engineering
1 page
Project Report On Digital Marketing
88% (8)
Project Report On Digital Marketing
54 pages
Spotify Premium Apk Download
No ratings yet
Spotify Premium Apk Download
2 pages
Cara Update Firmware Kenwood ddx5032 PDF
No ratings yet
Cara Update Firmware Kenwood ddx5032 PDF
1 page
A STUDY ON RECRUITMENT AND SELECTION PROCESS Wipro
100% (1)
A STUDY ON RECRUITMENT AND SELECTION PROCESS Wipro
83 pages
FINAL PROJECT For MBA
100% (2)
FINAL PROJECT For MBA
105 pages
Mca Final Year Project
100% (2)
Mca Final Year Project
76 pages
Final Project Report On Digital Marketing
87% (45)
Final Project Report On Digital Marketing
89 pages
MCA Final Major Project Report
No ratings yet
MCA Final Major Project Report
60 pages
Internship Report
No ratings yet
Internship Report
32 pages
R Fast Track Guide - 86 Key Points Every Programmer from Other Languages Should Master
From Everand
R Fast Track Guide - 86 Key Points Every Programmer from Other Languages Should Master
Ginno
No ratings yet