Simple Linear Regression

Uploaded by

anujgite5

Copyright:

Available Formats

Download as DOCX, PDF, TXT or read online from Scribd

Simple Linear Regression

Uploaded by

anujgite5

0% found this document useful (0 votes)

3 views5 pages

Copyright

Available Formats

DOCX, PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Download as DOCX, PDF, TXT or read online from Scribd

Download as docx, pdf, or txt

0% found this document useful (0 votes)

3 views5 pages

Simple Linear Regression

Uploaded by

anujgite5

Copyright:

Available Formats

Download as DOCX, PDF, TXT or read online from Scribd

Download as docx, pdf, or txt

Jump to Page

You are on page 1of 5

Search inside document

Sipna College of Engineering & Technology, Amravati.

Department of Computer Science and Engineering

Branch: - Computer Science and Engineering Class: - III Year
Subject: - Data Science and Statistics Lab Sem: - V

Student Manual
Practical No 2
Aim: Implementation of Simple Linear Regression using python Code
Software Required: R Studio/Anaconda
Theory: Simple linear regression is an approach for predicting a response using a single
feature. It is assumed that the two variables are linearly related. Hence, we try to find a linear
function that predicts the response value(y) as accurately as possible as a function of the
feature or independent variable(x). Let us consider a dataset where we have a value of
response y for every feature x:

For generality, we define:

x as feature vector, i.e x = [x_1, x_2, …., x_n],
y as response vector, i.e y = [y_1, y_2, …., y_n]
for n observations (in above example, n=10).
A scatter plot of the above dataset looks like:-

1 CSE/SEM-VI/DSS Lab/PR02
Sipna College of Engineering & Technology, Amravati.
Department of Computer Science and Engineering

Now, the task is to find a line that fits best in the above scatter plot so that
we can predict the response for any new feature values. (i.e a value of x not
present in a dataset)
This line is called a regression line.
The equation of regression line is represented as:

Here,
 h(x_i) represents the predicted response value for ith observation.
 b_0 and b_1 are regression coefficients and represent y-
intercept and slope of regression line respectively.
To create our model, we must “learn” or estimate the values of regression
coefficients b_0 and b_1. And once we’ve estimated these coefficients, we
can use the model to predict responses!
In this article, we are going to use the principle of Least Squares.
Now consider:

Here, e_i is a residual error in ith observation.

So, our aim is to minimize the total residual error.
We define the squared error or cost function, J as:

and our task is to find the value of b_0 and b_1 for which J(b_0,b_1) is
minimum! Without going into the mathematical details, we present the result
here:

2 CSE/SEM-VI/DSS Lab/PR02
Sipna College of Engineering & Technology, Amravati.
Department of Computer Science and Engineering

where SS_xy is the sum of cross-deviations of y and x:

and SS_xx is the sum of squared deviations of x:

Code:
import numpy as np
import matplotlib.pyplot as plt
def estimate_coef(x, y):
# number of observations/points
n = np.size(x)
# mean of x and y vector
m_x = np.mean(x)
m_y = np.mean(y)
# calculating cross-deviation and deviation about x
SS_xy = np.sum(y*x) - n*m_y*m_x
SS_xx = np.sum(x*x) - n*m_x*m_x
# calculating regression coefficients
b_1 = SS_xy / SS_xx
b_0 = m_y - b_1*m_x
return (b_0, b_1)
def plot_regression_line(x, y, b):
# plotting the actual points as scatter plot
plt.scatter(x, y, color = "m",
marker = "o", s = 30)
# predicted response vector
y_pred = b[0] + b[1]*x

3 CSE/SEM-VI/DSS Lab/PR02
Sipna College of Engineering & Technology, Amravati.
Department of Computer Science and Engineering
# plotting the regression line
plt.plot(x, y_pred, color = "g")
# putting labels
plt.xlabel('x')
plt.ylabel('y')
# function to show plot
plt.show()
def main():
# observations / data
x = np.array([0, 1, 2, 3, 4, 5, 6, 7, 8, 9])
y = np.array([1, 3, 2, 5, 7, 8, 8, 9, 10, 12])
# estimating coefficients
b = estimate_coef(x, y)
print("Estimated coefficients:\nb_0 = {} \
\nb_1 = {}".format(b[0], b[1]))
# plotting regression line
plot_regression_line(x, y, b)
if __name__ == "__main__":
main()

Output:
Estimated coefficients:
b_0 = -0.0586206896552
b_1 = 1.45747126437
And graph obtained looks like this:

4 CSE/SEM-VI/DSS Lab/PR02
Sipna College of Engineering & Technology, Amravati.
Department of Computer Science and Engineering

5 CSE/SEM-VI/DSS Lab/PR02

Ad3411 Data Science and Analytics Laboratory
Document24 pages
Ad3411 Data Science and Analytics Laboratory
Mohamed Shajid N
100% (7)
Introduction To Statistical Learning: With Applications in R
Document13 pages
Introduction To Statistical Learning: With Applications in R
Anuar Yeraliyev
No ratings yet
ML Exp 1.3 3229
Document5 pages
ML Exp 1.3 3229
Avnish Patel
No ratings yet
ML Exp 1.3 3331
Document5 pages
ML Exp 1.3 3331
Avnish Patel
No ratings yet
Exp 1_Exp 2_Exp 3_merged
Document9 pages
Exp 1_Exp 2_Exp 3_merged
Shaurya Shankrat
No ratings yet
ML Lab Manual
Document37 pages
ML Lab Manual
apekshapandekar01
100% (1)
Lab IAT 06
Document11 pages
Lab IAT 06
igorkapganglink
No ratings yet
Machine Learning Lab: Raheel Aslam (74-FET/BSEE/F16)
Document4 pages
Machine Learning Lab: Raheel Aslam (74-FET/BSEE/F16)
Raheel Aslam
No ratings yet
AI Lab9
Document5 pages
AI Lab9
Maryam Khansa
No ratings yet
MLR Example 2predictors
Document5 pages
MLR Example 2predictors
wangshiui2002
No ratings yet
Mslab All 2982
Document31 pages
Mslab All 2982
Anubhav Khurana
No ratings yet
ML Lecture 2 2023
Document59 pages
ML Lecture 2 2023
Kevon Bvunza
No ratings yet
Lab IAT 03
Document17 pages
Lab IAT 03
igorkapganglink
No ratings yet
Merged Document
Document49 pages
Merged Document
Rahul Borkar
No ratings yet
Fem2063 Data Analytics - May 2020 Lab Practice 5 (Week 6)
Document8 pages
Fem2063 Data Analytics - May 2020 Lab Practice 5 (Week 6)
Zhi yi
No ratings yet
Interpolation and Least Square
Document18 pages
Interpolation and Least Square
rojandahal99
No ratings yet
Cl-Vii Ass2 4301063
Document5 pages
Cl-Vii Ass2 4301063
ATHARVA SHINDE
No ratings yet
Ilovepdf Merged
Document14 pages
Ilovepdf Merged
adityagarouthiya11
No ratings yet
Programs Lab Bca
Document16 pages
Programs Lab Bca
Gayu Gayu
No ratings yet
16BCB0126 VL2018195002535 Pe003
Document40 pages
16BCB0126 VL2018195002535 Pe003
Mohit
No ratings yet
EAI - Exp - 2 - A Edited
Document3 pages
EAI - Exp - 2 - A Edited
gamershankar656
No ratings yet
Jugraj Singh - DSIP - Exp2 - A3
Document12 pages
Jugraj Singh - DSIP - Exp2 - A3
JUGRAJ SINGH
No ratings yet
Exp3 ML
Document4 pages
Exp3 ML
naikwadebhushan
No ratings yet
Design and Analysis of Algorithms Lab Aman Dubey
Document11 pages
Design and Analysis of Algorithms Lab Aman Dubey
Rythm
No ratings yet
ML Lab Manual
Document29 pages
ML Lab Manual
Priyanka Karunakaran
No ratings yet
Mumbai Educational Trust: MET Institute of Computer Science
Document11 pages
Mumbai Educational Trust: MET Institute of Computer Science
Sarika Singh
No ratings yet
Complex Problem AI
Document13 pages
Complex Problem AI
js9118164
No ratings yet
ML RECORD - Merged
Document33 pages
ML RECORD - Merged
dnr departments
No ratings yet
21brs1474 ML Lab 2
Document25 pages
21brs1474 ML Lab 2
chickenwinglassi
No ratings yet
Numerical Methods and Probability Assignment
Document5 pages
Numerical Methods and Probability Assignment
Minal Fatyma
No ratings yet
FMS Final Submission
Document25 pages
FMS Final Submission
seshanesai
No ratings yet
Practical No. 2: 1. Packages
Document15 pages
Practical No. 2: 1. Packages
sagar korde
No ratings yet
Bipin Python Programming
Document21 pages
Bipin Python Programming
Vipin Singh
No ratings yet
Teddy Mukadzambo R201998U Python Refresher
Document8 pages
Teddy Mukadzambo R201998U Python Refresher
Teddy Mukadzambo
No ratings yet
Lab 2 Polynomial and Plots, Branch Statements and Loops, Functions and Types of Functions Using Matlab
Document12 pages
Lab 2 Polynomial and Plots, Branch Statements and Loops, Functions and Types of Functions Using Matlab
Ahmed Ali
No ratings yet
Cat 2 Document Likkitha
Document80 pages
Cat 2 Document Likkitha
Likkitha
No ratings yet
CGL Lab Manual
Document30 pages
CGL Lab Manual
december bueno
No ratings yet
System Simulation and Modeling Lab
Document23 pages
System Simulation and Modeling Lab
Pavleen Kaur
No ratings yet
EE-232: Signals and Systems Lab 2: Plotting and Array Processing in MATLAB
Document16 pages
EE-232: Signals and Systems Lab 2: Plotting and Array Processing in MATLAB
Muhammad Uzair Khan
No ratings yet
Lab 7 - Bias and Variance
Document5 pages
Lab 7 - Bias and Variance
Liban Ali Mohamud
No ratings yet
Project-1 (Data Preprocessing)
Document5 pages
Project-1 (Data Preprocessing)
Arijeet ros
No ratings yet
Maxbox - Starter67 Machine Learning
Document7 pages
Maxbox - Starter67 Machine Learning
Max Kleiner
No ratings yet
Final Submission of Fundamental of Mathematics & Statictis
Document37 pages
Final Submission of Fundamental of Mathematics & Statictis
seshanesai
No ratings yet
Machine Learning LAB
Document20 pages
Machine Learning LAB
asmimcse
No ratings yet
Lab-5 Report
Document11 pages
Lab-5 Report
jithentar.cs21
No ratings yet
DA Part-B New
Document7 pages
DA Part-B New
eren9110823895
No ratings yet
Image Feature Extraction Based On PCA
Document5 pages
Image Feature Extraction Based On PCA
Sanjana Kuril
No ratings yet
Lecture 33 Algebraic Computation and FFTs
Document16 pages
Lecture 33 Algebraic Computation and FFTs
Ritik chaudhary
No ratings yet
Dhrumil Aml
Document14 pages
Dhrumil Aml
SAVALIYA FENNY
No ratings yet
ML Lab 07
Document4 pages
ML Lab 07
kashif anwar
No ratings yet
Matlab Record
Document64 pages
Matlab Record
Dhanush Kumar peddapalyam
No ratings yet
CS6513-COMPUTER GRAPHICS LABORATORY-664424542-computer Graphics Lab Manual 2013 Regulation
Document77 pages
CS6513-COMPUTER GRAPHICS LABORATORY-664424542-computer Graphics Lab Manual 2013 Regulation
Ganesh
100% (1)
C Problems
Document13 pages
C Problems
priyojit_hitk
No ratings yet
Classification Review
Document8 pages
Classification Review
Alex
No ratings yet
Assignment B 1 LinearRegression
Document5 pages
Assignment B 1 LinearRegression
Mahesh Kadam
No ratings yet
Lab#10 Ai
Document3 pages
Lab#10 Ai
Momel Fatima
No ratings yet
Task 11
Document6 pages
Task 11
k.shalivahanreddy
No ratings yet
Lab-10-Forest-Regression
Document5 pages
Lab-10-Forest-Regression
poopstack1984
No ratings yet
563-Article Text-1999-1-10-20221223
Document4 pages
563-Article Text-1999-1-10-20221223
Radik Alkhamov
No ratings yet
LAB-4 Report
Document21 pages
LAB-4 Report
jithentar.cs21
No ratings yet
Line Drawing Algorithm: Mastering Techniques for Precision Image Rendering
From Everand
Line Drawing Algorithm: Mastering Techniques for Precision Image Rendering
Fouad Sabry
No ratings yet
Fast Food Consumption in College student-TPB
Document13 pages
Fast Food Consumption in College student-TPB
Bharath Waraj
No ratings yet
Stat 497 - LN4
Document67 pages
Stat 497 - LN4
Julian Diaz
No ratings yet
Cohort Studies
Document16 pages
Cohort Studies
emmanuel ssendikwanawa
No ratings yet
Project Proposal Fourth Stats
Document1 page
Project Proposal Fourth Stats
RAZEL ALCANTARA
No ratings yet
Chap 16
Document30 pages
Chap 16
Krish Gujarathi
No ratings yet
Appendix 3: SPSS Results: Descriptive Statistics
Document6 pages
Appendix 3: SPSS Results: Descriptive Statistics
Selcuk Can
No ratings yet
SPSS Uji PH
Document16 pages
SPSS Uji PH
Anonymous ZXPwKEN
No ratings yet
Lecture 2 - Regression Model PDF
Document69 pages
Lecture 2 - Regression Model PDF
Dat Ngo
No ratings yet
AI XII (843) Units 1,2,3
Document17 pages
AI XII (843) Units 1,2,3
giyohiy540
No ratings yet
17 KRM Om10 Tif ch14
Document64 pages
17 KRM Om10 Tif ch14
Sandip Agarwal
No ratings yet
Slovin's Formula Is Used To Calculate The Sample Size Necessary To Achieve A Certain Confidence Interval When Sampling A Population
Document5 pages
Slovin's Formula Is Used To Calculate The Sample Size Necessary To Achieve A Certain Confidence Interval When Sampling A Population
Mirafuentes, Aprodhite S.
No ratings yet
Hypothesis Testing Application 2
Document32 pages
Hypothesis Testing Application 2
Guille F Reyes
No ratings yet
21CS63 - Unit1 Practice Questions
Document3 pages
21CS63 - Unit1 Practice Questions
chaithanyasgowda10
No ratings yet
Statitics
Document52 pages
Statitics
Sanjay Gupta
100% (1)
Measures of Dispersion
Document9 pages
Measures of Dispersion
Hemasri Chinnu
No ratings yet
New Predictive Modelling Using R and SPSS
Document1 page
New Predictive Modelling Using R and SPSS
divya
No ratings yet
CFA Institute
Document11 pages
CFA Institute
juan carlos molano toro
No ratings yet
Document 1
Document3 pages
Document 1
Ming
No ratings yet
MStat PSB 2018 PDF
Document2 pages
MStat PSB 2018 PDF
abhay
No ratings yet
Sbaod
Document8 pages
Sbaod
Vishnu
No ratings yet
Laboratory Exercise No 3B
Document6 pages
Laboratory Exercise No 3B
Michael Abeleda
No ratings yet
Answer All Questions in This Section
Document3 pages
Answer All Questions in This Section
Teoh Yeow Sheng
No ratings yet
Types of Hypothesis
Document26 pages
Types of Hypothesis
Ankit Jindal
No ratings yet
Binomial and Geometric Distributions
Document12 pages
Binomial and Geometric Distributions
Neha Pathak Zaveri
No ratings yet
Anomaly Detection
Document11 pages
Anomaly Detection
Varest Yokin
No ratings yet
Takane 2001
Document29 pages
Takane 2001
Vitaliy Bulygin
No ratings yet
Stat Research
Document12 pages
Stat Research
Mylady Ramos
No ratings yet
Lecture5 More of Chapter 3
Document58 pages
Lecture5 More of Chapter 3
ramsatpm3515
100% (1)
Unit - IV - Sampling
Document38 pages
Unit - IV - Sampling
witob23385
No ratings yet