0% found this document useful (0 votes)

213 views142 pages

Introduction To Regression With Statsmodels in Python

Linear regression and logistic regression are two of the most widely used statistical models. They act like master keys, unlocking the secrets hidden in your data. In this course, you’ll gain the skills you need to fit simple linear and logistic regressions. Through hands-on exercises, you’ll explore the relationships between variables in real-world datasets, including motor insurance claims, Taiwan house prices, fish sizes, and more.

Uploaded by

jcmayac

We take content rights seriously. If you suspect this is your content, claim it here.

0% found this document useful (0 votes)

213 views142 pages

Introduction To Regression With Statsmodels in Python

Uploaded by

jcmayac

We take content rights seriously. If you suspect this is your content, claim it here.

You are on page 1/ 142

A tale of two

variables
I N T R O D U C T I O N T O R E G R E S S I O N W I T H S TAT S M O D E L S I N P Y T H O N

Maarten Van den Broeck

Content Developer at DataCamp
Swedish motor insurance data
Each row represents one geographic region n_claims total_payment_sek
in Sweden.
108 392.5
There are 63 rows. 19 46.2
13 15.7
124 422.2
40 119.4
... ...

INTRODUCTION TO REGRESSION WITH STATSMODELS IN PYTHON

Descriptive statistics
import pandas as pd
print(swedish_motor_insurance.mean())

n_claims 22.904762
total_payment_sek 98.187302
dtype: float64

print(swedish_motor_insurance['n_claims'].corr(swedish_motor_insurance['total_payment_sek']))

0.9128782350234068

INTRODUCTION TO REGRESSION WITH STATSMODELS IN PYTHON

What is regression?
Statistical models to explore the n_claims total_payment_sek
relationship a response variable and some
108 3925
explanatory variables.
19 462
Given values of explanatory variables, you
13 157
can predict the values of the response
124 4222
variable.
40 1194
200 ???

INTRODUCTION TO REGRESSION WITH STATSMODELS IN PYTHON

Jargon
Response variable (a.k.a. dependent variable)
The variable that you want to predict.

Explanatory variables (a.k.a. independent variables)

The variables that explain how the response variable will change.

INTRODUCTION TO REGRESSION WITH STATSMODELS IN PYTHON

Linear regression and logistic regression
Linear regression
The response variable is numeric.

Logistic regression
The response variable is logical.

Simple linear/logistic regression

There is only one explanatory variable.

INTRODUCTION TO REGRESSION WITH STATSMODELS IN PYTHON

Visualizing pairs of variables
import matplotlib.pyplot as plt
import seaborn as sns

sns.scatterplot(x="n_claims",
y="total_payment_sek",
data=swedish_motor_insurance)

plt.show()

INTRODUCTION TO REGRESSION WITH STATSMODELS IN PYTHON

Adding a linear trend line
sns.regplot(x="n_claims",
y="total_payment_sek",
data=swedish_motor_insurance,
ci=None)

INTRODUCTION TO REGRESSION WITH STATSMODELS IN PYTHON

Course flow
Chapter 1
Visualizing and ing linear regression models.

Chapter 2
Making predictions from linear regression models and understanding model coe cients.

Chapter 3
Assessing the quality of the linear regression model.

Chapter 4
Same again, but with logistic regression models

INTRODUCTION TO REGRESSION WITH STATSMODELS IN PYTHON

Python packages for regression
statsmodels
Optimized for insight (focus in this course)

scikit-learn
Optimized for prediction (focus in other DataCamp courses)

INTRODUCTION TO REGRESSION WITH STATSMODELS IN PYTHON

Let's practice!
I N T R O D U C T I O N T O R E G R E S S I O N W I T H S TAT S M O D E L S I N P Y T H O N
Fitting a linear
regression
I N T R O D U C T I O N T O R E G R E S S I O N W I T H S TAT S M O D E L S I N P Y T H O N

Maarten Van den Broeck

Content Developer at DataCamp
Straight lines are defined by two things
Intercept
The y value at the point when x is zero.

Slope
The amount the y value increases if you increase x by one.

Equation
y = intercept + slope ∗ x

INTRODUCTION TO REGRESSION WITH STATSMODELS IN PYTHON

Estimating the intercept

INTRODUCTION TO REGRESSION WITH STATSMODELS IN PYTHON

Estimating the intercept

INTRODUCTION TO REGRESSION WITH STATSMODELS IN PYTHON

Estimating the intercept

INTRODUCTION TO REGRESSION WITH STATSMODELS IN PYTHON

Estimating the slope

INTRODUCTION TO REGRESSION WITH STATSMODELS IN PYTHON

Estimating the slope

INTRODUCTION TO REGRESSION WITH STATSMODELS IN PYTHON

Estimating the slope

INTRODUCTION TO REGRESSION WITH STATSMODELS IN PYTHON

Estimating the slope

INTRODUCTION TO REGRESSION WITH STATSMODELS IN PYTHON

Running a model
from statsmodels.formula.api import ols
mdl_payment_vs_claims = ols("total_payment_sek ~ n_claims",
data=swedish_motor_insurance)

mdl_payment_vs_claims = mdl_payment_vs_claims.fit()
print(mdl_payment_vs_claims.params)

Intercept 19.994486
n_claims 3.413824
dtype: float64

INTRODUCTION TO REGRESSION WITH STATSMODELS IN PYTHON

Interpreting the model coefficients
Intercept 19.994486
n_claims 3.413824
dtype: float64

Equation
total_payment_sek = 19.99 + 3.41 ∗ n_claims

INTRODUCTION TO REGRESSION WITH STATSMODELS IN PYTHON

Let's practice!
I N T R O D U C T I O N T O R E G R E S S I O N W I T H S TAT S M O D E L S I N P Y T H O N
Categorical
explanatory
variables
I N T R O D U C T I O N T O R E G R E S S I O N W I T H S TAT S M O D E L S I N P Y T H O N

Maarten Van den Broeck

Content Developer at DataCamp
Fish dataset
Each row represents one sh. species mass_g
There are 128 rows in the dataset. Bream 242.0

There are 4 species of sh: Perch 5.9

Common Bream Pike 200.0
European Perch Roach 40.0

Northern Pike ... ...

Common Roach

INTRODUCTION TO REGRESSION WITH STATSMODELS IN PYTHON

Visualizing 1 numeric and 1 categorical variable
import matplotlib.pyplot as plt
import seaborn as sns

sns.displot(data=fish,
x="mass_g",
col="species",
col_wrap=2,
bins=9)

plt.show()

INTRODUCTION TO REGRESSION WITH STATSMODELS IN PYTHON

Summary statistics: mean mass by species
summary_stats = fish.groupby("species")["mass_g"].mean()
print(summary_stats)

species
Bream 617.828571
Perch 382.239286
Pike 718.705882
Roach 152.050000
Name: mass_g, dtype: float64

INTRODUCTION TO REGRESSION WITH STATSMODELS IN PYTHON

Linear regression
from statsmodels.formula.api import ols
mdl_mass_vs_species = ols("mass_g ~ species", data=fish).fit()
print(mdl_mass_vs_species.params)

Intercept 617.828571
species[T.Perch] -235.589286
species[T.Pike] 100.877311
species[T.Roach] -465.778571

INTRODUCTION TO REGRESSION WITH STATSMODELS IN PYTHON

Model with or without an intercept
From previous slide, model with intercept Model without an intercept

mdl_mass_vs_species = ols( mdl_mass_vs_species = ols(

"mass_g ~ species", data=fish).fit() "mass_g ~ species + 0", data=fish).fit()
print(mdl_mass_vs_species.params) print(mdl_mass_vs_species.params)

Intercept 617.828571 species[Bream] 617.828571

species[T.Perch] -235.589286 species[Perch] 382.239286
species[T.Pike] 100.877311 species[Pike] 718.705882
species[T.Roach] -465.778571 species[Roach] 152.050000

The coe cients are relative to the intercept: In case of a single, categorical variable,
617.83 − 235.59 = 382.24! coe cients are the means.

INTRODUCTION TO REGRESSION WITH STATSMODELS IN PYTHON

Let's practice!
I N T R O D U C T I O N T O R E G R E S S I O N W I T H S TAT S M O D E L S I N P Y T H O N
Making predictions
I N T R O D U C T I O N T O R E G R E S S I O N W I T H S TAT S M O D E L S I N P Y T H O N

Maarten Van den Broeck

Content Developer at DataCamp
The fish dataset: bream
bream = fish[fish["species"] == "Bream"]
print(bream.head())

species mass_g length_cm

0 Bream 242.0 23.2
1 Bream 290.0 24.0
2 Bream 340.0 23.9
3 Bream 363.0 26.3
4 Bream 430.0 26.5

INTRODUCTION TO REGRESSION WITH STATSMODELS IN PYTHON

Plotting mass vs. length
sns.regplot(x="length_cm",
y="mass_g",
data=bream,
ci=None)

plt.show()

INTRODUCTION TO REGRESSION WITH STATSMODELS IN PYTHON

Running the model
mdl_mass_vs_length = ols("mass_g ~ length_cm", data=bream).fit()
print(mdl_mass_vs_length.params)

Intercept -1035.347565
length_cm 54.549981
dtype: float64

INTRODUCTION TO REGRESSION WITH STATSMODELS IN PYTHON

Data on explanatory values to predict
If I set the explanatory variables to these values,
what value would the response variable have?

explanatory_data = pd.DataFrame({"length_cm": np.arange(20, 41)})

length_cm
0 20
1 21
2 22
3 23
4 24
5 25
...

INTRODUCTION TO REGRESSION WITH STATSMODELS IN PYTHON

Call predict()
print(mdl_mass_vs_length.predict(explanatory_data))

0 55.652054
1 110.202035
2 164.752015
3 219.301996
4 273.851977
...
16 928.451749
17 983.001730
18 1037.551710
19 1092.101691
20 1146.651672
Length: 21, dtype: float64

INTRODUCTION TO REGRESSION WITH STATSMODELS IN PYTHON

Predicting inside a DataFrame
explanatory_data = pd.DataFrame( length_cm mass_g
{"length_cm": np.arange(20, 41)} 0 20 55.652054
) 1 21 110.202035
prediction_data = explanatory_data.assign( 2 22 164.752015
mass_g=mdl_mass_vs_length.predict(explanatory_data) 3 23 219.301996
) 4 24 273.851977
print(prediction_data) .. ... ...
16 36 928.451749
17 37 983.001730
18 38 1037.551710
19 39 1092.101691
20 40 1146.651672

INTRODUCTION TO REGRESSION WITH STATSMODELS IN PYTHON

Showing predictions
import matplotlib.pyplot as plt
import seaborn as sns
fig = plt.figure()
sns.regplot(x="length_cm",
y="mass_g",
ci=None,
data=bream,)
sns.scatterplot(x="length_cm",
y="mass_g",
data=prediction_data,
color="red",
marker="s")
plt.show()

INTRODUCTION TO REGRESSION WITH STATSMODELS IN PYTHON

Extrapolating
Extrapolating means making predictions
outside the range of observed data.

little_bream = pd.DataFrame({"length_cm": [10]})

pred_little_bream = little_bream.assign(
mass_g=mdl_mass_vs_length.predict(little_bream))

print(pred_little_bream)

length_cm mass_g
0 10 -489.847756

INTRODUCTION TO REGRESSION WITH STATSMODELS IN PYTHON

Let's practice!
I N T R O D U C T I O N T O R E G R E S S I O N W I T H S TAT S M O D E L S I N P Y T H O N
Working with model
objects
I N T R O D U C T I O N T O R E G R E S S I O N W I T H S TAT S M O D E L S I N P Y T H O N

Maarten Van den Broeck

Content Developer at DataCamp
.params attribute
from statsmodels.formula.api import ols
mdl_mass_vs_length = ols("mass_g ~ length_cm", data = bream).fit()
print(mdl_mass_vs_length.params)

Intercept -1035.347565
length_cm 54.549981
dtype: float64

INTRODUCTION TO REGRESSION WITH STATSMODELS IN PYTHON

.fittedvalues attribute
Fi ed values: predictions on the original 0 230.211993

dataset 1 273.851977
2 268.396979
3 399.316934
print(mdl_mass_vs_length.fittedvalues)
4 410.226930
...
or equivalently 30 873.901768
31 873.901768

explanatory_data = bream["length_cm"] 32 939.361745

33 1004.821722

print(mdl_mass_vs_length.predict(explanatory_data)) 34 1037.551710
Length: 35, dtype: float64

INTRODUCTION TO REGRESSION WITH STATSMODELS IN PYTHON

.resid attribute
Residuals: actual response values minus 0 11.788007
1 16.148023
predicted response values
2 71.603021
3 -36.316934
print(mdl_mass_vs_length.resid) 4 19.773070
...
or equivalently

print(bream["mass_g"] - mdl_mass_vs_length.fittedvalues)

INTRODUCTION TO REGRESSION WITH STATSMODELS IN PYTHON

.summary()
mdl_mass_vs_length.summary()

OLS Regression Results

==============================================================================
Dep. Variable: mass_g R-squared: 0.878
Model: OLS Adj. R-squared: 0.874
Method: Least Squares F-statistic: 237.6
Date: Thu, 29 Oct 2020 Prob (F-statistic): 1.22e-16
Time: 13:23:21 Log-Likelihood: -199.35
No. Observations: 35 AIC: 402.7
Df Residuals: 33 BIC: 405.8
Df Model: 1
Covariance Type: nonrobust
==============================================================================
coef std err t P>|t| [0.025 0.975]
<-----------------------------------------------------------------------------
Intercept -1035.3476 107.973 -9.589 0.000 -1255.020 -815.676
length_cm 54.5500 3.539 15.415 0.000 47.350 61.750
==============================================================================
Omnibus: 7.314 Durbin-Watson: 1.478
Prob(Omnibus): 0.026 Jarque-Bera (JB): 10.857
Skew: -0.252 Prob(JB): 0.00439
Kurtosis: 5.682 Cond. No. 263.

INTRODUCTION TO REGRESSION WITH STATSMODELS IN PYTHON

OLS Regression Results
==============================================================================
Dep. Variable: mass_g R-squared: 0.878
Model: OLS Adj. R-squared: 0.874
Method: Least Squares F-statistic: 237.6
Date: Thu, 29 Oct 2020 Prob (F-statistic): 1.22e-16
Time: 13:23:21 Log-Likelihood: -199.35
No. Observations: 35 AIC: 402.7
Df Residuals: 33 BIC: 405.8
Df Model: 1
Covariance Type: nonrobust

INTRODUCTION TO REGRESSION WITH STATSMODELS IN PYTHON

coef std err t P>|t| [0.025 0.975]
<-----------------------------------------------------------------------------
Intercept -1035.3476 107.973 -9.589 0.000 -1255.020 -815.676
length_cm 54.5500 3.539 15.415 0.000 47.350 61.750
==============================================================================
Omnibus: 7.314 Durbin-Watson: 1.478
Prob(Omnibus): 0.026 Jarque-Bera (JB): 10.857
Skew: -0.252 Prob(JB): 0.00439
Kurtosis: 5.682 Cond. No. 263.

INTRODUCTION TO REGRESSION WITH STATSMODELS IN PYTHON

Let's practice!
I N T R O D U C T I O N T O R E G R E S S I O N W I T H S TAT S M O D E L S I N P Y T H O N
Regression to the
mean
I N T R O D U C T I O N T O R E G R E S S I O N W I T H S TAT S M O D E L S I N P Y T H O N

Maarten Van den Broeck

Content Developer at DataCamp
The concept
Response value = ed value + residual

"The stu you explained" + "the stu you couldn't explain"

Residuals exist due to problems in the model and fundamental randomness

Extreme cases are o en due to randomness

Regression to the mean means extreme cases don't persist over time

INTRODUCTION TO REGRESSION WITH STATSMODELS IN PYTHON

Pearson's father son dataset
1078 father/son pairs father_height_cm son_height_cm
Do tall fathers have tall sons? 165.2 151.8
160.7 160.6
165.0 160.9
167.0 159.5
155.3 163.3
... ...

1 Adapted from h ps://www.rdocumentation.org/packages/UsingR/topics/father.son

INTRODUCTION TO REGRESSION WITH STATSMODELS IN PYTHON

Scatter plot
fig = plt.figure()

sns.scatterplot(x="father_height_cm",
y="son_height_cm",
data=father_son)

plt.axline(xy1=(150, 150),
slope=1,
linewidth=2,
color="green")

plt.axis("equal")
plt.show()

INTRODUCTION TO REGRESSION WITH STATSMODELS IN PYTHON

Adding a regression line
fig = plt.figure()

sns.regplot(x="father_height_cm",
y="son_height_cm",
data=father_son,
ci = None,
line_kws={"color": "black"})

plt.axline(xy1 = (150, 150),

slope=1,
linewidth=2,
color="green")

plt.axis("equal")
plt.show()

INTRODUCTION TO REGRESSION WITH STATSMODELS IN PYTHON

Running a regression
mdl_son_vs_father = ols("son_height_cm ~ father_height_cm",
data = father_son).fit()
print(mdl_son_vs_father.params)

Intercept 86.071975
father_height_cm 0.514093
dtype: float64

INTRODUCTION TO REGRESSION WITH STATSMODELS IN PYTHON

Making predictions
really_tall_father = pd.DataFrame( really_short_father = pd.DataFrame(
{"father_height_cm": [190]}) {"father_height_cm": [150]})

mdl_son_vs_father.predict( mdl_son_vs_father.predict(
really_tall_father) really_short_father)

183.7 163.2

INTRODUCTION TO REGRESSION WITH STATSMODELS IN PYTHON

Let's practice!
I N T R O D U C T I O N T O R E G R E S S I O N W I T H S TAT S M O D E L S I N P Y T H O N
Transforming
variables
I N T R O D U C T I O N T O R E G R E S S I O N W I T H S TAT S M O D E L S I N P Y T H O N

Maarten Van den Broeck

Content Developer at DataCamp
Perch dataset
perch = fish[fish["species"] == "Perch"]
print(perch.head())

species mass_g length_cm

55 Perch 5.9 7.5
56 Perch 32.0 12.5
57 Perch 40.0 13.8
58 Perch 51.5 15.0
59 Perch 70.0 15.7

INTRODUCTION TO REGRESSION WITH STATSMODELS IN PYTHON

It's not a linear relationship
sns.regplot(x="length_cm",
y="mass_g",
data=perch,
ci=None)

plt.show()

INTRODUCTION TO REGRESSION WITH STATSMODELS IN PYTHON

Bream vs. perch

INTRODUCTION TO REGRESSION WITH STATSMODELS IN PYTHON

Plotting mass vs. length cubed
perch["length_cm_cubed"] = perch["length_cm"] ** 3

sns.regplot(x="length_cm_cubed",
y="mass_g",
data=perch,
ci=None)
plt.show()

INTRODUCTION TO REGRESSION WITH STATSMODELS IN PYTHON

Modeling mass vs. length cubed
perch["length_cm_cubed"] = perch["length_cm"] ** 3

mdl_perch = ols("mass_g ~ length_cm_cubed", data=perch).fit()

mdl_perch.params

Intercept -0.117478
length_cm_cubed 0.016796
dtype: float64

INTRODUCTION TO REGRESSION WITH STATSMODELS IN PYTHON

Predicting mass vs. length cubed
explanatory_data = pd.DataFrame({"length_cm_cubed": np.arange(10, 41, 5) ** 3,
"length_cm": np.arange(10, 41, 5)})

prediction_data = explanatory_data.assign(
mass_g=mdl_perch.predict(explanatory_data))
print(prediction_data)

length_cm_cubed length_cm mass_g

0 1000 10 16.678135
1 3375 15 56.567717
2 8000 20 134.247429
3 15625 25 262.313982
4 27000 30 453.364084
5 42875 35 719.994447
6 64000 40 1074.801781

INTRODUCTION TO REGRESSION WITH STATSMODELS IN PYTHON

Plotting mass vs. length cubed
fig = plt.figure() fig = plt.figure()
sns.regplot(x="length_cm_cubed", y="mass_g", sns.regplot(x="length_cm", y="mass_g",
data=perch, ci=None) data=perch, ci=None)
sns.scatterplot(data=prediction_data, sns.scatterplot(data=prediction_data,
x="length_cm_cubed", y="mass_g", x="length_cm", y="mass_g",
color="red", marker="s") color="red", marker="s")

INTRODUCTION TO REGRESSION WITH STATSMODELS IN PYTHON

Facebook advertising dataset
How advertising works spent_usd n_impressions n_clicks

1. Pay Facebook to shows ads. 1.43 7350 1

1.82 17861 2
2. People see the ads ("impressions").
1.25 4259 1
3. Some people who see it, click it.
1.29 4133 1
4.77 15615 3

936 rows ... ... ...

Each row represents 1 advert

INTRODUCTION TO REGRESSION WITH STATSMODELS IN PYTHON

Plot is cramped
sns.regplot(x="spent_usd",
y="n_impressions",
data=ad_conversion,
ci=None)

INTRODUCTION TO REGRESSION WITH STATSMODELS IN PYTHON

Square root vs square root
ad_conversion["sqrt_spent_usd"] = np.sqrt(
ad_conversion["spent_usd"])

ad_conversion["sqrt_n_impressions"] = np.sqrt(
ad_conversion["n_impressions"])

sns.regplot(x="sqrt_spent_usd",
y="sqrt_n_impressions",
data=ad_conversion,
ci=None)

INTRODUCTION TO REGRESSION WITH STATSMODELS IN PYTHON

Modeling and predicting
mdl_ad = ols("sqrt_n_impressions ~ sqrt_spent_usd", data=ad_conversion).fit()

explanatory_data = pd.DataFrame({"sqrt_spent_usd": np.sqrt(np.arange(0, 601, 100)),

"spent_usd": np.arange(0, 601, 100)})

prediction_data = explanatory_data.assign(sqrt_n_impressions=mdl_ad.predict(explanatory_data),
n_impressions=mdl_ad.predict(explanatory_data) ** 2)
print(prediction_data)

sqrt_spent_usd spent_usd sqrt_n_impressions n_impressions

0 0.000000 0 15.319713 2.346936e+02
1 10.000000 100 597.736582 3.572890e+05
2 14.142136 200 838.981547 7.038900e+05
3 17.320508 300 1024.095320 1.048771e+06
4 20.000000 400 1180.153450 1.392762e+06
5 22.360680 500 1317.643422 1.736184e+06
6 24.494897 600 1441.943858 2.079202e+06

INTRODUCTION TO REGRESSION WITH STATSMODELS IN PYTHON

Let's practice!
I N T R O D U C T I O N T O R E G R E S S I O N W I T H S TAT S M O D E L S I N P Y T H O N
Quantifying model
fit
I N T R O D U C T I O N T O R E G R E S S I O N W I T H S TAT S M O D E L S I N P Y T H O N

Maarten Van den Broeck

Content Developer at DataCamp
Bream and perch models
Bream Perch

INTRODUCTION TO REGRESSION WITH STATSMODELS IN PYTHON

Coefficient of determination
Sometimes called "r-squared" or "R-squared".

The proportion of the variance in the response variable that is predictable from the
explanatory variable

1 means a perfect t

0 means the worst possible t

INTRODUCTION TO REGRESSION WITH STATSMODELS IN PYTHON

.summary()
Look at the value titled "R-Squared"

mdl_bream = ols("mass_g ~ length_cm", data=bream).fit()

print(mdl_bream.summary())

# Some lines of output omitted

OLS Regression Results

Dep. Variable: mass_g R-squared: 0.878
Model: OLS Adj. R-squared: 0.874
Method: Least Squares F-statistic: 237.6

INTRODUCTION TO REGRESSION WITH STATSMODELS IN PYTHON

.rsquared attribute
print(mdl_bream.rsquared)

0.8780627095147174

INTRODUCTION TO REGRESSION WITH STATSMODELS IN PYTHON

It's just correlation squared
coeff_determination = bream["length_cm"].corr(bream["mass_g"]) ** 2
print(coeff_determination)

0.8780627095147173

INTRODUCTION TO REGRESSION WITH STATSMODELS IN PYTHON

Residual standard error (RSE)

A "typical" di erence between a prediction and an observed response

It has the same unit as the response variable.

MSE = RSE²

INTRODUCTION TO REGRESSION WITH STATSMODELS IN PYTHON

.mse_resid attribute
mse = mdl_bream.mse_resid
print('mse: ', mse)

mse: 5498.555084973521

rse = np.sqrt(mse)
print("rse: ", rse)

rse: 74.15224261594197

INTRODUCTION TO REGRESSION WITH STATSMODELS IN PYTHON

Calculating RSE: residuals squared
residuals_sq = mdl_bream.resid ** 2 residuals sq:
0 138.957118
print("residuals sq: \n", residuals_sq) 1 260.758635
2 5126.992578
3 1318.919660
4 390.974309
...
30 2125.047026
31 6576.923291
32 206.259713
33 889.335096
34 7665.302003
Length: 35, dtype: float64

INTRODUCTION TO REGRESSION WITH STATSMODELS IN PYTHON

Calculating RSE: sum of residuals squared
residuals_sq = mdl_bream.resid ** 2 resid sum of sq : 181452.31780412616

resid_sum_of_sq = sum(residuals_sq)

print("resid sum of sq :",

resid_sum_of_sq)

INTRODUCTION TO REGRESSION WITH STATSMODELS IN PYTHON

Calculating RSE: degrees of freedom
residuals_sq = mdl_bream.resid ** 2 deg freedom: 33

resid_sum_of_sq = sum(residuals_sq)

deg_freedom = len(bream.index) - 2

print("deg freedom: ", deg_freedom)

Degrees of freedom equals the number of

observations minus the number of model
coe cients.

INTRODUCTION TO REGRESSION WITH STATSMODELS IN PYTHON

Calculating RSE: square root of ratio
residuals_sq = mdl_bream.resid ** 2 rse : 74.15224261594197

resid_sum_of_sq = sum(residuals_sq)

deg_freedom = len(bream.index) - 2

rse = np.sqrt(resid_sum_of_sq/deg_freedom)

print("rse :", rse)

INTRODUCTION TO REGRESSION WITH STATSMODELS IN PYTHON

Interpreting RSE
mdl_bream has an RSE of 74 .

The di erence between predicted bream masses and observed bream masses is typically
about 74g.

INTRODUCTION TO REGRESSION WITH STATSMODELS IN PYTHON

Root-mean-square error (RMSE)
residuals_sq = mdl_bream.resid ** 2 residuals_sq = mdl_bream.resid ** 2

resid_sum_of_sq = sum(residuals_sq) resid_sum_of_sq = sum(residuals_sq)

deg_freedom = len(bream.index) - 2 n_obs = len(bream.index)

rse = np.sqrt(resid_sum_of_sq/deg_freedom) rmse = np.sqrt(resid_sum_of_sq/n_obs)

print("rse :", rse) print("rmse :", rmse)

rse : 74.15224261594197 rmse : 72.00244396727619

INTRODUCTION TO REGRESSION WITH STATSMODELS IN PYTHON

Let's practice!
I N T R O D U C T I O N T O R E G R E S S I O N W I T H S TAT S M O D E L S I N P Y T H O N
Visualizing model fit
I N T R O D U C T I O N T O R E G R E S S I O N W I T H S TAT S M O D E L S I N P Y T H O N

Maarten Van den Broeck

Content Developer at DataCamp
Residual properties of a good fit
Residuals are normally distributed

The mean of the residuals is zero

INTRODUCTION TO REGRESSION WITH STATSMODELS IN PYTHON

Bream and perch again
Bream: the "good" model Perch: the "bad" model

mdl_bream = ols("mass_g ~ length_cm", data=bream).fit() mdl_perch = ols("mass_g ~ length_cm", data=perch).fit()

INTRODUCTION TO REGRESSION WITH STATSMODELS IN PYTHON

Residuals vs. fitted
Bream Perch

INTRODUCTION TO REGRESSION WITH STATSMODELS IN PYTHON

Q-Q plot
Bream Perch

INTRODUCTION TO REGRESSION WITH STATSMODELS IN PYTHON

Scale-location plot
Bream Perch

INTRODUCTION TO REGRESSION WITH STATSMODELS IN PYTHON

residplot()
sns.residplot(x="length_cm", y="mass_g", data=bream, lowess=True)
plt.xlabel("Fitted values")
plt.ylabel("Residuals")

INTRODUCTION TO REGRESSION WITH STATSMODELS IN PYTHON

qqplot()
from statsmodels.api import qqplot
qqplot(data=mdl_bream.resid, fit=True, line="45")

INTRODUCTION TO REGRESSION WITH STATSMODELS IN PYTHON

Scale-location plot
model_norm_residuals_bream = mdl_bream.get_influence().resid_studentized_internal
model_norm_residuals_abs_sqrt_bream = np.sqrt(np.abs(model_norm_residuals_bream))
sns.regplot(x=mdl_bream.fittedvalues, y=model_norm_residuals_abs_sqrt_bream, ci=None, lowess=True)
plt.xlabel("Fitted values")
plt.ylabel("Sqrt of abs val of stdized residuals")

INTRODUCTION TO REGRESSION WITH STATSMODELS IN PYTHON

Let's practice!
I N T R O D U C T I O N T O R E G R E S S I O N W I T H S TAT S M O D E L S I N P Y T H O N
Outliers, leverage,
and influence
I N T R O D U C T I O N T O R E G R E S S I O N W I T H S TAT S M O D E L S I N P Y T H O N

Maarten Van den Broeck

Content Developer at DataCamp
Roach dataset
roach = fish[fish['species'] == "Roach"]
print(roach.head())

species mass_g length_cm

35 Roach 40.0 12.9
36 Roach 69.0 16.5
37 Roach 78.0 17.5
38 Roach 87.0 18.2
39 Roach 120.0 18.6

INTRODUCTION TO REGRESSION WITH STATSMODELS IN PYTHON

Which points are outliers?
sns.regplot(x="length_cm",
y="mass_g",
data=roach,
ci=None)
plt.show()

INTRODUCTION TO REGRESSION WITH STATSMODELS IN PYTHON

Extreme explanatory values
roach["extreme_l"] = ((roach["length_cm"] < 15) |
(roach["length_cm"] > 26))

fig = plt.figure()
sns.regplot(x="length_cm",
y="mass_g",
data=roach,
ci=None)

sns.scatterplot(x="length_cm",
y="mass_g",
hue="extreme_l",
data=roach)

INTRODUCTION TO REGRESSION WITH STATSMODELS IN PYTHON

Response values away from the regression line
roach["extreme_m"] = roach["mass_g"] < 1

fig = plt.figure()
sns.regplot(x="length_cm",
y="mass_g",
data=roach,
ci=None)

sns.scatterplot(x="length_cm",
y="mass_g",
hue="extreme_l",
style="extreme_m",
data=roach)

INTRODUCTION TO REGRESSION WITH STATSMODELS IN PYTHON

Leverage and influence
Leverage is a measure of how extreme the explanatory variable values are.

In uence measures how much the model would change if you le the observation out of the
dataset when modeling.

INTRODUCTION TO REGRESSION WITH STATSMODELS IN PYTHON

.get_influence() and .summary_frame()
mdl_roach = ols("mass_g ~ length_cm", data=roach).fit()
summary_roach = mdl_roach.get_influence().summary_frame()
roach["leverage"] = summary_roach["hat_diag"]

print(roach.head())

species mass_g length_cm leverage

35 Roach 40.0 12.9 0.313729
36 Roach 69.0 16.5 0.125538
37 Roach 78.0 17.5 0.093487
38 Roach 87.0 18.2 0.076283
39 Roach 120.0 18.6 0.068387

INTRODUCTION TO REGRESSION WITH STATSMODELS IN PYTHON

Cook's distance
Cook's distance is the most common measure of in uence.

roach["cooks_dist"] = summary_roach["cooks_d"]
print(roach.head())

species mass_g length_cm leverage cooks_dist

35 Roach 40.0 12.9 0.313729 1.074015
36 Roach 69.0 16.5 0.125538 0.010429
37 Roach 78.0 17.5 0.093487 0.000020
38 Roach 87.0 18.2 0.076283 0.001980
39 Roach 120.0 18.6 0.068387 0.006610

INTRODUCTION TO REGRESSION WITH STATSMODELS IN PYTHON

Most influential roaches
print(roach.sort_values("cooks_dist", ascending = False))

species mass_g length_cm leverage cooks_dist

35 Roach 40.0 12.9 0.313729 1.074015 # really short roach
54 Roach 390.0 29.5 0.394740 0.365782 # really long roach
40 Roach 0.0 19.0 0.061897 0.311852 # roach with zero mass
52 Roach 290.0 24.0 0.099488 0.150064
51 Roach 180.0 23.6 0.088391 0.061209
.. ... ... ... ... ...
43 Roach 150.0 20.4 0.050264 0.000257
44 Roach 145.0 20.5 0.050092 0.000256
42 Roach 120.0 19.4 0.056815 0.000199
47 Roach 160.0 21.1 0.050910 0.000137
37 Roach 78.0 17.5 0.093487 0.000020

INTRODUCTION TO REGRESSION WITH STATSMODELS IN PYTHON

Removing the most influential roach
roach_not_short = roach[roach["length_cm"] != 12.9]

sns.regplot(x="length_cm",
y="mass_g",
data=roach,
ci=None,
line_kws={"color": "green"})

sns.regplot(x="length_cm",
y="mass_g",
data=roach_not_short,
ci=None,
line_kws={"color": "red"})

INTRODUCTION TO REGRESSION WITH STATSMODELS IN PYTHON

Let's practice!
I N T R O D U C T I O N T O R E G R E S S I O N W I T H S TAT S M O D E L S I N P Y T H O N
Why you need
logistic regression
I N T R O D U C T I O N T O R E G R E S S I O N W I T H S TAT S M O D E L S I N P Y T H O N

Maarten Van den Broeck

Content Developer at DataCamp
Bank churn dataset
has_churned time_since_ rst_purchase time_since_last_purchase
0 0.3993247 -0.5158691
1 -0.4297957 0.6780654
0 3.7383122 0.4082544
0 0.6032289 -0.6990435
... ... ...
response length of relationship recency of activity

1 h ps://www.rdocumentation.org/packages/bayesQR/topics/Churn

INTRODUCTION TO REGRESSION WITH STATSMODELS IN PYTHON

Churn vs. recency: a linear model
mdl_churn_vs_recency_lm = ols("has_churned ~ time_since_last_purchase",
data=churn).fit()

print(mdl_churn_vs_recency_lm.params)

Intercept 0.490780
time_since_last_purchase 0.063783
dtype: float64

intercept, slope = mdl_churn_vs_recency_lm.params

INTRODUCTION TO REGRESSION WITH STATSMODELS IN PYTHON

Visualizing the linear model
sns.scatterplot(x="time_since_last_purchase",
y="has_churned",
data=churn)

plt.axline(xy1=(0, intercept),
slope=slope)

plt.show()

INTRODUCTION TO REGRESSION WITH STATSMODELS IN PYTHON

Zooming out
sns.scatterplot(x="time_since_last_purchase",
y="has_churned",
data=churn)

plt.axline(xy1=(0,intercept),
slope=slope)

plt.xlim(-10, 10)
plt.ylim(-0.2, 1.2)
plt.show()

INTRODUCTION TO REGRESSION WITH STATSMODELS IN PYTHON

What is logistic regression?
Another type of generalized linear model.

Used when the response variable is logical.

The responses follow logistic (S-shaped) curve.

INTRODUCTION TO REGRESSION WITH STATSMODELS IN PYTHON

Logistic regression using logit()
from statsmodels.formula.api import logit
mdl_churn_vs_recency_logit = logit("has_churned ~ time_since_last_purchase",
data=churn).fit()
print(mdl_churn_vs_recency_logit.params)

Intercept -0.035019
time_since_last_purchase 0.269215
dtype: float64

INTRODUCTION TO REGRESSION WITH STATSMODELS IN PYTHON

Visualizing the logistic model
sns.regplot(x="time_since_last_purchase",
y="has_churned",
data=churn,
ci=None,
logistic=True)

plt.axline(xy1=(0,intercept),
slope=slope,
color="black")

plt.show()

INTRODUCTION TO REGRESSION WITH STATSMODELS IN PYTHON

Zooming out

INTRODUCTION TO REGRESSION WITH STATSMODELS IN PYTHON

Let's practice!
I N T R O D U C T I O N T O R E G R E S S I O N W I T H S TAT S M O D E L S I N P Y T H O N
Predictions and odds
ratios
I N T R O D U C T I O N T O R E G R E S S I O N W I T H S TAT S M O D E L S I N P Y T H O N

Maarten Van den Broeck

Content Developer at DataCamp
The regplot() predictions
sns.regplot(x="time_since_last_purchase",
y="has_churned",
data=churn,
ci=None,
logistic=True)

plt.show()

INTRODUCTION TO REGRESSION WITH STATSMODELS IN PYTHON

Making predictions
mdl_recency = logit("has_churned ~ time_since_last_purchase",
data = churn).fit()

explanatory_data = pd.DataFrame(
{"time_since_last_purchase": np.arange(-1, 6.25, 0.25)})

prediction_data = explanatory_data.assign(
has_churned = mdl_recency.predict(explanatory_data))

INTRODUCTION TO REGRESSION WITH STATSMODELS IN PYTHON

Adding point predictions
sns.regplot(x="time_since_last_purchase",
y="has_churned",
data=churn,
ci=None,
logistic=True)

sns.scatterplot(x="time_since_last_purchase",
y="has_churned",
data=prediction_data,
color="red")

plt.show()

INTRODUCTION TO REGRESSION WITH STATSMODELS IN PYTHON

Getting the most likely outcome
prediction_data = explanatory_data.assign(
has_churned = mdl_recency.predict(explanatory_data))
prediction_data["most_likely_outcome"] = np.round(prediction_data["has_churned"])

INTRODUCTION TO REGRESSION WITH STATSMODELS IN PYTHON

Visualizing most likely outcome
sns.regplot(x="time_since_last_purchase",
y="has_churned",
data=churn,
ci=None,
logistic=True)

sns.scatterplot(x="time_since_last_purchase",
y="most_likely_outcome",
data=prediction_data,
color="red")

plt.show()

INTRODUCTION TO REGRESSION WITH STATSMODELS IN PYTHON

Odds ratios
Odds ratio is the probability of something
happening divided by the probability that it
doesn't.

probability
odds_ratio =
(1 − probability)
0.25 1
odds_ratio = =
(1 − 0.25) 3

INTRODUCTION TO REGRESSION WITH STATSMODELS IN PYTHON

Calculating odds ratio
prediction_data["odds_ratio"] = prediction_data["has_churned"] /
(1 - prediction_data["has_churned"])

INTRODUCTION TO REGRESSION WITH STATSMODELS IN PYTHON

Visualizing odds ratio
sns.lineplot(x="time_since_last_purchase",
y="odds_ratio",
data=prediction_data)

plt.axhline(y=1,
linestyle="dotted")

plt.show()

INTRODUCTION TO REGRESSION WITH STATSMODELS IN PYTHON

Visualizing log odds ratio
sns.lineplot(x="time_since_last_purchase",
y="odds_ratio",
data=prediction_data)

plt.axhline(y=1,
linestyle="dotted")

plt.yscale("log")

plt.show()

INTRODUCTION TO REGRESSION WITH STATSMODELS IN PYTHON

Calculating log odds ratio
prediction_data["log_odds_ratio"] = np.log(prediction_data["odds_ratio"])

INTRODUCTION TO REGRESSION WITH STATSMODELS IN PYTHON

All predictions together
time_since_last_prchs has_churned most_likely_rspns odds_ratio log_odds_ratio
0 0.491 0 0.966 -0.035
2 0.623 1 1.654 0.503
4 0.739 1 2.834 1.042
6 0.829 1 4.856 1.580
... ... ... ... ...

INTRODUCTION TO REGRESSION WITH STATSMODELS IN PYTHON

Comparing scales
Are values easy to Are changes easy to Is
Scale
interpret? interpret? precise?
Probability ✔ ✘ ✔
Most likely
✔✔ ✔ ✘
outcome
Odds ratio ✔ ✘ ✔
Log odds ratio ✘ ✔ ✔

INTRODUCTION TO REGRESSION WITH STATSMODELS IN PYTHON

Let's practice!
I N T R O D U C T I O N T O R E G R E S S I O N W I T H S TAT S M O D E L S I N P Y T H O N
Quantifying logistic
regression fit
I N T R O D U C T I O N T O R E G R E S S I O N W I T H S TAT S M O D E L S I N P Y T H O N

Maarten Van den Broeck

Content Developer at DataCamp
The four outcomes
predicted false predicted true
actual false correct false positive
actual true false negative correct

INTRODUCTION TO REGRESSION WITH STATSMODELS IN PYTHON

Confusion matrix: counts of outcomes
actual_response = churn["has_churned"]

predicted_response = np.round(mdl_recency.predict())

outcomes = pd.DataFrame({"actual_response": actual_response,

"predicted_response": predicted_response})

print(outcomes.value_counts(sort=False))

actual_response predicted_response
0 0.0 141
1.0 59
1 0.0 111
1.0 89

INTRODUCTION TO REGRESSION WITH STATSMODELS IN PYTHON

Visualizing the confusion matrix
conf_matrix = mdl_recency.pred_table()

print(conf_matrix)

[[141. 59.]
[111. 89.]]

true negative false positive

false negative true positive

from statsmodels.graphics.mosaicplot
import mosaic

mosaic(conf_matrix)

INTRODUCTION TO REGRESSION WITH STATSMODELS IN PYTHON

Accuracy
Accuracy is the proportion of correct
acc = (TN + TP) / (TN + TP + FN + FP)
predictions.
print(acc)

TN + TP
accuracy =
TN + FN + FP + TP 0.575

[[141., 59.],
[111., 89.]]

TN = conf_matrix[0,0]
TP = conf_matrix[1,1]
FN = conf_matrix[1,0]
FP = conf_matrix[0,1]

INTRODUCTION TO REGRESSION WITH STATSMODELS IN PYTHON

Sensitivity
Sensitivity is the proportion of true positives.
sens = TP / (FN + TP)
TP print(sens)
sensitivity =
FN + TP
0.445
[[141., 59.],
[111., 89.]]

TN = conf_matrix[0,0]
TP = conf_matrix[1,1]
FN = conf_matrix[1,0]
FP = conf_matrix[0,1]

INTRODUCTION TO REGRESSION WITH STATSMODELS IN PYTHON

Specificity
Speci city is the proportion of true negatives.
spec = TN / (TN + FP)
TN print(spec)
specificity =
TN + FP
0.705
[[141., 59.],
[111., 89.]]

TN = conf_matrix[0,0]
TP = conf_matrix[1,1]
FN = conf_matrix[1,0]
FP = conf_matrix[0,1]

INTRODUCTION TO REGRESSION WITH STATSMODELS IN PYTHON

Let's practice!
I N T R O D U C T I O N T O R E G R E S S I O N W I T H S TAT S M O D E L S I N P Y T H O N
Congratulations
I N T R O D U C T I O N T O R E G R E S S I O N W I T H S TAT S M O D E L S I N P Y T H O N

Maarten Van den Broeck

Content Developer at DataCamp
You learned things
Chapter 1 Chapter 2

Fit a simple linear regression Make predictions

Interpret coe cients Regression to the mean

Transforming variables

Chapter 3 Chapter 4

Quantifying model t Fit a simple logistic regression

Outlier, leverage, and in uence Make predictions

Get performance from confusion matrix

INTRODUCTION TO REGRESSION WITH STATSMODELS IN PYTHON

Multiple explanatory variables
Intermediate Regression with statsmodels in Python

INTRODUCTION TO REGRESSION WITH STATSMODELS IN PYTHON

Unlocking advanced skills
Generalized Linear Models in Python

Introduction to Predictive Analytics in

Python

Linear Classi ers in Python

INTRODUCTION TO REGRESSION WITH STATSMODELS IN PYTHON

Happy learning!
I N T R O D U C T I O N T O R E G R E S S I O N W I T H S TAT S M O D E L S I N P Y T H O N

COMM 204 HW3 Solution
No ratings yet
COMM 204 HW3 Solution
2 pages
AI Fundamentals
91% (11)
AI Fundamentals
881 pages
MATH6183 Introduction+Regression
No ratings yet
MATH6183 Introduction+Regression
70 pages
Ms 236 N 0
No ratings yet
Ms 236 N 0
63 pages
Assignment 1:: Intro To Machine Learning
No ratings yet
Assignment 1:: Intro To Machine Learning
6 pages
1.reg Chapter1
No ratings yet
1.reg Chapter1
30 pages
Chapter 2
No ratings yet
Chapter 2
39 pages
Intermediate Regression With Statsmodels in Python
No ratings yet
Intermediate Regression With Statsmodels in Python
129 pages
Chapter 5. Regression Models: 1 A Simple Model
No ratings yet
Chapter 5. Regression Models: 1 A Simple Model
49 pages
Lecture Notes Week 3
No ratings yet
Lecture Notes Week 3
61 pages
Think Stats 3rd Edition Early Release - Allen Downey
No ratings yet
Think Stats 3rd Edition Early Release - Allen Downey
97 pages
Introduction To Stasmodels
No ratings yet
Introduction To Stasmodels
34 pages
Intro To Forecasting
No ratings yet
Intro To Forecasting
15 pages
Applied Linear Regression
No ratings yet
Applied Linear Regression
13 pages
SM Notes 2020
No ratings yet
SM Notes 2020
139 pages
Lecture 16 Regression
No ratings yet
Lecture 16 Regression
30 pages
Regression
No ratings yet
Regression
46 pages
Regression Dataset Example
No ratings yet
Regression Dataset Example
14 pages
Assignment No.4 - (20-Ele-68)
No ratings yet
Assignment No.4 - (20-Ele-68)
17 pages
Week3 Modified
No ratings yet
Week3 Modified
25 pages
Linear Regression
No ratings yet
Linear Regression
20 pages
Simple Regression
No ratings yet
Simple Regression
46 pages
Lab Linear Regression
No ratings yet
Lab Linear Regression
21 pages
Data Science Chapitre 2
No ratings yet
Data Science Chapitre 2
98 pages
CH 06
No ratings yet
CH 06
22 pages
Linear Regression Program So Far
No ratings yet
Linear Regression Program So Far
33 pages
Essentials of Linear Regression in Python
No ratings yet
Essentials of Linear Regression in Python
23 pages
Islp 3
No ratings yet
Islp 3
5 pages
Intro LOGIT
No ratings yet
Intro LOGIT
46 pages
Notes 23 Regression R
No ratings yet
Notes 23 Regression R
5 pages
En Tanagra Python StatsModels PDF
No ratings yet
En Tanagra Python StatsModels PDF
20 pages
Berkeley Machine Learning
No ratings yet
Berkeley Machine Learning
185 pages
C1 W1 Lab03 Model Representation Soln-Copy1
No ratings yet
C1 W1 Lab03 Model Representation Soln-Copy1
7 pages
Understanding The Geometry of Predictive Models: Workshop at S P Jain School Institute of Management and Research
No ratings yet
Understanding The Geometry of Predictive Models: Workshop at S P Jain School Institute of Management and Research
78 pages
Linear Regression
No ratings yet
Linear Regression
8 pages
Lecture 6 Model Selection and Regularization 11oct2023
No ratings yet
Lecture 6 Model Selection and Regularization 11oct2023
29 pages
C1 W1 Lab02 Model Representation Soln
No ratings yet
C1 W1 Lab02 Model Representation Soln
5 pages
Objects Oriented Programming OOP
No ratings yet
Objects Oriented Programming OOP
66 pages
Machine Learning Lecture 1
No ratings yet
Machine Learning Lecture 1
5 pages
Machine Learning Basics 1683717543
No ratings yet
Machine Learning Basics 1683717543
15 pages
7 محاضرات
No ratings yet
7 محاضرات
36 pages
An Introduction To Stadistical Learning-129-140-1-8
No ratings yet
An Introduction To Stadistical Learning-129-140-1-8
8 pages
machine learning (1)
No ratings yet
machine learning (1)
30 pages
ML Manoj
No ratings yet
ML Manoj
51 pages
Linear Regression
100% (2)
Linear Regression
228 pages
Module 2 Lab Activity - Regression
No ratings yet
Module 2 Lab Activity - Regression
9 pages
Stat Modelling Notes
No ratings yet
Stat Modelling Notes
49 pages
Lab02
No ratings yet
Lab02
14 pages
Objects Oriented Programming OOP
No ratings yet
Objects Oriented Programming OOP
67 pages
ISLP - Website 135 200
No ratings yet
ISLP - Website 135 200
66 pages
ISLP - Website-135-200 (1) - 1-60
No ratings yet
ISLP - Website-135-200 (1) - 1-60
60 pages
Data Science Chapitre 2
No ratings yet
Data Science Chapitre 2
132 pages
Simple Linear Regression in Machine Learning
No ratings yet
Simple Linear Regression in Machine Learning
7 pages
Previewpdf
No ratings yet
Previewpdf
27 pages
Lecture-2 Unit 2
No ratings yet
Lecture-2 Unit 2
56 pages
Linear Regression
No ratings yet
Linear Regression
18 pages
Experiment No.8 - Fit Simple Linear Regression Models Using Built-In Functions.
No ratings yet
Experiment No.8 - Fit Simple Linear Regression Models Using Built-In Functions.
8 pages
Cursus Advanced Econometrics
No ratings yet
Cursus Advanced Econometrics
129 pages
ML Unit-2
100% (1)
ML Unit-2
52 pages
C1 W1 Lab02 Model Representation Soln
No ratings yet
C1 W1 Lab02 Model Representation Soln
7 pages
GVPCOEW-Supervised ML - Linear Regression - DONE
No ratings yet
GVPCOEW-Supervised ML - Linear Regression - DONE
24 pages
Data Science Using Python and R
From Everand
Data Science Using Python and R
Chantal D. Larose
No ratings yet
Supervised Learning With Scikit-Learn
No ratings yet
Supervised Learning With Scikit-Learn
178 pages
Introduction To TensorFlow in Python
100% (3)
Introduction To TensorFlow in Python
146 pages
Introduction To Statistics in Python
100% (2)
Introduction To Statistics in Python
211 pages
Finance Fundamentals in Python
100% (4)
Finance Fundamentals in Python
877 pages
Introduction and Intermediate Docker
100% (1)
Introduction and Intermediate Docker
255 pages
Applied Finance in Python
100% (2)
Applied Finance in Python
545 pages
Syllabus
No ratings yet
Syllabus
9 pages
DOE Homework 5 Stefan Garnett Harmasi
No ratings yet
DOE Homework 5 Stefan Garnett Harmasi
8 pages
Intermediate Statistics Test Sample 2
100% (1)
Intermediate Statistics Test Sample 2
19 pages
Exam
No ratings yet
Exam
12 pages
CH-4 - Differenciation - Lecture 2
No ratings yet
CH-4 - Differenciation - Lecture 2
14 pages
Epidemiologi Komputer
No ratings yet
Epidemiologi Komputer
6 pages
Persiapan Perhitungan Pada Usaha Produksi Rapai Keuneubah Nanggroe
No ratings yet
Persiapan Perhitungan Pada Usaha Produksi Rapai Keuneubah Nanggroe
4 pages
A Note On Optimal Capital Allocation
No ratings yet
A Note On Optimal Capital Allocation
2 pages
Econ2214 Reflective Essay 1
No ratings yet
Econ2214 Reflective Essay 1
3 pages
CS2A - April22 - EXAM - Clean Proof
No ratings yet
CS2A - April22 - EXAM - Clean Proof
8 pages
Math11 Statistics Q4 - BANGA WEEK 1-2-Key-Answer
100% (2)
Math11 Statistics Q4 - BANGA WEEK 1-2-Key-Answer
3 pages
Erlang C Lokad
No ratings yet
Erlang C Lokad
2 pages
Handbook of Measurement Error Models, 1st Edition FULL PDF DOCX DOWNLOAD
100% (15)
Handbook of Measurement Error Models, 1st Edition FULL PDF DOCX DOWNLOAD
15 pages
Introduction To Econometrics (3 Updated Edition, Global Edition)
No ratings yet
Introduction To Econometrics (3 Updated Edition, Global Edition)
9 pages
Topic 7. Double Grouping. Latin Squares (ST&D 9.10-9.15)
No ratings yet
Topic 7. Double Grouping. Latin Squares (ST&D 9.10-9.15)
8 pages
SPSS Answers (Chapter 5)
No ratings yet
SPSS Answers (Chapter 5)
6 pages
Utility Theory
No ratings yet
Utility Theory
21 pages
Annuities
No ratings yet
Annuities
68 pages
Company Liril Cinthol Strategy Low Advertisement (B1) Medium Advertisement (B2) High Advertisement (B3) Low
No ratings yet
Company Liril Cinthol Strategy Low Advertisement (B1) Medium Advertisement (B2) High Advertisement (B3) Low
13 pages
3b. Lecture Slides - Asset Pricing Models
No ratings yet
3b. Lecture Slides - Asset Pricing Models
35 pages
Chapter 4 Inferential Statistics Probabilities
No ratings yet
Chapter 4 Inferential Statistics Probabilities
53 pages
Decision Theory
No ratings yet
Decision Theory
30 pages
統計摘要
No ratings yet
統計摘要
12 pages
Regression Webinar
No ratings yet
Regression Webinar
31 pages
The Issue of Endogeneity Within Theory-Based, Quantitative Management Accounting Research
No ratings yet
The Issue of Endogeneity Within Theory-Based, Quantitative Management Accounting Research
24 pages
Credibility, Mahler & Dean (AutoRecovered)
No ratings yet
Credibility, Mahler & Dean (AutoRecovered)
4 pages
ScottJR Web16 8 PDF
No ratings yet
ScottJR Web16 8 PDF
6 pages
Heteroskedasticity Glejser Using SPSS
No ratings yet
Heteroskedasticity Glejser Using SPSS
9 pages
Analysis of Variance
No ratings yet
Analysis of Variance
132 pages