DISC 212 Session 13
DISC 212 Session 13
DISC 212 Session 13
Science
Tuesday 24th October 2023
Recap
• Sensitivity Analysis
– Interpreting allowable increase and allowable
decrease columns of the objective function
coefficients
– Interpreting the shadow price of constraints
– Interpreting reduced costs of objective function
coefficients
Way Forward
• 2 class skipped last week
– Online make-up (over the weekend insha’Allah)
• Will not cover more network modeling
problems
• Exam 2 will include predictive models which
we will cover today
– Also sensitivity analysis and basic network models
and shortest path
Types of Mathematical Models
Regression
• Regression Analysis is used to estimate a function f( )
that describes the relationship between a
continuous dependent variable and one or more
independent variables.
Y = f(X1, X2, X3,…, Xn) + e
Note:
• f( ) describes systematic variation in the relationship.
e represents the unsystematic variation (or random error) in the
relationship.
Advertising – Sales Example
• Consider the relationship between advertising
(X1) and sales (Y) for a company.
• There probably is a relationship...
...as advertising increases, sales should increase.
• But how would we measure and quantify this
relationship? Obs
Advertising
(in $1000s)
Actual Sales
(in $1000s)
1 30 184.4
2 40 279.1
3 40 244.0
4 50 314.2
5 60 382.2
6 70 450.2
7 70 423.6
8 70 410.2
9 80 500.4
10 90 505.3
Scatter Plot
600.0
$1000s
Actual Sales in
500.0
400.0
300.0
200.0
100.0
0.0
0 20 40 60 80 100
Advertising in $1000s
Nature of Statistical Relationship
Y
Regression
Curve
X
Simple Linear Regression Model
The scatter plot shows a linear relation between advertising and
sales.
So the following regression model is suggested by the data,
Yi 0 1X1i e i
This refers to the true relationship between the entire population of
advertising and sales values.
Note: The TREND( ) function is dynamically updated whenever any inputs to the
function change. However, it does not provide the statistical information provided by
the regression tool. It is best two use these two different approaches to doing
regression in conjunction with one another.
R2 Statistic
• The R2 statistic indicates how well an estimated
regression function fits the data.
• 0 < R2 < 1
• It measures the proportion of the total variation
in Y around its mean that is accounted for by
the estimated regression equation.
• To understand this better, consider the following
graph...
Understanding R2
Yi (actual value)
Y
* ^
Yi - Y i
Yi - Y ^ (estimated value)
Yi
^ -Y
Y i
Y
^
Y = b0 + b1X
X
Understanding R2
n n n
(Y Y) (Y Y ) (Y Y)
i 1
i
2
i 1
i i
2
i 1
i
2
or,
TSS = ESS + RSS
RSS ESS
R2 1
TSS TSS
Predicting the Dependent Value
Suppose we want to estimate the average levels of
sales expected if $65,000 is spent on advertising.
36.342 5.550X
Yi 1 i
(Y Y )
i 1
i i
2
Se
n k 1
2S
Y
where: h e
Y h b0 b1X1
h
Y i b0 b1X1 b2 X 2 bk X k
i i i
*
* **
*
* * * *
* *
* * * *
* *
* *
*
* *
*
X2 X1
Real Estate Example
• A real estate appraiser wants to develop a model
to help predict the fair market values of residential
properties.
• Three independent variables will be used to
estimate the selling price of a house:
– Total square footage
– Number of bedrooms
– Size of garage
Selecting the Model
• We want to identify the simplest model that
adequately accounts for the systematic variation
in the Y variable.
• Arbitrarily using all the independent variables
may result in overfitting.
• A sample reflects characteristics:
– representative of the population
– specific to the sample
• We want to avoid fitting sample specific
characteristics -- or overfitting the data.
Models with One IV
• With simplicity in mind, suppose we fit three simple
linear regression functions:
Y i b0 b1X1
i
Y b b X
i 0 2 2i
Y b b X
i 0 3 3i
Key regression results are:
Variables Adjusted Parameter
in the Model R2 R2 Se Estimates
X1 0.870 0.855 10.299 b0=9.503, b1=56.394
X2 0.759 0.731 14.030 b0=78.290, b2=28.382
X3 0.793 0.770 12.982 b0=16.250, b3=27.607