Problem Set 03 - Solutions
Problem Set 03 - Solutions
Department of Economics
Spring 2022-2023
Q1) Explain the meaning of SSR and ESS. You may use graphs for illustration purposes.
The explained sum of squares (ESS), alternatively known as the model sum of squares or sum of
squares due to regression, is a quantity used in describing how well a model, often a regression,
represents the data being modelled. SSR is the sum of squares of residuals, it is a measure of the
discrepancy between the data and an estimation model.
1
Q2) State with a reason whether the following statements are true, false, or uncertain.
i) The t test of significance requires that the sampling distributions of estimators 𝛽̂0 and
𝛽̂1 follow the normal distribution.
True. The 𝑡 test is based on variables with a normal distribution. Since the estimators of 𝛽1 and
𝛽0 are linear combinations of the error 𝑢𝑖 , which is assumed to be normally distributed under
CLRM, these estimators are also distributed.
False. All we can say is that at hand does not permit us to reject the null hypothesis.
iii) In the two-variable regression function, if the slope coefficient 𝛽1 is zero, the intercept
𝛽0 is estimated by the sample mean 𝑦̅.
True. Remember
𝛽̂0 = 𝑦̅ − 𝛽̂1 𝑥̅
iv) Even though the disturbance term in the CLRM is not normally distributed, the OLS
estimators are still unbiased.
True. So long as 𝐸(𝑢𝑖 ) = 0, the OLS estimators are unbiased. No probabilistic assumptions are
required to establish unbiasedness.
2
Q3) We estimate a simple linear regression with the following results:
Then,
𝜎̂ 2 = 2.04678
2
∑𝑢̂𝑖 2
𝜎̂ =
𝑇−𝑘
∑𝑢̂𝑖 2
𝜎̂ 2 = = 2.04672 → ∑𝑢̂𝑖 2 = 100.28928
51 − 2
It is given that the estimated variance of 𝛽1 is 0.00098.
1
𝑠𝑒(𝛽̂1 ) = √𝑣𝑎𝑟(𝛽̂1 ) = √𝜎̂ 2 ∗
∑(𝑥𝑖 − 𝑥̅ )2
∑(𝑥𝑖 − 𝑥̅ )2
1
0.00098 = 𝜎̂ 2 ∗
∑(𝑥𝑖 − 𝑥̅ )2
1
0.00098 = 2.04672 ∗
∑(𝑥𝑖 − 𝑥̅ )2
2.04672
∑(𝑥𝑖 − 𝑥̅ )2 = = 2088.4898
0.00098
3
Suppose we define variables as:
iv) If 𝛽1 = 0.18, comment on the results. What happens to mean income when
percentage of graduates increase by 1 percentage point? Does it make sense?
1% point increase in the percent of university graduates will lead to an increase of 180 Turkish
liras (0.18 ∗ 1000 𝑇𝐿) in mean income. It makes sense since increase in educational attainment
leads to higher wages.
Suppose that
𝑥̅ = 69.139 𝑦̅ = 15.187
𝛽̂0 = 𝑦̅ − 𝛽̂1 𝑥̅
∑(𝑥𝑖 )2
We know the value for ∑(𝑥𝑖 − 𝑥̅ )2 , so we can find ∑(𝑥𝑖 )2 by employing that:
4
Q4) Table contains the YKS scores and the GPA for eight college students. GPA is based on a four-
point scale.
i) Estimate the relationship between GPA and YKS using OLS; that is, obtain the
intercept and slope estimates in the equation.
5
Find the following calculation for each individual and corresponding summation:
Student (𝑖) GPA (𝑦𝑖 − 𝑦̅) (𝑦𝑖 − 𝑦̅)2 YKS (𝑥𝑖 − 𝑥̅ ) (𝑥𝑖 − 𝑥̅ )2 (𝑥𝑖 − 𝑥̅ )(𝑦𝑖 − 𝑦̅)
(𝑦𝑖 ) (𝑥𝑖 )
1 2.8 -0.5 0.25 350 -60 3,600 30
2 3.3 0.0 0 360 -50 2,500 0
3 3.5 0.2 0.04 450 40 1,600 8
4 3.7 0.4 0.16 180 70 4,900 28
5 3.8 0.5 0.25 190 80 6,400 40
6 2.9 -0.4 0.16 330 -80 6,400 32
7 2.5 -0.8 0.64 300 -110 12,100 88
8 3.9 0.6 0.36 520 110 12,100 66
SUM 1.86 49,600 292
̂ 𝑖 = 0.84 + 0.006𝑌𝐾𝑆𝑖
𝐺𝑃𝐴
6
ii) Compute the fitted values and residuals for each observation and verify that the
residuals sum to zero.
iii) How much of the variation in GPA for these 8 students is explained by YKS? Explain.
𝑆𝑆𝑅 0.1416
𝑅2 = 1 − =1− = 0.924
𝑇𝑆𝑆 1.86
7
Q5) Consider the money demand 𝑀𝐷 equation y using OLS method. The model is as follows:
𝑀𝑡𝐷 = 𝛽0 + 𝛽1 𝑌𝑡 + 𝑢𝑡
The data covers the period from 1970 and 2009. The following information is obtained from the
data:
Money demand (𝑀𝐷 ) corresponds to 𝑦 and GDP (𝑌) corresponds to 𝑥 in the normal
equation.
i) Estimate the coefficients of the model, the measure of the goodness of fit and the
standard errors of the coefficients.
We know ̅̅̅̅
𝑀𝑑 and 𝑌̅ given the values and sample size (𝑛 = 40)
̅̅̅̅ 120
𝑀𝑑 = =3
40
200
𝑌̅ = =5
40
𝛽̂0 = 𝑦̅ − 𝛽̂1 𝑥̅
∑(𝑥𝑡 − 𝑥̅ )(𝑦𝑡 − 𝑦̅) ∑𝑥𝑡 𝑦𝑡 − 𝑛𝑥̅ 𝑦̅
𝛽̂1 = =
∑(𝑥𝑡 − 𝑥̅ )2 ∑𝑥𝑡2 − 𝑛𝑥̅ 2
1200 − (40 ∗ 5 ∗ 3)
𝛽̂1 = = 0.75
1800 − (40 ∗ 52 )
8
Rewrite 𝑅 2 with our variables:
1800 − 40(5)2
𝑅 2 = (0.75)2 ∗ = 0.90
860 − 40(3)2
𝑆𝑆𝑅 ∑𝑢̂𝑡2
𝑅2 = 1 − =1−
𝑇𝑆𝑆 ∑(𝑦𝑡 − 𝑦̅)2
∑𝑢̂𝑡2
0.90 = 1 −
860 − 40(3)2
Thus,
∑ 𝑢̂𝑡2 = 50
∑𝑢̂𝑡2
𝜎̂ 2 =
𝑛−𝑘
50 50
𝜎̂ 2 = = = 1.316
40 − 2 38
𝜎̂ 2 ∑𝑥𝑡2 𝜎̂ 2 ∑𝑌𝑡2
𝑠𝑒(𝛽̂0 ) = √𝑣𝑎𝑟(𝛽̂0 ) = √ ∗ = √ ∗
𝑛 ∑(𝑥𝑡 − 𝑥̅ )2 𝑛 ∑𝑌𝑡2 − 𝑛𝑌̅ 2
1.316 1800
𝑠𝑒(𝛽̂0 ) = √ ∗ = √0.074 = 0.272
40 800
9
For the slope term:
1 1
𝑠𝑒(𝛽̂1 ) = √𝑣𝑎𝑟(𝛽̂1 ) = √𝜎̂ 2 ∗ = √𝜎̂ 2 ∗
∑(𝑥𝑡 − 𝑥̅ )2 ∑𝑌𝑡 − 𝑛𝑌̅ 2
2
1
𝑠𝑒(𝛽̂1 ) = √1.316 ∗ = √0.0016 = 0.04
800
ii) Construct 95 percent confidence interval for both slope and constant terms.
Firstly, recall 𝛽̂1 and find 𝛽̂0 :
𝛽̂1 = 0.75
0.025
𝑡38 = 2.021
𝛼/2 𝛼/2
𝛽̂0 − 𝑡𝑑𝑓 𝑠𝑒(𝛽̂0 ) ≤ 𝛽̂0 ≤ 𝛽̂0 + 𝑡𝑑𝑓 𝑠𝑒(𝛽̂0 )
𝛼/2 𝛼/2
𝛽̂1 − 𝑡𝑑𝑓 𝑠𝑒(𝛽̂1 ) ≤ 𝛽̂1 ≤ 𝛽̂1 + 𝑡𝑑𝑓 𝑠𝑒(𝛽̂1 )
𝐻0 : 𝛽1 = 0.5
𝐻1 : 𝛽1 ≠ 0.5
That is, the true slope is 0.5 under the null hypothesis but less or greater than 0.5.
under the alternative hypothesis.
10
Recall 𝑡 − distribution and calculate:
Since
𝐻0 : 𝛽1 = 0
𝐻1 : 𝛽1 ≠ 0
𝛽̂1 − 𝛽1 0.75 − 0
𝑡𝑐𝑎𝑙𝑐𝑢𝑙𝑎𝑡𝑒𝑑 = = = 18.75
𝑠𝑒(𝛽̂1 ) 0.04
0.025
𝑡𝑐𝑟𝑖𝑡𝑖𝑐𝑎𝑙 = 𝑡38 = 2.021
Since
11
Q6) We estimate a simple linear regression with the following results:
𝑌𝑡 = 𝛽0 + 𝛽1 𝑋𝑡 + 𝑢𝑡
𝐻0 : 𝛽1 = 0
𝐻1 : 𝛽1 ≠ 0
Calculate 𝑡 −value:
𝛽̂1 − 𝛽1 0.310 − 0
𝑡𝑐𝑎𝑙𝑐𝑢𝑙𝑎𝑡𝑒𝑑 = = = 3.780
𝑠𝑒(𝛽̂1 ) 0.082
0.005
𝑡𝑐𝑟𝑖𝑡𝑖𝑐𝑎𝑙 = 𝑡22 = 2.819
Decision criteria:
𝐻0 : 𝛽1 = 0
𝐻1 : 𝛽1 > 0
Calculate 𝑡 −value:
𝛽̂1 − 𝛽1 0.310 − 0
𝑡𝑐𝑎𝑙𝑐𝑢𝑙𝑎𝑡𝑒𝑑 = = = 3.780
𝑠𝑒(𝛽̂1 ) 0.082
0.01
𝑡𝑐𝑟𝑖𝑡𝑖𝑐𝑎𝑙 = 𝑡22 = 2.508
Decision criteria:
12
iii) Test the following hypothesis with significance level: 5%. Show the rejection region in
a graph.
𝐻0 : 𝛽1 = 0
𝐻1 : 𝛽1 < 0
Calculate 𝑡 −value:
𝛽̂1 − 𝛽1 0.310 − 0
𝑡𝑐𝑎𝑙𝑐𝑢𝑙𝑎𝑡𝑒𝑑 = = = 3.780
𝑠𝑒(𝛽̂1 ) 0.082
0.05
𝑡𝑐𝑟𝑖𝑡𝑖𝑐𝑎𝑙 = 𝑡22 = −1.717
Decision criteria:
13
iv) Test the following hypothesis with significance level: 5%. Show the rejection region in
a graph.
𝐻0 : 𝛽1 = 0.5
𝐻1 : 𝛽1 ≠ 0.5
Calculate 𝑡 −value:
Decision criteria:
𝛼/2 𝛼/2
𝛽̂1 − 𝑡𝑑𝑓 𝑠𝑒(𝛽̂1 ) ≤ 𝛽1 ≤ 𝛽̂1 + 𝑡𝑑𝑓 𝑠𝑒(𝛽̂1 )
0.079 ≤ 𝛽1 ≤ 0.541
14
Q7) Consider the following regression output:
𝑦𝑖 = 0.2033 + 0.6560 𝑥𝑖
(0.0976) (0.1961)
and
where 𝑦𝑖 is labour force participation rate (LFPR) of women in 1972 and 𝑥𝑖 is LFPR of women in
1968. The regression results were obtained from a sample of 19 cities in the United States. Test
the hypothesis:
𝐻0 : 𝛽1 = 1
𝐻1 : 𝛽1 > 1
Which test do you use? What are the underlying assumptions of the test(s) you use?
Calculate 𝑡 −value:
𝛽̂1 − 𝛽1 0.650 − 1
𝑡𝑐𝑎𝑙𝑐𝑢𝑙𝑎𝑡𝑒𝑑 = = = −1.78
𝑠𝑒(𝛽̂1 ) 0.1961
0.05
𝑡𝑐𝑟𝑖𝑡𝑖𝑐𝑎𝑙 = 𝑡17 = 1.7396
Decision criteria:
15
Q8) An economics student constructed the following models for her assignment:
𝑀𝑡𝑑 = 𝛽0 + 𝛽1 𝑌𝑡 (1)
𝑙𝑜𝑔(𝑀𝑡𝑑 ) = 𝛽0 + 𝛽1 𝑌𝑡 (4)
where 𝑀𝑑 and 𝑌 are money demand and aggregate income level (in dollar terms). How do we
interpret 𝛽1 coefficients in (1) − (4)?
Model 1: When the aggregate income level increases by one dollar, the money demand increases
by one dollar.
Model 2: When the aggregate income level increases by one percent, the money demand
increases by one dollar.
Model 3: When the aggregate income level increases by one percent, the money demand
increases by one percent.
Model 4: When the aggregate income level increases by one dollar, the money demand increases
by one percent.
16