STAT 497 - Old Exams
STAT 497 - Old Exams
2009
METU-SPRING 2009
STAT 497
MIDTERM EXAM I
QUESTIONS
1. Eliminate parameter redundancy from the following series and determine if they are
stationary and/or invertible.
a) X t 0.8 X t 1 0.15 X t 2 at 0.3at 1
b) X t X t 1 0.5 X t 2 at at 1
c) Xt = −0.7Xt−2 + 1.5Xt−1 +at 0.2at-2
Yt 0.8Yt 1 at 0.5at 1
where a ~ WN 0, 2 .
t a
a) Find the values of p and q.
b) Write the equation of the process using the backward shift operator B.
2
4. Let Yt 15 0.5t Yt 1 at 0.6at 1 where at ~ WN 0, a .
a) Is Yt level stationary and/or trend stationary and/or difference stationary? Explain?
c) After eliminating the trend, which ARMA process fits well to this series?
5. Following plots belongs to several time series data. Each row gives the time series plot, sample
autocorrelation function and sample partial autocorrelation function of one series. Describe the series
1
and explain whether series is stationary or not. If it is stationary, what is the order of ARMA model
for that series? If it is not stationary, explain whether the series is trend stationary and/or difference
stationary by just looking at the plots.
E
6. For three different time series data, the KPSS test is applied and following results are obtained.
Interpret these outputs, assuming that =0.05.
2
a) > kpss.test(x)
b) > kpss.test(y)
7. The hourly chemical viscosity readings are given and before fitting any model to the data, we want
to identify a proper model for this series. The time series plot is given in Table 1.
b) The sACF and sPACF plots are given in Table 2. Do you think that the series is stationary?
Explain.
3
c) Explain the following results.
Minimum Information Criterion
Lags MA 0 MA 1 MA 2 MA 3 MA 4 MA 5
d) We decided to take a regular difference and, the time series plot, sACF/sPACF plots and
inverse autocorrelations are given in Table 3. Interpret the plots and explain whether we did a
good job by differencing or not?
4
Inverse Autocorrelations
Lag Correlation -1 9 8 7 6 5 4 3 2 1 0 1 2 3 4 5 6 7 8 9 1
1 -0.91429 | ******************| . |
2 0.88358 | . |****************** |
3 -0.79922 | ****************| . |
4 0.76953 | . |*************** |
5 -0.68733 | **************| . |
6 0.65809 | . |************* |
7 -0.58482 | ************| . |
8 0.55704 | . |*********** |
9 -0.49020 | **********| . |
10 0.46514 | . |********* |
Table 3. The time series plot, sACF/sPACF plots and inverse autocorrelations of the
differenced series.
1. Consider the 100 yearly unit sales of X brand pencil. Based on the ACFs and PACFs of the
original series and its differencing, a reasonable model for the series is chosen as an AR(2)
after differencing. Therefore, we can assume an AR(3) model for the original series and test
for a unit root. Without using assumed initial values, we fit the ADF equation with t = 4,
5, ...,100. The OLS regression equation becomes
Yˆt 15.95 0.75 Yt 1 0.06 Yt 1 0.58 Yt 2
3.10 0.05 0.08 0.08
The corresponding standard errors for the estimators are given in the parenthesis. Calculate
the test statistics to test for a unit root and decide whether to reject the null hypothesis of
existence of a unit root using the DF critical values table.
5
2. For the multiplicative seasonal ARIMA model of order 0,1,1 0,1,2 6 ,
a) Write the model in operator and difference equation form.
b) Can we directly calculate the autocovariances for this model? Why?
c) After taking the regular and seasonal differences obtain the autocovariance function by
using the autocovariance generating function.
3. Consider the model 1 0.6 B 0.8B 2 Yt 10 1 0.2 B at where at ’s are WN with mean
0 and a2 1 .
a) Given Y58 15, Y59 13, Y60 10 and a59 0 , obtain a60 and then first three-step
ahead minimum MSE forecasts for Y61 , Y62 and Y63 , Yˆ60 , 1, 2,3.
b) Find the weights j , j 1, 2,3 . State the recursive formula for j when j 3 .
c) Calculate the forecast error variances for e60 , 1, 2,3 and construct 95% prediction
intervals for Y61 , Y62 and Y63 .
d) Suppose that the observation at t = 61 turns out to be Y61 16 . Find the updated
forecast for Y62 .
4. Consider the following test results
a) For a series with nonzero mean, interpret the test results and discuss what should be done
next based on the test results.
Augmented Dickey-Fuller Unit Root Tests
b) For a series with mean 48.13, interpret the test results and discuss what should be done
next based on the test results.
6
5. The time series plot of quarterly US beer production between 1975 and 1997 is given in the
following figure
b) The sample ACF and PACF of the seasonally differenced series are given below. Interpret the
results and suggest an approprite SARIMA model for this series.
1.0 1.0
0.8 0.8
0.6 0.6
0.4 0.4
0.2 0.2
0.0 0.0
5 10 15 20 5 10 15 20
-0.2 -0.2
-0.4 -0.4
-0.6 -0.6
-0.8 -0.8
-1.0 -1.0
7
Dr. Ceylan Yozgatlıgil 4.6.2009
METU-SPRING 2009
STAT 497
FINAL EXAM
QUESTIONS
Wt 1 2 3 4 5 6 7 8 9 10 11 12 ˆ0 W
1 B 4 Yt .26 .06 −.14 −.38 -.12 .03 −.10 −.12 .03 −.04 .10 .05 25.8 1.8
1 B 1 B 4 Y−.39
t .04 .03 -.31 .10 .18 .01 −.11 .09 −.05 .06 −.10 38.2 0.2
Suggest an appropriate seasonal model (or models) for the series, to the extent that this is possible
based on the limited information, and calculate preliminary estimates for the parameters of the model,
including a2 , using a method of moments technique.
c) Find the weights j , j 1, 2, 3 and 4 in the random shock form of the model.
d) Calculate the forecast error variances for e100 , 1,2,3,4 and construct 95%
prediction intervals for Y101 , Y102 , Y103 and Y104 .
8
3. What type of outliers can be seen in time series analysis? Explain each of them briefly.
4. U.S. monthly unemployment rate for ages 16 and over from January 1948 to April 2009 is analyzed.
a) Interpret the following time series plot.
VALUE
11
10
JAN45 JAN50 JAN55 JAN60 JAN65 JAN70 JAN75 JAN80 JAN85 JAN90 JAN95 JAN00 JAN05 JAN10
date
c) After obtaining a stationary series, following sACF and sPACF plots are observed.
Identify an appropriate ARIMA model using these plots.
Autocorrelations
Partial Autocorrelations
9
Lag Correlation -1 9 8 7 6 5 4 3 2 1 0 1 2 3 4 5 6 7 8 9 1
1 0.11323 | .|** |
2 0.28963 | .|****** |
3 0.18024 | .|**** |
4 0.07092 | .|* |
5 0.08855 | .|** |
6 -0.02670 | *|. |
7 -0.07436 | *|. |
8 -0.02251 | .|. |
9 -0.01007 | .|. |
10 -0.13536 | ***|. |
11 0.02758 | .|* |
12 -0.12111 | **|. |
13 -0.01923 | .|. |
14 0.01524 | .|. |
15 0.04384 | .|* |
d) After deciding on two models, following estimation results are obtained for the identified
models. Identify the insignificant parameters in these models, if there are any. Which
estimation result is better? Discuss the reasons.
Maximum Likelihood Estimation I
Standard Approx
Parameter Estimate Error t Value Pr > |t| Lag
Standard Approx
Parameter Estimate Error t Value Pr > |t| Lag
10
e) You have decided on a best model fit but now you want to check the diagnostics. Several
diagnostic analyses are performed and following results are obtained. Explain each
analysis and discuss whether assumptions are violated or not. If violated, discuss what
should be done to correct this.
Autocorrelation Check of Residuals
To Chi- Pr >
Lag Square DF ChiSq --------------------Autocorrelations--------------------
Residual: Actual-Forecast
2
-1
-2
JAN45 JAN50 JAN55 JAN60 JAN65 JAN70 JAN75 JAN80 JAN85 JAN90 JAN95 JAN00 JAN05 JAN10
date
45
40
35
30
P
e
r 25
c
e
n 20
t
15
10
-1.5 -1.3 -1.1 -0.9 -0.7 -0.5 -0.3 -0.1 0.1 0.3 0.5 0.7 0.9 1.1
Residual: Actual-Forecast
11
Goodness-of-Fit Tests for Normal Distribution
1.5
R
e
s
i 1.0
d
u
a
0.5
l
:
A 0
c
t
u
a -0.5
l
-
F
-1.0
o
r
e
c -1.5
a
s
t
-2.0
-4 -3 -2 -1 0 1 2 3 4
Normal Quantiles
Alternative LM Pr > LM
Lag Correlation -1 9 8 7 6 5 4 3 2 1 0 1 2 3 4 5 6 7 8 9 1
1 0.33758 | .|******* |
2 -0.06627 | *|. |
3 0.02459 | .|. |
4 -0.01110 | .|. |
5 0.07115 | .|* |
6 0.04897 | .|* |
7 -0.04531 | *|. |
8 0.05434 | .|* |
9 0.04447 | .|* |
10 -0.00978 | .|. |
Heteroscedasticity Test
Equation Test Statistic DF Pr > ChiSq Variables
12
f) The following forecasts and 95% confidence limits are obtained at the end of the model
fit. Interpret the plot and discuss the reasonability of the forecasts.
VALUE
12
11
10
JAN72 JAN74 JAN76 JAN78 JAN80 JAN82 JAN84 JAN86 JAN88 JAN90 JAN92 JAN94 JAN96 JAN98 JAN00 JAN02 JAN04 JAN06 JAN08 JAN10
date
1. (20 pts.) Consider the following processes where it is assumed that {at}’s are White Noise
with zero mean and a2 variance.
a. Yt 0.4Yt 1 at 0.6at 2
b. Yt 0.8Yt 1 0.3Yt 2 at
c. Yt 0.2Yt 1 at 1.2at 1 0.8at 2
d. Yt 1.5Yt 1 0.56Yt 2 at 1.3at 1 0.4at 2
i) For each of the above process, identify the process as ARMA(p,q).
ii) Express each of the above process in backshift operator form.
iii) For each of the above process, determine if they are stationary and/or invertible (p.s.
please eliminate any parameter redundancy, if exists).
13
2. (15 pts.) Consider the process
where at ~ WN 0, a2 . .
3. (15 pts.) To be able to explain the variations in quarterly profit of U.S. manufacturing
corporations, the sales for U.S. manufacturing corporations is used. Besides of sales, linear
time trend and sesonal dummies (S1, S2, S3 and S4) are used to explain the seasonal variation
in the series. Time series plot of the profit series is given below:
14
Profit = -26510+0.2784*Sales-2755*Trend-491.4*S2-515.7*S3-1650*S4
(5605) (0.04) (457) (483) (377) (492)
Interpret the coefficients of these equations and explain the difference of them in terms of
explaining the variation of profit series.
4. (10 pts.) For one time series, the KPSS test is applied and following results are obtained.
Interpret these outputs, assuming that =0.05.
5. (10 pts.) Monthly values of the The Southern Oscillation Index (SOI) during 1950-1995. This
series consists of 540 observations on the SOI, computed as the ``difference of the departure from
the long-term monthly mean sea level pressures'' at Tahiti in the South Pacific and Darwin in
Northern Australia. The index is one measure of the so-called "El Nino-Southern Oscillation" -- an
event of critical importance and interest in climatological studies in recent decades. The fact that
most of the observations in the last part of the series take negative values is related to a recent
warming in the tropical Pacific, and this pattern has been of key interest in recent studies. A key
question of interest is to determine just how unusual this event is, and if it can reasonably be
explained by standard "stationary" time series models, or requires models that include drifts/trends
that may be related to global climatic change.
15
KPSS Test for Level Stationarity
KPSS Level = 0.6675, Truncation lag parameter = 5, p-value = 0.01650
KPSS Test for Trend Stationarity
KPSS Trend = 0.1412, Truncation lag parameter = 5, p-value = 0.05897
Phillips-Perron Unit Root Test
Dickey-Fuller Z(alpha) = -230.5145, Truncation lag parameter = 6,
p-value = 0.01
Warning message: In pp.test(soi): p-value smaller than printed p-value
16
B
Match the presented time series plots A to D with the generating stochastic processes given by i) to
vi). Explain the reasons of your identification.
7. (10 pts.) Following time series plot and ACF/PACF plots belongs to Ice Cream and Frozen
Desserts PPI from January 2000 to December 2009
a) Interpret these plots. What are the necessary steps that we need to consider next?
17
b) We decided to take a regular and seasonal difference to work with a statioanry series. The
time series plots and ACF/PACF plots of the differenced series given below. Interpret these
plots. Identify an appropriate SARIMA(p,d,q)(P,D,Q)s model for this series.
1. A time series of n = 100 observations gave the following sample ACF and sample PACF,
Lag k 1 2 3 4 5 6 7 8
rk 0.22 −0.68 −0.47 −0.34 0.56 −0.01 −0.51 −0.26
kk 0.22 −0.76 −0.09 0.05 0.09 0.04 −0.11 −0.11
with the sample mean Z 10.04 and sample variance ˆ0 2.77 . An AR(2) model,
1 B B Z
1 2
2
t at , appears to be appropriate for the data.
a) Obtain the Yule-Walker estimates ˆ1 and ˆ2 . Also, obtain estimates for and a2 .
b) Using ˆ1 and ˆ2 as the true values of 1 and 2 , calculate k , k 1, 2,3, 4.
2. Consider the following ARIMA model
18
Y47=19 Y48=22 Y49=17 Y50=21
h) Find the weights j , j 1, 2,3 in the random shock form of the model.
i) Calculate the forecast error variances for e50 , 1,2 ,3 and construct 95% prediction
intervals for Y51 ,Y52 and Y53 .
3. Consider the ARCH(1) model nt t et , where et are i.i.d. random variable with mean 0 and
variance 1, and t2 0 1nt21 . Show that the unconditional variance of nt is
0
Var nt .
1 1
4. Consider the two-dimensional vector AR(1) process
Time series plot of Lydia Pinkham’s Advertising and Sales Data from 1907-1960
19
4000
3000
Sales
2000
Advertisement
1000
b) What is the purpose of the Granger-Causality test? Explain. Interpret the below output.
Are the results logical? Explain.
Granger-Causality Wald Test
1 1 16.91 <.0001
2 1 8.73 0.0031
6. Consider the number of visitors (in thousands) to a Colorado ski resort data between the first
quarter of 1981 to last quarter of 1986.
a) Interpret the below time series plot.
20
b) Considering the below sample ACF, sample PACF plots and ADF test results what
should we do next? Why? Explain.
R output
21
Coefficients:
ar1 sar1
-0.7166 -0.6931
s.e. 0.1526 0.1645
Which model is fitted to the data? Are all parameters in the model significant? Write the model.
d) The diagnostic analysis on the residuals are given below. Explain each graph and test result. Do
you think that the model that you found in part c) is a plausible model for the data? If not, what do
you suggest?
To Chi- Pr >
Lag Square DF ChiSq -------------Autocorrelations------------
22
Jarque Bera Test
data: resid(ski.fit)
X-squared = 10.897, df = 2, p-value = 0.004303
The Sample ACF, Sample PACF plots and “Autocorrelation Check for White Noise” of
Squared Residuals
Autocorrelations
Partial Autocorrelations
Lag Correlation -1 9 8 7 6 5 4 3 2 1 0 1 2 3 4 5 6 7 8 9 1
1 0.34013 | . |******* . |
2 -0.26591 | . *****| . |
23
3 -0.12504 | . ***| . |
4 -0.08301 | . **| . |
5 0.13486 | . |*** . |
6 -0.15290 | . ***| . |
7 -0.00885 | . | . |
8 -0.10794 | . **| . |
9 -0.06764 | . *| . |
10 -0.18666 | . ****| . |
11 0.17740 | . |**** . |
12 -0.12110 | . **| . |
13 -0.11265 | . **| . |
14 -0.05091 | . *| . |
15 0.05536 | . |* . |
16 -0.14527 | . ***| . |
To Chi- Pr >
Lag Square DF ChiSq -------------Autocorrelations------------
Name:__________________________________________________________________
Signature: ______________
QUESTIONS
1. (20 pts.) Consider the following processes where it is assumed that {at}’s are White Noise
with zero mean and a2 variance.
24
a. Yt 0.2Yt 1 at 0.1at 1
b. Yt 0.1Yt 1 0.12Yt 2 at .9at 1 0.2at 2
c. Yt 0.2Yt 1 0.8Yt 2 at 0.3at 1 0.4at 2
iv) For each of the above process, identify the process as ARMA(p,q).
v) Express each of the above process in backshift operator form.
vi) For each of the above process, determine if they are stationary and/or invertible (p.s.
please eliminate any parameter redundancy, if exists).
i .i .d
2. (15 pts.) Let Xt be a MA(1) process: X t at at 1 where at ~ N 0,1 . Let
Yt X t X t 1 ,t 1,2 ,
a) Define the name of the process for Yt . Write the form of the process.
b) Derive the autocorrelation function of Yt .
Yt Yt 1 0.6Yt 2 1 at 0.5at 1
where at ~ WN 0, a2 . .
d. Is the process stationary?
e. Write the model in Random Shock Form and Inverted Form. If you cannot write them in
desired forms, explain the reason and take necessary actions to write the process in these
forms.
f. Find the mean and the autocovariance function of the model. If you cannot find them,
explain the reason and take necessary actions to find the mean and autocovarince function
of the process.
4. (10 pts.) Below output is obtained after applying a first order difference to the series. What
can you conclude about the necessity of differencing and the order of the process?
Autocorrelations
Inverse Autocorrelations
Lag Correlation -1 9 8 7 6 5 4 3 2 1 0 1 2 3 4 5 6 7 8 9 1
25
1 0.69470 | . |************** |
2 0.59781 | . |************ |
3 0.63658 | . |************* |
4 0.57689 | . |************ |
5 0.44585 | . |********* |
6 0.52649 | . |*********** |
7 0.49690 | . |********** |
8 0.40663 | . |******** |
9 0.40134 | . |******** |
10 0.35074 | . |******* |
11 0.26079 | . |***** |
12 0.27159 | . |***** |
13 0.27067 | . |***** |
14 0.17777 | . |**** |
15 0.21052 | . |**** |
Partial Autocorrelations
Lag Correlation -1 9 8 7 6 5 4 3 2 1 0 1 2 3 4 5 6 7 8 9 1
1 -0.36487 | *******| . |
2 -0.11063 | . **| . |
3 -0.21015 | ****| . |
4 -0.27139 | *****| . |
5 0.03307 | . |* . |
6 0.03035 | . |* . |
7 -0.21807 | ****| . |
8 -0.12548 | .***| . |
9 -0.07506 | . **| . |
10 -0.09722 | . **| . |
11 -0.07667 | . **| . |
12 -0.00303 | . | . |
13 -0.15226 | .***| . |
14 0.06496 | . |* . |
15 -0.00928 | . | . |
26
5. (20 pts.) Consider the following stochastic processes:
v) Yt 0.1Yt 1 0.9Yt 2 at
vi) Yt at 0.5at 1 0.8at 2
vii) Yt 0.6Yt 1 0.4Yt 2 at 0.5at 1 0.8at 2 0.3at 3
viii) Yt 50 0.5t 0.3t 2 at
Match the presented time series plots A to D with the generating stochastic processes given by i) to
vi). Explain the reasons of your identification.
6. (15 pts.) Consider the analysis of quarterly U.S. GNP from 1947(1) to 2002(3), n = 223
observations. The data are Real U.S. Gross National Product in billions of chained 1996
27
dollars and they have been seasonally adjusted. The data were obtained from the Federal
Reserve Bank of St. Louis (http://research.stlouisfed.org/).
d) By considering the above ACF plot and below output, what can you say about the stationarity
of the series? What are the necessary steps that we need to consider next?
e) After having a stationary process, the time series plots and ACF/PACF plots of the
differenced series given below. Interpret these plots. Identify an appropriate ARIMA(p,d,q)
model for this series.
EACF TABLES
28
AR/MA (gnp) AR/MA (diff(gnp)
0 1 2 3 4 5 6 7 8 9 10 0 1 2 3 4 5 6 7 8 9 10
0 xxxxxxxxxxx 0 xxxoooooooo
1 xxooooooooo 1 xoooooooooo
2 xoooooooooo 2 ooooooooooo
3 xxooooooooo 3 xoxoooooooo
4 xxooooooooo 4 oxxoooooooo
5 oxooooooooo 5 xxooooooooo
6 oxxoooooooo 6 xxoxoooxooo
7 xoxooooxooo 7 xooxooxoooo
8 xoxoooxxooo 8 xoxxxoxoooo
9 xxxoooxoooo 9 xoxoxxxoooo
10 x o x x x x x x o o o 10 x o x o o o x x o o o
Lags MA 0 MA 1 MA 2 MA 3 MA 4 MA 5
QUESTIONS
Quarters I II III IV
Series 15 10 15 30
29
Residuals 1 2 2 3
3. Suppose that annual sales (in millions of dollars) of the Acne Corporation follow the AR(2)
model
Yt 10 0.1Yt 1 0.6Yt 2 at
with a2 1.
a) If sales for 2006, 2007, and 2008 were $5 million, $8 million, and $9 million, respectively,
forecast for 2009 and 2010.
b) Show that 1 0.1 for this model.
c) Calculate 95% prediction limits for your forecasts in part a) for 2009 and 2010.
d) If sales in 2009 turn out to be $10 million, update your forecast for 2010.
5. Levels of carbon dioxide (CO2) are monitored at several sites around the world to investigate
atmospheric changes. One of the sites is at Alert, Northwest Territories, Canada, near the
Arctic Circle. The time series plot of monthly CO2 levels from January 1994 through
December 2004 is given below.
b) The Least Square fit for the monthly time trend with seasonal dummies is given below.
Interpret the regression output.
Call:
lm(formula = co2 ~ 0 + trend + Q, na.action = NULL)
Residuals:
Min 1Q Median 3Q Max
-1.73874 -0.59689 -0.06947 0.54086 2.15539
30
Coefficients:
Estimate Std. Error t value Pr(>|t|)
trend 1.8321 0.0221 82.90 <2e-16 ***
Q1 -3290.5412 44.1790 -74.48 <2e-16 ***
Q2 -3289.8730 44.1809 -74.46 <2e-16 ***
Q3 -3289.5775 44.1827 -74.45 <2e-16 ***
Q4 -3289.3102 44.1846 -74.44 <2e-16 ***
Q5 -3289.0138 44.1864 -74.44 <2e-16 ***
Q6 -3291.2173 44.1883 -74.48 <2e-16 ***
Q7 -3297.8264 44.1901 -74.63 <2e-16 ***
Q8 -3303.9827 44.1919 -74.76 <2e-16 ***
Q9 -3303.3617 44.1938 -74.75 <2e-16 ***
Q10 -3298.8017 44.1956 -74.64 <2e-16 ***
Q11 -3294.4689 44.1975 -74.54 <2e-16 ***
Q12 -3291.8779 44.1993 -74.48 <2e-16 ***
---
Signif. codes: 0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1
6. Consider the monthly U.S. electricity generation (in millions of kilowatt hours) of all types:
coal, natural gas, nuclear, petroleum, and wind, 01/1973 - 12/2005.
a) The time series plot, ACF and PACF plots are given below. Please interpret the plots.
31
b) ADF and HEGY test results can be seen in the below output. Please interpret the outputs
and explain what should be the next step.
## ADF test with constant, no trend and seasonal dummies.
> ADF.test(wts=electricity, itsd=c(1,0,c(1:11)),regvar=0, selectlags=list(mode="bic",
Pmax=12))
--------- ------ - ------ ----
Augmented Dickey & Fuller test
--------- ------ - ------ ----
ADF statistic:
ADF statistic:
---- ----
HEGY test
---- ----
HEGY statistics:
Stat. p-value
tpi_1 -1.494 0.10
tpi_2 -3.856 0.01
Fpi_3:4 33.858 0.10
Fpi_5:6 28.442 0.10
Fpi_7:8 3.747 0.01
Fpi_9:10 12.883 0.10
Fpi_11:12 2.303 0.01
Fpi_2:12 20.113 NA
Fpi_1:12 18.453 NA
Lag orders: 3 12
Number of available observations: 372
32
After a seasonal difference, we have
What is your preliminary guess about the order of the seasonal multiplicative model? Please write the
model that you suggest?
33
Dr. Ceylan Yozgatlıgil 10.6.2011
METU-SPRING 2011
STAT 497
FINAL EXAM
QUESTIONS
1. A non-seasonal time series of n = 100 observations gave the following sample ACF and
sample PACF for the original data,
Lag k 1 2 3 4 5 6 7 8 9 10
rk 0.73 0.11 −0.50 −0.79 −0.63 −0.15 0.36 0.64 0.55 0.17
kk 0.73 −0.91 0.09 0.03 0.08 −0.05 −0.03 0.12 −0.14 −0.05
with the sample mean Y 0.07 and sample variance ˆ 0 5.29 and for the first differenced data
we have
Lag k 1 2 3 4 5 6 7 8 9 10
rk 0.65 −0.01 −0.61 −0.84 −0.59 −0.06 0.44 0.68 0.54 0.11
kk 0.65 −0.75 −0.42 −0.33 −0.15 −0.16 −0.24 0.00 −0.09 −0.13
c) Find the weights j , j 1, 2,3 in the random shock form of the model.
d) Calculate the forecast error variances for e200 , 1,2 ,3 and construct 95% prediction
intervals for Y201 ,Y202 and Y203 . .
34
3. Consider the monthly seasonally adjusted U.S. Disposable Personal Income Per Capita and M2
money stock data from 1959:1 to 2011:1.
a) What features of the data are evident in the time series plots and related statistics that should
be captured by a model? Is there other evidence that would be useful in formulating a
modeling strategy?
c) We perform a Johansen cointegration test to these series and received the following results.
Explain what this output tells you.
Cointegration Rank Test Using Trace
5%
H0: H1: Critical Drift Drift in
Rank=r Rank>r Eigenvalue Trace Value in ECM Process
35
d) We decided to fit a VECM(4) to the series and obtain the following output on the residuals.
Interpret each output and if the assumptions are not satisfied, suggest a method to fix it.
income +- .. .. .. .. .+ ..
m2 -+ .. .. .. .- .. ..
5 4 31.48 <.0001
6 8 35.03 <.0001
Standard
Variable R-Square Deviation F Value Pr > F
36
4. U.S. quarterly Real Gross Domestic Product data are obtained from U.S. Department of
Commerce. Data start from first quarter of 1947 to first quarter of 2011.
a) What features of the data are evident in the time series plots and related statistics that should
be captured by a model? Is there other evidence that would be useful in formulating a
modeling strategy?
b) Interpret the below outputs and discuss what should be the next step.
HEGY.test (gdp, c(1,0,c(0)), regvar=0, selectlags=list(mode="signf", Pmax=NULL))
HEGY statistics:
Stat. p-value
tpi_1 -1.527 0.10
tpi_2 -8.078 0.01
Fpi_3:4 180.727 0.01
Fpi_2:4 734.935 NA
Fpi_1:4 558.200 NA
----
HEGY statistics:
Stat. p-value
tpi_1 -3.500 0.01
tpi_2 -6.759 0.01
Fpi_3:4 82.517 0.01
Fpi_2:4 85.489 NA
Fpi_1:4 66.266 NA
c) After a regular order of difference, we obtain the below time series plot, sample ACF and
sample PACF plots. Interpret the plots and suggest a tentative model.
37
d) You can see six different fitted models for the series below. Interpret the results and choose one
model as the best one for this time series. Explain the reasoning.
> fit1=arima(gdp,order=c(2,1,0),seasonal=list(order=c(2,0,0), period=4))
> fit1
Call:
arima(x = gdp, order = c(2, 1, 0), seasonal = list(order = c(2, 0, 0), period = 4))
Coefficients:
ar1 ar2 sar1 sar2
0.4842 0.3162 -0.0198 -0.1198
s.e. 0.0602 0.0677 0.0727 0.0679
Call:
arima(x = gdp, order = c(2, 1, 2), seasonal = list(order = c(2, 0, 0), period = 4))
Coefficients:
ar1 ar2 ma1 ma2 sar1 sar2
0.0605 0.7901 0.3714 -0.2949 -0.1256 -0.1659
s.e. 0.0955 0.1053 0.1310 0.1304 0.0780 0.0695
Call:
arima(x = gdp, order = c(2, 1, 2), seasonal = list(order = c(2, 0, 1), period = 4))
Coefficients:
ar1 ar2 ma1 ma2 sar1 sar2 sma1
0.0255 0.9043 0.3589 -0.4158 0.2824 -0.1659 -0.4933
38
s.e. 0.0518 0.0620 0.1061 0.1203 0.1340 0.0712 0.1318
Call:
arima(x = diff(gdp), order = c(2, 1, 3), seasonal = list(order = c(2, 0, 1),
period = 4))
Coefficients:
ar1 ar2 ma1 ma2 ma3 sar1 sar2 sma1
-0.3477 0.4668 -0.2431 -0.6768 0.0012 0.3553 -0.1937 -0.4686
s.e. 0.1543 0.1392 0.1687 0.1302 0.1335 0.1462 0.0701 0.1445
Call:
arima(x = gdp, order = c(2, 1, 2), seasonal = list(order = c(2, 0, 2), period = 4))
Coefficients:
ar1 ar2 ma1 ma2 sar1 sar2 sma1 sma2
0.0419 0.8538 0.3509 -0.3371 1.0561 -0.6825 -1.2874 0.7230
s.e. 0.0577 0.0663 0.1030 0.1070 0.1447 0.0990 0.1501 0.1153
Call:
arima(x = gdp, order = c(3, 1, 2), seasonal = list(order = c(2, 0, 2), period = 4))
Coefficients:
ar1 ar2 ar3 ma1 ma2 sar1 sar2 sma1 sma2
0.6857 0.7802 -0.4686 -0.2835 -0.6324 1.0094 -0.7214 -1.1548 0.6706
s.e. 0.1338 0.1304 0.0701 0.1374 0.1104 0.1388 0.1263 0.1455 0.1457
39
> shapiro.test(window(rstandard(fit)))
data: window(rstandard(fit))
W = 0.975, p-value = 0.0001719
40
`
> Box.test(resid(fit),lag=15,type = c("Ljung-Box"))
Box-Ljung test
data: resid(fit)
X-squared = 10.0169, df = 15, p-value = 0.8187
Squared residuals are obtained and ACF and PACF plots are obtained as follows. Why are looking at
the squared residuals? What these graphs tell us?
41
f) We obtained the following forecasts and prediction limits. What can you say about these forecasts
and prediction limits? Are they plausible?
X 1t 1 0 X 1,t 1 e1t
X 0.6
0 X 2,t 1 e2t
2t
6. Consider a quarterly data for U.S. from 1970:1 to 2008:4 (n=156) with variables
42
yt: log of real GDP (in billion of 2000 dollars)
a) What is the problem when nonstationary series are used in regression analysis?
b) Why do we use augmented Dickey-Fuller test?
c) We perform a Granger-causality test. Analyze the results of Granger-causality Wald test at 5%
significance level.
Chi-square df p-value
m y 18.222 6 0.006
y m 2.205 6 0.900
1. A time series of n = 100 observations gave the following sample ACF and sample PACF
for the original data,
Lag k 1 2 3 4 5 6 7 8 9 10
rk 0.73 0.11 −0.50 −0.79 −0.63 −0.15 0.36 0.64 0.55 0.17
kk 0.73 −0.91 0.09 0.03 0.08 −0.05 −0.03 0.12 −0.14 −0.05
with the sample mean Y 0.07 and sample variance ˆ 0 5.29 and for the first differenced data
we have
Lag k 1 2 3 4 5 6 7 8 9 10
rk 0.65 −0.01 −0.61 −0.84 −0.59 −0.06 0.44 0.68 0.54 0.11
kk 0.65 −0.75 −0.42 −0.33 −0.15 −0.16 −0.24 0.00 −0.09 −0.13
43
2. Consider the following ARIMA model
d) Find the weights j , j 1, 2,3 in the random shock form of the model.
e) Calculate the forecast error variances for e50 , 1,2 ,3 and construct 95%
prediction intervals for Y51 ,Y52 and Y53 .
3. Consider the two-dimensional vector AR(1) process
44
4. Lydia Pinkham’s Vegetable Compound was first brewed and prepared by the lady of the
same name as a home remedy of her friends. The compound, an herbal extract in alcoholic
solution, was considered to be effective against “women’s weakness”. The first
commercial sale of the medicine was in 1873, when sales reached $500,000. A part of the
profits was reinvested immediately into high intensity advertising campaigns. Records of
a court litigation involving a family argument produced a complete listing of the firm’s
annual advertising expenditures and sales over the period 1908-1935. Palda (1964) was
later able to obtain the data from 1907-1960.
Time series plot of Lydia Pinkham’s Advertising and Sales Data from 1907-1960
4000
3000
Sales
2000
Advertisement
1000
ii. What is the purpose of the Granger-Causality test? Explain. Interpret the below output.
Are the results logical? Explain.
Granger-Causality Wald Test
1 1 16.91 <.0001
2 1 8.73 0.0031
45
5. Consider the number of visitors (in thousands) to a Colorado ski resort data between the
first quarter of 1981 to last quarter of 1986.
i. Interpret the below time series plot.
ii. Considering the below sample ACF, sample PACF plots and ADF test results what
should we do next? Why? Explain.
46
c) After regular and seasonal differencing, we fit a certain model to the data. The results are given in
the below table:
R output
Coefficients:
ar1 sar1
-0.7166 -0.6931
s.e. 0.1526 0.1645
Which model is fitted to the data? Are all parameters in the model significant? Write the model.
d) The diagnostic analysis on the residuals are given below. Explain each graph and test result. Do
you think that the model that you found in part c) is a plausible model for the data? If not, what do
you suggest?
To Chi- Pr >
Lag Square DF ChiSq -------------Autocorrelations------------
47
Jarque Bera Test
data: resid(ski.fit)
X-squared = 10.897, df = 2, p-value = 0.004303
The Sample ACF, Sample PACF plots and “Autocorrelation Check for White Noise” of
Squared Residuals
Autocorrelations
48
Partial Autocorrelations
Lag Correlation -1 9 8 7 6 5 4 3 2 1 0 1 2 3 4 5 6 7 8 9 1
1 0.34013 | . |******* . |
2 -0.26591 | . *****| . |
3 -0.12504 | . ***| . |
4 -0.08301 | . **| . |
5 0.13486 | . |*** . |
6 -0.15290 | . ***| . |
7 -0.00885 | . | . |
8 -0.10794 | . **| . |
9 -0.06764 | . *| . |
10 -0.18666 | . ****| . |
11 0.17740 | . |**** . |
12 -0.12110 | . **| . |
13 -0.11265 | . **| . |
14 -0.05091 | . *| . |
15 0.05536 | . |* . |
16 -0.14527 | . ***| . |
To Chi- Pr >
Lag Square DF ChiSq -------------Autocorrelations------------
6. State whether the following statements are TRUE (T) or FALSE (F)? And if your answer
is FALSE, write the TRUE statement below it.
( ) If the seasonal difference (i.e., the season-to-season change) of a time series looks like
stationary noise, this suggests that the process is seasonal random walk.
( ) The idea behind cointegration is to find a linear combination between two I(d)-variables that
yields a variable with a lower order of integration.
49
( ) White's (1980) test is a test of the null hypothesis of no heteroskedasticity against
heteroskedasticity of unknown, general form.
( ) The correlograms of the squared residuals can be used to check autoregressive conditional
heteroskedasticity. If there is ARCH in the residuals, the autocorrelations and partial autocorrelations
should be zero at all lags and the Ljung-Box Q-statistics should not be significant.
( ) The ARCH model does not provide any new insight for understanding the source of variations of
a financial time series. They only provide a mechanical way to describe the behavior of the
conditional variance. It gives no indication about what cause such behavior to occur.
( ) If we decide a spurious regression between any two variable, it means that there is cointegration
between these variables.
( ) Vector error correction model (VECM) is adding error correction feature to a multi-factor model
such as a vector autoregression model.
1. From a series of length 100, we have computed the sample autocorrelations as r1 = 0.8, r2 = 0.5,
r3 = 0.4, Y = 2, and a sample variance of 5. If we assume that an AR(2) model with a constant
term is appropriate,
a) Obtain the Yule-Walker estimates of 1, 2, θ0, and 2, where 0 is the constant term and 2 is the
error variance?
b) Using the estimates of the parameters that you found in part a), obtain the two-step ahead forecasts
with the 95% prediction intervals assuming that Y99=5 and Y100=3.
50
X1,t 0.4 0.6 X1,t 1 a1,t 0.8 0.3a1,t 1
b)
X 2,t 0.2 0.3 X 2,t 1 a2,t 0.4 0.15 a2,t 1
3. Quarterly oil prices of Turkey data are obtained from TURKSTAT from 1996 to 2011.
d) Interpret the following outputs separately and discuss about the next step to follow.
> HEGY.test(wts =oil, itsd = c(1, 0,c(0)), regvar = 0)
HEGY statistics:
Stat. p-value
tpi_1 0.304 0.10
tpi_2 -5.407 0.01
Fpi_3:4 7.807 0.01
Fpi_2:4 28.932 NA
Fpi_1:4 21.747 NA
Stat. p-value
tpi_1 0.001 0.10
tpi_2 -3.320 0.01
Fpi_3:4 1.561 0.10
Fpi_2:4 7.740 NA
Fpi_1:4 6.343 NA
51
Stat. p-value
tpi_1 -8.288 0.010
tpi_2 -7.219 0.011
Fpi_3:4 34.417 0.010
Fpi_2:4 36.234 NA
Fpi_1:4 30.594 NA
c) You can see five different fitted models for the series below. Interpret the results and choose one model
as the best one for this time series. Explain the reasoning.
Coefficients:
ar1 ar2 ma1 sma1 sma2
52
0.8374 -0.3823 -0.5056 -0.7709 -0.0389
s.e. 0.3674 0.1589 0.3766 0.1758 0.1633
d) After deciding on the best model, we checked the diagnostics of the model. Please interpret each
output one-by-one. If assumptions are not satisfied, suggest a method to solve the problem.
> shapiro.test(window(rstandard(fit),start=c(1996,1)))
Shapiro-Wilk normality test
53
W = 0.9374, p-value = 0.002623
> jarque.bera.test(rstandard(fit))
Jarque Bera Test
Box-Ljung test
data: rstandard(fit)
X-squared = 9.9355, df = 15, p-value = 0.8238
Box-Pierce test
data: rstandard(fit)
X-squared = 8.4557, df = 15, p-value = 0.9042
Squared residuals are obtained and ACF and PACF plots are obtained as follows. Why are looking at
the squared residuals? What these graphs tell us?
>rr=rstandard(fit)^2
54
d) We obtained the following forecasts and prediction limits. What can you say about these forecasts
and prediction limits? Are they plausible?
4. Consider the quarterly Turkish private consumption expenditure and current price gross domestic
product (GDP) series from 1990 to 2011.
a) Interpret the below plot. What could be the reason of the co-movement between these two series?
Explain.
55
b) What is the purpose of the Granger causality test? Explain the logic behind the test.
c) Analyze and interpret the results of Granger causality test applied to the consumption and GDP series
of Turkey. Do you think that the results are logical? Discuss.
> granger.test(cg, 3)
F-statistic p-value
GDP -> CONSUMPTION 2.331032 0.080678470
CONSUMPTION -> GDP 5.249620 0.002368946
QUESTIONS
Yt 6 0.8Yt 2 at 0.027at 3
where it is assumed that {at}’s are White Noise with zero mean and a2 variance.
c) (5 pts.) Write the stationarity and the invertibility condition of the process.
d) (5 pts.) Write the process in random shock form and find 1, 2, 3 and 4.
e) (5 pts.) Write the process in inverted form and find 1, 2, 3 and 4.
56
10. Consider the following processes
a) (5 pts.) Yt 0.8Yt 1 0.9Yt 2 0.7Yt 3 at
b) (5 pts.) Yt 0.4Yt 12 0.6Yt 24 at 0.5at 2
c) (5 pts) Yt 2 t Yt 1 at at 1
where at ~ WN 0, 2 .
Are the processes stationary? If not, obtain stationary processes applying an appropriate
technique.
11. (15 pts.) Examining and forecasting the global carbon emissions is very important to get
precautions before it is too late. Changing climate due to global warming and raise in carbon
emission affect all living creatures on earth. Source carbon data is published online
(November 21, 2010) in Nature Geoscience and released via GlobalCarbonProject.org. We
examine yearly global carbon emissions from 1959 to 2009.
a) What features of the data are evident in the time series plots and related statistics that should
be captured by a model? Is there other evidence that would be useful in formulating a
modeling strategy?
d) In the light of following test results, what are the necessary steps that we need to consider
next?
> pp.test(co2)
data: co2
Dickey-Fuller Z(alpha) = -10.6185, Truncation lag parameter = 3, p-value = 0.4778
> kpss.test(co2,c("Level"))
57
data: co2
KPSS Level = 2.5488, Truncation lag parameter = 1, p-value = 0.01
> kpss.test(co2,c("Trend"))
data: co2
KPSS Trend = 0.279, Truncation lag parameter = 1, p-value = 0.01
> kpss.test(diff(co2),c("Level"))
data: diff(co2)
KPSS Level = 0.0947, Truncation lag parameter = 1, p-value = 0.1
d) After having a stationary process, the time series plots and ACF/PACF plots of the series given
below. Interpret these plots. Identify an appropriate ARIMA(p,d,q) model for this series.
b) (15 pts.) Turkish monthly consumer prices index (CPI) is obtained from January 1995 to
April 2013.
58
b) Interpret the below outputs and discuss what should be the next step. Explain each output
separately.
HEGY statistics:
Stat. p-value
tpi_1 -0.273 0.100
tpi_2 -4.226 0.010
Fpi_3:4 8.162 0.010
Fpi_5:6 4.491 0.012
Fpi_7:8 3.203 0.043
Fpi_9:10 14.468 0.010
Fpi_11:12 1.144 0.100
Fpi_2:12 7.660 NA
Fpi_1:12 7.029 NA
HEGY statistics:
Stat. p-value
tpi_1 0.343 0.100
tpi_2 -3.004 0.010
Fpi_3:4 4.526 0.011
Fpi_5:6 2.787 0.068
Fpi_7:8 0.832 0.100
Fpi_9:10 6.168 0.010
Fpi_11:12 4.085 0.019
Fpi_2:12 3.129 NA
Fpi_1:12 2.906 NA
59
>HEGY.test(diff(cpi,12), itsd=c(0,0,0),regvar=0, selectlags=list(mode="aic",
Pmax=12))
HEGY statistics:
Stat. p-value
tpi_1 -0.413 0.10
tpi_2 -6.206 0.01
Fpi_3:4 17.810 0.01
Fpi_5:6 16.373 0.01
Fpi_7:8 24.747 0.01
Fpi_9:10 35.998 0.01
Fpi_11:12 21.145 0.01
Fpi_2:12 40.590 NA
Fpi_1:12 37.223 NA
> HEGY.test(diff(diff(cpi,12)), itsd=c(0,0,0),regvar=0,
selectlags=list(mode="aic", Pmax=12))
HEGY statistics:
Stat. p-value
tpi_1 -3.446 0.01
tpi_2 -10.117 0.01
Fpi_3:4 49.689 0.01
Fpi_5:6 35.239 0.01
Fpi_7:8 37.765 0.01
Fpi_9:10 43.328 0.01
Fpi_11:12 32.261 0.01
Fpi_2:12 30.757 NA
Fpi_1:12 28.998 NA
c) As a result of the operation that you suggest in part b), the series Zt is obtained and the time
series plot, correlogram and sample PACF plot are given below.
i) Interpret the time series plot and discuss the existence of any strange behavior.
ii) Which ARIMA(p,d,q) or SARIMA(p,d,q)(P,D,Q)s process(es) is(are) suitable for this
series? Why?
60
c) (15 pts.) Suggest appropriate processes for the below plots and explain the reasons.
i)
ii)
61
iii)
3. From a series of length 200, the sample mean of the process is obtained as 25. The sample
autocorrelations are calculated as r1 = 0.76, r2 = 0.65, r3 = 0.14, and a sample variance of 4. If we
assume that an AR(2) model with a constant term is appropriate,
c) Obtain the Yule-Walker estimates of the constant term, parameters 1, 2 and the error variance
2?
d) Using the estimates of the parameters that you found in part a), obtain the two-step ahead forecasts
with the 95% prediction intervals assuming that Y199=27 and Y200=24.
4. Consider the process 1 0.7 B 1 0.3B 4 Yt 12 1 0.4 B at where at ’s are WN with
mean 0 and a2 1 .
62
e) If the forecast origin 90, given is
Y85 9 ,Y86 20, Y87 8,Y88 15, Y89 10, a 90 0.5 and a89 0.3 , find the first four-
step ahead minimum MSE forecasts for Y91 ,Y92 , Y93 and Y94 , Ŷ90 , 1,2 ,3,4 .
f) Calculate the forecast error variances for e90 , 1,2 ,3,4 and construct 95%
prediction intervals for Y91 ,Y92 , Y93 and Y94 .
g) Suppose that the observation at t = 91 turns out to be Y91 16 . Find the updated
forecast for Y92 .
5. Consider two-dimensional vector AR(1) processes
63
3. Monthly median sales for new houses sold data are obtained from U.S. Department of Commerce:
Census Bureau between January 1963 and April 2013. The last four months’ data which will be used to
check the forecast accuracy were not used in the analysis.
f) Interpret the following outputs separately and discuss about the next step to follow.
> HEGY.test(wts =sales, itsd= c(1, 0,c(0)), regvar = 0)
HEGY statistics:
Stat. p-value
tpi_1 0.873 0.10
tpi_2 -5.535 0.01
Fpi_3:4 71.608 0.01
Fpi_5:6 52.256 0.01
Fpi_7:8 66.231 0.01
Fpi_9:10 65.960 0.01
Fpi_11:12 34.894 0.01
Fpi_2:12 71.147 NA
Fpi_1:12 65.527 NA
HEGY statistics:
Stat. p-value
tpi_1 -2.542 0.01
tpi_2 -5.501 0.01
Fpi_3:4 52.080 0.01
Fpi_5:6 37.911 0.01
Fpi_7:8 37.033 0.01
Fpi_9:10 55.274 0.01
Fpi_11:12 25.054 0.01
Fpi_2:12 54.296 NA
Fpi_1:12 50.623 NA
64
> HEGY.test(wts =diff(diff(sales,12)), itsd = c(0, 0,c(0)), regvar = 0)
HEGY statistics:
Stat. p-value
tpi_1 -6.698 0.01
tpi_2 -6.404 0.01
Fpi_3:4 83.780 0.01
Fpi_5:6 61.907 0.01
Fpi_7:8 55.682 0.01
Fpi_9:10 82.737 0.01
Fpi_11:12 81.847 0.01
Fpi_2:12 50.644 NA
Fpi_1:12 49.703 NA
65
c) You can see six different fitted models for the series below. Interpret the results and choose one model
as the best one for this time series. Explain the reasoning.
Coefficients:
ar1 ar2 ar3 ma1 sar1 sar2
-0.2212 -0.0190 -0.0658 -0.4019 -0.0471 0.1448
s.e. 0.1539 0.1015 0.0536 0.1509 0.0434 0.0435
Coefficients:
ar1 ma1 sma1
-0.1243 -0.5415 -0.9691
s.e. 0.0631 0.0521 0.0271
Coefficients:
ar1 ma1 ma2 sar1 sar2 sma1
-0.8483 0.2476 -0.4076 -0.0498 0.1402 -0.0118
s.e. 0.1043 0.1221 0.0791 0.3072 0.0467 0.3070
Coefficients:
ar1 ar2 ar3 ar4 ar5 ma1 sar1 sar2
-1.1404 -0.6582 -0.3967 -0.2722 -0.1706 0.4854 0.6797 0.1488
s.e. 0.1915 0.1340 0.0861 0.0683 0.0411 0.1932 0.1358 0.0558
sar3 sma1 sma2
-0.1688 -1.7374 0.7629
s.e. 0.0533 0.1297 0.1236
Coefficients:
66
ar1 ma1 sar1 sar2
-0.1800 -0.4477 -0.0479 0.1430
s.e. 0.0696 0.0648 0.0434 0.0435
d) Below are the diagnostics of fit6. Please interpret each output one-by-one. If assumptions are not
satisfied, suggest a method to solve the problem.
> jarque.bera.test(rstandard(fit))
data: rstandard(fit)
X-squared = 402.8255, df = 2, p-value < 2.2e-16
Box-Ljung test
data: rstandard(fit)
X-squared = 33.7756, df = 15, p-value = 0.003661
Squared residuals are obtained and then ACF and PACF plots are drawn as follows. Why are we
looking at the squared residuals? What do these graphs tell us?
>rr=rstandard(fit)^2
67
d) We obtained the following forecasts and prediction limits for fit6 and fit5, respectively. What can
you say about these forecasts and prediction limits? Are they plausible?
68
e) The original median new house sale prices, forecasts of fit5 and fit6 are given below. Use one of the
forecast accuracy measures and decide which fit is better than the other using this measure.
FORECASTS
Original Fit6 Fit5
Jan-13 251500 252853.9 254471.3
Feb-13 266300 250047.2 256708
Mar-13 250700 249942.7 253329
Apr-13 271600 250736.1 257136.6
5. Annual Carbon Dioxide Emissions (million tons carbon dioxide), coal consumption (million tons
oil equivalent) and primary energy consumption (million tons oil equivalent) of Turkey are
obtained by BP Statistical Review of World Energy June 2012 from 1965 to 2011.
69
400
300
200
100
Year
Note: Upper line is the graph of emission, middle one is for the general energy consumption and
the bottom one is for the coal consumption.
b) Explain the following test results. What is the next step to follow? Explain.
Cointegration Rank Test Using Trace
5%
H0: H1: Critical Drift Drift in
Rank=r Rank>r Eigenvalue Trace Value in ECM Process
70
c) Analyze and interpret the results of Granger causality test applied to the general energy consumption,
coal consumption and carbon dioxide emission series of Turkey. Do you think that the results are logical?
Discuss.
Granger-Causality Wald Test
1 1 0.15 0.7019
2 1 1.26 0.2610
3 1 5.36 0.0207
71