Econometric Model With Qualitative Variables - 2
Econometric Model With Qualitative Variables - 2
Econometric Model With Qualitative Variables - 2
How to quantify qualitative variables to quantitative variables ? Why do we need to do this ? Econometric model needs quantitative variables to estimate its parameters
What are the differences among these variables: Dummy? Indicator? Binary? Dichotomy? Categorical
Other Usages:
How to model Unstable Regression? - Jumping Regression - Shifting Regression
Technically speaking, do we have problems with our model if: - Independent variable (s) is (are) a dummy (ies) - Dependent variables is a dummy
Illustration:
We would like to analyze whether there are differences between graduate and undergraduate students in weekly entertainment spending. Y: weekly spending for entertainment per student PS: graduate or undergraduate PS = 1 ; graduate student PS = 0 ; undergraduate student Model: Y = + PS + u From the model, an average spending: Graduate student: E (Y PS = 1) = + Undergraduate student: E (Y PS = 0) =
For example, by using data from a survey, the estimated model is the following: Y = 9,4 + 16 PS t (53,22) (6,245) R2 = 96,54% The model indicates that 0 dan 0 (statistically signifiant) Interpretation: average spending for graduate students: 9,4 + 16 = 25,4, average spending for under graduate students: 9,4 (There is a difference between spending of the two groups) The next question is whether graduate students more able or more consumptive in entertainment spending than undergraduate students
A model that can relate X and G to Y: Y = 1 + 2 G + X + u From the model, it can be seen that: Average salary of female professor = 1 + X Average salary of male professor = 1 + 2 + X
Since we define dummy variable differently, will we have different result substantively? Model with new definition:
Y = 1 + 2 S + X + u
Remark
In defining dummy variable, which category is representing by one or zero does not matter as long as the estimated model is interpreted consistently.
Y = 1 + 2 D 2 + 3 D 3 + X + u
When we estimate this model with OLS, what will happened ?
Can we represent these types of variables with a Variable that has different values like: 1, 2, and 3 based on the number of categories? Should we define differently? Try define as follows: D2 = 1 ; if the highest level of education is high school 0 ; others D3 = 1 ; if the highest level of education is university 0 ; others
See the following model: Y = 1 + 2 D2 + 3 D3 + X + u life insurance expenses per year income per year 1 ; high school degree 0 ; others D3 = 1 ; college degree (S1) 0 ; others Average spending based on education: less than high school : 1 + X (base category) high school : 1 + 2 + X university/college (S1): 1 + 3 + X Notes: Reference group is less than high school. Why? How do we choose a base category? Y = X = D2 =
Salary = f (experience, sex, what faculty) Y = 1 + 2 D2 + 3 D3 + X + u Y = salary / year X = years of teaching D2 = 1 ; male professor 0 ; female professor D3 = 1 ; professor in Faculty of Economics 0 ; others Average salary of a female professor outside FE: 1 + X Average salary of a male professor outside FE: 1 + 2 + X Average salary of a female professor inside FE: 1 + 3 + X Average salary of a male professor inside FE: 1 + 2 + 3 + X
Comparing 2 regressions
Saving (Y) = 1 + 2 Income (X) + u The above model indicates that saving and income do not behave differently across sample and time. However, in reality, there is a possibility that the model behaves differently before and after a certain event. Let say, behavior of saving is different between prior and post an economic crisis. How to accommodate this changing in saving behavior? The following model can be used in accommodating a change. Period I, before crisis: Yi = 1 + 2 Xi + ui ; i = 1,2, , n Period II, after crisis: Yi = 1 + 2 Xi + i ; i = n+1, n+2, , N
Possibilities in comparing those two models: Case 1: 1 = 1 and Case 2: 1 1 and Case 3: 1 = 1 and Case 4: 1 1 and 2 = 2 2 = 2 2 2 2 2
Case 1 : both models are the same, no shift Case 4 : both models are different and there is a shift
Comparing 2 regression with dummy variables Yi = 1 + 2 Di + 1 Xi + 2 Di Xi + ui Di = 1 ; observation from period 1 0 ; observation from period 2 Based on this representation, average saving period: I : Yi = (1 + 2) + (1 + 2) Xi II : Yi = 1 + 1 Xi (Y) in
Rules:
1. Commission is proportional with sales 2. Bonus is given for an agent that over a target, X*.
Y: Bonus X: size of sales achieved by an agent X* : sales target Define a dummy, D = 1 ; if X > X* 0 ; if X X*
The commission can be modeled as follows: Commission = 1 + 1 X ; for X < X* Commission = 1 + 1 X + 2(X-X*) ; for X > X*