Logit Probit and Tobit Models For Catego PDF
Logit Probit and Tobit Models For Catego PDF
Logit Probit and Tobit Models For Catego PDF
By Rajulton Fernando
Presented at
PLCS/RDC Statistics and Data Series at Western
March 23,
23 2011
Introduction
• In social science research
research, categorical data are often
collected through surveys.
– Categorical
g Î Nominal and Ordinal variables
– They take only a few values that do NOT have a metric.
• A)) Binary
y Case
• Many dependent variables of interest take only two
values (a dichotomous variable), denoting an event or
non-event and coded as 1 and 0 respectively. Some
examples:
– The labor force status of a person.
– Voting behavior of a person (in favor of a new policy).
– Whether a person got married or divorced.
– Whether a person involved in criminal behaviour, etc.
Introduction
• With such variables
variables, we can build models that
describe the response probabilities, say P(yi = 1), of
the dependent
p variable yi.
– For a sample of N independently and identically distributed
observations i = 1, ... ,N and a (K+1)-dimensional vector x′i
off explanatory
l t variables,
i bl theth probability
b bilit that
th t y takes
t k value
l
1 is modeled as
P ( yi = 1| xi ) = F ( xi′ β ) = F ( zi )
where β is a (K + 1)-dimensional column vector of
parameters.
• The transformation function F is crucial. It maps the
linear combination into [0,1] and satisfies in general
F(−∞) = 0,
0 F(+∞) = 1,
1 and δF(z)/δz > 0 [that is is, it is a
cumulative distribution function].
The Logit and Probit Models
• When the transformation function F is the logistic
function, the response probabilities are given by
e xi β
′
P ( y i = 1 | xi ) =
1 + e xi β
′
P ( y i = 1 | x i ) = Φ ( x i′ β ) = ∫ Φ ( s ) ds
2 = ∫ 2π
e ds
−∞ −∞
Descriptive Statistics
1 6.491 8 .592
Model -2 Log
Likelihood Chi-Square df Sig.
Parameter Estimates
[married=1] 0a . . 0 . . .
Tobit regression cannot be done in SPSS. Use Stata. Here are the Stata commands.
First, fit simple OLS Regression of the variable lwf (just to check)
------------------------------------------------------------------------------
lwf | Coef. Std. Err. t P>|t| [95% Conf. Interval]
-------------+----------------------------------------------------------------
age | .0363624 .003862 9.42 0.000 .0287885 .0439362
married | .3188214 .0690834 4.62 0.000 .1833381 .4543046
children | .3305009 .0213143 15.51 0.000 .2887004 .3723015
education | .0843345 .0102295 8.24 0.000 .0642729 .1043961
_cons | -1.077738 .1703218 -6.33 0.000 -1.411765 -.7437105
------------------------------------------------------------------------------
------------------------------------------------------------------------------
lwf | Coef. Std. Err. t P>|t| [95% Conf. Interval]
-------------+----------------------------------------------------------------
age | .052157 .0057457 9.08 0.000 .0408888 .0634252
married | .4841801 .1035188 4.68 0.000 .2811639 .6871964
children | .4860021 .0317054 15.33 0.000 .4238229 .5481812
education | .1149492 .0150913 7.62 0.000 .0853529 .1445454
_cons | -2.807696 .2632565 -10.67 0.000 -3.323982 -2.291409
-------------+----------------------------------------------------------------
/sigma | 1.872811 .040014 1.794337 1.951285
------------------------------------------------------------------------------
Obs. summary: 657 left-censored observations at lwf<=0
1343 uncensored observations
0 right-censored observations