Ecometric
Reviews
EDITOR
DALE J. POIRIER
Department of Econo mics
University of Toronto
150 St. George Street
Toronto, Canado M5S lAI
ASS@IATE EDITORS
.
ALBERTO HOLLY
U"luersit€de Lausanne
Ecole des Hautes Eudes Commerciales
1015
l^ousanner;f::tr
CHENG HSIAO
of Toronto
Instirute for Policy Analysis
University
150 St. George Street
Toronto, Ontario, MsS
IAI
ROGER KOENKER
Department of Economics
University of lllinois
Champaign, Illinois
U.S.A.
6
I 80
I
TAKAMITSU SAWA
Kyoto Universiry
Institute of Economic Research
fukyo-Ku, Kyoto 606,
fapan
Canada
PETER SCHMIDT
Michigan Stat e University
Department of Eonomics
Marshall Hall
East Lansing, Mich$an 48824
U.S,A.
ECONOMETRrc REVIEWS
Volunc
l,Nubcr I, l9ts
CONTENTS
!.
reaarrr of -Bgnlnctricr and-1':P'Tl:.11*'^;;-:'
von zur Muehlen
K. Conwsy,
{i v. i.$loern/, R.
$i;;-, o. thc Forndrtionr of Econometrics - Arc Thete
$
_
5t
Doraltd
iii
or rbc Foundrttonr of Economctricr
-
Aro Thcre
on thc Ponndrtlonr of Econonctricr
-
Arc Thetc
'lmpfopcr" Prlorr, rnd Finltc
69
75
Addittvity
on thc Poundrtl,on of Econometricr
-
El
Are Therc
93
jtt, t. Smlhy
:fcply lo Conrncntr on thc Foundrtirons of Econometricr
IttfrtAny?
l'"'.IrI
A.
Y.
B,
Are
l0l
Swomy, R. K. Conway, ond P. von zur Muehlen
j.:
||tb
-
.
rod Llmltrtlonr of Prncl
t2t
Drtr . .
i-ttbo
r Gomot
Dttr . .
t7s
Limitrtiorrc of Prncl
Dttr . .
t79
of Prnel
Drtr . .
183
on Eenefitr rnd Limitrtions of Penel
i" O. lrtuoton and N. M. Kiefer
I
I'
;i0mncrt ol lcrefitr ud
\gtrt*rn
! Srrot on Dcocfitr rnd Limitrtionr
7 tr. Sobn
'il.pty
to Gomrncnts on Bencfitc rnd Limitrtiong of Prncl
Detr
.
i.C, Iltbo
I
IARCEL DEKKER,INC. New York end Bqcc!
,,
chrrgc
187
ECoNOMETRTC REVTEWS,
4(I), !-6L
(1985)
TITE FOUNDATIONS OF ECONCI'IETRICS--ARX THERE ANY?
P.A.V.B.
Roger
Swany
Special Studles
K.
Conway
Bureau of Economic Analysis
Department of Conmerce
Washington, DC 20030
Federal Reserve Board
Washington, DC 20551
P. von zur
Muehlen
Special StudLes
Federal Reserve Board
I'lashington, DC 20551
of eeonornetrLes; the h,to-ttalued
Logie; the problen of induetion; inetnunentalien; Loeal eoheneneel
ila.7y-uaLued l,ogie; eoidential interpretation.
Keg llords arld Phr.ases: foundations
ASSTRACT
of a 1ogically conslstent economic theory
strictly adheres to Aristotlers axioms of logic are factually
true if its sufficient conditlons are all factually true. Alternatively, if a conclusion of such a theory is false, then at least
one of its assumptlons is false. Unfortunately, the factual truth
of sufficient conditlons carnot be established because the probleu
of induction is impossible to solve. It is alpo true that the
falsity of a conclueion cannot be established in the presence of
uncertainty. Whlle the philosophy of lnstrumentalisn applied to
The concluslons
which
Copyright @ 1985 by Marccl Dekkcr, Inc.
07 47
4938 | 8s I 0401 -000 I $3.50/0
SWAMY, CONWAY, AND VON ZUR MUEHLEN
sufficient and logically consistent explanations may provide useful solutlons to immediate practical problems' the principles
of simpliclty, parsinony and profligacy--al1 of them requiring
conditional deductive arguments--are useless as crlEerla for
rnodel choice.
Furthermore, Arlstotlers axioms can give rise to difficulties.
For example, as G6delrs lnconpleteness Eheorem shows, if our
supposed axioos for a Eheory do not permit any contradictions,
then the theory ls incomplete in the sense that Ehere are proposiEions that are undecLdable relatlve to those axioms' If we adhere
gq Popperts view that sclentiflc actlvity is inpossible wlthout
the adoption of Aristotlerg axion of noncontradiction, then we
nay achleve 1oca1 coherence (i.e., consistency within a glven
model) by adopting the axiom and conblning a prior distribuEion
satisfying certain conditions with the lnformation on a sufficient
and logically consistent model via the Bayes theorem. Intultlonlsts are correct ln denying the unlversal validity of lndirect
proofs and thus precluding the use of Aristotlers axLon of the
excluded-rniddle and we have denonstrated that roany-valued loglc
applies whenever this axlorn is not used. Probabllity theory ls a
version of many-val"ued loglc and one can provide strong or weak
(but inconcl_uslve) evidence for one statistical hypothesis against
another using thls theory. The actual predlctive perforrnance is
a good crlterion for Judging dlfferent approaches to lnference'
CONTENTS
1.
Introductlon
z.
The
Merit of Two-Valued or Deductive Logic - Sone First Principles
2.7 Nature of loglc
2.2 Aristotlets axloms of loglc
2,3 Loglcally valid arguments
2.4 Modus ponens or nodus tollens
2.5 Exanples of the vlolations of Aristotlers axioms
FOUNDATIONS OF ECONOMETRICS
3. Sufflcient and Logically Conslstent Models
3.1 The necessity of logical consLstency and an exanple
3.2 Alternatlve theories
4. Inductlvlsm and Approxlmations
4.L The problen of induction
4.2 The retreat Eo approximations
4.3 Attempts to solve the probl-ern of lnduction
5.
Popperrs Falsificatlonlsm and Related Problems
5.1 Asynnetry
5.2 Loglcal anbigulty regarding refutatlons
5.3 The problem caused by uncertalnty
6. ObJective Knowledge, Instrumentalism, Simplicity and Parsimony
or ProfLigacy
6.1 Bolandrs (1982, p. 179) view
6.2 Instrumentallsm
6.3 Slnpllclty
6.4 Principle of parslmony vs. pri.nclple of profllgacy
7. Restrlctive Nature of Arlstotlers
Axiorns and methods of
Relaxlng Then
7.t G6delts chali.enge to the axiom of noncontradlctlon
7.2 Local vs. globa1 coherence
7.3 Locally coherent inference procedures
8.
Intultionistsr crittclsm of the axion of
7.4
Brouwerrs and
7.5
Many vaLued
7.6
excluded-mlddle
Birnbaumrs (1977)
the excluded-nlddle
or ftzzy logic-relaxlng the
axiom
of
the
evidential lnterpretation vs. the
llkellhood prlnclple or Bayes procedures
Conclusions
Appendlx: InterpreLations of Probability
1. Importance of probabLllty lnterpretations
2. Alternative meanings of the word "probabt1lty"
3. Dl-fferences and sinl-larltles of results based on dlfferent
vlews
4. CondLtlong for the relevance of frequency lnterpretation
5. Conditlons for the relevance of subJectlve lnterpretatlons
6. On frequentlst vs. posterl-or probablllty
References
SWAMY, CONWAY, AND VON ZUR MUEHLEN
1. Introduction
Eltle asks such a questlon is that few
econometrlc textbooks clarlfy or even conslder the foundatl-ons of
econometrics.l l{ence, our purpose in thls paper 1s to artlculate
these foundatlons. By 'foundations" we mean the econometrtclansr
vlew of the relationship between r'he economic theorles or asaumptions upon which they base thelr models and the statistical methods
Ehey use to reach concluslons about the nature of the real world'
I{hat is usually discussed under the rubric of economic methodology
is nore concerned with the nonstochastlc cases than wlth lhe
stochastlc cases lBlaug (1980) and Boland (1982)]' In contrast'
our vLew ls that a proper study of econometric foundatlons should
be concerned strictly nlth the foundatlons a8 manlfested ln the
theorLes.
nature of stochastic ry!3!9
In this paper' we argue that Ehe accepted views of the approprtate rnethodologles for enpirlcal lnvestigation of neoclasslcal
econonlc theorl.es are lnadequate to clarlfy the foundatione of
The reason why our
cannot understand econometric
nethodology \tithout first understandlng economic theory; thus,
in Sections 2 chrough 4 we dlscuss the foundatl'ons of economics,
which are anchored on two dlfferent but not necessarlly mutually
economeErlcs.
We presume one
exclusive nethodological topics. The flrst toplc is the
Aristotellan logtcal system conelsting of only true or false
staEements and the other ls the "problem of induction,'both of
whlch are Ehe subJect of critical examination by philosophers of
science. Some examples of vlolatlons of Arlstotlers axLous of
logic will also be glven ln the6e sections to show thaL econometric models wLth all givens incorporated ln thelr "unlverse of
discourse" do not fa11 under the category of Arlstotellan systems.
A consLstent set of distrlbutional and behavloral assumptl-ons
should be added to nake such nodels logical'Ly valld. The loglcal
conslstency and valldicy of a nodel is a necessary conditlon for
-T-.ffi .-c"ptron is Zellner's
ln econonlcs.
(1971
,
Ch.
1) renarks on inference
FOUNDATIONS
OF
ECONOMETRICS
iis truth. Related questions about Popperts falsificatlonism
and
Friednanrs views based on Lnstrumentallsm wii-1 be examined in
Sections 5 and 6, respectively. It will be shown that these
philosophles applted to Arlstotelian systems have the sort of
logical Justificatlon which phllosophers generally insist on, but
ihey do not have a slmllar Justlfication when applied to econometric models. Section 6 also contains a discusslon about the non-
logical principles of slnpllcity,
parsl-mony and
profligacy slnce
they are frequently advocated as approprlate criteria for economet-
ric
models.
The restrictive nature of Arlstotlets axioms will be discussed
in Section 7. Specifically, it will be shown in this section that
not a1l- of Aristotlers axioms can be satl.sfied ln all econometric
applications and failure to satLsfy all of Aristotlets axLoms
renders the argument in favor of true conclusions inadequate.
Fortunately, in the present century there have been several
investlgatlons into an alternative logical system known as "nanyvalued 1ogic." The axioms of nany-valued logic, though weaker
than those of Aristotle, can be shown to form a valid foundation
for econometrics if a system of rules of inference can be established which will insure that what is provable is exactly what is
valid; however, it should be noEed that there is no unique way of
extending the notion of validity to the many-valued case. Fina11y,
in Sectlon 8 we offer our sunnary and conclusions.
2.
The
Merit of Tvo-Valued or Deductive LoglcSome First Principles
2.1 Nature of loglc
Loglc deals Itith the correctness of arguments. In logic, an
argument is a group of indlvidual statements standing ln relatLon
to each other. One of theo ls the conclusl-on and the relnalning
statements are assunptLons or premises. The latter present the
evidence for the former. Obvlously, the truth of any statenents
SWAMY, CONWAY, AND VON ZUR MUEHLEN
not guarantee that a conclusLon ls true' Such statements
must also have gome bearlng upon that concluslon' It ls this
is
connection beEween premises and conclusions wlth which loglc
or
conconcerned and not wlth the truth or falsity of premises
clusions by themselves lsee Salmon (1973, pP' 3-4)]' Thus'
1ogic, as a dlscipline, is preoccupied ltlth establlehing the
valiclity of arguments. Truth or falsity is a property of
lndividual statements and not of arguments' 2
does
2.2 Aristotlets axl-ons of logic
Aristotle poslted three rules that are only ry939a cooditions for the adnissibility of statements into logical arguments'
"the axion
These "rulegr" "axioms" or "canons" of logic are: (1)
ofidentity.':differentstatementscannotusedifferentdefinitionsofthesamewordsl(2)'.theaxlomoftheexc]-uded.niddle'.:
statements that are always neither true nor false nor both are
prohibited;and(3)'.theaxiomofnoncontradiction..:stateDents
cannot be allowed to be both true and false'
Ite mean "factual truth"' In addition' there is
-z'J_i-Gall
anothei type of truth known as "logicai' truth" (or truth lnfollows
truth
whose
sEatements
are
truths
Logical
clrcurnstances).
frorn the deflnltlons of the words that occur ln them' Such
statements are called analytlc Etatements or tautolologies [see
A
r.rccawley (198r, p. 7o)ffi.-130)1.
1,o"..?91i!9!v
statenent is one whose "1ogica1 falsity" follows fron the Treanings
are stateoenEs
i?EET.as rhat occur in lt. By contrast, there
whose truth or falsity is not determined solely by rhe meanings
ofthewordstheycontaln.Theseareknownas..syntheticstate.
ments." Synthetic statenents are not logical truths or falsehoods;
they are iactual statenentsr,each of whlch is either factually
ana1ytic statement is necessarily crue'
trrrl o, f;G;ffi;-an
it wl1l be true regardless of what cl-rcumstances hold' Its truth
does not exclude any inaglnable poesibillties' Loglcal truth is
always much stronger than factual truth because no empLrical
obeeivation could ever refute the former, but it night refute the
latter. We sha11 show Ln thl-s sectlon that no synthettc conclueLons can be valtdly deduced frorn analytic premises alone lsee
Salnon (1973, p. 132)1. Further, we wLll u8e the convention of
letiing .'true" or "faise" be "factually true" or "factually fa1se"
throughout the PaPer.
FOUNDATIONS OF ECONOMETRICS
2.3 Iogicblly valid arguments
In Arl-stotellan 1ogic, an argument which does not violate
any of Arlstotlers axioms is loglcal whenever lt is a sufficient
argument in favor of its conclusions in the following sense: If
an argument is logicai-, then whenever 4 of its assumptlons are
true, all of its conclusions will be true as we11.
Having established the logical sufflciency of a formal argument, it can be used as part of a larger enplrical argument that
agrees wlth any particular conclusion of the formal argument. In
this sense, logical validity is a necessary (but not sufficient)
condltion for the truth of the concluslon of an enpirl-cal argument.
2.4
or modus tollens
The mere sufficiency of the assumptions of a logicaL argument
shows that whenever all the assumptLons are true then every
conclusion which logica11y follows from their conJunctLon must be
true or, equivalenti.y, whenever any concluslon turns out to be
false then the conJunction of all of Lts assumptions cannot be
true. Using a loglcal argument in favor of the truth of any of
Lts conclusions by arguing fron the truth of its assunptione Ls
said to be using the argument in modus ponens. Alternatlvely'
usLng a logical argument against the truth of an assumptlon by
arguing from the falslty of a concluslon is cal1ed nodus to11ens.
Put differently, any use of modus ponens is ca11ed "affirming
is called "denying
the antecedent" and any use of glglglfg
the consequent." It ls a logicaL fallacy to "affirn the coneequent"
(reverse modus ponene) or to "deny the antecedent" (rylg
tollene). A further discussion of these ldeas is given ln SaLmon
Modus ponens
(1973), Boland (1979), and Blaug (1980).
A loglcal argument Ln modus ponens or modus tollens cannot
be developed if Aristotlers axloms of logic are lgnored. For
example, if we actually arrive at an erroneous conclusion, at
least one of our assumptions ls guaranteed to be false, provlded
any of our statements which Ls not true l-s false, i.e., all of
SWAI'IY, CONWAY, AND VON ZUR MUEHLEN
satisfy Arl-stotlers axioms of the excluded-nlddle
d
c trrrdiction. In this sense, Aristotellan logle can be
mltbi e tw*alued logic in which statex0ents are assigned only
c orf the tsso values: "true" or "fa1se".
rDur s6.GsrrEEs
2-t Era+les of the violations of Aristotlers axiorns
Unfortunately, Aristotlers logical system is of linited use
in econorretrics because econonetric practices in general cannot
strictly adhere to Arlstotlers axloms. The following lllustrations
of the violations of these axioras form a representative sample of
size four drawn from the econorneEric literature.3
Exanple 1. Approxiuate demand systems. Theil (1975) has
shown that logarithndc differentials can be used to investigate
the general features of conplete sets of demand functions.
Goldberger (1967) has pointed out a robustness property of this
approach by arguing that a "differential formrlation quite possibly
provides an adequale approxination to utility-maxirnizing behavior
over a range of conceivably true utllity functLons; this without
being exactly appropriate for any particular one" (italtcs added).
In the same vein, Barnett (1983) has derived a demand sysEem from
a new Laurent series approximation to Ehe reclprocal of an indirect
utllity function on the contention that the Laurent series expansion provLdes a better-behaved remaioder term than the Taylor
series used by Thei1. C1-ear1y, both theories vlolate ArLstotlers
axiom of the excluded=niddle: the approxirnately true demand
systems of Theil and Barnett are neither absolutely true nor
absolutely fa1se.
Exanple 2. Causality tests. Typically, econometric modelbutldlng posits a set of autonomous conjectures as to basic
behavioral relationships, including an indicatlon of the relevant
varlables and a partitioning of those variables lnto endogenous
-T--Th-t purpose of presenting these four examples is not to accuse
econometrLcians of violating Arlstotlets axioms but to point out
that many-valued logic (to be discussed later in Section 7) is
needed Eo Justlfy econometrl-c practice.
FOUNDATIONS OF ECONOMETRICS
and exogenous sets. This structural modellng approach has been
recently chall.enged by an alcernative nethodology utillzing vector
autoregressive (VAR) nodeling Ln conjunction with "exogenelty"
tesEs based on Grangerrs definitions of causality Isee Sins
(1980)1. I{owever, the implied statistlcal test results for
"exogeneity" can hold only wlth probabtlity less than one even
in sufficiently large sanples Isee Sargent (1979) and Conway,
Swamy, Yanagida, and von zur Muehlen (1984)1. Thls alternative
nethodology by naking the exogeneity assunption dependent on the
outcoxle of an inconslstent statistlcal test permtts assumptions
that are neither absolutely false nor absolutely true and therefore, vlolates the axlom of the excluded-niddle Isee Boland
(1982, p. l24)1.
Example 3. Ratlonal expectations. In 1961, Muth (1961)
posited an equilibrium nodel in which agents forrned expectations
rationally. AB stated by hin, "expectations of firrns (or, more
generally, the subJective probabllity distribution of outcomes)
tend Eo be distributed, for Ehe same information set, about the
predictlon of the theory (or the robJectLver probability di6tri--
bution of outcomes)" [Muth (1961, p. 316)]. Lucas (1976, p. 27),
among others, adopted thLs concept by posltlng that expectations
of individuals are "ratLonal" if a subjective distributLon on
whlch decisions are based is equal to a true (objective) probability distribution of Ehe event(s) under conslderation. Such
a Juxtaposltlon is lnapproprlate if, as SItaEy, Barth, and Tinsley
(1982) have shown, "subJective probability" is based on the
approach of subJectlvisEic BayesLans and "obJective probability"
ls based on long-term frequency considerations, because these
probabillty concepts are not conpatible \rithln the same definition of "probabi1lty.'4 Consequently, rational- expectatlons
arguments that are based on these two competing definltlone
vlolate Arlstotletg
axiom
of identity.
ffiore
conprehensive discussion of subJective probability
and frequentist probabiltty ts gl-ven in the Appendix t.o the paper.
SWAMY, CONWAY, AND VON ZUR MUEHLEN
4. Specificatlon of rational expectations models.
can be seen from Sargent (1976) and Wal1ls (1980) that ratlonal
expectations models include Ehe following sets of equatLons:
(i) A set of structural equations specifled as of some time,
t, where sone rlght-hand side variables represent anticipatlons
formed by lndividuals about unobservable values of speciflc
variables in period t or Lacer.
(1i) A sec of corresponding reduced forxn equations.
(lil) A set of equations statlng that Ehe unobserved
antlcipatlons or expectations of indlvlduals are equal to the
condltional neans of the daLa-generatlng processes described by
the structural equatlons Ln set (i) given the data aval1ab1e when
Example
It
the expectatLons were formed.
(1v) A set of time series models representing the processes
that generated the exogenous varlables.
The argument presented in Swany, Barth, and Tlnsley (1982'
pp. 139-141) and Conway, Swamy, Yanaglda, and von zur Muehlen
(1984) shows that the conJunction of the sets (f)-(iv) of equations can violate Aristotlers axlom of noncontradl-ction because
the identlfylng restrictions inposed on the structural equations
may conlradict the necessary and sufficient conditlons under
which the corresponding reduced form equatloos and the specified
tLme serles models exist. For addltlonal examples of econometric
pracEice possibly involvlng concradictlons, see Swany (1980),
Swamy and Mehta (1983a), and Swamy, von zur Huehlen, Tinsley,
and Farr (I983).
3. Sufflclent
3.1
and
Logically CoosLstent l{odels
of loglcal consietency and an eraTle
It follows from the scheoe of Aristotellan loglc presented
earlLer ln Sectlon 2 that the theoretlcal knorledge on shich a
uodelts speciflcation l-s based uuet b€ loglcally consistent if lt
The necesslty
FOUNDATIONS
OF
ECONOMETRICS
11
is to provlde a "true" explanation of anything. Although the
logical consistency of a given expl-anatlon does not necessarily
i.nply lts truth, it is a necessary prior condition to any explanation of the reil world. Boland (L982, p. 24) expresses the same
view by argulng eloquently that "the only objective and nonarbitrary test to be applled to theories or models ls that of
1ogica1, consistency and validity. Even if we caonot prove a
theory or model is true, at the very mlni-mum to be true Lt must
be 1oglca11y consistent." If the assunptions underlying a nodel
contradict each other and/or violate Aristotlers axlom of identity,
as ln Exarnples 4 and 3, then our knowledge leading to these
assunptions is not 1ogica1Ly consistent and cannot provide the
true distribuElons of economic variables. For this reason, we
should specify a nodel without violatlng Arlstotlers axioms of
identity and noncontradlctLon as far as possible.
Exanple 5. A suffl-clent and logically consistent mode1.
We now consider an exarnple of a sufficient and 1ogical1y consistent
mode1 from orthodox statistics that conforng wlth Aristotlers
axiomg. In what fo11ows, we utllize an g1g!9.5g!9,9. calculus of
probablltty baeed on Kolmogorovre axlons [Rao (1973, 80-87)1.5
Suppose that x1 ls a k-dimensional. random vector having the
decomposltlon
--m=ta;Aard
interpretations of probabtllty violate the
axion of the excluded-middle lsee Appendix to the paper]. Consequently, to avold this vlolation, we have chosen here to work
wlth an unlnterpreted calculus of probabillty dedtrced fron
Kolmogorovrs axions of probability. Howeverr we should point out
that not every statl-stician would llke to work with Kolnogorovts
axlons wl-thout a modificatlon. For example, L. J. Savage and
de Finetti (L974) replaced Kolmogorovre countable addttlvity
conditlon by a finite additivity conditi-on. Jeffreys (1961) and
Cox (1951) regarded the theory of probabillty as a generalization
of ordinary Boolean loglc. Jeffreys was also the first to propose
an axlomatlc foundation for "inproper" distrlbutions (1.e.,
distributlone whoee deneities lntegraEe to lnflnlty inetead of
to unity), whereLn certalnty is represented by lnflnlty lnstead
of by unity; for a nasterly exposltion of Jeffreyst axioms of
probablllty, see Zellner (1971, pp. 42-53; 1983, pp. 74-8f).
FOOTNOTE
5
CONTINUED ON NEXT PAGE
SWAMY, CONWAY, AND VON ZUR MUEHLEN
t2
t=x;+vr
(t=L,z,... ,
T)
(r)
xl represents anticipatlons of lndividuals formed about
the values of xg in perlod t-1 (as in Wallist model), and vg l-s
the diecrepancy of individualsr antl-clpatlons from lhe actual
observable xg. Further, 1et
where
xt* = efg
(2)
is a k x r matrix of less Ehan fu11 rank, every colunn of
whlch containe at l-east two nonzero elements and no row of whlch
ls nu1-1, and f1 is a r x 1 random vector conslsting of the
psychologlcal factors that determlne lndividual anticlpatlone'
Suppose also that the elernents of the vector (f!, vl)t are
nutuallylndependent.LetYgbeascalarrandomvarlabledeflned
where A
by
Yt=nrfg*€6gr
(3)
lr is a 1 x r fixed vector and €6g is a random varlable
independent of fr. WithouE Loss of generallty, 1et E fg = 0,
Evg = 0, and EGgg = 6. Then the following theoren due to Kagan'
Linnlk and Rao (KLR) (1973, p. 320) gives the conditl-one under
which the conditional rrnean E(Yglxg) extsts and is linear'
Theoren (KLR): Let k ) 2 and let the regression of € 9g on
vs be llnear: E(€pglvg) = 6tvs. Then E(Ygl*t) = x{8 if and only
if the followlng conditlons are satisfl-ed:
where
FbbTIi6'IE-5Tn-mvunl
Ilowever, the probabillty calculus based on Boolean algebra or
a
finite addttlvtty conditlon is less general than that based on
Kolmogorovrs axloms. For example, the theory of conditional
alistributlons for flnitely addltlve probabillties 1s relatively
new and stlll incornplete [see l{eath and Sudderth (1978)]' we
sha11 say more about finlte addltivlty Later in thls paper'
To avold the vlolation of the axlom of identity, we use only
Kolr,oogorovrs axioms here.
FOUNDATIONS OF
ECONOMETRICS
13
(f) If the Jth element of I - A'B ls not zero, Ehen the Jth
elenent of f1 is normal.
(fi) If the JEh element of I - 6 is not zero, then the Jth
element of v1 ls noroal-.
(iii) If the Jth elernents of fg and vg ar€ normal, then the
vector B and the variances of the elernents of fg and vt are reLated
according to an identity glven in KLR (1973, p. 327,10.5.7).
This theoren has been extended by KLR to the cases where A
has fu11 colurnn rank, r=1, or k=1. The last case is especlalLy
interesting because when k=1, the randorn variables tx, vg r and €6a
need not be normal Ln order for the regression of Yg on xg to be
lLnear. Further, this regressl.on is not linear for all A and X.
Thuer,unllke Sargentrs or lJaLl-lsr rational expectations nodelthe expectations nodel
n(v.lx.) = xiB
(4)
in (3) and (l) respectively, follows
1oglca1l-y fron the conJunctl-on of the conditions of KtRrs Theorem.
ModeL (4) is true tf the condl-tlons of KLRrs Theoren are true; l-n
this sense, lt is sufficient and 1oglca1Ly conslstent. Notice
that the specification of nodel (4) does not violate any of
Aristotlets axions.6 KLR's Theoren ls an example of an argunent
ln modus ponens. In fact, all the characterization theorens
- - Mathernatfcfans, working with detentrinistic rnodels, give
sufflcient, necessary, or necessary and sufficient conditions for
their results. They do not violate any of Aristotlers axions.
In the same rtay, Kolmogorov deduces the entire probabiltty
calculus from a conelstent set of axions without vlolating
Arlstotlers axioms. Econometricians can, in principle, follow
the sane derivatl-on. They can take a consistent set of behavioral
assumptions nade by economists and conbLne then wlth Kofuoogorovrs
axlonrs of probability tn a consistent manner to obtain a full set
of consistent behavioral and stochastic assumptions. From these
assumptions they can deduce a 1ogical1y consistent, sufficLent
model, as ln Example 5. In this case, stochasticism does not
necessarily vLolate the axlom of the excluded-nidd1e.
where Yg and xg are aa defined
SWA},IY, CONWAY, AND VON ZUR MUEHLEN
L4
proved by KLR yield several
different
arguments
in
modus Ponens,
thereby renderlng the srochastic rnodels in their theorems
sufficient and loglca11y consistent.
3.2 Alternative theories
At any Point in time there nay not be only
one theory
of
the
economy. There can be several conpeting theorles favoring sone
glven propositions or speciflc predictlons. Kuhnrs view oo how
gcience works Ls very well known among nethodologists Isee Boland
(1982, p. l6f) and Blaug (1980, pp. 27-33)1. According to hls
vLew, scLentific progress is not sfunp1y the cunulative reeult of
normal sclentific actlvity. The scientlfic revolutions and the
controversLes that arl,se when one paradigm Ls deemed to be a blt
stale and a new and more promlsing paradigm takes ite place'
clearly do not flt lnto an lncrementallst view of the world' In
the case of a revolution in scientific paradigm ln this sense' we
slmply have one more theory to add to the existlng (economic)
theories. If each of these Eheorles is sufficient and 1ogical1y
conslstent, then its concluslons are true whenever all of lts
assumptions are true. If two theorles contradict each other,
then both of thern cannot be true.
4. Inductivlsn
and Approxinations
of induction
far has not yet addressed the neans by
which one knows the truth of the assumptions of a sufficlent and
logically coneistent explanatlon. Unfortunately, Arlstotellan
loglc ls of 1lttle help ln deEerroining the truth of an assumptlon'
It can only help by "passlng" along known truthe from assumptLons
Thls llnitatlon of tradltlonal
to conclusions as 1. glq-ry.
logic leads to a consideration of the so-ca11ed probleo of
inductlon: the problern of findlng a form of loglcal argument l-n
4.1
The problem
The discussion so
FOUNDATIONS OF ECONOMETRICS
which (a) the conclusion ls a general statement and (b) the
assumptions lnclude gla slngular statements of partLcuLars
[Boland (7979, p. 506)]. Inductl-visn is the methodologlcal
doctrlne that asserts that any Justlfication of oners knowledge
must be 1ogica1ly based glg!1 on experiential evidence con6isting
of partlcular or sl.ngular obgervational statements IBoland (1982,
p. 14)1.7 Given inductivi-sm, any straightforward solution to
the problern of induction requlres an l-nducEive 1ogic. That
ls to say, there must be a forn of logic whlch permits correct
(inducti.ve) arguments consisting of ooly singular statements
(..g., a finlte number of observatlons) to yield true conclusions
that are general statements (..g., "the conditions of KLR!s
Theorem are absolutely true"). Unfortunately, there is no solution to the problern of induction: no matter how maly "facts" one
has, one cannot ever prove the absol-ute truth of the conditLons
of KLR's Theoren. Popper (1972, pp. 23'29) considered induction
to be a "mythr" and, indeed, said so in no uncertaLn terms.
The personal (or subjective) probability theory also supports
the conclusion that there ls no solutl-on to the problem of lnductlon. In the vlew of a personall-stic Bayesian like Savage (f98f)'
our opinions today are the (rational) consequence of what they were
yesterday and of what we have seen since yesterday. Similarly'
yesterdayrs opinions can be traced to the day before' but the
theory of personal probability does not pretend to say what
system of opinions ne ought to have been born wlth. Thus, there
are no oblective grounds for any specific belief beyond recent
experience and there are none for believing a unlversal proposltion other than one that is tautological", gLven what has been
observed. Knowledge of a unlversal ls acceptance with a hlgh
personal probabtlity of a unlversal with flnlte domaLn or of many
such, vaguely speclfied. But Justification of opinione ln the
-7--ET;E'-how philosophers define the problern of inductlon.
Unless othernlse quallfied, this le the sense in whlch we use the
terrn "problem of lnductlon" throughout the paper.
I6
SWAMY, CONWAY, AND VON ZUR MUEHLEN
that one ltho rejects them is guilty of an error
to a loglcal fallacy is not aval lab1e.
sense
retreat to approxinations
Since there is no solution to the problem of induction'
tenptlng to argue that if the condltions of the real world
4.2
ls
conparable
The
it
sufflciently well the conditions of KLRrs Theorern
(whatever this means), model (4) w111 be approxlmately true.
This tenptatlon 6hou1d be reslsted Lf we are lnterested in applylng Aristotellan two-valued logic because there ls no valid
approxinate modus poneng.8 Modus ponens is not valid for arguments consistlng of statements Ehat are only "approxirnately
correct." The reagon is that auch staEements vl-olate Ariatotlers
axiom of the excluded-mtddle and hence are not adnisslble into
Aristotellan type loglcal arguments [see Boland (1981, p. 85)].
The valldity of cerEain results under certain departures fron
certaLn assumptLons proved by Rao and Mitra (L97L' pp. 155-167)'
Box and Tiao (1964) and FLsher (1961) does not establlsh the
validity of an approxinate modus ponens, but, rather, stnply
demonstrates the fact that a result can be true even lf tts
sufflclent condltlon is fa1se.
approxLmate
4.3 Attenpts to solve the Problem of induction
Jeffreys (1961, p. 43) has constructed an axiomatlc system
for a Bayesian probabllity theory to formallze Lnductive loglc ln
such a way that lt lncludes deductlve logic as a special llnltlng
case [see Zellner (L97I, p. 4; f983' p. 74)1. An alternative
statemetrt of Jeffreys'argument given by Good (1962, p.488) ls
that an inducttve inference can approach certalnty or, aymbolica11y, that P(pnlpl,..., pn-l)+I, where Plr P2r... are the results
of experinents, all of which were "successful." Good (1962'
pp. 488-489) offers the criticisn that "it seems somewhat
8 An approprlate nethodology for handLing lnexact concepts such
as approxLmatlons is provided by the nany-valued logic franework
discussed in Section 7.
FOUNDATIONS OF ECONOMETRICS
inconsistent to deny [as Jeffreys does] the validity of a limttlng definition for physical probabilities, and then to use [again
as Jeffreys doesl a lirnittng argument for the justificatlon of
lnduction." Good then goes on Eo argue that ln practice (after a
very 1ong, but of course, finlte sequence of successes) repeated
veriflcations of the consequencee of a hypothesls would not make
it practically certain that the next consequence of lt would be
verifled if the new consequence were kooltn to be of an entlrely
different character frorn Ehe previous ones.
To ilLustrace his inductive 1ogLc, Jeffreys (1961, p. i28)
considers the hypothesis lhat all animals with feathers have
beaks and clairns that the probabllity that this hypothesls ls
true will approach I as Ehe number of successes, wlthout fal-lures,
increases. Comrnenting on Jeffreysr l-l1ustratlon, Good (1962,
p. 489) polnts out that "[t]he argument and the conclusion are
both undermlned by the fact that the dlstribution of essentl-al1y
distlnct species of feathered anirnals rnight be very skewIed]...,
or even norse, so tha! there would always renaln species that had
not been sarnpled until about half of the entire population of
feathered aninals had been examined." Accordingly, Good (L962,
p. 489) says, "a general 1aw which does not itself refer to
probablllties will not usually tend to become certain, however
often lt ls verified. If the 1aw could be qualified by saying
that lt l-s to be lnterpreted as applying only to experiments and
observations of a sinllar nature to those already rnade, then it
night well tend to certainty, but lt l-s not easy to make this
klnd of qualification precise." Berk (1970) has attempted to
nake this quallflcation preclse by describtng the conditions
under which a sequence of poeterior distrlbutLons converges
weakly Eo a degenerate dlstrlbutlon. Therefore, under Berkrs
(1970) conditlons, a 1aw could be properly qualified so that it
tends to certalnty. llowever, the problen of establishing the
truth of Berkrs conditlons ls identlcal to the problem of induction. Thus, Goodrs deecrlptlon of the problem with Jeffreyst
77
SWAMY, CONWAY, AND VON ZUR MUEHLEN
i.nductive infereace appears to be consistent r'lEh Popperrs or
Boland's (1982) description of the problem with the philosophersr
type lnduction. Boland (1979) discusses several unsuccessful
attempts to solve the phii-osophersr problem of induction'
Therefore, we conclude that the problem of inductlon as stated
by Boland is not solvable and the attenpts to solve it are not
successful.
5.
5.1
Popperrs
Falsiflcationism and Related
Problems
Asynnetry
is a fundamental asymmetry between establishing the
truth of a statement and refuting it. No unlversal statement can
be 1ogica1ly derived from or conclusively established by any
finite number of singular statenents, but any unlversal statement
can be 1ogica11y contradicted or refuted with the aid of deductive
logic by only one singular statement. Popper exploits this
fundamental asynmetry in formulating his demarcatlon cricerlon
tlhich states: sclence is that body of synthetic proposltl-ons
about the real world that can, at least in prlnclple, be falsified
by enpirical observatlons because, in effect, Ehey rule out
cerLain events from occurrlng [B1aug (1980, p. 12)]. This
demarcatLon crlterlon then leads to the following program:
Whenever lt is shown t.hat one of the predictlons of a theory ls
fa1se, then, by modus tollens' lte can conclude that at least one
of its assunpcions must be fa1se, provided all of Arl-slotlers
axloms are satisfied. Predlctions have an overrlding importance
for Popper in refuting explanatory theories. Accordingly'
sclentl-sts seek to explain their observatlons, and they derlve
logical predlctlons that are lnherent in thelr explanatlons ln
order to refute thelr theories; all "true" theories are merely
provlsionally true, having so far defied falsiflcatioo; or'
reetating Popperts pol-nt' all the naterial tsruth we possess is
packed into lhose theorles that have not yet been falsified
There
FOUNDATIONS OP
ECONOMETRICS
19
lflaug (1980, p. 17)]. In econometrLcs, the question also arises:
falsify economic theories? If a nodel violates
Aristotlets axion of identity or noncontradictlon, then certainly,
we can reJebt it outright on the grounds of loglcal lnvalidity,
as we proposed in the preceding sections. If a model violates
Aristotlers axiom of the excluded-middle. then rte cannot uae
We can use many-valued logic, !o be
modus tollens to refute it.
dlscussed in Section 7 below, to analyze euch a nodel, but we cannot refute Lt. The difficult problem remalns: IIow do hre refute
a sufficient and 1ogica1Ly consistenE model of the forn (4)?
llow does one
5.2 Loglcal ambiguity regarding refutatlons
Equat.lon (4) nay represent the behavior of some unlt of the
economy. If lt does,lt will only partly do so because, at best,
economic theory suggests only lthlch variables are deflnltely
reLevant and whlch are possibly relevant. Theory night, in addition, outline aome broad features of the approprlate functional
form. In general, economl"c theories do not u8ually aasert a
specific functlonal form nor do they suggest a specific distrlbutlon for the pertlnent variables. Indeed, as a practical matter,
the epread of available observatlons and thelr number wl11 reetrLct
onets abllity to discrlml-nate anong general-Lzed. functional
forns. Therefore, in order to bul1d a eufficlent and logically
conslsten! model of the form (4), we mus! add extraneous assumptions of whlch the conditiono of KLRrs Theorem are examples. One
problen wlth eufficlent and Logically coflslstent rnodels is their
inherent logical anbigulty regardlng refutatlone. In other
words, if nodel (4) were refuted, one would not know whether the
source of the fallure rtas the set of extraneous assumptlone or
the seE of original behavioral assumptions underlylng (4) [Boland
(L982, p. 120)1.
5.3
The problen caused by uncertainty
Any
eufficient
and
loglcally cons{stent
econornetric nodel of
20
SWAMY, CONI.JAY, AND VON ZUR MUEHLEN
the foru (4) represents a sltuation of uncertainty' In the case
ofcertainty'therewouldbeaone-to-onecorrespondencebetween
actions and consequences. In the presence of uncertalnty, model
(4) nerely generates a probabiltty dlstrlbutlon of coosequences
resulting fron a change in the exogenous variables, l-'€', 3o
exogenous change in xg in (4) generates a (Predictlve) dlstrlbutlon of Y1. Uncertainty, then' means that we [ust dlscriminate
and choose among sufficient and logtcally conslstent models by
comparingthepredictivedistributionsofavariableofinterest
inplled by those nodels. But then the basis of conparlson and
falslflcatlon becomes an issue. First, to dlstlnguish model (4)
from another nodel Lt must be identifiable. That is, if the
variabLe Y6 has different distributioas for dlfferent values of
p,thennodel(4)isidentlfiableandl-nformativestatemenlsabout
E can be nade uslng an observed value of Ya'9 Seuinal work on
(1971'
comparlng predletlve dlstrlbutions can be found ln Zellner
pp. 306-317). ThLs work has been taken further by Gelsel (1975)
and others. Second, as shown by DeGroot (1970, pp' 86-115)' the
speciflcation of a cornplete orderlng of a class of predictive
distrlbutlons requlres an approprl-ate utllity functlon' Apart
fron the obvl-ous practlcal difftculcies aasoclated with the determinarLon of an approprlate uttltty functlon, such an orderlng rnay
not concluslvely falelfy a nodel lGeisel (1975)]'
that identiftablllty ln this sense ls lmportant
T/f-Etlfeve
Fron.the
ft"t i.ti the non-Bayesian and Eayeslan vl-ewpolnts'
quantif lcations
neanlngful
stanipoint,
op.."iio""ff
subJectivlst,
"t by (coherent) lndividuals about
of uncertainty are Lhose nade
observables. fte."for", .tt op"t"tionali'y lnterpretabfe !11Uffiuatirrtydistribut1onoveranyunobservab1e(11kea
is possltle lf there la a one-to-one correspondence
;;;;;";;.)
betweentheunobservableandthecondltlonaldistrlbutionofsooe
bets
observables given the unobservable because in thie case the
bets
the
lnto
translated
unlquely
be
can
observables
the
about
there ls
about the ,rrrobse..r"ii".- rtt the unldlntlfiabiltty case' (subinterpretable
no such correspondence and an oPeratlonally
j""ii".l-ltotalrrrty
distrlbutlon over the parameters ilAy not be
possible.
FOUNDATIONS
OF
ECONOMETRICS
2T
falsify a model
arises if a sequence of posterior distributions for the nodel
converges weakly to a distribution which ls degenerate at a
correct point in the paraneter space whenever the nodel is true.
Sufficlent conditlons for this convergence are glven in Berk
(f970), as sre have already polnted out. In an earlier paper,
Berk (1966) has also shown that the posterior distrlbutl-on tends
to be confined to a special set in the paraneLer space when the
nodel is incorrecc. Verlficatlon of the truth of Berkrs conditions is unlikely siace it would seem to require a solution
to the problem of lnduction.
The situatl-on lrhere one can conclusively
6. Objective
Knowledge,
Instrunentalisn, Sinplicity
or Profligacy
and
Parsinony
6.1 Bolandrs (1982, p. 179) view
A role of theoretical knowledge is one of providing sufficient and 1ogica11y consistent explanations of the real wor1d.
Boland (L982, p. I79) proposes that this role be separated from
one havl-ng to do with truth (i.e., whether statements derived
from theoretical knowledge are true or false). This separation
is Justified because the truth of someonefs knowledge is not
always necessary for successful action, as we show below ln this
section. By separatlng the role of knowledge fron its truth
status, Boland is not suggestlng that theories or knowledge
cannot be true. Rather he is asserting LhaL a theory can be true
even Lhough its truth status ls usually unknown to us. The truth
status of anyoners theoretlcal knowledge is necessarj.ly conJectural, but by irs loglcal nature Lt must be capable of at
least belng staEed in language or in "other repeatable forms" to
the extent that it concerns the real world [Popper (L972,
pp. 106ff) l.
22
SWAMY, CONWAY, AND VON ZUR MUEHLEN
6.2 Instrumentaliso
There is a vLew that, as long as a 1ogically va1ld rnodel
does its lntended Job, there ls no apparent need to argue ln lte
favor, or on behaLf of any of its constituent parts. For poLicyorlented economLsts' the stated purpose ls the generat{on of
"Erue" or comparatl.vely successful predictlons. In this case a
nodelrs predlctlve success ls certainly a sufflcient argument ln
lts favor. This view of the gfg of models is ca11ed "instrumentall-sm. " Instrumentalism holds that nodels do not have to be
considered true statements about the nature of the worLd, but,
instead, nay be considered convenlent ways of (logtcal1y)
generating what have lurned out to be true (or successful)
predictions or conclusions lBoland (1982, p. 114)]. A prlori
truth of the assumptLons ls not required if it is already known
that the concLuslons are "true" or acceptable by sone conventionalist crl-terlon. It is ln this manner that instrumentalism'
such as the view presented by Friednan (1953), responds to the
failure to solve the probLem of lnduction. For the followers of
Lnstrunentall"t, glg po* wt11 now necessarlly be eeen as
irrelevant because they begin thelr analysls wlth a search--not
for the true assurnptlons--but for "true" or useful conclusions.
t{odus tollens ls likewlse irrelevant because lte use can only
begtn with false concluslons IBoland (1982, pp. 144-145)1.
Priedman concludes Ehat testing of assumptions ls
Lrrelevant for true conclusions since nodua tollens cannot be
used in reverse whenever tests reject assumptions, and statistlcal
testing vlolates Arlstotlets axlom of the excluded-niddle in any
eca11 saqle case. This leads Friedman to discuss the possibllity
that a false assumption nl-ght be applled as part of an explanatl-on
of sore observed phenomenon. llere he introduces hls fanous
version of the "as lf" theory of explanatLon: "[I]f we are trylng
to explain the effect of the assumed behavlor of sone
lndividua1s..., eo long as the effect Le in fact observed and lt
rould be the effect lf they were Ln fact to behave as ite assume'
FOUNDATIONS OF ECONOMETRICS
se can use our behavioral assumption even when the assumption is
fa1se." IBoland, (1979, p. 513)]. Thus, under lnstrumentalism,
the use of 1ogically consistent but lncorrect assumptions ls
Justtfiable.
While FrLedrnanrs instrumentalisrn does not necessarily violate
any loglcal princlples, Lt can, nevertheless, give rise to the
sane difficulties as Popperra falslficationism when applied to
stochastic nodels of the type (4). For example, in the presence
of uncertalnty, the meanlng of the term "true or useful prediction"
ls not c1ear. Also, past auccess ln prediction ni11 not guarantee
future success. Therefore, DeGrootrs (1970, pp. 86-115) conplete
orderlng of predictive dlstributions can change over tine IGelsel
(1974)1. Furthermore, there exists no slngle econometric model
which predlcts all varlables better than any other nodel for all
tirne periods. Flnally, ae Boland (L982, p. l-96) points out,
LnsErumentalisn is approprlate only for lnrnedlate practlcal
problems and not for long-terrn phllosophical questLons and, as
Kyburg (1983, p. 27) argues, a "strongly instrunentalist vl-ew
of science is perfectly adequate to the design of experlmental
apparatus as well as to the creation of engLneering wonders."
But to understand the world, we need a nodel that repreaents it
adequately. There is no guarantee that Lnstrumentallsm leads to
such a model. "Repeated successes (or failed refutations) of
instrumentalisn... [are] 1ogtca11y equivalenE to repeated aucceaaful predictLone or true conclusions. We sti11 cannot conclude
1oglcaL1y that the assumptLone, l-.e., the basee of lnstrumentalisn
Itself, are true. They could very well be false, and in the
future someone nay be able to find a refutatlon" [Boland (L979,
p.
522)1.
6.3 Sinpl-lcity
Expllclt acknowledgement of modeling fallures can be indefinltely postponed by such patchwork devlces as dumy variables,
ratchet variables, Judgenental conetant-term adJustments, and
SWAMY, CONWAY, AND VON ZUR MUEHLEN
in applied econometrlcs.
Followlng Lakatos [Blaug (1980, P. 36)], thls practice ls cal1ed
"degenerating," since it involves the endless addition of 35|@
adjustrnents thaE nerely accommodate whatever new facts become
avallable. Ilistoricallyr thls type of resPonse Eo fallure has
given lrnpetus to a yearning for sl-nplicity in nodeling' Unfortunately, as Blaug (1980, 9. 25) polnts out' attenpEs to deflne
precisei,y what ls meant by a simple theory have so far falled
largely because the very notion (of the sinplielty of a theory)
is itself highly conditloned by the hlstorical perspective of
scientists. This view is shared by Good (I980b' P' 403) who'
after some heroic but unguccessful attenpts by Nagel IGood (1980b'
p. 402)l and hlmself [Good (l98ob, pp. 422'423)]' adnits that it
ls difflcult to specify sharply whether one 1aw Ls more general
than another. As ln physics and other highly developed sciences,
where it has become largely accepted that sords unaccompanl-ed by
operational meanlng are suspect and 1tke1y to prove meanlngless
IBridgman (1928)], we reject unquallfled termlnoLogy llke
"simpliclty" and "cornplexity. " Indeed, from the polnt of view
of l-nstrumentalisn, there is no need to imPose such undefinable
crlteria, since the only relevant criterion ought to be
dogmatic
priors whlch are
commonly used
efficaclousness: use whatever works, provided the meana are
derived from a 1ogtca11y valld nodel and are not influenced
patctrerork devlces designed to flt speclfic needs.
by
vs. principle of profllgacy
Occasionally, elther the "prlnciple of parsLmony' or the
"prlnclple of profligacy" is advocated as a crlterlon for model
choice. An example of the forrner vlew is given by Box (1976,
p. 792) who argues, "Just as the abiltty to devlse sinple but
evocative nodels ls the signature of the great eclentlst 8o
overelaboraEion and overParameterization is often the mark of
nedlocrlty." An advocate of the prlnciple of profligacy 18 Slns
(1980, p. 15). Where pure lnductlvisn requiree a final (absolute)
6.4 Prl-nciple of
parsimony
FOUNDATIONS OF
ECONOMETRICS
25
inductive proof for any true model, these tlto so-called prlnci-p1-es
requ1reon1y@deductl-veargumentstoshowthatthe
chosen model ls alternatively "slmple but evocatlve" or "extravagantly paraneterlzed." Ilowever, thls solution poses a new
problem. Since knowledge is Lmperfect, condltlonal deductive
argumenta necessarlly contain assumptLons. Therefore, the choice
of a rnodel is always open to question. In partlcular, one can,
crlterla used to deflne "parsimony" or
"profligacy." So thls posslbility then presents the new problen
of an lnflnLte regress ln whlch the next step lnvolves the choice
of a rnetacrlterLon to be used for defining what is evocaLlve
stnpllcity or paranetTic extravagance. A flnal unintended consequence of an appllcatlon of either of these "prlnciples" is,
lndeed, a circularlty whereby some operatlve crlteria are held to
be appropriate because Lhey are sufficient to Justtfy our choice.
The "prlnclple of parslmony" rnay nislead the lnvestlgator, sLnce
the true theory might actually be different frorn the one he
considers on the basls of his definition of the prlnclple of
parslnony. On the other hand, a profusely parameterized model
rnay not be falslflable or identiflable ln the econometric sense.
Therefore, without further qualifications, the nonlogical
princlples of parsimony and profligacy are vacuous, and, Lndeed,
have the smack of nunerology rather than of sclence.
as we1l, questlon the
7. Restrlctive Nature of Aristotlers
Axioms and Methods of
Relaxlng Then
7.L Giidelrs challenge to the axion of noncontradiction
In thls section we begin to address the following questions:
(1) can we satisfy Aristotlets axlome all the tine? and (2) what
are the conaequences of relaxing some of hls axloms?
The restriction imposed by the axiom of noncontradlctl.on can
best be expressed ln terms of a conpleteness concept. [The
following discossion ls borrowed from McCawley (1981, p. 76) and
26
swAMY, CoNWAY, AND VON ZUR MUEHLEN
pp. 4-5, 81 and 85)1. A systen is "deductively
complete" lf it inposes the most strlngent possible restrictlon
on states of affalrs: that there is only one sEate of affairs in
which the rulee of the system lead only Eo true conclusLons when
applied to true assuoptions, as l"n modus Ponens. A system ls
deductlvely complete if for every proposltlon p of the system'
either p or not-p is a theorern of the systen. The question of
"deductive cornpleteness" arl-ses ln connection with any system
that includes axlons for a particular subJecE matter. one exanple
concerns a formal system that ls intended as an axiomatizatlon of
the arithnetic of posltlve lntegers. In a most celebrated resul,t
Lyndon (1966,
regarding deductive completeness, G6delrs incompleteneas theorem
shows that any axiomatizatl-on of the arithneclc of positive
integers is elther deductively incomplete or inconsistent; that
ie, if our supposed axioms for arithnetic are consistent (i.e',
they dontt al1ow us to deduce any contradlctions), then there are
propositions of arithnetlc that are "undecidable" relative to
those axloms: proposiElons A such that neither A nor not-A can
be proven from those axioms.
ciiDEL's TNcoMPLETENESS TEEoREM: If the theory T ls valid'
it l-s not complete.
Thus, there ls a conflict betlteen deductlve completenese and
logical consistency. We have to glve up elther the goal of
achieving deductlve completeness or the goal of achieving logical
consistency. As we have already argued, lf at the very minlmum
to be true a theory or model must be 1oglca11y conslstent' Ehen
lre cannot give up logtcal consLstency. This means that we have
to give up deductlve cornpleteness. Once we give up deductlve
completeness, there are propositlons of econometrics that are
undecidable rel.ative to any assumed econonetric axloms tha! do
not permlt ue to deduce any contradictlons.
The usefulness of GiJdelrs theoren resldes ln lte abllity to
indicat.e a llnitatlon of an econometrlc approach, where one
hypothesls is tested agalnst another by conbining the trto rival
FOUNDATIONS OF
ECONOMETRICS
27
in a general nodel such that each of the hypotheses
can be derlved as speclal cases of the conbined nodel [see Pesaran
(1982)1. In other words, tn applied research Ln econometrics
there is a natural llmlt to the process of buildtng a general
nodel by enbedding a number of otherwise nonnested models tf the
general nodel is required to be logicall-y conslstent.
hypotheses
7.2 Local vs.
g1oba1 coherence
G6delre lncompletenesa theorem noEwithstanding, the discussion in Sections 2-6 shows that there 1s much to be said for
conducting the study of econometrics even within the weakest
possible netatheory that wl11 serve. In support of thls klnd of
proposltion, Barnard (I976, p. 121) states that "...a useful
distlnction can be drawn between r1oca1 coherencer and rglobal
coherencer. ...[T]he total process of naking eclentiflc
inferences... involves, among other thlngs, the cholce of a
rcde1, and rrnodel critlclsn. I The procees of nodel critlcLsn may
se11 lead to a change of model-, and in that case, we rnay well
cotrtradlct propositlons prevlously taken (tentatively) as true.
Such a change of model could lead to rLncoherencerr... But so
long as we contlnue to use the same mode1, our reasonl-ng should
of course, satisfy the rcoherencyt requirement. Thus, we should
be 1oca11y coherent (withtn the sane model) but good senae may
rel1 require Ehat we should be prepared Eo be rincoherentr when
e change of rnodel ls ca11ed for." Good (1983, p. f6), using the
t€n "type 2 princlple of ratlonality" to refer to Barnardrs term
'local coherencer" states that the procesa of enlargLng bodies of
bcliefs and of checking them for consl"stency can never be conpleted
crn l-n prlnciple, by vlrtue of Gijdelrs theoren. Ilence, the type
2 principle of ratlonaltty is a logical necessi.ty.
Denpster (1968, pp. 244-245) draws a distinction between
cmaistency ln the weak sense and consl-stency in the strong
oltrs€. L.J. Savage and other Bayesians have deecribed conslstent
bcbavior ln the senee of consistency wlth plausibl,e sets of
SWA},IY, CONWAY, AND VON ZUR MUEHLEN
28
is consistency ln the weak
s€ns€r Accordi.ng to Dempster' conslstency ln the atrong sense
referg to consistency rtith some unique true laws of thought and
behavior, rthl-1e consistency ln the weak sense Ls a mLnor achlevenent and provides no grounds for chooeLng among approaches'
Tverskyrs (1974, p. f58) view is sinllar. Ile doee not belleve
that (locaL) coherence, or lnternaL conslstency, of a given set
of probability Judgnents is the only criterion for their adequacy'
In his view "Judgnenta muat be conpatlble wlth the entire web of
bellefs held by the tndividual, and not only consiEtent among
themselves. Conpattbility anong beliefg is the essence of rational
that
Judgnent...l0 One of the lnpllcatlons of Gijdel.rs theorem is
Dempsterrs criEerlon of consistency ln the strong eense and
Tverskyte crl-terlon of the compatibllity of a set of probablllty
nay be
Judgrnents with the Judgets total systern of bellefs
lnpossible to satlsfY.
axLonts which
ln
Dempsterts teros
7.3 Locally coherent Lnference procedures
Local coherence can be achleved by applying de Finettire
(Ig74) eubJectlve Bayesian method to a sufficlent and 1ogical1y
consistent model deftned earlier in sectlon 3. A declgion-maker
who employs such a model ls seen to be locally coherent lf antl
only lf his probabillties are conputed in accordance nith some
finirely additlve prior Isee l{eath and sudderth (1978)]. Unfortunately, there are technical difficulties lnvolved in erryloying
flnitely addltlve distrlbutlons which are not countably addltive.
For example, such prlors nay fall to yleld poeterlors; and even
lf poeEerlors exlst for a finitely additlve prlor, there may be
polnt about savagets axioms that Llndley (1974) brlngs
out ls that they refer to a large wor1d, l-n Savagers language'
Llndley warns that we should not take our worlde too sna11. When
thinklng about one probabllity that ls needed, Lindley says that
-e
we should introduce othersr assess these as well, and then see if
the Judgnents cohere. Els guess ls that typicaLly-they-w111 not'
and ievlsion to make then so will be ca11ed for. One of the lnplicatl-ons of Giidelts theoreo is that coherence cannot be achleved
ln sufficlently large worlds.
FOUNDATIONS OF ECONOMETRICS
no available al,gorLthn for coraputing thern [see Ileath and Sudderth
(L978, p. 336)1. To avoid such difficultLes, Lane and Sudderth
(1983) describe a fairl-y general inferential setting ln whlch all
locally coherent Lnferences can be obtained as posteriors from
proper, countably additive priors. Speciflcally, they prove
that if the (sufficlent and 1ogical1y consistent) sanpling nodel
is a contlnuous function of a parameter and if either the parameter apace or the observatl-on space is compact, then a coherent
Lnference whlch ls a continuous function of the obeervation nu6t
be the posterior of a proper, counEably additlve prior. These
conditions are not always satlsfied. Whenever the conditions
of lleath and Sudderthrs (1978, p. 335) Theorem 1 or Lane and
Sudderthrs (1983, p. 118) Corollary 3.1 are true and the sanpling nodel is grounded in a euffLcient and logtcally consistent
erplanation, Bayesian lnferences are 1,ocally coherent and do not
violate Arl-stotlets axlorn of noncontradiction for a gLven nodel.
An alternatl-ve nethod euggested by Jeffreys (1961) is based
on a calculus of credlbilities (= unique ratlonal degrees of
bellef or inteneLties of convictLon = logical probabilities). In
order to apply thls calculus to statlstics an assumptlon concerning inltial probabllittes of scientific lans or nu11 hypotheses
ls necessary in each appltcation [see Good (L962, p. 488)]. Once
initial probabtlities have been speclfl-ed, ernpirical evidence
lodifles such probabilittes via Bayee theoren, provided these
lnitlal (or prtor) probabilities possess posterl-or probabilltles.
Although it can be said that 1ogical1y inconslstent theories or
hypotheses have zero prlor probabtllty of being true, one philosophical difftculty here concerns the unprovable exlstence of
logical prlor probabilities for all (1ogica1ly conelstent)
scientlfic lawe or null hypotheses, whlch Jeffreys and his
followers stnply assutne. We say "unprovable" because the available evidence is confllctlng. For example, Tversky (1974) reports
sone results of a research proJect showing that lndividual subJects Judge the probability of a hypothesls by the degree to which
SWAMY, CONWAY, AND VON ZUR MUEHLEN
tt represents the evidence' Irith llttle or no regard for lts
prior probabtlicy, whereas Salmon (1973, p. f13) states that
scientistsr 3udgments about the reasonablenesa or plauslbility of
hypotheses constltute assessmenta of prior probabilltiee. DeGroot
(I974, p. 1053) asserts that "the person's rlnltialr probabilltlee
for X will no! be formed until he realLzes that X affects hls
llfe, and Ehese probabillties will then be personal ones based
on hls individual curuulative observatlonal experLence. There are
no correct initlal probabilities"; for a Justification of thls
assertation, see savage (1981, pp. 519-520). Another difflculty
is that ne may noE be abl-e to recognize inltial probabilltiea even
lf they exl-st. Agaln' as Salmon (L973, pp. 114-116) observes,
the assessment of prior probabilitlee Ls often a difflcult and
subEle matter. Jeffreysr own attempt to overcome thls dlfftculty
by means of some rules or l-nvariance theory (whlch says that the
prlor should be invariant under a rnodel transformation tha! does
not change the parameter space and the structure of the rnodel)
proved not entirely satisfactory even to hineelf and forced him
to use ad hoc adJustments when deciding wheEher or not to use the
theory lsee Good (1962, p. 488)1. More irnportantly' Good (1980a'
pp. 24-25) interpreEs Jeffreyst use of rules for deEerninlng
prLor probabllities as an application of "Type 2 rationallty"'
It ls lnterestlng, then, that de Finettlre and Jeffreysr approaches
make use of the notion of "locally coherent inferences."
It should be noted that Jeffreyst lnvarlance principle wll-1
not ahtays lead to unique prlor probablltties. Although the
invarlance prlnciple determlnes a uniquely defined prior for any
1l-near mode1, it is known that addltional princlples of inference
may be needed to flnd unlquely defined prl-ors for other models
whlch do not have the rlch algebralc atructure of linear models
[eee Vtllegas (f977b)]. one princlple that may be used for this
purpose is a compattbility prlnclple defined in Villegas (L977b,
p. 652) or a certain group structure defined in Vlllegas (1981'
p. 775, Proposition 2). It is aLeo true that nany of the prlors
FOUNDATIONS OF ECONOMETRICS
Jeffreys (1961) uaes are not entLrely based on the invarlance
principle and the lnner structure of the nodel belng analyzed for
they may also be based on external Judgments lVillegas (1977a'
pp. 454)1. Posterior probabllltles nay have some desirable
interpretatlon lf these external Judgmenta are not imposed
lVillegas L977 a, l98l) I.
The invarl-ant prlor dlstrlbutione that produce the invarlant
procedures are, of course' typlcally lmproper. Renyi (1970)
provides an axiomatic system of probabillty that accomodates
inproper prlors. Howeverr not every irnproper prior can lead to
coherent inferences. This, J.n our oplnion, is a loglcal obJection
to the use of lnvariant or improper prlors. If tt is nathematically convenLent and reasonable to uae Lmproper, countably additLve prlors ln practlcal problems, then, in vlew of Eeath and
Sudderthrs (1978, pp. 336-337) Corollary I and Theorem 2, l-t ls
necesaary to tho\t that the chosen improper prlors have posterLors
shich could also be obtalned from properr flnitely additive
priors. If an inproper prLor does not lead to a posterior whLch
could have been obtained from a proper, flnltely additive prl-or,
then such a prlor results Ln lncoherence and should not be ueed.
As alternatives to lnproper prLors, Box and Ttao (1973) and
Zellner (1971) consider prlors whlch we call seeningly inproper.
That i6, by supposlng that to a eufflclent approxlmatlon the
prior follows the form of a partlcular lnproper prlor only over
the range of appreciable llkelthood and that it Euitably fa1ls to
zero outslde that range' Box and Tiao (1973' P. 21) and also
Zellner (1971) ensure that the prlors actually used are proper.
The posterior inferences for local-ly uniform, Proper prlors llke
those for l-mproper prlors are not coherent unless they could also
be obtalned from proper, finltely additlve priors, as lleath and
Sudderthrs (1978) and Stonere (1976) regults lndlcate. See aleo
Stonefs (L976, pp. L24'125) response to Barnard. Thus, ln the
final analyels, the cholce of a prlor should be dependent not
ooly on the nature of the argunents used to derlve it but also on
the coherence of the posterLor lnference lt leads to.
31
32
SWAMY, CONWAY, AND VON ZUR MUEHLEN
flnltely addltive probabillty
for a glven sarnpllng model
p lf and only lf p and q are conslstent. A rlgorous proof of
this result is glven by Lane and Sudderth (1983' pp. 115-116) who
have shown that, roughly speaking, p and q are conslstent if they
are the condltional dlstrlbutiona corresponding to sone Joint
distribution. An analogous result is Kol,rnogorovrs consLstency
theorem for countably additive dlstributlons stated in Rao (1973'
p. 108). It should be noted that coherence nay provlde a necessary
condltlon for reasonable lnference but ls not eufficlent. Lane
and Sudderth (1983, p. lf9) ilLustrate this polnt uslng an example
where the lnferrer wlth a prior ls capable of drawing coherent
lnferences, though he ls'free to be stupid in some clrcumstances."
Thls ls analogous to the result that loglcal conslsEency ls a
necessary condition for an argument to be true' as pointed out Ln
Section 3. l.
Hill- (f984) ls of rhe opi-nlon that lt is too restrlctive to
always restrLct oneself to proper prlor dlstributlons and that
lnproper or flnltely additlve prior dlstrlbutlons may provide
satlsfactory approxlnations ln obtainlng our posterlor distrlbutlons. Ile also presents an Lnteresting discusslon of an example
showlng that lf a Bayeslao uses the uniform prlor dlstrlbutlon
for the parameter 0 ln this example, then hls pogterl-or probabllity for an interval, given any data, ls at Least .95, while
given any 0, the frequentlst probabtlity for the interval is very
tiny. This ls a real paradox and ls known as the phenonenon of
non-conglonerability. The facE that the uniform lnproper prlor
dlstribution can be glven a finitely addltive lnterpretation
explalns why a non-congl,onerabllity occurs in the above example.
This means that nith de FinetLits subJective theory which ln
prlnclple does not rule out any flnltely addltive prlor distrlbution, we must Learn to 1lve wlth non-conglonerabil"ity orr ln
particular, we must accept that we can have Pr(a physical theoryl
data) = 0 for all poselble data, while Pr(a physlcal theory) ) 0.
Furthermore, if we only consider
measures, an lnference q is coherent
FOUNDATIONS OF ECONOMETRICS
A rigorous proof of this result appears tn IIil1 (1981) who ends
his paper with the followlng conclusion: "For those of us who
wlsh to retaLn the subJectlve Bayeslan model for learning and
decLsLon-maklng there appear to be three main paths open. Flrst,
we can restrict Ehe model io finltletlc appllcations and/or to
bounded loss functl-ons and proper prior dlstrlbutlons ln Ehe
lnfinite case; second, we can persist lrith conventional improper
prlor distrlbutions ln the infinlte case, ignoring inadnisstbiHty
(and even exlended inadnisslbility) problems; thlrd, we can
develop the ftnitely addltive theory, learnlng to live wlth nonconglonerabllity. The flrst path ls qulte restrictive and nay
be unreallstlc even as an approxirnatlon. The second path, at
least ln sorne applicatlons, wl11 lead to unnecesgary losses.
Are Lhere any real obJectlone to the thlrd path?"
7.4
Brouwerrs and
IntuitionLstsr critlcisn of the axion of the
excluded-niddle
It ls clear from Section 2.2 that, for any proposltlon p
which satisfies both the excluded-mtddle and the noncontradLctlon
axions, the falslty of the falsity of p impliee the truth of p.
That is, slnce an adnlssible statement canoot be both true and
false and since lL can only be true or false, then lt is either
true or false. If it is not false, then slnce it cannot be falee
too, lt must be true. This ls the basls of all of the lndlrect
proofs that are used in nathematlcs. ll Brouwer and the lntuitlonlsts would not accept this proof. To see why, conslder the
followlng argument which ls borrowed liberally fron Lyndon (1966,
pp. 34-35) and Wl1der (L952, pp. 243-244). Inrultiontsm ls a
well-developed phllosophical posltion which proposes that
nathenatical reasonlng, especlally concerning inflnite sete, be
restrlcted to certain conetructLve and intuLtively tnmedlate
--Ti- Ar-Cuments in which we prove p by supposlng not-p and showlng that a contradtctlon fol-lows are generally referred to as
argunents by leductlo ad absurdun [see Mc0aw]_ey (1981, pp. 27-29)1.
33
SWAMY, CONWAY, AND VON ZUR
34
I,TUEHLEN
principle. Ae a very rough exanple, consider the assertion that
there exists a natural nurnber n which can be found and shown to
have the property P. The fornal negation of the aesertton can
be underetood constructlvely only as statlng that there ls a
means of showing, for each n' that n does not have the property
p. Ivere G a flnite eet of numbers, one could eay it was
"intultively clear" that e{ther G contalned a number rtlth Ehe
property p or no elenent of G had the property p (the nurnber of
Ilowever,
elemente in G is unlmPortant as long as it ls finite).
if G were infinite, lt could be entlrely conceivable that nelther
the assertl-on nor its negation Ls true in thls congtructl-ve
sense. 0n thls basls, intultionlsts deny lndirect proofs and
thus wl-sh to preclude the uee of excl'uded-mtddle eo as to nake
lndirect proofs lmposslble. Heyting subsequently axlomatized the
intultlonlstic proposttional or sentential Loglc. Ilis axlooatlzation differs frorn Arigtotellan sententiaL loglc only in the
onisslon of the axion of the excluded-mtddle. Additional examples
of such an axLomatlc system are those fron which elther frequentlst or Bayeslan statlstical rnodels are derlved Isee Lyndon (1966'
pp. 33-34) l.
7.5
Many-valued
or fuzzy loFic - relaxing the axion of
the
excludedaiddle
Whlle the earlier discusslon ln Sections 2-6 descrlbed a
loglcal syeten obeying Arletotlefs axlon (2) of the excludedmiddle-that for any statement s, "8 or not- a"--' it is
lnstructive to consider the consequences of abandonlng that
axiom so that statenents can be sonethlng other than true or
false. There is a sizeable Llterature ln whlch rules of inference
and princlples of truth value assignnents are formulated and
lnvestlgated for the aet or aets of truth values under consideration. An excellent survey of thls llterature ls given by HcCawley
(1981, Chapter 12).
Once we devl-ate fron Arlstotlete axl-om (2)' the conventLonal
FOUNDATIONS
OF
ECONOMETRICS
35
problenatlcal. Thusr "t.ruth valuer" whlle
firmly established and thus hard to avoid, is mlsleading because
-values" assigned to proposltions need not be TRUTII values
.ELE,
but can be values for other parameters or combinations of paratsters auch aa probabllitles. Thus, the probabllltles assigned
3o sets belonging to the Borel field of sete ln a sample space in
.athematlcal statlstl-cs [Rao (1973, pp. 82-83)] do not always
i41y only sure or impossibl-e events. As Lyndon (1964, p. 33)
ootes, probablllty theory, in its most prinLtLve form, resembles
rany-valued logic in that it attaches to each event a probability
taken as a number tn [0, l]. Thus, probabllities can represent
degrees of certalnty applytng to unknown factual inforrnatlon.
lThere is an lnportant distinctLon between probabllitles as
ilegrees of certainty about unestablished facts and probabilities
as indices of eurprLse about facts which have become known. I
Statisticians treat all nlsslng values of observable varlables
lncludlng the unobserved future values of variabl-es as random
variables. Therefore, the probablltty that a prediction lnterval
rill cover the future value of a varl-able cannot be equal go 0 or
I and thus deserves a different va1ue, lnternediate between 0 and
I lsee Thel1 (1971, pp. 134-f35)]. Thle probabllity does not
hcome 0 or I even after the future value ls real-ized. By contrast, frequentiste refuse to regard (after the experlment) the
leyoan-Pearson confidence coefficl-ent as a "probabllity" of
correctly covering the true parameter valuer lnstead interpretlng
it8 oeaning before the experfunent Ls conducted Ln terns of the
.ctua1 frequency of successes in many (not necessarily identlcal)
crperLnents as Justifled by the law of Large numbere [see Kiefer
(1977b, pp. L73-L74)1. Thus, before the experinent i8 conducted,
e confl.dence lnterval ls random and it l-s reasonable for l-te confidence coefflcient to take a value between 0 and 1. Another
craqle of "truth values" that do not repreaent truth or falslty
ranld be a subJectlve Bayeslan assl-gnment of a real number (greater
ihan or equal to 0 and less than or equal to 1) to a propositlon
oooenclature becomea
36
SWAMY, CONWAY, AND VON ZUR MUEHLEN
to expresa an indj.vidualts degree of belief that thls ProposlElon
is true. The statenent that an event occurs wifh probabiUty p
or that a varlabLe takes an lnterval of values wiLh probabllity p
vlolates the axlom of the excluded-middle unless p ls always
elther 0 or 1.
Dempster (1968, p. 244) has offered an interpreEatlon apPlytng Jointly to the pair of lnterlocklng concepts' probability and
betting. It ls that "probabillties are degrees of certainty
applylng to unknown factual informaLlon, while associated betLing
rules relaLe to decislon-making appropriate to those degrees of
certainty. ....The related use of the lerm EL!g!, as in degrees
of belief,... means only Ehat he who adopts any probabillty nusL
There ls a sl-mllar
have a commLtment to it or bellef in lt.
notion of bellef trnpliclt ln factual information, for he who
quotes a fact (as chough true) must feel conrniLted to it (though'
of course, Lt rnay not be true). Thus, the controversial concept
of probablllstl-c knowledge is no dlfferent as regards bellef from
the noncontroverslal, concePL of determlnistic knowleilge...'12
If what we know today about the future Ls not a part of our
deterministlc knowledge' then our statement about the future cannot properly be ca11ed either true or false (at least, not by our
speaking ln the present) and so deserves a different truth value'
intermedlate between truth and falsity. Thus' there is need for
a many-valued logic concerned ltith the assignment of "values"
Lo propositions without regard to whether those values are
appropriately ca11ed truth values, although such "values" wlll
nornally have sone relation to inference. For exanple, if "values"
are degrees of confidence ln propositions, one would be interested
in developing rules of lnference such that each rule of inference
ytelds conclusions whose confidence leve1 is at least that of
assumptions.
--iZ- a; FG-e i. (L974, pp. 7O'72) has used "predlctlon" and
"prevision" to deoote indivldualsf determlnlstlc and probablListic
knowledge, respectively.
FOUNDATIONS OF ECONOMETRICS
no unique way of extending the notion of valldity
the many-valued case, although one may say that a formula is
There
is
to
valid tf and only if lt receives the same value t (truth) under
all adnlssible asslgnnent.s of truth values to the underl-ying
assumptions. Alternatlvel-y, one can a11ow a set of truth values
to be "deslgnated" and take a formula to be valid lf and only lf
1t recelves a designated truth value under all admissible assignments of truth values to the assunptlons. McCawley (1981, p. 368)
discusses the important question of what it means to say that a
systen of rules of lnference and a set of condltlons on truth
The criterion of fit in the two-valued
value assignrnent "fit.'
case (truth and false) consldered, for exarnple, in Sectlon 2 was
that, for any admisslble asslgnnent of Eruth values, the conclusions that can be lnferred fron assumptions that are true (l-n
that asslgnment) are also true. This crlterlon, in conJunction
sith the speclflc rules of inference of Section 2, imposed very
strong constrainLs on how truth values could be assigned to
complex propositlons. The constralnts were so strong largely
because there were only two truth values. I{owever, if truth
values can be any number from 0 to 1 (where 0 corresponds to
'falsityr" 1 to "truthr" and interrnediate val,ues to varlous
greater and lesser degrees of certal.nty), a less stringent
criterion of fit nay be admitted. A system with this range of
truth vaLues is useful in coplng wlth Lnexact concepts enumerated
in UcCawley (1981, pp. 360-394), such as APPROXII{ATION discussed
in Sectlon 4.2. In de Flnettirs (1974) Bayesian theory, ltself
a rurny-valued loglc, the condition of coherency serves the same
functLon as the criteria of flt ln the verslons of fuzzy logic
covered by McCawley (1981).
To produce rules of inference
it fuzzy logLc, the prlnciple
of "semantLc conpleteness" has been lnvoked [McCawley (1981,
p. 369)1. It requires that statements be provable if they are
valid ln the nany-valued sense. Clear1y, any system of rules of
l-nference designed to raake a ftzzy logic semanttcally complete
37
38
SWAMY, CONWAY, AND VON ZUR MUEHLEN
nu6t neceasarily differ from che rules of Sectlon 2. I{ere it must
be enphasized that 7rr frs.zzy 1ogic, concLusl-ons are restrLcted to
the status of belng only g!_fgg.11 as certain as at least one of
the assumptions. This llnltation ls the price we Pay for relaxing
Aristotlete axLom of the excl.uded-midd1e.
As we potnted out l-n footnote 5, the frequency or subJective
l-nterpretation of probablllty violates the axlom of the excludedniddLe. Our Exanple 5 glves a nodel whi.ch, when conblned with
the frequency interpretatl-on of probabllity, rejects the axlom of
the excluded-middle but ls not lnpllclEly constructed on the basis
of the acceptance of that axlom. Surely, this nodel gives conclusions which hold wlth probabillty less than I in srna11 samples,
but doee not mean that we should reject probablllty theory.
These concLusl-ons may hold wlth probability I ln sufflclently
large samples.
7.6 Blrnbaunrs (1977) evldential interpretation vs. the llkellhood princlple or Bayes proceduree
Inductlon based upon a statistical analysis of a model can
easlly yleld false concluslons from true premlses. A11 we can do
is try to conatruct our l-nductive arguments ln a way that lti11
rnl-ninize the chances of obtainlng false concluslons from true
premises. Most statistical nethods applled to research data
for this purpose have been given their systematic nathernatical
Justlflcation ltithln the Neyrnan-Pearson theory. In thls theory
the nu11 and alternatlve statlsttcal hypotheses whlch are speclfied by the values of a parameter and which nay be "reJected" or
"accepted" on the basls of a testlng procedure can be assocl-ated
nlth the reepective "decisions" appearing in the formal rnodel' of
a decision problem. The slnplest nodels of declslon probleros
may be descrlbed in terms of schenata of the followlng forn:
FOUNDATIONS OF ECONOMETRICS
39
Slmple hypotheses:
IIg
Posslble declslons:
d1 [=re
Error probabillties:
It1
ject
116l
c = Pr[d1lltgl
d2[= do not reJect ltgl
B = Pr [d2lttl l
tern "reject" expresses here an lnterpretatlon of the statlstlcal evidence, as glvlng appreclable, but
linlted support to one statietical hypothesis agal-nst another.
As Birnbaurn rightly points out, statLstical evidence is adequately
The declsl-on-llke
represented noc by d1 and d2 but by
synboJ.s
1lke
dl: (reJect 116 in favor of Il1, c,
B)
dit (t"J."t Ht in favor of H6, o,
B).
and
p. 23) uses the term "evidential interpretatlon"
of the decislon concept to refer to such applicatione of models
of decision problerns and uses the term "confldence concept" of
statistlcal evidence to refer to euch Lnterpretations of
statistical evidence. A definttion of Birnbaumrs "confl-dence
concept" is as follons.
The confidence concept: A concept of statistical evidence
ls not plausible unless it finds "strong evidence for H1 agalnst
E6'with snal1 probabiilty when IIg is true, and with rnuch larger
probabillty (l-B) when II1 is true [Birnbaum (Lg77, p. 24)].13
In evidentl-al lnterpretation, the result of a test is not
llteral1y an action, but ls lnetead a relative evidential- evaluatlon of tno or nore hypotheses. For exanple, Birnbaum (1977,
Birnbaurn (1,977,
-i5-Teyoan and Pearson suggest that a test procedure be selected
guch that B ls as sma11 as possible for a gLven value of c. The
probabllity I tends to 0 as the sanple size tends to € if the
test procedure is consLsteIrt. It is not possible to reduce both
s and B to zero. For an intultlve explanatl-on of Birnbaunrs confidence concept, see Salmon (L973, pp. 105-f12).
SWAMY, CONWAY, AND VON ZUR MUEHLEN
(reject H6 ln favor of Il1'
p. 25) interprets the decislon
'li:
.01, .2) as very strong (but inconclusive) evidence for E1 against
llgr and interprets the declslon df: (reJect II9 ln favor of IIl,
.5, .5) as rtorthless evidence. The latter declsion Ls no more
useful than ls the result of a toss of a fair coin, since the
error probabllitles (.5, .5) also represent the experlment of
tossing a fair coin, with one side labeled "reject Hg" and the
other "reJect II1." Consequentl-y, the exact vaLues of c and F go
a long way toward interpretlng statistlcal evidence'
Althoughtyptcalapplicationsofstandardstallsticalnethods
in econometric research report decislons of these types' the
exact evaluation of o and B is exEremely difficult, lf not
imposslble, in several settings. For this reason' the probabilttles c and E play thelr basic roles largely lnpllcitly and
unsystematlcally in guiding applications and interpretatlons of
econonetrlc evidence. The failure in currenE econometrlc practlce
of not reportlng the values of boEh a and B Leaves much to be
desired in achleving the fu11 discLoeure needed to nake a study
convincl-ng to a crLtlcal reader and the palatabiltty needed to
reach a casual reader.
Blrnbaun (1977, pp. 26-27) Itarns that statistlcal evldence
is one among several basee for validating scientiflc conclusions.
The concluslons of a sclentific investigatlon ehould be based not
only on statlstical evidence of eufficiert strength concernlng
the appropriate statistical hypotheses, but also on (1) the
adequacy of the employed (sufflcienE and logica11y consistent)
nodel to represent the empirical situation ln ftnportant respects
and (2) the coupatibillty with other knowledge and evidence of
a conclusion that ls supported by statlstlcal evldence provided
by the current lnvestigation. The nain difficulty is that the
problen of providing avallable statlstical evidence of the form
al or al on the hypothesle of interest when thls evidence le nlxed
up wlth those on nuigance parameters is indeed complex lsee Basu
(L977)1. The reason ls that a nuisance paraneter may eubstantially
FOUNDATIONS OF ECONOMETRICS
accuracy with which one can evaluate o and 0 for the
hypothesis of interest. [For an incl"sive discusslon on this
problern, see Kieferrs (L977a, pp. 824-825) reply to Denpsterrs
affect the
comments.
]
dlfficulty which should also be stressed here ls that
Lhe values of a and B may depend not only on the evidence but also
on the path by which it was reached. Put differently, o and B
may depend iodividually on Lhe pariicular long run [Cornfteld
(1970, pp 4-6)1. In view of rhis dependence, Cornfleld (1970'
p.6) argues Lhat ooe cannot always equate the concept of choosing between hypotheses on the basis of evidence and the concepi
of reJectlng a hypothesl-s at a fixed a with minimum B but one
shoul,l use the Savage and Lindley Bayeslan rule [Cornfield (1970'
pp. l5-16)l whlch is necessary and sufftcient to minilnize a
llnear compound a + lB, no matter what the long run is. The
Another
posit.ive quantity tr can then be interpreted as the weight one
nould be willing to assign to B as compared to ct. The SavageLindley argument depends on the notion that L\to test procedures,
one of which has a srnaller a and the other a snaller B may be
equally acceptable.
Ilowever, a criticlso of the Savage-Lindley argumenE stas
given by Birnbaum (1977 pp. 37-39). CruciaL to this criticism
'
is the distinction beEween the evidentiaL interpretations defined
above and the following behavioral inEerpretations of the confidence concept. If the result of a test ls not a statement about
a hypothesis but a decision to lake some particular course of
action, then Birnbaum says that this result irnplles a behavloral
inLerpretatlon of the confidence concePt. The Savage-Llndley
argument concerns JudgrrnenLs of preference or else lndifference
(equivalence) betrteen alLernatlve decislon functlons (tests) in
problems of two simple hypotheses, with each declsion functlon
represented by a polnt p = (ct, 8) in the unlt square, determined
by lts error probabilitiee c and B. Birnbaum argues that in some
situatlons he would strongly prefer to u8e a Eest characterized
SWAMY, CONWAY, AND VON ZUR MUEHLEN
by (.05, .05) rather than one characLerized by (.1,0),
although
lndiffereot between (.1, 0) and (0, .I), i.e., that
(.05, .05) > (.1,0) - (0, .l). This paLtern of preferences ls
incompatible with the assurtrpcion of the Savage-Lindley argument
that the equivalence of sets in the unlt square deterrnined by error
probabilities (c,B) ie invariant under probabilistic mixing.
Thls assumptlon holds only for "decislons" under the behavioral
interpretations, but not under the evidential lnterpretations
which constitute the Neynen-pg3r'son statistical practice. Thus,
Blrobaum does not belleve that a 50-50 rnixture of (say) (0, .1)
and (.1, 0) error probabilities will have the same evidential
meaning for al-1 practltioners as the (.05, .05) error probabillties,
but this may not be relevanL to the consistency with which they
would bet, lf forced Eo do so, based on such nunbers. They do
not necessarily achleve a utility that is linear in (c,B). The
polnt which Birnbaum ls making here is that while behavl-ora1
lnterpretatLons of "decislons" nay play a very valuable heuristic
role ln the nathematical developments of the Neynan-Pearson
theorles, statlstlcal nethods based on those theories must be
lnterpreted etiLh care when considered for posslble use with
evidential loterpretaEions. This ls particularly true when the
utilltles of individuals are nonlinear.
Such seens to have been Blrnbaumts later view. IIis earlier
viev was that the process of interpretaElon of experinental
evidence could be reduced to that of lnterpretlng the likelihood
function. Birnbaunrs earller argument Lakes as the rnodel of an
experlment E the triplet (n, S, p) where S = {y} is the (discrete)
sanple space, CI = {0} is the parameLer space, and p = p(y,0) is
the probability functlon of |, for each 0. Then he takes the
palr (E,y), when y is observed, Lo coostitute a uodel of statlstical evldence. The statlgttcs h(Y), t(y) are defLned, respeetiveLy, to be (a) g!g!$g., or (b) sufflcienr if (a) p(y,O) =
g(h) p(yltr,O), where g(h) does not depend on 0,0G0, or (b)
p(y,O) = g(t,e) p(ylt), where p(ylr) does nor depend on 0, 0€fi.
Then some of the axioms of statistical evldence consldered are
he would be
FOUNDATIONS OF ECONOMETRICS
CoNDITIoNALITY PRINCIPLE
(Cp): If h(y) is ancillary,
rhen
Ev(E,y) = Ev(85,y) where h = h(y), Ev(E,y) denores rhe evidence
deternined by an outcone y of the experiment E and E5 = (e, 56,
ph) with S5 = {y:h(y) = h} and p5 = p(ylh,o).
LIKELIHOOD PRINCIPLE (Lp): If, for some c)0, p(y,O) =
cp*(y*,0) for all 0 in 0, then Ev(E,y) = Ev(E*,y*).
MATIIEMATICAL EQUMLENCE (ME): If p(y,O) = p(y*,e) f or all
I 0, then Ev(E,y) = Ey(f,,y*).
Wlth these formulations Birnbaum proved that Cp and ME Jotntly
tnply LP [Barnard and Godambe (1982, p. 1035)]. In words, rhe Lp
says that the ltkeLihood function for the data that happen to
occur Ls alone an adequate descriptlon of an experlnent wLthout
any statement of the probablllty that thls or another ltkelihood
functlon would arise under various values of 0. Birnbaun also
considers the extensions of the above argument (which ls malnly
concerned lrlth the case where the posslble observations form a
discrete set) to the continuous case. Ilere it is polnted out
that the WEAK SUFFICIENCY PRINCIPLE [If, for some c)0, p(y,O) =
cp(y*,O) for a1-1 0 ln 0, then Ev(E,y) = Ev(E, y*)l rnay be needed
Eo replace ME in the derivation of LP fron CP. However, some
conpllcations rnay arise from the llmited degree of arbitrarlness
that exists ln the choice of a denslty functlon.
The LP ls very controversLal. Crltics havg pointed out that
in practice specific statistical nodels can never be who11y trusted
so that a statistlc sufflclent on the hypothesls of a gl-ven rnodells not sufficient under the wider hypothesls that that rnodel may
oot actually obtaln. A naJor crlticlsm of the Lp cane fron
Birnbaun hinself. Specifically, he recognized that the Lp is
inconpatible with hLs confLdence concept--under no 0 sha11 there
be high probablltty of outcomes lnterpreted as "strong evidence
aSainst 0"--deflned earlier. Any adequate concept of statistlcal
evldence must meet at least certain rnlnlmal verslons of both the
LP and the confidence concept, but it has become clear to Birnbaurn
that no such concept of statistical evLdence can exlst. This does
SWAMY, CONWAY, AND VON ZUR MUEHLEN
not mean that adequate concepts could not be found in speciflc
cases, chough Birnbaum thought that the domain to whlch such concepEs could be applied woul-d far from cover the rnaJor aPpll-catlons
of statlstlcal- rnethodl for further dlscusslon, see Barnard and
Godarnbe (1982) who give an exampLe where the ltkelthood function
provides nlsleadtng evidence wlth probabtllty l. Thts type of
likelthood functl-ons also arise ln the context of some econonetrLc
nodels ar.a1-yzed by Maddala (1983, p. 300) and others. Furthermore,
"the recording of the llkelihood function ls not an answer, 8lnce
l-n a subsequent decilt begs the question of gqlong@
sion or stage of lnvestigation to be determined frorn the data"'
pointed out by Kiefer (L977a, p. 823).
The problem of generallzing the LP so that the generallzed
version avoids some of the dtfficulties mentloned above is taken
up by Berger and Wolpert (1984). Ilowever, Lane (1984) ln his
dLscussion of Berger and Wo1-pertts (1984) work raises the
important issue of the neaning of 0 ln the (0, S,p)-paradign.
According to Lane, there are at leasE Ehree possible lnterpretations of the elenents of o: (a) e ts the distrtbutton p; (b) fl
l-s an abstract 6et and 0 rnerely indexes the distributlon p; (c) 0
ls a possi-b1e value for some "rea1" physical Parameter' and p l-s
to be regarded as the digtrlbutlon of the random quantity 9
should 0 be Ehe true value of that parameter. InterpretatLon
(c) ralses the difficult phlloaophlcal questlon: when--and in what
sense--do "rea1" physlcai, pararleters exlet. More l-mportantly, as
Lane points out, depending on whether one adopts interpretatLon
(a), (b), or (c), the (generalized) LP is devoid of lnterestlng
consequences, wrong or severely and amblguously restrLcted in ics
domain of appllcabtltty. In their reply to Laners comDents,
Berger and Wolpert (1984) adrnlt that while the (generaltzed) LP
will apply under l-nterpretatlon (c), tt also applles when 0 ie
onJ-y defined by sone aspect of the experiment. The fundanental
dlfflculty is that the information l-n (0, S, p) says nothing
about how the rnodel represents reall-ty, and it ts hard to aee how
FOUNDATIONS
OF
ECONOMETRICS
a prlnclple of lnference can disregard the details of this representatlon. Slnce the (generallzed) LP lgnores alL euch details,
Lane belleves that lt ls inadequate. IIe says that not enough
lnformatLon ls encoded ln (O, S, p) upon whlch to base a general
princlple of lnference. Lane further shows the irrelevance of the
(0, S, p)-paradlgrn by argulng that the real aln of Lnference
is usuaLly not to make statements about the "true" value of an
unobservable parameter 0 on the basls of an observed quantlty y,
hlt to generate a predictlon about the value of some future
observables and the (generalized) LP does not address the question
of how to generate such a predictlon directly. Another reason
rhy Lane thlnks that the (0, S, p)-paradigm is lrrelevant ls that
especially in applications arising in nonexperinental scLencee
like econometrics, the nodel Ls sculptured either fron data
already in hand or perhaps fron a realistic view of what data are
potentlally available. In such cases, there is no rray to separate
nhat (E, y) Eays about 0 from "prlor" information about e. When
the "parameter" dependa upon the experLment for its existence and
oeaning, the (generalized) LP does not apply and the (o, S, p)paradigrn is lrrelevant. For all these reasons we shoul-d not take
the LP very serl-ously unless rle know lte are in a sltuatLon where
interpretation (c) is approPrlate.
The Bayeslan method applies under all lnterPretations of 0.
Sl-nce the appeal of the confidence concept is undeniabl,e, we now
lnvestigate whether a Bayesian nethod can be developed which is
conslstent wlth that concep!. In hls conmentary on Birnbaurnfs
(L977) paper, Llndley (1977) conatructs one example showing that
the descrlptlon of evidence in the forn of al or al is unsatisfactory and another exanple showing that the particular preference
pattern for teets [(.05' .05) > (.1' 0) - (0, .l)] nentioned by
Birnbaum amounts to preferring evidence that uses the toss of a
coin whose reeult ls irrelevanE to 116 or E1 to that whlch does not.
Accordlng to Ltndley (L977), the meanlng Bl-rnbaun attaches to
evLdence is not as satisfactory as that based on the LP and the
SWAMY, CONWAY, AND VON ZUR MUEHLEN
46
of Bayesf theoren.
follows from the Bayesian argument
and an important property of the llkellhood functlon ls that lt
does not depend on which long run rre conslder the result as
ernbedded ln [see Cornfield (1970, pp. f3-15)]. However, since
every posterior distribution, glven by Bayeer theorem, depends on
a likelihood functLon, the former also gives rnleleadl-ng evidence
whenever Che latter does, depending oo the cholce of prior. In
fact, thle statement agrees rrith the conclusion of Lindleyrs
(1977, p. 58) shorE derivation, the purpose of which is to ehow
that the interpretation of a large value of the posterior odde
ratlo in favor of II1 relatlve to 116 as an evidence agalnst IIg
if the prlor odds ratio
agreea wlth Birnbaunts g!!!g_ggg.9.
l-n favor of H1 relative to It6 ls sufftclently snall. Consequently,
the posterlor odds ratl-os based on arbltrary prior odds ratl-os may
give nisleading evidence.
As Kiefer (I977b, pp. L74-L75) notes, the basic problen ls
that "statistlcs ls too couplex to be codlfled ln terme of a
slmple prescriptlon that is a panacea for all Bettlngs, and...
one mu6t look as careful-ly as posslble at a varlety of possible
procedures..." In view of Kyburgrs (1983, pp. 91-93) conpelling
argumenls lsee Appendlx to the paperl, it is difficult to show
that a partlcular prlor distrtbutlon represents Eomeonets degrees
of beltef; but fron the well establlshed results that under sultable regularity condltlone the Bayesian procedurea and adnisslble
procedures coincide, lt appears reaeonable to conclude, aa
Wolfowltz [Kiefer (L977b, pp. 17t-L73)] doee, that the use of a
prlor distrtbutlon reflects, at the very least, a statistlclanrs
ability to plck an adnissible or a preferred 1ocal1y coherent
procedure fron thoee available or hie truth value assignnent.
Coneequently, lt does not seem unreaeonable to us that one ghould
make a tentatlve firet step at selecting a procedure ln accordance
with Klefer'e (1977b, p. 171) euggestion: Flret, chooge a prior
dlatrlbution that reflects eonethlng of what one would regard as
a desirable rlsk functlon (operating characteristlc), deflned in
use
The LP
FOUNDATIONS OF
ECONOMETRICS
of the relatlve importance of the risk values at various
states of nature. Then compare the rlsk function of the resulting
Bayes procedure with those of other candj.dates to make sure that
lt does not have a subadnisstbllity defect. For an appllcation,
terms
see Swany and Tinsley (1980) and Swamy and Mehta (1983b) who have
attempted to follorr this suggestion ln econonetrlc contexts.
8.
of
Conclusions
The theory and practLce of econometrics draws on Dany sources
knowledge lncludlng economic theory, statlstlcal theory, and
data. Often consideratLons describing lnstitutlonal or
psychological aspects of a problem must be entered. The intersection of all these sources forns a gray area which, as we have
descrlbed in the preceding pages, defles the established rlgors of
Aristotl-ers principles of logic. The foundations of econometrics
economic
are weaker than those so1e1y supporEing econonic Eheory under
certainty. Thus, while Aristotelian principles may be applied to
determine the loglcal valldlty of economlc theory under certainty,
ihe truth of its conclusions so constructed cannot be determined
slnply by inspectlon of empirical results for two reasons: first,
the problem of induction (as defined by Boland) is lnpossible to
solve and, second, the uncertainty theorles and approximations
comonly adopted in econometrics violate Arlstotle's axiom of the
excluded-middle. To our eventual dislllusionment we have found
ihat econometricians cannot establlsh the truth or falsehood of
economl-c theorles unless they are 1oglca11y inconslstent, in
which case they are false. There are, oevertheless, real conirlbutlons to be rnade. Econometricians can develop sufficient
and 1ogica11y consistent theorles and lmpose them on thelr enpirlca1 nodels. Care should be taken to ensure that the behavloral
assuuptions of economic theory and the statlstlcal assumptlons
nade do not contradict each other. Many-valued logic, whlch ls
reaker than Aristotellan 1ogic, provldes a basls for the anaLysis
47
SWAMY, CONWAY, AND VON ZUR MUEHLEN
models. Probability theory, in lts nost prlnltive form,
resembles many-valued logic. lle have also noted the key lmportance of interpretatlons of probabllity in providing the modus
operandL for fitting a sufficient and 1ogical1y conslsten! model
to glven data. Rules of lnference that lead to "senantlcally
compLete" many-valued logic in econometrlcs, thereby ensuring that
nhat is provable is exactly what is va11d, ought to fit a set of
condltions regardlng truth value asslgnments. Our maln recomrnendation for achleving such senantic completeness is to fo11ow
Kieferrs advice and choose a prlor distrlbution (a truth value
assignnent) that reflects aomething of what one would regard as a
deslrable risk functlon and then compare the risk function of the
resulting Bayes procedure with those of other candidates to make
sure that it does not have a subadmlssibllity defect.
Sueh a prescription would, in our view, produce 1oca11y
(wlthin the same model) coherent inferences. I{owever, in ltght
of G'ddelrs incompleteness theorem' g1oba1 coherence or consLstency
among all possible cornpeting models is not possible. To facilitate
cholce among competing nodels' we suggest the following maxim:
When a given 1ogically consistent model produces better predictlons
than any other logica11y consistent rnodel, a researcher may favor
Lt over less successful cornpetitors. At the same tlme, the
researcher should recognize thaE because of uncertalnty, the
currently successful performance of a given nodel nay prove as
evanescent as the morning dew. Thls w111 requlre constant vigil-:
one must always be prepared to check the predictlons of a model
against those of its competltors' new and o1d, provlded the model
and iis competitors are logica11y consistent.
of
such
Appendix
Interpretations of Probabllity
1.
and
of probabllity lnterpretations
Indeed, the meaning of a crlterion of fttting a sufflclent
1oglca11y consistent model to glven data and even the con-
ftnportance
FOUNDATIONS OF ECONOMETRICS
sequences of a fitting criterion will depend fundanentally on the
philosophical interpretation which is glven to the concept of
probability. Furthermore, there are interpretaEions of "probability" under whlch lt makes sense to look for a Justificatlon of
a fittlng crlterl-on, while there are other inferpretatlons under
shich a fitting criterion may be regarded as a part of the definttion of "rationalityr" and sti11 other interpretations under
rrhlch a criterlon l-s not always applicable.
2. Alternative rneanings of the word "probabl1lty"
The word "probablllty" has been used in various senses by a
nunber of writers [see Good (1983, Chapter 6)]. To quote Good
(1962, p. 487), "Ic]he most inportant distinction ls that between
physical, material, or lntrlnsic probabilities, chances, or propensities on the one hand, and non-physical or intuitive probabilities on the other... Intuitive probabilities can be subdivided
into credibilities = loglcal probabllities = unlque rational
degrees of be1lef or inEensltles of convlction on the one hand,
and on the other hand subJective or personal degrees of bellef to
nhich some canons of conslstency, honesty, and maturity have been
applled, in which case they are cal1ed subjectlve or personal
probabllities. "
Probabilttles in Ehese senses can be covered as special cases
of two fairl,y standard interpretaEions of probabllity: one is an
empirical-frequency and the other ls a subJectivistlc interpretaiion. BoEh of these are falrly well-known, each of them havlng
been clalmed by advocaEes to hold the key to the problem of
decl-sLon-rnakLng under situations of uncertalnty. Let us consider
ihen in turn.
The Frequency Interpretation: Probability statenents are
statistical hypotheses about the relative frequencles of occurrence
of events, subJect to confirmation and disconfirmation. They are
offen unknown. The arguments of the probablllty functlon (what we
refer to when we speak of the "probability of") are certaln classes
of events.
st^,AMY, CoNWAY, AND VON ZUR MUEHLEN
Interpretatlon: Probability statements
express personal opinions or subJective degrees of belief, and
are open to potentlal modiflcatlon so as to conform to Lhe rules
of the probability ca1cu1us. They can in no event be unknown.
The argumenis of the probablllty function are usually taken to be
statements, but they nay also be taken to be events.14
The SubJectivistlc
3. Differences
and similarities of results based on different vlews
While there are surprislng cases of agreenent among the various points of view, there are also many cases of disagreenent.l5
For example, the subJectivistst approach to statlstl.cal lnference
is essentially Bayesian; but it may, in view of a connection
between the frequentistic criLerion of adnissibility and fhe
subjectivistic coodition of coherency polnted out by Ei11 (1975,
pp. 556-557), Lead to neEhods formally ldent.l-ca1 wlth methods put
forward from a frequentist point of view.15 As pointed out by
T-so-me personalists, taking the betElng background rnore
serlously, had doubts about lncluding ln the dornain of che probability function statements whose truth or falsity cannot be
settled ln a predlctable flnite length of time, see Kyburg (1983,
pp. 23-24).
15 As we shal1 see below, a frequency interpretatlon of probablltty ls valld under the conditlons of a law of large numbers.
But any assertion of the "appropriateness" of the conditlons of a
law of large numbers is itself, lnescapably, an act of personal
Judgnent. On the other hand, ln the subJectlvlst approach, the
nixing meaaure p on [0,1] 1n terms of whlch the law of exchangeable
dichotomous trials can be represented as a mi-xEure of lndependently
and ldentlcally distributed Bernoulll processes, can be lnterpreLed
as a bellef about long-run relative frequency. Thus, subJective
Judgnents as well as the notion of long-run relatlve frequency
enter Lnto both frequentist aod subJectivist approaches.
16 Rigorous statementa of this connectioo are given by Lehrnann
(1983, p. 263) and Heath and Sudderth (1978). Speciflcally,
Lehrnann has presented a simple proof to show that any unique
Bayes estimator ls adnlssible, and lleath and Sudderth prove that
lf a bounded loss functlon ls speclfled, then a decision rule is
extended adnisslble (1.e., not uniformly domlnated) if and only
1f lt ts Bayee for some flnitely addltive prior. [We have already
stated lleath and Sudderthrs (feZAl result which establlshes the
close connectlon between the coherent lnferences and flnitely
additlve prlors. l
FOUNDATIONS
OF
ECONOMETRICS
51
Efron (1978), it is also true that while statistics, by R.A.
Fisherts deflnitlon, is lnterested ln sumnary statements about
large populations of objects, frequentlsts and Bayesians have
produced fundamentally dlfferent answers to the basic queeti.on
concernLng the cholce of sunmary statements most relevant to
drawing inferences from data.
4. Conditions for the relevance of frequency interpretatLon
The frequency lnterpretatlon is most prevalent. It takes
probabllity statements to be ernplrical statements that describe
lhe relative frequencles of occurrence of certain classes of
objecEs or events encountered in the world and does noE require
genuinely repetitive situations because the 1aw of large numbers
In
can also hold for nonidentical (and even dependent) trials.
where
of
law
of
large
numbers
do
any sltuaLlon
the condltlons
a
aoL hold (such as the variable that Eakes only one value), the
frequency interpretation of probabtlity nay be lncorrect. Even
in those situations where the condltions of a 1aw of large numbers
do ho1d, we may not knon exactly the lLmltlng frequencles ln
infinlte reference classes; thus our probabllity statements entall
assertlons about unknown rel-ative frequencies. Thls neans t.hat
probabillcy statements can at best be approxlmatlons to long-run
relative frequencies, and hence ArlsE.ote1lan logic does not apply
under a frequeney interpretation. The many-valued loglc applies
but requires the condltions of a law of large numbers and a
sufficientl-y large sanple to make the frequentist procedures
senantlcally conplete. [For a useful survey of the frequentistic
approaches to infereoce, see Efron (1982). I The c and B in
Elrnbaumrs evidential lnterpretatlon presented ln the preceding
section denotes frequentist probabllity.
5. Condltions for the relevance of subjectlve interpretations
Whlle the frequency interpretatlon posslbly conveys something
obJective about events, the subJectlvlstic lnterpretation is baslc-
52
SWAMY, CONWAY, AND VON ZUR MUEHLEN
ally psychologlcal. The representatlon of degrees of individualst
beliefs by probability statements ls fundamental to the latter
interpretation. Jeffreys and hls followers assume that unique
rational degrees of bellef exlst common to all rational mlnds
given the same factual lnforrnation. Good (1962) ca11s these
unique rational degrees of beltef crediblltries' a term whlch we
have already introduced. By eonErast, L.J. Savage, de Flnetti
and thelr fo110wers believe that different lndlviduals can have
different degrees of belief for the same event even when they are
glven the sane factual inforrnatlon. But these indivldually
varying subJective or personal degrees of belief are, in a sense
abstractl-ons. They are not necessarlly the actual degrees of
belLef of a llving individual but a rnodified version that satlsfies
de Flnetti's conditl-on of coherency. To illustrate' suppose that
an lndlvidual assesses dlrectly the probabllity Pi of each event
A1 in a partltlon of the universe, and then discovers that [iP1 =
P>1. If he were forced co bet simultaneousLy on each Ai occurring'
paying P1 for the chance to rtln a unit if A1 occurs, and winning
nothlng if A1 does not occur' then no matter which A1 actually
does occur, he would pay P unlts and receive one unit, and thus
be a sure 1oser. Using this example very effectively, Ilill (1975,
p. 557) argues that a rnethod of evaluatlng probabilities which
would make a llvlng human being a sure loser if he or she had to
act upon those probablllties (even hypothetically) would seero
suspicious and untrustworthy for any Purposes whatsoever, tncluding casual thought, and lt wou1d seem deslrable to remove such
incoherencles sherever possible. The theorems of total probabillty
and compound probability are only the lmrnediaLe coroLlarl-es of
the condition of coherency; that is, the nunbers representing
degrees of coherent bellef must satisfy a1-1 the axloms of the
probabil-1ty calculus.
While we are persuaded by llilLts (1975, p. 557) demonstration that incoherence is symptomatlc of a "basically unsound"
attitude, there ls no evidence that indlvldualsr degreee of be1lef
FOUNDATIONS
OF
ECONOMETRICS
do in fact satisfy the probablllty calculus, and, to the contrary'
conslderable evldence has been presented by Kyburg (t983' pp.
91-93) to the effect that they do not. That indlvldualsr bellefs
foLlows from the
-y oot conform to probablllty calculus also
study eonducted by Tversky (L974) who descrlbes three heurlstics,
or mental operatlons, that are enployed ln Judgment under uncer-
heuristics are not dlscarded even though they
occaslonaLly lead to errors. Tversky exhlbits the failure of
both laynen (untutored in the laws of probablltty) and experts to
iofer from life-1ong experl-ence fundamental statlstical rules
such as the role of prior probablllty or Lhe effect of earnple
size on sampling varlab111ty.17 This nay lead to a probl-en ln
Bayeslan inference. If we suppose a ful-1 preference ranking
aEong acts there are two posslbllitles. Elther the preference
ranklng ls coherent, or lt ls not. If lt ls coherent, we are al-1
set. If tt is not' then somethtng must be changed; unfortunately'
subJectlvlstlc theory wil-1 not te11 us erhat to change.
As Kyburg (1983, pp. 81-85) polnts out' there may indeed be
a rough lntuitLve connectlon between an lndividualrs degree of
belief in a starement S and the least odds he or she is willing
to offer ln a bet on S. But this connectlon is much too loose
to generate by itself a set of numbers confornlng to the probabillty calculus. The Dutch Book argument cl-ted by Kyburgr glves
excellent reasons for adopting a table of odds or publishlng a
list of preferences which conform to the baslc axloms of probabl1lty, but coherence ls inposed at the coet of destroylng the
lmedlate and intuitlve connectLon between odds and degrees of
belief that the argument orlglnally depend on. Indeed, at thls
point, we may fl-nd ourselves wonderlng lf there ls such a thlng
as "degree of be11ef." Thts skepticism is also Justified by the
fact that L.J. Savage uses two "structure" axloms which are
exlstential ln character, ln conJunction wlth five "rationality'
tainty.
These
reason, Lindley (7974, p. 181) thinks that a thor-I-F;Eis
ough drilling in the prlnclples of maxlrnun expected utlllty ln
the Laissez-faire schoolroons of today would not be anlss.
53
swAMY, CONI^JAY, AND VON ZUR MUEHLEN
54
to prove the exlstence of a unique probabillty distributlon
on states of nature [see Suppes (1974)]. Thus, "sorne of Savagets
axloms do not in any dlrect sense represent rationality that
should be satlsfied by an ldeally ratlonal person but, rather they
represent structural assumptlons about the environment that may or
nay not be satlsfled ln glven appLlcations" [Suppes (l-974, p. ]'62)l'
It certalnly follovs fron the rlgorous derivatlon presented
in DeGroot (1970, pp. 70-82) that a subJectlvlst cannot asslgn a
unique subJective probablllty to each event in an unamblguous
nanner if he ls unable to lmaglne an ideal auxillary experlment in
which a unlformly dlstributed random varlable X can be generated,
and if he is unable to compare the reLattve likellhood of any
event whl-ch was origLnally of lnterest to hin with that of some
other event of the forn {XeI = an lnterval}. Consequently' if the
distributions conatructed ln DeGrootrs roanner are construed as
yieldlng subJective probabillties--i.e., degrees of bellef--this
means that subjective probability asslgnments do not requl-re a
unique degree-of-be1lef functlon.18 Thus, approxlmatlons are
necessary to represent someoners oplnlons by a distribution,
thereby ruling out any apPlicatlon of Aristotellan loglc to this
axioms
sl.tuation.
frequentLst v8. posterior probabllity
Once we have a sufflclent and 1ogica11y conslstent sanpllng
model and a coherent prior, the derlvation of a posterlor distributlon, lf lt exlsts, can be stralghtforward [Ze11ner (1971)]. So
we sklp this derivation and proceed to point out the dlstlnction
between the frequentlst probabillty and posterlor probabillty'
since we have made rnuch of this dlstlnctlon in Example 3 and in
6.
On
T'-Tt ;froffi-be noted that de Flnetti only recommends the use
of finitely addltlve prlors, whereas I{111 (1976' p. 1000) Justifles
the use of countably additlve prlor distrlbution on the basis of
the usefulness of the approxlmations to whlch one ls led by means
of the latter distrlbutlons. However, a countably addltlve distrlbution nay provlde an adequate approxinatlon to more than one
degree-of-bellef function, thus destroylng a unlque connectlon
between prior dlstrlbutlons and degrees of bellef.
FOUNDATIONS OF ECONOMETRICS
55
the beginning of thls Appendix. As lndlcated by Kiefer (L977b,
p. 774), frequentlst probabi.l-ity (e.g., that a confidence lnterval
w111 by chance cover the true parameter value) refers to a chance
experlnent yet to be conducted, whl1e posterior probability (re1ative to even a physical prlor 1aw and given some observed value
of the varlable under conslderation, say Y = 3.17) does not. The
fact that one may bet in a siurilar fashion in the two circumstances
does not alter this dlstlnctlon because the nature of bets based
on confidence coefficlents ls dlfferent fron the nature of bets
based on posterlor probabllities. For a frequentist, it ls the
posterlor probablllty given the yet-to-be-observed Y that is
descrlbed in terms of an experlnent etll-1 to be conductedr gnd
shose 1aw of large nurnbers behavior Justlfles the way he uses
it when Y = 3.17. Of course, subJectlvtsts requlre no such
distlnction.
ACKNOWLEDGEI'{ENTS
in this paper are those of the authors and
reflect the views of the Board of Governors or the staff
of the Federal Reserve System, nor do they reflect the vl-ews of
the Departnent of Commerce. For thelr editorlal assistance and
helpful comments, the authorg are grateful lndeed to James Barth,
L.A. Boland, John Craven, Clark Edwards, Richard llaldacher,
Charles llallahan, Richard Heifner, Dennl-s EenLgan, Anl1 Kashyap,
Judlth Lathan, Hichael LeBlanc, DennLs Llndley, Thonas Lutton,
Jitendar Mann, Darrel Parke, Dale Poirier, Paul Prentice, Lloyd
Vlews expressed
do not
Teigen and Arnold Ze11ner. The authors are especially grateful
also to Michael Weiss for hls rnetlculoue revlew of our paper and
for lntroduclng us to ftzzy set theory arrd f'tzzy loglc. Thanks
are due to Nadlne Loften, ERS, USDA and Sharon Sherbert, Speclal
Studies, FRB, for thelr consclentious preparation of the manuacript.
REFEPJNCES
A., (1976). Comment on "Strong Inconslstency Fron
Uniforn Priors" by Stone. J. Amer. Statist. Assoc., 7L, L2L-L22.
Barnard, G.
SWAUY, CONWAY, AND VON ZUR MUEHLEN
56
Barnard, G. A. & Godambe, V. P.' (1982). Memorial article,
Allan Birnbaon L923-L976. Ann. Statl6t., 10, f033-1039.
Barnett
W.
A., (1983). New lndlces of money supply and the flexldernand system. J. Bus. & Econ. Statlst. , L, 7'23'
ble Laurent
(1977). On the elimination of nulsance parametera'
Statist. Assoc. , 72, 355-366.
Berger, J. O. & Wolpert, R. L., (1984). The Ltkellhood Princlple'
Lecture Notes-Monograph Series, Volume 5. Ilayward, Callfornla:
Instltute of Mathenatlcal Statlstlcs.
Basu, D. ,
J.
Arner.
Berk, R. I{., (1966). Linlting behavior of posterior distributions
when the model ls incorrect. Ann. llath. Statl-st., 37r 5l-58'
Berk, R. II., (1970). Conslstency a Posterlorl. Ann. l'(ath'
Statist., 41, 894-906.
Birnbaum,
A., (1977).
.rhe Neyman-Pearson theory as decislon
theory, and as lnference theory; wlLh a criticisn of the LindleySavagl argumenL for Bayeslan theory. Synthese, 36, L9-49'
Blaug, n., (1980). The Methodology of Economlcs or llow Economlsts
Expl-ain. Cambrldge, U.K.: Canbridge Unlversity Press'
Boland, L. A., (1979). A critique
J. Econ. Liter. XVII' 503-522.
of Friednaa's critics.
'
L. A., (1981). SaEisflcing ln nethodology: A reply.
J. Econ. Llter. ' XIX' 84-86.
Boland, L. A., (1982). The Foundations of Economic t{ethod.
Boland,
London: George Al1en and Unwin.
P., (L976). Sclence and statlstlcs. J. Aner. Statist'
797-799.
77,
,
Box, G. E. P. & Tiao, G. C., (1964). A note on criterion robustness and lnference robustness. Bi.ometrLka, 51' 169-173.
Box, G. E.
Assoc.
Box, G. E. P. and Tiao, G.C., (1973). Bayesian Inference in
Statistlcal Analysls. Readingr Massachusetts: Addlson{'Iesley'
Brldgman, P., (1928). The Logic
Macl-lll1on Conpany.
of
l'{odern
Physics.
New York:
R. K., Swamy, P. A. V. B., Yanaglda, J.F. & von zur
Huehlen, P., (L984). The lnposslbllity of causaLlty testlng.
Agrl. Econ. Res., 35, l-19.
Conway,
FOUNDATIONS OF ECONOMETRICS
CornfLeld, J., (1970). The frequency theory of probabillty,
Bayest theorem, and sequential clinical trlals. Bayesian StatisItcs (D. L. Meyer & R. O. Co11ler, Jr., Eds.) Itasca, I11lnois:
F. E. Peacock Publishers, Inc., f-28.
Cox, R.
T., (1961).
The Algebra
The John Ilopkins Press.
de
of Probable Inference. Baltimore:
Finetti, B., (1974). The Theory of Probability,
York: John l{lley & Sons.
Volume 1,
New
DeGroot, M.
I{., (1970). Optirnal Statlstical Decisions.
New York:
McGraw-Ili11 Book Company.
ll. H., (1974). Review of Sir Earold Jeffreysr Scientific
Inference, 3rd ed., J. Aner. Statist. Assoc., 69, 1052-1053.
DeGrootr
A. P., (1968). A generallzatlon of Bayesian inference
(wlth dlscussLon). J. R. Statlst. Soc., B, 30, 205-247.
Dempster,
Efron, B., (1978). Controversles ln the foundations of statistlcs.
Amer. Math. t-lonth. , 85, 23I-246.
Efron, B., (1982). Maxlrnum llkelthood and decision theory.
statist., 10, 340-356.
Ann.
Fisher, F. M., (1961). 0n the cost of approxiftrte specification
in sirnrltaneous equatLon estimation. EconomeErica, 29, 139-170.
M., (1953). Methodology of posltive economl-cs. Essays
in Positive Economics. Chicago: Chlcago University Press,
Priedman,
3-43.
Geisel, M. S. (1975). Bayesian comparlson of simple nacroecononic
Dodels, Studles in Bayesian Econometrics and Statlsttcs (S. E.
Fienberg & A. Ze11ner, Eds.) Ansterdan: North-Ilolland Publishing
Conpany, 227-256,
A. S., (1967). Functional form and utllity: A revLew
consuner dernand theory. Systems Fornulatlon, t'lethodology, and
Po1lcy Workshop Paper 6703, Unlversity of Wlsconsin, Madlson.
Goldberger,
of
I. J., (1962). Review of llarold Jeffreyst Theory of Probability, Third edition. J. R. Stattst. Soc. , A, 125, 487-489.
Good,
I. J., (1980a). The contributlons of Jeffreys to Bayeslan
StatlstLcs. Bayeslan AnalysLs in EconometrLcs and Statistlcs:
Eseays ln llonor of llarold Jeffreys (A. ZeLlner, Ed.) Amsterdam:
C'ood,
North-Ilo1land Publlshing Company, 2L-34.
SWAMY, CONWAY, AND VON ZUR MUEHLEN
I. J., (1980b). Explicatlvlty: A nathenatical theory of
explanatlon lrith statlstical applicatlons. Bayeslan Analysls
Ln Econonetrics and Statlstl-c6: Essays 1n Honor of llarold
Jeffreys (A. Zellner, Ed. ) Amsterdan: North-llol1and Publlshing
Good,
Company,
Good,
and
P
397
-425.
I. J., (1983). Good Thlnking, the Foundattons of Probablllty
Its Applicatlon. Minneapolis: Unlverslty of Minnesota
ress.
Ileath, D. & Sudderth, W., (1978).
coherence, and extended
On flnitely addltive prlors,
admissibllity. Ann. Statlst., 6,
333-345.
Ili11, B. M., (1975). on coherence, lnadmissibillty and inference
about many paraneters in the theory of least squares. Studles
in Bayeslan EconometrLcs and Statlsttcs (S. E. Flenberg and
A. Zel1ner, Eds.) Ansterdan: North-Ilolland Publlshlng Company,
555-584.
Hi1L, B. M., (1976). Revlew of de Flnettirs Theory of Probabllity,
Volume 2. J. Aner. Statist. Assoc., 71, 999-1000.
8i11,'B. M., (1981). On sone statlstlcal paradoxes and nonconglonerability. Bayesian Statistlcs (J. M. Bernardo, M. I1.
DeGroot, D. V. Lindley & A. F. M. Snlth, Eds.) Valencia, Spain:
Universlty Press, l6l-174.
Hil1, B. M., (1984). Dlscusslon of The Llke1thood Principle by
J.O. Berger and R. L. Wolpert. Lecture Notes+{onograph Serles,
Volume 6, Hayward, Californla: Instltute of Mathematical Statistics, l5l-174.
Jeffreys, II., (1961). Theory of Probablllty. Third edltlon.
London: Oxford UniversJ.ty Press.
A. M., Linnl-k, Y. V., & Rao, C. R., (1973). Characterlzatlon Problens ln Mathenatical. Statlstlcs. New York: John
Kagan,
Wl1ey & Sons.
KLefer, J. (L977a). Condltlonal confidence statenents and confldence estlmators (with discusslon). J. Aner. Statist. Assoc.,
72, 789-827.
Kiefer, J., (1977b). The foundatlons of statlstics - are there
any? Synthese, 36, J-5L-J-76.
E., Jr., (1983). Eplstenology and Inference.
Minneapolls: University of Minnesota Press.
Kyburg, I1.
,
FOUNDATIONS OF ECONOMETRICS
Lane, D. A., (1984). DLscusslon of The Ltkelihood prlnciple by
J. O. Berger and R. L. Wolpert. Lecture Notes{onograph Series,
Vo1. 6, Ilayward, Callfornia: Instltute of Mathenatical Statis-
tlcs,175-181.
tane, D. A. & Sudderth, W. D., (1983). Coherent and contl-nuous
inference. Ann. Statlst., 11, L14-120.
Lehmann,
E. L., (1983). Theory of Point Estinatlon,
John Wl1ey & Sons.
New York:
Lindley, D. V., <L974). Comments on "The neasurement of belief"
by P. Suppes and "Assessing uncertainty" by A. Tversky.
J. R. Statlst. Soc., B, 36, 181-f82.
Llndley, D. V., (1977). The distlnctlon between lnference
declsion. Synthese, 36, 5l-58.
and
E., Jr., (1976). Econometrlc policy evaluation: A
critlque. The Phi1llpa curve and labor -rkets, CarnegieRochester conference serles on publlc
, Volume 1
Lucas, R.
msterdam: North-Ho11and
Publlshing Company, 19-46.
Lyndon, R. C., (1966). Notes on
Van Nos t rand-Relnhold.
Loglc.
New
York: Anerican Book-
laddala, G. S., (1983). Lirnlted-Dependent and Qualitative Varlables in Econometrics, Canbridge, U. K.: Cambrldge Unlverslty
Press.
XcCawley,
J. D., (1981). Everythlng That Ltnguists Have Always
to ask). Chicago:
Wanted to Know About Logic (but were ashamed
The Unlversity of Chicago Prees.
Xuth, J., (1961). Ratlonal expectatlons and the theory of prlce
Eovements. Econometrica, 29, 315-35.
Pesaran, M. H., (1982). On the conprehenslve raethod of testing
uonnested regression nodels. J. Econonetrlcs, 18, 263-274.
Popper,
K., (L972). ObJective Knowledge, An Evolutlonary
London: Oxford UnLverslty Press.
Approach.
Iao, C. R. E Mltra, S. K., (1971). Generallzed Inverse of Matrices
and lts Appllcatlons. New York: John Wl1ey & Sons.
lao, C. R., (1973). Linear Statietlcal Inference
Second edltlon. New York: John Wl1ey & Sons.
and
ItB Applicatlons,
SWAMY, CONWAY, AND VON ZUR MUEHLEN
60
Renyl, A., (1970). FoundatLons of Probabillty. San Francisco:
Ilolden Day, Inc.
C., (1973). Logic,
Prentice-Eall, Inc.
Salmon, W.
Second
edition.
New
Jersey:
T. J., (1976). A classical nacroeconometrlc model for
the United States. J. Pollt. Econ., 84, 207-237.
Sargent,
Sargent,
T. J., (1979). Causallty, exogenei.ty, and natural rate
to C. R. Nelson and B. T. McCallurn. J. Polit.
rnodels: Reply
Econ., 87, 403-409.
L. J., (1981). Inpllcatlons of personal probablllty for
loductl-on. The wrltlngs of Leonard Jlmde Savage - A Memorlal,
Selectlon, Published by the Amerlcan Statistical Associatlon
and the Instltute of Mathematical Statlstlcs, U.S.A.
Savage,
Sins, C. A., (1980). Macroecononlcs and reality, Econometrlca,
48,1-48.
M., (1976). Strong inconsLsEency from uniform prlors (nith
dlscussion). J. Amer. Statist. Assoc. , 71, LI4-L25.
Suppes, P., (1974). The measurenent of bellef (with dlscusslon).
J. R. Statist. Soc., B, 36, 160-19f.
Stone,
P. A. V. B., (1980). A comparlson of estlnators for underslzed sanpLes. J. Econonetrlcs, 14, 161-181.
Swamy,
P. A. V. B. & Tlnsley, P. A., (1980). Linear predlctlon
estinatlon methods for regresslon models wlth stationary
stochastic coefficlents. J. Econometrics, 12, 103-142.
Swamy,
and
P. A. V. 8., Barth, J. R., & Tlnsley, P.A., (1982). The
ratlonal expectations approach to economlc modellng. J. Econ.
Dynam. and Contr., 4, 125-L67.
Swamy,
P. A. V. B. & ilehta, J. S., (1983a). Further results on
Zellnerrs mininun expected Loss and fu11 infornation marinum
llkelihood estlnates for underslzed samples. J. Bus. & Econ.
Swamy,
statlst., L,
Swamy,
154-162.
P. A. V. B. & Mehta, J. S., (1983b). Ridge regression
of the Rotterdam nodel. J. Econometrlce, 22, 365-390.
estl-matlon
P. A. V. B., von zur Muehlen, P., Ti-nsley, P. A., & Farr,
H. T., (1983). 0n loglcal valldlty and econometrlc nodeLllng:
The caee of money suppl-y. Special Studles Paper 180, Federal
Swamy,
Reserve Board, Waehlngton, D.C.
6I
FOUNDATIONS OF ECONOMETRICS
Thell, H., (1971). Princlples of Econometrics,
New York:
John Wlley & Sons.
Thell, 8., (1975). Theory and Measurement of Consumer Demand,
Volume 1. Amsterdam: North-Ilolland Publishlng Conpany.
Tversky, A., (1974). Assessing uncertalnty (wlth dlscussion).
J. R. Statlst. Soc., B, 35, 148-159 and 175-191.
Vi11egas,
Statlst.
C., (1977a). Inner statistical inference. J. Aner'
Aeeoc. , 72, 453-458.
Villegas, C., (1977b). On the repreeentation of
J. Aner. Statist. Assoc., 72, 651-654.
Lgnorance.
Villegas, C., (f981). Inner statlstical lnference II.
Statlst. , 9, 768-776.
WalLJ-s,
Ann.
K. F., (1980). Econometrlc lnpllcatlons of the ratlonal
expectatlons hypothesls. Econometrlcar' 48, 49-74.
I{llder, R. L., (1952). Introduction to the Foundatlon of Mathematicg. New York: John I,llley & Sons.
Zellner A., (1971). An Introductl-on to Bayesian Inference ln
Econometrics. New York: John Wlley & Sons.
A., (f983). Statlstlcal theory and econometrics. I{andbook
of Econometrl-cs, Volume 1 (2. Grlliches & M. D. Intrlllgator'
Eds.). Amsterdam: North-Halland Publi.shlng Conpany'
ZelLner
ECoNOMETRTC REVTEWS,
4(1)
, 63-67 (1985)
COI.,IMENT
fn thls brief connent on the paper by Swanyr Conway and
Muehlen (SCM) I wltl present an al.ternative lnterpretation of the
'
logic of the sltuation faclng econonetric theorists. At the
outsel f wish to note, perhaps for obvious r€ssonsr that I accepl
alnost everything SCM discuss concernj.ng Arlstotlefs axions for
adnlsslbility of statenents into logical argunents. However, 1t
ls not necessary to base one I s appreclatlon of logic on the
suspicious nanalytlcal-synthetlcn distinctlon as SCM do in thelr
footnote 2 (see instead, Qulne 1961). Furthernorer any
conslderatlon of stat,enents about the future does not necessitate
a reJection of the traxion of the excluded-middlen slnce an
adnisslble statenent ls stlll either true or false even though we
nay not yet know 1ts truth status (see Bo1and 1982, Chapter 6).
l{ith these two exceptions notedr I will expLaln why, while I agree
with SCMfs reconnendatlon that the naxlom of noncontradictiontt
should not be abandonedr f dlsagree with thelr reconmendation that
ve should reject Aristotlers axlon of the excluded-mlddle in favor
of some form of nnany-valued logietr.
The najor polnt to be nade in favor of the axion of
noncontradiction ls that any argumenl that vlolates it can be used
to prove anything (see Popper 1965). Startlng with two
contradlcfory assumptions, nthe sun is shining nowr and nthe sun
is not shining nowrr, Popper shows how easy it is to infer that
iCaesar was a traitorn or if one wishesr to infer its denial
rCaesar was not a traitorrt using the sane two (contradlctory)
assurnptions. f doubt anyone wishlng to construct a convinclng
logical argunent ln favor of the truth of a given statenent rrould
63
Copyrtht
@ 1985
by Marcol Dekkcr, Inc.
07 47 -493818s
10401{063$3.s0/0
ever be satisfied nilh an argunent that 1s also capable of
providing support for a denial of that given statenent'
The only theorists that are ever satisfied with bulldlng
argunents that vlolate the axlon of noncontradiction are those
desiring lo bu11d Marxian nodels of soclal dynanlcs (where
contradictlons are necessary elenents in a dialecttcal process).
unless this ls oners purpose for employing econonetrlc techniquesr
there is no need to reJect the axion of noncontradiction'
While lt is dlfflcult to see why one should give up the axion
of noncontradlction simply because G6del wants us to worry about
the loglcal conpleteness of fornal systems' it ls even more
difficult to see why the anti-fornalistts rejection of indirect
proofs nust necessitate a rejection of the axion of the
excluded-niddle .
To avoid any nisunderstandlng
herer let ne illustrate
the
role of the axions of noncontradictlon and the excluded-niddle in
lndirect proofs. Consider the form of an lndirect argunent in
favor of proposlllon P glven the truth of assunptlons A' and At'
An lndirect argunent is (like any arguloent) a conpound statenent'
It says that since all its parls are true, any conJunction formed
by a valid rearrangement of its parts is true' In lhe case of an
indlrect argunent ln favor of P the coniunction is forned by
conjolning the denlal of P (not-P) wlth A.| and Ar' An indirect
proof shows tha! such a conjunction contains a contradictlon. And
since A.| and A, are accepted as truer by the axlon of
noncontradictionr one cannot also accept the truth of not-P.
That 1s, accepting the truth A, and A, reQuires the rejection of
the falsity of P. ff P ls not falser what is it? Wel1r lf you
also accept the axion of lhe excluded-nlddle, then P ls true or
it is false. In thls case then, not-false is the sane as true
since there ls no olher optlon. Thls ihen constltutes a proof of
the truth of P (given A' and Ar) bf showing that not-P is false'
Historicalllfr aoo€ nathemaiicians (e.g. r rintultionistsr)
would always have us reJect indirecl proofs. Nevertheless, Euclid
FOUNDATIONS OF ECONOMETRICS-COMMENT
65
supposedly provided an indlrect proof of his sixth proposition
that nif two angles of a triangle are equal then the sides
opposite are also equaln. I rnentlon this because there are many
propositions that are conmonly accepted on the basis of indirect
proofs. We should also recognize that every indirect proof
lnvolves both the axiom of noncontradlction and axion of the
excluded-niddfe. It 1s true thal if we were to give up eilher
axionr we would have to avoid relylng on any lenna or theoren
vbose proof ls indirect, but avoidlng lhe use of indi.rect proofs
of the axion of the excluded-middle !
Whlle it night be possible to axionatize the intuillonistsl
denial of the axiom of the excluded-niddle based on a denial of
indirect proofs, the real issue with the intuitionists was the use
of the concept of lnfinity ln proofs since the concepL of infinity
always refers to an impossible quantity (see Boland 1985' Chapter
5). It ntght nake nore sense to reject the use of the concept of
lnfinlty and retain the axlom of the excluded-niddle.
By advocaiing sone forrn of nany-val'ued or nfuzzyn loglc' SCM
yish us to replace Aristotlers axion of the excluded-niddle with
another axion whlch woufd aIlow adnlsslble statenents to be
sonething other than true or fa1se. Aristotfefs axiorn of the
excluded-niddle would need to be replaced because it would seen to
contradlct the econonetric model-bullderts use of Bayesr theorem.
Could we not avold such a contradlction by ensuring that whenever
does not require abandonnent
tJe use Bayeslan estimatlon technlques neither we nor the economic
theories that we have modelled have enployed a lenna or theoren
rhich has to be pqoven indirectly? If instead we htere to fol1ow
lhelr advicer we night also need sornething to replace the axj.on of
nonconbradictlon. If statenents can be nnany-valuednr what
conslitutes a contradiction? Presumably' a statement cannot have
two different lruth values. Butr can we say that a staLenent
nhj.ch is, by any neaaurement, .5 true is also .5 false? Is this a
contradiclion? Just rhat does the axion of noncontradicllon mean
rhen ne abandon the axlon of the excluded-nlddle?
BOLAND
by SCM ln Sectlon 6 of their paperr we could
retreat fron the problen of ensuring that the concepts and
assunptions of econouetrie theory satisfy Aristotlers axions of
logic by instead 1i-miting applications of econonetrics to
ninnediate practlcal problemsn. That is, we could slnply take an
instrumentalist-type stance and thereby reject any need to ensure
that our'econonetric assunptlons are adequately based on accepted
logleal principles such as the axion of the excluded-midd1e. But
scM reject such instrumentallsn, quite correctly I thinkr because
they do not wish lo restrict econonetrics to inrnediate practical
As reeognlzed
problens.
But is current econonetric theory capable of dealing with
anything nore than lnnediate practlcal problens? Even if we limit
our extention beyond innediate practical problems to hypothesis
testingr it is not clear that nuch ean be acconplished. Every
econonetric nodel of an econonic theory yields anbiguous results
whenever one ls realty trying to test whether the theory ls lrue
or false (see Boland'1977 and Cross 1982). At bestr the queslion
of testing involves strategies and preferences concerning whether
alpha-type or beta-type error is the least preferred. Neynan and
dld not actually solve a theoretlcal problen. when rre
recognize the question of strategies and error-type preferences'
we are actuall-y offering an instrunentalist strategy to deal with
practical problens (e.g., whether lo vaccinate a population
agalnst a vlrus and thereby risk infecting that population)' No
matter holr nuch we nay disllke lnstrunentali.smr il seens difficult
to avold instrunentallsrn whenever econonetrics is invol"ved.
What more do SCM want econonetrics to do? The founders of
the Econonetric Society dld have greater thlngs in nind. For
exanple, Frj.sch saw econornetrlcs as a nunificatlon of the
theoretlcal-quantltative and the enplrical-quantitative approach
to econonic problensn with a nconstrucLive and rlgorous thinklng
slmilar to that whlch has cone to doninate in the natural
sciencest (1933' p.1). Schunpeter saw the ains of econonetrics to
Pearson
FOUNDATIONS OF ECONOMETRICS-COMMENT
67
be nflrst and last sclentificn while stressing nthe nunerical
aspectn to be able t,o nexpectr from constant endeavor to cope wlth
the difflcultles of nunerical work, a wholesoroe disclpline ...
[that] helps tn building up the econonlc theory of the futurer
(1933, p. 12).
It would seem that SCM have shown that the econonetrics of
the founders wiII forever renain as 111-founded npipe-dreansn.
And so long as econonic theory is built using ordlnary loglc as
represented by Aristotlets axions, to be consistently applled lo
current econonic lheory, econonetrlc lheory nust also be based on
Aristotlers axions. For thls reasonr a logical foundation for
econonetric theory that denles any of Aristotlers axions cannot be
used to nodel ordinary econonic theory. Thus' lhe applicability
of nfuzzyn econonetric theory would seen to be linited to only
bullding econonetric nodels of lfuzzyn econonic theory.
Lawrence A. Boland
Simon Fraser Unlversily
ADDITIONAL REFERENCES
Boland,
L. A., (1977). Testabllity in
Boland,
L. A., (1985).
African J. of Econ., tl5, 93-105.
Methodotogr
Boston: Allen and Unwln.
Econonic Sci.ence. So.
for a New Microecononlcs.
Cross, R., (1982). The Duhen{uine thesls, Lakatos and the
appralsalrof theorles in nacroecononics. Econ. J., 92, 320-\0.
Frlsch, R., (1933). Editorial. Econometrica, 1,
1-4.
K., (1965). What is dlalectic? ConJectures
Refutations. New York: Baslc Books, 31'2-35.
Popper,
and
I{. V. 0., (1961). Tvo dognas of enpiricisn. From a
Loglcal Point of Vlew. New lork: Harper Torchbooks, 20-46.
Quine,
Schunpeter,
J., (1933).
Econonetriea, 1, 5-12.
The couron sense
of econonetrics.
ECONOMETRTC REVTEWS,
4(1), 69-74
(1985)
COl.tl'tENT
I shal,l refer to the pa.per under disctrssion ' and t'o ibs authorship '
by scu. It was nob clear Lo me rrhether it had ansuered lhe question
in iLs bible. since the empllasis was on multivalued logic' Perhaps it is
neither true nor false that bhey ansflered [he question. The comment in
the conclu.sions, "The foundabions of econometrics are ireaker than those
solely $rpporting economic theory under celtainLy" sholrs lhat scril
believe bhat hhe foundaLions do exisb but are not solid. It is
qrestionable rrhebher the foundahions of :ury science are enbirely solid'
not even bhe fotrndabions of rnathemalics or logic. The ambiguiLies o{
language almosL preclude the solidity of foundations. This is rrhy il wa-s
srch a good idea of Turing's to base lhe foundations of malhematics on
a carefully specuied comFrter, br! even so he had to rtake lhe
irdgement thal any Possible calculation corld be carried out on such a
coDF:ter.
is so much in the Paper thal in this discussion I can louch
cr only a fraction of i[. I agree wilh rnany oi the comments' but in
ttds discJssicn I shall concentrate more on points of disagleemenb.
one criterion that scu have used, to eursarer lhe question in lheir
title, is whether Aristotelean tro-valued logic is adequate for lesting
econometric bheories. Buu lhe discussion is really more general since
it deals wibh scientific inference as a whole, and not only with
inference in econometrics. Inference usually requires a theory of
There
probabitiby lthich certainly goes beyond Aristotelean logic.
Aristotle said, if the tlanslalion is Lo be Lrusted, "It, is Lherefore
pfain that it is not necessaty that of an affinnalion and a denial one
stpuld be true and lhe other false. For in lhe casb of bhab lthich
69
Copyrigbt @ 1985 by Marcol Dckker, Inc.
o7 47
4938 l8s lo4o I {069$3.5010
IU
GOOD
exists potenllatly tnrt nob actually, the rule which applies to lhat which
existsactrrallydoesnobholdgoo,d.,'(Thereferenceisgivenafterthe
where
next quobablon. ) But bhi.s is conLradicted in the table of conlents
ib is staled (and I agree somewhaL more with this) that "propositions
il is not
whj.ch refer to fulure [ime musb be either true or false ' brrl
194I'
deternined hrhich must be Lrue and which false"' (l{cKeon' ed"
pp. 38 and /.8; oE Eubchins, €d., 1952' Volume 8' PP' 23 and 29')
ebo Aristotle discussed probability Lo some exLent' Thus Arisbotle'
unlike
Iike other humans, went beyond Arislotelean logic brt PerhaPs'
others, vrithout often contradicting it'
I don't know whether Aristotle ever discussed the vagueness and
a
anbiguily of ordinary language. For example' lhe question rdhether
yes or no
man has a beard cannot alrrays be ansrrered with a clear
The
because beardednes€, like mosb Lhings, is a matber of degree'
man'sfacemaybemerely'|'zzyaflelseveraldayswithotltshaving.
This is a clear and simple exanPle showing Lhe need in princiPle for
must
non-ArisboLelean fuzzy logic. Tha! Uhere are degrees of meaning
havebeenrecognizedcen|uriesago.Istillbelievel,ha|Isaidin
diJfiorlt
Good (f95o' page ln) that "there are senlences for which it is
to decide rrhelher lhey are proPosilions. such 'par[ial ProPositions'
ofbenoccurinthepioneeringworkonnegscienti'ictheories''.
Aristotelean logic could be largely resLored to Lhe question of
beardedrressbydefiningabeardbymeansofarrarbilrarylhresholdon
theweighboffacialhair.Ittotldbenecessaryboshavelhemanto
breigh Lhe hair, so Uhe Uest would be "destruc[ive" (to use a term from
qualitycon|rol).AlsofurbherquestiorrswouldariseslrchasiJheredoes
Lhe face end and the neck or head begin'
If something atst concrete as a beard c'ur be ^tzzy ' the fuzziness
of more abstract concePts i-s all the more Lo be exPecbed' An example
of a very imPortanl fuzzy concept in the PhilosoPhy of science is that
of slnpu,city or comPlexily, and bhe allied concept of parsimony of
in
assumPtions in hyPotheses or bhecries' Is simPucity to be defined
LermsofnumbetofParameters'shot.fdthecomPlexiuyof[henumerical
valuesoftheparametersbeincluded,issirnpticityjust'brevityandU
}.OUNDAT]ONS
OF
ECONOMETRICS-COMMENT
7L
so in hrhat language? But, in my oPinion' the fuzziness of simplicity ot
IErsimony (which
is
Lhe same thing) and comPlexity does not by any
leans make them "useless as crileria for model choice" as staled by
SCH in lheir Abstrac!. on bhe other hand I hhink bhey are absolutely
essenbial for [hat purPose. To take a familiar examPle' given dala
(xt, yt), (x2, Yz), ..., (xn, Yn)' we can find a Polynomial fil y =
f(x) of lhe (n - 1)th degree ttEb exaclly fits the data. I hope scr't
nrld nob alhrays prefer lhis fib, for prediclive [rrrPoses ' over
polynomials of lower degree (which are more parsimonious) ' what
reason is Lhere, other than simplicity or parsimony , tor preferring a
polynomial of lower degree? Another exannPle where parsimony (or
sirpficiby) is of the essence is lhe Preference for lhe BuUiau/Nerrton
rrilrerse square law of gravita[ion (Brrlliau' 1645, p. 23) over the
epicycle bheory of uhe motion of lhe planebs.
scM say in their conctusions' and elsewhere, that Probability
tJEory i.s a version of nany-valued logii, and ib certainly can be so
regarded. But ib also obeys the u.sral larrs of Arisbolelean logic. If a
poposition is now only probable for us, lhen we don't know now whether
t is Urue or false, though we might knoh, later, and it can be true or
rrlce sysn if we never find or.rt which. similarly lhe statement Lhat
*thing i-s approximately true can itself be absolutely true if anvthing
caa be.
scl,t say that 'The statement that an event occurs with Probabiliby
p ...
violaLes bhe axiom
cit rer O
or
1.
" It
of
bhe excluded-middle unless
p is
seems to me !hai. no violabion of the law
alt{ays
of
bhe
cacluded middle follows from this argument, whebher the probability is
ld€rpreted in the long-run frequency sense or in the srbjective
(personal) sense. Ijf uhere
is a violalion j.t is
because Probabilities
right nob take sharp values so bhat uhe sLatement Lhab a Probability is P
!s somewhal unclear. It night mean bhaL lhe probabilily Ues between
p j| €, for some €. It night be true or false thaL the long-run
relative frequency of getting heads, when bossing a coin, Iies bebween
O.45 and O.55 (provided bhat lhe long run is not too long). I{e don'U
r€d double-headed or double-tailed coins in order lo use bhe lan of
72
the excluded middle.
An interesting question is how one nighb measure degrees of
mearring, or degrees of belonging to a set as in the theory of frtz.zy
sets. Take the excrmple of the fuzzy beard. we night define it as
belonging to degree x, to the class of beards if a fraction x of some
specified pop$ation of people would caII i.t a beard if they were forced
to say either Uhab it is or is not a beard. According to Uhis
definition, degrees of belonging depend on nhat poprlation of itrdges is
assumed. I do not know whether bhis proposal ttas been made in bhe
extensive liberature on fuzzy sets. Also I do not know enough of the
liberature to judge \dhether bhe theory of f,Lzzy sebs is useful . ft is
not, by Lhe way, the same as a theory of partially-ordered
probabilities. This I once chec*ed in conversation with zadeh who was
probably the first person to study the theory of, fvz.zy sets intensively.
rt deals with degrees of meaning, not degrees of probabiliby.
on another topic, scM state that the problern of induction is
impossible to solve. lf they had said it has no complete solution' I
would have agreed. Part of the solution is bo drae a distinction
betireen induction from previous experiences to a general law on the
one hand ("universal induction") and only to bhe next case on lhe other
( "predictive inducbion" ) . The la[er form of induction is more secr.rre !
see also Good (1983, pp.2O5-2O7) where an abtempb is made to
quantiJy the matter. complete solutions to Uhe problems of bhe
philosophy of science are hard to come by. In view of Lhe comrnents in
their conclusions, scl,l might partly agree qualitabivel.y wilh ny comments
on induction. They refer bo ny criticisn of Jeffreys's argument for
(predictive) induc[ion. l,ly argument did show that Jeffreys's argumenl
did not enbirely solve the problem of induction. Bub I Uhink ScU
overstate their case when they repeatedly say there is no solution bo
bhe problem of induction. Parbial solutions can b€ valuable.
I an sympathetic to the views of SCU on instrumentalisn, bub
would like to add a cotrple of comments. (i) An instrumentalist theory
can be true or false provided that ib is formulated as a neans for
naking predictions. Iite don't have bo give concrete interpretations to
FOUNDATIONS
OF
73
ECONOMETRICS-COMMENT
uE linear operators or
wave functions
Schroedinger fonntrlabions
of
of the
Eeisenberg and
quantum mechanics'
yet bolh
formulabions
brue. The basic philosophical question is rrhether all bheories
are insbrumentalist. (ii) when discttssing Friedman's instrurentalisn'
sclt say bhat "in bhe presence of uncertainty' the neaning of 'Lrue or
refuI predicbon' is not clear." I agree' yet some clariJication is
pcible. For example, iJ we have bwo predictive theories !11 and E2r
ttEre are various ways Uo decide which is in some sense the better
tlEory. one apProach is by means of the concept of explicativity
(c@d, I97?; Good & ucuichael' 1984), though ib does nob stfficiently
u*e into account bhe question of comPlexity of bheories. The problens
d philosophy are seldom completely solved, bub partial soluLions are
esilable and can be useful . There are some commenbs in scH's
can be
cdrlusions that are consisbent with this
somerrhal mundane sentiment.
ADDIEONAI, REFERENCES
brlliau, Ismail (Bullialdus) (f6/.5). -@
philolaica opus nowm. in quo motus planetarun nouam ac veram
hypothesim demonstrantur. . . (Pariis, SunPtibus s. Piaget).
cited by Jeffreys (1957, P. I32n).
Goodr I. J. (1950). Probabilibv and the weishing of Evidence
(tondon: charles Griffin; New York3 Hafners).
eood, f . J. (f97?). "ExPlicativity: a mathemabical theory of
explanation Ltith stabistical apPlications"' Proc. Rov. soc'
(London) A 354, 3o3-33O. [RePrinted as (198Ob).]
{bd, I. J. (f983). 'The
of a Nerarchical model for
nrlLinomials and contingency tables", in Fclenti4g.-&l9g '
Data Analysis. and Rohrsbness (G. E. P. Bo:<, Tom f,eonard &
Chien-Rr lfu, eds.; Academic Press).
bd, I. J. & UcHichael, Alan (1984). "A Pragmatic modification
of explicabiviby for lhe acceptance of hyPotheses" ' Philosophv
of science 51, L2o-L27.
Elrins, R. ll . (edilor in chiet) (1952). -@
robustness
74
World.VolumeS.Aristotlel.chicago:EncycloPediaBritannica
Inc.
Jefireys, H. (1957). Fdentific hference, 2nd edn' cambridg€l
University Press.
llcKeon, R. (ed,) (1941). The Basic works of Aristotle' New York:
Random llouse.
I. J.
va.
Good
of
stabistics
Polytechnic lIlst.
DePt.
and Stale Univ.
Blac*str.rrg,
Va.
2t O5L
ECoNoMETRTC REVTEWS,
4(1), 75-80
(1985)
col{uElrT oN
"THE FOI'NDATIONS OF ECONOUETRICS _ ARE TIIERE AI[T?'
\ION ZT'R MT'EIII,ET
Swamy, Conway and Muehlen (sCM), have written a thought
provoking paper addressing a number of jmportant philosophical
j'ssues that surround the theory and practice of econometrics.
l.Iany of the issues that they deal with are of general
philosophical interest and have been the subject of an intensive
on-going debate in the philosophy of science literature.l
ttre
problem of induction, the J-funitations of naive falsificationism,
the issues concerning Aristotle's tlro-valued aygtem of Logj-c,
and the alternative interpretations of probability discussed by
SCIII are al]. matters that have pre-occupi-ed philosopherg and
logicians for centuriea and remain largely un-resolved. As
Chalmers puts. it, with characteristi.c clarity,
"Ttrere is just no nethod that enabLeB sci.entific
theories to be proven true or even probabLy true ....
attempts to give a sinple and straightfonrard logical
reconatruction of the "scientifi.e method" encounterg
further difficulties wlren it is realized that there i.a
no method that enables scientific theories to be
conclugively disgroved either.' (p. xiv).
seen from thiE general perspective, one of scM's nain
ooncluaions that the truth or fa.l-eehood of economic theories can
Copyrtbt
@ 1985 by Marcel
Dckkcr, Inc.
07 47
4938 I 8s
1040 I -0075 $3.50/O
76
l
PESARAN
not be estabfished by econonetric methods is hardly surprising,
and follows directl"y from what we already 'know" from the
philosophy of science u.terature. If even in (hard) sciences
Iike physics, there is no way of conclusively Proving the truth
or falsehood of theories then thi"s must be even rnore so in
(soft) sciences uJ<e economics which are subject to additional
ambiguities arising largely from the socio-psychological nature
of hunan behaviour. 1[he methodological Problems surrounding the
use of econotnetric methods for the Purpose of testing economic
theories have long been recognized in the economic literature.
the early debate between Keynes and Tinbergen over the role of
econornetrics in testing of businegs cycle theories is one
prominent example urhich clearly brings out the limitatj.ons of
econometrics as a nethod of testing economic theories.2
Faced with the inevitable concLusion that econometricians
can not conclusively eatab]-ish the truth or falsehood of
economic theories, sclil advance the controversial view that the
Aristotelian system of two-valued fogic is inapplicable to
econometric rnodels, and that a system of many-valued logic is
needed if one is to justify the econonetric Practice. According
to scM the econor€tric practice violates Aristotle's axioms
prfunarily because of the uncertainty invo).ved in the nodeUing
In particular they argue that
of economic behaviour.
Aristotle's axiom of "the excluded-middle" is violated in the
case of econometric nodets as these npde].s can only be specifj-ecl
in a probabilistic manner. They regard any probabilistic
stater€nt as a violation of the axiom of the excluded-middle.
In their own words
"t'he statement that an event occurs with Probability p
or that a variable takes on interval of values rtith
probability p violates the axiom of the e:(cludedmiddle unless p is alnays either o or l' (P.36).
FOUNDATIONS
OF
ECONOMETRICS-COMMENT
point of view. The fact that the truth
or falsehood of a propositj.on i9 not known a priori, does not
necessarily render that proposition indeterminabe (i.e. neither
true nor false). I'he axiom of the excluded-middle states that
every proposition ia ej.ther true or else false, although which
of theae j,s the case may well be indeterminable as regards our
knowledge of it. I'he problen with SCI-t's argument lies in the
fact that they regard the indeterminacy of the truth-value of a
proposition relative to our knowledge aEr evidence of itg
indeterminasy in general. I|lrey state
ftlis is
€rn extreme
"If what xte know today about the future is not a Part
of our deterministic knowledge, then our statenrent
about the future c.rn not properly be called either
true or false (at least not by our speaklng 7n tlra
present) and so deserves a different truth value,
intermediate between truth and falsity" (p.36).
In this quotation the authors justify their viey about the
violation of the a>(iom of excludedqniddle on the grounds that
relative to our knowledge "today", the truth or falsehood of a
statenent about the future is not determinate. But this does
not necessarily imply that the truth-status of the proposition
about the future is intrinsically indeterminate. lrthether a
sealed envelolE contains a red or a bihite colour tid(et may
(relative to our knowledge) be indeterminate, but this does not
Ean that the proposition
"Irhe envelope contains a red coloured ticket"
does not adnit a determi-nate truth-status.
Ille econornetric exanples of the vj-olations of Aristotle's
axions given by sCM are also baaed on a sileilar kind of misunderatanding. In exirmple I the authors regald the existence of
78
PESARAN
two alternative forms of aPproxinations for the denand systems
as evidence of the violation of Aristotle's axiom of excludedmiddle. Ihey do not, however, erq)Iain why this should be so'
If the original ctenand system is taken to be the truth, then
from a strictly logicat viewpoint both approximations are false.
The adequacy of aPProximations is an empirical rather than a
logical queation. t'lre fact that t e may not be able to
conclusj.vely reject one aPProxination in favour of the other in
particular apptj.cationa does not rnean that the truth-statuE of
the approxinations relative to the original exact denand sltstem
is indeterminate.
In exanl[)le 3, the authors argue that the Rational Expectations tlPothesis (REH) violates Aristotle's axiom of the
identity of meaning, because it involves a juxtaEosition of the
concept of the subjective probability distriSution with the
concept of the objective Probability distriSution. lfhey regard
this as evidence that two different notions or meanings are used
What the authors overlook,
for one concept of probability.
hoyrever, is the fact that under the REH the two concepts are by
ctefinition indistinguishable from one another as they will be
one and the same thing. only in situations where learning is
j-ncoru,lete, is there .rny possibility for the subjective
to diverge from the objective
probability distrilution
probability distribution.
But in such a circumstance tlre REH,
in the strong sense advanced ry iluth, will not be applicable in
any c.rse. Ehe issue of whether the nEH is a valiA h1E>otheeis or
not is again an erqPirical mater for wtrich tlrere seerns to be no
conclusive angwer.
In short I am highly scetr'tical of the authors' claifi that a
system of nany-valued logic providee a fir.trler foundation for the
analygis of econonetric models. sernantj,cg al)art, I altl also not
at all sure what relevance all thege philosophical consider-
FOUNDATIONS
OF
ECONOMETRICS-COMMENT
79
ations have for econonEtrj-c practice. The Problemg the aPPlied
econolptrician faces in obtaining a model which is relevant to
the particular question, is derived from a logically consistent
econonic theory and which rePresents the data adequately' are
not lilety to be resolved by a course in nany-valued logic'
(:Ihese problens are discusEed in PeEaran and Smith (1985b))'
rhite I fully endorae the authors' vien that rte muEt alwaya be
prepared to check the Predictions of our nrodel against those of
it8 Cotu)etitors, I am not convinced that reaort to many-valued
logic is going to heIP us rtith this alifficult task. Any nove
away from the logical rigour of the Aristotelian Principlea is
lilely to make the outcorne of confronting competing theories
more rather ttran less inconclusive.
u. Hashem Pesaran
Trinity college
Cambridge
FOCrII|C'TES
lFor a highly readable introductory survey of the philosoPhy of
science literature see Chalnerg (19?8), where relevant referenceg to
original sourcesr can also be found. Many of the lEthodological
issues raiged by SCU are also covered etctensively by Imre LatakoS'
(19?O) ercellent account of the basis of the nethodological claah
between PoPPer and Kuhn.
2por the relevant references and re-examj,natj.on of the KelmeETiribergen debate in the light of recent developrnents in econolptrics
nee Pes€rran and Snith (1985a).
PESARAN
80
ADDTTIONAI, REFERENCES
A.F., (I9?8), What is This Thing Called Science?
University Press.
Chalmerg,
OIEn
f11e
Lakatos, I., (1970), Falsification and the nethodologD, of
scientific rese€rrch prograrmes. Criticism and the Growth of
Knowledge (I. Iakatos and A. Itusgrave, Eds. ), cafibridge
University Press.
Pesaran, M.H. and Snith, R.P., (1985a), Kelmes on econometrics.
Kelmes' Econoroica: Methodological, Igsues (T. IJawson and
!!.t1. Pes€rran, Eds. ), Croom tlelm.
Pesaran, U.H. and $nith, R.P., (1985b), Evaluation of nacroeconometric tnodels. Economic llodelling, 2, L25-L3+,
ECONOMETRTC
REVTEI{S,
CoHERENCE,
4(1), 81-91
(1985)
"II'fROPER" PRTORS, AND FrNrrE
ADDTTTVTTY
claims span important contributions from Aristotle to Zel-l"ner. I shal1 confine myself,
however, to issues of coherence of (finiteLy additive)
probability, "improper" priors, and representation of
Thus, my comment is focused
"ignorance" by such probability.
57,3
on
of their essay.
As Jeffreys showed nearly half a century ago, many farniliar
(textbook) "orthodox" statistical procedures have a Bayesian
nodel under an "improper" prior. (see 52,1 of his Ttreory of
Probability (1961).)
Exampler: Let x 'r, N(0rl), with 0 unknown. Ttren flJeffreys,
(1961, p. 137)] ordinary confidence intervals (and also fiducial
intervals) for Q are Bayes posterior probabiLities, provided the
"prior" for 0 (used in calculation with Bayes' Theorem for
densities) is the uniform, "improper" (Lebesgue) density.
Jeffreys (1961, chapter 3) argued for the existence of
"ignorancet' priors, Eo be used to represent an agentts uncertainty
quantities prior to the reLevant
abou! unknown statistical
just
the general background assurnptions of,
observations and given
e.g., B statistical mode1. His theory of Invariants (1946),
incorporated in the 2nd edition (1948) of the lheory of Probability,
offers a tractable nethod for cal.culating such "ignorance" priors
Not the
and gives reasons on behalf of the program of Invariants.
least of these reasons is the important, pragmatic consideration
mentioned above, to wit: Jeffreyst "ignorancett priors provide a
analyses.
Bayesian model for sorne basic "orthodox" statistical
(1979,
p, 421) I have argued that Jeffreys' theory
Elsewhere
The authors' many interesting
81
Copyright @ 1985 by Marccl Dckkcr, Inc'
07 47
4938
185
1O401{08 I S3.s0/O
SEIDENFELD
of Invariants is not Bayesian after aLL. The problem, in a
nutshell,, is that the theory of Invariants makes the prior a
function of the data through the likelihood. This leads to
Bayesian contradictions when, for instance, the data are composite,
Lack a common sufficient statistic,
yet admit (unique) "ignorance"
priors for separate components of the data. then different
posterior probabilities arise from Ehe same data depending merely
upon which component of the data is used to fix the (invariant)
prior. Thus, I do not find Jeffreys' program acceptable for rnaking
t'objectiver" i.e., observer invariant, a sense of statistical
"ignorance.t' Better, I think, to use sets of probabilities to
capture uncertainty than to rely on some one distinguished distribution to model a lack of knowledge. (See [tevi, (1980, chapter
9)] for a rich discussion of this approach, including its decisiontheoreEic consequences and variations due to Dempster, Good, Kyburg,
and C.A.B. Smith.)
Nonetheless, there are important questions worth asking about
ttimpropert' priors without supposing they serve as canonical
representations for states of (near) ignorance. (Recall their
How
pragmatic vaLue ir: linking Bayesian and orthodox statistics.)
do "improper" densities translate into probability distributions?
And do they stand the Bayesian test of coherence?
Of course, the 'rimproper" uniform (Lebesgue) density from
example, is no probability density: the sure-event carries
infinite measure. But, as Sir Harold points out (1961, p. 119),
There is
"... vre must use o instead of I to denote certainty ...
no difficulty
in this because the number assigned to certainty is
conventional.rr Ttrere is no difficulty so long as we keep our
conventional shifts consistent with one another. Towards this
end, and following Levi's (1980, 95.10-5.11) excel"lent presentation, let us partially define a probabiLity P(.) frorn a o-finite
measure U(.) (arising from an "irnproper" density) as fol-lows:
(1)
P(E) = I if u(E) = @ and u(E) < -.
As Levi (1980, p,129) shows, Lhis transl-ation from an "improper"
COHERENCE
prior to a probability gives a reconslruction of Jeffreys' formal
calculations--where the "improper" density is used in Bayest
theorem for densities. It also makes sense of other of Jeffreys'
analyses, €.g., that the probability is 0 that a < 0< b, for
all -- < a < b < o,, under the "improper" density from example,
[J"ffr"ys, (1961, p. 122)). However, as has been noted by many:
Heath and Sudderth (i978); Hill (1980); Levi (1980), to mention
chree, the resulting probability l(') is merely finitely (not
countably) additive. In the exampler, by translation according
to (t), P(') is purely finitely additive (p.f.a.) as P(i < (r. i
+l)=0fori=0,J1,
Why should mathematical issues of additivit.y of measures
be of interest in a discussion of foundations of statistical
inference? After all, even Kolmogorov (1956, P. 15) thinks
l-sdditivity
is an expedient. So what if we alter the convention
that probabiliry is councably additive in reaction to the other
shift in convention: !o use o-finite but not finite measures to
depict certainty? The question is more than a mathematical nicety
given the serious allegations, €.8., of Dawid, Stone and Zidek
(1973), that "improper" priors engender inconsiscency and Bayesian
incoherence,
Ttre so-called "marginalization paradoxes" and "strong
inconsistencv" attributed to Jeffreys-styled analysis using
"improper" priors are, in my opinion, traceable to two distinct
features of "impropriety." FirsE, inadequate mathematical techniques have been employed to facror joint distributions for
continuous random variables into a product of a "oarginal" and
"conditional" density. Ihe inadequacy is evident through
leparzrmeterization. (I have discussed this in (1982).) Second,
finirely additive probabilities admit failures of what deFinetti
(L972, p. 99) calls "conglornerability" of distributions.
b"
pefinition
let n = {t'': i=I,'..)
.1g14:'"."bi1ijl:
a (denumerable) partition of the sure-event. If P(Elh.) < (>) k
for each i, then P(E) < (>) t.
83
SEIDENFELD
additive probability i'(')
sa!isfies conglomerability in a partition n if and only if P(')
is disintegrabLe in n.) the following is an elemenrary illustration of non-conglornerability of a merely finitel'y additive
probability (reported in lderinetti (1972, p. 205)] and fDubins
(Dubins (1975) shoved that a finitely
(t975, p. e2)l).
ExamoLe^: Let the field be the set of (a11) subsets of
Let P-(a- ) = l/2J if i=1,
x = {a..:
1lc].ji=l ,21 i=L,2,...}.
i=2i hence P"(') is countably additive and lives on
each i, j, and let
the set X,r = {a..: i=l}.
Let P,(a,.)
d lJ = 0 for
1.1
ro(xr) = 0. rinally, let P(') = oP (.) + BPo(') (cr,B > 0, a +B= 1).
Then p(.) is not conglomerable in n
Fix n = {njrnj = {atj,"rj]}.
1 for each j. see fschervish
as P(xr) ="c, ihereas-r(xrln:)
et al, (1981+, p.2I7) for discussion of how neither conglomerability nor non-congLomerabiLity in a given partition is closed
and = 0 if
under convex combinations. ]
Recently, Schervish et al. (1984) have shown that (subject
to mild regularity conditions) every finitely but not countably
additive probability admils failures of (denumerabLe) conglomeraThe least upper bound of the failures is given by the
bility.
coefficient of the p.f.a, component in a decomposition of a finitely
additive probability into a convex combination of a countably
additive probability and a p.f.a. probability--corresponding to
B in exampLer.
Subject to the translation (1), o-finite measures arising
Hence,
from "improper" densities convert to p.f.a. probabilities.
they adrnit arbitrariLy large failures of conglomerability. (ff
one adds clauses to (1) for using limits to translate u(')-measures
into P(.)-measures, as is done by lleath and Sudderth (1978), the
P(') which results is non-atomic as we11.) Here is a sirnple
ilLustration of non-conglomerabiLity with Jeffreys' prior from
example' where the failure exceeds .67.
Example": Let I t N(0,1) and Let p(O) be partially defined
as a p.f.a. probability according to (1) from the uniform "improper"
COHERENCE
density. Let I. = {x,0: i-l < l"-ol. i} and let J. = {x: i-1
Note that P(I:) > 0 and P(I.,) > .67 in
: l"l . i] i=l ,2,...
particul-ar. Moreover, P(r.) = P(r. le) = r(r. lx). (The first
equality is by the confidence interval property of L and a
The
tacit assumption of conglomerability in the 0-partition,
second equality is by Jeffreys' analysis, as in exarnpler,)
Define a denurnerable partition n = {h.: hi = Ii*lg (r, n;.)}.
Then, since P(J.)=0 (according to (1)), P(rlltt.l=o for each
element of n, yet P(Il) > .67. Also, P(h,) > 0 so that, as in
exampler, for non-atomic distributions non-congLomerability is
present r^rithout conditioning upon events of probability 0'
Is a merely finitely additive probability coherent? Does
non-conglomerability entail a "Dutch Book" (sure loss) for
anyone posEing (conditional) odds that fail conglomerability?
liOl At least according to the positions defended by Savage
(1954) and deFinetti (1974) finiteLy additive probability is
coherent. (See lseidenfe].d and Schervish (1983)] for details.)
Savage's postulate system Pl-P7 adrnits all non-atornic finitely
DeFinettits criterion for
additive personal probability.
(against
many cal1ed-off bets),
finitely
avoiding a sure-l-oss
or equivalentLy his criterion of admissibility of "previsions"
against his (proper) squared-error score' establishes coherence
of all finitely additive probabil-ity.
Ttrere are, however, rival accounts of coherence, even for
In their important (1978) paper'
finitely additive probability.
Eeath and Sudderth introduce conditions of coherence that (in
effect) require conglomerability simul-taneously in two specific
the partitioo by the unknown (and unobserveil)
partitions:
parameter 0, and the partition by the random variable(s) to be
observed. (Typically, for textbook statistical problems, these
are partitions with cardinality of the continuum. ) Thus "H-S
coherence" is a stricter slandard than (deFinettits) coherence'
additive prior fails to carry a posterior disYhen a finitely
tribution that is conglomerable in the partition by the observed
85
86
SEIDENFELD
then that prior is "H-S incoherent'" The
authors accept the H-S standard for, in this situation, they say
the prior fails to yield a posterior (p. 28) and should not be
used (p. 31). Of course, such priors have posteriors that are
coherent in the sense of deFinetti or Savage.
I queslion the appropriateness of the added restrictions
imposed by "H-S coherence" beyond what is required by coherence
(in the deFinetti sense). For one, a "H-S coherent" distribution can be "H-S incoherent" conditional on an event defined
This is
so1el-y in terms of the observed random variable(s).
random variable,
illustrated by the following.
Example4: Let (x|xr) be i.i.d N(0,o2), $tith both parameters
"unknown." It is straightforward to shos, that
(z)
(e,o2)'
P(x
'min- < 0 < xmax'I (0,o2)) ='5 for each pair
As Heath and Sudderth show (1978, P' 341) there is a "H-s coherent"
finitely additive prior, corresponding to Jeffreys' "improper"
ignorance density (uniform over 0 and independently uniform over
1n o). The posterior satisfies
(31
(*,,x,)) =.5 for each observation (xl,xt)'
P(x-.'mln- < 0 < x*--max'I L' I
Define the random variable
t = (xr+ x) / (x, -x),
Buehler and Feddersen (t903) show that
(0'o2)' tt'"
,(*,nir, I01*r"*
| {0,o2), l.l '1'5) > '518 for each
(xr,
xr) pairs
But, as \^re can partition the event ltl < 1.5 by those
(x' xr),
in
(in
t), given congl-omerability
satisfying the inequality
it follows frorn (3) that' conditionally,
(5)
P(xmln <o<x max I ltl.1.5)=.s
but given conglomerability in (0,o2), it follows from (4) that,
conditionaLly,
(6)
P(xmln < o < xmax I ltl . 1.5) > .5]8
-(5)
and (6) are contradictory, it cannot be that the
Thus, since
"H-S coherentt' distribution associated with (3) is also a "H-S
coherent" distribution given ltl . f.S. (As shown in [Kadane et
al. (1981)], such conditional "H-S incoherence"is restricted to
COHERENCE
87
events of (unconditional) probabiLity 0. As the observation,
likewise, carries an unconditionaL probabiLity 0, iE can be that
for each possible observation there is some containing event
of probability 0 that, conditionally, establishes "H-S incoherence.")
It is open, I believe, when such condirional "li-S incoherence"
can arise. (The problern is equivalerrt to what Buehler (1959)
called the question of "relevant subsets.")
In light of these findings: that all merely finitely addiEive
probabilities suf fer non-conglomerability in a denumerable partition ;
that a "H-S coherent" distribution can be conditionally "H-S
incoherent"; and that this may be possible for each point in lhe
sample space, why do the authors subscribe to Ehe more resErictive
notion of "H-S coherence"? If coherence is to serve as a norm for
inductive logic, if incoherence is a mark of the irrational
analogous to the objection of deductive inconsistency, then one
nust justify the serious charge thac non-conglomerability is
I susPect
irrational in order that "H-S coherence" be justified,
that the family of pragmatically bettable propositions cannot be
insulared from the anomaly of non-conglomerability without
Better' I
reintroducing Ehe principle of countable additivity.
think, to use deFinettirs standard rhan to make coherence inEo
a procrustean bed.
A short Postscript on ttl,ogic"
The authors are intent on connecting problems in the foundafions of statistics with questions abouE logic and lhe foundations
of mathemalics. As one who teaches courses in the philosophy
of science and in mathematical logic, of course I am pleased
to see economisEs concerned about the "logic" of probability (in
Unfortunately, though the path aPPears evident lo
statistics).
at least three economists, Ehere is one philosopher who does
not follow the trail from Aristotle through G6'de1 to Birnbaum.
For example, what is the economic relevance of Gddel's
celebrated incompleteness Lheorern? Should economists be more
concerned than, say, physicists or even mathematicians about
SEIDENFELD
chis fact regarding formalized theories? Do Ehe authors know of
a single case where Godelean-incompleteness matters in economics?
What axiomatized economic theory and what economic
hypothesis are involved? I am nonpLussed by the repeated appeal
to G6de1's importanr result.
I'trere is a substantial literature dealing with Lhe idea
tha! probability theory gives a generalization of deductive 1ogic.
The theme is hardly surprising since it is natural lo think of
logic algebraically [H"lto" O96D 1 thro,,gh the pair (A,M),
where A is a Boolean algebra and M is a Boolean ideal in A
(generaEed by the "anEi-axioms"). Then a measure algebra over
the quotient algebra A/M (assuming M is a Proper ideal) provides
a quick link where exactly the deductive consequences of the
axioms carry probability 1. For a clear sufinary of the Program
to use probability for generalizing deductive to inductive 1ogic,
see carnap's (1950,9 43B). For a view of probability as multivalued logic, see Reichenbach's (1949, chapter 10)' However,
what I find surprising and rather puzzling is the authors'
assertion that the Aristotelean principle of excluded middle
does noE hold for the "standard" interpretations of probability,
by which they mean the "frequencyt' and "subjective" interpretations (see their fn. 5 and AppenaixS4 and55)' This is a much
stronger clairn than the modesE resulr that (formal) probabilicy
theory carries an interPretation as a non-classical logic'
Let us consider the reasons they offer on behalf of cheir
claim. In the Appendix 94, "orrcet"ing the "frequency" interpretation they write,
Even in Ehose situations where the conditions of a law
of large numbers do hold, lte may not know exactly the
limiting frequencies in infinite reference classesl
thus our probability statements entail assertions about
Itris means that probability
unknown relative friquencies.
sEatements can at best be approximations to long-run
relative frequencies, and hence Aristotelian logic does
noE apply under a frequency interpretation'
In the Appendix 55, concerning the "subjective" interpretation
COHERENCE
rhey write,
Consequently, if the distribuEions constructed in
DeGroot's .manner are construed as yielding subjective
probabilities--i.e,,
degrees of belief--this means that
subjective probability assignments do not require a unique
degree-of-belief function.l8 Thus, approximatione aiE
necessary Lo represent someone's opinions by a distribution,
thereby ruling out any application of Aristotelian logic to
Ehis siEuation.
As best I can make out, these arguments resE on a lacic assump:ion (which I reject) which amounts to the presumption that
approximations are at odds with Ariscotelean 1ogic. (I apologize,
in advance, if this is not what the authors intend.)
Of course, we are well aware of the problem posed by Ehe
i.nevitable vagueness of ordinary language, Just how few hairs
nake a given head bald? (If one more hair makes no difference,
then by mathemaEical induction the head is bald regardless the
Sr-rt I reject the assumption
auober of hairs growing on it!)
that approximations must involve the kind of vagueness that makes
by standards of classical 1ogic. The familiar
rhem unintelligible
terms "palm" and "back of my handt' are infected with vagueness:
lust where is the dividing line? But that does not preclude an
epplicarion of classical logic to statements 1ike, "I clap wiEh
tae palrns ofmy hands." And it is understood by all what. the
traffic cops means when it is reported that I was driving
:pproximately 45 mph in a 25 mph zone. I do not believe this
uaderstanding involves application of some non-standard logic!
Htrat is sorety rnissing from the authors' arguments (above) is
. demonstralion that the sense of "approximation" used in the
'sEandard" interpretations of probability is of the sort Ehat
cannot be captured by classical logic.
This is not to deny that I, too, find serious deficiencies
I think the
ia the frequency interpretation of probability.
*dispositional" theory is more reasonable: chance is a useful
(see hacking. (rgo:)] ana
r€nse of "empirical" probability.
(1980,
chapter 11) ] for discussion of "chance.") J do noE
ILevi
r
SEIDENFELD
90
find classical logic to be a hindrance in the philosophically
difficulty
task of producing defensible explications of "probabNor do I see where the authors have shown us good reason
ility."
Lo suspect classical logic as one of the sources of the conflicts
in the ongoing debates over the foundations of statisEics.
Teddy Seidenfeld
Washington University
ADDITIONAL REFERENCES
Buehler, R,J., (1959). Some Validity Criteria For Statistical
Inferences. Annals Math. Statis! ' , 30, 845-863.
Buehler, R.J. & Feddersen, A.P., (1963). NoEe on conditional
property of Student's t. Annals Math. Statist., 34' 1098-1100'
Carnap, R., (1950). Logical Foundations of Probability.
Chicago: University of Chicago Press.
Dawid, A.P., Stone, M. & Zidek, J.V., (1973). Marginaliza!ion
paradoxes in Bayesian and structural inference' J. Roy.
statisc, Soc. 35, 189-233 (with discussion).
deFinetti, B., (I97D.
New York: Wi1ey.
Probability,
Induction and Statistics'
( lg7, . Finitely additive conditional probabilities,
Ann. Probability, 3' 89-99'
conglomerability and disinlegrations'
Dubins, L. ,
Hacking, I.,
(1965). Logic of Statistical
Inference.
New York:
Cambridge Universi.ty Press.
Halmos, P.R. , 0962).
Algebraic Logic'
New
York: Chelsea'
Jeffreys, H., (Lg4O. An Invariant Form for the Prior Probability
in Estimation Problems. Proc. Roy' Soc., A, 186, 453.
Kadane, J.8., Schervish, M. & Seidenfeld, T., (1981) '
Statistical
Imp1ications of Finitely Additive Probability, In: Bayesian
Inference and Decision Techniques with Applications: Essays
in Honor of Bruno deFinetti, Coel, P.K. & Zellner, A. (eds.),
forthcoming.
Kolmogorov, A.N., (1956). Foundations of the Theory of Prob-
abi1.ity.
New
York: Chelsea.
91
COHERENCE
Levi, r.,
(1980)
The Enterprise of Knowledge. Cambridge, Mass':
MIT Press.
Reichenbach, H., (1949). The Iheory of Probability.
Los Angeles:
Savage, L.J. (1954). Foundations of Statistics.
York: wiley'
University of California Press'
New
Schervish, M., Seidenfeld, T. & Kadane, J.B., (1984). fn9 Extent
of Non-Conglomerability of Finitely Additive Probabilities.
Z. Wahrscheinl ichkei ts theorie verw. ' 66 , 205-226 '
Seidenfeld, T., (1979). Why I am not an objective Bayesian.
Theory and Decision, 11, 413-440,
Seidenfeld, T,, (1982). Paradoxes of Congl-omerability and
Fiducial Inference. In: Logic, Methodology and Philosophy of
Science VI, Cohen, L.J.r Los, J., Pfeiffer, H. & Podewski, K'
(eds.) Amsterdam: North Hol1and, 395-412.
Seidenfeld, T. & Schervish, M., (1983). A Conflict Between Finite
Additivity and Avoiding Dutch Book. Phil' Science, 50, 398-472'
ECONOMETRTC REVTEWS,
4(1), 93-99
(1985)
COMMENT
that logical lnference plays a large
part ln sclence and thaE it ls ioportant to get lt rlghc.
Even lnstrunentallsts, for whom sclence is magic, would
agree Ehat it ls lmportant to find the rlght - 1.e.
successful - spells. The authors under discusston - Swamy,
Conway and von zur Muehlen - pick out a cerEalo dominant
conceptlon of loglc whlch they call "Arlstotellan". Slnce
-Aristotellan 1oglc" norrnally neans a partlcular system
conflned to sylloglsms, I sha1l foLlow the customary usage
of loglcians and philosophers and talk of "classical" logtc.
The authors I thesls 1s that ecooometrtc practlce does not
conform to the prlnciples of classlcal loglc and calls for
a rlval loglc, namely some brand of many-valued loglc. I
sha11 argue that vtrtually everythlng they say about
classlcal loglc is mlstaken and thelr advocacy of
many-valued 1oglc ls mlsconcelved.
The authors see classlcal logtc as characterised by
three principles. The Law of Excluded Middle says that every
statement is elther true or faLse. The Law of Cont.radlctlon
(to glve lt lts customary name) says that a statement and
lts negation cannot both be true. And the Axioo of Identity
(as the authors call lt for want. of an accepted termtnology)
says that a term must oot be used in dlfferent senses ln the
same argument. I shall deal wlth these Ln reverse order,
Everyone agrees
93
Copyrigbt @ 1985 by Marcol Dekker, lnc.
07 47 -4938 | 85/040 l
4093 $3.s0/0
ECONOMETRTC REVTEWS,
4(1), 93-99
(1985)
COMMENT
that logical loference plays a large
part in sclence aod that tt ls important to get it rtght.
Even lnstrumentallsts, for whom sclence 1s maglc, would
agree Ehat it 1s important to find the right - i.e.
successful - spells. The authors under discusslon - Swamy,
Conway and von zur Muehlen - pick out a cerEaln dominant
conceptlon of logtc whtch they call "Arlstotelian". Since
-Aristot.elian 1oglc" norrnally neans a partlcular system
conflned to syllogisms, I sha1l follow the customary usage
of loglclans and philosophers and talk of "class1cal" 1oglc.
The authors I t.hesis 1s that econoneErlc practlce does oot
conform to the prlnclples of classical loglc and calls for
a rlval logic, narnely some brand of many-valued loglc. I
shal1 argue that vlrtua1ly everythlng they say about
classleal loglc ls mlstaken and thelr advocacy of
nany-valued loglc is misconcelved.
The authors see classlcal loglc as characterlsed by
three prlnclples. The Law of Excluded l,tiddle says that every
statement is either true or faLse. The Law of Contradletlon
(to glve 1t lts customary name) says that a statement and
lts negatlon cannot both be true. And the Axiom of Identity
(as the authors call tt for want of an accepted termtnology)
says that a tentr rnrst oot be used in different senses ln the
same argument. I shall cleal wlth these ln reverse order,
Everyone agrees
93
Copyright O 1985 by Marcel Dekker, Inc.
o7 47
4938 I 8s/040 l -0093 $3.50/0
SMILEY
94
followed by
some remarks on nany-valued
loglc.
The Axion of Identlty. It is unfortunate that the one
economlst whom the authors cit.e as violatlng thls rule
actually provides a textbook example of obedlence co tt!
For thelr os,n quotatlon from Muth ln Exanple 3 shows hlm
uslng trro terms, namely "objectlve probabillty" and
"subjective probabi1lty", lnstead of uslng the same term
"probabtlity" ln t\to senses. But ln any case the l-ssue ls a
spurious one. The purpose of Ehe loglclanst rule is to
avoid fallacles due to amblgulty. Ustng a symbol to stand
for dlfferent things ln the same formula ls asking for
trouble. Outslde formal loglc, howeverr the fact that a term
stands for more than one thtng does not necessarlty create
arnblguity. The difference is that a fu11y formaL argument
leaves oo scope for the play of context' whereas in an
lnformal argument the context can make lt perfectty clear
which sense of a word ls ln question at (rhlch momentr so
there need be no genuine conflict wlth Ehe spirit of the
loglclansr house ru1e.
The Law of Contradlctlon. The authors say that cerEain
theorles of ratlooal expectatlons have been found to be
lnconslstent (1.e. self-contradictorY), and thls 1s supposed
t.o vlolate the law of contradictlon (Example 4). In itself
it does oothing of the sort: lt all depends oo what happens
next. If the dlscovery of lnconslstency ls taken as grounds
for abandonlng or anending the theory ln question, then far
from vlolating the law of contradictlon such behaviour will
vlndicate lt. For the 1aw of cootradictlon doesntt say EhaE
there cannot be an lnconslstent theory: i.t says that an
FOUNDATIONS OF ECONOMETRICS-COMMENT
95
inconsistenE theory cannot be true; and this is what leads
usr as seekers after truth, to reject a theory once it ls
known to be lnconslstent. The only way an economlst could
violate the law of contradictlon would be by adoittlng that
his pet theory rtas lnconslsEent but clalming that thls
didnft matter.
The same nlsunderstandlng runs through the sectlon on
G6delts theorem, wlth lts extraordinary heading "Giidelrs
challenge to the axiom of noncontradictlon". Giidel showed
that any sufflclentLy strong theory must be lncooslstent or
incomplete, but he could only be seen as challenglng the law
of contradictlon 1f he had lnvited theorlsts to sacrlfice
consistency in order to achleve completeness, and of course
he atltl no such thing. I agree with the authorsr warnlng
about the danger of creatlng lnconslstency when rival
hypotheses are comblned into a more general nodel, but I
cannot see whab Ehey think thls has got to do wlth Giidel|s
theorem. Nor can I nake out the alleged impllcatlons of
G6delts theorem for Dempst.errs or Tverskyrs crlterla of
consistency, or for the clalm that "coherence cannot be
achleved ln sufflclently large worlds". 0n the contrary the
whole appeal to Giidel rs theorem appears to be a red
herring.
of Excluded Mlddle. This ls sald to be vlolated by
economlstsr reltance oo approxlmations, for example in
demand systems (Example 1) or Ehe lnterpreEation of
probabtllty (pp. 1In.r 51, 54). This Ls not so' If someone
puts forward an approximatlon as an approxiroatlon, e'9. by
assertlng that'il = 22/7 +.002, then what he says ls
The Law
96
SMILEY
straightforerardly true. On the other hand tf he puts it
forward wlthout quallficatlon, asserting baldly thaE T( =
22/7, then what he says 1s stralghtforwardly false. Neither
way 1s there any violatlon of the law of excluded ntddle.
The ldea that there is a status "approxlrnately true"
genulnely located somewhere between true and false 1s a
mlstake, llke that ldea that there ls a person called "the
average plumber" who genuinely has 1.3 chlldren and 1.9
Legs etc. The rnlstake ln each case ls to take at face value
an idiom that onty nakes sense when construed as shorthand
for something else. Thus the clalm that the average plumber
has 1.3 chlldren needs to be seen as shorthand for an
assertion about the ratlo of plumbers to chlldren of
plunbers, and llkewlse the clalm that Tf = 22/7 is
"approxlmately true" needs to be seen as shorthand for an
assertlon about the relative smallness of the difference
between the two numbers. When chis ls done the nystery
disappears along with the phrase that caused lt.
A common mlstake ln dlscussions of the law of excluded
rnlddle ls to confuse a statementrs being true wlth 1ts
being knowo to be true. The authors do thts when they
equate our inablllty Eo tell whether statementg abouE the
future are true or false with thelr actualty not betng
elther true or false. The same mistake lles behind thelr
clain that probable sEatements are nelther true nor false
(Example 2). Thelr authorlty here ls Boland fs "The
Foundatlons of Economlc Method", and what Boland says ls
"If \ile adopt the stochastic-conventlonallst vlew that
identlfies absolute truth wlth a probablltty of I and
absolute falslty with 0 then o.. a stochastlc statement
with a probablllty of 0.6 ls not absolutely true, oor ls lt
I
tI
FOUNDATIONS OF ECONOMETRICS-COMMENT
97
absolutely false". I canrt speak for "stochastic
conventLonalists", but any reputable wrlter on probablllty
w111 say that what corresponds to a statementrs having
probabllity I ls not its being true but lts belng known to
be true (or lts betng certaln, confirrned, etc. See for
exarnple Carnap, "The Logical Foundations of Probabtlity",
p.177). Probabtllty is not a continuum between truth and
falsity; it is a conEinuum in a different dlmenslon of
assessment
r
Less actually turns on the issue of excluded mlddle
than the authors thtnk. They say that the use of logic Eo
establlsh concLusl-ons (rnodus ponens), and to refute theortes
(modus tollens), both depend on the law of excluded otddle
(pp.7, 19). In fact neLther does so, as nay be conflrned by
observlng that both hold for lntultlonist loglc desplce its
rejectlon of excluded niddle. A11 modus ponens requlres 1s
Ehat valld arguments are truth-preservlng, plus the
presence of a set of true premlsses to work from. The lack
of true premlsses, not any supposed failure of excluded
niddle, ts the reason why "approxlmate modus ponens"
doesnrt work (p.16). Modus tollens llkewlse only requlres
that valid arguments are truth-preserving, plus the presence
of an untrue conclusion to work back from. The authors seem
to have been led astray agaln by Boland, who describes
refutation as lf it rtere a two-step process: (a) work back
from the untTuth of the concluslon to the untruth of the
assumptloos taken collectively; (b) argue from the
collective untruth of the assumptlons to the falslty of at
least one of them. The second step does indeed depend on he
law of excluded rnlddle, but lt is a polntless addition: the
flrst step is already an adequate account both of modus
SMILEY
98
tollens and of refutatlon.
logic. I have tried to show that Ehere ls no
reason for econometrlclans to reJect classlcal logic, but
in case anyone 1s unconvl-nced let IE cooclude with three
l,Iany-valued
cautions about rnany-valued logic.
The flrst ls that, contrary to the i.mpresslon given by
the authors (p.2), not all the aLternatives to classical
logic wtll be found under Ehe heading of nany-vaLued logic.
As logiclans use Ehe idea, lt is not enough for a system to
posit more than two truth-values; lt also needs to be truthfunctional, where this rreans that the truth-values ascrlbed
to compound sentences are to depend solely on the truthvalues of their coilponents. Thls ls why probabiltty theory
will never make a mrny-valued 1ogic, for although negation
is Eruth-functlonal conJunctlon 1s not: the probabtLtty of
not-A ls a function of the probablllty of A, but the
probabtllty of A-and-B depends on other things than the
separate probablllt1es of A and B. Stnl1arly the fact that
lntultlonlsts replace the cLasslcal- dichotomy of "true"
and "fal-se" by a trlchotomy of "proved", "refuted" and
"undecided" does not
that lntuitlonist logic ls many'Dean
valued.
The second cautlon ls that an apparently many-valued
logic may t.urn out to be rrerely a sysEem of classlcal logic
in disgulse. Thls emerges very clearly frorn the most common
net.hod of constructlng many-valued loglcs, whereby some of
the truth-values are "deslgnaEed" and an argument ls counted
as valld if it always leads from deslgnated values to
deslgnated values (cf. p.37). For all thls tells usr
"designated" nay turn out to be just another name for
FOUNDATIONS OF ECONOUETRICS-COMMENT
99
-true", with the designated values representlng various subcases of Eruth and the undeslgnated ones subcases of
In other words, what looks l-lke a rival to
falsity.
classical logic rnay slmply be a fine-gralned versloo of it.
There are nonet,heless some genulne rlvals to classical
loglc. Fttzzy logic ls one of them, not because lt is
deslgned to handle fuzzy concepts but because 1t treats
truth as ltself a ftzzy concept. But - and thls ls the last
point - an econometrlclan who follows up the authors t
recommendatLon of ftzzy logtc as a tool for dealing wlth
-inexact concepts such as approxlmation" (p.37) is llable to
be disappolnted. The reason ls that "inexact" can mean at
least three things. The statement " 7[' i" . small number" is
inexact because it ls Eg. - lt lacks clear-cut truthcondltions owing to the ftzzy nature of the coocept "sroall".
The approxlmatlons " Tl = 22/7" aod " It = 22/7 + .002" are
not like this at all. One is lnexact because untrue; the
other 1s lnexact because tt is unspeclfic; but both have
perfectly cl-ear-cut trrrth-coodltions: there ts nothlng vague
or fuzzy about either of them. Even lf fuzzy logtc should be
the right tool for dealing with fuzzlness and vagueness,
wouldnrt lt be fatr to deflne econometrlcs as the part of
economLcs that avoids fuzzy concepts and vague statements?
T. J.
Clare College,
Smlley
Cambrldge
ECoNoMETRIC REVTEWS,
4(1),
101-119 (1985)
REPLY
are honored by professor Dale poirlerrs offer to publlsh
a compressed verslon of our paper along wLth commentarles by such
leadlng researchers 1n phllosophy of sclence and econometrlcs as
Professors L.A. Boland, I.J. Good, M.H. pesaran, T. Seldenfeld,
and r.J. sniley. Thelr comments and those offered earller by our
former colleague, Dr. Edward J. Green, have helped us rethlnk
sone of the materlal we have presented. Even where crltlclsn has
been well founded, falrness diccaces that we present our responses
here rather than ln an altered text. Where, ln our subJective
vlew, conments were illfounded, we wl1l so lndlcate. As our
followlng replles show, our posltlon remalns firm.
We
Analytlc-Synthetic Distlnction
It was good to read professor Bolandrs klnd remarks. Our
dlscusslon of Arlstotlers axloms for adrnlsslblllty of statements
lnto loglcal arguments orres so much to hls very inportant contrrbu:ions to the foundatlons of eeonomlcs. Ife concur ln the oplnlon
that the "analytlcal-synthetlc' dlstinctloo is susplcious. The
offendlng foocnote 2 containing Ehls distlnctlon was added to
paclfy sorne crltlcs nentloned in our authorsr footnote. In any
case' this dlstlnctlon ls not cruclal for our dlscussion, provlded
readers understand the word "true" in the same way as Boland does.
Arlstotellan Loslc
clearly lndlcated wldely known sources for our
c€rnlnology ln referrlng to Arlstotlers axlons, Sniley appears to
tave the curlous lrnpresslon that "The Law of contradlctlon'. ls a
c'lstomary name and that rre have re-chrlstened one of Arlstotlers
Even though we
101
Corrisht
@ 1985
by Marccl Dckkcr, Inc.
07 47
4938
1
85 1 040
1
-0 1 0
1
$3.s0/0
ECoNoMETRTC REVTEWS,
4(t),
101-119 (1985)
REPLY
we are honored by professor Dare poirierrs offer to publlsh
a compressed versLon of our paper along wlth connentarles by such
leadlng researchers ln phllosophy of sclence and econometrlcs as
Professors L.A. Boland, I.J. Good, M.H. pesaran, T. Seldenfeld,
and r.J. sniley. Thelr comments and those offered earlier by our
former colleague, Dr. Edward J. Green, have helped us rethlnk
some of the materlal we have presented. Even where crltlclsm has
been well founded, falrness dictates that \re present our responses
here rather than ln an altered text. where, ln our subJectlve
view, comnents rrere lllfounded, we wl11 so lndlcate. As our
followlng replles show, our posltlon remalns flrm.
Analytlc-Synthetic Dlstlnctlon
It nas good to read professor Bolandrs klnd remarks. Our
dlscusslon of Arlstotlers axloms for adnnissibllity of statements
lnto loglcal arguments owes so much to hls very inportant contributions to the foundatlons of econoralcs. we concur in the oplnlon
Shat the "analytlcal-synthetic" dlstlnction ls susplclous. The
offending foocnote 2 concainlng Ehls distlnctlon was added to
paclfy some crltlcs nentloned in our authorsr footnote. In any
case' this dlstlnctlon ls not cruclal for our dlscusslon, provlded
readers understand the word "true" ln the same way as Boland does.
Arlstotellan Loglc
clearly lndlcated wldely known sources for our
lernlnol0gy in referrlng to Arlstotlers axlons, snlley appears to
have the curlous lmpressLon that "The Law of contradlctlon" is a
cuscomary name and that we have re-chrlstened one of Arlscotlers
Even though we
101
Cog''tixbt
@ 1985
by Marccl Dckkcr, Inc.
0747 4938185104014t 0l $3.s0/0
702
SWAMY, CONWAY, AND VON ZUR MUEHLEN
axloms as "The Axlom of ldentity" "for lack of an accepted termlnology." Smlley and Boland nay settle between themselves the
assertion that we "have probably been 1ed astray by Bo1and." But
that the use of modus ponens and 99dus-!e11eng does not depend on
the axlom of the excluded-mlddle is absolutely false. Bolandrs
Ereatmenc of the lnterrelatlonshlp between modus ponens (or
modus tollens) and Arlstotlets axioms ls nothiog short of excellent. Smlleyrs claim that modus ponens or modus tollens only
requlres that valid arguments are Eruth-preservlng lacks a Program
guaranteeing that valid arguments are truth-preservlng, thereby
asklng readers to act b1lnd1y and mechanlcally.
The Axlom of Identlty
Snlleyts defense of one economlsE, some of whose work we had
found to be 1n vlolation of the axlon of ldentlty, ls based on
the somewhat objectlonable assertlon that "economists do not-thank God!--make use of formallzed argunents." Well!
Whatever economlsEs do or don't do, the axlom of ldenElty
neans that loglcal falslty of the sEatement 11ke "2=4" follows
fron the very definltlons of the terms. In an analogous manner'
loglcal falslty of the ratlonal expectatlons hypothesls (nnn)
as adopted by one "perfect example of obedience to lthe axlorn
of ldentlty] " follows from the very deflnltlons of the terms
Slnce
"subjectlve and objectlve probabllity dlstrlbutions."
subjectlve and ob-'iective Probabllitles are entlrely dlfferent
concepcs, their lnterchangeable use 1n a theory therefore vlolates
The same must be said of Pesaran's statethe axlom of ldentlty.
"under
the REH che two concepts [subjectlve and objecment that
tlve probabllitles] are by definltlon lndisElngulshable from one
another as they w111 be one and the same thlng." Two different
deflnltlons cannot be made equlvalent by deflnltion.
The Axlom of Noncontradlctlon
Although Boland agrees wlEh our reconmendatlon Ehat the
"axlon of noncontradlctlon" should not be abandoned, Sml1ey
FOUNDATIONS OF ECONOMETRICS-REPLY
103
evidently belleves Ehat there cannoE be any vlolatlons of the
axlom unless an economist acknowledges--without conErlElon--that
hls theory ls lnconslstent. Thls 1s just misgulded. There are
no a11bls for vlolatlons of t.hls axiom, acknowledged or not.
Incidentally, scientlfic progress is made by accumulaElng
evldence agalnst loglcally valld theorles' not by uncoverlng
thlnklng mlstakes. Our reasons for not abandonlng the axlom of
nonconcradlcclon are the sane as Bolandfs.
The Axlorn of the Excluded-Mlddle
Bolandrs polnt that "every lndirect proof lnvolves both the
axlom of noncontradictlon and axlom of the excluded-mlddle" 1s
rell taken. In fact, thls corrects an error ln an earller verslon
of the paper. We agree wlth Boland that "avoidlng the use of
indirect proofs does not require abandonnent of che axlom of the
excluded-mlddle!" Any scatemenE co Ehe contrary that nlght be
found ln the paper ls accidental. Our reason for abandoning the
axlom of che excluded-nlddle ls not that lndlrect proofs are
invalld, although they are so 1n the lnflnite case and ln other
cases menEloned above, but only that economeEric work cannot
proceed, and may be lmposslble, 1f we lnslst on satlsfylng the
axLon of the excluded-nlddle. Let us clarify thls polnt. The
real alm of lnference 1s usually to generate a predictlon about
che value of sorne future observables. Suppose that a predlctive
dlstributlon, say p(yl*), derl-ved from a 1oglca1ly conslstent
econometrlc roodel (the truth status of whlch is unknown) ls used
to produce an lnterval predlction, say 10.5 to 30.8, for the value
of the random variable f tn a future perlod, say T*s. Then both
sraremenrs, "pr(10.5!]1a"!30.4;=6.95" and "the probablllty that
the reallzed value of if+sr saY Y14g, lles between 10.5 and 30'8 ls
either 0 or 1r" are correct because the random varlable i1.r" must
be distlngulshed fron the value ylas taken by that random varlable.
The statement that 10.5jiT+s!30.8 dlffers from the statement that
the
L0.59T+s<30.8. I.le have therefore thls lmportant distlnctlon:
j1*"
asslgnnent of a truth value of 0.95 to a statement about
104
SWAMY, CONWAY, AND VON ZUR MUEHLEN
violates the axlon of the excluded-rnlddle' wh1le the asslgnment
of a cruth value of 0 or 1 to a statemeot about yT+s does not.
Clearly, Bolandrs remark that "any conslderatlon of statenents
about the future does noc necessltate a rejectlon of the raxlom
of the excluded-nlddle' slnce an adnlsslble staternent ls stl11
elther Erue or false even chough we may not yet know lts truth
status" must refer to reallzatLons, not future events consldered
as random events. Indeed, when we lncroduce random varlables of
Ehe type iT+"r r. cannot even assign a truth value of 1 or 0 to
the statemenc, 10.5$1+s!30.4, and hence caonot avold violating the
axlom of the excluded-nlddle unless the lnterval 10.5-30.8 covers
the entire range of posslble values for Ir*. or ls outslde the
This dlfference ln Ehe asslgnnent of truth values
range of ]r*".
to the stateDents about I1*" and Yr*", lncldentally, applles
whether \re use Bayes theoren or not and cannot be bhanged by avoldlng the use of lndlrect proofs. Good's remark thac "[1]f a proposltlon is now only probable for us, then we donrE know now whether
lt ls true or false, though we mlght know later, and lt can be
crue or false even lf we never flnd out whlch" l-s lnconsequential
here. Further, Goodfs stacement chat "... lprobabtllcy theory]
also obeys the usual laws of ArlsEotelean logle" ls correct ln the
speclal case when by probablllty theory ls meant an unlnc.erpreted
calculus of probabillty as deduced, e.8., by Kolmogorov. In any
case, we do not know of any exanpLe where "pr(10.5ET+J30.8)=p,
01!n$," does not vlolate the axl.om of the excluded-middle.
Pesaran mlsses Ehe polnt that '10.5$1*1!30.8" and
"10.5(y1.r"(30.8" are two dlfferent statements, and nlthout reallzlng that we are asslgning a truth value p, Olpll, to the former
and noc to the latter, he lncorrectly critlclzes our argument.
Smlley says that a common mlstake ln dlscusslons of Ehe axlom of
the excluded-nlddle ls to confuse a statementrs belng true wlth lts
belng known to be true. Throughouc our dlscusslon of Arlstotellan
loglc we only refer to statements whlch are elEher true or false,
saylng speclflcally that the truth of assunptions cannot be known
SWAMY, CONWAY, AND VON ZUR MUEHLEN
the axlorn of the excluded-mlddle.
Since the concept of lnfinlty ls used extenslvely in econometrlc work, our objective of dlscoverlng the foundatlons as
manlfested in the nature of stochastlc econometrlc theorles cannot
be served by rejecclng the use of thls concept, as suggested by
Boland. Non-Bayesians as well as some Bayeslans use countablyadditlve dlstributlons where the very deflnltlon of countableaddltivity lnvolves the concept of lnflnlty.
Vagueness of Language
Both Good and Seldenfeld refer to vagueness and amblgulty
of ordlnary language to crltlclze some of our arguments. Good
belleves Ehat ArlscoEellan loglc can be used even when the concepts
ate fuzzy, but hls arguments are not clear to us. Hls statement
thaE "Ehe quesElon whether a rnan has a beard ... ls a clear and
sfunp1e exarnple showlng the need ln prlnclple for non-Arlstotelean
fr,zzy ].oglc." seems to contradlet another statement, "Arlstotelean
loglc could be largely restored to the questlon of beardedness..."
We do not deny thaE ordlnary language rnay be vague. Indeed we
took care noC to use vague language ourselves. What we have sald
ls that slnce there 1s no approxlmate modus ponens, modus ponens
or Arlstotelian loglc cannot be used 1f any of our assumptlons
are only approxlmately crue. Of course, Seldenfeld rejects
"a Eacit assumptlon ... whlch amounts to the presumptlon EhaC
approxlmatlons are at odds wlCh Arlscotelean loglcr" and no
apology 1s requlred. He also rejects the assumptlon that
"approxlmatlons must lnvolve the klnd of vagueness chat makes thern
unlncelllglble by standards of classlcal loglc." Thls criticlsm
ls beslde the polnt because we do not use Arlstotellan loglc to
dlstlngulsh "unlnrelllglble"
fron "1ntelliglble" scatenenrs buE
to pass along known truths (or falsltles) fron assumptlons (or
concluslons) to concluslons (or assunptlons). Let us conslder
Seldenfeldrs own example. He says, "... lt is understood by all
what the trafflc cop means when lt ls reported that I was drlvlng
FOUNDATIONS OF ECONOMETRICS-REPLY
107
approxlmately at 45 mph In a 25 mph zone. I do not belleve thls
understandlng lnvolves appllcatlon of some non-scandard loglc!"
A personrs understanding of any staEement depends on how well che
terms appearlng ln that scatement are defined and how nel1 chat
person understands those deflnitlons.
Our concern here 1s dlfferent; what can a trafflc courc conclude from the premlse that
Professor Seldenfeld was drlvlng approxlnately at 45 nph in a
25 nph zone? No judge ln his rlghc mlnd would allow such evldence
ln court because any number less than or greaEer than 25 ls stlll
approxlnaEely 45. The evldence ls legally inconcluslve.
A semantlc abuse llke Smlleyrs phrase "the average plumber has
1.3 chlldrenr" serving as shorthand for an assertlon about the
ratlo of plunbers to chlldren of plumbers would probably not pass
an edLtorrs scrutiny. Slmllar1y, no one who has studled nathemaEl-cs
w111 use Smlleyts phrase, "Il=22/7 ls rapproxlmately truer," as
shorthand for an assertlon about the relatl-ve smallness of the
dlfference between Ehe Ewo numbers. We repeat: the nain focus
ln our paper is on the connectlon between assumptions and concluslons and not on the usefulness of phrases as shorthands for
sentences.
The Problen of InductLon
It has always been our understandlng that outslde of used-car
lots "solutlon" means "complete solutlonr" not partlal solutlon. In
Sectlon 4.1 and foocnote 7, we have clarifled what we rnean by "the
problern of lnducElon." Good agrees wlth the statemenE that thls
problem has no complete solutlon. I{ow do we overstate our case
by repeatlng Chls correct scatement whenever necessary? Indeed,
ln the paper we never say or funp1y that partlal solutlons cannot be
valuable. Concepts such as locally coherent procedures, Blrnbaumrs
confldence concept, and Kleferts suggestlons discussed in Sectlons
7.3 and 7.6 would be useful for gettlng partlal solutlons.
Boland and Good agree that the problem of lnductlon has no
(conplete) solutlon. Therefore ne cannoc verlfy the Eruth of Ehe
SIIAMY, CONWAY, AND VON ZUR MUEHLEN
condltlons of Ehe laws of large nurnbers or the condltlons under
whlch subJectlve probablllty distributlons exlst. Because of thls
dlfflculty we are unable to pass along truchs to frequentlstsf or
subjectivlstst specl.ficatlon of probabillty dlstributlons slnce
these truths are always unknown to ua. It ls prudent to a1low for
the possiblllty that the Arlscotellan prlnciple of the excludedniddle does noc hold for frequency or subjectlve interpretaclons
of probabillty. What, then' ls so "surprlsing and rather puzzlTng"
(Seldenfeld) about thls assertlon? For exarnple, Cram6r
(1946, p. 143), a leadlng frequentlst, has sald that to any event
connecEed wlth a random experlment, we should be able to ascrlbe
a number p such EhaE, ln a long serles of repetltlons of the
experlnent, the frequency of the event would be approxlnately equal
to p. Slnllar1y' H111 (1975, p. 559), a leading subjectlvisE'
has pointed out that the notlon of a "true" denslty ls dlstasteful
to a subjectlvist such as hlmself. There are no approxlmately true
frequentist or subjectlve probabllitles from whlch unarnblguously
true concluslons w111 valldly fo1low r{lth the assurance of modus
ponens. If our presentatlon of Bolandrs demonstratlon that there
1s no approxlmate modus ponens or aPProxlmate rnodus tollens ls
not "a demonstraElon that the sense of rapproximatlonr used ln
the rstandardr lnterpreEatlons of probablllty ls of che sort that
cannot be captured by classlcal loglcr" then what is it?
It ls not classlcal loglc but the lack of a (complete) solutlon to the problem of lnductlon that ls a hlndrance ln che phllosophlcally dlfflculc task of produclng defenslble expllcations
of "probab11ity." It ls also the source of confllcts ln the
If we can
ongolng debates over Ehe foundatlons of statlstlcs.
verlfy the truth of the condltlons of a law of large numbers,
then we can say that the frequency lncerpretaclon of probablllty
ls also true. Alternatlvely, lf lte can verify the truth of the
condiclons under whlch subjectlve probabllltles exist, then we
can pass along that known truth to the subjectlve lnterpretaElon
In elther case' controversles of the type that
of probablllty.
FOUNDATIONS OF ECONO},IETRICS-REPLY
109
appeared ln Kiefer (L977a, pp. 822-827 ) surroundlng the question:
I{hat 1s the correct lnterpretatlon of probablllty? should end.
Slnpllclty or
Parsinony
Returnlng to Goodrs comments, we do not agree that the concepts of slmpllclEy or parsimony and conplexlty, are absolutely
essentLal for model cholce. Conslder, for exarnple, Goodrs own
exanple of flttlng a polynonial to n observatlons of a dependent
and an lndependent varlable. Though Good belleves Ehat a polynonlal
of lower degree ls nore parslmonious than a polynomlal of che
(n-1)th degree Ehat fits the data exactly, the latter polynonlal
may not very often be the better predl-ctor of the dependent varlable outslde the saurple than polynornlals of lower degree. If it
is, then lt 1s useful, but the operatlonal meanlng Good attaches
Eo the word "parsLmony" ls nlsleadlng. If our purpose ls not to
predlct but to provlde a relative evldentlal evaluatlon of models,
then we cannot work wlEh exact flts because an equatlon thac flts
the glven data exactly will not enable us to estlmate the error
varlance, and hence staclstlcal evLdence 1o the form of Birnbaumrs
df and d! f" not possible. Exact fits are analogous to sltuatlons
presented by a slngle observatlon y fron a normal dlstrlbutlon wlth
unknown mean u and unknown sEandard devlatlon o. The 1lkellhood
here 1s inflnlte at the polnt o=0, U=y, suggestlng very strongly
that small values of o are more plauslble than larger values, no
natter what the value of j that 1s observed. What you get fron
thls ls not parsimony, but mlsleading evldence wlth probablllty 1!
The confldence concept of Blrnbaun lllunlnates the reason, ocher
than slnpllclty or parslmony, for preferrlng ldentlflable rnodels
to nonl.dentlflable models whlch nay flt the data exactly. Identlfiabl1lcy is a necessary conditlon for statlstlcal conslstency.
It should also be noted that there ls no necessary connectlon
between estlmatlon efflclency and predlctlve efflclency and Ehat
estlmatlon efflclency applles on1-y to rare sltuatlons. To see
thLs, recall the three posslble lnterpretatlons labeled (a), (b),
SWAMY, CONWAY, AND VON ZUR MUEHLEN
and (c) of the parameters ln the (fl,S,p)-paradigrn presented in
applles under all three
inEerpretatlons (a), (b), and (c), estlmation efficlency applles
only under lnterpretatton (c). Loosely speaking, one can guess
estimatlon efflclency, whenever 1c applies, by conputlng the
observatlon/parameter ratlo. When this ratlo ls not sufficlently
large, parameter estimates wl11 be lmpreclse or Ehe efficiency of
the parameter estlmates w111 be low. Also, lf we follow Blrnbaum
to provlde evldentlal interpretatlon, then thls evidence nay be
worthless ln thls case. However, the predictlons lmp1led by these
lmpreclse paramecer estlmates rnay be successful Isee Lad and Swany
(1985)1. Moreover, lt ls dlffLcult to opt for lnterprecatlon (c)
1n nonexperlrnental sltuatlons llke econometrlcs because there may
not be any model-free physlcal quantlcies standing behind each
model paramecer. If lnterpretati"on (c) does not apply' then
In any case, 1f we reject
estlmatlon efflciency ls irrelevant.
a roodel on the ground that lt has too many parameters' then we
Therefore,
rnay be rejectlng a best predlctor of our varlables.
the relevant conslderatlon for predlctlve purposes ls not the number of parameters 1n a nodel but a nodelrs predlctive efflciency.
Thus, the prlnclple of parsimony based on the number of unknown
parameters can be quite mlsleading.
Sectlon 7.6.
Though predlctlve efficlency
I-tany-Valued Loglc
Goodts openlng remark, "Slnce the enphasls was on nultlvalued
loglc, perhaps it ls nelther true nor false that they answered the
ltltle] questlon," is an amuslng pun. Let us establlsh here that
we dld not propose dropplng any of Arlstotlers axloms merely for
the purpose of understandlng our paper. More to che polntr whlle we
concede the lmportance of language and of avoiding lts amblgultles
(exarnples of anblguities ln language rnay be found even among the
commentaries here), our Lntent was Eo dellneaEe a mlnlmal set of
axloms for a foundatlon of econometrlcs' not physlcs' llngulsElcs
or posslbly psychology. We belleve we achieved our purpose.
FOUNDATIONS OF ECONOMETRICS-REPLY
Contrary to Che lmpresslon glven by Sm11ey, we do not say
thaE all the alCernatives to classlcal loglc w111 be found under
the headlng of "many-valued" loglc. We treat probablllty theory
as a verslon of many-valued logic as suggesEed by Lyndon (L966,
p. 33) under the headlng of "Many-valued 1oglcs." He proposes
that probablllty theory, ln 1ts most prlmltlve form, resembles
many-valued loglc 1n Chat lE atEaches to each formula f as
probabtllty a number p ln the lnterval [0,1]. We are arrrare that
probabllity dlffers from loglc 1n that pr(AUB) does noL depend on
only pr(A) and pr(B) [see Lyndon (1966, p. 33)]. For rhls reason,
a systein of rules of lnference that flEs the probability theory
ls dlfferent fron the system that flts Arlscotelian loglc Isee
McCawley (1981, p. 368)1. Good agrees that probablllty theory
certalnly can be regarded as a verslon of many-valued loglc.
In our dlscusslon of a verslon of many-valued loglc rJe are,
by the way, not dealing wlth degrees of meanlng but wlth degrees
of certalnty. The analogy bet$reen ftzzy set theory and probablllty
theory 1s merely senantlc. We lncluded a discusslon of flzzy
sets 1n Sectlon 7.5 because of 1ts analogles wlth probablllCy
theory. The act of asslgnlng truth values to proposltlons ln
fuzzy sets theory para11e1s the act of asslgnlng probablllties to
sets ln probablllEy theory. We do noE mean to say that the theory
of tuzzy sets ls the same as a theory of (part1a11y-ordered)
probabilitles.
The arguments glven 1n McCawley (1981, Chapter 12)
dlsprove Snlleyts claim that fuzzy log|e cannot handle lnexact
concepts such as approxlmatlon. Our answer to Snl1ey's closlng
questlon 1s no lf by "flzzy concepts" and "vague sEatemenEs" he
neans all those statenents whlch violate the axlorn of the excludedmlddle.
Many-valued loglc can be used wlthout replaclng the axlorn of
noncontradlctlon. The answer to Bolandts questlon, "If st.atements
can be Inany-valuedr' what constitutes a contradlction?" depends on
the type of many-valued log1c we use. Let us conslder probablllty
theory whlch 1s a verslon of many-valued loglc. Glven the trlplet
SI^IAMY, CONWAY, AND VON ZUR MUEHLEN
(S,BrP)=(a sanple space, a Borel field, a probablllty set functlon), a set belonglng Eo B w111 have only one cruth value, vlz.,
a probablllEy. In thls sense, a sEatement cannot have ewo dlfferent
truth values. Boland further asks, "But, can we say thaE a staternent whlch ls, by any measurement,0.5 Erue is also 0.5 false?"
Our answer ls: 1f the probabtllty of a set AeB ls 0.5, then yes,
the probablllty of the complernent of A 1s 0.5. Thls is not a
conEradlctlon. Aga1n, ln Ehe case of probablllty theory, the
axlom of noncontradictlon means coherence ln de Finettlts sense.
Thls answers Bolandrs questlon: "Just what does the axlom of
noncontradlctloo mean when we abandon chis axlom of the excludedmtddle?" Incoherent oplnlons that do not conform to probabillty
calculus are noE free frorn contradlctlons.
Smlley mlsrepresents lnstrumentaliscs by saylng that they
conslder sclence as rnaglc. Whl1e we are not opposed Eo lnstrumenta11sm, we also do not want to relegate econometrlcs to lmmediate
practlcal problems. As Blrnbaurn polnts out, hypothesis testing
can be used for evldentlal lncerpretatlon.
Contrary to what Boland
has said, the questlon of testlng need not necessarlly lnvolve
sErategies and preferences concernlng whether a type I or type II
error 1s the least preferred on1y, but may lnstead concern relatlve evidentlal evaluatlons of trro or nore hypotheses. Idea11y,
we should follow Blrnbaumfs confl.dence concepE: under no rnodel
sha11 there be a hlgh probabllity of outcomes lnterpreted as strong
evldence agalnst thac rnodel. Thls is not to deny the presence of
practlcal dlfflcultles
lnvolved 1n the enplrlcal lmplementatlon
of Blrnbaumts confldence concept desplte lEs undenlable appeal.
Regardlng plpe dreams, \{e are afraid that the econometrics
of che founders w111 remaln 111usory 1f lc 1s based on pesaranrs
prescrlptlon. A requlrement that lts foundatlon should rest on all
of Aristotlers axl-ome includlng the axlom of the excluded-mlddle
nay Just remaln the dream of a mldsunmer nlght. But there ls l1ght
at the end of the tunnel lf Blrnbaunrs confldence concept 1s taken
serlously. More Eo the pol-nt, loglcally conslstent econometrlc
FOUNDATIONS
OF ECONOMETRICS-REPLY
of endogenous
varlables, glven exogenous varlables, rarely, lf ever, produce
degenerate predlctive distrlbuclons of endogenous varlables.
Using nondegenerate predlcclve dlstrlbutlons' we can aL best make
probabllistlc statements about Ehe yet-to-be observed future value
of an endogenous varlable. The probablllties of such statements
wl11 usually be nelthet zero nor one. Alternatlvely, uslng
Blrnbaumrs confidence concept' we may glve an evldentlal evaluatlon of two or more competlng models, as long as they are 1oglcal1y
conslstenc, of course. This procedure nay not glve a strong
evldence agalnst a nodel unless the observaclon/parameter ratlo
is sufficlently high. A Ehlrd approach ls to conpare hypoEheses
concernlng real events by uslng posterlor odds ln favor of one
hypothesls relaElve to another, as ln Zellner (1971). However,
the atteinpt to do thls 1s often thwarted by sample lnformatlon
that ls weak and hence incapable of providlng the econometrlclan
wlth satlsfactory odds ln favor of elther hypothesls. Furthermore,
evaluatlon of posterior odds ls hlghly sensltive to the prlor
distrlbutlon of parameters under che nalntalned hypothesls. In
these cases, one ls not really comparlng hypotheses, but rather
To establlsh
comblnatlons of hypotheses wlth prlor dlstrlbutlons.
of competing hypotheses we may conslder
bounds on plauslblllty
this: As suggested by H111 (1984a), a sample could reveal how
extreme prlor dlstrlbutlons would have to be (wlthin a cercain
class of prlors) l-n order to produce declslve evidence for one
hypothesls versus another. We would then be in a posltlon to say
that such prlor dlstrlbutlons are not ln our "ball-parkr" and
therefore that such daEa are lncapable of glving a clear lndlcatlon
as to whlch rnodel is preferable. Thls program 1s conslsEenE with
Klefer's suggestlon summarized at che end of Sectlon 7.6. Thus' che
app1lcabl1lEy of "fvzzy" econometric cheory based on BLrnbaumrs,
Hl11ts and Klefer's suggesELons would not be lfunlted to bulldlng
economeEric models representlng merely "ftzzy" econonlc theory as
suggested by Boland.
models that specify Ehe condltlonal dlstrlbutlons
swAMY, CONWAY, AND VON ZUR MUEHLEN
Achievlng Coherence
Returnl-ng to some technlcal comments, the accuracy of
Seldenfeldrs clalm that "nany fanlllar (textbook) rorchodoxt
statlsclcal procedures have a Bayeslan model under an rimproperr
prlor" seems s11ghtly exaggerated. Example 1 does not represenc
very many fanlllar (textbook) "orthodox" st.aCistical procedures.
For lnstance, frequentlst admissible procedures that are equally
famlliar are omltced. Thelr Bayeslan descrlptlons do not always
use "inproper" prlors! The "examples" he glves referrlng to
Jeffreysr prlors, flnite addlclvity and non-conglomerablllty, are
perhaps unnecessary 1n vlew of our references to Zellner's (1971),
Vlllegast (1977a, b, 1981) and H111's (1981,1982,1984) work on
those subjects.
We hope that our st.atement, "[1]f an inproper prlor does noc
lead to a posterlor whlch could have been obtalned from a proper,
flnitely addltlve prlor, then such a prlor results 1n incoherence
and should not be used." ls not really as outrageous as
Seidenfeldts crltlcism suggests. Because we hesltated co Eake a
chance on obtalnlng lncoherent lnferences, we felt obllged to err
on the slde of cautlon 1n reconmendlng the use of lnproper prlors.
It ls true that lf Che chosen lmproper prlor has a posterlor whlch
could not be obtalned from a proper, flnit.ely additive prlor, then
the Heath and Sudderth condltlon for coherency cannoE be used to
show that the chosen lmproper prlor 1s lncoherenc. It may be posslble to show, as 1n Hill (1984), that the apparenc 111 consequences
of uslng Ehe chosen lmproper prlor cannot really be nade operational. For thls reason, our cautlon may be excesslve. In any
case, ao lmproper prlor should not be used if 1c could not be shown
by any nethod that the bad consequences of uslng such a prlor
cannot be rnade operatlonal. Uslng a new condltlonal frequency
lnterpretation of statlstlcal Lnferences, V11legas (1977a, 1981)
shows that the posted condltlonal odds based on a posterlor for
a posslbly lmproper prlor can be coherenE.
The followlng are our responses to Seldenfeld's questlons on
FOUNDATIONS OF ECONOMETRICS-REPLY
115
the seventh page of hls conments. Flrst, we conslder lleath and
Sudderthts derivatlon of a necessary and sufflcient eonditlon for
In referring
coherency to be correct. That ls why we accept lt.
co our so cal1ed "lnterestlng clalns" spanning lnporcant contrlbutlons fron A-Z--a11 of which are, lncldentally, referenced with
proofs--Seldenfeld proposes Ewo proofs by assertlon: (1) the
'approprlateness of the added restrlctlons lmposed by 'H-S
coherencet" is to be questloned, and (2) "when ... condltlonal
'H-S lncoherencer can arlse" is an open questlon. If lt ls not
known when "11-S lncoherence" could arlse, how is lt chat one
should reject "H-S coherence?" [,{e understand H111rs (1981, 1982,
1984) argurnent leading to the concluslon that the I{-S requlrement
for coherence 1s too restrl-ctlve co the extent thac le goes beyond
the de Flnettl form of coherence (whlch only requires avoldance of
sure loss wlth a flnlte number of garnbles). Even Ehough the condltlon of coherency 1s fundarnental for de Finettlrs lnductlve 1oglc,
the requirement of conglornerabillty ls not part of that loglc.
As H111 (1981) has suggested, we should learn to undersEand aod
llve wlth non-conglomerablllty. Subjectlvlsts who' 1lke ltlll'
prefer the flnltely addltlve theory, regard Ehe counEably addltlve
sltuatlon as rnerely a speclal case of lnteresE' to be justlfied
or not ln each appllcatlon on lts own merlt. The congloneracl"ve
property should be considered ln the same manner. Whenever we
declde co work wlth a flnltely addlclve theory, the Il-S condltlon for coherency ls useful. We must justlfy the serlous charge
that non-conglonerablllty ls l-rraEional, as Seldenfeld has suggested, only lf we apply the H-S condltion for coherency bllndly
and mechanlcally.
Gidelrs Incornpleteness Theorem
Most practlcloners of the phllosophy of sclence and mathematlcal loglc wl11 appreci-ate attempts to develop the trall from
Arlstotle through Codel to Blrnbaum as long as such efforts pass
the tests that phllosophers of sclence have constructed. Thorough-
l-
16
SI^rAMY
'
CONWAY
'
AND VON ZUR MUEHLEN
ness of coverage \dou1d be one of then. But Seldenfeld appears to
have sone mlsglvlngs concernlng our dlscussion of Godelts lncompleteness theorem, polntlng to a lack of plausible examples ln
economlcs. A confllct bet\teen deductlve conpleteness and loglcal
consistency ln econometrlcs necessarlly arlses 1f the econonetrlc
theorles we are deallng wlth are consistent, axlomaLlzed extenslons
of che theory of naeural numbers, as polnted out by Edward Green
(Unlverslty of PtEtsburgh) who klndly drew our attentlon to an
excellent book by Joseph R. Shoenfleld entltled Mathenatlcal Loglc.
There it 1s scated wlth proof that a conslstent Eheory T ls also
complete lf lt admlts ellnlnatlon of quantlfiers, if lt contalns
a constant, and lf lts every varlable-free formula ls decidable ln
T. In hls comments, Smlley neglected to conslder the posslblllty
that these condltlons are vlolated when rlval hypotheses are combined into a more general model or when one tries to achleve
Dempsterrs or Tversky's type of consi-stency. Likewlse, our
reference to Barnard and Good notwlthstandlng, Srnlley apParently
does not see the relatlonshlp between these conditlons and coherence ln sufficiently large worlds. Let us explaln why we think
that G;de1rs theorem has relevance to a foundatlonal dlscusslon
of ecooometrlcs.
A forrnula f of a theory T 1s decldable ln T lf either f or
not-f ls a theoren of T. The concept of completeness ls related to
dectdablllcy: lf an axlomatlzable theory T is cornplete and consistent, then 1t 1s decidable. For exanple, both predlcate loglc and
arlthnetlc are undecldable though the former ls axLomatlzable and
1s
the latter ls not. As Lyndon (1966, p. 38) polnts out"'lt
the exceptlon rather than che rule that a theory of any genulne
nathematlcal cornplexlty 1s decldable." Llke nost decidabillty
proofs for theorles ln a language with quantlflers, the decldabi11ty proofs for econometric theorles can use the method of
ellmlnatlon of quantl-flers. If econometrlc theorles saclsfy the
lsornorphlsm condltion and the submodel condltlonr then they adnlt
ellnlnatlon of quantLflers [see Shoenfleld, op. cit.r p. 85]. It
FOUNDATIONS OF ECONOMETRICS-REPLY
tr7
ls not posslble to show that every econometric theory satlsfles
these condltlons. One set of undecldabtltty results whose proofs
are lndependent of Godelts proof of the undecidabtllcy of arlthnetlc Ls due to Post and Turlng who escabllsh the undecldablllty of
the questlon of whether an ldeallzed compuElng machlne, glven a
certaln lnput, will come to the end of lts conputation [see Lyndon
(1966, p. 42)1. Thls shows that, contrary to what Good has sald,
Turlngrs idea to base the foundations of nathematics (or econonetrlcs) on a carefully specifled computer was not so good. A
result related to that of Post and Turlng ls the unsolvabllity
of the word problern for senlgroups or groups [see Lyndon (1966,
g.42)l whlch 1s relevant here because lt ls posslble chat some
econometrlc theorles are the lsomorphlsms of the word problem for
semlgroups or groups.
Unacceptable resulEs can be easl1y obEalned lf che undecldabl1lty of formalized econometrlc theorles are lgnored. For
appealing axloms of group rationexample, Ithen two intultively
allty, viz., "external Bayeslanity" and "ltke1lhood prlnclple,"
are used to determlne an oplnlon pool ln a nultl-agent declslon
problem, then a generally unacceptable consensus may arlse whlch
lgnores alt but one of the oplnlons expressed [see Genest (1984)].
One posslble reason for M11Eon Frlednan's perennlal unhapplness
wtth Federal Reserve pollcy 1s perhaps the recurrlng lncompatlblllty of hls opinlons with those of the pools used by the Federal
Reserve. The lnpossibillty of obtalnlng a pollcy concluslon whlch
ls acceptable to all (ratlonal) economists even wlth ldentical
lnformatlon sets provldes one plauslble example of undecldablllty
ln economlcs.
The absence of consensus among economlsts even wlth ldentlcal
lnformatlon sets ls the result of the undecidabillty of certaln
formallzed econometrLc theorles. Thls means that such theorles
can be elther lnconsistent or lncomplete. If we denand conslstency,
then we may not be able to achl.eve completeness. But the axlom of
noncontradlction cannot be satlsfled lf the theorl-es are lnconsls-
SI^]AMY, CONWAY, AND VON ZUR MUEHLEN
tent. Our scatement 1s consistent $71th Nagel and Newmants (1958'
p. 6) observatlon thaL G;de1 "presenEed mathematiclanrs wlth the
asEoundlng and melancholy concluslon that the axlonatlc method has
certaln lnherent llnlcatlons ..." and Hofstadler's (1980, p, 25)
observatlon chat G;delts paper "1n some ways utterly demollshed
"noE only that there were
Hilbertrs program" ... by reveallng...
lrreparable rholesf ln the axlomatlc system proposed by Russell and
Whitehead, but more generally' that no axlonatic systen whatsoever
could produce all number-theoretlcal truths, unless lt were an
lnconslstent system!" If these statements donrt represent a
challange, then what does?! Thus, Godel poses a challenge to
che axlom of noncontradlctlon. This ls not the same as Smlleyrs
clalm that'[Coaef] could only be seen as challenglng the Law of
Contradictlon lf he had urged mathematlclans to sacriflce conslstency for the sake of completeness." The concluslon part of thls
sentence would not even follow from 1ts assumptlon part' Srnileyrs
suggestlon that the whole buslness of Godelts Eheorem ls sonethlng
of a red herrlng ls a red herrlng.
Tn conclusion, to errors and omlsslons catalogued by our klnd
dlscussants we plead not gul1ty. Any remalnlng lmpresslon that
our lnEerpretatlons of classical logic are lncorrect or Ehat our
advocacy of nany-valued logic ls mlsconcelved should, 1n Slr
Wlnston Churchlllrs fellcltous phrase, perhaps be wrltten off
as "gross termlnologlcal lnexactltude." As best as we can judge'
those crltlcisms are not based on a clear, correct, or even fair
understandlng of our paper. Nevertheless' we are wl111ng to
asslgn a truth value greater than zero (but less than one) to the
prospect that some readers w111 not agree wlth all we have to say.
P.A.V.B. Swany
Federal Reserve Board
Roger K. ConwaY
DeparEment of Commerce
P. von zur Muehlen
Federal Reserve Board
FOUNDATIONS OF ECONOMETRICS-REPLY
ADDITIONAL
REFERENCES
Crarn6r, H., (1946). Machernatlcal Methods of Statlsclcs.
Prlnceton Unlverslty Press.
Princeton:
Genest, C., (1984). A confllct
subjectlve dlst.ributlons.
between two axloms for cornblnlng
J.R. Staclst. Soc. B, 46, 403-405.
Hi1l, 8.M., (1982). Conglomerablllty for distributlons that are
only finltely additlve. Technlcal Report /il13, Unlverslty of
Mlchlgan, Ann Arbor.
H111, 8.M., (1984a;. Discusslon of che papers by DeGroot and
Erlksson, Rossl, and Lltterman. Proceedlngs of the Soclal and
Economlc Sectlon, Anerlcan Statlstlcal
Assoclation.
Ilofstadter, D.R., (1980). Codel, Escher, Bach: An ELernal Golden
Brald. New York: Vintage Books.
Lad, F. and Swamy, P.A.V.B., (1985). Enplrical assessments of the
efflclent rLarkets hypothesls: A subjectlvlst analysls of the
variance bounds approach, Special Studles Paper, Federal Reserve
Board, Washlngton, DC.
Nagel, E. and Newman, J.R., (1958). Godelts Proof.
New York Unlverslty Press.
New York:
FOUNDATIONS
OF ECONOMETRICS-REPLY
105
unless the problern of lnduction 1s solvable. Smileyrs dlstlnction
ls therefore not germane. In Sectlon 7.5 we dlscuss sEat,ements
llke, pr(10.5<iT+s<30.8)=p, 0311, and pr(10.5(y1*;!30.8)=0 or 1.
Both these statements, though dlfferent, are true. Therefore,
Snlleyrs asserElon Ehat ne equate people's present 1nabl1ity Co
tell whether fuEure statements are true or false wlth thelr
actually belng nelther true nor false is lncorrect. Furthermore,
he seerns t.o be lmplylng that in our Chlnklng probablllcy ls a
contl-nuum between truth and falslty.
Following Dempster, we
lnterpret probablllties as degrees of certalnty, and thls 1s what
Sniley may have meant.
In a related context, Goodrs renark that "... the statement
thac sonethlng ls approxlnately true can 1tse1f be absolutely
Erue lf anythlng can be." appears to be a semantlc puzzle Lnvolvlng
a contradlctlon. Indeed, Popper (1965), who ls quoted ln Bolandrs
comments, has shown that anythlng can be true if the axiom of noncontradlcElon 1s vlolaced. Popper put great emphasls on the fact
that ln standard logic a contradlctlon leads to loglcal anarchy,
as for example, in Pesaranrs statenent that "[1]f the orlglnal
denand system ls caken to be the truth, then from a strlccly
Loglcal vlewpolnt both approxlrnatlons are false."
Goodfs Justlflcatlon of the sEaEement that no vlolatlon of
the law of the excluded-mlddle follows fron the statenent thac an
event occurs wlch probablllty p, whether Ehe probablllty ls lnterpreted ln the long-run frequency sense or 1n the sub-jectlve
(personal) sense ls couched ln some conslderable quantlty of fuzzy
semantics, thereby undermlnlng itself:
"ntght notr" "somewhat
"mlght
"some
unclearr"
meanr"
er" "mlght be true or falser"
'provlded long run 1s not too 1ong." If che parameter "p" 1n the
statement, "the probablllty of an event 1s pr" ls characterl-zed
ln such semantlc hedges, then lt ls, of courser funposslble to
determlne lf a specifled value of p ls true or false. Therefore
ee are justlfled ln assumlng that the probablllCy of an evenc ls
approxl.nately p. But thls assunptlon lnvolves the vlolatlon of