Asda PDF
Asda PDF
Asda PDF
J.W. MERKS
Degrees of freedom also occur in the
The variance is the fundamental
basic formula for the covariance, but
J.W. Merksis Dresident olMalrix
Consulrants
limited,
measure for random variations in stothey cancel in a simplified formula.
r . Columbia, Canada. nflnmeelin0 lla!Jer 95- However, covariances cannot be
chastic variables and stochastic sys- V a ~ o u ~ eBritish
319. Ori0inal mafluscri~lSe~lember1995; revised manuscri~l tested for statistical significance withterns. Variances are amenable to
r e asproved
v i e w e dPaller out taking degrees of freedom into
mathematical analysis. The proper- A P ~ I I ~ u~i ~ ~I .~ ~ ~ i f l n f l l l h i ~ ~ e e ~and
is
invited
and
must
be
submilled,
in
duplicare,
prior
to May 31,
ties of variances are derived from the
account,
variance of a general function. The
1991.
Associative dependence. Associaarithmetic mean and the weighted avtive dependence exists if the degree
erage are both functionally depenof association between two or more
dent variables of sets of measured values. The central
variables is statistically significant. An example in minlimit theorem defines the relationship between the varieral exploration is associative dependence between
ance of a set of measured values and the variance of the
grades and densities in massive sulfides. A significant
arithmetic mean of the set.
degree of associative dependence permits regression paThe variance of the weighted average, despite its sigrameters to be used to calculate densities from grades
nificance in mining, metallurgy and exploration, is not as
and the variance of density from the variance of grade.
well documented as the central limit theorem. As a reMeasured values in n-dimensional sample spaces
sult, kriging variances and covariances of sets of kriged
may display a significant degree of associative depenestimates (distance weighted averages) could evolve into
dence, which is frequently referred to as spatial depenthe essence of geostatistics.
dence. Ordered sets of on-stream data at mineralprocessing plants almost invariably display a significant
Dependenceand independence
degree of associative dependence.
Functional or mathematical dependence. The arithmetic mean is a function of a set of measured values with
The variance of the arithmetic mean
identical weight factors, and the weighted average is a
The arithmetic mean is a functionally dependent
function of a set of measured values with different
variable of a set of measured values with equal weight
weight factors. The requirement of
factors. It can be formulated as folmathematical or functional indepenlows
dence invalidates the variance and Abstract
covariance of a set of distance
Applied statistics provides % = (l/n)*x, + ... + ( l / n ) * x , + (1)
weighted averages. The concept of scores of powerful tests and tech- ... + (l/n)*x,
degrees of freedom in applied statis- niques that are widely used and
tics is the corollary of the require- abused in science and engineering. where
ment of functional independence in The variance is the basic measure k = arithmetic mean of a set of n meaprobability theory. Degrees of free- for variability, precision and risk.
surements,
dom are awarded when values are The formulas for the variances of xi = ith measured value and
measured but not when they are cal- the arithmetic mean and the n = number of measurements in the
culated from measured values. It is weighted average are derived from
set.
beyond the scope of this paper to the variance of a general function,
The variance of the arithmetic
prove that adding functionally depen- as defined in probability theory. mean can be derived from the fordent values to a set of measured val- The requirement of functional or mula for the variance of a general
ues does not add degrees of freedom mathematical independence in function. Intermediate steps in the
to the set.
probability theory translates into derivation are based on the premises
Because degrees of freedom are the concept of degrees of freedom in that the squared partial derivative for
fundamental in applied statistics, they applied statistics. Ceostatistics is each term in Eq. (1) is (l/n)2and that
occur not only in the formulas for the only variant of statistics in the variance of the arithmetic mean is
variances but also in many statistical which functional dependence and the sum of n terms as follows
tables. For example, the denomina- degrees of freedom are irrelevant.
tors in the formulas for the variances This paper examines whether such vat-(%)= (l/n)2*var(x,) +
of randomly distributed and ordered basic elements of probability and ... + (l/n)2*var(xi)+ ... +
(2)
sets of measured values are degrees statistics can be ignored with impu- ( l / n ) 2 * var(x,)
of freedom (Gy, 1979; Merks 1993). nity.
Introduction
78
When a set of measured values of a variable is obtained by applying the same measurement procedure to
the same sample space, the variance of the set is an unbiased estimate for the variance of each term, and the for-'\,
mula for the variance of the arithmetic mean becomes
TABLE 1
where
x = weighted average of a set of n measurements,
w, = ith weight factor,
x,= ith measured value and
n = number of measurements in the set.
Because the squared partial derivative for the ith
term is wi2,the variance of the weighted average of a set
of n measured values with weight factors wi is
Macro
count
Mass,
Kg
Weight
factor
4
2
15
19
40
16
12
64
72
164
0.098
0.073
0.390
0.439
1.OOO
Table 1 lists a set of macrodiamond counts determined in test samples of different mass.
The macrodiamond counts and sample masses in
Table 1 have one, and only one, mass weighted average,
namely, 0.098*4 + 0.073*2 + 0.390*15 + 0.439*19 = 14.7
or approximately 15 macrodiamonds. The variance of
this mass weighted average is 0.09S2*4 + 0.0732*2 +
0.3902*15+ 0.4392*19 = 6.0, for a standard deviation of
d6.0 = 2.45, a 95% confidence interval of 2*2.45 = k4.9 or
4.9*100114.7 f 33% and a 95% confidence range with a
lower limit of 14.7 - 4.9 =: 10 and an upper limit of 14.7 +
4.9 = 20 macrodiamonds.
The previous example shows how to calculate the
variance of the mass weighted average of a set of
macrodiamonds counts determined in test samples of
different mass. The next example is based on a hypothetical set of uranium data (Clark, 1979) and shows how
to calculate the variance of a distance weighted average
of a set of values measured at different positions within
a two-dimensional sample space.
Table 2 lists the measured values and their positions,
the position for which the weighted average is calculated,
the distances between the measured values, the inverse
distance for each position and its weight factor.
The arithmetic mean of the set of uranium values in
Table 2 is 366 ppm, and the variance of the randomly distributed set is 4,480 ppm2. The central limit theorem
teaches that the variance of the arithmetic mean is 4,4801
5 = 896 ppm2, for a standard deviation of 6896 = 29.9
ppm, a 95% confidence interval of t0.95;4*29.9 =
2.776*29.9 = +83 ppm or 83*1001366 = f23% and a 95%
confidence range with a lower limit of 366 - 83 = 283
ppm and an upper limit of 366 + 83 = 449 ppm. The tabulated t-value of 2.776 at 95% probability reflects that a
set of five randomly distributed measurements has 5 - 1
= 4 degrees of freedom.
79
TABLE 3
Qtl
ccaallnav
iaa7
MINING ENGINEERING
Summary
The variance is the essence of probability theory and
applied statistics. The properties of variances are derived
from the variance of the general function as defined in
probability theory. The formulas for the variances of the
arithmetic mean and the weighted average are derived
from the formula for the variance of a general function.
Arithmetic means and weighted averages are functionally dependent variables of sets of measured values, but
arithmetic means are based on measured values with
identical weight factors and weighted averages on measured values with different weight factors.
The distance weighted average and its variance are
of crucial importance in mineral exploration. Set theory
teaches that two or more measured values of a variable
in an n-dimensional sample space define an infinite set
of positions and, thus, an infinite set of distance weighted
averages. In applied statistics, every distance weighted
average of the infinite set has its own variance.
In geostatistics, the distance weighted average metamorphosed into the kriged estimate (Clark, 1979; David,
1977). a simple genesis that evolved into a bizarre variant
of Byzantine complexity, a synthesis of probability without functional independence, statistics without degrees
of freedom and geomathematics with abandon. This
geostatistical variant of probability theory and applied
statistics is implemented with dogmatic determination to
emulate static stochastic systems of immense intricacy
and variability.
David (1977) alludes to the variance of the arithmetic mean as "the famous central limit theorem" and
mentions that it gives the variance of the mean of "a
group of n independent samples" but omits the variance
of the weighted average, which is, in fact, a homologue of
the central limit theorem. H e predicts correctly that
"statisticians will find many unqualified statements" in
the "few pages of theory" in which he "firmly grounds
the model" and that "the terminology may seem barbarous to statisticians" but appears unaware that the methodology, too, is barbarous.
David refers to "Student's t" but does not mention
that tabulated values of the t-distribution are a function
of degrees of freedom. H e refers to "independent
samples" but does not differentiate between spatial dependence and functional dependence. Not surprisingly,
his chapter on kriging does not refer to the requirement
of functional or mathematical independence. On the
contrary, David intimates that the ability to calculate the
variance and covariance of a set of functionally dependent variables (kriged estimates) is a test for geostatistical perspicacity.
Armstrong and Champigny (1989) caution against
oversmoothing, which seems to imply that the requirement of functional independence can be violated a little,
but they fail to define the perfect degree of smoothing.
The authors observe that "the kriging variance rises up
to a maximum and then drops off.'' and submit that this
extraordinary behavior is described by Brooker (1992)
who praises its robustness.
Rendu (1994) worries about "an endless list of
MINING ENGINEERING
FEBRUARY 1997
81
e c o m l I A ~ V1007
r MINING ENGINEERING
References
Armstrong, M.. and Champigny, N., 1989, "A study on kriging
small blocks," CIM Bulletin,Vol. 82, No. 923, March.
Armstrong, M., 1992, "Freedom of speech?." De Geo,statisticr.,
Julius MCMXCII, Numero 14.
Clark, I.. 1979, Practical Ceostatistics Applied Science Publishers.
Barking. England.
Brooker. PI.. 1986, "A parametric study of robustness of the
kriging variance as a function of the range and the relative nugget effect for a spherical semivariogram." Jourr~ulof Mathematical Geology,
Vol. 18, No. 5 .
David, M., 1977, Geostarislical Ore Reserve Esritrration, Elsevier
Scientific Publishing Company, Amsterdam.
Gy, P., 1979, Sampling of Particulate Materials, Elsevier Scientific
Publishing Company, Amsterdam.
r~.~
GrolMatheron, G., 1963. Prir~ciplesof C e o s ~ u t i s ~Economic
ogy, Vol. 58.
Merks, J.W., 1993,"Abuse of statistics." CIM Bulletit~.Vol..86. No.
966, January.
Philip. G.M., and Watson, D.F.. 1986, "Matheronian geostatistics
Q u o vadis'?," Journal of Malhemarical Geology. Vol. 18. No. 1 .
Rendu. J-M., 1978, "Kriging, logarithmic kriging and conditional
eupectation: Comparison of theory with actual results." 16th
APSCOM Svmposilrnz, Chapter 19.
Rcndu, J-M., 1994. "Mining geostatistics
Forty years passed. What Lies Ahead'?." Mining Engitrc,ering.
Vol. 46, No, 6. June.