Dixon Test
Dixon Test
Dixon Test
139-146 139
tetradecanol) are separated and detected, quantitation is and reversed-phase mechanism. Figure 9 shows that the
hindered by poor solubility and the formation of micelles. p-toluenesulfonic acid and the cefazolin are both detected by
Applications. Since a prerequisite for PAD reactivity is UV a t 254 nm (A), and the two diols, which do not have a
adsorption of the analyte, the presence of other surface-ad- chromophore, are easily detected by PAD (B). In addition,
sorbable substances, as well as electroactive compounds, can the cefazolin has a PAD signal, which has a higher limit of
act as interferences. Therefore, general selectivity is achieved detection than UV, but may be utilized for added selectivity.
via chromatographic separation prior to PAD. This conclusion
does not preclude additional selectivity from control of de- LITERATURE CITED
tection parameters. Kisslnger, P. T. I n Laboratory Techniques in Electroanalytical Chemis-
The assay for alcohols was applied to several matrices to try; Klssinger, P. T., Heineman, W. R., Eds.; Marcel Dekker: New
York, 1984; pp 631-32.
illustrate the analytical utility of the procedure. Separation Adams, R. N. Electrochemistry at SolM Electrodes; Marcel Dekker:
on an ion-exclusion column with direct detection is illustrated New York, 1969.
Gllman, S . I n E/ectroana/ytica/ Chemistry; Bard, A. J., Ed.; Marcel
in Figure 8 for various aliphatic alcohols and polyalcohols in Dekker: New York, 1967; Vol. 2, pp 111-92.
toothpaste (A), liquid cold formula (B), brandy (C), and wine Fleet, B.; Little, C. J. J. Chromtogr. Sci. 1874, 72, 747.
cooler (D). The selectivity for alcohols in acidic media at a Van Rooljen, H. W.; Poppe. H. Anal. Chim. Acta 1881, 730,9.
Hughes, S.; Meschi, P. L.; Johnson, D. C. Anal. Chim. Acta 1881.
Pt electrode contributes to decreased time for sample prep- 732,1.
aration and simplified chromatograms. Johnson, D. C.; Lacourse, W. R. Anal. Chem. 1890. 62,589A.
Nachtmann, F.; Budna. K. W. J. Chromtogr. 1977, 736,279.
The versatility of separations on mixed-mode ion-exchange Jupille, T. J. Chromatogr. Sci. 1978, 77, 160.
columns with selective detection is illustrated in Figure 9 by Beden, B.; Cetin, I.; Kahyaoglu. D.;Takky, D.; Lamy, C. J. Catal.
the simultaneous detection of ionic and neutral species in a 1987, 704, 37.
Ocon, P.; Alonso, C.; Celdran, R.; Gonzalez-Velasco, J. J. Electroanal.
pharmaceutical preparation. This experiment utilizes a UV Chem. 1888, 206, 179.
detector and PAD in series after a PCX-500 column. Under Slingsby, R. W.; Rey, M. J. Li9. Chromatogr. 1880, 73(1), 107.
Lacourse, W. R.; Jackson, W.A.; Johnson, D. C. Anal. Chem. 1888,
acidic conditions, the cephalosporin antibacterial consists of 67, 2486.
a cation (i.e., cefazolin) and neutral and anionic compounds
(i.e., lB-hexanediol, 1,4-cyclohexanediol,and p-toluenesulfonic
acid). The neutral and anionic compounds are separated by RECEIVED for review July 20,1990. Accepted October 8,1990.
the reversed-phase character of the column, while the cationic The financial support of Dionex Corporation is acknowledged
compound is separated by a combination of cation exchange with gratitude.
CrHlcal values at the 95% Confidence level for the two-tailed is interesting to note that these treatises, as well as essentially
0 test, and related tests based upon subrange ratlos, for the all analytical chemistry textbooks published in the U.S.during
statlstkal rejectlon of outlying data have been Interpolated by the past decade (3),have settled on the use of Dixon’s Q test
applying cublc regresslon analysls to the values orlglnally (and variants thereof) ( 4 ) as the primary method for testing
published by Dlxon. Corrections to errors In Dixon’s orlglnal for the rejection of outlying values.
tables are also Included. The resultant values are judged to Each of the recent statistical treatises written for analytical
be accurate to wlthln f0.002 and corroborate the fact that chemists has attempted to include critical values of Q for the
correspondlng crltlcal values published In recent slatlstlcal 95% confidence level, values that were not included in Dixon’s
treatlses for analytical chemlsts are erroneous. I t Is recom-
publications. However, not only do the 95% confidence values
differ in each treatise but all compilations contain significant
mended that the newly generated 95% crltlcal values be
errors. The most legitimate set of 95% values is that presented
adopted by analytical chemlsts as the general standard for
by Miller and Miller (4 In I10) ( l a ) ,which they attribute
the rejection of outller values. to King (5),but no such values are listed in King’s article, and
the values of Miller and Miller differ by amounts varying from
0.002 to 0.007 from the 95% values presented in the current
Analytical chemists depend upon the generation and in- manuscript. Anderson ( I b ) describes the equations corre-
terpretation of precise experimental data. As a result, they sponding to the two-tailed tests for Dixon’s parameters
are especially cognizant of the value of statistics in data designated as rl0 (for 3 5 n 5 lo), rll (for 8 5 n Ilo), and
treatment, and a number of statistical treatises have recently rZ1(for 11 In I13) and purportedly lists critical values for
been published that are specifically written for the professional the 90%,95%, and 99% confidence levels for these sample
analytical chemist ( I ) . Included in each of these publications sizes, but the values actually listed in his table are Dixon’s
is a brief section dealing with tests for the rejection of grossly values for one-tailed tests. Thus,as applied to two-tailed tests,
deviant values (outliers). Although many statistical tests have Anderson’s confidence levels should be labeled 80%, 9070, and
been proposed to deal with this topic [Barnett and Lewis (2) 98% (vide infra). Caulcutt and Boddy (IC), while describing
discuss 47 different equations designed for this purpose], it only the equation for the Q (i.e., rl0)ratio, accurately list both
0003-2700/91/0363-0139$02.50/0 0 1991 American Chemical Society
140 ANALYTICAL CHEMISTRY, VOL. 63, NO. 2 , JANUARY 15, 1991
95% and 99% confidence level critical values for this ratio such values can be attributed to random variation alone within
for five sample sizes, 3 5 n 5 7 ; the values given in their Table some reasonable level of probability. If the probability of
G (p 248) for 8 5 n 5 12 and for 13 In 5 40 are clearly not obtaining such outliers is determined to be very small, based
Q values but are somewhat similar to (though not identical on the overall population variance, one may reasonably con-
with) the critical values for rI1and r21,respectively. However, clude that this hypothesis is incorrect, i.e., these values cannot
the equations /or these latter parameters are not euen be attributed to simple random variation and are likely the
mentioned in this treatise. result of a determinate error. On this basis, suspected values
Faced with this confusing and conflicting array of critical may be rejected in a statistically legitimate fashion, provided
values for the popular Q test and Dixon’s related tests based that the confidence level chosen is a reasonable one.
on various subrange ratios, it seems likely that many erroneous Confidence levels commonly used for tests involving the
statistical inferences are being made daily in analytical rejection of data are 99%, 95% or 90%, the higher levels
chemistry laboratories throughout the world. To counteract representing the more conservative approach. Operating a t
this possibility and to extend the utility of Dixon’s parameters a high confidence level reduces the likelihood of rejecting a
for two-tailed tests, the current paper provides a reliable set legitimate value containing no determinate error (i.e., it re-
of critical values a t the 95% confidence level for the Q pa- duces the so-called a risk or “Type I error”), but it also in-
rameter and for all of the related subrange ratios proposed creases the likelihood of retaining a value that contains a
by Dixon. I t also corrects several errors found in Dixon‘s determinate error (Le., it increases the so-called /3 risk or “Type
original published tables. At the same time, it is intended I1 error”) ( I O ) . The consensus of most statisticians is that the
to alert analytical chemists to the problems that exist in former error (rejecting a “good” value) is considered more
current treatises and, hopefully, may serve as a useful source serious and, therefore, the use of high confidence levels in the
for future textbook and treatise authors dealing with this topic. rejection of data is greatly preferred. For example, when
operating a t the 99% confidence level ( a = 0.01) with a
CRITERIA FOR REJECTION OF OUTLIERS population containing no outliers, only 1% of the good values
Determinate Errors and Outliers. In any set of ana- will be rejected. In contrast, when operating a t the 90%
lytical measurements, one or more values may incorporate a confidence level ( a = 0.10), 10% of the good values will be
determinate (systematic) error. Such errors may involve either rejected and, for small sample sizes, the effect upon the es-
a “locator” error (displacement of the population mean) or timation of the population mean becomes significant relative
a “scalar” error (increase in the population variance) or a to the accuracy desired by most analytical procedures.
combination of the two ( 4 ) . Values that are affected by a [However, Dixon has argued that the use of rejection tests a t
determinate error belong to a different population of mea- the 9 0 7 ~confidence level is warranted in many cases ( I I ) . ]
surements. Thus, if the presence of a determinate error is Since the use of a 99% confidence level ( a = 0.01) allows
known to the individual making the measurements, the values for the rejection of only the most extreme deviations-
so affected should automatically be rejected, regardless of their particularly when applied to small sample sizes-the /3 risk
magnitude. (For a brief discussion on the detection and of retaining a value that does not belong to the general sample
elimination of determinate errors in analytical measurements, population (Le., a “bad” value) is quite large, and most ex-
see ref 3b.) However, in many instances, determinate errors perimentalists are not satisfied when operating a t this level
are incurred that are not detected by the experimentalist. in testing for the rejection of data. Operating a t the 95%
If an undetected determinate error affects only a fraction confidence level ( a = 0.05) provides a reasonable compromise.
of the values in the total sample and if the error is sizable If values not rejected at the 95% confidence level are viewed
relative to the experimental precision of the remaining values, with suspicion, additional measurements are probably war-
the affected values may be significantly larger or smaller than ranted. It should be noted that no specific test or choice of
the values that are not affected by such errors; ;.e., they will confidence level is ideal, however. As Natrella has noted, “the
be observed as outlier values. Recognizing this, experimen- only sure way to avoid publishing any ‘bad’ results is to throw
talists tend to suspect the presence of determinate errors away all results” (12). The converse, of course, is also
whenever outliers are observed in a set of data, and there is true-the only sure way to avoid discarding any good results
a strong motivation to eliminate such outliers since they is to retain them all.
significantly affect the sample mean and, therefore, the final A number of tests, such as those based on the “Student’s
estimate of the population mean. In the absence of inde- t distribution” (13), require independent knowledge of the
pendent knowledge that determinate errors are responsible population standard deviation (cr) and the population mean
for the appearance of outliers, however, the decision to delete ( k ) or the uncontaminated sample standard deviation (s) and
deviant values must be based upon reasonable statistical the uncontaminated sample mean (8) for their application
evidence. Some statisticians look askance at any attempt to (2). In the absence of such independent information, these
discard deviant values by means of statistical criteria. Deming, tests are of limited use since a decision must be made as to
a highly respected authority in the industrial application of whether the suspected values should be included or excluded
statistics, has concluded that “a point is never to be excluded in the calculation of s and X . Neither decision is satisfactory
on statistical grounds alone” (6). However, the majority of as the choice made automatically biases the outcome of the
those who have addressed themselves to the statistical test. [The often-cited “Chauvenet’s criterion” for the rejection
treatment of scientific measurements would appear to agree of a deviant measurement (2, 7) is also based on the sample
with Parratt’s statement that “rejection on the basis of a hunch standard deviation and, therefore, poses the same problems
or of general fear is not a t all satisfactory, and some sort of when applied to small samples; it has been noted that this
objective criterion is better than none” ( 7 ) . In fact, the sta- criterion is set to reject, on average, half an observation of good
tistical treatment of outlier values has received increasing data per sample, regardless of the sample size (8,14).] For
attention in recent decades and the various statistical tests analytical measurements, where cr and g (and uncontaminated
that have been proposed are described and compared in texts values of s and X ) are generally not known independently,
(2,8) and an extensive review (9) dealing specifically with this tests that do not require the use of these quantities are greatly
topic. preferred.
Tests for Rejection of Grossly Deviant Values. Sta-
tistical approaches used for identifying values that are affected APPLICATION OF DIXON’S RANGE TESTS
by determinate errors are based on testing the hypothesis that The Q Test and Related Subrange Ratios. In a classic
ANALYTICAL CHEMISTRY, VOL. 63, NO. 2, JANUARY 15, 1991 141
1950 article ( 4 ) ,Dixon investigated the performance of several The rl0 ratio is commonly designated as "Q" and is generally
statistical tests in terms of their ability to reject bad values considered to be the most convenient, legitimate, statistical
in data sets taken from Gaussian populations. The tests test available for the rejection of deviant values from a small
investigated included both those which require independent sample conforming to a Gaussian distribution. (It is equally
knowledge of u or s and those which do not require such well suited to larger data sets if only one outlier is present.)
information. Of the tests included in the latter group, Dixon The fact that small data sets are common in analytical testing
concluded that tests based on ratios of the range and various procedures, in combination with the simplicity of this test,
subranges were to be preferred as a result of their excellent accounts for the fact that the Q test is included in nearly all
performance and ease of calculation. [Dixon also noted that modern statistical treatises and textbooks designed for use
another test which performs well in screening for outliers is in analytical chemistry (1, 3).
a modified F test, in which the ratio of the standard deviations A few authors (17), following Deming's viewpoint (vide
calculated by including and excluding the suspected deviant supra), object to the rejection of data from any small sample
value is compared to critical values of F; however, this latter based on statistical tests, claiming that the amount of in-
test may be "masked" by a second deviant value.] The range formation available is insufficient to establish the distribution
tests, all of which are closely related, include the following pattern; recommended alternatives include dropping the
(where the values are ordered such that x1 < x 2 < ... < x,-~ highest and lowest values (for a sample containing five or more
< x,): values) or reporting the median. However, this would appear
1. For a single outlier x1 to be overly cautious since most series of repetitive analytical
measurements follow a Gaussian distribution (18) provided
that s is small compared to X (thus, the Q test is not applicable
to analytical measurements when operating close to the de-
2. For outlier x1 avoiding x, tection limits). Dixon has tested the relative merits of the
sample mean and median as an estimator of the population
mean under various conditions (11,15)and has concluded that,
for the most part, the sample mean (after the rejection of
outliers) appears to provide a better approximation than does
3. For outlier x1 avoiding x,, x,-, the median.
In his original calculation of the critical values of the various
r criteria (1.9, Dixon was able to obtain exact solutions only
for the case where n = 3 or 4. Critical values for n = 5 , 7 , 10,
4. For outlier x l avoiding x2 15,20,25, and 30 were calculated by using numerical methods.
All other values were obtained by interpolation and were
generally judged to be accurate within fO.OO1.
An irritating feature of the Q test and Dixon's related su-
brange ratio tests, as they currently exist, is the lack of suitable
5. For outlier x1 avoiding x2 and x, critical values of Q (and rll, r21,etc.) a t the 95% confidence
level (a = 0.05), since this confidence level is frequently utilized
for all other statistical tests. The lack of 95% confidence level
values arises from the fact that Dixon generated critical values
a t several standard probability levels ( a = 0.005, 0.01, 0.02,
6. For outlier x1 avoiding x2 and x,, 0.05, etc.) corresponding to 99.5%, 99%, 98%, 95%, etc.,
confidence levels in terms of a one-tailed test (15). As com-
monly applied by analytical chemists and other experimen-
talists, however, these tests are used as two-tailed tests (i.e.,
(The parenthetical equations are designed for testing x,, one is generally interested in testing outlier values at both the
the highest value rather than the lowest value, xl.) In Dixon's upper and lower ends). The probability that an individual
notation, the first digit in the subscript of each ratio, rij, refers good value may lie outside a specified interval in either tail
to the number of possible suspected outliers on the same end (Le., on either the high or low end) is twice as large as the
of the data as the value being tested, while the second digit probability that it lies outside the chosen interval in only one
indicates the number of possible outliers on the opposite end tail. As a result, the a risk doubles for a two-tailed test (18)
of the data from the suspected value. Thus, the ratio rl0 and the confidence level decreases accordingly. Thus, Dixon's
simply compares the difference between a single suspected 99.5%, 99%, 98%, and 95% confidence levels translate into
outlier (xl or x n ) and its nearest-neighboring value to the 99%, 98%, 96%, and 90% levels, respectively, when consid-
overall range of values in the sample-in other words, it de- ering a two-tailed test, a fact not recognized by some authors
termines the fraction of the total range that is attributable ( I b , 3e,m). Since Dixon did not calculate one-tailed critical
to one suspected outlier. The other ratios are similarly for- values at the 97.5% confidence level, there have been no values
mulated except that they use subranges that are specifically available a t the 95% level when using these tests as two-tailed
designed to avoid the influence of additional outliers either tests. Therefore, about half of the current analytical chemistry
on the opposite end of the data (rll and r12),on the same end textbooks (3a-d,h) listing valid critical values of Q list the
of the data (rm),or both (rZ1and rZ2). Clearly, the latter ratios values for the 99%,96%, and 90% confidence levels, despite
require larger sample sizes to perform satisfactorily. Dixon the fact that the 96% confidence level is somewhat unor-
subsequently generated critical values for all of these ratios thodox, while the remainder (3fg,i-l) provide only 90%
(15)for sample sizes of 3 In 5 30 and recommended (based confidence level values. As noted earlier, the treatises that
on a combination of the relative performance of each ratio have attempted to present critical values for Q at the 95%
and its degree of independence from other outlying values) confidence level (I) have invariably listed flawed values.
that, as a general rule, the various ratios be applied as follows [In tests designed to detect the presence of a single outlier,
(16): for 3 In 5 7, use rlo;for 8 In I10, use r l l ; for 11 I King has argued (5) that the effect of running a two-tailed
n I13, use rZl;for n L 14, use rZ2. test is approximately to double the a risk relative to that for
142 ANALYTICAL CHEMISTRY, VOL. 63, NO. 2, JANUARY 15, 1991
Table I. Critical Values of Dixon's r I o( Q ) Parameter As Applied to a Two-Tailed Test at Various Confidence Levels,
Including the 95% Confidence Level"
confidence level
80 7c 90 7c 95% 96 o/o 98% 99 ?&
N b ( a = 0.20) ( a = 0.10) ( a = 0.05) ( a = 0.04) ( a = 0.02) ( a = 0.01)
Table 11. Critical Values of Dixon's r l lParameter As Applied to a Two-Tailed Test at Various Confidence Levels, Including
the 95% Confidence Level
confidence level
80 % 90 % 95 % 96 Yo 98% 99%
N O ('N = 0.20) ( a = 0.10) ( a = 0.05) ( n = 0.04) (1 cr = 0.02) ( a = 0.01)
a one-tailed test; but, for very small sample sizes, the effect by independent knowledge of the true distribution of the
m a y be slightly less than double depending upon the dis- population. This argument implies that, at worst, assuming
tribution pattern of the data since the tail containing the a doubling of the a risk upon going from a one-tailed test to
perceived deviant is determined by the sample rather than a two-tailed test will result in critical values that are slightly
ANALYTICAL CHEMISTRY, VOL. 63, NO. 2, JANUARY 15, 1991 143
Table 111. Critical Values of Dixon’s rI2Parameter As Applied to a Two-TailedTest at Various Confidence Levels, Including
the 95% Confidence Level
confidence level
80 ‘70 90 % 95 % 96 % 98 ‘70 99 %
NQ (a = 0.20) (a = 0.10) (a = 0.05) (a = 0.04) (a = 0.02) (a = 0.01)
too high, and the resulting decisions that are made will be functions could generally be extended to include the data for
overly conservative. As shown by comparison of Dixon’s the 60% and 40% confidence levels but, except as noted
original one-tailed critical values for 95% confidence (15) with below, these values were not included in fitting the cubic
his two-tailed critical values for the 90% confidence level (16b), regression curves since the data in the lower confidence level
it is clear that Dixon assumed a doubling of the a risk.] region did not significantly affect the calculation of the 95%
Interpolation of Critical Values at the 95% Confidence values.
Level. As noted above, Dixon was able to obtain exact so- Based on the foregoing analysis, the critical values of Q a t
lutions for the various critical values only for the cases where the 95% confidence level were then calculated from the cubic
n = 3 or 4. Although the general form of the equation for n regression curve for each sample size and were found to be
2 5 has been presented (15),the specific expressions for the within fO.OO1 of the values obtained graphically. Cubic
central density function vary with each sample size, and these functions were subsequently fitted to the 99%, 98%, 96%,
expressions have not been published. Therefore, in the current 90%, and 80% confidence level data for each of the other ratio
work, appropriate two-tailed critical values of Q at the 95% functions defined by Dixon. To check the veracity of the
confidence level were initially estimated by plotting the generated equations, the critical values for 96% and 90% were
(two-tailed) critical values for the 99%, 98%, 96%, 90%, and also calculated and, in each case (except as noted below), were
80% confidence levels as generated by Dixon. The graphically found to be within *0.001 of Dixon’s tabular values.
interpolated values for 95% Q were then checked by using In using this approach, it was noted that cubic equations
regression analysis to determine the best fitting empirical could not be fitted to the rz0 critical values for n I19. After
polynomial functions to the values that Dixon published; these examining Dixon’s tabular data carefully, it was discovered
functions were then solved for the appropriate critical values that the 90% confidence level critical values showed a dis-
of Q a t the 95% confidence level. continuity in this region. T o circumvent this problem, the
With the exception of cases in which the original Dixon 60% confidence level values were included in fitting the rz0
tables contained apparent errors (vide infra), it was found that data to cubic equations. In the region 4 5 n I18,the critical
a cubic function provided an optimal fit to the Q values in values of rz0at the 95% confidence level obtained by including
this region. Interestingly, the use of a quadratic function and excluding the 60% critical values were within f0.0002,
generally yielded the same values of 95% Q, to three signif- i.e., undetectable to three significant figures (see Figure 1).
icant figures, as those obtained from a cubic function despite For n I19, the 90% values were then omitted. In this manner,
a notably poorer fit; fourth-power functions also produced the cubic expressions were generated that provided excellent fits
same 95% values to within fO.001 but were less sensitive to to the data and permitted both the 95% and new 90% critical
errors in the tabular data. It was also noted that the cubic values to be computed. Interestingly, in comparing the newly
144 ANALYTICAL CHEMISTRY, VOL. 63, NO. 2, JANUARY 15, 1991
Table IV. Critical Values of Dixon's rzoParameter As Applied to a Two-Tailed Test at Various Confidence Levels, Including
the 95% Confidence Level
confidence level
80 % 90% 95 % 96 C7c 98 % 99 %
h'" ( n = 0.20) ( a = 0.10) ( n = 0.05) ( a = 0.04) (a = 0.02) ( a = 0.01)
Table V. Critical Values of Dixon's r I IParameter As Applied to a Two-Tailed Test at Various Confidence Levels, Including
the 95% Confidence Level
confidence level
80 % 90 % 95 % 96 % 98% 99 %
N" (a = 0.20) (a = 0.10) (a = 0.05) (a = 0.04) (a = 0.02) (a = 0.01)
been carefully cross-checked, and the 95% critical values are (15)were themselves generally accurate to within *0.001). The
judged to be accurate within fO.OO1 relative to the accuracy values shown cover the entire range of sample sizes (3 5 n I
of the values at the other confidence levels (which Dixon stated 30) included in Dixon's original article.
146 Anal. Chem. 1991, 63, 146-15 1
It is suggested that the critical values of Q and the related titative Analysis: Theory and Practice; Harper 8 Row: New York,
r criteria which have been generated for the 95% confidence 1987; pp 78-80. (f) Rubinson, K. A. Chemical Analysis; Little, Brown:
Boston, 1987; pp 162-164. (9) Day, R. A., Jr.; Underwood, A. L.
level should be used routinely by practicing analytical chemists Quantitative Analysis, 5th ed.; Prentice-Hall: Englewood Cliffs, NJ,
in testing for the rejection of outliers since this confidence 1986; pp 29-31. (h) Manahan, S.E. Quantitative Chemical Analysis;
Brooks Cole: Monterey. CA, 1986; pp 74-75. (i)Kennedy, J. H. Ana-
level provides a reasonable compromise between ultracon- iyticai Chemistry: Principles, 2nd ed.; Harcourt, Brace, Jovanovich:
servatism and the overzealous rejection of deviant values. New York, 1990; pp 35-39. (i) Harris, D. C. Quantitative Chemical
These values should also be incorporated into future analytical Analysis; Freeman: San Francisco, 1982; pp 51-52. (k) Ramette, R.
W. Chemical Equilibrium and Analysis ; Addison-Wesley: Reading, MA,
chemistry treatises and textbooks dealing with tests for the 1981; pp 53-54. (I) Christian, G. D. AnalyticalChemistry; Wiley: New
rejection of data to provide a uniform set of critical values York, 1980; pp 78-79. (m) Flaschka, H. A,; Barnard. A. J., Jr.; Stur-
rock, P. E. Quantitative Analytical Chemistry; Willard Grant: Boston,
a t this standard confidence level. 1980; pp 19-20.
As a concluding comment, it should be noted that recent Dixon, W. J. Ann. Math. Stat. 1950, 27. 488-506
studies on the variance of the arithmetic mean after rejection King, E. P. J. Am. Stat. Assoc. 1953, 48, 531-533.
Deming, W. E. Statistical Aaustment of Data; Wiley: New York,
of outliers suggest the superiority of two more recently pro- 1943 (republished by Dover: New York. 1964); p 171.
posed criteria (Huber-type skipped mean and Shapiro-Wilk Parratt, L. G. Probability and Experimental Errors in Science ; Wiley:
New York, 1961; pp 176-178.
rules) for rejection decisions (19),particularly for larger sam- Hawkins, D. M. Identification of Outliers; Chapman and Hall: London,
ples containing multiple outliers. Nonetheless, the simplicity 1980.
of Dixon's range ratio tests argues strongly for their continued Beckman, R . J.; Cook, R . D. Technometrics 1983, 25, 119-149.
(a) Burr, I. W. Applied Statistical Methods; Academic: New York.
use in many analytical applications. 1974; pp 194-195. (b) Hamilton, W. C. Statistics in Physical Science;
Ronald: New York. 1964; pp 45-49.
ACKNOWLEDGMENT Dixon, W. J. I n Contributions to Order Statistics; Sarhan, A. E.,
Greenberg, B. G., Eds.; Wiley: New York, 1962; pp 299-342 (see pp
I express my appreciation to my colleagues, George Schenk 314-317).
and David Coleman, for helpful suggestions regarding this Natrelia, M. G. Experimental Statistics ; National Bureau of Standards
Handbook 91; NBS: Washington, DC, 1963; Chapter 17.
manuscript. "Student". [Gossett, W. S.] Biometrika 1908, 6 , 1.
Taylor, J. R. An Introduction to Error Analysis: The Study of Uncer-
LITERATURE CITED tainties in Physical Measurements; University Science: Mill Valley,
(a) Miller, J. C.; Miller, J. N. Statistics for Analytical Chemistry. 2nd ed.; CA, 1982; pp 142-145.
Wiley: New York. 1988; pp 62-64, 218. (b) Anderson, R. L. Practical Dixon, W. J. Ann. Math. Stat. 1951, 22, 68-78.
Statistics for Analytical Chemlsts; Van Nostrand Reinhoid: New York, (a) Dixon, W. J. Biometrics 1953, 9, 74-89. (b) Dean, R . 8.; Dixon.
1987; pp 31-32. (c) Caulcutt, R.;Boddy, R . Statistics for Analytical W. J. Anal. Chem. 1951, 2 3 , 636-638.
Chemists; Chapman and Hall: London, 1983; pp 66-67, 248. Peters, D. G.; Hayes, J. M.; Hieftje, G. M. Chemical Separations and
Measurements : Theory and Practice of Analytical Chemistry; Saun-
Barnett, V.; Lewis, T. Outliers in Statistical Data, 2nd ed.; Wiiey: New ders: Philadelphia, 1974; p 36.
York. 1984.
(a) Skoog, D. A.; West, D. M.; Holler, F. J. Analytical Chemistry: An Youmans, H. L. Statistics for Chemistry; Charles E. Merrill: Columbus,
Introduction, 5th ed.; Saunders: Philadelphia, 1990; p 56. (b) Skoog, OH, 1973; p 65 ff.
D. A.; West, D. M.; Holler, F. J. Fundamentals of Analytical Chemistry, Hampel, F. R. Technomefrics 1985, 27, 95-107.
5th ed.; Saunders: New York, 1988; pp 13-16. (c) Hargis, L. G.
Analytical Chemistry; Prentice-Hall: Englewocd Cliffs, NJ, 1988; p 56.
(d) Fritz, J. S.:Schenk, G. H. Quantitative Analytical Chemistry, 5th RECEIVED
for review June 15, 1990. Accepted October 16,
ed.: Allyn 8 Bacon: Boston, 1987; pp 45-46. (e) Potts, L. W. Quan- 1990.