DesignofExperimentswithMINITABPDFDrive Trang 301 522

Download as pdf or txt
Download as pdf or txt
You are on page 1of 222

Linear Regression 289

8.6 CONFIDENCE LIMITS FOR THE REGRESSION LINE


The previous section makes it apparent that the true slope and intercept of a regression
line are not exactly known. This means that the regression line ŷ = b0 + b1x drawn
through the (x, y) data might be the best line to draw based on the limited information
available, but that the true line that represents the population of (x, y) could be shifted
up or down a bit or that the true slope might be a bit shallower or a bit steeper. The quan-
tities sb0 and sb1 estimate the size of these uncertainties. If we take these two effects
together—the shifting up and down of the regression line due to uncertainty in b 0 and
changes to the slope so that the line is steeper or more shallow due to uncertainty in b 1,
we can determine confidence limits for where we can expect the true line to fall. The
(1 – a) 100% confidence interval for the regression line is given by:

⎛ 2 ⎞
1 (x − x ) 1 (x − x ) ⎟
2

P ⎜ y − tα / 2 sε
ˆ + < μ y( x ) < y + tα / 2 sε
ˆ + = 1− α (8.27)
⎜⎝ n SS x n SS x ⎟⎟

where ta/2 is taken with dfe = n – 2 degrees of freedom and my(x) is the parameter associ-
ated with y at the specified x value. Remember that this interval does not reflect the dis-
tribution of individual data points about the regression line—it indicates with (1 – a )
100% confidence where the true regression line might be located. Notice that the con-
fidence limits for the regression can be made arbitrarily tight by taking as many obser-
vations for the regression as is required.
Since the calculations for the confidence interval involve limits of the function y(x),
there are many values along the regression line that must be evaluated to determine
what the bounds look like. This isn’t something that you want to do by hand. MINITAB
has the ability to construct the confidence intervals from the Stat> Regression> Fitted
Line Plot menu. You will have to select Display Confidence Bands in the Options
menu to add the confidence limits to the fitted line plot.

Example 8.7
Use MINITAB to construct the 95 percent confidence interval for the regression line
from Example 8.2.
Solution: MINITAB provides the capability to construct confidence intervals for
regression models from its Stat> Regression> Fitted Line Plot menu. The 95 percent
confidence interval for the regression line is shown in Figure 8.8.

Sometimes after regression for y(x) it is necessary to construct a confidence inter-


val for x for a specified value of y. To construct this interval, you might be tempted to
perform the regression of x as a function of y and then to use the method of Equation
8.27 to construct the confidence interval for x, but this approach gives the wrong
answer. The correct interval is given by a method called inverse prediction. Given the
results of a linear regression analysis for y = f(x) of the form y = b0 + b1x determined
290 Chapter Eight

Regression Plot
Y = 1.18182 + 2.36364X
R-Sq = 93.8%

20

10
y

Regression
0 95% Cl

1 2 3 4 5 6 7 8
x

Figure 8.8 95 percent confidence bands for the regression line.

from n observations, the (1 – a) 100% confidence interval for the true value of x that
delivers the specified value of y is given by:

⎛ b (y − y) b (y − y) ⎞
P⎜ x + 1 − h < μx ( y) < x + 1 + h⎟ = 1 − α (8.28)
⎜⎝ d d ⎟⎠

where:

⎛ 1⎞ ( y − y )
2
tα /2 sε
h= d ⎜1 + ⎟ + (8.29)
d ⎝ n⎠ ∑x
2

and:

( )
2
d = b12 − tα / 2 sb1 (8.30)

and the t distribution has dfe = n – 2 degrees of freedom. See Neter et al. (1996) or Sokal
and Rohlf (1995) for more detailed explanations and examples of inverse prediction.

8.7 PREDICTION LIMITS FOR THE OBSERVED VALUES


In the previous section, a confidence interval was described for the position of the true
regression line for the population of (x, y) observations. The purpose of this section is
Linear Regression 291

to build on that confidence interval to create a new interval that provides prediction
bounds for individual observations. The width of the prediction interval combines the
uncertainty of the position of the true line as described by the confidence interval with
the scatter of points about the line as measured by the standard error.
For a given value of x, the predicted value of y will be ŷ = b0 + b1x. The fraction of
the population of values that should fall within a certain distance of this prediction
equation is given by:

⎛ 2 ⎞
1 (x − x ) 1 (x − x ) ⎟
2

P ⎜ y − tα / 2 sε 1 + +
ˆ < y ( x ) < y + tα / 2 sε 1 + +
ˆ = 1− α (8.31)
⎜⎝ n SS x n SS x ⎟⎟

where ta/2 has dfe = n – 2 degrees of freedom. This interval looks very much like the
confidence interval in Equation 8.27 but it has an additional term (1) inside the square
root. This term represents the additional variation of individual points about the regres-
sion line.
Whereas the confidence interval for the regression line can be made arbitrarily tight
by taking many observations, the width of the prediction interval is limited by the stan-
dard error of the regression. Notice that when n is very large in Equation 8.31 the pre-
diction interval can be approximated by:

( )
P yˆ − tα / 2 sε < y ( x ) < yˆ + tα / 2 sε < 1 − α (8.32)

This approximate interval will always be narrower than the true prediction interval
but the discrepancy between them is small when the sample size is large. The simpli-
fication provided by the approximate interval makes it an important, very useful, and
actually common practice. It is much easier to make a statement like “95 percent of
the observations are expected to fall within ±ta/2se of the best-fit line,” rather than trying
to explain the complex but relatively insignificant behavior of the exact prediction
interval.
MINITAB can plot prediction limits along with the best-fit line from the Stat>
Regression> Fitted Line Plot menu. You will have to select Display Prediction Bands
in the Options menu. Both confidence bands and prediction bands are often shown
along with the regression line and the scatterplot of the data.

Example 8.8
Construct a 95 percent prediction interval for y when x = 4 for the model deter-
mined in Example 8.2.
Solution: The predicted value of y at x = 4 is:

yˆ ( 4 ) = 1.18 + 2.36 ( 4 ) = 10.6


292 Chapter Eight

The 95 percent prediction interval is given by Equation 8.31 with a = 0.05 where the
t distribution has dfe = n – 2 = 5 – 2 = 3 degrees of freedom. The required t value is
t0.025,3 = 3.18 so the prediction interval for y(4) is:

⎛ ( x − x )2 ( x − x )2 ⎞
P ⎜ yˆ − t0.025 sε 1 + n1 + SSx < y( x ) < yˆ + t0.025 sε 1 + n1 + SSx ⎟ = 0.95
⎝ ⎠
⎛ ( 4 − 5 )2 ⎞
P ⎜10.6 − 3.18 × 2.32 1 + 15 + 44 < y ( 4 ) < 10.6 + 3.18 × 2.32 1 + 15 + 44 ⎟ = 0.95
( 4 − 5 )2
⎝ ⎠

( )
P 2.4 < y ( 4 ) < 18.8 = 0.95

That is, 95 percent of the observations taken at x = 4 should have values of y that fall
between 2.4 and 18.8.

Example 8.9
Determine the approximate prediction interval corresponding to the situation
described in Example 8.8.
Solution: The approximate 95 percent prediction interval is given by:

( )
P yˆ − tα / 2 sε < y ( x ) < yˆ + tα / 2 sε < 1 − α
P (10.6 − 3.18 × 2..32 < y ( 4 ) < 10.6 + 3.18 × 2.32 ) < 0.95
P ( 3.2 < y ( 4 ) < 18.0 ) < 0.95

This interval is narrower than the exact prediction interval by about 10 percent but it
is so much easier to calculate that it is an appealing compromise. The agreement
between the approximate and exact intervals is much better when the number of obser-
vations is larger than the n = 5 case considered here, so the approximation is usually
safe to use.

Example 8.10
Use MINITAB to construct the 95 percent prediction interval for observations from
Example 8.2.
Solution: The graphical prediction interval can be constructed from MINITAB’s
Stat> Regression> Fitted Line Plot menu. The graphical output is shown in Figure
8.9. The values of the prediction interval at x = 4 are 2.4 and 18.8 as calculated in
Example 8.8.
Linear Regression 293

Regression Plot
Y = 1.18182 + 2.36364X
R-Sq = 93.8%
30

20
18.8
y

10

2.4
0 Regression
95% Pl

1 2 3 4 5 6 7 8
x

Figure 8.9 95 percent prediction interval for observations.

8.8 CORRELATION

8.8.1 The Coefficient of Determination


A comprehensive statistic is required to measure the fraction of the total variation in the
response y that is explained by the regression model. The total variation in y taken rel-
ative to –y is given by SSy = ∑(yi – –y)2, but SSy is partitioned into two terms: one that
accounts for the amount of variation explained by the straight-line model given by
SSregression and another that accounts for the unexplained error variation given by SSe =
∑(yi – ŷi)2 = ∑e 2i . The three quantities are related by:

SS y = SSregression + SSε
(8.33)
Consequently, the fraction of SSy explained by the model is:
SSregression
r2 = SS y

= 1 − SSεy
SS
(8.34)

where r 2 is called the coefficient of determination. In practice, r 2 is more easily deter-


mined from its calculating form:
294 Chapter Eight

SS xy2
r =
2
(8.35)
SS x SS y

The coefficient of determination finds numerous applications in regression and


multiple regression problems. Since SSregression is bounded by 0 ≤ SSregression ≤ SSy, there
are corresponding bounds on the coefficient of determination given by 0 ≤ r 2 ≤ 1. When
r 2 0 the regression model has little value because very little of the variation in y is
attributable to its dependence on x. When r 2 1 the regression model almost completely
explains all of the variation in the response, that is, x almost perfectly predicts y. We’re
usually hoping for r 2 = 1, but this rarely happens.
A common mistake made by people who don’t understand r 2 is to compare it to an
arbitrarily chosen acceptance condition to determine if a model is a good fit to the data.
A low r 2 value doesn’t necessarily mean that a model is useless, just that the overall per-
formance of the model is poor because of the large amount of random error in the data
set. Even if you find a low r 2 value in an analysis, make sure to go back and look at the
regression coefficients and their t values. You may find that, despite the low r 2 value, one or
more of the regression coefficients is still strong and relatively well known. In the same
manner, a high r 2 value doesn’t necessarily mean that the model that you’ve fitted to the
data is the right model. That is, even when r 2 is very large, the fitted model may not accu-
rately predict the response. It’s the job of lack of fit or goodness of fit tests, which will be
discussed later in this chapter, to determine if a model is a good fit to the data.

Example 8.11
Calculate the values of r 2 for each of the slopes attempted in Example 8.1 and the
best-fit line. Plot r 2 versus b1 and show that the best-fit line corresponds to the maxi-
mum value of r 2.
Solution: The following table was developed from the one in Example 8.1:

b1 0 1 2 2.36 3 4
∑d i
2
262 102 22 16 34 134
sd2 87.3 34.0 7.33 2.32 11.3 44.7
r2 0 0.611 0.916 0.939 0.870 0.489

The r 2 values were determined from Equation 8.34 where SSy = 262. The r 2 values are
plotted against their b1 values in Figure 8.10. The plot clearly shows that the best-fit
line is the one that maximizes r 2. This shows, again, that the best-fit line given by lin-
ear regression is the one that has the least error.

8.8.2 The Correlation Coefficient


The correlation coefficient r is given by the square root of the coefficient of determi-
nation with an appropriate plus or minus sign. Return for a moment to the calculating
Linear Regression 295

1.0

Best fit
r2

0.5

0.0
0 1 2 3 4
b1

Figure 8.10 r 2 values for different slopes b1.

form of r 2 given in Equation 8.35. Note that the sum in the numerator SSxy is not actu-
ally a sum of squares, hence r can be signed. The sign of r is useful—if r is positive it
means that y increases with respect to x, and if it is negative y decreases with respect to
x. That is, the sign of r is the same as the sign of the slope of the regression line. Since
r is determined from the appropriately signed value of

r2

and r 2 is bounded by 0 ≤ r 2 ≤ 1, then the correlation coefficient is bounded by –1 ≤ r ≤ 1.


Because of the ease of interpreting the coefficient of determination on a zero to 100 per-
cent scale, it is used more frequently as a regression summary statistic than the correla-
tion coefficient.
Whereas linear regression operates under the assumption that the values of x are
known exactly, that is, without error, correlation analysis does not require the same
assumption and so is more robust than linear regression. In situations where there
are known errors in the x as well as the y, or when there is a need to correlate two
responses, for example, y1 and y2, the appropriate method of analysis is correlation.

8.8.3 Confidence Interval for the Correlation Coefficient


The coefficient of determination r 2 is a statistic that estimates the degree of correlation
between x and y. A different data set of (x, y) values will give a different value of r 2. The
quantity that such r 2 values estimate is the true population coefficient of determination
r 2, which is a parameter. When the distribution of the regression model residuals is nor-
mal with constant variance, the distribution of r is complicated, but the distribution of:
296 Chapter Eight

1 ⎛1+ r ⎞
Z = ln ⎜ ⎟ (8.36)
2 ⎝1− r ⎠

is approximately normal with mean:

1 ⎛1+ ρ⎞
μ Z = ln ⎜ ⎟ (8.37)
2 ⎝1− ρ⎠

and standard deviation:

1
σZ = (8.38)
n−3

The transformation of r into Z is called Fisher’s Z transformation. A lookup table relat-


ing Z and r is provided in Appendix A.9. This information can be used to construct a
confidence interval for the unknown parameter mZ from the statistic r and the sample
size n. The confidence interval is:

⎛ z z ⎞
P ⎜ Z − α /2 < μZ < Z + α /2 ⎟ = 1 − α (8.39)
⎝ n−3 n − 3⎠

The inverse of the Z transform then gives a confidence interval for the correlation coef-
ficient of the form:

P ( rL < ρ < rU ) = 1 − α (8.40)

Fisher’s Z transform is accurate when n ≥ 50 and tolerable when n ≥ 25. For 10 < n
< 25 a small-sample correction to Fisher’s original Z transform should be used. The cor-
rected transform is given by:

3Z + r
Z′ = Z − (8.41)
4 ( n − 1)

where Z′ has standard deviation

1
σ Z′ = (8.42)
n −1

The resulting confidence interval is:

⎛ z z ⎞
P ⎜ Z ′ − α /2 < μZ′ < Z ′ + α /2 ⎟ = 1 − α (8.43)
⎝ n −1 n − 1⎠
Linear Regression 297

The transformation from Z′ back to r is given by solving Equation 8.41 for r:

r = 4 ( n − 1) ( Z − Z ′ ) − 3Z (8.44)

Example 8.12
A linear regression analysis based on n = 30 observations had coefficient of determi-
nation r 2 = 0.828. The regression assumptions of independent, normal, and homoscedas-
tic residuals were satisfied and the linear model provided a good fit to the data. Find the
95 percent confidence interval for the true population coefficient of determination.
Solution: The sample size is sufficiently large that Fisher’s Z transform should pro-
vide adequate accuracy for the confidence interval. The correlation coefficient is:

r = 0.828 = 0.910

so the value of Fisher’s Z is:

1 ⎛ 1+ r ⎞
Z = ln ⎜ ⎟
2 ⎝ 1− r ⎠
1 ⎛ 1 + 0.910 ⎞
= ln ⎜ ⎟
2 ⎝ 1 − 0.910 ⎠
= 1.528

The 95 percent confidence interval for mZ is then:

⎛ z z ⎞
P ⎜ Z − 0.025 < μ Z < Z + 0.025 ⎟ = 1 − 0.05
⎝ n−3 n − 3⎠
⎛ 1.96 1.96 ⎞
P ⎜1.528 − < μ Z < 1.528 + ⎟ = 0.95
⎝ 30 − 3 30 − 3 ⎠
P (1.151 < μ Z < 1.905) = 0.95

From the table for Z(r) in Appendix A.9, the values of r that correspond to Z = 1.151
and 1.905 are r = 0.816 and 0.957, respectively, so our interval for the unknown popu-
lation correlation coefficient is:

P ( 0.816 < ρ < 0.957 ) = 0.95

Then the confidence interval for the unknown population coefficient of determination is:

( )
P 0.666 < ρ 2 < 0.916 = 0.95
298 Chapter Eight

This example makes it very clear that despite the apparently high experimental
coefficient of determination, the relatively small sample size leaves a tremendous
amount of uncertainty about the true value of r 2. Don’t be fooled into a false sense of
confidence by a large value of r 2 determined from a small data set.

8.8.4 The Adjusted Correlation Coefficient


In more complex regression problems where many independent variables and possibly
interaction terms enter the model, it’s unfair to measure the model quality with the coef-
ficient of determination r 2. As more and more terms are carried in a complex model, the
r 2 value will always increase. This makes it necessary to penalize r 2 for the additional
complexity of the model. This new coefficient of determination, called the adjusted
coefficient of determination, r 2adjusted, is given by:

dftotal SSε
2
radjusted = 1− (8.45)
dfε SS y

r 2adjusted is always smaller than r 2 and is the safer of the two statistics to use when evalu-
ating a complex model.

Example 8.13
Calculate r 2 for the best fit of the data in Example 8.1 using both the defining and
calculating forms given in Equations 8.34 and 8.35. Also calculate the adjusted coeffi-
cient of determination.
Solution: The sums of squares necessary to determine the correlation coefficients
were already determined in Example 8.2. By the defining form of the coefficient of
determination:

SSε 16.2
r2 = 1− = 1− = 0.938
SS y 262

Alternatively, by the calculating form:

SS xy2 104 2
r2 = = = 0.938
SS x SS y 44 × 262

The adjusted coefficient of determination is given by Equation 8.45:

dftotal SSε 4 (16.2 )


2
radjusted = 1− = 1− = 0.918
dfε SS y 3( 262 )
Linear Regression 299

8.9 LINEAR REGRESSION WITH MINITAB


MINITAB provides two basic functions for performing linear regression. The first
method, accessed from the Stat> Regression> Fitted Line Plot menu or with the fitline
function at the command prompt, is the best place to start to evaluate the quality of the
fitted function. The output from Stat> Regression> Fitted Line Plot (or fitline) includes
a scatter plot of the (xi, yi) data with the superimposed fitted line, a full ANOVA table,
and an abbreviated table of regression coefficients. A comprehensive set of graphical
residuals diagnostics can be turned on in the Graphs menu and there are options to fit
quadratic and cubic models.
MINITAB provides a more comprehensive regression analysis from the Stat>
Regression> Regression menu. You must specify the columns for the x and the y values,
either by name or by column number, or you can invoke the regression command
directly from the command line with:
mtb > regress c1 1 c2

where column c1 contains the response y and c2 contains the values of x. The “1”
between c1 and c2 tells MINITAB that there is only one independent variable. This
anticipates multiple linear regression, which involves more than one predictor in the
model. A comprehensive set of graphical residuals diagnostics can be turned on from
the Stat> Regression> Regression> Graphs menu.
MINITAB’s Stat> Regression> Regression output has two parts. The first part is a
table of the regression coefficients and the corresponding standard deviations, t values,
and p values. The second part is the ANOVA table, which summarizes the statistics
required to determine the regression coefficients and the summary statistics like r 2, r 2adj,
and se . There is a p value reported for the slope of the regression line in the table of
regression coefficients and another p value reported in the ANOVA table for the
ANOVA F test. These two p values are numerically identical and not just by coinci-
dence. There is a special relationship that exists between the t and F distributions when
the F distribution has one numerator degree of freedom. This relationship is:

Fα ,1, dfε = tα2 , dfε (8.46)

Because the ANOVA F value and the t value associated with the slope are mathemati-
cally equivalent, they also share the same p value.

Example 8.14
Analyze the data from Example 8.1 using MINITAB and explain the output line
by line.
Solution: The MINITAB output is shown in Figure 8.11. The agreement between the
calculations done above and MINITAB is excellent. The only differences are small ones
300 Chapter Eight

MTB > Name c3 "RESI1"


MTB > Regress 'y' 1 'x';
SUBC> Residuals 'RESI1';
SUBC> Constant;
SUBC> Brief 2.

Regression Analysis: y versus x

The regression equation is


y = 1.18 + 2.36 x

Predictor Coef SE Coef T P


Constant 1.182 2.036 0.58 0.602
x 2.3636 0.3501 6.75 0.007

S = 2.32249 R-Sq = 93.8% R-Sq(adj) = 91.8%

Analysis of Variance

Source DF SS MS F P
Regression 1 245.82 245.82 45.57 0.007
Residual Error 3 16.18 5.39
Total 4 262.00

MTB > print c1-c3

Data Display

Row x y RESI1
1 1 3 -0.54545
2 2 7 1.09091
3 6 14 -1.36364
4 8 18 -2.09091
5 8 23 2.90909

Figure 8.11 MINITAB output for data from Example 8.1.

due to round-off error. There are not enough data points to make meaningful residuals
plots so they are not shown. MINITAB determines the constant in the linear model to be
b0 = 1.182. The constant has standard deviation sb0 = 2.036. For the hypothesis test of
H0: b 0 = 0 versus HA: b 0 ≠ 0, the t statistic is tb0 = b0/sb0 = 0.58, which, with dfe = 3
degrees of freedom, is not statistically significant ( p = 0.602). The t and p values indi-
cate that b 0 is indistinguishable from zero. The slope of the fitted line is b1 = 2.3636 and
its standard deviation is sb1 = 0.3501. For the hypothesis test of H0: b 1 = 0 versus HA:
b 1 ≠ 0, the t statistic is tb1 = b1/sb1 = 2.36/0.35 = 6.75, which is highly significant (p =
0.007). The degrees of freedom column indicates that there are 5 – 1 = 4 total degrees
of freedom, 1 regression degree of freedom, and 4 – 1 = 3 error degrees of freedom. The
total amount of variation in the response is SStotal = 262.0 of which SSregression = 245.82
is explained by the linear model and the remaining SSe = 262.0 – 245.8 = 16.2 is unex-
plained or error variation. The mean squares are given by MS = SS/df and their ratio
Linear Regression 301

gives F = 45.57 which is much greater than the F = 1 value we expect if the linear model
is not meaningful. The p value for this F with dfregression = 1 and dfe = 3 is p = 0.007. The
standard error of the model is given by:

sε = MSε = 5.39 = 2.322

The coefficient of determination is r 2 = SSregression /SStotal = 245.82/262.0 = 0.938. The


ANOVA F value and the t value for the slope are related by (F = 45.57) = (t 2 = 6.752)
and they share the same p value. The data display at the bottom of the figure shows the
x and y values used for the analysis and the model residuals ei are reported in the next
column.

8.10 TRANSFORMATIONS TO LINEAR FORM


The usefulness of the linear regression analysis is extended tremendously when nonlin-
ear problems can be transformed into linear form. As long as all of the regression
assumptions are satisfied by the transformed variables this approach is valid.
As an example, consider a response y that depends on the single independent vari-
able x according to:

y = a + bx 2

where a and b are to be determined by regression. After the transformation x′ = x 2 is


applied, linear regression can be used to fit a model of the form:

y = a + bx ′

Generally, transformations are applied to x but sometimes, because of the structure


of the expected relationship between y and x, it may be easier to apply the transforma-
tion to y instead of x. Sometimes it may even be necessary to apply transforms to both
x and y variables in the same problem. An infinite number of transformations are pos-
sible. The diversity of possible transforms makes the linear regression method one of
the most powerful engineering and modeling tools available.
A catalog of some common nonlinear problems that can be linearized by variable
transformation is shown in Table 8.1. Some of these functions are plotted in Figure 8.12.
Generally, the model attempted should be based on first principles, but if no such first
principles model is available you can identify a candidate model by matching your
scatter plot with one of the functions from the figure.
MINITAB makes it easy to apply variable transformations to data so that you can still
use linear regression to analyze nonlinear problems. Enter the (x, y) data into two columns
of the MINITAB worksheet just as you normally would. Then use MINITAB’s let com-
mand (or the Calc> Calculator menu) to make the appropriate variable transformation.
302 Chapter Eight

Table 8.1 Transformations to linear form.


Function y′ x′ a′ Linear Form
y = ae bx
ln y ln a y ′ = a′ + bx
y = ax b log y log x log a y ′ = a′ + bx ′
b 1
y = a+ y = a + bx ′
x x
1 1
y= y ′ = a + bx
a + bx y
b
1
y = ae x ln y ln a y ′ = a′ + bx ′
x

⎛y ⎞
y = ax2ebx ln ⎜ 2 ⎟ ln a y ′ = a′ + bx
⎝x ⎠
−ϕ
1
n = noe kT ln n ln no y ′ = a′ – jx ′
kT
−ϕ
⎛ j ⎞ 1
j = AT 2e kT ln ⎜ 2 ⎟ ln A y ′ = a′ – jx ′
⎝T ⎠ kT

f (y) = a + bf (x) f (y) f (x) y ′ = a + bx ′

10
y = ex
y = x2
y=x
y = 1/x

5
y

y = sqrt(x)

y = log(x)

0 1 2 3 4 5 6 7 8 9 10
x

Figure 8.12 Some common functions that can be linearized.

For the example described above, if the x values reside in column c1 and the y values
in c2, then create the column of x 2 values with the command:
mtb > let c3 = c1*c1
Linear Regression 303

Then perform the regression for y using the x 2 values in column c3 as the independent
variable:
mtb > regress c2 1 c3

Although count and proportion responses can be transformed using the methods
presented in Section 5.12 and then analyzed using linear regression, there are better
analysis methods available for these kinds of data but they are beyond the scope of this
book. See Neter et al. (1996) or Agresti (2002) for help with the analysis of count and
proportion responses.

Example 8.15
A dependent variable y is thought to have the following dependence on x:

y = ax b

Find an appropriate transformation that linearizes the equation.


Solution: If we take the natural log of both sides of the equation we have:

( )
ln y = ln ax b = ln a + b ln x

If we make the substitutions y′ = ln y, a′ = ln a, and x′ = ln x, then our original equa-


tion can be written:

y′ = a′ + bx ′

which is a linear equation.

Example 8.16
The deflection angle q in radians of a solid cylindrical shaft of diameter D and
length L under an applied torque t is expected to depend on D according to:

π Lτ D k
θ=
32G

where G is the shear modulus of the material. If an experiment is performed to study


the deflection angle as a function of the cylinder diameter for a fixed material, cylinder
length L, and applied torque t, what transformation should be applied to determine k,
the exponent of D?
Solution: The equation for q must be rewritten to isolate its dependence on D:

⎛ π Lτ ⎞ k
θ =⎜ ⎟D
⎝ 32G ⎠
304 Chapter Eight

If we take the natural log of both sides:

⎛ π Lτ ⎞
ln (θ ) = ln ⎜ ⎟ + k ln ( D )
⎝ 32G ⎠

and with the substitutions q ′ = ln(q ), D′ = ln(D), and

⎛ π Lτ ⎞
a′ = ln ⎜ ⎟
⎝ 32G ⎠

we have the linear equation:

θ ′ = a′ + kD′
If this model is appropriate, then the slope of the line in a plot of q ′ versus D′ will indi-
cate the value of k.

Example 8.17
A dependent variable y is thought to have the following dependence on x:

y = ae bx

where e = 2.7182818 is the base of the natural logarithm. Find an appropriate trans-
formation that linearizes the equation.
Solution: If we take the natural log of both sides of the equation we have:

( )
ln y = ln ae bx = ln a + bx

If we make the substitutions y′ = ln y and a′ = ln a then our original equation can be


written:

y′ = a′ + bx

which is a linear equation.

Example 8.18
A dependent variable y is thought to have the following dependence on x:

b
y= a+
x

Find an appropriate transformation that linearizes the equation.


Linear Regression 305

Solution: If we take the reciprocal of the x values then x′ = 1/x so:

y = a + bx ′

Example 8.19
In life testing studies, the reliability of a device is its probability of survival to time
t. A common model for reliability is the Weibull probability distribution given by:

()
β

R (t ) = e
− t
η

where h is called the scale factor and b is called the shape factor. Find a transform that
linearizes Weibull reliability as a function of time.
Solution: The first step is to apply a natural log transform to both sides:

β
⎛t⎞
ln ( R ) = − ⎜ ⎟
⎝ η⎠

If we multiply both sides through by –1:

β
⎛t⎞
− ln ( R ) =⎜ ⎟
⎝ η⎠
β
⎛ 1⎞ ⎛t⎞
ln ⎜ ⎟ =⎜ ⎟
⎝ R⎠ ⎝ η⎠

and if we apply another natural log transform:

(
ln ln ( R1 ) ) = ln (( ) )
t
η
β

= β ln () t
η

= β ln ( t ) − β ln (η )

Finally, if we define R′ = ln(ln(1/R)), t′ = ln(t), and h ′ = b ln(h ) this equation has


the form:

R ′ = β t ′ − η′

which is linear in t′.


306 Chapter Eight

In practice we would put n units up for life test and record their failure times. Then,
for the ith failure at time t i , we estimate the reliability with:

i
R̂i = 1 −
n +1
If the failure times come from a Weibull population, then the transformed values of
(ti , R̂i ) given by (t′i , R̂′i ) as defined above should fall along a straight line.
MINITAB supports Weibull plots of complete failure data from its Graph> Probability
Plot menu with the Weibull option, and plots of censored data from its Stat> Reliability/
Survival> Distribution Analysis (Right Censoring) and Stat> Reliability/Survival>
Distribution Analysis (Arbitrary Censoring) menus.

8.11 POLYNOMIAL MODELS


The form of a model attempted for y (x) should always be based on an understanding of
the first-principles relationship between y and x (that is, the first principles of chemistry,
physics, biology, economics, and so on). In many cases a simple linear model is suffi-
cient. In other cases the first-principles relationship might suggest the need to transform
one or perhaps both variables before a linear model can be fitted. However, when the (x, y)
data display some complex nonlinear behavior and there is no known first-principles
explanation for that behavior, it usually becomes necessary to consider a polynomial
model. The general form of a polynomial model is:

ŷ = b0 + b1 x + b2 x 2 + L + bp x p (8.47)

where the polynomial is said to be of order p. The regression coefficients b0, b1, . . . ,
bp are determined using the same algorithm that was used for the simple linear
model; the error sum of squares is simultaneously minimized with respect to the regres-
sion coefficients. The family of equations that must be solved to determine the
regression coefficients is nightmarish, but most of the good statistical software
packages have this capability.
Although high-order polynomial models can fit the (x, y) data very well, they should
be of the lowest order possible that accurately represents the relationship between y and
x. There are no clear guidelines on what order might be necessary, but watch the sig-
nificance (that is, the p values) of the various regression coefficients to confirm that all
of the terms are contributing to the model. Polynomial models must also be hierarchi-
cal, that is, a model of order p must contain all possible lower-order terms.
Because of their complexity, it’s important to summarize the performance of poly-
nomial models using r 2adjusted instead of r 2. In some cases when there are relatively few
error degrees of freedom after fitting a large polynomial model, the r 2 value could be
misleadingly large whereas r 2adjusted will be much lower but more representative of the
true performance of the model.
Linear Regression 307

Example 8.20
Write out the third-order polynomial model for y (x) and describe how the standard
error of the model is calculated.
Solution: The third-order polynomial model for y (x) has the form:

ŷ = b0 + b1 x + b2 x 2 + b3 x 3

The error sum of squares is given by:

SSε = ∑εi2

where
εi = yi − yˆi = yi − (b0 + b1 xi + b2 xi2 + b3 xi3 )

If there are n (x, y) observations in the data set there will be dftotal = n – 1 total degrees
of freedom where a degree of freedom is lost to calculate (–x, –y ). Each of the four regres-
sion coefficients consumes a degree of freedom but only the first three are independent
so dfmodel = 3. By subtraction there will be dfe = n – 4 error degrees of freedom so the
standard error of the model will be:

∑ in=1εi
2
SSε
sε = =
dfε n−4

Most statistical software packages and spreadsheets provide functions to perform


polynomial regression. In MINITAB, you must construct a column for each power of x
that you want to include in the model and then instruct MINITAB to include all of those
columns in the model. The Stat> Regression> Regression menu or the regress com-
mand at the command prompt are used to perform the regression calculations.

Example 8.21
Use MINITAB to construct a third-order polynomial model for the following data:

x 8.9 8.7 0.1 5.4 4.3 2.4 3.4 6.8 2.9 5.6 8.4 0.7 3.8 9.5 0.7
y 126 143 58 50 40 38 41 66 47 65 138 49 56 163 45

Solution: The MINITAB commands to fit the third-order polynomial are shown in
Figure 8.13. The y values were loaded into column c1 of the MINITAB worksheet and
the x values were loaded into column c2. The x2 and x3 values were calculated in c3
and c4, respectively. The data and the fitted function are plotted in Figure 8.14. Despite
the fact that the model looks like it fits the data well, the regression coefficients are not
statistically significant. Another model should be considered such as a lower-order
polynomial or perhaps a model involving a transformation.
308 Chapter Eight

MTB > name c1 'y'


MTB > name c2 'x'
MTB > name c3 'x^2'
MTB > name c4 'x^3'
MTB > let 'x^2'='x'**2
MTB > let 'x^3'='x'**3
MTB > Regress 'y' 3 'x' 'x^2' 'x^3';
SUBC> Constant;
SUBC> Brief 2.

Regression Analysis: y versus x, x^2, x^3

The regression equation is


y = 54.8 - 7.49 x + 0.65 x^2 + 0.143 x^3

Predictor Coef SE Coef T P


Constant 54.783 7.932 6.91 0.000
x -7.486 8.138 -0.92 0.377
x^2 0.651 2.150 0.30 0.768
x^3 0.1431 0.1513 0.95 0.365

S = 9.88422 R-Sq = 95.9% R-Sq(adj) = 94.8%

Analysis of Variance

Source DF SS MS F P
Regression 3 25429.3 8476.4 86.76 0.000
Residual Error 11 1074.7 97.7
Total 14 26504.0

Source DF Seq SS
x 1 18452.9
x^2 1 6889.2
x^3 1 87.3

Figure 8.13 Fitting a third-order polynomial.

200

150
y

y = 54.8 – 7.49x + 0.65x 2 + 0.143x 3


100

50

0 5 10
x

Figure 8.14 Third-order polynomial fitted to example data.


Linear Regression 309

8.12 GOODNESS OF FIT TESTS


Whenever a model is fitted to data it’s necessary to test the resulting goodness of the fit.
The fit is judged to be good when the mean of observed values of the response y taken
at a fixed level of x coincides with the predicted value of y from the model. (See Figure
8.16, page 311, for an example of a linear fit that does not provide a good fit to the data.)
Many people are under the misconception that the goodness of fit is indicated by the
coefficient of determination r 2, but goodness of fit and correlation are two different
issues. While a model with a high r 2 explains much of the observed variation in the
response, that model doesn’t necessarily provide a good fit to the data.
There are many methods that can be used to judge the goodness of a linear model’s
fit to data: post-regression graphical diagnostics, the use of a quadratic model, and the
linear lack of fit test. MINITAB supports all of these methods. The first method is a
simple graphical technique that only requires the proper training and practice to inter-
pret correctly. The last two methods are more formal quantitative methods. The purpose
of this section is to present all three of these techniques.

8.12.1 The Quadratic Model As a Test of Linear Goodness of Fit


The quadratic model goodness of fit test for the linear model uses the hypotheses H0:
there is no curvature in the data versus HA: there is curvature in the data. The test is
performed by fitting a quadratic model to the data:

y = b0 + b1 x + b2 x 2 (8.48)

where the regression coefficients b0, b1, and b2 are estimates of parameters b 0, b 1, and
b2, respectively. The decision to accept or reject the null hypothesis regarding curvature
is based on the significance of the b2 regression coefficient. That is, the hypotheses can
be mathematically expressed as H0: b2 = 0 versus HA: b2 ≠ 0 and the test is carried out
using the t test method described earlier, where:

b2
tb2 = (8.49)
sb2

is compared to ta/2 with dfe = n – 3 degrees of freedom. If tb2 is statistically significant


then there is evidence that the linear model does not fit the data. If tb2 is not statistically
significant then the quadratic term can be dropped from the model and the linear model
provides a good fit.
The quadratic model for linear lack of fit is especially useful when an independent
variable x has just three discrete levels in an experiment. This situation is encountered
frequently in designed experiments as we will see in Chapter 11. When there are more
than three levels of x in an experiment, the linear lack of fit test is preferred over the
quadratic model. The quadratic model may still be effective, but there are some situa-
tions in which it will not detect lack of fit that the linear lack of fit test picks up easily.
310 Chapter Eight

Example 8.22
Fit the following data with an appropriate model and use scatter plots and residu-
als diagnostic plots to check for lack of fit.

x 3 3 3 5 5 5 7 7 7 9 9 9 11 11 11
y 65 60 62 86 85 89 100 102 98 109 113 112 117 112 118

Solution: The linear regression model is shown in Figure 8.15. From the coefficient
of determination r 2 = 0.92 and the highly significant regression coefficients everything
looks just great, but the fitted line plot and residuals versus x plot in Figure 8.16 suggest

MTB > Regress 'y' 1 'x';


SUBC> Constant;
SUBC> Brief 2.

Regression Analysis: y versus x

The regression equation is


y = 49.2 + 6.57 x

Predictor Coef SE Coef T P


Constant 49.233 4.054 12.14 0.000
x 6.5667 0.5370 12.23 0.000

S = 5.88261 R-Sq = 92.0% R-Sq(adj) = 91.4%

Analysis of Variance

Source DF SS MS F P
Regression 1 5174.5 5174.5 149.53 0.000
Residual Error 13 449.9 34.6
Total 14 5624.4

Figure 8.15 Linear fit to data from Example 8.22.

120
5
105
Residual

0
y

90
y = 49.23 + 6.567x
75 –5
S 5.88261
R-Sq 92.0%
R-Sq (adj) 91.4%
60 –10
2 4 6 8 10 2 4 6 8 10
x x

Figure 8.16 Linear fit and residuals diagnostic plot for Example 8.22.
Linear Regression 311

that there might be a problem with curvature. The graphs clearly indicate that the
model overpredicts the response y when x is at its extreme values and the model under-
predicts y when x takes on intermediate values. Clearly, the linear model is insufficient
to express y(x).
Since it appears that there might be significant curvature in y(x), the next step is
to fit a quadratic model to the data. The quadratic model has the form given in
Equation 8.48 where the squared term quantifies the amount of curvature present. The
quadratic model was fitted and is shown in Figure 8.17. From the regression and
ANOVA tables, this new model looks much better than the original linear model. The
r 2 is much higher, the standard error is smaller, but more importantly, the coefficient
of the quadratic term in the model is highly significant. This indicates that there is sig-
nificant curvature in the data and that the quadratic model really is necessary. The
fitted line plot and residuals versus x plot are shown in Figure 8.18. The improvement
in the quality of the fit is obvious. These observations all suggest that the quadratic
model provides a better fit to the data than the linear model does. The goodness of fit
of the quadratic model could be tested by fitting a cubic model to the data and testing
the regression coefficient of the x 3 term for statistical significance; however, the two
plots suggest that this is an unnecessary step.

MTB > name c3 'x^2'


MTB > let 'x^2'='x'*'x'
MTB > Regress 'y' 2 'x' 'x^2';
SUBC> Constant;
SUBC> Brief 2.

Regression Analysis: y versus x, x^2

The regression equation is


y = 18.5 + 17.1 x - 0.750 x^2

Predictor Coef SE Coef T P


Constant 18.483 4.222 4.38 0.001
x 17.067 1.340 12.73 0.000
x^2 -0.75000 0.09440 -7.94 0.000

S = 2.44722 R-Sq = 98.7% R-Sq(adj) = 98.5%

Analysis of Variance

Source DF SS MS F P
Regression 2 5552.5 2776.3 463.57 0.000
Residual Error 12 71.9 6.0
Total 14 5624.4

Source DF Seq SS
x 1 5174.5
x^2 1 378.0

Figure 8.17 Quadratic fit to data from Example 8.22.


312 Chapter Eight

4
120 y = 18.48 + 17.07x – 0.7500x**2

105 2

Residual
0
y

90

75 –2
S 2.44722
R-Sq 98.7%
R-Sq (adj) 98.5%
60 –4
2 4 6 8 10 2 4 6 8 10
x x

Figure 8.18 Quadratic fit and residuals diagnostic plot for Example 8.22.

8.12.2 The Linear Lack of Fit Test


The linear lack of fit test is a powerful alternative to the quadratic model for testing good-
ness of fit. The rationale for the linear lack of fit test is relatively simple: the linear lack
of fit test contrasts the error estimates from two models for the same data. The first model
is the usual regression model. The second model is a one-way classification model fitted
using ANOVA where the treatments are defined by the x values.* Because the one-way
ANOVA model always has more degrees of freedom than the linear regression model it
must always fit the data better, so its error sum of squares must be smaller. Since the resid-
uals from the ANOVA model can only be due to random error about the treatment means,
their contribution to the total variability is referred to as pure error. In contrast, the linear
regression residuals can get contributions from two sources: a contribution from truly ran-
dom or pure error and a contribution due to biases in the treatment means from the values
predicted by the linear model. If the linear model is valid then the treatment means defined
by the one-way classification based on x will fall on or near the values predicted by the
linear model. If, however, the treatment means differ substantially from the values pre-
dicted by the linear model then there is evidence of linear lack of fit and another model—
something other than the linear model—should be considered.
The lack of fit test calculations are done by constructing and combining the results
of the linear regression and one-way ANOVA models. The allocation of sums of squares
for the linear lack of fit test is shown in Figure 8.19 where SSe (PureError) is the error sum
of squares taken directly from the one-way ANOVA model; that is: SSe (PureError) =
SSe (ANOVA). The sum of squares associated with linear lack of fit is given by the difference
between the error sums of squares of the two models:

SSε( LOF ) = SSε( Regression ) − SSε( PureError ) (8.50)

* When repeated observations are not made at identical x values, the observations can still be grouped for the linear
lack of fit test according to comparable x values.
Linear Regression 313

SSTotal
SSTotal

SSRegression SSε(Regression) SSANOVA

SSε(LOF) SSε(Pure Error)

Figure 8.19 Relationship between ANOVA and regression sums of squares.

SSe (LOF) is always positive because the ANOVA model is always more complex than the
regression model, so SSe (PureError) will always be smaller than SSe (Regression).
The degrees of freedom for the linear lack of fit calculations also break down
according to the tree diagram in Figure 8.19. (Just replace each SS in the figure with df.)
The degrees of freedom associated with linear lack of fit are given by the difference
between the error degrees of freedom from the two models:

dfε( LOF ) = dfε( Regression ) − dfε( PureError ) (8.51)

The mean square associated with lack of fit is given by:

MSε( LOF ) = SSε( LOF ) / dfε( LOF ) (8.52)

which is tested for significance against the ANOVA (or pure) error mean square:

FLOF = MSε( LOF ) / MSε( PureError ) (8.53)

If FLOF is statistically significant then we must accept the hypothesis that the linear
model does not fit the data and another model—perhaps one with curvature—should be
considered. Table 8.2 shows the structure of the new linear regression ANOVA table,
which includes the linear lack of fit calculations.
MINITAB supports lack of fit calculations from the Options> Lack of Fit Tests
menu in Stat> Regression> Regression. If two or more observations are taken at each
level of x, then use the Pure Error option. If the x values are not repeated, then use the
Data Subsetting option.

Example 8.23
Use MINITAB’s pure error lack of fit test option to perform the linear lack of fit test
for the data from Example 8.22. Use the results from the linear regression and ANOVA
analyses to confirm the lack of fit test results.
314 Chapter Eight

Table 8.2 ANOVA table layout with lack of fit.


Source df SS MS F
Regression dfRegression SSRegression MSRegression FRegression
Residual Error dfe (Regression) SSe (Regression) MSe (Regression)
Lack of Fit dfe (LOF) SSe (LOF) MSe (LOF) Fe (LOF)
Pure Error dfe (PureError) SSe (PureError) MSe (PureError)
Total dfTotal SSTotal

MTB > Regress 'y' 1 'x';


SUBC> Constant;
SUBC> Pure;
SUBC> Brief 2.

Regression Analysis: y versus x

The regression equation is


y = 49.2 + 6.57 x

Predictor Coef SE Coef T P


Constant 49.233 4.054 12.14 0.000
x 6.5667 0.5370 12.23 0.000

S = 5.88261 R-Sq = 92.0% R-Sq(adj) = 91.4%

Analysis of Variance

Source DF SS MS F P
Regression 1 5174.5 5174.5 149.53 0.000
Residual Error 13 449.9 34.6
Lack of Fit 3 391.2 130.4 22.23 0.000
Pure Error 10 58.7 5.9
Total 14 5624.4

Figure 8.20 MINITAB’s regression output with lack of fit information.

Solution: The MINITAB regression output showing the results of the pure error lack
of fit test is shown in Figure 8.20. The significant lack of fit term (FLOF = 22.23, pLOF =
0.000) indicates that the linear model does not fit the data.
The linear regression and one-way ANOVA analyses of the data are shown in
Figures 8.15, page 310, and 8.21, respectively. The sum of squares associated with lack
of fit is given by the difference between the error sums of squares of the two models as
in Equation 8.50:

SSε( LOF ) = 449.9 − 58.67 = 391.23

Similarly, the degrees of freedom to estimate the lack of fit is given by the difference
between the degrees of freedom of the two models as in Equation 8.51:
Linear Regression 315

MTB > Oneway 'y' 'x'.

One-way ANOVA: y versus x

Source DF SS MS F P
x 4 5565.73 1391.43 237.18 0.000
Error 10 58.67 5.87
Total 14 5624.40

S = 2.422 R-Sq = 98.96% R-Sq(adj) = 98.54%

Individual 95% CIs For Mean Based on


Pooled StDev
Level N Mean StDev -+---------+---------+---------+--------
3 3 62.33 2.52 (--*-)
5 3 86.67 2.08 (-*-)
7 3 100.00 2.00 (-*-)
9 3 111.33 2.08 (-*-)
11 3 115.67 3.21 (-*-)
-+---------+---------+---------+--------
60 75 90 105
Pooled StDev = 2.42

Figure 8.21 ANOVA model from Example 8.22.

dfε( LOF ) = 13 − 10 = 3

The mean square associated with lack of fit is given by Equation 8.52:

SSε( LOF ) 391.23


MSε( LOF ) = = = 130.4
dfε( LOF ) 3

This mean square is tested for significance by comparing it to MSe (PureError) according to
Equation 8.53:

MSε( LOF ) 130.4


FLOF = = = 22.2
MSε( PureError ) 5.87

FLOF has dfe (LOF) = 3 numerator degrees of freedom and dfe (PureError) = 10 denominator
degrees of freedom. Its corresponding p value is pLOF = 0.0001, which is highly signifi-
cant. The lack of fit sums of squares, degrees of freedom, mean squares, F, and p values
all confirm the results of MINITAB’s lack of fit calculations. Evidently there is evidence
of lack of fit in the linear model.

The lack of fit test method can be extended to test for lack of fit in any fitted func-
tion provided that the observations can be broken up into enough different groups that
there are sufficient degrees of freedom to perform the test. As a minimum, the number
316 Chapter Eight

Table 8.3 Lack of fit calculations for quadratic model.


Source df SS MS F p
Regression 2 5552.5 2776.25 463.5 0.000
Error 12 71.9 5.99
Lack of Fit 2 13.2 6.6 1.12 0.363
Pure Error 10 58.7 5.87
Total 14 5624.4

of groupings according to the x variable must be at least one greater than the number of
regression coefficients in the fitted model. For example, to test a quadratic model for
lack of fit, the observations must be classified into at least four groups. MINITAB does
not support these calculations but they are relatively easy to perform by comparing the
regression and ANOVA reports.

Example 8.24
Test the quadratic model for the data from Example 8.22 for lack of fit.
Solution: The quadratic and one-way classification models are given in Figures
8.17, page 311, and 8.21, respectively. The degrees of freedom and sums of squares
were taken from these figures and used to construct the ANOVA table showing the lack
of fit calculations for the quadratic model in Table 8.3. The relatively small FLOF value
and corresponding large pLOF value indicate that there is no evidence of lack of fit in the
quadratic model. Apparently the quadratic model is a good fit to the data.

8.13 ERRORS IN VARIABLES


The previous section dealt with the issue of a regression assumption violation—that the
linear model did not provide a good fit to the data. Another type of regression assump-
tion violation occurs when there is random error in the independent variable x. This
conflicts with the requirement that the x values be known exactly. The consequence of
this assumption violation is that the resulting regression coefficients determined using
methods from this chapter become biased; however, when the standard deviation of the
random error in the x values is known or can be estimated, then the bias in the regres-
sion coefficients can be removed using errors-in-variables regression analysis.
Consider a situation in which a response y is a linear function of an independent
variable x:

yi = a + bxi + εi (8.54)

where the ei are normally distributed errors with me = 0 and constant variance s e2. All
appears to be fine, except suppose that the xi are not directly measurable and can only
be approximated by an observable quantity wi :
Linear Regression 317

xi = wi + ui (8.55)

where the ui are normally distributed errors with mean mu = 0 and constant variances u2.
Since the xi are not known, we cannot fit y(x) so we must settle for y(w) obtaining:

y = aw + bw w (8.56)

where the subscript w indicates that the regression coefficients are calculated from the
(w, y) observations. It can be shown (with difficulty) that the true regression coefficient
b is related to bw by:
⎛ σ2 ⎞
b = bw ⎜ 2 w 2 ⎟ (8.57)
⎝ σ w − σu ⎠

where s w2 is the variance of the w observations and s u2 must be known or estimated


from repeated x observations or from an independent experiment. Once the corrected
value of b is determined, the corrected value of a is given by:

a = y − bw (8.58)

and the error variance s e2 can be recovered from:

σ ε2 = σ ε2( y( w )) − b 2σ u2 (8.59)

where s e2(y(w)) is the error variance from the linear fit of y(w).

8.14 WEIGHTED REGRESSION


The linear regression method assumes that the regression model residuals are
homoscedastic so that all observations in the data set deserve to be weighted equally in
the analysis. When the residuals are heteroscedastic, the observations with greater
inherent noise deserve to be weighted less heavily than those observations where the
noise is smaller. The usual first approach to dealing with heteroscedastic residuals is to
attempt a variable transformation that recovers the homoscedasticity of the residuals,
but when such a transform cannot be found, it becomes necessary to introduce weight-
ing factors for each observation. The new array of observations has the form (xi , yi , wi )
where the wi are the weighting factors. The wi are chosen to be the reciprocals of the
local error variance:
1
wi = (8.60)
σ i2

The result of applying such weights to observations is that the weighted residuals
given by:
318 Chapter Eight

εi′ = wi εi (8.61)

will be homoscedastic, which satisfies the modified regression assumption. This


approach is equivalent to minimizing SSe ′ = Σwie 2i instead of the usual SSe = Σe 2i with
respect to the regression coefficients.
In most cases, the values of the error variance to associate with the observations are
unknown and must be determined empirically. When there are many repeated observa-
tions at a limited set of xi values the s 2i can be estimated from each set. When there
are not repeated observations but the ei appear to be systematically related to the xi, the
usual approach is to try to find a function of the form e 2i = f (xi), then to use the result-
ing function to predict error variances for each observation, and finally to determine the
necessary weighting factors from wi = 1/ŝ 2i .
MINITAB’s Stat> Regression> Regression function allows the column of weights
wi to be specified from its Options menu. See MINITAB’s Help files or Neter et al.
(1996) for help with weighted regression.

8.15 CODED VARIABLES


In many situations it is necessary to use coded levels of an independent variable instead
of the actual quantitative levels of that variable. This is usually done when only two or
three equally spaced levels of a quantitative variable are required. The codes used are
just like the transforms used to get from measurement units on an x axis to standard nor-
mal z units, or from x to t units, or from sample variance s2 to c 2 units. The codes
required here are actually much easier to use than any of these transforms. It is gener-
ally not necessary to be concerned with coding when there is just one independent vari-
able in a problem. However, as soon as two or more independent variables are involved,
coding becomes a necessity.
In many experiments only two levels of a quantitative variable will be considered.
Rather than using the values of the quantitative levels in calculations, the two levels are
referenced by the codes –1 for the smaller level and +1 for the larger level. This arrange-
ment is shown in Figure 8.22, however, we require a more formal relationship between
the two scales. Consider the same situation described in Figure 8.23. The coding makes
use of two quantities from the x or original measurement units axis: the midpoint or zero
level between the –1 and +1 levels, and the step size from the zero level out to the –1
and +1 levels. Let’s let the zero level be denoted x0 and the step size be denoted Δx. Then

xsmall xlarge x

–1 +1 x′

Figure 8.22 Original and coded axes.


Linear Regression 319

Δx Δx

x– x0 x+

–1 0 +1

Figure 8.23 Transformation between original and coded values.

it makes sense to let the –1 and +1 levels of x be denoted x–and x+, respectively. If we
let the coded levels of x be indicated by the symbol x′, then we can easily switch from
a value in original measurement units to its corresponding coded value with:

x − x0
x′ = (8.62)
Δx

And by solving this equation for x we can easily switch from coded units back to mea-
surement units:
x = x 0 + x ′Δx (8.63)

This looks messy now, but the use of coded variables is so common that shortly you will
do it without thinking about it. The x′ notation is also not used universally, it’s just been
introduced here for clarity, and you will not see it used in this book outside of this chap-
ter. It should also be apparent that the transformation equations that have just been
defined are linear equations much like the ones discussed earlier in this chapter. It is
possible and entirely appropriate to redraw Figures 8.22 and 8.23 as x–y plots with x on
the vertical axis and x′ on the horizontal axis, but since the transforms will generally be
used in place of the original measurement values it makes sense to think of the two
scales in parallel, as they are presented in the figures.

Example 8.25
An experiment is performed with two levels of temperature: 25C and 35C. If these
are the –1 and +1 levels of temperature, respectively, then find the coded value that cor-
responds to 28C.
Solution: The zero level of temperature is x0 = 30C and the step size to the –1 and
+1 levels is Δx = 5C, so the transformation equation to coded units is:

x − 30
x′ =
5

Then the coded value of x = 28C is:


320 Chapter Eight

28 − 30
x′ = = −0.4
5

The solution is shown graphically in Figure 8.24.

Example 8.26
Use the definitions in the preceding example to determine the temperature that has
a coded value of x′ = +0.6.
Solution: The equation to transform from coded to actual values is:

x = 30 + 5 x ′

so the actual temperature that corresponds to the coded value x′ = +0.6 is:

x = 30 + 5(0.6) = 33

The solution is shown graphically in Figure 8.25.

8.16 MULTIPLE REGRESSION


When a response has n quantitative predictors such as y (x1, x2, . . . , xn), the model for
y must be created by multiple regression. In multiple regression each predictive term in

5C 5C

25 28 30 35 T

–1 –0.4 0 +1 T′

Figure 8.24 Transformation of T = 28C to coded units.

5C 5C

25 30 33 35 T

–1 0 0.6 +1 T′

Figure 8.25 Transformation of T′ = 0.6 coded units back to temperature T units.


Linear Regression 321

the model has its own regression coefficient. The simplest multiple regression model
contains a linear term for each predictor:

y = b0 + b1 x1 + b2 x 2 + L + bn x n (8.64)

This equation has the same basic structure as the polynomial model in Equation 8.47 and,
in fact, the two models are fitted and analyzed in much the same way. Where the work-
sheet to fit the polynomial model requires n columns, one for each power of x, the
worksheet to fit the multiple regression model requires n columns to account for each of
the n predictors. The same regression methods are used to analyze both problems.
Frequently, the simple linear model in Equation 8.64 does not fit the data and a
more complex model is required. The terms that must be added to the model to achieve
a good fit might involve interactions, quadratic terms, or terms of even higher order.
Such models have the basic form:

y = b0 + b1 x1 + b2 x 2 + L + b12 x1 x 2 + L + b11 x12 + b22 x 22 + L (8.65)

and must be hierarchical, that is, if a complex term is to be included in the model then
all of the simpler terms that can be derived from it must also be present in the model.
For example, if a model is to contain a term like b123x1x2x3 then the model must also con-
tain x1, x2, x3, x12, x13, and x23. Complex equations like this can be fitted in the usual way
after a column is created in the worksheet for each term in the model.
The relationship between the levels used in an experiment for the different quantita-
tive predictors plays a role in determining what model can be fitted for the response.
Ideally, the predictors should be completely independent of each other. Then each pre-
dictor can be included in the model and their effects will be quantified independently.
Things become more complicated when predictors are dependent on each other. Suppose
that two predictors are perfectly correlated, that is, that the magnitude of their correla-
tion coefficient is unity. A series of models can be constructed that contain either or both
variables; however, when both variables are included in the model, it is impossible to
determine unique regression coefficients for the predictors. In fact, there are an infinite
number of sets of regression coefficients that deliver identical performance. This prob-
lem limits the predictive use of the model to those cases in which the correlation between
the two predictors is preserved. If the correlation is broken then the model cannot be used
because the independent effects of the predictors have not been determined.
An experiment that has correlated quantitative predictors is said to suffer from a
form of variable confounding called colinearity. Colinearity is a continuous, not binary,
characteristic of an experiment design. Generally, we wish to have complete indepen-
dence between our predictive variables, but sometimes, by design or by chance, some
degree of dependence appears between variables. For example, in a passive experiment
(that is, where the experimental input and output variables are observed but not con-
trolled) certain predictors may be naturally and uncontrollably correlated. Special
analysis methods and interpretations are available for problems that suffer from some
322 Chapter Eight

colinearity, but the general nature of DOE is to avoid colinearity so that these methods
are not necessary.
To demonstrate the difficulties caused by colinearity, consider a simple example.
Suppose that a response y depends on two predictors x1 and x2, that an experiment is per-
formed in which all of the observations are taken such that x1 = x2, and that when y is
modeled as a function of x1, an excellent fit of the form y = 10x1 is obtained that meets
all of the requirements of the regression method. The response could also be expressed
as y = 10x2, or combinations of x1 and x2 could be considered, for example: y = 5x1 + 5x2,
y = –5x1 + 15x2, y = 20x1 – 10x2, and an infinite number of others are possible. [If you
want to check that all of these give the same answer, try an example like y (x1, x2) = y (1,
1).] Although these are all excellent models for y, they are all constrained by the condi-
tion x1 = x2. As long as this condition is satisfied then y can be safely predicted from any
of these models, but as soon as the correlation between x1 and x2 is broken then none of
the models can be used. If we are going to go to the trouble of doing an experiment, we
would prefer to do it such that the effects of x1 and x2 could be determined independently.
The specific intent of DOE is to avoid the problems caused by correlated predic-
tors. Designs that have independent predictors are called orthogonal designs. Except for
some special cases, these are the designs that will be considered in the remaining chap-
ters of this book. The orthogonality of a design is often evaluated by constructing a
matrix of the correlation coefficients (r) between all possible pairs of predictors. Designs
that are orthogonal will have r = 0 for all pairs of predictors.
Multiple regression can be used to fit both empirical and first-principles models to
data; however, the values used in the model for the different predictors depends on which
type of model is being fitted. When an empirical model is being fitted and includes an
interaction between two independent variables, the variables must first be coded using the
methods described in Section 8.15. If variables are not coded then incorrect regression
coefficients may be obtained for the main effects and interactions. When a first-principles
model is being fitted then the variables may be expressed in their original measurement
units. Then the regression coefficients are often equal to physical or material constants
suggested by the first-principles model.
Multiple regression can be performed from MINITAB’s Stat> Regression>
Regression menu or with the regress command at the command prompt. Each predictor
must be created in a separate column of the worksheet before the model can be fitted.
This might require you to explicitly create columns for the squares of variables, interac-
tions, and any transformations. Use the let command (or Stat> Calc> Calculator) to create
these columns. The syntax for the regression command from the command prompt is
similar to that for regression on one predictor variable. For example, to regress a response
in c1 as a function of three predictors in columns c2, c3, and c4 use:
mtb > regress c1 3 c2-c4

Example 8.27
Analyze the following 2 2 experiment with two replicates using multiple linear
regression. Use an empirical model including terms for x1, x2, and their interaction.
Linear Regression 323

Compare the model obtained by fitting the original numerical values of the predictors
with the model obtained by fitting the transformed values.

x1 x2 y
10 40 286,1
10 50 114, 91
100 40 803, 749
100 50 591, 598

Solution: The data were entered into a MINITAB worksheet with a single response
in each row. Then the x1 and x2 columns were multiplied together using the let command
to create the x12 interaction column. Figure 8.26 shows a matrix plot of the response and
the three predictors created with Graph> Matrix Plot. The first row of plots shows that
y appears to increase with respect to x1 and x12 but does not appear to depend on x2 at
all. The plot of x1 versus x2 shows that they are independent of each other, the plot of x1
versus x12 shows that they are very strongly correlated, and the plot of x2 versus x12 shows
that they are mostly independent of each other.
Figure 8.27 shows the data and multiple regression analysis using the original val-
ues of the predictors. The correlation matrix of the predictors confirms the observations
made from the matrix plot: x1 and x2 are independent (r = 0), x1 and x12 are strongly cor-
related (r = 0.985), and x2 and x12 are weakly correlated (r = 0.134). None of the pre-
dictors in the regression analysis are statistically significant. This result is unexpected
because of the apparently strong correlations observed between the response and x1
and x12 in the matrix plot.

602.5
y
201.5

100

x1

10
50

x2

40

10 100 40 50 400 5000


x1 x2 x12

Figure 8.26 Matrix plot of response and uncoded predictors.


324 Chapter Eight

MTB > print c1-c4

Data Display

Row y x1 x2 x12
1 286 10 40 400
2 1 10 40 400
3 114 10 50 500
4 91 10 50 500
5 803 100 40 4000
6 749 100 40 4000
7 591 100 50 5000
8 598 100 50 5000

MTB > corr c2-c4;


SUBC> nopvalue.

Correlations: x1, x2, x12

x1 x2
x2 0.000
x12 0.985 0.134

Cell Contents: Pearson correlation

MTB > regress c1 3 c2-c4

Regression Analysis: y versus x1, x2, x12

The regression equation is


y = 175 + 13.3 x1 - 2.5 x2 - 0.156 x12

Predictor Coef SE Coef T P


Constant 174.8 520.3 0.34 0.754
x1 13.272 7.321 1.81 0.144
x2 -2.54 11.49 -0.22 0.836
x12 -0.1561 0.1617 -0.97 0.389

S = 102.907 R-Sq = 94.0% R-Sq(adj) = 89.5%

Analysis of Variance

Source DF SS MS F P
Regression 3 666873 222291 20.99 0.007
Residual Error 4 42359 10590
Total 7 709233

Source DF Seq SS
x1 1 632250
x2 1 24753
x12 1 9870

Figure 8.27 Multiple regression analysis using original values of the predictors.

The regression analysis in Figure 8.27 is flawed because of the strong correlation
between x1 and x12. This correlation is not real— it is an artifact of the use of the origi-
nal values of the predictors instead of the coded values. The coded values (cx1, cx2, cx12)
of the predictors were determined by assigning the values –1 and +1 to the low and high
values of the original predictors, respectively. Figure 8.28 shows the matrix plot of the
response and coded predictors and Figure 8.29 shows the corresponding regression
Linear Regression 325

analysis. Figure 8.28 suggests that the response depends only on x1 and that the pre-
dictors are all independent of each other. This is a very different interpretation of the
situation than that provided by Figure 8.26. Figure 8.28 provides the correct interpre-
tation because it faithfully represents the true independence of the predictors. The
regression analysis in Figure 8.29 confirms that the predictors are all independent (r = 0
for all pairs of predictors) and that y only depends on x1. A comparison of the ANOVA
tables and summary statistics from Figures 8.27 and 8.29 shows that they are identical,
but the regression analysis in Figure 8.27 completely misses the dependence of y on x1
because of the colinear predictors.
Figure 8.26 suggests that y depends rather strongly on x12 but any hint of this
dependence is missing in Figure 8.28. The correlation between the response and x12
in Figure 8.26 is caused by the colinearity of x12 and x1. To determine the interaction
between x1 and x2 these predictors were just multiplied together. Since x2 is relatively
constant compared to x1, the x12 term determined from x12 = x1x2 is essentially pro-
portional to x1. Coding the original values of the predictors eliminates this mathe-
matical difficulty so that the true influence of the interaction on the response can be
determined.
This example clearly demonstrates that it is necessary to use coded values for the
predictors when using multiple regression to build models with two or more predictors.

Section 8.10 described methods for transforming nonlinear functions into linear
form so that they could be analyzed using simple linear regression methods. The same
transformation methods may be required to linearize a multiple regression problem,
especially when the model to be fitted is a first-principles model of some specific
form. If the first-principles model cannot be easily linearized with an appropriate
transform, then you will have to settle for an empirical model or get help from your

602.5
y
201.5

+1

cx1

–1
+1

cx2

–1

–1 +1 –1 +1 –1 +1
cx1 cx2 cx12

Figure 8.28 Matrix plot of response and coded predictors.


326 Chapter Eight

MTB > print c1-c7

Data Display

Row y x1 x2 x12 cx1 cx2 cx12


1 286 10 40 400 -1 -1 1
2 1 10 40 400 -1 -1 1
3 114 10 50 500 -1 1 -1
4 91 10 50 500 -1 1 -1
5 803 100 40 4000 1 -1 -1
6 749 100 40 4000 1 -1 -1
7 591 100 50 5000 1 1 1
8 598 100 50 5000 1 1 1

MTB > corr c5-c7;


SUBC> nopvalue.

Correlations: cx1, cx2, cx12

cx1 cx2
cx2 0.000
cx12 0.000 0.000

Cell Contents: Pearson correlation

MTB > regress c1 3 c5-c7

Regression Analysis: y versus cx1, cx2, cx12

The regression equation is


y = 404 + 281 cx1 - 55.6 cx2 - 35.1 cx12

Predictor Coef SE Coef T P


Constant 404.13 36.38 11.11 0.000
cx1 281.13 36.38 7.73 0.002
cx2 -55.63 36.38 -1.53 0.201
cx12 -35.13 36.38 -0.97 0.389

S = 102.907 R-Sq = 94.0% R-Sq(adj) = 89.5%

Analysis of Variance

Source DF SS MS F P
Regression 3 666873 222291 20.99 0.007
Residual Error 4 42359 10590
Total 7 709233

Source DF Seq SS
cx1 1 632250
cx2 1 24753
cx12 1 9870

Figure 8.29 Multiple regression analysis using coded predictors.

neighborhood statistician. If, however, the first-principles model can be linearized,


then the usual multivariable regression analysis can be used. If the resulting first-
principles model doesn’t fit the data, then the theory behind the model may be invalid
or the data may be corrupt or just inappropriate. If a model for the data is still required,
an empirical model can always be fitted.
Linear Regression 327

A common type of first-principles model that can be easily linearized is any model
that involves only products and ratios of the predictors that are possibly raised to powers.
A simple logarithmic transform will convert such models into the linear form necessary
for analysis using multiple regression. The powers of the predictors don’t need to be
known—they will reported as the regression coefficients for the different variables.

8.17 GENERAL LINEAR MODELS


In Chapter 6 we saw that when an experiment contains two or more qualitative vari-
ables, the response can be analyzed as a function of those variables using multi-way
ANOVA. In the preceding section we saw that when an experiment contains two or
more quantitative variables, the response can be fitted as a function of those variables
using multiple linear regression. When an experiment contains a combination of quali-
tative and quantitative variables, however, the usual ANOVA and multiple regression
techniques will not work. Experiments that contain both qualitative and quantitative
variables are analyzed using a technique called a general linear model.
The trick to general linear models is to replace each qualitative variable that normally
would be analyzed by ANOVA with an array of quantitative variables that can be analyzed
by regression. Let’s reconsider the one-way classification problems that were analyzed using
one-way ANOVA in Chapter 5. The one-way ANOVA model consists of one mean for
each of the k treatment conditions where only the first k – 1 means are independent. Now,
suppose that we create k indicator variables, one for each of the k treatments, where the
first indicator variable takes on the value one for those runs that used the first treatment
and zero for all other runs, the second indicator variable is one for those runs that used the
second treatment and zero for all other runs, and so on. After indicator variables are cre-
ated for all of the treatments, the response can be analyzed as a function of any k – 1 of
them using multiple regression. Only k – 1 indicator variables can be included in the
model because the last one is always dependent on the others just as the kth treatment
mean is dependent on the other k – 1 treatment means in ANOVA. Although this method
of coding the treatments works, it introduces an undesirable bias in the regression model’s
constant. This problem is corrected by introducing a minor but important modification to
the treatment coding scheme that is demonstrated in Example 8.29. A consequence of this
scheme is that the kth treatment’s regression coefficient is often not reported, but it can be
determined from the negative sum of the other k – 1 coefficients. In practice, the software
to construct general linear models hides all of the necessary coding of qualitative variables
so that although it appears that they are being analyzed using ANOVA, they are actually
being analyzed by the equivalent regression methods.
When a general linear model includes an interaction between a qualitative and
quantitative variable, the model must also include the main effects of those variables to
remain hierachical. The effect of the interaction in the model is that, in addition to the
usual slope term associated with the quantitative variable, there will be adjustments to
the slope for each level of the qualitative variable.
328 Chapter Eight

Example 8.28
Write out the general linear model for a quantitative variable x and a qualitative
variable A with three levels A = {1, 2, 3} including the interaction term.
Solution: The general linear model will have the form:

y = b0 + b1 x + b21 ( A = 1) + b22 ( A = 2 ) + b23 ( A = 3)


+ b31 x ( A = 1) + b32 x ( A = 2 ) + b33 x ( A = 3)

where the bi are regression coefficients and terms like (A = 1) are Boolean expressions
that are equal to one when the expression is true, and zero otherwise. The nominal slope
of y versus x is indicated by b1 but the three terms of the form b3i describe corrections
to the slope for each A treatment. In practice, the coefficients b23 and b33 will not be
reported but can be determined from b23 = – (b21 + b22) and b33 = – (b31 + b32).

Use MINITAB’s Stat> ANOVA> General Linear Model function to analyze exper-
iments with both quantitative and qualitative variables. In the Model window enter the
qualitative and quantitative variables and any other terms that you want to include in the
model such as interactions and quadratic terms. By default, MINITAB assumes that any
terms that appear in the Model window are qualitative variables to be analyzed by
ANOVA, so identify the quantitative variables to be analyzed by regression by entering
them in the Covariates window. MINITAB’s Stat> ANOVA> General Linear Model
function also includes a powerful collection of post-ANOVA comparison tools, excel-
lent residuals diagnostics, and many other advanced capabilities.

Example 8.29
The glass used to construct arc lamps is often doped with materials that attenuate
harmful UV radiation; however, these dopants usually decrease the life of the lamp. An
experiment was performed to determine which of four doping materials would have the
least impact on lamp life. The concentration of each dopant was adjusted to attenuate
the UV by the desired amount and then five randomly selected lamps from each of the
four treatments were operated to end of life. The experimental life data in hours are
shown below. Analyze these data using both one-way ANOVA and regression to demon-
strate the equivalence of the two analysis methods.

Obs A B C D
1 316 309 354 243
2 330 291 364 298
3 311 363 400 322
4 286 341 381 317
5 258 369 330 273
Linear Regression 329

Solution: The data were entered into a MINITAB worksheet and analyzed using
Stat> ANOVA> One-Way. The results of the ANOVA are shown in Figure 8.30. To per-
form the same analysis by regression, an indicator variable was created for each treat-
ment using Calc> Make Indicator Variables. The resulting indicator variables are
shown in Figure 8.31 under the label Indicator Variables. These indicator variables
are not quite suitable for use in the regression analysis because they are biased, that is,
they don’t each have an average value of zero. To resolve this problem, the first three
columns of indicator variables were retained and modified according to the columns
under the label GLM Coding. The modification was to change the zero values for runs
of treatment D (or ID = 4) in columns A, B, and C to –1 values. This corrects the bias
problem, giving each treatment column an average value of zero, and preserves the
independence of the treatments.
The regression analysis of lamp life as a function of the correctly coded treatments
is shown at the bottom of Figure 8.31. This analysis exactly reproduces the results from
the one-way ANOVA, including the values of the standard error and the coefficients of
determination. The regression coefficient for treatment D was not automatically
reported in the MINITAB output but its value was calculated from the negative sum of

Data Display

Row A B C D
1 316 309 354 243
2 330 291 364 298
3 311 363 400 322
4 286 341 381 317
5 258 369 330 273

One-way ANOVA: A, B, C, D

Source DF SS MS F P
Factor 3 17679 5893 6.30 0.005
Error 16 14962 935
Total 19 32641

S = 30.58 R-Sq = 54.16% R-Sq(adj) = 45.57%

Individual 95% CIs For Mean Based on


Pooled StDev
Level N Mean StDev -----+---------+---------+---------+----
A 5 300.20 28.45 (--------*-------)
B 5 334.60 33.86 (--------*-------)
C 5 365.80 26.57 (--------*-------)
D 5 290.60 32.84 (-------*-------)
-----+---------+---------+---------+----
280 315 350 385

Pooled StDev = 30.58

Figure 8.30 Arc lamp life analysis by ANOVA.


330 Chapter Eight

Data Display
Indicator Variables GLM Coding
------------------- -----------
Row Life ID A B C D A B C
1 316 1 1 0 0 0 1 0 0
2 330 1 1 0 0 0 1 0 0
3 311 1 1 0 0 0 1 0 0
4 286 1 1 0 0 0 1 0 0
5 258 1 1 0 0 0 1 0 0
6 309 2 0 1 0 0 0 1 0
7 291 2 0 1 0 0 0 1 0
8 363 2 0 1 0 0 0 1 0
9 341 2 0 1 0 0 0 1 0
10 369 2 0 1 0 0 0 1 0
11 354 3 0 0 1 0 0 0 1
12 364 3 0 0 1 0 0 0 1
13 400 3 0 0 1 0 0 0 1
14 381 3 0 0 1 0 0 0 1
15 330 3 0 0 1 0 0 0 1
16 243 4 0 0 0 1 -1 -1 -1
17 298 4 0 0 0 1 -1 -1 -1
18 322 4 0 0 0 1 -1 -1 -1
19 317 4 0 0 0 1 -1 -1 -1
20 273 4 0 0 0 1 -1 -1 -1

Regression Analysis: Life versus A, B, C

The regression equation is


Life = 323 - 22.6 A + 11.8 B + 43.0 C

Predictor Coef SE Coef T P


Constant 322.800 6.838 47.21 0.000
A -22.60 11.84 -1.91 0.074
B 11.80 11.84 1.00 0.334
C 43.00 11.84 3.63 0.002
D -32.20 -(-22.60 + 11.80 + 43.00) = -32.20

S = 30.5798 R-Sq = 54.2% R-Sq(adj) = 45.6%

Analysis of Variance

Source DF SS MS F P
Regression 3 17679.2 5893.1 6.30 0.005
Residual Error 16 14962.0 935.1
Total 19 32641.2

Figure 8.31 Arc lamp life analysis by regression.

the other coefficients and manually inserted into the figure. The standard error of the
D coefficient is the same as the others so the corresponding t and p values could also
be determined and added to the figure. For the first treatment, the regression model pre-
–––
dicts that the lamp life is LifeA = 322.8 – 22.6 = 300.2, which is in exact agreement with
the treatment mean reported in the ANOVA. There is also perfect agreement between the
two models for the other treatment means, and the regression model constant is exactly
equal to the grand mean of the data set.
Linear Regression 331

Example 8.30
An experiment was performed by Swagelok Company in Solon, Ohio, to compare
the torques required to tighten nuts on tubing fittings for three different lubricants:
LAU, MIS, and SWW. The purpose of the experiment was to determine if one of the
lubricants delivered lower tightening torque than the others where a 10 percent differ-
ence would be considered significant. The tightening operation is destructive to the nut
and fitting so six randomly selected nut and fitting combinations were treated with each
lubricant and torques were measured for each nut/fitting combination at 180, 270, 360,
and 450 degrees as the nuts were tightened through 450 degrees of rotation. The order
of the runs was completely randomized. The experimental data are shown in Figure
8.32. Analyze the torque as a function of lubricant and angle and determine if there are
differences between the lubricants.
Solution: A multi-vari chart of the torque data is shown in Figure 8.33. The chart
shows that there is an approximately linear relationship between torque and angle with
some slight upward curvature. Both LAU and MIS appear to have higher torque than
SWW. Unfortunately, the variation in the observations appears to increase in size with the
torque. Figure 8.34 shows another multi-vari chart after the torque has been transformed
by taking the natural logarithm. The figure shows some weak downward curvature in the
torque versus angle curve; however, the transform appears to have successfully recovered
the homoscedasticity of the observations about their treatment means.

Data Display

Row Unit Angle LAU MIS SWW


1 1 180 72.1 70.6 53.4
2 1 270 103.6 102.1 80.2
3 1 360 129.9 145.7 112.4
4 1 450 173.9 193.3 138.0
5 2 180 77.2 65.2 49.4
6 2 270 122.6 91.9 70.6
7 2 360 162.9 123.7 100.7
8 2 450 210.1 162.9 135.4
9 3 180 61.1 58.9 53.4
10 3 270 88.9 83.5 80.2
11 3 360 130.3 117.9 122.2
12 3 450 157.8 156.3 153.7
13 4 180 75.8 71.4 50.5
14 4 270 116.4 101.4 72.8
15 4 360 153.0 158.5 106.1
16 4 450 198.4 204.2 129.6
17 5 180 67.3 73.6 57.8
18 5 270 105.4 111.6 85.3
19 5 360 154.1 165.1 120.4
20 5 450 222.5 198.4 154.8
21 6 180 70.6 63.3 51.6
22 6 270 107.6 92.6 71.7
23 6 360 144.9 130.7 108.3
24 6 450 197.3 168.7 147.9

Figure 8.32 Tightening torque data.


332 Chapter Eight

225 Lubricant
LAU
MIS
200 SWW

175

150
Torque

125

100

75

50

180 270 360 450


Angle

Figure 8.33 Multi-vari chart of torque versus angle by lubricant.

5.6 Lubricant
LAU
5.4 MIS
SWW
5.2

5.0
In(Torque)

4.8

4.6

4.4

4.2

4.0

180 270 360 450


Angle

Figure 8.34 Multi-vari chart of ln(Torque) versus angle by lubricant.

Because angle is a quantitative variable and lubricant is a qualitative variable it is


necessary to use the general linear model method of analysis. In addition to these two
variables, there may be an interaction between them, and a quadratic angle term must
be included in the model to account for the slight curvature observed in the multi-vari
Linear Regression 333

chart. There is also a chance that the nut/fitting combinations used within treatments
could be different, so they should also be accounted for in the model.
The torque data from Figure 8.32 were stacked and columns indicating the lubri-
cant type and nut/fitting unit number for each lubricant were created. Part of the work-
sheet and output obtained using MINITAB’s Stat> ANOVA> General Linear Model
function are shown in Figure 8.35. The terms included in the model were Lubricant,
Unit(Lubricant), Angle, Lubricant*Angle, and Angle*Angle. The nested term Unit
(Lubricant) indicates that the nut/fitting units were unique within lubricant treatments.
Unit was declared to be a random variable because the nut/fitting combinations were
random samples assigned to the different treatments. Angle was declared to be a covari-
ate because it is a quantitative predictor.
The diagnostic plots in Figure 8.36 show that the residuals are normally distributed
and homoscedastic with respect to angle, lubricant, the fitted values, and run order, as
required by the analysis method. There is no evidence of lack of fit in the plot of resid-
uals versus angle so the quadratic model is probably appropriate.
The general linear model analysis shows that Lubricant ( p = 0.000), Unit ( p =
0.000), Angle ( p = 0.000), and Angle*Angle ( p = 0.000) are all highly significant. The
only term that is not statistically significant is the Lubricant*Angle interaction ( p =
0.847). The insignificance of this term means that the torque versus angle curves for all
of the lubricants have the same slope. To satisfy Occam we should run the analysis
again with the Lubricant*Angle interaction removed from the model because the
regression coefficients for the surviving lubricant terms will change; however, in this
case the interaction term is so weak that there is little difference between the two
models. The overall performance of the model is excellent, with adjusted coefficient of
determination r 2adj = 0.9898.
The table of regression coefficients was simplified for clarity; the coefficients for
the Unit(Lubricant) terms were deleted from the table because they are not of interest
and the table was reformatted slightly. The nonreported coefficients for the Lubricant and
Lubricant*Angle interaction where Lubricant = SWW, and the corresponding t and p
values, were calculated manually and added to the table. Ignoring the insignificant
Lubricant*Angle interaction terms, the model can be written:

ln ( Torque ) = 3.16 + 0.128 ( Lubricant = LAU ) + 0.060 ( Lubricant = MIS )


− 0.188 ( Lubricant = SWW ) + 0.0062 Angle − 0.000004 Angle 2

where expressions like (Lubricant = LAU) are Boolean expressions. The signs of the lubri-
cant coefficients indicate that the SWW lubricant delivers the lowest torque, which is con-
sistent with the multi-vari charts. Since the coefficient of SWW is negative and the other two
are positive so that zero falls between them, SWW is very different from the other two lubri-
cants, at least statistically if not practically. The predicted relative differences between
torques by lubricant are: (1 – e(0.128–(–0.188))) 100% = 37% for LAU relative to SWW, (1 –
e(0.060–(–0.188)))100% = 28% for MIS relative to SWW, and (1 – e(0.128–0.060))100% = 7% for LAU
relative to MIS. The first two differences are practically significant relative to the goals of
the experiment.
334
Data Display

Chapter Eight
Row Lubricant Unit Angle Torque ln(Torque)
1 LAU 1 180 72.1 4.27805
2 LAU 1 270 103.6 4.64054
. . . . . .
. . . . . .

71 SWW 6 360 108.3 4.68491


72 SWW 6 450 147.9 4.99654

General Linear Model: ln(Torque) versus Lubricant, Unit

Factor Type Levels Values


Lubricant fixed 3 LAU, MIS, SWW
Unit(Lubricant) random 18 1, 2, 3, 4, 5, 6, 1, 2, 3, 4, 5, 6, 1, 2, 3, 4, 5, 6

Analysis of Variance for ln(Torque), using Adjusted SS for Tests


Source DF Seq SS Adj SS Adj MS F P
Lubricant 2 1.16643 0.12337 0.06169 13.48 0.000 x Not an exact F-test
Unit(Lubricant) 15 0.49245 0.49245 0.03283 19.39 0.000
Angle 1 10.02376 0.44326 0.44326 261.83 0.000
Lubricant*Angle 2 0.00057 0.00057 0.00028 0.17 0.847
Angle*Angle 1 0.07110 0.07110 0.07110 42.00 0.000
Error 50 0.08465 0.08465 0.00169
Total 71 11.83896

S = 0.0411453 R-Sq = 99.29% R-Sq(adj) = 98.98%

Term Coef SE Coef T P


Constant 3.15664 0.05567 56.70 0.000
Lubricant
LAU 0.12826 0.02254 5.69 0.000
MIS 0.06005 0.02254 2.66 0.010
SWW -0.18831 0.02254 -8.35 0.000 -(0.12826+0.06005) = -0.18831
Angle 0.006152 0.000380 16.18 0.000
Angle*Lubricant
LAU -0.000024 0.000068 -0.36 0.722
MIS -0.000015 0.000068 -0.21 0.832
SWW 0.000039 0.000068 0.57 0.571 -(-0.000024-0.000015) = 0.000039
Angle*Angle -0.000004 0.000001 * *

Figure 8.35 General linear model for ln(Torque). Continued


Continued

Unusual Observations for ln(Torque)


Obs ln(Torque) Fit SE Fit Residual St Resid
1 4.27805 4.20724 0.02395 0.07082 2.12 R denotes an observation with a large standardized residual.
17 4.20916 4.29864 0.02395 -0.08948 -2.67 R
20 5.40493 5.29323 0.02395 0.11169 3.34 R

Expected Mean Squares, using Adjusted SS


Expected Mean Square
Source for Each Term
1 Lubricant (6) + 0.3704 (2) + Q[1]
2 Unit(Lubricant) (6) + 4.0000 (2)
3 Angle (6) + Q[3, 4]
4 Lubricant*Angle (6) + Q[4]
5 Angle*Angle (6) + Q[5]
6 Error (6)

Error Terms for Tests, using Adjusted SS


Source Error DF Error MS Synthesis of Error MS
1 Lubricant 31.57 0.00458 0.0926 (2) + 0.9074 (6)
2 Unit(Lubricant) 50.00 0.00169 (6)
3 Angle 50.00 0.00169 (6)
4 Lubricant*Angle 50.00 0.00169 (6)
5 Angle*Angle 50.00 0.00169 (6)

Linear Regression
Variance Components, using Adjusted SS
Estimated Standard
Source Value Deviation
Unit(Lubricant) 0.00778 0.0882
Error 0.00169 0.0411
Total 0.00947 0.0973

335
336 Chapter Eight

Histogram of the Residuals Normal Probability Plot of the Residuals


20 99.9
99
90
Frequency

Percent
10 50

10
1
0 0.1
–0.08 –0.04 –0.00 0.04 0.08 0.12 –0.1 0.0 0.1
Residual Residual

Residuals versus Lubricant Residuals versus Angle

0.1 0.1
Residual

Residual
0.0 0.0

–0.1 –0.1
1 2 3 200 300 400
Lubricant Angle

Residuals versus the Fitted Values Residuals versus the Order of the Data

0.1 0.1
Residual

Residual

0.0 0.0

–0.1 –0.1
4.0 4.5 5.0 1 5 10 15 20 25 30 35 40 45 50 55 60 65 70
Fitted Value Observation Order

Figure 8.36 Residuals diagnostic plots from the general linear model for ln(Torque).

Under other circumstances, the post-ANOVA comparisons of the three lubricants


might be carried out using Tukey’s, Duncan’s, or Hsu’s methods; however, MINITAB
correctly refuses to attempt these comparisons because of the presence of the random
nut/fitting units nested within each lubricant type. The problem is that the apparent dif-
ferences between lubricants could still be due to the way the 18 nut/fitting units were
assigned to the three lubricants and a different randomization or a different set of 18
units could deliver different results. Consequently, the conclusion that there are statis-
tically and practically significant differences between the lubricants is dependent on the
untestable assumption that the random assignment of units to lubricants was fair.
Linear Regression 337

From the variance components analysis at the end of Figure 8.35 the standard devi-
ation of variability in ln (torque) associated with random variability in the nut/fitting
units is given by:

sunits = 0.00778 = 0.0882

The corresponding relative variation in the torques is given by 1 – e0.0882 = 0.092 or


about 9.2 percent. In the same manner, the standard error of the model is:

sε = MSε = 0.00169 = 0.0411

and the corresponding relative error variation in the torques is given by 1 – e0.0411 =
0.042 or 4.2 percent. The combined variation due to both random sources has standard
deviation:

stotal = 0.00778 + 0.00169 = 0.0973

and the corresponding combined relative error in the torques is 1 – e0.0973 = 0.102
or about 10.2 percent. This means that about 95 percent of the observed torques
should fall within about ±2 standard deviations or within ±(1 – e2 × 0.0973) 100% = ±21%
of the mean torque.

8.18 SAMPLE-SIZE CALCULATIONS FOR


LINEAR REGRESSION
8.18.1 Sample-Size to Determine the Slope with
Specified Confidence
Suppose that we wish to determine the number of (x, y) observations that are required
to estimate the true slope b 1 from a regression line fitted to experimental data to within
some specified range of values. The regression model will be of the form y = b0 + b1x
which approximates the true relationship y = b 0 + b 1x. The confidence interval for the
slope parameter b 1 will have the form:

P ( b1 − δ < β1 < b1 + δ ) = 1 − α (8.66)

where b1 is the slope determined from the experimental data, d is the half-width of the
confidence interval for the unknown slope parameter b 1, and a is the usual Type 1 error
rate. We know that the distribution of b1 as an estimate of b 1 is Student’s t with degrees
of freedom equal to the number of observations minus two. Then we can set:

δ = tα /2σ b1 (8.67)
338 Chapter Eight

where sb1 can also be estimated from the experimental data. But sb1 is calculated from:

σε
σ b1 = (8.68)
SS x

where se is the standard deviation of the inherent noise in the process and SSx is:

SS x = ∑ ( xi − x )
2
(8.69)
i

The value of SSx is dependent on the number of observations in the data set and the dis-
tribution of the x values over the range of interest. In general, if n observations are made
at each of k evenly spaced levels of x between some specified bounds of x given by xmin
and xmax, then it can be shown (with difficulty) that:

SS x =
1
12
( k − 1)( k ) ( k + 1) n (Δ x)
2
(8.70)

where

x max − x min
Δx = (8.71)
k −1

is the spacing between the k evenly spaced levels of x. Equations 8.67, 8.68, and 8.70
can be solved to obtain the following condition for the sample size:
2
12 ⎛ tα / 2σ ε ⎞
n≥ ⎜ ⎟ (8.72)
k ( k − 1) ( k + 1) ⎝ δ Δ x ⎠

The inequality is necessary because n must be an integer and exact equality is unlikely.
This inequality is also transcendental—both sides depend on n, the number of observa-
tions to be taken at each of the k evenly spaced levels of x. The dependence of the
right-hand side on n is hidden in the degrees of freedom of ta/2. It may be necessary to
attempt several values of n before the minimum value that meets the condition in
Equation 8.72 is found. A first estimate for the sample size can be obtained by approx-
imating ta/2 with za/2. The quantities xmin , xmax , k, and d are chosen by the experimenter
and se must be estimated from prior data, data from a related process, or with an edu-
cated guess.

All Observations at Two Extreme Levels (k = 2)


When all observations are to be concentrated at the two extreme levels of x given by xmin
and xmax , the sample-size condition with k = 2 becomes:
Linear Regression 339

2
⎛t σ ⎞
n ≥ 2 ⎜ α /2 ε ⎟ (8.73)
⎝ δΔ x ⎠

where Δx = xmax – xmin. This case is very important because we often want to determine
the slope of y(x) with the fewest possible observations and the simplest experiment
design. Obviously, the sample size n decreases as Δ x increases so we want to pick xmin
and xmax to be as far apart as is practically possible.

Example 8.31
An electrochemical lead sensor outputs a small current that is proportional to
the lead concentration in a solution. How many observations must be taken to
determine the sensitivity of the sensor if it must operate for lead concentrations
from zero to 2000 ppm? Preliminary data indicate that the sensitivity is about 0.002
nA/ppm and the standard error is about se = 0.05 nA. The sample size should be
sufficient so that the 95 percent confidence interval for the slope spans ±2 percent
of the slope.
Solution: If we can assume that the sensor is linear in the range of interest then we
only need to take observations at zero and 2000 ppm, which gives the separation
between the levels Δx = 2000 ppm. We want the half-width of the resultant confidence
interval for the slope to be d = 0.02 × 0.002 = 4 × 10–5nA/ppm so the initial estimate
(t0.025 z0.025 = 1.96) for the sample size is:

( )
2
zα / 2σ ε
n ≥2 Δxδ

( )
2
≥2 1.96×0.05
2000 ×4 ×10 −5

≥ 3.0

This calculation suggests that we might get by with just n = 3 observations at 0 ppm and
another three observations at 2000 ppm; however, the greater than or equal to condi-
tion of Equation 8.73 is not rigorously satisfied. With 2n = 6 total observations there
will only be 2n – 2 = 4 error degrees of freedom. The corresponding t value is t0.025,4 =
2.78, which is very different from z0.025 = 1.96. If we substitute this t value into the right-
hand side of Equation 8.73 we get:

( ) 2( )
2 2
tα / 2σ ε 2.78×0.05
2 Δ xδ 2000 ×4 ×10 −5

6.04

Because n = 3 is not greater than or equal to 6.04 the n = 3 solution is not valid. The
following table indicates values of n and the corresponding values of:
340 Chapter Eight

( )
2
tα / 2σ ε
2 Δxδ

where the t value has dfe = 2n – 2 degrees of freedom:

( )
2
t` /2r d
n dfd t0.025 2 D xc

3 4 2.776 6.0
4 6 2.447 4.7
5 8 2.306 4.2
6 10 2.228 3.9

The smallest value of n that meets the condition given by Equation 8.73 is n = 5 because
5 ≥ 4.2 is true. This value slightly exceeds the condition defined in Equation 8.73, but
will deliver a narrower confidence interval for b than what was initially specified. The
solution n = 5 indicates that we need to run n = 5 observations at 0 ppm and another
five observations at 2000 ppm to obtain the desired confidence interval for the slope.

Many Uniformly Distributed Observations (k → ∞)


When the values of x within the interval xmin to xmax cannot be controlled but the obser-
vations can be made randomly and uniformly distributed between these bounds, the
value of k becomes very large and Δ x becomes correspondingly small. (Uniformly dis-
tributed doesn’t necessarily mean that the observations are evenly spaced—only that all
values of x within the range of interest are equally likely.) The sample-size condition
given by Equation 8.72 can be manipulated and becomes:

2
⎛ tα / 2σ ε ⎞
N ≥ 12 ⎜ ⎟ (8.74)
⎜⎝ δ ( x − x ) ⎟⎠
max min

where N is the total number of observations taken in the interval from xmin to xmax. It is
very important that the distribution of the observations in the interval from xmin to xmax
be checked carefully, especially with respect to the density of points near the ends of the
interval. These points are the largest contributors to information about the slope and
they must be well represented in the sample to obtain the desired confidence interval
width. If too many of the points fall near the middle of the interval then the value of d
obtained will be larger than intended.

Example 8.32
Find the sample size for Example 8.31 if a total of N observations are to be uni-
formly distributed in the interval from zero to 2000 ppm.
Solution: If N is so large that t0.025 z0.025 then the sample size is given by:
Linear Regression 341

2
⎛ 1.96 × 0.05 ⎞
N ≥ 12 ⎜ ⎟ = 18.0
⎝ 4 × 10 −5 × 2000 ⎠

This gives only dfe = 16 degrees of freedom for the t distribution, so the approximation
of t0.025 with z0.025 is probably only marginally satisfied. After another iteration, the min-
imum sample size is determined to be N = 21.

8.18.2 Sample Size to Determine the Regression Constant with


Specified Confidence
The same approach used to determine sample size for the regression slope can be
applied to estimate the y axis intercept b 0. Our goal is to determine the minimum sam-
ple size necessary to determine a confidence interval for the unknown constant b 0 in y
= b 0 + b 1x. The confidence interval will have the form:

P ( b0 − δ < β0 < b0 + δ ) = 1 − α (8.75)

where b0 is the regression constant determined from the experimental data, d is the half-
width of the confidence interval where:

δ = tα /2σ b0 (8.76)

and a is the Type 1 error probability. The standard error of the estimate of the regres-
sion constant is:

1 x2
σ b0 = σ ε + (8.77)
n SS x

When Equations 8.76, 8.77, and 8.70 are solved for n we obtain:

⎛t σ ⎞
2
⎛ 12 ⎛ x ⎞ ⎞
2

n ≥ ⎜ α /2 ε ⎟ ⎜1 + ⎜ ⎟ ⎟ (8.78)
⎝ δ ⎠ ⎝⎜ k ( k − 1) ( k + 1) ⎝ Δ x ⎠ ⎟⎠

where n is the number of observations taken at each of k evenly spaced levels from xmin
to xmax and Δ x is the spacing between the levels. This expression is transcendental
because the degrees of freedom for ta/2 depend on the sample size. Unfortunately this
expression doesn’t reduce to anything simpler.

Example 8.33
Determine the sample size required to estimate the true value of the regression con-
stant in Example 8.31 to within ±0.03nA with 95 percent confidence. Use observations
at k = 3 evenly spaced levels of x.
342 Chapter Eight

Solution: We have se = 0.05nA, d = 0.03nA, and a = 0.05. With k = 3 evenly spaced


levels, we will have to take n observations at 0, 1000, and 2000 ppm lead concen-
trations so Δ x = 1000 ppm. Since the same number of observations will be taken at
each of these three levels, we will have –x = 1000 ppm. These values give the sample-
size condition:

( t0.025 ( 0.05)
) (1 + ( ))
2 2
n ≥ 0.03
12
2×3×4
1000
1000

≥ 4.17t02.025

where t0.025 has kn – 2 degrees of freedom. If n is going to be large, then as a first guess
t0.025 (z0.025 = 1.96) so:
n ≥ 4.17 (1.96 )
2

≥ 16.0

Since the total number of observations will be nk = 48, and t0.025 with dfe = 48 – 2 = 46
degrees of freedom is approximately equal to z0.025 = 1.96, we can accept this solution.
The calculation indicates that it will be necessary to take n = 16 observations at zero,
1000, and 2000 ppm lead concentrations to determine the true value of the regression
constant to within ±0.03nA with 95 percent confidence.

8.18.3 Sample Size to Determine the Predicted Value of the


Response with Specified Confidence
The confidence interval for the true value of the response y determined from the pre-
dicted value ŷ = b0 + b1x was given in Equation 8.26. If the true value of the response y
for some specified value of x must be determined to within some specified amount d
such that:
(
P yˆ − δ < μ y ( x ) < yˆ + δ = 1 − α ) (8.79)

then by comparison of the two equations:

1 (x − x )
2

δ = tα /2σ ε + (8.80)
n SS x

If n observations are taken at each of k evenly spaced levels of x between xmin and xmax,
then SSx is given by Equations 8.70 and 8.71. These equations can be solved to deter-
mine a condition for the sample size to deliver a confidence interval for my(x) of the
desired width:
⎛t σ ⎞
2
⎛ 12 ⎛x−x⎞ ⎞
2

n ≥ ⎜ α /2 ε ⎟ ⎜1 + ⎟
⎜⎝ k ( k − 1) ( k + 1) ⎜⎝ Δ x ⎟⎠ ⎟⎠
(8.81)
⎝ δ ⎠
Linear Regression 343

Equation 8.78, which gives the sample size for the confidence interval for the
regression constant, is just a special case of this condition with x = 0.

8.18.4 Sample Size to Detect a Slope Different from Zero


The sample-size calculations above apply to confidence intervals for the slope, the
regression constant, and predicted values. Those calculations assume that you know that
the slope, constant, or predicted value is different from zero and that you want to quan-
tify it within a specified range of values with some degree of confidence. A different
but frequently encountered sample-size problem for regression is to determine the
power necessary to detect a nonzero slope for a given sample size. The hypotheses to
be tested are H0: b 1 = 0 versus HA: b 1 ≠ 0. There are two equivalent solutions to this
problem available. The first solution is analogous to the relationship between the sample-
size solutions for simple confidence intervals and hypothesis tests for one mean where
ta/2 in the confidence interval solution is simply replaced by ta/2 + tb in the hypothesis
testing solution. (See Equations 3.39 and 3.42.) The power P for the linear regression
to detect the nonzero slope b 1 is given by:

P = 1− P ( β1
σb
1
β
− tα / 2 < t < σ b1 + tα / 2
1
)
= 1− P( )
β1 SS x β1 SS x
σε − tα / 2 < t < σε + tα / 2 (8.82)

where
σ b1 = σ ε / SS x (8.83)

The second sample-size calculation to test H0: b 1 = 0 versus HA: b 1 ≠ 0 requires


the method of the power of F tests from Chapter 7 where the relevant F test is the
regression F test from the regression ANOVA table. This shouldn’t be a surprise since
the F statistic for the regression is equal to the square of the tb1 statistic for the regres-
sion slope.
The expected value of the regression’s ANOVA F statistic is given by:

E (F) =
(
E SSregr ) +1= λ +1 (8.84)
E ( MSε )

where l is the F distribution noncentrality parameter when b 1 ≠ 0. This equation can be


solved for the noncentrality parameter in terms of the distribution of the xs and the
expected value of the mean square error:

λ=
(
E SSregr ) = β SS
2
1 x

E ( MSε ) σ ε2 (8.85)
344 Chapter Eight

where b 1 is the nonzero value of the slope that we wish to detect. The power P = 1 – b
to reject H0: b 1 = 0 is given by the condition:

Fα = FP , λ (8.86)

where the central and noncentral F distributions both have one numerator and dfe
denominator degrees of freedom. This relationship is used to determine the power for a
specified sample size. It is transcendental in the sample size, so iterations must be used
to determine the correct sample size to achieve a desired value of the power.

Example 8.34
An experiment is to be performed to determine whether the ramp-up time of a
propane-fired heat-treatment furnace depends on the intake/ambient air temperature.
The experiment will use three levels of intake air temperature: 15C, 20C, and 25C. A
single load of carbon steel will be placed in the furnace for the experiment. The
response will be the time required to bring the load to the usual heat-treatment tem-
perature of 930C. What is the power to detect an effect of 10 minutes per degree centi-
grade if three trials are performed at each level of ambient temperature? Historical
data suggest that the variation in the amount of time required to heat a load of this size
is se = 30 minutes.
Solution: The slope of the ramp-up time versus ambient temperature relationship
that we are trying to detect is b 1 = 10 min/C. There are three equally represented lev-
els of ambient temperature, so SSx is given by:

= ∑ ( xi − x )
2
SS x
= 121 ( k − 1)( k ) ( k + 1) n (Δ x)
2

= 121 ( 2 ) ( 3)( 4 )( 3)( 5)


2

= 150

The noncentrality parameter of the F distribution when b 1 = 10 is:

β12SS x
λ = σ ε2

10 2 (150 )
= 30 2

= 16.7

The regression’s ANOVA F test will have dfregr = 1 numerator degree of freedom and
dfe = 9 – 2 = 7 denominator degrees of freedom. The power for the test is given by:

(F0.05 (
= 5.59 ) = FP ,16.7 = F0.94 ,16.7 )
Linear Regression 345

so the power is P = 0.94. This means that the experiment has a 94 percent chance of
delivering a statistically significant (p < 0.05) regression slope if the true slope is b 1 =
10 min/C. The power will be higher if the slope is larger and lower if the slope is smaller.

Example 8.35
Use the method of Equation 8.82 to confirm the answer to Example 8.34.
Solution: There are dfe = 7 degrees of freedom for the error so t0.025,7 = 2.365 and:

P = 1− P ( β1 SS x
σε − tα / 2 < t <
β1 SS x
σε + tα / 2 )
= 1− P ( 10 150
30 − 2.365 < t < 10 30150 + 2.365 )
= 1 − P (1.72 < t < 6.447 )
= 0.94

The value of the power, P = 0.94, is in agreement with the power found using the method
of the ANOVA F test.

8.19 DESIGN CONSIDERATIONS FOR


LINEAR REGRESSION
The 11-step general procedure for experimentation introduced in Chapter 4 is appro-
priate for situations that will be analyzed by linear regression methods. Following are
some special considerations for these situations:
• Confirm that the independent variable (x) can be determined exactly. If this
condition is not met it will be necessary to use the errors in variables method
of Section 8.13.
• Select minimum and maximum values of x that are as far apart as practically
possible. This will improve the estimates of the regression coefficients and
decrease the number of observations required for the experiment.
• Concentrate a substantial fraction of the observations at or near the minimum
and maximum values of x. These observations improve the estimates of the
regression coefficients more than observations that fall near the middle of the
range of x.
• Use at least three levels of x to permit a lack of fit test. It’s best to evenly space
the levels. If a transformation of x is anticipated to linearize the model, the
transformed x values should be evenly spaced.
• Take at least two replicate readings at each level of x so that a linear lack of fit
test can be performed.
346 Chapter Eight

• Do the runs in random order. Do not perform the runs by systematically


increasing or decreasing x.
• If possible, block replicated observations. Include the blocks as a qualitative
variable in the model to reduce the effects of extraneous variation and to help
identify possible causes for it. Use the general linear model method to do
regression on the quantitative variable and ANOVA on the blocking variable.
• Perform a sample-size calculation to determine the necessary rather than an
arbitrary number of observations to collect. Consider historical data, data from
a related process, or do a preliminary experiment to estimate the standard error
of the model required for the calculation.
9
Two-Level Factorial
Experiments

9.1 INTRODUCTION
Chapter 6 introduced the general factorial designs where two or more variables could
each have two or more levels and all possible combinations of variables were con-
structed. These designs were designated a × b × c × . . . designs where each number indi-
cated the number of levels of a variable and all the runs were performed in random
order. This chapter introduces a special subset of these factorial designs—those that
have only two levels of each variable. These experiments are designated 2 × 2 × . . . × 2
or 2k experiments where k is the number of variables. 2k is also the number of unique
cells or runs in each replicate of the design. When all of the experimental runs are per-
formed in random order, the 2k experiments have the ability to characterize all of the
variables under consideration, and, as with the other factorial designs, they can resolve
two-factor and higher-order interactions.
The 2k experiments are one of the most important and fundamental families of
experiments in DOE. In addition to being some of the most commonly run experiments,
they also provide the foundation for the more complex designs considered in Chapters
10 and 11, so study these experiments very carefully.

9.2 THE 21 FACTORIAL EXPERIMENT


The 21 factorial experiment is the simplest of the two-level experiments. Some people
would call it trivial, but despite its simplicity the 21 factorial experiment still demon-
strates many of the important aspects of the analysis common to all two-level factorial
experiments.

347
348 Chapter Nine

The 21 factorial experiment involves only one variable at two levels. The variable
may be qualitative or quantitative—in either case the analysis is the same. This is one of
the important characteristics of two-level factorial experiments. When two or more vari-
ables are being studied, they can be all qualitative, all quantitative, or a mix of the two
types. This flexibility is not preserved when an experiment has one or more qualitative
variables at three or more levels.
The two levels of the variable under study are referenced in terms of the coded lev-
els that were introduced back in Chapter 8. These levels are designated –1 and +1 or just
– and + and are often called the low and high levels, respectively. The actual physical
levels used for the low and high settings are entirely up to the experimenter. The coded
values provide a universal way of communicating information about variables. The use
of codes also has mathematical benefits that greatly simplify many calculations. These
codes are used so frequently that, with experience, their use becomes second nature.
You will soon find yourself immediately thinking in terms of appropriate low and high
levels for each variable in a 2k experiment.
In tabular form the 21 factorial experiment design may be written:

Run x1
1 −1
2 +1

The subscript 1 on x indicates that x1 is the first independent variable in our experiment
in anticipation of more complex experiments with two or more variables. The other
variables will be indicated by x2, x3, and so on. (MINITAB and some books prefer the
use of A for x1, B for x2, and so on. This choice is certainly valid, but I like the simplic-
ity of the x1 notation. Get used to both choices of notation.)
Replicates are used to increase the total number of observations in the 21 experi-
ment. When the experiment is replicated, the same number of runs should be used for
each level of x1. For example, this experiment might be replicated four times giving four
observations at the x1 = –1 level and four observations at the x1 = +1 level for a total of
eight experimental runs. If the same number of runs are not used at each level of x1, the
experiment becomes unbalanced and its validity could be compromised. This issue of
balance in the number of runs performed at each level of a variable is a key concept
of DOE, and you should always try to preserve this balance if possible. When the num-
ber of runs at each level of a variable are not equal, either by accident or by design, spe-
cial considerations must be made in the analysis. These issues will be addressed later in
this chapter.
There are several ways to analyze the response in a 21 experiment. Although we
could use ANOVA to test for a difference between the response means at the two levels
of x1, the regression methods of Chapter 8 provide a more concise model that can be
easily expanded for more complex designs. Chapter 8 suggests the use of the model:

y = b0 + b1 x1 (9.1)
Two-Level Factorial Experiments 349

where y is the measured response and the regression methods of Chapter 8 are used to
determine the statistics b0 and b1. Here the values of x1 are limited to the coded values
–1 and +1 and, if x1 is quantitative, all of the fractional values in between. Since the
actual values of the measurement variable x1 are not used in Equation 9.1, you are
responsible for switching back and forth between the real and coded levels of x1.
Some discussion of the b0 coefficient is appropriate because it has special meaning
in the interpretation of all of the 2k experiments. Recall from Chapter 8 that the point
(–x, y)
– must fall on the regression line. Since the 21 experiment contains the same number
of runs at the low and high levels of x1, the mean level of x1 in the experiment must be
–x = 0. The corresponding response under this condition must be –y, or in terms of the
1
dot notation introduced earlier –y•• where the implied summations are over both levels of
x1 and all replicates. It can be seen by comparing these results that:

y = b0 + b1 x1
= b0 + b1 ( 0 )
= b0

This is an important observation—the b0 coefficient in the regression analysis of the


21 experiment, and of any balanced 2k experiment for that matter, corresponds to the
grand mean of the response y. In general, the b0 coefficient will represent the grand
mean of the response, and the effects due to x1, x2, . . . can be interpreted as deviations
or perturbations from the grand mean. The b0 term provides a sort of anchor for the
response about which all the other terms exhibit their effects.
The effect of x1 on the response is best seen from a response plot of y versus x1 as
shown in Figure 9.1. Two points are plotted in the figure, one at (x1, y) = (–1, –y–•) and
the other at (x1, y) = (+1, –y+•), where –y–• is the mean of all responses at x1 = –1:

1 n
y−• = ∑y ( x1 = −1)
n i=1
(9.2)

y +•

Δy

y–•
Δx1

–1 +1 x1

Figure 9.1 Response y versus x1.


350 Chapter Nine

where n is the number of replicates and –y+• is the mean of all responses at x1 = +1:

1 n
y+• = ∑y ( x1 = +1)
n i=1
(9.3)

From the plot, the slope of the line through these two points is:

Δy y − y−•
= ( y+• − y−• )
1
b1 = = +• (9.4)
Δ x1 ( +1) − ( −1) 2

Now that b0 and b1 are both uniquely determined from the data, Equation 9.1 can be
used to describe how y depends on x1.
If there are n replicates of the 21 experiment then there will be 2n total runs or dftotal
= 2n – 1 where one degree of freedom is consumed, as always, by the grand mean. In
this experiment, the grand mean corresponds to the b0 coefficient. Since the b1 coeffi-
cient also consumes one degree of freedom there will be dfe = 2n – 2 degrees of free-
dom for the error estimate, just as in Chapter 8.

Example 9.1
Determine the regression equation for the following data set. Use the methods
described in Section 9.2.
x1 –1 –1 +1 +1
y 47 51 21 17

Solution: The b0 coefficient from the regression model is:

b0 = y••
= 41 ( 47 + 51 + 21 + 17 )
= 34.0

The b1 coefficient is given by:

b1 = 1
2 (y+•
− y−• )
= 1
2
⎡⎣( 21+17
2 )−( 47+51
2 )⎤⎦
= −15.00

This means that the regression equation is given by:

y = b0 + b1 x1
= 34 − 15 x1

The data and the regression line are plotted in Figure 9.2.
Two-Level Factorial Experiments 351

60 y = 34 – 15x1

50

40 (0, 34)
y

30

20

10
–1 0 +1
x1

Figure 9.2 Plot of example data and regression fit.

Example 9.2
Use MINITAB’s regress command to confirm the regression equation found in
Example 9.1. Also determine the standard error and r 2 value.
Solution: The MINITAB output is shown in Figure 9.3. The regression equation is
the same as was found in the example. The model standard error is se = 2.828 and the
coefficient of determination is r 2 = 0.983.

9.3 THE 22 FACTORIAL EXPERIMENT


The 2 × 2 factorial experiment is one of the simplest and yet most profoundly important
experiments. Despite its simplicity, it is one of the most commonly run experiments of all
types. Its analysis embodies all of the key concepts, mathematics, and interpretation issues
of the more complicated designs that are founded on it, so study this design carefully.
The 2 × 2 or 22 factorial experiment has two variables, x1 and x2, each at two levels.
As before, the coded levels are designated –1 and +1 or just – and +, and simple trans-
formations are used to convert back and forth between these coded levels and the real
measurable levels of a variable. The four unique runs of the 22 experiment design may
be expressed in tabular form as:

Run x1 x2
1 − −
2 − +
3 + −
4 + +
352 Chapter Nine

MTB > print c1-c2

Data Display

Row x1 y
1 -1 47
2 -1 51
3 1 21
4 1 17

MTB > regress c2 1 c1

Regression Analysis: y versus x1

The regression equation is


y = 34.0 - 15.0 x1

Predictor Coef SE Coef T P


Constant 34.000 1.414 24.04 0.002
x1 -15.000 1.414 -10.61 0.009

S = 2.82843 R-Sq = 98.3% R-Sq(adj) = 97.4%

Analysis of Variance

Source DF SS MS F P
Regression 1 900.00 900.00 112.50 0.009
Residual Error 2 16.00 8.00
Total 3 916.00

Figure 9.3 MINITAB output for Example 9.2.

x1

x2
– +

Figure 9.4 2 × 2 factorial design.

or in graphical form as in Figure 9.4. The experiment is performed by selecting a random


order of experimentation for the runs, configuring the system to the required (x1, x2) lev-
els, and measuring the corresponding responses. If replication is desired, the order of
Two-Level Factorial Experiments 353

experimentation can be randomized completely by randomizing over all possible runs,


or randomization can be limited by blocking on replicates.
The analysis of the 22 factorial experiment is carried out by evaluating the effects
of variables x1 and x2. The effects can be expressed in the form of a linear regression
model with two variables:

y( x1 , x 2 ) = b0 + b1 x1 + b2 x 2 (9.5)

where b0, b1, and b2 are regression coefficients to be determined from the data. The
effect of x1 is determined by grouping the responses according to their x1 levels as shown
in Figure 9.5. The coefficient b1 will then be:

y+•• − y−••
b1 = (9.6)
2
where

1 n
2n ∑∑
y+•• = y(+1, x 2 ) (9.7)
i =1 x 2

1 n
2n ∑∑
y−•• = y(−1, x 2 ) (9.8)
i =1 x 2

and n is the number of replicates. The first dot indicates summation over all x2 levels
and the second dot indicates summation over replicates. The choice of –1 and +1 for the
levels of x1 clearly makes for easy calculation and interpretation of the b1 term. The
numerator of Equation 9.6 is the change in y:

Δy = y+•• − y−•• (9.9)

x1

x2
– +

Figure 9.5 Determination of the x1 effect.


354 Chapter Nine

observed over a change of x1 from –1 to +1:

Δ x1 = +1 − (−1) = 2 (9.10)

This interpretation is shown in the plot of y versus x1 in Figure 9.6.


The coefficient b2 can be determined in a similar manner. This is done by grouping
the responses by their x2 classification as shown in Figure 9.7. The b2 coefficient is cal-
culated in the same way as was the b1 coefficient:

y•+• − y•−•
b2 = (9.11)
2
where
1 n
2n ∑∑
y•+• = y( x1 , +1) (9.12)
i =1 x1

y +••

Δy

y –••
Δx1

–1 +1 x1

Figure 9.6 Response y versus variable x1.

x1

x2
– +

Figure 9.7 Determination of the x2 effect.


Two-Level Factorial Experiments 355

1 n
2n ∑∑
y•−• = y( x1 , −1) (9.13)
i =1 x1

It is possible and highly desirable to add one more term to the model given by
Equation 9.5. In fact, this term and other terms like it are one of the major strengths of
the factorial designs. The term to be added is the two-factor interaction term that mea-
sures the strength of the interaction between x1 and x2. We write the interaction as x12
and its regression coefficient as b12. The new model becomes:

y( x1 , x 2 ) = b0 + b1 x1 + b2 x 2 + b12 x12 (9.14)

In terms of the four cells of the 2 × 2 experiment, the interaction term is determined by
pairing the observations as shown in Figure 9.8. The levels of x12 are determined by tak-
ing the product of x1 and x2:

x12 = x1 x 2 (9.15)

The coefficient b12 is determined in a manner similar to the way that b1 and b2 were
determined:

( y++• + y−−• ) − ( y+−• + y−+• )


b12 = (9.16)
2
where
1 n
n∑
y++• = y(+1, +1) (9.17)
i =1

and so on for –y– – • , –y+ – • , and –y– + • .


In practice, the coefficients b0, b1, b2, and b12 are determined using the linear regres-
sion function of a suitable computer program. In MINITAB for example, if c1 contains

x1

+ x12 = +1

– x12 = –1

x2
– +

Figure 9.8 Determination of the x12 effect.


356 Chapter Nine

the response y and the levels of x1 and x2 are included in columns c2 and c3, then the
interaction levels for x12 can be determined with:

mtb > let c4= c2*c3

or from the Calc> Calculator menu. This performs the simple operation x12 = x1 × x2
and puts the result in column c4. The regression is carried out with MINITAB’s regress
command:

mtb > regress c1 3 c2 c3 c4

or from the Stat> Regression> Regression menu. The “3” after c1 indicates to MINITAB
that three terms are to be included in the regression model.

Example 9.3
Use the technique presented in Figures 9.5, 9.7, and 9.8 to construct a model for
the following data set.
x1 \ x2 −1 +1
−1 61, 63 41, 35
+1 76, 72 68, 64

Solution: The model we need has the form:

y( x1 , x 2 ) = b0 + b1 x1 + b2 x 2 + b12 x12

Since the experiment has the same number of observations at each level of each vari-
able, the grand mean of the data set corresponds to the b0 coefficient so:

1
b0 = y••• =
8
(61 + 63 + 76 + 72 + 41 + 35 + 68 + 64 ) = 60
where the data are yijk and the i subscript indicates the level of x1, j indicates the level
of x2, and k indicates the replicate. We can find the b1 coefficient by taking the data in
rows according to the levels of x1. When x1 = –1 we have a mean response of:

1
y−•• =
4
(61 + 63 + 41 + 35) = 50
When x1 = +1 we have a mean response of:

1
y+•• =
4
( 76 + 72 + 68 + 64 ) = 70
Two-Level Factorial Experiments 357

80

70

60
b1 = 10
50
y

40

30

20
–1 0 +1
x1

Figure 9.9 Response y versus variable x1.

The data and the response means are plotted in Figure 9.9. As required, the y axis inter-
cept (at x1 = 0) is 60. The slope of the line in the figure, which is the b1 coefficient, is
given by:

y+•• − y−•• 70 − 50
b1 = = = 10
Δ x1 ( +1) − ( −1)
We can find the b2 coefficient in a similar manner. If we take the data by columns
according to the levels of x2 we have:

1
y•−• =
4
(61 + 63 + 76 + 72) = 68
1
y•+• = ( 41 + 35 + 68 + 64 ) = 52
4
The data and the response means are plotted in Figure 9.10. The b2 coefficient is:

y•+• − y•−• 52 − 68
b2 = = = −8
Δ x2 ( +1) − ( −1)
The coefficient for the interaction x12 is found by taking the data from the table
along the diagonals as shown in Figure 9.8. The data on the falling diagonal corre-
sponds to the x12 = –1 level. The mean response along this diagonal is:

1
y ( x12 = −1) = ( 76 + 72 + 41 + 35) = 56
4
358 Chapter Nine

80

70

60

50
y

b2 = – 8

40

30

20
–1 0 +1
x2

Figure 9.10 Response y versus variable x2.

The data on the rising diagonal corresponding to the x12 = +1 level gives a mean
response of:

1
y ( x12 = +1) = (61 + 63 + 68 + 64 ) = 64
4
The data and the response means are plotted in Figure 9.11. The b12 coefficient is:

y ( x12 = +1) − y ( x12 = −1) 64 − 56


b12 = = = +4
Δx12 ( +1) − ( −1)
If we put all of this information together, the mathematical model for the response is:

y = 60 + 10 x1 − 8 x 2 + 4 x12 (9.18)

Example 9.4
Find the model standard error and the coefficient of determination for Example 9.3.
Solution: We need to determine the residuals to find the model standard error. The
table below shows the responses yijk and the predicted values or fits ŷijk. The differences
between the observed and predicted values are the residuals according to:

εijk = yijk − yˆijk

The model standard error is calculated from the square root of the sum of the
squares of the residuals divided by the appropriate degrees of freedom:
Two-Level Factorial Experiments 359

80

70

60

50 b12 = 4
y

40

30

20
–1 0 +1
x12

Figure 9.11 Response y versus interaction x12.

SSε
sε =
dfε

where SSe = Σe ijk


2
and dfe = n – 4 is the number of degrees of freedom available to esti-
mate the error after calculating the regression coefficients b0, b1, b2, and b12. The fol-
lowing table gives the observed values, the predicted values according to Equation
9.18, the residuals, and the squares of the residuals:

x1 x2 yijk yˆ ijk d ijk d ijk2


–1 –1 61 62 –1 1
–1 –1 63 62 1 1
–1 +1 41 38 3 9
–1 +1 35 38 –3 9
+1 –1 76 74 2 4
+1 –1 72 74 –2 4
+1 +1 68 66 2 4
+1 +1 64 66 –2 4

The model standard error is:

1+ 1+ 9 + 9 + 4 + 4 + 4 + 4 36
sε = = = 3.0
8−4 4
360 Chapter Nine

To determine the coefficient of determination r 2, we need to find the amount of total


variation present in the response. This is given by:

( )
2
SStotal = ∑ yijk − y•••

which is just the sum of squares required to calculate the variance (or standard deviation)
of the yijk. Subtracting y–••• from each of the yijk , squaring the results, and adding the
squares, we get:

( )
2
SStotal = ∑ yijk − 60

= ( 61 − 60 ) + ( 63 − 60 ) + ( 41 − 60 ) + L
2 2 2

= 1476

The coefficient of determination is given by:

= 1 − SStotalε
SS
r2
= 1 − 1476
36

= 0.976

The adjusted coefficient of determination is given by:


× SS
= 1 − dftotal
2 df ε
radjusted ε × SStotal

= 1 − 47××1476
36

= 0.957

Example 9.5
Find and interpret the regression coefficient t values and the corresponding p values
for Example 9.3.
Solution: We need to determine the regression coefficient standard deviations in
order to find their t values. The standard deviations for regression coefficients were
defined in Chapter 8. For the constant term, the standard deviation is:

1 x2
sb0 = sε +
n SS x

Since the mean level of each variable is –x = 0, this expression simplifies to:


sb0 =
n

For the example problem we have:


Two-Level Factorial Experiments 361

3.0
sb0 = = 1.061
8

Also from Chapter 8, the standard deviation of the b1 coefficient is given by:


sb1 =
SS x

where SSx = Σ(xi – –x)2. Again, since –x = 0 by design and since the x values are either +1
or –1, this expression simplifies to:


sb1 =
n

so again we have:

sb1 = 3 / 8 = 1.061

Similarly, the standard deviations of the b2 and b12 coefficients are also equal to 1.061.
(It is common for several of the standard deviations of the regression coefficients in
simple designed experiments to be equal.) The t values for the regression coefficients
are found by taking the ratio of each coefficient to its standard deviation so we have:

b0 60
tb0 = = = 56.6
sb0 1.061

b1 10
tb1 = = = 9.43
sb1 1.061

b2 −8
tb2 = = = −7.54
sb2 1.061

b12 4
tb12 = = = 3.77
sb12 1.061

These values correspond to the t values used in hypothesis tests of H0: the coefficient is
zero versus HA: the coefficient is different from zero. For example, tb1 = 9.43 indicates
that the coefficient b1 = 10 is 9.43 of its standard deviations greater than zero. This indi-
cates, without much doubt, that the coefficient is different from zero (that is, that HA
should be accepted). The corresponding p value indicates just how much doubt there is
in this conclusion. A p value measures the tail area under the t distribution that charac-
terizes the distribution of the experimental regression coefficient outboard of the t value.
362 Chapter Nine

Since the hypothesis test being used is two-tailed, the p value gets contributions from
both tails. The degrees of freedom for the t distributions used here are equal to dfe . The
t values for the first three regression coefficients are large enough so that their p val-
ues are very near zero. A t table shows that t0.01,4 = 3.75 so the p value for tb12 = 3.77 is
about p = 2(0.01) = 0.02. The exact p values are easiest to find using MINITAB’s invcdf
function or from the Calc> Probability Distribution menu. The results of the regression
analysis are summarized in the following table:

Source b s t p
Constant 60 1. 06 57 0.00 dftotal = 7 sε = 3.0
x1 10 1.06 9.4 0.00 dfmodel = 3 r 2 = 0.977
x2 −8.0 1.06 −7.5 0.00
dfε = 4 2
r adjusted = 0.957
x12 4.0 1.06 3.8 0.02

Example 9.6
Perform the MINITAB analysis on the data from Example 9.3 and compare it to the
summary table above. Construct the necessary plots to check assumptions.
Solution: The analysis by MINITAB of the example data is shown in Figure 9.12.
MINITAB confirms all of the values that we calculated manually. The diagnostic plots
that must be constructed and the characteristics to check from each of them are: the
normal plot of residuals for normality and outlier check; a plot of residuals versus obser-
vation order for independence and homoscedasticity; plots of residuals versus inde-
pendent variables for homoscedasticity; and a plot of residuals versus predicted values
for homoscedasticity. There is some redundancy in creating all of these plots; however,
there is no harm done by checking them all. The diagnostic plots are shown in Figure
9.13 and they confirm that all necessary conditions are met, although some of the
graphs are a little coarse and hard to interpret because of the relatively small number
of observations.

9.4 THE 23 FACTORIAL DESIGN


The 23 factorial design has two levels of each of three variables and requires 2 × 2 × 2
= 8 runs. The 23 design matrix is shown in Table 9.1. The matrix of runs is generated
by alternating between levels –1 and +1 for runs one to eight of variable x3. Then vari-
able x2 is generated by alternating pairs of –1s and +1s. Finally, variable x1 is generated
by taking four –1s and then four +1s. Since the same number of –1s and +1s appear in
each column, the experiment is balanced. The assignment of the names x1, x2, and x3 to
the three columns is arbitrary. The two- and three-factor interactions were added to the
table by multiplying the appropriate columns of signs.
The experimental runs in Table 9.1 are organized by their logical or standard order
indicated by the column labeled Std. To prevent the effects of study variables from being
Two-Level Factorial Experiments 363

MTB > let c4=c2*c3


MTB > corr c2-c4;
SUBC> nopvalue.

Correlations: x1, x2, x12

x1 x2
x2 0.000
x12 0.000 0.000

Cell Contents: Pearson correlation

MTB > regress c1 3 c2-c4

Regression Analysis: y versus x1, x2, x12

The regression equation is


y = 60.0 + 10.0 x1 - 8.00 x2 + 4.00 x12

Predictor Coef SE Coef T P


Constant 60.000 1.061 56.57 0.000
x1 10.000 1.061 9.43 0.001
x2 -8.000 1.061 -7.54 0.002
x12 4.000 1.061 3.77 0.020

S = 3 R-Sq = 97.6% R-Sq(adj) = 95.7%

Analysis of Variance

Source DF SS MS F P
Regression 3 1440.00 480.00 53.33 0.001
Residual Error 4 36.00 9.00
Total 7 1476.00

Source DF Seq SS
x1 1 800.00
x2 1 512.00
x12 1 128.00

Figure 9.12 MINITAB analysis of data from Example 9.3.

confounded with lurking variables, the runs should be performed in random order, such
as the order shown in the column labeled Run. The first run of the experiment (Run = 1)
must be configured with x1 at its +1 level, x2 at its –1 level, and x3 at its +1 level. Then
the process should be operated to generate a part or whatever the output of the process
happens to be. After the first run has been completed, the process should be reconfigured
for the second run (Run = 2) and so on until all eight runs have been completed.
If the 23 design is to be replicated, the runs should be randomized either by: 1) ran-
domizing completely over all possible runs or 2) randomizing the order of the runs
within each replicate. If the latter method is used, then the replicates can be treated as
blocks to protect against lurking variables that change from block to block. This bene-
fit makes blocking on replicates preferred over randomizing over all possible runs.
364 Chapter Nine

Histogram of the Residuals Normal Probability Plot of the Residuals


99
2
90
Frequency

Percent
1 50

10

0 1
–3 –2 –1 0 1 2 3 –5.0 –2.5 0.0 2.5 5.0
Residual Residual

Residuals versus the Order of the Data Residuals versus the Fitted Values

2 2
Residual

Residual
0 0

–2 –2

1 2 3 4 5 6 7 8 40 50 60 70
Observation Order Fitted Value

Residuals versus x1 Residuals versus x2

2 2
Residual

Residual

0 0

–2 –2

–1.0 –0.5 0.0 0.5 1.0 –1.0 –0.5 0.0 0.5 1.0
x1 x2

Figure 9.13 Diagnostic plots for Example 9.6.

Table 9.1 Matrix of runs for 23 design.


Std Run x1 x2 x3 x12 x13 x23 x123
1 8 – – – + + + –
2 3 – – + + – – +
3 2 – + – – + – +
4 6 – + + – – + –
5 5 + – – – – + +
6 1 + – + – + – –
7 4 + + – + – – –
8 7 + + + + + + +
Two-Level Factorial Experiments 365

When a 23 experiment is blocked on replicates, the blocks constitute a fourth vari-


able that is qualitative. The method of analysis for the blocked 23 experiment is to fit a
general linear model (GLM), which is available in MINITAB’s Stat> ANOVA>
General Linear Model menu. The use of general linear models is covered in Chapter 7.
MINITAB’s DOE tools can also analyze the blocked experiment. With either method of
analysis, if there are no significant differences between blocks defined by replicates, then
the model can be simplified by ignoring the blocking structure. The little bit of added
complexity that comes with blocking on replicates is well worth the trouble.
The 23 experiment design can be visualized as a cube, as in Figure 9.14, with an
experimental run at each of the cube’s corners. The numbers in parentheses next to each
run correspond to the standard order numbers from Table 9.1. Practice until you become
proficient in thinking about 2k experiments this way. Even when an experiment involves
more than three variables, it is still possible and useful to think about the design in terms
of just three variables at a time.
A regression analysis of the 23 factorial experiment can fit the following model:

y = b0 + b1 x1 + b2 x 2 + b3 x3 + b12 x12 + b13 x13 + b23 x 23 + b1223 x123 (9.19)

where the bs are all regression coefficients. The b0 term is referred to as the model’s
constant; the coefficients b1, b2, and b3 are called the main effects; the b12, b13, and b23
terms are two-factor interactions; and b123 is a three-factor interaction. In engineering
and manufacturing it is rare to encounter a significant three-factor interaction so this
term is usually omitted from the model. Then the sum of squares and degrees of free-
dom associated with the three-factor interaction term are pooled (that is, combined)
with the error.

X3

(2)
1 (4)

(8)
(6)

–1
–1
–1 1
(1)
X2
(3)
1
(5)
(7)
X1

Figure 9.14 23 design.


366 Chapter Nine

Since all of the design variables in the 23 experiment have an equal number of runs
at their –1 and +1 levels, the average level of each design variable is –xi = 0 where i = 1,
2, 3 indicates the design variable. This means that the constant b0 in the regression
model must be equal to the grand mean of the response, that is, b0 = –y. While it is appro-
priate to compare the magnitudes of b1, b2, . . . , b123 to each other, b0 may be several
orders of magnitude different than the other regression coefficients. The information
contained in b0 is different in nature from the information contained in the other regres-
sion coefficients. It’s best to interpret b0 as a reference value or anchor for the response
and the other regression coefficients as perturbations to the response due to the differ-
ent model terms. For example, the expected values of the response with x1 at its ±1 levels
are given by b0 ± b1, the expected values of the response with x2 at its ±1 levels are given
by b0 ± b2, and so on.
If an experiment uses a single replicate of a 23 design then the model in Equation
9.19 will consume all available degrees of freedom. This means that the model given by
Equation 9.19 would exactly fit all eight data points without error. Not having an error
estimate is a serious problem. Without any degrees of freedom left to estimate error there
is no standard error, no regression coefficient standard deviations, no ANOVA F statis-
tic, and so on. There are two ways to resolve this problem. Either the experiment must
be replicated to provide additional degrees of freedom to estimate the error, or one or
more terms must be eliminated from the model and used to form the error estimate.
Since it is rare to find significant three-factor interactions in engineering and manufac-
turing problems, the three-factor interaction term is usually omitted from the model and
becomes the error estimate. This only provides one degree of freedom for the error esti-
mate, but it’s a start. After fitting the model with just one degree of freedom for the error
estimate, it is likely that other model terms will appear to be insignificant. These terms
can also be eliminated from the model and pooled with the error estimate.
As you refine a model, be careful to preserve its hierarchy. This means that in order
to keep an interaction term in the model, the main effects contributing to the interaction
must be retained in the model whether or not they are statistically significant. If a three-
factor or higher-order interaction is to be retained in the model, then all possible lower-
order interactions must also be retained.
When can you stop refining a model? To some degree the p values for the regres-
sion coefficients can be helpful, but this is really a judgment call that can only be
made by the person doing the analysis. One strategy is to keep eliminating terms from
the model until the standard error reaches a minimum value. Eliminating a term from the
model causes its sum of squares to be combined with the error sum of squares and its
single degree of freedom to be added to the error degrees of freedom. If the term is truly
insignificant, the effect of the additional error degree of freedom will outweigh the addi-
tion to the error sum of squares and the standard error of the model will decrease. If the
dropped term is significant, then the addition to the error sum of squares will outweigh
the benefit of the additional error degree of freedom and the standard error of the model
will increase. There are no hard and fast rules about how far to take the refinements to
a model, but remember, when refining a model, always keep Occam’s razor in mind: the
best model is probably the simplest one that explains the data.
Two-Level Factorial Experiments 367

Example 9.7
A 23 experiment was performed to study the breaking strength of plastic wire ties
used for stabilizing and organizing wiring in electrical enclosures. The experimental
variables were A: Manufacturer, B: Temperature, and C: Age. The matrix of experi-
mental runs and the breaking strength response are shown in Figure 9.15. The figure
also shows the regression analysis using the full model with main effects and two- and
three-factor interactions. Use the regression analysis and the sums of squares associ-
ated with the different model terms to identify a refined model.
Solution: The regression analysis in Figure 9.15 shows the full model with main
effects and two- and three-factor interactions. Since the experiment has only one repli-
cate of the 23 design, there are no degrees of freedom left over for the error estimate.
Occam says that the most complex factor is the least likely to be important so the three-
factor interaction should be the first one omitted from the model. This frees up a single
degree of freedom so that an error estimate can be made and p values for regression
coefficients can be calculated.
Table 9.2 shows the results of a series of regression analyses, starting from the full
model and progressing to the simplest model where all terms have been dropped, where
the weakest term in the model was dropped in each successive step. (This analysis was
run manually; however, the same analysis can be performed automatically using Stat>
Regression> Stepwise. Be careful using automated stepwise regression because it
doesn’t check to make sure that the models fitted are hierarchical.) The regression coef-
ficient p values, standard errors, and coefficients of determination don’t suggest an
obvious stopping point for model simplification. The larger models suggest that the B
variable doesn’t have any affect on the response so all terms involving B should cer-
tainly be dropped. The model with A, C, and AC is appealing except that the AC term is
weak ( p = 0.107). However, when AC is dropped from the model, then the C term is no
longer statistically significant ( p = 0.080). The only model with terms that are all sta-
tistically significant is the model including only the A variable ( p = 0.026); however,
this model may be oversimplified. The two best candidate models to report appear to be
the model with A, C, and AC and the model with only A. Figure 9.16, which shows the
r 2 and r 2adj values from Table 9.2 as a function of dfmodel , confirms that there’s no abrupt
change in the quality of the model as individual model terms are added or removed.
Clearly, more data are required to clarify the importance of C in the model.

9.5 THE ADDITION OF CENTER CELLS TO 2K DESIGNS


The power of any 2k experiment, that is, its ability to detect small differences between
the ±1 states of each of its variables, can be increased by adding experimental runs.
Runs cannot be added to an experiment in an arbitrary manner, however. To preserve the
very important balance (that is, orthogonality) of the 2k designs, it is necessary to add
runs by adding complete replicates. Since each replicate requires an additional 2k runs,
replicating a complete experiment design can be expensive. The only other way to add
runs to a 2k experiment without unbalancing it is to add center cells to the design. Center
368 Chapter Nine

MTB > let c5=c2*c3


MTB > let c6=c2*c4
MTB > let c7=c3*c4
MTB > let c8=c2*c3*c4
MTB > print c1-c8
Data Display
Row y A B C AB AC BC ABC
1 91 1 -1 -1 -1 -1 1 1
2 123 1 1 1 1 1 1 1
3 68 -1 -1 -1 1 1 1 -1
4 131 1 -1 1 -1 1 -1 -1
5 85 1 1 -1 1 -1 -1 -1
6 87 -1 -1 1 1 -1 -1 1
7 64 -1 1 -1 -1 1 -1 1
8 57 -1 1 1 -1 -1 1 -1
MTB > corr c2-c8;
SUBC> nopvalue.
Correlations: A, B, C, AB, AC, BC, ABC
A B C AB AC BC
B 0.000
C 0.000 0.000
AB 0.000 0.000 0.000
AC 0.000 0.000 0.000 0.000
BC 0.000 0.000 0.000 0.000 0.000
ABC 0.000 0.000 0.000 0.000 0.000 0.000
Cell Contents: Pearson correlation
MTB > regress c1 7 c2-c8
Regression Analysis: y versus A, B, C, AB, AC, BC, ABC
The regression equation is
y = 88.3 + 19.3 A - 6.00 B + 11.2 C + 2.50 AB + 8.25 AC - 3.50 BC + 3.00 ABC
Predictor Coef SE Coef T P
Constant 88.2500 * * *
A 19.2500 * * *
B -6.00000 * * *
C 11.2500 * * *
AB 2.50000 * * *
AC 8.25000 * * *
BC -3.50000 * * *
ABC 3.00000 * * *
S = *
Analysis of Variance
Source DF SS MS F P
Regression 7 5029.500 718.500 * *
Residual Error 0 * *
Total 7 5029.500
Source DF Seq SS
A 1 2964.500
B 1 288.000
C 1 1012.500
AB 1 50.000
AC 1 544.500
BC 1 98.000
ABC 1 72.000

Figure 9.15 Regression analysis of a 23 design using the full model.


Two-Level Factorial Experiments 369

Table 9.2 Results from fitting a series of reduced models to a 23 design.


Term Coeff SS p Values
A 19.25 2964.5 0.022 0.020 0.008 0.008 0.013 0.026
B –6.00 288.0 0.098 0.162 0.142
C 11.25 1012.5 0.295 0.055 0.034 0.048 0.080
AB 2.50 50.0 0.166
AC 8.25 544.5 0.558 0.096 0.072 0.107
BC –3.50 98.0 0.222 0.333
ABC 3.00 72.0
SSmodel 5029.5 4957.5 4907.5 4809.5 4521.5 3977.0 2964.5 0
SSe 0 72.0 122 220 508 1052.5 2065 5029.5
dfmodel 7 6 5 4 3 2 1 0
dfe 0 1 2 3 4 5 6 7
MSe * 72.0 61.0 73.3 127 210.5 344.2 718.5
se * 8.5 7.8 8.6 11.3 14.5 18.6 26.8
r2 1.0 0.986 0.976 0.956 0.899 0.791 0.589 0
2
r adj * 0.900 0.915 0.898 0.823 0.707 0.521 0

r2
1.0

r 2adj
0.8

0.6
r 2 or r 2adj

0.4

0.2

0.0
0 1 2 3 4 5 6 7
dfmodel

Figure 9.16 r 2 and r adj


2
versus dfmodel for wire tie strength example.

cells have all of their variables at their zero level, that is, (x1, x2, . . .) = (0, 0, . . .). This
means that to add center cells to an experiment, all of the variables in the experiment
must be quantitative and that a zero level midway between the ±1 state of each variable
must be available. Any number of center cells can be added to an experiment without
unbalancing the design.
370 Chapter Nine

It may seem strange that the error estimate for a 2k design with centers can be sub-
stantially determined using information from the center cells, especially when an exper-
iment has few or even no error degrees of freedom before the center cells are added.
This practice is justified as long as the homoscedasticity assumption, that the distribu-
tion of error variability is constant throughout the design space, is satisfied.
The increase in the power of an experiment provided by the addition of center cells
is limited. When an experiment has relatively few initial error degrees of freedom,
adding center cells can improve the situation significantly. For example, a 23 experi-
ment with just one replicate will only have one error degree of freedom if the model
includes main effects and two-factor interactions:

dfε = dftotal − dfmodel = (8 − 1) − ( 3 + 3) = 1

The addition of a few center cell runs can provide the additional error degrees of free-
dom that make the analysis of the experiment easier and confidence in its interpretation
higher. If, however, the 23 design has been replicated many times, the additional center
cells won’t increase the number of error degrees of freedom enough to improve the error
estimate significantly.
Although the use of center cells can improve the power of some experiments, the
usual reason that center cells are added to 2k experiments is to allow for a test of cur-
vature in the response between the ±1 levels of quantitative design variables. Since the
2k experiments only have two levels of each variable, they provide no opportunity to test
the assumed linearity between the ±1 states, which is essential if the model is to be used
for interpolation. Center cells provide the necessary third state required to test the lin-
earity assumption. Tests for the linearity assumption will be presented in Chapter 11.
For the purposes of this chapter, the discussion of center cells will be limited to their
contribution to the error degrees of freedom; however, if you are using center cells in a
2k design, you should definitely read ahead into Chapter 11 to learn how to evaluate
your experiment for possible curvature.

9.6 GENERAL PROCEDURE FOR ANALYSIS


OF 2k DESIGNS
Regardless of what software you’re using to analyze 2k designs, or whether you do the
analysis step by step or with some off-the-shelf comprehensive analysis package, the same
basic steps need to be considered. These steps are:
1. Enter the design matrix of ±1 values into the appropriate columns of the
worksheet. Add columns indicating the standard order, run order, blocks,
and experimental response. Use an appropriate missing value symbol in the
response column to indicate any missing observations. (MINITAB’s missing
value symbol is an asterisk [*].)
Two-Level Factorial Experiments 371

2. Create columns for the two-factor and, if necessary, any desired higher-order
interactions by multiplying the appropriate columns of main effects together.
3. Delete any rows/runs that correspond to missing or otherwise seriously com-
promised observations. Construct the correlation (r) matrix of main effects and
interactions. The purpose of constructing and inspecting the correlation matrix
is to check the validity and integrity of the experiment design. If the experi-
ment was designed correctly and there were no missing or extra observations,
the correlation matrix should have r = 0 everywhere except on the diagonal
where the row variable and the column variable are the same where r = 1. This
structure in the correlation matrix is present by design; it is an important char-
acteristic of a well-designed experiment. If there was a mistake creating the
matrix of runs or the interactions, or if there were missing or extra runs that
unbalanced the experiment, at least some of the off-diagonal correlations will
be non-zero. Inspect these cases to determine if any correlations are so high
that they compromise the integrity of the experiment. If so, identify and imple-
ment appropriate corrective actions before attempting to analyze the experiment.
4. Perform and interpret some type of graphical analysis of the response as a
function of the design variables, such as a multi-vari chart for smaller
experiments or main effects and interaction plots for larger ones.
5. Analyze the response as a function of the main effects, two-factor interactions,
any desired higher-order interactions, and blocks. Interpret the p values of the
model terms to determine which terms are significant. Interpret the standard
error of the model se and the adjusted coefficient of determination r 2adj.
6. If the model has a large number of terms, create a normal probability plot of
the regression coefficient t values. (The t values are used instead of the
coefficients themselves because sometimes the coefficients can have different
standard errors, so it’s more appropriate to compare their t values instead.)
Points that plot near ti = 0 correspond to insignificant model terms, and
outliers correspond to significant model terms. Add reference lines to the
normal plot corresponding to the threshold t values at ±ta/2,dfe to help distinguish
between insignificant and significant terms.
7. Perform an analysis of the residuals to validate the analysis method:
a. Inspect the histogram of residuals for normality and potential outliers.
b. Inspect the normal probability plot of the residuals for normality and
potential outliers.
c. Inspect the plot of residuals versus run order for homoscedasticity and
independence.
d. Inspect the plot of residuals versus fits for homoscedasticity.
372 Chapter Nine

e. Inspect the plot of residuals versus blocks for homoscedasticity.


f. Inspect each plot of the residuals versus the independent variables for
homoscedasticity.
8. If a more quantitative test for outliers is required, compare the deleted Studen-
tized residuals for the suspect observations to the Bonferroni corrected critical
value ta/(2n),dfe. Observations whose deleted Studentized residuals are larger in
magnitude than this critical value are probably statistical outliers. Attempt to
correlate any unusual observations from when the experiment was being per-
formed to the suspected outliers to find grounds for omitting these observa-
tions from the data set; however, never omit any observations without good cause.
Resolve the effects of outliers on the model before accepting a final model.
9. Identify terms that can be omitted from the model without significantly
compromising its predictive capability and refine the model. Be careful to
preserve the hierarchy of terms in the model. This operation may have to be
performed in a series of incremental steps.
10. Accept a refined model, determine its regression coefficients, standard error,
and adjusted coefficient of determination, and confirm that the assumptions
required of the analysis method are satisfied.

9.7 2K FACTORIAL DESIGNS IN MINITAB

9.7.1 Creating the 2k Designs in MINITAB

There are several different ways to create 2k factorial designs in MINITAB:


• Manually enter all of the ±1 values for each column into the worksheet.
• Copy the design from an existing file, such as the appropriate 2^k.mtw MINITAB
worksheet or the 2^k.xls Excel worksheet provided on the CD-ROM included
with this book.
• Use the set command (or the Calc> Make Patterned Data> Simple Set of
Numbers menu) to create the necessary pattern of ±1 values for each column.
• Use MINITAB’s Stat> DOE> Factorial> Create Factorial Design menu to
specify and create the design.
The first three methods are much less important now that MINITAB contains the Stat>
DOE tools; however, these methods are still useful in special cases and are certainly
worth studying to better understand the 2k factorial designs. Always check your work
carefully if you use any of these methods, because an unidentified mistake at this stage
can ruin an experiment. You will also have to create your own randomization and block-
ing plans for the experimental runs.
Two-Level Factorial Experiments 373

The first method above—manually creating the matrix of experimental runs—is


only practical for the smallest designs. For example, the 22 and 23 designs with just four
and eight runs, respectively, can be quickly typed into a worksheet and then copy/paste
operations can be used to create the desired number of replicates. Any experiments
larger than 24 are too complicated and time-consuming to create manually, however, and
there are just too many opportunities to make mistakes.
Opportunities for using the second method above—copying the design from an
existing worksheet—might be more frequent than you think. Relatively few of the 2k
designs are regularly used, so a small collection of designs can be quite comprehensive.
The handful of designs included on the CD-ROM will be sufficient for most experiments.
And once a new design has been created, it’s easy to copy it from its original worksheet
and paste it into a new one when that design is required for another experiment.
With some practice, the third method above—creating the matrix of experimental
runs with the set command (or the Calc> Make Patterned Data> Simple Set of Numbers
menu)—can be a fast and safe way of creating 2k designs. A 2k design will require k calls of
the set command to create the necessary pattern of ±1 values for each of the k columns
of the design. The set command can also be used to create a column for the standard
order of the runs and then the sample command (or the Calculate> Random Data>
Sample from Columns menu) can be used to determine the random run order.
The fourth method above—creating the design using the Stat> DOE> Factorial>
Create Factorial Design menu—is quick, safe, and easy, and designs created by this
method are ready to analyze from the Stat> DOE> Factorial> Analyze Factorial Design
menu. MINITAB takes care of all of the randomization and blocking, too. Designs cre-
ated by other means can still be analyzed with Stat> DOE> Factorial> Analyze
Factorial Design if they are first defined in MINITAB using Stat> DOE> Factorial>
Define Factorial Design. This step creates the necessary hidden links between the work-
sheet and MINITAB so that the Stat> DOE> Factorial> Analyze Factorial Design func-
tion can locate the necessary information to complete the analysis.

Example 9.8
Use MINITAB’s Calc> Make Patterned Data> Simple Set of Numbers menu or the
set command to create the 24 experiment design.
Solution: The first four columns of the MINITAB worksheet were named x1, x2, x3,
and x4. The following table shows how the inputs to the Calc> Make Patterned Data>
Simple Set of Numbers menu were set to create the matrix of experimental runs:

Store patterned data in: x 1 x 2 x 3 x 4


From first value: –1 –1 –1 –1
To last value: 1 1 1 1
In steps of: 2 2 2 2
List each value 8 4 2 1
List the whole sequence 1 2 4 8
374 Chapter Nine

Alternatively, the corresponding set commands could be typed at the command prompt
in the Session window. For example, to create the x1 column in C1, use:

mtb > set c1


data > 1(-1:1/2)8
data > end.

When you use the Calc> Make Patterned Data> Simple Set of Numbers menu with the
command prompt enabled, these set commands will also appear in the Session window.
Figure 9.17 shows these commands in the Session window and the resulting matrix of
experimental runs. The set command was also used to determine the standard order col-
umn, and the sample command (or the Calc> Random Data> Sample from Columns
menu) was used to determine a random order for the runs.

Example 9.9
Use MINITAB’s Stat> DOE> Factorial> Create Factorial Design menu to recre-
ate the 24 experiment design.
Solution: The 24 design was created using MINITAB’s Stat> DOE > Factorial>
Create Factorial Design menu. In the Create Factorial Design menu, a two-level

Figure 9.17 Creating the 24 design with the Calc> Make Patterned Data> Simple Set of
Numbers menu or the set command.
Two-Level Factorial Experiments 375

Figure 9.18 Stat> DOE> Factorial> Create Factorial Design configuration to create the 24 design.

factorial design with four factors was selected. Then in the Designs menu the Full fac-
torial design was chosen. These instructions and the resulting 16-run experiment design
with the runs in random order are shown in Figure 9.18. The CenterPt and Blocks
columns are special columns required by MINITAB’s Stat> DOE> Factorial> Analyze
Factorial Design tool.

9.7.2 Analyzing the 2k Factorial Designs with MINITAB


There are at least three different ways to analyze data from a 2k factorial experiment
using MINITAB. The first and easiest way is to use the experiment design and analysis
tools provided in MINITAB’s Stat> DOE> Factorial> Analyze Factorial Design menu.
The second way is to use the special experiment analysis macros provided with this text.
These are MINITAB local .mac macros that include the standard set of analyses along
with some special analyses that aren’t usually provided but are still easy to implement.
The third way to analyze a 2k experiment in MINITAB is to run the analysis manually,
step by step. The three methods give largely the same results.
The two primary MINITAB tools that can be used to perform manual analyses of
2k designs are the Stat> Regression> Regression menu and the Stat> ANOVA> General
Linear Model menu. Both methods give exactly the same results when they are con-
figured correctly. The primary disadvantages of the Stat> Regression> Regression menu
376 Chapter Nine

are: 1) you must create columns for all of the desired interactions before running the
analysis and 2) if the experiment was run in blocks and you want to include a term for
blocks in the model, you have to build columns of indicator variables for the blocks
using the method of Section 8.17. The advantages of the Stat> Regression> Regression
menu are that after all of the columns for the interactions and blocks are created: 1) it
is easy to create the correlation matrix to check the integrity of the design/experiment,
2) it is very easy to configure the regression analysis, and 3) the algorithm is a little bit
more robust to problems with some models than other methods.
The primary disadvantages of the Stat> ANOVA> General Linear Model menu
are: 1) you have to explicitly type out the desired model, which can be tedious and
tricky for larger designs, 2) the algorithm can be sensitive to problems with some models,
and 3) there’s no easy way to create the correlation matrix. The primary advantages are:
1) the GLM tool is very flexible and can be used to analyze very complex experiments
and 2) if the experiment is blocked it’s easy to include blocks in the model.

Manual Analysis with Stat> Regression> Regression


To better appreciate the more automated analysis methods, let’s consider the manual
method of analysis first. Even if you always intend to use MINITAB’s integrated DOE
analysis tools in the Stat> DOE menu, it’s useful to consider these steps to better under-
stand what MINITAB is doing. The method described here uses the Stat> Regression>
Regression approach and is the same method captured in the mlrk.mac macros, which
will be described in the next section.
The steps in a manual analysis of a 2k experiment using MINITAB are:
1. Enter the matrix of experimental runs into a new MINITAB worksheet or
load the desired design from an existing project or worksheet file if the design
already exists. The worksheet should have one column for each variable. Use
short but informative names preceded by a generic prefix for the columns such
as x1:Temp, x2:Press, x3:Time, or A:Temp, B:Press, C:Time.
2. If the experiment was replicated, copy the first replicate into new rows of the
original columns for as many replicates as are required. If the experiment was
blocked, perhaps on replicates, add a column to uniquely distinguish each block.
3. Enter the response into a single column of the MINITAB worksheet and give it
an informative name. Indicate missing values with the “*” missing value symbol.
Add a column to the worksheet indicating the run order of the observations.
4. Use the Calc> Calculator menu or let statements to build all of the two-factor
interactions to be included in the model. Higher-order interactions can also be
generated but usually aren’t. Use the generic variable prefixes to name the
interactions, such as x12, x13, x14, or AB, AC, AD.
5. If the experiment was blocked, use the Calc> Make Indicator Variables menu
or the indicator command to translate the blocking column with b blocks into
b columns of indicator variables.
Two-Level Factorial Experiments 377

6. Create the correlation matrix including all main effects, two-factor interactions,
and blocks from the Stat> Basic Stats> Correlation menu or with the corre-
lation command. You may wish to suppress the obnoxious display of p values
for the correlation coefficients by turning off the Display p-values option or by
using the nopvalues subcommand. By design, the correlation matrix should be
diagonal with zero correlation between unlike terms (for example, x1 and x2)
and unity correlation between like terms (for example, x1 and x1). Inspect the
correlation matrix for non-zero values in the off-diagonal fields, which would
indicate errors in an earlier step. Some correlation is expected between the
different blocks but these correlations should be symmetric.
7. If there are more than a few missing observations, copy the worksheet to a new
worksheet, delete the rows that contain missing observations, then recreate
the correlation matrix and inspect it for substantial correlations between
model terms. If there are any substantial correlations, the experiment is
compromised and remedial actions will have to be taken before it’s safe to
complete the analysis.
8. In the first b – 1 of b columns of indicator variables for the blocks, change all
of the zero values in the rows corresponding to the last block to –1s.
9. Use the Stat> Regression> Regression menu or the regress command to analyze
the response as a function of all of the desired main effects, interactions, and
blocks. Include only the first b – 1 columns of modified block indicators in
the model. Open the Graphs menu and turn on the appropriate residuals
diagnostic plots. Set the Storage menu to store the residuals, fits, regression
coefficients, and deleted (Studentized) t residuals in case they’re needed for
further analysis.
10. Inspect the residuals diagnostic plots for normality, homoscedasticity, inde-
pendence, and the presence of outliers. If any of the observations appear to
be outliers, check their deleted (Studentized) t residuals that were stored in
step 9. Observations with deleted Studentized residuals of magnitude greater
than ta/(2n),dfe are probably outliers, where n is the number of observations in the
data set and dfe is the error degrees of freedom.
11. If the model has lots of terms, many of which might be statistically insignifi-
cant, create a normal probability plot of the regression coefficient t values to
help distinguish the significant coefficients from the insignificant ones. To
create this plot, use MINITAB’s block copy operation to copy the regression
coefficient t values from the Session window into a new column of the
worksheet.* Omit or delete the t value for the model constant because it’s

* To use MINITAB’s block copy operation, hold down the ALT key and use the mouse to select the desired rectangu-
lar area in the Session window. Then use the usual copy and paste operations to put a copy of the selected data into
a new column or columns of the worksheet.
378 Chapter Nine

fundamentally different from the other coefficients and not of interest here.
Then use the Stat> Basic Stats> Normality Test or Graph> Probability Plot
menus, or the normtest or pplot commands to create the normal plot. The
insignificant regression coefficients will appear near the middle of the normal
plot with coefficient t values near zero and significant coefficients will appear
as outliers. Add reference lines at ±ta/2,dfe to help distinguish between
significant and insignificant coefficients.
12. Refine the model by dropping insignificant terms identified from the regression
coefficient p values and from the normal plot of the regression coefficients.
When there are terms on the borderline of being significant (p 0.05), it may
be necessary to refine the model in a series of steps by dropping one term—
the weakest term—at each step so that the p values can be watched carefully.
If a two-factor interaction is to be retained in the model then both of the
corresponding main effects must also be retained, whether or not they are
significant, to preserve the hierarchy of terms in the model.
13. Before a refined model can be accepted, the residuals must be checked for:
normality; homoscedasticity with respect to the regression predictors, run
order, and fitted values; and independence. These checks can be done from
the residuals diagnostic graphs.

Analysis with the mlrk.mac Macros


The mlrk.mac macros provided with this text are MINITAB local .mac macros that
capture most of the instructions described in the previous section. These macros have
some extra features that aren’t important for 2k designs but will be required for the
designs in the next two chapters.
The mlrk.mac macros are run from the MINITAB command prompt with calling
statements like:

mtb > %mlr3 "Std" "Run" "Center" "Block" "A" "B" "C" "Y";
subc > terms "AB" "AC" "BC" "AA
A" "BB" "CC".

When the columns of the worksheet are in the necessary order, the calling statement is
much simpler, like mtb> %mlr3 c1-c8. The “k” in the generic mlrk.mac designa-
tion indicates the number of variables in the design, so mlr3.mac is expecting a three-
variable experiment, mlr4.mac is expecting a four-variable experiment, and so on. It is
up to you to apply the correct mlrk.mac macro to your experimental data. The defini-
tion and order of the input columns to the mlrk.mac macros are identical to the columns
created by MINITAB’s Stat> DOE> Factorial> Create Factorial Design menu, so the
mlrk.mac macros can be used as an alternative to MINITAB’s Stat> DOE> Factorial>
Analyze Factorial Design method. The optional terms subcommand stores the indi-
cated two-factor interactions and quadratic terms in the indicated columns of the work-
sheet. Storing these columns in the worksheet simplifies the steps required to refine a
model after the initial model is fitted with the macro. Descriptions of the data formats
Two-Level Factorial Experiments 379

and instructions for the use of the mlrk.mac macros are given in comments at the begin-
ning of each macro that can be viewed in a text editor like Notepad. As an example, the
text of the mlr3.mac macro is shown in Figure 9.19.
If the experiment is saturated, that is, if there aren’t sufficient degrees of freedom
to estimate the error, the mlrk.mac macros generate the error: “Error: Not enough data
in column” in which case you will have to continue the analysis manually with a smaller
model from the Stat> Regression> Regression or the Stat> ANOVA> General Linear
Model menus. However, before the error is encountered, the macro completes many of
its early steps in the analysis so the new model is relatively easy to configure.
The mlrk.mac macros cannot create the normal probability plot of the regression
coefficient t values because MINITAB doesn’t have the ability to store them. After the
macro is run, however, you can copy the t values from the Session window, paste them
back into the worksheet, and then create the normal plot. To simplify this operation, a
special macro called coefftnormplot.mac was written and is included on the CD-ROM.
The inputs to the macro are the column of regression coefficient names, the column of
t values, and the number of error degrees of freedom. An example calling statement for
the macro is:
mtb > %coefftnormplot "Coeff" "T" 18

The mlrk.mac macros also call another custom macro that creates a normal plot of
the deleted Studentized residuals, which are useful for identifying statistical outliers.
Each observation is identified by its run number and reference lines are displayed at
the Bonferroni-corrected critical values given by ±ta/(2n),dfe to assist in the identification
of outliers.

Example 9.10
The experimental data from a 25 experiment are shown in Figure 9.20. Use the
mlr5.mac macro to analyze the data and then interpret the results.
Solution: The experimental data were entered into a MINITAB worksheet and then
analyzed using the mlr5.mac macro. The command statement was:

mtb > %mlr5 c1-c10;


subc > terms c11-c25.

The main effects plot in Figure 9.21 suggests that there might be significant effects
associated with B, C, and E. The interactions plot in Figure 9.22 suggests that there
might be significant interactions due to BC, CE, and possibly AC. The Session window
output is shown in Figure 9.23. The correlation matrix, which was edited for space and
clarity, confirms that all of the terms in the model are independent of each other. The
terms AA, BB, . . . , and EE are quadratic terms that mlr5.mac attempts to include in
the model but cannot because the design is not capable of resolving them, consequently,
MINITAB omits them from the regression analysis that follows. The Blocks term was
also omitted from the model because there is only one block in the experiment. The
380
macro
mlr3 Std Run Ctr Blo A B C Y;

Chapter Nine
terms AB AC BC AA BB CC.
#PGMathews, 18 May 2004, V1.0 for Minitab V14
#Copyright (C) 2004 Mathews Malnar and Bailey, Inc.
#See Mathews, Design of Experiments with Minitab, ASQ Press, 2004 for details.
#This macro performs the analysis of a three variable designed experiment with main
#effects, two-factor interactions, quadratic terms, and blocks.
#The expected input data structure is the standard column format created by Minitab 14's
#Stat> DOE> Factorial> Create Factorial Design or ...> Response Surface> Create Response
#Surface Design menus. The macro is suitable for 2^3 and 2^(3-1) with and without centers,
#3^3, BB(3), and CC(3) designs. When two or more terms are confounded, the first term
#encountered will be retained in the model.
#The 'terms' subcommand will output the calculated interactions and quadratic terms
#into the 6 specified columns. (Be careful because those columns will be overwritten.)
#Then subsequent analyses, such as to refine the model, can be performed using Stat> ANOVA>
#General Linear Model.
#Example calling statement:
# mtb> %mlr3 c1-c8;
# subc> terms c9-c14.
mcolumn Std Run Ctr Blo A B C Y
mcolumn AB AC BC AA BB CC
mcolumn ID IDCount Block.1-Block.20 DSR coeff
mconstant NumBlo dfmodel i dfe alphaB tcrit
#Make indicator variable columns for the blocks.
max Blo NumBlo #Number of blocks
indicator Blo Block.1-Block.NumBlo #Make indicator columns for blocks
#Calculate the interaction and quadratic terms and construct the correlation matrix.
let AB=A*B
let AC=A*C
let BC=B*C
let AA=A*A
let BB=B*B
let CC=C*C
if terms=1 #name the interactions and quadratic terms
name AB "AB" AC "AC" BC "BC" AA "AA" BB "BB" CC "CC"
endif
#If you need to view the p values, remove the ; from the following line and comment out the nopvalues subcommand.
corr A B C AB AC BC AA BB CC Block.1-Block.NumBlo;
nopvalues.

Figue 9.19 Text of mlr3.mac macro. Continued


Continued

#Fix the block codes for the last (reference) block. To view the coding convention search Minitab
#Help for "Design matrix used by General Linear Model".
let i=1
while i<NumBlo
let Block.i=Block.i-Block.NumBlo
let i=i+1
endwhile

#Create the plots of main effects and interactions.


main Blo A B C;
response Y.
interact A B C;
response Y;
full.

#Run the regression analysis


if NumBlo>1 #then the experiment is blocked
let NumBlo=NumBlo-1 #Keep the last block out of the model
let dfmodel=NumBlo+3+3+3
regress Y dfmodel Block.1-Block.NumBlo A B C AB AC BC AA BB CC;
gfourpack; #residuals diagnostic plots
gvars Blo A B C; #residuals vs. blocks and study variables
tresiduals DSR; #store the deleted Studentized residuals for outlier analysis
coeff coeff.

Two-Level Factorial Experiments


else #no blocking
let dfmodel=3+3+3
regress Y dfmodel A B C AB AC BC AA BB CC;
gfourpack;
gvars A B C;
tresiduals DSR;
coeff coeff.
endif

#Create normal plot of deleted Studentized residuals with Bonferroni critical values.
let dfe=count(Y)-count(coeff) #error degrees of freedom, note coeff includes the constant
let alphaB=1-0.05/count(Y)/2 #Bonferroni corrected t value for alpha=0.05
invcdf alphaB tcrit;
t dfe.
call normplotDSR DSR Run tcrit

endmacro

381
382 Chapter Nine

Row StdOrder RunOrder CP Blocks A B C D E Y


1 25 1 1 1 1 1 -1 -1 -1 226
2 14 2 1 1 -1 1 1 -1 1 150
3 15 3 1 1 -1 1 1 1 -1 284
4 30 4 1 1 1 1 1 -1 1 190
5 29 5 1 1 1 1 1 -1 -1 287
6 2 6 1 1 -1 -1 -1 -1 1 149
7 23 7 1 1 1 -1 1 1 -1 53
8 28 8 1 1 1 1 -1 1 1 232
9 11 9 1 1 -1 1 -1 1 -1 221
10 24 10 1 1 1 -1 1 1 1 -30
11 20 11 1 1 1 -1 -1 1 1 76
12 31 12 1 1 1 1 1 1 -1 270
13 21 13 1 1 1 -1 1 -1 -1 59
14 22 14 1 1 1 -1 1 -1 1 -32
15 3 15 1 1 -1 -1 -1 1 -1 142
16 17 16 1 1 1 -1 -1 -1 -1 121
17 8 17 1 1 -1 -1 1 1 1 -43
18 32 18 1 1 1 1 1 1 1 200
19 19 19 1 1 1 -1 -1 1 -1 123
20 4 20 1 1 -1 -1 -1 1 1 137
21 5 21 1 1 -1 -1 1 -1 -1 1
22 6 22 1 1 -1 -1 1 -1 1 -51
23 26 23 1 1 1 1 -1 -1 1 187
24 13 24 1 1 -1 1 1 -1 -1 265
25 12 25 1 1 -1 1 -1 1 1 233
26 10 26 1 1 -1 1 -1 -1 1 217
27 18 27 1 1 1 -1 -1 -1 1 71
28 16 28 1 1 -1 1 1 1 1 187
29 27 29 1 1 1 1 -1 1 -1 207
30 7 30 1 1 -1 -1 1 1 -1 40
31 1 31 1 1 -1 -1 -1 -1 -1 179
32 9 32 1 1 -1 1 -1 -1 -1 266

Figure 9.20 Experimental data from a 25 experiment.

Main Effects Plot (Data Means) for y

Blocks A B
250

200

150

100
Mean of y

50
1 –1 1 –1 1
C D E
250

200

150

100

50
–1 1 –1 1 –1 1

Figure 9.21 Main effects plot from Example 9.10.


Two-Level Factorial Experiments 383

Interaction Plot (Data Means) for y


–1 1 –1 1

200 A
100 A –1
1
0
200 B
B 100 –1
1
0
200 C
100 C –1
1
0
200 D
D 100 –1
1
0
200 E
100 E –1
1
0

–1 1 –1 1 –1 1

Figure 9.22 Interactions plot from Example 9.10.

regression analysis indicates that there are many statistically significant terms, espe-
cially B, C, E, AC, BC, and CE. The standard error of the model is se = 19.21 and the
adjusted coefficient of determination is r 2adj = 0.965.
The graphical analyses of the residuals created by the macro are shown in Figure
9.24. The residuals diagnostic plots confirm that the residuals are normally distributed
and homoscedastic with respect to the run order, the fitted values, and the design vari-
ables as required by the ANOVA method. All of the deleted Studentized residuals in the
normal plot shown in Figure 9.25 fall inside of the Bonferroni-corrected critical values,
so no observations are statistical outliers. The normal plot of the regression coefficient
t values in Figure 9.26 was created with the coefftnormplot.mac macro and confirms the
conclusions drawn from the regression coefficient p values—there appear to be signif-
icant effects due to B, C, E, BC, CE, and AC. The many insignificant model terms stack
up in a nice line centered at about ti = 0, which makes the outliers—the statistically sig-
nificant terms—easy to identify.
Although the regression model has plenty of error degrees of freedom despite its
relatively large size, the model should be simplified by eliminating terms that do not
contribute to it. An obvious choice is to eliminate all terms involving D because none of
these terms are statistically significant. Other terms also can be dropped from the model.
Figure 9.27 shows the regression analysis obtained using Stat> Regression>
Regression including the statistically significant terms from the original model. It was
necessary to retain the statistically insignificant A term in the refined model to preserve
the hierarchy of terms since AC is to appear in the model. The new residuals diagnos-
tic plots (not shown) indicated that the residuals were still normal and homoscedastic
as required by the regression method.
384
Correlations: A, B, C, D, E, AB, AC, AD, AE, BC, BD, BE, CD, CE, DE, AA, BB, CC, DD, EE, Block.1

Chapter Nine
A B C D E AB AC AD AE BC BD BE CD CE DE AA BB CC DD EE
B 0
C 0 0
D 0 0 0
E 0 0 0 0
AB 0 0 0 0 0
AC 0 0 0 0 0 0
AD 0 0 0 0 0 0 0
AE 0 0 0 0 0 0 0 0
BC 0 0 0 0 0 0 0 0 0
BD 0 0 0 0 0 0 0 0 0 0
BE 0 0 0 0 0 0 0 0 0 0 0
CD 0 0 0 0 0 0 0 0 0 0 0 0
CE 0 0 0 0 0 0 0 0 0 0 0 0 0
DE 0 0 0 0 0 0 0 0 0 0 0 0 0 0
AA * * * * * * * * * * * * * * *
BB * * * * * * * * * * * * * * * *
CC * * * * * * * * * * * * * * * * *
DD * * * * * * * * * * * * * * * * * *
EE * * * * * * * * * * * * * * * * * * *
Block.1 * * * * * * * * * * * * * * * * * * * *

Regression Analysis: Y versus A, B, ...


* AA is (essentially) constant
* AA has been removed from the equation.
* BB is (essentially) constant
* BB has been removed from the equation.
* CC is (essentially) constant
* CC has been removed from the equation.
* DD is (essentially) constant
* DD has been removed from the equation.
* EE is (essentially) constant
* EE has been removed from the equation.

The regression equation is


Y = 144 - 4.28 A + 82.1 B - 29.9 C + 1.47 D - 27.2 E + 2.78 AB + 14.5 AC
- 0.09 AD - 1.03 AE + 32.7 BC + 1.41 BD + 0.34 BE + 4.28 CD - 15.8 CE + 5.47 DE

Figure 9.23 Output from mlr5.mac macro for a 25 experiment. Continued


Continued

Predictor Coef SE Coef T P


Constant 144.281 3.396 42.49 0.000
A -4.281 3.396 -1.26 0.225
B 82.094 3.396 24.18 0.000
C -29.906 3.396 -8.81 0.000
D 1.469 3.396 0.43 0.671
E -27.219 3.396 -8.02 0.000
AB 2.781 3.396 0.82 0.425
AC 14.531 3.396 4.28 0.001
AD -0.094 3.396 -0.03 0.978
AE -1.031 3.396 -0.30 0.765
BC 32.656 3.396 9.62 0.000
BD 1.406 3.396 0.41 0.684
BE 0.344 3.396 0.10 0.921
CD 4.281 3.396 1.26 0.225
CE -15.781 3.396 -4.65 0.000
DE 5.469 3.396 1.61 0.127

S = 19.2094 R-Sq = 98.2% R-Sq(adj) = 96.5%

Analysis of Variance

Source DF SS MS F P
Regression 15 319388 21293 57.70 0.000
Residual Error 16 5904 369

Two-Level Factorial Experiments


Total 31 325292

Source DF Seq SS
A 1 587
B 1 215660
C 1 28620
D 1 69
E 1 23708
AB 1 248
AC 1 6757
AD 1 0
AE 1 34
BC 1 34126
BD 1 63
BE 1 4
CD 1 587
CE 1 7970
DE 1 957

385
386 Chapter Nine

Histogram of the Residuals Normal Probability Plot of the Residuals


99
10
90
Frequency

Percent
50
5
10
0 1
–30 –20 –10 0 10 20 30 –20 0 20
Residual Residual
Residuals versus the Order of the Data Residuals versus the Fitted Values

20 20
Residual

Residual
0 0

–20 –20

2 4 6 8 10 12 14 16 18 20 22 24 26 28 30 32 0 150 300
Observation Order Fitted Value
Residuals versus A Residuals versus B

20 20
Residual

Residual

0 0

–20 –20

–1 1 –1 1
A B

Residuals versus C Residuals versus D

20 20
Residual

Residual

0 0

–20 –20

–1 1 –1 1
C D

Residuals versus E

20
Residual

–20

–1 1
E

Figure 9.24 Residuals diagnostic plots from a 25 experiment.


Two-Level Factorial Experiments 387

Normal Probability Plot of Deleted Studentized Residuals (DSR)


–3.803 0 3.803
8
2
3
19
14
31
Normal Score (z)

1 30
32
6
22
4 24
7
20 13
0 10 5 0
23 28
16
15 25
26 17
18
11 29
–1 12
2
9
27
21
–2

–4 –3 –2 –1 0 1 2 3 4
DSR

Reference lines indicate Bonferroni-corrected +/– t (0.025/n,dfe ) critical values.


Data labels indicate the run number.

Figure 9.25 Normal probability plot of deleted Studentized residuals.

Normal Probability Plot of Regression Coefficient t Values


–2.12 0 2.12
2 B

BC
AC
1
Normal Score (z)

DE
CD

D AB

0 BE BD 0
AE AD
A
CE
–1 E

–2
–10 –5 0 5 10 15 20 25
Regression Coefficient t Value

Reference lines indicate +/– t (0.025,dfe ) critical values for regression coefficients.

Figure 9.26 Normal probability plot of regression coefficient t values.


388 Chapter Nine

Regression Analysis: Y versus A, B, C, E, AC, BC, CE

The regression equation is


Y = 144 - 4.28 A + 82.1 B - 29.9 C - 27.2 E + 14.5 AC + 32.7 BC - 15.8 CE

Predictor Coef SE Coef T P


Constant 144.281 3.200 45.08 0.000
A -4.281 3.200 -1.34 0.194
B 82.094 3.200 25.65 0.000
C -29.906 3.200 -9.35 0.000
E -27.219 3.200 -8.51 0.000
AC 14.531 3.200 4.54 0.000
BC 32.656 3.200 10.20 0.000
CE -15.781 3.200 -4.93 0.000

S = 18.1033 R-Sq = 97.6% R-Sq(adj) = 96.9%

Analysis of Variance

Source DF SS MS F P
Regression 7 317427 45347 138.37 0.000
Residual Error 24 7866 328
Total 31 325292

Source DF Seq SS
A 1 587
B 1 215660
C 1 28620
E 1 23708
AC 1 6757
BC 1 34126
CE 1 7970

Unusual Observations

Obs A Y Fit SE Fit Residual St Resid


8 1.00 232.00 193.38 9.05 38.63 2.46R
9 -1.00 221.00 253.88 9.05 -32.88 -2.10R
21 -1.00 1.00 32.37 9.05 -31.37 -2.00R

R denotes an observation with a large standardized residual.

Figure 9.27 Refined model for the 25 experiment.

Analysis with MINITAB’s DOE Tools (Stat> DOE> Factorial)


You can use MINITAB’s Stat> DOE> Factorial> Create Factorial Design menu to create
a 2k factorial design that meets your requirements or, if you’ve created your own matrix
of runs for a 2k experiment, you can use Stat> DOE> Factorial> Define Factorial
Design to specify the design to MINITAB. After you’ve created the design and entered
the response column but before you consider any quantitative analysis of the data, it’s
helpful to present the data graphically to get a preview of what variables and model
terms might be most important. Use the Stat> DOE> Factorial> Factorial Plots menu
Two-Level Factorial Experiments 389

to create main effects and interaction plots. You will have to specify the response to be
analyzed and the terms to consider in the two Setup menus. Main effects plots with
sloped lines indicate possibly significant main effects. Interaction plots that show
diverging line segments indicate possibly significant two-factor interactions.
After the design has been defined and the required responses entered into the work-
sheet, you can analyze the design with Stat> DOE> Factorial> Analyze Factorial Design.
The functions provided within MINITAB should be very familiar to you by now. Select
the experimental response column in the Responses: window and use the Terms window
to select the terms to be included in the model. Use the arrows to move selected terms
back and forth between the Available Terms: and the Selected Terms: windows or indi-
cate the highest-order terms to be included in the model in the Include terms in the
model up through order: window. If the experiment was run in blocks, then check the
Include blocks in the model box to account for possible block-to-block differences.
In the Graphs menu, select the usual residuals diagnostic plots including plots of
the residuals versus all of the independent variables. You should also consider turning
on the normal and/or Pareto effects plots. These plots are very useful for distinguishing
between significant and insignificant model terms when you need to refine the model.
Significant terms will be outliers on the normal plot of effects and will have long bars
on the Pareto chart. Insignificant terms will fall on an approximately straight line near
zero on the normal plot and have short bars on the Pareto chart. After you’ve determined
which model terms can safely be omitted from the model, return to the Terms window
to remove them and rerun the analysis. MINITAB will issue a warning if you attempt to
analyze a nonhierarchical model.

9.8 EXTRA AND MISSING VALUES


By design, the 2k experiments are balanced; they have the same number of observations
at the +1 and –1 levels of each design variable. This characteristic gives these designs
some very desirable behavior; most importantly, it makes the design variables com-
pletely independent of each other. When there are extra or missing observations in a 2k
design, however, the experiment becomes unbalanced, causing many of the model terms
to become correlated with each other. This effect can be observed by comparing the cor-
relation matrices of an intact 2k design with one that has a few missing or extra obser-
vations. The primary undesirable consequence of an unbalanced design is that the
regression coefficients become biased. This problem is serious enough that some simple
strategies have been developed for managing extra or missing observations in 2k
designs. A few practical strategies will be presented here, but more rigorous and even
exact methods are available. See Montgomery (1991) or Hoaglin et al. (1991) for details
on these methods.
When an experiment has many replicates and there are a few extra observations,
the extra values can often be left intact or omitted from the data set at random without
substantially affecting the results of the analysis. The latter approach wastes some
390 Chapter Nine

information that might otherwise improve the error estimate but it does recover the bal-
ance of the experiment. If the experiment only has a single replicate plus a few extra
observations, the usual approach is to average the duplicated observations, but it’s
important to recognize that this approach tends to underestimate the size of the standard
error because averages tend to behave better than individuals. When there are wildly
different numbers of replicates in the various unique cells of the experiment design, the
usual approach is to calculate and then fit the cell means. Then the standard error of
the cell mean model must be combined with the within-cell variation to obtain a more
realistic measure of the inherent noise in the system. The disadvantage to this approach
is that it ignores differences in the relative weights that should be applied to the various
cells based on the number of observations that they contain. There are rigorous methods
for weighting the cell means but they are beyond the scope of this book.
The strategies for managing a few missing observations are a bit different than those
for extra observations. The first condition that should be checked is to determine if the
missing observations have a common cause or if they are missing at random (MAR). If
there is a common cause for the missing observations then that cause must be investi-
gated. If the missing observations occur at random then there are some relatively simple
remedial actions that can be taken. If the experiment contains several replicates, the dif-
ficulties caused by the unbalanced design are often minor and can be ignored. The cor-
relation matrix is helpful in deciding if this might be the case. If that approach is not
appropriate, the missing values can be replaced by estimated values. These estimates
could be determined from the cell means of the surviving observations or by iteratively
replacing the missing observations with their values as predicted by the model until they
converge. The latter approach effectively minimizes SSe with respect to the missing
observations as if they were regression coefficients. These replacement solutions recover
the balance of the experiment design but the error degrees of freedom should be reduced
by the number of missing observations, and all of the statistics that depend on the error
degrees of freedom should be recalculated, including the regression coefficient t and p
values. These calculations are relatively easy to perform by copying the sums of squares
to a spreadsheet, correcting the degrees of freedom column, then recalculating the mean
squares, F statistics, and their p values. The regression coefficient t values can be deter-
mined from the square root of the corresponding F statistics because of the identity:

t p , dfε = Fp ,1, dfε (9.20)

9.9 PROPAGATION OF ERROR


An important application of the model derived from a 2k experiment is to predict the
expected variation in the response y due to variation in the independent variables xi.
Obviously, for a given set of xi values, the corresponding point estimate of y can be
determined from the model; however, random noise in the xi about their nominal values
Two-Level Factorial Experiments 391

will tend to cause variation in the y that can also be predicted from the model. This tech-
nique, called propagation of error, is very important in manufacturing applications
where it is important to understand how tolerances on process input variables cause
undesirable variation in the process output variables. Propagation of error calculations
require some calculus operations; however, the calculus required is very simple and you
can probably find someone to help if you don’t understand what’s going on. The prop-
agation of error method also has important applications with more complex models,
such as those we will see in later chapters.
Theorem 9.1 (Propagation of Error) If a process is configured to some nominal
condition indicated by x′i but each of the xi suffers from some random variation about
x′i characterized by sxi , then the induced variation in y (x1, x2, . . . , xk) at (x′1, x′2, . . . , x′k)
is given by:
2
⎛ ⎞
∂y
k
σ = σ + ∑⎜
2 2
× σ xi ⎟ (9.21)
y ε

i =1 ⎝ ∂x i

xi′ ⎠

where se is the standard error of the model for y.

Example 9.11
A manufacturer of valves used in marine hydraulic steering controls built and ana-
lyzed an experiment to study hydraulic pressure leak-back rate as a function of critical
valve characteristics. The model that they obtained, after appropriate refinements, was:

y = 13 − 0.8 x1 + 2.0 x 2 + 1.2 x12

and had standard error se = 0.4. Control charts of x1 and x2 indicated that the normal
manufacturing variation in those variables was ŝx1 = 0.2 and ŝx2 = 0.3. (All quantities
are given in standardized units.) Determine the predicted response and the variation
induced in y by manufacturing variation in x1 and x2 when (x′1, x′2 ) = (1, –0.5).
Solution: The predicted value of the leak-back rate is given by:

yˆ (1, −0.5) = 13 − 0.8 (1) + 2.0 ( −0.5) + 1.2 (1) ( −0.5) = 10.6

The values of the required partial derivatives at the specified point are:

∂y
= −0.8 + 1.2 x 2 (1,−0.5) = −1.4
∂x1 (1,−0.5)
∂y
= 2.0 + 1.2 x1 (1,−0.5) = 3.2
∂x 2 (1,−0.5)
392 Chapter Nine

The expected standard deviation of the leak-back rate due to the propagation of errors
in x1 and x2 is given by Equation 9.20:

2 2
⎛ ⎞ ⎛ ⎞
σˆ y = σˆ ε2 + ⎜ ∂∂xy1 × σˆ x1 ⎟ + ⎜ ∂∂xy2 × σˆ x2 ⎟
(1,−0.5) ⎝ (1,− 0.5) ⎠ ⎝ (1,−0.5) ⎠

= ( 0.4 ) + ( −1.4 × 0.2) + (3.2 × 0.3)


2 2 2

= 0.160 + 0.078 + 0.922


= 1.160
= 1.08

These calculations indicate that the predicted value of the leak-back rate is ŷ
(1, –0.5) = 10.6 and the corresponding standard deviation in the leak-back rate due
to manufacturing variation in x1 and x2 is expected to be ŝy = 1.08. The variance
magnitudes indicate that variation in x2 is the largest source of induced variation in
the leak-back rate, so any effort to reduce variation should be focused on improved
control of x2.

9.10 SAMPLE SIZE AND POWER


There are two approaches that can be used to determine the sample size (that is, the
number of replicates) for two-level factorial designs:
• The sample size can be determined so as to deliver a specified power for
the ANOVA F test to detect a specified difference between the ±1 levels of
a variable.
• The sample size can be determined so as to quantify the regression
coefficient associated with a variable to within some specified range with
specified confidence.
When the focus of an experiment is on the identification of significant variables,
such as in a screening experiment, the sample size and power analysis for the ANOVA
F test are appropriate. When the focus of the experiment is on quantifying the regres-
sion coefficient associated with a term that is already known or suspected to be signif-
icant, then the sample size analysis for the confidence interval is appropriate. These
methods have already been presented in Chapters 7 and 8, respectively, but their spe-
cific applications to two-level factorial designs will be reviewed here.

9.10.1 Sample Size and Power to Detect Significant Effects


The sample-size and power calculations for an F test to detect a significant difference
between the ±1 levels of a variable in a 2k factorial experiment were presented in Section
Two-Level Factorial Experiments 393

7.5.3. Since all k of the design variables are coded to ±1 levels, then the 2k factorial
design offers the same power to detect a difference d between the ±1 levels of each vari-
able. In practice, the choices for the real physical settings of the variables corresponding
to the coded ±1 levels determine the true power for each variable. This means that the
power to detect a difference between the ±1 levels of a variable with two widely sepa-
rated levels is relatively high compared to the power obtained when the levels are set
closer together.

Example 9.12
Calculate the power to determine a difference d = 400 between the ±1 levels of the
variables in a 24 design with six replicates if se = 800. Include main effects and two-
factor interactions in the model and use the method of Section 7.5.3 to find the power.
Solution: The experiment requires a total of N = 6 × 24 = 96 runs, so there are
dftotal = 95 total degrees of freedom. The model will contain dfmodel = ( 42 ) + ( 42 ) = 10
degrees of freedom so there will be dfe = 95 – 10 = 85 error degrees of freedom. The F
statistic for the effect of any of the four design variables will have one numerator and
85 denominator degrees of freedom so, with a = 0.05, the critical value of the accept/
reject bound for F will be F0.05,1,85 = 3.953. The power is given by the condition Fa = FP,l
where the noncentrality parameter l is:

( )
2
λ = N
2a
δ
σε

( 2) ( 800 )
2
= 296 400

= 6.0

and a = 2 is the number of levels of each variable. From FP,6.0 = 3.953 we find that the
power is P = 0.678. That is, this experiment will deliver a 67.8 percent chance of detect-
ing the presence of a 400-unit difference between the –1 and +1 levels of a design vari-
able if such a difference is present. This power is relatively low and more replicates
should be considered.

MINITAB V14 contains a simple power and sample-size calculator for the 2k
designs that can be found in its Stat> Power and Sample Size> 2-Level Factorial
Design menu. There are several different types of power and sample-size calculations
that MINITAB can make. All of these problems require you to specify:
• The number of variables k.
• The number of corner points in the design given by 2k. (This specification may
seem unnecessary right now but the reason will become clear in Chapter 10.)
• The expected value of the model standard error se .
Then, given any three of the following quantities, MINITAB calculates the fourth
quantity:
394 Chapter Nine

• The number of replicates n.


• The size of the smallest effect d.
• The power P.
• The number of center points.
The addition of center points to the 2k designs will be discussed in detail in Chapter 11.
For now, leave the number of center points set to zero.
There is an important option in the Stat> Power and Sample Size> 2-Level
Factorial Design menu that you will probably have to exercise to get the correct sam-
ple-size answers. MINITAB assumes that the model you intend to build will include all
possible terms: main effects, two-factor interactions, three-factor interactions, and so
on. If you don’t want to include some of these terms in the model, such as three-factor
and higher-order interactions, then enter the Design menu and specify the number of
model terms that you want to omit from the model. You will have to calculate this
number yourself.

Example 9.13
Use MINITAB’s power calculation capability to confirm the answer to Example 9.12.
Solution: In MINITAB’s Stat> Power and Sample Size> 2-Level Factorial Design
menu the number of variables is k = 4, the number of corner points is 24 = 16, the stan-
dard deviation is se = 800, the number of replicates is n = 6, the effect size is d = 400,
and the number of center points is zero. In the Design menu it is necessary to indicate
that the three- and four-factor interactions are to be omitted from the model. The num-
ber of terms to be omitted is ( 43 ) + ( 44 ) = 5. The MINITAB menus and corresponding out-
put are shown in Figure 9.28. MINITAB confirms that the power is P = 0.678 within
round-off error.

Example 9.14
Use MINITAB’s power calculation capability to determine the number of replicates
of the 24 design from Example 9.12 necessary to deliver a power of 90 percent or greater
to detect a difference of d = 400 between the ±1 levels of a design variable.
Solution: The sample-size calculation (not shown) was performed by changing the
Stat> Power and Sample Size> 2-Level Factorial Design menu so that the Power
values: field was 0.90 and the Replicates: field was empty. MINITAB indicates that n =
11 replicates of the 24 design are required and that the exact power of the experiment
to detect an effect d = 400 will be P = 0.9094.

Whether you have MINITAB or not, there is a simple approximation to the exact
method for determining the number of replicates of a 2k experiment necessary to achieve
a specified power. The method is analogous to the relationship between the sample-size
calculations for the confidence interval and the hypothesis test for the two-sample t test
Two-Level Factorial Experiments 395

Figure 9.28 Power calculation for 24 design.

introduced in Chapter 3. It can be shown that the number of replicates determined in the
exact algorithm is approximately:
2
1 ⎛ σ ⎞
2 ⎝
( )
r ≥ k −2 ⎜ tα / 2 + tβ ε ⎟
δ ⎠
(9.22)

where r is the number of replicates, the power is P = 1 – b, and the t distribution has
degrees of freedom equal to the error degrees of freedom of the regression. Although
this expression is transcendental, if there are plenty of error degrees of freedom in the
model then the t distribution is approximately z. This provides a convenient starting
point for iterations and the correct answer is often obtained on the first iteration.

Example 9.15
Use the alternative method to determine a general formula for the number of repli-
cates required for a 2k experiment that will deliver 90 percent power to detect a difference
d between the ±1 levels of a variable. Use a = 0.05 and assume that there will be enough
error degrees of freedom in the model that the t distribution is approximately normal.
Solution: With t0.025 z0.025 = 1.96 and t0.10 z0.10 = 1.28, Equation 9.22 gives:
396 Chapter Nine

2
1 ⎛σ ⎞
r ≥ k −2 (1.96 + 1.28 ) ⎜ ε ⎟
2

2 ⎝ δ ⎠
2
42 ⎛ σ ⎞
≥ k ⎜ ε⎟ (9.23)
2 ⎝ δ ⎠

Example 9.16
Use the alternative method from Example 9.15 to determine the number of repli-
cates required for the situation described in Example 9.14.
Solution: With k = 4, se = 800, and d = 400, Equation 9.23 gives:
2
42 ⎛ 800 ⎞
r≥ 4⎜ = 10.5
2 ⎝ 400 ⎟⎠

so the experiment requires r = 11 replicates. This confirms MINITAB’s exact solution.


There will be plenty of error degrees of freedom to justify the use of the normal distri-
bution to approximate the t distribution, so no further iterations are required.

9.10.2 Sample Size to Quantify Effects


In Section 8.18 an algorithm was presented to find the sample size required to deter-
mine the regression slope parameter to within a specified range of values with specified
confidence. This algorithm, with observations taken at coded ±1 levels, so separated by
Δ x = 2, is directly applicable to the sample-size problem for two-level factorial experi-
ments. This sample-size calculation applies to all of the variables in the experiment
because they all use the same coded ±1 levels.
Under these conditions, to determine the slope parameter bi for the ith of k variables
with confidence 1 – a within an interval given by:

P ( bi − δ < βi < bi + δ ) = 1 − α (9.24)

where bi is the estimated value of bi determined from the regression analysis, the mini-
mum required sample size must meet the condition:
2
1⎛t σ ⎞
n ≥ ⎜ α /2 ε ⎟ (9.25)
2⎝ δ ⎠

where ta/2 has degrees of freedom equal to the error degrees of freedom of the model.
The sample-size problem is not solved yet. Here n is the number of observations that
must be taken at the –1 and +1 levels, so the total number of observations in the exper-
iment must meet the condition:
Two-Level Factorial Experiments 397

r 2 k ≥ 2n (9.26)

where r is the number of replicates of the 2k design. When these last two conditions are
combined, the number of replicates is given by:
2
1 ⎛ tα / 2σ ε ⎞
r≥ (9.27)
2k ⎜⎝ δ ⎟⎠

The smallest value of r that meets this condition is the minimum number of replicates
of the experiment required to determine bi with the specified precision. As before, this
condition is transcendental because the degrees of freedom for ta/2 depend on the value
of r.

Example 9.17
A 23 experiment is to be performed to quantify the regression slopes associated with
the main effects to within d = 20 with 95 percent confidence. The standard error of the
model is expected to be se = 80. How many replicates of the 23 design are required?
Solution: The number of replicates must meet the condition given by Equation 9.27.
If the number of replicates is sufficiently large that t0.025 (z0.025 = 1.96) then:

( )
2
tα / 2σ ε
r ≥ 1
2k δ

≥ (
1 1.96 × 80
23 20 ) 2

≥8 (9.28)

The number of error degrees of freedom for the model with main effects and two-factor
interactions is dfe = dftotal – dfmodel = (8 (23) – 1) – 6 = 57, so the approximation of t0.025
with z0.025 is justified and the solution r = 8 is valid.

9.11 DESIGN CONSIDERATIONS FOR 2K EXPERIMENTS


• Pick the ±1 levels for each variable to be as far apart as is safely and practically
possible.
• Block the experiment by replicates and include blocks in the model to control
for differences between them. (This requires analysis by general linear model,
available either from Stat> DOE> Factorial> Analyze Factorial Design or
Stat> ANOVA> General Linear Model.)
• If all of the design variables in a 2k experiment are quantitative, consider adding
center cells to the design to increase the error degrees of freedom and permit a
398 Chapter Nine

linear lack of fit test. See Chapters 8 and 11 for details on how to perform the
lack of fit test.
• If: 1) all design variables in a 2k experiment are quantitative, 2) there are very
few (≤ 10) error degrees of freedom, and 3) it’s too expensive to replicate the
entire experiment, then add center cells to increase the error degrees of freedom
and the power of the experiment. Use the information from the center cells to
do a linear lack of fit test. (See Chapter 11.)
• For 2k designs with five or more variables consider using the fractional factorial
designs of Chapter 10 to reduce the number of runs required by the experiment.
• Use the methods of Chapter 10 to block large 2k experiments.
• Do a sample-size calculation to determine the number of replicates required
for your experiment. Use the appropriate calculation—either one to detect
significant effects or one to quantify the regression coefficients using a
confidence interval.
10
Fractional Factorial
Experiments

10.1 INTRODUCTION
One of the advantages of the factorial designs is their ability to use relatively few exper-
imental runs to estimate many effects. For example, in the 25 design with one replicate,
the 32 runs are used to determine five main effects, 10 two-factor interactions, and
higher-order interactions if necessary. The analysis is done by grouping the 32 runs into
two sets of 16 runs each according to the –1 and +1 levels of each factor of interest. The
regression coefficient for each factor is then calculated from:

Δ y y+ − y−
bi = = (10.1)
Δ xi 2

where the + and – subscripts indicate the xi factor levels. Different groupings of the
same y values are used to determine each effect.
Table 10.1 shows how the size of the 2k experiments grows with the number of vari-
ables and how the available information is used. In the table, dfmodel is determined from
the number of main effects plus the number of two-factor interactions, that is:

⎛ k ⎞ ⎛ k ⎞ k ( k + 1)
dfmodel = ⎜ ⎟ + ⎜ ⎟ = (10.2)
⎝ 1⎠ ⎝ 2⎠ 2

Of course we could consider more complex model terms including three-factor and
higher-order interactions; however, those are rarely significant so it is usually safe to
ignore them. The table shows that for the larger experiments most of the runs are used
to estimate the error. Do we really need so many error degrees of freedom? How many

399
400 Chapter Ten

Table 10.1 Number of runs in factorial experiments.


k 2k dftotal dfmodel dferror
2 4 3 3 0
3 8 7 6 1
4 16 15 10 5
5 32 31 15 16
6 64 63 21 42
7 128 127 28 99
8 256 255 36 219
9 512 511 45 466
10 1024 1023 55 968

runs are really required to accurately estimate the bi statistics? If it were possible to cut
back on the size of the experiment, which runs would we cut? These are the questions
to be addressed in this chapter.

10.2 THE 25 –1 HALF-FRACTIONAL FACTORIAL DESIGN


Table 10.1 shows that the 2k factorial designs get very large as k gets large but the num-
ber of terms modeled does not grow as quickly. In general, the number of runs in a single
replicate will be 2k and the number of terms in the model (main effects plus two-factor
interactions) will only be dfmodel = k(k + 1)/2. For example, the model for a 25 experi-
ment requiring 32 runs potentially includes ( 05 ) = 1 constant, ( 15 ) = 5 main effects, ( 25 ) = 10
two-factor interactions, ( 35 ) = 10 three-factor interactions, ( 45 ) = 5 four-factor interactions,
and ( 55 ) = 1 five-factor interaction. If we agree that three-, four-, and five-factor inter-
actions are unlikely then all of these terms can be removed from the model and pooled
with the error estimate. The model including just the constant, main effects, and two-
factor interactions requires only 5 + 10 = 15 degrees of freedom and there are 32 – 1 =
31 total degrees of freedom available. This leaves 31 – 15 = 16 degrees of freedom for
the error estimate, which could be considered excessive! This is before any replication
is considered and Occam is likely to drop some terms from the model and dfe will get
even larger! What a waste! Clearly, we need a scheme to selectively reduce the number
of runs in a large 2k experiment to get the information that we need with a reasonable
amount of work.
If there is indeed an excessive number of error degrees of freedom built into the 25
experiment, then let’s consider some possible strategies to eliminate some of the runs. As
an aggressive but perhaps arbitrary goal let’s try to find a way to eliminate one half of
the original 32 runs. (For the moment ignore the fact that a 16-run experiment with a 15-
term model doesn’t leave any degrees of freedom for error. We’ll deal with this problem
later.) Consider the full 25 experiment design in 32 runs shown in Table 10.2. If one half
of the runs are to be eliminated, just how do we select them? We might consider ran-
domly selecting the runs to be eliminated but that method has some substantial risks as
Fractional Factorial Experiments 401

Table 10.2 25 design with all two-factor interactions.


Run x1 x2 x3 x4 x5 x12 x13 x14 x15 x23 x24 x25 x34 x35 x45
1 – – – – – + + + + + + + + + +
2 – – – – + + + + – + + – + – –
3 – – – + – + + – + + – + – + –
4 – – – + + + + – – + – – – – +
5 – – + – – + – + + – + + – – +
6 – – + – + + – + – – + – – + –
7 – – + + – + – – + – – + + – –
8 – – + + + + – – – – – – + + +
9 – + – – – – + + + – – – + + +
10 – + – – + – + + – – – + + – –
11 – + – + – – + – + – + – – + –
12 – + – + + – + – – – + + – – +
13 – + + – – – – + + + – – – – +
14 – + + – + – – + – + – + – + –
15 – + + + – – – – + + + – + – –
16 – + + + + – – – – + + + + + +
17 + – – – – – – – – + + + + + +
18 + – – – + – – – + + + – + – –
19 + – – + – – – + – + – + – + –
20 + – – + + – – + + + – – – – +
21 + – + – – – + – – – + + – – +
22 + – + – + – + – + – + – – + –
23 + – + + – – + + – – – + + – –
24 + – + + + – + + + – – – + + +
25 + + – – – + – – – – – – + + +
26 + + – – + + – – + – – + + – –
27 + + – + – + – + – – + – – + –
28 + + – + + + – + + – + + – – +
29 + + + – – + + – – + – – – – +
30 + + + – + + + – + + – + – + –
31 + + + + – + + + – + + – + – –
32 + + + + + + + + + + + + + + +

we will see. Another method might be to eliminate the last sixteen of the thirty-two runs
but then we’d lose the ability to detect the effect of x1. This strategy is definitely unac-
ceptable. A third choice might be to eliminate eight of the sixteen runs with x1 = –1 and
eight of the sixteen runs with x1 = +1. This would preserve the ability to resolve the x1
effect, but then we’re back to the original problem—how do we select the eight runs to
eliminate from each set of sixteen? A logical method for the selection of these runs is
required and hopefully it will have minimal consequences.
Table 10.2 shows all 32 runs of the 25 experiment and the 10 two-factor interactions
that we would like to determine. An advantage of the factorial designs is that all of the
402 Chapter Ten

Table 10.3 Correlation matrix for 25 full-factorial design with all two-factor interactions.
x1 x2 x3 x4 x5 x12 x13 x14 x15 x23 x24 x25 x34 x35 x45
x1 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0
x2 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0
x3 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0
x4 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0
x5 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0
x12 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0
x13 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0
x14 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0
x15 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0
x23 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0
x24 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0
x25 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0
x34 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0
x35 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0
x45 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1

terms we want to model, the five main effects and ten two-factor interactions, are inde-
pendent of each other. This is confirmed by calculating the correlation coefficients
between all possible pairs of terms in the model. This is shown in the correlation matrix
of model terms in Table 10.3. (The values shown in the table are r values, not r 2, because
r 2 values get too small too fast and too many of them would appear as zeros when they
really aren’t.) All of the off-diagonal correlation coefficients in the matrix are zeros
confirming that all terms are independent. In fact, if the correlation matrix were
expanded to include all of the higher-order interaction terms, they would also be inde-
pendent. This is a desirable characteristic, and whatever fraction of the 32 runs we end
up keeping in our reduced experiment, it should at least preserve the independence of
the main effects and two-factor interactions.
Tables 10.4 and 10.5 show the design matrix and corresponding correlation matrix
for a 16-run experiment where the 16 runs were taken randomly from the full 32-run
experiment. The correlation matrix shows the consequence of using this method to iden-
tify the experimental runs. The off-diagonal terms in the correlation matrix are no longer
all zeros. In fact, there are few terms in the model that are completely independent of
any others. This indicates that there is substantial confounding of what were supposed
to be independent variables. The 16 runs selected here are not unique and other sets of
16 runs will give other confounding patterns, some better and some worse than the one
shown here. Of the ( 3216 ) = 601,080,390 possible 16-run subsets, we have to hope that at
least some of them behave as we want them to. Thankfully, there is such a solution and
we don’t have to randomly search through the hundreds of millions of possible subsets
to find it.
When we try to build just a fraction of the full 32-run experiment, some correlations
between variables apparently are inevitable. Suppose that we try to select the runs to
Fractional Factorial Experiments 403

Table 10.4 Experiment of 16 random runs from the full 32-run experiment.
Run x1 x2 x3 x4 x5 x12 x13 x14 x15 x23 x24 x25 x34 x35 x45
4 – – – + + + + – – + – – – – +
5 – – + – – + – + + – + + – – +
6 – – + – + + – + – – + – – + –
8 – – + + + + – – – – – – + + +
9 – + – – – – + + + – – – + + +
19 + – – + – – – + – + – + – + –
20 + – – + + – – + + + – – – – +
21 + – + – – – + – – – + + – – +
22 + – + – + – + – + – + – – + –
23 + – + + – – + + – – – + + – –
25 + + – – – + – – – – – – + + +
26 + + – – + + – – + – – + + – –
28 + + – + + + – + + – + + – – +
30 + + + – + + + – + + – + – + –
31 + + + + – + + + – + + – + – –
32 + + + + + + + + + + + + + + +

Table 10.5 Correlation matrix of experiment of 16 random runs from full 32-run experiment.
x1 x2 x3 x4 x5 x12 x13 x14 x15 x23 x24 x25 x34 x35 x45
x1 1.00 0.32 –0.05 0.13 –0.05 –0.24 0.13 –0.05 0.13 0.24 0.05 0.40 0.05 –0.13 –0.32
x2 0.32 1.00 –0.24 –0.13 0.02 0.42 0.13 0.02 0.38 0.10 –0.02 0.13 0.49 0.13 0.02
x3 –0.05 –0.24 1.00 –0.13 –0.02 0.10 0.38 –0.02 –0.13 –0.10 0.52 0.13 0.02 0.13 –0.27
x4 0.13 –0.13 –0.13 1.00 0.13 0.00 0.00 0.38 –0.25 0.52 –0.13 0.00 0.13 –0.25 0.13
x5 –0.05 0.02 –0.02 0.13 1.00 0.36 –0.13 –0.27 0.38 0.16 0.02 –0.13 –0.24 0.13 –0.02
x12 –0.24 0.42 0.10 0.00 0.36 1.00 –0.26 –0.16 0.00 0.07 0.16 0.00 0.16 0.00 0.10
x13 0.13 0.13 0.38 0.00 –0.13 –0.26 1.00 –0.13 0.00 0.26 0.13 0.00 0.13 0.00 –0.13
x14 –0.05 0.02 –0.02 0.38 –0.27 –0.16 –0.13 1.00 0.13 0.16 0.27 0.13 0.02 –0.13 –0.02
x15 0.13 0.38 –0.13 –0.25 0.38 0.00 0.00 0.13 1.00 0.00 0.13 0.25 –0.13 0.00 0.13
x23 0.24 0.10 –0.10 0.52 0.16 0.07 0.26 0.16 0.00 1.00 –0.16 0.00 –0.16 0.00 –0.10
x24 0.05 –0.02 0.52 –0.13 0.02 0.16 0.13 0.27 0.13 –0.16 1.00 0.13 –0.27 –0.13 0.02
x25 0.40 0.13 0.13 0.00 –0.13 0.00 0.00 0.13 0.25 0.00 0.13 1.00 –0.13 –0.25 –0.13
x34 0.05 0.49 0.02 0.13 –0.24 0.16 0.13 0.02 –0.13 –0.16 –0.27 –0.13 1.00 0.13 0.02
x35 –0.13 0.13 0.13 –0.25 0.13 0.00 0.00 –0.13 0.00 0.00 –0.13 –0.25 0.13 1.00 –0.13
x45 –0.32 0.02 –0.27 0.13 –0.02 0.10 –0.13 –0.02 0.13 –0.10 0.02 –0.13 0.02 –0.13 1.00

control the correlations so that they behave in a tolerable manner. Since we don’t expect
to observe four-factor interactions, and don’t even plan on looking for them anyway,
let’s perfectly correlate or confound x5 with the four-factor interaction x1234 by using
only those experimental runs that satisfy the condition:

x5 = x1 x 2 x3 x 4 = x1234 (10.3)
404 Chapter Ten

Only 16 of the 32 runs from the full experiment satisfy this condition. This provides a
scheme to select 16 of 32 runs from the full experiment with the penalty that x5 is
confounded with x1234, but what other consequences are there? To consider this question,
the 16-run experiment is shown in Table 10.6 where only those 16 runs satisfying
Equation 10.3 are included. Note that Equation 10.3 is satisfied for each run. The two-
factor interactions are also shown in Table 10.6 and the correlation matrix is shown in
Table 10.7. The correlation matrix shows that all of the off-diagonal terms are now zeros
just as we wanted! How did this happen? This is exactly what we were after—a 16-run
experiment that can model main effects and two-factor interactions. But at what price?
To understand the downside of the design in Table 10.6, it’s necessary to consider
the confounding that occurs between the other potential terms in the model. To check this,
try multiplying Equation 10.3 through by x1. This corresponds to just multiplying the
indicated columns together on a row by row basis. This yields:

x1 x5 = x1 x1234 (10.4)

Since x1x1 = x 21 = 1, this reduces to:

x15 = x 234 (10.5)

That is, the two-factor interaction x15 is perfectly confounded with the three-factor inter-
action x234. This is an acceptable risk because we don’t usually expect to see three-
factor interactions in our experiments. All the other two-factor interactions can be
generated by multiplying Equation 10.3 through by x2, x3, and x4. When this is done, it’s

Table 10.6 Experiment of 16 runs with x5 = x1234 from full 32-run experiment.
Run x1 x2 x3 x4 x5 x12 x13 x14 x15 x23 x24 x25 x34 x35 x45
2 – – – – + + + + – + + – + – –
3 – – – + – + + – + + – + – + –
5 – – + – – + – + + – + + – – +
8 – – + + + + – – – – – – + + +
9 – + – – – – + + + – – – + + +
12 – + – + + – + – – – + + – – +
14 – + + – + – – + – + – + – + –
15 – + + + – – – – + + + – + – –
17 + – – – – – – – – + + + + + +
20 + – – + + – – + + + – – – – +
22 + – + – + – + – + – + – – + –
23 + – + + – – + + – – – + + – –
26 + + – – + + – – + – – + + – –
27 + + – + – + – + – – + – – + –
29 + + + – – + + – – + – – – – +
32 + + + + + + + + + + + + + + +
Fractional Factorial Experiments 405

Table 10.7 Correlation matrix for experiment of 16 runs with x5 = x1234 from full
32-run experiment.
x1 x2 x3 x4 x5 x12 x13 x14 x15 x23 x24 x25 x34 x35 x45
x1 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0
x2 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0
x3 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0
x4 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0
x5 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0
x12 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0
x13 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0
x14 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0
x15 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0
x23 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0
x24 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0
x25 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0
x34 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0
x35 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0
x45 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1

found that each two-factor interaction is confounded with a particular three-factor inter-
action. But if we don’t expect three-factor interactions to be significant then this com-
promise is tolerable.
Since the subscripts in the confounding relations such as Equation 10.3 carry all of
the information about the confounding between terms, it is common to write the con-
founding relations as 5 = 1234 instead of x5 = x1234. All of the confounding relations for
the five-variable 16-run experiment implied by Equation 10.3 are shown in the follow-
ing table:

1 = 2345 12 = 345 23 = 145 34 = 125 45 = 123


2 = 1345 13 = 245 24 = 135 35 = 124
3 = 1245 14 = 235 25 = 134
4 = 1235 15 = 234
5 = 1234

Although the correlation matrices in Tables 10.3 and 10.7 appear to be identical, they
are not. Both tables are incomplete—they don’t show the three-, four-, and five-factor
interactions—but if they were expanded to show those interactions, the differences would
be apparent. Whereas Table 10.3 would be diagonal, with ones on the diagonal and zeros
everywhere else, Table 10.7 would have some additional ones in off-diagonal positions
because of the confounding between the low-order and high-order terms in the model.
Equation 10.3 is clearly the key to determining how this 16-run experiment is con-
structed and how it behaves. One way to generate the design of Table 10.6 is to construct
406 Chapter Ten

a 16-run 24 factorial experiment in variables x1, x2, x3, and x4, and then to determine the
required levels for the last variable x5 using Equation 10.3. Since Equation 10.3 deter-
mines the levels of x5 from the first four variables, it is called the design generator. Designs
constructed this way are called fractional factorial designs. The five-variable design in
16 runs described here is designated a 25–1 design. The –1 in the exponent indicates that
only one half of the original 25 = 32 runs of the experiment are used since 2–1 = 1⁄2. For
this reason we call this experiment a half-fractional factorial. The remaining 16 runs
make up the complementary half-fraction and satisfy the generator:

5 = −1234 (10.6)

The complementary half-fraction experiment is just as useful and valid as the experiment
defined by the generator of Equation 10.3; there is no preference between the two. For
this reason, generators for fractional factorial designs are usually written with a ± oper-
ator to indicate that there are two equivalent choices for the generator, such as:

5 = ±1234 (10.7)

10.3 OTHER FRACTIONAL FACTORIAL DESIGNS


The half-fractional factorial design introduced in the previous section is only one
possible fraction that can be defined. Quarter-, eighth-, and higher-order fractional
experiments can also be designed. The designations for the half-, quarter-, and eighth-
fractional designs are 2k–1, 2k–2, and 2k–3, respectively, where k is the number of variables
and the terms –1, –2, and –3 in the exponents indicate the degree of fractionation.
(Notice that 2–1 = 1⁄2, 2–2 = 1⁄4, and 2–3 = 1⁄8.) In addition to indicating the degree of fraction-
ation present in a fractional factorial design, the number following the minus sign in
the exponent of the design designation indicates the number of generators required
for the design, so half-fractional experiments will have one generator, quarter-fractional
designs will have two generators, eighth-fractional designs will have three generators,
and so on. The generators for the quarter- and higher-order fractional factorial designs
are selected on the basis of the same arguments that we used to determine the runs for
the half-fractional design but the rules for generators can get quite complicated. It’s not
worth taking the time to learn how the generators for the higher-order fractional designs
are constructed—just refer to an appropriate table of designs and generators to find
them. Table 10.8 shows the generators for some of the most common fractional factor-
ial designs.
Fractional factorial designs are commonly used to reduce the number of runs
required to build an experiment. They also provide a powerful tool for blocking large
experiments. For example, suppose a large 2k experiment cannot be completed in a sin-
gle day. Which runs should be made on each day? Your first thought might be to ran-
domly assign the runs of the experiment to days, but we already saw the risk of that
choice. If the experiment must be built roughly in quarters, say spread over four days,
then a logical choice would be to build a quarter-fraction of the full experiment each
Fractional Factorial Experiments 407

Table 10.8 Fractional factorial designs with number of runs and generators.
k Design Resolution Design Runs Generators
3–1
3 III 2III 4 3 = ±12
4 IV 24–1
IV 8 4 = ±123
5 III 25–2
III 8 4 = ±12, 5 = ±13
V 2V5–1 16 5 = ±1234
6 III 26–3
III 8 4 = ±12, 5 = ±13, 6 = ±23
IV 26–2
IV 16 5 = ±123, 6 = ±234
6–1
VI 2VI 32 6 = ±12345
7–4
7 III 2III 8 4 = ±12, 5 = ±13, 6 = ±23, 7 = ±123
7–3
IV 2IV 16 5 = ±123, 6 = ±234, 7 = ±134
7–2
IV 2IV 32 6 = ±1234, 7 = ±1245
7–1
VII 2VII 64 7 = ±123456
8 IV 28–4
IV 16 5 = ±234, 6 = ±134, 7 = ±123, 8 = ±124
IV 28–3
IV 32 6 = ±123, 7 = ±124, 8 = ±2345
V 2V8–2 64 7 = ±1234, 8 = ±1256
8–1
VIII 2VIII 128 8 = ±1234567

day. Each quarter-fraction of the full experiment, while having some potentially unde-
sirable confounding between variables, can be analyzed by itself. Then, after each
day’s data become available, the data can be combined and used to analyze the system
more completely. If it isn’t necessary to build the full experiment and only a half-frac-
tion of the full experiment is required, then two quarter-fractions or four eighth-frac-
tions can be built, yielding the required half-fractional factorial experiment when the
data are combined on the last day. This approach also permits you to treat the differ-
ent days as blocks so that day-to-day differences can be identified and removed.

10.4 DESIGN RESOLUTION


The generators used to construct a fractional factorial design determine the confound-
ing that will be present among the design variables and their various interactions. For
the 25–1 design considered earlier, the generator involves five variables: x1, x2, x3. x4, and
x5, and defines the confounding for a main effect and a four-factor interaction. When the
generator is used to determine the confounding for the two-factor interactions, it shows
that they are each confounded with a single three-factor interaction. Again this con-
founding relationship involves five terms—two in the two-factor interaction and three
in the three-factor interaction. This observation, that every confounding relation in the
25–1 design involves five terms, is a fundamental characteristic of the design, so the design
is referred to as a resolution V design where the roman numeral V for five is used. The
design designation 25–1 is enhanced to reflect the design resolution: 25–1V , where the sub-
script indicates the resolution. This notation is summarized in Figure 10.1. Table 10.9
summarizes the most common fractional factorial designs by the number of design
variables and the design resolution.
408 Chapter Ten

Number of Degree of
variables fractionation

2Rk–p

Number of levels Design


of each variable resolution

Figure 10.1 Fractional factorial design designation.

Table 10.9 Fractional factorial designs by number of variables and design resolution.
Design Resolution
k III IV V VI VII VIII
3–1
3 2III
4–1
4 2IV
5–2
5 2 III 2V5–1
6–3 6–2 6–1
6 2 III 2 IV 2VI
7–4 7–3 7–1
7 2 III 2 IV 2VII
7–2
2 IV
8–4
8 2 IV 2V8–2 8–1
2VIII
8–3
2 IV

Each fractional factorial design has its own inherent design resolution. For example,
the 24 –1 half-fractional factorial design has a generator given by:

4 = 123 (10.8)

The generator can be manipulated to show that all main effects are confounded with
three-factor interactions and all two-factor interactions are confounded with other two-
factor interactions. All of the confounding relations for the 24 –1 design are shown in the
following table:
1 = 234 12 = 34
2 = 134 13 = 24
3 = 124 14 = 23
4 = 123

In every case there are four variables in each confounding relation, indicating that this
is a resolution IV design, so the design is designated 24IV–1. The design and correlation
matrices for the 24 and 24IV–1 designs are shown in Tables 10.10, 10.11, 10.12, and 10.13.
The correlation matrices nicely summarize the situation but the simple statement that
the design is resolution IV provides an even more succinct summary.
Fractional Factorial Experiments 409

Table 10.10 24 design in 16 runs.


Run x1 x2 x3 x4 x12 x13 x14 x23 x24 x34
1 – – – – + + + + + +
2 – – – + + + – + – –
3 – – + – + – + – + –
4 – – + + + – – – – +
5 – + – – – + + – – +
6 – + – + – + – – + –
7 – + + – – – + + – –
8 – + + + – – – + + +
9 + – – – – – – + + +
10 + – – + – – + + – –
11 + – + – – + – – + –
12 + – + + – + + – – +
13 + + – – + – – – – +
14 + + – + + – + – + –
15 + + + – + + – + – –
16 + + + + + + + + + +

Table 10.11 Correlation matrix for 24 full-factorial experiment.


x1 x2 x3 x4 x12 x13 x14 x23 x24 x34
x1 1 0 0 0 0 0 0 0 0 0
x2 0 1 0 0 0 0 0 0 0 0
x3 0 0 1 0 0 0 0 0 0 0
x4 0 0 0 1 0 0 0 0 0 0
x12 0 0 0 0 1 0 0 0 0 0
x13 0 0 0 0 0 1 0 0 0 0
x14 0 0 0 0 0 0 1 0 0 0
x23 0 0 0 0 0 0 0 1 0 0
x24 0 0 0 0 0 0 0 0 1 0
x34 0 0 0 0 0 0 0 0 0 1

Table 10.12 24–1


IV Half-fractional factorial design.

Run x1 x2 x3 x4 x12 x13 x14 x23 x24 x34


1 – – – – + + + + + +
2 – – + + + – – – – +
3 – + – + – + – – + –
4 – + + – – – + + – –
5 + – – + – – + + – –
6 + – + – – + – – + –
7 + + – – + – – – – +
8 + + + + + + + + + +
410 Chapter Ten

Table 10.13 Correlation matrix for 24–1


IV half-fractional factorial design.

x1 x2 x3 x4 x12 x13 x14 x23 x24 x34


x1 1 0 0 0 0 0 0 0 0 0
x2 0 1 0 0 0 0 0 0 0 0
x3 0 0 1 0 0 0 0 0 0 0
x4 0 0 0 1 0 0 0 0 0 0
x12 0 0 0 0 1 0 0 0 0 1
x13 0 0 0 0 0 1 0 0 1 0
x14 0 0 0 0 0 0 1 1 0 0
x23 0 0 0 0 0 0 1 1 0 0
x24 0 0 0 0 0 1 0 0 1 0
x34 0 0 0 0 1 0 0 0 0 1

The advantage of understanding the design resolution is that if a model has partic-
ular requirements then a design of a certain resolution can be specified. If it is desired
that a model resolve main effects and two-factor interactions independently, then a res-
olution V design is needed. If it’s acceptable to have main effects confounded with
three-factor interactions, and two-factor interactions confounded with other two-factor
interactions, then a resolution IV design is appropriate. If main effects and two-factor inter-
actions can be confounded, then a resolution III design is appropriate. Each resolution has
its own problems and an associated strategy for managing them.
What if the generator of Equation 10.8 is multiplied through by x4? Since x 42 = 1 we
will get:

1 = x1234 (10.9)

that is, the four-factor interaction is constant. Try it out. The product of x1, x2, x3, and x4
for all of the runs in the 24 –1 design is one. This means that the four-factor interaction is
confounded with the constant of the model. This still follows the rule defined by design
resolution IV—that each confounding relation must involve four variables.
When higher fractions than one-half are considered, more generators are required to
specify a design. Some designs have generators that contain different numbers of vari-
ables. The shortest generator (or confounding relation implied by the generators) deter-
mines the design resolution. For example, a 27– 4 design requires four generators. From
Figure 10.5 they are:

x 4 = x12
x5 = x13
x6 = x 23
x 7 = x123
Fractional Factorial Experiments 411

Note that in this case, three of the generators involve three variables and the fourth one
involves four. Since the design resolution is determined by the length of the shortest gen-
erator, which in this case involves three variables, this design is designated 27– 4
III .
k
There is no confounding at all in the 2 full-factorial experiments; estimates for
all main effects, two-factor, three-factor, and higher-order interactions up to the single
k-factor interaction can be resolved separately from each other. Confounding only
happens in experiments that are fractionated.

10.5 THE CONSEQUENCES OF CONFOUNDING


Every 2k full-factorial experiment has the desirable quality that all of its main effects, two-
factor interactions, and all higher-order interactions up to the single k-factor interaction
are independent of each other; consequently, every one of these terms can be included in
the model even if they are not expected to be important. In contrast, the 2k–p fractional fac-
torial designs take advantage of the relative rarity of significant higher-order interactions
and intentionally confound them with simpler terms like main effects and two-factor inter-
actions that are more likely to be important. Whenever potential model terms are con-
founded, there are constraints on which terms can be included in the model.
The degree of agreement between two potential terms in a model is measured by the
correlation coefficient r introduced in Chapter 8. A convenient way to present the cor-
relation coefficients of the many possible pairs of terms in a model is with a square
matrix of correlation coefficients such as were used earlier in this chapter. Those pairs
of terms in the matrix with r = 0 are independent of each other but in the fractional fac-
torial designs there are frequently cases where r = ±1, which indicates that the two rel-
evant terms are perfectly correlated. When two terms are perfectly correlated or
confounded like this, the columns of their ±1 values in the design matrix will be exactly
the same when r = 1, and exactly opposite each other when r = 1. That is, the levels of
the confounded terms are effectively locked together. Then if one or the other or both
terms has a significant effect on the response, it will be impossible to determine which
term or terms was the true cause of the effect. The mathematical consequence of this
relationship is that only one of the confounded terms can be included in the model and
the effect attributed to the included term will actually be a combination of the effects
due to both confounded terms. In general, when two or more model terms are con-
founded with each other, only one of the involved terms can be included in the model
but the effect attributed to that term actually will be a combination of the effects of all
of the confounded terms.

Example 10.1
Analyze the data from the 23 full-factorial experiment with two replicates in Figure
10.2. Then extract and reanalyze those runs that correspond to the one-half fractional
factorial design with x3 = x12 and compare the two models.
412 Chapter Ten

Row y x1 x2 x3 x12 x13 x23


1 77.274 -1 -1 -1 1 1 1
2 78.882 -1 -1 1 1 -1 -1
3 67.014 -1 1 -1 -1 1 -1
4 58.279 -1 1 1 -1 -1 1
5 85.340 1 -1 -1 -1 -1 1
6 130.299 1 -1 1 -1 1 -1
7 63.342 1 1 -1 1 -1 -1
8 112.042 1 1 1 1 1 1
9 70.145 -1 -1 -1 1 1 1
10 80.777 -1 -1 1 1 -1 -1
11 70.425 -1 1 -1 -1 1 -1
12 64.060 -1 1 1 -1 -1 1
13 80.620 1 -1 -1 -1 -1 1
14 131.356 1 -1 1 -1 1 -1
15 67.897 1 1 -1 1 -1 -1
16 108.460 1 1 1 1 1 1

Figure 10.2 Data from a 23 full-factorial experiment.

Solution: The experimental data were loaded into a MINITAB worksheet and ana-
lyzed. The correlation matrix and the multiple regression analysis output from the full-
factorial experiment are shown in Figure 10.3. Figure 10.4 shows the runs and analysis
from the indicated 23–1
III design where the generator 3 = 12 was used to select those eight
runs to be retained from the original 16-run experiment. The correlation matrix clearly
shows the confounding between main effects and two-factor interactions as expected in
this resolution III design and the expected pairs of columns are exactly the same in the
data table.

Example 10.2
Use the confounding relations to compare the regression coefficients of the models
in Figures 10.3 and 10.4.
Solution: Figure 10.3 shows the analysis of the full-factorial design with all of the
main effects and two-factor interactions intact. The confounding relations for the 23–1 III
experiment extracted from the full-factorial experiment are: 1 = 23, 2 = 13, and 3 = 12.
The regression coefficients for x1 and x23 from the full-factorial experiment are b1 = 13.28
and b23 = –2.11. In the fractional factorial experiment, since x2 and x23 are confounded,
only x1 can be retained in the model but its regression coefficient is a combination of
the effects of both terms. That is, b1 from the fractional factorial experiment equals b1 +
b23 from the full-factorial experiment:

(b + b )
1 23 full
= ( b1 ) fractional
13.28 − 2.11 = 11.17

which is in perfect agreement with the coefficient of x1 reported in Figure 10.4. From
the other confounding relations:
Fractional Factorial Experiments 413

MTB > corr c2-c7;


SUBC> nopvalues.

Correlations (Pearson)
x1 x2 x3 x12 x13
x2 0.000
x3 0.000 0.000
x12 0.000 0.000 0.000
x13 0.000 0.000 0.000 0.000
x23 0.000 0.000 0.000 0.000 0.000

MTB > regress c1 6 c2-c7

Regression Analysis
The regression equation is
y = 84.1 + 13.3 x1 - 7.70 x2 + 11.4 x3 - 1.79 x12 + 11.7 x13 - 2.11 x23

Predictor Coef StDev T P


Constant 84.1381 0.8561 98.28 0.000
x1 13.2813 0.8561 15.51 0.000
x2 -7.6984 0.8561 -8.99 0.000
x3 11.3811 0.8561 13.29 0.000
x12 -1.7858 0.8561 -2.09 0.067
x13 11.7388 0.8561 13.71 0.000
x23 -2.1107 0.8561 -2.47 0.036

S = 3.425 R-Sq = 98.7% R-Sq(adj) = 97.9%

Analysis of Variance

Source DF SS MS F P
Regression 6 8170.1 1361.7 116.11 0.000
Residual Error 9 105.5 11.7
Total 15 8275.6

Source DF Seq SS
x1 1 2822.3
x2 1 948.2
x3 1 2072.5
x12 1 51.0
x13 1 2204.8
x23 1 71.3

Figure 10.3 Analysis of 23 full-factorial experiment.

(b 2
+ b13 ) full = ( b2 ) fractional
−7.70 + 11.74 = 4.04

(b 3
+ b12 ) full = ( b3 ) fractional
11.38 − 1.79 = 9.59

which are also in perfect agreement with the two figures. The final result of confound-
ing in the 23–1
III fractional factorial design is that the model constant is confounded with
414 Chapter Ten

MTB > print c1-c7

Data Display
Row y x1 x2 x3 x12 x13 x23

1 78.882 -1 -1 1 1 -1 -1
2 67.014 -1 1 -1 -1 1 -1
3 85.340 1 -1 -1 -1 -1 1
4 112.042 1 1 1 1 1 1
5 80.777 -1 -1 1 1 -1 -1
6 70.425 -1 1 -1 -1 1 -1
7 80.620 1 -1 -1 -1 -1 1
8 108.460 1 1 1 1 1 1

MTB > corr c2-c7;


SUBC> nopvalues.

Correlations (Pearson)
x1 x2 x3 x12 x13
x2 0.000
x3 0.000 0.000
x12 0.000 0.000 1.000
x13 0.000 1.000 0.000 0.000
x23 1.000 0.000 0.000 0.000 0.000

MTB > regress c1 6 c2-c7

Regression Analysis

* x12 is highly correlated with other X variables


* x12 has been removed from the equation

* x13 is highly correlated with other X variables


* x13 has been removed from the equation

* x23 is highly correlated with other X variables


* x23 has been removed from the equation

The regression equation is


y = 85.4 + 11.2 x1 + 4.04 x2 + 9.60 x3

Predictor Coef StDev T P


Constant 85.4448 0.8869 96.34 0.000
x1 11.1705 0.8869 12.60 0.000
x2 4.0404 0.8869 4.56 0.010
x3 9.5953 0.8869 10.82 0.000

S = 2.509 R-Sq = 98.7% R-Sq(adj) = 97.7%

Analysis of Variance

Source DF SS MS F P
Regression 3 1865.41 621.80 98.81 0.000
Residual Error 4 25.17 6.29
Total 7 1890.58

Source DF Seq SS
x1 1 998.25
x2 1 130.60
x3 1 736.56

Figure 10.4 Analysis of 23–1


III half-fractional factorial experiment.
Fractional Factorial Experiments 415

the three-factor interaction so (b0 + b123)full = (b0)fractional . Although the three-factor inter-
action wasn’t reported in Figure 10.2, it must be given by:

(b )
123 full
= ( b0 ) fractional − ( b0 ) full
= 85.44 − 84.14
= 1.30

which could be confirmed by fitting the full model to the full-factorial experiment. This
demonstrates, by example, that the consequence of confounding in fractional factorial
designs is that the regression coefficients from the full-factorial experiments are liter-
ally added together according to the confounding relations and reported as the coeffi-
cients of the fractional factorial experiment.

10.6 FRACTIONAL FACTORIAL DESIGNS IN MINITAB


The methods for creating and analyzing fractional factorial designs in MINITAB are
substantially the same as those presented in Section 9.7 for the full-factorial designs.
The only modifications to those methods address the issues associated with confound-
ing between model terms.

10.6.1 Creating Fractional Factorial Designs in MINITAB


The methods for creating fractional factorial designs in MINITAB are very similar to
the methods for creating 2k full-factorial designs presented in Section 9.7.1, with a few
extra steps:
• Copy the design from an existing file.
• Manually enter all of the ±1 values for each column into the worksheet for the
base design. Then use the let command (or the Calc> Calculator menu) and
the design generators to create the remaining columns.
• Use the set command (or the Calc> Make Patterned Data> Simple Set of
Numbers menu) to create the necessary pattern of ±1 values for each column
of the base design. Then use the design generators and the let command to
create the remaining columns.
• Use MINITAB’s Stat> DOE> Factorial> Create Factorial Design menu to
specify and create the design.
If you use one of the first three methods, you should make and check the correlation
matrix including all of the main effects and two-factor interactions to confirm that the
run matrix was created correctly. The correlation matrix should have r = 1 on the diag-
onal and r = 0 everywhere for the off-diagonal entries except for certain r = ±1 values
expected from the confounding relations.
416 Chapter Ten

Example 10.3
Use MINITAB’s Calc> Make Patterned Data> Simple Set of Numbers and Calc>
Calculator dialogs to create the 26IV–2 experiment design. Construct the correlation matrix
to confirm that the design was created correctly.
Solution: The base design of the 26IV–2 experiment is a 24 design with 16 runs. The
instructions for creating this design manually are given in Example 9.8. The Calc>
Make Patterned Data> Simple Set of Numbers dialog was used to recreate the 24 base
design and then the Calc> Calculator dialog with the generators taken from Table 10.8
were used to determine x5 and x6. Then the Calc> Calculator dialog was used again to
create all of the two-factor interactions. The MINITAB commands and output are shown
in Figure 10.5. If you are mouse-impaired you can type these commands directly at the
command prompt instead of using the mouse/menu environment. The correlation matrix
was reformatted to fit better in the Session window but the default output by MINITAB
is very similar.

The easiest way to create a fractional factorial design in MINITAB is from the
Stat> DOE Factorial> Create Factorial Design menu. Designs created in this way have
the added advantage that they’re automatically recognized by MINITAB when you’re
ready to analyze the experiment using Stat> DOE> Factorial> Analyze Factorial

Figure 10.5 MINITAB commands to create the 26–2


IV design and correlation matrix.
Fractional Factorial Experiments 417

Figure 10.6 Creating the 26–2


IV design using Stat> DOE> Factorial> Create Factorial Design.

Design. The steps required to create a fractional factorial design using the Create
Factorial Design menu are essentially the same as the steps used to create a full-factorial
design as described in Section 9.7.1. There are several options available to customize
the design but the default settings will be applicable in most cases.

Example 10.4
Use MINITAB’s Stat> DOE> Factorial> Create Factorial Design menu to recreate the
26–2
IV experiment design from Example 10.3. Confirm that the design was created correctly.
Solution: The experiment was created in MINITAB and MINITAB’s output is shown
in Figure 10.6. The matrix of experimental runs is randomized so it is difficult to com-
pare it to the result from Example 10.3. The runs could be sorted by the standard order
and then checked to see if they match, but the output in the Session window indicates
that the same design generators were used to create the fifth and sixth variables so we
can be confident that the two methods for creating the 26IV–2 designs are equivalent.

10.6.2 Analysis of Fractional Factorial Designs with MINITAB


The analysis methods for full-factorial designs presented in Section 9.7.2 are still
applicable for the fractional factorial designs. Those methods were:
418 Chapter Ten

• Manual analysis with Stat> Regression> Regression.


• Analysis with the custom mlrk.mac macros.
• Analysis with Stat> DOE> Factorial> Analyze Factorial Design.
For manual analysis, the Stat> Regression> Regression method is a bit tedious
because you have to create all of the interaction columns and indicator columns for the
blocks. But once those steps are complete, the Stat> Regression> Regression method is
quite flexible and easy to use. The Stat> ANOVA> General Linear Model method can
also be used to analyze fractional factorial designs, but specify all of the model terms
except blocks as covariates to get the regression coefficients in the MINITAB output.
The mlrk.mac macros work the same way for fractional factorial designs as they do
for full-factorial designs. They use MINITAB’s regress command instead of the GLM
(general linear model) command because the regress command automatically retains the
first of every confounded set of terms and drops the others from the model. MINITAB
prints a warning in the Session window when it must drop a term from the model. For
example, if x4 is correlated to another term already included in the model, you would
see the following statements appear in the Session window:

* x4 is highly correlated with other x variables


* x4 has been removed from the equation

The GLM command doesn’t have this capability; when confounded terms are included
in the model it generates an error and stops.
The Stat> DOE> Factorial> Analyze Factorial Design menu works exactly as it
did for full-factorial designs. You will still have to specify the response, the terms to be
included in the model, and the residuals diagnostic graphs to be constructed. MINITAB
automatically includes all of the possible terms that the design allows to be fitted so you
shouldn’t have to make many changes to the model. MINITAB also reports the con-
founding relations to assist in the interpretation of the regression coefficients.
Usually with the full-factorial designs, and always when they are replicated, there
are enough total degrees of freedom in an experiment to fit an appropriate model and
still have degrees of freedom left over to estimate the error. But sometimes, especially
when an experiment has very few runs and a large model, the model consumes all
available degrees of freedom and there are none left over to estimate the error. Such
designs are called saturated designs. Unreplicated fractional factorial designs are
often saturated designs. When the analysis of these designs is performed in MINITAB,
MINITAB completes as much of the analysis as it can before it has to stop. Part of
the analysis that it does complete is the calculation of the regression coefficients, but
without error degrees of freedom it cannot determine their associated standard errors,
t values, or p values. One method to continue the analysis is to construct the normal
probability plot of the regression coefficients and use it to determine which model
terms are the weakest. After the weakest terms are dropped from the model they are
used to estimate the error and MINITAB can once again complete the rest of the
analysis.
Fractional Factorial Experiments 419

One of the most important saturated designs is the 25–1 V design which has fifteen
runs, dfmodel = 15, and dfe = 0. But with so many plotted points in the normal plot of the
regression coefficients, it’s often quite easy to determine which terms are important and
which can safely be dropped from the model. Of course a follow-up experiment should
be performed to confirm any conclusions drawn from such a risky design and analysis.

Example 10.5
Perform the analysis of the 25–1
V design formed from the 16 runs of the 32-run exper-
iment in Example 9.10, using the generator 5 = 1234. How well does the half-fractional
factorial design duplicate the results of the full-factorial experiment?
Solution: The 16 runs of the original 32-run experiment were copied into a new
MINITAB worksheet and the analysis was performed using Stat> Regression> Regres-
sion. MINITAB’s Session window output (after some minor edits) is shown in Figure
10.7. The regression analysis is incomplete because the experiment is saturated—all of
the available degrees of freedom are consumed by the model. In order to distinguish
significant regression coefficients from insignificant ones, a normal probability plot of

Data Display

Row StdOrder RunOrder Y A B C D E


1 14 2 150 -1 1 1 -1 1
2 15 3 284 -1 1 1 1 -1
3 29 5 287 1 1 1 -1 -1
4 2 6 149 -1 -1 -1 -1 1
5 23 7 53 1 -1 1 1 -1
6 20 11 76 1 -1 -1 1 1
7 22 14 -32 1 -1 1 -1 1
8 3 15 142 -1 -1 -1 1 -1
9 17 16 121 1 -1 -1 -1 -1
10 8 17 -43 -1 -1 1 1 1
11 32 18 200 1 1 1 1 1
12 5 21 1 -1 -1 1 -1 -1
13 26 23 187 1 1 -1 -1 1
14 12 25 233 -1 1 -1 1 1
15 27 29 207 1 1 -1 1 -1
16 9 32 266 -1 1 -1 -1 -1

Correlations: A, B, C, D, E, AB, AC, AD, AE, BC, BD, BE, CD, CE, DE

A B C D E AB AC AD AE BC BD BE CD CE
B 0
C 0 0
D 0 0 0
E 0 0 0 0
AB 0 0 0 0 0
AC 0 0 0 0 0 0
AD 0 0 0 0 0 0 0
AE 0 0 0 0 0 0 0 0
BC 0 0 0 0 0 0 0 0 0
BD 0 0 0 0 0 0 0 0 0 0
BE 0 0 0 0 0 0 0 0 0 0 0
CD 0 0 0 0 0 0 0 0 0 0 0 0
CE 0 0 0 0 0 0 0 0 0 0 0 0 0
DE 0 0 0 0 0 0 0 0 0 0 0 0 0 0

Figure 10.7 Analysis of a 25–1


V saturated experiment. Continued
420 Chapter Ten

Continued

Regression Analysis: Y versus A, B, . . .

The regression equation is


Y = 143 - 5.19 A + 84.2 B - 30.1 C + 1.44 D - 27.6 E - 1.31 AB + 19.7 AC
- 4.81 AD - 2.06 AE + 33.6 BC + 2.81 BD - 6.69 BE + 9.56 CD - 16.2 CE
+ 0.0625 DE

Predictor Coef SE Coef T P


Constant 142.563 0.000 * *
A -5.18750 0.00000 * *
B 84.1875 0.0000 * *
C -30.0625 0.0000 * *
D 1.43750 0.00000 * *
E -27.5625 0.0000 * *
AB -1.31250 0.00000 * *
AC 19.6875 0.0000 * *
AD -4.81250 0.00000 * *
AE -2.06250 0.00000 * *
BC 33.5625 0.0000 * *
BD 2.81250 0.00000 * *
BE -6.68750 0.00000 * *
CD 9.56250 0.00000 * *
CE -16.1875 0.0000 * *
DE 0.0625000 0.0000000 * *

S = *

Analysis of Variance

Source DF SS MS F P
Regression 15 171667.9 11444.5 * *
Residual Error 0 * *
Total 15 171667.9

Source DF Seq SS
A 1 430.6
B 1 113400.6
C 1 14460.1
D 1 33.1
E 1 12155.1
AB 1 27.6
AC 1 6201.6
AD 1 370.6
AE 1 68.1
BC 1 18023.1
BD 1 126.6
BE 1 715.6
CD 1 1463.1
CE 1 4192.6
DE 1 0.1

the regression coefficients was created. This plot is shown in Figure 10.8. (The plot
was created by copying the regression coefficients and their associated terms/labels
from the Session window into a MINITAB worksheet. Then the regression coefficients
were plotted with the custom plotnorm macro using the label subcommand.) The
plot indicates that there are many terms of near-zero magnitude but that B, C, E, AC,
BC, CE, and perhaps CD are outliers that should be retained in the regression model.
To preserve model hierarchy, the main effects A and D also need to be retained in
the model.
Fractional Factorial Experiments 421

100
B

Regression Coefficient
50
BC

AC
CD
D BD
AB DE
0 BE A AD AE

CE

C E

0.10 0.50 0.90


Normal Probability

Figure 10.8 Normal plot of the regression coefficients from a saturated experiment.

The refined model is shown in Figure 10.9. The analysis shows that the CD inter-
action is barely significant (p = 0.041) so that dropping it and the insignificant D main
effect (p = 0.710) might not be a serious compromise. But the AC interaction is highly
significant (p = 0.002), so it and the A main effect should be retained in the model.
Comparison of this refined model and the refined model determined from the analysis
of the full-factorial experiment in Example 9.10 shows that they are substantially the
same with comparable regression coefficients. Regression diagnostics for the model in
Figure 10.9 (not shown) indicate that the residuals are normally distributed and
homoscedastic as required by the analysis method. This example clearly shows that the
16-run 25–1 5
V design delivers substantially the same information as the 32-run 2 experi-
5–1
ment even though the 2 V experiment is saturated.

10.7 INTERPRETATION OF FRACTIONAL


FACTORIAL DESIGNS
10.7.1 Resolution V Designs
Of the fractional factorial designs, designs of resolution V and higher are the easiest to
interpret. Resolution V designs confound main effects with four-factor interactions, and
two-factor interactions with three-factor interactions. This means that the model for a
resolution V design can contain all of the main effects and two-factor interactions so that
the usual methods of Chapter 9 can be used to analyze the data. As long as the assump-
tion that three-factor and higher-order interactions are insignificant is true, resolution V
designs should provide a safe model. In the absence of hard evidence that three-factor
and higher order interactions are insignificant, we rely on Occam to protect us.
In the author’s experience, based on many years of building about one experiment a
week, I’ve only ever encountered a handful of experiments where I detected a significant
three-factor interaction. In each case, the magnitude of the three-factor interaction was
422 Chapter Ten

Regression Analysis: Y versus A, B, C, D, E, AC, BC, CD, CE

The regression equation is


Y = 143 - 5.19 A + 84.2 B - 30.1 C + 1.44 D - 27.6 E + 19.7 AC + 33.6 BC
+ 9.56 CD - 16.2 CE

Predictor Coef SE Coef T P


Constant 142.563 3.692 38.62 0.000
A -5.187 3.692 -1.41 0.210
B 84.187 3.692 22.80 0.000
C -30.063 3.692 -8.14 0.000
D 1.438 3.692 0.39 0.710
E -27.562 3.692 -7.47 0.000
AC 19.688 3.692 5.33 0.002
BC 33.563 3.692 9.09 0.000
CD 9.563 3.692 2.59 0.041
CE -16.188 3.692 -4.38 0.005

S = 14.77 R-Sq = 99.2% R-Sq(adj) = 98.1%

Analysis of Variance

Source DF SS MS F P
Regression 9 170360 18929 86.80 0.000
Residual Error 6 1308 218
Total 15 171668

Source DF Seq SS
A 1 431
B 1 113401
C 1 14460
D 1 33
E 1 12155
AC 1 6202
BC 1 18023
CD 1 1463
CE 1 4193

Figure 10.9 Refined model from the 25–1


V saturated experiment.

relatively small so that it wasn’t necessary to worry about or bother with it. It was also
so hard to imagine a physical mechanism that might cause the three-factor interaction
that it was most likely a Type 1 error—an artifact of the particular data set. Further-
more, I never wanted to even attempt to try to explain the significance of a three-factor
interaction—especially a weak one—to anyone. The point is that Occam and experi-
ence both suggest that three-factor and higher-order interactions are rare so it is gener-
ally safe to ignore them.

10.7.2 Resolution IV Designs


Resolution IV designs confound main effects with three-factor interactions, and two-
factor interactions with other two-factor interactions. Since three-factor and higher-order
interactions should be rare, we can expect to safely recover the main effects; however,
the confounding between two-factor interactions can present a problem. A first choice
has to be made to decide which two-factor interactions should be included in the model.
Fractional Factorial Experiments 423

In the absence of any prior knowledge of which interactions might be significant, the
choice is arbitrary. For each set of confounded interactions, only one of them can be
included in the model. When a two-factor interaction is found to be significant, it is up
to us to decide which interaction or interactions of each set to attribute the effect to.
Occam’s razor and the concept of effect heredity can provide some guidance for
deciding which confounded terms should be retained in a model. Generally, if two
variables are going to have a significant two-factor interaction then both or at least
one of their main effects should be substantial. Consequently, by comparing the list
of significant main effects to the pair of two-factor interactions that might be the
cause of the effect, its often possible to rule out one of the pair of confounded two-
factor interactions.
Despite these difficulties, the good news is that even though only one of each pair
of confounded interactions can be included in the model, the effects of both interactions
will still be accounted for by that one term. The bad news is that when we can’t be cer-
tain which of the confounded terms is the real cause of the observed effect, it will be
necessary to build a follow-up experiment that resolves the ambiguity.

Example 10.6
An experiment was performed using a 24IV–1 design with generator 4 = 123. A model
including the main effects and three of the six possible two-factor interactions was fitted
to the response:

y = b0 + b1 x1 + b2 x 2 + b3 x3 + b12 x12 + b13 x13 + b14 x14

The model showed that only coefficients b2, b3, and b14 were significant. Describe how
the model should be refined.
Solution: It doesn’t make sense that variables x1 and x4 would be insignificant by
themselves but have a significant two-factor interaction. It’s more likely that the effect
attributed to x14 is actually due to x23 with which it is confounded. This suggests that the
model should actually be:

y = b0 + b2 x 2 + b3 x3 + b23 x 23

This is significantly simpler and makes much more sense. Of course in any future exper-
iments, it would be a good idea to try to resolve this issue and pin down the true cause
of the interaction: x14 or x23.

Sometimes there is an opportunity to limit the risks associated with confounding by


carefully selecting the terms that are confounded with each other when you design an
experiment. If you have prior experience with a system, knowledge of applicable first
principles, or just an accurate opinion about which variables are likely to interact with
each other, you may be able to structure the confounding so that terms that are expected
to be significant are confounded with terms that are not expected to be significant. That
way, when a significant interaction term is detected, it should be easier to decide which
424 Chapter Ten

of the confounded terms is the real cause of the observed effect. Although this trick
might appear to be useful, it is relatively rare that sufficient information of the neces-
sary accuracy is available to make it effective.

Example 10.7
An experiment is planned using a 26IV–2 design. The design variables are operator,
machine, part design, temperature, lubricant, and assembly torque. Prior experience
suggests that there will be interactions between: operator and machine, part design and
lubricant, part design and assembly torque, and lubricant and assembly torque. No
other interactions are expected. How should the variables be assigned to resolve the
expected significant interactions?
Solution: If the 26IV–2 design uses the generators E = ABC and F = BCD, then the
implied confounding relations between pairs of two-factor interactions are: AB = CE,
AC = BE, AE = BC = DF, BD = CF, and BF = CD. All other two-factor interactions
will be confounded with three-factor interactions, which are assumed to be insignifi-
cant. If the variables are assigned in the order given in the problem statement, then the
operator/machine interaction (AB) will be confounded with the part design/lubricant
interaction (CE), which is not acceptable. Under the alternative assignment A: Operator,
B: Machine, C: Design, D: Lubricant, E: Temperature, and F: Torque, each suspected
significant two-factor interaction (AB, CD, CF, and DF) is paired with one or two
insignificant ones.

Example 10.8
When ultrasonic (acoustic) energy causes gas bubbles in a liquid to resonate, the
gas bubbles can collapse and form a light-emitting plasma. This process, called sono-
luminescence, has applications in the dissolution of materials and possibly in cold
fusion. An experiment was performed to identify the most important variables in a
device designed to produce sonoluminescence and determine the variable settings that
maximize the sonoluminescent light intensity.* Seven variables from a very long list of
variables were selected for the study. Due to time and cost limitations the experiment
was limited to one replicate of a 27–3
IV design. The seven variables and their levels are
shown in Table 10.14. The sixteen runs of the experiment were performed in completely
random order. The data are shown in Table 10.15. Analyze the experimental data and
try to refine the model. Use the refined model to determine the settings of the variables
that maximize the intensity response.
Solution: The experimental design and response data were entered into a MINITAB
worksheet. Then the experiment design was specified to MINITAB using Stat> DOE>
Factorial> Define Custom Factorial Design. To get a preliminary view of the data,
main effects and interaction plots were created using Stat> DOE> Factorial> Factorial
Plots. The main effects plots are shown in Figure 10.10 and the interaction plots are

* Eva Wilcox and Ken Inn, NIST Physics Laboratory, 1999, www.itl.nist.gov/div898/handbook/pri/section6/
pri621.htm.
Fractional Factorial Experiments 425

Table 10.14 Variables and their levels for the NIST sonoluminescence study.
Variable –1 +1 Units Description
x1 : Molarity 0.10 0.33 mole Amount of the solute
x2 : Solute type Sugar Glycerol NA Type of solute
x3 : pH 3 11 NA pH of the solution
x4 : Gas type Helium Air NA Type of gas dissolved in the water
x5 : Water depth Half Full NA Depth of the water in the flask
x6 : Horn depth 5 10 mm Depth of the ultrasonic horn in the solution
x7 : Flask clamping Unclamped Clamped NA Method of clamping the flask

Table 10.15 Experimental data from NIST sonoluminescence experiment.


Std Run x1 x2 x3 x4 x5 x6 x7 Y: Intensity
1 15 – – – – – – – 80.6
2 4 + – – – – + + 66.1
3 3 – + – – + – + 59.1
4 10 + + – – + + – 68.9
5 9 – – + – + + + 75.1
6 1 + – + – + – – 373.8
7 14 – + + – – + – 66.8
8 11 + + + – – – + 79.6
9 5 – – – + + + – 114.3
10 8 + – – + + – + 84.1
11 2 – + – + – + + 68.4
12 16 + + – + – – – 88.1
13 12 – – + + – – + 78.1
14 6 + – + + – + – 327.2
15 13 – + + + + – – 77.6
16 7 + + + + + + + 61.9

–1 1 –1 1 –1 1 –1 1 –1 1 –1 1 –1 1
150

130

110
y

90

70
x1 x2 x3 x4 x5 x6 x7

Figure 10.10 Main effects plot from the NIST sonoluminescence experiment.
426 Chapter Ten

–1 1 –1 1 –1 1 –1 1 –1 1 –1 1

x1 200
1 150
–1 100

x2 200
1 150
–1 100

x3 200
1 150
–1 100

x4 200
1 150
–1 100

x5 200
1 150
–1 100

x6 200
1 150
–1 100

x7

Figure 10.11 Interaction effects plot from the NIST sonoluminescence experiment.

shown in Figure 10.11. The main effects plots suggest that variables x1, x2, x3, and x7 have
much stronger effects than do x4, x5, and x6. The diverging line segments in the interac-
tion plots suggest that there are significant interactions due to x12, x13, x17, x23, x27, x37, x45,
x46, and x56; however, since two-factor interactions are confounded with other two-factor
interactions, a shorter list of interactions might explain these observations.
The experiment was analyzed using Stat> DOE> Factorial> Analyze Factorial
Design. The output from MINITAB is shown in Figure 10.12. All seven of the main
effects and the twenty-one two-factor interactions were entered into the model; however,
MINITAB recognized that some of the two-factor interactions were confounded with
other two-factor interactions and it was only able to retain seven of them in the model.
From the confounding relations shown in the MINITAB output we can see that the first
terms from each set of three confounded terms were the ones that MINITAB retained in
the model.
Figure 10.12 shows that there are several terms in the model with p 0.05 but
there is only a single degree of freedom for the error estimate. Some of the surviving
interactions are not significant and can be dropped from the model. This simplifies the
model and frees up degrees of freedom to improve the error estimate. Three of the inter-
actions, x12, x13, and x17, have nearly significant p values and should be retained in the
model, at least until the error estimate is improved. From the confounding relations it
can be seen that these three interactions are sufficient to explain all of the interactions
that appear to be significant in Figure 10.11. Then it becomes apparent that all of the
Fractional Factorial Experiments 427

Alias Information for Terms in the Model.


Totally confounded terms were removed from the analysis.

x1*x2 + x3*x7 + x5*x6


x1*x3 + x2*x7 + x4*x6
x1*x4 + x3*x6 + x5*x7
x1*x5 + x2*x6 + x4*x7
x1*x6 + x2*x5 + x3*x4
x1*x7 + x2*x3 + x4*x5
x2*x4 + x3*x5 + x6*x7

Fractional Factorial Fit: Y versus x1, x2, x3, x4, x5, x6, x7

Estimated Effects and Coefficients for Y (coded units)

Term Effect Coef SE Coef T P


Constant 110.61 2.919 37.90 0.017
x1 66.21 33.11 2.919 11.34 0.056
x2 -78.61 -39.31 2.919 -13.47 0.047
x3 63.81 31.91 2.919 10.93 0.058
x4 3.71 1.86 2.919 0.64 0.639
x5 7.49 3.74 2.919 1.28 0.422
x6 -9.04 -4.52 2.919 -1.55 0.365
x7 -78.11 -39.06 2.919 -13.38 0.047
x1*x2 -59.56 -29.78 2.919 -10.20 0.062
x1*x3 70.01 35.01 2.919 11.99 0.053
x1*x4 -10.49 -5.24 2.919 -1.80 0.323
x1*x5 -0.56 -0.28 2.919 -0.10 0.939
x1*x6 -16.34 -8.17 2.919 -2.80 0.218
x1*x7 -63.46 -31.73 2.919 -10.87 0.058
x2*x4 1.69 0.84 2.919 0.29 0.821

Analysis of Variance for Y (coded units)

Source DF Seq SS Adj SS Adj MS F P


Main Effects 7 83557 83556.6 11936.7 87.57 0.082
2-Way Interactions 7 51428 51428.0 7346.9 53.90 0.105
Residual Error 1 136 136.3 136.3
Total 15 135121

Figure 10.12 Analysis of the NIST sonoluminescence data.

main effects and two-factor interactions involving x4, x5, and x6 can be dropped from the
model without compromising its quality. That is, a simplified model involving x1, x2, x3,
and x7 and their interactions explains most of the variation accounted for by the model
involving all seven variables and their interactions. Remembering Occam, the former
model with four predictor variables is preferred.
The refined model, including only x1, x2, x3, x7, x12, x13, and x17, is shown in Figure
10.13. Residuals diagnostic plots (not shown) indicate that the residuals are normal
and homoscedastic with respect to the run order, the fitted values, and the predictors
used in the model. All of the terms in the model are highly significant, with p = 0.000.
By inspecting the signs of the model’s regression coefficients, the settings (x1, x2, x3,
x7) = (1, –1, 1, –1) should maximize the intensity response. Under these settings, the pre-
dicted value of the response is:
428 Chapter Ten

Fractional Factorial Fit: Y versus x1, x2, x3, x7

Estimated Effects and Coefficients for Y (coded units)

Term Effect Coef SE Coef T P


Constant 110.61 4.204 26.31 0.000
x1 66.21 33.11 4.204 7.87 0.000
x2 -78.61 -39.31 4.204 -9.35 0.000
x3 63.81 31.91 4.204 7.59 0.000
x7 -78.11 -39.06 4.204 -9.29 0.000
x1*x2 -59.56 -29.78 4.204 -7.08 0.000
x1*x3 70.01 35.01 4.204 8.33 0.000
x1*x7 -63.46 -31.73 4.204 -7.55 0.000

Analysis of Variance for Y (coded units)

Source DF Seq SS Adj SS Adj MS F P


Main Effects 4 82950 82950 20737.6 73.32 0.000
2-Way Interactions 3 49908 49908 16635.9 58.82 0.000
Residual Error 8 2263 2263 282.8
Pure Error 8 2263 2263 282.8
Total 15 135121

Alias Structure

I + x1*x2*x3*x7
x1 + x2*x3*x7
x2 + x1*x3*x7
x3 + x1*x2*x7
x7 + x1*x2*x3
x1*x2 + x3*x7
x1*x3 + x2*x7
x1*x7 + x2*x3

Figure 10.13 Refined model for NIST sonoluminescence data.

Y ( x1 , x 2 , x3 , x 7 ) = 110.6 + 33.1x1 − 39.3x 2 + 31.9 x3 − 39.1x 7 − 29.8 x12 + 35.0 x13 − 31.7 x17
Y (1, −1,1, −1) = 110.6 + 33.1 (1) – 39.3 (−1) + 31.9 (1) − 39.1 (−1) − 29.8 (1) (−1)
+35.0 (1) (1) − 31.7(1) (−1)
= 350.5

This predicted value is consistent with the values of the two observations from the data
set (373.8, 327.2) that were taken using these settings.
The adjusted coefficient of determination of the refined model is:

= 1 − dfεtotal
2 df SS ε
radj SStotal

= 1 − 815×135
× 2263
121

= 0.969
Fractional Factorial Experiments 429

and the standard error of the model is:

sε = MSε = 282.8 = 16.8

10.7.3 Resolution III Designs


Resolution III designs are considerably harder to interpret than designs of higher reso-
lution. Consider a model for a 23–1
III design that can only include main effects:

y = b0 + b1 x1 + b2 x 2 + b3 x3 (10.10)

The confounding relations are: 1 = 23, 2 = 13, and 3 = 12. Suppose all three main effects
are found to be significant. What’s really going on here? The true behavior of the system
might indeed be as in Equation 10.10 or it could be any one of the following:

y = b0 + b1 x1 + b2 x 2 + b12 x12 (10.11)

y = b0 + b1 x1 + b3 x3 + b13 x13 (10.12)

y = b0 + b2 x 2 + b3 x3 + b23 x 23 (10.13)

Occam’s guidance is useless here because it’s unclear if the main effects–only model is
more likely than one of the two variable models with an interaction. Without any other
knowledge about the effects of the variables and their interactions there’s no reason to
pick one of these models over the others—they are all equally likely. Although all
resolution III designs suffer from this ambiguity, they are still used for screening exper-
iments with many variables when only a few of the variables are expected to be signif-
icant. If, from the beginning, most of the variables in an experiment are expected to be
important then a design of resolution IV or higher should be used.
Many resolution III designs give ambiguous results that have to be clarified with
one or more follow-up experiments. The type of follow-up experiment required depends
on the results of the original experiment. If only a few variables in the original experi-
ment are found to be significant, then those variables can be used to build a full-factorial
or higher-resolution fractional factorial design. If so many of the variables in the orig-
inal experiment are found to be significant that none of them can be eliminated from
consideration, then the follow-up experiment should be another resolution III design
that is complementary to the original design. This complementary design, called the
fold-over design, is created by inverting the signs in all of the columns of the original
design. When the results from two resolution III fold-over designs are combined, they
always yield a design of resolution IV that provides a better chance of figuring out
which variables and interactions are really important. MINITAB will create the fold-
over design for a fractional factorial experiment if Fold on all factors is selected in the
Create Factorial Design> Options menu.
430 Chapter Ten

Example 10.9
A 27– 4
III experiment was performed using generators 4 = 12, 5 = 13, 6 = 23, and 7 = 123.
When the experimental data were analyzed, the significant variables were found to be
1, 2, 3, and 5. Identify an appropriate follow-up experiment to resolve the ambiguities
from the original experiment.
Solution: Because of the confounding between variables 1, 3, and 5, the following
models might explain the experimental results: y(1, 2, 3, 5), y(1, 2, 3, 13), y(1, 2, 5, 15),
and y(2, 3, 5, 35). (This assumes that three-factor and higher-order interactions are not
significant.) Appropriate follow-up experiments to distinguish between these models are
24 and 24IV–1 designs. The 24IV–1 design would be more economical. It is also sufficient to
resolve the ambiguity of the original experiment because there is evidence that two of
the relevant interactions, 12 and 23, are not significant.

Example 10.10
A 27– 4
III experiment was performed using generators D = AB, E = AC, F = BC, and G =
ABC. When the experimental data were analyzed, all of the variables were found to be
significant. Create the fold-over design and show that when the two designs are combined
they yield a resolution IV design.
Solution: The eight-run 27– 4
III design was created using Stat> DOE> Factorial>
Create Factorial Design in MINITAB. This design is shown in the first eight rows in
Figure 10.14. Columns A–C contain the base 23 design and the remaining four columns
were generated using the default generators: D = AB, E = AC, F = BC, and G = ABC.
The settings for the fold-over design, shown in rows 9 to 16, were determined by chang-
ing all of the signs of A– G from the original design. The 16-run experiment created
from the two combined eight-run experiments was analyzed (Stat> DOE> Factorial>
Analyze Factorial Design) using a simulated response not shown in the figure. The con-
founding relations from the analysis of the combined designs confirm that the 16-run
experiment is a resolution IV design. Notice that although we say that the design is res-
olution IV, each main effect is confounded with four three-factor interactions and each
two-factor interaction is confounded with two other two-factor interactions. The fold-
over design also could have been created by specifying the 27– 4
III design and then select-
ing Fold on all factors in the Create Factorial Design> Options menu.

10.7.4 Designs of Resolution VI and Higher


Because of the rarity of higher-order interactions, designs of resolution VI and higher
generally don’t present any serious difficulties in analysis. Main effects will be con-
founded with five-factor or higher-order interactions and two-factor interactions will be
confounded with four-factor or higher-order interactions so both types of terms are very
safe from confounding issues. For the same reason (the rarity of high-order interac-
tions), designs of resolution VI and higher can be used to study three-factor interactions
when such interactions are expected. The resolution V designs are really the threshold
Fractional Factorial Experiments 431

Fractional Factorial Design


Factors: 7 Base Design: 7, 8 Resolution: III
Runs: 8 Replicates: 1 Fraction: 1/16
Blocks: 1 Center pts (total): 0

* NOTE * Some main effects are confounded with two-way interactions.

Design Generators: D = AB, E = AC, F = BC, G = ABC

Alias Structure (up to order 3)


I + ABD + ACE + AFG + BCF + BEG + CDG + DEF
A + BD + CE + FG + BCG + BEF + CDF + DEG
B + AD + CF + EG + ACG + AEF + CDE + DFG
C + AE + BF + DG + ABG + ADF + BDE + EFG
D + AB + CG + EF + ACF + AEG + BCE + BFG
E + AC + BG + DF + ABF + ADG + BCD + CFG
F + AG + BC + DE + ABE + ACD + BDG + CEG
G + AF + BE + CD + ABC + ADE + BDF + CEF

MTB > let c12=-c5


MTB > let c13=-c6
MTB > let c14=-c7
MTB > let c15=-c8
MTB > let c16=-c9
MTB > let c17=-c10
MTB > let c18=-c11
MTB > Stack (c1-c11) (c1-c4 c12-c18) (c1-c11).
MTB > print c1-c18

Data Display
Row Std Run CP Blo A B C D E F G fA fB fC fD fE fF fG
1 1 1 1 1 -1 -1 -1 1 1 1 -1 1 1 1 -1 -1 -1 1
2 2 2 1 1 1 -1 -1 -1 -1 1 1 -1 1 1 1 1 -1 -1
3 3 3 1 1 -1 1 -1 -1 1 -1 1 1 -1 1 1 -1 1 -1
4 4 4 1 1 1 1 -1 1 -1 -1 -1 -1 -1 1 -1 1 1 1
5 5 5 1 1 -1 -1 1 1 -1 -1 1 1 1 -1 -1 1 1 -1
6 6 6 1 1 1 -1 1 -1 1 -1 -1 -1 1 -1 1 -1 1 1
7 7 7 1 1 -1 1 1 -1 -1 1 -1 1 -1 -1 1 1 -1 1
8 8 8 1 1 1 1 1 1 1 1 1 -1 -1 -1 -1 -1 -1 -1

9 1 1 1 1 1 1 1 -1 -1 -1 1
10 2 2 1 1 -1 1 1 1 1 -1 -1
11 3 3 1 1 1 -1 1 1 -1 1 -1
12 4 4 1 1 -1 -1 1 -1 1 1 1
13 5 5 1 1 1 1 -1 -1 1 1 -1
14 6 6 1 1 -1 1 -1 1 -1 1 1
15 7 7 1 1 1 -1 -1 1 1 -1 1
16 8 8 1 1 -1 -1 -1 -1 -1 -1 -1

Alias Structure (up to order 3)


A + B*C*G + B*E*F + C*D*F + D*E*G
B + A*C*G + A*E*F + C*D*E + D*F*G
C + A*B*G + A*D*F + B*D*E + E*F*G
D + A*C*F + A*E*G + B*C*E + B*F*G
E + A*B*F + A*D*G + B*C*D + C*F*G
F + A*B*E + A*C*D + B*D*G + C*E*G
G + A*B*C + A*D*E + B*D*F + C*E*F
A*B + C*G + E*F
A*C + B*G + D*F
A*D + C*F + E*G
A*E + B*F + D*G
A*F + B*E + C*D
A*G + B*C + D*E
B*D + C*E + F*G

Figure 10.14 Original and fold-over 27–4


III designs and confounding relations when they
are combined.
432 Chapter Ten

between designs that are difficult to interpret—designs of resolution III and IV—and
designs that are very easy to interpret—designs of resolution VI and higher.

10.8 PLACKETT-BURMAN DESIGNS


The Plackett-Burman designs are a special set of highly fractionated two-level factorial
designs. They have an integer multiple of four, for example, 8, 12, 16, . . . , for their
number of runs and can be used to study one variable fewer than the number of runs.
For example, the 12-run Plackett-Burman design can include at most eleven variables.
If fewer than the maximum number of variables are used in a Plackett-Burman experi-
ment then the unused variables are not included in the model and their degrees of free-
dom just contribute to the error estimate.
When all possible n – 1 variables are included in an n-run single-replicate Plackett-
Burman experiment, the experiment design is saturated. That is, the model consumes
all of the available degrees of freedom so there are no remaining degrees of freedom to
estimate the error. The usual analysis strategy in this case is to drop from the model the
variable with the smallest regression coefficient to begin building an error estimate.
Then more weak terms can be dropped from the model one by one until a satisfactory
model is reached. A normal probability plot of the regression coefficients from the ini-
tial model is often helpful in determining which terms to keep and which to drop.
Plackett-Burman designs are resolution III designs—their main effects are con-
founded with two-factor interactions. Each main effect is usually confounded with sev-
eral two-factor interactions. Like other resolution III designs, a Plackett-Burman design
can be folded to create a complementary design that, when combined with the original
design, gives a resolution IV design. Although the resulting experiment is a resolution
IV design, each main effect will be confounded with several three-factor interactions,
and several two-factor interactions will be confounded with each other.
Create a Plackett-Burman design using MINITAB from the Stat> DOE> Factorial>
Create Factorial Design menu using the Plackett-Burman Design option. Analyze the
design using Stat> DOE> Factorial> Analyze Factorial Design. If you want to create
the folded Plackett-Burman design, MINITAB will not do it for you so you will have to
do it yourself. To create the folded design, copy the original matrix of runs into a new
worksheet, invert all of the signs using the Calc> Calculator menu or with let com-
mands (for example, mtb> let c3=-c3), and then copy and append the new runs
onto the original design. You will have to use Stat> DOE> Factorial> Define Custom
Factorial Design to define the experiment in MINITAB so that it will perform the
analysis of the resulting resolution IV design.

10.9 SAMPLE-SIZE CALCULATIONS


The sample-size and power calculations for fractional factorial designs are carried out
the same way as they were for the two-level factorial designs as described in Chapter 9.
Fractional Factorial Experiments 433

MINITAB supports calculations for both the fractional factorial designs and the
Plackett-Burman designs. Perform sample-size and power calculations for fractional
factorial designs in MINITAB from the Stat> Power and Sample Size> 2 Level
Factorial Design menu. MINITAB anticipates the model that you will probably use so
leave the Design> Number of terms omitted from model field set to its default zero
value. Perform sample-size and power calculations for Plackett-Burman designs from
the Stat> Power and Sample Size> Plackett-Burman Design menu.

Example 10.11
A screening experiment is to be performed to study five variables in a 25–2 III design.
The experiment will have four replicates, which will be built in blocks. The standard
error of the model is expected to be se = 80. Use MINITAB to determine the power of
the experiment to detect a difference of d = 100 between the ±1 levels of the design vari-
ables and then confirm the value of the power by direct calculation.
Solution: The experiment design was created in MINITAB using Stat> DOE>
Factorial> Create Factorial Design and then Stat> Power and Sample Size> 2 Level
Factorial Design was used to perform the power calculation. Figure 10.15 shows
MINITAB’s output from creating the design and calculating the power, and the windows
used to set up the power calculation. MINITAB reports that the power of the experiment
to detect an effect of size d = 100 is P = 0.9207.

Figure 10.15 Power calculation for 25–2


III design.
434 Chapter Ten

In order to confirm the power, we need to know how many error degrees of freedom
there will be for the model. MINITAB’s output in the Session window indicates the alias
structure of the experiment. MINITAB will include a term in the model for each of the
main effects, but it will also include terms for the BC and BE interactions and three
terms to account for the four blocks. Then the model will have dfmodel = 5 + 2 + 3 = 10
degrees of freedom and there will be dfe = 32 – 1 – 10 = 21 error degrees of freedom.
The F distribution noncentrality parameter will be:

( )
2
λ = N
2a
δ
σε

= 2( 2) ( 100
80 )
4(8 ) 2

= 12.5

The power is given by the condition Fa = FP,l where the central and noncentral F distri-
butions have one numerator and dfe = 21 denominator degrees of freedom. MINITAB’s
Calc> Probability Distributions> F function was used to obtain the solution:

F0.05 = 4.3248 = F0.9207 ,12.5

which confirms that the power of the experiment is P = 0.9207 for an effect size d = 100.

10.10 DESIGN CONSIDERATIONS FOR FRACTIONAL


FACTORIAL EXPERIMENTS
• Use a fractional factorial design as a screening experiment (step 4 of the 11-
step process: preliminary experimentation) before attempting a full-factorial
or more complex experiment. This is a low-risk, systematic way of confirming
that the levels of all of the variables are safe to use.
• Where possible, use higher-resolution designs rather than designs of low
resolution. If necessary, consider removing a variable from an experiment by
holding it fixed to increase the design resolution of an experiment.
• Reserve substantial time and resources for follow-up experiments to resolve the
ambiguities of a low-resolution design.
• Only use resolution III designs if most of the design variables are likely to
be insignificant and if it’s safe to assume that two-factor interactions are not
significant. Otherwise use a design of higher resolution or plan to do a follow-
up experiment to resolve the ambiguities of the resolution III design.
• Combine a resolution III design with its fold-over design to form a design of
resolution IV.
Fractional Factorial Experiments 435

• Add a variable to a 24 design to create a 25–1


V design using the same number of
runs. The new variable should be chosen to add little risk to the experiment
and should have a reasonably high probability of being insignificant.
• Build a full-factorial experiment by combining complementary fractions of
fractional factorial designs. Treat the fractional replicates as blocks to test for
and control block effects. When possible, analyze the data as the blocks are
completed and suspend the test early if the experiment is conclusive to conserve
time and resources.
• Be careful which generators you use for a highly fractionated factorial design.
Some sets of generators will allow more model terms to be resolved than other
sets even though both deliver the same design resolution. Experiment designs
that resolve the most model terms are said to be minimum aberration designs.
MINITAB uses minimum aberration designs.
11
Response-Surface
Experiments

11.1 INTRODUCTION
The two-level factorial designs of Chapters 9 and 10 provide a powerful set of experi-
ment designs for studying complex responses; however, our collection of designs is
incomplete. Consider the response space shown in Figure 11.1. The horizontal and ver-
tical axes indicate values of independent variables x1 and x2, respectively, and the con-
tours correspond to constant values (10, 20, . . . , 100) of the response. The five squares
in the figure represent five special regions where experiments might be performed.
Some of the regions show supplemental contours to clarify the behavior of the response.
Regions 1 and 2 are each sensitive to only one variable, x1 and x2, respectively. Region
3 is sensitive to both variables but the parallel contours indicate that there is no inter-
action between x1 and x2. The divergence between the response contours in region 4
indicates that, in addition to the main effects of x1 and x2, there is a significant interac-
tion between them. The contours in region 5 indicate that with x2 held constant, the
response increases, reaches a maximum value, and then decreases as x1 increases. The
response also shows curvature with respect to x2 when x1 is held constant.
Two-level factorial and fractional factorial designs are suitable for studying regions
1–4; however, they are not capable of quantifying or even detecting curvature in the
response such as in region 5. The weakness of these designs is due to their use of just
two levels of each design variable. As we saw in Chapter 8, a variable must have at least
three levels in order to fit a model that can resolve curvature in the response. The pur-
pose of this chapter is to present experiment designs that are capable of resolving curva-
ture in the response associated with each design variable. These designs are called
response-surface designs or designs for quadratic models.

437
438 Chapter Eleven

1 4 5

x2 90
100
80

70
2

60

50
3

40
10 20 30
x1

Figure 11.1 Different regions of experimental interest in a response space.

11.2 TERMS IN QUADRATIC MODELS


If we consider an experiment with just one independent quantitative variable x1, then a
model that includes curvature will take the form:

y( x1 ) = b0 + b1 x1 + b11 x12 (11.1)

The term x 21 may also be written as if it were a two-factor interaction of x1 and x1, or
x 21 = x11.
The fits provided by a simple linear model and the quadratic model of Equation
11.1 are shown for data with curvature in Figure 11.2. Notice that the coefficients b0 and
b1 will be different in the two models. The quadratic model is clearly superior in this
case although with just three levels of x1 the quadratic model must pass exactly through
the response means at each level of x1.
When three levels of each variable are incorporated into an appropriate experi-
ment design such that curvature due to each variable can be quantified, the model has
the form:

y = b0 + b1 x1 + b2 x 2 + L + b12 x12 + L + b11 x12 + b22 x 22 + L (11.2)

where three-factor and higher-order interactions have been ignored. This equation
defines the response surface, that is, how y depends on x1, x2, . . . , which can be repre-
sented or thought of as a surface in a multidimensional graph. Designs that can deliver
quadratic terms for all of their design variables are called response-surface designs.
Response-Surface Experiments 439

y = b0 + b1x1
y

y = b0 + b1x1 + b2x 21

x1

Figure 11.2 Linear and quadratic fits to data with curvature.

Example 11.1
Write out the full model that will be fitted for a four-variable response surface
experiment.
Solution: The full model will include main effects, two-factor interactions, and
squared terms. For four variables the model will be:

y = b0 + b1 x1 + b2 x 2 + b3 x3 + b4 x 4
+ b12 x12 + b13 x13 + b14 x14 + b23 x 23 + b24 x 24 + b34 x34
+ b11 x12 + b22 x 22 + b33 x32 + b44 x 42

The magnitudes of the regression coefficients in Equation 11.2 indicate the strength
of the various terms in the model, but the signs of the regression coefficients also play
an important role in determining the general shape of the response surface. Figure 11.3
shows surface plots for six different response surfaces where the response Y depends on
the variables A and B. (The following analysis still applies to problems that involve
more than two design variables but those cannot be easily drawn and are difficult to
describe. In such cases, analyze two variables at a time using the method that follows.)
In each case the variables A and B are considered over the range from –1.5 to +1.5—a
meaningful range for coded variables in a designed experiment. Figure 11.3a shows the
response surface for Y (A, B) = 20 – 5A + 8B, which is just a flat plane. Notice that the
various contours for constant A are all parallel to each other as are the contours for con-
stant B. Figure 11.3b shows the response surface for Y (A, B) = 20 – 5A + 8B + 6AB.
This response surface looks somewhat similar to the previous one except that the plane
440 Chapter Eleven

a: Y(A, B) = 20 – 5A + 8B b: Y(A, B) = 20 – 5A + 8B +6AB

30
30
20
Y 20 Y
10
10 0
–1 1 –1 1
0 0 0 0
A 1 –1 B A 1 –1 B

c: Y(A, B) = 20 – 5A + 8B + 6AB – 32A2 d: Y(A, B) = 20 – 5A + 8B + 6AB – 32A2

20
0
Y 0 Y
–30
–20
–40 –60
–1 1 –1 1
0 0 0 0
A 1 –1 B A 1 –1 B

e: Y(A, B) = 20 – 5A + 8B + 6AB + 32A2 + 20B2 f: Y(A, B) = 20 – 5A + 8B + 6AB – 32A2 + 20B2

80 40
Y 60 Y 20
40 0

20 –20
–1 1 –1 1
0 0 0 0
A 1 –1 B A 1 –1 B

Figure 11.3 Examples of different response surfaces.

is twisted instead of flat. The twisting is caused by the AB interaction term. Notice that
even with the twisting of the plane, all of the contours for constant A and B are still
straight lines—they just aren’t parallel to each other any more. Figure 11.3c shows the
response surface for Y (A, B) = 20 – 5A + 8B + 6AB – 32A2. The quadratic term causes
the curvature in the response surface for changes made in the A direction, but notice that
the B contours are still straight lines because there is no B2 term in Y (A, B). This sur-
face shows that for a specified value of B there is a unique value of A that maximizes
Y. Figure 11.3d shows the response surface for Y (A, B) = 20 – 5A + 8B + 6AB – 32A2 –
20B2. This response surface has downward curvature in both the A and B directions
causing a maximum to occur for Y for a unique choice of A and B . Figure 11.3e shows
the response surface for Y (A, B) = 20 – 5A + 8B + 6AB + 32A2 + 20B2. The equation
for Y is very similar to the equation from Figure 11.3d but the signs of the quadratic
Response-Surface Experiments 441

terms are both positive. This causes the curvature with respect to both A and B to be
upward so there is a minimum in Y for a unique choice of A and B. Figure 11.3f shows
the response surface for Y (A, B) = 20 – 5A + 8B + 6AB – 32A2 + 20B2. The signs on the
quadratic terms cause this saddle-shaped response surface to be curved downward with
respect to A and curved upward with respect to B. For a fixed value of A there is a value
of B that minimizes Y and for a fixed value of B there is a value of A that maximizes Y.
Designed experiments to study responses that have surfaces shaped as in Figures
11.3a and b need only two levels of each design variable because there is no curvature
in Y due to A or B. Experiments to study responses with surfaces shaped as in Figures
11.3c–f require more elaborate designs with more variable levels to resolve the complex
curvature. These are the response-surface designs.

11.3 2K DESIGNS WITH CENTERS


Center points can be added to the 2k and 2k–p designs when all k of the design variables
are quantitative. If the low and high coded levels of each variable are –1 and +1, respec-
tively, then the center points will have coded level zero for each variable. For example,
the center points in a 23 plus centers experiment will have (x1, x2, x3) = (0, 0, 0). Figure
11.4 shows a three-dimensional drawing of the 23 plus centers design.
There are two reasons for adding center points to a 2k design. First, adding center
points to a design increases the number of error degrees of freedom for the analysis
without unbalancing the design. The additional error degrees of freedom increase the
power of the design to detect small effects, especially when there are few error degrees
of freedom before the center cells are added.

X3

–1
–1
–1 1
X2
1

X1

Figure 11.4 23 plus centers design.


442 Chapter Eleven

Table 11.1 Runs of the 22 plus centers design.


Run x1 x2 x12 x 12 x 22
1 – – + + +
2 – + – + +
3 + – – + +
4 + + + + +
5 0 0 0 0 0

The second and usually more important reason to add center points to a 2k design is
to add a third level of each design variable. Although this provides some information
about curvature, unfortunately the information is incomplete. To understand why, con-
sider the matrix of experimental runs for the 22 plus centers design in Table 11.1.
Notice that the columns for x 21 and x 22 are identical, that is, x 21 = x 22. This means that the
quadratic terms in the model are confounded so it will not be possible to include both
of them in the model. The model that can be fitted to the data can have only one generic
quadratic term:
y = b0 + b1 x1 + b2 x 2 + b12 x12 + b∗∗ x∗2 (11.3)

where the * is used to indicate the ambiguity of the source of the quadratic effect. The
model that we really want to fit is:

y = b0 + b1 x1 + b2 x 2 + b12 x12 + b11 x12 + b22 x 22 (11.4)

which resolves the curvature into its two possible sources. When x 12 and x 22 are con-
founded, the b** coefficient will be a combination of the desired b11 and b22 coefficients,
that is:
b∗∗ = b11 + b22 (11.5)

The statistical significance of b** provides a linear lack of fit test for two-level factorial
plus centers designs. If the analysis indicates that b** is not statistically significant then
there is a good chance that there is no curvature in the response. (There’s only a “good
chance” that there’s no curvature because there’s a small chance that b11 and b22 are both
large and approximately equal in magnitude but opposite in sign so that b** 0.) If, how-
ever, b** is statistically significant then one or both terms cause significant curvature in
the response but we cannot tell which variable is responsible. This also means that when
b** is significant, Equation 11.3 cannot be used to predict the response because the con-
tribution of b** x 2* to y cannot be determined. (To understand this problem, consider the
case y(x1, x2) = y(0, 1).)
All of the 2k and 2k–p plus centers designs suffer from the problem of confounded
quadratic effects. This means that these designs are not true response-surface designs.
Despite this weakness, the 2k and 2k–p plus centers designs are still very powerful and
popular. They can also be supplemented with additional runs that permit the quadratic
model terms to be fully resolved. Typically, these experiments are built in two blocks.
Response-Surface Experiments 443

The first block consists of a 2k or 2k–p plus centers design that is built and checked for
linear lack of fit. If there is evidence of lack of fit, then the second block of supple-
mental runs is built and combined with the first block to fully resolve the quadratic
terms. These designs, called central composite designs, are one of several types of very
powerful response-surface designs that we will consider in this chapter.

11.4 3K FACTORIAL DESIGNS


If a factorial experiment is constructed in k quantitative variables with each variable
appearing at three evenly spaced levels and all possible combinations of levels are used,
then we have a 3 × 3 × . . . × 3 or a 3k factorial experiment. The total number of runs in
the experiment, if no replication is performed, is given by 3k. The 3k experiments are not
used very often because they require so many runs. As k increases, the number of runs
3k grows much faster than the number of terms in the model. For this reason, other more
efficient designs are usually used instead of the 3k designs.

Example 11.2
Write out the matrix of experimental runs for the 33 experiment. Draw the design in
three dimensions and describe where the experimental runs fall. Indicate the model that
can be fitted with this design and how the degrees of freedom are distributed.
Solution: The 33 experiment has three variables, each at three levels, and 33 = 27
total runs in one replicate. The experimental runs are indicated in standard order in
Table 11.2. The runs are shown in the three-dimensional drawing in Figure 11.5. There
is a run at every corner of the cube, at the center of every edge, at the middle of every
cube face, and one run at the center of the cube. The model that can be fitted to this
design is the full quadratic model:

y = b0 + b1 x1 + b2 x 2 + b3 x3 + b12 x12 + b13 x13 + b23 x 23 + b111 x12 + b22 x 22 + b33 x32 (11.6)

Table 11.2 Table of runs for the 33 experiment design.


Std x1 x2 x3 Std x1 x2 x3 Std x1 x2 x3
1 – – – 10 0 – – 19 + – –
2 – – 0 11 0 – 0 20 + – 0
3 – – + 12 0 – + 21 + – +
4 – 0 – 13 0 0 – 22 + 0 –
5 – 0 0 14 0 0 0 23 + 0 0
6 – 0 + 15 0 0 + 24 + 0 +
7 – + – 16 0 + – 25 + + –
8 – + 0 17 0 + 0 26 + + 0
9 – + + 18 0 + + 27 + + +
444 Chapter Eleven

X3

–1
–1
1
–1
X2
1

X1

Figure 11.5 33 factorial experiment design.

If the experiment has only one replicate then the distribution of the degrees of freedom
is dftotal = 27 – 1 = 26, dfmodel = 9, and dferror = 26 – 9 = 17. If some of the model terms
are not significant they can be dropped from the model and used to improve the error
estimate.

11.5 BOX-BEHNKEN DESIGNS


Although the 3k designs deliver a full quadratic model, they aren’t used very often
because there are other designs that are much more efficient. One such family of
designs is the Box-Behnken designs. These designs are essentially fractions of the 3k
designs with additional center points to preserve the balance of the design. Table 11.3
shows the matrix of runs for Box-Behnken designs with three to seven variables. The
original Box-Behnken paper describes designs for up to twelve variables (Box and
Behnken 1960).
Table 11.3 uses a special shorthand notation to simplify the presentation of the
experimental runs. In this notation, the matrix of four runs for a 22 experiment is written
(±1 ±1), that is:
x1 x2
− −
( ±1 ± 1) = − +
+ −
+ +
Response-Surface Experiments 445

Table 11.3 Box-Behnken design catalog.

BB(3) BB(6)
x1 x2 x3 Runs x1 x2 x3 x4 x5 x6 Runs
±1 ±1 0 4 ±1 ±1 0 ±1 0 0 8
±1 0 ±1 4 0 ±1 ±1 0 ±1 0 8
0 ±1 ±1 4 0 0 ±1 ±1 0 ±1 8
0 0 0 3 ±1 0 0 ±1 ±1 0 8
Total Runs 15 0 ±1 0 0 ±1 ±1 8
±1 0 ±1 0 0 ±1 8
0 0 0 0 0 0 6
BB(4)
Total Runs 54
Block x1 x2 x3 x4 Runs
1 ±1 ±1 0 0 4
BB(7)
1 0 0 ±1 ±1 4
1 0 0 0 0 1 x1 x2 x3 x4 x5 x6 x7 Runs
2 ±1 0 0 ±1 4 0 0 0 ±1 ±1 ±1 0 8
2 0 ±1 ±1 0 4 ±1 0 0 0 0 ±1 ±1 8
2 0 0 0 0 1 0 ±1 0 0 ±1 0 ±1 8
3 ±1 0 ±1 0 4 ±1 ±1 0 ±1 0 0 0 8
3 0 ±1 0 ±1 4 0 0 ±1 ±1 0 0 ±1 8
3 0 0 0 0 1 ±1 0 ±1 0 ±1 0 0 8
Total Runs 27 0 ±1 ±1 0 0 ±1 0 8
0 0 0 0 0 0 0 6
Total Runs 62
BB(5)
Block x1 x2 x3 x4 x5 Runs
1 ±1 ±1 0 0 0 4
1 0 0 ±1 ±1 0 4
1 0 ±1 0 0 ±1 4
1 ±1 0 ±1 0 0 4
1 0 0 0 ±1 ±1 4
1 0 0 0 0 0 3
2 0 ±1 ±1 0 0 4
2 ±1 0 0 ±1 0 4
2 0 0 ±1 0 ±1 4
2 ±1 0 0 0 ±1 4
2 0 ±1 0 ±1 0 4
2 0 0 0 0 0 3
Total Runs 46
446 Chapter Eleven

where all permutations of the signs are considered. Similarly, the set of four runs des-
ignated by (0 ±1 ±1) corresponds to:

x1 x2 x3
0 − −
( 0 ± 1 ± 1) = 0 − +
0 + −
0 + +

Example 11.3
Write out the matrix of experimental runs for the three-variable Box-Behnken
experiment. Plot the design in three dimensions and describe where the observations fall.
Write out the model that can be fitted with this design and calculate the degrees of free-
dom for the model and error.
Solution: The matrix of experimental runs for the BB(3) design was determined
from Table 11.3 and is shown in standard order in Table 11.4. The three-dimensional
drawing of the design shown in Figure 11.6 indicates that the experimental runs fall on
the edges of the cube and that there are multiple observations at the design center. The
model that can be fitted with this experiment is the full quadratic model given in
Equation 11.6. The experiment has dftotal = 15 – 1 = 14, dfmodel = 9, and dferror = 15 – 9 = 5.
This is a significant savings in runs compared to the 33 experiment (15 versus 27).
Furthermore, it’s likely that when the full model is fitted, some of the model terms will
be weak and can be dropped from the model. This simplifies the model and adds
degrees of freedom for error estimation.

Inspection of the Box-Behnken designs in Table 11.3 reveals the rationale used to
determine the experimental runs. These designs are expected to resolve main effects,
interactions, and quadratic terms in the model. If there are k main effects, then there will
be ( k2 ) two-factor interactions. These interactions are 12, 13, 14, and so on. Inspection
of Table 11.3 shows that for the smaller designs with k ≤ 5 variables, each noncenter
row of each BB(k) design consists of a four-run 22 factorial experiment that resolves
each two-factor interaction while the remaining k – 2 variables are held at their zero
levels. Then center cells are added to the experiment to complete the runs required to
resolve the quadratic terms. The number of center cells is determined by considerations
affecting the estimation of the regression coefficients. This issue is too complex for this
book so just use the number of center cells specified in the design catalog. The larger
Box-Behnken experiments with k > 5 variables use eight-run 23 factorial designs involv-
ing three variables while the other variables are held at their zero levels.

Example 11.4
Explain how the 46 runs of the BB(5) design are determined.
Response-Surface Experiments 447

Table 11.4 Table of runs for the Box-Behnken three-variable experiment.

Std x1 x2 x3
1 – – 0
2 – + 0
3 + – 0
4 + + 0
BB(3)
5 – 0 –
x1 x2 x3 Runs 6 – 0 +
±1 ±1 0 4 = 7 + 0 –
±1 0 ±1 4 8 + 0 +
0 ±1 ±1 4 9 0 – –
0 0 0 3 10 0 – +
11 0 + –
12 0 + +
13 0 0 0
14 0 0 0
15 0 0 0

X3

–1
–1
0
–1 1
0
X2
1

X1

Figure 11.6 Box-Behnken three-variable design: BB(3).

Solution: The BB(5) design is expected to resolve ( 51 ) = 5 main effects, ( 52 ) = 10 two-


factor interactions, and five quadratic terms. The 10 two-factor interactions are: AB,
AC, AD, AE, BC, BD, BE, CD, CE, DE. A four-run 22 experiment must be created in
each of these pairs of columns while the remaining three variables are held at their zero
448 Chapter Eleven

levels. For example, the runs associated with the AB interaction correspond to the row
(±1, ±1, 0, 0, 0) of the BB(5) design matrix in Table 11.3, the runs associated with the
AC interaction correspond to the row (±1, 0, ±1, 0, 0), and the other rows in the matrix
are determined in a similar manner. This means that the design will contain 10 × 22 =
40 runs plus some center cells to resolve the quadratic terms. The BB(5) matrix calls
for six center runs (0, 0, 0, 0, 0) so the design requires a total of 46 runs.

Box-Behnken experiments tend to be large so it is usually necessary to block them.*


Blocking plans for the smaller Box-Behnken designs (k ≤ 5) are shown in Table 11.3.
The runs of the larger Box-Behnken designs (k ≥ 6) can be broken into two blocks by
splitting each row of eight runs like (±1 ±1 0 ±1 0 0) into two half-fractions of the
implied 23 design. The resulting two sets of four runs are then assigned to different
blocks. The shorthand notation for the Box-Behnken designs breaks down here—there’s
no easy way to indicate how the runs behave and the full matrix of runs is too long to
display. Details about blocking the Box-Behnken designs are given in Box and Behnken
(1960) and MINITAB can create both the unblocked and blocked designs.

11.6 CENTRAL COMPOSITE DESIGNS


Adding center cells to the two-level factorial designs (2k and 2k–p) is a good attempt to
give them the ability to account for curvature, but the fix that they provide is incom-
plete. Additional runs can be added to these experiments to give them full quadratic
modeling capabilities. These designs are the central composite or Box-Wilson designs
designated CC(2k) or CC(2k–p) where 2k and 2k–p indicate the two-level factorial design
that is the basis of the central composite design. The runs that must be added to the two-
level factorial plus centers designs fall at extreme points outside the usual –1 and +1
levels of the two-level factorial part of the design. These points are referred to as star
points. The quantity h (eta) is the distance that the star points are located from the cen-
ter of the design. The value used for h depends on the number of points in the two-level
factorial or cube part of the experiment. h is given by:

η = ncube
1/ 4
(11.7)

where ncube is the number of points in a single replicate of the 2k or 2k–p design. This con-
dition gives the design a special characteristic called rotatability. Two star points are
added to the original experiment for each variable, one star point at the –h level and one
at the +h level, while all other variables are held constant at their zero level. This gives
the central composite designs five levels of each variable: –h, –1, 0, 1, h.

* These blocking patterns also provide a way to introduce a qualitative variable into a Box-Behnken experiment. To
allow safe conclusions to be drawn about differences between the levels of the qualitative variable, the runs must be
randomized over all blocks.
Response-Surface Experiments 449

In theory, the number of center points in the central composite designs is deter-
mined by:

n0 = 4 ncube − 2k + 4 (11.8)

although in practice the exact number used can vary. There are two common conditions
used to determine the number of center points. One condition makes the design orthog-
onal and the other condition gives the design a characteristic called uniform precision.
The uniform precision designs provide equal error variance at the design center and at
unit (±1) distance from the design center. The uniform precision designs are the ones
that are usually used.
A catalog of central composite designs is shown in Table 11.5. The catalog is not
complete—in some cases where there are two designs available for the same number of
variables, the smaller of the two designs is shown [for example, CC(26VI–1) is shown but
not CC(26)]. The catalog shows the matrix of runs using the shorthand notation for fac-
torial designs, the number of center points required, the star point coordinates, and the
total number of runs.

Example 11.5
Write out the matrix of experimental runs for the three-variable central composite
design. Sketch the design in three dimensions and describe where the observations fall.
Write out the model that can be fitted with this design and calculate the degrees of free-
dom for the model and error.
Solution: The matrix of runs for the CC(23) design is shown in Table 11.6 and the
design is drawn in three dimensions in Figure 11.7. The figure shows that the central
composite design has a run at every corner of the cube, there is a star point above each
face of the cube, and there are several points at the center of the cube. The star point
positions are outside the cube at (x1, x2, x3) = (–1.682, 0, 0), (+1.682, 0, 0), (0, –1.682,
0), (0, +1.682, 0), (0, 0, –1.682), and (0, 0, +1.682). The experiment has twenty obser-
vations so dftotal = 20 – 1 = 19. The model that can be fitted to this experiment is the full
quadratic model with three main effects, three two-factor interactions, and three qua-
dratic terms so it has dfmodel = 9. By subtraction there are dferror = 19 – 9 = 10 degrees
of freedom to estimate the error and more degrees of freedom for error may become
available if insignificant terms can be dropped from the model.

Central composite designs can be large and difficult to build but there are excel-
lent plans available for breaking them into blocks of practical size. All of the central
composite designs can be broken into two blocks, the first block consisting of the
points from the cube plus some center points and the second block consisting of the
star points and some center points. The specific number of center points for each block
is prescribed and there may be a slight change in the star point position if the total
number of center points changes from the original design. If the block of points from
the cube is still too large, it may be further broken up into smaller blocks defined by
450 Chapter Eleven

Table 11.5 Central composite design catalog.

CC (22) CC (2V8–2 )
x1 x2 Runs x1 x2 x3 x4 x5 x6 x7 x8 Runs
±1 ±1 4 ±1 ±1 ±1 ±1 ±1 ±1 1234 1256 64
0 0 5 0 0 0 0 0 0 0 0 10
±1.41 0 2 ±2.83 0 0 0 0 0 0 0 2
. . . . . . . . .
0 ±1.41 2 . . . . . . . . .
. . . . . . . . .
Total Runs 13
0 0 0 0 0 0 0 ±2.83 2
Total Runs 90

CC (23) 7–1
CC (2VII )
x1 x2 x3 Runs x1 x2 x3 x4 x5 x6 x7 Runs
±1 ±1 ±1 8 ±1 ±1 ±1 ±1 ±1 ±1 123456 64
0 0 0 6 0 0 0 0 0 0 0 14
±1.68 0 0 2 ±2.83 0 0 0 0 0 0 2
. . . . . . . .
0 ±1.68 0 2 . . . . . . . .
. . . . . . . .
0 0 ±1.68 2
0 0 0 0 0 0 ±2.83 2
Total Runs 20
Total Runs 92

CC (24) 6–1
CC (2VI )
x1 x2 x3 x4 Runs x1 x2 x3 x4 x5 x6 Runs
±1 ±1 ±1 ±1 16 ±1 ±1 ±1 ±1 ±1 12345 32
0 0 0 0 7 0 0 0 0 0 0 9
±2 0 0 0 2 ±2.38 0 0 0 0 0 2
. . . . . . . . . . . .
. . . . . . . . . . . .
. . . . . . . . . . . .
0 0 0 ±2 2 0 0 0 0 0 ±2.38 2
Total Runs 31 Total Runs 53

CC (25) CC (2V5–1)
x1 x2 x3 x4 x5 Runs x1 x2 x3 x4 x5 Runs
±1 ±1 ±1 ±1 ±1 32 ±1 ±1 ±1 ±1 1234 16
0 0 0 0 0 10 0 0 0 0 0 6
±2.38 0 0 0 0 2 ±2 0 0 0 0 2
. . . . . . . . . . . .
. . . . . . . . . . . .
. . . . . . . . . . . .
0 0 0 0 ±2.38 2 0 0 0 0 ±2 2
Total Runs 52 Total Runs 32
Response-Surface Experiments 451

Table 11.6 Matrix of experimental runs for the CC(23) design.

Std x1 x2 x3
1 – – –
2 – – +
3 – + –
CC (23) 4 – + +
5 + – –
x1 x2 x3 Runs
6 + – +
±1 ±1 ±1 8
= 7 + + –
0 0 0 6
8 + + +
±1.68 0 0 2
9 0 0 0
0 ±1.68 0 2
10 0 0 0
0 0 ±1.68 2
11 0 0 0
12 0 0 0
13 0 0 0
14 0 0 0
15 –1.68 0 0
16 +1.68 0 0
17 0 –1.68 0
18 0 +1.68 0
19 0 0 –1.68
20 0 0 +1.68

X3

–1 –1
–1
1
1
X2
X1

Figure 11.7 Central composite three-variable design: CC(23).


452 Chapter Eleven

complementary fractional factorial designs. As before, some center points are run with
each block and the star point position might change a bit if the total number of center
points deviates from the original plan. Table 11.7 shows practical blocking plans for
some of the central composite designs. In the Definition column, the symbol (*) indicates
star points and the symbol (0) indicates center points. Where blocks are fractional fac-
torials, complementary fractions must be used. Different references may show slightly

Table 11.7 Blocking plans for central composite designs.


Number Total
Design of Blocks Block Definition Runs Runs h
2 2
CC(2 ) 1 2 + 4 (*) + 5 (0) 13 1.414
CC(22) 2 1 22 + 3 (0) 7 14 1.414
2 4 (*) + 3 (0) 7
CC(23) 1 23 + 6 (*) + 6 (0) 20 1.682
CC(23) 2 1 23 + 4 (0) 12 20 1.633
2 6 (*) + 2 (0) 8
CC(23) 3 1 23–1 + 2 (0) 6 20 1.633
2 23–1 + 2 (0) 6
3 * + 2 (0) 8
CC(24) 1 24 + 8 (*) + 7 (0) 31 2.000
CC(24) 2 1 24 + 4 (0) 20 30 2.000
2 8 (*) + 2 (0) 10
CC(24) 3 1 24–1 + 2 (0) 10 30 2.000
2 24–1 + 2 (0) 10
3 8 (*) + 2 (0) 10
CC(2V5–1) 1 25–1 + 10 (*) + 6 (0) 32 2.000
CC(2V5–1) 2 1 25–1 + 6 (0) 22 33 2.000
2 10 (*) + 1 (0) 11
6–1
CC(2VI ) 1 26–1 + 12 (*) + 9 (0) 53 2.378
6–1
CC(2VI ) 2 1 26–1 + 8 (0) 40 54 2.366
2 12 (*) + 2 (0) 14
6–1
CC(2VI ) 3 1 26–2 + 4 (0) 20 54 2.366
2 26–2 + 4 (0) 20
3 12 (*) + 2 (0) 14
7–1
CC(2VII ) 1 27–1 + 14 (*) + 14 (0) 72 92 2.828
7–1
CC(2VII ) 2 1 27–1 + 8 (0) 72 90 2.828
2 14 (*) + 4 (0) 18
7–1
CC(2VII ) 3 1 27–2 + 4 (0) 36 90 2.828
2 27–2 + 4 (0) 36
3 14 (*) + 4 (0) 18
7–1
CC(2VII ) 5 1 27–3 + 2 (0) 18 90 2.828
2 27–3 + 2 (0) 18
3 27–3 + 2 (0) 18
4 27–3 + 2 (0) 18
5 14 (*) + 4 (0) 18
Response-Surface Experiments 453

different preferences for the number of center points used and the values of h, but
these differences are generally negligible for practical applications. MINITAB offers
several blocking plans for some of the larger designs and recommends the best values
for the star point position and the number of center points for each block.

Example 11.6
Describe a blocking plan to build the CC(24) experiment in three blocks. Write out
the model that can be fitted and describe the distribution of the degrees of freedom if a
term for the blocks is included in the model.
Solution: From Table 11.7, the CC(24) experiment can be built in three blocks of
size 10 each. The three blocks are:
• Block 1: Eight runs from the 24 –1 design with 4 = +123 plus two center points.
• Block 2: Eight runs from the complementary 24 –1 design with 4 = –123 plus
two center points.
• Block 3: Eight star points with h = 2 plus two center points.
The model will be:

y = b0 + b1 x1 + b2 x 2 + b3 x3 + b4 x 4
+ b12 x12 + b13 x13 + b14 x14 + b23 x 23 + b24 x 24 + b34 x34
+ b11 x12 + b22 x 22 + b33 x32 + b44 x 42
+ d 2 ( block = 2 ) + d3 ( block = 3)

where terms for the blocks (d2 and d3) have been explicitly included in the model. The
experiment will have 30 observations so dftotal = 29. The model has four main effects, six
two-factor interactions, four quadratic terms, and two terms for the blocks so dfmodel = 16.
This leaves dfe = 13 degrees of freedom for the error estimate.

11.7 COMPARISON OF THE


RESPONSE-SURFACE DESIGNS
Since all three families of true response-surface designs: 3k, BB(k), and CC(2k), deliver
models with main effects, two-factor interactions, and quadratic terms, other criteria
besides which model can be fitted must be considered in deciding which design to use
for a response surface experiment. There are three criteria used to compare the design
families:
1. The number of observations in the design and the number of error degrees
of freedom.
2. The number of levels required of each design variable.
454 Chapter Eleven

3. The safety of the highest and lowest variable levels.


As will be seen, these criteria are important enough that different designs tend to be pre-
ferred under different circumstances. While one criterion might be more important than
the others for a particular design problem, all three criteria must be considered and man-
aged simultaneously.

11.7.1 Number of Observations and Error Degrees of Freedom


As with any other type of experiment design, response-surface designs can be made
very sensitive to small effects by building enough replicates of the design. Response-
surface designs tend to be large and expensive, however, so they are frequently built
using just a single replicate. In this case, the experiment should have enough runs so
that the full model can be constructed leaving enough degrees of freedom to provide a
good error estimate. Models that have less than about eight error degrees of freedom are
often considered to be too risky, and models that have more than about twenty error
degrees of freedom are often considered to be wasteful of resources. Frequently, one
response-surface design will deliver a model that has error degrees of freedom that fall
within this range when the other designs don’t.
Table 11.8 shows a comparison of the number of runs (N) and the number of error
degrees of freedom provided by all of the response surface designs for two to six vari-
ables. Each case in the table considers just one replicate of the experiment design. The
table shows that:
• When an experiment has just two variables there are only two designs to choose
from: the 32 and the CC(22) designs. The 32 design is very efficient although
with a single replicate there are only dfe = 3 error degrees of freedom. This
design should be replicated to have sufficient degrees of freedom for the error
estimate or the CC(22) design should be used instead. Both of these strategies
are frequently used.
• Of the three variable experiments, the 33 design with dfe = 17 is comparatively
wasteful of resources. It has too many runs to justify its use over the other two
three-variable designs. Compared to the very efficient BB(3) design, even the
CC(23) experiment seems wasteful. The BB(3) is a bit short on error degrees of
freedom, but most of these experiments have several terms that can be dropped
from the model to improve the error estimate. Of the three variable experi-
ments, the BB(3) design is probably used most often.
• Of the four variable experiments, the 34 is definitely too large compared to the
other two designs. The BB(4) and CC(24) experiments are comparable in their
total number of runs and both have plenty of degrees of freedom for the error
estimate. These two designs are probably used with approximately equal frequency.
• Of the five variable experiments, the 35 with 243 runs is impractical and the
BB(5) design requires 44 percent more runs than the CC(25–1 5–1
V ). The CC(2V ),
Response-Surface Experiments 455

Table 11.8 Comparison of the response-surface designs.


k Design N dftotal dfmodel dfe
2
2 3 9 8 5 3
CC(23) 13 12 5 7
3 33 27 26 9 17
BB(3) 15 14 9 5
CC(23) 20 19 9 10
4 34 81 80 14 66
BB(4) 27 26 14 12
CC(24) 31 30 14 16
5 35 243 242 20 222
BB(5) 46 45 20 25
CC(2V5–1) 32 31 20 11
6 36 729 728 27 701
BB(6) 54 53 27 26
6–1
CC(2VI ) 53 52 27 25

which is very efficient, starts out with 11 error degrees of freedom but this
number usually grows nicely as terms are dropped from the model. More
CC(25–1
V ) designs are used than BB(5) experiments. In fact, since a fifth variable
can usually be found for most problems that start off with four variables, more
CC(25–1 4
V ) designs are built than either the BB(4) or CC(2 ) designs, which are
comparable in size. The CC(25–1 V ) provides a great opportunity to study a fifth
variable in an experiment that starts out with four variables, with little extra
time or expense.
• Response-surface designs with six or more variables require so many runs
and are so difficult to build and manage that they are rarely built, When they
are built, the CC(26VI–1) design is often used because of its convenient blocking.
The 26VI–1 plus centers design is usually built, often in two blocks, and analyzed
first to see if there is curvature in the design space and if one or more variables
can be dropped from further experiments. If there is evidence of curvature
and if none of the design variables can be dropped from the experiment, then
the block of star points is run and combined with the first block(s). Then the
full model with main effects, two-factor interactions, and quadratic terms can
be fitted.

11.7.2 Number of Levels of Each Variable


The BB(k) and 3k experiments have three levels of each variable where the central com-
posite designs have five levels of each variable. In many situations, because of the nature
of some of the design variables, getting the necessary five levels for all of the variables
for a central composite design is impossible or impractical. This might happen because
five levels just aren’t available or sometimes the star point positions can’t be made with
456 Chapter Eleven

the correct ±h values. In the latter case, if the central composite design is the right
design for the problem except that the star point positions can’t be achieved exactly,
then by all means compromise the h value so that the central composite design can
still be used. Incorrect star point positions will degrade the quality of the design, but
the compromise is often relatively minor and worth the risk. When there are just too
many compromises that have to be made to get the central composite design to work, it
might be time to turn to the three-level designs as an alternative.
Of the central composite designs, the CC(24) and the CC(25–1 V ) deserve some special
attention because their five levels (–h, –1, 0, +1, +h) are all spaced one coded unit apart
because they have h = 2. This might not seem like such a big deal at first, but there are
many cases in which a quantitative variable’s levels are easiest to achieve if they take
on integer values. For example, if the amount of some material is a variable in a cen-
tral composite design and the material is only available in some quantized form (for
example, pill, tablet, and so on) that cannot be easily and accurately subdivided, then
the CC(24) or the CC(25–1V ) designs might be much easier to build than one of the other
designs that has fractional h values. Of course the Box-Behnken designs, with their
three evenly spaced levels, also have this advantage and should also be considered.

11.7.3 Uncertainty About the Safety of Variable Levels


There are two forces that affect the choice of the extreme quantitative variable levels for
all experiments. The first force is sensitivity—the farther apart the levels of a variable
can be spaced, the more sensitive the experiment will be for that variable. The second
force is safety—if the levels of a variable are chosen to be too far apart then one or both
of the extreme levels may be lost. These issues become more difficult to manage as the
number of variables in an experiment increases because there are more opportunities to
screw up.
Sometimes enough is known about the safe limits for each variable that there is lit-
tle to no concern about the choice of levels, but most experiments worth doing have one
or more variables that little is known about and there is corresponding uncertainty about
the safety of its levels. When safe limits are known for all of the variables in an exper-
iment then the three-level experiments are excellent choices. They also put most of their
observations far from the design center so the experiment has high sensitivity. If, how-
ever, one or more of the variable levels are chosen inappropriately then a significant
fraction of the experimental runs could be lost. So many runs can be lost from one poor
choice of variable level that it might be impossible to salvage any model from the runs
that survive.
When safe limits are not known, the central composite designs are an excellent
choice. Their star points can be placed in questionable territory, leaving the points of
the factorial design and center points in complete safety. Even if all of the star points
are lost from the experiment, the surviving two-factorial plus centers experiment can
still be analyzed for main effects, two-factor interactions, and lack of fit. This strategy
is especially good for large experiments where there are just too many (that is, 2k)
opportunities to make a mistake picking a variable level.
Response-Surface Experiments 457

Although the 3k and BB(k) designs both use three levels of each variable, if their ±1
coded levels have the same physical values then the BB(k) design is somewhat safer
than the 3k design. This is because the BB(k) designs don’t use the ±1 variable levels
for all k variables simultaneously like the 3k designs do. This means that the 3k experi-
ments have observations that fall farther from the design center than the BB(k) designs.
This puts some of the runs of the 3k designs at greater risk but it also makes the 3k
designs more sensitive to variable effects.

Example 11.7
Compare the sensitivity and safety of the CC(22) and 32 designs if: 1) safe limits for
both design variables are known and 2) safe limits for one or both design variables are
not known.
Solution: Figure 11.8 shows the CC(22) and 32 designs and the surrounding dan-
ger zones due to unsafe variable levels. If safe limits for both variables are known with
certainty, then the 32 experiment would be the preferred design because more of its
observations fall farther from the center of the design space than for the CC(22) design.
This gives the 32 design greater sensitivity to x1 and x2 effects. If safe limits for one or
both variables are not known with certainty, then there is an excellent chance that at
least part of the 32 design will wander into the dangerous part of the design space so
that some if not many of the experimental runs will be lost. Even if just one of four
choices of variable levels are picked incorrectly, one third of the runs will be lost and
the surviving runs will be difficult to analyze.
By comparison, the CC(22) design only puts its star points at risk. If the experiment
is placed correctly in the design space, the full model will be obtained, but even if the
CC(22) wanders off so that some star points are lost, it’s likely that the 22 plus centers

a. b.
X2 X2

Danger! Danger!

X1 X1

Figure 11.8 Comparison of the risks associated with extreme variable levels for the CC (22)
and 32 designs.
458 Chapter Eleven

design will survive and can still be analyzed for main effects, two-factor interactions,
and lack of fit. Of course the ±1 levels for the 32 design could be chosen to fall closer
together so that there is less risk of losing observations, but that is all part of the game
of picking a design and its variable levels.

Example 11.8
Compare the risks of the 33 and BB(3) designs if safe variable levels are not known
and both experiments use the same ±1 and zero variable levels.
Solution: Figures 11.5, page 444, and 11.6, page 447, show the 33 and BB(3)
designs, respectively, plotted in three dimensions. The 33 experiment has a run at every
corner, on every edge, and at every face of the cube in addition to the single center
point. The BB(3) design only has runs on the cube edges and at the center. Since the
cube corner points fall farther from the design center than the edge points, the 33 design
offers greater sensitivity to variable effects but those points are also at greater risk of
being lost.

11.8 RESPONSE-SURFACE DESIGNS IN MINITAB


Response-surface designs can be created and analyzed in MINITAB using methods
similar to those described in Section 9.7, with appropriate modifications to account for
the quadratic terms. Since the 2k plus centers designs are not true response-surface
designs, use the usual Stat> DOE> Factorial tools to create and analyze them.

11.8.1 Creating Response-Surface Designs in MINITAB


A response-surface design can be created in MINITAB by:
1. Manually entering the matrix of experimental runs into a new worksheet.
2. Opening a worksheet that already contains the desired design. Many common
designs can be found in the worksheets provided on the CD-ROM included
with this book (for example, cc(2^5h).mtw).
3. Using MINITAB’s Stat> DOE> Response Surface> Create Response
Surface Design menu to specify and create the design. This menu is very
similar to the Stat> DOE> Factorial> Create Factorial Design menu.
MINITAB provides some options to fine-tune some of the response-surface
designs but the default settings are usually appropriate.

11.8.2 Analysis of Response-Surface Designs in MINITAB


Once a response-surface design has been created in MINITAB and the responses have
been entered into a column of the worksheet, the data can be analyzed by the same
methods described in Section 9.7.2. If the analysis is to be done manually using Stat>
Response-Surface Experiments 459

Regression> Regression it will be necessary to create columns for the quadratic terms
in addition to the columns for the main effects and interactions. If you choose to use
Stat> ANOVA> General Linear Model, you don’t have to create columns for the inter-
actions and quadratic terms in the worksheet but you will have to add them to the Model
and Covariates windows.
The mlrk.mac macros automatically create the necessary columns for the quadratic
terms and include them in the model. For a 2k design with centers, MINITAB will only
include the first quadratic term in the model and you are responsible for interpreting it
as the combined effect of all of the quadratic terms.
MINITAB’s Stat> DOE> Response Surface> Analyze Response Surface Design
menu works just like the Stat> DOE> Factorial> Analyze Factorial Design menu. If
you didn’t use Stat> DOE> Response Surface> Create Response Surface Design to
create the design, you will have to specify the response-surface design to MINITAB
using Stat> DOE> Response Surface> Define Custom Response Surface Design first.
The Responses, Terms, and Graphs menus all work as before. MINITAB automatically
includes all of the possible terms in the model, including the quadratic terms.
Response-surface models often have many insignificant terms. It is your responsi-
bility to use Occam’s razor to refine the model. This is especially important when the
model has few error degrees of freedom but is not so important for experiments that
already have plenty of error degrees of freedom. Sometimes an automated form of
Occam called stepwise regression can be used to refine the model. Stepwise regression
comes in two forms: stepwise backward and stepwise forward. In the backward
method, all of the possible model terms are included in the regression model and the
model is refined by eliminating the least significant terms one by one. The refining
process is stopped when the weakest term is still statistically significant. In stepwise
forward, the initial model is small, usually consisting only of main effects, and then
terms are added to the model one by one in order of their strength. The addition of
terms stops when the next term to be added is not statistically significant. MINITAB’s
Stat> Regression> Stepwise menu supports stepwise forward and backward regression
but because of problems with preserving the hierarchy of model terms this method is
not recommended.
Many response-surface designs are performed for the purpose of identifying the
variable levels that maximize or minimize the response. When none of the variables in
a response-surface experiment has a significant quadratic term, the response will be
maximized and minimized for specific choices of extreme levels of the important
design variables. In contrast, if a response-surface design reveals two or more variables
that cause strong curvature in the response, a local maximum or minimum may exist
within the range of the experimental variables. Such a maximum or minimum might be
found by observation using post-regression diagnostic plots such as the contour and
wire-frame plots that MINITAB provides from the Stat> DOE> Response Surface>
Contour/Surface Plots menu.
For more complicated optimization problems there are analytical and software-
based hill-climbing methods to find the design variable conditions that maximize or
minimize the response. MINITAB provides such an optimization tool from its Stat>
460 Chapter Eleven

DOE> Response Surface> Response Optimizer menu. The response optimizer can
simultaneously solve problems involving several design variables and one or more
responses. The response optimizer gets its input from the last run of Stat> DOE>
Response Surface> Analyze Response Surface Design, so if you’ve attempted several
different models make sure that you run your favorite again before running the opti-
mizer. And sometimes when the response changes very quickly with relatively small
changes to the design variables, different models will give substantially different opti-
mized results. The response optimizer also includes an interactive graphic mode which
you can use to investigate the local behavior around an optimized solution but this topic,
which deserves a whole chapter of its own, is outside the scope of this book. See
MINITAB’s Help menu for assistance with running the response optimizer.

Example 11.9
The basic geometry of a metal halide arctube produced by GE Lighting in Cleveland,
Ohio, is shown in Figure 11.9.* The arc chamber is made from quartz tubing blow-molded
into an ellipsoidal shape. The length and diameter of the arc chamber are determined by
physical calculation of the power input and lumen output requirements of the lamp. A
metal halide compound is dosed into the arc chamber, tungsten electrodes are sealed
into each end, and a reflective coating or end coat is painted over the arctube’s ends.
Among others, there are three important factors that affect the lumen output of the fin-
ished arctube: the electrode insertion length (EIL), the end coat height (ECH), and the
metal halide dose amount measured in milligrams relative to the arc chamber surface
area (HAD or metal halide density). The geometry of the EIL and ECH are shown in the
figure and the same values are used for both the top and bottom ends of the arctube.
During lamp operation, the metal halide dose melts and covers the coldest areas of
the bulb wall. Some of the liquefied metal halide evaporates and enters the arc core

Quartz to metal seal

ECH

Quartz arc chamber

Arc

EIL
End coat

Electrode lead

Figure 11.9 Metal halide arctube geometry.


*Source: Simulation and analysis courtesy of General Electric Company.
Response-Surface Experiments 461

where the metal halide molecules dissociate and the metal atoms radiate their charac-
teristic atomic spectra. The combined metal spectra give the lamp its highly efficient
light output—about 100 lumens per watt versus about 18 lumens per watt for a stan-
dard incandescent light bulb. Generally, the light output from the lamp increases as
more of the wall area is covered by liquid metal halide. Since the ends of the arc cham-
ber tend to be cold, EIL and ECH are adjusted to keep them warm and force the liquid
metal halide out of the end chambers and onto the bulb wall. The end chambers become
warmer as EIL decreases because the arc gets closer to the end chambers, but if the
arctube wall gets too hot the lamp will fail prematurely. The end chambers also become
warmer as ECH increases because the reflective end coat traps heat there; however, the
end coat also prevents some light from escaping from the arctube so the light output
falls if the end coat gets too high. The light output tends to increase with the addition
of metal halide; however, if too much metal halide is added to the lamp it tends to form
puddles that roll down the inner bulb wall and strike the hot electrode shank causing
undesirable flares in the light output.
An experiment was performed to determine the appropriate values of EIL, ECH,
and HAD for a new arctube design. The experimental variables and results are shown
in Table 11.9. The ±1 variable levels were chosen to be near their extreme allowable
physical values. The experiment is a BB(3) design with two replicates and the experiment
was blocked on replicates. Analyze the experimental data and determine the settings for
the three design variables that maximize the light output.
Solution: The experiment design was entered into a MINITAB worksheet and
defined using Stat> DOE> Response Surface> Define Custom Response Surface
Design. The experimental lumen response was entered into a column of the worksheet
and then analyzed using Stat> DOE> Response Surface> Analyze Response Surface
Design. The residuals diagnostic plots shown in Figure 11.10 indicate that the residu-
als are normally distributed and homoscedastic with respect to the design variables, the
run order, and the fitted values as required by the analysis method.
Figure 11.10 also contains a normal probability plot of the regression coefficient t
values. The plot shows reference lines at bi = 0 and at bi = ±2.09, where the latter cor-
respond to the critical t value that distinguishes significant from insignificant regres-
sion coefficients at a = 0.05: ta/2,dfe = t0.025,19 = 2.09. This plot suggests that A, B, AA, and
CC are all statistically significant (p < 0.05) and that AB, BB, and blocks are marginal
(p 0.05).
The results of the MINITAB analysis performed using Stat> DOE> Response
Surface> Analyze Response Surface Design are shown in Figure 11.11. The model p
values confirm that all three design variables have significant main effects and/or qua-
dratic terms, the two-factor interactions are all statistically insignificant or marginal,
and the blocks are not significant. The model has enough error degrees of freedom that
there’s no pressing reason to refine it. The three quadratic terms have mixed signs,
which means that the lumen response surface is very complex.
MINITAB’s Stat> DOE> Response Surface> Response Optimizer was used to
determine the A: ECH, B: EIL, and C: HAD settings that maximize the lumens for the
462 Chapter Eleven

Table 11.9 Variables and design matrices from arc lamp design experiment.
Code Variable –1 0 +1 Units Std Run Block A B C Lumens
A ECH 1 2 3 mm 1 9 1 –1 –1 0 4010
B EIL 2.0 2.75 3.5 mm 2 8 1 –1 1 0 5135
C HAD 1.5 2.75 4.0 mg/cm2 3 10 1 1 –1 0 5879
4 12 1 1 1 0 6073
5 3 1 –1 0 –1 3841
6 15 1 –1 0 1 4933
7 6 1 1 0 –1 5569
8 14 1 1 0 1 5239
9 13 1 0 –1 –1 5017
10 5 1 0 –1 1 5243
11 1 1 0 1 –1 6412
12 11 1 0 1 1 6210
13 4 1 0 0 0 5805
14 7 1 0 0 0 5624
15 2 1 0 0 0 5843
16 26 2 –1 –1 0 4746
17 16 2 –1 1 0 6052
18 27 2 1 –1 0 6105
19 19 2 1 1 0 6232
20 23 2 –1 0 –1 4549
21 30 2 –1 0 1 4080
22 22 2 1 0 –1 5006
23 18 2 1 0 1 5438
24 17 2 0 –1 –1 4903
25 25 2 0 –1 1 6129
26 20 2 0 1 –1 6234
27 29 2 0 1 1 6860
28 21 2 0 0 0 6794
29 24 2 0 0 0 5780
30 28 2 0 0 0 6053

full model in Figure 11.11. The response optimizer’s output is shown at the end of
that figure. MINITAB found that the optimal settings (A: ECH, B: EIL, C: HAD) =
(0.81, 1.0, –0.023) deliver a maximum response of 6435 lumens. The corresponding
mechanical settings for the three variables are ECH = 2.8mm, EIL = 3.5mm, and HAD
= 2.7mg/cm2.
Contour and response-surface plots were created from the Stat> DOE> Response
Surface> Contour/Surface Plots menu and are shown in Figure 11.12. Each plot
shows the response surface with respect to a pair of design variables while the third
variable is held constant at its optimum level. The contour and surface plots of lumens
versus A: ECH and C: HAD (the middle pair of plots) clearly show that there is a local
maximum in the lumens within the experimental range of these variables. The other
Response-Surface Experiments 463

Histogram of the Residuals Normal Probability Plot of the Residuals


99
90
Frequency

5.0

Percent
50
2.5
10
0.0 1
–600 –400 –200 0 200 400 600 –500 0 500
Residual Residual
Residuals versus the Fitted Values Residuals versus the Order of the Data

500 500
Residual

Residual
0 0

–500 –500

4000 5000 6000 2 4 6 8 10 12 14 16 18 20 22 24 26 28 30


Fitted Value Observation Order
Normal Probability Plot Residuals versus A
5 B A
Coefficient t Value

500
Residual

BB
C

AC
0
0 BC
Block
CC AB
–500
AA
–5 –1 0 1
A
0.01 0.10 0.50 0.90 0.99

Residuals versus B Residuals versus C

500 500
Residual

Residual

0 0

–500 –500

–1 0 1 –1 0 1
B C

Figure 11.10 Residuals analysis from arc lamp design example.

plots, for lumens versus A: ECH and B: EIL and lumens versus B: EIL and C: HAD,
show saddle-type surfaces. Steep slopes in the vicinity of the optimum solution imply
that the arctube will demand quite tight manufacturing tolerances to deliver any con-
sistency in lumens.
464 Chapter Eleven

Response Surface Regression: Lumens versus Block, A, B, C

The analysis was done using coded units.

Estimated Regression Coefficients for Lumens

Term Coef SE Coef T P


Constant 5983.17 154.95 38.613 0.000
Block -137.60 69.30 -1.986 0.062
A 512.19 94.89 5.398 0.000
B 448.50 94.89 4.727 0.000
C 162.56 94.89 1.713 0.103
A*A -749.15 139.67 -5.364 0.000
B*B 294.98 139.67 2.112 0.048
C*C -402.15 139.67 -2.879 0.010
A*B -263.75 134.19 -1.965 0.064
A*C -65.13 134.19 -0.485 0.633
B*C -128.50 134.19 -0.958 0.350

S = 379.6 R-Sq = 84.8% R-Sq(adj) = 76.7%

Analysis of Variance for Lumens

Source DF Seq SS Adj SS Adj MS F P


Blocks 1 568013 568013 568013 3.94 0.062
Regression 9 14649728 14649728 1627748 11.30 0.000
Linear 3 7838638 7838638 2612879 18.14 0.000
Square 3 6088550 6088550 2029517 14.09 0.000
Interaction 3 722541 722541 240847 1.67 0.207
Residual Error 19 2737225 2737225 144064
Lack-of-Fit 15 2159234 2159234 143949 1.00 0.564
Pure Error 4 577991 577991 144498
Total 29 17954965

Unusual Observations for Lumens

Obs StdOrder Lumens Fit SE Fit Residual St Resid


21 21 4080.000 4684.975 242.541 -604.975 -2.07 R

R denotes an observation with a large standardized residual.

Response Optimization
Parameters

Goal Lower Target Upper Weight Import


Lumens Maximum 6000 8000 8000 1 1

Global Solution

A = 0.81173
B = 1.00000
C = -0.02306

Predicted Responses

Lumens = 6434.92, desirability = 0.21746

Composite Desirability = 0.21746

Figure 11.11 Analysis and optimization of BB(3) arc lamp design example.
Response-Surface Experiments 465

Contour Plot of Lumens = f(A = 0.81, B, C)


1
Surface Plot of Lumens = f(A = 0.81, B, C)

6500
0
C

Lumens 6000

5500
5500 1
6000 0
–1 –1
C
–1 0 1 0 –1
B 1
B

Contour Plot of Lumens = f(A, B = 1, C)


1
Surface Plot of Lumens = f(A, B = 1, C)
6500.00

0 6500
C

Lumens
6000

5500
1
6000.00 5000 0 C
–1 –1
0 –1
1
–1 0 1 A

Contour Plot of Lumens = f(A, B, C = –0.023)


1 Surface Plot of Lumens = f(A, B, C = –0.023)
6500.00

5500.00
7000

0
B

Lumens 6000
6000.00
5000
1
4000 0 B
5000.00 6000.00 –1
0 –1
–1 A 1
–1 0 1
A

Figure 11.12 Contour and response-surface plots for all pairs of design variables.
466 Chapter Eleven

11.9 SAMPLE-SIZE CALCULATIONS


Separate sample-size calculations can be considered to determine the model constant, the
main effects, and the quadratic terms for the experiment designs considered in this chapter;
however, it’s most likely that the experimenter will be interested in the sample size neces-
sary to quantify one of the main effects. The development of a condition to determine the
required sample size (that is, number of replicates) to determine main effects for the designs
in this chapter is similar to the development of the sample-size calculation for the slope of
a linear regression problem in Chapter 8. The key difference between the calculations done
for linear regression and those necessary here is that the 3k, BB(k), and CC(2k) designs have
unique forms for their equations for SSx. All three experiment designs will be considered.
Be aware that the calculation of sample size for the 3k, BB(k), and CC(2k) designs is not
supported in MINITAB so you will have to complete these calculations yourself.
There were two different goals considered in the sample-size calculations of other
chapters: we either sought to determine the number of design replicates required to
detect a significant effect due to a design variable or we sought to determine the number
of design replicates required to quantify the regression coefficient associated with a
design variable with specified confidence. Generally, sample-size calculations for
response surface designs are done for the latter reason—to quantify a regression coef-
ficient that is already known to be significantly different from zero. This is the approach
that will be emphasized for the true response surface designs but be aware that the former
goal is still possible.
The goal of the sample-size calculation is to determine the number of replicates (n)
of the experiment required to determine the regression coefficient associated with one of
the variables to within some specified range of values, as in:

P (b − δ < β < b + δ ) = 1 − α (11.9)

where b is the regression coefficient, b is the true slope parameter, and 1 – a is the con-
fidence level. Here, and in all of the following calculations, b and b are slopes defined
in terms of coded units for x. Since all of the variables use the same ±1 coded levels,
the same sample-size calculation applies for all of them.
The value of d in Equation 11.9 is given by:

δ = tα /2σ b (11.10)

where the number of degrees of freedom associated with the t distribution is given by:

dfε = nN design − 1 − dfmodel (11.11)

where Ndesign is the number of runs in one replicate of the experiment design. The value
of sb is given by:
Response-Surface Experiments 467

σε
σb = (11.12)
nSS x

where
SS x = ∑ ( xi − x )
2
(11.13)

is the x sum of squares for one replicate of the design and n is the number of replicates.
The solution of this system of equations for n gives:
2
1 ⎛ tα / 2σ ε ⎞
n≥ (11.14)
SS x ⎜⎝ δ ⎟⎠

The smallest value of n that meets this condition is the minimum number of replicates
of the design that will deliver a confidence interval for b that is as narrow or narrower
than required. If more replicates are used, the resulting confidence interval will be nar-
rower than necessary and the experiment may become wasteful of resources.

11.9.1 Sample Size for 2k and 2k–p Plus Centers Designs


The sample-size calculations for 2k and 2k–p plus centers designs are almost identical to
the calculations for these designs without centers, which were presented in Chapters 9
and 10. Although center cells do not make any contribution to SSx or the noncentrality
parameter, they do add error degrees of freedom. When there are very few error degrees
of freedom in a 2k or 2k–p experiment, the addition of a few center cells can improve the
power of the design but the benefit diminishes very quickly. The only way to signifi-
cantly improve the sensitivity of an experiment that already has more than about ten
error degrees of freedom is to add replicates.

Sample Size to Detect Significant Effects


The sample-size calculations shown in the next two examples address the goal of detect-
ing significant variable effects, which is the more common goal for the 2k and 2k–p plus
centers designs. These calculations are substantially the same as those described in
Section 9.10.1 with the exception that the model consumes an additional degree of free-
dom to estimate the generic curvature term.

Example 11.10
Calculate the power to detect a difference of d = se between the ±1 levels of one of
the variables for one replicate of a 23 design with four added center cells. Confirm the
answer using MINITAB.
Solution: The noncentrality parameter is:
468 Chapter Eleven

λ = N
2a ( )
δ
σ
2

( ) (1)
2
= 8
22

=2

where N = 23 = 8 is the number of runs in the cube of the experiment. There are twelve
runs in the experiment so there will be dftotal = 11 total degrees of freedom. The model
will include three main effects, three two-factor interactions, and one generic curvature
term for dfmodel = 7. By subtraction, the error degrees of freedom will be dfe = 4. With
a = 0.05, the power is given by the condition:

F0.05,1, 4 = FP ,1, 4 , 2

which gives:

F0.05,1, 4 = F0.195,1, 4 , 2 = 7.709

The power to detect the effect is P = 0.195 or only about 19.5 percent. This answer was
confirmed using Stat> Power and Sample Size> 2-Level Factorial Design. MINITAB’s
output is shown in Figure 11.13. Notice that without the additional center cells there
would have only been dfe = 7 – 6 = 1 error degree of freedom and the power would have
been much worse.

MTB > Power;


SUBC> FFDesign 3 8;
SUBC> Replicates 1;
SUBC> Effect 1;
SUBC> CPBlock 4;
SUBC> Sigma 1;
SUBC> Omit 1;
SUBC> FitC;
SUBC> FitB.

Power and Sample Size

2-Level Factorial Design

Alpha = 0.05 Assumed standard deviation = 1

Factors: 3 Base Design: 3, 8


Blocks: none

Number of terms omitted from model: 1


Including a term for center points in model.

Center Total
Points Effect Reps Runs Power
4 1 1 12 0.195232

Figure 11.13 Power calculation for 23 plus four centers design.


Response-Surface Experiments 469

Example 11.11
Construct a plot of the power versus the total number of runs for 23 factorial designs
with n = 1, 2, 4, and 8 replicates and additional center cells for up to N = 80 total runs.
Include main effects, two-factor interactions, and the lack of fit term in the models. The
smallest difference to be detected between ±1 levels is d = se. Use a = 0.05. When does
the use of center cells improve the power of a 23 experiment?
Solution: The power calculations were done from Stat> Power and Sample Size>
2-Level Factorial Design. The replicates were not blocked, a term for the center cells
was included in the model, and one term corresponding to the three-factor interaction
was omitted from the model. The resulting power values are plotted against the total
number of experimental runs in Figure 11.14. The circles show the 23 designs without
center cells. No lack of fit term can be included in their model. The plot shows that when
there are relatively few runs in an experiment, such as when there is just one replicate,
the addition of center cells can increase the power; however, the main factor that
determines the power is the number of replicates. This analysis confirms that the primary
reason for adding center cells to 2k designs is to allow for a test of lack of linear fit.

Sample Size to Quantify Effects


Sample-size calculations to quantify effects for the 2k and 2k–p plus centers designs are very
similar to those described in Section 9.10.2. A single example is offered here that only
differs from the calculations shown in that section in the management of the center cells.

Example 11.12
How many replicates of a 24 design with three centers (19 runs total) are required
to quantify the slope associated with a design variable to within ± 0.01 with 95 percent
confidence if the standard error is expected to be se = 0.02?

1.0
8(23) + centers
0.9
0.8 4(23) + centers
0.7
0.6
Power

0.5 2(23) + centers


0.4
0.3 1(23) + centers
0.2
0.1
0.0
0 10 20 30 40 50 60 70 80
Total Number of Runs

Figure 11.14 Power versus total number of runs for one to eight replicates of the 23 plus
centers design.
470 Chapter Eleven

Solution: Since the center cells don’t contribute to SSx then for a single replicate of
the 19-run design:

SS x = 24−1 ( −1) + 3( 0 ) + 24−1 ( +1) = 16


2 2 2

Then the number of replicates (n) must meet the condition given by Equation 11.14. As
a first guess, if there are enough error degrees of freedom that t0.025 z0.025, then:

( )
2
tα / 2σ ε
n ≥ 1
SS x δ

≥ 161 ( 1.960×.010.02 )
2

≥ 0.96

which of course rounds up to n = 1. The model including main effects, two-factor inter-
actions, and a generic curvature term will require dfmodel = 4 + 6 + 1 = 11 degrees of
freedom which, with just n = 1 replicate, would only leave dfe = 19 – 1 – 11 = 7 error
degrees of freedom. This is too few degrees of freedom to satisfy t0.025 z0.025 so further
iterations are required. With n = 2 replicates there will be dfe = 2 (19) – 1 – 11 = 26
error degrees of freedom so t0.025,26 = 2.056 which gives:

n ≥ 161 ( 2.0560.01 ) 2
× 0.02

≥ 1.06

Consequently, n = 2 replicates will be sufficient to determine the slope of a design vari-


able to within ±0.01 with 95 percent confidence.

11.9.2 Sample Size for 3k Designs


Consider a 3k experiment where all k of the variables are quantitative with three evenly
spaced levels in coded units –1, 0, and +1. One replicate of the experiment will have
Ndesign = 3k runs so the error degrees of freedom for the experiment will be:

dfε = n3k − 1 − dfmodel (11.15)

Since the design is balanced with –1, 0, and +1 for levels of x, then –x = 0 and the spac-
ing between the levels is Δ x = 1. In one replicate, the number of observations at each
level is (1/3) 3k = 3k–1. Then SSx for one replicate is given by:

= ∑ ( xi − x )
2
SS x
= 3k −1 ( −1) + 3k −1 ( 0 ) + 3k −1 ( +1)
2 2 2

= 2 × 3k −1 (11.16)
Response-Surface Experiments 471

Then from Equation 11.14:


2
1 ⎛ tα / 2σ ε ⎞
n≥ (11.17)
2 × 3k −1 ⎜⎝ δ ⎟⎠

As in other cases, this expression is transcendental and must be solved by iteration;


however, since the 3k designs involve so many runs and a comparatively small model,
the number of error degrees of freedom is usually large enough that ta /2 za /2.

Example 11.13
How many replicates of a 33 factorial design are required to determine the slope
associated with one of the design variables to within d = ± 0.05 if the standard error of
the model is expected to be se = 0.20? Use a = 0.05.
Solution: The full quadratic model will require dfmodel = 3 + 3 + 3 = 9 degrees of
freedom and since one replicate of the design will contain 33 = 27 runs, the number
of error degrees of freedom should be large enough that t0.025 z0.025. Then the num-
ber of replicates is given by:

( )
2
tα / 2σ ε
n ≥ 1
2×3k −1 δ

≥ 1
2×32
( 1.96× 0.20
0.05 ) 2

≥ 3.4

This indicates that n = 4 replicates will be required to determine the slope to within
the specified range of values. The total number of runs in the experiment will be 4 ×
27 = 108.

11.9.3 Sample Size for Box-Behnken Designs


The total number of runs in one replicate of a k variable Box-Behnken design where
3 ≤ k ≤ 5 is given by:
⎛ ⎞
N design = 4 ⎜ k ⎟ + n0 (11.18)
⎝ 2⎠

where n0 is the number of center points. One third of the non–center points will be run
at each of the three levels of x. Then SSx for one replicate is given by:

= ∑ ( xi − x )
2
SS x
4⎛ ⎞ ⎛4⎛ ⎞ ⎞ 2 4⎛ ⎞
= ⎜ k ⎟ ( −1) + ⎜ ⎜ k ⎟ + n0 ⎟ ( 0 ) + ⎜ k ⎟ ( +1)
2 2

3 ⎝ 2⎠ ⎝ 3 ⎝ 2⎠ ⎠ 3 ⎝ 2⎠
8⎛ ⎞
= ⎜ k⎟ (11.19)
3 ⎝ 2⎠
472 Chapter Eleven

The number of replicates is given by:


2
1 ⎛ tα / 2σ ε ⎞
n≥
8 ⎛ k ⎞ ⎜⎝ δ ⎟⎠
(11.20)
⎜ ⎟
3 ⎝ 2⎠
where the number of degrees of freedom for the t distribution is:

⎛ ⎛ ⎞ ⎞
dfε = n ⎜ 4 ⎜ k ⎟ + n0 ⎟ − 1 − dfmodel (11.21)
⎝ ⎝ 2⎠ ⎠

The sample-size condition is transcendental but when the number of error degrees of
freedom is large enough it is safe to take ta/2 za/2.
When a Box-Behnken design with six or more variables is used, Equation 11.18 no
longer gives the total number of runs in the design and unique calculations for SSx and
dfe must be done.

Example 11.14
Find the number of replicates required if a Box-Behnken design is used for the sit-
uation presented in Example 11.13.
Solution: One replicate of the BB(3) design requires only 15 runs, but if three or
more replicates are required there will be enough error degrees of freedom that it is
safe to take t0.025 z0.025 = 1.96. Then the number of replicates is given by:
2
1 ⎛ tα / 2σ ε ⎞
n ≥ ⎜ ⎟
8 ⎛ k ⎞ ⎜⎝ δ ⎟⎠
⎜ ⎟
3 ⎝ 2⎠
2
1 ⎛ 1.996 × 0.20 ⎞

8 ⎛ 3⎞ ⎜⎝ 0.05 ⎟⎠
⎜ ⎟
3 ⎝ 2⎠
≥ 7.7

This indicates that the experiment will require n = 8 replicates. The total number of runs
in the experiment will be 8 × 15 = 120.

Example 11.15
How many replicates of a BB(6) design are required to determine the slope associ-
ated with one of the design variables to within d = 4 if the standard error of the model
is expected to be se = 6? Use a = 0.05.
Solution: From the design matrix for the BB(6) design in Table 11.3, page 445,
there are 24 runs with each design variable at its –1 level, 24 runs with each design
Response-Surface Experiments 473

variable at its +1 level, and 32 runs with each design variable at its zero level. [Each
row like (+1 +1 0 +1 0 0) consists of eight runs and there are eight center cells.] SSx for
one replicate is given by:

SS x = 24 ( −1) + 32 ( 0 ) + 24 (1) = 48
2 2 2

The number of replicates must meet the condition:


2
1 ⎛ tα / 2σ ε ⎞
n≥
48 ⎜⎝ δ ⎟⎠

With t0.025 z0.025 the sample size is:


2
1 ⎛ 1.96 × 6 ⎞
n≥ ⎜ = 0.18
48 ⎝ 4 ⎟⎠

which of course rounds up to n = 1. The model will consume 6 + 15 + 6 = 27 degrees


of freedom so with n = 1 replicate there will be df e = 80 – 1 – 27 = 52 error
degrees of freedom. This means the t0.025 z0.025 condition is satisfied and the sample-
size calculation is valid.

11.9.4 Sample Size for Central Composite Designs


The number of runs in one replicate of a central composite design is given by:

N design = ncube + n0 + nstar (11.22)

With respect to one design variable, there will be one star point at its –h level, one
star point at its +h level, n0 + nstar – 2 points at its center (0) level, ncube/2 points at its
–1 level, and ncube/2 points at its +1 level. Then SSx for one replicate of a central com-
posite design is given by:

= ∑ ( xi − x )
2
SS x
= ( −η ) + 12 ncube ( −1) + ( n0 + nstar − 2 ) ( 0 ) + 12 ncube ( +1) + ( +η )
2 2 2 2 2

= 2η 2 + ncube (11.23)

and the sample size must meet the condition:


2
1 ⎛ tα / 2σ ε ⎞
n≥ 2 (11.24)
2η + ncube ⎜⎝ δ ⎟⎠
474 Chapter Eleven

This condition is transcendental and n must be rounded up to the nearest integer. When
the experiment design is large enough and the error degrees of freedom is large it is safe
to take ta/2 za/2.

Example 11.16
Find the number of replicates required if a central composite design is used for the
situation presented in Example 11.13.
Solution: The CC(23) design requires ncube = 8, nstar = 6, and n0 = 6 for a total of
Ndesign = 20 runs in one replicate. The star point position is given by h = 1.682. If the
number of replicates is large enough then it is safe to take t0.025 1.96 so that the num-
ber of replicates must meet the condition:

( )
2
tα / 2σ ε
n ≥ 1
2η 2 + ncube δ

≥ 1
2(1.682 ) +8
2 ( 1.96× 0.20
0.05 ) 2

≥ 4.5

This indicates that the experiment will require n = 5 replicates. The total number of runs
in the experiment will be 5 × 20 = 100 and there will be dfe = 100 – 1 – 9 = 90 error
degrees of freedom. Notice that where the 33 and BB(3) designs had their extreme levels
at x = ±1, the central composite design has its cube points at x = ±1 but its star points
lie further outside the cube at x = ±h. If these extreme levels are too far away from the
design center, it may be necessary to redefine the actual physical levels corresponding
to the coded levels of x. This will have an affect on the ability of the experiment to deter-
mine the slopes associated with the different variables.

11.10 DESIGN CONSIDERATIONS FOR


RESPONSE-SURFACE EXPERIMENTS
• An experiment should have enough error degrees of freedom to provide a good
error estimate but not so many that it is wasteful of resources. Often a single
replicate of the appropriate design is sufficient.
• If it is difficult to obtain the required five levels for a central composite design
then one of the three-level designs (3k or BB(k)) might be easier to build.
• If your experiment has three variables, a Box-Behnken design (15 runs) is more
economical than a central composite design (20 runs).
• If your experiment has five variables, a central composite design (32 runs) is
more economical than a Box-Behnken design (43 runs).
Response-Surface Experiments 475

• If you don’t know safe upper and lower limits for all of the design variables,
use a central composite design instead of a Box-Behnken design. Position the
cube points inside the known safe region of the design space and let the star
points fall in questionable territory. If any or all of the star points of the central
composite design are lost, the remaining factorial or fractional factorial design
with centers can be still be analyzed for main effects, two-factor interactions,
and lack of fit.
• If you know the safe upper and lower limits for all of the design variables, use
a Box-Behnken design instead of a central composite design. The Box-Behnken
design puts points further from the center of the design space so the power of
the design to detect small effects is greater than the power provided by the
central composite design.
• Build a central composite design in blocks. If you are uncertain about the safety
of the extreme levels of the design variables, build the star points first to
demonstrate that they are all safe. If you don’t know that a quadratic model is
really necessary, build the factorial plus centers block first, test it for lack of
linear fit (that is, curvature), and then decide whether it’s necessary to build
the block of star points. Get commitment from management to build the full
experiment at the beginning of the project but build it in blocks anyway. If you
have to build the full experiment then it’s planned and budgeted for. If you
don’t have to build the block of star points, you can declare victory sooner
and at a lower cost.
• Use blocks to introduce a qualitative variable into a Box-Behnken design,
but randomize over all runs so that it is safe to make claims about differences
between the levels of the variable.
Bibliography

Agresti, A. 2002. Categorical Data Analysis. 2nd ed. Hoboken, NJ: John Wiley & Sons.
AIAG. 2002. Measurement Systems Analysis. 3rd ed. Automotive Industry Action Group.
Bhote, K., and A. Bhote. 2000. World Class Quality: Using Design of Experiments to Make It
Happen. 2nd ed. New York: AMACOM.
Box, G. E. P., and D. W. Behnken. 1960. Some New Three Level Designs for the Study of
Quantitative Variables. Technometrics 2, no. 4: 455–475.
Box, G. E. P., W. G. Hunter, and J. S. Hunter. 1978. Statistics for Experimenters: An
Introduction to Design, Data Analysis, and Model Building. New York: John Wiley &
Sons.
Christensen, R. 1997. Log-Linear Models and Logistic Regression. New York: Springer-
Verlag.
Davies, O. 1963. The Design and Analysis of Industrial Experiments, 2nd ed. New York:
Hafner.
Freund, J. E., and R. E. Walpole. 1980. Mathematical Statistics. 3rd ed. New Jersey: Prentice-
Hall.
Hicks, C. R. 1993. Fundamental Concepts in the Design of Experiments. 4th ed. New York:
Saunders College Publishing.
Hoaglin, D. C., F. Mosteller, and J. W. Tukey. 1991. Fundamentals of Exploratory Analysis of
Variance. New York: John Wiley & Sons.
Kevles, D. J. 1998. The Baltimore Case. New York: W. W. Norton and Company.
Montgomery, D. C. 1991. Design and Analysis of Experiments. 3rd ed. New York: John Wiley
& Sons.
———. 1997. Introduction to Statistical Quality Control. 3rd ed. New York: John Wiley &
Sons.
Neter, J., M. H. Kutner, C. J. Nachtsheim, and W. Wasserman. 1996. Applied Linear Statistical
Models. 4th ed. Boston: McGraw-Hill.
Ostle, B., K. Turner, C. Hicks, and G. McElrath. 1996. Engineering Statistics: The Industrial
Experience. Belmont, CA: Duxbury Press.

489
490 Bibliography

Ross, P. J. 1988. Taguchi Techniques for Quality Engineering. New York: McGraw-Hill.
Sagan, C. 1996. A Demon-Haunted World: Science as a Candle in the Dark. New York:
Random House.
Sokal, R. R., and F. J. Rohlf. 1995. Biometry: The Principles and Practice of Statistics in
Biological Research. 3rd ed. New York: W. H. Freeman and Co.
Tufte, E. 1983. The Visual Display of Quantitative Information. Cheshire, CT: Graphics Press.
Weaver et al. 1986. Altered Repertoire of Endogenous Immunoglobulin Gene Expression in
Transgenic Mice Containing a Rearranged Mu Heavy Chain Gene. Cell 45: 247–59.
Appendix A
Statistical Tables

Table A.1 Greek characters.


Lower Upper
Name Case Case Arabic
alpha a Α a
beta b Β b
gamma g Γ g
delta d Δ d
epsilon or e Ε e
zeta z Ζ z
eta h Η h
theta q Θ y
iota i Ι i
kappa k Κ k
lambda l Λ l
mu m Μ m
nu n Ν n
xi x Ξ x
pi p Π p
rho r Ρ r
sigma s Σ s
tau t Τ t
upsilon u ϒ u
phi f or j Φ f
chi c Χ q
psi y Ψ c
omega w Ω w

477
478 Appendix A

Table A.2 Normal Distribution: Values of p = Φ (–∞ < z < zp).


0.00 0.01 0.02 0.03 0.04 0.05 0.06 0.07 0.08 0.09
–3.00 0.0013 0.0013 0.0013 0.0012 0.0012 0.0011 0.0011 0.0011 0.0010 0.0010
–2.90 0.0019 0.0018 0.0018 0.0017 0.0016 0.0016 0.0015 0.0015 0.0014 0.0014
–2.80 0.0026 0.0025 0.0024 0.0023 0.0023 0.0022 0.0021 0.0021 0.0020 0.0019
–2.70 0.0035 0.0034 0.0033 0.0032 0.0031 0.0030 0.0029 0.0028 0.0027 0.0026
–2.60 0.0047 0.0045 0.0044 0.0043 0.0041 0.0040 0.0039 0.0038 0.0037 0.0036
–2.50 0.0062 0.0060 0.0059 0.0057 0.0055 0.0054 0.0052 0.0051 0.0049 0.0048
–2.40 0.0082 0.0080 0.0078 0.0075 0.0073 0.0071 0.0069 0.0068 0.0066 0.0064
–2.30 0.0107 0.0104 0.0102 0.0099 0.0096 0.0094 0.0091 0.0089 0.0087 0.0084
–2.20 0.0139 0.0136 0.0132 0.0129 0.0125 0.0122 0.0119 0.0116 0.0113 0.0110
–2.10 0.0179 0.0174 0.0170 0.0166 0.0162 0.0158 0.0154 0.0150 0.0146 0.0143
–2.00 0.0228 0.0222 0.0217 0.0212 0.0207 0.0202 0.0197 0.0192 0.0188 0.0183

–1.90 0.0287 0.0281 0.0274 0.0268 0.0262 0.0256 0.0250 0.0244 0.0239 0.0233
–1.80 0.0359 0.0351 0.0344 0.0336 0.0329 0.0322 0.0314 0.0307 0.0301 0.0294
–1.70 0.0446 0.0436 0.0427 0.0418 0.0409 0.0401 0.0392 0.0384 0.0375 0.0367
–1.60 0.0548 0.0537 0.0526 0.0516 0.0505 0.0495 0.0485 0.0475 0.0465 0.0455
–1.50 0.0668 0.0655 0.0643 0.0630 0.0618 0.0606 0.0594 0.0582 0.0571 0.0559
–1.40 0.0808 0.0793 0.0778 0.0764 0.0749 0.0735 0.0721 0.0708 0.0694 0.0681
–1.30 0.0968 0.0951 0.0934 0.0918 0.0901 0.0885 0.0869 0.0853 0.0838 0.0823
–1.20 0.1151 0.1131 0.1112 0.1093 0.1075 0.1056 0.1038 0.1020 0.1003 0.0985
–1.10 0.1357 0.1335 0.1314 0.1292 0.1271 0.1251 0.1230 0.1210 0.1190 0.1170
–1.00 0.1587 0.1562 0.1539 0.1515 0.1492 0.1469 0.1446 0.1423 0.1401 0.1379

–0.90 0.1841 0.1814 0.1788 0.1762 0.1736 0.1711 0.1685 0.1660 0.1635 0.1611
–0.80 0.2119 0.2090 0.2061 0.2033 0.2005 0.1977 0.1949 0.1922 0.1894 0.1867
–0.70 0.2420 0.2389 0.2358 0.2327 0.2296 0.2266 0.2236 0.2206 0.2177 0.2148
–0.60 0.2743 0.2709 0.2676 0.2643 0.2611 0.2578 0.2546 0.2514 0.2483 0.2451
–0.50 0.3085 0.3050 0.3015 0.2981 0.2946 0.2912 0.2877 0.2843 0.2810 0.2776
–0.40 0.3446 0.3409 0.3372 0.3336 0.3300 0.3264 0.3228 0.3192 0.3156 0.3121
–0.30 0.3821 0.3783 0.3745 0.3707 0.3669 0.3632 0.3594 0.3557 0.3520 0.3483
–0.20 0.4207 0.4168 0.4129 0.4090 0.4052 0.4013 0.3974 0.3936 0.3897 0.3859
–0.10 0.4602 0.4562 0.4522 0.4483 0.4443 0.4404 0.4364 0.4325 0.4286 0.4247
–0.00 0.5000 0.4960 0.4920 0.4880 0.4840 0.4801 0.4761 0.4721 0.4681 0.4641
Continued

Zp 0 z
Statistical Tables 479

Continued
0.00 0.01 0.02 0.03 0.04 0.05 0.06 0.07 0.08 0.09
0.00 0.5000 0.5040 0.5080 0.5120 0.5160 0.5199 0.5239 0.5279 0.5319 0.5359
0.10 0.5398 0.5438 0.5478 0.5517 0.5557 0.5596 0.5636 0.5675 0.5714 0.5753
0.20 0.5793 0.5832 0.5871 0.5910 0.5948 0.5987 0.6026 0.6064 0.6103 0.6141
0.30 0.6179 0.6217 0.6255 0.6293 0.6331 0.6368 0.6406 0.6443 0.6480 0.6517
0.40 0.6554 0.6591 0.6628 0.6664 0.6700 0.6736 0.6772 0.6808 0.6844 0.6879
0.50 0.6915 0.6950 0.6985 0.7019 0.7054 0.7088 0.7123 0.7157 0.7190 0.7224
0.60 0.7257 0.7291 0.7324 0.7357 0.7389 0.7422 0.7454 0.7486 0.7517 0.7549
0.70 0.7580 0.7611 0.7642 0.7673 0.7704 0.7734 0.7764 0.7794 0.7823 0.7852
0.80 0.7881 0.7910 0.7939 0.7967 0.7995 0.8023 0.8051 0.8078 0.8106 0.8133
0.90 0.8159 0.8186 0.8212 0.8238 0.8264 0.8289 0.8315 0.8340 0.8365 0.8389

1.00 0.8413 0.8438 0.8461 0.8485 0.8508 0.8531 0.8554 0.8577 0.8599 0.8621
1.10 0.8643 0.8665 0.8686 0.8708 0.8729 0.8749 0.8770 0.8790 0.8810 0.8830
1.20 0.8849 0.8869 0.8888 0.8907 0.8925 0.8944 0.8962 0.8980 0.8997 0.9015
1.30 0.9032 0.9049 0.9066 0.9082 0.9099 0.9115 0.9131 0.9147 0.9162 0.9177
1.40 0.9192 0.9207 0.9222 0.9236 0.9251 0.9265 0.9279 0.9292 0.9306 0.9319
1.50 0.9332 0.9345 0.9357 0.9370 0.9382 0.9394 0.9406 0.9418 0.9429 0.9441
1.60 0.9452 0.9463 0.9474 0.9484 0.9495 0.9505 0.9515 0.9525 0.9535 0.9545
1.70 0.9554 0.9564 0.9573 0.9582 0.9591 0.9599 0.9608 0.9616 0.9625 0.9633
1.80 0.9641 0.9649 0.9656 0.9664 0.9671 0.9678 0.9686 0.9693 0.9699 0.9706
1.90 0.9713 0.9719 0.9726 0.9732 0.9738 0.9744 0.9750 0.9756 0.9761 0.9767

2.00 0.9772 0.9778 0.9783 0.9788 0.9793 0.9798 0.9803 0.9808 0.9812 0.9817
2.10 0.9821 0.9826 0.9830 0.9834 0.9838 0.9842 0.9846 0.9850 0.9854 0.9857
2.20 0.9861 0.9864 0.9868 0.9871 0.9875 0.9878 0.9881 0.9884 0.9887 0.9890
2.30 0.9893 0.9896 0.9898 0.9901 0.9904 0.9906 0.9909 0.9911 0.9913 0.9916
2.40 0.9918 0.9920 0.9922 0.9925 0.9927 0.9929 0.9931 0.9932 0.9934 0.9936
2.50 0.9938 0.9940 0.9941 0.9943 0.9945 0.9946 0.9948 0.9949 0.9951 0.9952
2.60 0.9953 0.9955 0.9956 0.9957 0.9959 0.9960 0.9961 0.9962 0.9963 0.9964
2.70 0.9965 0.9966 0.9967 0.9968 0.9969 0.9970 0.9971 0.9972 0.9973 0.9974
2.80 0.9974 0.9975 0.9976 0.9977 0.9977 0.9978 0.9979 0.9979 0.9980 0.9981
2.90 0.9981 0.9982 0.9982 0.9983 0.9984 0.9984 0.9985 0.9985 0.9986 0.9986
3.00 0.9987 0.9987 0.9987 0.9988 0.9988 0.9989 0.9989 0.9989 0.9990 0.9990

0 Zp z

p 0.001 0.0025 0.005 0.01 0.025 0.05 0.10


Zp 3.09 2.81 2.575 2.33 1.96 1.645 1.28
480 Appendix A

Table A.3 Student’s t Distribution: Values of tp where P (tp < t < ∞) = p.


p
m 0.001 0.0025 0.005 0.01 0.025 0.05 0.10
1 318.289 127.321 63.656 31.821 12.706 6.314 3.078
2 22.328 14.089 9.925 6.965 4.303 2.920 1.886
3 10.214 7.453 5.841 4.541 3.182 2.353 1.638
4 7.173 5.598 4.604 3.747 2.776 2.132 1.533
5 5.894 4.773 4.032 3.365 2.571 2.015 1.476
6 5.208 4.317 3.707 3.143 2.447 1.943 1.440
7 4.785 4.029 3.499 2.998 2.365 1.895 1.415
8 4.501 3.833 3.355 2.896 2.306 1.860 1.397
9 4.297 3.690 3.250 2.821 2.262 1.833 1.383
10 4.144 3.581 3.169 2.764 2.228 1.812 1.372
11 4.025 3.497 3.106 2.718 2.201 1.796 1.363
12 3.930 3.428 3.055 2.681 2.179 1.782 1.356
13 3.852 3.372 3.012 2.650 2.160 1.771 1.350
14 3.787 3.326 2.977 2.624 2.145 1.761 1.345
15 3.733 3.286 2.947 2.602 2.131 1.753 1.341
16 3.686 3.252 2.921 2.583 2.120 1.746 1.337
17 3.646 3.222 2.898 2.567 2.110 1.740 1.333
18 3.610 3.197 2.878 2.552 2.101 1.734 1.330
19 3.579 3.174 2.861 2.539 2.093 1.729 1.328
20 3.552 3.153 2.845 2.528 2.086 1.725 1.325
21 3.527 3.135 2.831 2.518 2.080 1.721 1.323
22 3.505 3.119 2.819 2.508 2.074 1.717 1.321
23 3.485 3.104 2.807 2.500 2.069 1.714 1.319
24 3.467 3.091 2.797 2.492 2.064 1.711 1.318
25 3.450 3.078 2.787 2.485 2.060 1.708 1.316
26 3.435 3.067 2.779 2.479 2.056 1.706 1.315
27 3.421 3.057 2.771 2.473 2.052 1.703 1.314
28 3.408 3.047 2.763 2.467 2.048 1.701 1.313
29 3.396 3.038 2.756 2.462 2.045 1.699 1.311
30 3.385 3.030 2.750 2.457 2.042 1.697 1.310
40 3.307 2.971 2.704 2.423 2.021 1.684 1.303
50 3.261 2.937 2.678 2.403 2.009 1.676 1.299
60 3.232 2.915 2.660 2.390 2.000 1.671 1.296
80 3.195 2.887 2.639 2.374 1.990 1.664 1.292
100 3.174 2.871 2.626 2.364 1.984 1.660 1.290
∞ 3.090 2.807 2.576 2.326 1.960 1.645 1.282
Statistical Tables 481

Table A.4 c 2 Distribution: Values of c p2 where P (0 < c 2 < c p2 ).


p
m 0.005 0.01 0.025 0.05 0.1 0.9 0.95 0.975 0.99 0.995
1 0.00 0.00 0.00 0.00 0.02 2.71 3.84 5.02 6.63 7.88
2 0.01 0.02 0.05 0.10 0.21 4.61 5.99 7.38 9.21 10.60
3 0.07 0.11 0.22 0.35 0.58 6.25 7.81 9.35 11.34 12.84
4 0.21 0.30 0.48 0.71 1.06 7.78 9.49 11.14 13.28 14.86
5 0.41 0.55 0.83 1.15 1.61 9.24 11.07 12.83 15.09 16.75
6 0.68 0.87 1.24 1.64 2.20 10.64 12.59 14.45 16.81 18.55
7 0.99 1.24 1.69 2.17 2.83 12.02 14.07 16.01 18.48 20.28
8 1.34 1.65 2.18 2.73 3.49 13.36 15.51 17.53 20.09 21.95
9 1.73 2.09 2.70 3.33 4.17 14.68 16.92 19.02 21.67 23.59
10 2.16 2.56 3.25 3.94 4.87 15.99 18.31 20.48 23.21 25.19
11 2.60 3.05 3.82 4.57 5.58 17.28 19.68 21.92 24.73 26.76
12 3.07 3.57 4.40 5.23 6.30 18.55 21.03 23.34 26.22 28.30
13 3.57 4.11 5.01 5.89 7.04 19.81 22.36 24.74 27.69 29.82
14 4.07 4.66 5.63 6.57 7.79 21.06 23.68 26.12 29.14 31.32
15 4.60 5.23 6.26 7.26 8.55 22.31 25.00 27.49 30.58 32.80
16 5.14 5.81 6.91 7.96 9.31 23.54 26.30 28.85 32.00 34.27
17 5.70 6.41 7.56 8.67 10.09 24.77 27.59 30.19 33.41 35.72
18 6.26 7.01 8.23 9.39 10.86 25.99 28.87 31.53 34.81 37.16
19 6.84 7.63 8.91 10.12 11.65 27.20 30.14 32.85 36.19 38.58
20 7.43 8.26 9.59 10.85 12.44 28.41 31.41 34.17 37.57 40.00
21 8.03 8.90 10.28 11.59 13.24 29.62 32.67 35.48 38.93 41.40
22 8.64 9.54 10.98 12.34 14.04 30.81 33.92 36.78 40.29 42.80
23 9.26 10.20 11.69 13.09 14.85 32.01 35.17 38.08 41.64 44.18
24 9.89 10.86 12.40 13.85 15.66 33.20 36.42 39.36 42.98 45.56
25 10.52 11.52 13.12 14.61 16.47 34.38 37.65 40.65 44.31 46.93
26 11.16 12.20 13.84 15.38 17.29 35.56 38.89 41.92 45.64 48.29
27 11.81 12.88 14.57 16.15 18.11 36.74 40.11 43.19 46.96 49.65
28 12.46 13.56 15.31 16.93 18.94 37.92 41.34 44.46 48.28 50.99
29 13.12 14.26 16.05 17.71 19.77 39.09 42.56 45.72 49.59 52.34
30 13.79 14.95 16.79 18.49 20.60 40.26 43.77 46.98 50.89 53.67
35 17.19 18.51 20.57 22.47 24.80 46.06 49.80 53.20 57.34 60.27
40 20.71 22.16 24.43 26.51 29.05 51.81 55.76 59.34 63.69 66.77
45 24.31 25.90 28.37 30.61 33.35 57.51 61.66 65.41 69.96 73.17
50 27.99 29.71 32.36 34.76 37.69 63.17 67.50 71.42 76.15 79.49
55 31.73 33.57 36.40 38.96 42.06 68.80 73.31 77.38 82.29 85.75
60 35.53 37.48 40.48 43.19 46.46 74.40 79.08 83.30 88.38 91.95
70 43.28 45.44 48.76 51.74 55.33 85.53 90.53 95.02 100.43 104.21
80 51.17 53.54 57.15 60.39 64.28 96.58 101.88 106.63 112.33 116.32
90 59.20 61.75 65.65 69.13 73.29 107.57 113.15 118.14 124.12 128.30
100 67.33 70.06 74.22 77.93 82.36 118.50 124.34 129.56 135.81 140.17
482
Appendix A
Table A.5 F Distribution: Values of Fp where P (Fp < F < ∞) = p and F = s12 /s22.
F Distribution: Values of F0.05
m1
m2 1 2 3 4 5 6 7 8 9 10 12 15 20 24 30 40 60 120 ∞
1 161.4 199.5 215.7 224.6 230.2 234.0 236.8 238.9 240.5 241.9 243.9 245.9 248.0 249.1 250.1 251.1 252.2 253.3 254.3
2 18.51 19.00 19.16 19.25 19.30 19.33 19.35 19.37 19.38 19.40 19.41 19.43 19.45 19.45 19.46 19.47 19.48 19.49 19.50
3 10.13 9.55 9.28 9.12 9.01 8.94 8.89 8.85 8.81 8.79 8.74 8.70 8.66 8.64 8.62 8.59 8.57 8.55 8.53
4 7.71 6.94 6.59 6.39 6.26 6.16 6.09 6.04 6.00 5.96 5.91 5.86 5.80 5.77 5.75 5.72 5.69 5.66 5.63
5 6.61 5.79 5.41 5.19 5.05 4.95 4.88 4.82 4.77 4.74 4.68 4.62 4.56 4.53 4.50 4.46 4.43 4.40 4.36
6 5.99 5.14 4.76 4.53 4.39 4.28 4.21 4.15 4.10 4.06 4.00 3.94 3.87 3.84 3.81 3.77 3.74 3.70 3.67
7 5.59 4.74 4.35 4.12 3.97 3.87 3.79 3.73 3.68 3.64 3.57 3.51 3.44 3.41 3.38 3.34 3.30 3.27 3.23
8 5.32 4.46 4.07 3.84 3.69 3.58 3.50 3.44 3.39 3.35 3.28 3.22 3.15 3.12 3.08 3.04 3.01 2.97 2.93
9 5.12 4.26 3.86 3.63 3.48 3.37 3.29 3.23 3.18 3.14 3.07 3.01 2.94 2.90 2.86 2.83 2.79 2.75 2.71
10 4.96 4.10 3.71 3.48 3.33 3.22 3.14 3.07 3.02 2.98 2.91 2.85 2.77 2.74 2.70 2.66 2.62 2.58 2.54
11 4.84 3.98 3.59 3.36 3.20 3.09 3.01 2.95 2.90 2.85 2.79 2.72 2.65 2.61 2.57 2.53 2.49 2.45 2.40
12 4.75 3.89 3.49 3.26 3.11 3.00 2.91 2.85 2.80 2.75 2.69 2.62 2.54 2.51 2.47 2.43 2.38 2.34 2.30
13 4.67 3.81 3.41 3.18 3.03 2.92 2.83 2.77 2.71 2.67 2.60 2.53 2.46 2.42 2.38 2.34 2.30 2.25 2.21
14 4.60 3.74 3.34 3.11 2.96 2.85 2.76 2.70 2.65 2.60 2.53 2.46 2.39 2.35 2.31 2.27 2.22 2.18 2.13
15 4.54 3.68 3.29 3.06 2.90 2.79 2.71 2.64 2.59 2.54 2.48 2.40 2.33 2.29 2.25 2.20 2.16 2.11 2.07
20 4.35 3.49 3.10 2.87 2.71 2.60 2.51 2.45 2.39 2.35 2.28 2.20 2.12 2.08 2.04 1.99 1.95 1.90 1.84
25 4.24 3.39 2.99 2.76 2.60 2.49 2.40 2.34 2.28 2.24 2.16 2.09 2.01 1.96 1.92 1.87 1.82 1.77 1.71
30 4.17 3.32 2.92 2.69 2.53 2.42 2.33 2.27 2.21 2.16 2.09 2.01 1.93 1.89 1.84 1.79 1.74 1.68 1.62
40 4.08 3.23 2.84 2.61 2.45 2.34 2.25 2.18 2.12 2.08 2.00 1.92 1.84 1.79 1.74 1.69 1.64 1.58 1.51
60 4.00 3.15 2.76 2.53 2.37 2.25 2.17 2.10 2.04 1.99 1.92 1.84 1.75 1.70 1.65 1.59 1.53 1.47 1.39
120 3.92 3.07 2.68 2.45 2.29 2.18 2.09 2.02 1.96 1.91 1.83 1.75 1.66 1.61 1.55 1.50 1.43 1.35 1.25
∞ 3.84 3.00 2.60 2.37 2.21 2.10 2.01 1.94 1.88 1.83 1.75 1.67 1.57 1.52 1.46 1.39 1.32 1.22 1.00
Continued
Continued
F Distribution: Values of F0.01
m1
m2 1 2 3 4 5 6 7 8 9 10 12 15 20 24 30 40 60 120 ∞
1 4052 4999 5404 5624 5764 5859 5928 5981 6022 6056 6107 6157 6209 6234 6260 6286 6313 6340 6366
2 98.50 99.00 99.16 99.25 99.30 99.33 99.36 99.38 99.39 99.40 99.42 99.43 99.45 99.46 99.47 99.48 99.48 99.49 99.50
3 34.12 30.82 29.46 28.71 28.24 27.91 27.67 27.49 27.34 27.23 27.05 26.87 26.69 26.60 26.50 26.41 26.32 26.22 26.13
4 21.20 18.00 16.69 15.98 15.52 15.21 14.98 14.80 14.66 14.55 14.37 14.20 14.02 13.93 13.84 13.75 13.65 13.56 13.46
5 16.26 13.27 12.06 11.39 10.97 10.67 10.46 10.29 10.16 10.05 9.89 9.72 9.55 9.47 9.38 9.29 9.20 9.11 9.02
6 13.75 10.92 9.78 9.15 8.75 8.47 8.26 8.10 7.98 7.87 7.72 7.56 7.40 7.31 7.23 7.14 7.06 6.97 6.88
7 12.25 9.55 8.45 7.85 7.46 7.19 6.99 6.84 6.72 6.62 6.47 6.31 6.16 6.07 5.99 5.91 5.82 5.74 5.65
8 11.26 8.65 7.59 7.01 6.63 6.37 6.18 6.03 5.91 5.81 5.67 5.52 5.36 5.28 5.20 5.12 5.03 4.95 4.86
9 10.56 8.02 6.99 6.42 6.06 5.80 5.61 5.47 5.35 5.26 5.11 4.96 4.81 4.73 4.65 4.57 4.48 4.40 4.31
10 10.04 7.56 6.55 5.99 5.64 5.39 5.20 5.06 4.94 4.85 4.71 4.56 4.41 4.33 4.25 4.17 4.08 4.00 3.91
11 9.65 7.21 6.22 5.67 5.32 5.07 4.89 4.74 4.63 4.54 4.40 4.25 4.10 4.02 3.94 3.86 3.78 3.69 3.60
12 9.33 6.93 5.95 5.41 5.06 4.82 4.64 4.50 4.39 4.30 4.16 4.01 3.86 3.78 3.70 3.62 3.54 3.45 3.36
13 9.07 6.70 5.74 5.21 4.86 4.62 4.44 4.30 4.19 4.10 3.96 3.82 3.66 3.59 3.51 3.43 3.34 3.25 3.17
14 8.86 6.51 5.56 5.04 4.69 4.46 4.28 4.14 4.03 3.94 3.80 3.66 3.51 3.43 3.35 3.27 3.18 3.09 3.00
15 8.68 6.36 5.42 4.89 4.56 4.32 4.14 4.00 3.89 3.80 3.67 3.52 3.37 3.29 3.21 3.13 3.05 2.96 2.87
20 8.10 5.85 4.94 4.43 4.10 3.87 3.70 3.56 3.46 3.37 3.23 3.09 2.94 2.86 2.78 2.69 2.61 2.52 2.42
25 7.77 5.57 4.68 4.18 3.85 3.63 3.46 3.32 3.22 3.13 2.99 2.85 2.70 2.62 2.54 2.45 2.36 2.27 2.17

Statistical Tables
30 7.56 5.39 4.51 4.02 3.70 3.47 3.30 3.17 3.07 2.98 2.84 2.70 2.55 2.47 2.39 2.30 2.21 2.11 2.01
40 7.31 5.18 4.31 3.83 3.51 3.29 3.12 2.99 2.89 2.80 2.66 2.52 2.37 2.29 2.20 2.11 2.02 1.92 1.80
60 7.08 4.98 4.13 3.65 3.34 3.12 2.95 2.82 2.72 2.63 2.50 2.35 2.20 2.12 2.03 1.94 1.84 1.73 1.60
120 6.85 4.79 3.95 3.48 3.17 2.96 2.79 2.66 2.56 2.47 2.34 2.19 2.03 1.95 1.86 1.76 1.66 1.53 1.38
∞ 6.63 4.61 3.78 3.32 3.02 2.80 2.64 2.51 2.41 2.32 2.18 2.04 1.88 1.79 1.70 1.59 1.47 1.32 1

483
484 Appendix A

Table A.6 Critical Values for Duncan’s Multiple Range Test (r0.05,p,dfe ).
Number of Means (p)
dfd 2 3 4 5 6 7 8 9 10
1 17.97
2 6.09 6.09
3 4.50 4.52 4.52
4 3.93 4.01 4.03 4.03
5 3.64 3.75 3.80 3.81 3.81
6 6.46 3.59 3.65 3.68 3.69 3.70
7 3.34 3.48 3.55 3.59 3.61 3.62 3.63
8 3.26 3.40 3.48 3.52 3.55 3.57 3.57 3.58
9 3.20 3.34 3.42 3.47 3.50 3.52 3.54 3.54 3.55
10 3.15 3.29 3.38 3.43 3.47 3.49 3.51 3.52 3.52
11 3.11 3.26 3.34 3.40 3.44 3.46 3.48 3.49 3.50
12 3.08 3.23 3.31 3.37 3.41 3.44 3.46 3.47 3.48
13 3.06 3.20 3.29 3.35 3.39 3.42 3.46 3.46 3.47
14 3.03 3.18 3.27 3.33 3.37 3.40 3.43 3.44 3.46
15 3.01 3.16 3.25 3.31 3.36 3.39 3.41 3.43 3.45
16 3.00 3.14 3.23 3.30 3.34 3.38 3.40 3.42 3.44
17 2.98 3.13 3.22 3.28 3.33 3.37 3.39 3.41 3.43
18 2.97 3.12 3.21 3.27 3.32 3.36 3.38 3.40 3.42
19 2.96 3.11 3.20 3.26 3.31 3.35 3.38 3.40 3.41
20 2.95 3.10 3.19 3.25 3.30 3.34 3.37 3.39 3.41
24 2.92 3.07 3.16 3.23 3.28 3.31 3.35 3.37 3.39
30 2.89 3.03 3.13 3.20 3.25 3.29 3.32 3.35 3.37
40 2.86 3.01 3.10 3.17 3.22 3.27 3.30 3.33 3.35
60 2.83 2.98 3.07 3.14 3.20 3.24 3.28 3.31 3.33
120 2.80 2.95 3.04 3.12 3.17 3.22 3.25 3.29 3.31
∞ 2.77 2.92 3.02 3.09 3.15 3.19 3.23 3.27 3.29
Source: Reproduced from H. L. Harter, “Critical Values for Duncan’s Multiple Range Test.” This table contains some corrected
values to those given by D. B. Duncan, “Multiple Range and Multiple F Tests,” Biometrics 1, no. 1 (1955): 1–42.
Statistical Tables 485

Table A.7 Critical Values of the Studentized Range Distribution (Q0.05 (k)).
Number of Means (k)
dfd 2 3 4 5 6 7 8 9 10
1 17.970 26.980 32.820 37.080 40.410 43.120 45.400 47.360 49.070
2 6.085 8.331 9.798 10.880 11.740 12.440 13.030 13.540 13.990
3 4.501 5.910 6.825 7.502 8.037 8.478 8.853 9.177 9.462
4 3.927 5.040 5.757 6.287 6.707 7.053 7.347 7.602 7.826
5 3.635 4.602 5.218 5.673 6.033 6.330 6.582 6.802 6.995
6 3.461 4.339 4.896 5.305 5.628 5.895 6.122 6.319 6.493
7 3.344 4.165 4.681 5.060 5.359 5.606 5.815 5.998 6.158
8 3.261 4.041 4.529 4.886 5.167 5.399 5.597 5.767 5.918
9 3.199 3.949 4.415 4.756 5.024 5.244 5.432 5.595 5.739
10 3.151 3.877 4.327 4.654 4.912 5.124 5.305 5.461 5.599
11 3.113 3.820 4.256 4.574 4.823 5.028 5.202 5.353 5.487
12 3.082 3.773 4.199 4.508 4.751 4.950 5.119 5.265 5.395
13 3.055 3.735 4.151 4.453 4.690 4.885 5.049 5.192 5.318
14 3.033 3.702 4.111 4.407 4.639 4.829 4.990 5.131 5.254
15 3.014 3.674 4.076 4.367 4.595 4.782 4.940 5.077 5.198
16 2.998 3.649 4.046 4.333 4.557 4.741 4.897 5.031 5.150
17 2.984 3.628 4.020 4.303 4.524 4.705 4.858 4.991 5.108
18 2.971 3.609 3.997 4.277 4.495 4.673 4.824 4.956 5.071
19 2.960 3.593 3.977 4.253 4.469 4.645 4.794 4.924 5.038
20 2.950 3.578 3.958 4.232 4.445 4.620 4.768 4.896 5.008
24 2.919 3.532 3.901 4.166 4.373 4.541 4.684 4.807 4.915
30 2.888 3.486 3.845 4.102 4.302 4.464 4.602 4.720 4.824
40 2.858 3.442 3.791 4.039 4.232 4.389 4.521 4.635 4.735
60 2.829 3.399 3.737 3.977 4.163 4.314 4.441 4.550 4.646
120 2.800 3.356 3.685 3.917 4.096 4.241 4.363 4.468 4.560
∞ 2.772 3.314 3.633 3.858 4.030 4.170 4.286 4.387 4.474
Source: Adapted from H. L. Harter. Order Statistics and Their Use in Testing and Estimation, Volume 1: Tests Based on Range
and Studentized Range of Samples from a Normal Population. Washington, DC: U.S. Government Printing Office, 1969.
486 Appendix A

Table A.8 Critical Values for the One-Way Analysis of Means (h0.05,k,dfe ).
Number of Treatments (k)
dfd 3 4 5 6 7 8 9 10
3 4.18
4 3.56 3.89
5 3.25 3.53 3.72
6 3.07 3.31 3.49 3.62
7 2.94 3.17 3.33 3.45 3.56
8 2.86 3.07 3.21 3.33 3.43 3.51
9 2.79 2.99 3.13 3.24 3.33 3.41 3.48
10 2.74 2.93 3.07 3.17 3.26 3.33 3.40 3.45
11 2.70 2.88 3.01 3.12 3.20 3.27 3.33 3.39
12 2.67 2.85 2.97 3.07 3.15 3.22 3.28 3.33
13 2.64 2.81 2.94 3.03 3.11 3.18 3.24 3.29
14 2.62 2.79 2.91 3.00 3.08 3.14 3.20 3.25
15 2.60 2.76 2.88 2.97 3.05 3.11 3.17 3.22
16 2.58 2.74 2.86 2.95 3.02 3.09 3.14 3.19
17 2.57 2.73 2.84 2.93 3.00 3.06 3.12 3.16
18 2.55 2.71 2.82 2.91 2.98 3.04 3.10 3.14
19 2.54 2.70 2.81 2.89 2.96 3.02 3.08 3.12
20 2.53 2.68 2.79 2.88 2.95 3.01 3.06 3.11
24 2.50 2.65 2.75 2.83 2.90 2.96 3.01 3.05
30 2.47 2.61 2.71 2.79 2.85 2.91 2.96 3.00
40 2.43 2.57 2.67 2.75 2.81 2.86 2.91 2.95
60 2.40 2.54 2.63 2.70 2.76 2.81 2.86 2.90
120 2.37 2.50 2.59 2.66 2.72 2.77 2.81 2.84
∞ 2.34 2.47 2.56 2.62 2.68 2.72 2.76 2.80
Source: Nelson.“Exact Critical Values for Use with the Analysis of Means.” Journal of Quality Technology 15, no. 1
(January 1983): 40–44. Used with permission.
Table A.9 Fisher’s Z Transformation: values of Z = 12 ln ( ).
1+r
1−r

r 0.000 0.005 0.010 0.015 0.020 0.025 0.030 0.035 0.040 0.045 0.050 0.055 0.060 0.065 0.070 0.075 0.080 0.085 0.090 0.095
0 0.000 0.005 0.010 0.015 0.020 0.025 0.030 0.035 0.040 0.045 0.050 0.055 0.060 0.065 0.070 0.075 0.080 0.085 0.090 0.095
0.1 0.100 0.105 0.110 0.116 0.121 0.126 0.131 0.136 0.141 0.146 0.151 0.156 0.161 0.167 0.172 0.177 0.182 0.187 0.192 0.198
0.2 0.203 0.208 0.213 0.218 0.224 0.229 0.234 0.239 0.245 0.250 0.255 0.261 0.266 0.271 0.277 0.282 0.288 0.293 0.299 0.304
0.3 0.310 0.315 0.321 0.326 0.332 0.337 0.343 0.348 0.354 0.360 0.365 0.371 0.377 0.383 0.388 0.394 0.400 0.406 0.412 0.418
0.4 0.424 0.430 0.436 0.442 0.448 0.454 0.460 0.466 0.472 0.478 0.485 0.491 0.497 0.504 0.510 0.517 0.523 0.530 0.536 0.543
0.5 0.549 0.556 0.563 0.570 0.576 0.583 0.590 0.597 0.604 0.611 0.618 0.626 0.633 0.640 0.648 0.655 0.662 0.670 0.678 0.685
0.6 0.693 0.701 0.709 0.717 0.725 0.733 0.741 0.750 0.758 0.767 0.775 0.784 0.793 0.802 0.811 0.820 0.829 0.838 0.848 0.858
0.7 0.867 0.877 0.887 0.897 0.908 0.918 0.929 0.940 0.950 0.962 0.973 0.984 0.996 1.008 1.020 1.033 1.045 1.058 1.071 1.085
0.8 1.099 1.113 1.127 1.142 1.157 1.172 1.188 1.204 1.221 1.238 1.256 1.274 1.293 1.313 1.333 1.354 1.376 1.398 1.422 1.447
0.9 1.472 1.499 1.528 1.557 1.589 1.623 1.658 1.697 1.738 1.783 1.832 1.886 1.946 2.014 2.092 2.185 2.298 2.443 2.647 2.994

r 0.001 0.002 0.003 0.004 0.005 0.006 0.007 0.008 0.009


0.80 1.101 1.104 1.107 1.110 1.113 1.116 1.118 1.121 1.124
0.81 1.130 1.133 1.136 1.139 1.142 1.145 1.148 1.151 1.154
0.82 1.160 1.163 1.166 1.169 1.172 1.175 1.179 1.182 1.185
0.83 1.191 1.195 1.198 1.201 1.204 1.208 1.211 1.214 1.218
0.84 1.225 1.228 1.231 1.235 1.238 1.242 1.245 1.249 1.253
0.85 1.260 1.263 1.267 1.271 1.274 1.278 1.282 1.286 1.290
0.86 1.297 1.301 1.305 1.309 1.313 1.317 1.321 1.325 1.329
0.87 1.337 1.341 1.346 1.350 1.354 1.358 1.363 1.367 1.371
0.88 1.380 1.385 1.389 1.394 1.398 1.403 1.408 1.412 1.417
0.89 1.427 1.432 1.437 1.442 1.447 1.452 1.457 1.462 1.467
0.90 1.478 1.483 1.488 1.494 1.499 1.505 1.510 1.516 1.522
0.91 1.533 1.539 1.545 1.551 1.557 1.564 1.570 1.576 1.583

Statistical Tables
0.92 1.596 1.602 1.609 1.616 1.623 1.630 1.637 1.644 1.651
0.93 1.666 1.673 1.681 1.689 1.697 1.705 1.713 1.721 1.730
0.94 1.747 1.756 1.764 1.774 1.783 1.792 1.802 1.812 1.822
0.95 1.842 1.853 1.863 1.874 1.886 1.897 1.909 1.921 1.933
0.96 1.959 1.972 1.986 2.000 2.014 2.029 2.044 2.060 2.076
0.97 2.110 2.127 2.146 2.165 2.185 2.205 2.227 2.249 2.273
0.98 2.323 2.351 2.380 2.410 2.443 2.477 2.515 2.555 2.599

487
0.99 2.700 2.759 2.826 2.903 2.994 3.106 3.250 3.453 3.800
INDEX

Index Terms Links

*Please note that italicized page numbers refer to figures or tables

aberration 435
acceptance interval 44
See also hypothesis tests
aliasing 114
See also confounding
alternative hypothesis 43
analysis of means (ANOM) 176
analysis of variance. See ANOVA
Anderson-Darling test 78 82
ANOM. See analysis of means
ANOVA 101
comparison of treatments following 161
Bonferroni’s method 161
Duncan’s multiple range test 164
Dunnett’s test 167
Sidak’s method 163
Tukey’s multiple comparisons test 166
nested design 248
power and sample-size calculations 185 250
regression, equivalence with 327
ANOVA, one-way
assumptions 152
degrees of freedom 154
F statistic 148
expected value 235
fixed variables 235

This page has been reformatted by Knovel to provide easier navigation.


Index Terms Links

ANOVA, one-way (Cont.)


graphical approach 145
hypotheses 150
with MINITAB 167
plots of the residuals 150
rationale 147
sample-size calculation 185
single random variable 238
sum of squares approach 155
table 154
total variance 154
transformations 177
unbalanced experiments 160
validation requirements 150
ANOVA, two-way
fixed variables 237
interactions 203
interpretation 210
with MINITAB 215
mixed model 241
random variables 242
rationale 192
sums of squares approach 202
appraiser variation (AV) 243

balanced experiment 211 214


Baltimore, David 119
bar charts 2
Bartlett’s test 151 161 168
bell-shaped curve 26 27
between-sample variation 155
binomial distribution 183

This page has been reformatted by Knovel to provide easier navigation.


Index Terms Links

blocking 110 114 175


Box-Behnken designs 448
central composite designs 449 450 452 453
k
2 designs with centers 442
using fractional factorial designs 406
blocking on replicates 114 175 211 363
blocking plan 128
blocking variables 115
Bonferroni’s method 161
box-and-whisker plot 5
See also boxplots
Box-Behnken design 444
blocking plans 448
sample-size calculations 471
boxplot slippage tests 71 145
boxplots 5
See also box-and-whisker plot
interpretation 21
Box-Wilson designs. See central composite
designs

caret (^) notation 22


cause-and-effect analysis 121
cause-and-effect diagram 94
creating with MINITAB 95
cell 108
center cells
adding to 2k designs 367
in sample-size and power calculations
for 2k designs 394
k
use in test for curvature in 2 designs 441

This page has been reformatted by Knovel to provide easier navigation.


Index Terms Links

central composite designs 443 448


blocking plans 449
sample-size calculations 473
central F distribution 252
central limit theorem 38 52 147
central tendency. See location
chi-square distribution 61
coded variables 107 318
coefficient of determination 293
See also correlation coefficient
coefftnormplot.mac 379
colinearity 321
combinations 32
completely randomized design 110 172
confidence intervals 37
correlation coefficient 295
inverse prediction 289
with MINITAB 79
one population mean 41 55
one variance 61
regression coefficients 287
regression line 289
sample-size calculation
estimating the mean 83 89
estimating regression parameters 337 396
variance components 239
confirmation experiment 130
confounding 114 117
fractional factorial experiments 402
quadratic effects 442
risks, managing 421
correlation coefficient 294
adjusted 298
confidence interval 295

This page has been reformatted by Knovel to provide easier navigation.


Index Terms Links

correlation matrix
construction 371 377
effect of missing or extra observations 389
fractional factorial designs 402 403 405 409 410
415
count data, transforming 182
counting 30
covariate 107

.dat files 17
data
analyzing 129
faking 119
integrity and ethics 119
plotting 1
types 1
decision limits. See acceptance interval
degrees of freedom
chi-square distribution 63
F distribution 65
linear regression 276
mean 24
one-way ANOVA 148 154 157
standard deviation 25
Student’s t distribution 53
two-way ANOVA 199
deleted Studentized residuals 284
deleted t residuals. See deleted
Studentized residuals
descriptive statistics 19
design, definition 107
design generators 406 407

This page has been reformatted by Knovel to provide easier navigation.


Index Terms Links

design of experiments xiii 93 98 107


definition 93
design, definition 107
process 121
team members and functions 121 122
design matrix 107 108
design resolution 407 434
interpreting Resolution III designs 429
interpreting Resolution IV designs 422
interpreting Resolution V designs 421
designed experiments
documentation 136
11-step procedure 120
purpose 104
report format 131
designs, types 108
dispersion. See variation
distribution of errors 101
distribution of sample means 52
distribution of sample variances 61
distribution shape 19
documentation 123
DOE method. See design of experiments
dot notation 159
dotplot 4 5
Duncan’s multiple range test 164
Dunnett’s test 167

effect heredity 118


eighth-fractional factorial design 406
empirical models 103
error statement 101

This page has been reformatted by Knovel to provide easier navigation.


Index Terms Links

errors-in-variables regression analysis 316


expected mean squares 235 249 253
experiment design 107
types 108
experimental data, analyzing 129
experimentation
active 94
general procedure 120
passive 94
preliminary 125
experiments. See also designed experiments
defined 94
designing 126
fractionally replicated 114
mistakes 139
reporting 131
results, interpreting 130
running 128
size 100
types 99
extra values, in two-level factorial designs 389

factorial designs 212 213


incomplete 231
with MINITAB 221
power calculation 252
factorials 31
factors 94
family error rate 161
F distribution 65
first-principles models 103
Fisher’s Z transformation 296

This page has been reformatted by Knovel to provide easier navigation.


Index Terms Links

fixed variables 235 241


power calculation 254 257
power calculations with MINITAB 263
fold-over design 429
fraction data, transforming 183
fractional factorial experiments
analyzing with MINITAB 417
blocking large experiments 406
confounding 402 404 407 411 423
creating in MINITAB 415
design considerations 434
design interpretation 421
fold-over design 429
generators 406 407
half-fractional factorial 406
Plackett-Burman designs 432
resolution 407
runs, reducing number 400
sample-size and power calculations 432
fractionally replicated experiments 114 116
frequency 2
full-factorial experiment 214

gage error (GRR) 245


gage error studies, analysis 242
with MINITAB 245
gedanken experiment 38
general linear model 101 327
3
blocked 2 experiment 365
k
2 factorial designs 375
generators 406 407

This page has been reformatted by Knovel to provide easier navigation.


Index Terms Links

goodness of fit tests 309


with MINITAB 313
Graeco-Latin Square 233

half-fractional factorial design 406


heteroscedasticity 101
ANOVA residuals 151
regression model residuals 317
histogram 1 3
creating in MINITAB 13
interpreting 19
homoscedasticity 101
ANOVA residuals 150
regression model residuals 283
two-sample t test 57
unbalanced one-way classification
experiments 160
Hsu’s method 59
hyper-Graeco-Latin Square 233
hypothesis tests 37
conclusions, stating 43
decision limits 44
errors 49
for means and variances 74
with MINITAB 79
for normality 75
for one sample mean 42 54
for one sample variance 63
one-sided 51
for one-way classification by ANOVA 144
for paired samples, location 59
procedure 73

This page has been reformatted by Knovel to provide easier navigation.


Index Terms Links

hypothesis tests (Cont.)


quick tests for location 68
rationale 42
for regression coefficients 286
for two independent means 56
for two variances 65

Imanishi-Kari, Theresa 119


incomplete factorial designs 231
inferences 37
inflection points 30
interaction plots 99 204 205 371 383
389
interactions 98 203 213 355 371
376
interquartile range 6 21
inverse prediction 289

knobs xiii
Kruskal-Wallis test 185

lack of fit 309


with MINITAB 313
lag-one plot 283
Latin Square 232
least significant difference 252

This page has been reformatted by Knovel to provide easier navigation.


Index Terms Links

Levene’s test 151 161


modified 168
linear goodness of fit. See lack of fit
linear least squares regression line 275
linear regression
assumptions 282
coded variables 318
coefficient of determination 293
confidence limits, line 289
confidence limits, regression coefficients 287
correlation 293
design considerations 345
errors in variables 316
general linear models 327
goodness of fit tests 309
hypothesis tests for regression
coefficients 285
lack of fit tests 312
with MINITAB 299
multiple regression 320
polynomial models 306
prediction limits 290
rationale 273
regression coefficients 277
regression constant, determining 341
regression slope, determining 337
sample-size calculations 337
transformations to linear form 301
weighted regression 317
location 19
measures 20
logarithmic transform 179
lower quartile 6
lurking variables 118 172

This page has been reformatted by Knovel to provide easier navigation.


Index Terms Links

.mac files 17
macros (MINITAB). See also specific
macro by name
exec 15
local 16
makeoneway.mac 176
Mann-Whitney test 56
mean 20 21
mean deviation 23
mean squares 159
measurement scale 3
median 20
method of least squares 275
.mgf files 17
minimum aberration designs 435
MINITAB
column identifier 12
column names 12
command prompt 11
commands, modifying 11
customizing 11
data, entering 12
data, graphing 13
data, printing 13
descriptive statistics, calculating 34
file extensions 17
graph gallery 13
graphs, customizing 13
graphs, printing 14
Graphs folder 10
History window 10
macros, creating 15

This page has been reformatted by Knovel to provide easier navigation.


Index Terms Links

MINITAB (Cont.)
project file 14
Project Manager 10
Related Documents folder 10
Report Pad 10
saving and retrieving 14
Session window 9
shortcut to MINITAB 9
starting 9
toolbars 10
windows 9
Worksheet 12
MINITAB local .mac macros 16 375
coefftnormplot.mac 379
makeoneway.mac 176
mlrk.mac macros 376 378 418 459
randomize.mac 225
unrandomize.mac 225
missing at random 390
missing values in two-level factorial designs 389
mixed model 241
mlrk.mac macros 376 378 418 459
models
purpose 104
types 100
Mood’s median test 185
.mpj files 17
.mtb files 17
.mtw files 17
multiple comparison tests, after ANOVA 161
Bonferroni’s method 161
Duncan’s multiple range test 164
Dunnett’s test 167
with MINITAB 168

This page has been reformatted by Knovel to provide easier navigation.


Index Terms Links

multiple comparison
tests, after ANOVA (Cont.)
Sidak’s method 163
Tukey’s multiple comparison test 166
multiple regression 320
with MINITAB 322
multiplication of choices rule 31
multi-vari charts 7 98
multi-way classification ANOVA, with
MINITAB 215
multi-way classification designs 213 227

nested designs 248


analysis with MINITAB 249
ANOVA tables 249
power calculations 261
nested variables 106 248
noncentral F distribution 253
noncentrality parameter 87 237 253
nonlinear problems 301
normal curve 26
standard 29
normal curve amplitude 35
normal curve graphing 35
normal distribution 26
normality test
normal probability plot 75
quantitative tests 78
normal plots. See normal probability plots
normal probabilities, calculating in
MINITAB 35

This page has been reformatted by Knovel to provide easier navigation.


Index Terms Links

normal probability distribution 26


cumulative 28
probability density function 27
normal probability plots 75
with MINITAB 82
no-way classification 192
nuisance variable 114
null hypothesis 43
acceptance interval 44

Occam’s razor 118


one variable at a time xiii 93 98
one-tailed hypothesis tests 51
one-way classification experiments 143 193
design considerations 188
power calculations 252
for random variables 256
sample-size calculations 185
one-way fixed-effects ANOVA 235
one-way random-effects ANOVA 238
operating characteristic curves 51
optimization problems 459
orthogonality 114 449
outliers 152
identifying 284
OVAT. See one variable at a time

paired-sample t-test 59
parameters 19 37
Pareto charts 2

This page has been reformatted by Knovel to provide easier navigation.


Index Terms Links

permutations 31
Plackett-Burman designs 99 432
sample-size calculation 432
point estimate 37
Poisson distribution 182
polynomial regression 306
with MINITAB 307
pooled sample variance 147
pooling 147
population 19
population mean 20 21
confidence interval 41
population standard deviation 23
post-ANOVA analysis methods 161
post-ANOVA comparisons 161
post-experiment power of the ANOVA 253
power calculations 250
See also sample-size calculations
factorial designs with fixed variables 254 263
factorial designs with random variables 256 266
fractional factorial designs 432
nested designs 261
two steps 252
two-level factorial designs 392
two-level factorial designs with centers 467
prediction limits 290
probability density function, normal 27
probability distributions 26
binomial 183
chi-square 61
F 65
normal 26
Poisson 182
Student’s t 52
Weibull 305
This page has been reformatted by Knovel to provide easier navigation.
Index Terms Links

problem statement 124


procedure 11-step DOE 120
process, DOE
documenting 123
11-step procedure 120
inputs 94
model 94
outputs 94
variables 105
project binder, designed experiment 136
propagation of error 390
pure error in linear lack of fit test 312
p value 46
importance 48

quadratic model. See also polynomial model


designs for 437
linear goodness of fit test 309
linear lack of fit test 312
qualitative data 1
displaying 2
qualitative predictor 101
qualitative variable 96
redefining as quantitative 105
qualitative variable levels 105
quantitative data 1
presenting 3
quantitative predictor 100 103
quantitative variable 96
uncontrolled 107
quantitative variable levels 105

This page has been reformatted by Knovel to provide easier navigation.


Index Terms Links

quarter-fractional factorial design 406


quick tests for two-sample location 68

random samples 20
random sampling 19
random variables in ANOVA 235 238
power calculation with MINITAB 256 266
randomization, run order 108 109 128 175 225
randomization by blocking on replicates 114
randomization plan 128
validation 113
randomized block design 110 175 210 212
randomized complete block design 211
randomize.mac 225
range 21
rank transform 184
regression analysis. See linear regression
regression coefficients 276 277
confidence limits 287
hypothesis tests 285
regression line, confidence limits 289
with MINITAB 289
regression line, prediction limits 290
with MINITAB 291
regression model 101
repetitions 113
replicates 113
fractional 116
randomization 114
residuals 101
resolution, design 407

This page has been reformatted by Knovel to provide easier navigation.


Index Terms Links

responses 94
types 97
response-surface designs 99 100 437
comparisons 453
number of levels of each variable 455
number of observations and error
degrees of freedom 454
variable levels, safety 456
design considerations 474
with MINITAB 458
sample-size calculations 467
variable levels, sensitivity 456
response transformations 177 301
rotatability 448
run order, randomization 109 128 175 224 363
with MINITAB 112 224
runs 108
adding center cells 367
extra 389
missing 118 389
order 109
randomization 109

sample 19
sample mean 20 21
distribution 38 52
transformation to standard units 45 53
sample median 20
sample selection 19
sample standard deviation 23
calculating form 25

This page has been reformatted by Knovel to provide easier navigation.


Index Terms Links

sample variances, distribution 61


sample-size calculations xv 82 114 127
See also power calculations
Box-Behnken designs 471
central composite designs 473
for confidence intervals for means 83
fractional factorial designs 432
for hypothesis tests for means 86
linear regression 337
for means with MINITAB 89
k
for 3 designs 470
for 2k designs 467
for two-level factorial designs 392
Satterthwaite method 59
saturated designs 418
scatter plot 6
screening experiments 99
shape. See distribution shape
Sidak’s method 163
software, statistical 9
spreadsheet 12
square root transform 179
for count data 182
for fraction data 184
standard deviation 22 30
calculating form 26
standard error of the model 101
standard order 108 176 224 362
star points 448
stem-and-leaf plot 4
stepwise regression 459
Student’s t distribution 52
sums of squares, one-way ANOVA
calculating forms 159

This page has been reformatted by Knovel to provide easier navigation.


Index Terms Links

sums of squares, one-way ANOVA (Cont.)


defining forms 155
sums of squares, two-way ANOVA 202
interaction 207

t test statistic 57
t tests
Bonferroni’s method 161
one-sample 54
for outliers 284
paired-sample 59
regression coefficients 287
risk of multiple tests 144
two-sample 57
k
3 factorial experiment design 443
total variation 155
transformations 178
choosing 179
to linear form 301
transformed sample mean 45 53
transforming count data 182
transforming fraction data 183
Tukey’s honest significant difference test 166
Tukey’s multiple range test 166
Tukey’s quick test 69
k
2 factorial designs
analyzing 370
analyzing with MINITAB 375
center cells, adding 367
confounding 411
creating in MINITAB 372
design considerations 397

This page has been reformatted by Knovel to provide easier navigation.


Index Terms Links

2k factorial designs (Cont.)


observations, extra or missing 389
propagation of error 390
sample size, determining 392
unbalanced experiment 389
k
2 fractional factorial design 399
analyzing with MINITAB 417
confounding 411
creating in MINITAB 415
design considerations 434
generators 406 407
interpretation 421 429 430
observations, extra or missing 389
propagation of error 390
resolution 407
sample size, determining 432
two-sample Smirnov test 71
two-sample t test 56
unequal variances 58
two-stage nested design 248
3
2 plus centers design 441
two-way classification 196
power calculations 257
two-way factorial design 212
Type 1 errors 49 50
Type 2 errors 49

uniform precision 449


unrandomize.mac 224
upper quartile 6

This page has been reformatted by Knovel to provide easier navigation.


Index Terms Links

variable levels, selection 105


variables 94
nested 106 248
types 96
uncontrolled 107
variables matrix 107
variance 23
variance components 239
confidence intervals 240
gage error studies 243
variation 19
between sample, in one-way ANOVA 155
measures 21
total, in one-way ANOVA 155
within sample, in one-way ANOVA 155

Weibull distribution 305


weighted regression 283 317
Welch method 59
within-sample variation 155
Worksheet 12
Worksheet window 9

z value 45

This page has been reformatted by Knovel to provide easier navigation.

View publication stats

You might also like