Chapter 10
Chapter 10
Chapter 10
In the previous chapter, we introduced some methods to estimate the unknown value of
some population characteristic by sample data. Sample data may also be used to decide
whether some claim or hypothesis about a population characteristic is plausible.
A test of hypothesis (or test procedure) is a method for using sample data to decide
between two competing hypotheses about a population characteristic. One hypothesis
might be = 100 and the other 100.
Definition 10.1 The null hypothesis, denoted by H0, is a claim about a population
characteristic that is initially assumed to be true.
In carrying out a test of H0 versus Ha, the hypothesis H0 will be rejected in favor of Ha only
if sample evidence strongly suggests that H0 is false. If the sample does not contain such
evidence, H0 will not be rejected. The two possible conclusions are then (1) reject H0 or
(2) fail to reject H0.
Once hypotheses have been formulated, we need a method for using sample data to
determine whether H0 should be rejected. The method that we use for this purpose is
called a test procedure. Just as a jury may reach the wrong verdict in a trial, there is some
chance that the use of a test procedure with sample data may lead us to the wrong
conclusion about a population characteristic. There are two different types of errors that
might be made when making a decision in a hypothesis-testing problem.
The only way to guarantee that neither type of error will occur is to make such decisions
on the basis of a census of the entire population.
Definition 10.3 The probability of a type I error is denoted by and is called the level of
significance of the test. Thus, a test with = .01 is said to have a level of significance
of .01 or to be a level .01 test.
The probability of a type II error is denoted by .
The relationship between and : As decreases, will increase, and vice versa.
The ideal test procedure has both = 0 and = 0. However, if our decision must be
based on incomplete information (a sample rather than a census) it is impossible to
achieve this ideal. The standard test procedures do allow the user to control , but they
provide no direct control over . To achieve a smaller probability of making a type I error
the risk of a type II error will increase. In general, there is a compromise between small
and small , leading to the following widely accepted principle for specifying a test
procedure.
After assessing the consequences of type I and type II errors, identify the largest that is
tolerable for the problem. Then employ a test procedure that uses this maximum
acceptable value (rather than any smaller value) as the level of significance (because
using a smaller increases ). In other words, don’t make smaller than it needs to be.
Thus, if you decide that = .05 is tolerable, you should not use a test with = .01,
because the smaller inevitably results in a larger . The choice of in any given
problem depends on the seriousness of a type I error relative to a type II error. If a type II
error has serious consequences, it may be a good idea to select a somewhat larger value
for .
Now we are ready to develop procedures for using sample information to decide between
the null and alternative hypotheses. The fundamental idea behind hypothesis-testing
procedures is: We reject the null hypothesis if the observed sample is very unlikely to
have occurred when H0 is true.
Let
= the proportion of individuals in a population that possess a certain property
p = (number of individuals in the sample that possess the property) / n
By the results in Chapter 8, the sampling distribution of p has the following properties.
1. p =
2.
3. when n is large (n ≥10 and n(1-) ≥10), the sampling distribution of p is
approximately normal.
Example 10.1 Population = {All blood recipients}. = The proportion of all blood
recipients stricken with viral hepatitis = .07. A new treatment is given to n = 200 blood
recipients. Only 6 of the 200 patients contract hepatitis. The question of interest to
medical researchers is: Is the new treatment effective? (Does the new treatment reduce
the incidence rate of viral hepatitis?)
p= = 0.07, =0.018
Since n = 200 (0.07) = 14 > 10 and n(1-) = 200(1-0.07) = 186 > 10, the sampling
distribution of p is approximately normal. Then
P(p .03 when H0 is true) P(z (0.03 – 0.07) / ) = P (z -2.22) = 0.0132
Since the probability is very small (as means that it is unlikely that a sample
proportion .03 or smaller would be observed if H0 is true), we reject H0 in favor of Ha.
Definition 10.4: A test statistic is the function of sample data on which a conclusion to
reject or fail to reject H0 is based. The P-value (sometimes called the observed
significance level) is a measure of inconsistency between the hypothesized value for a
population characteristic and the observed sample.
A decision as to whether H0 should be rejected based on the P-value and the chosen
:
H0 should be rejected if P-value .
H0 should not be rejected if P-value > .
In example 10.1, if = 0.05, we should reject H0. However, if = 0.01, we should not
reject H0.
Test statistic:
z = (p – hypothesized value) /
Assumptions:
(1) p is the sample proportion from a random sample
(2) The sample size is large (both n(hypothesized value) 10 and n(1-hypothesized
value) 10).
When carrying out a hypothesis test, the following steps are often used.
Steps in a hypothesis-testing analysis
Step 1-4 constitute a statement of the problem, step 5-8 give the analysis that will lead
to a decision, and step 9 provides the conclusion.
We will now turn our attention to developing a method for testing hypotheses about a
population mean.
The test procedures are based on some results about the sampling distribution of .
z= .
Test statistic: z = .
Assumptions:
1. is the sample mean of a random sample.
2. is known.
3. The sample size is large (generally n 30) or the population distribution is at least
approximately normal.
Example 10.3 A soda manufacturer is interested in determining whether its bottling
machine tends to overfill. Each bottle is supposed to contain 12 oz of fluid. A random
sample of size 36 is taken from bottles coming off the production line, and the contents of
each bottle are carefully measured. It is found that the mean amount of soda for the
sample of bottles is 12.1 oz. Suppose it is known that = .4 oz. Is the machine
overfilling? Test the relevant hypotheses at a significance level of 0.05.
t= .
Test statistic: t = .
Alternative hypothesis P-value
Ha: > hypothesized value Area under t curve with df = n – 1 to right of calculated
(Upper-tailed test) t
Ha: < hypothesized value Area under t curve with df = n – 1 to left of calculated t
(Lower-tailed test)
Ha: hypothesized value (i) 2(area to right of calculated t) if t is positive.
(Two-tailed test) (ii) 2(area to left of calculated t) if t is negative
Assumptions:
1. and s are the sample mean and sample standard deviation from a random sample.
2. The sample size is large (generally n 30) or the population distribution is at least
approximately normal.
Example 10.4 Are young women delaying marriage and marrying at a later age? This
question was addressed in a report issued by the Census Bureau. The report stated that in
1970 (based on census results) the mean age of brides marrying for the first time was
20.8 years. In 1990 (based on a random sample, since census results were not yet
available), the mean was 23.9. Suppose that the 1990 sample mean had been based on a
random sample of size 100 and that the sample standard deviation was 6.4. Is there
sufficient evidence to support the claim that women are now marrying later in life? Test
the relevant hypothesis using = .01.