Chapter 15 Chi Square Applications
Chapter 15 Chi Square Applications
Chapter 15 Chi Square Applications
Chi-Square Applications
Statistics and Probability
Learning Objectives
A goodness-of-fit test will show whether an observed set of frequencies could have
come from a hypothesized population distribution
A. The degrees of freedom are k-1, where k is the number of categories.
B. The formula for computing the value of chi-square is
2
(𝑓𝑜 −𝑓 𝑒 )
𝑥2 =
𝑓𝑒
LO2. CRITICAL VALUES OF CHI-SQUARE
LO2. GOODNESS-OF-FIT TEST:
Equal Expected Frequencies
• Illustration:
The human resources director at Georgetown Paper, Inc., is concerned about
absenteeism among hourly workers. She decides to sample the company records to
determine whether absenteeism is distributed evenly throughout the six-day
workweek. The hypotheses are:
𝐻0 : 𝐴𝑏𝑠𝑒𝑛𝑡𝑒𝑒𝑖𝑠𝑚 𝑖𝑠 𝑒𝑣𝑒𝑛𝑙𝑦 𝑑𝑖𝑠𝑡𝑟𝑖𝑏𝑢𝑡𝑒𝑑 𝑡ℎ𝑟𝑜𝑢𝑔ℎ𝑜𝑢𝑡 𝑡ℎ𝑒 𝑤𝑜𝑟𝑘𝑤𝑒𝑒𝑘.
𝐻1 : 𝐴𝑏𝑠𝑒𝑛𝑡𝑒𝑒𝑖𝑠𝑚 𝑖𝑠 𝑛𝑜𝑡 𝑒𝑣𝑒𝑛𝑙𝑦 𝑑𝑖𝑠𝑡𝑟𝑖𝑏𝑢𝑡𝑒𝑑 𝑡ℎ𝑟𝑜𝑢𝑔ℎ𝑜𝑢𝑡 𝑡ℎ𝑒 𝑤𝑜𝑟𝑘𝑤𝑒𝑒𝑘.
The sample results are: (use 1 percent significant level)
Number of Absent Number of Absent
Monday 12 Thursday 10
Tuesday 9 Friday 9
Wednesday 11 Saturday 9
LO2. GOODNESS-OF-FIT TEST:
EQUAL EXPECTED FREQUENCIES
• Illustration: The human resources director at Georgetown Step 1. State the null and alternate hypothesis
Paper, Inc., is concerned about absenteeism among hourly H0: Absenteeism is evenly distributed
workers. She decides to sample the company records to throughout the workweek.
determine whether absenteeism is distributed evenly H1: Absenteeism is not evenly distributed
throughout the six day workweek. The hypotheses are: throughout the workweek.
𝐻0: 𝐴𝑏𝑠𝑒𝑛𝑡𝑒𝑒𝑖𝑠𝑚 𝑖𝑠 𝑒𝑣𝑒𝑛𝑙𝑦 𝑑𝑖𝑠𝑡𝑟𝑖𝑏𝑢𝑡𝑒𝑑 𝑡ℎ𝑟𝑜𝑢𝑔ℎ𝑜𝑢𝑡 𝑡ℎ𝑒 𝑤𝑜𝑟𝑘𝑤𝑒𝑒𝑘.
𝐻1: 𝐴𝑏𝑠𝑒𝑛𝑡𝑒𝑒𝑖𝑠𝑚 𝑖𝑠 𝑛𝑜𝑡 𝑒𝑣𝑒𝑛𝑙𝑦 𝑑𝑖𝑠𝑡𝑟𝑖𝑏𝑢𝑡𝑒𝑑 𝑡ℎ𝑟𝑜𝑢𝑔ℎ𝑜𝑢𝑡 𝑡ℎ𝑒 𝑤𝑜𝑟𝑘𝑤𝑒𝑒𝑘. Step 2. Select the level of significance
𝑇ℎ𝑒 𝑠𝑎𝑚𝑝𝑙𝑒 𝑟𝑒𝑠𝑢𝑙𝑡𝑠 𝑎𝑟𝑒: (𝑢𝑠𝑒 1 𝑝𝑒𝑟𝑐𝑒𝑛𝑡 𝑠𝑖𝑔𝑛𝑖𝑓𝑖𝑐𝑎𝑛𝑡 𝑙𝑒𝑣𝑒𝑙) We selected the 0.01 significance level. The
probability is 0.01 that a true null hypothesis
Number of Number of will be rejected.
Absent Absent
Monday 12 Thursday 10 Step 3. Select the test statistics
The test statistic follows the chi-square
Tuesday 9 Friday 9 distribution, designated as
Wednesday 11 Saturday 9 2
(𝑓𝑂 − 𝑓𝑒 )2
𝑥 =
𝑓𝑒
LO2. GOODNESS-OF-FIT TEST:
EQUAL EXPECTED FREQUENCIES
• Illustration: The human resources director at Georgetown Step 4. Formulate the decision rule
Paper, Inc., is concerned about absenteeism among hourly 𝑑𝑓 = 𝑘 − 1 = 6 − 1 = 5
workers. She decides to sample the company records to
determine whether absenteeism is distributed evenly
throughout the six day workweek. The hypotheses are:
𝐻0: 𝐴𝑏𝑠𝑒𝑛𝑡𝑒𝑒𝑖𝑠𝑚 𝑖𝑠 𝑒𝑣𝑒𝑛𝑙𝑦 𝑑𝑖𝑠𝑡𝑟𝑖𝑏𝑢𝑡𝑒𝑑 𝑡ℎ𝑟𝑜𝑢𝑔ℎ𝑜𝑢𝑡 𝑡ℎ𝑒 𝑤𝑜𝑟𝑘𝑤𝑒𝑒𝑘.
𝐻1: 𝐴𝑏𝑠𝑒𝑛𝑡𝑒𝑒𝑖𝑠𝑚 𝑖𝑠 𝑛𝑜𝑡 𝑒𝑣𝑒𝑛𝑙𝑦 𝑑𝑖𝑠𝑡𝑟𝑖𝑏𝑢𝑡𝑒𝑑 𝑡ℎ𝑟𝑜𝑢𝑔ℎ𝑜𝑢𝑡 𝑡ℎ𝑒 𝑤𝑜𝑟𝑘𝑤𝑒𝑒𝑘.
𝑇ℎ𝑒 𝑠𝑎𝑚𝑝𝑙𝑒 𝑟𝑒𝑠𝑢𝑙𝑡𝑠 𝑎𝑟𝑒: (𝑢𝑠𝑒 1 𝑝𝑒𝑟𝑐𝑒𝑛𝑡 𝑠𝑖𝑔𝑛𝑖𝑓𝑖𝑐𝑎𝑛𝑡 𝑙𝑒𝑣𝑒𝑙) Critical value = 15.086
Number of Number of
Absent Absent
Monday 12 Thursday 10
Tuesday 9 Friday 9
Wednesday 11 Saturday 9
Decision Rule: Reject Null Hypothesis if the computed
value of chi-square is greater than 15.086. Otherwise,
do not reject.
LO2. GOODNESS-OF-FIT TEST:
EQUAL EXPECTED FREQUENCIES
Step 5. Compute the value of chi-square and make a decision.
South 155 2
𝑓𝑂 − 𝑓𝑒 2
𝑥 =
𝑓𝑒
West 73
Total 400
LO2. GOODNESS-OF-FIT TEST:
UNEQUAL EXPECTED FREQUENCIES
The owner of a mail-order catalog would like to compare her Step 4. Formulate the decision rule
sales with the geographic distribution of the population. 𝑑𝑓 = 𝑘 − 1 = 4 – 1 = 3
According to the United States Bureau of the Census, 21 percent
of the population lives in the Northeast, 24 percent in the
Midwest, 35 percent in the South, and 20 percent in the West.
Listed below is the breakdown of a sample of 400 orders
randomly selected from those shipped last month. At the 0.01
significance level, does the distribution of the orders reflect the Critical value = 11.345
population?
Region Frequency
Northeast 68
Midwest 104
South 155
West 73 Decision Rule: Reject Null Hypothesis if the
computed value of chi-square is greater than
Total 400 11.345. Otherwise, do not reject.
LO2. GOODNESS-OF-FIT TEST:
UNEQUAL EXPECTED FREQUENCIES
Step 5. Compute the value of chi-square and make a decision.
ILLUSTRATION A social scientist sampled Step 1. State the null and alternate hypothesis
140 people and classified them according H0: There is no relationship between income and
whether the person played the lottery.
to income level and whether or not they H1: There is a relationship between income and
played a state lottery in the last month. whether the person played the lottery.
The sample information is reported below.
Is it reasonable to conclude that playing Step 2. Select the level of significance
the lottery is related to income level? Use We selected the 0.05 significance level. The
probability is 0.05 that a true null hypothesis will be
the 0.05 significance level. rejected.
Low Income Middle Income High Income Total
Played 46 28 21 95
Did not play 14 12 19 45
Total 60 40 40 140 Decision Rule: Reject Null Hypothesis if the computed
value of chi-square is greater than 5.991. Otherwise,
do not reject.
LO3. CONTINGENCY TABLE ANALYSIS
EXERCISE A study regarding the relationship between age and the amount of pressure
sales personnel feel in relation to their jobs revealed the following sample
information. At the .01 significance level, is there a relationship between job
pressure and age?
Degrees of Job Pressure
Age Low Medium High Total
Less than 25 20 18 22 60
25 up to 40 50 46 44 140
40 up to 60 58 63 59 180
60 and older 34 43 43 120
Total 162 170 168 500