Chapter 3
Chapter 3
Chapter 3
Section 3.1
3.1 For a data set with an odd number of observations, first we rank the data set in increasing (or
decreasing) order and then find the value of the middle term. This value is the median. For a data set
with an even number of observations, first we rank the data set in increasing (or decreasing) order and
then find the average of the two middle terms. The average gives the median.
3.2 A few values that are either very small or very large relative to the majority of the values in a data set
are called outliers or extreme values. Suppose the exam scores for seven students are 73, 82, 95, 79,
22, 86, and 91. Then, 22 is an outlier because this value is very small compared to the other values.
The median is a better measure of central tendency as compared to the mean for a data set that contains
an outlier because the mean is affected much more by outliers than is the median.
3.3 Suppose the exam scores for seven students are 73, 82, 95, 79, 22, 86, and 91 points. Then,
Mean = (73 + 82 + 95 + 79 + 22 + 86 + 91)/7 = 75.43 points. If we drop the outlier (22),
Mean = (73 + 82 + 95 + 79 + 86 + 91)/6 = 84.33 points. This shows how an outlier can affect the
value of the mean.
3.4 All three measures of central tendency (the mean, the median, and the mode) can be calculated for
quantitative data. Note that the mode may or may not exist for a data set. However, only the mode (if
it exists) can be found for a qualitative data set. Examples given in Sections 3.1.1, 3.1.2, and 3.1.3 of
the text show these cases.
3.5 The mode can assume more than one value for a data set. Examples 3–8 and 3–9 of the text present
such cases.
3.6 A quantitative data set will definitely have a mean and a median but it may or may not have a mode.
Example 3–7 of the text presents a data set that has no mode.
3.7 For a symmetric histogram (with one peak), the values of the mean, median, and mode are all equal.
Figure 3.2 of the text shows this case. For a histogram that is skewed to the right, the value of the mode
is the smallest and the value of the mean is the largest. The median lies between the mode and the
39
40 Chapter Three
mean. Such a case is presented in Figure 3.3 of the text. For a histogram that is skewed to the left, the
value of the mean is the smallest, the value of the mode is the largest, and the value of the median lies
between the mean and the mode. Figure 3.4 of the text exhibits this case.
3.8 The median is the best measure to summarize this data set since it is not influenced by outliers.
3.10 ∑x = 14 + 18 – 10 + 8 + 8 –16 = 22
x = (∑x)/n = 22/6 = 3.67
Median = value of the 3.5th term in ranked data = (8+8)/2 = 8
Mode = 8
This data set has no mode because no value appears more than once.
n1 x1 + n 2 x 2 (18)(144) + ( 20)( x 2 )
x= = = 150 . Then,
n1 + n 2 18 + 20
3.29 Sum of the ages of six persons = (6)(46) = 276 years, so the age of sixth
person = 276 – (57 + 39 + 44 + 51 + 37) = 48 years.
3.30 Sum of the prices paid by the seven passengers = (7)(361) = $2527
Total price paid by the couple = 2527 – (402 + 210 + 333 + 695 + 485) = $384
Price paid by each of the couple = 384/2 = $192
3.31 For Data Set I: Mean = 123/5 = 24.6 For Data Set II: Mean = 158/5 = 31.6
The mean of the second data set is greater than the mean of the first data set by 7.
3.32 For Data Set I: Mean = 47/5 = 9.4 For Data Set II: Mean = 94/5 = 18.8
The mean of the second data set is twice the mean of the first data set.
3.34 The ranked data are: 184 195 245 259 271 297 307 309 323 365
369 387 390 410 438 457 578 590 679 795
To calculate the 20% trimmed mean, drop 20% of the smallest values and 20% of the largest values.
This data set contains 20 values, and 20% of 20 is 4. Hence, drop the 4 smallest values and the 4
largest values. By dropping 184, 195, 245, 259, 578, 590, 679, and 795, we obtain
∑x = 271 + 297 + 307 + 309 + 323 + 365 + 369 + 387 + 390 + 410 + 438 + 457 = 4323
20% Trimmed Mean = (∑x)/n = 4323/12 = $306.25 thousand = $306,250
Introductory Statistics, Mann, Seventh Edition - Instructor’s Solutions Manual 43
Weighted mean =
∑ xw = (73)(1) + (67)(1) + (85)(2) = 310 = 77.5
∑w 4 4
3.36 Geometric mean = n x1 ⋅ x2 ⋅ x3 ⋅ ... ⋅ xn = 5 1.04 ⋅ 1.03 ⋅ 1.05 ⋅ 1.06 ⋅ 1.08 = 5 1.287625248 = 1.05
Then, 1– Geometric mean = 1.05 –1= .05, so the mean inflation rate is 5%.
Section 3.2
3.37 Suppose the exam scores for seven students are 73, 82, 95, 79, 22, 86, and 91.
Then, Range = Largest value – Smallest value = 95 – 22 = 73 points.
If we drop the outlier (22) and calculate the range,
Range = Largest value – Smallest value = 95 – 73 = 22 points.
Thus, when we drop the outlier, the range decreases from 73 to 22 points.
3.38 No, the value of the standard deviation cannot be negative, because the deviations from the mean are
squared and, therefore, either positive or zero.
3.39 The value of the standard deviation is zero when all values in a data are the same. For example,
suppose the exam scores of a sample of seven students are 82, 82, 82, 82, 82, 82, and 82. As this data
set has no variation, the value of the standard deviation is zero for these observations. This is shown
below:
∑x = 574 and ∑x2 = 47,068
(∑ x) 2 (574) 2
∑ x2 − n
47,068 −
7 47,068 − 47,068
s= = = =0
n −1 7 −1 6
3.40 A summary measure calculated for a population data set is called a population parameter. If the
average exam score for all students enrolled in a statistics class is 75.3, then 75.3 is a population
parameter. A summary measure calculated for a sample data set is called a sample statistic. If we
took a random sample of 10 students in the statistics class and found the average exam score to be
77.1, this would be an example of a sample statistic.
3.41 Range = Largest value – Smallest value = 16 – (–9) = 25, ∑x = 24, ∑x2 = 564 and N = 8
(∑ x) 2 (24) 2
∑ x2 − N
564 −
8 = 564 − 72 = 61.5
σ2 = = σ = 61.5 = 7.84
N 8 8
3.42 Range = Largest value – Smallest value = 18 – (–16) = 34, ∑x = 22, ∑x2 = 1004, and n = 6
44 Chapter Three
2 (∑ x) 2 (22) 2
∑x − 1004 −
s2 = n = 6 = 1004 − 80.6667 = 184.6667 s = 184.6667 = 13.59
n −1 6 −1 5
(∑ x) 2 (72) 2
∑ x2 − 748 −
s2 = n = 8 = 14.2857 s = 14.2857 = 3.78
n −1 8 −1
( ∑ x) 2 (81) 2
∑ x2 − 699 −
s2 = n = 12 = 13.8409 s = 13.8409 = 3.72 thefts
n −1 12 − 1
(∑ x) 2 (176) 2
∑ x2 − 3872 −
n 8 3872 − 3872
s= = = =0
n −1 8 −1 7
The standard deviation is zero because all these data values are the same and there is no variation
among them.
(∑ x) 2 (114)2
∑ x2 − 2166 −
s= n = 6 2166 − 2166
= =0
n −1 6 −1 5
The standard deviation is zero because all these data values are the same and there is no variation
among them.
3.57 For the yearly salaries of all employees, CV = (σ/μ) × 100% = (6,820/62,350) × 100 = 10.94%
For the years of experience of these employees, CV = (σ/μ) × 100% = (2/15) × 100 = 13.33%
The relative variation in salaries is lower than that in years of experience.
3.58 For the SAT scores of the 100 students, CV = (s/ x ) × 100% = (105/975) × 100 = 10.77%
For the GPAs of these students, CV = (s/ x ) × 100% = (.22/3.16) × 100 = 6.96%
The relative variation in SAT scores is higher than that in GPAs.
Introductory Statistics, Mann, Seventh Edition - Instructor’s Solutions Manual 47
( ∑ x) 2 (123) 2
∑ x2 − 3883 −
s= n = 5 = 214.300 = 14.64
n −1 5 −1
For Data Set II: ∑x = 158, ∑x2 = 5850, and n = 5
(∑ x) 2 (158) 2
∑ x2 − 5850 −
s= n = 5 = 214.300 = 14.64
n −1 5 −1
The standard deviations of the two data sets are equal.
(∑ x) 2 (47) 2
∑ x2 − 507 −
σ= N = 5 = 13.04 = 3.61
N 5
For Data Set II: ∑x = 94, ∑x2 = 2028, and N = 5
(∑ x) 2 (94) 2
∑ x2 − 2028 −
σ= N = 5 = 52.16 = 7.22
N 5
The standard deviation of the second data set is twice the standard deviation of the first data set.
Section 3.3
3.61 The values of the mean and standard deviation for a grouped data set are the approximate values of the
mean and standard deviation. The exact values of the mean and standard deviation are obtained only
when ungrouped data are used.
3.62
x f m mf m2f
2– 4 5 3 15 45
5– 7 9 6 54 324
8 – 10 14 9 126 1134
11 – 13 7 12 84 1008
14 – 16 5 15 75 1125
N = ∑f = 40 ∑mf = 354 ∑m2f = 3636
3.63
x f m mf m2f
0 to less than 4 17 2 34 68
4 to less than 8 23 6 138 828
8 to less than 12 15 10 150 1500
12 to less than 16 11 14 154 2156
16 to less than 20 8 18 144 2592
20 to less than 24 6 22 132 2904
n = ∑f = 80 ∑mf = 752 ∑m2f = 10,048
x = (∑mf)/n = 752/80 = 9.40
2
2 (∑ mf ) (752) 2
∑m f − 10,048 −
s2 = n = 80 = 37.7114 s = 37.7114 = 6.14
n −1 80 − 1
3.64
Amount of Telephone Number of
Bill (dollars) Families m mf m2f
40 to less than 70 9 55 495 27,225
70 to less than 100 11 85 935 79,475
100 to less than 130 16 115 1840 211,600
130 to less than 160 10 145 1450 210,250
160 to less than 190 4 175 700 122,500
n = ∑f = 50 ∑mf = 5420 ∑m2f = 651,050
3.65
Hours Per Week Number of Students m mf m2f
0 to less than 5 7 2.5 17.5 43.75
5 to less than 10 12 7.5 90.0 675.00
10 to less than 15 15 12.5 187.5 2343.75
15 to less than 20 13 17.5 227.5 3981.25
20 to less than 25 8 22.5 180.0 4050.00
25 to less than 30 5 27.5 137.5 3781.25
N = ∑f = 60 ∑mf = 840 ∑m2f = 14,875
(∑ mf ) 2 (840)2
∑ m2 f − N
14,875 −
60 = 51.9167
σ2 = = σ = 51.9167 = 7.21 hours
N −1 60
Introductory Statistics, Mann, Seventh Edition - Instructor’s Solutions Manual 49
3.66
Weight (pounds) Number of Babies m mf m2f
3 to less than 5 5 4 20 80
5 to less than 7 30 6 180 1080
7 to less than 9 40 8 320 2560
9 to less than 11 20 10 200 2000
11 to less than 13 5 12 60 720
N = ∑f = 100 ∑mf = 780 ∑m2f = 6440
3.67
Miles Driven in 2009 Number of
(in thousands) Car Owners m mf m2f
0 to less than 5 7 2.5 17.5 43.75
5 to less than 10 26 7.5 195.0 1462.50
10 to less than 15 59 12.5 737.5 9218.75
15 to less than 20 71 17.5 1242.5 21,743.75
20 to less than 25 62 22.5 1395.0 31,387.50
25 to less than 30 39 27.5 1072.5 29,493.75
30 to less than 35 22 32.5 715.0 23,237.50
35 to less than 40 14 37.5 525.0 19,687.50
n = ∑f = 300 ∑mf = 5900 ∑m2f = 136,275
3.68
Amount of Electric Number of
Bill (dollars) Families m mf m2f
0 to less than 40 5 20 100 2,000
40 to less than 80 16 60 960 57,600
80 to less than 120 11 100 1100 110,000
120 to less than 160 10 140 1400 196,000
160 to less than 200 8 180 1440 259,200
n = ∑f = 50 ∑mf = 5000 ∑m2f = 624,800
∑ m2 f − ∑ n
( mf )2 (5000) 2
624,800 −
s2 = = 50 = 2546.9388 s = 2546.9388 = $50.47
n −1 50 − 1
The values in the column labeled mf give the approximate total amounts of electric bills for the
families belonging to corresponding classes. For example, the five families belonging to the first class
paid a total of $100 for electricity in August 2009. The value ∑mf = $5000 is the approximate total
amount of the electric bills for all 50 families included in the sample.
3.69
x f m mf m2f
0 to less than 20 14 10 140 1400
20 to less than 40 18 30 540 16,200
40 to less than 60 9 50 450 22,500
60 to less than 80 5 70 350 24,500
80 to less than 100 4 90 360 32,400
n = ∑f = 50 ∑mf = 1840 ∑m2f = 97,000
3.70
Number of Errors f m mf m2f
0 11 0 0 0
1 14 1 14 14
2 9 2 18 36
3 7 3 21 63
4 3 4 12 48
5 1 5 5 25
2
N = ∑f = 45 ∑mf = 70 ∑m f = 186
∑ m2 f − ∑ N
( mf ) 2 (70) 2
186 −
σ2 = = 45 = 1.7136 σ = 1.7136 = 1.31 errors
N 45
Section 3.4
3.72 Chebyshev’s theorem is applied to find a lower bound for the area under a distribution curve between
two points that are on opposite sides of the mean and at the same distance from the mean. According to
this theorem, for any number k greater than 1, at least (1 – (1/k2)) of the data values lie within k
standard deviations of the mean.
3.73 The empirical rule is applied to a bell–shaped distribution. According to this rule, approximately
(1) 68% of the observations lie within one standard deviation of the mean.
(2) 95% of the observations lie within two standard deviations of the mean.
(3) 99.7% of the observations lie within three standard deviations of the mean.
1 1
3.74 For the interval x ± 2s : k = 2, and 1 – =1– = 1 – .25 = .75 or 75%. Thus, at least 75% of the
k2 ( 2) 2
3.75 For the interval μ ± 2σ : k = 2, and 1 – 1 = 1 – 1 = 1 – .25 = .75 or 75%. Thus, at least 75% of
k2 ( 2) 2
the observations fall in the interval μ ± 2σ .
For the interval μ ± 2.5σ : k = 2.5, and 1 – 1 = 1 – 1 = 1 – .16 = .84 or 84%. Thus, at least
k2 ( 2.5) 2
84% of the observations fall in the interval μ ± 2.5σ .
For the interval μ ± 3σ : k = 3, and 1 – 1 = 1 – 1 = 1 –.11 = .89 or 89%. Thus, at least 89% of the
k2 (3) 2
observations fall in the interval μ ± 3σ .
3.76 Approximately 68% of the observations fall in the interval μ ± σ , approximately 95% fall in the
interval μ ± 2σ , and about 99.7% fall in the interval μ ± 3σ .
52 Chapter Three
3.77 Approximately 68% of the observations fall in the interval x ± s , approximately 95% fall in the
interval x ± 2s , and about 99.7% fall in the interval x ± 3s .
Thus, at least 75% of the runners ran the race in 180 to 260 minutes.
b. Each of the two values is 60 minutes from μ = 220. Hence,
1 1
k = 50/20 = 2.5 and 1 – =1– = 1 – .16 = .84 or 84%.
k2 ( 2.5) 2
Thus, at least 84% of the runners ran this race in 170 to 270 minutes.
3.79 a. Each of the two values is $1.2 million from μ = $2.3 million. Hence,
1
k = 1.2/.6 = 2 and 1 – 1 = 1 – = 1 – .25 = .75 or 75%.
k 2 ( 2) 2
Thus, at least 75% of all companies had 2009 gross sales of $1.1 to $3.5 million.
b. Each of the two values is $1.5 million from μ = $2.3 million. Hence,
1
k = 5200/2600 = 2 and 1 – 1 = 1 – = 1 – .25 = .75 or 75%.
k 2 ( 2) 2
Thus, at least 75% of all households have credit card debt between $4300 and $14,700.
ii. Each of the two values is $6500 from μ = $9500. Hence,
Introductory Statistics, Mann, Seventh Edition - Instructor’s Solutions Manual 53
1
k = 680/340 = 2 and 1 – 1 = 1 – = 1 – .25 = .75 or 75%.
k 2 ( 2) 2
Thus, at least 75% of all homeowners pay a monthly mortgage of $1685 to $3045.
ii. Each of the two values is $1020 from μ = $2365. Hence,
1
k = 1020/340 = 3 and 1 – 1 = 1 – = 1 – .11 = .89 or 89%.
k 2 (3)2
Thus, at least 89% of all homeowners pay a monthly mortgage of $1345 to $3385.
1
b. 1 – 1 = .84 gives 1 = 1 – .84 = .16 or k2 = so k = 2.5.
k 2
k 2 .16
Section 3.5
3.87 The interquartile range (IQR) is given by Q3 – Q1, where Q1 and Q3 are the first and third quartiles,
respectively. Examples 3–20 and 3–21 of the text show how to find the IQR for a data set.
3.88 Given a data set of n values, to find the kth percentile (Pk):
1. Rank the given data in increasing order.
2. Calculate kn/ 100. Then, Pk is the term that is approximately (kn/100) in the ranking. If kn/ 100 falls
between two consecutive integers a and b, it may be necessary to average the ath and bth values in
the ranking to obtain Pk.
Introductory Statistics, Mann, Seventh Edition - Instructor’s Solutions Manual 55
3.89 If xi is a particular observation in the data set, the percentile rank of xi is the percentage of the values
in the data set that are less than xi. Thus,
Number of values less than xi
Percentile rank of xi = × 100
Total number of values in the data set
3.93 The ranked data are: 318 336 337 339 362 363 366 369 372 375
378 381 384 385 386 387 390 393 395 403
405 409 417 431 433 434 438 444 461 480
a. The quartiles are Q1 = 369, Q2 = (386 + 387)/2 = 386.5, and Q3 = 417
IQR = Q3 – Q1 = 417 – 369 = 48
b. kn/100 = 57(30)/100 = 17.1 ≈ 17
56 Chapter Three
Thus, the 57th percentile can be approximated by the value of the 17th term in the ranked data,
which is 390. Therefore, P57 = 390.
c. Twenty-two values in the given data are smaller than 417. Hence, the percentile rank of
417 = (22/30) × 100 = 73.33%.
Thus, the 25th percentile may be approximated by the value of the fifth term in the ranked data,
which is 5. Therefore, P25 = 5. Thus, the number of new cars sold at this dealership is less than or
equal to 5 for approximately 25% of the days in this sample.
c. Thirteen values in the given data are less than 10. Hence, the percentile rank of
10 = (13/20) × 100 = 65%. Thus, on 65% of the days in the sample, this dealership sold fewer than
10 cars.
3.97 The ranked data are: 356 422 430 468 494 533 572 600 604 617 625
628 639 647 690 702 728 747 749 772 797 805
a. The three quartiles are
Q1 = 533, Q2 = (625 + 628)/2 = 626.5, and Q3 = 728
IQR = Q3 – Q1 = 728 – 533 = 195
The value 617 lies between Q1 and Q2, which indicates that it is in the second 25% group from the
bottom in the (ranked) data set.
b. kn/100 = 30(22)/100 = 6.6 ≈ 6.5
Thus, the 30th percentile may be approximated by the average of the 6th and 7th terms in the ranked
data. Therefore, P30 = (533 + 572)/2 = 552.5. Thus, approximately 30% of the credit scores of
applicants are at or below 552.5.
c. Five values in the given data are smaller than 533. Hence, the percentile rank of
533 = (5/22) × 100 = 22.73%. Thus, 22.73% of the credit scores of applicants are below 533.
Section 3.6
3.98 A box–and–whisker plot is based on five summary measures: the median, the first quartile, the third
quartile, and the smallest and largest value in the data set between the lower and upper inner fences.
0 20 40 60 80 100 120
-20 0 20 40 60 80
0 10 20 30 40
Introductory Statistics, Mann, Seventh Edition - Instructor’s Solutions Manual 59
3.102 The ranked data are: 356 422 430 468 494 533 572 600 604 617 625
628 639 647 690 702 728 747 749 772 797 805
Median = (625 + 628)/2 = 626.5, Q1 = 533, and Q3 = 728
IQR = Q3 – Q1 = 728 – 533 = 195, 1.5 × IQR = 1.5 × 195 = 292.5,
Lower inner fence = Q1 – 292.5 = 533 – 292.5 = 240.5,
Upper inner fence = Q3 + 292.5 = 728 + 292.5 = 1020.5
The smallest and the largest values within the two inner fences are 356 and 805, respectively. The data
set contains no outliers.
3.103 The ranked data are: 22.5 25.0 26.8 27.0 29.8 51.2 64.0 80.3 94.4 97.7 112.0 261.7
Median = (51.2 + 64.0)/2 = 57.6, Q1 = (26.8 + 27.0)/2 = 26.9, and Q3 = (94.4 + 97.7)/2 = 96.05,
IQR = Q3 – Q1 = 96.05 – 26.9 = 69.15, 1.5 × IQR = 1.5 × 69.15 = 103.725,
Lower inner fence = Q1 – 103.725 = 26.9 – 103.725 = –76.825,
Upper inner fence = Q3 + 103.725 = 96.05 + 103.725 = 199.775
The smallest and largest values within the two inner fences are 22.5 and 112, respectively. The value
261.7 is an outlier.
40 45 50 55 60
3.105 The ranked data are: 318 336 337 339 362 363 366 369 372 375
378 381 384 385 386 387 390 393 395 403
405 409 417 431 433 434 438 444 461 480
Median = (386 + 387)/2 = 386.5, Q1 = 369, and Q3 = 417,
IQR = Q3 – Q1 = 417 – 369 = 48, 1.5 × IQR = 1.5 × 48 = 72,
Lower inner fence = Q1 – 72 = 369 – 72 = 297,
Upper inner fence = Q3 + 72 = 417 + 72 = 489
The smallest and largest values within the two inner fences are 318 and 480, respectively. There are no
outliers.
2 4 6 8 10 12 14 16
15 20 25 30 35 40
2 4 6 8 10 12 14 16 18
Supplementary Exercises
(∑ x) 2 (46.22) 2
∑ x2 − n
228.5904 −
10
s2 = = = 1.6624 s = 1.6624 = 1.29 seconds
n −1 10 − 1
3.113
Rainfall Number of Cities m mf m2f
0 to less than 2 6 1 6 6
2 to less than 4 10 3 30 90
4 to less than 6 20 5 100 500
6 to less than 8 7 7 49 343
8 to less than 10 4 9 36 324
10 to less than 12 3 11 33 363
N = ∑f = 50 ∑mf = 254 ∑m2f = 1626
Thus, at least 89% of the students will learn the basics in 140 to 260 minutes.
1
b. 1 – 1 = .75 gives 1 = 1 – .75 = .25 or k2 = , so k = 2
k 2
k 2 .25
64 Chapter Three
3.116 a. i. Each of the two values is 900 hours from µ = $1750 hours. Hence,
Thus, at least 89% of households watched between 400 and 3100 hours of television.
1
b. 1 – 1 = .84 gives 1 = 1 – .84 = .16 or k2 = , so k = 2.5.
k 2
k 2 .16
μ − 2.5σ = 1750 – 2.5(450) = 625 hours and μ + 2.5σ = 1750 + 2.5(450) = 2875 hours
Thus, the required interval is 625 to 2875 hours.
b. kn/100 = 70(10)/100 = 7
Thus, the 70th percentile occurs at the seventh term in the ranked data, which is 84. Therefore,
P70 = 84. This means that about 70% of the values in the data set are smaller than or equal to 84.
c. Seven values in the given data are smaller than 97. Hence, the percentile rank of
97 = (7/10) × 100 = 70%. This means approximately 70% of the values in the data set are less than
97.
3.120 The ranked data are: 1779 1788 1791 1817 1857 1978 1989 2164 2250 2323
a. The three quartiles are Q1 = 1791, Q2 = (1857 + 1978)/ 2 = 1917.5, and Q3 = 2164
IQR = Q3 – Q1 = 2164 – 1791 = 373
The value 1978 is between Q2 and Q3, which indicates it is in the third 25% group from the bottom
in this (ranked) data set.
b. kn/100 = 70(10) / 100 = 7. Thus, the 70th percentile occurs at the seventh term in the ranked data,
which is 1989. Therefore, P70 = 1989. Thus, approximately 70% of these players scored less than
or equal to 1989 total points during the 2007-08 regular season.
c. One value in the given data is less than 1788. Hence, the percentile rank of
1788 = (1/10) × 100 = 10%. Thus, 10% of the players scored fewer than 1788 total points during
the 2007-08 regular season.
The smallest and largest values in the data set within the two inner fences are 4 and 65, respectively.
The value 81 is an outlier.
-20 0 20 40 60 80 100
75 + 69 + 87 + y
3.123 Let y = Melissa’s score on the final exam. Then, her grade is . To get a B, she needs
5
this to be at least 80. So we solve,
75 + 69 + 87 + y
80 =
5
5(80) = 75 + 69 + 87 + y
400 = 231 + y
y = 169
Thus, the minimum score that Melissa needs on the final exam in order to get a B grade is 169 out of
200 points.
3.124 a. Let y = amount that Jeffery suggests. Then, to insure the outcome Jeffery wants, we need
y + 12,000(5)
= 20,000
6
y + 12,000(5) = 6(20,000)
y + 60,000 = 120,000
y = 60,000
So, Jeffery would have to suggest $60,000 be awarded to the plaintiff.
b. To prevent a juror like Jeffery from having an undue influence on the amount of damage to be
awarded to the plaintiff, the jury could revise its procedure by throwing out any amounts that are
outliers and then recalculate the mean, or by using the median, or by using a trimmed mean.
3.125 a. Since x = (∑x)/n, we have 76 = (∑x)/5, so ∑ x = 5(76) = 380 inches. If we replace the tallest
player by a substitute who is two inches taller, the sum of the new heights is 380 + 2 = 382 inches.
Thus, the new mean is x = 382/5 = 76.4 inches. Since Range = Largest value – Smallest value, and
the largest value has increased by two while the smallest value is unchanged, the range has
increased by two. Thus, the new range is 11 + 2 = 13 inches. The median is the height of the third
player (if their heights are ranked) and this does not change. So, the median remains 78 inches.
Introductory Statistics, Mann, Seventh Edition - Instructor’s Solutions Manual 67
b. If we replace the tallest player by a substitute who is four inches shorter, then by reasoning similar
to that in part a, we have a new mean of x = 376/5 = 75.2 inches. You cannot determine the new
median or range with only the information given. We do not know how the new player’s height
compares to the rest of the players on the team; we have no knowledge of whether the substitute is
now the tallest player or not.
3.126 a. To calculate how much time the trip requires, divide miles driven by miles per hour for each 100
mile segment. Then, time = 100/52 + 100/65 + 100/58 = 1.92 + 1.54 + 1.72 = 5.18 hours.
b. Linda’s average speed for the 300 mile trip is not equal to (52 + 65 + 58)/3 = 58.33 mph. This
would assume that she spent an equal amount of time on each 100 mile segment, which is not true,
because her average speed is different on each segment. Linda’s average speed for the entire 300
mile trip is given by (miles driven)/(elapsed time) = 300/5.18 = 57.92 mph.
3.127 The mean price per barrel of oil purchased in that week is
Mean = [(1000)(51) + (200)(64) + (100)(70)]/1300 = 70,800/1300 = $54.46 per barrel
3.128 The method of calculating the mean is wrong in this case because it does not take into account the fact
that the homeowner bought different amounts of heating oil in the four deliveries. The correct method
of calculating the mean is
µ = (∑mf)/n = [(2.22)(209) + (2.34)(182) + (2.41)(157) + (2.43)(149)]/(209 + 182 + 157 + 149)
= $2.34 per gallon.
3.130 a. Total amount spent per month by the 2000 shoppers = (14)(8)(1100) + (18)(11)(900) = $301,400
b. Total number of trips per month by the 2000 shoppers = (8)(1100) + (11)(900) = 18,700
Mean number of trips per month per shopper = 18,700/2000 = 9.35 trips
c. Mean amount spent per person per month by shoppers aged 12-17 = 301,400/2000 = $150.70
3.131 a. For people age 30 and under, we have the following death rates from heart attack:
68 Chapter Three
number of deaths 1
Country A: × 1000 = × 1000 = 25
population 40
number of deaths .5
Country B: × 1000 = × 1000 = 20
population 25
Thus, the death rate for Country A is greater than that for Country B for people age 31 and older.
c. The overall death rates are as follows:
number of deaths 3
Country A: × 1000 = × 1000 = 50
population 60
number of deaths 3.5
Country B: × 1000 = × 1000 = 58.3
population 60
Thus, overall the death rate for country A is lower than the death rate for Country B.
d. In both countries people age 30 and under have a lower percentage of death due to heart attack
than people age 31 and over. Country A has 2/3 of its population age 30 and under while more
than 1/2 of the people in Country B are age 31 and over. Thus, more people in Country B than in
County A fall into the higher risk group which drives up Country B’s overall death rate from heart
attacks.
3.132 Total distance for the first 100 students = (100)(8.73) = 873 miles
Total distance for all 103 students = 873 + 11.5 + 7.6 + 10.0 = 902.1 miles
Mean distance for all 103 students = 902.1/103 = 8.76 miles
Thus, at least 50% of the scores are within 1.4 standard deviations of the mean.
b. Using Chebyshev’s theorem, we first find k so that at least 1 –.20 = .80 of the scores are within k
standard deviations of the mean.
1
1 – 1 = .80 gives 1 = 1 – .80 = .20 or k2 = = 5, so k = 5 ≈ 2.2.
k 2
k 2 .20
Thus, at least 80% of the scores are within 2.2 standard deviations of the mean, but this means that
at most 10% of the scores are greater than 2.2 standard deviations above the mean.
Introductory Statistics, Mann, Seventh Edition - Instructor’s Solutions Manual 69
3.134 a. Since we are dealing with a bell-shaped distribution and we know that 16% of all students scored
above 85, which is µ + 15, we must also have that 16% of all students scored below µ – 15 = 55.
Therefore, the remaining 68% of students scored between 55 and 85. By the empirical rule, we
know that approximately 68% of the scores fall in the interval μ − σ to μ + σ , so we have
μ − σ = 70 − σ = 55 and μ + σ = 70 + σ = 85. Thus, σ = 15.
b. We know that 95% of the scores are between 60 and 80 and that µ = 70. By the empirical rule, 95%
of the scores fall in the interval μ − 2σ to μ + 2σ . Then
60 = μ − 2σ = 70 − 2σ and 80 = μ + 2σ = 70 + 2σ . Therefore, 10 = 2σ and σ = 5.
The vacation expenditures are strongly skewed to the right. Most of the expenditures are relatively
small ($3400 or less) but there are two extreme outliers ($8200 and $321,500).
b. Neither the mode ($0) nor the mean ($13,872) are typical of these expenditures. Thus, the median
($500) is the best indicator of the average family’s vacation expenditures.
3.137 a.
Men
Women
60 70 80 90 100 110 120
The box–and–whisker plots show that the men’s scores tend to be lower and more varied than the
women’s scores. The men’s scores are skewed to the right, while the women’s are more nearly
symmetric.
b. Men Women
x = 82 x = 97.53
Median = 79 Median = 98
Modes = 75, 79, and 92 Modes = 94 and 100
Range = 45 Range = 36
Introductory Statistics, Mann, Seventh Edition - Instructor’s Solutions Manual 71
s2 = 145.8750 s2 = 71.2667
s = 12.08 s = 8.44
Q1 = 73.5 Q1 = 94
Q3 = 89.5 Q3 = 101
IQR = 16 IQR = 7
These numerical measures confirm the observations based on the box–and–whisker plots.
3.139 a. The total enrollment in the 25 freshman engineering classes is (24)(25) + 150 = 750. Then, the
mean size of these 25 classes is 750/25 = 30.
b. Each student attends five classes with total enrollment of 25 + 25 + 25 + 25 + 150 = 250. Then, the
mean size of the class is 250/5 = 50.
The means in parts a and b are not equal because:
1) From the college’s point of view, the large class of 150 is just one of 25 classes, so its influence
on the mean is strongly offset by the 24 small classes. This leads to a relatively small mean of
30 students per class.
2) From the point of view of each student, the larger class is one of just five, so it has a stronger
influence on the mean. This results in a larger mean of 50.
(∑ x) 2 (6597 ) 2
∑ x2 − 1,030,639 −
s= n = 44 = 31.0808 pounds
n −1 44 − 1
For men only:
n = 22, ∑x = 3848, ∑x2 = 680,724 and Median = 179 pounds
x = (∑x)/n = 3848/22 = 174.91 pounds
(∑ x) 2 (3848) 2
∑ x2 − 680,724 −
s= n = 22 = 19.1160 pounds
n −1 22 − 1
For women:
n = 22, ∑x = 2749, ∑x2 = 349,915 and Median = 123 pounds
x = (∑x)/n = 2749/22 = 124.95 pounds
72 Chapter Three
(∑ x) 2 (2749) 2
∑ x2 − 349,915 −
s= n = 22 = 17.4778 pounds
n −1 22 − 1
In this case, the median may be more informative than the mean, since it is less influenced by
extremely high or low weights. As one might expect, the mean and median weights for men are higher
than those of women. For the entire group, the mean and median weights are about midway between
the corresponding values for men and women. The standard deviations are roughly the same for men
and women. The standard deviation for the whole group is much larger than for men or women only,
due to the fact that it includes the lower weights of women and the heavier weights of men.
Thus, at least 55.6% of the fish are between 3 and 9 inches in length.
1
b. 1 – 1 = .84 gives 1 = 1 – .84 = .16 or k2 = , so k = 2.5
k 2
k 2 . 16
3.143 a. The mean, median, and standard deviations of the weights of males and females in grams are:
Men Women
x = 76,188.65 x = 54,428.95
Median = 77,970.61 Median = 53,577.57
s = 8326.7209 s = 7613.1618
The mean, median, and standard deviations of the weights of males and females in stones are:
Men Women
x = 12.49 x = 8.93
Median = 12.79 Median = 8.79
s = 1.3654 s = 1.2484
b. Converting the answers from Problem 3.140 to grams yields:
Men Women
x = 76,189.05 x = 54,426.97
Median = 77,970.61 Median = 53,577.57
s = 8326.7384 s = 7613.1549
Converting the answers from Problem 3.140 to stones yields:
Men Women
x = 12.49 x = 8.93
Median = 12.79 Median = 8.79
s = 1.3686 s = 1.2484
The answers are the same as those from part a with the exception of rounding error.
c. When converting from a larger unit to a smaller unit, the summary measures get larger by the
conversion factor. When converting from a smaller unit to a larger unit, the summary measures get
smaller by the conversion factor.
d. The distribution in units of pounds has more variability than that of stones. This is so because we
converted from a smaller unit to a larger unit; hence, the standard deviation was reduced by the
amount of the conversion factor (s for stones = s for pounds/14).
- - 2, 5, 8, 1 1 1 2 2 2 2 3 3 3 4 4 4 5 5 5 5 6 6 6 7 7 7 8 8 8 8 Pounds
4, 1, 0 0 0 1, 4, 7, 0, 3, 6, 9, 2, 5, 8, 1, 4, 7, 0, 3, 6, 9, 2, 5, 8, 1, 4, 7, 0, 3, 6, 9,
0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
e.
- - 2, 5, 8, 1 1 1 2 2 2 2 3 3 3 4 4 4 5 5 5 5 6 6 6 7 7 7 8 8 8 8
Grams
4, 1, 0 0 0 1, 4, 7, 0, 3, 6, 9, 2, 5, 8, 1, 4, 7, 0, 3, 6, 9, 2, 5, 8, 1, 4, 7, 0, 3, 6, 9,
0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
74 Chapter Three
The distribution in units of grams has more variability than that of pounds. This is so because we
converted from a larger unit to a smaller unit; hence, the standard deviation was increased by the
amount of the conversion factor (s for grams = s for pounds × 435.59). Because of the large
difference in units, retaining the same scale for the stacked plots displays all of the data values for
pounds in one location with no variability visible.
3.145 This golfer's score was not an outlier; therefore, her score must be less than the value of the upper inner
fence. From Exercise 3.137, Q3 = 101 and IQR = 7. Then, 1.5 × IQR = 1.5 × 7 = 10.5 and Q3 + 10.5 =
Introductory Statistics, Mann, Seventh Edition - Instructor’s Solutions Manual 75
111.5. Since this golfer had the uniquely highest score, and the next highest score was 107, she shot
between 108 and 111.
3.146 ∑x/10 = –645.5/10 = –64.55. This number represents the average returns for the stocks of these 10
companies for October 2008.
Self–Review Test
1. b 2. a and d 3. c 4. c 5. b 6. b 7. a
16. Suppose the exam scores for seven students are 73, 82, 95, 79, 22, 86, and 91 points. Then,
Mean = (73 + 82 + 95 + 79 + 22 + 86 + 91)/7 = 75.43 points. If we drop the outlier (22),
Mean = (73 + 82 + 95 + 79 + 86 + 91)/6 = 84.33 points. This shows how an outlier can affect the
value of the mean.
17. Suppose the exam scores for seven students are 73, 82, 95, 79, 22, 86, and 91 points. Then,
Range = Largest value – Smallest value = 95 – 22 = 73 points.
If we drop the outlier (22) and calculate the range,
Range = Largest value – Smallest value = 95 – 73 = 22 points.
Thus, when we drop the outlier, the range decreases from 73 to 22 points.
18. The value of the standard deviation is zero when all the values in a data set are the same. For example,
suppose the heights (in inches) of five women are: 67 67 67 67 67
This data set has no variation. As shown below the value of the standard deviation is zero for this
data set. For these data: n = 5, ∑x = 335, and ∑x2 = 22,445.
(∑ x) 2 (335) 2
∑ x2 − 22,445 −
n 5 22,445 − 22,445
s= = = =0
n −1 5 −1 4
76 Chapter Three
19. a. The frequency column gives the number of weeks for which the number of computers sold was in
the corresponding class.
b. For the given data: n = 25, ∑mf = 486.50, and ∑m2f = 10,524.25
x = (∑mf)/n = 486.50/25 = 19.46 computers
(∑ mf ) 2 (486.50) 2
∑ m2 f − 10,524.25 −
s2 = n = 25 = 44.0400 , s = 44.0400 = 6.64 computers
n −1 25 − 1
20. a. i. Each of the two values is 5.5 years from µ = 7.3 years. Hence,
1
k = 5.5/2.2 = 2.5 and 1 – 1 = 1 – = 1 – .16 = .84 or 84%
k 2 (2.5) 2
Thus, at least 84% of the cars are 1.8 to 12.8 years old.
ii. Each of the two values is 6.6 years from µ = 7.3 years. Hence
1
k = 6.6/2.2 = 3 and 1 – 1 = 1 – 2 = 1 – .11 = .89 or 89%
k 2 (3)
c. Ten values in the given data are less than 12. Hence, the percentile rank of
12 is (10/15) × 100 = 66.67%. Thus, 66.67% of the half hour time periods had fewer than 12
passengers set off the metal detectors during this day.
-5 0 5 10 15 20 25
26. The ranked data are: 207 238 287 293 349 366 463 479 538 2534
Thus, to find the 10% trimmed mean, we drop the smallest value and the largest value (10% of 10 is 1)
and find the mean of the remaining 8 values. For these 8 values,
∑x = 238 + 287 + 293 + 349 + 366 + 463 + 479 + 538 = 3013
10% trimmed mean = (∑x)/8 = 3013/8 = $376.63 thousand = $376,630. The 10% trimmed mean is a
better summary measure for these data than the mean of all 10 values because it eliminates the effect of
the outlier, 2534.
(∑ x) 2 (79) 2
∑ x2 − 1945 −
s= n = 4 = 11.32
n −1 4 −1
c. For Data Set II: ∑x = 67, ∑x2 = 1507, and n = 4
(∑ x) 2 (67) 2
∑ x2 − 1507 −
s= n = 4 = 11.32
n −1 4 −1
The standard deviations of the two data sets are equal.