CH 4 Estimation.
CH 4 Estimation.
CH 4 Estimation.
STATISTICAL ESTIMATION
o The sampling distribution of the mean shows how far sample means
could be from a known population mean.
o Similarly, the sampling distribution of the proportion shows how far
sample proportions could be from a known population proportion.
o In estimation, our aim is to determine how far an unknown
population mean could be from the mean of a simple random
sample selected from that population; or how far an unknown
population proportion could be from a sample proportion.
1
Basic concepts:
Estimation: is the process of using statistics as estimates of parameters.
2
Important Properties of Estimators
3
3. Consistency:
A statistic is a consistent estimator of a population
parameter if, as the sample size increases, it becomes
almost certain that the value of the statistic comes very
close to the value of the population parameter.
4. Sufficiency:
A sufficient statistic is an estimator that utilizes all the
information a sample contains about the parameter to
be estimated. For example, the sample mean is a
sufficient estimator of the population mean.
4
Types of Estimates
1. A point estimate: -
Is a single number that is used to estimate an
unknown population parameter.
The most important point estimates (given that they are single values) are:
5
2. An interval estimate
It is a range of values used to estimate a population parameter.
It describes the range of values with in which a parameter might lie.
Example:
Suppose we have the sample 10, 20, 30, 40 and 50 selected randomly from a population whose
mean is unknown.
xi 10 20 30 40 50
The sample Mean, x 30 is a point estimate of .
n 5
On the other hand, if we state that the mean, is between x 10 , the range of values from 20 (30-
7
Cont’d…..
8
Confidence interval for population mean is affected by
1)The population distribution: whether the population is
normally distributed or not.
2) The standard deviation: whether it is known or not.
3) The sample size: whether n is large or not.
9
10
To find the interval estimate of population mean, we have the
following steps.
1. Compute the standard error of the mean
2. Compute from the confidence coefficient.
3. Find the Z value for the from the table
4. Construct the confidence interval
5. Interpret the results
11
EXAMPLE 1:
Suppose that the standard deviation of the tube life for a particular brand of TV picture tube is
known to be σ = 500, but that the mean operating life is not known. Overall, the operating life of
the tubes is assumed to be approximately normally distributed. For a sample of n = 15, the mean
operating life is X = 8,900 hr. Determine (a) the 95 percent and (b) the 90 percent confidence
The normal probability distribution can be used in this case because the population is normally
13
Example 2:
With respect to the above example 1, suppose that the population of tube life cannot be assumed to
be normally distributed. However, the sample mean of X = 8,900 is based on a sample of n = 35.
Determine the 95 percent confidence interval for estimating the population mean.
500
Ans X Z α
2
σ 8,900 1.96
n 35
500
8,900 1.96( ) 8,900 1.96(84.46) 8,734 to 9,066
5.92
14
Confidence interval estimate of mean
Normal population, s.d unknown, n large
The confidence interval to estimate mean when
population standard deviation is unknown,
population normal and n is large is
15
Example:
For a given week, a random sample of 30 hourly employees selected from a very large number of
employees in a manufacturing firm has a sample mean wage of X = $180, with a sample standard
deviation of s = $14. We estimate the mean wage for all hourly employees in the firm with an
interval estimate such that we can be 95 percent confident that the interval includes the value of
Given:
X = $180 s = $14
Thus, we can state that the mean wage level for all
employees is between $174.98 and $185.02, with a 95
percent level of confidence in this estimate 17
Example 2:
A study is being conducted in a company that has 800
engineers. A random sample of 50 of these engineers reveals
that the average sample age is 34.3 years, and the sample
standard deviation is 8 years. Assuming normality, construct
a 98% confidence interval to estimate the average age of all
engineers in this company.
Given:
18
S N n
Solution : S x *
n N 1
8 800 50
i. = * 1.10
50 800 1
2 = 0.02/2 = 0.01
iii. Z / 2 Z 0.01 2.33
N n
iv. X Z / 2 s *
n N 1
= 34.3 ± 2.33(1.10)
= 34.3 ± 2.56
31.74 ≤ ≤ 36.86
19
Confidence interval for mean, unknown sd, n-small, population normal
20
X X
t
SX s
n
This formula is essentially the same as the z-formula, but the distribution table
values are not. The confidence interval to estimate mean becomes:
X t / 2,v s
n
Where: X = sample mean
α=1–C
ν = n – 1 (degrees of freedom)
n = sample size
iii. Look up t / 2 ,V
iv. Construct the confidence interval
A. Interpret results
22
Example:
In example 1 above we constructed confidence intervals for estimating the
mean operating life of a particular brand of TV picture tube based on the
assumption that the operating life of all tubes is approximately normally
distributed & s.d = 500, and given a sample of n = 15 withmean = 8,900 hr.
Suppose that s.d is not known, but rather, that the sample standard
deviation is S = 500.
2 = 0.05/2 = 0.025
= 22.4 ± 1.25
21.15 ≤ ≤ 23.65 26
B) Assume that the population of weekly contact data has a normal
distribution. Use the t distribution to develop a 95% confidence
interval for the mean number of weekly customer contacts.
C) Compare your answer for parts (a) and (b).
What do you conclude from your results?
Given
n= 61 weekly contact reports X = 22.4 contacts
S = 5 contacts C = 0.95
27
S 5
i. SX = = 0.64
n 61
ν = n – 1 = 61 – 1 = 60
2 = 0.05/2 = 0.025
iv. X t / 2,v s
n
= 22.4 ± 2.00 (0.64)
= 22.4 ± 1.28
21.12 ≤ ≤ 23.68
28
Interval Estimation of the Population Proportion
PP
PP
Z
p Pq
n
P pZ p q
n
Since Z can assume both positive and negative values, it becomes
P pZ pq pZ Sp
/2 n /2
29
Example 1:
Recently, a study of 87 randomly selected companies with telemarketing
operation was completed. The study revealed that 39% of the sampled
companies had used telemarketing to assist them in order processing. Using
this information estimate the population proportion of telemarketing companies
who use their telemarketing operation to assist them in order processing taking
a 95% confidence level.
Given:
n= 87 p = 0.39
q = 0.61 C = 0.95
30
Solution:
0.73 0.87
Pointes estimate 0.80 or
2
32
0.73 p Z / 2 s p
0.87 p Z / 2 s p
1.60 = 2 p
p 0.80
33
B.
P p Z / 2 S p
0.87 = 0.8+ Z / 2 S p
35
Solution:
P p Z / 2 S p
0.25 = 0.30 - Z / 2 S p
38
Sample size for estimating population mean
The confidence interval for mean is
X Z / 2
n
From the above expression Z /2 is called error of estimation (e). So
n
σ
e Zα 2
n
Zα 2 σ 2
n μ
e
39
Example:
A gasoline service station shows a standard deviation of Birr 6.25 for the changes
made by the credit card customers. Assume that the station’s management would like
to estimate the population mean gasoline bill for its credit card customers to be with
in ± Birr 1.00. For a 95% confidence level, how large a sample would be necessary?
Given:
n
e
1.96 * 6.25
2
n 151
1
Example 2:
The National Travel and Tour Organization (NTO) would like to estimate the
mean amount of money spent by a tourist to be with in Birr 100 with 95%
confidence. If the amount of money spent by tourist is considered to be
normally distributed with a standard deviation of Br 200, what sample size
would be necessary for the NTO to meet their objective in estimating this mean
amount?
41
Given:
e = Birr 100 σ = Birr 200
Z / 2
2
n
e
2
1.96 * 200
n
100
15.37 16
42
Sample size for estimating population proportion, p .
The confidence interval for p is P p Z pq
α/2
n
The expression pq is called the error term (e).
Z α/2
n
pq
That is, e Z / 2
n
pq
Squaring both sides e 2
Z / 2
2
2 n
Z / 2 pq Z / 2
2
np 2 np pq
then e e
43
Example
Suppose that a production facility purchases a particular
component parts in large lots from a supplier. The production
manager wants to estimate the proportion of defective parts
received from this supplier. She believes that the proportion of
defects is no more than 0.2 and wants to be with in 0.02 of the
true proportion of defects with a 90% level of confidence. How
large a sample should she take?
Given :
e = 0.02 p = 0.2 q =0.8
C = 0.90 Z / 2 Z 0.05 1.64
44
Solution:
2
Z /
np 2
pq
e
2
1.64
np 0.2 * 0.8
0.02
1075.84 1076
45
Example 2:
What is the largest sample size that would be needed in estimating a
population proportion to with in ± 0.02, with a confidence coefficient of 0.95?
Given:
e = 0.02 C = 0.95
Z / 2 Z 0.025 1.96
Solution:
The largest sample size would be obtained when p = 0.5. So,
2
Z / 2
2 1.96
np np 0.5 * 0.5
pq 0.02
e
2401
46
EXERCISE
Determine the sample size necessary to estimate p for the
following information.
a). e = .02, p is approximately 0.40, and confidence level is 96%
b). e is to be within .04, p is unknown, & confidence level is 95%
c). e is to be within 5%, p is approximately 55%, and confidence
level is 90%
d). e is to be no more than .01, p is unknown, and confidence
level is 99%
a. 2522 b. 601 c. 268 d. 16,577
47
END OF CHAPTER
4
48