Sampling Distributions
Sampling Distributions
Sampling Distributions
Sampling Distribution
Introduction
• In real life calculating parameters of
populations is prohibitive because
populations are very large.
• Rather than investigating the whole
population, we take a sample, calculate a
statistic related to the parameter of interest,
and make an inference.
• The sampling distribution of the statistic is
the tool that tells us how close is the statistic
to the parameter.
Sample Statistics as Estimators
of Population Parameters
• A sample statistic is a A population parameter
numerical measure of a is a numerical measure of
summary characteristic a summary characteristic
of a sample. of a population.
E(X) = 1(1/6) +
x 1 2 3 4 5 6 2(1/6) + 3(1/6)+
………………….= 3.5
p(x) 1/6 1/6 1/6 1/6 1/6 1/6
V(X) = (1-3.5)2(1/6) +
(2-3.5)2(1/6) +
…………. …= 2.92
Throwing a dice twice – sampling
distribution of sample mean
E( x) =1.0(1/36)+
6/36 1.5(2/36)+….=3.5
5/36
V(X) = (1.0-3.5)2(1/36)+
4/36 (1.5-3.5)2(2/36)... = 1.46
3/36
2/36
1/36
1 1.5 2.0 2.5 3.0 3.5 4.0 4.5 5.0 5.5 6.0 x
Sampling Distribution of the
Mean
n5 n 10 n 25
x 3.5 x 3.5 x 3.5
2x 2x 2x
.5833 ( )
2
x x .2917 ( )
2
.1167 ( )
2
x
5 6 10 25
Sampling Distribution of the
Mean
n5 n 10 n 25
x 3.5 x 3.5 x 3.5
2 2x 2x
2x .5833 ( x
) .2917 ( )
2
x .1167 ( )
2
x
5 10 25
The expected value of the sample mean is equal to the population mean:
E( X )
X X
The variance of the sample mean is equal to the population variance divided by
the sample size:
2
V(X) 2
X
X
n
The standard deviation of the sample mean, known as the standard error of
the mean, is equal to the population standard deviation divided by the square
root of the sample size:
X
s.e. SD( X ) X
n
The Central Limit Theorem
n=5
When sampling from a population 0.25
P(X)
0.15
0.10
mean and standard deviation n as
P(X)
0.1
(n >30). Large n
0.4
0.3
f(X)
For “large enough” n: X ~ N ( , / n)
0.2
2
0.1
0.0
-
X
The Central Limit Theorem Applies to
Sampling Distributions from Any Population
Normal Uniform Skewed General
Population
n=2
n = 30
X X X X
The Central Limit Theorem
(Example)
Mercury makes a 2.4 liter V-6 engine, used in speedboats. The company’s
engineers believe the engine delivers an average horsepower of 220 HP and
that the standard deviation of power delivered is 15 HP. A potential buyer
intends to sample 100 engines. What is the probability that the sample mean
will be less than 217 HP?
X 217
P( X 217) P
n n
217 220 217 220
P Z P Z
15 15
100 10
P( Z 2) 0.0228
Student’s t Distribution
If the population standard deviation, , is unknown, replace with
the sample standard deviation, s. If the population is normal, the
t X
resulting statistic:
s/ n
has a t distribution with (n - 1) degrees of freedom.
• The t is a family of bell-shaped and
symmetric distributions, one for each
number of degree of freedom.
• The expected value of t is 0. Standard normal
• The variance of t is greater than 1, but t, df=20
approaches 1 as the number of degrees of t, df=10
freedom increases.
• The t distribution approaches a standard
normal as the number of degrees of
freedom increases.
• When the sample size is small (<30) we use
t distribution.
Sampling Distributions
N n
x
n N 1
Sampling Distribution of x
Standard Deviation of x
Finite Population Infinite Population
N n
x ( ) x
n N 1 n
• A finite population is treated as being
infinite if n/N < .05.
• ( N n) / ( N 1) is the finite correction factor.
• x is referred to as the standard error of the
mean.
Sampling Distribution of the Sample Mean
• The amount of soda pop in each bottle is normally
distributed with a mean of 32.2 ounces and a
standard deviation of 0.3 ounces.
• Find the probability that a carton of four bottles will
have a mean of more than 32 ounces of soda per
bottle.
• Solution
– Define the random variable as the mean amount of soda per
bottle.
x 32 32.2
P( x 32) P( ) 0.9082
x .3 4
P( z 1.33) 0.9082
x = 32
x 32 = 32.2
x 32.2
Sampling Distribution of the
Sample Mean
• Example
– Dean’s claim: The average weekly income of
M.B.A graduates one year after graduation is
$600.
– Suppose the distribution of weekly income has a
standard deviation of $100. What is the
probability that 25 randomly selected graduates
have an average weekly income of less than
$550?
– Solution x 550 600
P( x 550) P( )
x 100 25
P( z 2.5) 0.0062
The Sampling Distribution of the Sample
Proportion, p
n= 2 , p = 0 .3
0 .4
P(X)
0 .2
0 .0
number of trials, n. 0 1 2
n=10,p=0.3
X 0.3
P(X)
0.1
0.0
0 1 2 3 4 5 6 7 8 9 10
X
0.2
P(X)
deviation p(1 p) 0.1
n 0.0
0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 X
0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15
15 15 15 15 15 15 15 15 15 15 15 15 15 15 15 15 ^p
Normal approximation to the
Binomial
– Normal approximation to the binomial works
best when
• the number of experiments (sample size) is
large, and
• the probability of success, p, is close to 0.5.
np 5; n(1 - p) 5
• Example
– A state representative received 52% of the
votes in the last election.
– One year later the representative wanted
to study his popularity.
– If his popularity has not changed, what is
the probability that more than half of a
sample of 300 voters would vote for him?
• Example
– Solution
• The number of respondents who prefer the
representative is binomial with n = 300 and p =
.52. Thus, np = 300(.52) = 156 and
n(1-p) = 300(1-.52) = 144 (both greater than 5)
pˆ p .50 .52
P( pˆ .50 ) P .7549
p (1 p) n (.52 )(1 .52 ) 300
SHAPE OF THE SAMPLING
DISTRIBUTION OF x
25
Sampling From a Normally
Distributed Population
If the population from which the samples
are drawn is normally distributed with mean
μ and standard deviation σ , then the
sampling distribution of the sample mean,
x ,will also be normally distributed with the
following mean and standard deviation,
irrespective of the sample size:
x and x
n 26
Figure 7.2 Population distribution and sampling
distributions of .
x
Normal distribution
27
Figure 7.2 Population distribution and sampling
distributions of .
x
Normal distribution
28
Figure 7.2 Population distribution and sampling
distributions of .
x
Normal distribution
29
Figure 7.2 Population distribution and sampling
distributions of .
x
Normal distribution
x 30
Figure 7.2 Population distribution and sampling
distributions of .
x
Normal distribution
31
x
Example 7-3
a)
x 1020
153
x 38.250
n 16
33
Figure 7.3
Sampling
x = 38.250 distribution of x for
n = 16
Population
distribution
σ = 153
b)
x 1020
153
x 21.637
n 50
35
Figure 7.4
Sampling
x = 21.637 distribution of x for
n = 50
Population
distribution
σ = 153
36
Solution 7-3
c)
x 1020
153
x 4.838
n 1000
37
Figure 7.5
Sampling
x = 4.838 distribution of x for
n = 1000
Population
σ = 153 distribution
38
x = μ = 1020 SAT scores
Sampling From a Population That
Is Not Normally Distributed
40
Figure 7.6 Population distribution and sampling
distributions of .
x
41
Figure 7.6 Population distribution and sampling
distributions of .
x
42
Figure 7.6 Population distribution and sampling
distributions of .
x
43
Figure 7.6 Population distribution and sampling
distributions of .
x
x
44
Example 7-4
45
Solution 7-4
x $1550
225
x $41.079
n 30
46
Figure 7.7
σ = $225
μ = $1550 x
47
Figure 7.7
x = $41.079
x = $1550 x 48
Solution 7-4
x $1550
225
x $22.500
n 100
49
Figure 7.8
σ = $225
μ = $1550 x
50
Figure 7.8
x = $22.500
x = $1550 x 51
APPLICATIONS OF THE
SAMPLING DISTRIBUTION OF x
52
Figure 7.9
P( 1 x x 1 x )
Shaded area
is .6826
.3413 .3413
1 x 1 x x
53
APPLICATIONS OF THE SAMPLING
DISTRIBUTION OF x cont.
54
Figure 7.10
P( 2 x x 2 x )
Shaded area
is .9544
.4772 .4772
2 x 2 x x
55
APPLICATIONS OF THE SAMPLING
DISTRIBUTION OF x cont.
56
Figure 7.11
P( 3 x x 3 x )
Shaded area
is .9974
.4987 .4987
3 x 3 x x
57
Example 7-5
x 32 ounces
.3
x .06708204 ounce
n 20
59
z Value for a Value of x
x
z
x
60
Solution 7-5
31.8 32
• For x = 31.8: z 2.98
.06708204
31.9 32
• Forx = 31.9: z 1.49
.06708204
Shaded are is
.0667
31.8 31.9 x = 32 x
-2.98 -1.49 0
z
62
Example 7-6
64
Solution 7-6
x $18,273
2100
x $300
n 49
65
Solution 7-6
a)
x 17,723 18,273
• For = $17,723: z
x 1.83
x 300
x 18,823 18,273
• For x = $18,823: z 1.83
x 300
• P (17,723 ≤ x≤ 18,823)
= P (-1.83 ≤ z ≤ 1.83) = .4664 + .4664
= .9328
66
Solution 7-6
a)
Therefore, the probability that the 2002-
2003 mean tuition and fees for this
sample of 49 four-year private U.S.
colleges was within $550 of the
population mean is .9328.
67
Figure 7.13
P($17,723 x $18,823)
Shaded area
is .9328
.4664 .4664
b)
• For x = $17,873:
x 17,873 18,273
z 1.33
x 300
• P ( x ≤ $17,873) = P (z ≤ -1.33)
= .5 - .4082
= .0918
69
Solution 7-6
b)
Therefore, the probability that the 2002-
2003 mean tuition and fees for this
sample of 49 four-year private U.S.
colleges was lower than the population
mean by $400 or more is .0918.
70
Figure 7.14
P( x $17,873)
The required
probability is
.0918
.4082
$17,873 x = $18,273 x
-1.33 0 z
71
POPULATION AND SAMPLE
PROPORTIONS
The population and sample proportions,
denoted by p and p̂ are
, respectively,
calculated as
X x
p and pˆ
N n
72
POPULATION AND SAMPLE
PROPORTIONS cont.
where
– N = total number of elements in the population
– n = total number of elements in the sample
– X = number of elements in the population that
possess a specific characteristic
– x = number of elements in the sample that
possess a specific characteristic
73
Example 7-7
74
Solution 7-7
X 563,282
p .71
N 789,654
x 158
pˆ .66
n 240
Sampling error pˆ p .66 .71 .05
75
MEAN, STANDARD DEVIATION, AND
SHAPE OF THE SAMPLING
p̂ DISTRIBUTION OF
• Sampling Distribution of p̂
• Mean and Standard Deviation of p̂
• Shape of the Sampling Distribution of p̂
76
Sampling Distribution of
p̂
Definition
The probability distribution of the sample
proportion, p̂ , is called its sampling
distribution. It gives various values that
p̂ can assume and their probabilities.
77
Example 7-8
78
Table 7.6 Information on the Five Employees of Boe
Consultant Associates
79
Example 7-8
80
Example 7-8
5! 5 4 3 2 1
Total number of samples 5 C3 10
3!(5 3)! 3 2 1 2 1
81
Table 7.7 All Possible Samples of Size 3 and the Value
of for Each Sample
p̂
Relative
p̂ f Frequency
.33 3 3/10 = .30
.67 6 6/10 = .60
1.00 1 1/10 = .10
Σf = 10 Sum = 1.00
83
Table 7.9 Sampling Distribution of When the
Sample Size is p̂3
p̂ P ( p̂ )
.33 .30
.67 .60
1.00 .10
ΣP ( p̂ ) = 1.00
84
Mean and Standard Deviation of
p̂
Mean of the Sample Proportion
The mean of the sample proportion, ,p̂is
denoted by and p̂
is equal to the population
proportion, p. Thus,
pˆ p
85
Mean and Standard Deviation of
p̂ cont.
Standard Deviation of the Sample Proportion
The standard deviation of the sample
proportion, p̂, is denoted by and
p̂ is given by
the formula pq
pˆ
n
90
Solution 7-9
91
Figure 7.15
Approximately
normal p̂ =.011
p̂ = .87 p̂
92