Sampling Distribution Web
Sampling Distribution Web
Sampling Distribution Web
Suppose that in a population consisting of 5 elements and one wishes to take a random sample of 2. There are 10 possible samples which might be selected. Population (N=5) 1 2 3 4 5 Since we know the entire population, we can compute the population parameters: = 3.0 =
( X ) = 2 = 1.41
2
[Note: N, not n-1. This is the formula for computing the population standard deviation, .]
p. 1
All possible samples of size n=2: Sampl e (1,2) (1,3) (1,4) (1,5) (2,3) (2,4) (2,5) (3,4) (3,5) (4,5) Mean ( X ) 1.5 2.0 2.5 3.0 2.5 3.0 3.5 3.5 4.0 4.5 30.0
The average of all the possible sample means E( X ) = 30.0 / 10 = 3.0. E( X ) =. This property is called unbiasedness. Also notice that the variability in the sample means is smaller than the variability in the original population. For this simple example it is best to use the (population) range as a measure of dispersion: 5-1 = 5 vs. 4.5 1.5 = 4. If the sample size were larger, the range of the sample means would be even smaller. Notice that if we graph the distribution of all possible sample means, it looks almost normal symmetric, mean=median=mode, etc.
1.5
2.0
2.5
3.5
4.0
4.5
p. 2
p. 3
x =
p. 4
The Sampling Distribution of the Mean Since this is a normal distribution, we can standardize it (transform to Z) just like any other normal distribution.
Z = X E( X )
Z=
X / n
Z =
X s/ n
p. 6
Example: A manufacturer claims that its bulbs have a mean life of 15,000 hours and a standard deviation of 1,000 hours. A quality control expert takes a random sample of 100 bulbs and finds a mean of X = 14,000 hours. Should the manufacturer revise the claim? Z=
14 ,000 15 ,000 1,000 100
1,000 100
= -10
Thats 10 standard deviations away from the mean! Yes, the claim should definitely be revised.
14,000
15,000
p. 7
Example: Suppose you have steel chains with an average breaking strength of =200 lbs with a =10 lbs, and you take a sample of n=100 chains. What is the probability that the sample mean breaking strength will be: (a) 195 lbs or less (b) 201 lbs or more
195
200
(a)
Z=
= -5
(b) Z =
=1
prob = .1587
p. 8
This is an important example that illustrates the difference between two situations: (1) Situation 1: We are examining individual observations when the population mean and population standard deviation are known. (2) Situation 2: We are examining a sample mean (or means) when the population mean and population standard deviation are known. Do not confuse the two. What you have to know is that the standard deviation of the sample mean is not the same as the standard deviation of an individual observation from the population. Sample means, as explained above, have a smaller standard deviation. In fact, as the sample size gets larger, the standard deviation of the sample mean (known as the standard error of the mean) gets smaller and smaller. With a very large sample, the distribution of the sample means has a narrow range around the population mean. Example: In a large automobile manufacturing company, the mean life of hybrid motors is normally distributed with a mean of 100,000 miles and a standard deviation of 10,000 miles. (a) What is the probability that a randomly selected hybrid motor has a life between 90,000 miles and 110,000 miles per year? (b) If a random sample of 100 motors is selected, what is the probability that the sample mean will be below 98,000 miles per year? Note that to solve the answer to (a), one simply converts to Z by using the following formula: Z = ( Xi ) / Z= Z=
90 ,000 100 ,000 10 ,000 110 ,000 100 ,000 10 ,000
= 1 = +1
Thus, we have to find how much area lies between 1 and +1 of the Z-distribution (answer = .6826) With (b), we are looking at the sampling distribution of the mean. Sample means follow a different distribution and to convert to Z, we use the following formula.
p. 9
Z=
X / n
Z =
= 2
Thus, the answer is .5 .4772 = .0228. Example: Carp raised in a fishery have an average weight () of 30 pounds and a population standard deviation of 3 pounds. (a) What proportion of carp from this fishery will weigh less than 26 pounds? (b) A random sample of 25 carp is taken at this fishery? What is the probability that the sample mean will be below 28.60 pounds?
Solutions, carp problem: (a) Z = 1.33 probability = .50 .4082 = .0918 (b) Z = 2.33 probability = .50 .4901 = .0099
Example: A candy manufacturer produces bags of jelly beans. The weight of a bag of jelly beans is normally distributed with a mean of 12 ounces and a standard deviation of 0.4 ounces. (a) What is the probability that a randomly selected bag weighs between 11.62 and 12.3 ounces? (b) 96% of the bags of jelly beans weight more than __ ounces ?
p. 10
If a random sample of 16 bags of jelly beans is selected ... (c) .... what is the probability that the sample mean will be between 11.62 and 12.3 ounces? (d) .... what is the probability that the sample mean will be above 12.2 ounces?
p. 11