Epi X Sampling
Epi X Sampling
Epi X Sampling
Availability
Availability of
of Lower
Lower cost
cost
elements
elements
Sampling
Sampling
provides
provides
Greater
Greater Greater
speed
speed accuracy
Population Vs. Sample
Population of Interest
Population Sample
Sample
A
A population
population is
is the
the set
set of
of all
all the
the elements
elements of
of interest
interest..
A
A sample
sample is
is a
a subset
subset of
of the
the population
population..
The
The sample
sample results
results provide
provide only
only estimates
estimates of
of the
the values
values of
of the
the pop
pop
characteristics.
characteristics.
With
With proper
proper sampling
sampling methods,
methods, the
the sample
sample results
results can
can provide
provide “g
“g
estimates
estimates of
of the
the population
population characteristics.
characteristics.
Sampling Frame and
Sampling Plan
•Sampling frame: A listing of every member of the population,
using the sampling criteria to define membership in the
population
•Subjects are selected from the sampling frame
•Sampling plan: Outlines strategies used to obtain a sample for
a study
◦ Probability sampling plans
◦ Nonprobability sampling plans
Types of Sampling Methods
Probability Samples
Non-Probability Samples
Simple Stratified
Random
Quota
Judgemental
Systematic Cluster
Convenience
8
Probability Sampling Method
In probability sampling
The random ("equal chance“) and "independent" components
of random sampling are what makes us confident that the
sample has a reasonable chance of representing the population
A sampling frame exists or can be compiled.
should have an equal or at least a known or nonzero chance
of being included in the sample.
Generalization is possible (from sample to population)
9
Types of Probability
Sampling
Simple random sampling
Cluster sampling
Systematic sampling
Simple Random Sampling
Sampling or Random
Errors
Non-sampling or systematic
22
What is a sampling error?
•Difference between the population mean and the mean
of the sample
•Random variation is:
◦ The expected difference in values that occurs when
different subjects from the same sample are examined
◦ Difference is random because some values will
be higher and others lower than the average
population values
Non Sampling Error or bias
Error in the design or conduct of a sampling procedure which results in
distortion of the sample, so that it is no longer representative of the reference
population.
A non-response error occurs when units selected as part of the sampling
procedure do not respond in whole or in part (non response rate >15%)
A response or data error is any systematic bias that occurs during
data collection, analysis or interpretation
Respondent error (e.g., lying, forgetting, etc.)
Interviewer bias
Recording errors
Poorly designed questionnaires
24
What is attrition and retention
in reference to research
samples?
•Sample attrition is the withdrawal or loss of
subjects from a study
◦Attrition rate = number of subjects withdrawing
÷ number of study subjects × 100
•Sample retention is the number of subjects who
remain in and complete a study
Study population (N = 20,000) consisting
Random Samples
of five subgroups based on their
preferred lunch times. Preferred lunch
times, each a 30-minute interval,
Four samples of 10 subjects. Each mean
is a point estimate, the best predictor of
the unknown population mean.
None of the samples shown is a perfect
duplication because no sample perfectly
replicates its population.
We cannot judge which estimate is the
true mean of the population but we can
estimate the interval in which the true
mean will fall by using any of the
samples.
Standard Error of the Mean
/Standard Error
•The SD describes variability in a data set whereas SE refers to the
variability in the means of repeated samples from the same
population.
•The SE quantifies the variations in the sampled means (SD/)
•Assumption
•Infinite numbers of possible samples can be drawn from the source
population
•Utility
•Helps in calculating confidence intervals
Confidence Intervals
•Assumption: Central Limits Theorem- Attribute of the
obtained samples are equal to true value of the
population.
•Values obtained by the sample are distributed normally
about the true value, some will get higher and some lower
values
•In a normal distribution approximately, 95% of the means
would be expected to fall within the range of two standard
errors above and two below the mean of these means.