BIOS O6S A4
BIOS O6S A4
BIOS O6S A4
Estimation
Regression
Linear regression
Logistic regression
Survival analysis
Correlation
OUTLINE
Proportions
AN OBSERVED DIFFERENCE IN A PARAMETERS,
BETWEEN TWO GROUPS
Eg. treated vs. control may be as a result of
4
sampling variation or Inherent differences between the two
groups
Differences in the handling and end point evaluation
(ascertainment)
(the two groups during the course of the investigation)
Chance
The true effects of the new procedure/drug.
HOW DO WE DECIDE IF THE EFFECT IS TRUE OR DUE
TO CHANCE ?
Statistical Inference
Two main techniques are used:
1. Significance testing & Test of Association (the p-value)
Null hypothesis vs. Alternate hypothesis
2. Confidence intervals
TEST OF SIGNIFICANCE & P-VALUE
Test of significance
Test of significance is the method to rule out chance as an
explanation of the observed difference.
Used more for numerical data
P-value
Chance/probability that random sampling from the population
would produce a sample statistic i.e. mean as deviant or more
deviant than the statistic observed.
7
CHARACTERISTICS OF STANDARD DEVIATIONS &
VARIANCES
Variance of 2 variables/proportions
Standard deviation formula when combined
Discrete or Continuous variables Computed as the addition of the
sd = two variances of the variables
of interest
Proportion
sd = Standard deviation 2 Discrete or
where Continuous variables together
(1-p)=q
sd = s12 + s22]
Type II error
This is committed if we accept the null hypothesis when it is false
and the alternate hypothesis is true.
The probability of type II error is denoted by the symbol β
Not possible control both errors at the same time in real life
Typically compromise one (β) for the other (α)
Common Test Situation Example
Statistics
Student’s T-test For group sample data Mean haemoglobin level in mothers who attended
antenatal clinic versus non attendant
Student’s T-test For paired data Change in haemoglobin levels after iron
supplementation
Z-score test For proportions Proportion of children with parasitaemia in urban
population and rural population
Chi square Test Categorical variables Association of antenatal attendance and outcome of
labour
Common Test Situation Example
Statistics
F – test for analysis of Comparison of means in Mean haemoglobin levels in urban, peri-
variance ANOVA more than two groups urban and rural mothers
Correlation coefficient Strength of association Blood sugar levels and systolic blood
pressure level
These are the values of t for which a given percentage, P, of the t-distribution lies outside the range -t to +t.
As the number of degrees of freedom increases, the distribution becomes closer to the standard normal
distribution.
P/2 P/2
-t 0 t
P 50 20 10 5 2 1 0.2 0.1
t=
means
t =
the df is (n-1)
Hypothesis
Ho: (p1-p2) = 0 and Ha: (p1-p2) ≠ 0
Test statistic
z =
Example 2: Survival rates in the 67 patients of the treated group and 24 of the
control group were 84.4 and 63 respectively. Is the survival rate in the treated
group better than in the control group?
p1 =.56 p2=.40
EXAMPLE 1 Hypothesis
H : (p -p ) = 0 and Ha: (p1-p2) ≠
o 1 2
An attack rate of para‑influenza 0 (2 tail test i.e. p < 0.025)
virus type II in nurses is known
to be 40%. In a sample of 60
nurses an attack rate of 56% Test statistic
was detected. Is this attack rate z=
out of the ordinary?
=
= 1.985
EXAMPLE 2 Hypothesis
H : (p -p ) = 0 and Ha: (p1-
o 1 2
Survival rates in the 67 p2 ) > 0 (1 tail test i.e. p <
patients of the treated 0.05)
group and 24 of the control
group were 84.4% and 63%
Test statistic
z=
respectively. Is the survival
rate in the treated group =
better than in the control
group? Use α=.05 = 1.981