Parametric Test

Download as pptx, pdf, or txt
Download as pptx, pdf, or txt
You are on page 1of 49

SEMINAR ON

PARAMETRIC TEST
SUBMITTED BY SUPERVISED BY
R.SUGANTHI RAJESH DR.U.RAMYA
PHD SCHOLAR ASSOCIATE PROFESSOR

MAHER MAHER
CHENNAI CHENNAI
INTRODUCTION

• A population is the entire set of individuals or objects the researcher is studying. A


sample is a smaller group within the population that is studied to make inferences about
the larger population. Measures describing the characteristics of a sample are called
statistics. However, measures when they describe the characteristics of a population are
called parameters.
DEFINITION

• Statistics is a branch of science that deals with the collection, organization, analysis of
data and drawing of inferences from the samples to the whole population
• Statistical tests are intended to decide whether a hypothesis about distribution of one or
more populations or samples should be rejected or accepted.
• Inferential statistics is the statistics that permit inferences on whether the results observed
in a sample are likely to occur in the larger population
TERMINOLOGIES IN STATISTICS

• Data are measurements or observations that are collected as a source of information. There are a
variety of different types of data, and different ways to represent data.
• Population means is the mean or average of all values in the given population. Denoted by µ
• sample mean is the average of the values obtained in the sample. Denoted by x¯ .
• Standard deviation is the average distance from the mean value of all values in a set of data.
• Level of significance is the probability of concluding that a sequence which is non random when it
is in fact random.

• Variance defines a measure of the spread or dispersion within a set of data. There are two types: the
population variance, usually denoted by 2 and the sample variance is usually denoted by S2
• Degree of freedom is the number of bits or free or unconstrained data used in calculating a sample
statistic or test statistic. Denoted by df. Sample mean has n degree of freedom. Sample variance has n-1
degree of freedom
• Critical region also known as rejection rejoin is a set of values for which the null hypothesis is rejected
• Null hypothesis is a comprehensive statement or default status that there is zero happening or nothing
happening. Denoted by H0
• Alternate hypothesis is a statistically important relationship between two variables. Denoted by H1
• Sample error is the difference between a sample statistics used to estimate a population parameter
• Type 1 error is rejecting the true null hypothesis and denoted by α
• Type 2 error is accepting the false null hypothesis and denoted by β
• Power of a test is rejecting the false null hypothesis and denoted by 1-β
• Directional test is known as one tailed test tests because all of the error is one tail of the
distribution
• Non directional test are called two tailed tests which is used to determine if the difference
between groups is statistically significant in either direction.
PARAMETRIC TEST

• Parametric test is a class of statistical tests that involve assumptions about the distribution
of the variables and estimation of the parameter
• It is a statistical test that makes assumptions about the parameters of the population
distributions from which one’s data is drawn.
• Numerical data that are normally distributed are analyzed with parametric test
• Parametric tests are done on the basis of mean and standard deviation
ASSUMPTION

It requires 3 assumptions for using


• The sample was drawn from a population for which the variance can be calculated. It is
expected to be in normal distribution
• The levels of measurement should be at least interval data or ordinal data with an
approximately normal distribution
• The data can be treated as random samples
APPLICATION OF PARAMETRIC TEST

• Used for quantitative data


• Used for continuous variables
• Used when data are measure on approximate interval or ratio scales of measurement
• Data should follow normal distribution
PARAMETRIC STATISTICAL
ANALYSIS
• T test
• Z test
• F test , Analysis of variance(ANOVA)
• Analysis of covariance(ANCOVA)
STUDENT’S ‘T’ TEST

• Developed by Prof.W.S.Gossett
• Synonyms of t distribution
• It is applied to find the significant difference between two means.
• Used to test the null hypothesis that there is no difference between the means of the two
groups
Definition:
Student t test is the probability distribution that is used to estimate population parameters
when the sample size is small and the population variance is unknown
Indication for ‘t’test
 When sample are small
 Randomly selected homogenous sample
 Measures on interval or ratio scale
 Variability normally distribute
 Population variance are not known
TYPES OF ‘T’ TEST

• One sample t-test


• Independence two sample t test (unpaired t-test)
• The paired t-test
ONE-SAMPLE T-TEST

• To test if a sample mean differs significantly from a given population mean


• The mean of one sample is compared with population mean

• Where Ẋ means sample mean, µ means population mean, S is standard deviation and n is
sample size
EXAMPLE

• A random sample of size 20 from a normal population gives a sample mean of 40, standard deviation of 6.
Test the hypothesis if population mean is 44 with one tail test
• H0 : There is no significant difference between sample mean and population mean
• H1: There is significant difference between sample mean and population mean
• Here x = 40, µ = 70. SD = 6 and n = 20
• t calculated value is 2.981
• t tabulated value is 2.093
• t calculated value  t tabulated value
• Rejected null hypothesis
INDEPENDENT TWO SAMPLE ‘T’
TEST
• To test if the population means estimated by two independent samples differ significantly
• Two different samples with same mean at initial point and compare mean at the end
PAIRED ‘T’ TEST

• To test if the population means estimated by two dependent samples differ significantly
• A usual setting for paired t test is when measurements are made on the same subjects before
and after a treatment

Where d is the mean difference and S d denotes standard deviation of the difference.
EXAMPLE OF PAIRED T TEST

• Systolic BB of 5 Patients before and after drug therapy


• Before – 160, 150, 170,130,140
• After – 140,110,120,140,130
• Test whether there is any significant difference between BP level

H0 : there is no significant difference between BP level before and after drug therapy
H1: there is significant difference between BP level before and after drug therapy
Z TEST

• Z test is a statistical test that is conducted on data that approximately follows a normal
distribution. The z test can be performed on one sample, two samples, or on proportions for
hypothesis testing. It checks if the means of two large samples are different or not when the
population variance is known.
• z test can further be classified into left-tailed, right-tailed, and two-tailed hypothesis tests
depending upon the parameters of the data.
DEFINITION;
A z test is conducted on a population that follows a normal distribution with independent data
points and has a sample size that is greater than or equal to 30.
TYPES OF Z TEST

• ONE SAMPLE Z TEST


• TWO SAMPLE Z TEST
• ONE PROPORTION Z TEST
• TWO PROPORTION Z TEST
ONE SAMPLE Z TEST

• A one-sample z test is used to check if there is a difference between the sample mean and
the population mean when the population standard deviation is known. The formula for
the z test statistic is given as follows:
• The algorithm to set a one sample z test based on the z test statistic is given as follows:
• Left Tailed Test:
• Null Hypothesis: H0:0 : μ=μ0
• Alternate Hypothesis: H1 : μ<μ0
• Decision Criteria: If the z statistic < z critical value then reject the null hypothesis.
• Right Tailed Test:
• Null Hypothesis: H0 : μ=μ0
• Alternate Hypothesis: H1 : μ>μ0
• Decision Criteria: If the z statistic > z critical value then reject the null hypothesis.
• Two Tailed Test:
• Null Hypothesis: H0 : μ=μ0
• Alternate Hypothesis: H1 : μ≠μ0
• Decision Criteria: If the z statistic > z critical value then reject the null hypothesis.
• A two sample z test is used to check if there is a difference between the means of two
samples. The z test statistic formula is given as follows:
Z TEST FOR PROPORTION

• A z test for proportions is used to check the difference in proportions. A z test can either
be used for one proportion or two proportions. The formulas are given as follows.
• One Proportion Z Test
• A one proportion z test is used when there are two groups and compares the value of an
observed proportion to a theoretical one. The z test statistic for a one proportion z test is
given as follows:
• Two Proportion Z Test
• A two proportion z test is conducted on two proportions to check if they are the same or
not. The test statistic formula is given as follows:
Z TEST CALCULATION

• a teacher claims that his section's students will score higher than his colleague's section.
The mean score is 22.1 for 60 students belonging to his section with a standard deviation
of 4.8. For his colleague's section, the mean score is 18.8 for 40 students and the standard
deviation is 8.1. Test his claim at α = 0.05.
STEPS OF Z TEST CALCULATION

• The steps to calculate the z test statistic are as follows:


• Identify the type of test. In this example, the means of two populations have to be
compared in one direction thus, the test is a right-tailed two-sample z test.
• Set up the hypotheses. H0: μ1=μ2, H1: μ1>μ2
• Find the critical value at the given alpha level using the z table. The critical value is
1.645.
• Determine z value using this formula

• Substitute values in this equation. x1¯ = 22.1, σ1 = 4.8, n1 = 60, x2¯ = 18.8, σ2=
8.1, n2 = 40 and μ1−μ2=0. Thus, z = 2.32
• Compare the critical value and test statistic to arrive at a conclusion. As 2.32 > 1.645
thus, the null hypothesis can be rejected. It can be concluded that there is enough
evidence to support the teacher's claim that the scores of students are better in his class.
DIFFERENCE BETWEEN Z TEST & T TEST

Z test T test
A z test is a statistical test that is used to check if A t-test is used to check if the means of two data
the means of two data sets are different when the sets are different when the population variance is
population variance is known. not known.

The sample size is greater than or equal to 30. The sample size is lesser than 30.

The data follows a normal distribution. The data follows a student-t distribution.

The one-sample z test statistic is given The t test statistic is given as t=x−μ/s√n where s
by z=x−μ/σ √n where  is population standard is the sample standard deviation
deviation
ANALYSIS OF VARIANCE(ANOVA)

• Founder R.A.FISCHER
• The student’s t test cannot be used for comparison of three or more groups
• The purpose of ANOVA is to test if there is any significant difference between the means of two or more groups
• The analysis of variance is the systematic algebraic procedure of decomposing the overall variation in the
responses observed in an experiment into variation
• Two variances –between group variability and within group variability that is variation existing between the
samples and variations existing within the sample
• The within group variability[error variance] is the variation that cannot be accounted for in the study design
• The between group (effect variance) is the result of treatment
ASSUMPTION OF ANOVA

• The population in which samples are drawn should be normal


• The samples observation are independent of each other
• The samples are selected at random
• The samples are drawn from population having equal variance
• The sample size should not differ widely
• The various effects{treatment and errors) ar additive in nature
• The experimental error are normally and independently distributed with mean zero
ONE WAY ANOVA

• It compares three or more unmatched groups when date are categorized in one way
• Total sum of square(TSS)=Treatment sum of square(TSS)+ Error sum of square(SSE)
Example
1. Compare control group with three different doses of aspirin in rats
2. Effect of supplementation of vit c in each subject before and after the treatment
TWO WAY ANOVA

• It is used to determine the effect of two nominal predictor variables on a continuous


outcome variables
• A two type ANOVA test analyzes the effect of the independent variables on the expected
outcome along with their relationship to the outcome itself
Example
1. Effect of two antihypertensive drugs in two different doses
2. Comparing the employees productivity based on the working hours and working
conditions
DIFFERENCE BETWEEN ONE AND
TWO WAY ANOVA
• One way ANOVA is used to determine if there is a difference in the mean height of stalks
of three different types of seeds
• Since only one factor that could be making the height different
• If three different types of seeds, and then add the possibility that three types of fertilizer is
used
• The mean height of the stalks could be different for a combination of several reasons
• Two factors(type of seed and type of fertilizer), use a two way ANOVA
STEPS OF APPLICATION OF ANOVA
A researcher finds out the difference of mean job satisfaction score among four different groups of
employees {doctors, nurses, technicians and ministerial staffs) in a hospital
Satisfaction score of four different category of employees in a hospital(N=52)
DOCTORS(X1) NURSES(X2) TECHNICIANS(X3) MINISTRIAL
STAFFS(X4)
18 17 19 20
16 19 23 21
17 18 13 23
20 15 19 27
12 22 17 30
19 12 32 20
16 14 21 22
14 23 18 24
23 17 20 17
16 16 22 16
13 15 25
21 22 27
20 19
14 15
14 16

X1=253 X2=260 X3=256 X4=220


• Apply all the values in formula f=MST/MSE
• MST=265.25/3=88.42
• MSE=720.73/48=15.02
• Hence F= 88.42/15.02
• F=5.887
• Refer to the tabulated F value for horizontal (df=3) at the 0.05 level of significance and vertical
df(df=48) at the 0.05 level of significance.The calculated F value (5.887) is greater than tabulated f
value(2.52) at 0.05 level of significance(p=0.05). Therefore null hypothesis is rejected and it is inferred
that there is statistical difference in job satisfaction score among four different groups of employees.
ANALYSIS OF COVARIANCE(ANCOVA)

• ANCOVA is a technique that combines the features of analysis of variance and regression
• It is used to increase the precision of treatment comparison
• This method is based on the fact that there are some extraneous sources of variation
which also contribute to the experimental error but are not controlled. These additional
variations are known as the ancillary or concomitant variates
DEFINITION OF ANCOVA

ANCOVA is defined as the very logical procedure, which reduces the experimental error by
eliminating from it the effects of variations in the concomitant variate and thus increase the
precision of the main variate on the concomitant variate is known as analysis of covariance
Example
Effect of 3 diet on gaining weight of animal with different age group and different initial
weight will influence animal performance and precision of experiment
ASSUMPTION

All assumption of ANOVA is applicable here too. In addition, it is assumed that


• The relationship between x and y is linear
• The relationship is same for each treatment
• The covariates are not affected by treatment
• The observations are from normal populations
USES OF ANCOVA TEST

• To increase the precision in a randomized experiment


• To remove the effect of disturbing variables in observational studies
• To throw light on the nature of treatment effects
• To analyze the data when some observations are missing
• To fit regression in multiple classification
ADVANTAGES OF ANCOVA

• Better power
• Improved ability to detect and estimate interactions
• The availability of extensions to deal with measurement error in the covariates
DISADVANTAGES

• There will be cost of introducing the blocking factor


• It may be difficult to find blocking factors that are highly correlated with the dependent
variable
• Loss of power may occur if a poorly correlated blocking factor is used
SUMMARY OF PARAMETRIC TESTS
APPLIED FOR DIFFERENT TYPES OF DATA
TYPE OF GROUPS PARAMETRIC TEST
COMPARISON OF TWO PAIRED GROUPS PAIRED T TEST
COMPARISON OF UNPAIRED GROUPS UNPAIRED TWO SAMPLE T TEST
COMPARISION OF POPULATION AND ONE SAMPLE T TEST
SAMPLE DRAWN FROM THE POPULATION

COMPARISON OF 3 OR MORE MATCHED ONE WAY ANOVA


GROUPS BUT VARIED IN 1 FACTOR

COMPARISON OF 3 OR MORE MATCHED TWO WAY ANOVA


GROUPS BUT VARIED IN 2 FACTORS
THANK YOU

You might also like