Parametric Test
Parametric Test
Parametric Test
PARAMETRIC TEST
SUBMITTED BY SUPERVISED BY
R.SUGANTHI RAJESH DR.U.RAMYA
PHD SCHOLAR ASSOCIATE PROFESSOR
MAHER MAHER
CHENNAI CHENNAI
INTRODUCTION
• Statistics is a branch of science that deals with the collection, organization, analysis of
data and drawing of inferences from the samples to the whole population
• Statistical tests are intended to decide whether a hypothesis about distribution of one or
more populations or samples should be rejected or accepted.
• Inferential statistics is the statistics that permit inferences on whether the results observed
in a sample are likely to occur in the larger population
TERMINOLOGIES IN STATISTICS
• Data are measurements or observations that are collected as a source of information. There are a
variety of different types of data, and different ways to represent data.
• Population means is the mean or average of all values in the given population. Denoted by µ
• sample mean is the average of the values obtained in the sample. Denoted by x¯ .
• Standard deviation is the average distance from the mean value of all values in a set of data.
• Level of significance is the probability of concluding that a sequence which is non random when it
is in fact random.
•
• Variance defines a measure of the spread or dispersion within a set of data. There are two types: the
population variance, usually denoted by 2 and the sample variance is usually denoted by S2
• Degree of freedom is the number of bits or free or unconstrained data used in calculating a sample
statistic or test statistic. Denoted by df. Sample mean has n degree of freedom. Sample variance has n-1
degree of freedom
• Critical region also known as rejection rejoin is a set of values for which the null hypothesis is rejected
• Null hypothesis is a comprehensive statement or default status that there is zero happening or nothing
happening. Denoted by H0
• Alternate hypothesis is a statistically important relationship between two variables. Denoted by H1
• Sample error is the difference between a sample statistics used to estimate a population parameter
• Type 1 error is rejecting the true null hypothesis and denoted by α
• Type 2 error is accepting the false null hypothesis and denoted by β
• Power of a test is rejecting the false null hypothesis and denoted by 1-β
• Directional test is known as one tailed test tests because all of the error is one tail of the
distribution
• Non directional test are called two tailed tests which is used to determine if the difference
between groups is statistically significant in either direction.
PARAMETRIC TEST
• Parametric test is a class of statistical tests that involve assumptions about the distribution
of the variables and estimation of the parameter
• It is a statistical test that makes assumptions about the parameters of the population
distributions from which one’s data is drawn.
• Numerical data that are normally distributed are analyzed with parametric test
• Parametric tests are done on the basis of mean and standard deviation
ASSUMPTION
• Developed by Prof.W.S.Gossett
• Synonyms of t distribution
• It is applied to find the significant difference between two means.
• Used to test the null hypothesis that there is no difference between the means of the two
groups
Definition:
Student t test is the probability distribution that is used to estimate population parameters
when the sample size is small and the population variance is unknown
Indication for ‘t’test
When sample are small
Randomly selected homogenous sample
Measures on interval or ratio scale
Variability normally distribute
Population variance are not known
TYPES OF ‘T’ TEST
• Where Ẋ means sample mean, µ means population mean, S is standard deviation and n is
sample size
EXAMPLE
• A random sample of size 20 from a normal population gives a sample mean of 40, standard deviation of 6.
Test the hypothesis if population mean is 44 with one tail test
• H0 : There is no significant difference between sample mean and population mean
• H1: There is significant difference between sample mean and population mean
• Here x = 40, µ = 70. SD = 6 and n = 20
• t calculated value is 2.981
• t tabulated value is 2.093
• t calculated value t tabulated value
• Rejected null hypothesis
INDEPENDENT TWO SAMPLE ‘T’
TEST
• To test if the population means estimated by two independent samples differ significantly
• Two different samples with same mean at initial point and compare mean at the end
PAIRED ‘T’ TEST
• To test if the population means estimated by two dependent samples differ significantly
• A usual setting for paired t test is when measurements are made on the same subjects before
and after a treatment
Where d is the mean difference and S d denotes standard deviation of the difference.
EXAMPLE OF PAIRED T TEST
H0 : there is no significant difference between BP level before and after drug therapy
H1: there is significant difference between BP level before and after drug therapy
Z TEST
• Z test is a statistical test that is conducted on data that approximately follows a normal
distribution. The z test can be performed on one sample, two samples, or on proportions for
hypothesis testing. It checks if the means of two large samples are different or not when the
population variance is known.
• z test can further be classified into left-tailed, right-tailed, and two-tailed hypothesis tests
depending upon the parameters of the data.
DEFINITION;
A z test is conducted on a population that follows a normal distribution with independent data
points and has a sample size that is greater than or equal to 30.
TYPES OF Z TEST
• A one-sample z test is used to check if there is a difference between the sample mean and
the population mean when the population standard deviation is known. The formula for
the z test statistic is given as follows:
• The algorithm to set a one sample z test based on the z test statistic is given as follows:
• Left Tailed Test:
• Null Hypothesis: H0:0 : μ=μ0
• Alternate Hypothesis: H1 : μ<μ0
• Decision Criteria: If the z statistic < z critical value then reject the null hypothesis.
• Right Tailed Test:
• Null Hypothesis: H0 : μ=μ0
• Alternate Hypothesis: H1 : μ>μ0
• Decision Criteria: If the z statistic > z critical value then reject the null hypothesis.
• Two Tailed Test:
• Null Hypothesis: H0 : μ=μ0
• Alternate Hypothesis: H1 : μ≠μ0
• Decision Criteria: If the z statistic > z critical value then reject the null hypothesis.
• A two sample z test is used to check if there is a difference between the means of two
samples. The z test statistic formula is given as follows:
Z TEST FOR PROPORTION
• A z test for proportions is used to check the difference in proportions. A z test can either
be used for one proportion or two proportions. The formulas are given as follows.
• One Proportion Z Test
• A one proportion z test is used when there are two groups and compares the value of an
observed proportion to a theoretical one. The z test statistic for a one proportion z test is
given as follows:
• Two Proportion Z Test
• A two proportion z test is conducted on two proportions to check if they are the same or
not. The test statistic formula is given as follows:
Z TEST CALCULATION
• a teacher claims that his section's students will score higher than his colleague's section.
The mean score is 22.1 for 60 students belonging to his section with a standard deviation
of 4.8. For his colleague's section, the mean score is 18.8 for 40 students and the standard
deviation is 8.1. Test his claim at α = 0.05.
STEPS OF Z TEST CALCULATION
• Substitute values in this equation. x1¯ = 22.1, σ1 = 4.8, n1 = 60, x2¯ = 18.8, σ2=
8.1, n2 = 40 and μ1−μ2=0. Thus, z = 2.32
• Compare the critical value and test statistic to arrive at a conclusion. As 2.32 > 1.645
thus, the null hypothesis can be rejected. It can be concluded that there is enough
evidence to support the teacher's claim that the scores of students are better in his class.
DIFFERENCE BETWEEN Z TEST & T TEST
Z test T test
A z test is a statistical test that is used to check if A t-test is used to check if the means of two data
the means of two data sets are different when the sets are different when the population variance is
population variance is known. not known.
The sample size is greater than or equal to 30. The sample size is lesser than 30.
The data follows a normal distribution. The data follows a student-t distribution.
The one-sample z test statistic is given The t test statistic is given as t=x−μ/s√n where s
by z=x−μ/σ √n where is population standard is the sample standard deviation
deviation
ANALYSIS OF VARIANCE(ANOVA)
• Founder R.A.FISCHER
• The student’s t test cannot be used for comparison of three or more groups
• The purpose of ANOVA is to test if there is any significant difference between the means of two or more groups
• The analysis of variance is the systematic algebraic procedure of decomposing the overall variation in the
responses observed in an experiment into variation
• Two variances –between group variability and within group variability that is variation existing between the
samples and variations existing within the sample
• The within group variability[error variance] is the variation that cannot be accounted for in the study design
• The between group (effect variance) is the result of treatment
ASSUMPTION OF ANOVA
• It compares three or more unmatched groups when date are categorized in one way
• Total sum of square(TSS)=Treatment sum of square(TSS)+ Error sum of square(SSE)
Example
1. Compare control group with three different doses of aspirin in rats
2. Effect of supplementation of vit c in each subject before and after the treatment
TWO WAY ANOVA
• ANCOVA is a technique that combines the features of analysis of variance and regression
• It is used to increase the precision of treatment comparison
• This method is based on the fact that there are some extraneous sources of variation
which also contribute to the experimental error but are not controlled. These additional
variations are known as the ancillary or concomitant variates
DEFINITION OF ANCOVA
ANCOVA is defined as the very logical procedure, which reduces the experimental error by
eliminating from it the effects of variations in the concomitant variate and thus increase the
precision of the main variate on the concomitant variate is known as analysis of covariance
Example
Effect of 3 diet on gaining weight of animal with different age group and different initial
weight will influence animal performance and precision of experiment
ASSUMPTION
• Better power
• Improved ability to detect and estimate interactions
• The availability of extensions to deal with measurement error in the covariates
DISADVANTAGES