Research Unit 4
Research Unit 4
Research Unit 4
Field research is used extensively to study flora and fauna. A major use case is scientists
monitoring and studying animal migration patterns with the change of seasons. Field
research helps collect data across years and that helps draw conclusions about how to
safely expedite the safe passage of animals.
● The studies are expensive and time-consuming and can take years to complete.
● It is very difficult for the researcher to distance themselves from a bias in the research
study.
● It is an interpretive method and this is subjective and entirely dependent on the ability
of the researcher.
● In this method, it is impossible to control external variables and this constantly alters
the nature of the research.
● A series of actions or steps performed on
data to verify, organize, transform,
integrate, and extract data in an appropriate
output form for subsequent use.
● Methods of processing must be rigorously
documented to ensure the utility and
integrity of the data.
1. Editing of data
2. Coding of data
3. Classification of data
4. Tabulation of data
5. Data diagrams
classification and tabulation
1. Geographical characteristics
2. Chronological characteristics
4. Quantitative characteristics
2)Inferential Analysis
Descriptive analysis
● Also known as Quantitative analysis.
● It is used for elaborating the data which is under the sampling observation either
graphically or numerically.
● Based on the number of variables data analysis can be defined as:
1)Univariate Analysis
2)Bi-Variate Analysis
3)Multivariate Analysis
Inferential Analysis
● Also known as qualitative analysis.
● Used by the researchers when they have acquired the data from the sample
through a random procedure(using probability method) and with a high
response rate .
● Examples for Qualitative data : interview notes, transcripts of focus groups,
answer to open ended questions, transcriptions of video recordings, news
articles, etc.
2) Hypothesis Testing.
Phases of Data Analysis
1) Preliminary data analysis
● Applied before hypothesis testing.
● Clarifies how well the coding, inputting, scaling are done.
● Outcome influences the result and conclusion.
1) Hypothesis testing
● Finds the validity of the assumption with a view to choose between two
opposite hypothesis about the population parameter.
Data interpretation
●Process of reviewing data through some predefined processes.
●Assigns a meaning to the information analyzed and
determines its signification and implications.
●To help people make sense of numerical data that has been
collected, analyzed and presented.
● Regression analysis
● Cohort analysis
● Predictive and prescriptive analysis
Qualitative data Interpretation
● Categorical
● data is described through the use of descriptive context.
● Observations
● Documents
● Interviews
TESTING OF HYPOTHESIS
Hypothesis testing is an act in statistics
whereby an analyst tests an assumption
regarding a population parameter. The
methodology employed by the analyst depends
on the nature of the data used and the reason
for the analysis.
STEPS IN HYPOTHESIS
● State the hypothesis
● Formulate analysis plan
● Carry out the plan
● Analysis of the result
Parametric Test
● The statistical tests based on the assumption that population or population parameter
is normally distributed are called parametric tests.
● This data in this test is derived from interval and ratio measurement in
parametric tests.
● The important parametric tests are :
Z-test
T-test
F-test
ANOVA
Z-test
● It is a parametric test of hypothesis testing.
● It is used to determine whether the means are different when the population
variance is known and the sample size is large.
T-test
● Non-parametric tests are experiments that do not require the underlying population
for assumptions.
● It does not rely on any data referring to any particular parametric group
● The non-parametric test is used when there are skewed data.
● The most common non-parametric tests are :
Chi-Square Test
Mann-Whitney U Test
● A chi-square test is a statistical test used to compare observed results with expected
results.
● The Chi-Square test is a statistical procedure used by researchers to examine the
differences between categorical variables in the same population.
● The Chi-Square test is most useful when analyzing cross tabulations of survey
response data.
Mann-Whitney U Test
● Kruskal Wallis test is used to compare the continuous outcome in greater than two
independent samples.
● The Kruskal-Wallis Test is a nonparametric alternative to the one-way ANOVA.
● The Kruskal-Wallis test is used to compare more than two independent groups
with ordinal data.
Essential ideas of multivariate analysis of Data
● Multivariate means involving multiple dependent variables resulting in one outcome.
● This explains that the majority of the problems in the real world are Multivariate.
● For example, we cannot predict the weather of any year based on the season. There are multiple
factors like pollution, humidity, precipitation, etc.
● Here, we will introduce you to multivariate analysis, its history, and its application in different fields.
● Multivariate analysis (MVA) is a Statistical procedure for analysis of data involving more than one
type of measurement or observation. It may also mean solving problems where more than one
dependent variable is analyzed simultaneously with other variables.
The Objective of multivariate analysis
Advantages
● The main advantage of multivariate analysis is that since it considers more than one factor of independent
variables that influence the variability of dependent variables, the conclusion drawn is more accurate.
● The conclusions are more realistic and nearer to the real-life situation.
Disadvantages
● The main disadvantage of MVA includes that it requires rather complex computations to arrive at a
satisfactory conclusion.
● Many observations for a large number of variables need to be collected and tabulated; it is a rather time-
consuming process.
Popcorn time!
Scientific experiment
Comedy Horror
Your friend wins!!! 🥳 (Unless, you're a stress eater!) 😅
Variable
● Something that can either be changed or measured in an experiment.
● A variable is any entity that can take on different values.
● Variables aren’t always ‘quantitative’ or numerical.
● In an experiment, the researcher is looking for the possible effect on the dependent
variable that might be caused by changing the independent variable.
● The dependent variable is the variable a researcher is interested in. The changes to
the dependent variable are what the researcher is trying to measure with all their
fancy techniques.
Popcorn variables
● In the example,
What is it? - Dependent variable is what is being measured or what changes with the change in
independent variable.
Reflects - Response
Denoted by - Y-axis
Interdependence Method
1.Factor Analysis
2.Cluster Analysis
3.Multidimensional Scaling
4.Correspondence Analysis
Difference Between Dependence &
Interdependence Method
About SPSS
● SPSS stands for Statistical Package for the Social Sciences.
● The company announced on July 28, 2009 that it was being acquired by
IBM.
Introduction to
SPSS is a powerful and flexible system for statistical and information analysis.
With SPSS we can analyze data in three basic ways:
❖ Statistics Program
❖ Modeler Program
❖ Text Analytics for Surveys Program
❖ Visualization Designer
THANK YOU