G3 Correlation Analysis
G3 Correlation Analysis
G3 Correlation Analysis
Bi c o l Bn i v e r s i t y G u i n o b a t a n
A g r i c u l t u r a l a n d Bi o sy s t e m s En g i n e e r i n g D e p a r t m e n t
CORRELATION
ANALYSIS
MATH 4: Engineering Data Analysis
Presentation by
GROUP 3 | III-ABE2a
Page 02 of 61
OVERVIEW
Introduction Lesson 3: Differences of the
Lesson 1: Correlation Analysis Correlation Coefficients
Lesson 2: Correlation Coefficient Lesson 4: Uses and Advantages
Spearman's Rank Correlation Coefficient References
Kendall Rank Correlation Thank You
Pearson Product-Moment Coefficient
CORRELATION ANALYSIS
CORRELATION ANALYSIS
Introduction
Correlation analysis is a statistical method used to evaluate the
strength and direction of the linear relationship between two
quantitative variables. In other words, it helps to assess
whether and how changes in one variable are associated with
changes in another variable. This analysis is crucial in various
fields, including economics, biology, psychology, finance, and
many others.
SON
ES
CORRELATION
L
1
ANALYSIS
CORRELATION ANALYSIS
MATH 4: ENGINEERING DATA ANALYSIS
WHAT IS CORRELATION
ANALYSIS?
Correlation analysis in research is a statistical method used to
measure the strength of the linear relationship between two
variables and compute their association. Simply put - correlation
analysis calculates the level of change in one variable due to
the change in the other.
Page 05 of 61
Page 06 of 61
SON
ES
CORRELATION
L
2
COEFFICIENT
CORRELATION ANALYSIS
LESSON 2
CORRELATION
COEFFICIENT
Unit of measurement used to calculate the intensity in the linear
relationship between the variables involved in a correlation
analysis, this is easily identifiable since it is represented with the
symbol r and is usually a value without units which is located
between 1 and -1.
Page 07 of 61
Page 08 of 61
There are usually three different ways of ranking statistical correlation according to:
Each coefficient will represent the end result as ‘r’. Spearman’s Rank and Pearson’s
Coefficient are the two most widely used analytical formulae depending on the
types of data researchers.
LESSON 2
Page 09 of 61
Descriptive Statistic
it summarizes the characteristics of a dataset
Inferential Statistic
we can test whether there is a significant
relationship between two variables.
PEARSON PRODUCT-MOMENT COEFFICIENT
VISUALIZING THE
Pearson correlation coefficient
Given:
r = 0.47
n = 10
SOLUTION:
4. Rejection Region
If the t value is greater than the critical value, then
the relationship is statistically significant (p < α).
If the t value is less than the critical value, then the
relationship is not statistically significant (p > α).
Note that a sample size of 10 is very small. It’s possible that you would find a significant relationship if
you increased the sample size.
Spearman's Rank
Correlation Coefficient
The Spearman’s rank coefficient of correlation or Spearman correlation coefficient is
a nonparametric measure of rank correlation (statistical dependence of ranking
between two variables).
Named after Charles Spearman, it is often denoted by the Greek letter ‘ρ’ (rho) and
is primarily used for data analysis.
It measures the strength and direction of the association between two ranked
variables.
Here,
n= number of data points of the two variables
d= difference in ranks of the “th” element
LESSON 2
Spearman's Rank
Correlation Coefficient
The Spearman Coefficient, ⍴, can take a value between +1 to -1 where,
A ⍴ value of +1 means a perfect association of rank
A ⍴ value of 0 means no association of ranks
A ⍴ value of -1 means a perfect negative association between ranks.
Closer the ⍴ value to 0, the weaker is the association between the two ranks.
LESSON 2
Spearman's Rank
Correlation Coefficient
Sample Problem
We must be able to rank the data before proceeding with the Spearman’s Rank
Coefficient of Correlation. It is important to observe if increasing one variable, the
other variable follows a monotonic relation.
At every level, you will need to compare the values of the two variables. Here is how
the calculations work:
The scores of 9 students in History and Geography are mentioned in the table
below.
LESSON 2
Spearman's Rank
Correlation Coefficient
Sample Problem
Step 1 - Create a table of the data obtained.
Step 2- Start by ranking the two data sets. Data ranking can be achieved by
assigning the ranking “1” to the biggest number in the column, “2” to the second
biggest number and so forth. The smallest value will usually get the lowest ranking.
This should be done for both sets of measurements.
Step 3- Add a third column d to your data set, d here denotes the difference
between ranks. For example, if the first student’s physics rank is 3 and the math rank
is 5 then the difference in the rank is 3. In the fourth column, square your d values.
LESSON 2
Spearman's Rank
Correlation Coefficient
Sample Problem
Step 1 - Create a table of the data obtained.
Step 2- Start by ranking the two data sets. Data ranking can be achieved by
assigning the ranking “1” to the biggest number in the column, “2” to the second
biggest number and so forth. The smallest value will usually get the lowest ranking.
This should be done for both sets of measurements.
Step 3- Add a third column d to your data set, d here denotes the difference
between ranks. For example, if the first student’s physics rank is 3 and the math rank
is 5 then the difference in the rank is 2. In the fourth column, square your d values.
LESSON 2
Spearman's Rank
Correlation Coefficient
Sample Problem
LESSON 2
Spearman's Rank
Correlation Coefficient
Sample Problem
Step 4- Add up all your d square values, which is 12 (∑d square)
Step 5- Insert these values in the formula
LESSON 2
Spearman's Rank
Correlation Coefficient
Sample Problem
=1 - [(6*12)/(9(81-1))]
=1 - (72/720)
=1-01
=0.9
The Spearman’s Rank Correlation for this data is 0.9 and as mentioned above if the
⍴ value is nearing +1 then they have a perfect association of rank.
LESSON 2
Page 36 of 61
Kendall Rank
Correlation
Also commonly known as “Kendall’s tau coefficient”.
Continuous or ordinal
Monotonicity
KENDALL RANK CORRELATION
Continuous or Ordinal
Continuous means that the variable
can take on any reasonable value.
Where C is the number of concordant pairs and D is the number of discordant pairs.
-
++
- + -
++++
++++-
Example Kendall's Tau
Suppose two doctors rank 6 patients by descending physical health. One of the two
doctors, in this case the female, is now defined as the reference and the patients are
sorted from 1 to 6.
C is 11 and D is 4
Alternate formula:
Kendall's Tau
Significance
In the case of Kendall's Tau, the null and alternative hypotheses result in:
SON
ES
DIFFERENCES OF
L
3 THE CORRELATION
COEFFICIENTS
CORRELATION ANALYSIS
Pearson Correlation Spearman's Rank Correlation
Kendall's Tau Significance
Coefficient Coefficient
Appropriate for
continuous, numerical non-parametric measure suitable for ordinal, interval, or
data that is approximately ratio data. It does not assume a specific distribution.
normally distributed.
It assumes a linear
does not assume a linear relationship and is based on
relationship between the
the ranks of the data.
variables.
LESSON 3
Page 53 of 61
Pearson Correlation Spearman's Rank Correlation
Kendall's Tau Significance
Coefficient Coefficient
LESSON 3
Page 54 of 61
Page 55 of 61
SON
ES
USES AND
L
4
ADVANTAGES
CORRELATION ANALYSIS
USES OF CORRELATION
ANALYSIS
Correlation analysis is used to study practical cases. Here, the researcher
can't manipulate individual variables. For example, correlation analysis is
used to measure the correlation between the patient's blood pressure and
the medication used.
ADVANTAGES
Awareness of the behavior between two variables: A
01 correlation helps to identify the absence or presence of
a relationship between two variables. It tends to be more
relevant to everyday life.
LESSON 4
Page 06 of 15
ADVANTAGES
Uses for further studies: Researchers can identify the
03 direction and strength of the relationship between two
variables and later narrow the findings down in later
studies.
LESSON 4
Presentation by Group 3 | III-ABE2 Page 14 of 15
REFERENCES
https://www.questionpro.com/features/correlation-
analysis.html#:~:text=What%20is%20correlation%20analysis%3F,the%20change%20in%20the%20other.
https://blog.flexmr.net/correlation-analysis-definition-exploration
https://sphweb.bumc.bu.edu/otlt/mph-modules/bs/bs704_multivariable/bs704_multivariable5.html
https://www.questionpro.com/blog/spearmans-rank-coefficient-of-
correlation/#:~:text=For%20example%2C%20if%20the%20first,column%2C%20square%20your%20d%
20values.&text=The%20Spearman's%20Rank%20Correlation%20for,a%20perfect%20association%20of
%20rank.
CORRELATION ANALYSIS
Presentation by
GROUP 3 | III-ABE2a
Caryl R. Loterte
THANK
Kimberly Jane M. Mitra
Emmanuel John D. Olitan
William C. Pamparo
YOU!