Activity #3' Answer (Yellchin P. Semblante)
Activity #3' Answer (Yellchin P. Semblante)
Main Campus
Corner M.J. Cuenco Ave.
R.Palma St., Cebu City 6000
Graduate School Studies
Answer: The difference between correlation from regression is that the strength
of the linear link between two variables is measured by correlation. As implied by the
does not distinguish between independent and dependent values whereas regression
2. What is the difference between Pearson Product Moment Correlation from
Answer: A statistic that assesses the linear correlation between two variables
the bivariate correlation. Its value ranges from +1 to -1. The Pearson correlation can
linear only when a change in one variable is associated with a proportional change in
the other variable), whereas the Spearman's rank correlation coefficient, also known
two variables). It evaluates how well a monotonic function can capture the
relationship between two variables. The Spearman correlation uses the ranking
values for each variable rather than the raw data to assess a monotonic relationship
Answer: The statistic known as Kendall's coefficient, also known as the Kendall rank
statistical dependence. The rank correlation coefficient from Kendall (1955) measures
how close two sets of ranks given to the same collection of objects. This coefficient is
dependent on the quantity of object pair inversions required to change one rank order
into the other. To do this, each rank order is represented by the set of all pairs of
objects (for example, [a,b] and [b,a] are the two pairings that represent the items a
and b), and a value of 1 or 0 is used to indicate whether the pair is a pair of objects.
coefficient, rrb, is used (ordinal). The formula is typically expressed as rrb = 2 •(Y1 -
Y0)/n, where n is the number of data pairs and Y0 and Y1, respectively, are the Y
score means for data pairs with an x score of 0 and 1. These Y values are ranked.
This formula assumes there are no tied ranks. This could be the same as a Somer's
c. Regression Analysis
independent variables. A regression analysis allows you to predict the effects of the
independent variable on the dependent variable. For example, age and height can be
described using a linear regression model. Because a person's height increases with
d. Contingency Coefficient
In other words, it reflects how similar two or more variables' measurements are
That is, it summarizes sample data without allowing you to draw conclusions about
the population. When there are two variables, a correlation coefficient is a bivariate
statistic, and when there are more than two variables, it is a multivariate statistic.
required to generalize your findings to the population. To compute a, you can use a F
test or a t test to calculate a test statistic that indicates the statistical significance of
your discovery.
Answer: The Linear Probability Model (LPM) The Linear Probability Model (LPM) is
simply the application of ordinary least squares. (OLS) to binary rather than
continuous outcomes. Equation 1 illustrates the LPM in the context of. Y is the
In this case, the dependent variable for each observation is either 0 or 1. The
linear regression is used. When you want to know how strong the relationship
between two variables is (for example, the relationship between rainfall and soil
In business, linear regressions can be used to evaluate trends and make estimates or
forecasts. For example, if a company's sales have steadily increased every month for
the past few years, the company could forecast sales in future months by conducting
h. Concordant
Answer: Concordant is simply the percentage of data pairs that behave as expected
(assuming no ties in relative risk values between any pairs of observations). Another
and nurture. Because the word concordance means 'to agree,' the concordance rate
i. Coefficient of Correlation
that quantifies the strength of the linear relationship between two variables. In a
The formula compares the distance of each datapoint from the variable mean and
uses this to determine how well the relationship between the variables can be fit to an
imaginary line drawn through the data for two variables. When we say that
j. Coefficient of Determination
model. The coefficient of determination can also be found using the following formula:
R2 = MSS/TSS = (TSS RSS)/TSS, where MSS is the model sum of squares (also
known as ESS, or explained sum of squares), which is the sum of the squares of the
prediction from the linear regression minus the mean for that variable; TSS is the total
sum of squares, which is the sum of the squares of the measurements minus the
k. Confidence Intervals
Answer: Confidence intervals are one way to express how "good" an estimate is; the
larger the 90% confidence interval for a specific estimate, the greater the caution
required when using the estimate. Confidence intervals serve as a helpful reminder of
range of values for our estimated population parameter rather than a single value or a
point estimate. The estimated confidence interval provides us with a range of values
within which we believe the true population value falls, with varying degrees of
certainty.
l. Method of Least Square
Answer: The least squares method is a type of mathematical regression analysis that
is used to determine the best fit line for a set of data points, providing a visual
representation of the relationship between the data points. Each data point
dependent variable. The regression analysis method begins with a set of data points
that will be plotted on an x- and y-axis graph. Using the least squares method, an
analyst will generate a best-fit line that explains the potential relationship between
independent and dependent variables. The least squares method provides the overall
rationale for locating the line of best fit among the data points under consideration.
The most common application of this method, which is also known as "linear" or
"ordinary," aims to generate a straight line that minimizes the sum of the squares of
the errors generated by the results of the associated equations, such as the squared
residuals resulting from differences in the observed and anticipated values based on
that model.
Answer: A straight line with the best fit is one that minimizes the distance between it
and some data. In a scatter plot of different data points, the line of best fit is used to
forecast indicators and price movements. The regression line is chosen using the
least squares criterion. The regression line is also known as the "line of best fit"
because it is the best fit when drawn through the points. It is a line that minimizes the
the model. The variance that the model does not predict is represented by the
If you prefer, you can express the R2 as a percentage rather than a proportion.
model fits the observed data. A coefficient of determination of 60%, for example,
indicates that 60% of the data fit the regression model. A higher coefficient, in