CORRELATION
CONSTANTS
presenters:
Aleeza
Huzaira Fatima
Correlation is a statistical measure
that expresses the extent to which
WHAT IS two variables are linearly related.
CORRELA It's a common tool for describing
simple relationships without
TION? making a statement about cause
and effect.
WHAT IS CORRELATION
CONSTANT?
A correlation coefficient is a number between -1 and
1 that tells you the strength and direction of a
relationship between variables.
In other words, it reflects how similar the
measurements of two or more variables are across a
dataset.
WHAT DOES A
CORRELATION COEFFICIENT
TELL YOU?
•Summarizing data
•Compare studies
SUMMARIZING DATA
A correlation coefficient is a descriptive statistic. That
means that it summarizes sample data without letting
you infer anything about the population.
A correlation coefficient is a bivariate statistic when it
summarizes the relationship between two variables,
and it’s a multivariate statistic when you have more
than two variables.
COMPARING STUDIES
A correlation coefficient is also an effect size
measure, which tells you the practical significance of
a result.
Correlation coefficients are unit-free, which makes it
possible to directly compare coefficients between
studies.
TYPES OF CORRELATION
•Positive Correlation – when the values of the two
variables move in the same direction so that an
increase/decrease in the value of one variable is
followed by an increase/decrease in the value of the
other variable.
•Negative Correlation – when the values of the two
variables move in the opposite direction so that an
increase/decrease in the value of one variable is
followed by decrease/increase in the value of the other
variable.
•No Correlation – when there is no linear dependence or
no relation between the two variables.
TYPES OF CORRELATION
USING A CORRELATION COEFFICIENT
In correlational research, you investigate whether changes in
one variable are associated with changes in other variables.
After data collection, you can visualize your data with a
scatterplot by plotting one variable on the x-axis and the other
on the y-axis. It doesn’t matter which variable you place on
either axis.
Visually inspect your plot for a pattern and decide whether
there is a linear or non-linear pattern between variables. A
linear pattern means you can fit a straight line of best fit
between the data points, while a non-linear or curvilinear
pattern can take all sorts of different shapes, such as a U-
shape or a line with a curve.
USING A CORRELATION
COEFFICIENT
There are many different correlation coefficients that
you can calculate. After removing any outliers, select
a correlation coefficient that’s appropriate based on
the general shape of the scatter plot pattern. Then
you can perform a correlation analysis to find the
correlation coefficient for your data.
You calculate a correlation coefficient to summarize
the relationship between variables without drawing
any conclusions about causation.
TYPES OF CORRELATION
COEFFICIENTS
•Pearson’s r correlation coefficient
•Spearman rho correlation coefficient
PEARSON’S R CORRELATION
COEFFICIENT
The Pearson’s product-moment correlation coefficient, also
known as Pearson’s r, describes the linear relationship
between two quantitative variables.
These are the assumptions your data must meet if you want
to use Pearson’s r:
•Both variables are on an interval or ratio level of
measurement
•Data from both variables follow normal distributions
•Data have no outliers
•Linear relationship between the two variables
FORMULA
r= strength of the
correlation between
variables x and y
n = sample size
Σxy = the sum of the
products of paired scores
Σx = the sum of x scores
Σy = the sum of y scores
Σx2 = the sum of squared x
scores
Σy2 = the sum of squared y
EXAMPLE
Marks obtained by 5 students in algebra and
trigonometry as given below:
Science 16,15,12,10,8
Geometry 11,18,10,20,17
Calculate the Pearson correlation coefficient.
EXAMPLE
Solution:
Construct the following table:
EXAMPLE
SPEARMAN’S RANK
CORRELATION COEFFICIENT
Spearman’s rho, or Spearman’s rank correlation
coefficient, is the most common alternative to
Pearson’s r. It’s a rank correlation coefficient because
it uses the rankings of data from each variable (e.g.,
from lowest to highest) rather than the raw data itself.
You should use Spearman’s rho when your data fail to
meet the assumptions of Pearson’s r. This happens
when at least one of your variables is on an ordinal
level of measurement or when the data from one or
both variables do not follow normal distributions.
SPEARMAN’S RANK
CORRELATION COEFFICIENT
While the Pearson correlation coefficient measures
the linearity of relationships, the Spearman
correlation coefficient measures the monotonicity of
relationships.
In a linear relationship, each variable changes in one
direction at the same rate throughout the data
range. In a monotonic relationship, each variable
also always changes in only one direction but not
necessarily at the same rate.
TYPES OF
MONOTON
IC
•Positive
monotonic: when
one variable
increases, the
other also
increases.
•Negative
monotonic: when
one variable
increases, the
other decreases.
SPEARMAN’S RANK
CORRELATION COEFFICIENT
FORMULA
ρ= strength of the rank
correlation between variables
di = the difference between
the x-variable rank and the y-
variable rank for each pair of
data
∑di2= sum of the squared
differences between x- and y-
variable ranks
n = sample size
EXAMPLE 1
EXAMPLE 2
ANY QUESTIONS?