Correlation
Correlation
Dr. N. D. VOHRA
Chapter 12
Correlation analysis
INTRODUCTION
Classification of
Statistical data Bivariate (Two
Variables)
More than one
variable
Multivariate
(more than two
variables)
INTRODUCTION
• Bivariate Data : When two variables move in sympathy with each other
so that changes in one variable are associated with changes in the other
variable in the same, or in the opposite direction, they are said to be
correlated.
• When the variables move in same direction, then the correlation is said
to be positive while if they are in the opposite directions, the
correlation is said to be negative.
• Remember that the direction of movement indicated is in general. It
means that it is not necessary that in positive correlation a higher value
of one variable shall necessarily be accompanied by a higher value of
the other.
DIRECTION AND DEGREES OF
CORRELATION
DIRECTION DEGREE
• The more scattered are the dots, the smaller is the degree of correlation
between the variables.
• There is no correlation between the variables when
❑ the dots are so scattered that there is no clear direction of their slope, and
❑ the dots are falling on a line that is parallel to the X-axis or the Y-axis.
A line parallel to the X-axis implies that the variable Y is not responsive to
changes in X whereas a line parallel to the Y-axis implies that X is not
sensitive to changes in Y.
Hence there is no correlation in either case.
SOME SELECTED SCATTER DIAGRAMS
EXAMPLE
Salesman: 1 2 3 4 5 6 7 8 9 10
Test scores: 18 20 21 22 27 27 28 29 29 29
Sales (000 Rs): 23 27 29 28 28 31 35 30 36 33
SOLUTION
Line through
the points
KARL PEARSON’S
COEFFICIENT OF CORRELATION
•
KARL PEARSON’S
COEFFICIENT OF CORRELATION
•
KARL PEARSON’S
COEFFICIENT OF CORRELATION
• This coefficient may assume negative as well as positive values and its
value can lie only within ±1.
• The negative sign of the correlation coefficient implies negative
correlation between the variables and positive sign implies a positive
correlation.
• Ignoring sign, closer the coefficient to zero, smaller the degree of
correlation and closer is the value to 1, higher is the degree of
correlation.
• However, the correlation coefficient should always be interpreted taking
in to account the sample size.
AN EXAMPLE
•
CALCULATION OF
COEFFICIENT OF CORRELATION
By Measuring Deviations From Mean Values
CALCULATION OF
COEFFICIENT OF CORRELATION
By Measuring Deviations From Assumed Mean Values
CALCULATION OF
COEFFICIENT OF CORRELATION
Without Measuring Deviations
ASSUMPTIONS OF THE COEFFICIENT
OF CORRELATION, r
•
TIED RANKS
•
MULTIPLE AND PARTIAL
CORRELATION
• When the data involve two variables, the correlation between the variables
is called simple correlation
• When they involve more than two variables, then we study multiple and
partial correlations.
• In such data, there are two or more independent variables which affect a
dependent variable.
• Multiple correlation is used to study the joint or cumulative effect of all the
given independent variables on the dependent variable.
• The partial correlation involves a study of correlation between one
independent variable and the dependent variable holding the other
independent variable(s) constant statistically.
END OF CHAPTER 12