MANSCI Midterm Correlation
MANSCI Midterm Correlation
. .
Lesson 1: Correlation
Bivariate Data
Bivariate data is data in which two variables are measured on an individual.
Collecting information ex: weight and height
The response variable (dependent/outcome/resulting variable/y) (variable with interest
of investigator) is the variable whose value can be explained or determined based
upon the value of the predictor variable (independent/x/criterion variable) presumed
to affect other variables.
Usually one dependent and many independent
Attitude (dependent) on premarital sex; factors that influence attitude (independent): family,
media exposure, sex, cultural backgrounds
A lurking variable is one that is related to the response and/or predictor variable,
but is excluded from the analysis. Hypothesize on relationship between age and reading
habit. Older the person the more he reads.
What comes first is the independent/predictor/response variable = age
The older the person, with adequate education (LV because not included in the analysis), the
more he reads
Unit 2: Probability Distributions t z f
Lesson 1: Correlation
Scatter Diagrams
A scatter diagram shows the relationship between two quantitative
variables measured on the same individual.
The value of the predictor is read on the
horizontal axis and the response variable
on the vertical axis.
Each individual in the data set is
represented by a point in the scatter
diagram.
Do not connect the points when drawing
a scatter diagram.
30 2520 28
3065 20
25 3600 18
3300 19
20
3625 19
3590 19
15
2000 2500 3000 3500 4000 2605 23
Linear
(Increasing) Nonlinear
3. 4.
r ≈ .9 r ≈ .4
r ≈ –.9 r ≈ –.4
(a) r = –0.969
(b) r = –0.049
(c) r = –1
(d) r = –0.992
(d) Price of a Big Mac and the number of MacDonald’s french fries
sold in a week.
(e) Shoe size and IQ.
S xy
r
S xx S yy
x
2
S xx ( xi x ) x
2 i
where 2
i
n
y
2
S yy ( yi y ) y
i
2 2
i
n
S xy ( xi x )( yi y ) xi yi
x y
i i
2520 28
30
negative linear 3065 20
25 relationship 3600 18
20
between 3300 19
3625 19
weight and 3590 19
15
2000 2500 3000 3500 400 mileage. 2605 23
0
Weight (lbs) 2370 28
30 2520 28
3065 20
25 3600 18
3300 19
20
3625 19
15 3590 19
2000 2500 3000 3500 400 2605 23
0
Weight (lbs) 2370 28