0% found this document useful (0 votes)

15 views51 pages

07 Linear Regression Jan30

The document discusses linear regression and correlation. It provides information on interpreting Pearson's r values, calculating the regression line and slope, and how r values indicate the proportion of variability in the outcome (Y) that is accounted for by the predictor (X). Pearson's r is best used for describing linear relationships between two interval/ratio variables, while eta (η) can describe strength of curvilinear relationships.

Uploaded by

Vanessa Wong

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

15 views51 pages

07 Linear Regression Jan30

Uploaded by

Vanessa Wong

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 51

PSYC 218 006 (Dr.

Chen)
Lecture 7
January 30, 2024
Linear Regression

THESE SLIDES ARE PROVIDED AS A COURTESY AND STUDY AID FOR YOUR PERSONAL USE.
DO NOT REPOST OR REDISTRIBUTE ANY PART OF THESE SLIDES WITHOUT YOUR INSTRUCTOR’S PERMISSION.
Today’s Topics
• Interpreting and using r values
• Calculating the regression line

2
The equation for calculating Pearson r using z-
scores is

OR…
Don’t forget the
square root!

Pagano, p. 133 3
In general, for relationships found in the
behavioural sciences:
If r is… Interpretation
Equal to 0 No relationship
Leading zeros
(before the
decimal point)
Between 0 and .10 Trivial
are not used in
APA style when Between .10 and .30 Small to medium
reporting
Pearson r values
Between .30 and .50 Medium to large

Greater than .50 Large to very large

4
A strong positive relationship (r = .73) exists
between the variables X and Y. This relationship
could exist because:

A. X causes Y
B. Y causes X
C. A third variable causes both X and Y
D. Any of the above
E. None of the above

5
A strong positive relationship (r = .73) exists
between the variables X and Y. This relationship
could exist because:

A. X causes Y
B. Y causes X
C. A third variable causes both X and Y
D. Any of the above
E. None of the above

6
7
https://www.statology.org/correlation-does-not-imply-causation-examples

8
https://dayoftheshirt.com/shirts/157048/correlation-does-not-imply-causation-snorgtees 9
https://xkcd.com/552/

https://thequalityadvisor.blogs
pot.com/2016/03/correlation-
does-not-equal-causation.html

10
https://www.cnn.com/2014/09/04/health/no-sleep-brain-size/index.html
https://www.cbsnews.com/news/sugar-rush-to-prison-study-says-lots-of- 11
candy-could-lead-to-violence/
https://www.cnn.com/2014/09/04/health/no-sleep-brain-size/index.html
12
Correlation ≠ Causation, but…
Sometimes we care more about prediction than
causation.
For example, if you want to know how hot it is in
Vancouver, it’s more useful to know…

whether it’s July or than whether it’s sunny

December (calendar or cloudy right now
month does not cause (sunshine does cause
heat, but is highly heat, but is only
correlated to moderately correlated to
temperature)… temperature).
13
Correlation and Prediction
Let’s start with an example of two perfectly correlated
variables. How would you predict Yi given Xi?

14
Pagano, p. 124
Y = bX + a

a = Y intercept
b = slope

Find a and b.

A. 500; 1000
B. 0.40; 500
C. 0; 500
D. 500; 0.40
E. None of the above

15
Pagano, p. 124
Y = bX + a

a = Y intercept
b = slope

Find a and b.

A. 500; 1000
B. 0.40; 500
C. 0; 500
D. 500; 0.40
E. None of the above

For calculating the slope,

“Rise” = 900 - 500 = 400
“Run” = 1000 - 0 = 1000 16
Pagano, p. 124
Y = 0.4000X + 500.0000

You can use this formula

to predict Y from any
given value of X.
Regression constants
(ay and by) should be
reported to 4 decimal
places. This helps
retain accuracy in
your final answer
when using the
equation to predict Y.

17
Pagano, p. 124
• Most variables in psychology will not be
perfectly correlated.
• However, as long as the relationship is linear, a
“line of best fit” can still be calculated.
• This line can help us make predictions about Yi
given Xi.

18
Pagano, p. 131
Which regression line would give the smallest
errors when predicting Yi given Xi?

A B C

19
Which regression line would give the smallest
errors when predicting Yi given Xi?

A B C

Note that this relationship

also has the highest r value
20
Pearson r tells us how helpful the regression line
will be in predicting Yi given Xi.

When r = 1, the When r = 0, For r values in

regression line the regression between 0 and 1,
will produce line will not the regression
perfect help at all in line will produce
predictions (no predicting Yi moderate errors
errors) 21
Pearson r also tells us the extent to which
differences in Y can be explained
(mathematically) by differences in X.

or in technical language…

Pearson r also tells us something about how

much of the variability in Y is accounted for by
(the variability in) X.
22
Example: A large cheese pizza costs $20. Each
additional topping costs $2.

$20 $22 $24 $26

The number of pizza toppings accounts for 100%

of the differences (variability) in pizza
prices…but NOT 100% of the total price.
23
Pagano, p. 124
Merchandise sold ($) accounts
for 100% of the differences
(variability) in salary, but not
100% of the total salary 24
Most of the variability in the number of fingers
people have can be accounted for by…
A. Genetics
B. Environmental factors
C. I know you’re trying to trick me, but I’m not sure
how.

25
Most of the variability in the number of fingers
people have can be accounted for by…
A. Genetics
B. Environmental factors
C. I know you’re trying to trick me, but I’m not sure
how.

Although genes determine the

average (modal) number of fingers
that humans have, environmental
factors (injuries; accidents) account
for most of the variability. 26
“Proportion of variability accounted for” is a
statement about a correlation.
– It is not necessarily a statement about a causal
relationship.
– It is a statement about variability (differences
between values in a dataset), not average values.

27
r Proportion
of variability
r2 = proportion of explained
the variability of Y .10 .01
accounted for by X .20 .04
.30 .09

See p. 137-139 of your textbook for a

.40 .16
derivation. (But, you won’t be tested .50 .25
on how this formula is derived.)
.60 .36
.70 .49
.80 .64
.90 .81
1.00 1.00
28
r2 = proportion of
the variability of Y
accounted for by X

See p. 137-139 of your textbook for a

derivation. (But, you won’t be tested
on how this formula is derived.)

Pagano, p.140
(Same information as previous slide
but expressed in percentages)

29
In a sample of students,
height and weight are
correlated with r = .65.
What percentage of the
variability in weight is
accounted for by height
in this sample?

A. .65
B. .42
C. 65.00 Pagano, p.140
(Same information as previous slide
D. 42.25 but expressed in percentages)

30
In a sample of students,
height and weight are
correlated with r = .65.
What percentage of the
variability in weight is
accounted for by height
in this sample?

A. .65
B. .42
C. 65.00 Pagano, p.140
(Same information as previous slide
D. 42.25 but expressed in percentages)

31
Pearson r is used for describing linear
relationships, when X and Y are both measured
on interval or ratio scales

32
If the relationship is curvilinear, the correlation
coefficient eta (η) can be used to describe the
strength of the relationship

33
Other linear correlation coefficients:
– If one or both variables are measured on an
ordinal scale, the Spearman rank order correlation
coefficient rho (rs) can be used
– If one of the variables is interval or ratio and the
other is dichotomous, the biserial correlation
coefficient (rb) can be used
– If both variables are dichotomous, the phi
coefficient (Φ) can be used

SPSS calculates the correlation coefficients rb or Φ automatically

using modified versions of the formula for Pearson r
34
You do NOT need to know how to calculate the
Spearman rank order correlation coefficient (p.
141), or the other types of correlation
coefficients, by hand

You should be able to recognize these other

correlation coefficients, understand when
they’re used, generate them using SPSS (for your
next assignment), and report them properly

35
Today’s Topics
• Interpreting and using r values
• Calculating the regression line

36
Let’s say we want to use IQ to predict GPA

The line of best fit

is the one that
minimizes the
overall prediction
error for GPA

How do we
calculate the line
of best fit?

37
Pagano, p. 161
Finding the line of best fit

Pagano, p. 162
We want to find a line that minimizes the total error (deviation of each
of the points from the line).
Same logic as for calculating variance – square the deviations first so 38
that the positive and negative values don’t just cancel each other out
Least-squares regression line
The least-squares regression line is the
prediction line that minimizes the total error of
prediction, according to the least-squares
criterion of ∑ (Y −Y ')2

For any linear relationship, there is only one line

that will minimize ∑ (Y −Y ')2

39
Least-squares regression line
Like any line, the regression line can be defined
with an equation in the general form
Y = bX + a

Let’s first
calculate by
40
Pagano, p. 163
Least-squares regression line
Calculate by
Use when r, sy, and sx
with one of have already been
these two calculated
formulas:

Use with raw

data

41
Pagano, p. 163, p.173
To calculate by from raw data:
Use the same strategy that we used for calculating Pearson r

Step 1: Calculate X2, Y2, and XY, as needed, for all raw values

42
Pagano, p. 134
To calculate by from raw data:
Use the same strategy that we used for calculating Pearson r

Step 1: Calculate X2, Y2, and XY, as needed, for all raw values

43
Pagano, p. 134
To calculate by from raw data:
Use the same strategy that we used for calculating Pearson r

Step 1: Calculate X2, Y2, and XY, as needed, for all raw values
Step 2: Calculate the sum (Σ) for all columns
44
Pagano, p. 134
To calculate by from raw data:
Use the same strategy that we used for calculating Pearson r

Step 1: Calculate X2, Y2, and XY, as needed, for all raw values
Step 2: Calculate the sum (Σ) for all columns
45
Pagano, p. 134
To calculate by from raw data:
Use the same strategy that we used for calculating Pearson r

Step 1: Calculate X2, Y2, and XY, as needed, for all raw values
Step 2: Calculate the sum (Σ) for all columns
Step 3: You’re ready to use your formula!
46
Pagano, p. 134
Least-squares regression line

After calculating by,

we can calculate ay

47
Least-squares regression line

X=
∑ X
N

Y=
∑ Y
Already
N calculated!

Practice on your own: textbook p.164-169

48
Least-squares regression line

Now plug by and ay back

into your formula…

49
Least-squares regression line
For your assignments
and exams, report
regression constants
(ay and by) to 4 decimal
places.

…and you have your

regression line!

Pagano, p. 164 50
Recommended Homework
Problems at end of textbook Ch. 7 (p. 179-182):
– 1-9, 13
– For extra practice: 10-11, 14, 16-18

Econometrics: A Simple Introduction
From Everand
Econometrics: A Simple Introduction
K.H. Erickson
3.5/5 (5)
Nordli Buying Guide A4
No ratings yet
Nordli Buying Guide A4
8 pages
ASD The Meltdown
No ratings yet
ASD The Meltdown
4 pages
Stat Chapter 9
No ratings yet
Stat Chapter 9
34 pages
How Can We Explore The Association Between Two Quantitative Variables?
No ratings yet
How Can We Explore The Association Between Two Quantitative Variables?
7 pages
Handout 5 Correlation and Regression (Recovered)
No ratings yet
Handout 5 Correlation and Regression (Recovered)
6 pages
Correlation
100% (1)
Correlation
29 pages
Linear Regression II
No ratings yet
Linear Regression II
54 pages
Correlation Simple Regression
No ratings yet
Correlation Simple Regression
26 pages
06 Correlation and Regression
No ratings yet
06 Correlation and Regression
63 pages
Lecture Week 12 - Intro To Regression
No ratings yet
Lecture Week 12 - Intro To Regression
5 pages
Correlation and Regression Original
No ratings yet
Correlation and Regression Original
44 pages
Week 8 2025 - Correlation and Regression
No ratings yet
Week 8 2025 - Correlation and Regression
47 pages
Psych Stat Reviewer Midterms
No ratings yet
Psych Stat Reviewer Midterms
10 pages
Final Project: Raiha, Maheen, Fabiha Mahnoor, Zara
No ratings yet
Final Project: Raiha, Maheen, Fabiha Mahnoor, Zara
14 pages
Linear Regression Analysis - 1
No ratings yet
Linear Regression Analysis - 1
18 pages
19 - Correlation and Regression
No ratings yet
19 - Correlation and Regression
7 pages
Relationship - Correlation and Regression
No ratings yet
Relationship - Correlation and Regression
42 pages
Correlation & Regression
No ratings yet
Correlation & Regression
65 pages
Lectures 14 15
No ratings yet
Lectures 14 15
66 pages
R22 Pagano P 128-130 Explanation of Correlation Theory
No ratings yet
R22 Pagano P 128-130 Explanation of Correlation Theory
3 pages
Class 11 12 Lecture Slides JB Fall 2020 Student and Instructor Version
No ratings yet
Class 11 12 Lecture Slides JB Fall 2020 Student and Instructor Version
41 pages
Simple Linear Regression and Correlation
No ratings yet
Simple Linear Regression and Correlation
77 pages
Quantitative Analysis III Slides (Printer-Friendly) - 2023.03.15
No ratings yet
Quantitative Analysis III Slides (Printer-Friendly) - 2023.03.15
52 pages
Parametric Test
No ratings yet
Parametric Test
49 pages
Corr and Regress
No ratings yet
Corr and Regress
42 pages
06 Regression
No ratings yet
06 Regression
18 pages
5 - Chapter9-Linear Regression
No ratings yet
5 - Chapter9-Linear Regression
15 pages
Topic 1: Investigating Relationships Between Two Numerical Variables
No ratings yet
Topic 1: Investigating Relationships Between Two Numerical Variables
8 pages
Chapter 4: of Tests and Testing 12 Assumptions in Psychological Testing and Assessment
No ratings yet
Chapter 4: of Tests and Testing 12 Assumptions in Psychological Testing and Assessment
5 pages
Correlation and Regression 2020
No ratings yet
Correlation and Regression 2020
63 pages
Correlation and Regression: Associate Professor Georgi Iskrov, PHD Department of Social Medicine and Public Health
No ratings yet
Correlation and Regression: Associate Professor Georgi Iskrov, PHD Department of Social Medicine and Public Health
28 pages
Stats10 - Chapter+4 2
No ratings yet
Stats10 - Chapter+4 2
54 pages
Review: I Am Examining Differences in The Mean Between Groups
100% (2)
Review: I Am Examining Differences in The Mean Between Groups
44 pages
Correlation and Regression Analysis Using SPSS
No ratings yet
Correlation and Regression Analysis Using SPSS
102 pages
Correlation and Regression Analysis: C H A P T E R 5
No ratings yet
Correlation and Regression Analysis: C H A P T E R 5
11 pages
Chapter 9: Correlation and Regression: Solutions
No ratings yet
Chapter 9: Correlation and Regression: Solutions
8 pages
SEE5211 Chapter3-P2017
No ratings yet
SEE5211 Chapter3-P2017
58 pages
Introduction To Correlation and Regression Analysis
No ratings yet
Introduction To Correlation and Regression Analysis
14 pages
Correlation and Regression Analysis
No ratings yet
Correlation and Regression Analysis
8 pages
Second Stats Packet 24
No ratings yet
Second Stats Packet 24
100 pages
ASS#1-FINALS Doromal
No ratings yet
ASS#1-FINALS Doromal
8 pages
Raghunath Chatterjee Correlation Lecture
No ratings yet
Raghunath Chatterjee Correlation Lecture
40 pages
Lecture 4
No ratings yet
Lecture 4
60 pages
Chapter 8
No ratings yet
Chapter 8
45 pages
Correlation and Regression 2
No ratings yet
Correlation and Regression 2
24 pages
Correlation and Regression Analysis
No ratings yet
Correlation and Regression Analysis
37 pages
Regression: Leech N L, Barret K C & Morgan G A (2011)
No ratings yet
Regression: Leech N L, Barret K C & Morgan G A (2011)
35 pages
Psych Assess Chap 4
No ratings yet
Psych Assess Chap 4
5 pages
8614.educational Statitics Unit 7
No ratings yet
8614.educational Statitics Unit 7
39 pages
Lecture 6 Linear Regression
No ratings yet
Lecture 6 Linear Regression
8 pages
Correlation Regression Tutorial
No ratings yet
Correlation Regression Tutorial
42 pages
Econometrics For Finance
100% (1)
Econometrics For Finance
54 pages
Lesson 12 - Introduction To Regression and Correlation Analysis Regression Analysis
No ratings yet
Lesson 12 - Introduction To Regression and Correlation Analysis Regression Analysis
39 pages
Correlation & Regression Analysis
100% (1)
Correlation & Regression Analysis
39 pages
Prediction and Regression
No ratings yet
Prediction and Regression
19 pages
ECN 652 Handout 9 Student
No ratings yet
ECN 652 Handout 9 Student
46 pages
Prediction Is A Key Task of Statistics
No ratings yet
Prediction Is A Key Task of Statistics
18 pages
Math 100: Mathematics in The Modern World (MMW) Data Management
No ratings yet
Math 100: Mathematics in The Modern World (MMW) Data Management
32 pages
Reg Lin
No ratings yet
Reg Lin
73 pages
Math for Computer Applications
From Everand
Math for Computer Applications
The Editors of REA
No ratings yet
Bell's Inequality Untwisted
From Everand
Bell's Inequality Untwisted
Jim Spinosa
No ratings yet
Procedure For Calibrating, Standardizing or Checking Equipment
No ratings yet
Procedure For Calibrating, Standardizing or Checking Equipment
2 pages
BDC - Sap Abap Questionnare
No ratings yet
BDC - Sap Abap Questionnare
6 pages
Software Engineering Course Outline
No ratings yet
Software Engineering Course Outline
3 pages
Final Analysis
No ratings yet
Final Analysis
29 pages
Connecting VB and MS Access Tutorial
No ratings yet
Connecting VB and MS Access Tutorial
10 pages
Longitudinal Studies
100% (1)
Longitudinal Studies
11 pages
Capsule 2 Do It Ypurself Durgesha Dalvi
No ratings yet
Capsule 2 Do It Ypurself Durgesha Dalvi
8 pages
Corrugating Industry - Controlling Warp: Key Words: Warp, Flatness, Moisture Control, Temperature, Curl
No ratings yet
Corrugating Industry - Controlling Warp: Key Words: Warp, Flatness, Moisture Control, Temperature, Curl
5 pages
Mark Connelly, Jo Fox, Stefan Goebel, Ulf Schmidt (Ed.) - Propaganda and Conflict. War, Media and Shaping The Twentieth Century-Bloomsbury Academic (2019)
No ratings yet
Mark Connelly, Jo Fox, Stefan Goebel, Ulf Schmidt (Ed.) - Propaganda and Conflict. War, Media and Shaping The Twentieth Century-Bloomsbury Academic (2019)
367 pages
Five Axis Articulated Robot (Scorbot)
No ratings yet
Five Axis Articulated Robot (Scorbot)
2 pages
Roll Forming Technology
No ratings yet
Roll Forming Technology
24 pages
Vector Addition
No ratings yet
Vector Addition
23 pages
1.1COD Method
No ratings yet
1.1COD Method
2 pages
Art Rubric
No ratings yet
Art Rubric
1 page
L'Oréal 2020 Digital Strategy
No ratings yet
L'Oréal 2020 Digital Strategy
12 pages
Munar, Ronald C-WPS Office
No ratings yet
Munar, Ronald C-WPS Office
6 pages
Certin 3rd Round Machine Ubuntu
No ratings yet
Certin 3rd Round Machine Ubuntu
60 pages
Ta-Rw244 Manual e
No ratings yet
Ta-Rw244 Manual e
16 pages
Enterprise and Global Management of E-Business Technology
No ratings yet
Enterprise and Global Management of E-Business Technology
12 pages
EEE4119F Project Brief - M3
No ratings yet
EEE4119F Project Brief - M3
2 pages
840-2 CST-2 2025
No ratings yet
840-2 CST-2 2025
2 pages
12IP and CS BOTH - 100 - VIVA Qs - CS 12 by Lovejeet Arora
No ratings yet
12IP and CS BOTH - 100 - VIVA Qs - CS 12 by Lovejeet Arora
8 pages
Jbel Lahdid (Essaouira) - Schedule 24 - Maintenance Plan Dm037065-En R2
No ratings yet
Jbel Lahdid (Essaouira) - Schedule 24 - Maintenance Plan Dm037065-En R2
34 pages
Grade 7 Agric, Scie & Tech Paper 2
No ratings yet
Grade 7 Agric, Scie & Tech Paper 2
6 pages
Oxyfuel Combustion in Rotary Kiln Lime Production PDF
No ratings yet
Oxyfuel Combustion in Rotary Kiln Lime Production PDF
12 pages
9 889 435 03+SS EC69+SS EC69i+SS EC79
No ratings yet
9 889 435 03+SS EC69+SS EC69i+SS EC79
2 pages
CDPC Placement Policy 2024-25 - 20th June
No ratings yet
CDPC Placement Policy 2024-25 - 20th June
6 pages
Python - E.balagurusamy Pages 1-50 - Flip PDF Download - FlipHTML5
No ratings yet
Python - E.balagurusamy Pages 1-50 - Flip PDF Download - FlipHTML5
336 pages

07 Linear Regression Jan30

Uploaded by

07 Linear Regression Jan30

Uploaded by

PSYC 218 006 (Dr.

Greater than .50 Large to very large

whether it’s July or than whether it’s sunny

For calculating the slope,

You can use this formula

Note that this relationship

When r = 1, the When r = 0, For r values in

Pearson r also tells us something about how

$20 $22 $24 $26

The number of pizza toppings accounts for 100%

Although genes determine the

See p. 137-139 of your textbook for a

See p. 137-139 of your textbook for a

SPSS calculates the correlation coefficients rb or Φ automatically

You should be able to recognize these other

The line of best fit

For any linear relationship, there is only one line

Use with raw

After calculating by,

*Practice on your own: textbook p.164-169*

Now plug by and ay back

…and you have your

You might also like

Practice on your own: textbook p.164-169