0% found this document useful (0 votes)

21 views15 pages

Notes 2

The document discusses the properties and conditions of a simple linear regression model, focusing on the four 'LINE' conditions: linearity, independence, normality, and equal variances of errors. It explains the estimation of intercept (b0) and slope (b1) using least squares, and how these estimates relate to the mean response and predictor variables. Additionally, it covers the significance of population variance (σ²) and mean square error (MSE) in predicting future responses.

Uploaded by

promptmba24

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

21 views15 pages

Notes 2

Uploaded by

promptmba24

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 15

Properties and LINE Conditions

Further Topics...

1 Four “LINE” conditions of a simple linear regression model

2 Math Formulas of b0 (intercept) and b1 (slope)

3 Properties of b0 and b1

4 Estimation of σ 2 (population variance)

Simple Linear Regression Model Four “LINE” Conditions

A simple linear regression model for a data set (xi , Yi ) is defined as

Yi = β0 + β1 xi + εi , i = 1, . . . , n.

Four conditions for a simple linear regression model:

1 The mean of the response, E(Yi ) = β0 + β1 xi is a Linear function of the xi .

2 The errors, εi , are Independent.

3 The errors, εi , at each value of the predictor, xi , are Normally distributed.

4The errors, εi , at each value of the predictor, xi , have Equal variances

(denoted σ 2 ).
We are studying “LINE” in this course.
Least Squares Estimates: b0 (Estimate of β0) and b1
(Estimate of β1)
In the previous lecture, we talked about a data set of 10 students, and we have
heights (h) and weights (w) of the 10 students.
The “best fitting line” is shown in the following plot: the intercept b0 = −266.53 and
the slope b1 = 6.14.

(75, 208)
200

(73, 181)
180
Weight

Y = 158.8
160
140

(63, 127)
120

(64, 121) x = 69.3

64 66 68 70 72 74

Height
By differentiation of the least squares criterion
n
X
Q= [Yi − (b0 + b1 xi )]2
i=1

we can get
n
X
(Yi − b0 − b1 xi ) = 0
i=1
n
X
xi (Yi − b0 − b1 xi ) = 0
i=1

Solving the two equations in the previous slide, we get

n
P
(xi − x̄)(Yi − Ȳ )
i=1 Sxy
b1 = n =
P Sxx
(xi − x̄)2
i=1

b0 = Ȳ − b1 x̄
1 Because the formulas for b0 and b1 are derived using the least squares
criterion, the resulting equation

Ŷi = b0 + b1 xi

is often referred to as the “least squares regression line,” or simply the

“least squares line.”

2 Re-arranging the terms in the formula

b0 = Ȳ − b1 x̄,

we can get
Ȳ = b0 + b1 x̄,
which means that the least squares line passes through the point (x̄, Ȳ ).
Some Notations
We use the notations:
1 Sum of squares for x:
n
X n
X
2
Sxx = (xi − x̄) = x2i − nx̄2
i=1 i=1

2 Sum of squares for Y :

n
X n
X
2
Syy = (Yi − Ȳ ) = Yi2 − nȲ 2
i=1 i=1

3 Cross-product sum of squares:

Xn n
X
Sxy = (xi − x̄)(Yi − Ȳ ) = xi Yi − nx̄Ȳ
i=1 i=1

4 Sample mean for x:

n
P
xi
i=1
x̄ =
n
5 Sample mean for Y :
n
P
Yi
i=1
Ȳ =
n
What Does b0 and b1 Tell Us?

1 b0 is the predicted response value when x = 0.

1. In the example of 10 students’ height and weight, b0 tells us that a person who
is 0 inches tall is predicted to weigh -267 pounds, which is not meaningful.

2. This happened because we “extrapolated” beyond the “scope of the model”

(the range of the x values).

2 b1 is the estimate of the change in mean response value E(Y ) for every
additional one-unit increase in the predictor x.
1. In the example of 10 students’ height and weight, b1 tells us that we predict the
mean weight to increase by 6.14 pounds for every additional one-inch increase
in height.

2. In general, we can expect the mean response to increase or decrease by b1

units for every one unit increase in the predictor x.
Understanding the Slope b1

1 If we study the formula for the slope b1 :

n
P
(xi − x̄)(Yi − Ȳ )
i=1
b1 = n
P
(xi − x̄)2
i=1

we see that the denominator is necessarily positive since it only involves

summing positive terms.

2 Therefore, the sign of the slope b1 is solely determined by the numerator.

3 The numerator tells us, for each data point, to sum up the product of two
distances – the distance of the x value from x̄ (the mean of all of the x values)
and the distance of the Y value from Ȳ (the mean of all of the Y values).
When is the Slope b1 > 0?
1 Is the trend in the following plot positive, i.e., as x increases, Y tends to increase?

2 If the trend is positive, then the slope b1 must be positive.

3 The vertical dashed line is x̄. The horizontal dashed line is Ȳ .

(75, 208)
200

(73, 181)
180
Weight

Y = 158.8
160
140

(63, 127)
120

(64, 121) x = 69.3

64 66 68 70 72 74

Height
When is the Slope b1 < 0?
1 Is the trend in the following plot negative, i.e., as x increases, Y tends to decrease?

2 If the trend is negative, then the slope b1 must be negative.

3 The vertical dashed line is x̄. The horizontal dashed line is Ȳ .

Skin Cancer Mortality versus Latitude

220

(33, 219)
Mortality (Deaths per 10 million)
200
180
160

(34.5, 160)
Y = 152.9
140

(43, 134)
120
100

x = 39.5

(44.8, 86)

30 35 40 45

Latitude (at center of state)

Estimation of σ 2 (Unknown Population Variance)
Why should we care about σ 2 ? – One reason is that we want to predict future
response from an estimated regression line.

We have two thermometer brands (A) and (B). The predictor is Celsius and the
response is Fahrenhelt. Will this thermometer brand (A) or brand (B) yield more
precise future predictions?

(A) (B)
120

120
100

100
Fahrenheit
80

80
60

60
40

40
20

0 10 20 30 40 50 0 10 20 30 40 50

Celsius Celsius
Review of Sample Variance
When there is no predictor x, we use Ȳ to estimate E(Y ), and we use the sample
variance s2 to estimate σ 2 .
The sample variance:
n
(Yi − Ȳ )2
P
i=1
s2 =
n−1
0.3
Probability density

0.2
0.1
0.0

-4 -2 0 2 4
In the simple linear regression setting when there is a predictor x. At each x
value, there is a sub-group of data points, and we use

Ŷi = b0 + b1 xi

to estimate
E(Yi ) = β0 + β1 xi .

Population of 200 Students Sample of 20 Students

+
+
College entrance test score

16
+
population regression line +
sample regression line +
20

14
+
+
+
+ +

12
+
15

10
+
+
10

+
6

+
5

+
4

1.0 1.5 2.0 2.5 3.0 3.5 4.0 1.0 1.5 2.0 2.5 3.0 3.5 4.0

High school gpa High school gpa

Mean Square Error M SE in Simple Linear Regression

The mean square error:

n
(Yi − Ŷi )2
P
i=1
M SE =
n−2

1 The numerator again adds up, in squared units, how far each response yi is
from its estimated mean Ŷi .

2 The denominator divides the sum by n − 2, because we effectively estimate

two parameters - the population intercept β0 and the population slope β1 .
That is, we lose two degrees of freedom.

3 It can be shown that E(M SE) = σ 2 , i.e., MSE is an unbiased estimator of σ 2 .

We can write it as σ̂ 2 = MSE.

Stats101A - Chapter 2
No ratings yet
Stats101A - Chapter 2
59 pages
Basics of Regression Analysis
No ratings yet
Basics of Regression Analysis
63 pages
R Code for Simple Linear Regression
No ratings yet
R Code for Simple Linear Regression
14 pages
Math644 - Chapter 1 - Part2 PDF
No ratings yet
Math644 - Chapter 1 - Part2 PDF
14 pages
Notes 1
No ratings yet
Notes 1
26 pages
Regression Analysis
No ratings yet
Regression Analysis
37 pages
Chapter 4 - Notes
No ratings yet
Chapter 4 - Notes
58 pages
PE Civil: Transportation Ebook Practice Exam
No ratings yet
PE Civil: Transportation Ebook Practice Exam
41 pages
Simple Linear Regression Guide
No ratings yet
Simple Linear Regression Guide
46 pages
Simple Linear Regression Guide
No ratings yet
Simple Linear Regression Guide
28 pages
Simple Linear Regression Overview
No ratings yet
Simple Linear Regression Overview
5 pages
Simple Linear Regression Analysis
No ratings yet
Simple Linear Regression Analysis
47 pages
1 - Simple Linear Regression
No ratings yet
1 - Simple Linear Regression
43 pages
Module 3: Linear Regression: TMA4268 Statistical Learning V2025
No ratings yet
Module 3: Linear Regression: TMA4268 Statistical Learning V2025
110 pages
Econ Shu301 Ch5
No ratings yet
Econ Shu301 Ch5
26 pages
Short - Notes - Econometric Methods
No ratings yet
Short - Notes - Econometric Methods
22 pages
Simple Linear Regression, Cont.: BIOST 515 January 13, 2004
No ratings yet
Simple Linear Regression, Cont.: BIOST 515 January 13, 2004
23 pages
UnivariateRegression 3
No ratings yet
UnivariateRegression 3
81 pages
Lecture 4: Simple Linear Regression Models, With Hints at Their Estimation
No ratings yet
Lecture 4: Simple Linear Regression Models, With Hints at Their Estimation
12 pages
p16 p26 Annotated
No ratings yet
p16 p26 Annotated
14 pages
Simple Linear Regression Analysis Guide
No ratings yet
Simple Linear Regression Analysis Guide
46 pages
Stat 353 Study Guide
No ratings yet
Stat 353 Study Guide
44 pages
FCDS - RA ch1 Sp21
No ratings yet
FCDS - RA ch1 Sp21
14 pages
Simple Linear Regression Guide
No ratings yet
Simple Linear Regression Guide
14 pages
Regression With One Regressor
No ratings yet
Regression With One Regressor
25 pages
Understanding Multiple Linear Regression
No ratings yet
Understanding Multiple Linear Regression
18 pages
Linear Regression Analysis Guide
No ratings yet
Linear Regression Analysis Guide
58 pages
Emet2007 Notes
No ratings yet
Emet2007 Notes
6 pages
Simple Linear Regression Guide
No ratings yet
Simple Linear Regression Guide
60 pages
Statistic SimpleLinearRegression
No ratings yet
Statistic SimpleLinearRegression
7 pages
Lecture 1: Optimal Prediction (With Refreshers) : 36-401, Fall 2017 Sunday 3 September, 2017
No ratings yet
Lecture 1: Optimal Prediction (With Refreshers) : 36-401, Fall 2017 Sunday 3 September, 2017
13 pages
Lecture - 8 Regression and Correlation
No ratings yet
Lecture - 8 Regression and Correlation
34 pages
Regression Analysis Essentials
No ratings yet
Regression Analysis Essentials
55 pages
Regression Notes - Part-1
No ratings yet
Regression Notes - Part-1
17 pages
Linear Regression Models 2018
No ratings yet
Linear Regression Models 2018
68 pages
Understanding Simple Linear Regression
No ratings yet
Understanding Simple Linear Regression
30 pages
Inferences in Regression Analysis
No ratings yet
Inferences in Regression Analysis
66 pages
Linear Regression for Engineers
No ratings yet
Linear Regression for Engineers
56 pages
Lecture 6
No ratings yet
Lecture 6
45 pages
Week1 SLR
No ratings yet
Week1 SLR
30 pages
Ordinary Least Squares Linear Regression Review: Week 4
No ratings yet
Ordinary Least Squares Linear Regression Review: Week 4
10 pages
STAT630Slide Adv Data Analysis
0% (1)
STAT630Slide Adv Data Analysis
238 pages
Simple Linear Regression Guide
No ratings yet
Simple Linear Regression Guide
54 pages
OLS Assumptions & Issues Guide
No ratings yet
OLS Assumptions & Issues Guide
4 pages
Chapter 2 - 1907876925
No ratings yet
Chapter 2 - 1907876925
33 pages
Simple Linear Regression Assumptions
No ratings yet
Simple Linear Regression Assumptions
20 pages
Simple Linear Regression and Correlation: Abrasion Loss vs. Hardness
No ratings yet
Simple Linear Regression and Correlation: Abrasion Loss vs. Hardness
23 pages
Linear Regression Basics Explained
No ratings yet
Linear Regression Basics Explained
19 pages
Lecture 3 Ase
No ratings yet
Lecture 3 Ase
13 pages
Chapter 10 - 2 - 2
No ratings yet
Chapter 10 - 2 - 2
33 pages
Lect5 Math231
No ratings yet
Lect5 Math231
31 pages
Module 10 - MS102
No ratings yet
Module 10 - MS102
18 pages
Mungadze Linear
No ratings yet
Mungadze Linear
21 pages
Definition of Simple Linear Regression
No ratings yet
Definition of Simple Linear Regression
9 pages
Applied General Statistics (HIS 223)
No ratings yet
Applied General Statistics (HIS 223)
35 pages
Applied Linear Regression Models 4th Ed Note
No ratings yet
Applied Linear Regression Models 4th Ed Note
46 pages
Group3 - SecA - Case6
No ratings yet
Group3 - SecA - Case6
3 pages
Home Depot Case - Group 6 SecA
No ratings yet
Home Depot Case - Group 6 SecA
9 pages
BBVA Group 6 SecA
No ratings yet
BBVA Group 6 SecA
5 pages
BBVA Compass Marketing Budget Analysis
No ratings yet
BBVA Compass Marketing Budget Analysis
5 pages
SectionA Team4 Case3
No ratings yet
SectionA Team4 Case3
8 pages
Notes 6
No ratings yet
Notes 6
26 pages
Assignment 3
No ratings yet
Assignment 3
1 page
Notes 4
No ratings yet
Notes 4
15 pages
Chemometrics in Spectroscopy (Second Edition) Howard Mark - Ebook PDF Download
100% (1)
Chemometrics in Spectroscopy (Second Edition) Howard Mark - Ebook PDF Download
58 pages
Icwa Foundation - Maths-Latest PDF
No ratings yet
Icwa Foundation - Maths-Latest PDF
242 pages
Regularization
No ratings yet
Regularization
5 pages
Krajewski - Om12 - 08
No ratings yet
Krajewski - Om12 - 08
74 pages
Understanding Drill Down and Drill Up
No ratings yet
Understanding Drill Down and Drill Up
24 pages
Statistics & Probability for Engineers Course
No ratings yet
Statistics & Probability for Engineers Course
2 pages
Math Behind Machine Learning
No ratings yet
Math Behind Machine Learning
9 pages
Loss Functions in ANN: Python Guide
No ratings yet
Loss Functions in ANN: Python Guide
6 pages
(Mouton Textbook) Stefan Th. Gries - Statistics For Linguistics With R - A Practical Introduction-Walter de Gruyter (2013) PDF
No ratings yet
(Mouton Textbook) Stefan Th. Gries - Statistics For Linguistics With R - A Practical Introduction-Walter de Gruyter (2013) PDF
374 pages
Artificial Intelligence For Predictive Maintenance in Oil and Gas Operations
No ratings yet
Artificial Intelligence For Predictive Maintenance in Oil and Gas Operations
6 pages
FDA Unit 5 Notes
No ratings yet
FDA Unit 5 Notes
38 pages
Econometrics 318 Syllabus USC
No ratings yet
Econometrics 318 Syllabus USC
6 pages
Analysis and Prediction of Healthcare Sector Stock Price Using Machine Learning Techniques Healthcare Stock Analysis
No ratings yet
Analysis and Prediction of Healthcare Sector Stock Price Using Machine Learning Techniques Healthcare Stock Analysis
15 pages
Employee Recruitment Impact on Performance
No ratings yet
Employee Recruitment Impact on Performance
22 pages
Datascience Unit 1 Quiz - Wayground
No ratings yet
Datascience Unit 1 Quiz - Wayground
7 pages
Associations Between Borgs' Rating of Perceived Exertion and Physiological Measures of Exercise Intensity # Control
No ratings yet
Associations Between Borgs' Rating of Perceived Exertion and Physiological Measures of Exercise Intensity # Control
9 pages
Economic Viability of Cat Fish Production in Oyo State, Nigeria
No ratings yet
Economic Viability of Cat Fish Production in Oyo State, Nigeria
5 pages
Multivariate & Time Series Analysis
No ratings yet
Multivariate & Time Series Analysis
23 pages
Beginner's Guide to Statistical Analysis
No ratings yet
Beginner's Guide to Statistical Analysis
27 pages
Chapter 10 How Costs Behave Powerpoint
No ratings yet
Chapter 10 How Costs Behave Powerpoint
45 pages
Sulphur Feed and Metal Recovery Data
No ratings yet
Sulphur Feed and Metal Recovery Data
408 pages
Optimizing Pharma Demand Forecasting
No ratings yet
Optimizing Pharma Demand Forecasting
3 pages
R for Statistics and Data Analysis
No ratings yet
R for Statistics and Data Analysis
91 pages
Association Between Proactive Personality
No ratings yet
Association Between Proactive Personality
10 pages
Тест 2
No ratings yet
Тест 2
27 pages
Fall Final Review MC 2015 - Ch. 1 - 3 - 4
No ratings yet
Fall Final Review MC 2015 - Ch. 1 - 3 - 4
58 pages
Report 3
No ratings yet
Report 3
34 pages
Online Food Delivery Insights
No ratings yet
Online Food Delivery Insights
8 pages
Reputation-Aware Data Fusion and Malicious Participant Detection in Mobile Crowdsensing
No ratings yet
Reputation-Aware Data Fusion and Malicious Participant Detection in Mobile Crowdsensing
9 pages
Visvesvaraya Technological University, Belagavi: VTU-ETR Seat No.: A
No ratings yet
Visvesvaraya Technological University, Belagavi: VTU-ETR Seat No.: A
48 pages

Notes 2

Uploaded by

Notes 2

Uploaded by

Properties and LINE Conditions

1 Four “LINE” conditions of a simple linear regression model

2 Math Formulas of b0 (intercept) and b1 (slope)

4 Estimation of σ 2 (population variance)

A simple linear regression model for a data set (xi , Yi ) is defined as

Four conditions for a simple linear regression model:

2 The errors, εi , are Independent.

3 The errors, εi , at each value of the predictor, xi , are Normally distributed.

4The errors, εi , at each value of the predictor, xi , have Equal variances

(64, 121) x = 69.3

Solving the two equations in the previous slide, we get

is often referred to as the “least squares regression line,” or simply the

2 Re-arranging the terms in the formula

2 Sum of squares for Y :

3 Cross-product sum of squares:

4 Sample mean for x:

1 b0 is the predicted response value when x = 0.

2. This happened because we “extrapolated” beyond the “scope of the model”

2. In general, we can expect the mean response to increase or decrease by b1

1 If we study the formula for the slope b1 :

we see that the denominator is necessarily positive since it only involves

2 Therefore, the sign of the slope b1 is solely determined by the numerator.

2 If the trend is positive, then the slope b1 must be positive.

3 The vertical dashed line is x̄. The horizontal dashed line is Ȳ .

(64, 121) x = 69.3

2 If the trend is negative, then the slope b1 must be negative.

3 The vertical dashed line is x̄. The horizontal dashed line is Ȳ .

Skin Cancer Mortality versus Latitude

Latitude (at center of state)

Population of 200 Students Sample of 20 Students

High school gpa High school gpa

The mean square error:

2 The denominator divides the sum by n − 2, because we effectively estimate

3 It can be shown that E(M SE) = σ 2 , i.e., MSE is an unbiased estimator of σ 2 .

You might also like