0% found this document useful (0 votes)

121 views14 pages

R Code for Simple Linear Regression

Uploaded by

Velocidad 0.5

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

121 views14 pages

R Code for Simple Linear Regression

Uploaded by

Velocidad 0.5

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Chapter 1 Simple Linear Regression

(Part 2)

1 Software R and regression analysis

Downloadable from http://www.r-project.org/; some useful commands

• setwd(’path..’) ... to change the directory for data loading and saving

• read.table .... for reading/loading data

• data$variable .... variable in the data

• plot(X, Y) ... plotting Y against X (starting a new plot);

• lines(X, Y)... to add lines on an existing plot.

• object = lm(y ∼ x)... to call “lm” to estimate a model and stored the calculation
results in ”object”

• Exporting the plotted ﬁgure (save as .pfd, .ps or other ﬁles)

Example 1.1 Suppose we have 10 observations for (X, Y ): (1.2, 1.91), (2.3, 4.50), (3.5,
2.13), (4.9, 5.77), (5.9, 7.40), (7.1, 6.56), (8.3, 8.79), (9.2, 6.56), (10.5, 11.14), (11.5, 9.88).
They are stored in ﬁle (data010201.dat). We hope to ﬁt a linear regression model

Yi = β0 + β1 Xi + εi , i = 1, ..., n

code of R (the words after # are comments only)

mydata = read.table(’data010201.dat’) # read the data from the file

1
X = mydata$V1 # select X
Y = mydata$V2 # select Y
plot(X, Y) # plot the observations (data)
myreg = lm(Y ∼ X) # do the linear regression
summary(myreg) # output the estimation

Coeﬃcients:
Estimate Std. Error t value P r(> |t|)
(Intercept) 1.3931 0.9726 1.432 0.189932
X 0.7874 0.1343 5.862 0.000378 ***
—
Sign. codes: 0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1

Residual standard error: 1.406 on 8 degrees of freedom

Multiple R-squared: 0.8111, Adjusted R-squared: 0.7875
F-statistic: 34.36 on 1 and 8 DF, p-value: 0.0003778

lines(X, myreg$fitted) # plot the fitted

title("Scatter of (X,Y) and fitted linear regression model") # add title
# Please get to know how to make a ﬁgure ﬁle for latter use

Scatter of (X,Y) and fitted linear regression model

10
8
Y

6
4
2

2 4 6 8 10

Figure 1: (R code)

2
The ﬁtted regression line/model is

Ŷ = 1.3931 + 0.7874X

For any new subject/individual with X , its prediction of E(Y ) is

Ŷ = b0 + b1 X .

For the above data,

• If X = −3, then we predict Ŷ = −0.9690

• If X = 3, then we predict Ŷ = 3.7553

• If X = 0.5, then we predict Ŷ = 1.7868

2 Properties of Least squares estimators

Statistical properties in theory

• LSE is unbiased: E{b1 } = β1 , E{b0 } = β0 .

Proof: By the model, we have

Ȳ = β0 + β1 X̄ + ε̄

and
n
(X − X̄)(Yi − Ȳ )
b1 = n i
i=1
(Xi − X̄)2
n i=1
i=1 (Xi − X̄)(β
0 + β1 Xi + εi − β0 − β1 X̄ − ε̄)
= n 2
i=1 (Xi − X̄)
n
(X − X̄)(εi − ε̄)
= β1 + i=1 n i
(Xi − X̄)2
n i=1
(Xi − X̄)εi
= β1 + i=1n 2
i=1 (Xi − X̄)

recall that Eεi = 0. It follows that

Eb1 = β1 .

3
For b0 ,

E(b0 ) = E(Ȳ − b1 X̄) = β0 + β1 X̄ − E(b1 )X̄ = β0 + β1 X̄ − β1 X̄

= β0

• Variance of the estimators

σ2 1 2 X̄ 2
V ar(b1 ) = n 2
, V ar(b0 ) = σ + n 2
σ2
i=1 (Xi − X̄) n i=1 (Xi − X̄)

[Proof:
n
(Xi − X̄)εi
V ar(b1 ) = V ar( i=1
n 2
)
i=1 (Xi − X̄)
n n

2 −2
= { (Xi − X̄) } V ar{ (Xi − X̄)εi }
i=1 i=1
n
n

= { (Xi − X̄)2 }−2 (Xi − X̄)2 σ 2
i=1 i=1
σ2
= n .
i=1 (Xi − X̄)2

We shall prove the second equation later.]

• Estimated (ﬁtted) regression function Ŷi = b0 + b1 Xi . We also call Ŷi = b0 + b1 Xi the

ﬁtted value.
E{Ŷi } = EYi

[Proof:
E(Ŷi ) = E(b0 + b1 Xi ) = E(b0 ) + E(b1 )Xi = β0 + β1 Xi = EYi

Numerical properties of fitted regression line

Recall the normal equations
n

−2 (Yi − b0 − b1 Xi ) = 0
i=1
n

−2 Xi (Yi − b0 − b1 Xi ) = 0
i=1

4
and ei = Yi − Ŷi = Yi − b0 − b1 Xi . It follows
n

ei = 0
i=1
n
Xi ei = 0
i=1

The following properties follows

n
• ei = 0
i=1

n
n
• Yi = Ŷi
i=1 i=1

n
• e2i = minb0 ,b1 {Q}
i=1

n
• Xi ei = 0
i=1

n
• Ŷi ei = 0
i=1

• Regression line always goes to (X̄, Ȳ )

• Yi − Ȳ = β1 (Xi − X̄) + i , where i = εi − ε̄.

• The coeﬃcient and the correlation coeﬃcient

sY
b1 = rX,Y
sX

where

n n
(Xi − X̄)2 (Yi − Ȳ )2
i=1
sX = , sY = i=1
n−1 n−1
n
(Xi − X̄)(Yi − Ȳ )
rX,Y = i=1 n
n 2 2
i=1 (Xi − X̄) i=1 (Yi − Ȳ )

2.1 Estimation of Error Terms Variance σ 2

• Sum of squares of residuals or error sum of squares (SSE)

n
n

SSE = (Yi − Ŷi )2 = e2i
i=1 i=1

5
• Estimate σ 2 by

n
n
(Yi − Ŷi )2 e2i
i=1 i=1
s2 = =
n−2 n−2
called mean squared error (MSE), i.e.

n
e2i
i=1
M SE =
n−2

or denoted by σ̂ 2 .

Why is it divided by n − 2? because there are TWO constraints on ei , i = 1, ..., n, i.e.

the normal equations.

• s2 is unbiased estimator of σ 2 , i.e. E(s2 ) = σ 2

[Proof: For any ξ1 , ..., ξn IID with mean μ and variance σ 2 , we have
n
n

E ¯2=E
(ξi − ξ) [(ξi − μ) − (ξ̄ − μ)]2
i=1 i=1
n

= E{ (ξi − μ)2 − n(ξ̄ − μ)2 }
i=1
n

= V ar(ξ) − nV ar(ξ̄)
i=1
= nσ 2 − σ 2

= (n − 1)σ 2

This is why we estimate σ 2 by n ¯2

− ξ)
2 i=1 (ξi
σ̂ = .
n−1
Consider
n-1 terms

1 1 1
V ar(ξ1 − ξ̄) = V ar{(1 − )ξ1 − ξ2 − ... − ξn }
n n n
1 2 2 1 2 1 2
= (1 − ) σ + 2 σ + ... + 2 σ
n n n
2 1 2 n−1 2
= (1 − + 2 )σ + σ
n n n2
1
= (1 − )σ 2 .
n

6
similarly, for any i,

1 2
V ar(ξi − ξ̄) = (1 − )σ .
n

Now turn to the estimator s2 . Consider

n
n
n

E{ (Yi − Ŷi )2 } = E(Yi − Ŷi )2 = V ar(Yi − Ŷi ) + {E(Yi − Ŷi )}2
i=1 i=1 i=1
n

= V ar{(Yi − Ȳ − b1 (Xi − X̄))2 }
i=1
n

= {V ar(Yi − Ȳ ) − 2Cov(Yi − Ȳ , b1 (Xi − X̄)) + V ar(b1 )(Xi − X̄)2 }
i=1
n

= {V ar(Yi − Ȳ ) − 2Cov((Yi − Ȳ )(Xi − X̄), b1 ) + V ar(b1 )(Xi − X̄)2 }
i=1
n

= {V ar(εi − ε̄) − 2Cov((Yi − Ȳ )(Xi − X̄), b1 ) + V ar(b1 )(Xi − X̄)2 }
i=1
n
n

2
= (n − 1)σ − 2Cov( (Yi − Ȳ )(Xi − X̄), b1 ) + V ar(b1 ) (Xi − X̄)2
i=1 i=1
n
n

2 2
= (n − 1)σ − 2Cov(b1 (Xi − X̄) , b1 ) + V ar(b1 ) (Xi − X̄)2
i=1 i=1
n

2
= (n − 1)σ − V ar(b1 ) (Xi − X̄)2 = (n − 2)σ 2 .
i=1

Thus
E(s2 ) = σ 2

Example For the above example, the MSE (estimator of σ 2 = V ar(εi )) is

n

M SE = e2i /(n − 2) = 1.975997.
i=1

or
√
σ̂ = M SE = 1.405702

which is also called Residual standard error.

How to ﬁnd the value in the output of R?

7
3 Regression Without Predictors

At ﬁrst glance, it doesn’t seem that studying regression without predictors would be very
useful. Certainly, we are not suggesting that using regression without predictors is a major
data analysis tool. We do think that it is worthwhile to look at regression models without
predictors to see what they can tell us about the nature of the constant. Understanding the
regression constant in these simpler models will help us to understand both the constant
and the other regression coeﬃcients in later more complex models.
Model
Yi = β0 + εi , i = 1, 2, ..., n.

where as before, we assume

εi , i = 1, 2, ..., n are IID with E(εi ) = 0 and V ar(εi ) = σ 2

(We shall call this model Regression Without Predictors)

The least square estimator b0 is to minimizer of
n

Q= {Yi − b0 }2
i=1

Note that
n
dQ
= −2 {Yi − b0 }
db0
i=1

Letting it equal 0, we have the normal equation

n

{Yi − b0 } = 0
i=1

which leads to the (ordinary) least square estimator

b0 = Ȳ .

The ﬁtted model is

Ŷi = b0 .

The ﬁtted residuals are

ei = Yi − Ŷi = Yi − Ȳi

8
• Can you prove the estimator is unbiased, i.e Eb0 = β0 ?

• How to estimate σ 2 ?
n
1 2
σ̂ 2 = ei
n−1
i=1

Why it is divided by n − 1?

4 Inference in regression

Next, we consider the simple linear regression model

Y1 = β0 + β1 X1 + ε1

Y2 = β0 + β1 X2 + ε2
..
. (1)

Yn = β0 + β1 Xn + εn

under assumptions of normal random errors.

• Xi is a known, observed, and nonrandom

• ε1 , ..., εn are independent N (0, σ 2 ), Thus Yi is random

• β0 , β1 and σ 2 are parameters.

By the assumption, we have

E(Yi ) = β0 + β1 Xi

and
V ar(Yi ) = σ 2

4.1 Inference of β1

We need to check whether β1 = 0 (or any other speciﬁed value, say -1.5), why

• To check whether X and Y has linear relationship

9
• To see whether the model can be simpliﬁed (if β1 = 0, the model becomes Yi = β0 +εi ,
a regression model without predictors.) For example, Hypotheses H0 : β1 = 0 v.s.
Ha : β1 = 0

Sample distribution of b1 recall

n
(X − X̄)(Yi − Ȳ )
b1 = i=1 n i 2
i=1 (Xi − X̄)

Theorem 4.1 For model (1) with normal assumption of εi then

σ2
b1 ∼ N β1 , n 2
i=1 (Xi − X̄)

Proof Recall the fact that any linear combination of independent normal distributed random
variables is still normal. To find its distribution, we only need to find its mean and variance.
Since Yi are normal and independent, thus b1 is normal, and

Eb1 = β1

and (we have proved that)

σ2
V ar(b1 ) = n 2
i=1 (Xi − X̄)
The theorem follows.

Question: what is the distribution of b1 / V ar(b1 ) under H0 ? Can we use this Theorem
to test the hypothesis H0 ? why
Estimated Variance of b1 . (Estimating σ 2 by M SE)
n 2
M SE ei /(n − 2)
s2 (b1 ) = n 2
= i=1
n 2
i=1 (Xi − X̄) i=1 (X i − X̄)

s(b1 ) is the Standard Error (or S.E.) of b1 , (or called Standard deviation)
sample distribution of (b1 − β1 )/s(b1 )

b1 − β1
follows t(n − 2) for model (1)
s(b1 )

Confidence interval for β1 . Let t1−α/2 (n−2) or t(1−α/2, n−2) the 1−α/2−quantile
of t(n − 2).

P (t(α/2, n − 2) ≤ (b1 − β1 )/s(b1 ) ≤ t(1 − α/2, n − 2)) = 1 − α

10
By symmetry of the distribution, we have

t(1 − α/2, n − 2) = −t(α/2, n − 2)

Thus, with conﬁdence 1 − α, we have the conﬁdence interval for β1 is

−t(1 − α/2, n − 2) ≤ (b1 − β1 )/s(b1 ) ≤ t(1 − α/2, n − 2)

i.e.
b1 − t(1 − α/2, n − 2) ∗ s(b1 ) ≤ β1 ≤ b1 + t(1 − α/2, n − 2) ∗ s(b1 )

Example 4.2 For the example above, find the 95% confidence interval for β1 ?
solution: since n = 10, we have t(1 − 0.05/2, 8) = 2.306; the SE for b1 is s(b1 ) = 0.1343.
Thus the confidence interval is

b1 ± t(1 − 0.05/2, 8) ∗ s(b1 ) = 0.7874 ± 2.306 ∗ 0.1343 = [0.4777, 1.0971]

Test of β1

• Two-sided Test: to check whether β1 is 0

H0 : β0 = 0, Ha : β1 = 0

Under H0 , we have random variable

b1
t= ∼ t(n − 2)
s(b1 )

Suppose the significance level is α (usually, 0.05, 0.01). Calculate t, say t∗

– If |t∗ | ≤ t(1 − α/2; n − 2), then accept H0 .

– If |t∗ | > t(1 − α/2; n − 2), then reject H0 .

The test can also be done based on the p-value, deﬁned as p = P (|t| > |t∗ |). It is
easy to see that
p-value < α ⇐⇒ |t∗ | > t(1 − α/2; n − 2)

Thus

11
– If p-value ≥ α, then accept H0 .

– If p-value < α, then reject H0 .

• One-sided test: for example to check whether β1 is positive (or negative)

H0 : β1 ≥ 0, Ha : β1 < 0

Under H0 , we have

b1 b1 − β1 β1
t= = + ∼ t(n − 2) + a positive term
s(b1 ) s(b1 ) s(b1 )
Suppose the significance level is α (usually, 0.05, 0.01). Calculate t, say t∗

– If t∗ ≥ t(α; n − 2), then accept H0 .

– If t∗ < t(α; n − 2), then reject H0 .

4.2 Inference about β0

Sample distribution of b0
b0 = Ȳ − b1 X̄

Theorem 4.3 For model (1) with normal assumption of εi then

1 X̄ 2
b0 ∼ N β0 , σ 2 [ + n 2
]
n i=1 (Xi − X̄)

[Proof The expectation is

Eb0 = E{Ȳ } − E(b1 )X̄ = (β0 + β1 X̄) − β1 X̄ = β0

Let ki = n(Xi −X̄) 2 , then (see the proof at the beginning of this part)
i=1 (Xi −X̄)

n

b1 = β1 + ki εi .
i=1

Thus
n n n
1 1
b0 = β0 + εi − ki εi = β0 + [ − ki X̄]εi
n n
i=1 i=1 i=1
The variance is
n
1 1 X̄ 2
V ar(b0 ) = [ − ki X̄]2 σ 2 = [ + n 2
]σ 2
n
i=1
n i=1 (X i − X̄)

12
Therefore the Theorem follows.]
Estimated Variance of b0 (by replacing σ 2 with MSE).
1 X̄ 2
s2 (b0 ) = M SE + n 2
n i=1 (Xi − X̄)

s(b0 ) is the Standard Error (or S.E.) of b0 , (or called Standard deviation)
Sample distribution of (b0 − β0 )/s(b0 )

b0 − β0
follows t(n − 2) for model (1)
s(b0 )

Confidence interval for β0 : with conﬁdence 1 − α, we have the conﬁdence interval

b0 − t(1 − α/2, n − 2) ∗ s(b0 ) ≤ β1 ≤ b0 + t(1 − α/2, n − 2) ∗ s(b0 )

Test of β0

• Two-sided Test: to check whether β1 is 0

H0 : β0 = 0, Ha : β0 = 0

Under H0 , we have
b0
t= ∼ t(n − 2)
s(b0 )
Suppose the significance level is α (usually, 0.05, 0.01). If the calculated t, say t∗

– If |t∗ | ≤ t(1 − α/2; n − 2), then accept H0 .

– If |t∗ | > t(1 − α/2; n − 2), then reject H0 .

Similarly, the test can also be done based on the p-value, deﬁned as p = P (|t| > |t∗ |).
It is easy to see that

p-value < α ⇐⇒ |t∗ | > t(1 − α/2; n − 2)

Thus

– If p-value ≥ α, then accept H0 .

13
– If p-value < α, then reject H0 .

• One-sided test:to check whether β1 is positive (or negative)

H0 : β0 ≤ 0, Ha : β0 > 0

Example 4.4 For the example above, with signiﬁcance level 0.05,

1. Test H0 : β0 = 0 versus H1 : β0 = 0

2. Test H0 : β1 = 0 versus H1 : β1 = 0

3. Test H0 : β0 ≥ 0 versus H1 : β0 < 0

Answer:

1. since n = 10, t(0.975, 8) = 2.306. |t∗ | = 1.432 < 2.306. Thus, we accept H0

(another approach: p-value = 0.1899 > 0.05, we accept H0 )

2. The t-value is |t∗ | = 5.862 > 2.306, thus we reject H0 , i.e. b1 is signiﬁcantly diﬀerent
from 0.

(another approach: p-value = 0.000378 < 0.05, we reject H0 )

3. t(0.05, 8) = −1.86, since t∗ = 1.3931 > −1.86 we accept H0

How to find these test from the output of the R code?

R Guide to Simple Linear Regression
No ratings yet
R Guide to Simple Linear Regression
14 pages
Simple Linear Regression Analysis
No ratings yet
Simple Linear Regression Analysis
47 pages
STAT 353: Expectation, Variance & Regression Guide
No ratings yet
STAT 353: Expectation, Variance & Regression Guide
44 pages
Simple Linear Regression Overview
No ratings yet
Simple Linear Regression Overview
5 pages
Definition of Simple Linear Regression
No ratings yet
Definition of Simple Linear Regression
9 pages
Regression Analysis Essentials
No ratings yet
Regression Analysis Essentials
55 pages
TSNotes 1
No ratings yet
TSNotes 1
29 pages
Biostatistics: Linear Regression Models
No ratings yet
Biostatistics: Linear Regression Models
70 pages
Linear Regression with One Predictor
No ratings yet
Linear Regression with One Predictor
35 pages
Simple Linear Regression Explained
No ratings yet
Simple Linear Regression Explained
14 pages
Simple Linear Regression Overview
No ratings yet
Simple Linear Regression Overview
12 pages
Mungadze Linear
No ratings yet
Mungadze Linear
21 pages
PE Civil: Transportation Ebook Practice Exam
No ratings yet
PE Civil: Transportation Ebook Practice Exam
41 pages
Basics of Regression Analysis
No ratings yet
Basics of Regression Analysis
63 pages
Understanding Simple Linear Regression
No ratings yet
Understanding Simple Linear Regression
37 pages
Simple Linear Regression Explained
No ratings yet
Simple Linear Regression Explained
56 pages
Understanding Simple Linear Regression
No ratings yet
Understanding Simple Linear Regression
19 pages
RegEstimationLS ML StatColumbia
No ratings yet
RegEstimationLS ML StatColumbia
44 pages
Simple Linear Regression Assumptions
No ratings yet
Simple Linear Regression Assumptions
20 pages
Understanding Simple Linear Regression
No ratings yet
Understanding Simple Linear Regression
59 pages
f23 Econ103 Week2 Ta Note
No ratings yet
f23 Econ103 Week2 Ta Note
5 pages
WST 311 Notes Part 2 2024
No ratings yet
WST 311 Notes Part 2 2024
21 pages
Least Squares Estimation in Regression
No ratings yet
Least Squares Estimation in Regression
5 pages
Understanding Multiple Linear Regression
No ratings yet
Understanding Multiple Linear Regression
18 pages
Simple Linear Regression Analysis
No ratings yet
Simple Linear Regression Analysis
23 pages
Linear Regression & Least Squares
No ratings yet
Linear Regression & Least Squares
29 pages
LS Estimators in Linear Models Explained
No ratings yet
LS Estimators in Linear Models Explained
18 pages
Ordinary Least Squares
No ratings yet
Ordinary Least Squares
54 pages
Applied Linear Regression Models 4th Ed Note
No ratings yet
Applied Linear Regression Models 4th Ed Note
46 pages
Regression Analysis Guide
100% (1)
Regression Analysis Guide
280 pages
Understanding Simple Linear Regression
No ratings yet
Understanding Simple Linear Regression
30 pages
Simple Linear Regression: LINE Conditions
No ratings yet
Simple Linear Regression: LINE Conditions
15 pages
Simple Linear Regression Guide
No ratings yet
Simple Linear Regression Guide
34 pages
Simple Linear Regression Explained
No ratings yet
Simple Linear Regression Explained
14 pages
Multiple Linear Regression Model by Jeevan Bista
No ratings yet
Multiple Linear Regression Model by Jeevan Bista
16 pages
Simple Linear Regression Explained
No ratings yet
Simple Linear Regression Explained
8 pages
Standard Errors For Regression Equations
No ratings yet
Standard Errors For Regression Equations
4 pages
Linear Regression Lecture Notes
100% (2)
Linear Regression Lecture Notes
228 pages
Notes 2
No ratings yet
Notes 2
16 pages
Econometric Theory: Module - Ii
No ratings yet
Econometric Theory: Module - Ii
11 pages
Linear Regression Analysis for Toluca Company
No ratings yet
Linear Regression Analysis for Toluca Company
29 pages
EMET2007: Linear Regression Insights
No ratings yet
EMET2007: Linear Regression Insights
6 pages
Linear Regression Analysis Guide
No ratings yet
Linear Regression Analysis Guide
58 pages
Multiple Linear Regression Analysis
No ratings yet
Multiple Linear Regression Analysis
61 pages
Univariate Regression Analysis Guide
No ratings yet
Univariate Regression Analysis Guide
81 pages
Multiple Regression Estimation Techniques
No ratings yet
Multiple Regression Estimation Techniques
20 pages
Foundations of Linear Regression in ML
No ratings yet
Foundations of Linear Regression in ML
9 pages
Understanding Linear Models in Statistics
No ratings yet
Understanding Linear Models in Statistics
8 pages
Simple Linear Regression Explained
No ratings yet
Simple Linear Regression Explained
4 pages
Understanding Simple Linear Regression
No ratings yet
Understanding Simple Linear Regression
4 pages
Inferences in Regression Analysis
No ratings yet
Inferences in Regression Analysis
66 pages
Understanding Multiple Regression Models
No ratings yet
Understanding Multiple Regression Models
21 pages
Understanding Simple Linear Regression
No ratings yet
Understanding Simple Linear Regression
15 pages
Simple Linear Regression Explained
No ratings yet
Simple Linear Regression Explained
23 pages

R Code for Simple Linear Regression

Uploaded by

R Code for Simple Linear Regression

Uploaded by

Chapter 1 Simple Linear Regression

1 Software R and regression analysis

Downloadable from http://www.r-project.org/; some useful commands

• read.table .... for reading/loading data

• data$variable .... variable in the data

• plot(X, Y) ... plotting Y against X (starting a new plot);

• lines(X, Y)... to add lines on an existing plot.

• Exporting the plotted ﬁgure (save as .pfd, .ps or other ﬁles)

code of R (the words after # are comments only)

mydata = read.table(’data010201.dat’) # read the data from the file

Residual standard error: 1.406 on 8 degrees of freedom

lines(X, myreg$fitted) # plot the fitted

Scatter of (X,Y) and fitted linear regression model

For any new subject/individual with X , its prediction of E(Y ) is

For the above data,

• If X = −3, then we predict Ŷ = −0.9690

• If X = 3, then we predict Ŷ = 3.7553

• If X = 0.5, then we predict Ŷ = 1.7868

2 Properties of Least squares estimators

Statistical properties in theory

• LSE is unbiased: E{b1 } = β1 , E{b0 } = β0 .

Proof: By the model, we have

recall that Eεi = 0. It follows that

E(b0 ) = E(Ȳ − b1 X̄) = β0 + β1 X̄ − E(b1 )X̄ = β0 + β1 X̄ − β1 X̄

• Variance of the estimators

We shall prove the second equation later.]

• Estimated (ﬁtted) regression function Ŷi = b0 + b1 Xi . We also call Ŷi = b0 + b1 Xi the

Numerical properties of fitted regression line

The following properties follows

• Regression line always goes to (X̄, Ȳ )

• Yi − Ȳ = β1 (Xi − X̄) + i , where i = εi − ε̄.

• The coeﬃcient and the correlation coeﬃcient

2.1 Estimation of Error Terms Variance σ 2

• Sum of squares of residuals or error sum of squares (SSE)

Why is it divided by n − 2? because there are TWO constraints on ei , i = 1, ..., n, i.e.

• s2 is unbiased estimator of σ 2 , i.e. E(s2 ) = σ 2

This is why we estimate σ 2 by n ¯2

Now turn to the estimator s2 . Consider

Example For the above example, the MSE (estimator of σ 2 = V ar(εi )) is

which is also called Residual standard error.

where as before, we assume

εi , i = 1, 2, ..., n are IID with E(εi ) = 0 and V ar(εi ) = σ 2

(We shall call this model Regression Without Predictors)

Letting it equal 0, we have the normal equation

which leads to the (ordinary) least square estimator

The ﬁtted model is

The ﬁtted residuals are

Next, we consider the simple linear regression model

under assumptions of normal random errors.

• Xi is a known, observed, and nonrandom

• ε1 , ..., εn are independent N (0, σ 2 ), Thus Yi is random

• β0 , β1 and σ 2 are parameters.

By the assumption, we have

• To check whether X and Y has linear relationship

Sample distribution of b1 recall

Theorem 4.1 For model (1) with normal assumption of εi then

and (we have proved that)

P (t(α/2, n − 2) ≤ (b1 − β1 )/s(b1 ) ≤ t(1 − α/2, n − 2)) = 1 − α

t(1 − α/2, n − 2) = −t(α/2, n − 2)

Thus, with conﬁdence 1 − α, we have the conﬁdence interval for β1 is

−t(1 − α/2, n − 2) ≤ (b1 − β1 )/s(b1 ) ≤ t(1 − α/2, n − 2)

b1 ± t(1 − 0.05/2, 8) ∗ s(b1 ) = 0.7874 ± 2.306 ∗ 0.1343 = [0.4777, 1.0971]

• Two-sided Test: to check whether β1 is 0

Under H0 , we have random variable

Suppose the significance level is α (usually, 0.05, 0.01). Calculate t, say t∗

– If |t∗ | ≤ t(1 − α/2; n − 2), then accept H0 .

– If |t∗ | > t(1 − α/2; n − 2), then reject H0 .

– If p-value < α, then reject H0 .

• One-sided test: for example to check whether β1 is positive (or negative)

– If t∗ ≥ t(α; n − 2), then accept H0 .

– If t∗ < t(α; n − 2), then reject H0 .

4.2 Inference about β0

Theorem 4.3 For model (1) with normal assumption of εi then

[Proof The expectation is

Eb0 = E{Ȳ } − E(b1 )X̄ = (β0 + β1 X̄) − β1 X̄ = β0

Confidence interval for β0 : with conﬁdence 1 − α, we have the conﬁdence interval

• Yi − Ȳ = β1 (Xi − X̄) + i , where i = εi − ε̄.