0% found this document useful (0 votes)

44 views51 pages

Multiple Linear Regression & Nonlinear Regression Models

1) Multiple linear regression allows modeling of a response variable's relationship to multiple explanatory variables simultaneously. It extends simple linear regression, which considers only one explanatory variable. 2) The multiple linear regression model estimates coefficients for each explanatory variable while controlling for the effects of the other variables. This allows assessing the independent effect of each predictor on the response variable. 3) Least squares estimation is used to calculate the intercept and slope coefficients by minimizing the sum of squared residuals between the actual and predicted response values. This provides the best-fitting regression plane through multiple dimensions.

Uploaded by

xiaohui qi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

44 views51 pages

Multiple Linear Regression & Nonlinear Regression Models

Uploaded by

xiaohui qi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 51

Multiple Linear Regression

1
The General Idea

Simple regression considers the relation

between a single explanatory variable and
response variable

2
The General Idea

Multiple regression simultaneously considers the

influence of multiple explanatory variables on a
response variable Y
The intent is to look at
the independent effect
of each variable while
“adjusting
adjusting out”
out the
influence of potential
confounders
Y = βo + β1 X1 + β2 X 2 + ... + β p X p + ε

3
Regression Modeling

A simple
p regression
g
model (one independent
variable) fits a regression
line in 2
2-dimensional
dimensional
space

A multiple regression
model with two
explanatory variables fits
a regression plane in 3-
dimensional space

4 4
Y = βo + β1 X1 + β2 X2 + ... + β p X p + ε

intercept Partial residuals

R
Regressioni
Coefficients
P ti l R
Partial Regression
i C Coefficients
ffi i t ((slopes):
l )
Regression coefficient of X after controlling
for (holding all other predictors constant)
influence of other variables from both X and
Y.
5
Multiple Regression Model
Intercept α predicts
where the regression
plane crosses the Y
axis
Slope for variable X1
(β1) predicts the
change in Y per unit X1
holding X2 constant
The slope
p for variable
X2 (β2) predicts the
change in Y per unit X2
holding X1 constant

6 6
Common variance explained
by X1 and X2
Unique variance explained by
X2

X2
X1

Unique variance explained by Y

X1 Variance NOT
explained by X1 and X2
7
Simple Regression Model

Regression coefficients are estimated by

minimizing ∑residuals2 (i.e., sum of the squared
residuals) to derive this model:

The standard error of the regression (sY|x) is

based on the squared residuals:

8
Multiple Regression Model

Again, estimates for the multiple slope

coefficients are derived by minimizing ∑residuals2
to derive this multiple regression model:

Again, the standard error of the regression is

based on the ∑residuals2:

9
Polynomial Model

yˆ = b0 + b1 x + b2 x + " + br x
2 r

Linear in-parameter, linear model

Special case of multiple linear regression if
setting x1 = x, x2 = x 2 ,", xr = x r

10
Estimating coefficients

11
The matrix algebra of

Ordinary Least Square

⎡1 x11 x12 " x k 1 ⎤
⎢1 x x 22 " x k 2 ⎥⎥
Intercept and Slopes: X = ⎢ 12
⎢# # # # ⎥
⎢ ⎥
β = ( X ' X ) X 'Y −1 ⎣1 x1 n x 2 n " x kn ⎦

Predicted Values:

Y ′ = Xβ
Residuals:

Y −Y′
12
Example 12.3

13
Example 12.3

14
Regression Statistics
How good is our model?

SST = ∑ (Y − Y ) 2

SSR = ∑ (Y ′ − Y ′) 2

SSE = ∑ (Y − Y ′) 2

SST=SSR+SSE

15
The Regression Picture

ŷi = βxi + α
yi
C A

B
y
B y
A
C
yi

*Least squares
x estimation gave us the
n n n line (β) that minimi
minimized
ed
∑i=1
( y i − y ) 2
= ∑
i=1
( yˆ i − y ) 2
+ ∑
i=1
( yˆ i − y i ) 2 C2

R2=SSreg/SStotal
A2 B2 C2
SStotal SSreg SSresidual
Total squared distance of Distance from regression line to naïve mean of Variance around the regression line
observations from naïve mean of y y Additional variability not explained
Total variation Variability due to x (regression) 16 squares method aims
by x—what least
to minimize
ANOVA
H 0 : β 1 = β 2 = ... = β k = 0
H A : βi ≠ 0 att least
l t one!!

dff SS MS F P-value

Regression k SSR SSR / df MSR / MSE P(F)

Residual n-k-1 SSE SSE / df

Total n1
n-1 SST

If P(F)<α then we know that we get significantly better prediction of Y from

the regression model than by just predicting mean of Y.
Y

ANOVA to test significance of regression

17
If we revisit example 12 12.3
3 and make ANOVA
f=30.98 and the p-value is less than 0.0001
H
How tto interpret
i t t the
th resultlt
Regression is significant
Thi model
This d l iis nott th
the onlyl model
d l th
thatt can b
be
used to explain the data
Th model
The d l may have
h been
b more effective
ff ti withith
inclusion or deletion of variables

18
Regression Statistics

SSE SSR
R = 1−
2
=
SST SST
Coefficient of multiple Determination
to judge the adequacy of the regression model

Drawback
D b k off thi
this concept:
t one can always
l iincrease th
the value
l
of Coefficient of determination by including more independent
variables

19
Regression Statistics
MSE

/(n−k−1) n−1 2
SSE
R =1−
2
=1− (1−R)
/(n−1) n−k−1
adj
SST
n = sample size
k = number of independent variables

djusted R2 a
Adjusted are
e not
ot b
biased!
ased
20
Revisit example 12.3

21
Properties of least squares
i
estimator
Under model assumption that random
errors ε1, ε 2 ,", ε k are iid, we have
b0 , b1 ,", bk are unbiased estimator of
regression coefficients β , β , " , β 0 1 k

The elements of matrix ( X ′X ) σ display −1 2

the variance of estimators on the main

diagonal and covariance on the off off-
diagonal σ = c σ
2
bi ii
2

σ b b = cov ( bi , b j ) = C ij σ 2 , i ≠ j
i j

22
Regression Statistics
Standard Error for the regression model

S e = S = σˆ
2
e
2

SSE SSE = ∑ (Y − Y ′) 2
S =
2

n − k −1
e

S e = MSE
2

23
Hypotheses Tests for Regression
C ffi i t
Coefficients

H 0 : β i = β i0
H 1 : β i ≠ β i0

bi − βi 0 bi − βi 0
t( n − k −1) = =
Se (bi ) 2
Se Cii S 2
e
S xx

24
Considering the importance of X3 in example
12.3.
H 0 : β3 = 0
H1 : β3 ≠ 0
We test by using t-distribution with 9 dof.

j
We can not reject y
the null hypothesis
Variable is insignificant in the presence of other
regressors in the model

25
Confidence Interval on Regression
C ffi i t
Coefficients

bi − tα / 2,( n − k −1) S Cii ≤ β i ≤ bi + tα / 2,( n − k −1) S Cii

2
e
2
e

Confidence Interval for βι

26
Hypotheses Tests for Regression
Coefficients: F test

Regression sum of squares if one variable X1

is removed from the regression model

1| 2, 3, … , 2, 3, … ,

H 0 : β1 = 0 Example 12
12.3:
3:

H 1 : β1 ≠ 0
1| 2, 3, … ,
2

Compare it with
27
Hypotheses Tests for Regression
Coefficients:
ff F test
H 0 : β1 = β 2 = 0
H 1 : β 1 ≠ 0 , or β 2 ≠ 0

C
Comparing
i iit with
ih

28
Confidence Interval on mean response
T-statistic with n-k-1 degrees of freedom

0 10, 20 ,…, 0
1
0 0

A 100(1-α)%
100(1 )% confidence
fid iinterval
t l ffor th
the mean response

1
0 //2 0 0 10, 20 ,,…,, 0

1
0 /2 0 0

29
30
Confidence Interval on observed
response
T-statistic

0 0
1 1
0 0

A 100(1-α)%
100(1 )% confidence
fid iinterval
t l ffor th
the mean response

1 1
0 /2 0 0 0

1 1
0 /2 0 0

31
32
Orthogonality

Designed experiment wherein the

variables Xp and Xq is orthogonal

Contribution of one individual

variable in explaining the
variance is readily given.

33
34
35
Qualitative variables

Qualitative variables provide information on discrete characteristics

The number of categories taken by qualitative variables is general

small.

These can be
Th b numerical
i l values
l b
butt each
h number
b d denotes
t an
attribute – a characteristic.

A qualitative variable may have several categories

Two categories: male – female
Three categories: nationality (French, German, Turkish)
More than three categories: sectors (car, chemical, steel, electronic equip., etc.)

36
Qualitative variables
There are several ways to code qualitative variables with n
categories
Using one categorical variables
Producing
g n - 1 dummyy variables

A dummy variable is a variable which takes values 0 or 1.

We also call them binary variables
We also call dichotomous variables

37
38
Stepwise regression

Avoiding predictors (Xs) that do not contribute

significantly
i ifi tl tto model
d l prediction
di ti

- Forward selection
The ‘best’ predictor variables are entered, one by one.

- Backward elimination
The ‘worst’ predictor variables are eliminated, one by one.

39
Forward selection

STEP 1. Do simple linear regressions of y vs. each x variable

individually. Select the x variable with the largest value of . (Suppose it is
X1.)
Step 2: Do all possible 2-variable regressions in which one of the two
variables is X1.
Choose the variable that when inserted gives the largest increase in
(Suppose it is X2.)
Step 3: Do all possible 3-variable regressions in which two of the three
variables are X1 and X2.
Choose the variable that gives the largest increase of
Repeat the process until the most recent variable inserted fails to induce a
significant increase in the explained regression. Such an increase can be
determined by using appropriate F-test or T-test.

40
41
42
Why use logistic regression?

There are many important research topics for

which the dependent variable is "limited
"limited.""
For example: voting, marketing, and
participation data is not continuous or
distributed normally.
Binary logistic regression is a type of
regression analysis where the dependent
variable is a dummy variable: coded 0 (did
not vote) or 1(did vote)

43
The Linear Probability Model

In the ordinary least squares regression:

Y=α+ β βX + є ; where Y = ((0, 1))
є is not normally distributed because Y
takes on only two values
The predicted probabilities can be greater
than 1 or less than 0

44
The Logistic Regression Model

The "logit" model solves these problems:

ln[p/(1-p)] = α + βX + e

p is the probability that the event Y occurs,

p(Y=1)
p/(1-p) is the "odds ratio"
ln[p/(1-p)]
l [ /(1 )] iis th
the llog odds
dd ratio,
ti or "l
"logit"
it"

45
More:
The logistic distribution constrains the
estimated probabilities to lie between 0 and 1.
The estimated probability is:

exp(-α - β X)]
p = 1/[1 + exp(

if you let α + β X =0,

=0 then p = .50
50
as α + β X gets really big, p approaches 1
as α + β X gets really small
small, p approaches 0

46
What if β=0 or infinity 47
Maximum Likelihood Estimation
(MLE)
MLE is a statistical method for estimating the
coefficients of a model.
The likelihood function ((L)) measures the
probability of observing the particular set of
dependent variable values (p1, p2, ..., pn) that
occur in the sample:
L = Prob (p1* p2* * * pn)
The
Th higher
hi h the h L L, the
h hi
higher
h the
h probability
b bili off
observing the ps in the sample.

48
MLE involves finding the coefficients (α, β)
that makes the log of the likelihood function
(LL < 0) as large as possible
Or,
O finds
f the coefficients
ff that make -2 times
the log of the likelihood function (-2LL) as
small as possible
The maximum likelihood estimates can be
solved by differentiating the log of likelihood
function with respect to α, β and setting the
partial derivatives equal
p q to zero

49
Interpreting Coefficients

Since:
[p ( p)] = α + βX + e
ln[p/(1-p)]
The slope coefficient (β) is interpreted as the
rate of change in the "log odds" as X changes
… not very useful
useful.
Since:
exp(-α - β X)]
p = 1/[1 + exp(

The marginal effect of a change in X on the

probability
b bilit iis:

50
An interpretation of the logit coefficient which
is usually more intuitive is the "odds ratio"
Since:
/( ) = exp((α + βX))
[p/(1-p)]
exp(β) is the effect of the independent
variable on the "odds
odds ratio"
ratio

Regression and Prediction
No ratings yet
Regression and Prediction
56 pages
04 MLR
No ratings yet
04 MLR
32 pages
Raw Introduction to Linear Regression(서울대 회귀분석 강의노트)
No ratings yet
Raw Introduction to Linear Regression(서울대 회귀분석 강의노트)
226 pages
Multiple Linear Regression Session 4
No ratings yet
Multiple Linear Regression Session 4
32 pages
BST 32202 Linear Regression 6 SLR Assumptions Lse
No ratings yet
BST 32202 Linear Regression 6 SLR Assumptions Lse
20 pages
03 - Simple Linear Regression
No ratings yet
03 - Simple Linear Regression
13 pages
03 Regression
No ratings yet
03 Regression
33 pages
Chapter 3
No ratings yet
Chapter 3
31 pages
PE Civil: Transportation Ebook Practice Exam
No ratings yet
PE Civil: Transportation Ebook Practice Exam
41 pages
1.1 Regression Analysis
No ratings yet
1.1 Regression Analysis
33 pages
SimpleMultipleLinearRegression FoundationalMathofAI S24
No ratings yet
SimpleMultipleLinearRegression FoundationalMathofAI S24
6 pages
Module 3: Linear Regression: TMA4268 Statistical Learning V2025
No ratings yet
Module 3: Linear Regression: TMA4268 Statistical Learning V2025
110 pages
Lecture 4
No ratings yet
Lecture 4
62 pages
AA3 - Linear Regression - 2024
No ratings yet
AA3 - Linear Regression - 2024
26 pages
Regression Model
No ratings yet
Regression Model
30 pages
Multiple Regression
No ratings yet
Multiple Regression
60 pages
Multiple Regression Slides Mod-Ed
No ratings yet
Multiple Regression Slides Mod-Ed
32 pages
Multiple Regression
100% (1)
Multiple Regression
100 pages
Pretest-FACTORING POLYNOMIALS
100% (1)
Pretest-FACTORING POLYNOMIALS
2 pages
Sma32
No ratings yet
Sma32
30 pages
15multiple Linear Regression
No ratings yet
15multiple Linear Regression
168 pages
Regression
No ratings yet
Regression
24 pages
Module01 LinearRegression
No ratings yet
Module01 LinearRegression
41 pages
Chapter 9 Simple Linear Regression and Correlation
No ratings yet
Chapter 9 Simple Linear Regression and Correlation
56 pages
Stat 353 Study Guide
No ratings yet
Stat 353 Study Guide
44 pages
MLR - 2023
No ratings yet
MLR - 2023
18 pages
Lecture 16
No ratings yet
Lecture 16
29 pages
Homework 2
No ratings yet
Homework 2
6 pages
Module01.1 LinearRegression
No ratings yet
Module01.1 LinearRegression
32 pages
Multiple Regression
No ratings yet
Multiple Regression
22 pages
Ot MCQ 3
No ratings yet
Ot MCQ 3
13 pages
Simple Linear Regression 69
No ratings yet
Simple Linear Regression 69
69 pages
SRM Notes
No ratings yet
SRM Notes
38 pages
Lec20 RidgeRegression
No ratings yet
Lec20 RidgeRegression
21 pages
Enat Maths For Social Remedial
No ratings yet
Enat Maths For Social Remedial
6 pages
Linear Regression
No ratings yet
Linear Regression
20 pages
Lesson 4 - ALGEBRAIC EXPRESSIONS
No ratings yet
Lesson 4 - ALGEBRAIC EXPRESSIONS
9 pages
STAT630Slide Adv Data Analysis
0% (1)
STAT630Slide Adv Data Analysis
238 pages
MATH6183 Introduction+Regression
No ratings yet
MATH6183 Introduction+Regression
70 pages
01 SLR Final
No ratings yet
01 SLR Final
37 pages
Artificial Intelligence and Machine Learning Fundamentals
No ratings yet
Artificial Intelligence and Machine Learning Fundamentals
23 pages
Multiple Linear Regression
No ratings yet
Multiple Linear Regression
73 pages
Lecture Week 13 - Regression
No ratings yet
Lecture Week 13 - Regression
10 pages
Module 3 - SimpleLinearRegression - Afterclass1b
No ratings yet
Module 3 - SimpleLinearRegression - Afterclass1b
26 pages
CUHK STAT5102 Ch3
No ratings yet
CUHK STAT5102 Ch3
73 pages
Chapter 3 Multiple Linear Regression - We Use This One
No ratings yet
Chapter 3 Multiple Linear Regression - We Use This One
6 pages
Q. Implement A Python Program For Kruskal's Algorithm For Finding The Minimum Spanning Tree. Sol
No ratings yet
Q. Implement A Python Program For Kruskal's Algorithm For Finding The Minimum Spanning Tree. Sol
7 pages
4 Methods For Solving Recurrences
No ratings yet
4 Methods For Solving Recurrences
18 pages
Decision Analysis
No ratings yet
Decision Analysis
24 pages
Simple Linear Regression, Cont.: BIOST 515 January 13, 2004
No ratings yet
Simple Linear Regression, Cont.: BIOST 515 January 13, 2004
23 pages
DSA Notes All Chapters
No ratings yet
DSA Notes All Chapters
2 pages
DSE (Week 4)
No ratings yet
DSE (Week 4)
3 pages
Chapter 2 Simple Linear Regression - Jan2023
No ratings yet
Chapter 2 Simple Linear Regression - Jan2023
66 pages
Bubble Sort
No ratings yet
Bubble Sort
11 pages
The Bucharest University of Economic Studies Bucharest Business School Romanian - French INDE MBA Program
No ratings yet
The Bucharest University of Economic Studies Bucharest Business School Romanian - French INDE MBA Program
67 pages
Fsgs
No ratings yet
Fsgs
28 pages
Simple Linear Regression
No ratings yet
Simple Linear Regression
11 pages
Dijkstra 2
No ratings yet
Dijkstra 2
32 pages
4.1 Solution of Algebraic and Transcendental Equations: Bisection Method
No ratings yet
4.1 Solution of Algebraic and Transcendental Equations: Bisection Method
10 pages
Chapter 3 Multiple Linear Regression: Ray-Bing Chen Institute of Statistics National University of Kaohsiung
No ratings yet
Chapter 3 Multiple Linear Regression: Ray-Bing Chen Institute of Statistics National University of Kaohsiung
45 pages
Google Net
No ratings yet
Google Net
40 pages
MAT205 Final
No ratings yet
MAT205 Final
1 page
Function Approximation, Interpolation, and Curve Fitting PDF
100% (1)
Function Approximation, Interpolation, and Curve Fitting PDF
53 pages
Multiple Regression
No ratings yet
Multiple Regression
100 pages
Gauss-Jordan Elimination, Linear Algebra, Alexandria University
No ratings yet
Gauss-Jordan Elimination, Linear Algebra, Alexandria University
52 pages
Applied Business Forecasting and Planning: Multiple Regression Analysis
No ratings yet
Applied Business Forecasting and Planning: Multiple Regression Analysis
100 pages
Graph Sheet V.2
No ratings yet
Graph Sheet V.2
13 pages
BasicMaths Log DPP-4 (JEE) Question @GB Sir
No ratings yet
BasicMaths Log DPP-4 (JEE) Question @GB Sir
1 page
Multiple Regression (Compatibility Mode)
No ratings yet
Multiple Regression (Compatibility Mode)
24 pages
Sec2 Regression PDF
No ratings yet
Sec2 Regression PDF
183 pages
斯坦福大学机器学习数学基础 57-64
No ratings yet
斯坦福大学机器学习数学基础 57-64
8 pages
Numerical Integration Method Comparation (Fixed)
No ratings yet
Numerical Integration Method Comparation (Fixed)
11 pages
125.785 Module 2.2
No ratings yet
125.785 Module 2.2
95 pages
Chapter 6
No ratings yet
Chapter 6
22 pages
Oma351 Full Notes
No ratings yet
Oma351 Full Notes
134 pages
Assignment 5 Solutions - Dalhousie University CSCI 3110
No ratings yet
Assignment 5 Solutions - Dalhousie University CSCI 3110
4 pages
Zco2020 Question Paper
No ratings yet
Zco2020 Question Paper
4 pages
Multi Variate Regression
No ratings yet
Multi Variate Regression
52 pages
LP - II Lecture
No ratings yet
LP - II Lecture
68 pages
Simple Regression 1
No ratings yet
Simple Regression 1
18 pages
Simple Linear
No ratings yet
Simple Linear
10 pages
Class 7 Opti PDF
No ratings yet
Class 7 Opti PDF
2 pages
Intronumericalrecipes v01 Chapter02 Regress
No ratings yet
Intronumericalrecipes v01 Chapter02 Regress
15 pages
Summary of Topics For Midterm Exam #2: STA 371G, Fall 2017
No ratings yet
Summary of Topics For Midterm Exam #2: STA 371G, Fall 2017
6 pages
Chapter 14, Multiple Regression Using Dummy Variables
No ratings yet
Chapter 14, Multiple Regression Using Dummy Variables
19 pages
Regression III: Advanced Methods: William G. Jacoby Department of Political Science
No ratings yet
Regression III: Advanced Methods: William G. Jacoby Department of Political Science
21 pages
Ce Pre Review 1 Prelim Examination Instruction: Read and Solve The Problems Below. Show Your Solutions. Correct Answers Without Correct
No ratings yet
Ce Pre Review 1 Prelim Examination Instruction: Read and Solve The Problems Below. Show Your Solutions. Correct Answers Without Correct
1 page

Multiple Linear Regression & Nonlinear Regression Models

Uploaded by

Multiple Linear Regression & Nonlinear Regression Models

Uploaded by

Multiple Linear Regression

Simple regression considers the relation

Multiple regression simultaneously considers the

intercept Partial residuals

Unique variance explained by Y

Regression coefficients are estimated by

The standard error of the regression (sY|x) is

Again, estimates for the multiple slope

Again, the standard error of the regression is

Linear in-parameter, linear model

Ordinary Least Square

Regression k SSR SSR / df MSR / MSE P(F)

Residual n-k-1 SSE SSE / df

If P(F)<α then we know that we get significantly better prediction of Y from

ANOVA to test significance of regression

The elements of matrix ( X ′X ) σ display −1 2

the variance of estimators on the main

bi − tα / 2,( n − k −1) S Cii ≤ β i ≤ bi + tα / 2,( n − k −1) S Cii

Confidence Interval for βι

Regression sum of squares if one variable X1

Designed experiment wherein the

Contribution of one individual

Qualitative variables provide information on discrete characteristics

The number of categories taken by qualitative variables is general

A qualitative variable may have several categories

A dummy variable is a variable which takes values 0 or 1.

Avoiding predictors (Xs) that do not contribute

STEP 1. Do simple linear regressions of y vs. each x variable

 There are many important research topics for

In the ordinary least squares regression:

The "logit" model solves these problems:

 p is the probability that the event Y occurs,

 if you let α + β X =0,

The marginal effect of a change in X on the

You might also like

There are many important research topics for

p is the probability that the event Y occurs,

if you let α + β X =0,