0% found this document useful (0 votes)

19 views19 pages

Lec 6

Uploaded by

vucarot2023

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

19 views19 pages

Lec 6

Uploaded by

vucarot2023

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

L INEAR R EGRESSION

Nehal Khosla, Priyanshu Parida

National Institute of Science Education and Research

January 30, 2023

PART I: L INEAR R EGRESSION : P ROBLEM & S OLUTION

1 Regression . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5

2 Linear Regression: The Problem . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6

2.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6
2.2 Regression Line . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7
2.3 Types . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8
2.4 Mathematical Representation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 9
2.5 Loss Function: Mean Squared Error . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10

3 Linear Regression: The Solution . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 11

3.1 Solution . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 11
3.2 Quality of Fit . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 12

1 / 18
PART II: L INEAR R EGRESSION : T HE I NDUCTIVE B IAS

1 Inductive Bias . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.1 A List . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.2 Choice of Loss Function . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 15

2 / 18
PART III: L INEAR R EGRESSION : A PPLICATIONS AND S HORTCOMINGS

1 Applications and Shortcomings . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 17

3 / 18
Part I

L INEAR R EGRESSION : T HE P ROBLEM

4 / 18
R EGRESSION

▶ Regression is a statistical method that attempts to predict the strength and nature of the
relationship between a dependent variable, and one or a series of independent variables.
▶ It does so by finding a curve that minimizes the error in the actual and expected values of the
dependent variable of training data-set.
▶ For proper interpretation of regression, several assumptions (inductive bias) about the data and
the model must hold.
▶ Linear regression is one of the common forms of this method. It establishes a linear relationship
between the two variables.

5 / 18
L INEAR R EGRESSION
I NTRODUCTION

▶ Linear regression is a supervised machine learning algorithm.

▶ The model tries to find a best-fit line to establish a linear relationship between the dependent
(y) and independent(x) variable.
▶ The model then uses this fit to predict the appropriate y-values for unknown x-values.
▶ The best-fit line is achieved minimizing the error between predicted and actual values. This is
done by minimizing the loss function.

6 / 18
L INEAR R EGRESSION
R EGRESSION L INE

The line showing the linear relationship between the dependent and independent variable is called
a regression line. An example of a regression line is shown below:

Figure. Regression line for log R (dependent variable)

vs log d (independent variable)

The regression line may be positive, wherein the dependent variable increases with increase in
independent variable; or it may be negative, wherein the dependent variable decreases with
increase in independent variable (as in above figure).

7 / 18
L INEAR R EGRESSION : T HE P ROBLEM
T YPES

Linear regression may be classified further into the following two types:
▶ Simple Linear Regression: Assumes a linear relationship between a single independent
variable and a dependent variable.
▶ Multiple Linear Regression: Assumes a linear relationship between two or more independent
variables and a dependent variable.

8 / 18
L INEAR R EGRESSION : T HE P ROBLEM
M ATHEMATICAL R EPRESENTATION

Once a linear relationship has been determined by the algorithm, the general form of each model
may be represented as follows:
▶ Simple Linear Regression
y = ax + b + u
▶ Multiple Linear Regression
Y = a1 x + a2 x +a3 x + b + u
where:
y = Dependent variable
x = Independent variable
a = Slope(s) of the variable(s)
b = The y-intercept
u = The regression residual/error term

9 / 18
L INEAR R EGRESSION : T HE P ROBLEM
L OSS F UNCTION : M EAN S QUARED E RROR

▶ The regression line is achieved by minimizing the sum of mean squared error (loss function) for
all points in the domain. The loss function is gives as:

1
MSE = Σ(y − f (x))2
N
where f(x) = a1 x + a2 x .... +b

10 / 18
L INEAR R EGRESSION : T HE S OLUTION
S OLUTION

The best fit line may be found in the following two manners:
▶ Closed form (Exact form) Solution:
• It solves the problem in terms of simple functions and mathematical operators.
• The closed form solution for linear regression is as follows:

B = (X′ X)−1 X′ Y

where B = Matrix of regression parameters

X = Matrix of X values
X’ = Transpose of X
Y = Matrix of Y values
• Although this method gives an accurate model, it is computationally expensive, especially
when there are more than 4 dimensions.
▶ Gradient Descent:
• It is used to minimize MSE by calculating the gradient of the loss function.
• It is an iterative optimization algorithm.

11 / 18
L INEAR R EGRESSION : T HE S OLUTION
Q UALITY OF F IT

▶ The goodness of the fit achieved determines how linearly the variables are correlated.
▶ The goodness of fit may be calculated using the Pearson correlation coefficient, which is given
by:
Σ(x1 − x)(yi − y)
r= p
Σ(xi − x)2 Σ(yi − y)2
▶ The higher the value of r, the better is the fit.

12 / 18
Part II

L INEAR R EGRESSION : T HE I NDUCTIVE B IAS

13 / 18
I NDUCTIVE B IAS
A L IST

Linear regression takes the following assumptions, or inductive biases:

▶ The assumption that the dependent and independent variables are linearly related.
▶ Homoscedasticity: The assumption that the error term should be the same for all points.
▶ The assumption that MSE is the most appropriate loss function for linear regression.

14 / 18
C HOICE OF L OSS F UNCTION

Let us analyse some loss functions to justify the choice of MSE as an appropriate loss function.
▶ L1 = (y-f(x)): This loss function gives out both positive and negative values, which cancel out to
give near zero error for large data.
▶ L2 = |(y-f(x))|: Although errors do not cancel out here, the outliers are penalised equally as
compared to standard data.
▶ L3 = (y-f(x))2 : In this case, the errors do not cancel out. Also, outliers are penalised more, giving
a more appropriate regression line.
Hence, MSE is an appropriate choice for loss function.

15 / 18
Part III

L INEAR R EGRESSION : A PPLICATIONS AND

S HORTCOMINGS

16 / 18
A PPLICATIONS AND S HORTCOMINGS

▶ Linear regression finds its applications in several fields, like market analysis, financial analysis,
environmental health, and medicine.
▶ However, it does leave somethings to desire for. A linear correlation does not indicate
causation, i.e. a connection between two variable does not imply that one causes the other.
▶ Linear regression is prone to noise and overfitting.
▶ It is prone to multicollinearity, i.e. occurence of correlation between two or more independent
variables. This reduces the statistical significance of an independent variable.

17 / 18
R EFERENCES

1. CS460 Machine Learning 2023 Lectures, Subhankar Mishra.

2. Linear Regression in Machine Learning, Javatpoint.
3. ML|Linear Regression, geeksforgeeks.

18 / 18

Understanding Linear Regression Basics
No ratings yet
Understanding Linear Regression Basics
83 pages
Linear Regression
No ratings yet
Linear Regression
11 pages
AAI Lecture 10 SP 25
No ratings yet
AAI Lecture 10 SP 25
37 pages
What Are Linear Models in Machine Learning (1) .Docx (Unit3 ML)
No ratings yet
What Are Linear Models in Machine Learning (1) .Docx (Unit3 ML)
60 pages
Linear Regression - Everything You Need To Know About Linear Regression
No ratings yet
Linear Regression - Everything You Need To Know About Linear Regression
17 pages
Linear-Regression ML
No ratings yet
Linear-Regression ML
36 pages
Unit-4 DS Student
No ratings yet
Unit-4 DS Student
43 pages
Linear Regression
100% (1)
Linear Regression
8 pages
DMML Unit4
No ratings yet
DMML Unit4
77 pages
Classification Algorithms Overview
No ratings yet
Classification Algorithms Overview
19 pages
Linear Regression A Foundational ML Algorithm
No ratings yet
Linear Regression A Foundational ML Algorithm
10 pages
Understanding Linear Regression Basics
No ratings yet
Understanding Linear Regression Basics
36 pages
Linear Regression Notes
No ratings yet
Linear Regression Notes
4 pages
ML - Module 2
No ratings yet
ML - Module 2
16 pages
Understanding Linear Regression Techniques
No ratings yet
Understanding Linear Regression Techniques
12 pages
Regression: Unit Iii
No ratings yet
Regression: Unit Iii
54 pages
Linear Regression in Machine Learning
No ratings yet
Linear Regression in Machine Learning
23 pages
Chapter4 Regression
No ratings yet
Chapter4 Regression
15 pages
Machine Learning for Data Analysts
No ratings yet
Machine Learning for Data Analysts
201 pages
CSL0777 L12
No ratings yet
CSL0777 L12
18 pages
Linear Regression - 1st Draft
No ratings yet
Linear Regression - 1st Draft
5 pages
Linear & Polynomial Regression Guide
No ratings yet
Linear & Polynomial Regression Guide
56 pages
Hanan
No ratings yet
Hanan
9 pages
Lecture6 Regression
No ratings yet
Lecture6 Regression
42 pages
Solving One Variable Linear Equations
No ratings yet
Solving One Variable Linear Equations
10 pages
Regression Analysis in Machine Learning
No ratings yet
Regression Analysis in Machine Learning
13 pages
Logistic Regression Example Explained
No ratings yet
Logistic Regression Example Explained
45 pages
Linear Regression
No ratings yet
Linear Regression
24 pages
Simple Linear Regression
No ratings yet
Simple Linear Regression
5 pages
ML Unit-2
No ratings yet
ML Unit-2
123 pages
Chapter - 2 - Linear and Logistic Regression
No ratings yet
Chapter - 2 - Linear and Logistic Regression
34 pages
Linear Regression Lecture Notes
No ratings yet
Linear Regression Lecture Notes
34 pages
What Is Linear Regression
No ratings yet
What Is Linear Regression
14 pages
Linear Regression
No ratings yet
Linear Regression
46 pages
Module 2-Supervised Learning
No ratings yet
Module 2-Supervised Learning
74 pages
ML Lecture - 3
No ratings yet
ML Lecture - 3
47 pages
Linear Regression
No ratings yet
Linear Regression
34 pages
Unit 2
No ratings yet
Unit 2
136 pages
ML 02 Regression 2
No ratings yet
ML 02 Regression 2
30 pages
Understanding Regression in Supervised Learning
No ratings yet
Understanding Regression in Supervised Learning
25 pages
Applying Machine Learning Algorithms With Scikit-Learn (Sklearn) - Notes
No ratings yet
Applying Machine Learning Algorithms With Scikit-Learn (Sklearn) - Notes
19 pages
Unit 2
No ratings yet
Unit 2
26 pages
Linear Regression
No ratings yet
Linear Regression
24 pages
ML Exp 1
No ratings yet
ML Exp 1
4 pages
ML Algorithm
No ratings yet
ML Algorithm
4 pages
Linear Regression
No ratings yet
Linear Regression
15 pages
Linear Regression Guide & Examples
No ratings yet
Linear Regression Guide & Examples
36 pages
Introduction To Linear Regression
No ratings yet
Introduction To Linear Regression
5 pages
10.introduction To Artificial Intelligence
No ratings yet
10.introduction To Artificial Intelligence
25 pages
SumitBurnwal ML
No ratings yet
SumitBurnwal ML
13 pages
Supervised Learning Algorithms
No ratings yet
Supervised Learning Algorithms
20 pages
ML Module 2
No ratings yet
ML Module 2
185 pages
Linear Regression in Machine Learning
No ratings yet
Linear Regression in Machine Learning
5 pages
6 ML Updated
No ratings yet
6 ML Updated
23 pages
Linear Regression For Machine Learning
No ratings yet
Linear Regression For Machine Learning
9 pages
Simple Linear and Logistic Regression
No ratings yet
Simple Linear and Logistic Regression
81 pages
Algebraic & Transcendental Equations
No ratings yet
Algebraic & Transcendental Equations
20 pages
2.1 - Datos de La Precipitacion
No ratings yet
2.1 - Datos de La Precipitacion
35 pages
Transportation Model Overview and Methods
No ratings yet
Transportation Model Overview and Methods
10 pages
Model 111
No ratings yet
Model 111
3 pages
Maths, Class - X, Polynomials, Dav Public School, Sherghati
No ratings yet
Maths, Class - X, Polynomials, Dav Public School, Sherghati
5 pages
Python Numerical Methods Guide
No ratings yet
Python Numerical Methods Guide
15 pages
Parabolic PDEs in Heat Conduction
No ratings yet
Parabolic PDEs in Heat Conduction
49 pages
Practical Optimization: Algorithms and Engineering Applications 2nd Edition Andreas Antoniou Online Reading
No ratings yet
Practical Optimization: Algorithms and Engineering Applications 2nd Edition Andreas Antoniou Online Reading
86 pages
PCM - Gujcet 28 03 2024 1 14 13
No ratings yet
PCM - Gujcet 28 03 2024 1 14 13
1 page
Special Products
No ratings yet
Special Products
19 pages
Ch.4 Roots of Polynomials
No ratings yet
Ch.4 Roots of Polynomials
1 page
Transportation Model Methods of Optimization Techniques: Presentation On
No ratings yet
Transportation Model Methods of Optimization Techniques: Presentation On
19 pages
Factoring Higher Power Polynomia
No ratings yet
Factoring Higher Power Polynomia
3 pages
Linear Programming Problem Formulation
No ratings yet
Linear Programming Problem Formulation
12 pages
7.6 Branch and Bound Method: C 1 1 0 0 0 Basic Variables Basic Basic Variables X X S S S Coefficient Variables Values
No ratings yet
7.6 Branch and Bound Method: C 1 1 0 0 0 Basic Variables Basic Basic Variables X X S S S Coefficient Variables Values
8 pages
Wa0003
No ratings yet
Wa0003
12 pages
Bisection Method for Finding Roots
No ratings yet
Bisection Method for Finding Roots
9 pages
Exam With Solutions PDF
0% (1)
Exam With Solutions PDF
17 pages
Polynomial Curve Fitting - MATLAB & Simulink PDF
No ratings yet
Polynomial Curve Fitting - MATLAB & Simulink PDF
12 pages
Na U3m07l01
No ratings yet
Na U3m07l01
12 pages
Curve Fitting 1
No ratings yet
Curve Fitting 1
20 pages
Assignment Questions-Operation Research (OEME0003) - 1
No ratings yet
Assignment Questions-Operation Research (OEME0003) - 1
2 pages
18B-044-PE, Mid 18B-EL, NA
No ratings yet
18B-044-PE, Mid 18B-EL, NA
7 pages
EXERCISE 2. Solve The Following Problems Using Regula-Falsi Method and Show The Graph
No ratings yet
EXERCISE 2. Solve The Following Problems Using Regula-Falsi Method and Show The Graph
5 pages
Midsem: CE620 Mid-Sem On 24 Feb 2021 From 09h00 To 11h00
No ratings yet
Midsem: CE620 Mid-Sem On 24 Feb 2021 From 09h00 To 11h00
19 pages
Linear Systems for Math Students
No ratings yet
Linear Systems for Math Students
39 pages
Numerical Methods - Principles, Analysis, and Algorithms - S. Pal
0% (2)
Numerical Methods - Principles, Analysis, and Algorithms - S. Pal
286 pages
Assignment 3 (MAB-103)
No ratings yet
Assignment 3 (MAB-103)
2 pages
Linear Programming Exam
No ratings yet
Linear Programming Exam
4 pages

Lec 6

Uploaded by

Lec 6

Uploaded by

L INEAR R EGRESSION

Nehal Khosla, Priyanshu Parida

National Institute of Science Education and Research

January 30, 2023

2 Linear Regression: The Problem . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6

3 Linear Regression: The Solution . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 11

1 Applications and Shortcomings . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 17

L INEAR R EGRESSION : T HE P ROBLEM

▶ Linear regression is a supervised machine learning algorithm.

Figure. Regression line for log R (dependent variable)

where B = Matrix of regression parameters

L INEAR R EGRESSION : T HE I NDUCTIVE B IAS

Linear regression takes the following assumptions, or inductive biases:

L INEAR R EGRESSION : A PPLICATIONS AND

1. CS460 Machine Learning 2023 Lectures, Subhankar Mishra.

You might also like