Regularization in Machine Learning

Regularization is a technique in machine learning that prevents overfitting by adding a penalty to model complexity, improving generalization on unseen data. Key types include L1 (Lasso) which promotes sparsity and feature selection, and L2 (Ridge) which shrinks weights without eliminating them, handling multicollinearity better. Elastic Net combines both methods, offering a balance between sparsity and coefficient size, particularly useful for datasets with correlated features.

Uploaded by

praveencertificates11

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

17 views3 pages

Regularization in Machine Learning

Uploaded by

praveencertificates11

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

Regularization in Machine Learning

Introduction
Regularization is a technique used in machine learning to prevent overfitting by adding a penalty to the
model’s complexity during training. It improves the generalization performance of the model on unseen
data.

Why Regularization is Needed

Overfitting occurs when a model learns noise or irrelevant details in the training data, leading to poor
performance on test data. Regularization addresses this issue by discouraging overly complex models, striking
a balance between underfitting and overfitting.

Types of Regularization
L1 Regularization (Lasso)
L1 regularization adds the sum of the absolute values of the coefficients as a penalty to the loss function.
This encourages sparsity by shrinking some weights to exactly zero, which can effectively perform feature
selection.
n
X
Loss = Original Loss + λ |wj | (1)
j=1

Properties of L1 Regularization:
• Promotes sparsity in the model parameters, leading to a sparse solution where some weights are exactly
zero.
• Performs feature selection, as irrelevant features will have zero coefficients.
• Useful in high-dimensional datasets with many irrelevant or redundant features.
• The optimization problem is not differentiable at zero, making it slightly harder to optimize compared
to L2 regularization.

Pros and Cons of L1 Regularization:

• Pros:
– Simpler and more interpretable models due to sparsity.
– Automatic feature selection.
• Cons:
– May lead to instability when features are highly correlated.
– Does not handle multicollinearity well.

1
L2 Regularization (Ridge)
L2 regularization adds the sum of the squared values of the coefficients as a penalty to the loss function. It
encourages smaller coefficients by penalizing large weights, without driving them to exactly zero.
n
X
Loss = Original Loss + λ wj2 (2)
j=1

Properties of L2 Regularization:
• Shrinks weights towards zero but does not eliminate them (no sparsity).

• Reduces model variance, leading to more stable predictions.

• Performs well when features are correlated, as it distributes weights evenly.
• The optimization problem is smooth and differentiable, making it easier to solve.

Pros and Cons of L2 Regularization:

• Pros:

– Handles multicollinearity well.

– Leads to more stable models.
• Cons:

– Does not perform feature selection as weights are never exactly zero.
– Can result in complex models in high-dimensional settings.

Elastic Net Regularization

Elastic Net combines L1 and L2 regularization, balancing sparsity and small coefficients. It is particularly
useful when there are many correlated features.
n
X n
X
Loss = Original Loss + λ1 |wj | + λ2 wj2 (3)
j=1 j=1

Comparison of L1 and L2 Regularization

The following table highlights the key differences between L1 (Lasso) and L2 (Ridge) regularization:

Property L1 (Lasso) L2 (Ridge)

λ wj2
P P
Weight Penalty λ |wj |
Weight Shrinkage Some weights are exactly zero All weights are small but non-zero
Feature Selection Yes (Sparse model) No
Handling Multicollinearity Poor Good
Optimization Complexity Non-differentiable at zero Smooth and differentiable
Use Case High-dimensional data with sparse Low-dimensional data, correlated
features features

Table 1: Comparison of L1 and L2 Regularization

2
Tuning Regularization
The strength of regularization is controlled by a hyperparameter λ, which determines the trade-off between
model complexity and fit to the data. Higher λ leads to stronger regularization (simpler models), while lower
λ allows for more complex models.
Hyperparameter tuning can be done using:
• Cross-validation to select the optimal λ.

• Grid search or randomized search to explore possible values.

Conclusion
Regularization is essential for building machine learning models that generalize well. Ridge (L2) and Lasso
(L1) regularization are widely used techniques:
• L1 Regularization (Lasso): Promotes sparsity, useful for feature selection.
• L2 Regularization (Ridge): Shrinks weights evenly, handles multicollinearity.
Elastic Net offers a compromise between L1 and L2 regularization, making it useful for datasets with
many correlated features.

Siex PDF
No ratings yet
Siex PDF
250 pages
Lec9_10 (1)
No ratings yet
Lec9_10 (1)
4 pages
DL_Unit-3
No ratings yet
DL_Unit-3
56 pages
pr
No ratings yet
pr
6 pages
Regularization
No ratings yet
Regularization
4 pages
Kkk
No ratings yet
Kkk
17 pages
ML Lec-8
No ratings yet
ML Lec-8
7 pages
Regularization
No ratings yet
Regularization
45 pages
Regularization
No ratings yet
Regularization
46 pages
Regularization
No ratings yet
Regularization
2 pages
Overfitting Underfitting: UNIT 2: Optimization and Regularization in Neural Networks
No ratings yet
Overfitting Underfitting: UNIT 2: Optimization and Regularization in Neural Networks
18 pages
DL Chpter 3
No ratings yet
DL Chpter 3
8 pages
12-Regularization for Deep Learning-17!08!2024
No ratings yet
12-Regularization for Deep Learning-17!08!2024
51 pages
Unit 4
No ratings yet
Unit 4
62 pages
Ridge Regression
No ratings yet
Ridge Regression
20 pages
Regularization: Swetha V, Research Scholar
No ratings yet
Regularization: Swetha V, Research Scholar
32 pages
Regularization in Machine Learning
No ratings yet
Regularization in Machine Learning
5 pages
L1L2_regularization_comparison
No ratings yet
L1L2_regularization_comparison
5 pages
Regularization in Deep Learning (1)
No ratings yet
Regularization in Deep Learning (1)
49 pages
4.Bias and Variance
No ratings yet
4.Bias and Variance
19 pages
Unit 2.3
No ratings yet
Unit 2.3
43 pages
02. Performance Tuning
No ratings yet
02. Performance Tuning
24 pages
UNIT LV
No ratings yet
UNIT LV
8 pages
unit4
No ratings yet
unit4
93 pages
Regularization Induces Sparse Coefficients
No ratings yet
Regularization Induces Sparse Coefficients
2 pages
L1 Regularization (Lasso) & L2 Regularization (Ridge)
No ratings yet
L1 Regularization (Lasso) & L2 Regularization (Ridge)
4 pages
Nndl Notes
No ratings yet
Nndl Notes
73 pages
5-Introduction To regularization-03-Aug-2020Material - I - 03-Aug-2020 - Module3 - Regularization
No ratings yet
5-Introduction To regularization-03-Aug-2020Material - I - 03-Aug-2020 - Module3 - Regularization
10 pages
Regularization in Machine Learning
No ratings yet
Regularization in Machine Learning
17 pages
Regularization
No ratings yet
Regularization
3 pages
What is Regularization.
No ratings yet
What is Regularization.
10 pages
Types of Regularization in Machine Learning - by Aqeel Anwar - Towards Data Science
No ratings yet
Types of Regularization in Machine Learning - by Aqeel Anwar - Towards Data Science
11 pages
Regularization 1704650055
No ratings yet
Regularization 1704650055
32 pages
Unit 4
No ratings yet
Unit 4
35 pages
CSL0777 L17
No ratings yet
CSL0777 L17
27 pages
Regularization For Deep Learning: Tsz-Chiu Au Chiu@unist - Ac.kr
No ratings yet
Regularization For Deep Learning: Tsz-Chiu Au Chiu@unist - Ac.kr
100 pages
? What is Overfitting
No ratings yet
? What is Overfitting
2 pages
Lec 05 Regularization
No ratings yet
Lec 05 Regularization
77 pages
07_regularization
No ratings yet
07_regularization
51 pages
BACK PROPAGATION and REGULATION, BATCH NORMALIZATION
No ratings yet
BACK PROPAGATION and REGULATION, BATCH NORMALIZATION
20 pages
Honours 1
No ratings yet
Honours 1
5 pages
UNIT-II Regularization in Deep Learning
No ratings yet
UNIT-II Regularization in Deep Learning
24 pages
Lesson Four
No ratings yet
Lesson Four
28 pages
Machine Learning
No ratings yet
Machine Learning
10 pages
Regularization
No ratings yet
Regularization
5 pages
Unit - 4 REGULARIZATION FOR DEEP LEARNING
No ratings yet
Unit - 4 REGULARIZATION FOR DEEP LEARNING
56 pages
B Ridge - and - Lasso - Regression
No ratings yet
B Ridge - and - Lasso - Regression
5 pages
Multi Class Learning With Individual Sparsity
No ratings yet
Multi Class Learning With Individual Sparsity
7 pages
Regularization_(1)
No ratings yet
Regularization_(1)
3 pages
L11+ Regularization
No ratings yet
L11+ Regularization
24 pages
Module-4_3
No ratings yet
Module-4_3
20 pages
21 CF With Regularization (Guide)
No ratings yet
21 CF With Regularization (Guide)
2 pages
DIP Assignment Essajan
No ratings yet
DIP Assignment Essajan
2 pages
PA Notes 2
No ratings yet
PA Notes 2
23 pages
4th Unit DL Final Class Notes (1)
No ratings yet
4th Unit DL Final Class Notes (1)
68 pages
Least Squares Optimization With L1-Norm Regularization
No ratings yet
Least Squares Optimization With L1-Norm Regularization
12 pages
Overfitting vs Underfitting
No ratings yet
Overfitting vs Underfitting
16 pages
NN&DL Unit-IV Regularization for Deep Learning
No ratings yet
NN&DL Unit-IV Regularization for Deep Learning
16 pages
Regularization ML
No ratings yet
Regularization ML
28 pages
Introduction to Logarithms and Exponentials
From Everand
Introduction to Logarithms and Exponentials
Simone Malacrida
No ratings yet
Exercises of Logarithms and Exponentials
From Everand
Exercises of Logarithms and Exponentials
Simone Malacrida
No ratings yet
Derivation of the Margin
No ratings yet
Derivation of the Margin
2 pages
aml1 (1)
No ratings yet
aml1 (1)
5 pages
NLPCourseOutline24-25_0816953ba4215defc2a791d7c6fe3dcd
No ratings yet
NLPCourseOutline24-25_0816953ba4215defc2a791d7c6fe3dcd
4 pages
Sentiments Analysis Code Analysis
No ratings yet
Sentiments Analysis Code Analysis
42 pages
FALLSEM2022-23 HUM1024 ETH VL2022230102172 Reference Material I 05-10-2022 HDI
No ratings yet
FALLSEM2022-23 HUM1024 ETH VL2022230102172 Reference Material I 05-10-2022 HDI
7 pages
Recruitment_for_AELR&D
No ratings yet
Recruitment_for_AELR&D
5 pages
01 Automotive Price List-10.0
No ratings yet
01 Automotive Price List-10.0
56 pages
Iso 20172-1 2017
50% (2)
Iso 20172-1 2017
11 pages
Cloud Exam Part 2
No ratings yet
Cloud Exam Part 2
3 pages
2005 450exc
No ratings yet
2005 450exc
84 pages
ASF Form
No ratings yet
ASF Form
2 pages
FD TAs Allottment-2nd Semester 2023-24
No ratings yet
FD TAs Allottment-2nd Semester 2023-24
4 pages
Electrical Power Systems Electrical Power Engineering Power Systems
No ratings yet
Electrical Power Systems Electrical Power Engineering Power Systems
29 pages
Grundfos HS English
No ratings yet
Grundfos HS English
112 pages
737 FS2CREW SOP1
No ratings yet
737 FS2CREW SOP1
6 pages
Future of Copyright 2
No ratings yet
Future of Copyright 2
65 pages
x12 855sampledocument
No ratings yet
x12 855sampledocument
2 pages
Bapi Po Create1
100% (2)
Bapi Po Create1
8 pages
rm300 hidraulico
No ratings yet
rm300 hidraulico
2 pages
Second Assessment Schedule April'25
No ratings yet
Second Assessment Schedule April'25
2 pages
AccurioPrint C3070L BEU EN Screen
No ratings yet
AccurioPrint C3070L BEU EN Screen
7 pages
MTAS Operation and Configuration
No ratings yet
MTAS Operation and Configuration
2 pages
Ale January 2022: Day 1 Part 1 Questions
No ratings yet
Ale January 2022: Day 1 Part 1 Questions
3 pages
VIC-2D Testing Guide
No ratings yet
VIC-2D Testing Guide
27 pages
Create AI Logos and Brand Kits Instantly
No ratings yet
Create AI Logos and Brand Kits Instantly
8 pages
0003 Mi20 00S1 0240 0
100% (1)
0003 Mi20 00S1 0240 0
15 pages
NTU BDC Registration Kit
No ratings yet
NTU BDC Registration Kit
8 pages
Daftar Riwayat Hidup: Curriculum Vitae
No ratings yet
Daftar Riwayat Hidup: Curriculum Vitae
2 pages
Operational Philosophy Voith Paper
No ratings yet
Operational Philosophy Voith Paper
4 pages
KOR-220-ETS-S3250-0807-CL_REV.A
No ratings yet
KOR-220-ETS-S3250-0807-CL_REV.A
2 pages
DCS Micro Project
No ratings yet
DCS Micro Project
13 pages
TM04 Producing Basic Server-Side Scrip For Dynamic Web Page
No ratings yet
TM04 Producing Basic Server-Side Scrip For Dynamic Web Page
91 pages
Myisolved How To Update A Direct Deposit Account
No ratings yet
Myisolved How To Update A Direct Deposit Account
6 pages
CH5-1
No ratings yet
CH5-1
11 pages