Gradient Descent

Gradient descent is an optimization algorithm used to find the minimum of a function. It works by taking steps in the direction of the negative gradient of the function at the current point, moving toward the local minimum. The algorithm initializes with a starting point and iteratively calculates the negative gradient to determine the direction and step size to reduce the function value at each step until it converges on a local minimum. Gradient descent is commonly used to train machine learning models like neural networks and logistic regression.

Uploaded by

Manoj Kudur

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

108 views

Gradient Descent

Uploaded by

Manoj Kudur

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 2

Gradient Descent

Gradient descent is a first-order optimization algorithm. To find a local minimum of a function using gradient descent, one takes steps proportional to the negative of the gradient (or of the approximate gradient) of the function at the current point. If instead one takes steps proportional to the positive of the gradient, one approaches a local maximum of that function; the procedure is then known as gradient descent. Gradient descent is also known as steepest descent, or the method of steepest descent. When known as the latter, gradient descent should not be confused with the method of steepest descent for approximating integrals.

Gradient descent is based on the observation that if the multivariable function is defined and differentiable in a neighborhood of a point a, then decreases fastest if one goes from in the direction of the negative gradient of F at a, . It follows that, if

for small enough, then guess 0 for a local minimum of

. With this observation in mind, one starts with a

, and considers the sequence such that

We have

So hopefully the sequence converges to the desired local minimum. Note that the value of the step size is allowed to change at every iteration. With certain assumptions on the function and particular choices of , convergence to a local minimum can be guaranteed.

When the function is convex, all local minima are also global minima, so in this case gradient descent can converge to the global solution. This process is illustrated in the picture to the right. Here is assumed to be defined on the plane, and that its graph has a bowl shape. The blue curves are the contour lines, that is, the regions on which the value of is constant. A red arrow originating at a point shows the direction of the negative gradient at that point. Note that the (negative) gradient at a point is orthogonal to the contour line going through that point. We see that gradient descent leads us to the bottom of the bowl, that is, to the point where the value of the function is minimal.

Applications Gradient descent is a popular algorithm for training a wide range of models in machine learning, including (linear) support vector machines, logistic regression and graphical models. It competes with the L-BFGS algorithm, which is also widely used. SGD has been used since at least 1960 for training linear regression models, originally under the name ADALINE. When combined with the back propagation algorithm, it is the de facto standard algorithm for training (shallow) artificial neural networks. Another popular stochastic gradient descent algorithm is the least mean squares (LMS) adaptive filter.

Additional Exercises Sol
80% (10)
Additional Exercises Sol
632 pages
V20PCA203 - Optimization Techniques - Online MCA - Question Bank - 2023-24 - ODD - DR - RV - 09.12.2023
No ratings yet
V20PCA203 - Optimization Techniques - Online MCA - Question Bank - 2023-24 - ODD - DR - RV - 09.12.2023
17 pages
Sosialisasi Materi Monev Mutu Layanan TW1 TH 2024 FKRTLfix
No ratings yet
Sosialisasi Materi Monev Mutu Layanan TW1 TH 2024 FKRTLfix
97 pages
Gradient Descent
No ratings yet
Gradient Descent
17 pages
Chapter 6 Distribution and Network Models
No ratings yet
Chapter 6 Distribution and Network Models
28 pages
Gradient Descent
No ratings yet
Gradient Descent
6 pages
4.2 - 1b - Gradient Descent - Wikipedia - Workedout
No ratings yet
4.2 - 1b - Gradient Descent - Wikipedia - Workedout
5 pages
Gradient_Descent_(1)
No ratings yet
Gradient_Descent_(1)
8 pages
Gradient Descent Algorithm in Machine Learning: Dr. P. K. Chaurasia
No ratings yet
Gradient Descent Algorithm in Machine Learning: Dr. P. K. Chaurasia
24 pages
ARTIFICIAL NEURAL NETWORKS ppt
No ratings yet
ARTIFICIAL NEURAL NETWORKS ppt
7 pages
Gradient Descent
No ratings yet
Gradient Descent
4 pages
Gradient Descent Algorithm in Machine Learning
No ratings yet
Gradient Descent Algorithm in Machine Learning
21 pages
CCS355 Neural Networks and Deep Learning
No ratings yet
CCS355 Neural Networks and Deep Learning
142 pages
SGD
No ratings yet
SGD
19 pages
Gradient Descent a Fundamental Optimization Algorithm
No ratings yet
Gradient Descent a Fundamental Optimization Algorithm
30 pages
Gradient Descent
No ratings yet
Gradient Descent
12 pages
ML MODULE 5 FULL NOTES
No ratings yet
ML MODULE 5 FULL NOTES
23 pages
Assignment B 4 GradientDescent
No ratings yet
Assignment B 4 GradientDescent
5 pages
DL Unit -2
No ratings yet
DL Unit -2
20 pages
Gradient Descent
No ratings yet
Gradient Descent
7 pages
12-Mini-Batch Gradient Descent - Exponential Weighted Averages-07-08-2024
No ratings yet
12-Mini-Batch Gradient Descent - Exponential Weighted Averages-07-08-2024
2 pages
Machine Learning
No ratings yet
Machine Learning
7 pages
CS 304.A Training Models
No ratings yet
CS 304.A Training Models
149 pages
Lecture10v01 Descent2
No ratings yet
Lecture10v01 Descent2
18 pages
ML Lec 08 Gradient Descent
No ratings yet
ML Lec 08 Gradient Descent
37 pages
Gradient Descent
No ratings yet
Gradient Descent
5 pages
week 10 notes MLF
No ratings yet
week 10 notes MLF
20 pages
Stochastic Gradient Descent
No ratings yet
Stochastic Gradient Descent
4 pages
Stochastic Gradient Descent
No ratings yet
Stochastic Gradient Descent
12 pages
Gradient Descent
No ratings yet
Gradient Descent
9 pages
ECS171: Machine Learning: Lecture 4: Optimization (LFD 3.3, SGD)
No ratings yet
ECS171: Machine Learning: Lecture 4: Optimization (LFD 3.3, SGD)
45 pages
Gradient Descent - A Quick, Simple Introduction - Built in
No ratings yet
Gradient Descent - A Quick, Simple Introduction - Built in
15 pages
AI33
No ratings yet
AI33
6 pages
Lecture05_descent
No ratings yet
Lecture05_descent
31 pages
Yash 21bsds12
No ratings yet
Yash 21bsds12
3 pages
What Is Gradient Descent - Built in
No ratings yet
What Is Gradient Descent - Built in
11 pages
Machine Learning Notes
No ratings yet
Machine Learning Notes
43 pages
Gradient Descent
No ratings yet
Gradient Descent
6 pages
Gradient Descend
No ratings yet
Gradient Descend
64 pages
Unit3_rev3
No ratings yet
Unit3_rev3
201 pages
Chapter Gradient Descent
No ratings yet
Chapter Gradient Descent
6 pages
lecture7-graddesc
No ratings yet
lecture7-graddesc
8 pages
LInear
No ratings yet
LInear
14 pages
Introduction-to-Gradient-Descent (2)
No ratings yet
Introduction-to-Gradient-Descent (2)
8 pages
Lecture 3 Gradient Descent
No ratings yet
Lecture 3 Gradient Descent
37 pages
Gradient - Descent Important 23-24
No ratings yet
Gradient - Descent Important 23-24
37 pages
Gradient Descent
No ratings yet
Gradient Descent
12 pages
3 Gradient Descent
No ratings yet
3 Gradient Descent
8 pages
GD Types
No ratings yet
GD Types
98 pages
Lecture 3 Gradient Descent
No ratings yet
Lecture 3 Gradient Descent
37 pages
Lecture02a Optimization Annotated PDF
No ratings yet
Lecture02a Optimization Annotated PDF
23 pages
GD Algo.pptx
No ratings yet
GD Algo.pptx
18 pages
Gradient Descent
No ratings yet
Gradient Descent
18 pages
BSC Part 3
No ratings yet
BSC Part 3
29 pages
Lec 5 - Gradient-Descent
No ratings yet
Lec 5 - Gradient-Descent
31 pages
Gradient Descent
No ratings yet
Gradient Descent
5 pages
Gradient Descent
No ratings yet
Gradient Descent
13 pages
Lecture Notes: Some Notes On Gradient Descent: Marc Toussaint
No ratings yet
Lecture Notes: Some Notes On Gradient Descent: Marc Toussaint
4 pages
Steepest Descent
No ratings yet
Steepest Descent
7 pages
Download
No ratings yet
Download
7 pages
Models PDF
No ratings yet
Models PDF
86 pages
Gradient Descent
No ratings yet
Gradient Descent
9 pages
Hill Climbing: Fundamentals and Applications
From Everand
Hill Climbing: Fundamentals and Applications
Fouad Sabry
No ratings yet
Backpropagation: Fundamentals and Applications for Preparing Data for Training in Deep Learning
From Everand
Backpropagation: Fundamentals and Applications for Preparing Data for Training in Deep Learning
Fouad Sabry
No ratings yet
Linear Programming
No ratings yet
Linear Programming
8 pages
Particle Swarm Optimization
100% (1)
Particle Swarm Optimization
19 pages
Transportation, Transshipment, and Assignment Problems
100% (1)
Transportation, Transshipment, and Assignment Problems
48 pages
[FREE PDF sample] Optimization concepts and applications in engineering Third Edition Belegundu ebooks
100% (4)
[FREE PDF sample] Optimization concepts and applications in engineering Third Edition Belegundu ebooks
81 pages
ILSSI Yellow Belt Practice Exam 2023 - v2
100% (1)
ILSSI Yellow Belt Practice Exam 2023 - v2
15 pages
Linear Programming
No ratings yet
Linear Programming
31 pages
Optimale Regelung: Optimal Control With Engineering Applications
No ratings yet
Optimale Regelung: Optimal Control With Engineering Applications
9 pages
121649
No ratings yet
121649
8 pages
NLP MultiVAr Constrained
No ratings yet
NLP MultiVAr Constrained
63 pages
Transportation Problem and Assignment Problem
No ratings yet
Transportation Problem and Assignment Problem
13 pages
MIP Model For Split Delivery VRP With Fleet & Driver Scheduling
No ratings yet
MIP Model For Split Delivery VRP With Fleet & Driver Scheduling
5 pages
BB 1
No ratings yet
BB 1
12 pages
Computer Based Optimization Method MCA - 305
100% (1)
Computer Based Optimization Method MCA - 305
3 pages
Silver Oak University: College of Technology
No ratings yet
Silver Oak University: College of Technology
4 pages
Class XII Applied Maths1
No ratings yet
Class XII Applied Maths1
4 pages
Hill Climbing Algorithm in Artificial Intelligence
No ratings yet
Hill Climbing Algorithm in Artificial Intelligence
6 pages
CAR Form - New
No ratings yet
CAR Form - New
24 pages
Pert.4 Minimization Model
No ratings yet
Pert.4 Minimization Model
28 pages
A Modeling Language For Mathematical Programming: Second Edition
No ratings yet
A Modeling Language For Mathematical Programming: Second Edition
528 pages
Pdmu DPCR 1st Sem 2017 - 1st Revision
No ratings yet
Pdmu DPCR 1st Sem 2017 - 1st Revision
28 pages
EMJ37103 EMJ37504 - Digital Control System Part II
No ratings yet
EMJ37103 EMJ37504 - Digital Control System Part II
12 pages
Lab5 Gams Formulation
No ratings yet
Lab5 Gams Formulation
48 pages
Cuckoo Search (CS) Algorithm - File Exchange - MATLAB Central
No ratings yet
Cuckoo Search (CS) Algorithm - File Exchange - MATLAB Central
5 pages
Chapter 6 - Integer Programming (Part 1)
No ratings yet
Chapter 6 - Integer Programming (Part 1)
22 pages
Tutorial Sheet 3 PDF
No ratings yet
Tutorial Sheet 3 PDF
2 pages
Unit 4 Operations Research
0% (1)
Unit 4 Operations Research
23 pages

Gradient Descent

Uploaded by

Gradient Descent

Uploaded by

Gradient Descent

for small enough, then guess 0 for a local minimum of

. With this observation in mind, one starts with a

You might also like