0% found this document useful (0 votes)

18 views15 pages

Gradient Descent for Beginners

The document explains the concept of Gradient Descent as an iterative optimization algorithm used in machine learning to minimize the cost function, which evaluates the performance of a model. It emphasizes the importance of small, user-guided iterations in both Agile software development and Gradient Descent, highlighting the role of learning rates and derivatives in finding the minimum value of a function. The goal is to achieve the lowest error in predictions by adjusting parameters effectively through the algorithm.

Uploaded by

harshamamidipaka2003

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

18 views15 pages

Gradient Descent for Beginners

Uploaded by

harshamamidipaka2003

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 15

Gradient Descent

Prakash P
Background
• Agile is a pretty well-known term in the software development process. The
basic idea behind it is simple:
• build something quickly ➡️get it out there ➡️get some feedback ➡️make changes
depending upon the feedback ➡️repeat the process.
• The goal is to get the product near the user and let the user guide you with
the feedback to obtain the best possible product with the least error.
• Also, the steps taken for improvement need to be small and should constantly
involve the user.
• In a way, an Agile software development process involves rapid iterations.
• The idea of — start with a solution as soon as possible, measure and iterate
as frequently as possible, is basically Gradient descent under the hood
Objective
• Gradient descent algorithm is an iterative process that takes us to the
minimum of a function.
• The formula below sums up the entire Gradient Descent algorithm in
a single line
A Machine Learning Model
• Consider a bunch of data points in a 2 D space. Assume that the data
is related to the height and weight of a group of students.
• Trying to predict some kind of relationship between these quantities
so that we could predict the weight of some new students afterwards.
Predictions
• Given a known set of inputs and their corresponding outputs, A
machine learning model tries to make some predictions for a new set
of inputs

The Error would be the difference between the two

predictions.

This relates to the idea of a Cost function or Loss function.

Cost Function
• A Cost Function/Loss Function evaluates the performance of our
Machine Learning Algorithm.
• The Loss function computes the error for a single training example
while the Cost function is the average of the loss functions for all the
training examples.
A Cost function basically tells us ‘ how good’ our model is at
making predictions for a given value of m and b.

Let’s say, there are a total of ’N’ points in the dataset and for all
those ’N’ data points we want to minimize the error. So the
Cost function would be the total squared error
Minimizing the Cost Function

• The goal of any Machine Learning Algorithm is to minimize the Cost

Function.

• Lower error between the actual and the predicted values signifies
that the algorithm has done a good job in learning.

• Since we want the lowest error value, we want those‘ m’ and ‘b’
values which give the smallest possible error
How do we actually minimize any
function?
• Cost function is of the form Y = X²
• In a Cartesian coordinate system, this is an equation for a parabola
and can be graphically represented as
To minimise the function above, need to find that value of X
that produces the lowest value of Y which is the red dot

It is quite easy to locate the minima here since it is a 2D graph

but this may not always be the case especially in case of higher
dimensions.
For those cases, need to devise an algorithm to locate the
minima, and that algorithm is called Gradient Descent
Gradient Descent
• Gradient descent is one of the most popular algorithms to perform optimization
and by far the most common way to optimize neural networks.
• It is an iterative optimisation algorithm used to find the minimum value for a
function.

Intuition:

Consider that you are walking along the graph below, and you
are currently at the ‘green’ dot.

Your aim is to reach the minimum i.e the ‘red’ dot, but from
your position, you are unable to view it.

Possible actions would be:

• You might go upward or downward Gradient Descent Algorithm helps us to make
these decisions efficiently and effectively with
• If you decide on which way to go, you might take a bigger the use of derivatives
step or a little step to reach your destination.
The Minimum Value

• A derivative is a term that comes from calculus and is calculated as

the slope of the graph at a particular point

The slope at the blue point is less steep than that at the green
point which means it will take much smaller steps to reach the
minimum from the blue point than from the green point.
Mathematical Interpretation of Cost
Function In the equation, y = mX+b, ‘m’ and ‘b’ are its
parameters. During the training process, there
will be a small change in their values.

Let that small change be denoted by δ.

The value of parameters will be updated as

m=m-δm and b=b-δb respectively

Our aim here is to find those values of m and b in

y = mx+b , for which the error is minimum i.e
values which minimize the cost function.
The Learning rate
• This size of steps taken to reach the minimum or bottom is
called Learning Rate.

• We can cover more area with larger steps/higher learning rate but are
at the risk of overshooting the minima

• On the other hand, small steps/smaller learning rates will consume a

lot of time to reach the lowest point
Calculating Gradient Descent

m¹,b¹ = next position parameters; m⁰,b⁰ = current position parameters

This 2 in this equation isn’t that significant since it just says that we have a learning rate twice as big
Conclusion
• Hence, to solve for the gradient,
• iterate through our data points using our new m and b values and
compute the partial derivatives.
• This new gradient tells us the slope of our cost function at our current
position and the direction we should move to update our parameters.
• The size of our update is controlled by the learning rate.
References
• https://machinelearningmastery.com/gradient-descent-for-machine-l
earning/

• https://towardsdatascience.com/understanding-the-mathematics-be
hind-gradient-descent-dde5dc9be06e

MAT6007 - Session8 - Gradient Descent
No ratings yet
MAT6007 - Session8 - Gradient Descent
16 pages
Gradient Descent Deep Learning: by T.K. Damodharan Vice President, RBS Reg - No: PC2013003013008
No ratings yet
Gradient Descent Deep Learning: by T.K. Damodharan Vice President, RBS Reg - No: PC2013003013008
37 pages
Gradient Descent
No ratings yet
Gradient Descent
9 pages
Understanding Cost Function & Gradient Descent
No ratings yet
Understanding Cost Function & Gradient Descent
142 pages
Module2 Optimizations
No ratings yet
Module2 Optimizations
65 pages
Adam Optimizer
No ratings yet
Adam Optimizer
22 pages
Gradient Descent
No ratings yet
Gradient Descent
13 pages
Understanding Gradient Descent in ML
No ratings yet
Understanding Gradient Descent in ML
20 pages
Gradient Descent
No ratings yet
Gradient Descent
6 pages
ML Notes
No ratings yet
ML Notes
14 pages
Slides-4 Optimization Extra Gradient Descent
No ratings yet
Slides-4 Optimization Extra Gradient Descent
67 pages
Gradient Descent and Cost Function
No ratings yet
Gradient Descent and Cost Function
14 pages
Understanding Gradient Descent in ML
No ratings yet
Understanding Gradient Descent in ML
4 pages
Gradient Descent Explained
No ratings yet
Gradient Descent Explained
9 pages
ML Lec 08 Gradient Descent
No ratings yet
ML Lec 08 Gradient Descent
37 pages
Gradient Descent
No ratings yet
Gradient Descent
27 pages
Linear Regression by IntuitiveAI v2.5
No ratings yet
Linear Regression by IntuitiveAI v2.5
5 pages
Gradient Descent
No ratings yet
Gradient Descent
5 pages
AIMLB PGP 2025 Session 5
No ratings yet
AIMLB PGP 2025 Session 5
67 pages
Lec05-1-Gradient Descent-Detailed
No ratings yet
Lec05-1-Gradient Descent-Detailed
62 pages
Gradient Descent in Machine Learning
No ratings yet
Gradient Descent in Machine Learning
3 pages
05 Gradient Descent
No ratings yet
05 Gradient Descent
23 pages
Unit 2-DLV
No ratings yet
Unit 2-DLV
84 pages
AI33
No ratings yet
AI33
6 pages
Gradient Descent Final
No ratings yet
Gradient Descent Final
27 pages
Gradient Descent in Linear Regression
No ratings yet
Gradient Descent in Linear Regression
30 pages
Gradient Descent in Machine Learning
No ratings yet
Gradient Descent in Machine Learning
8 pages
Gradient Descent Algorithm Is A First
No ratings yet
Gradient Descent Algorithm Is A First
5 pages
5.1loss Function, Optimization, GD
No ratings yet
5.1loss Function, Optimization, GD
39 pages
L3 Linear Regression and Gradient Descent
No ratings yet
L3 Linear Regression and Gradient Descent
46 pages
Assignment 4
No ratings yet
Assignment 4
8 pages
Eem520l3 2023
No ratings yet
Eem520l3 2023
25 pages
4 - Gradient Descent and Stochastic GD
No ratings yet
4 - Gradient Descent and Stochastic GD
37 pages
LInear
No ratings yet
LInear
14 pages
Understanding Gradient Descent in ML
No ratings yet
Understanding Gradient Descent in ML
9 pages
Gradient Descent - PR
No ratings yet
Gradient Descent - PR
31 pages
Machine Learning Optimization Techniques
No ratings yet
Machine Learning Optimization Techniques
37 pages
Understanding Gradient Descent Techniques
No ratings yet
Understanding Gradient Descent Techniques
25 pages
Machine Learning Basics Explained
No ratings yet
Machine Learning Basics Explained
10 pages
Unit VI Optimization Techniques Question Bank Solved Answer
No ratings yet
Unit VI Optimization Techniques Question Bank Solved Answer
20 pages
Gradient Descent in Machine Learning - Javatpoint
No ratings yet
Gradient Descent in Machine Learning - Javatpoint
9 pages
Gradient Descent From Scratch Complete Intuition
No ratings yet
Gradient Descent From Scratch Complete Intuition
8 pages
What Is Machine Learning by Coursera
No ratings yet
What Is Machine Learning by Coursera
47 pages
Gradient Descend
No ratings yet
Gradient Descend
64 pages
Gradient Descent
No ratings yet
Gradient Descent
12 pages
Sheet 3 Sol 3
No ratings yet
Sheet 3 Sol 3
3 pages
Linear Regression for Beginners
No ratings yet
Linear Regression for Beginners
11 pages
11 Gradient Descent
No ratings yet
11 Gradient Descent
58 pages
UNIT III Part-2
No ratings yet
UNIT III Part-2
39 pages
Gradient Descent
No ratings yet
Gradient Descent
7 pages
Gradient Descent Algorithm.Y...
No ratings yet
Gradient Descent Algorithm.Y...
10 pages
Week 6
No ratings yet
Week 6
72 pages
06 23ECE216 GradientDescent v2
No ratings yet
06 23ECE216 GradientDescent v2
73 pages
chp2 Gradient Descent Algorithm
No ratings yet
chp2 Gradient Descent Algorithm
5 pages
Gradient Descent in Logistic Regression
No ratings yet
Gradient Descent in Logistic Regression
16 pages
Chapter 4
No ratings yet
Chapter 4
33 pages
DL Unit 2a
No ratings yet
DL Unit 2a
14 pages
MScFE 650 MLF - Video - Transcripts - M3
No ratings yet
MScFE 650 MLF - Video - Transcripts - M3
19 pages
AI & ML in CNC Machining
No ratings yet
AI & ML in CNC Machining
12 pages
CV v2.1
No ratings yet
CV v2.1
2 pages
Leveraging AI Tools For Building Manager Productivity
No ratings yet
Leveraging AI Tools For Building Manager Productivity
36 pages
M6 - Aliyah Muthi Lathifah - 10521097 - 4PA21
No ratings yet
M6 - Aliyah Muthi Lathifah - 10521097 - 4PA21
4 pages
CBSE Class XI AI Curriculum 2020-21
No ratings yet
CBSE Class XI AI Curriculum 2020-21
9 pages
Lecture Text 2 - Machine Learning Algorithms and Techniques
No ratings yet
Lecture Text 2 - Machine Learning Algorithms and Techniques
2 pages
Master in Innovation and Research in Informatics: Miri, (FIB, UPC), Specialization in Data Science
No ratings yet
Master in Innovation and Research in Informatics: Miri, (FIB, UPC), Specialization in Data Science
1 page
Food Spoilage Detection Using Convolutional Neural Networks and K Means Clustering
No ratings yet
Food Spoilage Detection Using Convolutional Neural Networks and K Means Clustering
7 pages
Machine Learning in Mail Processing
No ratings yet
Machine Learning in Mail Processing
17 pages
Buettner 2019
No ratings yet
Buettner 2019
6 pages
Theoretical Framework For Deep Learning Analysis
No ratings yet
Theoretical Framework For Deep Learning Analysis
4 pages
DS Syllubus GL
No ratings yet
DS Syllubus GL
8 pages
Electronics 13 04527 With Cover
No ratings yet
Electronics 13 04527 With Cover
34 pages
Datamites Certified Data Analyst Brochure INDIA V9
No ratings yet
Datamites Certified Data Analyst Brochure INDIA V9
18 pages
Chapter 1
No ratings yet
Chapter 1
27 pages
Neural Computation: Exercise Sheet 5
No ratings yet
Neural Computation: Exercise Sheet 5
2 pages
Next Gen Algorithmic Trading Strategies, Tools, and Techniques For
No ratings yet
Next Gen Algorithmic Trading Strategies, Tools, and Techniques For
260 pages
Face Age Detection Using Transfer Semi-Supervised Regression Models
No ratings yet
Face Age Detection Using Transfer Semi-Supervised Regression Models
3 pages
Finger Print - FLL
No ratings yet
Finger Print - FLL
8 pages
2 - 202anggitha Tugas Kelompok 1
No ratings yet
2 - 202anggitha Tugas Kelompok 1
15 pages
5 Faris
No ratings yet
5 Faris
4 pages
AWS Healthcare 19167008
No ratings yet
AWS Healthcare 19167008
9 pages
AI Linear Regression & Perceptron
No ratings yet
AI Linear Regression & Perceptron
8 pages
Deep Learning With Python Sample
100% (1)
Deep Learning With Python Sample
31 pages
AI Fundamentals Level 1 Quiz - Attempt Review - Jan 2024
100% (3)
AI Fundamentals Level 1 Quiz - Attempt Review - Jan 2024
16 pages
Introduction to Bayesian Optimization
No ratings yet
Introduction to Bayesian Optimization
4 pages
If4071 Deep Learning
No ratings yet
If4071 Deep Learning
1 page
Bengaluru House Price Prediction Report
No ratings yet
Bengaluru House Price Prediction Report
18 pages
Project Work
No ratings yet
Project Work
14 pages
Principled Penalty-Based Methods For Bilevel Reinforcement Learning and RLHF
No ratings yet
Principled Penalty-Based Methods For Bilevel Reinforcement Learning and RLHF
49 pages

Gradient Descent for Beginners

Uploaded by

Gradient Descent for Beginners

Uploaded by

Gradient Descent

The Error would be the difference between the two

This relates to the idea of a Cost function or Loss function.

• The goal of any Machine Learning Algorithm is to minimize the Cost

It is quite easy to locate the minima here since it is a 2D graph

Possible actions would be:

• A derivative is a term that comes from calculus and is calculated as

Let that small change be denoted by δ.

The value of parameters will be updated as

Our aim here is to find those values of m and b in

• On the other hand, small steps/smaller learning rates will consume a

m¹,b¹ = next position parameters; m⁰,b⁰ = current position parameters

You might also like