0% found this document useful (0 votes)

110 views16 pages

MAT6007 - Session8 - Gradient Descent

Gradient descent is an iterative algorithm used to find the minimum of a cost function. It works by taking successive small steps in the direction of the negative gradient of the function at the current point. The gradient provides information about the direction and steepness of the cost function. The learning rate determines the step size in each iteration and affects how quickly the minimum is reached. The goal of gradient descent is to update the parameters like weights and biases in a neural network to minimize the overall cost function.

Uploaded by

manojnaidu yandrapu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

110 views16 pages

MAT6007 - Session8 - Gradient Descent

Uploaded by

manojnaidu yandrapu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

MAT6007 Deep Learning

Gradient Descent

Prakash P VIT , Chennai

Road Map
 Understanding the Mathematics
behind Gradient Descent
 Gradient Descent

Prakash VIT Chennai

Understanding the Mathematics behind Gradient
Descent
 Agile is a pretty well-known term in the software development
process.
 The basic idea behind it is simple:
 build something quickly, ➡️get it out there, ➡️get some feedback ➡️make
changes depending upon the feedback ➡️repeat the process.
 The goal is to get the product near the user and guide you
with feedback to obtain the best possible product with the least
error.
 Also, the steps taken for improvement need to be small and should
constantly involve the user
The idea of — start with a solution as soon as possible, measure and iterate as frequently as possible, is
Gradient descent under the hood.
Prakash VIT Chennai
Understanding the Mathematics behind Gradient
Descent
Objective
Gradient descent algorithm is an iterative process that takes us to the minimum of a
function.

The formula below sums up the entire Gradient Descent algorithm in a single line

Prakash VIT Chennai

Understanding the Mathematics behind Gradient
Descent
A Machine Learning Model
arbitrary line in space that passes through
Consider a bunch of data points in a 2 D some of these data points
space. Assume that the data is related to
the height and weight of a group of
students.

We are trying to predict some

relationship between these quantities to
predict the weight of some new
students afterward.

This is essentially a simple example of a

supervised Machine Learning technique
Prakash VIT Chennai
Understanding the Mathematics behind Gradient
Descent
Predictions
Given a known set of inputs and their corresponding outputs, A machine learning model
tries to make some predictions for a new set of inputs.

This relates to the idea of a Cost

The Error would be the difference between the function or Loss function.
two predictions.

Prakash VIT Chennai

Understanding the Mathematics behind Gradient
Descent
Cost Function
A Cost Function/Loss Function evaluates the performance of our Machine Learning
Algorithm
The Loss function computes the error for a single training example, while the Cost
function is the average of the loss functions for all the training examples
Let’s say there are a total of ’N’ points in the dataset, and for all those ’N’ data
points, we want to minimize the error.

So the Cost function would be the total squared error

The goal of any Learning Algorithm is to

minimize the Cost Function.
Prakash VIT Chennai
Understanding the Mathematics behind Gradient
Descent
How do we minimize any function?

Cost function is of the form Y = X²

To minimize the function above, need to find
that value of X that produces the lowest value
of Y which is the red dot in the above figure
It is pretty easy to locate the minima here since it is a 2D graph, but this may not
always be the case, especially in higher dimensions

Devise an algorithm to locate the minima, and that algorithm is called Gradient Descent

Prakash VIT Chennai

Understanding the Mathematics behind Gradient
Descent
Consider that you are walking along with the graph below, and you are currently at the
‘green’ dot. You aim to reach the minimum, i.e., the ‘red’ dot, but from your position, you
are unable to view it.
Possible actions would be:
You might go upward or downward

If you decide which way to go, you might take a bigger step or a little
step to reach your destination

Essentially, there are two things that you should know to

reach the minima, i.e. which way to go and how big a step
to take.
Understanding the Mathematics behind Gradient
Descent
The Minimum Value
Tangent at the green point, know that if we are
moving upwards, we are moving away from the
minima and vice versa.

Also, the tangent gives us a sense of the

steepness of the slope

The slope at the blue point is less steep

than that at the green point, which means
it will take much smaller steps to reach the
minimum from the blue point than from
the green point
Understanding the Mathematics behind Gradient
Descent
Mathematical Interpretation of Cost Function
• Let us now put all these learnings into a mathematical formula.
• In the equation, y = mX+b ‘m’ and ‘b’ are its parameters.
• During the training process, there will be a small change in their values.
• Let that small change be denoted by δ.
• The value of parameters will be updated as m=m-δm and b=b-δb, respectively.
• Aim here is to find those values of m and b in y = mx+b
• For which the error is minimum, i.e., values that minimize the cost function.

The idea is that by being able to compute the derivative/slope of the function, find the minimum of a function
Understanding the Mathematics behind Gradient
The Learning rate Descent
This size of steps taken to reach the minimum or bottom is called Learning Rate.

Derivatives

Use derivates to decide whether to increase or decrease the weights to increase

or decrease any objective function

Two concepts from calculus

Chain Rule

Power Rule
Understanding the Mathematics behind Gradient
Calculating Gradient Descent Descent
apply these rules of calculus in our original equation and find the derivative of the Cost
Function w.r.t to both ‘m’ and ‘b’.
Calculate the gradient of Error w.r.t to both m and b

m¹,b¹ = next position

parameters;
m⁰,b⁰ = current position
parameters
Example

 Find the local minima of the function y=(x+5)² starting from the point x=3

Step 1 : Initialize x =3. Then, find the gradient

of the function, dy/dx = 2*(x+5). learning rate → 0.01

https://
gist.github.com/rohanjoseph93/
ecbbb9fb1715d5c248bcad0a7d
3bffd2#file-gradient_descent-ip
ynb

Prakash VIT Chennai

References
 https://www.youtube.com/watch?v=jc2IthslyzM

 Reducing Loss: Gradient Descent | Machine Learning Crash Course (google.com)

 Understanding the Mathematics behind Gradient Descent. | by Parul

Pandey | Towards Data Science

 https://
towardsdatascience.com/implement-gradient-descent-in-python-9b93ed7108d1

Prakash VIT Chennai

Thanks

Prakash VIT Chennai

Understanding Gradient Descent in ML
No ratings yet
Understanding Gradient Descent in ML
15 pages
Gradient Descent Deep Learning: by T.K. Damodharan Vice President, RBS Reg - No: PC2013003013008
No ratings yet
Gradient Descent Deep Learning: by T.K. Damodharan Vice President, RBS Reg - No: PC2013003013008
37 pages
Gradient Descent
No ratings yet
Gradient Descent
9 pages
Gradient Descent
No ratings yet
Gradient Descent
6 pages
Understanding Cost Function & Gradient Descent
No ratings yet
Understanding Cost Function & Gradient Descent
142 pages
Adam Optimizer
No ratings yet
Adam Optimizer
22 pages
AI33
No ratings yet
AI33
6 pages
Assignment 4
No ratings yet
Assignment 4
8 pages
Understanding Gradient Descent Algorithm
No ratings yet
Understanding Gradient Descent Algorithm
64 pages
Understanding Gradient Descent in ML
No ratings yet
Understanding Gradient Descent in ML
20 pages
Understanding Gradient Descent in ML
No ratings yet
Understanding Gradient Descent in ML
8 pages
Assignment B 4 GradientDescent
No ratings yet
Assignment B 4 GradientDescent
5 pages
Gradient Descent in Machine Learning
No ratings yet
Gradient Descent in Machine Learning
65 pages
ML Lec 08 Gradient Descent
No ratings yet
ML Lec 08 Gradient Descent
37 pages
Gradient Descent
No ratings yet
Gradient Descent
13 pages
Understanding Gradient Descent in ML
No ratings yet
Understanding Gradient Descent in ML
4 pages
Gradient Descent
No ratings yet
Gradient Descent
5 pages
Understanding Gradient Descent in ML
No ratings yet
Understanding Gradient Descent in ML
62 pages
LInear
No ratings yet
LInear
14 pages
Assignment No 3
No ratings yet
Assignment No 3
7 pages
Gradient Descent Explained
No ratings yet
Gradient Descent Explained
9 pages
Slides-4 Optimization Extra Gradient Descent
No ratings yet
Slides-4 Optimization Extra Gradient Descent
67 pages
Gradient Descent in Machine Learning
No ratings yet
Gradient Descent in Machine Learning
3 pages
Analytical vs Numerical Solutions in Optimization
No ratings yet
Analytical vs Numerical Solutions in Optimization
14 pages
Gradient Descent
No ratings yet
Gradient Descent
7 pages
Gradient Descent in Machine Learning
No ratings yet
Gradient Descent in Machine Learning
27 pages
Gradient Descent in Logistic Regression
No ratings yet
Gradient Descent in Logistic Regression
16 pages
Understanding Gradient Descent Algorithm
No ratings yet
Understanding Gradient Descent Algorithm
5 pages
What Is Gradient Descent - Built in
No ratings yet
What Is Gradient Descent - Built in
11 pages
Gradient Descent in Machine Learning
No ratings yet
Gradient Descent in Machine Learning
27 pages
MScFE 650 MLF - Video - Transcripts - M3
No ratings yet
MScFE 650 MLF - Video - Transcripts - M3
19 pages
Mscfe XXX (Course Name) - Module X: Collaborative Review Task
No ratings yet
Mscfe XXX (Course Name) - Module X: Collaborative Review Task
19 pages
Unit 2-DLV
No ratings yet
Unit 2-DLV
84 pages
Gradient Descent
No ratings yet
Gradient Descent
55 pages
Understanding Gradient Descent Techniques
No ratings yet
Understanding Gradient Descent Techniques
25 pages
Machine Learning Optimization Techniques
No ratings yet
Machine Learning Optimization Techniques
37 pages
Understanding Gradient Descent in ML
No ratings yet
Understanding Gradient Descent in ML
15 pages
Gradient Descent for ML Beginners
No ratings yet
Gradient Descent for ML Beginners
11 pages
11 Gradient Descent
No ratings yet
11 Gradient Descent
58 pages
Understanding Gradient Descent in ML
No ratings yet
Understanding Gradient Descent in ML
9 pages
Understanding Gradient Descent Basics
No ratings yet
Understanding Gradient Descent Basics
12 pages
UNIT III Part-2
No ratings yet
UNIT III Part-2
39 pages
AIMLB PGP 2025 Session 5
No ratings yet
AIMLB PGP 2025 Session 5
67 pages
Notes Unit 1-3 Part-III
No ratings yet
Notes Unit 1-3 Part-III
25 pages
05 Gradient Descent
No ratings yet
05 Gradient Descent
23 pages
Gradient Descent Algorithm.Y...
No ratings yet
Gradient Descent Algorithm.Y...
10 pages
Gradient Descent in Linear Regression
No ratings yet
Gradient Descent in Linear Regression
30 pages
Unit VI Optimization Techniques Question Bank Solved Answer
No ratings yet
Unit VI Optimization Techniques Question Bank Solved Answer
20 pages
Gradient Descent Algorithm in Machine Learning: Dr. P. K. Chaurasia
No ratings yet
Gradient Descent Algorithm in Machine Learning: Dr. P. K. Chaurasia
24 pages
Understanding Gradient Descent in ML
No ratings yet
Understanding Gradient Descent in ML
16 pages
Neural Network Optimization Techniques
No ratings yet
Neural Network Optimization Techniques
65 pages
Gradient Descent and Cost Function
No ratings yet
Gradient Descent and Cost Function
14 pages
Gradient Descent Algorithm in Machine Learning
No ratings yet
Gradient Descent Algorithm in Machine Learning
21 pages
Understanding Gradient Descent Techniques
No ratings yet
Understanding Gradient Descent Techniques
40 pages
Unit3 Rev3
No ratings yet
Unit3 Rev3
201 pages
Chapter 4
No ratings yet
Chapter 4
33 pages
4 - Gradient Descent and Stochastic GD
No ratings yet
4 - Gradient Descent and Stochastic GD
37 pages
Linear Regression by IntuitiveAI v2.5
No ratings yet
Linear Regression by IntuitiveAI v2.5
5 pages
MAT6007 - Session1 - History of Deep Learning
No ratings yet
MAT6007 - Session1 - History of Deep Learning
22 pages
Deep Learning: Sigmoid Neurons & Gradient Descent
No ratings yet
Deep Learning: Sigmoid Neurons & Gradient Descent
19 pages
McCulloch-Pitts Neuron vs Perceptron
No ratings yet
McCulloch-Pitts Neuron vs Perceptron
15 pages
Perceptron Learning Algorithm Explained
No ratings yet
Perceptron Learning Algorithm Explained
19 pages
Understanding Multilayer Perceptrons
No ratings yet
Understanding Multilayer Perceptrons
13 pages
NetSuite Suite Training ERP Fundamentals
100% (2)
NetSuite Suite Training ERP Fundamentals
496 pages
Calculator Design Process Overview
No ratings yet
Calculator Design Process Overview
41 pages
Project Management Basics
No ratings yet
Project Management Basics
14 pages
Exception Handling Try Throw Catch (T)
No ratings yet
Exception Handling Try Throw Catch (T)
22 pages
Xlri MDP On Data Analysis and Financial Modeling Using Excel 0 30 Yrs
No ratings yet
Xlri MDP On Data Analysis and Financial Modeling Using Excel 0 30 Yrs
6 pages
Windows Registry Keys Guide
No ratings yet
Windows Registry Keys Guide
5 pages
Engineering Services and Consultancy Brochure
No ratings yet
Engineering Services and Consultancy Brochure
80 pages
Unit V DBMS
No ratings yet
Unit V DBMS
24 pages
How To Install Libpango-1.0-0 Ubuntu Package On Ubuntu 20.04 - Ubuntu 18.04 - Ubuntu 19.04 - Ubuntu 16.04
No ratings yet
How To Install Libpango-1.0-0 Ubuntu Package On Ubuntu 20.04 - Ubuntu 18.04 - Ubuntu 19.04 - Ubuntu 16.04
2 pages
Secabo CIII
No ratings yet
Secabo CIII
27 pages
Computer Science SL Paper 1 Markscheme
No ratings yet
Computer Science SL Paper 1 Markscheme
11 pages
VNX - VNX 5200 Procedures-VNX5200 File Installation Guide
No ratings yet
VNX - VNX 5200 Procedures-VNX5200 File Installation Guide
95 pages
Activity The Evaluation Usability Test
No ratings yet
Activity The Evaluation Usability Test
3 pages
A Generalized Framework For Opening Doors and Drawers in Kitchen Environments
No ratings yet
A Generalized Framework For Opening Doors and Drawers in Kitchen Environments
7 pages
Meeting Management for Leaders
No ratings yet
Meeting Management for Leaders
7 pages
College Network Redesign Plan 2022
No ratings yet
College Network Redesign Plan 2022
26 pages
B2B Integration Strategy
No ratings yet
B2B Integration Strategy
20 pages
L3WS Products License
No ratings yet
L3WS Products License
6 pages
Final Test Module 9
No ratings yet
Final Test Module 9
12 pages
Understanding EVM
No ratings yet
Understanding EVM
13 pages
IFCD Application Form for Projects
No ratings yet
IFCD Application Form for Projects
9 pages
Narayan DF-Platter Multi-Face Heterogeneous Deepfake Dataset CVPR 2023 Paper
No ratings yet
Narayan DF-Platter Multi-Face Heterogeneous Deepfake Dataset CVPR 2023 Paper
10 pages
Introduction to Regression Analysis
No ratings yet
Introduction to Regression Analysis
15 pages
Đề 04
No ratings yet
Đề 04
40 pages
Non-Data Collection Statement for Research
No ratings yet
Non-Data Collection Statement for Research
2 pages
RHB R6.3 Point Release
No ratings yet
RHB R6.3 Point Release
20 pages
GASCO e-Registration System Overview
No ratings yet
GASCO e-Registration System Overview
11 pages
Report
0% (1)
Report
5 pages
RohiniSri Resume
No ratings yet
RohiniSri Resume
2 pages
4-CH 24-Bit 128kS/s Dynamic Signal Acquisition USB 2.0 Module
No ratings yet
4-CH 24-Bit 128kS/s Dynamic Signal Acquisition USB 2.0 Module
3 pages