0% found this document useful (0 votes)

64 views8 pages

Ee5239 HW3-2

This document summarizes the results of running various optimization algorithms - steepest descent, diminishing step size, and Armijo rule - on two different datasets. For the first dataset, the algorithms were tested on a 50x10 matrix. Armijo rule performed best with parameters B=0.5, s=0.2, σ=0.1. For the second larger dataset using a 7291x3 matrix, Armijo rule again performed best with similar parameters, while diminishing step size outperformed fixed step size. Armijo rule was identified as the overall best performing algorithm between the three tested.

Uploaded by

Sarthak Jain

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

64 views8 pages

Ee5239 HW3-2

Uploaded by

Sarthak Jain

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 8

Homework 3

Sarthak Jain

2018/10/18

1 Running Algorithm on Data Set

1.1 Problem Formulation

min (1/2)||Ax − b||2 (1)

where A is a randomly generated 50x10 matrix, and b is a random 50X1 vector. Higher eigenvalue for
the above matrix is 61.9 (i.e. L=61.9).

1.2 Data Preprocessing

The feature matrix A is first ”scaled and shifted” according to the following equation:

A = (A − mean(A))/σ (2)

This makes sure that the condition number is good and the algorithm converges faster.

1.3 Using Steepest descent

Figure 1 shows the convergence of steepest descent with various step sizes in close proximity to 1/L. In
Figure 1 we notice that as the step size increases the convergence time increases upto 0.015 and then it
decreases again.

Figure 1: log(Error) vs Number of iterations for Steepest Descent with various stepsizes.

1
1.4 Diminishing Step Size

Figure 2 shows the result of diminishing step size (α/r). We see that as α increases the initial divergence
increases but the rate of convergences is better too. Thus there is a trade-off between initial divergence
and rate of convergence. We find that this algorithm performs the best at /alpha = 0.1.
p
Figure 3 shows the result of diminishing step size (α/ (r)). The initial divergence of this version of
diminishing step size is more than earlier version. Hence it works better for lower values of α compared
to the previous version. The best performance of this version is observed at α = 0.05.
Note that α/r2 takes infinite time to converge because the sum of the sequence converges to 2 ( and not
to infinity).

Figure 2: log(Error) vs Number of iterations for Diminishing stepsize=α/r.

√
Figure 3: log(Error) vs Number of iterations for Diminishing stepsize=α/ r.

1.5 Armiho Rule

Figure 4 and 5 display the performance of Armiho Rule for various values of s, B and σ, keeping two of
them fixed at a time. We find that this algorithm performs best close to B=0.5, s=0.2 and σ = 0.1.

2
Figure 4: log(Error) vs Number of iterations for Armiho Rule, varying B (s=1 and σ = 0.1.)

Figure 5: log(Error) vs Number of iterations for Armiho Rule, varying s (B=0.5 and σ = 0.1.)

3
2 Running Algorithm on Data Set

2.1 Problem Formulation

min (1/2)||Ax − b||2 (3)

where A is 7291x3 matrix, with the first two columns containing the features ”intensity” and ”symme-
try”; and the third column consists of 1 (for intercept). b is a 7291X1 vector consisting of label +1 if
the digit is 1 and −1 otherwise.
Eigenvalues of AT A are: 3634, 7291 and 10946.

2.2 Data Preprocessing

The feature matrix A is first scaled and shifted according to the following equation:

A = (A − mean(A))/σ (4)

This makes sure that the condition number is good and the algorithm converges faster.

2.3 Using Steepest descent

Figure 6 shows the convergence of steepest descent with various step sizes. Since L=10946, we will be
looking for values in the range of 10− 4 to 10− 5. In Figure 1 we notice that as the step size decrease
the convergence time increases, which is expected. For α > 0.0001 the problem diverges. Therefore we
should choose a step size smaller than 0.0001 but it shouldn’t be very small, otherwise it will take more
time to converge. Therefore α = 0.0001 is almost the best step size for steepest descent.

Figure 6: log(Error) vs Number of iterations for Steepest Descent with various stepsizes.

2.4 Diminishing Step Size

Figure 7 shows the result of diminishing step size (α/r). We see that as α increases in the interval
0.0006 to 0.00095 the performance gets better. However above 0.0001, the algorithm takes forever to
converge. This is because if the step size is above 0.0001, the initial error value shoots up to a very high
value (diverges) and then subsequently, it takes a lot of time for diminishing step size to bring it back
to 0. This is consistent with fixed step size, where in we found that for values greater than 0.0001, the

4
algorithm diverges.

p
Figure 8 shows the result of diminishing step size (α/ (r)). The performance of this version of diminish-
ing step size is much better compared to the previous version. This is because the step size is decreasing
slowly in each iteration and therefore the algorithm converges faster. Here we get the best performance
at 0.0005 (compared to higher values) because for lower α, the initial overshoot is very less compared to
higer α. The algoritm becomes very slow for α > 0.001 because of very large initial divergence.

Figure 7: log(Error) vs Number of iterations for Diminishing stepsize=α/r.

√
Figure 8: log(Error) vs Number of iterations for Diminishing stepsize=α/ r.

2.5 Armiho Rule

Figure 9, 10 and 11 displays the performance of Armiho Rule for various values of s, B and σ, keeping
two of them fixed at a time. For the first two plots the value of sigma = 0.1. We find that this algorithm
performs best close to B=0.5, s=0.2 and σ = 0.1.

2.6 Clash of the Best

In Figure 12, we take the best performers from the above three step size rules and Compare them. Figure
8 shows the comparison. For fixed size we take α = 0.0001, for Armiho rule we take B = 0.5, σ = 0.1

5
Figure 9: log(Error) vs Number of iterations for Armiho Rule, varying B (s=1 and σ = 0.1.)

Figure 10: log(Error) vs Number of iterations for Armiho Rule, varying s (B=0.5 and σ = 0.1.)

Figure 11: log(Error) vs Number of iterations for Armiho Rule, varying σ (B=0.5 and s = 0.8).

6
and s = 0.8. We find that Armiho rule is the best performer. However, diminishing step size almost
becomes equal to Armiho asymptotically.

Figure 12: Comparison between the best performers of the above three algorithms.

2.7 Regression Lines for Classification

In figures 13 and 14, we show the result of our algorithm for separating one and the rest. As the error is
negligible with all three algorithms the classifying hyperplane is the same irrespective of the algorithm
used.

Figure 13: Separating 1 and rest.

7
Figure 14: Separating 0 and rest.

Advanced Numerical Analysis (Prof. P.P. Gupta, G.S. Malik, J.P. Chauhan)
100% (1)
Advanced Numerical Analysis (Prof. P.P. Gupta, G.S. Malik, J.P. Chauhan)
531 pages
CBNST Notes For BCA PU 3rd Sem Based On Syllabus PDF
100% (1)
CBNST Notes For BCA PU 3rd Sem Based On Syllabus PDF
27 pages
1.2 Bus Admittance Matrix: Formulation
100% (1)
1.2 Bus Admittance Matrix: Formulation
11 pages
Exam With Solutions
No ratings yet
Exam With Solutions
7 pages
Liu Dissertation
No ratings yet
Liu Dissertation
215 pages
Hauser Lecture2
No ratings yet
Hauser Lecture2
26 pages
Theory Workbook Grade1
71% (7)
Theory Workbook Grade1
29 pages
Machine Learning Theory and Applications - 2024 - Vasques - Machine Learning Alg
No ratings yet
Machine Learning Theory and Applications - 2024 - Vasques - Machine Learning Alg
98 pages
Matlab Robust Control Toolbox
No ratings yet
Matlab Robust Control Toolbox
168 pages
Vtet Language PDF
No ratings yet
Vtet Language PDF
31 pages
Optimumengineeringdesign Day5
No ratings yet
Optimumengineeringdesign Day5
84 pages
RTV 4 Manual - Regu Tools
No ratings yet
RTV 4 Manual - Regu Tools
128 pages
2-Linear System PDF
No ratings yet
2-Linear System PDF
73 pages
Week11 - Regularization and Optimization
No ratings yet
Week11 - Regularization and Optimization
75 pages
Complete Accuracy and Stability of Numerical Algorithms Second Edition Nicholas J. Higham Ebook PDF File All Chapters
No ratings yet
Complete Accuracy and Stability of Numerical Algorithms Second Edition Nicholas J. Higham Ebook PDF File All Chapters
67 pages
Numerical Solution of The Stable, Non-Negative Definite Lyapunov Equation
100% (1)
Numerical Solution of The Stable, Non-Negative Definite Lyapunov Equation
21 pages
Lecture8 UnconstrainedII 2023
No ratings yet
Lecture8 UnconstrainedII 2023
57 pages
Local Search
No ratings yet
Local Search
37 pages
Solution of Linear System of Equations
No ratings yet
Solution of Linear System of Equations
49 pages
Unit 2
No ratings yet
Unit 2
46 pages
Optim
No ratings yet
Optim
70 pages
Models PDF
No ratings yet
Models PDF
86 pages
Lecture 9
No ratings yet
Lecture 9
31 pages
Department of Computer Science and Engineering (CSE)
No ratings yet
Department of Computer Science and Engineering (CSE)
11 pages
Cse 445 ML - 1
No ratings yet
Cse 445 ML - 1
28 pages
Quiz 1 Solutions 13-07-2025
No ratings yet
Quiz 1 Solutions 13-07-2025
28 pages
Lec 02
No ratings yet
Lec 02
43 pages
CS-6777 Liu Abs
No ratings yet
CS-6777 Liu Abs
103 pages
Cormen 2nd Edition Solutions
No ratings yet
Cormen 2nd Edition Solutions
99 pages
Lecture 1: Overview of Scientific Computing: Hao Wu
No ratings yet
Lecture 1: Overview of Scientific Computing: Hao Wu
56 pages
Clnote Sept24
No ratings yet
Clnote Sept24
24 pages
04 Local Search 2017 Ihler
No ratings yet
04 Local Search 2017 Ihler
34 pages
Num Math
No ratings yet
Num Math
87 pages
To Download
No ratings yet
To Download
10 pages
Lecture Notes17
No ratings yet
Lecture Notes17
122 pages
8-19-21 (Submission)
No ratings yet
8-19-21 (Submission)
52 pages
IB352 Warwick Wk4 - Lecture-4
No ratings yet
IB352 Warwick Wk4 - Lecture-4
22 pages
MLDL I Linear Regression With Gradient Descent - Ipynb Colaboratory
No ratings yet
MLDL I Linear Regression With Gradient Descent - Ipynb Colaboratory
15 pages
SVM, RF, Decision Tree
No ratings yet
SVM, RF, Decision Tree
17 pages
Multi Variable Optimization: Min F (X, X, X, - X)
No ratings yet
Multi Variable Optimization: Min F (X, X, X, - X)
38 pages
RTV 4 Manual
No ratings yet
RTV 4 Manual
128 pages
RL Unit 3,4,5
No ratings yet
RL Unit 3,4,5
19 pages
Lecture 5
No ratings yet
Lecture 5
31 pages
Multiloop and Multivariable Control PDF
No ratings yet
Multiloop and Multivariable Control PDF
43 pages
01hybrid Gaussian-Cubic Radial Basis Functions For Scattered Data Interpolation
No ratings yet
01hybrid Gaussian-Cubic Radial Basis Functions For Scattered Data Interpolation
16 pages
Optimization and Search
No ratings yet
Optimization and Search
27 pages
Assignment 1 - Data Science
100% (1)
Assignment 1 - Data Science
5 pages
BSC Part 3
No ratings yet
BSC Part 3
29 pages
Lec 16
No ratings yet
Lec 16
10 pages
Selecting Step Sizes - in Sensitivity Analysis by Finite Differences
No ratings yet
Selecting Step Sizes - in Sensitivity Analysis by Finite Differences
15 pages
CFD 1st Unit
No ratings yet
CFD 1st Unit
15 pages
Process Optimization
No ratings yet
Process Optimization
70 pages
6 Gradient Method
No ratings yet
6 Gradient Method
19 pages
Steepest Descent Algorithm
No ratings yet
Steepest Descent Algorithm
28 pages
20 Notes 6250 f13
No ratings yet
20 Notes 6250 f13
8 pages
Urv Tol
No ratings yet
Urv Tol
8 pages
03 Iterative Methods PDF
No ratings yet
03 Iterative Methods PDF
19 pages
Algorithm For Unconstrained-Multivariable Case-2 (CH 6)
No ratings yet
Algorithm For Unconstrained-Multivariable Case-2 (CH 6)
31 pages
Reporte 1
No ratings yet
Reporte 1
8 pages
EE364a Homework 7 Solutions
No ratings yet
EE364a Homework 7 Solutions
16 pages
2.4 Solving Systems of Linear Equations
No ratings yet
2.4 Solving Systems of Linear Equations
27 pages
Some Properties of Entropy For The Exponentiated Pareto Distribution (EPD) Based On Order Statistics
No ratings yet
Some Properties of Entropy For The Exponentiated Pareto Distribution (EPD) Based On Order Statistics
11 pages
Homework4 v1.0
No ratings yet
Homework4 v1.0
5 pages
Sol3 2020
No ratings yet
Sol3 2020
5 pages
Homework 7 WC
No ratings yet
Homework 7 WC
10 pages
Reporte 2
No ratings yet
Reporte 2
6 pages
Design of A Planar Parallel Robot For Optimal Workspace and Dexterity
No ratings yet
Design of A Planar Parallel Robot For Optimal Workspace and Dexterity
8 pages
Soln1 Ps
100% (1)
Soln1 Ps
6 pages
Opt ch6
No ratings yet
Opt ch6
6 pages
Simulated Annealing: Starting With Steepest Descent Method
No ratings yet
Simulated Annealing: Starting With Steepest Descent Method
28 pages
Using Matlab To Debug Software Written For A Digital Signal Processor
No ratings yet
Using Matlab To Debug Software Written For A Digital Signal Processor
8 pages
RL2018-Ch 2 Question
No ratings yet
RL2018-Ch 2 Question
3 pages
Lab03
No ratings yet
Lab03
3 pages
Optimization: Optimization and Some Traditional Methods
No ratings yet
Optimization: Optimization and Some Traditional Methods
21 pages
Lasso Linear Regression
No ratings yet
Lasso Linear Regression
8 pages
Progress Report ON Music Emotion Recognition Using Gaussian Processes
No ratings yet
Progress Report ON Music Emotion Recognition Using Gaussian Processes
2 pages
Slide 6: Script For 17 March 2020
No ratings yet
Slide 6: Script For 17 March 2020
3 pages
Steepest Descent
No ratings yet
Steepest Descent
4 pages
School of Computer Science and Applied Mathematics
No ratings yet
School of Computer Science and Applied Mathematics
5 pages
Computing For Data Sciences: Introduction To Regression Analysis
No ratings yet
Computing For Data Sciences: Introduction To Regression Analysis
9 pages
EE5239 JKJKJK
No ratings yet
EE5239 JKJKJK
6 pages
Exactly One Pos Root
No ratings yet
Exactly One Pos Root
2 pages
Tasks123 3
No ratings yet
Tasks123 3
1 page
Paper Modifications
No ratings yet
Paper Modifications
1 page
Homework2 Advanced ML
No ratings yet
Homework2 Advanced ML
4 pages
Download
No ratings yet
Download
7 pages
Thesis THE FIRST ONE
No ratings yet
Thesis THE FIRST ONE
2 pages
TMA4215 Report (10056,10047,10023)
No ratings yet
TMA4215 Report (10056,10047,10023)
5 pages
Load Technique: Reliable Flow For Radial
No ratings yet
Load Technique: Reliable Flow For Radial
7 pages
Boundary Value Problems: Second Order BVP
No ratings yet
Boundary Value Problems: Second Order BVP
4 pages
CSC336 Assignment 1
No ratings yet
CSC336 Assignment 1
2 pages
Hevia ARMA Estimation
No ratings yet
Hevia ARMA Estimation
6 pages
Algorithms Process Optimization
No ratings yet
Algorithms Process Optimization
5 pages
Lecture 4 CS210 2012 PDF
No ratings yet
Lecture 4 CS210 2012 PDF
4 pages
Steepest Descent
No ratings yet
Steepest Descent
7 pages
Gradient Descent Algorithm Matlab
No ratings yet
Gradient Descent Algorithm Matlab
3 pages
Algebra Secret Revealed Complete Guide to Mastering Solutions to Algebraic Equations
From Everand
Algebra Secret Revealed Complete Guide to Mastering Solutions to Algebraic Equations
Joseph McDavid
No ratings yet
A Brief Introduction to MATLAB: Taken From the Book "MATLAB for Beginners: A Gentle Approach"
From Everand
A Brief Introduction to MATLAB: Taken From the Book "MATLAB for Beginners: A Gentle Approach"
Peter Kattan
2.5/5 (2)

Ee5239 HW3-2

Uploaded by

Ee5239 HW3-2

Uploaded by

Homework 3

1 Running Algorithm on Data Set

1.1 Problem Formulation

min (1/2)||Ax − b||2 (1)

1.2 Data Preprocessing

1.3 Using Steepest descent

Figure 2: log(Error) vs Number of iterations for Diminishing stepsize=α/r.

1.5 Armiho Rule

2.1 Problem Formulation

min (1/2)||Ax − b||2 (3)

2.2 Data Preprocessing

2.3 Using Steepest descent

2.4 Diminishing Step Size

Figure 7: log(Error) vs Number of iterations for Diminishing stepsize=α/r.

2.5 Armiho Rule

2.6 Clash of the Best

2.7 Regression Lines for Classification

Figure 13: Separating 1 and rest.

You might also like