0% found this document useful (0 votes)

4 views19 pages

Hyperparameter Tuning

The document discusses hyperparameter tuning in machine learning, highlighting the difference between parameters and hyperparameters, and the importance of selecting appropriate hyperparameters for model performance. It outlines common hyperparameters such as learning rate, batch size, and number of layers, and describes methods for tuning them, including grid search and random search. The conclusion emphasizes the significance of hyperparameter tuning for optimizing models and suggests using systematic approaches for large-scale projects.

Uploaded by

Sruthy Sasidharan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

4 views19 pages

Hyperparameter Tuning

Uploaded by

Sruthy Sasidharan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 19

HYPERPARAMETER

TUNING
PRESENTED BY -SRUTHY P L
ROLL NO -11
M.TECH VLSI & ES
MODEL ENGINEERING COLLEGE
THRIKKAKARA
Hyperparameters vs. Parameters
• Parameters: Internal variables learned by the model (e.g., weights,
biases).
• Hyperparameters: External configurations that control training (e.g.,
learning rate, dropout rate).
• Example: In a Neural Network,
• - Parameters: Weights, Biases.
• - Hyperparameters: Number of layers, Activation function, Learning
rate.
HYPERPARAMETERS
• Hyperparameters are settings that control the learning process of a
machine learning model.
• Unlike parameters (e.g., weights in neural networks),
hyperparameters are not learned from data.
• They are set before training starts and influence model performance.
• Examples: Learning rate, batch size, number of hidden layers, number
of trees in a random forest.
Why is Hyperparameter Tuning
Important?
• Poor hyperparameters can lead to underfitting or overfitting.
• The right hyperparameters improve model accuracy and efficiency.
• Helps in optimizing training time and computational resources.
• Essential for deep learning models where training costs are high.
Common Hyperparameters in
Machine Learning & Deep Learning
• Learning Rate (α): Controls how much to adjust weights in each step.
• Batch Size: Number of samples per update.
• Number of Epochs: Number of complete passes through the dataset.
• Number of Layers: Defines depth of deep learning models.
• Activation Functions: ReLU, Sigmoid, Tanh, etc.
• Dropout Rate: Prevents overfitting by randomly dropping
connections.
• Regularization Parameters: L1, L2 norms to control model complexity.
Learning Rate
• The learning rate is a hyperparameter that determines the step size at
which the network updates its parameters during training.
• A large learning rate can lead to rapid convergence but may result in
unstable and oscillating training.
• A small learning rate can ensure stable and smooth training but may
result in slower convergence.
• Therefore, it is important to experiment with different learning rates
and choose the one that gives the best trade-off between training speed
and stability.
Number of Layers
• The number of layers in a CNN is a critical
hyperparameter that determines the depth of the
network.
• A deeper network can learn more complex features and
patterns from the data, but it is also more prone to
overfitting.
• Therefore, it is important to strike a balance between the
number of layers and the complexity of the problem.
• A good starting point is to use a small number of layers
and gradually increase their depth until the desired
performance is achieved.
Filter Size
• The filter size is another important hyperparameter that determines
the receptive field of each convolutional layer.
• A larger filter size can capture more information from the input
image, but it also increases the number of parameters in the network.
• A smaller filter size can reduce the number of parameters, but it may
not be able to capture all the relevant features in the image.
• Therefore, it is important to experiment with different filter sizes
and choose the one that gives the best performance.
We can start with filter size 3x3
Stride
• The stride is a hyperparameter that determines the number of pixels by
which the filter moves across the input image.
• A larger stride can reduce the size of the output feature maps, but it can
also lead to information loss.
• A smaller stride can preserve more information, but it also increases the
computation time and memory requirements.
• Therefore, it is important to choose an appropriate stride that balances the
trade-off between information loss and computational efficiency.
• Default stride in CNN is 1
Stride
Padding
• Padding is a technique used to preserve the spatial dimensions of the
input image while applying convolutional layers.
• It involves adding zeros around the border of the input image to create
a padded image that can be convolved with the filter.
• Padding can help preserve the information at the edges of the image
and prevent the loss of spatial resolution.
• However, it also increases the memory requirements and computation
time of the network.
• Therefore, it is important to experiment with different padding
techniques and choose the one that gives the best performance.
Padding
Learning Rate
Batch Size
• The batch size is a hyperparameter that determines the number of
samples that are processed by the network in each training iteration.
• A larger batch size can reduce the variance of the gradient estimates and
improve the stability of the training.
• However, it also increases the memory requirements and may lead to
slower convergence.
• A smaller batch size can reduce the memory requirements and improve
the convergence speed but may lead to noisy gradient estimates.
• Therefore, it is important to experiment with different batch sizes and
choose the one that gives the best trade-off between stability and speed.
Batch Size
Methods of Hyperparameter Tuning
• 1. Grid Search: Exhaustive search over a predefined hyperparameter
space.
• 2. Random Search: Randomly selects hyperparameters, often more
efficient.
• 3. Bayesian Optimization: Uses probability models to find best
hyperparameters.
• 4. Hyperband: Optimizes computational budget using adaptive
resource allocation.
Challenges in Hyperparameter
Tuning
• Large search space makes tuning computationally expensive.
• Overfitting can occur if tuned too aggressively.
• Requires deep knowledge of the model and data.
• Trade-off between performance improvement and computational
cost.
Conclusion
• Hyperparameter tuning is crucial for optimizing machine learning &
deep learning models.
• Choosing the right tuning method improves performance and
efficiency.
• Understanding hyperparameters helps in better model design and
training.
• Use systematic approaches and tools to automate tuning for large-
scale projects.

Experiment 3 Full Report Latest
100% (2)
Experiment 3 Full Report Latest
17 pages
KP3S Pro V2 Instructiones-EN
No ratings yet
KP3S Pro V2 Instructiones-EN
10 pages
Hyper Parameters
No ratings yet
Hyper Parameters
7 pages
Hyper Parameters
No ratings yet
Hyper Parameters
24 pages
Parameters and LL
No ratings yet
Parameters and LL
6 pages
B210317003 - Zeeshan Asghar - Assignment No 02
No ratings yet
B210317003 - Zeeshan Asghar - Assignment No 02
6 pages
Hyperparameter Tuning in DNNs
No ratings yet
Hyperparameter Tuning in DNNs
6 pages
CNN Training Aspects Presentation
No ratings yet
CNN Training Aspects Presentation
26 pages
Module2.3 Hyperparameter Optimization
No ratings yet
Module2.3 Hyperparameter Optimization
29 pages
15-Hyperparameter Tuning - Batch Normalization-14!08!2024
No ratings yet
15-Hyperparameter Tuning - Batch Normalization-14!08!2024
4 pages
Hyper Parameter Turning
No ratings yet
Hyper Parameter Turning
4 pages
Tadlo mcl
No ratings yet
Tadlo mcl
11 pages
ML Lec 09 ANN Quadratic Training
No ratings yet
ML Lec 09 ANN Quadratic Training
44 pages
Unit 4 A
No ratings yet
Unit 4 A
16 pages
Fixing Neural Network Course 2 1659759284
No ratings yet
Fixing Neural Network Course 2 1659759284
30 pages
Hyperparameters
No ratings yet
Hyperparameters
15 pages
tutorial 4
No ratings yet
tutorial 4
6 pages
TrainingNN 1
No ratings yet
TrainingNN 1
52 pages
DNN Hyperparameter Tuning
No ratings yet
DNN Hyperparameter Tuning
105 pages
ML Individual assigenment 1 - Copy (2)
No ratings yet
ML Individual assigenment 1 - Copy (2)
11 pages
Lecture_2
No ratings yet
Lecture_2
31 pages
What are the most common hyperparameters to tune in machine learning models
No ratings yet
What are the most common hyperparameters to tune in machine learning models
2 pages
hyper parameter
No ratings yet
hyper parameter
2 pages
Optimization of Hyper-Parameter For CNN Model Using Genetic Algorithm
No ratings yet
Optimization of Hyper-Parameter For CNN Model Using Genetic Algorithm
6 pages
Deep Learning
100% (2)
Deep Learning
49 pages
An Experimental Study On Hyper Parameters For Training Deep Convolutional Networks
No ratings yet
An Experimental Study On Hyper Parameters For Training Deep Convolutional Networks
8 pages
Training NNs
No ratings yet
Training NNs
34 pages
lecture_221007_05
No ratings yet
lecture_221007_05
21 pages
16-Optimization and Loss Functions in Classifiers, Convolution Layers, Max Pool Layers-24!08!2024
No ratings yet
16-Optimization and Loss Functions in Classifiers, Convolution Layers, Max Pool Layers-24!08!2024
36 pages
Room Classification Using Machine Learning
No ratings yet
Room Classification Using Machine Learning
16 pages
chp2 Tuning hidden layer
No ratings yet
chp2 Tuning hidden layer
3 pages
Arabic Digit Recognition
No ratings yet
Arabic Digit Recognition
5 pages
Deep Learning UNIT-II Part1
No ratings yet
Deep Learning UNIT-II Part1
48 pages
Parameters and Hyperparameters notes
No ratings yet
Parameters and Hyperparameters notes
2 pages
Building Convolutional Neural Networks For Image Classification Slides
No ratings yet
Building Convolutional Neural Networks For Image Classification Slides
57 pages
ML MODULE6 Artificial Neural Networks
No ratings yet
ML MODULE6 Artificial Neural Networks
42 pages
Computer Vision NN Architecture
No ratings yet
Computer Vision NN Architecture
19 pages
Hyperparameters and Parameters
No ratings yet
Hyperparameters and Parameters
8 pages
GEN AIML NOTES BY PIYUSH
No ratings yet
GEN AIML NOTES BY PIYUSH
39 pages
ML pp16_u3
No ratings yet
ML pp16_u3
39 pages
Practical Aspects of Deep Learning PI
No ratings yet
Practical Aspects of Deep Learning PI
46 pages
lec#3-3
No ratings yet
lec#3-3
13 pages
#Machinelearning: Mastering Tuning Hyperparameter
No ratings yet
#Machinelearning: Mastering Tuning Hyperparameter
7 pages
6_Tips for Training Deep Neural Networks
No ratings yet
6_Tips for Training Deep Neural Networks
59 pages
Fundamentals of Deep Learning
No ratings yet
Fundamentals of Deep Learning
26 pages
UNIT III
No ratings yet
UNIT III
26 pages
pytorch
No ratings yet
pytorch
19 pages
Domnic Object Detecion Basics
No ratings yet
Domnic Object Detecion Basics
62 pages
Multi Layer Perceptron
No ratings yet
Multi Layer Perceptron
51 pages
Improving Deep Neural Networks: Hyperparameter Tuning, Regularization and Optimization
No ratings yet
Improving Deep Neural Networks: Hyperparameter Tuning, Regularization and Optimization
1 page
Deep_Learningworkshop3CNN - Copie
No ratings yet
Deep_Learningworkshop3CNN - Copie
9 pages
Tunability: Importance of Hyperparameters of Machine Learning Algorithms
No ratings yet
Tunability: Importance of Hyperparameters of Machine Learning Algorithms
32 pages
train,test and validation
No ratings yet
train,test and validation
3 pages
Lecture 02
No ratings yet
Lecture 02
147 pages
CS601 Machine Learning Unit 3
No ratings yet
CS601 Machine Learning Unit 3
47 pages
33. Multi Layer Perceptron
No ratings yet
33. Multi Layer Perceptron
82 pages
NNT Intro 1
No ratings yet
NNT Intro 1
8 pages
Super VIP Cheatsheet - Deep Learning
No ratings yet
Super VIP Cheatsheet - Deep Learning
47 pages
Batch Normalization 1
No ratings yet
Batch Normalization 1
3 pages
Lecture W15ab
No ratings yet
Lecture W15ab
44 pages
Machine Learning Algorithms
No ratings yet
Machine Learning Algorithms
4 pages
Artificial Intelligence Interview Questions
From Everand
Artificial Intelligence Interview Questions
Tech Interviews
5/5 (2)
Virtual Reality Class 7
No ratings yet
Virtual Reality Class 7
42 pages
CZAR V2 Build Tutorial
100% (3)
CZAR V2 Build Tutorial
20 pages
Proxy vivo
No ratings yet
Proxy vivo
3 pages
DAA - Module 4 Notes 4TH SEM
No ratings yet
DAA - Module 4 Notes 4TH SEM
11 pages
ME-308 Quality Engineering Dr. Abdul Shakoor Lecture 2 - Cost of Quality
No ratings yet
ME-308 Quality Engineering Dr. Abdul Shakoor Lecture 2 - Cost of Quality
10 pages
Conditions Governing Connection Operation Small Scale Generation
No ratings yet
Conditions Governing Connection Operation Small Scale Generation
18 pages
2nd Annual Next - Gen Digital Pathology Conference - London UK 5
No ratings yet
2nd Annual Next - Gen Digital Pathology Conference - London UK 5
14 pages
A High-Efficiency Envelope-Tracking Supply Modulator Using a Class-G Linear Amplifier and a Single-Inductor Dual-Input-Dual-Output Converter for 5G NR Power Amplifier
No ratings yet
A High-Efficiency Envelope-Tracking Supply Modulator Using a Class-G Linear Amplifier and a Single-Inductor Dual-Input-Dual-Output Converter for 5G NR Power Amplifier
13 pages
Hp z440 Workstation Technical Guide
No ratings yet
Hp z440 Workstation Technical Guide
20 pages
Capacity Correction Factors: Back Pressure
No ratings yet
Capacity Correction Factors: Back Pressure
2 pages
Case Study: The Timber House
No ratings yet
Case Study: The Timber House
4 pages
the-perkins-1100
No ratings yet
the-perkins-1100
2 pages
Gravimetric Measurement of Particulate Concentration of Hydrogen Fuel
No ratings yet
Gravimetric Measurement of Particulate Concentration of Hydrogen Fuel
4 pages
Sullivan D250 2014
No ratings yet
Sullivan D250 2014
121 pages
Simatic ET 200S Distributed I/O 2AI U HF Analog Electronic Module (6ES7134-4LB02-0AB0)
No ratings yet
Simatic ET 200S Distributed I/O 2AI U HF Analog Electronic Module (6ES7134-4LB02-0AB0)
26 pages
Nso Sample Paper Class-6
No ratings yet
Nso Sample Paper Class-6
2 pages
Unit01 03
No ratings yet
Unit01 03
147 pages
Government of Assam: REF NO.: DITEC - No.160/2021/131 Release Date: 28 June 2022
No ratings yet
Government of Assam: REF NO.: DITEC - No.160/2021/131 Release Date: 28 June 2022
3 pages
LAB MANUAL (C.C.)
No ratings yet
LAB MANUAL (C.C.)
44 pages
Introduction To XML Extensible Markup Language: Prof.N.Nalini AP (SR) VIT
No ratings yet
Introduction To XML Extensible Markup Language: Prof.N.Nalini AP (SR) VIT
35 pages
HB Epoxy Zinc Rich Primer - TDS
No ratings yet
HB Epoxy Zinc Rich Primer - TDS
2 pages
FM Should Be A Master of Building's All Tecnnical Systems
No ratings yet
FM Should Be A Master of Building's All Tecnnical Systems
61 pages
Reconciliation SOP
No ratings yet
Reconciliation SOP
2 pages
APPRAISAL FORM (Basic Research Proposal)
No ratings yet
APPRAISAL FORM (Basic Research Proposal)
2 pages
02 Script DD With Oracle11gR2
No ratings yet
02 Script DD With Oracle11gR2
26 pages
Mivi Collar Classic Neckband With Fast Charging Bluetooth Headset
No ratings yet
Mivi Collar Classic Neckband With Fast Charging Bluetooth Headset
1 page
How To Record Yourself in Burlington
No ratings yet
How To Record Yourself in Burlington
2 pages
Using The Command Line - LibreCAD Wiki
No ratings yet
Using The Command Line - LibreCAD Wiki
3 pages

Hyperparameter Tuning

Uploaded by

Hyperparameter Tuning

Uploaded by

HYPERPARAMETER

You might also like