0% found this document useful (0 votes)

10 views

Chapter 1

The document outlines an intermediate course on deep learning with PyTorch, focusing on training robust models using optimizers, addressing vanishing and exploding gradients, and implementing CNNs and RNNs. It covers prerequisites such as neural network training and PyTorch basics, and introduces object-oriented programming concepts to define datasets and models. Additionally, it discusses various optimizers, model evaluation techniques, and solutions for unstable gradients, including weight initialization and batch normalization.

Uploaded by

Islem Nasri

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

10 views

Chapter 1

Uploaded by

Islem Nasri

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 37

PyTorch and object-

oriented
programming
I N T E R M E D I AT E D E E P L E A R N I N G W I T H P Y T O R C H

Michal Oleszak
Machine Learning Engineer
What we will learn
How to train robust deep learning models:

Improving training with optimizers

Mitigating vanishing and exploding

gradients

Convolutional Neural Networks (CNNs)

Recurrent Neural Networks (RNNs)

Multi-input and multi-output models

INTERMEDIATE DEEP LEARNING WITH PYTORCH

Prerequisites
The course assumes you are comfortable with the following topics:

Neural networks training:

Forward pass

Loss calculation

Backward pass (backpropagation)

Training models with PyTorch:

Datasets and DataLoaders

Model training loop

Model evaluation
Prerequisite course: Introduction to Deep Learning with PyTorch

INTERMEDIATE DEEP LEARNING WITH PYTORCH

Object-Oriented Programming (OOP)
We will use OOP to define:
PyTorch Datasets

PyTorch Models

In OOP, we create objects with:

Abilities (methods)

Data (attributes)

INTERMEDIATE DEEP LEARNING WITH PYTORCH

Object-Oriented Programming (OOP)
class BankAccount:
def __init__(self, balance):
self.balance = balance

init is called when BankAccount object is created

balance is the attribute of the BankAccount object

account = BankAccount(100)
print(account.balance)

100

INTERMEDIATE DEEP LEARNING WITH PYTORCH

Object-Oriented Programming (OOP)
Methods: Python functions to perform tasks class BankAccount:
deposit method increases balance def __init__(self, balance):
self.balance = balance

def deposit(self, amount):

self.balance += amount

account = BankAccount(100)
account.deposit(50)
print(account.balance)

150

INTERMEDIATE DEEP LEARNING WITH PYTORCH

Water potability dataset

INTERMEDIATE DEEP LEARNING WITH PYTORCH

PyTorch Dataset
from torch.utils.data import Dataset init: load data, store as numpy array
super().__init__() ensures
class WaterDataset(Dataset):
WaterDataset behaves like torch
def __init__(self, csv_path):
super().__init__()
Dataset
df = pd.read_csv(csv_path)
len: return the size of the dataset
self.data = df.to_numpy()
getitem:
def __len__(self):
take one argument called idx
return self.data.shape[0]
return features and label for a single
def __getitem__(self, idx): sample at index idx
features = self.data[idx, :-1]
label = self.data[idx, -1]
return features, label

INTERMEDIATE DEEP LEARNING WITH PYTORCH

PyTorch DataLoader
dataset_train = WaterDataset( features, labels = next(iter(dataloader_train))
"water_train.csv" print(f"Features: {features},\nLabels: {labels}")
)

Features: tensor([
from torch.utils.data import DataLoader [0.4899, 0.4180, 0.6299, 0.3496, 0.4575,
0.3615, 0.3259, 0.5011, 0.7545],
dataloader_train = DataLoader( [0.7953, 0.6305, 0.4480, 0.6549, 0.7813,
dataset_train, 0.6566, 0.6340, 0.5493, 0.5789]
batch_size=2, ]),
shuffle=True, Labels: tensor([1., 0.])
)

INTERMEDIATE DEEP LEARNING WITH PYTORCH

PyTorch Model
Sequential model definition: Class-based model definition:

net = nn.Sequential( class Net(nn.Module):

nn.Linear(9, 16), def __init__(self):
nn.ReLU(), super().__init__()
nn.Linear(16, 8), self.fc1 = nn.Linear(9, 16)
nn.ReLU(), self.fc2 = nn.Linear(16, 8)
nn.Linear(8, 1), self.fc3 = nn.Linear(8, 1)
nn.Sigmoid(),
) def forward(self, x):
x = nn.functional.relu(self.fc1(x))
x = nn.functional.relu(self.fc2(x))
x = nn.functional.sigmoid(self.fc3(x))
return x

net = Net()

INTERMEDIATE DEEP LEARNING WITH PYTORCH

Let's practice!
I N T E R M E D I AT E D E E P L E A R N I N G W I T H P Y T O R C H
Optimizers, training,
and evaluation
I N T E R M E D I AT E D E E P L E A R N I N G W I T H P Y T O R C H

Michal Oleszak
Machine Learning Engineer
Training loop
import torch.nn as nn Define loss function and optimizer
import torch.optim as optim BCELoss for binary classification

criterion = nn.BCELoss() SGD optimizer

optimizer = optim.SGD(net.parameters(), lr=0.01)
Iterate over epochs and training batches
for epoch in range(1000): Clear gradients
for features, labels in dataloader_train:
optimizer.zero_grad() Forward pass: get model's outputs
outputs = net(features)
Compute loss
loss = criterion(
outputs, labels.view(-1, 1) Compute gradients
)
loss.backward() Optimizer's step: update params
optimizer.step()

INTERMEDIATE DEEP LEARNING WITH PYTORCH

How an optimizer works

INTERMEDIATE DEEP LEARNING WITH PYTORCH

How an optimizer works

INTERMEDIATE DEEP LEARNING WITH PYTORCH

How an optimizer works

INTERMEDIATE DEEP LEARNING WITH PYTORCH

How an optimizer works

INTERMEDIATE DEEP LEARNING WITH PYTORCH

How an optimizer works

INTERMEDIATE DEEP LEARNING WITH PYTORCH

Stochastic Gradient Descent (SGD)
optimizer = optim.SGD(net.parameters(), lr=0.01)

Update depends on learning rate

Simple and efficient, for basic models

Rarely used in practice

INTERMEDIATE DEEP LEARNING WITH PYTORCH

Adaptive Gradient (Adagrad)
optimizer = optim.Adagrad(net.parameters(), lr=0.01)

Adapts learning rate for each parameter

Good for sparse data

May decrease the learning rate too fast

INTERMEDIATE DEEP LEARNING WITH PYTORCH

Root Mean Square Propagation (RMSprop)
optimizer = optim.RMSprop(net.parameters(), lr=0.01)

Update for each parameter based on the size of its previous gradients

INTERMEDIATE DEEP LEARNING WITH PYTORCH

Adaptive Moment Estimation (Adam)
optimizer = optim.Adam(net.parameters(), lr=0.01)

Arguably the most versatile and widely used

RMSprop + gradient momentum

Often used as the go-to optimizer

INTERMEDIATE DEEP LEARNING WITH PYTORCH

Model evaluation
from torchmetrics import Accuracy Set up accuracy metric
Put model in eval mode and iterate over
acc = Accuracy(task="binary")
test data batches with no gradients
net.eval()
Pass data to model to get predicted
with torch.no_grad():
for features, labels in dataloader_test: probabilities
outputs = net(features)
Compute predicted labels
preds = (outputs >= 0.5).float()
acc(preds, labels.view(-1, 1)) Update accuracy metric

accuracy = acc.compute()
print(f"Accuracy: {accuracy}")

Accuracy: 0.6759443283081055

INTERMEDIATE DEEP LEARNING WITH PYTORCH

Let's practice!
I N T E R M E D I AT E D E E P L E A R N I N G W I T H P Y T O R C H
Vanishing and
exploding gradients
I N T E R M E D I AT E D E E P L E A R N I N G W I T H P Y T O R C H

Michal Oleszak
Machine Learning Engineer
Vanishing gradients
Gradients get smaller and smaller during
backward pass

Earlier layers get small parameter updates

Model doesn't learn

INTERMEDIATE DEEP LEARNING WITH PYTORCH

Exploding gradients
Gradients get bigger and bigger
Parameter updates are too large

Training diverges

INTERMEDIATE DEEP LEARNING WITH PYTORCH

Solution to unstable gradients
1. Proper weights initialization
2. Good activations

3. Batch normalization

INTERMEDIATE DEEP LEARNING WITH PYTORCH

Weights initialization
layer = nn.Linear(8, 1)
print(layer.weight)

Parameter containing:
tensor([[-0.0195, 0.0992, 0.0391, 0.0212,
-0.3386, -0.1892, -0.3170, 0.2148]])

INTERMEDIATE DEEP LEARNING WITH PYTORCH

Weights initialization
Good initialization ensures:

Variance of layer inputs = variance of layer outputs

Variance of gradients the same before and after a layer

How to achieve this depends on the activation:

For ReLU and similar, we can use He/Kaiming initialization

INTERMEDIATE DEEP LEARNING WITH PYTORCH

Weights initialization
import torch.nn.init as init

init.kaiming_uniform_(layer.weight)
print(layer.weight)

Parameter containing:
tensor([[-0.3063, -0.2410, 0.0588, 0.2664,
0.0502, -0.0136, 0.2274, 0.0901]])

INTERMEDIATE DEEP LEARNING WITH PYTORCH

He / Kaiming initialization
init.kaiming_uniform_(self.fc1.weight)
init.kaiming_uniform_(self.fc2.weight)
init.kaiming_uniform_(
self.fc3.weight,
nonlinearity="sigmoid",
)

INTERMEDIATE DEEP LEARNING WITH PYTORCH

He / Kaiming initialization
import torch.nn as nn
import torch.nn.init as init

class Net(nn.Module):
def __init__(self): def forward(self, x):
super().__init__() x = nn.functional.relu(self.fc1(x))
self.fc1 = nn.Linear(9, 16) x = nn.functional.relu(self.fc2(x))
self.fc2 = nn.Linear(16, 8) x = nn.functional.sigmoid(self.fc3(x))
self.fc3 = nn.Linear(8, 1) return x

init.kaiming_uniform_(self.fc1.weight)
init.kaiming_uniform_(self.fc2.weight)
init.kaiming_uniform_(
self.fc3.weight,
nonlinearity="sigmoid",
)

INTERMEDIATE DEEP LEARNING WITH PYTORCH

Activation functions

Often used as the default activation nn.functional.elu()

nn.functional.relu() Non-zero gradients for negative values -

Zero for negative inputs - dying neurons helps against dying neurons
Average output around zero - helps against
vanishing gradients

INTERMEDIATE DEEP LEARNING WITH PYTORCH

Batch normalization
After a layer:

1. Normalize the layer's outputs by:

Subtracting the mean

Dividing by the standard deviation

2. Scale and shift normalized outputs using learned parameters

Model learns optimal inputs distribution for each layer:

Faster loss decrease

Helps against unstable gradients

INTERMEDIATE DEEP LEARNING WITH PYTORCH

Batch normalization
class Net(nn.Module):
def __init__(self):
super().__init__()
self.fc1 = nn.Linear(9, 16)
self.bn1 = nn.BatchNorm1d(16)

...

def forward(self, x):

x = self.fc1(x)
x = self.bn1(x)
x = nn.functional.elu(x)

INTERMEDIATE DEEP LEARNING WITH PYTORCH

Let's practice!
I N T E R M E D I AT E D E E P L E A R N I N G W I T H P Y T O R C H

18 24 CA4181P2K15T1YA80 Q141D317U4G7LA2、Q141D317U4G7LA4 D9860P 肯尼亚、坦桑尼亚（右舵） 2020.12.5 更新
100% (1)
18 24 CA4181P2K15T1YA80 Q141D317U4G7LA2、Q141D317U4G7LA4 D9860P 肯尼亚、坦桑尼亚（右舵） 2020.12.5 更新
571 pages
Pytorch Cheatsheet EN
No ratings yet
Pytorch Cheatsheet EN
1 page
INF 106 Maintextbook PDF
No ratings yet
INF 106 Maintextbook PDF
154 pages
Top Down & Bottom Up Approach
No ratings yet
Top Down & Bottom Up Approach
18 pages
Activation Functions: Ismail Elezi
No ratings yet
Activation Functions: Ismail Elezi
30 pages
chapter4 (1)
No ratings yet
chapter4 (1)
34 pages
Chapter 4
No ratings yet
Chapter 4
34 pages
chapter2 (1)
No ratings yet
chapter2 (1)
35 pages
chapter1 (1)
No ratings yet
chapter1 (1)
50 pages
Chapter 2
No ratings yet
Chapter 2
35 pages
Chapter1 Intro
No ratings yet
Chapter1 Intro
35 pages
Chapter 3
No ratings yet
Chapter 3
26 pages
chapter3 (1)
No ratings yet
chapter3 (1)
26 pages
PyTorch Crash Course 1713016363
No ratings yet
PyTorch Crash Course 1713016363
15 pages
Module02 PyTorch
No ratings yet
Module02 PyTorch
36 pages
Pytorch Neural Networks Guide 1717173717
No ratings yet
Pytorch Neural Networks Guide 1717173717
17 pages
vertopal.com_PyTorch_CrashCourse
No ratings yet
vertopal.com_PyTorch_CrashCourse
16 pages
Ilovepdf Merged
No ratings yet
Ilovepdf Merged
10 pages
Pytorch 101: Deep Learning PHD Course 2017/2018
No ratings yet
Pytorch 101: Deep Learning PHD Course 2017/2018
19 pages
Building Deep Learning Models Using the PyTorch Library
No ratings yet
Building Deep Learning Models Using the PyTorch Library
4 pages
(Deep Learning Using PyTorch) (Cheatsheet)
No ratings yet
(Deep Learning Using PyTorch) (Cheatsheet)
7 pages
Intro To Pytorch
No ratings yet
Intro To Pytorch
12 pages
Deep Learning Lab: How To Train Your First Neural Network
No ratings yet
Deep Learning Lab: How To Train Your First Neural Network
68 pages
Pytorch Tutorial 1
No ratings yet
Pytorch Tutorial 1
48 pages
Intro To PyTorch and Neural Networks - Intro To PyTorch and Neural Networks Cheatsheet - Codecademy
No ratings yet
Intro To PyTorch and Neural Networks - Intro To PyTorch and Neural Networks Cheatsheet - Codecademy
8 pages
PyTorch_CrashCourse
No ratings yet
PyTorch_CrashCourse
17 pages
cs519 hw2
No ratings yet
cs519 hw2
15 pages
Pytorch Tutorial: Narges Honarvar Nazari January 30
No ratings yet
Pytorch Tutorial: Narges Honarvar Nazari January 30
29 pages
CIFAR_10_ Dataset_Using_CNN_Aniiiii_HTML
No ratings yet
CIFAR_10_ Dataset_Using_CNN_Aniiiii_HTML
8 pages
CS236 Introduction To PyTorch
100% (4)
CS236 Introduction To PyTorch
33 pages
Harvard CS197 Lecture 6 & 7 Notes
No ratings yet
Harvard CS197 Lecture 6 & 7 Notes
18 pages
Pytorch Tutorial 1 Rev 1
No ratings yet
Pytorch Tutorial 1 Rev 1
48 pages
PyTorch Workflow Fundamentals - Zero To Mastery Learn PyTorch For Deep Learning
No ratings yet
PyTorch Workflow Fundamentals - Zero To Mastery Learn PyTorch For Deep Learning
43 pages
2c PyTorch4
No ratings yet
2c PyTorch4
4 pages
Py Torch
No ratings yet
Py Torch
786 pages
NN From Scratch
No ratings yet
NN From Scratch
5 pages
Pytorch Tutorial: - Ntu Machine Learning Course
No ratings yet
Pytorch Tutorial: - Ntu Machine Learning Course
64 pages
PyTorch - A Comprehensive Overview
No ratings yet
PyTorch - A Comprehensive Overview
7 pages
Lab 9
No ratings yet
Lab 9
29 pages
Pytorch Tutorial For Beginner: Department of Computer Science & Engineering University of Washington
No ratings yet
Pytorch Tutorial For Beginner: Department of Computer Science & Engineering University of Washington
11 pages
Pytorch Slides
No ratings yet
Pytorch Slides
31 pages
unit 4 part 3 dl_1
No ratings yet
unit 4 part 3 dl_1
5 pages
Assignment 3 DS5620
No ratings yet
Assignment 3 DS5620
11 pages
Deep Learning With PyTorch 1
No ratings yet
Deep Learning With PyTorch 1
1 page
Stars 4 0 0 0 + Forks 7 0 0 + License MIT
No ratings yet
Stars 4 0 0 0 + Forks 7 0 0 + License MIT
19 pages
Pytorch A Detailed Overview Agladze Mikhail instant download
No ratings yet
Pytorch A Detailed Overview Agladze Mikhail instant download
82 pages
PyTorch 1 - 0 - Bringing Research and Production Together Presentation
No ratings yet
PyTorch 1 - 0 - Bringing Research and Production Together Presentation
108 pages
HW1P1_F23
No ratings yet
HW1P1_F23
37 pages
Deep Learning With PyTorch Guide For Beginners and Intermediate
100% (7)
Deep Learning With PyTorch Guide For Beginners and Intermediate
120 pages
Pytorch
No ratings yet
Pytorch
38 pages
PyTorch Workflow Fundamentals
No ratings yet
PyTorch Workflow Fundamentals
1 page
PyTorch PDF
No ratings yet
PyTorch PDF
72 pages
Deep Learning With PyTorch
No ratings yet
Deep Learning With PyTorch
19 pages
PyTorch_Guide_With_Code
No ratings yet
PyTorch_Guide_With_Code
4 pages
Train your image classifier model with PyTorch
No ratings yet
Train your image classifier model with PyTorch
6 pages
Pytorch Tutorial PDF
No ratings yet
Pytorch Tutorial PDF
27 pages
Crashcourse DL Pytorch Parr
No ratings yet
Crashcourse DL Pytorch Parr
39 pages
unit-4-part-3
No ratings yet
unit-4-part-3
8 pages
Beginner's PyTorch Guide
No ratings yet
Beginner's PyTorch Guide
35 pages
EE769 Assignment 3
No ratings yet
EE769 Assignment 3
1 page
یادگیری پایتورچ
No ratings yet
یادگیری پایتورچ
30 pages
Day 45 PyTorch Presentation
No ratings yet
Day 45 PyTorch Presentation
67 pages
Python Advanced Programming: The Guide to Learn Python Programming. Reference with Exercises and Samples About Dynamical Programming, Multithreading, Multiprocessing, Debugging, Testing and More
From Everand
Python Advanced Programming: The Guide to Learn Python Programming. Reference with Exercises and Samples About Dynamical Programming, Multithreading, Multiprocessing, Debugging, Testing and More
Marcus Richards
No ratings yet
PAN'S PIPE
No ratings yet
PAN'S PIPE
5 pages
Fast Radial Basis Functions for Engineering Applications Marco Evangelos Biancolini all chapter instant download
100% (2)
Fast Radial Basis Functions for Engineering Applications Marco Evangelos Biancolini all chapter instant download
40 pages
Christensen-Intro To Biomedical Engineering Biomechanics&Bioelectricity I
100% (1)
Christensen-Intro To Biomedical Engineering Biomechanics&Bioelectricity I
118 pages
Evidence
100% (1)
Evidence
87 pages
Review On Mobile Application For Medicine Reminder
No ratings yet
Review On Mobile Application For Medicine Reminder
3 pages
Bussiness Environment of Bangladesh-A Study in Case of Knit Composite Industry
No ratings yet
Bussiness Environment of Bangladesh-A Study in Case of Knit Composite Industry
5 pages
GPT Text
No ratings yet
GPT Text
24 pages
Chasing Cars Tab by Snow Patrol Songsterr Tabs with Rhythm
No ratings yet
Chasing Cars Tab by Snow Patrol Songsterr Tabs with Rhythm
1 page
Literature Review of Pipe Bending Machine
100% (2)
Literature Review of Pipe Bending Machine
7 pages
Brise Vent Havre ENG
No ratings yet
Brise Vent Havre ENG
31 pages
5 6093425460392231051 PDF
No ratings yet
5 6093425460392231051 PDF
1 page
"Ancient of Days" "God Is Good All The Time" Chorus
No ratings yet
"Ancient of Days" "God Is Good All The Time" Chorus
2 pages
Module 4 - PK Infrastructure
No ratings yet
Module 4 - PK Infrastructure
23 pages
KHALID-MOVING AVERAGESx
No ratings yet
KHALID-MOVING AVERAGESx
8 pages
7-Principles-Of-Bioethics NEW
No ratings yet
7-Principles-Of-Bioethics NEW
11 pages
HRM Final Project
100% (1)
HRM Final Project
27 pages
Screenshot 2024-01-19 at 10.06.37 A.M.
No ratings yet
Screenshot 2024-01-19 at 10.06.37 A.M.
78 pages
Biblical Timeline
100% (2)
Biblical Timeline
25 pages
Assessment2-Virtual Worlds
No ratings yet
Assessment2-Virtual Worlds
8 pages
2021-07-29 6102ebf67a512 GenderontheEdgeTransgenderGayandOtherPacificIslandersbyNikoBesnierz-lib - Org PDF
100% (1)
2021-07-29 6102ebf67a512 GenderontheEdgeTransgenderGayandOtherPacificIslandersbyNikoBesnierz-lib - Org PDF
386 pages
Fuel Inyector Cups 3 Filtros
No ratings yet
Fuel Inyector Cups 3 Filtros
1 page
Synopsis
100% (6)
Synopsis
19 pages
The Mouse in The Mountain
No ratings yet
The Mouse in The Mountain
181 pages
Architectural & Structural-Driver's Lounge
No ratings yet
Architectural & Structural-Driver's Lounge
14 pages
Guid 1 N: Ub Orientation
No ratings yet
Guid 1 N: Ub Orientation
15 pages
To Change The Way You Think, Change The Way You See
No ratings yet
To Change The Way You Think, Change The Way You See
6 pages
Physics - Rigid Body Dynamics Solutions
0% (1)
Physics - Rigid Body Dynamics Solutions
2 pages