0% found this document useful (0 votes)

38 views123 pages

Deep Learning for Image Classification

The document provides an introduction to deep learning for image classification, covering topics such as training linear classifiers, SoftMax classification, neural networks, and various deep learning libraries in Python like Scikit-learn, Keras, TensorFlow, and PyTorch. It discusses logistic regression, cost functions, training processes, and the use of cross-entropy for model evaluation. Additionally, it includes practical examples and visualizations to illustrate the concepts presented.

Uploaded by

Vetri Maran

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

38 views123 pages

Deep Learning for Image Classification

Uploaded by

Vetri Maran

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Introduction to Deep

Learning
For Image classification
Outline

• Training Linear classifiers

• SoftMax Classification
• Neural Networks
• convolutional neural network
• generative adversarial network
Machine Learning and Deep Learning
Libraries in Python
Scikit-learn
(Machine Learning : regression,
classification, …)
3. Machine
Learning and
Deep Learning Keras
(Deep Learning Neural Networks, …)
Deep Learning Libraries in Python

TensorFlow
(Deep Learning: Production and
Deployment)
3. Deep
Learning
Libraries
PyTorch
(Deep Learning: regression, classification,… )
Logistic Regression
𝑦# = 0 𝑦# = 1
𝜎 𝑤#x <0.5
𝜎 𝑤 # x >0.5

𝐱 !

𝒘! 𝐱 = 0
𝒘! 𝒙𝟏 = +

𝐱 "

𝒘! 𝒙𝟐 = -
Logistic Regression

𝐱 !
𝒘! 𝐱 = 0
𝐱 % = [4, −4]# 𝐱" = [2, 0]#
𝑅1
𝑧
w= [1, −1]!
𝑝 𝑦" = 1 𝐱" , 𝒘 = 𝜎 𝒘# 𝐱" 𝐱 "

2
= 𝜎 −1,1 𝐱 $ = [−3, 0]#
0
= 𝜎 2 = 0. 881 𝑅2
Logistic Regression

𝐱 !
𝒘! 𝐱 = 0
𝐱 % = [4, −4]# 𝐱" = [2, 0]#
𝑅1

𝑧 w= [1, −1]!
𝑝 𝑦$ = 1 𝐱 $ , 𝒘 = 𝜎 𝒘# 𝐱 $ 𝐱 "

= 𝜎 −3 = 0. 047 𝐱 $ = [−3, 0]#

𝑝 𝑦" = 0 𝐱 $ , 𝒘 = 1 − 𝜎 𝒘# 𝐱 $
𝑅2
= 0.953
Logistic Regression

𝐱 !
𝒘! 𝐱 = 0
𝑅1
𝐱" = [2, 0]#
Xn P(y=1|x) P(y=0|x)
𝐱 % = [4, −4]#
x1 0. 881 0.119
𝐱 "
x2 0.047 0.953
x3 0.997 0.009 𝐱 $ = [−3, 0]#

𝑅2
Hyperplane
𝐱 !
𝒘! 𝐱 = 0
𝐱 % = [4, −4]# 𝐱" = [2, 0]#
𝑅1
How to find w? Best w?
w= [1, −1]!
𝐱 "

Fitting? Training process.

𝐱 $ = [−3, 0]#

𝑅2
The Big Picture

𝑦+ = 𝑓(𝑥|𝑊)
Train 𝑊, 𝑏

Iris Dataset
Classifier Prediction
𝑦!

Unknown
𝑥
Sample
Training
Training: Cost Function
• Lets try to train a model to predict 𝑦 = 1
• Lets forget about the logistic function try a simple model: 𝑦3 = 𝑏
• We use a cost function to see how good our model is :
(𝑦 − 𝑏) $
𝐶(𝑏) = (1 − 𝑏) $

What's the best value for b ?

𝑏
Training
• Let say we started with a guess b=-2
C(b=-2)=9
b
𝑦3 = −2

C(b=1)=0

Learning Rate 𝐶(𝑏! )

𝑑𝐶(𝑏)
𝑑𝐶(𝑏% ) 𝑑𝑏
𝑏$ = 𝑏% − 𝜂
𝑑𝑏

𝐶(𝑏" )
𝑏$ = 𝑏% + 𝑛𝑢𝑚𝑏𝑒𝑟
0
© 2015 BigDataUniversity.com 𝑏
14
Training
𝐶 −2 = 9
𝑏$ = −2 − 1.5𝑠𝑙𝑜𝑝𝑒(−2)

𝑏% = −2 + 3

𝑏% = 1 𝐶 1 =0

𝑦3 = 1
15
Training: Cost
• All we do for the logistic regression just add the logistic function

𝑐 𝑤, 𝑏 = (𝑦 − 𝑏) $

𝑐 𝑤, 𝑏 = (𝑦 − 𝜎(𝑤𝑥 + 𝑏)) $

𝑐 𝑤, 𝑏 = (𝑦% − 𝜎(𝑤𝑥% + 𝑏)) $ +. . + (𝑦& − 𝜎(𝑤𝑥& + 𝑏)) $

Training
𝒘'
𝑐(𝒘& )
𝒘%
𝑐(𝒘" ) 𝒘$
−𝛻𝑓(𝜽! )

𝑐(𝒘$ ) −𝛻𝑓(𝜽" )

𝜃"
Training: logistic

𝑑𝐶(𝑏")
𝑏! = 𝑏" − 𝜂
𝑑𝑏
𝑏! = −1 + 2

𝜎(𝑤" 𝑥 + 𝑏)
𝑏

18
Training

𝑑𝐶(𝑏")
𝑏! = 𝑏" − 𝜂
𝑑𝑏
𝑏! = −1 + 0

𝜎(𝑤" 𝑥 + 𝑏)

19
Training: Threshold

Parameter b 0.69
Likelihood 0.445

1 − 𝜎(𝐱 " + 𝑏) 1 − 𝜎(𝐱 " + 𝑏) 𝜎(𝐱 $ + 𝑏)

𝑝 𝑦! = 0|𝒙! + 𝒃 𝑥𝑝 𝑦" = 0|𝒙" + 𝒃 x.. 𝑥𝑝 𝑦$ = 1|𝒙$ + 𝒃 ≈
Training: Threshold

Parameter b 0.69 0. 0.339

Likelihood 0.445 0.46

𝑝 𝑦! = 0|𝒙! + 𝒃 𝑥𝑝 𝑦" = 0|𝒙" + 𝒃 x.. 𝑥𝑝 𝑦" = 1|𝒙$ + 𝒃 ≈ 0.3378

Training: Threshold

Parameter b 0.69 0. 0.339 0

Likelihood 0.445 0.46 0.47

𝑝 𝑦! = 0|𝒙! + 𝒃 𝑥𝑝 𝑦" = 0|𝒙" + 𝒃 x.. 𝑥𝑝 𝑦" = 1|𝒙$ + 𝒃 ≈ 0.3378

22
Logistic Regression
𝑝 𝑦! = 0|𝒙! + 𝒃 𝑝 𝑦! = 0|𝒙! + 𝒃
C
𝑝 𝑌 𝛉, 𝑿 = * 𝜎(𝒘𝐱 @ + 𝑏)D! (1 − 𝜎(𝒘𝑻𝐱 @ + 𝑏)D! )BFD!
@AB

𝑙𝑛(𝑝 𝑌 𝛉, 𝑿 )

−𝑙𝑛(𝑝 𝑌 𝛉, 𝑿 )

B
− 𝑙𝑛(𝑝 𝑌 𝛉, 𝑿 )
C

© 2015 BigDataUniversity.com 𝜃G =argmax

23 % (P 𝑌 𝜃 )
Cross Entropy
&
1
ℓ(𝛉) = − E 𝑦( ln(𝜎(𝒘𝐱 ( + 𝑏))+(1 − 𝑦( ) ln(1 − 𝜎(𝒘𝐱 ( + 𝑏))
𝑁
()%

def criterion(yhat,y):
out=-1*torch.mean(y*torch.log(yhat) +(1-y)*torch.log(1-yhat))
return out
𝑦=0 𝑦=1 𝑦=2

𝑥
𝑧" = 𝑤" 𝑥 + 𝑏"
𝑧$ = 𝑤$ 𝑥 + 𝑏$

𝑧# = 𝑤# 𝑥 + 𝑏#

𝑥
𝑧" > 𝑧# 𝑎𝑛𝑑 𝑧$

𝑧"
𝑧$

𝑧#

𝑥
𝑧# > 𝑧" 𝑎𝑛𝑑 𝑧$

𝑧"
𝑧$

𝑧#
𝑧$ > 𝑧" 𝑎𝑛𝑑 𝑧#

𝑧"
𝑧$

𝑧#
Review of Argmax

i 0 1 2 3 4 5 6 7 8 9

𝒛 = [1 3 2 4 3 2 0 10 9 7]
𝑦! = 𝑎𝑟𝑔𝑚𝑎𝑥! {𝑧! }

𝑦! = 7
𝑧N = −1.7𝑥 − 0.36

𝑧O = 1.9𝑥 − 0.34
𝑧B = 1.3𝑥 + 1.3
𝑧N = −1.7𝑥 − 0.36
𝑧" 𝑧# 𝑧$ 𝑧O = 1.9𝑥 − 0.34

i 0 1 2
2.33 1.15 -3.3

𝑧B = 1.3𝑥 + 1.3

𝑦! = 𝑎𝑟𝑔𝑚𝑎𝑥! {𝑧! }
𝑦! = 0
𝑧N = −1.7𝑥 − 0.36
𝑧" 𝑧# 𝑧$ 𝑧O = 1.9𝑥 − 0.34

i 0 1 2
-1.25 1.36 0.64

𝑧B = 1.3𝑥 + 1.3

𝑦! = 𝑎𝑟𝑔𝑚𝑎𝑥! {𝑧! }
𝑦! = 1
𝑧N = −1.7𝑥 − 0.36
𝑧" 𝑧# 𝑧$ 𝑧O = 1.9𝑥 − 0.34

i 0 1 2
-3.05 1.46 2.6

𝑧B = 1.3𝑥 + 1.3

𝑦! = 𝑎𝑟𝑔𝑚𝑎𝑥! {𝑧! }
𝑦! = 2
Targets :MNIST Dataset

𝑦=0 𝑦=1 𝑦=2 ⋯ 𝑦=9

⋯
1
1
2
3
2
:
:
: 𝐱= 784

:
This is 2D not 784 D
𝑦 = 𝑎𝑟𝑔𝑚𝑎𝑥% {𝒘𝒊𝒙 + b𝒊}
𝒘# i 𝒘 𝑧' = 𝒘' 𝒙
𝒘" 1 1 1
,
2 2
2 1 1
− ,
2 2

3 0, −1
𝒘$
1
𝑧#
&! "
P( y =0|x)=
2 & ! " '& ! # '& ! $

𝑥! −1
1
2
2 P(y = 0|x)
𝑧!
&! #
P(y = 1|x)=
P(y = 1|x)
& ! " '& ! # '& ! $
1
𝑥" 0 2
1 P(y = 2|x)
𝑧" P(y = 2|x)=
&! $
& ! " '& ! # '& ! $
class SoftMax (nn.Module):
def __init__(self,in_size,out_size):
super(SoftMax,self).__init__()
self.linear = nn.Linear(in_size, output_size)

def forward(self,x):
out = self.linear(x)
return out

© 2017 IBM Corporation

input_dim = 28 * 28
output_dim = 10

model = SoftMax(input_dim, output_dim)

© 2017 IBM Corporation

criterion = nn.CrossEntropyLoss()
Loss
y : torch.LongTensor

y : N not Nx1

© 2017 IBM Corporation

optimizer = optim.SGD(model.parameters(), lr = 0.01) model

n_epochs = 100
accuracy_list = []

optimizer

© 2017 IBM Corporation

train_loader = torch.utils.data.DataLoader(dataset=train_dataset, batch_size = 100)

x,y
validation_loader = torch.utils.data.DataLoader(dataset=validation_dataset, batch_size = 5000)

x,y

© 2017 IBM Corporation

for epoch in range(n_epochs):
for x, y in train_loader:
optimizer.zero_grad()
z = model(x.view(-1, 28*28))
loss = criterion(z, y)
loss.backward()
optimizer.step()

correct = 0
for x_test, y_test in validation_loader:
z = model(x_test.view(-1, 28*28))
_,yhat = torch.max(z.data, 1) :
correct = correct + (yhat == y_test).sum().item()
accuracy = correct / N_test :
accuracy_list.append(accuracy)

© 2017 IBM Corporation

What Is a Neural Networks
• A neural network is a function that can be used to approximate most
functions using a set of parameters
Features and targets: Example

1
Building a Neural Networks
with a Linear Classifier
Features and targets: Example
Linear Classifiers vs Neural Networks
Neural Networks

𝑥 𝑧
𝑧

𝑥
Neural Networks

activation function

𝑧(𝑥) 𝑎(𝑧)
𝑥

activation 𝑎

𝑧
Neural Networks
Neural Networks
Neural Networks

activation function

𝑧 𝑎
𝑥
𝑎!! 𝑎"! 𝑧"

𝑎"" − 𝑎$"
𝑧! 𝑎!! 1
𝑧"

𝑧! 𝑎"! −1

© 2017 IBM Corporation

Neural Networks
hidden layer
output layer
artificial neuron
artificial neuron
!
𝑤!"
"
𝑏"" 𝑀
" $ $
𝑤"" 𝑤## 𝑏#
𝑥0 "
𝑤"$

$
"
𝑏"$ 𝑤$#
!
𝑤!"

𝑀
" $
𝑤"" 𝑤##
𝑥0 "
𝑤"$
$
𝑤$#
!
𝑤!"

𝑥0
𝑧!! = 𝑥 + 5

𝑧!!

𝑧"!
𝑧"! = 𝑥 − 5

𝑧"!
𝑧!!
𝑎!! = 𝜎(𝑥 + 5) activation

𝑎"!

𝑎"! = 𝜎(𝑥 − 5)

𝑎!!
PyTorch
import torch
import torch.nn as nn
from torch import sigmoid
class Net(nn.Module):
def __init__(self,D_in,H,D_out):
super(Net,self).__init__()
self.linear1=nn.Linear(D_in,H)
self.linear2=nn.Linear(H,D_out)
def forward(self,x):
x=sigmoid(self.linear1(x))
x=sigmoid(self.linear2(x))
return x

𝑥
class Net(nn.Module):
def __init__(self,D_in,H,D_out):
super(Net,self).__init__()
self.linear1=nn.Linear(D_in,H)
self.linear2=nn.Linear(H,D_out)
def forward(self,x):
x=sigmoid(self.linear1(x))
x=sigmoid(self.linear2(x))
return x

𝑥
class Net(nn.Module):
def __init__(self,D_in,H,D_out):
super(Net,self).__init__() D_in=1
self.linear1=nn.Linear(D_in,H)
self.linear2=nn.Linear(H,D_out)
def forward(self,x):
x=sigmoid(self.linear1(x))
x=sigmoid(self.linear2(x))
return x

𝑥
class Net(nn.Module):
def __init__(self,D_in,H,D_out):
super(Net,self).__init__() H=2
self.linear1=nn.Linear(D_in,H)
self.linear2=nn.Linear(H,D_out)
def forward(self,x):
x=sigmoid(self.linear1(x))
x=sigmoid(self.linear2(x))
return x

model = Net(1, 2, 1)
𝑥
x=torch.tensor([0.0])
yhat=model(x)
class Net(nn.Module):
def __init__(self,D_in,H,D_out):
super(Net,self).__init__()
self.linear1=nn.Linear(D_in,H)
self.linear2=nn.Linear(H,D_out)
def forward(self,x):
x=sigmoid(self.linear1(x))
x=sigmoid(self.linear2(x))
return x

model = Net(1, 2, 1)
𝑥
x=torch.tensor([0.0])
yhat=model(x)

© 2017 IBM Corporation

class Net(nn.Module):
def __init__(self,D_in,H,D_out):
super(Net,self).__init__()
self.linear1=nn.Linear(D_in,H)
self.linear2=nn.Linear(H,D_out)
def forward(self,x):
x=sigmoid(self.linear1(x))
x=sigmoid(self.linear2(x))
return x

model = Net(1, 2, 1)

x=torch.tensor([0.0])
yhat=model(x)
class Net(nn.Module):
def __init__(self,D_in,H,D_out):
super(Net,self).__init__()
self.linear1=nn.Linear(D_in,H)
self.linear2=nn.Linear(H,D_out)
def forward(self,x):
x=sigmoid(self.linear1(x))
x=sigmoid(self.linear2(x))
return x
!
𝑤!! $
𝑥0 𝑤&#

$
: 𝑤!$ 𝑤!"
! 𝑤&#
:
𝑤%!
:
:
!
𝑥1 𝑤%"
$
𝑤&#
𝑤%$

𝝈(𝒛 ) © 2017 IBM Corporation

Neuron 0
!
𝑤!! O
𝑤BB
𝑥0 O
𝑤BO O
𝑤BB Neuron 1
:
:
𝑤!$ 𝑤!"
! O
𝑤OB

𝑤%! O
𝑤OS
O
𝑤OO
O
O
𝑤SB 𝑤SS
Neuron 2
! O

𝑥1
𝑤SO
𝑤%" O
𝑤OS

𝑤%&

© 2019 IBM Corporation

Neuron 0
!
𝑤!! O
𝑤BB
1
𝑥! = 10
O
𝑤BO O
𝑤BB
Neuron 1
:
:
𝑤!$ 𝑤!"
! O
𝑤OB

𝑤%! O
𝑤OO
3

O
𝑤OS O
𝑤SS
Neuron 2
! O
𝑤SO
𝑥T = 4
𝑤%" O
𝑤OS
9
𝑤%&
𝑦+ = 2

© 2019 IBM Corporation

𝑥B

:
:
:
:
:
:
: :
𝑥T
ss Net(nn.Module):
def __init__(self,D_in,H1,H2,D_out):
super(Net,self).__init__()
self.linear1=nn.Linear(D_in, H1)
self.linear2=nn.Linear(H1, H2) 𝑥B

self.linear3=nn.Linear(H2, D_out) :
def forward(self,x): :
: :
x=torch. relu(self.linear1(x)) 𝑥T
: :

x=torch. relu(self.linear2(x))
x=self.linear3(x)
return x
255 255 255 255 0
255 255 255 235 0
255 255 0 0 0
255 0 0 0 0
Column
s
Column
s

rows
2 3 .. 40
rows : : .. 50
5 34 . . 24
0
1
2
3
0 2 3 .. 40
:
:
:
: : .. 50
500 5 34 . . 24
500 0 1 2 .. .. .. 50
0

0 1 2 .. .. .. 50
0
Image from: Rafael C. Gonzalez, Richard E. Woods - Digital Image Processing (2008,
Prentice Hall)
9 3 . . 40
: : . . 50
2 3 . . 40
5 34 . . 24
: : . . 50
4 5 3 34 . . . . 4024
: : . . 50
5 34 . . 24
9 3 . . 40
0 : : . . 50
2 3 . . 40
1
5 34 . . 24 2
2 : : . . 50
3
: 0 4 5 3 34 . . . . 4024 1
:
:
: : . . 50
500 500 5 34 . . 24 0
0 1 2 .. .. .. 50
0 1 2 .. .. .. 50 0
0
CNN for Image Classification
Feature Learning fully connected
layer

Classification

Input Convolution + RELU Pooling Convolution + RELU Pooling

image

𝐗
kernel
𝑾

0 0 1 0 0
1 0 -1
0 0 1 0 0
2 0 -2
0 0 1 0 0
1 0 -1
0 0 1 0 0

0 0 1 0 0 kernel_size=3
𝒁
-4
0x1+0x0+1x-1 -1
0 1 0 0 1 -1 0 0
+ +
0x2+0x0+1x-2
0 2 0 0 1 -2 0 0
-2
+
0 1 0 0 1 -1 0 0 0x1+0x0+1x-1 +

-1
0 0 1 0 0
=
0 0 1 0 0
-4
𝒁
-4 0
0x1+1x0+0x-1 0
0 0 1 1 0 0 -1 0
+ +
0x2+1x0+0x-2 0
0 0 2 1 0 0 -2 0
+ +
0 0 1 1 0 0 -1 0 0x1+0x0+0x-1 0
0
0 0 1 0 0

0 0 1 0 0
𝒁
-4 0 4

0 0 1 1 0 0 0 -1

0 0 1 2 0 0 0 -2

0 0 1 1 0 0 0 -1

0 0 1 0 0

0 0 1 0 0
𝒁
-4 0 4

0 0 1 0 0 -4

0 1 0 0 1 -1 0 0

0 2 0 0 1 -2 0 0

0 1 0 0 1 -1 0 0

0 0 1 0 0
𝒁
-4 0 4

0 0 1 0 0 -4 0

0 0 1 1 0 0 -1 0

0 0 2 1 0 0 -2 0

0 0 1 1 0 0 -1 0

0 0 1 0 0
𝒁
-4 0 4

0 0 1 0 0 -4 0 4

0 0 1 1 0 0 0 -1

0 0 1 2 0 0 0 -2

0 0 1 1 0 0 0 -1

0 0 1 0 0
𝒁
-4 0 4

0 0 1 0 0 -4 0 4

0 0 1 0 0 -4

0 1 0 0 1 -1 0 0

0 2 0 0 1 -2 0 0

0 1 0 0 1 -1 0 0
𝒁
-4 0 4

0 0 1 0 0 -4 0 4

0 0 1 0 0 -4 0

0 0 1 1 0 0 -1 0

0 0 2 1 0 0 -2 0

0 0 1 1 0 0 -1 0
𝒁
-4 0 4

0 0 1 0 0 -4 0 4

0 0 1 1 0 0 0 -1
Activation Map

0 0 1 2 0 0 0 -2

0 0 1 1 0 0 0 -1
Out channels

𝒁N = 𝑾N ∗ 𝐗B + 𝒃N
𝐗"
1 0 -1
0 0 1 0 0 1 0 -2
1 0 -1
0 0 1 0 0

0 0 1 0 0 𝒁B = 𝑾B ∗ 𝐗B + 𝒃B
1 2 -1
0 0 1 0 0
0 0 0
0 0 1 0 0 -1 -2 -1

𝒁O = 𝑾O ∗ 𝐗 + 𝒃O
image
1 1 1
1 1 1
1 1 1
Out channels

𝒁N = 𝑾N ∗ 𝐗 O + bN
𝐗$
1 0 -1
0 0 0 0 0 1 0 -2
1 0 -1
0 0 0 0 0

1 1 1 1 1 𝒁B = 𝑾B ∗ 𝐗 O + 𝑏B
1 2 -1
0 0 0 0 0
0 0 0
0 0 0 0 0 -1 -2 -1

𝒁O = 𝑾O ∗ 𝐗 O + bO
image1
1 1 1
1 1 1
1 1 1
Method similar to CNN https://developer.nvidia.com/blog/deep-learning-nutshell-core-concepts/

Convolution Convolution Convolution

+ RELU + RELU + RELU
CNN Architecture

Convolution
Layer Pooling Convolution
Layer Layer Pooling
Layer

: Output
: :
:

Input Image

Image by Alex Aklson hidden layer

Convolution Max Pooling
TORCH-VISION MODELS
CNN Architecture

Convolution
Layer Pooling Convolution
Layer Layer Pooling
Layer

: Output
ResNet18 : :
:

Input Image

Fully Connected
Convolution Max Pooling Layer
classes

1
: 0
ResNet18
:
,
𝐲=
:
:
:
0

115
import torch
import torchvision.models as models
from torch.utils.data import Dataset, DataLoader
from torchvision import transforms
import torch.nn as nn
torch.manual_seed(0)
Pretrained
model = models.resnet18(pretrained=True)
mean = [0.485, 0.456, 0.406]
std = [0.229, 0.224, 0.225]

composed= transforms.Compose([transforms.Resize(224),
transforms.ToTensor(),
transforms.Normalize(mean, std)])

train_dataset=Dataset(transform=composed, train=True )
validation_data=Dataset(transform=composed)
for param in model.parameters():
param.requires_grad=False

model.fc=nn.Linear(512,7)

:
: :
:
train_loader=torch.utils.data.DataLoader(dataset=train_dataset, batch_size=15)

x,y
validation_loader=torch.utils.data.DataLoader(dataset=validation_dataset,batch_size=10)

x,y
criterion = nn.CrossEntropyLoss()

optimizer = torch.optim.Adam([parameters for parameters in model.parameters() if parameters.requires_grad],lr=0.003)

N_EPOCHS = 20
loss_list = []
accuracy_list = []
correct = 0
n_test = len(validation_dataset)
for epoch in range(n_epochs):
loss_sublist = []
for x, y in train_loader:
model.train()
optimizer.zero_grad()
z = model(x)
loss = criterion(z, y)
loss_sublist.append(loss.data.item())
loss.backward()
optimizer.step()
loss_list.append(np.mean(loss_sublist))

correct = 0
for x_test, y_test in validation_loader:
model.eval()
z = model(x_test)
_, yhat = torch.max(z.data, 1)
correct += (yhat == y_test).sum().item()
accuracy = correct / n_test
accuracy_list.append(accuracy)
Generative Adversarial
Network
Generative Adversarial Networks (GAN’S)
• generate new data with the same statistics as the training set
https://thispersondoesnotexist.com/

GAN

Dense Neural Nets
No ratings yet
Dense Neural Nets
68 pages
Vac Sem 7
No ratings yet
Vac Sem 7
29 pages
Three-Layer Neural Network for Unsupervised Learning
No ratings yet
Three-Layer Neural Network for Unsupervised Learning
13 pages
Lecture 09 Slides - After
No ratings yet
Lecture 09 Slides - After
57 pages
Tutorial On Neural Networks - 18MAR2024
No ratings yet
Tutorial On Neural Networks - 18MAR2024
33 pages
Neural Network Basics and Training
No ratings yet
Neural Network Basics and Training
73 pages
Week 2 Artificial Neural Networks - Part II
No ratings yet
Week 2 Artificial Neural Networks - Part II
40 pages
Building Your Deep Neural Network - Step by Step v8 PDF
No ratings yet
Building Your Deep Neural Network - Step by Step v8 PDF
44 pages
Notebook - Deep Neural Networks
No ratings yet
Notebook - Deep Neural Networks
28 pages
ANN PR Code and Output
No ratings yet
ANN PR Code and Output
25 pages
Mlp-Fromscratch Sigmoid-Mse
No ratings yet
Mlp-Fromscratch Sigmoid-Mse
13 pages
Experiment 2.4 DL
No ratings yet
Experiment 2.4 DL
4 pages
Linear Regression with Gradient Descent
No ratings yet
Linear Regression with Gradient Descent
3 pages
Neural Network Training and Testing
No ratings yet
Neural Network Training and Testing
8 pages
Basics of Artificial Neural Networks
No ratings yet
Basics of Artificial Neural Networks
10 pages
Perceptron and Neural Network Implementations
No ratings yet
Perceptron and Neural Network Implementations
41 pages
A Gentle Introduction To Neural Networks With Python
No ratings yet
A Gentle Introduction To Neural Networks With Python
85 pages
Activation Functions and Perceptron Models
No ratings yet
Activation Functions and Perceptron Models
27 pages
Intro to Neural Networks with Python
100% (1)
Intro to Neural Networks with Python
85 pages
Machine Learning Lecture II Overview
No ratings yet
Machine Learning Lecture II Overview
118 pages
Understanding Neural Networks Basics
No ratings yet
Understanding Neural Networks Basics
79 pages
ANN Programs
No ratings yet
ANN Programs
20 pages
Neural Networks and Deep Learning Overview
No ratings yet
Neural Networks and Deep Learning Overview
28 pages
LLM For Maths People
No ratings yet
LLM For Maths People
53 pages
NN From Scratch
No ratings yet
NN From Scratch
5 pages
Session 8 Neural Network
No ratings yet
Session 8 Neural Network
29 pages
Cours 2
No ratings yet
Cours 2
25 pages
Custom NN vs PyTorch Model Accuracy
No ratings yet
Custom NN vs PyTorch Model Accuracy
19 pages
Neural Network (Perceptrons)
No ratings yet
Neural Network (Perceptrons)
31 pages
5 - From Linear Models To Multi-Layer Perceptrons
No ratings yet
5 - From Linear Models To Multi-Layer Perceptrons
45 pages
13 - Neural Network (Perceptrons)
No ratings yet
13 - Neural Network (Perceptrons)
31 pages
Da 3 Lab DL 21BCE2687
No ratings yet
Da 3 Lab DL 21BCE2687
15 pages
Matlab Neural Network Toolbox Guide
No ratings yet
Matlab Neural Network Toolbox Guide
18 pages
PyTorch Neural Network Basics Guide
No ratings yet
PyTorch Neural Network Basics Guide
12 pages
Neural Network Training in Python
No ratings yet
Neural Network Training in Python
3 pages
Final DL
No ratings yet
Final DL
26 pages
Build a Deep Neural Network Guide
No ratings yet
Build a Deep Neural Network Guide
27 pages
PyTorch Linear Regression Guide
No ratings yet
PyTorch Linear Regression Guide
10 pages
Adaline SGD
No ratings yet
Adaline SGD
4 pages
Artificial Neural Networks
No ratings yet
Artificial Neural Networks
100 pages
Deep Learning with PyTorch Basics
No ratings yet
Deep Learning with PyTorch Basics
39 pages
Lab 8
No ratings yet
Lab 8
10 pages
Computer Vision Assignment 3 Guide
No ratings yet
Computer Vision Assignment 3 Guide
11 pages
MachineLearningSlides PartOne
No ratings yet
MachineLearningSlides PartOne
252 pages
Aml Pa
No ratings yet
Aml Pa
17 pages
6.neural Networks 2
No ratings yet
6.neural Networks 2
44 pages
Deep Learning: Neural Networks Overview
No ratings yet
Deep Learning: Neural Networks Overview
44 pages
Stochastic Gradient Descent for Neural Networks
No ratings yet
Stochastic Gradient Descent for Neural Networks
3 pages
Sparseautoencoder 2011new
No ratings yet
Sparseautoencoder 2011new
19 pages
Day1 06 Simple NN Python
No ratings yet
Day1 06 Simple NN Python
18 pages
Logistic Regression Implementation Guide
No ratings yet
Logistic Regression Implementation Guide
4 pages
Deep Learning Practical Implementations
No ratings yet
Deep Learning Practical Implementations
10 pages
Deep Learning: Neural Networks Overview
No ratings yet
Deep Learning: Neural Networks Overview
47 pages
Perceptron and Neural Networks Overview
No ratings yet
Perceptron and Neural Networks Overview
4 pages
Train Neural Network for XOR Function
No ratings yet
Train Neural Network for XOR Function
12 pages
Deep Learning: CIFAR-10 Classification
No ratings yet
Deep Learning: CIFAR-10 Classification
15 pages
Modes and Ragas: More Than Just A Scale: Catherine Schmidt-Jones
No ratings yet
Modes and Ragas: More Than Just A Scale: Catherine Schmidt-Jones
12 pages
Milestones in The Development of The Quantum Theory
No ratings yet
Milestones in The Development of The Quantum Theory
7 pages
Early River Valley Civilizations Overview
No ratings yet
Early River Valley Civilizations Overview
14 pages
Jhon Doe's Creative Designer Profile
No ratings yet
Jhon Doe's Creative Designer Profile
1 page
L2-SIG-PRO-007 v1 - Overlapping Design Requirement
No ratings yet
L2-SIG-PRO-007 v1 - Overlapping Design Requirement
14 pages
Software Categories and Examples
No ratings yet
Software Categories and Examples
3 pages
Grade 4 English Lesson Plan
No ratings yet
Grade 4 English Lesson Plan
4 pages
12 Practice Test
No ratings yet
12 Practice Test
5 pages
AS 4068-1993 - Flat Pallets For Materials Handling
100% (1)
AS 4068-1993 - Flat Pallets For Materials Handling
25 pages
Active Matrix LCD Vs Passive Matrix LCD
No ratings yet
Active Matrix LCD Vs Passive Matrix LCD
2 pages
Algorithms Lab Course Plan
No ratings yet
Algorithms Lab Course Plan
3 pages
HW1
No ratings yet
HW1
6 pages
Daily Detailed Lesson Plan
No ratings yet
Daily Detailed Lesson Plan
2 pages
Chap 4 FACTORS AND POLYNOMIALS
No ratings yet
Chap 4 FACTORS AND POLYNOMIALS
18 pages
Procedure For Obtaing Building Approval
No ratings yet
Procedure For Obtaing Building Approval
7 pages
303 Notes
No ratings yet
303 Notes
29 pages
Internal Assessment Biology (HL) - Comparing The Inhibition of Calatase Enzyme by Metal Ion Inhibitors
100% (7)
Internal Assessment Biology (HL) - Comparing The Inhibition of Calatase Enzyme by Metal Ion Inhibitors
19 pages
Mba First Question Papers
100% (3)
Mba First Question Papers
14 pages
User Manual SM - 1.8 3kW
No ratings yet
User Manual SM - 1.8 3kW
32 pages
Dell New Portfolio PC
No ratings yet
Dell New Portfolio PC
68 pages
Bmw320i 2006
No ratings yet
Bmw320i 2006
4 pages
The Evolution and Importance of Keyboards - A 21st Century Perspective
No ratings yet
The Evolution and Importance of Keyboards - A 21st Century Perspective
2 pages
Practicum Report at Philippine Carabao Center
No ratings yet
Practicum Report at Philippine Carabao Center
8 pages
Introduction To Computer Application Packages-Computer Technology Notes
No ratings yet
Introduction To Computer Application Packages-Computer Technology Notes
5 pages
Grade 8 English Sample Paper-Term 1
No ratings yet
Grade 8 English Sample Paper-Term 1
6 pages
Dean Koontz Novel Timeline
83% (6)
Dean Koontz Novel Timeline
3 pages
Intention Setting Worksheet Sarahprout PDF
100% (15)
Intention Setting Worksheet Sarahprout PDF
25 pages
Mil STD 3009
No ratings yet
Mil STD 3009
95 pages
Elevator Drive Technology 2020 Guide
No ratings yet
Elevator Drive Technology 2020 Guide
126 pages
Inability To Make Decisions With Respect To Taking Appropriate Health Action Due To
100% (1)
Inability To Make Decisions With Respect To Taking Appropriate Health Action Due To
3 pages

Deep Learning for Image Classification

Uploaded by

Deep Learning for Image Classification

Uploaded by

Introduction to Deep

• Training Linear classifiers

= 𝜎 −3 = 0. 047 𝐱 $ = [−3, 0]#

Fitting? Training process.

What's the best value for b ?

Learning Rate 𝐶(𝑏! )

𝑐 𝑤, 𝑏 = (𝑦% − 𝜎(𝑤𝑥% + 𝑏)) $ +. . + (𝑦& − 𝜎(𝑤𝑥& + 𝑏)) $

1 − 𝜎(𝐱 " + 𝑏) 1 − 𝜎(𝐱 " + 𝑏) 𝜎(𝐱 $ + 𝑏)

Parameter b 0.69 0. 0.339

𝑝 𝑦! = 0|𝒙! + 𝒃 𝑥𝑝 𝑦" = 0|𝒙" + 𝒃 x.. 𝑥𝑝 𝑦" = 1|𝒙$ + 𝒃 ≈ 0.3378

Parameter b 0.69 0. 0.339 0

𝑝 𝑦! = 0|𝒙! + 𝒃 𝑥𝑝 𝑦" = 0|𝒙" + 𝒃 x.. 𝑥𝑝 𝑦" = 1|𝒙$ + 𝒃 ≈ 0.3378

© 2015 BigDataUniversity.com 𝜃G =argmax

𝑦=0 𝑦=1 𝑦=2 ⋯ 𝑦=9

© 2017 IBM Corporation

model = SoftMax(input_dim, output_dim)

© 2017 IBM Corporation

© 2017 IBM Corporation

© 2017 IBM Corporation

© 2017 IBM Corporation

© 2017 IBM Corporation

© 2017 IBM Corporation

© 2017 IBM Corporation

𝝈(𝒛 ) © 2017 IBM Corporation

© 2019 IBM Corporation

© 2019 IBM Corporation

Input Convolution + RELU Pooling Convolution + RELU Pooling

Convolution Convolution Convolution

Image by Alex Aklson hidden layer

optimizer = torch.optim.Adam([parameters for parameters in model.parameters() if parameters.requires_grad],lr=0.003)

You might also like