0% found this document useful (0 votes)

5 views5 pages

Week 3

The document discusses a neural network with various layers and activation functions, providing solutions to questions about parameters, loss values, and activation functions. It covers concepts such as backpropagation, cross-entropy, and information content of events. Additionally, it includes calculations for predicted outputs and loss using specific inputs and weights.

Uploaded by

durgaraoscet

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

5 views5 pages

Week 3

Uploaded by

durgaraoscet

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

Deep Learning - Week 3

Use the following data to answer the questions 1 to 2

A neural network contains an input layer h0 = x, three hidden layers (h1 , h2 , h3 ), and
an output layer O. All the hidden layers use the Sigmoid activation function, and the
output layer uses the Softmax activation function.
Suppose the input x ∈ R200 , and all the hidden layers contain 10 neurons each. The
output layer contains 4 neurons.

1. How many parameters (including biases) are there in the entire network?
Correct Answer: 2274
Solution:
Number of Parameters
Input Layer to h1 : 200 × 10 + 10 = 2010
h1 to h2 : 10 × 10 + 10 = 110
h2 to h3 : 10 × 10 + 10 = 110
h3 to Output Layer: 10 × 4 + 4 = 44
Total Parameters: 2010 + 110 + 110 + 44 = 2274

2. Suppose all elements in the input vector are zero, and the corresponding true label is
also 0. Further, suppose that all the parameters (weights and biases) are initialized
to zero. What is the loss value if the cross-entropy loss function is used? Use the
natural logarithm (ln).
Correct Answer: Range(1.317,1.455)
Solution:
Loss with Zero Inputs and Parameters Input: x = 0, weights and biases = 0.
Hidden Layers: σ(0) = 0.5.
Output Layer Logits: [0, 0, 0, 0].
Softmax: Softmax(zi ) = 14 , ∀i.
Cross-Entropy Loss: − ln 14 = ln(4) ≈ 1.386.

Use the following data to answer the questions 3 to 4

The diagram below shows a neural network. The network contains two hidden layers
and one output layer. The input to the network is a column vector x ∈ R3 . The first
hidden layer contains 9 neurons, the second hidden layer contains 5 neurons and the
output layer contains 2 neurons. Each neuron in the lth layer is connected to all the
neurons in the (l + 1)th layer. Each neuron has a bias connected to it (not explicitly
shown in the figure).
Hidden layer 1

a1 h(1)
1

(1)
h2 Hidden layer 2

(1) a2 (2)
Input layer h3 h1

x1 (1) (2) Output layer

h4 h2
a3 ŷ
1
x2 (1) (2)
h5 h3
ŷ2
x3 (1) (2)
h6 h4

(1) (2)
h7 h5

(1)
W1 h8 W2 W3

(1)
h9

In the diagram, W1 is a matrix and x, a1 , h1 , and O are all column vectors. The
notation Wi [j, :] denotes the j th row of the matrix Wi , Wi [:, j] denotes the j th column
of the matrix Wi and Wkij denotes an element at ith row and j th column of the matrix
Wk .

3. Choose the correct dimensions of W1 and a1

(a) W1 ∈ R3×9
(b) a1 ∈ R9×5
(c) W1 ∈ R9×3
(d) a1 ∈ R1×9
(e) W1 ∈ R1×9
(f) a1 ∈ R9×1

Correct Answer: (c),(f)

Solution:

4. How many learnable parameters(including bias) are there in the network?

Correct Answer: 98
Solution:
Number of parameters in W1 : (9 ∗ 3) + 9
Number of parameters in W1 : (5 ∗ 9) + 5
Number of parameters in W1 : (2 ∗ 5) + 2
Total: 36 + 50 + 12 = 98.
5. We have a multi-classification problem that we decide to solve by training a feedfor-
ward neural network. What activation function should we use in the output layer to
get the best results?

(a) Logistic
(b) Step function
(c) Softmax
(d) linear

Correct Answer: (c)

Solution: Softmax works best on multilayer classification problems since it is scale-
invariant and outputs a probability distribution.

6. Which of the following statements about backpropagation is true?

(a) It is used to compute the output of a neural network.

(b) It is used to optimize the weights in a neural network.
(c) It is used to initialize the weights in a neural network.
(d) It is used to regularize the weights in a neural network.

Correct Answer: (b)

Solution: Backpropagation is a commonly used algorithm for optimizing the weights
in a neural network. It works by computing the gradient of the loss function with
respect to each weight in the network, and then using that gradient to update the
weight in a way that minimizes the loss function.

7. Given two probability distributions p and q, under what conditions is the cross entropy
between them minimized?

(a) All the values in p are lower than corresponding values in q

(b) All the values in p are higher than corresponding values in q
(c) p = 0(0 is a vector)
(d) p = q

Correct Answer: (d)

Solution:Cross entropy is lowest when both distributions are the same.

8. Given that the probability of Event A occurring is 0.18 and the probability of Event
B occurring is 0.92, which of the following statements is correct?

(a) Event A has a low information content

(b) Event A has a high information content
(c) Event B has a low information content
(d) Event B has a high information content

Correct Answer: (b),(c)

Solution: Events with high probability have low information content while events
with low probability have high information content.

Use the following data to answer the questions 9 and 10

The following diagram represents a neural network containing two hidden layers and
one output layer. The input to the network is a column vector x ∈ R3 . The activation
function used in hidden layers is sigmoid. The output layer doesn’t contain any
activation function and the loss used is squared error loss (predy − truey )2 .
Input Hidden Hidden Output
layer layer 1 layer 2 layer
x1 (1)
h1
(2)
h1
x2 (1)
h2 ŷ1
(2)
h2
x3 (1)
h3

The following network doesn’t contain any biases and the weights of the network are
given below:
 
1 1 3
1 1 2
W1 =2 −1 1  W2 = W3 = 1 2
3 1 1
1 2 −2
 
1
The input to the network is: x = 2

1
The target value y is: y = 5

9. What is the predicted output for the given input x after doing the forward pass?
Correct Answer: Range(2.9,3.0)
Solution:
Doing the forward
 pass in thenetwork
  we
  get
1 1 3 1 6
h1 = W1 · x1 = 2 −1 1  · 2 = 1
1 2 −2 1 3
 
0.997
a1 = sigmoid(h1 ) =0.731
0.952
 
0.997
1 1 2  3.632
h2 = W2 · a 1 = . 0.731 =

3 1 1 4.674
0.952

0.974
a2 = sigmoid(h2 ) =
0.990

0.974
y= 1 2 · = 2.954
0.990

10. Compute and enter the loss between the output generated by input x and the true
output y.
Correct Answer: Range(3.97,4.39)
Solution: Loss=(5 − 2.954)2 = 4.1861

UDL Answer Booklet Students
No ratings yet
UDL Answer Booklet Students
79 pages
A2.2 DNN Update 2
No ratings yet
A2.2 DNN Update 2
51 pages
Lecture NN Part1
No ratings yet
Lecture NN Part1
62 pages
NLP-NeuralNetworks Reading Notes
No ratings yet
NLP-NeuralNetworks Reading Notes
13 pages
Week 1 Sol Merged
No ratings yet
Week 1 Sol Merged
39 pages
Week 3
No ratings yet
Week 3
3 pages
8-10. Backpropagation Algorithm
No ratings yet
8-10. Backpropagation Algorithm
233 pages
Solution Dseclzg524 05-07-2020 Ec3r
No ratings yet
Solution Dseclzg524 05-07-2020 Ec3r
7 pages
CSE489: Machine Vision (Sheet 7) : Yehia Zakaria
No ratings yet
CSE489: Machine Vision (Sheet 7) : Yehia Zakaria
34 pages
Neural Networks - Annotated
No ratings yet
Neural Networks - Annotated
21 pages
Deep learning
No ratings yet
Deep learning
15 pages
DL Group Exercise 1 (1)
No ratings yet
DL Group Exercise 1 (1)
7 pages
Feed Forward NN
No ratings yet
Feed Forward NN
35 pages
COE292 - T221 - Final - Version C
No ratings yet
COE292 - T221 - Final - Version C
19 pages
Module 3_Modified
No ratings yet
Module 3_Modified
106 pages
Solution Dseclzg524!01!102020 Ec2r
100% (1)
Solution Dseclzg524!01!102020 Ec2r
6 pages
Week - 5 (Deep Learning) Q. 1) Explain The Architecture of Feed Forward Neural Network or Multilayer Perceptron. (12 Marks)
No ratings yet
Week - 5 (Deep Learning) Q. 1) Explain The Architecture of Feed Forward Neural Network or Multilayer Perceptron. (12 Marks)
7 pages
ass5_soln
No ratings yet
ass5_soln
6 pages
Y W X y F (X) X W X: Neural Networks Viewed As Directed Graph
No ratings yet
Y W X y F (X) X W X: Neural Networks Viewed As Directed Graph
12 pages
CT1 DL Ans
No ratings yet
CT1 DL Ans
13 pages
Top 100 Deep Learning Interview Questions
No ratings yet
Top 100 Deep Learning Interview Questions
157 pages
Kagan Lecture2
No ratings yet
Kagan Lecture2
118 pages
week 03-04 - Deep Feedforward Networks - Intro
No ratings yet
week 03-04 - Deep Feedforward Networks - Intro
141 pages
Lab 04 Sol PDF
No ratings yet
Lab 04 Sol PDF
7 pages
2.3 Feed Forward Netwoks
No ratings yet
2.3 Feed Forward Netwoks
25 pages
neural network basics
No ratings yet
neural network basics
37 pages
1) deep_learning
No ratings yet
1) deep_learning
60 pages
TT1 QBAns1
No ratings yet
TT1 QBAns1
15 pages
Name:-Time Allowed: - 3 Hours: Artificial Neural Networks Exam
No ratings yet
Name:-Time Allowed: - 3 Hours: Artificial Neural Networks Exam
11 pages
120135
No ratings yet
120135
4 pages
Unit 2
No ratings yet
Unit 2
18 pages
Interview Questions in Neural Network
No ratings yet
Interview Questions in Neural Network
9 pages
DL_EXP-2_16010422230
No ratings yet
DL_EXP-2_16010422230
6 pages
Must Know Questions Deep Learning
No ratings yet
Must Know Questions Deep Learning
22 pages
UDL Answer Booklet Students
No ratings yet
UDL Answer Booklet Students
79 pages
ML Endsem 2022
No ratings yet
ML Endsem 2022
7 pages
Session NN
No ratings yet
Session NN
32 pages
ANN Unit IV Notes
No ratings yet
ANN Unit IV Notes
4 pages
06-NeuralNetworks-2024
No ratings yet
06-NeuralNetworks-2024
82 pages
CNN
No ratings yet
CNN
19 pages
Question Bank - Deep Learning
No ratings yet
Question Bank - Deep Learning
25 pages
Module 2
No ratings yet
Module 2
44 pages
ML_Lec-22
No ratings yet
ML_Lec-22
25 pages
7COM1033test_0000
No ratings yet
7COM1033test_0000
4 pages
Introduction to ANN
No ratings yet
Introduction to ANN
6 pages
Deep Learning (1)
No ratings yet
Deep Learning (1)
19 pages
Home Assignment Submission Solutions
No ratings yet
Home Assignment Submission Solutions
82 pages
Unit 2.1
No ratings yet
Unit 2.1
37 pages
UDL Answer Booklet Students
No ratings yet
UDL Answer Booklet Students
79 pages
tutorial 1,2
No ratings yet
tutorial 1,2
12 pages
AD3451 ML UNIT 4 NOTES
No ratings yet
AD3451 ML UNIT 4 NOTES
36 pages
DNN Cluster S2 22 MidSem Makeup
No ratings yet
DNN Cluster S2 22 MidSem Makeup
7 pages
Assignment Mtech
No ratings yet
Assignment Mtech
5 pages
DL Quiz1
No ratings yet
DL Quiz1
5 pages
8 Neural Networks
No ratings yet
8 Neural Networks
55 pages
Ad3451 ML Unit 4 Notes Eduengg
No ratings yet
Ad3451 ML Unit 4 Notes Eduengg
36 pages
Python Deep Learning Tutorial
0% (1)
Python Deep Learning Tutorial
17 pages
Deep Learning Assignment3 Solution
No ratings yet
Deep Learning Assignment3 Solution
9 pages
Machine Learning Unit 2 MCQ
No ratings yet
Machine Learning Unit 2 MCQ
17 pages
2K22_B17_49 PRIYANSHU NANDAN - Multi Layer Perceptrons Reference
No ratings yet
2K22_B17_49 PRIYANSHU NANDAN - Multi Layer Perceptrons Reference
32 pages
PowerPoint Presentation-3
No ratings yet
PowerPoint Presentation-3
28 pages
CISC 867 Deep Learning: 12. Recurrent Neural Networks
No ratings yet
CISC 867 Deep Learning: 12. Recurrent Neural Networks
72 pages
Back+Propagation
No ratings yet
Back+Propagation
21 pages
Thank You For Taking The Week 3: Assignment 3. Week 3: Assignment 3
No ratings yet
Thank You For Taking The Week 3: Assignment 3. Week 3: Assignment 3
3 pages
Deep Learning - IIT Ropar - Unit 6 - Week 3
No ratings yet
Deep Learning - IIT Ropar - Unit 6 - Week 3
4 pages
CS771: Introduction To Machine Learning Piyush Rai
No ratings yet
CS771: Introduction To Machine Learning Piyush Rai
25 pages
RNN_LSTM
No ratings yet
RNN_LSTM
16 pages
DLCV Ch2 Neural Network
No ratings yet
DLCV Ch2 Neural Network
68 pages
Nlp Materia
No ratings yet
Nlp Materia
29 pages
Tensorflow, Keras and Deep Learning
No ratings yet
Tensorflow, Keras and Deep Learning
51 pages
ANN Course File 2011
No ratings yet
ANN Course File 2011
8 pages
Resnet50 Summary
No ratings yet
Resnet50 Summary
4 pages
Vein-Based Biometric Verification Using Densely-Connected Convolutional Autoencoder
No ratings yet
Vein-Based Biometric Verification Using Densely-Connected Convolutional Autoencoder
5 pages
MUCLecture_2024_3256178
No ratings yet
MUCLecture_2024_3256178
10 pages
CNN 2
No ratings yet
CNN 2
47 pages
Course Title
No ratings yet
Course Title
3 pages
New Microsoft Office Word Document
No ratings yet
New Microsoft Office Word Document
3 pages
Unit 4 - Week 3: Assignment 3
No ratings yet
Unit 4 - Week 3: Assignment 3
3 pages
Comparison of Activation Function On Extreme Learning Machine (ELM) Performance For Classifying The Active Compound - 5.0023872
No ratings yet
Comparison of Activation Function On Extreme Learning Machine (ELM) Performance For Classifying The Active Compound - 5.0023872
9 pages
How To Choose An Activation Function For Deep Learning
No ratings yet
How To Choose An Activation Function For Deep Learning
15 pages
01-ci-cs6
No ratings yet
01-ci-cs6
21 pages
Week 7
No ratings yet
Week 7
7 pages
Audio GAN
No ratings yet
Audio GAN
2 pages
CST395 - ML Syllabus
No ratings yet
CST395 - ML Syllabus
13 pages
Unit II Supervised II
No ratings yet
Unit II Supervised II
16 pages
A Recurrent Neural Network
No ratings yet
A Recurrent Neural Network
3 pages
Taud 2017
No ratings yet
Taud 2017
5 pages
Artificial Intelligence Mini Project
No ratings yet
Artificial Intelligence Mini Project
5 pages
Notes For Electrical 2nd Year
No ratings yet
Notes For Electrical 2nd Year
4 pages
Advanced AI Concepts _ Quizizz
No ratings yet
Advanced AI Concepts _ Quizizz
2 pages
Understanding Multi-Layer Feed-Forward Neural Networks in Machine Learning
No ratings yet
Understanding Multi-Layer Feed-Forward Neural Networks in Machine Learning
4 pages
231AD63 Deep learning
No ratings yet
231AD63 Deep learning
2 pages
Student Solutions Manual to Accompany Economic Dynamics in Discrete Time, secondedition
From Everand
Student Solutions Manual to Accompany Economic Dynamics in Discrete Time, secondedition
Yue Jiang
4.5/5 (2)
Worked Examples in Mathematics for Scientists and Engineers
From Everand
Worked Examples in Mathematics for Scientists and Engineers
G. Stephenson
No ratings yet

Week 3

Uploaded by

Week 3

Uploaded by

Deep Learning - Week 3

Use the following data to answer the questions 1 to 2

Use the following data to answer the questions 3 to 4

x1 (1) (2) Output layer

3. Choose the correct dimensions of W1 and a1

Correct Answer: (c),(f)

4. How many learnable parameters(including bias) are there in the network?

Correct Answer: (c)

6. Which of the following statements about backpropagation is true?

(a) It is used to compute the output of a neural network.

Correct Answer: (b)

(a) All the values in p are lower than corresponding values in q

Correct Answer: (d)

(a) Event A has a low information content

Correct Answer: (b),(c)

Use the following data to answer the questions 9 and 10

You might also like