0% found this document useful (0 votes)

93 views

Explaining How Resnet-50 Works and Why It Is So Popular

The document summarizes the ResNet-50 model architecture and why it is popular. ResNet-50 uses skip connections to allow deeper networks that are easier to train. Skip connections add the output of earlier layers to later layers, preventing the vanishing gradient problem. The document explains ResNet-50 has 6 parts: preprocessing, 4 blocks of convolutional layers with skip connections, and a fully-connected layer. Code is provided to load a dataset, preprocess it, and prepare it for training ResNet-50 on image classification.

Uploaded by

Abhishek Sanap

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

93 views

Explaining How Resnet-50 Works and Why It Is So Popular

Uploaded by

Abhishek Sanap

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 15

The Annotated ResNet-50

Explaining how ResNet-50 works and why it is so popular

Resnet-50 Model architecture

Introduction

The ResNet architecture is considered to be among the most popular

Convolutional Neural Network architectures around. Introduced by
Microsoft Research in 2015, Residual Networks (ResNet in short)
broke several records when it was first introduced in this paper by He.
et. al.

Why ResNet?

The requirement for a model like ResNet arose due to a number of

pitfalls in modern networks at the time.

1. Difficulty in training deep neural networks: As the

number of layers in a model increases, the number of
parameters in the model increases exponentially. For each
Convolutional layer, a total of ((height(kernel)⋅width(kernel)
⋅filters(input))+1)⋅filters(output) gets added to the bill. To put
it into context, a simple 7x7 kernel Convolution layer
from 3 channels to 32 channels adds 4736 parameters. An
increase in the number of layers in the interest of
experimentation leads to an equal increase in complexity for
training the model. Training then requires greater
computational power and memory.
2. More expressive, less different: A neural network is often
considered to be a function approximator. It has the ability to
model functions given input, target and a comparison
between the function output and target. Adding multiple
layers into a network makes it more capable to model complex
functions. But results published in the paper stated that a 18-
layer plain neural network performs considerably better than
a 34-layer plain neural network as can be seen in the below
graph.

(He et al., 2015)

Adding layers can be seen as an expansion of the function space. For

example, multiple layers added together can be seen as a function F.
This function F can be expressed as a representation of a function
space F` that it can reach/model.

Having your desirable function in F` would be a lucky chance, but

more often than not, it is not the case. Adding layers here allows us to
expand and change around the function space F`, allowing us to cover
a larger space in the larger parent function space consisting of all
possible functions in the conceivable universe. But this method has an
inherent pitfall. As the function space becomes larger, there is no
guarantee that we get closer to our target function. In fact, there is a
good chance that in the experimental phase, you move away from the
function space that may have the function you actually need.

Did the jargon confuse you? Let’s take an analogy of a needle and a
haystack.
Let the needle be the perfect weights of the neural network, or as
explained before, a function. Let the haystack be all the possible
functions that can be made.

One starts from a single search area and tries to zero into the needle
from there. Adding layers is equivalent to moving your search area and
making it bigger. But that comes with the risk of moving away from the
place where the needle actually is as well as making our search more
time-consuming and difficult. Larger the haystack, more difficult it is
to find the perfect needle. What is the solution, then?

Quite simple and elegant, actually. Nest your function spaces.

This is done for a few simple reasons. The most important one being
the fact that it allows you to ensure that while the model adds layers to
increase the size of the function space, you don’t end up degrading the
model. This gives the guarantee that while our model can do better
with more layers, it will not do any worse.

Coming back to our haystack analogy, this is equivalent to making our

search space larger, but making sure that we do not move away from
our current search space.

(Zhang et al., 2021)

3. Vanishing/Exploding Gradient: This is one of the most

common problems plaguing the training of larger/deep neural
networks and is a result of oversight in terms of numerical stability of
the network’s parameters.
During back-propagation, as we keep moving from the deep to the
shallow layers, the chain rule of differentiation makes us multiply the
gradients. Often, these gradients are small, to the order of 10^{-5}10−5
or more.
According to some simple math, as these small numbers keep getting
multiplied with each other, they keep becoming infinitesimally smaller,
making almost negligible changes to the weights.

On the other end of the spectrum, there are cases when the gradient
reaches orders up to 10⁴ and more. As these large gradients multiply
with each other, the values tend to move towards infinity. Allowing
such a large range of values to be in the numerical domain for weights
makes convergence difficult to achieve.

This problem is popularly known as the Vanishing/Exploding gradient

problem. ResNet, due to its architecture, does not allow these problems
to occur at all. How so? The skip connections (described ahead) do not
allow it as they act as gradient super-highways, allowing it to flow
without being altered by a large magnitude.

What are Skip Connections?

The ResNet paper popularized the approach of using Skip Connections.

If you recall, the approach to solving our function space problems was
to nest them. In terms of applying it to our use-case, it was the
introduction of a simple addition of the identity function to the output.

In mathematical terms, it would mean y=x+F(x) where y is the final

output of the layer.
(He et al., 2015)

In terms of architecture, if any layer ends up damaging the

performance of the model in a plain network, it gets skipped due to the
presence of the skip-connections.

Architecture

ResNet-50 architecture

The ResNet-50 architecture can be broken down into 6 parts

1. Input Pre-processing
2. Cfg[0] blocks
3. Cfg[1] blocks
4. Cfg[2] blocks
5. Cfg[3] blocks
6. Fully-connected layer

Different versions of the ResNet architecture use a varying number of

Cfg blocks at different levels, as mentioned in the figure above. A
detailed, informative listing can be found below.

(He et al., 2015)

Show me the code!

The best way to understand the concept is through some code. The
implementation below is done in Keras, uses the standard ResNet-50
architecture (ResNet has several versions, differing in the depth of the
network). We will train the model on the famous Stanford Dogs dataset
by Stanford AI.
Import headers
!pip install -q tensorflow_datasets
import tensorflow as tf
from tensorflow import keras
import tensorflow_datasets as tfds
import os
import PIL
import pathlib
import PIL.Image
import warnings
warnings.filterwarnings("ignore")
from datetime import datetime

Dataset download and pre-processing

We download the Stanford Dogs dataset using Tensorflow Datasets

(stable) and split it into a training, validation and test set.

Along with the images and labels, we also get some meta-data which
gives us more information about the dataset. That is stored
in ds_info and printed in a human-readable manner.

We also make use of tfds.show_examples() to print some random example

images and labels from the dataset.

We run tfds.benchmark() to perform a benchmarking test on the iterator

provided by tf.data.Dataset

We perform the following best-practice steps on the tf.data.Dataset object

to make it efficient:
 batch(BATCH_SIZE) : Allows us to prepare mini-batches within the
dataset. Note that the batching operation requires all images
to be of the same size and have the same number of channels
 map(format_image) : Cast the image into a tf.float32 Tensor, normalize
all values in the range [0,1][0,1], resize the image from its
original shape to the model-input shape of (224, 224, 3)
(224,224,3) using the lanczos3 kernel method
 prefetch(BUFFER_SIZE) : Pre-fetch brings in the next batch of the
dataset during training into memory while the current batch
is being processed, reducing the I/O time but requiring more
memory in the GPU
 cache() : Caches the first batch of the iterator to reduce load-
times, similar to prefetch with the difference simply being that
cache will load the files but not push into GPU memory
(train_ds, valid_ds, test_ds), ds_info = tfds.load(
'stanford_dogs',
split=['train', 'test[0%:10%]', 'test[10%:]'],
shuffle_files=True, with_info=True,
as_supervised=True
)print("Dataset info: \n")
print(f'Name: {ds_info.name}\n')
print(f'Number of training samples : {ds_info.splits["train"].num_examples}\n')
print(f'Number of test samples : {ds_info.splits["test"].num_examples}\n')
print(f'Description : {ds_info.description}')
tfds.show_examples(train_ds, ds_info)CLASS_TYPES = ds_info.features['label'].num_classes
BATCH_SIZE = 4print('Benchmark results')
tfds.benchmark(train_ds)def format_image(image, label): image = tf.cast(image, tf.float32)
image = image / 255.0
image = tf.image.resize_with_pad(image, 224, 224, method='lanczos3', antialias=True)
return image, labeldef prepare_ds(ds):
ds = ds.map(format_image)
ds = ds.batch(BATCH_SIZE)
ds = ds.prefetch(tf.data.AUTOTUNE)
ds = ds.cache()
return dstrain_ds = prepare_ds(train_ds)
valid_ds = prepare_ds(valid_ds)
test_ds = prepare_ds(test_ds)
Output:
Downloading and preparing dataset 778.12 MiB (download: 778.12 MiB, generated: Unknown size,
total: 778.12 MiB) to /root/tensorflow_datasets/stanford_dogs/0.2.0...
Dataset stanford_dogs downloaded and prepared to /root/tensorflow_datasets/stanford_dogs/0.2.0.
Subsequent calls will reuse this data.
Dataset info:Name: stanford_dogsNumber of training samples : 12000Number of training samples :
8580Description : The Stanford Dogs dataset contains images of 120 breeds of dogs from around
the world. This dataset has been built using images and annotation from
ImageNet for the task of fine-grained image categorization. There are
20,580 images, out of which 12,000 are used for training and 8580 for
testing. Class labels and bounding box annotations are provided
for all the 12,000 images.Benchmark results************ Summary ************Examples/sec
(First included) 787.00 ex/sec (total: 12000 ex, 15.25 sec)
Examples/sec (First only) 10.34 ex/sec (total: 1 ex, 0.10 sec)
Examples/sec (First excluded) 791.95 ex/sec (total: 11999 ex, 15.15 sec)

Augmentation
imageAug = keras.Sequential([
keras.layers.RandomFlip("horizontal_and_vertical"),
keras.layers.RandomRotation(0.2),
keras.layers.RandomContrast(0.2)
])

We perform some data augmentation to allow our model to be more

robust. A RandomFlip, RandomRotation and RandomContrast is
used to make the image set more varied. The parameters to the
functions are probabilities, i.e. the chance that an image will undergo
the selected transformation.

Cfg0 Block

This block contains 1 Conv Layer and 2 Identity Layers. For helping
numerical stability, we specify a kernel constraint which makes sure
that all weights are normalized at constant intervals. Between 2
subsequent layers, we also include a BatchNormalization layer. The
code has been written in an explicit way deliberately to help readers
understand what design choices have been made at each stage.

 Input Shape : (56,56,64)

 Output Shape : (56,56,256)

Cfg1 Block

This block contains 1 Conv Layer and 2 Identity Layers. This is similar
to the Cfg0 blocks, with the difference mainly being in the number
of out_channels in the Conv and Identity layers being more.

 Input Shape : (56,56,256)

 Output Shape : (28,28,512)

Cfg2 Block

This block contains 1 Conv layer and 5 Identity layers. This is one of the
more important blocks for ResNet as most versions of the model differ
in this block-space.

 Input Shape : (28,28,512)

 Output Shape : (14,14,1024)
Cfg3 Block

This block contains 1 Conv Layer and 2 Identity Layers. This is the last
set of Convolutional Layer blocks present in the network.

 Input Shape : (14,14,1024)

 Output Shape : (7,7,2048)

Classifier Block

This block contains an AveragePooling Layer, a Dropout Layer and

a Flatten layer. At this block, the feature map is finally flattened and
pushed into a Fully Connected Layer which is then used for producing
predictions. A Softmax activation is applied to generate
logits/probabilities.

 Input Shape : (7,7,2048)

 Output Shape : ( 1, CLASS_TYPES )

Build ResNet Model

Now we take all the blocks and join them together to create the final
ResNet Model. In our entire process, we have used the Keras
Functional API, which is a best-practice for Tensorflow.

We also perform some visualizations, namely model.summary() to print out

the structure of the model's layers and keras.utils.plot_model() to plot the
visualized Directed Acyclic Graph of the model that will be used by
Tensorflow in the backend to streamline execution.
Model: "resnet50"
_________________________________________________________________
Layer (type) Output Shape Param #
=================================================================
input (InputLayer) [(None, 224, 224, 3)] 0

sequential (Sequential) (None, 224, 224, 3) 0

conv2d_28 (Conv2D) (None, 112, 112, 64) 9472

max_pooling2d (MaxPooling2D (None, 56, 56, 64) 0

)

cfg0_block (Functional) (None, 56, 56, 256) 148480

cfg1_block (Functional) (None, 28, 28, 512) 665600

cfg2_block (Functional) (None, 14, 14, 1024) 2641920

cfg3_block (Functional) (None, 7, 7, 2048) 10526720

classifier (Functional) (None, 120) 3932280

=================================================================
Total params: 17,924,472
Trainable params: 17,893,752
Non-trainable params: 30,720
_________________________________________________________________
None

Defining Callbacks

In model.fit(), we can define callbacks for the model that are invoked
during training at pre-determined intervals. We define a Model
Checkpoint callback that creates a snapshot of the model at the
completion of each epoch.
callbacks_list = [
keras.callbacks.ModelCheckpoint(
filepath='resnet50_model/checkpoint_{epoch:02d}.hdf5',
monitor='val_loss',
verbose=0,
save_best_only=True,
mode='auto',
save_freq='epoch',
options=None,
initial_value_threshold=None
)
]history = model.fit(
x=train_ds,
validation_data=valid_ds,
callbacks=callbacks_list,
epochs=20
)

If we wish to use a previously-saved model, we can do so too.

## If using Google Colaboratory, one can upload checkpoints onto Google Drive and use it
directly.from google.colab import drive
drive.mount('/content/gdrive')
model = keras.models.load_model('/content/gdrive/My Drive/checkpoint_18.hdf5')## If using local
Jupyter Notebooks, one can use checkpoints from local drives itself.model =
keras.models.load_model('./resnet50_model/checkpoint_18.hdf5')

Get model history

We print the model history to get more information about the training
process
print(history)

Predicting results

We take the trained model and use it to perform predictions on the test
set as well as calculate several metrics like Loss and Accuracy
results = model.evaluate(test_ds)
print(f"Results : {results}")
Conclusion

Above, we have visited the Residual Network architecture, gone over

its salient features, implemented a ResNet-50 model from scratch and
trained it to get inferences on the Stanford Dogs dataset.

As a model, ResNet brought about a revolution in the field of Computer

Vision and Deep Learning simultaneously. It went on to win the
ImageNet Large Scale Visual Recognition Challenge of 2015 and COCO
Competition. But it was only a stepping stone to many interesting
variations which yielded better results. Check the Interesting Links
section below to find some great blogs and research papers for the
same.

Size Chart - Shirt & Pant Size Chart - True Religion
No ratings yet
Size Chart - Shirt & Pant Size Chart - True Religion
1 page
Living Language - Easy English (Basic ESL)
100% (2)
Living Language - Easy English (Basic ESL)
146 pages
RESNET
No ratings yet
RESNET
5 pages
Res Net
No ratings yet
Res Net
8 pages
6 Apr - 6 - DL
No ratings yet
6 Apr - 6 - DL
69 pages
5-Convolutional Neural Network
No ratings yet
5-Convolutional Neural Network
43 pages
Unit-3 (1)
No ratings yet
Unit-3 (1)
37 pages
BMM 2018 - Deep Learning Tutorial
No ratings yet
BMM 2018 - Deep Learning Tutorial
47 pages
Convolutional Networks
No ratings yet
Convolutional Networks
211 pages
3)ImageNet classfication
No ratings yet
3)ImageNet classfication
9 pages
Assignment_13_Modern_AI
No ratings yet
Assignment_13_Modern_AI
3 pages
Understanding AlexNet
No ratings yet
Understanding AlexNet
8 pages
MSCDA 605 Machine Learning Exam Model Answers May_2019
No ratings yet
MSCDA 605 Machine Learning Exam Model Answers May_2019
7 pages
ResNet Presentation
No ratings yet
ResNet Presentation
25 pages
Unit-3
No ratings yet
Unit-3
38 pages
2. Deep Neural Network
No ratings yet
2. Deep Neural Network
60 pages
DSE_3141_Deep_Learning_Lab_Manual_2024_Week4
No ratings yet
DSE_3141_Deep_Learning_Lab_Manual_2024_Week4
14 pages
Why Convolutions?: Till Now in MLP
No ratings yet
Why Convolutions?: Till Now in MLP
38 pages
06 Training
No ratings yet
06 Training
108 pages
Augmentation and Segmentation
No ratings yet
Augmentation and Segmentation
32 pages
Biologically Inspired Deep Residual Networks
No ratings yet
Biologically Inspired Deep Residual Networks
10 pages
Plant Disease Identification
No ratings yet
Plant Disease Identification
17 pages
Control System Term Paper
No ratings yet
Control System Term Paper
12 pages
DL Mannual For Reference
No ratings yet
DL Mannual For Reference
58 pages
Introduction To Deep Learning: TA: Drew Hudson May 8, 2020
No ratings yet
Introduction To Deep Learning: TA: Drew Hudson May 8, 2020
33 pages
[Fall 2024] Deep Learning 3
No ratings yet
[Fall 2024] Deep Learning 3
54 pages
MN906 AI Watermarking
No ratings yet
MN906 AI Watermarking
99 pages
Convolutional Neural Network
No ratings yet
Convolutional Neural Network
37 pages
Deep Learning Unit 4
No ratings yet
Deep Learning Unit 4
11 pages
03 Convolution Neural Networks and Computer Vision With Tensorflow
No ratings yet
03 Convolution Neural Networks and Computer Vision With Tensorflow
21 pages
a imprimer 4
No ratings yet
a imprimer 4
4 pages
Graph Neural Networks
100% (1)
Graph Neural Networks
27 pages
Convolutional Neural Networks: Computer Vision
No ratings yet
Convolutional Neural Networks: Computer Vision
14 pages
4b Image Processing
No ratings yet
4b Image Processing
63 pages
AI_slide_2
No ratings yet
AI_slide_2
82 pages
Approximation- and Quantization-Aware Training for Graph Neural Networks
No ratings yet
Approximation- and Quantization-Aware Training for Graph Neural Networks
14 pages
COMP3220 Lect 11 - Introduction to Convolutional Neural Networks
No ratings yet
COMP3220 Lect 11 - Introduction to Convolutional Neural Networks
13 pages
Lecture 2: Introduction To Pytorch
No ratings yet
Lecture 2: Introduction To Pytorch
7 pages
Object Classification Using CNN
No ratings yet
Object Classification Using CNN
9 pages
L3 - UUCLxDeepMind DL2020
No ratings yet
L3 - UUCLxDeepMind DL2020
110 pages
MobileNetV2 Inverted Residuals and Linear Bottlenecks
No ratings yet
MobileNetV2 Inverted Residuals and Linear Bottlenecks
11 pages
Lecture W15ab
No ratings yet
Lecture W15ab
44 pages
Project
No ratings yet
Project
3 pages
An Introduction To Convolutional Neural Networks: November 2015
No ratings yet
An Introduction To Convolutional Neural Networks: November 2015
12 pages
UNIT-II DLL
No ratings yet
UNIT-II DLL
19 pages
Cats and Dogs Classification
No ratings yet
Cats and Dogs Classification
12 pages
Bag of Tricks For Image Classification With Convolutional Neural Networks
No ratings yet
Bag of Tricks For Image Classification With Convolutional Neural Networks
10 pages
L7 - Functional API
No ratings yet
L7 - Functional API
14 pages
2 Deep Neural Network_241120_095158
No ratings yet
2 Deep Neural Network_241120_095158
47 pages
Deep Learning
No ratings yet
Deep Learning
45 pages
Image Classification Using Small Convolutional Neural Network
No ratings yet
Image Classification Using Small Convolutional Neural Network
5 pages
Introduction to Artificial Neural Networks
No ratings yet
Introduction to Artificial Neural Networks
31 pages
International Journal of Computational Science, Information Technology and Control Engineering (IJCSITCE)
No ratings yet
International Journal of Computational Science, Information Technology and Control Engineering (IJCSITCE)
8 pages
LLM for Maths People
No ratings yet
LLM for Maths People
53 pages
hw5
No ratings yet
hw5
10 pages
ANNandItsApplicationsinCivilEngineering
No ratings yet
ANNandItsApplicationsinCivilEngineering
264 pages
Le y Yang - Tiny ImageNet Visual Recognition Challenge
No ratings yet
Le y Yang - Tiny ImageNet Visual Recognition Challenge
6 pages
Building A Neural Network From Scratch C++: Name of Student: Niranjan Class: XII Year: 2019 - 2020
No ratings yet
Building A Neural Network From Scratch C++: Name of Student: Niranjan Class: XII Year: 2019 - 2020
40 pages
new
No ratings yet
new
8 pages
ML Lec 10 Neural Networks
No ratings yet
ML Lec 10 Neural Networks
87 pages
Breadth First Search: Fundamentals and Applications
From Everand
Breadth First Search: Fundamentals and Applications
Fouad Sabry
No ratings yet
Flood Fill: Flood Fill: Exploring Computer Vision's Dynamic Terrain
From Everand
Flood Fill: Flood Fill: Exploring Computer Vision's Dynamic Terrain
Fouad Sabry
No ratings yet
Cas - 1 (REVISED 2015) Cost Accounting Standard On "Classification of Cost"
No ratings yet
Cas - 1 (REVISED 2015) Cost Accounting Standard On "Classification of Cost"
10 pages
Coding Decoding 3
No ratings yet
Coding Decoding 3
28 pages
Unit-2 Storytelling Styles in a Digital World Through Jump Cuts, L-cuts, Match Cuts, Cutaways, Dissolves, Split Edits
No ratings yet
Unit-2 Storytelling Styles in a Digital World Through Jump Cuts, L-cuts, Match Cuts, Cutaways, Dissolves, Split Edits
12 pages
Book Review Smart Energy
No ratings yet
Book Review Smart Energy
14 pages
Guide to Internet Cryptography Security Protocols and Real World Attack Implications 1st Edition Jörg Schwenk pdf download
100% (1)
Guide to Internet Cryptography Security Protocols and Real World Attack Implications 1st Edition Jörg Schwenk pdf download
43 pages
Alarm Verification Protocol FOR Incubators (Microbiology, Q-Block)
50% (2)
Alarm Verification Protocol FOR Incubators (Microbiology, Q-Block)
10 pages
Arkv 16
No ratings yet
Arkv 16
20 pages
Network Security: Computer Systems Servicing 10
No ratings yet
Network Security: Computer Systems Servicing 10
12 pages
TCP and UDP Port Numbers - Most Common Port Numbers
No ratings yet
TCP and UDP Port Numbers - Most Common Port Numbers
432 pages
Methods of Data Collection Advantages and Disadvantages
100% (2)
Methods of Data Collection Advantages and Disadvantages
4 pages
Marcos Ortiz Resume
No ratings yet
Marcos Ortiz Resume
4 pages
Instruction Manual For DODGE Setscrew, Eccentric Collar, D-Lok, H-E Series, E-Z Kleen, Ultra Kleen and Food Safe Mounted Ball Bearings
No ratings yet
Instruction Manual For DODGE Setscrew, Eccentric Collar, D-Lok, H-E Series, E-Z Kleen, Ultra Kleen and Food Safe Mounted Ball Bearings
2 pages
STQA Bankai
No ratings yet
STQA Bankai
34 pages
Oscilla Diagnostic Audiometer: Manual or Automatic Test USB Port For PC-connection Bone and Masking Dual Control
No ratings yet
Oscilla Diagnostic Audiometer: Manual or Automatic Test USB Port For PC-connection Bone and Masking Dual Control
2 pages
Lesson Plan - Initial Configuration of Sophos Firewall
No ratings yet
Lesson Plan - Initial Configuration of Sophos Firewall
2 pages
Manual Cordex 48Vdc
100% (1)
Manual Cordex 48Vdc
80 pages
Tryton Documentation: Release 4.8
No ratings yet
Tryton Documentation: Release 4.8
29 pages
PS 1000 WR 011
No ratings yet
PS 1000 WR 011
32 pages
PhDThesis Harshit Agarwal
100% (1)
PhDThesis Harshit Agarwal
140 pages
Delta Electronics Inc Dps-650xb X 650w Ecos 3293 Report
100% (1)
Delta Electronics Inc Dps-650xb X 650w Ecos 3293 Report
1 page
LCD3_manual_EN
No ratings yet
LCD3_manual_EN
44 pages
Ricoh MP c6000 Docx7
No ratings yet
Ricoh MP c6000 Docx7
19 pages
Import Data
No ratings yet
Import Data
41 pages
17 Lagrange Interpolation Mathematica Program
No ratings yet
17 Lagrange Interpolation Mathematica Program
14 pages
12 Computer Science SP 01
No ratings yet
12 Computer Science SP 01
12 pages
Instant download Building SANs with Brocade Fabric Switches 1st Edition Syngress pdf all chapter
100% (12)
Instant download Building SANs with Brocade Fabric Switches 1st Edition Syngress pdf all chapter
60 pages
04 Task Performance 1 (6) .Docx Busano
No ratings yet
04 Task Performance 1 (6) .Docx Busano
3 pages
SQL Server Startup Parameters
No ratings yet
SQL Server Startup Parameters
3 pages

Explaining How Resnet-50 Works and Why It Is So Popular

Uploaded by

Explaining How Resnet-50 Works and Why It Is So Popular

Uploaded by

The Annotated ResNet-50

Explaining how ResNet-50 works and why it is so popular

Resnet-50 Model architecture

The ResNet architecture is considered to be among the most popular

The requirement for a model like ResNet arose due to a number of

1. Difficulty in training deep neural networks: As the

(He et al., 2015)

Adding layers can be seen as an expansion of the function space. For

Having your desirable function in F` would be a lucky chance, but

Quite simple and elegant, actually. Nest your function spaces.

Coming back to our haystack analogy, this is equivalent to making our

(Zhang et al., 2021)

3. Vanishing/Exploding Gradient: This is one of the most

This problem is popularly known as the Vanishing/Exploding gradient

What are Skip Connections?

The ResNet paper popularized the approach of using Skip Connections.

In mathematical terms, it would mean y=x+F(x) where y is the final

In terms of architecture, if any layer ends up damaging the

The ResNet-50 architecture can be broken down into 6 parts

Different versions of the ResNet architecture use a varying number of

(He et al., 2015)

Show me the code!

Dataset download and pre-processing

We download the Stanford Dogs dataset using Tensorflow Datasets

We also make use of tfds.show_examples() to print some random example

We run tfds.benchmark() to perform a benchmarking test on the iterator

We perform the following best-practice steps on the tf.data.Dataset object

We perform some data augmentation to allow our model to be more

 Input Shape : (56,56,64)

 Input Shape : (56,56,256)

 Input Shape : (28,28,512)

 Input Shape : (14,14,1024)

This block contains an AveragePooling Layer, a Dropout Layer and

 Input Shape : (7,7,2048)

Build ResNet Model

We also perform some visualizations, namely model.summary() to print out

sequential (Sequential) (None, 224, 224, 3) 0

conv2d_28 (Conv2D) (None, 112, 112, 64) 9472

max_pooling2d (MaxPooling2D (None, 56, 56, 64) 0

cfg0_block (Functional) (None, 56, 56, 256) 148480

cfg1_block (Functional) (None, 28, 28, 512) 665600

cfg2_block (Functional) (None, 14, 14, 1024) 2641920

cfg3_block (Functional) (None, 7, 7, 2048) 10526720

classifier (Functional) (None, 120) 3932280

If we wish to use a previously-saved model, we can do so too.

Get model history

Above, we have visited the Residual Network architecture, gone over

As a model, ResNet brought about a revolution in the field of Computer

You might also like