0% found this document useful (0 votes)

3 views4 pages

Cnn Archtechture

The document discusses key innovations in deep learning architectures, focusing on AlexNet's improvements over LeNet-5, including deeper architecture, ReLU activation, and GPU utilization. It also describes the Inception module in GoogleNet, highlighting its computational efficiency through parallel paths and parameter reduction. Lastly, it compares DenseNet's dense blocks with ResNet's skip connections, emphasizing differences in connection types, feature reuse, and gradient flow.

Uploaded by

Sara Zara

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

3 views4 pages

Cnn Archtechture

Uploaded by

Sara Zara

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 4

1.

Explain the architectural innovations of AlexNet and how it

improved upon LeNet-5.

Answer:
AlexNet, introduced by Alex Krizhevsky et al. in 2012, revolutionized deep learning by
winning the ImageNet ILSVRC challenge with a top-5 error rate of 17% (compared
to 26% by others). Its key innovations over LeNet-5 include:

Deeper Architecture:

8 layers (5 convolutional + 3 fully connected) vs. LeNet-5’s 5 layers.

Enabled learning of hierarchical features (edges → textures → object parts).
ReLU Activation:

Replaced sigmoid/tanh with Rectified Linear Units (ReLUs).

Advantage: Avoided vanishing gradients, sped up training (6x faster convergence).
GPU Utilization:

First CNN trained on dual NVIDIA GTX 580 GPUs (3 days vs. weeks on CPUs).
Overlapping Max-Pooling:

Used stride=2, 3x3 windows (vs. non-overlapping in LeNet).

Benefit: Reduced spatial dimensions while retaining more information.
Local Response Normalization (LRN):

Mimicked biological lateral inhibition to encourage neuron competition.

Later replaced by batch normalization in modern networks.
Dropout:

Randomly deactivated 50% neurons in fully connected layers during training.

Reduced overfitting (critical for 60M parameters).

Comparison with LeNet-5:

Feature LeNet-5 AlexNet

Depth 5 layers 8 layers

Activatio
Sigmoid/Tanh ReLU
n

Small (MNIST Large

Scale
digits) (ImageNet)

Hardwar
CPU GPU-optimized
e

Conclusion: AlexNet’s scale and innovations (ReLU, GPU, dropout) made it the first
modern CNN, paving the way for deeper architectures.

2. Describe the Inception module in GoogleNet. Why is it

computationally efficient?
(10 marks)

Answer:
The Inception module is the core building block of GoogleNet (2014), which reduced
the top-5 error to 6.7% with only 6M parameters (vs. AlexNet’s 60M).

Structure of Inception Module:

Parallel Paths:

1x1 convolutions: Cheap channel-wise transformations.

3x3 and 5x5 convolutions: Capture spatial patterns at different scales.
3x3 max-pooling: Preserves spatial features.
Filter Concatenation:

Outputs of all paths are depth-concatenated (requires "SAME" padding for equal
width/height).
1x1 Bottlenecks:

Applied before 3x3/5x5 convs to reduce channels (e.g., 256 → 64).

Example: A 5x5 conv on 256 channels costs (256×5×5×64)=409,600 ops, but with a
1x1 bottleneck (256×1×1×64=16,384 + 64×5×5×64=102,400), total ops drop to 118,784.

Computational Efficiency:

Parameter Reduction: 1x1 convs compress channels before expensive operations.

Multi-Scale Processing: Captures fine and coarse features in parallel.
Sparse Connections: Mimicked by dense operations (efficient on GPUs).

Mathematical Insight:
For input tensor X of shape (H, W, C), the module computes:
text

Copy

Download

Output = Concat[
Conv1x1(X),
Conv3x3(Conv1x1(X)),
Conv5x5(Conv1x1(X)),
MaxPool3x3(X)
]

Conclusion: Inception balances depth and efficiency via bottlenecks and multi-
scale aggregation.
3. What are skip connections in ResNet? Prove mathematically why
they mitigate vanishing gradients.

(15 marks)

Answer:

Skip Connections (Residual Learning):

Definition: Shortcut paths that add input x to the output of a layer block F(x).
Formula: H(x) = F(x) + x, where F(x) learns the residual (difference from identity).

Why They Work:

Gradient Flow:

During backpropagation, gradients for F(x) are computed as:

∂Loss/∂x = ∂Loss/∂H(x) × (∂F(x)/∂x + 1)
The "+1" term ensures gradients never vanish completely, even if ∂F(x)/∂x ≈ 0.
Identity Mapping

At initialization, F(x) ≈ 0 ⇒ H(x) = x.

Early training stages benefit from near-identity transformations.
Deep Network Training:

In a 152-layer ResNet, skip connections allow gradients to propagate directly to early

layers

Mathematical Proof:
Consider a chain of L residual blocks:

H_L(x) = x + Σ_{i=1}^L F_i(x)

The gradient w.r.t. layer l is:

∂H_L/∂H_l = 1 + ∂(Σ_{i=l}^L F_i)/∂H_l

Even if ∂F_i/∂H_l → 0, the gradient remains ≥1 (avoids vanishing).

Visualization:

Input(x)→[WeightLayer→ReLU→ Weight Layer]→ +→ Otput

(H(x))
↑_____________________________________↓

Conclusion: Skip connections enable ultra-deep networks (e.g., ResNet-152) by

preserving gradient flow.

4. Compare DenseNet’s dense blocks with ResNet’s skip connections.

(10 marks)

Answer:
Feature ResNet DenseNet

Concatenative ([x, F1(x),

Connection Additive (H(x) = F(x) + x)
F2(x), ...])

Feature
Single path per block All previous features reused
Reuse

Moderate (25M for ResNet-

Parameters Economical (12M for DenseNet-121)
50)

Gradient
Preserved via addition Enhanced via concatenation
Flow

Structure Residual blocks Dense blocks + transition layers

Key Differences:

DenseNet:

Growth rate (K): Each layer adds K new channels (e.g., K=12).
Bottlenecks: 1x1 convs compress channels before 3x3 convs
Transition Layers: Batch norm + 1x1 conv + 2x2 pooling between blocks.
ResNet:

Simpler, but requires careful initialization for deep networks.

Example:

Dense Block:
Layer1: [x]
Layer2: [x, F1(x)]
Layer3: [x, F1(x), F2(x)]
Res Block:
Layer1: x
Layer2: x + F1(x)

Conclusion: DenseNet improves feature reuse but requires more memory; ResNet is
simpler and widely adopted.

400 Emails Football Clubs List 72015
78% (9)
400 Emails Football Clubs List 72015
55 pages
unit 4 deeplearning
No ratings yet
unit 4 deeplearning
41 pages
Patho Nutshell PDF Complete
No ratings yet
Patho Nutshell PDF Complete
288 pages
Vol-1 33 KV Sub-Station
No ratings yet
Vol-1 33 KV Sub-Station
127 pages
Convolutional Networks
No ratings yet
Convolutional Networks
211 pages
IA 3 Must Study Merged
No ratings yet
IA 3 Must Study Merged
69 pages
DL CO1 and CO2 Answers
No ratings yet
DL CO1 and CO2 Answers
36 pages
Unit V
No ratings yet
Unit V
84 pages
C Plus Plus Notes
No ratings yet
C Plus Plus Notes
15 pages
Create A Calendar Using C Programming: Department of Computer Engineering
100% (1)
Create A Calendar Using C Programming: Department of Computer Engineering
13 pages
Convolutional Neural Network Ilsvrc Alexnet (2012) Zfnet (2013) Vggnet (2014) Googlenet 2014) Resnet (2015) Conclusion
No ratings yet
Convolutional Neural Network Ilsvrc Alexnet (2012) Zfnet (2013) Vggnet (2014) Googlenet 2014) Resnet (2015) Conclusion
82 pages
Classic Cnn
No ratings yet
Classic Cnn
39 pages
Q_day1
No ratings yet
Q_day1
26 pages
Attention in Neural Networks
No ratings yet
Attention in Neural Networks
8 pages
DLP
No ratings yet
DLP
50 pages
DL PYQs ENDSEM
No ratings yet
DL PYQs ENDSEM
36 pages
GENAI-SEE
No ratings yet
GENAI-SEE
51 pages
SQL 3
No ratings yet
SQL 3
38 pages
ENG6500 8 DL IntroductionToDeepLearning Part2
No ratings yet
ENG6500 8 DL IntroductionToDeepLearning Part2
65 pages
DL UNIT 2 CNN Architectures
No ratings yet
DL UNIT 2 CNN Architectures
12 pages
Convolutional Neural Network2 26112024 015227pm
No ratings yet
Convolutional Neural Network2 26112024 015227pm
41 pages
Unit 5a - Machine Vision
No ratings yet
Unit 5a - Machine Vision
55 pages
Assignment
No ratings yet
Assignment
24 pages
Oop (Programs)
No ratings yet
Oop (Programs)
49 pages
IVA UNIT-5 EDITED
No ratings yet
IVA UNIT-5 EDITED
42 pages
Keras and Tensorflow
No ratings yet
Keras and Tensorflow
11 pages
Unit-3
No ratings yet
Unit-3
38 pages
Alex Net
No ratings yet
Alex Net
26 pages
6 Apr - 6 - DL
No ratings yet
6 Apr - 6 - DL
69 pages
Unit-3 (1)
No ratings yet
Unit-3 (1)
37 pages
TResNet
No ratings yet
TResNet
37 pages
19-ResNet-10-09-2024
No ratings yet
19-ResNet-10-09-2024
35 pages
List of Private Books Banned Under Section 10 of PCTB Act, 2015
No ratings yet
List of Private Books Banned Under Section 10 of PCTB Act, 2015
5 pages
Peso Net Web Service API V2
100% (1)
Peso Net Web Service API V2
24 pages
DL3 QB
No ratings yet
DL3 QB
19 pages
4b Image Processing
No ratings yet
4b Image Processing
63 pages
Deep Learning (MODULE-3) (1)
No ratings yet
Deep Learning (MODULE-3) (1)
85 pages
The Harvard Admissions Handbook
No ratings yet
The Harvard Admissions Handbook
12 pages
Atoms and Molecules Class 9 Notes Science Chapter 3
No ratings yet
Atoms and Molecules Class 9 Notes Science Chapter 3
6 pages
Unit III
No ratings yet
Unit III
58 pages
Mổ xẻ cái AlexNet network
No ratings yet
Mổ xẻ cái AlexNet network
5 pages
Modern CNN Architectures
No ratings yet
Modern CNN Architectures
32 pages
Deep_Learning_Q1_Q2_Q3_Answers
No ratings yet
Deep_Learning_Q1_Q2_Q3_Answers
6 pages
Random Variables Notes
No ratings yet
Random Variables Notes
23 pages
ML Lec 14 LeNeT CNN Architecture
No ratings yet
ML Lec 14 LeNeT CNN Architecture
14 pages
Unit 5
No ratings yet
Unit 5
24 pages
deep learning questions
No ratings yet
deep learning questions
17 pages
ML II - Unit IV
No ratings yet
ML II - Unit IV
20 pages
TM03-Abstraction-Packages-Exception Handling
No ratings yet
TM03-Abstraction-Packages-Exception Handling
11 pages
Comprehensive Notes on Advanced CNN Concepts & Vision Tasks
No ratings yet
Comprehensive Notes on Advanced CNN Concepts & Vision Tasks
5 pages
cnn (1)_unit 3_merged
No ratings yet
cnn (1)_unit 3_merged
14 pages
Different Deep CNN Architectures - LeNet, AlexNet, VGG
No ratings yet
Different Deep CNN Architectures - LeNet, AlexNet, VGG
13 pages
Medium, Modes, Types of Communication Skill
No ratings yet
Medium, Modes, Types of Communication Skill
17 pages
DeepLearningAssign2
No ratings yet
DeepLearningAssign2
5 pages
dl ass 742
No ratings yet
dl ass 742
14 pages
VGGNet and ResNet Assignment Questions
No ratings yet
VGGNet and ResNet Assignment Questions
8 pages
Week10
No ratings yet
Week10
3 pages
Res Net 4
No ratings yet
Res Net 4
23 pages
27-Deep Convolutional Models - ResNet, AlexNet, InceptionNet and Others-18!09!2024
No ratings yet
27-Deep Convolutional Models - ResNet, AlexNet, InceptionNet and Others-18!09!2024
11 pages
Convolutional Neural Networks
No ratings yet
Convolutional Neural Networks
15 pages
Unit-2 Adl
No ratings yet
Unit-2 Adl
25 pages
23-CNN Operations - Architecture - Simple Convolution Network-09!09!2024
No ratings yet
23-CNN Operations - Architecture - Simple Convolution Network-09!09!2024
8 pages
Orange Pi 4G-IoT User Manual - V1.2
No ratings yet
Orange Pi 4G-IoT User Manual - V1.2
22 pages
465-Lecture 7 (1)
No ratings yet
465-Lecture 7 (1)
46 pages
Difference Between Alexnet, Vggnet, Resnet, and Inception
No ratings yet
Difference Between Alexnet, Vggnet, Resnet, and Inception
14 pages
ResNet_Deep_Learning_Presentation
No ratings yet
ResNet_Deep_Learning_Presentation
8 pages
Alex Net
No ratings yet
Alex Net
2 pages
Coord Milestones
No ratings yet
Coord Milestones
11 pages
DL 4
No ratings yet
DL 4
5 pages
Res Net
No ratings yet
Res Net
8 pages
Final Curved Bars
No ratings yet
Final Curved Bars
20 pages
My Tailor
No ratings yet
My Tailor
5 pages
DSE 3151 25 Sep 2023
No ratings yet
DSE 3151 25 Sep 2023
9 pages
Past Paper Practice
No ratings yet
Past Paper Practice
9 pages
Department of Mechanical Engineering: Composite Materials
No ratings yet
Department of Mechanical Engineering: Composite Materials
20 pages
Activity Checklist (Replacement of Cylinder Head)
No ratings yet
Activity Checklist (Replacement of Cylinder Head)
3 pages
BFM Students
No ratings yet
BFM Students
5 pages
Ageless Marketing Strategies For Reaching The Hearts and Minds of The New Customer Majority
No ratings yet
Ageless Marketing Strategies For Reaching The Hearts and Minds of The New Customer Majority
191 pages
Training Provider General Guideline
No ratings yet
Training Provider General Guideline
24 pages
Famous Networks
No ratings yet
Famous Networks
6 pages
Densely Connected Convolutional Networks
No ratings yet
Densely Connected Convolutional Networks
11 pages
Recall ALE JAN2024
No ratings yet
Recall ALE JAN2024
32 pages
Alexnet: The Architecture That Challenged Cnns
No ratings yet
Alexnet: The Architecture That Challenged Cnns
6 pages
GROUP 6
No ratings yet
GROUP 6
1 page
Condensed Format 1
No ratings yet
Condensed Format 1
10 pages
In The Story Animals Kick Out Their Human Owner
No ratings yet
In The Story Animals Kick Out Their Human Owner
1 page
Inception-V4, Inception-ResNet and The Impact of Residual Connections On Learning
No ratings yet
Inception-V4, Inception-ResNet and The Impact of Residual Connections On Learning
12 pages
Industry Ucaa Chart Min Capital Surplus
No ratings yet
Industry Ucaa Chart Min Capital Surplus
19 pages
BS en 12680-3-2011
No ratings yet
BS en 12680-3-2011
26 pages
Data Sheet: HLMP-HD61, HLMP-HM61 and HLMP-HB61
No ratings yet
Data Sheet: HLMP-HD61, HLMP-HM61 and HLMP-HB61
12 pages
Business Combination: Expense Immediately
No ratings yet
Business Combination: Expense Immediately
7 pages
Convolutional Neural Network Report
No ratings yet
Convolutional Neural Network Report
5 pages
Tosoh AIA Test Menu: Unit Dose Test Cup Reagent System
No ratings yet
Tosoh AIA Test Menu: Unit Dose Test Cup Reagent System
2 pages
Vero Beach Real Estate Ad - DSRE 03102013
No ratings yet
Vero Beach Real Estate Ad - DSRE 03102013
8 pages
The Dawn of The Ready-Mixed Concrete Industry
No ratings yet
The Dawn of The Ready-Mixed Concrete Industry
5 pages
What Exactly Is "The American Dream"?
No ratings yet
What Exactly Is "The American Dream"?
1 page
2019 Audit Engagement Letter
No ratings yet
2019 Audit Engagement Letter
9 pages
Exam JN0-633
No ratings yet
Exam JN0-633
83 pages
Assignments - MB0046 - Marketing Management
100% (1)
Assignments - MB0046 - Marketing Management
2 pages
Flood Fill: Flood Fill: Exploring Computer Vision's Dynamic Terrain
From Everand
Flood Fill: Flood Fill: Exploring Computer Vision's Dynamic Terrain
Fouad Sabry
No ratings yet