SlideShare a Scribd company logo
2
Most read
3
Most read
14
Most read
Convolution Neural
Network (CNN)
A . Suraj Kumar
Name
Your title here
Presentations are
communication tools
Your title here
Presentations are
communication tools
Your title here
Presentations are
communication tools
Your title here
Presentations are
communication tools
Your title here
Presentations are
communication tools
Your title here
Presentations are
communication tools
01
02
03
04
05
06
2
Convolution Neural Network (CNN)
3
Contents
01
Your title here
02
Your title here
03
Your title here
04
Your title here
05
Your title here
● Computer vision is concerned with the automatic extraction, analysis and
understanding of useful information from a single image or a sequence of images.
- The British Machine Vision Association and Society for Pattern Recognition (BMVA)
(or)
● It is an interdisciplinary field that deals with how computers can be made to gain
high-level understanding from digital images or videos.
- Wikipedia
4
Introduction to Computer Vision
Introduction to Computer
Vision
Computer vision is concerned with the automatic
extraction, analysis and understanding of useful
information from a single image or a sequence of images.
It is an interdisciplinary field that deals with how
computers can be made to gain high-level
understanding from digital images or videos.
6
● It is a class of deep learning.
● Convolutional neural network (ConvNet’s or CNNs) is one of the main
categories to do images recognition, images classifications, objects
detections, recognition faces etc.,
● It is similar to the basic neural network. CNN also have learnable
parameter like neural network i.e., weights, biases etc.
● CNN is heavily used in computer vision
● There 3 basic components to define CNN
○ The Convolution Layer
○ The Pooling Layer
○ The Output Layer (or) Fully Connected Layer
What is CNN(Convolution Neural Network)
7
Architecture of CNN
8
● Computers read images as pixels and it is expressed as matrix (NxNx3)—
(height by width by depth).
● The Convolutional Layer makes use of a set of learnable filters. A filter is
used to detect the presence of specific features or patterns present in the
original image (input).
● It is usually expressed as a matrix (MxMx3), with a smaller dimension but
the same depth as the input file.
● This filter is convolved (slided) across the width and height of the input
file, and a dot product is computed to give an activation map.
Convolution Layer
9
Images source: Analytics Vidhya
Convolution Layer
10
Images source: Analytics
Convolution Layer
The concept of stride and padding:
● The weight of a matrix moves 1 pixel at a time is called as stride 1 (as we did in
above case).
What if we increase the stride value?
11
Images source: Analytics Vidhya
Convolution Layer
● As we can see in above image the increase in the stride value decreases the size of
the image (which may cause in losing the features of the image).
● Padding the input image across it solves our problem, we add more than one layer of
zeros around the image in case of higher stride values.
12
Images source: Analytics Vidhya
Convolution Layer
● when the input of 6x6 is padded around with zeros we get the output with same
dimensions of 6x6 this is known as ‘Same Padding’.
● The middle 4x4 pixel remains the same, here we have retained the more
information from borders and also preserved the size of image.
13
Images source: Analytics Vidhya
Convolution Layer
● It can be seen in between the convolution layers in CNN architecture.
● This layer basically reduces the amount of parameters and computation in the
network.
● Pooling is done for the sole purpose of reducing the spatial size of the image.
● Pooling is done independently on each depth dimension, therefore the depth of
the image remains unchanged. The most common form of pooling layer generally
applied is the max pooling.
14
Images source: Analytics Vidhya
The Pooling Layer
● Let's try to understand from below image how max pooling has affected our image
below.
15
Images source: Analytics Vidhya
The Pooling Layer
● After multiple layers of convolution and padding, we would need the output in the
form of a class.
● The convolution and pooling layers would only be able to extract features and
reduce the number of parameters from the original images.
● However, to generate the final output we need to apply a fully connected layer to
generate an output equal to the number of classes we need.
● Convolution layers generate 3D activation maps while we just need the output as
whether or not an image belongs to a particular class.
● The output layer will have the loss function like categorical cross-entropy, to
compute the error in prediction.
● Once the forward pass is complete the backpropagation begins to update the
weight and biases for error and loss reduction.
16
The Output Layer
How to decide the number of convolution layers and number of filters in
CNN ?
● Deeper networks is always better, at the cost of more data and increased
complexity of learning.
● You should initially use fewer filters and gradually increase and monitor the
error rate to see how it is varying.
● Very small filter sizes will capture very fine details of the image. On the other
hand having a bigger filter size will leave out minute details in the image.
17
CNN
● Based on the problems, we have the different CNN’s which are used in
computer vision.
● The five major computer vision techniques which can be addressed using
CNN.
■ Image Classification
■ Object Detection
■ Object Tracking
■ Semantic Segmentation
■ Instance Segmentation
18
Types of CNN
Image Classification:
● In an image classification we can use the traditional CNN models or there also
many architectures designed by developers to decrease the error rate and
increasing the trainable parameters.
■ LeNet (1998)
■ AlexNet (2012)
■ ZFNet (2013)
■ GoogLeNet19 (2014)
■ VGGNet 16 (2014)
■ ResNet(2015)
19
Types of CNN
Object Detection:
● Here the implementation of CNN is different compared to the previous
image classification.
● Here the task is to identify the objects present in the image, therefore
traditional implementation of CNN may not help.
■ R CNN
■ Fast R CNN
■ Faster R CNN
■ YOLO
20
Types of CNN
● Computer vision, an AI technology that allows computers to understand
and label images, is now used in convenience stores, driverless car
testing, daily medical diagnostics, and in monitoring the health of crops
and livestock.
● Different use cases found in the computer vision as follows
■ Retail and Retail Security
■ Automotive
■ Healthcare
■ Banking
■ Agriculture
■ Industrial
21
Applications of Computer Vision
● Here in this seminar we tried to understand what is Computer
Vision.
● What is CNN and its different layers.
● Different types of CNN and its uses in Computer Vision techniques.
● Different Applications of Computer Vision.
22
Conclusion
● https://www.analyticsvidhya.com/blog/2017/06/architecture-of-convolution
al-neural-networks-simplified-demystified/
● https://towardsdatascience.com/covolutional-neural-network-cb0883dd65
29
● https://medium.com/technologymadeeasy/the-best-explanation-of-convol
utional-neural-networks-on-the-internet-fbb8b1ad5df8
● https://adeshpande3.github.io/A-Beginner%27s-Guide-To-Understanding-C
onvolutional-Neural-Networks/
● http://www.image-net.org/
23
References
24
Thank you!
Name

More Related Content

PPTX
Convolution Neural Network (CNN)
Suraj Aavula
 
PPTX
Convolution Neural Network_Artificial Intelligence_Good.pptx
AnnavarapuRajavarapr
 
PPTX
NMO IE-2 Activity Presentation.pptx
LEGENDARYTECHNICAL
 
PPTX
NMO IE-2 Activity Presentation.pptx
LEGENDARYTECHNICAL
 
PPTX
Introduction to Convolutional Neural Networks (CNNs).pptx
CHRISEVANS269099
 
PPTX
Convolutional Neural Network (CNN)of Deep Learning
alihassaah1994
 
PPTX
build a Convolutional Neural Network (CNN) using TensorFlow in Python
Kv Sagar
 
PDF
Overview of Convolutional Neural Networks
ananth
 
Convolution Neural Network (CNN)
Suraj Aavula
 
Convolution Neural Network_Artificial Intelligence_Good.pptx
AnnavarapuRajavarapr
 
NMO IE-2 Activity Presentation.pptx
LEGENDARYTECHNICAL
 
NMO IE-2 Activity Presentation.pptx
LEGENDARYTECHNICAL
 
Introduction to Convolutional Neural Networks (CNNs).pptx
CHRISEVANS269099
 
Convolutional Neural Network (CNN)of Deep Learning
alihassaah1994
 
build a Convolutional Neural Network (CNN) using TensorFlow in Python
Kv Sagar
 
Overview of Convolutional Neural Networks
ananth
 

Similar to computervisionanditsapplications-190311134821.pptx (20)

PPTX
Convolutional-Neural-Networks-CNNs-Powering-the-Future-of-Computer-Vision (1)...
HarshDeshmukh46
 
PPTX
Mnist report ppt
RaghunandanJairam
 
PPTX
Convolutional Neural Networks
milad abbasi
 
PPTX
Cnn
Mehrnaz Faraz
 
PDF
DL.pdf
ssuserd23711
 
PPTX
Convolutional Neural Network and Its Applications
Kasun Chinthaka Piyarathna
 
PPTX
Introduction to Computer Vision and its Applications
RamSIyer2
 
PPTX
Introduction to convolutional networks .pptx
ArunNegi37
 
PDF
Mnist report
RaghunandanJairam
 
PDF
Convolutional Neural Networks: Part 1
ananth
 
PPTX
11_Saloni Malhotra_SummerTraining_PPT.pptx
SaloniMalhotra23
 
PPT
digital image processing - convolutional networks
Muhammad824617
 
PPTX
Deep learning L1-CO2-session-4 CNN .pptx
Kv Sagar
 
PPTX
CNN-ppt.pptx
vidyamali4
 
PPTX
Illustrative Introductory CNN
YasutoTamura1
 
PPTX
Cnn
rimshailyas1
 
PPTX
Convolutional neural network in deep learning
priyasjcettnj
 
PDF
convolutional_neural_networks in deep learning
ssusere5ddd6
 
PDF
CNN.pptx.pdf
Knoldus Inc.
 
PPTX
Convolutional neural network
Ferdous ahmed
 
Convolutional-Neural-Networks-CNNs-Powering-the-Future-of-Computer-Vision (1)...
HarshDeshmukh46
 
Mnist report ppt
RaghunandanJairam
 
Convolutional Neural Networks
milad abbasi
 
DL.pdf
ssuserd23711
 
Convolutional Neural Network and Its Applications
Kasun Chinthaka Piyarathna
 
Introduction to Computer Vision and its Applications
RamSIyer2
 
Introduction to convolutional networks .pptx
ArunNegi37
 
Mnist report
RaghunandanJairam
 
Convolutional Neural Networks: Part 1
ananth
 
11_Saloni Malhotra_SummerTraining_PPT.pptx
SaloniMalhotra23
 
digital image processing - convolutional networks
Muhammad824617
 
Deep learning L1-CO2-session-4 CNN .pptx
Kv Sagar
 
CNN-ppt.pptx
vidyamali4
 
Illustrative Introductory CNN
YasutoTamura1
 
Convolutional neural network in deep learning
priyasjcettnj
 
convolutional_neural_networks in deep learning
ssusere5ddd6
 
CNN.pptx.pdf
Knoldus Inc.
 
Convolutional neural network
Ferdous ahmed
 
Ad

More from OrxanMirzzad (7)

PPTX
androidos.pptxnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn
OrxanMirzzad
 
PPTX
25.Aykhan Multimedia.pptxdddddddddddddddddddddddddddddd
OrxanMirzzad
 
PPTX
Deixtras Algorithm.pptxdjjdjdjdjddddddddddddddd
OrxanMirzzad
 
PPTX
Detailed_Classical_Electronic_Theory.pptx
OrxanMirzzad
 
PPTX
Crisp_and_Fuzzy_Relations_with_Plan.pptx
OrxanMirzzad
 
PPTX
05_ohms_law_.pptxhghhhhhhhhhhhhhhhhhhhhhhhhhhhhhhh
OrxanMirzzad
 
PPT
Ch6b, drude model and conductivity of a metal (1).ppt
OrxanMirzzad
 
androidos.pptxnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn
OrxanMirzzad
 
25.Aykhan Multimedia.pptxdddddddddddddddddddddddddddddd
OrxanMirzzad
 
Deixtras Algorithm.pptxdjjdjdjdjddddddddddddddd
OrxanMirzzad
 
Detailed_Classical_Electronic_Theory.pptx
OrxanMirzzad
 
Crisp_and_Fuzzy_Relations_with_Plan.pptx
OrxanMirzzad
 
05_ohms_law_.pptxhghhhhhhhhhhhhhhhhhhhhhhhhhhhhhhh
OrxanMirzzad
 
Ch6b, drude model and conductivity of a metal (1).ppt
OrxanMirzzad
 
Ad

Recently uploaded (20)

PDF
Choosing Between Flight Cases and Shipping Boxes for Maximum Safety.pdf
Maadhu Flight Case LLP
 
PDF
Between Porchlight and Citylight: The Joseph Kim Journey
Joseph Kim Nolensville Tennessee
 
PDF
📚Student and education importance......
kasimnagori121
 
PDF
Malaria detection through machine learning and deep learning
BhaveshGoyal26
 
PPTX
奎斯特大学文凭办理|办理QUC毕业证学位证书2025年新版学位证书影本
1cz3lou8
 
PPTX
Welcome chiku new07.pptxrfgretgregbdffgfdgfd
xoviva2925
 
PPT
Wayne Thiebaud Not Pop Art Presentation ppt
brwnidjessicaleigh
 
PPTX
Bharatanatyam-Dance-Sculpture-and-Sacred-Spaces (2).pptx
Deepshikha Maurya
 
PPTX
design for presentation purpose can used
vishveshvarvms
 
PPT
preposition-powerpoint-aus-ver1_ver_4.ppt
NhungNgLHoi
 
PPTX
Escalator_21bar064,070,073aeqweqeqe.pptx
Aaravpatel25
 
PDF
Zero no Tsukaima 1 - Zero_s Familiar.pdf
WaldeckFlugelWallens
 
PPTX
Lecture 1b - Diagnostic Analytics Intro and Purpose ver 1.0 (1).pptx
guddipatel10
 
PDF
Your Love Story, Beautifully Told by FliqaIndia’s Creative Experts
sandeep239227
 
DOCX
This is a security test for Inspectiv test
Firsts Lasts
 
PDF
Portfolio_Hu Jiayue .pdf
ahujiayue
 
PDF
Presentación San Patricio Ilustrativo Verde (1).pdf
andressuarezaraya912
 
PPTX
• Hinduism is not founded by any particular prophet. Buddhism was founded by ...
BeshoyGirgis2
 
PPSX
Eagle in Paintings .ppsx
guimera
 
PPTX
ENGLISH 6 WEEK 6 DAY 2.pptxLFLLLLLLLLLLLLLLLLLLLLLLLLLL
DitaSIdnay
 
Choosing Between Flight Cases and Shipping Boxes for Maximum Safety.pdf
Maadhu Flight Case LLP
 
Between Porchlight and Citylight: The Joseph Kim Journey
Joseph Kim Nolensville Tennessee
 
📚Student and education importance......
kasimnagori121
 
Malaria detection through machine learning and deep learning
BhaveshGoyal26
 
奎斯特大学文凭办理|办理QUC毕业证学位证书2025年新版学位证书影本
1cz3lou8
 
Welcome chiku new07.pptxrfgretgregbdffgfdgfd
xoviva2925
 
Wayne Thiebaud Not Pop Art Presentation ppt
brwnidjessicaleigh
 
Bharatanatyam-Dance-Sculpture-and-Sacred-Spaces (2).pptx
Deepshikha Maurya
 
design for presentation purpose can used
vishveshvarvms
 
preposition-powerpoint-aus-ver1_ver_4.ppt
NhungNgLHoi
 
Escalator_21bar064,070,073aeqweqeqe.pptx
Aaravpatel25
 
Zero no Tsukaima 1 - Zero_s Familiar.pdf
WaldeckFlugelWallens
 
Lecture 1b - Diagnostic Analytics Intro and Purpose ver 1.0 (1).pptx
guddipatel10
 
Your Love Story, Beautifully Told by FliqaIndia’s Creative Experts
sandeep239227
 
This is a security test for Inspectiv test
Firsts Lasts
 
Portfolio_Hu Jiayue .pdf
ahujiayue
 
Presentación San Patricio Ilustrativo Verde (1).pdf
andressuarezaraya912
 
• Hinduism is not founded by any particular prophet. Buddhism was founded by ...
BeshoyGirgis2
 
Eagle in Paintings .ppsx
guimera
 
ENGLISH 6 WEEK 6 DAY 2.pptxLFLLLLLLLLLLLLLLLLLLLLLLLLLL
DitaSIdnay
 

computervisionanditsapplications-190311134821.pptx

  • 2. Your title here Presentations are communication tools Your title here Presentations are communication tools Your title here Presentations are communication tools Your title here Presentations are communication tools Your title here Presentations are communication tools Your title here Presentations are communication tools 01 02 03 04 05 06 2 Convolution Neural Network (CNN)
  • 3. 3 Contents 01 Your title here 02 Your title here 03 Your title here 04 Your title here 05 Your title here
  • 4. ● Computer vision is concerned with the automatic extraction, analysis and understanding of useful information from a single image or a sequence of images. - The British Machine Vision Association and Society for Pattern Recognition (BMVA) (or) ● It is an interdisciplinary field that deals with how computers can be made to gain high-level understanding from digital images or videos. - Wikipedia 4 Introduction to Computer Vision
  • 5. Introduction to Computer Vision Computer vision is concerned with the automatic extraction, analysis and understanding of useful information from a single image or a sequence of images. It is an interdisciplinary field that deals with how computers can be made to gain high-level understanding from digital images or videos.
  • 6. 6 ● It is a class of deep learning. ● Convolutional neural network (ConvNet’s or CNNs) is one of the main categories to do images recognition, images classifications, objects detections, recognition faces etc., ● It is similar to the basic neural network. CNN also have learnable parameter like neural network i.e., weights, biases etc. ● CNN is heavily used in computer vision ● There 3 basic components to define CNN ○ The Convolution Layer ○ The Pooling Layer ○ The Output Layer (or) Fully Connected Layer What is CNN(Convolution Neural Network)
  • 8. 8 ● Computers read images as pixels and it is expressed as matrix (NxNx3)— (height by width by depth). ● The Convolutional Layer makes use of a set of learnable filters. A filter is used to detect the presence of specific features or patterns present in the original image (input). ● It is usually expressed as a matrix (MxMx3), with a smaller dimension but the same depth as the input file. ● This filter is convolved (slided) across the width and height of the input file, and a dot product is computed to give an activation map. Convolution Layer
  • 9. 9 Images source: Analytics Vidhya Convolution Layer
  • 11. The concept of stride and padding: ● The weight of a matrix moves 1 pixel at a time is called as stride 1 (as we did in above case). What if we increase the stride value? 11 Images source: Analytics Vidhya Convolution Layer
  • 12. ● As we can see in above image the increase in the stride value decreases the size of the image (which may cause in losing the features of the image). ● Padding the input image across it solves our problem, we add more than one layer of zeros around the image in case of higher stride values. 12 Images source: Analytics Vidhya Convolution Layer
  • 13. ● when the input of 6x6 is padded around with zeros we get the output with same dimensions of 6x6 this is known as ‘Same Padding’. ● The middle 4x4 pixel remains the same, here we have retained the more information from borders and also preserved the size of image. 13 Images source: Analytics Vidhya Convolution Layer
  • 14. ● It can be seen in between the convolution layers in CNN architecture. ● This layer basically reduces the amount of parameters and computation in the network. ● Pooling is done for the sole purpose of reducing the spatial size of the image. ● Pooling is done independently on each depth dimension, therefore the depth of the image remains unchanged. The most common form of pooling layer generally applied is the max pooling. 14 Images source: Analytics Vidhya The Pooling Layer
  • 15. ● Let's try to understand from below image how max pooling has affected our image below. 15 Images source: Analytics Vidhya The Pooling Layer
  • 16. ● After multiple layers of convolution and padding, we would need the output in the form of a class. ● The convolution and pooling layers would only be able to extract features and reduce the number of parameters from the original images. ● However, to generate the final output we need to apply a fully connected layer to generate an output equal to the number of classes we need. ● Convolution layers generate 3D activation maps while we just need the output as whether or not an image belongs to a particular class. ● The output layer will have the loss function like categorical cross-entropy, to compute the error in prediction. ● Once the forward pass is complete the backpropagation begins to update the weight and biases for error and loss reduction. 16 The Output Layer
  • 17. How to decide the number of convolution layers and number of filters in CNN ? ● Deeper networks is always better, at the cost of more data and increased complexity of learning. ● You should initially use fewer filters and gradually increase and monitor the error rate to see how it is varying. ● Very small filter sizes will capture very fine details of the image. On the other hand having a bigger filter size will leave out minute details in the image. 17 CNN
  • 18. ● Based on the problems, we have the different CNN’s which are used in computer vision. ● The five major computer vision techniques which can be addressed using CNN. ■ Image Classification ■ Object Detection ■ Object Tracking ■ Semantic Segmentation ■ Instance Segmentation 18 Types of CNN
  • 19. Image Classification: ● In an image classification we can use the traditional CNN models or there also many architectures designed by developers to decrease the error rate and increasing the trainable parameters. ■ LeNet (1998) ■ AlexNet (2012) ■ ZFNet (2013) ■ GoogLeNet19 (2014) ■ VGGNet 16 (2014) ■ ResNet(2015) 19 Types of CNN
  • 20. Object Detection: ● Here the implementation of CNN is different compared to the previous image classification. ● Here the task is to identify the objects present in the image, therefore traditional implementation of CNN may not help. ■ R CNN ■ Fast R CNN ■ Faster R CNN ■ YOLO 20 Types of CNN
  • 21. ● Computer vision, an AI technology that allows computers to understand and label images, is now used in convenience stores, driverless car testing, daily medical diagnostics, and in monitoring the health of crops and livestock. ● Different use cases found in the computer vision as follows ■ Retail and Retail Security ■ Automotive ■ Healthcare ■ Banking ■ Agriculture ■ Industrial 21 Applications of Computer Vision
  • 22. ● Here in this seminar we tried to understand what is Computer Vision. ● What is CNN and its different layers. ● Different types of CNN and its uses in Computer Vision techniques. ● Different Applications of Computer Vision. 22 Conclusion
  • 23. ● https://www.analyticsvidhya.com/blog/2017/06/architecture-of-convolution al-neural-networks-simplified-demystified/ ● https://towardsdatascience.com/covolutional-neural-network-cb0883dd65 29 ● https://medium.com/technologymadeeasy/the-best-explanation-of-convol utional-neural-networks-on-the-internet-fbb8b1ad5df8 ● https://adeshpande3.github.io/A-Beginner%27s-Guide-To-Understanding-C onvolutional-Neural-Networks/ ● http://www.image-net.org/ 23 References

Editor's Notes

  • #4: Start the discussion with the human eye and take them to the computer vision. Explain about computer vision definition and speak about what are the different fields it deals with. Take the topic to machine learning
  • #6: Say why CNN why not Feed forward NN(example MNIST image 28 x 28 x 1(black & white image contains only 1 channel) Total number of neurons in input layer will 28 x 28 = 784, this can be manageable. What if the size of image is 1000 x 1000, which means you need 10⁶ neurons in input layer.
  • #7: Explain the Architecture of CNN
  • #8: Explain image pixels how pixels are expressed in matrix form And what are filters how are the represented.
  • #9: Explain briefly the image
  • #10: Explain the original image and conoluted image
  • #11: What is stride and explain with image Increase in stride value loss of pixels
  • #12: Discuss the same padding concept: when the input of 6x6 is padded around with zeros we get the output with same dimensions of 6x6. And feature are extracted without loss.
  • #13: The output of the Convolution layer is passes through the activation function
  • #15: As you can see I have taken convoluted image and have applied max pooling on it. The max pooled image still retains the information that it’s a car on a street. If you look carefully, the dimensions if the image have been halved. This helps to reduce the parameters to a great extent.
  • #16: Discuss about the flattening of pixels before sending it to the output layer. Explain the flattening process. When weights are updated, they take place on both convolution layers and fully connected layers.
  • #21: Discuss Amazon Go store for retail and security Google cars for Automotive Cheque sign recognition in banks