Computer Vision: Field of AI That Enables Computers To Derive Meaningful Information From

Computer vision is a field of AI that enables machines to interpret visual data, contrasting with human vision which relies on extensive context. Applications include autonomous vehicles, translation of road signs, and facial recognition. Convolutional Neural Networks (CNNs) are key to computer vision tasks, utilizing layers for feature detection and reducing computation costs while managing image data effectively.

Uploaded by

u1904031

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

25 views26 pages

Computer Vision: Field of AI That Enables Computers To Derive Meaningful Information From

Uploaded by

u1904031

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 26

Computer Vision

Field of AI that enables computers to derive meaningful information from

images, videos, or other visual inputs.
Human Vision vs. Computer Vision

• Human vision:
– lifetimes of context to train how to tell objects apart, how far away they are, whether they are
moving and whether there is something wrong in an image.

• Computer vision:
– Trains machines to perform these functions. Can analyze thousands of products in less than a
minute
Computer Vision Applications

• Tesla: Autonomous cars for hand free driving

• Google translate: Convert road signs in one language to another language
• Photo scan: Optical Character Recognition (OCR) or QR Code Reader
• Facebook: Face recognition for automatic tagging
• Boston Dynamics: Designing intelligent robots
Basic Components of a Image Processing Task
What is an Image?
Color Channel

1 channel 1 channel 3 channel

How does computer vision works?

• How to solve computer vision tasks?

– Computer vision needs lots of data.
– It runs analyses of data over and over until it discerns distinctions and ultimately recognize
images.

• Machine learning based models can enable a computer to teach itself about the
context of visual data.
– One popular ML algorithm used for computer vision task is convolutional neural network (CNN)
Convolutional Neural Network

• Convolutional Neural Networks (CNN) are distinguished from other neural networks
by their superior performance with image or visual input signals.
• CNN consists of three main layers:
1) Convolutional layer
2) Pooling layer
3) Fully-connected layer
Why CNN
• High computation cost of ANN

• Reduce overfitting
• Successfully capture the Spatial and Temporal dependencies in an
image
• Trainable parameters depends on filter rather than image size
Convolutional Layer
• Convolutional Layers are core building block of a CNN.
• Focus on detecting edges from an image
• It requires a few components, which are input data, a filter, and a feature map.
Convolution
• Convolution: Express how one shape is modified by another
• Below amatrix is convolved with a filter to obtain matrix

Input Image
Convolutional Layer (Intuition)

How can we detect these edges from an image?

Convolutional Layer (Intuition)
• To illustrate this, we use a simplified picture
Convolutional Layer
• We have seen that convolving an input of dimension with a filter results in output.

• In general:
– Input:
– Filter size:
– Output:

• Disadvantage:
– Every time we apply a convolutional operation, the size of the image shrinks
– Pixels present in the corner of the image are used only a few number of times during convolution
as compared to the central pixels
– Hence, we do not focus too much on the corners since that can lead to information loss
Convolutional Layer (Padding)
• We can pad the image with an additional border (add pixels around the border)
• In general:
– Input:
– Padding:
– Filter size:
– Output:
Convolutional Layer (Padding)
• Hence, we have two choice for padding

• Valid Padding:
– It means no padding.
– If we are using valid padding, the output will be

• Same Padding:
– Here, we apply padding so that the output size is the same as the input size
– Need to set,
Convolutional Layer (Stride)
• Stride is how far the filter moves in every step along one direction.
• If we select stride of 2, then we will take two steps – both in the horizontal and
vertical directions.
Convolution Over Multiple Channel
• Color image contains three channel
• We can use a filter on the image
• The last dimension of the filter should be same as the number of input channel
• We can use multiple filter for capturing multiple features
Pooling Layer
• Pooling layers are generally used to reduce the size of the inputs and hence speed up
the computation.
CNN Example
• There are a combination of convolution and pooling layers at the beginning
• A few fully connected layers at the end
• And, finally a softmax classifier to classify the input into various categories
• There are a lot of hyperparameters in this network which we have to specify as well
CNN Example
Classical Networks (LeNet-5)

• Parameters: 60k
• Layers flow: Conv -> Pool -> Conv -> Pool -> FC -> FC -> Output
• Activation functions: Sigmoid/tanh and ReLu
Classical Networks (Alexnet)
More Classical Networks
• VGG-16

• ResNet

• Inception

• And many more…

Some Notes
• Building your own model from scratch can be a tedious and cumbersome
process.
• In many cases, we also face issues like lack of data availability.

• Steps to follow:
– Using Open-Source implementation
– Transfer Learning: we can take a pre-trained network and transfer that to a new
task which we are working on.
– Data Augmentation: Deep learning models perform well when we have a large
amount of data.
– E.g., Mirroring, Random cropping, Rotating
Resources
• https://www.youtube.com/playlist?list=PLGP2q2bIgaNzhSv4yMX6yPxwQ0mk4CP
wS

Introduction To Convolutional Neural Networks (CNNS)
No ratings yet
Introduction To Convolutional Neural Networks (CNNS)
28 pages
Understanding Convolutional Neural Networks
No ratings yet
Understanding Convolutional Neural Networks
56 pages
Unit - 5
No ratings yet
Unit - 5
47 pages
Computer Vision & CNNs - Study Notes
No ratings yet
Computer Vision & CNNs - Study Notes
12 pages
Module 3 Notes
No ratings yet
Module 3 Notes
22 pages
CNN Course: Build & Apply Networks
No ratings yet
CNN Course: Build & Apply Networks
95 pages
Unit 3 CNN 2024
No ratings yet
Unit 3 CNN 2024
58 pages
Understanding Convolutional Neural Networks
No ratings yet
Understanding Convolutional Neural Networks
47 pages
Convolutional Neural Networks Notes
No ratings yet
Convolutional Neural Networks Notes
29 pages
Computer Vision Part 2
No ratings yet
Computer Vision Part 2
5 pages
Intro To CNN
No ratings yet
Intro To CNN
93 pages
Deep Dive Into Convolutional Neural Networks CNNs
No ratings yet
Deep Dive Into Convolutional Neural Networks CNNs
3 pages
Computer Vision With CNNs
No ratings yet
Computer Vision With CNNs
3 pages
Convolutional Neural Networks - Part 1
No ratings yet
Convolutional Neural Networks - Part 1
44 pages
DL Unit Iv
No ratings yet
DL Unit Iv
18 pages
Computer Vision and Its Applications
No ratings yet
Computer Vision and Its Applications
3 pages
Introduction To CNNs
No ratings yet
Introduction To CNNs
26 pages
CNNs for AI and Machine Learning
No ratings yet
CNNs for AI and Machine Learning
16 pages
Convolutional Neural Networks
No ratings yet
Convolutional Neural Networks
8 pages
ML 2
No ratings yet
ML 2
70 pages
CNN Basics and Architecture Guide
No ratings yet
CNN Basics and Architecture Guide
16 pages
Convolutional Neural Network
No ratings yet
Convolutional Neural Network
11 pages
Understanding Convolutional Neural Networks
No ratings yet
Understanding Convolutional Neural Networks
5 pages
DL 4
No ratings yet
DL 4
4 pages
GCET DL Unit-3 CNN
No ratings yet
GCET DL Unit-3 CNN
114 pages
Understanding Convolutional Neural Networks
No ratings yet
Understanding Convolutional Neural Networks
46 pages
UNIT 2 Self Notes
No ratings yet
UNIT 2 Self Notes
10 pages
Understanding CNN Architecture Basics
No ratings yet
Understanding CNN Architecture Basics
24 pages
DL Unit-3
No ratings yet
DL Unit-3
70 pages
Some Important Question
No ratings yet
Some Important Question
59 pages
CNN - Convolutional Neural Network
No ratings yet
CNN - Convolutional Neural Network
33 pages
Understanding CNN Architecture and Applications
No ratings yet
Understanding CNN Architecture and Applications
69 pages
Understanding Convolutional Neural Networks
No ratings yet
Understanding Convolutional Neural Networks
3 pages
DL Unit-Ii
No ratings yet
DL Unit-Ii
34 pages
CNN Architecture and Layers Guide
No ratings yet
CNN Architecture and Layers Guide
21 pages
DL Unit 4 Modified
No ratings yet
DL Unit 4 Modified
64 pages
DL Unit 4
No ratings yet
DL Unit 4
58 pages
CH VI - Convolutional Neural Network - 24
No ratings yet
CH VI - Convolutional Neural Network - 24
33 pages
Unit 2 Part 01
No ratings yet
Unit 2 Part 01
35 pages
CNN, RNN
No ratings yet
CNN, RNN
60 pages
Scan 30 Sep 23 18 20 44
No ratings yet
Scan 30 Sep 23 18 20 44
30 pages
DL Unit-4
No ratings yet
DL Unit-4
26 pages
Module5 ML
No ratings yet
Module5 ML
112 pages
CNNs for Image Recognition
No ratings yet
CNNs for Image Recognition
16 pages
CV PPT Mt101
No ratings yet
CV PPT Mt101
16 pages
Unit4 CNN
No ratings yet
Unit4 CNN
187 pages
Intro to Convolutional Neural Networks
No ratings yet
Intro to Convolutional Neural Networks
80 pages
Introduction To Deep Learning
No ratings yet
Introduction To Deep Learning
47 pages
Convolutional Neural Networks - Deeplearning-Notes
No ratings yet
Convolutional Neural Networks - Deeplearning-Notes
43 pages
CNN and Applications
No ratings yet
CNN and Applications
22 pages
AWS Certified Cloud Practitioner Q&A
No ratings yet
AWS Certified Cloud Practitioner Q&A
9 pages
Voice Cloning Translation Setup
No ratings yet
Voice Cloning Translation Setup
4 pages
Real Time and Embedded System Assignment Group-6
No ratings yet
Real Time and Embedded System Assignment Group-6
34 pages
Data Visualization Exploring and Explaining With Data J.camm Bibis - Ir
100% (3)
Data Visualization Exploring and Explaining With Data J.camm Bibis - Ir
418 pages
ABC4all Files - New
No ratings yet
ABC4all Files - New
237 pages
iGPSport BSC200 Manual (English)
No ratings yet
iGPSport BSC200 Manual (English)
9 pages
Unesco - Eolss Sample Chapters: Bus Architectures
No ratings yet
Unesco - Eolss Sample Chapters: Bus Architectures
7 pages
SPOS - Unit 1 (Introduction)
No ratings yet
SPOS - Unit 1 (Introduction)
95 pages
XML Lab Record
No ratings yet
XML Lab Record
63 pages
CSC 102 Answers 1
No ratings yet
CSC 102 Answers 1
17 pages
Cs Prospectus
No ratings yet
Cs Prospectus
1 page
Neuroband Presentation
No ratings yet
Neuroband Presentation
15 pages
Image Caption Generator Report Final
No ratings yet
Image Caption Generator Report Final
39 pages
Management Information System
No ratings yet
Management Information System
16 pages
Autosar Sws Timesyncovercan
No ratings yet
Autosar Sws Timesyncovercan
70 pages
Understanding Cloud and Cluster Computing
No ratings yet
Understanding Cloud and Cluster Computing
6 pages
Speakeasy Manual English
No ratings yet
Speakeasy Manual English
47 pages
Informatica MDM Cloud SaaS
No ratings yet
Informatica MDM Cloud SaaS
6 pages
Lab Manual PCS 408 "Java Programming" (B-Tech CSE IV Semester) 2021-2022
100% (1)
Lab Manual PCS 408 "Java Programming" (B-Tech CSE IV Semester) 2021-2022
67 pages
SAP Certified Application Associate - SAP S/4HANA For Financial Accounting Associates - Full
No ratings yet
SAP Certified Application Associate - SAP S/4HANA For Financial Accounting Associates - Full
29 pages
Mercury Security - EP1502 - Manual1
No ratings yet
Mercury Security - EP1502 - Manual1
7 pages
NSA Gemalto SIM Card Hack Exposed
No ratings yet
NSA Gemalto SIM Card Hack Exposed
15 pages
Microprocessor Lab Manual EC8681
No ratings yet
Microprocessor Lab Manual EC8681
82 pages
HC-42 Bluetooth Module User Manual
No ratings yet
HC-42 Bluetooth Module User Manual
17 pages
IP Addressing for IT Students
No ratings yet
IP Addressing for IT Students
3 pages
Access and Use Internet - Assignment
No ratings yet
Access and Use Internet - Assignment
2 pages
Discrete Math Project Guidelines
No ratings yet
Discrete Math Project Guidelines
3 pages
New Course Allocation System Presentation 042617
No ratings yet
New Course Allocation System Presentation 042617
12 pages
Experiment 2
No ratings yet
Experiment 2
7 pages
Course Advising Forms - SST
No ratings yet
Course Advising Forms - SST
21 pages