0% found this document useful (0 votes)

19 views

Computer vision

Computer vision technology enables machines to interpret and respond to images and videos, facing challenges such as the need for extensive datasets and time-sensitive decision-making. Key tasks include object classification, identification, detection, and segmentation, utilizing various techniques like geometric transformations and feature detection. The document outlines the importance of deep learning algorithms in enhancing object recognition and detection capabilities.

Uploaded by

nameera0987654321

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

19 views

Computer vision

Uploaded by

nameera0987654321

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 13

Computer vision technology is crucial part of AI, that aids in

building a machine with ability to look at an image/video, to

understand it, and to respond to it.

Challenges in Computer Vision:

 Image is stored as vector array in digital form. Deep learning
techniques are required to get insights from this data.
 A very huge data set would be required to train the system
to identify objects at various angles/environmental
conditions.
 Time based decision making. Example: Alert has to be
generated by a surveillance robot, when someone crosses a
railway line and a train is approaching, otherwise, it should
be considered normal.
 In case of living objects, ability to differentiate between, the
live object, a statue of the object, a life size poster/photo of
the object.
 Understanding the object with its context

What is Computer Vision ?

Computer vision tasks include methods
for acquiring, processing, analyzing and understanding digital
images, and extraction of high-dimensional data from the real
world in order to produce numerical or symbolic information, e.g.
in the forms of decisions
PURPOSE OF COMPUTER VISION
Object Classification
Object Identification
Object Verification
Object Detection
Object Landmark Detection
Object Segmentation
Object Recognition
https://infyspringboard.onwingspan.com/common-content-store/Shared/Shared/Public/
lex_auth_012983793978515456273_shared/web-hosted/assets/Concept2.png https://
infyspringboard.onwingspan.com/common-content-store/Shared/Shared/Public/
lex_auth_012983793978515456273_shared/web-hosted/assets/Concept2.png

Visualization
The purpose is to observe the objects that are not visible in an
image

Image Sharpening and Restoration

The purpose is to create a better image

Image Retrieval

The purpose is to seek for the image of interest

Measurement of Pattern

The purpose is to measure various objects in an image

Image Recognition

The purpose is to distinguish the objects in an image

Changing color spaces:
Color spaces define how the range value of pixels are represented
for different mediums. Some of the widely used color spaces
include RGB, HSV, CMYK, LAB color space, YCrCb color space, etc.

Example: By default, digital images will be in RGB (Red, Blue,

Green) color space. In RGB color space, the pixels have only the
color components information, and has no direct details about
brightness or saturation intensity. It would be pose complexity if
we wish to process the brightness of the image. Similarly,
some kind of processing could provide poor results in some color
spaces. Hence, we might choose to convert it to a different color
space based on the requirement.

CMYK - Cyan, Magenta, Yellow and blacK - The color space which
is widely used in printers.

HSV - Hue, Saturation, Value - preferred color space in high

quality graphics, due to the property that, it has separate
components to denote color, intensity and brightness. So it is
easy to maintain the color and just alter the brightness in the
images.
Geometric Transformations:
As the name indicates, geometric transforms refers to altering the
geometric aspects of the images like size, orientation, etc.,
without affecting the actual contents of the image. Some of the
geometric transformation techniques are listed below:

Scaling:

Generally images are scaled down to minimize computational

time and resources, while taking to consideration that, the feature
details are not lost.

Rotation/Reflection/Translation:

Images will be rotated in different angles so that it would be

analysed in different perspectives.
Mirroring can also be done to obtain insights about the image.

mage translation refers to moving pixels in an image from one

position to another, with aim of changing an objects position/
angle.
IMAGE SEGMENTATION

In computer vision, segmentation is the process of extracting

pixels in an image that are related. Segmentation algorithms
usually take an image and produce a group of contours (the
boundary of an object that has well-defined edges in an image) or
a mask where a set of related pixels are assigned to a unique
color value to identify it.

The main purpose for Image segmentation is to partition an

image into a collection of set of pixels and achieve the following
results for

– Meaningful regions (coherent objects)

– Linear structures (line, curve, …)

– Shapes (circles, ellipses, …)

The very simple example for image segmentation is method

based on thresholding.

Binary Segmentation

You have an image, and you want to segment into 2 part (light
and dark). So you can use the threshold is a value, for example,
100. So after segmenting, your image will be segmented into 2
part: the pixels with higher than 100 intensity and the pixels (you
can set value for it 255 - is maximum intensity value, white color)
with less than 100 intensity (corresponding intensity is 0 -
minimum intensity value, black color). This is call binary
segmentation. If you use more thresholds, you have more
segments.
Feature

A feature is a piece of information which is relevant for solving the

computational task related to a certain application. Features may
be specific structures in the image such as points, edges or
objects. Features may also be the result of a general
neighborhood operation or feature detection applied to the
image.

Main Component Of Feature Detection And Matching

Detection:

Identify the Interest Point in the image.

The features that are in specific locations of the images, such as

mountain peaks, building corners, doorways, or interestingly
shaped patches of snow. These kinds of localized features are
often called keypoint features (or even corners) and are often
described by the appearance of patches of pixels surrounding the
point location.
The features that can be matched based on their orientation and
local appearance (edge profiles) are called edges and they can
also be good indicators of object boundaries and occlusion events
in the image sequence.

Description:

The local appearance around each feature point is described in

some way that is (ideally) invariant under changes in illumination,
translation, scale, and in-plane rotation. We typically end up with
a descriptor vector for each feature point.

Matching:

Descriptors are compared across the images, to identify similar

features. For two images we may get a set of pairs (Xi, Yi) ↔ (Xi`,
Yi`), where (Xi, Yi) is a feature in one image and (Xi`, Yi`) its
matching feature in the other image.

IMAGE RECOGNITION
Recognition is one of the toughest challenges in the concepts in
computer vision. For the human eyes, recognizing an object’s
features or attributes would be very easy. Humans can recognize
multiple objects with very small effort. However, this does not
apply to a machine. It would be very hard for a machine to
recognize or detect an object because these objects vary. They
vary in terms of viewpoints, sizes, or scales.

Object Recognition

Object recognition is used for indicating an object in an image or

video. This is a product of machine learning and deep learning
algorithms. Object recognition tries to acquire this innate human
ability, which is to understand certain features or visual detail of
an image.

The output of object recognition will include the identified object

category along with the probability of correctness.

Object recognition refers to identification of what is present in the

image, while object detection refers to locating where it is present
in the image.

Object recognition through deep learning can be achieved

through training models or through utilizing pre-trained deep
learning models. To train models from scratch, the first thing you
need to do is to collect a large number of datasets. Then you
need to design a certain architecture that will be used for creating
the model.

Just like in deep learning, object recognition through algorithmic

approach is also possible.

The following algorithms are commonly uses approaches:

 HOG feature extraction

 Bag of words model
 Viola-Jones algorithm

IMAGE DETECTION
Image or Object Detection is a technique that processes the
image and detects objects in it.

When it comes to applying deep machine learning to image

detection, developers use Python along with open-source libraries
like OpenCV image detection, Open Detection, Luminoth,
ImageAI, and others. These libraries simplify the learning process
and offer a ready-to-use environment.

The commonly used techniques for Object Detection are

• Haar cascades algorithm

• Viola Jones Algorithm

Object detection uses an object’s feature for classifying its class.

For example, when looking for circles in an image, the machine
will detect any round object. To recognize any instances of an
object in a class, this algorithm uses learning techniques and
extracted features of an image.

Modelling The Smart City Performance
No ratings yet
Modelling The Smart City Performance
14 pages
Computer Vision Class 10 Notes
100% (5)
Computer Vision Class 10 Notes
7 pages
Computer Vision
No ratings yet
Computer Vision
29 pages
Computer Vision
No ratings yet
Computer Vision
19 pages
Computer Vision
No ratings yet
Computer Vision
4 pages
computer vision technology
No ratings yet
computer vision technology
29 pages
Pdf&rendition 1
No ratings yet
Pdf&rendition 1
2 pages
Computer Vision Class 10 Notes
No ratings yet
Computer Vision Class 10 Notes
5 pages
CV
No ratings yet
CV
9 pages
Computer Vision Notes
No ratings yet
Computer Vision Notes
4 pages
Ip Cv Summary Finaaaal-1
No ratings yet
Ip Cv Summary Finaaaal-1
178 pages
Chunk 2
No ratings yet
Chunk 2
31 pages
C10_AI_COMPUTER VISION (1)
No ratings yet
C10_AI_COMPUTER VISION (1)
40 pages
52 BDB
No ratings yet
52 BDB
3 pages
AI-Computer Vision
No ratings yet
AI-Computer Vision
16 pages
Computer Vision Class X
No ratings yet
Computer Vision Class X
39 pages
Computer Vision
No ratings yet
Computer Vision
13 pages
Computer Vision
No ratings yet
Computer Vision
15 pages
"Introduction To Computer Vision": Submitted by
No ratings yet
"Introduction To Computer Vision": Submitted by
45 pages
lecture 1 AI Summary
No ratings yet
lecture 1 AI Summary
31 pages
AI CV NOTES
No ratings yet
AI CV NOTES
6 pages
Unit-5 Computer Vision
No ratings yet
Unit-5 Computer Vision
3 pages
Computer Vision Notes
No ratings yet
Computer Vision Notes
4 pages
CS312 Module 4
No ratings yet
CS312 Module 4
21 pages
UNIT_1
No ratings yet
UNIT_1
15 pages
Chapter-4 Computer Vision Study material
No ratings yet
Chapter-4 Computer Vision Study material
4 pages
CV GTU ANSWERS
No ratings yet
CV GTU ANSWERS
56 pages
e98da8fbc33b80a8a7c6cfc6ddfd7cf5
No ratings yet
e98da8fbc33b80a8a7c6cfc6ddfd7cf5
36 pages
Ch-Computer Vision
No ratings yet
Ch-Computer Vision
6 pages
Computer Vision
No ratings yet
Computer Vision
36 pages
UNIT_3__DL[1]
No ratings yet
UNIT_3__DL[1]
15 pages
Screenshot 2023-10-23 at 5.51.17 AM
No ratings yet
Screenshot 2023-10-23 at 5.51.17 AM
14 pages
Image Manipulation Finall
No ratings yet
Image Manipulation Finall
7 pages
AI 10th grade pdfs
No ratings yet
AI 10th grade pdfs
30 pages
6960795-Class10 Ai Partb Unit5 Computervision
No ratings yet
6960795-Class10 Ai Partb Unit5 Computervision
17 pages
Computer Vision Class 10 AI Notes CBSE
No ratings yet
Computer Vision Class 10 AI Notes CBSE
8 pages
PDF Joiner
No ratings yet
PDF Joiner
38 pages
UNIT 2 COMPUTER VISION & IMAGE PROCESSSING (1)
No ratings yet
UNIT 2 COMPUTER VISION & IMAGE PROCESSSING (1)
16 pages
Multimedia Systems: Multimedia Databases - Image Processing Basics
No ratings yet
Multimedia Systems: Multimedia Databases - Image Processing Basics
58 pages
4. Computer Vision
No ratings yet
4. Computer Vision
23 pages
2023 - 12 - 06 7 - 57 PM Office Lens
No ratings yet
2023 - 12 - 06 7 - 57 PM Office Lens
11 pages
Computer_Vision_1_introduction
No ratings yet
Computer_Vision_1_introduction
44 pages
Digital Image Processing
No ratings yet
Digital Image Processing
10 pages
COMPUTER VISION
No ratings yet
COMPUTER VISION
12 pages
UNESCO Module: Introduction To Computer Vision and Image Processing
No ratings yet
UNESCO Module: Introduction To Computer Vision and Image Processing
48 pages
Paper_BackProoagation
No ratings yet
Paper_BackProoagation
13 pages
Computer Vision - Unit 1 Notes
No ratings yet
Computer Vision - Unit 1 Notes
13 pages
Computer Vision and Image Processing (updated) (2)
No ratings yet
Computer Vision and Image Processing (updated) (2)
165 pages
Computer Vision
No ratings yet
Computer Vision
21 pages
Computer Vision
No ratings yet
Computer Vision
33 pages
HW_675075_1Compu
No ratings yet
HW_675075_1Compu
3 pages
Computer Vision
No ratings yet
Computer Vision
22 pages
Digital Image Processing
No ratings yet
Digital Image Processing
30 pages
RAT292 M3 Part 2 Sensors and Actuators
No ratings yet
RAT292 M3 Part 2 Sensors and Actuators
55 pages
Computer Vision
No ratings yet
Computer Vision
30 pages
CS5330-F22-Lectures
No ratings yet
CS5330-F22-Lectures
116 pages
Class 10 AI 417 Computer Vision
No ratings yet
Class 10 AI 417 Computer Vision
22 pages
Computer Vision U1&2 Notes (1)
No ratings yet
Computer Vision U1&2 Notes (1)
62 pages
Computer Vision Notes
No ratings yet
Computer Vision Notes
72 pages
Image Segmentation: Unlocking Insights through Pixel Precision
From Everand
Image Segmentation: Unlocking Insights through Pixel Precision
Fouad Sabry
No ratings yet
Pyramid Image Processing: Exploring the Depths of Visual Analysis
From Everand
Pyramid Image Processing: Exploring the Depths of Visual Analysis
Fouad Sabry
No ratings yet
Infosys
No ratings yet
Infosys
27 pages
In Mobile Computing
No ratings yet
In Mobile Computing
4 pages
Inter Process Communication
100% (1)
Inter Process Communication
6 pages
Dsa Manual (2022) - 1-16
No ratings yet
Dsa Manual (2022) - 1-16
16 pages
Mee 334 (A)
No ratings yet
Mee 334 (A)
12 pages
Review Questions On Land Use Planning
No ratings yet
Review Questions On Land Use Planning
2 pages
Past Simple or Past Continuous
No ratings yet
Past Simple or Past Continuous
1 page
Slip-Ons - Welding Guidelines
No ratings yet
Slip-Ons - Welding Guidelines
7 pages
AJ 245 Technical Spesification
No ratings yet
AJ 245 Technical Spesification
15 pages
Introduction To Business Environment: Muhammad Shahmeer Hashmi BE-17-17
No ratings yet
Introduction To Business Environment: Muhammad Shahmeer Hashmi BE-17-17
18 pages
Performance Evaluation Imoc 2003
No ratings yet
Performance Evaluation Imoc 2003
5 pages
Object-Oriented Python 1st Edition Irv Kalb instant download
No ratings yet
Object-Oriented Python 1st Edition Irv Kalb instant download
43 pages
TNVR Responsibilty Agreement - Spay Day
No ratings yet
TNVR Responsibilty Agreement - Spay Day
2 pages
WMM CF-43 Standard Manual Rev3.0 Jul 2012
100% (1)
WMM CF-43 Standard Manual Rev3.0 Jul 2012
112 pages
The Novel-Writing Training Plan PDF
100% (10)
The Novel-Writing Training Plan PDF
57 pages
MC145050 Motorola
No ratings yet
MC145050 Motorola
15 pages
OMM Fellows Review
100% (1)
OMM Fellows Review
178 pages
Missing Pages A Collection Of Short Stories Associated With The Library Trilogy Mark Lawrence download
100% (2)
Missing Pages A Collection Of Short Stories Associated With The Library Trilogy Mark Lawrence download
31 pages
Heightmaxx Extra Things
No ratings yet
Heightmaxx Extra Things
2 pages
De Thi Thu Tieng Anh Nam Dinh Lan 1
No ratings yet
De Thi Thu Tieng Anh Nam Dinh Lan 1
6 pages
Pakistan Steel Marketing Department
No ratings yet
Pakistan Steel Marketing Department
6 pages
Iron Valley Oracles
No ratings yet
Iron Valley Oracles
52 pages
Forests: Deforestation and Desertification
No ratings yet
Forests: Deforestation and Desertification
15 pages
The A To Z of Skeletal Muscles
99% (165)
The A To Z of Skeletal Muscles
238 pages
IEEE STD 3004.8 - 2016: Protection & Coordination
No ratings yet
IEEE STD 3004.8 - 2016: Protection & Coordination
17 pages
Perimeter and Area
No ratings yet
Perimeter and Area
5 pages
Histology Department Medical Faculty Bandung Islamic University
No ratings yet
Histology Department Medical Faculty Bandung Islamic University
51 pages
Daftar Pustaka LUKMAN HAKIM
No ratings yet
Daftar Pustaka LUKMAN HAKIM
5 pages
Healing During The Dispensation of Grace Part 1
No ratings yet
Healing During The Dispensation of Grace Part 1
4 pages
Research Article
No ratings yet
Research Article
9 pages
NX12000 Nitrogen Compressor Reference PID 030120_250204_115826
No ratings yet
NX12000 Nitrogen Compressor Reference PID 030120_250204_115826
1 page
Cha 4
No ratings yet
Cha 4
5 pages

Computer vision

Uploaded by

Computer vision

Uploaded by

Computer vision technology is crucial part of AI, that aids in

building a machine with ability to look at an image/video, to

Challenges in Computer Vision:

What is Computer Vision ?

Image Sharpening and Restoration

The purpose is to create a better image

The purpose is to seek for the image of interest

The purpose is to measure various objects in an image

The purpose is to distinguish the objects in an image

Example: By default, digital images will be in RGB (Red, Blue,

HSV - Hue, Saturation, Value - preferred color space in high

Generally images are scaled down to minimize computational

Images will be rotated in different angles so that it would be

mage translation refers to moving pixels in an image from one

In computer vision, segmentation is the process of extracting

The main purpose for Image segmentation is to partition an

– Meaningful regions (coherent objects)

– Linear structures (line, curve, …)

– Shapes (circles, ellipses, …)

The very simple example for image segmentation is method

A feature is a piece of information which is relevant for solving the

Main Component Of Feature Detection And Matching

Identify the Interest Point in the image.

The features that are in specific locations of the images, such as

The local appearance around each feature point is described in

Descriptors are compared across the images, to identify similar

Object recognition is used for indicating an object in an image or

The output of object recognition will include the identified object

Object recognition refers to identification of what is present in the

Object recognition through deep learning can be achieved

Just like in deep learning, object recognition through algorithmic

The following algorithms are commonly uses approaches:

 HOG feature extraction

When it comes to applying deep machine learning to image

The commonly used techniques for Object Detection are

• Haar cascades algorithm

• Viola Jones Algorithm

Object detection uses an object’s feature for classifying its class.

You might also like