0% found this document useful (0 votes)

118 views8 pages

YOLO Object Detection and Retrieval System

This document summarizes a research paper that detects objects in images using the YOLO algorithm. The paper introduces object detection and challenges like lighting, positioning, and scale. It describes algorithms like R-CNN, fast R-CNN, and Faster R-CNN that use region proposals. YOLO is introduced as a single neural network that predicts bounding boxes and class probabilities for the full image simultaneously. The paper explains how YOLO works by dividing the image into a grid, predicting bounding boxes and confidence scores in each grid cell, and multiplying class probabilities and confidences to get final predictions.

Uploaded by

mamta kweera

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

118 views8 pages

YOLO Object Detection and Retrieval System

Uploaded by

mamta kweera

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

IOP Conference Series: Materials Science and Engineering

PAPER • OPEN ACCESS

Detection and Content Retrieval of Object in an Image using YOLO

To cite this article: B Vinoth Kumar et al 2019 IOP Conf. Ser.: Mater. Sci. Eng. 590 012062

View the article online for updates and enhancements.

This content was downloaded from IP address 112.79.149.230 on 27/05/2020 at 05:01

International Conference on Frontiers in Materials and Smart System Technologies IOP Publishing
IOP Conf. Series: Materials Science and Engineering 590 (2019) 012062 doi:10.1088/1757-899X/590/1/012062

Detection and Content Retrieval of Object in

an Image using YOLO
1
Vinoth Kumar B, 2Abirami S, 3Bharathi Lakshmi R J, 4Lohitha R, 5Udhaya R B
1
Associate Professor, 2,3,4,5UG Scholars, Department of Information Technology, PSG College of Technology
Coimbatore, TN, India.
1bvk.it@psgtech.ac.in
2
abhishanmugam7@gmail.com
3
bharathiravi0397@gmail.com
4
lohithatk@gmail.com
5
udhayaramesh1210@gmail.com

Abstract – It is easy for human beings to identify the object that is in an image. Even if the task is complex, human beings
require only a minimal effort. Since computer vision is actually replicating human visual system, the same thing can be
achieved in computers when they are trained with large amount of data, faster GPUs and many advanced algorithms. In
general terms, Object detection can be defined as a technology that detects instances of object in images and videos by
mimicking the human visual system functionalities. The motivation of the project is making the search process easier for
the user i.e., if the object is very new for the user and he has no idea about it, he can upload a picture of that object and
the algorithm will detect the object and gives a description about it. The objective of the project is to detect the object in an
image, once the object is detected, the label i.e., the name of the detected object is searched in Wikipedia and few lines of
description about that object is retrieved and printed. Also, the label is searched in google and the URL of the top pages
with content related to the label are also displayed. The detection of object in an image is done using YOLO (You Only
Look Once) algorithm with pre-trained weights. Previous methods for object detection, like R-CNN and its variations,
used a pipeline to perform this task in multiple steps. This can take some time for execution, complex optimization may be
involved because individual training of components is required. YOLO, does it all fastly with a single neural network.
Hence, YOLO is preferred.

Keywords – object detection, region proposals, optimization, YOLO, google search, description, Wikipedia, text to speech.

I . INTRODUCTION

It is an easy task for a human being to identify the object that is in an image because of the faster and
accurate neural and visual system. Even if a complex task is given, human beings can do it with minimal effort.
Since computer vision is actually mimicking human visual system, the same thing can be achieved in computers
by training them with large amount of data, faster GPUs and many advanced algorithms. In general terms, Object
detection can be defined as a technology that detects instances of object in images and videos by mimicking the
human visual system functionalities. The special feature that every object has on its own is used to classify the
objects[6]. Example, when searching for circular objects, the objects at a specific distance from the center are
searched. Likewise, when searching for square shaped objects, objects are checked for perpendicularity at corners
and equality of side lengths. Similarly, for face detection applications standard features like eyes, lips, nose are
considered and some other features like skin tone and distance between the eyes are also considered. Due to the
circumstances there are some challenges faced during the detection of objects like:
· Lightning: the lightning conditions, weather conditions may vary during the entire course of the day.
· Positioning: the object in the image can be positioned in various aspects.
· Rotation: the object can be in various aspects in the image.

Content from this work may be used under the terms of the Creative Commons Attribution 3.0 licence. Any further distribution
of this work must maintain attribution to the author(s) and the title of the work, journal citation and DOI.
Published under licence by IOP Publishing Ltd 1
International Conference on Frontiers in Materials and Smart System Technologies IOP Publishing
IOP Conf. Series: Materials Science and Engineering 590 (2019) 012062 doi:10.1088/1757-899X/590/1/012062

· Occlusion: some part of the object in the image may not be clearly visible.
· Scale: the size of the object may vary.
These are some challenges that should be taken into account while developing an object detection system.

II. ALGORITHMS FOR OBJECT DETECTION

There are several machine learning and deep learning algorithms for object detection. When machine
learning approaches are used for detection, it is mandatory to define the features first. Deep learning approaches
does not demand to specify the features instead they perform end-to-end detection. Machine learning methods
use Support Vector Machine (SVM) and deep learning methods use CNN. R-CNN, fast R-CNN, faster R-CNN
are some common algorithms for object detection.

A. R-CNN
R-CNN uses selective search. By this, bounding boxes are generated. Then, for each bounding box,
image classification is done through CNN. Finally, each bounding box are refined using regression [7]. The
problems with R-CNN are:
· It takes a huge amount of time to train the network as it requires classification of 2000 region proposals
per image.
· It cannot be implemented real time due to the time constraints.
· Since the selective search algorithm is a fixed algorithm, no learning is happening at that stage.

B. Fast R-CNN

In fast R-CNN, the input image is fed into the CNN. Then, the region of proposals are identified and
wrapped into squares[4]. Then, reshaping of regions is done using a RoI pooling layer. Then, a softmax layer is
used to predict the class of the proposed region[3]. The problem with this fast R-CNN is:
· Performance degradation during testing.

C. Faster R-CNN

Both of the above mentioned algorithms uses selective search which is a slow and time-consuming
process.
Faster R-CNN does not use selective search. It uses a separate network and by that it produces region proposals [5].
All of the previously explained algorithms use region based approach to detect the object in the image
without looking at the complete image.

D. You Only Look Once (YOLO)

YOLO is also an object detection algorithm which uses only one convolutional network to predicts the
bounding boxes and the class probabilities and thus YOLO differs from other region based algorithms [1].

III. WORKING OF YOLO

YOLO trains and tests on full images and directly optimizes detection performance. YOLO model has
several benefits over other traditional methods of object detection like the following. First, YOLO is extremely
fast. Since frame detection in YOLO is a regression problem there is no need of complex pipeline. We can simply
run our neural network on any new image at test time to make predictions.

· Second, YOLO sees the entire image during training and testing unlike other sliding window
algorithms which require multiple iterations to process a single image.
· Third, YOLO learns generalizable object representations. When trained on real time images and
tested, YOLO outperforms top detection methods like DPM and R-CNN.
YOLO network uses features from the entire image to predict each bounding box. It also predicts all
bounding boxes across all classes for an image simultaneously. This means our network reasons globally about

2
International Conference on Frontiers in Materials and Smart System Technologies IOP Publishing
IOP Conf. Series: Materials Science and Engineering 590 (2019) 012062 doi:10.1088/1757-899X/590/1/012062

the full image and all the objects in the image. The YOLO design enables end-to-end training and real time
speeds while maintaining high average precision[2].
Following are the steps how YOLO works,
· First it divides the input image into an S × S grid as shown in fig.1.

Fig. 1 Divide the image into S × S grid

· If the center of an object falls into a grid cell, that grid cell is responsible for detecting that object.
· Each grid cell predicts B bounding boxes and confidence scores for those boxes as shown in fig.2.
· These confidence scores reflect how confident the model is that the box contains an object. If no object
exists in that cell, the confidence scores should be zero.

Fig. 2 Calculate bounding boxes and confidence score for each box.
· Each grid cell also predicts conditional class probabilities.
· These probabilities are conditioned on the grid cell containing an object. We only predict one set of
class probabilities per grid cell, regardless of the number of boxes B.
· Finally, we multiply the conditional class probabilities as shown in fig.3 and the individual box
confidence predictions which gives us class-specific confidence scores for each box as shown in fig.4.

Fig. 3 Multiply probability and confidence scores.

Fig. 4 Final Output

IV. WEB SCRAPING AND TEXT TO SPEECH CONVERSION

3
International Conference on Frontiers in Materials and Smart System Technologies IOP Publishing
IOP Conf. Series: Materials Science and Engineering 590 (2019) 012062 doi:10.1088/1757-899X/590/1/012062

Web scraping is a technique that is used to retrieve the content from websites. It consists of two phases
namely fetching the web page and later extracting the required content from it. Here two types of web scraping
is done one is extracting the content from Wikipedia and other is top google search links for that label. The
required modules are installed to system using pip.

A. Content Retrieval from Wikipedia

After detecting the object from image will use that labelled class to retrieve data from Wikipedia. It is
a free encyclopedia in web. So by extracting data from Wikipedia helps the user to get a idea about what the
object is and its uses. Wikipedia is a python library that will help to access and extract data from Wikipedia. In
that module with a help of a predefined function Summary(), label(object name) and filter(no of lines from
Wikipedia) are arguments for this function and returns a string that contains the extracted data.

B. URL Retrieval from Google

By using the label(object name) will extract top google URL’s from google with the help of python
module Googlesearch. By using pre defined function called Search() will extract the required URL’s. In this
function we can pass arguments like label(object name) , no of links need to be extracted etc. With these links
they can refer more about the object other than Wikipedia content[9].

C. Text to Speech Conversion

This step will convert the label(object name) and Wikipedia content to voice so that everybody can
understand better. The module used for text to speech conversion is pyttsx which is platform independent and it
can convert in offline too.
But pyttsx is supported only in python 2.x versions so pyttsx3 module can used in both python 2.x and 3.x
versions. Inorder to use pyttsx3 init() function need to be called to initialize the process and use a predefined
method say() with argument text which needs to be converted to voice[8]. Finally use runAndWait() to run the
speech.

V. PERFORMANCE ANALYSIS

To analyze the performance of YOLO, it compared with algorithms like R-CNN, fast R-CNN, faster R-
CNN on various performance measures like time taken, accuracy and the frames per second. When analysis was
done based on time taken by the algorithm to detect the objects as listed in table 1, it is found that R-CNN takes
around 40 to 50 seconds, fast R-CNN takes 2 seconds, faster R-CNN takes 0.2 seconds, and YOLO takes just
0.02 seconds. From this analysis it can be inferred that, YOLO performs 10 times quicker that faster R-CNN,
100 times quicker than fast R-CNN and more than 1000 times quicker than R-CNN.

TABLE I
PERFORMANCE EVALUATION BASED ON TIME TAKEN

Algorithm Time taken (in sec)

R-CNN 40-50

Fast R-CNN 2

Faster R-CNN 0.2

4
International Conference on Frontiers in Materials and Smart System Technologies IOP Publishing
IOP Conf. Series: Materials Science and Engineering 590 (2019) 012062 doi:10.1088/1757-899X/590/1/012062

YOLO 0.02

When analysis was done based on the number of frames per second, YOLO performs far better than all
the other algorithms as shown in fig.5, with 48 fps whereas, R-CNN processes 2 fps, fast R-CNN processes 5 fps
and faster R-CNN processes 8 fps.

50 48

40
30
20
8
10 5
2
0
R-CNN fast R-CNN faster R- YOLO
CNN

Frames Per Second

Fig.5 Performance analysis based on frames per second

When analysis was done based on the accuracy it is found that YOLO has lesser accuracy than the other
three algorithms as shown in fig.6. So, it is not recommended to use YOLO for applications in which accuracy
is the major concern.

80
75
70 73
65 70

60 65
63
55
50
R-CNN fast R-CNN Faster R-CNN YOLO

Accuracy

Fig. 6 Performance analysis based on accuracy

The model can be used in tracking objects for example tracking a ball during a football match, tracking
movement of a cricket bat, tracking a person in a video, Video surveillance, Smart Class for students, Instructor
for blind people to get details about unknown objects. It is also used in Pedestrian detection.
· Face detection: An example of object detection in daily life is that when we upload a new picture in
Facebook or Instagram it detects our face using this method.
· People Counting: Object detection can be also used for people counting, it means that it is used for
analyzing store performance or crowd statistics during festivals where the people spend a limited
amount of time and other details .This type of analysis is little difficult as people move away from frame.
· Vehicle detection: When the object is a vehicle such as a bicycle or car or bus, object detection with
tracking can prove effective in estimating the speed of the object. The type of ship entering a port can
be determined by object detection based on the shape, size etc. This method of detecting ships has been
developed in certain European Countries.

5
International Conference on Frontiers in Materials and Smart System Technologies IOP Publishing
IOP Conf. Series: Materials Science and Engineering 590 (2019) 012062 doi:10.1088/1757-899X/590/1/012062

· Manufacturing Industry: Object detection is also used in industrial processes to identify products. If
we want our machine to detect products which are only circular we can use Hough circle detection
transform can be used for detection.
· Online images: Apart from these object detection can be used for classifying images found online.
Obscene images are usually filtered out using object detection.
· Security: In the future we might be able to use object detection to identify anomalies in a scene such as
bombs or explosives (by making use of a quadcopter).
· Medical Diagnose: Use of object detection and recognition in medical diagnose to detect the X-Ray
report, brain tumors.

VI. EXPERIMENTAL RESULTS

Since YOLO is a pretrained model it should have

pre-trained YOLO V3 weights file, CFG file,text document containing the object classes in current program
directory. This Script needs 4 Arguments.

· Input image – airplane.jpg

· YOLO config file - yolov3.cfg (contains details about layers or hidden layers used in Neural network)
· YOLO pretrained weights file – yolov3.weights
(first few layers in Neural network have already learned some general factors applicable to all classes.
· Text file consists of object classes – yolov3.txt (contains object classes names)
This model is trained using COCO dataset so it has capability of detecting 80 objects.
Execute the script by this command,
Python yolo1.py --image airplane.jpg --config yolov3.cfg --weights yolov3.weights --classes
yolov3.txt

Fig. 7 Final output

Label (object class) will be printed and it will be converted to voice and then the content retrieval from
Wikipedia gets executed as shown in fig.7 and similarly it will be converted to voice. Now the top Google links
will be shown finally the image with the detected class label is seen with bounded box around the object. After
closing this window the image with labelled object will be saved in the program directory.

Although the results are encouraging, the model has few limitations as follows:
· YOLO has strong spatial constraints and hence it cannot detect small objects which appear in groups.
· A small error in small grids can cause a greater impact in the result.
· The model struggles when object is in new aspect ratio or configuration.

CONCLUSION

6
International Conference on Frontiers in Materials and Smart System Technologies IOP Publishing
IOP Conf. Series: Materials Science and Engineering 590 (2019) 012062 doi:10.1088/1757-899X/590/1/012062

An efficient model is developed in this research work that generates the audio of the descriptions for
objects present in the given image. Identification of objects is achieved through YOLO and the descriptions for
the detected objects are generated using Wikipedia package and the related URLs are retrieved using google
search packages available in python library. The fetched descriptions are read out using the pyttsx3 package.
Future avenue is to enhance this to an image based search engine.

REFERENCES
[1] Redmon, Joseph, Santosh Divvala, Ross Girshick, and Ali Farhadi. "You only look once: Unified, real-time object detection."
In Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 779-788. 2016.
[2] Impiombato, D., S. Giarrusso, T. Mineo, O. Catalano, C. Gargano, G. La Rosa, F. Russo et al. "You Only Look Once: Unified Real-
Time Object Detection." Nucl. Instruments Methods Phys. Res. Sect. A Accel. Spectrometers, Detect. Assoc. Equip. 794 (2015): 185-192.
[3] Wang, Xiaolong, Abhinav Shrivastava, and Abhinav Gupta. "A-fast-rcnn: Hard positive generation via adversary for object detection."
In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 2606-2615. 2017.
[4] Girshick, Ross. "Fast r-cnn." In Proceedings of the IEEE international conference on computer vision, pp. 1440-1448. 2015.
[5] Ren, Shaoqing, Kaiming He, Ross Girshick, and Jian Sun. "Faster r-cnn: Towards real-time object detection with region proposal
networks." In Advances in neural information processing systems, pp. 91-99. 2015.
[6] Vondrick, Carl, et al. "Hoggles: Visualizing object detection features." Proceedings of the IEEE International Conference on Computer
Vision. 2013.
[7] Beena, M. V., MN Agnisarman Namboodiri, and P. G. Dean. "Automatic sign language finger spelling using convolution neural network:
analysis." International Journal of Pure and Applied Mathematics 117.20 (2017): 9-15.
[8] Manaswi, Navin Kumar. "Speech to Text and Vice Versa." Deep Learning with Applications Using Python. Apress, Berkeley, CA, 2018.
127-144.
[9] Bharanipriya, V., and V. Kamakshi Prasad. "Web content mining tools: a comparative study." International Journal of Information
Technology and Knowledge Management 4.1 (2011): 211-215.

Real-Time Object Detection Project
No ratings yet
Real-Time Object Detection Project
6 pages
YOLO vs Faster R-CNN: A Comparative Study
No ratings yet
YOLO vs Faster R-CNN: A Comparative Study
5 pages
YOLO Object Detection Project Overview
No ratings yet
YOLO Object Detection Project Overview
15 pages
YOLOv2 Real-Time Object Detection Study
No ratings yet
YOLOv2 Real-Time Object Detection Study
5 pages
YOLO Algorithm for Object Detection
No ratings yet
YOLO Algorithm for Object Detection
9 pages
Incremental Training for Unseen Object Detection
No ratings yet
Incremental Training for Unseen Object Detection
19 pages
Voice-Activated Object Detection & Cartoonization
No ratings yet
Voice-Activated Object Detection & Cartoonization
6 pages
Project
100% (1)
Project
30 pages
YOLO-LITE: A Real-Time Object Detection Algorithm Optimized For Non-GPU Computers
No ratings yet
YOLO-LITE: A Real-Time Object Detection Algorithm Optimized For Non-GPU Computers
8 pages
YOLO-LITE: A Real-Time Object Detection Algorithm Optimized For Non-GPU Computers
No ratings yet
YOLO-LITE: A Real-Time Object Detection Algorithm Optimized For Non-GPU Computers
8 pages
YOLO Object Detection with Python
No ratings yet
YOLO Object Detection with Python
11 pages
YOLO Framework Performance Evaluation
No ratings yet
YOLO Framework Performance Evaluation
7 pages
Synopsis - Internship - Group-53
No ratings yet
Synopsis - Internship - Group-53
8 pages
Object Detection
No ratings yet
Object Detection
11 pages
Yolo Algorithm
No ratings yet
Yolo Algorithm
37 pages
YOLO Object Detection with Python
No ratings yet
YOLO Object Detection with Python
10 pages
YOLO Algorithm for Object Detection
No ratings yet
YOLO Algorithm for Object Detection
3 pages
You Only Look Once - Object Detection Models A Review
No ratings yet
You Only Look Once - Object Detection Models A Review
8 pages
YOLO: For Computer Vision Experts
No ratings yet
YOLO: For Computer Vision Experts
3 pages
YOLO: Real-Time Object Detection System
No ratings yet
YOLO: Real-Time Object Detection System
10 pages
YOLO Based Detection and Classification of Objects in Video Records
No ratings yet
YOLO Based Detection and Classification of Objects in Video Records
5 pages
YOLO: Fast Object Detection for Engineers
No ratings yet
YOLO: Fast Object Detection for Engineers
6 pages
Object Detection Techniques Overview
No ratings yet
Object Detection Techniques Overview
22 pages
Yolopdf
No ratings yet
Yolopdf
10 pages
YOLO: Real-Time Object Detection
No ratings yet
YOLO: Real-Time Object Detection
10 pages
YOLO Object Detection Seminar Report
No ratings yet
YOLO Object Detection Seminar Report
13 pages
Deep Learning For Object Detection - 131124
No ratings yet
Deep Learning For Object Detection - 131124
35 pages
YOLO Object Detection with AI Techniques
No ratings yet
YOLO Object Detection with AI Techniques
20 pages
Design of A Real-Time Object Detection Prototype S
No ratings yet
Design of A Real-Time Object Detection Prototype S
6 pages
"Object Detection With Yolo": A Seminar On
No ratings yet
"Object Detection With Yolo": A Seminar On
14 pages
Object Detection in Autonomous Vehicles
No ratings yet
Object Detection in Autonomous Vehicles
20 pages
YOLO: Real-Time Object Detection
No ratings yet
YOLO: Real-Time Object Detection
10 pages
Real-Time Object Detection with YOLO
No ratings yet
Real-Time Object Detection with YOLO
8 pages
Real-Time Object Detection for the Blind
No ratings yet
Real-Time Object Detection for the Blind
4 pages
Enhancing Real-Time Object Detection With YOLO Alg
No ratings yet
Enhancing Real-Time Object Detection With YOLO Alg
9 pages
Object Detection and Text-to-Speech Integration
No ratings yet
Object Detection and Text-to-Speech Integration
10 pages
Object Detection in Autonomous Vehicles
No ratings yet
Object Detection in Autonomous Vehicles
26 pages
YOLO: Real-Time Object Detection System
No ratings yet
YOLO: Real-Time Object Detection System
10 pages
Yolov10 To Its Genesis A Decadal and Comprehensive
No ratings yet
Yolov10 To Its Genesis A Decadal and Comprehensive
49 pages
YOLO Based Object Detection Models: A Review and Its Applications
No ratings yet
YOLO Based Object Detection Models: A Review and Its Applications
40 pages
YOLO Object Detection in Video Analysis
No ratings yet
YOLO Object Detection in Video Analysis
56 pages
Overview of YOLO Object Detection
No ratings yet
Overview of YOLO Object Detection
7 pages
YOLO Based Object Detection Models: A Review and Its Applications
No ratings yet
YOLO Based Object Detection Models: A Review and Its Applications
40 pages
Optimized YOLO for Traffic Object Detection
No ratings yet
Optimized YOLO for Traffic Object Detection
5 pages
Real-Time Object Detection System Overview
No ratings yet
Real-Time Object Detection System Overview
31 pages
Object Detection
No ratings yet
Object Detection
13 pages
YOLO Algorithm for Image Object Detection
No ratings yet
YOLO Algorithm for Image Object Detection
18 pages
The Real-Time Detection of Traffic Participants Using YOLO Algorithm
No ratings yet
The Real-Time Detection of Traffic Participants Using YOLO Algorithm
4 pages
Drone Detection with Deep Learning
No ratings yet
Drone Detection with Deep Learning
5 pages
Unified Real-Time Object Detection
No ratings yet
Unified Real-Time Object Detection
36 pages
Image Preprocessing For Efficient Training of YOLO Deep Learning Networks
No ratings yet
Image Preprocessing For Efficient Training of YOLO Deep Learning Networks
3 pages
Image Detection and Segmentation Using YOLO v5 For
No ratings yet
Image Detection and Segmentation Using YOLO v5 For
6 pages
YOLO Advances To Its Genesis: A Decadal and Comprehensive Review of The You Only Look Once (YOLO) Series
No ratings yet
YOLO Advances To Its Genesis: A Decadal and Comprehensive Review of The You Only Look Once (YOLO) Series
83 pages
Traffic Monitoring with Object Detection
No ratings yet
Traffic Monitoring with Object Detection
11 pages
Pothole and Object Detection For An Autonomous Vehicle Using YOLO
No ratings yet
Pothole and Object Detection For An Autonomous Vehicle Using YOLO
5 pages
27 GSJ8976
No ratings yet
27 GSJ8976
16 pages
Unit-3 Introduction To Data Structures
No ratings yet
Unit-3 Introduction To Data Structures
2 pages
IT Project Guidelines
No ratings yet
IT Project Guidelines
3 pages
Bda Co Po New
No ratings yet
Bda Co Po New
1 page
HTML & CSS Quiz for MCA Students
No ratings yet
HTML & CSS Quiz for MCA Students
12 pages
FortiGate IPS Configuration Guide
No ratings yet
FortiGate IPS Configuration Guide
11 pages
PSMComm2 User Manual v1 1c
No ratings yet
PSMComm2 User Manual v1 1c
80 pages
Java Calendar Program Guide
No ratings yet
Java Calendar Program Guide
11 pages
5x7 LED Matrix Display with 8051
No ratings yet
5x7 LED Matrix Display with 8051
15 pages
Health Nursing Informatics Assignment
No ratings yet
Health Nursing Informatics Assignment
3 pages
Understanding Sequential Circuits and Flip-Flops
No ratings yet
Understanding Sequential Circuits and Flip-Flops
11 pages
Pony ICT 06 T1 Rev - November
No ratings yet
Pony ICT 06 T1 Rev - November
11 pages
RSM Uae Cybersecurity Data 50 500 Apollo 14 Feb 2024 Email Data For Campign 10
No ratings yet
RSM Uae Cybersecurity Data 50 500 Apollo 14 Feb 2024 Email Data For Campign 10
1 page
Bharat Light and Power Case Study - Google Cloud
No ratings yet
Bharat Light and Power Case Study - Google Cloud
5 pages
SSD 970 Pro Evo Plus Brochure 210712
No ratings yet
SSD 970 Pro Evo Plus Brochure 210712
2 pages
Computer Networks Notes
No ratings yet
Computer Networks Notes
2 pages
Fixed-Point Designer Getting Started Guide
No ratings yet
Fixed-Point Designer Getting Started Guide
54 pages
ASELSAN Software Defined HF Radios
0% (1)
ASELSAN Software Defined HF Radios
12 pages
Witcher - Wild Hunt Workout
No ratings yet
Witcher - Wild Hunt Workout
4 pages
SFTC DEFORM F2 Modulus
No ratings yet
SFTC DEFORM F2 Modulus
2 pages
Monthly Exam Revision Guide: Lessons 1-5
No ratings yet
Monthly Exam Revision Guide: Lessons 1-5
17 pages
Bug Life Cycle - A Comprehensive Guide
No ratings yet
Bug Life Cycle - A Comprehensive Guide
5 pages
OtoNova Clinical
No ratings yet
OtoNova Clinical
8 pages
VND - Openxmlformats Officedocument - Wordprocessingml.document&rendition 1
No ratings yet
VND - Openxmlformats Officedocument - Wordprocessingml.document&rendition 1
68 pages
Nisha Resume
No ratings yet
Nisha Resume
1 page
Computer Science Courses Overview 2022
No ratings yet
Computer Science Courses Overview 2022
3 pages
Mid S4 Ict
No ratings yet
Mid S4 Ict
6 pages
FlashStart Report 2025
No ratings yet
FlashStart Report 2025
10 pages
Cambridge IGCSE™: Information and Communication Technology 0417/32
No ratings yet
Cambridge IGCSE™: Information and Communication Technology 0417/32
10 pages
Training Hi Trax
No ratings yet
Training Hi Trax
2 pages
Modbus Protocol Overview
No ratings yet
Modbus Protocol Overview
7 pages

YOLO Object Detection and Retrieval System

Uploaded by

YOLO Object Detection and Retrieval System

Uploaded by

IOP Conference Series: Materials Science and Engineering

PAPER • OPEN ACCESS

Detection and Content Retrieval of Object in an Image using YOLO

View the article online for updates and enhancements.

This content was downloaded from IP address 112.79.149.230 on 27/05/2020 at 05:01

Detection and Content Retrieval of Object in

II. ALGORITHMS FOR OBJECT DETECTION

D. You Only Look Once (YOLO)

III. WORKING OF YOLO

Fig. 1 Divide the image into S × S grid

Fig. 3 Multiply probability and confidence scores.

Fig. 4 Final Output

IV. WEB SCRAPING AND TEXT TO SPEECH CONVERSION

A. Content Retrieval from Wikipedia

B. URL Retrieval from Google

C. Text to Speech Conversion

Algorithm Time taken (in sec)

Faster R-CNN 0.2

Frames Per Second

Fig.5 Performance analysis based on frames per second

Fig. 6 Performance analysis based on accuracy

VI. EXPERIMENTAL RESULTS

Since YOLO is a pretrained model it should have

· Input image – airplane.jpg

Fig. 7 Final output

You might also like