0% found this document useful (0 votes)

70 views17 pages

Text Detection

The document summarizes a proposed text detection algorithm. It begins with an abstract describing the challenges of text detection under complex backgrounds and variations in font, style, language and orientation. It then discusses existing methods and their limitations in being slow, detecting non-text regions, and requiring hand-tuned parameters. The proposed method aims to address these issues. It uses edge-enhanced Maximally Stable Extremal Regions (MSER) as letter candidates and improves accuracy by filtering regions based on aspect ratio. The objectives are to evaluate performance on standard datasets and the problem is robust text detection from camera images.

Uploaded by

Krishna Shetty

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

70 views17 pages

Text Detection

Uploaded by

Krishna Shetty

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 17

PRESENTATION BY,

ANANTH SHETTY K R
USN: 4JC13LIE02
Under the Guidance of,
Mr. Shreekanth. T
Assistant Professor
1

ABSTRACT
A Text is very much interesting prospect which provides clues depending on the context for the

object that appears inside an image.

Detection of Text under complex background has become a challenging task due to variations in font

and style, languages and orientation.

The proposed method can handle documents having widely varying text sizes unlike other existing

local binarization methods.

Existing methods for scene text detection tend to be slow as the image has to be processed in multiple

scales.

The algorithm such as MSER algorithm detects a large number of non-characters and rule based

method generally require hand tuned parameters, which is time consuming and error prone.

The clustering based method shows good performance but it is complicated by incorporating a

second stage processing after minimum spanning tree clustering.

This motivated us to propose a robust and accurate scene text detection method. The proposed

method aims to eliminate the above complications in a better way.

INTRODUCTION
Digital cameras are compact, easy to use, portable and offer a high-speed non-contact mechanism for image

acquisition.

Its ability to capture non-paper document images like scene text has several potential applications like

licence plate recognition, road sign recognition, digital note taking, document archiving and wearable
computing.

Camera images suffer from uneven lighting, low resolution, blur, and perspective distortion.

Overcoming these challenges will help us effortlessly acquire and manage information in documents.

In document processing systems, a binarization process precedes the analysis and recognition procedures.

The use of two-level information greatly reduces the computational load and the complexity of the analysis

algorithms. It is critical to achieve robust binarization since any error introduced in this stage will affect the
subsequent processing steps.

Text is the most important information in a document.

Here, this project focus on a novel method to binarize camera-captured color document images,

whereby the foreground text is output as black and the background as white irrespective of the
original polarity of foreground-background shades.

A block diagram of a proposed text detection algorithm is as shown in Figure.1.

Figure.1: Block Diagram of a proposed text Detection

Algorithm.
5

LITERATURE REVIEW
One of the binarization method is the global thresholding technique that uses a single threshold

to classify image pixels into foreground or background classes. Global thresholding techniques
are generally based on histogram analysis [4, 6]. It works well for images with well separated
foreground and background intensities.

On the other hand, local methods use a dynamic threshold across the image according to the

local information. These approaches are generally window-based and the local threshold for a
pixel is computed from the gray values of the pixels within a window centered at that particular
pixel.

MOTIVATION
The above mentioned methods in the literature have some flaws

and that can be overcome in this project by employing edgeenhanced Maximally Stable Extremal Regions as basic letter
candidates.
A disadvantage of the MSER is that it detects a lot of false positives

-- regions that do not contain characters.

To solve this problem, the algorithm proposed improves accuracy

of finding character regions.

The main idea is to eliminate regions with very small or very big

aspect ratio.

OBJECTIVES
To validate the performance of our proposed system, we use the metrics

defined in [15] and run our algorithm on the ICDAR competition dataset.
The text detecting performance is evaluated by calculating the precision and

recall rates and comparing with other methods.

The performance of our text detection algorithm is evaluated by checking

the correctly detected bounding boxes around the title text.

We use a stringent criterion and declare a title to be correctly detected only

when all letters within the title are detected.

PROBLEM STATEMENT
The text detection stage seeks to detect the presence of text in a given image. CC-based
methods are used to identify all regions in the image.
A geometrical analysis is needed to merge the text components using the spatial
arrangement of the components so as to filter out non-text components and mark the
boundaries of the text regions.
The geometric as well as stroke width information are then applied to perform filtering
and pairing of CCs.
Finally, letters are clustered into lines and additional checks are performed to eliminate
false positives.
Hence it may be considered as a robust approach for text detection using edge-enhanced
MSER letters as basic letter candidates.

METHODOLOGY

SOFTWARE REQUIREMENTS
The framework is designed in Matlab in 64 bit system 1.8 GHz with Multi

Core processor where different types of images are considered for the
experiment.

The implementation also considers images with single text, multiple text,

and text with different sizes of fonts, text with complex and simple
background, text with different languages, images taken from camera or
mobile.

REFERENCES
[1] S. S. Tsai, D. Chen, V. Chandrasekhar, G. Takacs, N. M. Cheung, R. Vedantham, R.

Grzeszczuk, and B. Girod, Mobile

product recognition, in Proc. ACM Multimedia 2010, 2010.

[2] D. Chen, S. S. Tsai, C. H. Hsu, K. Kim, J. P. Singh, and B. Girod, Building book inventories using smartphones, in Proc.

ACM Multimedia, 2010.

[3] G. Takacs, Y. Xiong, R. Grzeszczuk, V. Chandrasekhar, W. Chen, L. Pulli, N. Gelfand, T. Bismpigiannis, and B. Girod,

Outdoors augmented reality on mobile phone using loxel-based visual feature organization, in Proc. ACM Multimedia
Information Retrieval, 2008, pp. 427434.
[4] D. G. Lowe, Distinctive image features from scale-invariant keypoints, International Journal of Computer Vision, vol. 60,

pp. 91110, 2004.

[5] H. Bay, A. Ess, T. Tuytelaars, and L. Van Gool, Speeded-up robust features (surf), Computer Vision and Image

Understanding, vol. 110, no. 3, pp. 346 359, 2008.

[6] V. Chandrasekhar, G. Takacs, D. Chen, S. Tsai, R. Grzeszczuk, and B. Girod, CHoG: Compressed histogram of gradients.

a low bit-rate feature descriptor, in CVPR, 2009, pp. 2504 2511.

[7] D. Nister and H. Stewenius, Scalable recognition with a vocabulary tree, in CVPR, 2006, pp. 21612168.
[8] D. M. Chen, S. S. Tsai, V. Chandrasekhar, G. Takacs, R. Vedantham, R. Grzeszczuk, and B.

Girod, Inverted Index

Compression for Scalable Image Matching, in Proc. of IEEE Data Compression Conference (DCC), Snowbird, Utah, March
2010.

[9] J. Liang, D. Doermann, and H. P. Li, Camera-based analysis of text and documents: a

survey, IJDAR, vol. 7, no. 2-3, pp. 84104, 2005.

[10] K. Jung, K. I. Kim, and A. K. Jain, Text information extraction in images and video: a
survey, Pattern Recognition, vol. 37, no. 5, pp. 977 997, 2004.
[11] Y. Zhong, H. Zhang, and A. K. Jain, Automatic caption localization in compressed
video, IEEE Trans. Pattern Anal. Mach. Intell., vol. 22, no. 4, pp. 385 392, 2000.
[12] Q. Ye, Q. Huang, W. Gao, and D. Zhao, Fast and robust text detection in images and
video frames, Image Vision Comput., vol. 23, pp. 565576, 2005.
[13] X. Chen and A. L. Yuille, Detecting and reading text in natural scenes, in CVPR, 2004,
vol. 2, pp. II366 II373 Vol.2.
[14] X. Chen and A. L. Yuille, A time-efficient cascade for real-time object detection: With
applications for the visually impaired, in CVPR - Workshops, 2005, p. 28.
[15] S. M. Lucas, ICDAR 2005 text locating competition results, in ICDAR, 2005, pp. 80
84 Vol. 1.
[16] B. Epshtein, E. Ofek, and Y. Wexler, Detecting text in natural scenes with stroke width
transform, in CVPR, 2010, pp. 2963 2970.
[17] P. Shivakumara, T. Q. Phan, and C. L. Tan, A laplacian approach to multi-oriented text
detection in video, IEEE Trans. Pattern Anal. Mach. Intell., vol. 33, no. 2, pp. 412 419, feb.
2011.

DSP Project
No ratings yet
DSP Project
16 pages
Personal Growth
100% (1)
Personal Growth
18 pages
1st Review
100% (1)
1st Review
14 pages
"Camera Based Product Information Reading For Blind People": Priyanka Patil, Sonali Solat, Shital Hake
No ratings yet
"Camera Based Product Information Reading For Blind People": Priyanka Patil, Sonali Solat, Shital Hake
4 pages
Project Report On 2factor Authentication
No ratings yet
Project Report On 2factor Authentication
91 pages
Last Vergen Reports
No ratings yet
Last Vergen Reports
26 pages
Department of Computer Science: Image To Text Using Text Recognition & Text To Speech
No ratings yet
Department of Computer Science: Image To Text Using Text Recognition & Text To Speech
66 pages
mca1414garbybaby-170131175855
No ratings yet
mca1414garbybaby-170131175855
44 pages
PDL-III Report FINAL
No ratings yet
PDL-III Report FINAL
34 pages
Department of Electronics and Communication Engineering
No ratings yet
Department of Electronics and Communication Engineering
25 pages
Haramaya University Computer Science Student
No ratings yet
Haramaya University Computer Science Student
15 pages
Text Detection in Document Images: Highlight On Using FAST Algorithm
No ratings yet
Text Detection in Document Images: Highlight On Using FAST Algorithm
11 pages
Text Detection and Recognition Using Enhanced MSER Detection and A Novel OCR Technique
No ratings yet
Text Detection and Recognition Using Enhanced MSER Detection and A Novel OCR Technique
7 pages
Scene Text Recognition by Using EE-MSER and Optical Character Recognition For Natural Images-35843
No ratings yet
Scene Text Recognition by Using EE-MSER and Optical Character Recognition For Natural Images-35843
5 pages
Research PaPer EAST
No ratings yet
Research PaPer EAST
10 pages
Automatically Detect and Recognize Text in Natural Images
No ratings yet
Automatically Detect and Recognize Text in Natural Images
19 pages
Journal Publishers
No ratings yet
Journal Publishers
4 pages
Multi-Script-Oriented Text Detection and Recognition in Video/Scene/Born Digital Images
No ratings yet
Multi-Script-Oriented Text Detection and Recognition in Video/Scene/Born Digital Images
18 pages
Scene Text Detection Using Machine Learning Classifiers
No ratings yet
Scene Text Detection Using Machine Learning Classifiers
5 pages
Top-Down and Bottom-Up Cues For Scene Text Recognition: Anand Mishra Karteek Alahari C. V. Jawahar
No ratings yet
Top-Down and Bottom-Up Cues For Scene Text Recognition: Anand Mishra Karteek Alahari C. V. Jawahar
8 pages
Gupta Synthetic Data For CVPR 2016 Paper
No ratings yet
Gupta Synthetic Data For CVPR 2016 Paper
10 pages
2014 springer
No ratings yet
2014 springer
17 pages
(IJCST-V12I2P9) :Dr.M. Praneesh, Ashwanth.V, Febina.N, Sai Krishna P K
No ratings yet
(IJCST-V12I2P9) :Dr.M. Praneesh, Ashwanth.V, Febina.N, Sai Krishna P K
8 pages
IJERT Segmentation and Detection of Text
No ratings yet
IJERT Segmentation and Detection of Text
6 pages
Robustdetection of Text in Natural Scene Images
No ratings yet
Robustdetection of Text in Natural Scene Images
4 pages
Extraction Text From Camera Images
No ratings yet
Extraction Text From Camera Images
14 pages
Confluence 2018 8442875
No ratings yet
Confluence 2018 8442875
4 pages
Text Extraction and Localization From Captured Images: Taufin M Jeeralbhavi Dr. Jagadeesh D. Pujari Shivananda V. Seeri
No ratings yet
Text Extraction and Localization From Captured Images: Taufin M Jeeralbhavi Dr. Jagadeesh D. Pujari Shivananda V. Seeri
3 pages
Rainarli 2020 IOP Conf. Ser. Mater. Sci. Eng. 879 012106
No ratings yet
Rainarli 2020 IOP Conf. Ser. Mater. Sci. Eng. 879 012106
9 pages
المشروع
No ratings yet
المشروع
17 pages
Stroke Width Transform
No ratings yet
Stroke Width Transform
8 pages
Text Detection and Recognition For Semantic Mapping in Indoor Navigation
No ratings yet
Text Detection and Recognition For Semantic Mapping in Indoor Navigation
4 pages
Methodology For Eliminating Plain Regions From Captured Images
No ratings yet
Methodology For Eliminating Plain Regions From Captured Images
13 pages
Kang Orientation Robust Text 2014 CVPR Paper
No ratings yet
Kang Orientation Robust Text 2014 CVPR Paper
8 pages
Localizing Text On Videos
No ratings yet
Localizing Text On Videos
13 pages
"Customer Satisfaction": A Major Research Project On
No ratings yet
"Customer Satisfaction": A Major Research Project On
62 pages
Character Recoganization
No ratings yet
Character Recoganization
6 pages
Detection and Identification of Un-Uniformed Shape Text From Blurred Video Frames
No ratings yet
Detection and Identification of Un-Uniformed Shape Text From Blurred Video Frames
11 pages
Kami Export - 1904.01941
No ratings yet
Kami Export - 1904.01941
5 pages
IJCRT2108410
No ratings yet
IJCRT2108410
5 pages
Text Detection and Localization in Natural Scene Images Using MSER and Fast Guided Filter
No ratings yet
Text Detection and Localization in Natural Scene Images Using MSER and Fast Guided Filter
6 pages
Trafficsign - Sift With SVM
No ratings yet
Trafficsign - Sift With SVM
5 pages
Tang_Few_Could_Be_Better_Than_All_Feature_Sampling_and_Grouping_CVPR_2022_paper
No ratings yet
Tang_Few_Could_Be_Better_Than_All_Feature_Sampling_and_Grouping_CVPR_2022_paper
10 pages
Latest Base Paper
No ratings yet
Latest Base Paper
4 pages
GoK2014_4
No ratings yet
GoK2014_4
6 pages
1301.2628!!!
No ratings yet
1301.2628!!!
10 pages
Thesis_Research_Proposal
No ratings yet
Thesis_Research_Proposal
5 pages
Text Color Images
No ratings yet
Text Color Images
6 pages
Event Info Extraction From Flyers: Yang Zhang Hao Zhang, Haoranli
No ratings yet
Event Info Extraction From Flyers: Yang Zhang Hao Zhang, Haoranli
7 pages
Detection of Text from Lecture Video Images
No ratings yet
Detection of Text from Lecture Video Images
5 pages
3586a949
No ratings yet
3586a949
6 pages
A Robust and Fast Text Extraction in Images and Video Frames
No ratings yet
A Robust and Fast Text Extraction in Images and Video Frames
7 pages
Title: Spatial Cohesion Refers To The Fact That Text
No ratings yet
Title: Spatial Cohesion Refers To The Fact That Text
6 pages
2005-6606-1-PB
No ratings yet
2005-6606-1-PB
21 pages
CMRT09 Fabrizio Et Al
No ratings yet
CMRT09 Fabrizio Et Al
6 pages
Extracting Text Part Using MATLAB: Poonam Rani, Payal Taneja Daulat Sihag
No ratings yet
Extracting Text Part Using MATLAB: Poonam Rani, Payal Taneja Daulat Sihag
3 pages
CSP - Final PPT 1.1
No ratings yet
CSP - Final PPT 1.1
50 pages
4
No ratings yet
4
7 pages
Miriam Leon, Veronica Vilaplana, Antoni Gasull, Ferran Marques (Veronica - Vilaplana, Antoni - Gasull, Ferran - Marques) @upc - Edu
No ratings yet
Miriam Leon, Veronica Vilaplana, Antoni Gasull, Ferran Marques (Veronica - Vilaplana, Antoni - Gasull, Ferran - Marques) @upc - Edu
4 pages
InstallGetStarted Guide
No ratings yet
InstallGetStarted Guide
342 pages
Quezon District_Enclosure No. 6 Session Guide Form.REVISED
No ratings yet
Quezon District_Enclosure No. 6 Session Guide Form.REVISED
6 pages
Implementation of A Video Text Detection System
No ratings yet
Implementation of A Video Text Detection System
5 pages
f976553ee1f71ab50d224f113141c460
No ratings yet
f976553ee1f71ab50d224f113141c460
23 pages
Homeroom Guidance 2nd Quarter
No ratings yet
Homeroom Guidance 2nd Quarter
49 pages
Design Project Manual: Architectural Capstone
No ratings yet
Design Project Manual: Architectural Capstone
38 pages
RRB Clerk Prelims Day - 33 (E) 165399423787
No ratings yet
RRB Clerk Prelims Day - 33 (E) 165399423787
31 pages
Streckeisen - 1976 - To Each Plutonic Rocks Proper Names
100% (1)
Streckeisen - 1976 - To Each Plutonic Rocks Proper Names
33 pages
Starfinder RPG 3PP - Dead in Space
100% (1)
Starfinder RPG 3PP - Dead in Space
133 pages
Dissertation Moyen Age Seconde
100% (2)
Dissertation Moyen Age Seconde
8 pages
Final Report EE
100% (2)
Final Report EE
12 pages
PET Writing Part 1 Explanation
100% (1)
PET Writing Part 1 Explanation
19 pages
Old Thesis
No ratings yet
Old Thesis
22 pages
WEB2 Practicals
No ratings yet
WEB2 Practicals
7 pages
11.input Output
No ratings yet
11.input Output
7 pages
Total Productive Maintenance
100% (1)
Total Productive Maintenance
34 pages
Construction Site Security Surve
100% (3)
Construction Site Security Surve
4 pages
Shock: Sandesh S Melagiri 57
No ratings yet
Shock: Sandesh S Melagiri 57
5 pages
Learn To Play Bansuri - Alankars (1-15)
No ratings yet
Learn To Play Bansuri - Alankars (1-15)
7 pages
Boundary Element Programming in Mechanics - 104207485
No ratings yet
Boundary Element Programming in Mechanics - 104207485
6 pages
BOVINA - Ball Launcher Project
No ratings yet
BOVINA - Ball Launcher Project
1 page
Mradul Maheshwari - MPO
No ratings yet
Mradul Maheshwari - MPO
3 pages
NCERT Solutions For Class 12 Maths Chapter 3 Matrices Exercise 3.1
No ratings yet
NCERT Solutions For Class 12 Maths Chapter 3 Matrices Exercise 3.1
12 pages
Material Testing Open Ended Experiment: Effect On Strength and Slump of The Concrete Due To Different Types of Water
No ratings yet
Material Testing Open Ended Experiment: Effect On Strength and Slump of The Concrete Due To Different Types of Water
11 pages
K L E F (Deemed To Be UNIVERSITY) : Code: 15 EC 305
No ratings yet
K L E F (Deemed To Be UNIVERSITY) : Code: 15 EC 305
2 pages
IBM 2365 Processor Storage PDF
No ratings yet
IBM 2365 Processor Storage PDF
2 pages
Ananth Shetty K R Mobile No: 8105010222 Objective
No ratings yet
Ananth Shetty K R Mobile No: 8105010222 Objective
1 page
Engineering Statistics: Measures of Central Tendency
No ratings yet
Engineering Statistics: Measures of Central Tendency
10 pages
PTC - Meeting Agenda - Oct 4
No ratings yet
PTC - Meeting Agenda - Oct 4
1 page
Google Pert and CPM
No ratings yet
Google Pert and CPM
7 pages
Myjobmag Law Entry Level CV
No ratings yet
Myjobmag Law Entry Level CV
1 page
CSC511C A0001
No ratings yet
CSC511C A0001
1 page
Interpolation and Approximation
No ratings yet
Interpolation and Approximation
8 pages
Ananth Resume
No ratings yet
Ananth Resume
2 pages
Brief Psychiatric Rating Scale (BPRS) : Please Enter The Score For The Term Which Best Describes The Patient's Condition
No ratings yet
Brief Psychiatric Rating Scale (BPRS) : Please Enter The Score For The Term Which Best Describes The Patient's Condition
1 page
Ananth Resume 1
No ratings yet
Ananth Resume 1
3 pages
Machine Learning - Advanced Concepts
From Everand
Machine Learning - Advanced Concepts
Derrick Mwiti
No ratings yet
Computer Stereo Vision: Exploring Depth Perception in Computer Vision
From Everand
Computer Stereo Vision: Exploring Depth Perception in Computer Vision
Fouad Sabry
No ratings yet
Image Segmentation: Unlocking Insights through Pixel Precision
From Everand
Image Segmentation: Unlocking Insights through Pixel Precision
Fouad Sabry
No ratings yet