Case Study of Lexical Analyzer PDF

Lexical analysis is the first phase of compiling source code. It breaks the source code into tokens by removing whitespace and comments. The lexical analyzer checks for valid tokens and generates errors if invalid tokens are found. It identifies tokens and expands macros while reading input characters from the source code. Common lexical errors include misspelled identifiers or keywords. Error recovery techniques include removing or ignoring characters. Lexical analysis improves compiler efficiency by eliminating unwanted tokens and allowing specialization through a separate lexical analyzer.

Uploaded by

AMIT DHANDE

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

100% found this document useful (1 vote)

1K views3 pages

Case Study of Lexical Analyzer PDF

Uploaded by

AMIT DHANDE

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

Case Study: Lexical Analyzer

Lexical analysis is the first phase of a compiler. It takes the modified source code from language
preprocessors that are written in the form of sentences. The lexical analyzer breaks these
syntaxes into a series of tokens, by removing any whitespace or comments in the source code.

If the lexical analyzer finds a token invalid, it generates an error. The lexical analyzer works
closely with the syntax analyzer. It reads character streams from the source code, checks for
legal tokens, and passes the data to the syntax analyzer when it demands.

Roles of the Lexical analyzer

Lexical analyzer performs below given tasks:

1. Helps to identify token into the symbol table

2. Removes white spaces and comments from the source program
3. Correlates error messages with the source program
4. Helps you to expands the macros if it is found in the source program
5. Read input characters from the source program

Lexical Errors
A character sequence which is not possible to scan into any valid token is a lexical error.
Important facts about the lexical error:

1. Lexical errors are not very common, but it should be managed by a scanner
2. Misspelling of identifiers, operators, keyword are considered as lexical errors
3. Generally, a lexical error is caused by the appearance of some illegal character, mostly at
the beginning of a token.
Error Recovery in Lexical Analyzer
Here, are a few most common error recovery techniques:

1. Removes one character from the remaining input

2. In the panic mode, the successive characters are always ignored until we reach a
well-formed token
3. By inserting the missing character into the remaining input
4. Replace a character with another character
5. Transpose two serial characters

Why separate Lexical and Parser?

1. The simplicity of design: It eases the process of lexical analysis and the syntax analysis
by eliminating unwanted tokens
2. To improve compiler efficiency: Helps you to improve compiler efficiency
3. Specialization: specialized techniques can be applied to improves the lexical analysis
process
4. Portability: only the scanner requires to communicate with the outside world
5. Higher portability: input-device-specific peculiarities restricted to the lexer

Advantages of Lexical analysis

1. Lexical analyzer method is used by programs like compilers which can use the parsed
data from a programmer's code to create a compiled binary executable code
2. It is used by web browsers to format and display a web page with the help of parsed data
from JavaScript, HTML, CSS
3. A separate lexical analyzer helps you to construct a specialized and potentially more
efficient processor for the task
Disadvantage of Lexical analysis

1. You need to spend significant time reading the source program and partitioning it in the
form of tokens
2. Some regular expressions are quite difficult to understand compared to PEG or EBNF
rules
3. More effort is needed to develop and debug the lexer and its token descriptions
4. Additional runtime overhead is required to generate the lexer tables and construct the
tokens

Diagram:

Conclusions: In computer science, lexical analysis, lexing or tokenization is the process of
converting a sequence of characters such as in a computer program or web page into a sequence
of tokens, strings with an assigned and thus identified meaning. A program that performs lexical
analysis may be termed a lexer, tokenizer, or scanner, though scanner is also a term for the first
stage of a lexer.Thus we have successfully studied it.

002chapter 2 - Lexical Analysis
No ratings yet
002chapter 2 - Lexical Analysis
114 pages
6.attributes For Tokens
No ratings yet
6.attributes For Tokens
5 pages
DATA ANALYTICS Syllabus 3 Units
No ratings yet
DATA ANALYTICS Syllabus 3 Units
37 pages
2 Syntax Directed Transiation
No ratings yet
2 Syntax Directed Transiation
9 pages
Chapter 5 - Uncertain Knowledge and Reasoning
No ratings yet
Chapter 5 - Uncertain Knowledge and Reasoning
29 pages
NLP Lab Manual
No ratings yet
NLP Lab Manual
15 pages
Unit 4
No ratings yet
Unit 4
26 pages
LR (0) Parser
No ratings yet
LR (0) Parser
8 pages
CS8602 Compiler Design Two Marks Questions 1
No ratings yet
CS8602 Compiler Design Two Marks Questions 1
22 pages
Chpater 1 - Unit 2
No ratings yet
Chpater 1 - Unit 2
31 pages
NLP Guide for AI Students
No ratings yet
NLP Guide for AI Students
29 pages
F U-4 PDF
No ratings yet
F U-4 PDF
48 pages
Unit-1 Cyber Laws
No ratings yet
Unit-1 Cyber Laws
21 pages
Recognition of Tokens
No ratings yet
Recognition of Tokens
34 pages
QB Solved m3
No ratings yet
QB Solved m3
4 pages
Aecs Lab Manual Final - 2019-20
No ratings yet
Aecs Lab Manual Final - 2019-20
101 pages
Compiler Design Lab Manual
No ratings yet
Compiler Design Lab Manual
36 pages
CD-30 Questions With Solution
No ratings yet
CD-30 Questions With Solution
43 pages
Write A C Program To Simulate Lexical Analyzer To Validating A Given Input String.
No ratings yet
Write A C Program To Simulate Lexical Analyzer To Validating A Given Input String.
8 pages
Unit 4: Symbol Table
No ratings yet
Unit 4: Symbol Table
38 pages
Compiler Design (CS-701) : Develop A Lexical Analyzer To Recognize A Few Patterns in C
No ratings yet
Compiler Design (CS-701) : Develop A Lexical Analyzer To Recognize A Few Patterns in C
17 pages
Madhuri Gupta 7th Sem AI Lab Manual1
No ratings yet
Madhuri Gupta 7th Sem AI Lab Manual1
17 pages
BScCSIT Transaction DBMS
No ratings yet
BScCSIT Transaction DBMS
30 pages
Compiler Design Course Guide
No ratings yet
Compiler Design Course Guide
114 pages
CD Previous Question Papers According To Jntuh Syllabus
No ratings yet
CD Previous Question Papers According To Jntuh Syllabus
16 pages
FLAT Lords Paper
No ratings yet
FLAT Lords Paper
60 pages
Compiler Design Unit-2
No ratings yet
Compiler Design Unit-2
29 pages
CD Sanchit Sir Notes
100% (1)
CD Sanchit Sir Notes
115 pages
NLP Revision Notes and Applications
No ratings yet
NLP Revision Notes and Applications
4 pages
AI Unit - 2 R22
No ratings yet
AI Unit - 2 R22
40 pages
5.2: Closure Properties of Recursive and Recursively Enumerable Languages
No ratings yet
5.2: Closure Properties of Recursive and Recursively Enumerable Languages
14 pages
Tries: - Standard Tries - Compressed Tries - Suffix Tries
No ratings yet
Tries: - Standard Tries - Compressed Tries - Suffix Tries
11 pages
Algorithms & Machine Learning Intro
No ratings yet
Algorithms & Machine Learning Intro
76 pages
Compiler Design Lab Guide
No ratings yet
Compiler Design Lab Guide
59 pages
Time: 3 Hours Total Marks: 100: Printed Page 1 of 2 Sub Code:KNC302
No ratings yet
Time: 3 Hours Total Marks: 100: Printed Page 1 of 2 Sub Code:KNC302
2 pages
Laboratory Record Note Book: Amity University Chhattisgarh
No ratings yet
Laboratory Record Note Book: Amity University Chhattisgarh
21 pages
It - (R22) - 2-2 - Automata and Compiler Design - Digital Notes - (2023-24)
No ratings yet
It - (R22) - 2-2 - Automata and Compiler Design - Digital Notes - (2023-24)
64 pages
Database Indexing and Hashing
No ratings yet
Database Indexing and Hashing
7 pages
NLP Practical
No ratings yet
NLP Practical
27 pages
MCQ On Knowledge Representation 5eea6a0e39140f30f369e525
No ratings yet
MCQ On Knowledge Representation 5eea6a0e39140f30f369e525
21 pages
Python MP Report PDF
No ratings yet
Python MP Report PDF
61 pages
Compiler Design Practical File PDF
No ratings yet
Compiler Design Practical File PDF
33 pages
ATCD-Unit 1
No ratings yet
ATCD-Unit 1
33 pages
Da Unit-2
No ratings yet
Da Unit-2
23 pages
Turing Test in AI, Agents, Environment
No ratings yet
Turing Test in AI, Agents, Environment
17 pages
Unit-3 Notes
No ratings yet
Unit-3 Notes
6 pages
NP-Complete Exam Guide
100% (1)
NP-Complete Exam Guide
7 pages
Chapter 13: Query Processing
No ratings yet
Chapter 13: Query Processing
25 pages
Re To DFA
No ratings yet
Re To DFA
6 pages
Unit 2 Lexical Analyzer
No ratings yet
Unit 2 Lexical Analyzer
30 pages
Compiler Design Lecture Notes (10CS63) : D C S & E
No ratings yet
Compiler Design Lecture Notes (10CS63) : D C S & E
96 pages
2-Regular Expressions, Text Normalization, Edit Distance
No ratings yet
2-Regular Expressions, Text Normalization, Edit Distance
42 pages
DAP Lab Manual
No ratings yet
DAP Lab Manual
20 pages
Lab Assignment1 Mongodb
100% (1)
Lab Assignment1 Mongodb
2 pages
UNIT 2 - CS3401-Algorithms
No ratings yet
UNIT 2 - CS3401-Algorithms
22 pages
CD Unit-V
No ratings yet
CD Unit-V
10 pages
Compiler Lab Guide for Students
No ratings yet
Compiler Lab Guide for Students
47 pages
Lesson 10
No ratings yet
Lesson 10
27 pages
Lexical Analysis in Compiler Design With Example
No ratings yet
Lexical Analysis in Compiler Design With Example
8 pages
Role of A Lexical AN
No ratings yet
Role of A Lexical AN
26 pages
Sharepoint Online and Office 365 Administration
No ratings yet
Sharepoint Online and Office 365 Administration
238 pages
8086 Microprocessor Guide
No ratings yet
8086 Microprocessor Guide
26 pages
Slides 02 Programming Languages - UET CS - Talha Waheed - Classification of PL
No ratings yet
Slides 02 Programming Languages - UET CS - Talha Waheed - Classification of PL
27 pages
Antivirus Report Last
No ratings yet
Antivirus Report Last
102 pages
Unit 3 Flashcards
No ratings yet
Unit 3 Flashcards
15 pages
9050 User's Manual PDF
100% (1)
9050 User's Manual PDF
147 pages
Architecture Advance 3
No ratings yet
Architecture Advance 3
18 pages
Artificial Intelligence Tutelage System
No ratings yet
Artificial Intelligence Tutelage System
5 pages
Advanced Modeling Checklist
No ratings yet
Advanced Modeling Checklist
6 pages
Naukri RAJATPANDEY (3y 4m)
No ratings yet
Naukri RAJATPANDEY (3y 4m)
4 pages
Data Migration
No ratings yet
Data Migration
5 pages
Laboratory Activities Structures
No ratings yet
Laboratory Activities Structures
4 pages
REAA Student Course Booklet (FNS Courses)
No ratings yet
REAA Student Course Booklet (FNS Courses)
17 pages
TLE-TE 9 - Q1 - W5 - Mod5 - ICT CSS
100% (4)
TLE-TE 9 - Q1 - W5 - Mod5 - ICT CSS
31 pages
CFree5: Enabling C++11 Support
No ratings yet
CFree5: Enabling C++11 Support
8 pages
Switch Manager: SM/EN GL/B11
No ratings yet
Switch Manager: SM/EN GL/B11
38 pages
Cloud Healthcare
No ratings yet
Cloud Healthcare
5 pages
Windows XP Command Line Guide
No ratings yet
Windows XP Command Line Guide
4 pages
Thesis Supervisor Recommendation With Representative Content and Information Retrieval
No ratings yet
Thesis Supervisor Recommendation With Representative Content and Information Retrieval
8 pages
BSBTEC404-BSBTWK401 Student Assessment Guide
No ratings yet
BSBTEC404-BSBTWK401 Student Assessment Guide
44 pages
08 - CA (CL) - 20th Batch (Sec-B) - IT - LO6 (Part) - Internet of Things (IoT) - CLass - 10 (29102024)
No ratings yet
08 - CA (CL) - 20th Batch (Sec-B) - IT - LO6 (Part) - Internet of Things (IoT) - CLass - 10 (29102024)
16 pages
Certified Data Entry and Office Assistant (Upskilling) Curriculum
No ratings yet
Certified Data Entry and Office Assistant (Upskilling) Curriculum
9 pages
Salesforce Integration Bootcamp Notes
No ratings yet
Salesforce Integration Bootcamp Notes
22 pages
Bluetooth Barcode Reader Manual
No ratings yet
Bluetooth Barcode Reader Manual
29 pages
Ansible Fundamentals To Advance
No ratings yet
Ansible Fundamentals To Advance
18 pages
Lab 10 Oracle Access Management - Access Manager 11g R2 PS3 OAM Authentication Plug-In
100% (1)
Lab 10 Oracle Access Management - Access Manager 11g R2 PS3 OAM Authentication Plug-In
33 pages
SRT Gemini Traslation Documnetation
No ratings yet
SRT Gemini Traslation Documnetation
10 pages
Academic Research Thesis Guide
No ratings yet
Academic Research Thesis Guide
58 pages
imageRUNNER C3226i - Benefit Datasheet - EM - FINAL - DIGI
No ratings yet
imageRUNNER C3226i - Benefit Datasheet - EM - FINAL - DIGI
2 pages
910-6854-001 Rev B PDF
No ratings yet
910-6854-001 Rev B PDF
22 pages

Case Study of Lexical Analyzer PDF

Uploaded by

Case Study of Lexical Analyzer PDF

Uploaded by

Case Study: Lexical Analyzer

Roles of the Lexical analyzer

1. Helps to identify token into the symbol table

1. Removes one character from the remaining input

Why separate Lexical and Parser?

Advantages of Lexical analysis

You might also like