0% found this document useful (0 votes)

34 views31 pages

Chapter 4

Regular expressions provide a precise way to define formal languages. They use operations like *, +, and () to describe strings of characters. Any language that can be defined by a regular expression is considered a regular language. All finite languages are regular because they can be expressed as a single regular expression listing all possible strings. Regular expressions allow unambiguous definitions of languages in a way that removes all ambiguity.

Uploaded by

ruba

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

34 views31 pages

Chapter 4

Uploaded by

ruba

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 31

Regular Expressions

Defining Languages Using

Regular Expressions
 Previously, we defined the languages:
• L1 = {Xn for n = 1, 2, 3, . . .}
• L2 = {x, xxx, xxxxx, . . .}
 But these are not very precise ways of
defining languages.
 So, now we want a very precise way of
defining a languages, and we will do this
using regular expressions
Regular Expressions
 Regular expressions are written in bold face letters and
are a way of specifying the language.
 Formal way to define the lexical specifications of a
language
 Remove ambiguity altogether
 Called expressions on account of similarity with
arithmetic expressions
 Use *, + and ()
 * shows repetition
 + presents choice or disjunction (some time authors used
U for this purpose)
 () used for grouping
Language-Defining Symbols:
Star Sign
 We now introduce the use of the Kleene star, applied not
to a set, but directly to the letter x and written as a
superscript: x*.
 This simple expression indicates some sequence of x’s
(may be none at all):
x* = Λ or x or x2 or x3…
= xn for some n = 0, 1, 2, 3, …

 We can think of the star as an unknownpower.

 That is, x* stands for a string of x’s, but we do not
specify how many, and it may be the null string .

4
 The notation x* can be used to define languages
by writing, say L = language (x*)

 Since x* is any string of x’s, L is then the

language of all possible strings of x’s of any
length (including Λ).

5
 Given the alphabet = {a, b}, suppose we wish to define the
language L that contains all words of the form one a followed by
some number of b’s (maybe no b’s at all); that is
L = {a, ab, abb, abbb, abbbb, …}

 Using the language-defining symbol, we may write

L = language (ab*)

 This equation obviously means that L is the language in which the

words are the concatenation of an initial a with some or no b’s.

6
 We can apply the Kleene star to the whole
string ab if we want:
(ab)* = Λ or ab or abab or ababab…
 Observe that
(ab)* ≠ a*b*
 because the language defined by the
expression on the left contains the word
abab, whereas the language defined by
7
the expression on the right does not.
 If we want to define the language L1 = {x; xx; xxx; …}
using the language-defining symbol, we can write
L1 = language(xx*)
which means that each word of L1 must start with an x
followed by some (or no) x’s.

 Note that we can also define L1 using the notation + (as

an exponent) introduced in previous lecture
L1 = language(x+)

8
Plus Sign

 Let us introduce another use of the plus

sign. By the expression
x+y
where x and y are strings of characters
from an alphabet, we mean either x or y.

 Care should be taken so as not to confuse

this notation with the notation + (as an
9
exponent).
Example

 Consider the language T over the alphabet

Σ = {a; b; c}:
 T = {a; c; ab; cb; abb; cbb; abbb; cbbb; abbbb;
cbbbb; …}
 In other words, all the words in T begin with
either an a or a c and then are followed by some
number of b’s.
 Using the above plus sign notation, we may
write this as
10
T = language((a+ c)b*)
Example

 Consider a finite language L that contains

all the strings of a’s and b’s of length
three exactly:
L = {aaa, aab, aba, abb, baa, bab, bba,
bbb}
 Thus, we may write
L = language((a+ b)(a + b)(a + b))

11
Example

 In general, if we want to refer to the set of all possible

strings of a’s and b’s of any length whatsoever, we could
write
language((a+ b)*)

 This is the set of all possible strings of letters from

the alphabet Σ = {a, b}, including the null string.

12
Regular Expressions
 Given  = {a,b}
 a* = {Λ, a,aa,aaa,aaa,aaaa,aaaaa, …}
 ab* = {a, ab,abb,abbb,abbbb, …}
 a+b = {a,b}
 (ab)* = {Λ, ab, abab, ababab, …}
 (a+b)* = {Λ, any string of as and bs}
Formal Definition of Regular
Expressions
 The set of regular expression is defined by
following rules
1. Every letter of  and Λ is a regular
expression
2. If r1 and r2 are regular expressions, then so
are
 (r1)
 r1r2
 r1+r2
 r1*

3.Nothing else is a regular expression

Regular Expressions
 Whether following are RE if so what
languages do they generate
a (b + a)*
 bb(a+b)
 (a+b)(a+b)(a+b)
 (a+b)*ba
 (a+b)*a(a+b)*
 (a+b)*aa(a+b)*
Regular Expressions
 Write RE for the following languages over
the  ={a,b}.
 All words ending with b
 All words that start with a
 All words that start with a double letter
 All words that contain at least one double letter
 All words that start and end with a double letter
 All words of length >=3
 All words that contain exactly one a or exactly
one b
 All words that don’t end at b
Regular Expressions
 Example: Give a regular expression for each of the
following over the alphabet { 0, 1 }:

 { w | w begins with a ‘1’ and ends with a ‘0’ }

 { w | w contains exactly three 1’s}

 { w | w contains at least three 1’s}

 { w | w is a string that begin with a ‘1’ and contain

exactly two 0’s }

 Regular expression definition of a language is not unique.

17
[Section 1.3]
Regular expressions

Examples: give regular expressions for the

following languages:
- { w ε {0,1}* | w contains the substring 01 }
or
- { w ε ∑* | w contains the substring 01 }
- {w in {0,1}* | second symbol of w is a 1}
- { w ε {0,1}* | |w| < 4 }
Some exercises on regular expressions

 Example: What is L((a  b)a(a  b))?

 Ans: {w in {a, b}* | w contains at least one
a}
 Write regular expressions for:
1. {w in {a,b}* | |w| is odd }.
2. {w in {a,b}* | w does not have ab as a substring}.
3. {w in {a,b,c}* | no b in w can come before any c in w}.

19
Regular Expression
Identities
Regular Expression Identities
1. u = u = u
2. * = 
3. u+v=v+u
4. u+u=u
5. u* = (u*)*
6. u (v + w) = uv + uw
7. (u + v) w = uw + vw

20
Languages Associated
with Regular
Expressions
Definition

 The following rules define the language associated

with any regular expression:

 Rule 1: The language associated with the regular

expression that is just a single letter is that one-letter
word alone, and the language associated with Λ is just
{Λ}, a one-word language.

 Rule 2: If r1 is a regular expression associated with the

language L1 and r2 is a regular expression associated
with the language L2, then:
(i) The regular expression (r1)(r2) is associated with the product
L1L2, that is the language L1 times the language L2:

language(r1r2) = L1L2
22
Definition contd.
 Rule 2 (cont.):

(ii) The regular expression r1 + r2 is associated

with the language formed by the union of L1
and L2:
language(r1 + r2) = L1 + L2

(iii) The language associated with the regular

expression (r1)* is L1*, the Kleene closure of
the set L1 as a set of words:
23 language(r1*) = L1*
Languages associated with REs
 r1= a, r2 = b, r3 = Λ
 IfL1 is associated with r1 and L2 is
associated r2
 Language(r1r2)= L1L2
 Language(r1+r2) = L1+L2 = L1 U L2
 Language(r1*) = L1* (Kleen’s Closure of L1)
Regular Languages
 How to tell whether a language is regular
 Define an RE for it, if it is possible the language
is Regular other wise non-regular
 Definition
 Any language that can be represented by a
regular expression is called a regular
language
 It is to be noted that if r1, r2 are regular
expressions, corresponding to the languages L1
and L2 then the languages generated by r1+
r2, r1r2( or r2r1) and r1*( or r2*) are also
regular languages.
Regular Languages
 Example
 Consider the language L, defined over
Σ = {a,b}, of strings of length 2,
starting with a, then
 L = {aa, ab}, may be expressed by
the regular expression aa+ab. Hence
L, by definition, is a regular language.
Regular Languages
 All finite languages are regular
 Example
 Consider the language L, defined over Σ
= {a,b}, of strings of length 2, starting
with a, then L = {aa, ab}, may be
expressed by the regular expression
aa+ab. Hence L, by definition, is a
regular language.
Theorem

 If L is a finite language (a language with only finitely many

words), then L can be defined by a regular expression. In
other words, all finite languages are regular.

Proof

 Let L be a finite language. To make one regular expression that

defines L, we turn all the words in L into boldface type and insert
plus signs between them.

 For example, the regular expression that defines the language

L = {baa, abbba, bababa} is (baa + abbba + bababa)

 This algorithm only works for finite languages because an infinite

language would become a regular expression that is infinitely long,
which is forbidden.
28
Equivalent Regular Expressions
 Definition
 Two regular expressions are said to be
equivalent if they generate the same
language.
 Example
 Consider the following regular
expressions
 r1 = (a + b)* (aa + bb)
 r2 = (a + b)*aa + ( a + b)*bb then both
regular expressions define the language
of strings ending in aa or bb
Example

 The language of all words that have at least two a’s can
be defined by the expression:
(a + b)*a(a + b)*a(a + b)*

 Another expression that defines all the words with at

least two a’s is
b*ab*a(a + b)*

 Hence, we can write

(a + b)*a(a + b)*a(a + b)* = b*ab*a(a + b)*

where by the equal sign we mean that these two

expressions are equivalent in the sense that they
30
describe the same language.
Example

 Let V be the language of all strings of a’s and b’s in

which either the strings are all b’s, or else an a followed
by some b’s. Let V also contain the word Λ. Hence,
V = {Λ, a, b, ab, bb, abb, bbb, abbb, bbbb, …}
 We can define V by the expression
b* + ab*
where Λ is included in b*.
 Alternatively, we could define V by
(Λ + a)b*
which means that in front of the string of some b’s, we
have
either an a or nothing.
31

CS3304 9 LanguageSyntax 2 PDF
No ratings yet
CS3304 9 LanguageSyntax 2 PDF
39 pages
Regular Expressions
No ratings yet
Regular Expressions
31 pages
Lecture 7 & 8 - Regular Expressions
No ratings yet
Lecture 7 & 8 - Regular Expressions
39 pages
Ch3 - Regular Expression: Subhash Sagar Email
No ratings yet
Ch3 - Regular Expression: Subhash Sagar Email
37 pages
Automata 3
No ratings yet
Automata 3
21 pages
TOC-L02-Regular Expressions-S25
No ratings yet
TOC-L02-Regular Expressions-S25
23 pages
Theory of Automata RE 3
No ratings yet
Theory of Automata RE 3
13 pages
Lecture#03,4
No ratings yet
Lecture#03,4
27 pages
Lesson 03
No ratings yet
Lesson 03
11 pages
Writing Regular Expression
No ratings yet
Writing Regular Expression
10 pages
CS273 Theory of Automata & Fomal Languages: (WEEK-2) Lecture-3 & 4
No ratings yet
CS273 Theory of Automata & Fomal Languages: (WEEK-2) Lecture-3 & 4
41 pages
TOA Lesson 04
No ratings yet
TOA Lesson 04
13 pages
LEC-3
No ratings yet
LEC-3
25 pages
Lecture # 2: Automata Theory and Formal Languages (CSC-221)
No ratings yet
Lecture # 2: Automata Theory and Formal Languages (CSC-221)
48 pages
Regular Expression2
No ratings yet
Regular Expression2
12 pages
2. Regular Expressions
No ratings yet
2. Regular Expressions
4 pages
Regular Expressions and Regular Languages
No ratings yet
Regular Expressions and Regular Languages
5 pages
Theory of Automata and Formal Languages
No ratings yet
Theory of Automata and Formal Languages
24 pages
Absent Tha
No ratings yet
Absent Tha
33 pages
Lecture 7 Regular Expression Lec
No ratings yet
Lecture 7 Regular Expression Lec
15 pages
Language Associated With Regular Expressions: FIRST SEM. SY. 2008-2009
No ratings yet
Language Associated With Regular Expressions: FIRST SEM. SY. 2008-2009
12 pages
Theory of Automata (Regular Expression)
No ratings yet
Theory of Automata (Regular Expression)
42 pages
Lesson 03
No ratings yet
Lesson 03
30 pages
Chap-2 2 (RegularExpression)
No ratings yet
Chap-2 2 (RegularExpression)
46 pages
Lesson 3
No ratings yet
Lesson 3
10 pages
2.0+Regular Expression Part 1 MKN
No ratings yet
2.0+Regular Expression Part 1 MKN
33 pages
TOA - Lec4 Regular Expressions
No ratings yet
TOA - Lec4 Regular Expressions
28 pages
Theory of Automata-RE (2)
No ratings yet
Theory of Automata-RE (2)
25 pages
CSC312 Automata Theory: Recursive Definations Regular Expressions
No ratings yet
CSC312 Automata Theory: Recursive Definations Regular Expressions
28 pages
TPL lect 15 - 16
No ratings yet
TPL lect 15 - 16
5 pages
TOA Lecture03
No ratings yet
TOA Lecture03
24 pages
lecture 3, 4 (1)
No ratings yet
lecture 3, 4 (1)
33 pages
chapter two
No ratings yet
chapter two
59 pages
CSC312 Automata Theory: Regular Expressions
No ratings yet
CSC312 Automata Theory: Regular Expressions
20 pages
chapter 3
No ratings yet
chapter 3
10 pages
Chapter No.1
No ratings yet
Chapter No.1
31 pages
Computation Theory: Expressions Languages Grammar
No ratings yet
Computation Theory: Expressions Languages Grammar
51 pages
Chapter 4: Regular Expressions
No ratings yet
Chapter 4: Regular Expressions
12 pages
03-RegularExpression 112422
No ratings yet
03-RegularExpression 112422
22 pages
Regular expression
No ratings yet
Regular expression
89 pages
Compiler - Chap.2.part 3
No ratings yet
Compiler - Chap.2.part 3
85 pages
Chapter 3 - Regular Expression
No ratings yet
Chapter 3 - Regular Expression
16 pages
Lecture 3a and 3b
No ratings yet
Lecture 3a and 3b
21 pages
Language About Complier Construction
No ratings yet
Language About Complier Construction
23 pages
Regular - Expressions For FL & A
No ratings yet
Regular - Expressions For FL & A
34 pages
Lecture 03
No ratings yet
Lecture 03
16 pages
TOA Lecture 03
No ratings yet
TOA Lecture 03
63 pages
Unit Ii
No ratings yet
Unit Ii
25 pages
FL 2
No ratings yet
FL 2
34 pages
Lecture 3 Regular Expressions
No ratings yet
Lecture 3 Regular Expressions
36 pages
Language of Grammars
No ratings yet
Language of Grammars
27 pages
Lecture Slides Regular Expressions
No ratings yet
Lecture Slides Regular Expressions
138 pages
Lesson 03
No ratings yet
Lesson 03
18 pages
Theory of Automata Lecture#2: by Riaz Ahmad Ziar R.ziar@kardan - Edu.af
No ratings yet
Theory of Automata Lecture#2: by Riaz Ahmad Ziar R.ziar@kardan - Edu.af
22 pages
02_Regular Expression and Regular Languages
No ratings yet
02_Regular Expression and Regular Languages
40 pages
TOA - Lecture 3
No ratings yet
TOA - Lecture 3
63 pages
3 RegularExpressions
No ratings yet
3 RegularExpressions
25 pages
Introduction to Formal Languages
From Everand
Introduction to Formal Languages
György E. Révész
2/5 (1)
The Genetic Code of All Languages,(Part-1; An Overview)
From Everand
The Genetic Code of All Languages,(Part-1; An Overview)
Moni Kanchan Panda
No ratings yet
Introduction to Partial Differential Equations: From Fourier Series to Boundary-Value Problems
From Everand
Introduction to Partial Differential Equations: From Fourier Series to Boundary-Value Problems
Arne Broman
2.5/5 (2)
The Genetic Code of All Languages,(Part 2.1; Numerals)
From Everand
The Genetic Code of All Languages,(Part 2.1; Numerals)
Moni Kanchan Panda
No ratings yet
Lecture 2 Data Mining Functions
No ratings yet
Lecture 2 Data Mining Functions
40 pages
Data Mining (DM) : Lecture 3: Know Your Data
No ratings yet
Data Mining (DM) : Lecture 3: Know Your Data
53 pages
DM-BS-lec6-Mining Frequent Patterns
No ratings yet
DM-BS-lec6-Mining Frequent Patterns
37 pages
Wordpress Tutorial
No ratings yet
Wordpress Tutorial
1 page
Lecture 1-Data Mining (Introduction)
No ratings yet
Lecture 1-Data Mining (Introduction)
30 pages
Distributed Database Management Systems: Week-4
No ratings yet
Distributed Database Management Systems: Week-4
24 pages
Distributed Database Management Systems: Week-3
No ratings yet
Distributed Database Management Systems: Week-3
7 pages
WEEK1
No ratings yet
WEEK1
20 pages
Distributed Database Management Systems: Week-4
No ratings yet
Distributed Database Management Systems: Week-4
24 pages
Week 5
No ratings yet
Week 5
23 pages
Distributed Database Management Systems: Week-3
No ratings yet
Distributed Database Management Systems: Week-3
7 pages
Compiler Design - Theory Tools and Examples PDF
No ratings yet
Compiler Design - Theory Tools and Examples PDF
320 pages
Lexical Analysis: 4.1 Motivation of The Chapter
No ratings yet
Lexical Analysis: 4.1 Motivation of The Chapter
2 pages
Brouwer1998 Chapter MythsAndFactsAboutTheEfficient PDF
No ratings yet
Brouwer1998 Chapter MythsAndFactsAboutTheEfficient PDF
15 pages
Lecture 2
No ratings yet
Lecture 2
29 pages
Nanomaterials PPT 2
No ratings yet
Nanomaterials PPT 2
23 pages
EE 331 - Signals and Systems: - A Must For All EE Engineers/researchers
No ratings yet
EE 331 - Signals and Systems: - A Must For All EE Engineers/researchers
29 pages
Destiny Harvest Centre: Prayer, Fasting & Revival
No ratings yet
Destiny Harvest Centre: Prayer, Fasting & Revival
2 pages
Is GPS 200
No ratings yet
Is GPS 200
226 pages
Cycle Inventory
No ratings yet
Cycle Inventory
40 pages
A - A - Anime of The Year Poll - Anime & Manga - 4chan
No ratings yet
A - A - Anime of The Year Poll - Anime & Manga - 4chan
71 pages
Grade 9 If Comprehension Check
No ratings yet
Grade 9 If Comprehension Check
3 pages
Case Ih Dx31 Dx34 Tractors Repair Manual
No ratings yet
Case Ih Dx31 Dx34 Tractors Repair Manual
8 pages
Week 3_Trims & inprocess inspection-1
No ratings yet
Week 3_Trims & inprocess inspection-1
31 pages
ACH File Format
No ratings yet
ACH File Format
2 pages
Booths Algorithm
No ratings yet
Booths Algorithm
23 pages
Xerox B1025 MFP Sag En-Us PDF
No ratings yet
Xerox B1025 MFP Sag En-Us PDF
123 pages
Climbing Film Evaporator
0% (1)
Climbing Film Evaporator
8 pages
Smoke Dampers: MODELS SD35, SD36, SD37, SD50, SD60 (M), SD60-2, SD60V, SD35SS, SD36SS, SD37SS, SDRS25 (M), SDRS25SS (M) and
No ratings yet
Smoke Dampers: MODELS SD35, SD36, SD37, SD50, SD60 (M), SD60-2, SD60V, SD35SS, SD36SS, SD37SS, SDRS25 (M), SDRS25SS (M) and
2 pages
Sathottari Kahanokar Manika Mohini Ke Katha Sahitya Me Mulya Vighatan
No ratings yet
Sathottari Kahanokar Manika Mohini Ke Katha Sahitya Me Mulya Vighatan
6 pages
Assignment Cover Page: RMIT International University Vietnam
No ratings yet
Assignment Cover Page: RMIT International University Vietnam
4 pages
Hegde Visual Speech Enhancement Without A Real Visual Stream WACV 2021 Paper
No ratings yet
Hegde Visual Speech Enhancement Without A Real Visual Stream WACV 2021 Paper
10 pages
Software Project Management (Department Elective Û I)
No ratings yet
Software Project Management (Department Elective Û I)
3 pages
Asso CET 2014 Syllabus
No ratings yet
Asso CET 2014 Syllabus
21 pages
Nguyen Van Tuan - CV
No ratings yet
Nguyen Van Tuan - CV
3 pages
Đọc hiểu tiếng Anh
No ratings yet
Đọc hiểu tiếng Anh
13 pages
Ad 19123
No ratings yet
Ad 19123
1 page
Backer Material For Use With Cold-And Hot-Applied Joint Sealants in Portland-Cement Concrete and Asphalt Joints
No ratings yet
Backer Material For Use With Cold-And Hot-Applied Joint Sealants in Portland-Cement Concrete and Asphalt Joints
4 pages
Living City 2.7 CHANGELOG
No ratings yet
Living City 2.7 CHANGELOG
9 pages
Thesis For Information Technology Free
100% (2)
Thesis For Information Technology Free
5 pages
4) Lecture_4_Unit 1 - PAYMENT VOUCHER
No ratings yet
4) Lecture_4_Unit 1 - PAYMENT VOUCHER
15 pages
Glistening Upturn in Branded Luxury Jewellery
No ratings yet
Glistening Upturn in Branded Luxury Jewellery
3 pages
Kalpesh Project 27
No ratings yet
Kalpesh Project 27
35 pages
NiQ Brochure EN
No ratings yet
NiQ Brochure EN
16 pages
Autocorrelation-Applied Tests
No ratings yet
Autocorrelation-Applied Tests
16 pages

Chapter 4

Uploaded by

Chapter 4

Uploaded by

Regular Expressions

Defining Languages Using

 We can think of the star as an unknownpower.

 Since x* is any string of x’s, L is then the

 Using the language-defining symbol, we may write

 This equation obviously means that L is the language in which the

 Note that we can also define L1 using the notation + (as

 Let us introduce another use of the plus

 Care should be taken so as not to confuse

 Consider the language T over the alphabet

 Consider a finite language L that contains

 In general, if we want to refer to the set of all possible

 This is the set of all possible strings of letters from

3.Nothing else is a regular expression

 { w | w begins with a ‘1’ and ends with a ‘0’ }

 { w | w contains exactly three 1’s}

 { w | w contains at least three 1’s}

 { w | w is a string that begin with a ‘1’ and contain

 Regular expression definition of a language is not unique.

Examples: give regular expressions for the

 Example: What is L((a  b)*a(a  b)*)?

 The following rules define the language associated

 Rule 1: The language associated with the regular

 Rule 2: If r1 is a regular expression associated with the

(ii) The regular expression r1 + r2 is associated

(iii) The language associated with the regular

 If L is a finite language (a language with only finitely many

 Let L be a finite language. To make one regular expression that

 For example, the regular expression that defines the language

 This algorithm only works for finite languages because an infinite

 Another expression that defines all the words with at

 Hence, we can write

where by the equal sign we mean that these two

 Let V be the language of all strings of a’s and b’s in

You might also like

 Example: What is L((a  b)a(a  b))?