String Naive and KMP

The document outlines a C program for string manipulation that reads a main string, a pattern string, and a replacement string, performing pattern matching to replace occurrences of the pattern in the main string. It describes the algorithm and implementation details, including a naive method and the KMP (Knuth-Morris-Pratt) algorithm for efficient pattern searching. Additionally, it provides code snippets for both the naive approach and the KMP algorithm, detailing how to compute the longest prefix suffix (lps) array for improved matching efficiency.

Uploaded by

colabpython39

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

12 views

String Naive and KMP

Uploaded by

colabpython39

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 18

EXPERIMENT - 02

Design, Develop and Implement a program in C for the following

operations on Strings
• Read a Main String (STR), a Pattern String (PAT) and a Replace String
(REP).
• Perform Pattern Matching Operation: Find and Replace all occurrences
of PAT in STR with REP if PAT exists in STR. Repost suitable messages
in case PAT does not exist in STR.
Support the program with functions for each of the above operations.
Don’t use built-in functions.
ALGORITHM:
Step 1: Start.
Step 2: Read main string STR, pattern string PAT and replace string REP.
Step 3: Search / find the pattern string PAT in the main string STR.
Step 4: If PAT is found then replace all occurrences of PAT in main string
STR with REP string.
Step 5: If PAT is not found give a suitable error message.
Step 6: Stop.
#include<stdio.h>
//Declarations
char str[100], pat[50], rep[50], ans[100];
int i, j, c, m, k, flag=0;
void stringmatch() {
i = m = c = j = 0;
while(str[c] ! = '\0') {
if(str[m] = = pat[i]) {//matching
i++;
m++;
if(pat[i] = = '\0') { //found occurrences.
flag = 1;
//copy replace string in ans string.
for(k = 0; rep[k] != '\0'; k++, j++)
ans[j] = rep[k];
i = 0; c = m;
}
} // if ends.
else { //... mismatch
ans[j] = str[c];
j++; c++; m=c; i=0;
}//elseends
} //end of while
ans[j] = '\0';
} //end stringmatch()
int main() {
printf("\nEnter a main string \n"); gets(str);
printf("\nEnter a pattern string \n"); gets(pat);
printf("\nEnter a replace string \n"); gets(rep);
stringmatch();
if(flag = = 1)
printf("\nThe resultant string is\n %s" , ans);
else
printf("\nPattern string NOT found\n");
return 0;
} // end of main
Naïve Method:
str = a b c d e f g h
Pat = def

aaaaaaaaab
aaab

Generating lps:
Pat= a b c d a b c
Prf = a, ab, abc, abcd
Suf=c, bc, abc, dabac
lps=abc

p1=a b c d a b e a b f
0 0 0 0 1 2 0 1 2 0

p2=a b c d e a b f a b c
0 0 0 0 0 1 2 0 1 2 3
p3=a a a a b a a c d
0 1 2 3 0 1 2 0 0

Str: a b a b c a b a b a b d
Pat: a b a b d
KMP (Knuth Morris Pratt) Pattern Searching: The Naive pattern-
searching algorithm doesn’t work well in cases where we see many
matching characters followed by a mismatching character.
1) txt[] = “AAAAAAAAAAAAAAAAAB”, pat[] = “AAAAB”
2) txt[] = “ABABABCABABABCABABABC”, pat[] = “ABABAC”
(A worst case for Naive).
The KMP matching algorithm uses degenerating property (pattern
having the same sub-patterns appearing more than once in the pattern) of
the pattern and improves the worst-case complexity to O(n+m). The
basic idea behind KMP’s algorithm is: whenever we detect a mismatch
(after some matches), we already know some of the characters in the text
of the next window. We take advantage of this information to avoid
matching the characters that we know will anyway match.

Preprocessing Overview: KMP algorithm preprocesses pat[] and

constructs an auxiliary lps[] of size m (same as the size of the pattern)
which is used to skip characters while matching.
• Name lps indicates the longest proper prefix which is also a suffix. A
proper prefix is a prefix with a whole string not allowed. For example,
prefixes of “ABC” are “”, “A”, “AB” and “ABC”. Proper prefixes are “”,
“A” and “AB”. Suffixes of the string are “”, “C”, “BC”, and “ABC”.
• We search for lps in subpatterns. More clearly we focus on sub-strings
of patterns that are both prefix and suffix.
• For each sub-pattern pat[0..i] where i = 0 to m-1, lps[i] stores the length
of the maximum matching proper prefix which is also a suffix of the sub-
pattern pat[0..i].
• lps[i] = the longest proper prefix of pat[0..i] which is also a suffix of
pat[0..i].
Note: lps[i] could also be defined as the longest prefix which is also a
proper suffix. We need to use it properly in one place to make sure that
the whole substring is not considered.
For the pattern “AAAA”, lps[] is [0, 1, 2, 3]
For the pattern “ABCDE”, lps[] is [0, 0, 0, 0, 0]
For the pattern “AABAACAABAA”, lps[] is [0, 1, 0, 1, 2, 0, 1, 2, 3, 4,
5]
For the pattern “AAACAAAAAC”, lps[] is [0, 1, 2, 0, 1, 2, 3, 3, 3, 4]
For the pattern “AAABAAA”, lps[] is [0, 1, 2, 0, 1, 2, 3]

In the preprocessing part,

 We calculate values in lps[]. To do that, we keep track of the length of
the longest prefix suffix value (we use len variable for this purpose) for
the previous index
 We initialize lps[0] and len as 0.
 If pat[len] and pat[i] match, we increment len by 1 and assign the
incremented value to lps[i].
 If pat[i] and pat[len] do not match and len is not 0, we update len to
lps[len-1].
How to use lps[] to decide the next positions (or to know the number
of characters to be skipped)?
 We start the comparison of pat[j] with j = 0 with characters of the
current window of text.
 We keep matching characters txt[i] and pat[j] and keep incrementing i
and j while pat[j] and txt[i] keep matching.

When we see a mismatch

 We know that characters pat[0..j-1] match with txt[i-j…i-1] (Note that
j starts with 0 and increments it only when there is a match).
 We also know (from the above definition) that lps[j-1] is the count of
characters of pat[0…j-1] that are both proper prefix and suffix.
 From the above two points, we can conclude that we do not need to
match these lps[j-1] characters with txt[i-j…i-1] because we know that
these characters will anyway match.
#include <stdio.h>
#include <stdlib.h>
#include <string.h>
// Fills lps[] for given pattern pat
void computeLPSArray(const char* pat, int M, int* lps) {
int len = 0; // Length of the previous longest prefix suffix
lps[0] = 0; // lps[0] is always 0
int i = 1; // Loop calculates lps[i] for i = 1 to M-1
while (i < M) {
if (pat[i] == pat[len]) {
len++;
lps[i] = len;
i++;
}
else {
if (len != 0) {
len = lps[len - 1];
}
else {
lps[i] = 0;
i++;
}
}
}
}
// Prints occurrences of pat in txt and returns an array of occurrences
int* KMPSearch(const char* pat, const char* txt, int* count) {
int M = strlen(pat);
int N = strlen(txt);
// Create lps[] that will hold the longest prefix suffix values for pattern
int* lps = (int*)malloc(M * sizeof(int));
// Preprocess the pattern (calculate lps[] array)
computeLPSArray(pat, M, lps);
int* result = (int*)malloc(N * sizeof(int));
// Number of occurrences found
*count = 0;
int i = 0; // index for txt
int j = 0; // index for pat
while ((N - i) >= (M - j)) {
if (pat[j] == txt[i]) {
j++;
i++;
}
if (j == M) {
// Record the occurrence (1-based index)
result[*count] = i - j + 1;
(*count)++;
j = lps[j - 1];
}
else if (i < N && pat[j] != txt[i]) {
if (j != 0) {
j = lps[j - 1];
}
else {
i = i + 1;
}
}
}
free(lps);
return result;
}
// Driver code
int main() {
const char txt[] = "geeksforgeeks";
const char pat[] = "geeks";
int count;
// Call KMPSearch and get the array of occurrences
int* result = KMPSearch(pat, txt, &count);
// Print all the occurrences (1-based indices)
for (int i = 0; i < count; i++) {
printf("%d ", result[i]);
printf("\n");
// Free the allocated memory
free(result);
return 0;
}

Datasheet EFPE 1001
No ratings yet
Datasheet EFPE 1001
1 page
Lisp Interpreter in Rust
From Everand
Lisp Interpreter in Rust
Vishal Patil
1/5 (1)
Open Ended Lab
100% (1)
Open Ended Lab
6 pages
TRION Kitchen ESP Product Introduction 8.1
No ratings yet
TRION Kitchen ESP Product Introduction 8.1
36 pages
20BCS5977_DAA LAB WORKSHEET 3.3pdf
No ratings yet
20BCS5977_DAA LAB WORKSHEET 3.3pdf
5 pages
AEC LAST
No ratings yet
AEC LAST
6 pages
KMP
No ratings yet
KMP
3 pages
BCS304 DS Module 1 KMP Algorithm
No ratings yet
BCS304 DS Module 1 KMP Algorithm
6 pages
Lab2 C
No ratings yet
Lab2 C
2 pages
Cse 217
No ratings yet
Cse 217
10 pages
KMP Algorithm For Strings
No ratings yet
KMP Algorithm For Strings
4 pages
AOA Module 6 - String of Algorithms - Aeraxia - in
No ratings yet
AOA Module 6 - String of Algorithms - Aeraxia - in
26 pages
Cse2012 Design and Analysis of Algorithms Lab Digital Assignment 2
No ratings yet
Cse2012 Design and Analysis of Algorithms Lab Digital Assignment 2
18 pages
Cse2012 Design and Analysis of Algorithms Lab Digital Assignment 2
No ratings yet
Cse2012 Design and Analysis of Algorithms Lab Digital Assignment 2
18 pages
Design & Analysis of algorithm- 6
No ratings yet
Design & Analysis of algorithm- 6
32 pages
Assignment on Implementation of KMP Algorithm
No ratings yet
Assignment on Implementation of KMP Algorithm
5 pages
Daa 9
No ratings yet
Daa 9
4 pages
9912KMPAlgo
No ratings yet
9912KMPAlgo
3 pages
Daa 9
No ratings yet
Daa 9
4 pages
DAA EXP-9 AJAY
No ratings yet
DAA EXP-9 AJAY
4 pages
Lab7
No ratings yet
Lab7
5 pages
Daa Exp-9
No ratings yet
Daa Exp-9
4 pages
Internetalgo
No ratings yet
Internetalgo
13 pages
Arun Exp-9
No ratings yet
Arun Exp-9
3 pages
CSE 205 Lab Manual 12 KMP
No ratings yet
CSE 205 Lab Manual 12 KMP
6 pages
DAA-DA
No ratings yet
DAA-DA
9 pages
2_StringMatch
No ratings yet
2_StringMatch
3 pages
Advanced String Lecture
No ratings yet
Advanced String Lecture
50 pages
DAA LAB DA2
No ratings yet
DAA LAB DA2
8 pages
Module III Problem Solving
No ratings yet
Module III Problem Solving
16 pages
Lab 2
No ratings yet
Lab 2
8 pages
Analysis of Algorithm Assignment 3
No ratings yet
Analysis of Algorithm Assignment 3
19 pages
Naïve Method. Code:: Naive, Rabin-Karp, and Knuth-Morris-Pratt Algorithms For String Matching
No ratings yet
Naïve Method. Code:: Naive, Rabin-Karp, and Knuth-Morris-Pratt Algorithms For String Matching
5 pages
Nikhil%20DAA%209
No ratings yet
Nikhil%20DAA%209
4 pages
pgm-1-ds_merged
No ratings yet
pgm-1-ds_merged
48 pages
Unit2-Letter Manipulation 2-KMP
No ratings yet
Unit2-Letter Manipulation 2-KMP
14 pages
Pattern Matching
No ratings yet
Pattern Matching
2 pages
DSA _Strings_ Notes
No ratings yet
DSA _Strings_ Notes
8 pages
271 Lab 2
No ratings yet
271 Lab 2
12 pages
DS UNIT-5 TOPIC
No ratings yet
DS UNIT-5 TOPIC
26 pages
Pattern Search
No ratings yet
Pattern Search
2 pages
54.string 2notes
No ratings yet
54.string 2notes
20 pages
Strings and Pattern Searching
100% (1)
Strings and Pattern Searching
80 pages
Data Structures Using C: Example 4.13
No ratings yet
Data Structures Using C: Example 4.13
5 pages
New
No ratings yet
New
30 pages
KMP algorithm
No ratings yet
KMP algorithm
19 pages
DAA Exp - 3.3
No ratings yet
DAA Exp - 3.3
3 pages
Strings
No ratings yet
Strings
73 pages
BCSL305 - Lab Manual 1
No ratings yet
BCSL305 - Lab Manual 1
56 pages
Co 4 (Lo 2)
No ratings yet
Co 4 (Lo 2)
12 pages
54.string Inotes
No ratings yet
54.string Inotes
20 pages
String Matching
No ratings yet
String Matching
2 pages
Dsa Bootcamp Practice Questions PDF
No ratings yet
Dsa Bootcamp Practice Questions PDF
2 pages
String 1
No ratings yet
String 1
14 pages
Knuth Morris Pratt Algorithm
No ratings yet
Knuth Morris Pratt Algorithm
4 pages
AAD Lec11
No ratings yet
AAD Lec11
5 pages
String Matching - RYS - Lect - 1 - 2 - 3 - Update
No ratings yet
String Matching - RYS - Lect - 1 - 2 - 3 - Update
61 pages
Unit 3
No ratings yet
Unit 3
34 pages
DAA Exp - 1.3
No ratings yet
DAA Exp - 1.3
3 pages
Week4 PPT SM
No ratings yet
Week4 PPT SM
35 pages
DSA 1-7 Programs
No ratings yet
DSA 1-7 Programs
26 pages
Daa Lab 9
No ratings yet
Daa Lab 9
43 pages
Naive String Search
No ratings yet
Naive String Search
3 pages
Stack Operations
No ratings yet
Stack Operations
31 pages
Sparse Matrix
No ratings yet
Sparse Matrix
28 pages
Postfix Evaluation-Infix to Postfix-Tower of Hanoi
No ratings yet
Postfix Evaluation-Infix to Postfix-Tower of Hanoi
7 pages
Sparse Matrix Tipplet
No ratings yet
Sparse Matrix Tipplet
1 page
Addition of Two Polynomials
No ratings yet
Addition of Two Polynomials
8 pages
Scholarship Guide
No ratings yet
Scholarship Guide
51 pages
Come Let Us Sing Unto the Lord-1 PDF Choral Music Vocal Music 3
No ratings yet
Come Let Us Sing Unto the Lord-1 PDF Choral Music Vocal Music 3
1 page
616 Pressure Transmitter Dwyer
No ratings yet
616 Pressure Transmitter Dwyer
2 pages
Al Zamil PV - Series Units
67% (3)
Al Zamil PV - Series Units
14 pages
Tang2017 Extracting Top-K Insights From Multi-Dimensional Data
No ratings yet
Tang2017 Extracting Top-K Insights From Multi-Dimensional Data
16 pages
Just A Haircut, Please!: 1. Pre-Listening Exercises
No ratings yet
Just A Haircut, Please!: 1. Pre-Listening Exercises
2 pages
65 Bedded MCH-mongar-Technical Specification - Electrical and LV System
No ratings yet
65 Bedded MCH-mongar-Technical Specification - Electrical and LV System
53 pages
Desain Spanduk
No ratings yet
Desain Spanduk
1 page
S2FD30B60 D
No ratings yet
S2FD30B60 D
5 pages
Graiffe Adopt Me - Google Search
No ratings yet
Graiffe Adopt Me - Google Search
1 page
51 Service Life Design of Steel Elements - Part - 6 - Design - Example - 3
No ratings yet
51 Service Life Design of Steel Elements - Part - 6 - Design - Example - 3
31 pages
PyTorch Workflow Fundamentals - Zero To Mastery Learn PyTorch For Deep Learning
No ratings yet
PyTorch Workflow Fundamentals - Zero To Mastery Learn PyTorch For Deep Learning
43 pages
Portable Shaker Specifications Model MIL813
No ratings yet
Portable Shaker Specifications Model MIL813
2 pages
Class-7 Dated 26-08-2023 Cloud Computing
No ratings yet
Class-7 Dated 26-08-2023 Cloud Computing
15 pages
Sulzer RT-flex - ICU - Assembly Instruction
No ratings yet
Sulzer RT-flex - ICU - Assembly Instruction
6 pages
14.4V LI-ION: Powerplusxq
No ratings yet
14.4V LI-ION: Powerplusxq
2 pages
IMC Plan
No ratings yet
IMC Plan
2 pages
Marco's Delphi Books Essential Pascal - Web Site Essential Pascal - Local Index
No ratings yet
Marco's Delphi Books Essential Pascal - Web Site Essential Pascal - Local Index
89 pages
1314 Sample Lab Module 4 Fall 2019
No ratings yet
1314 Sample Lab Module 4 Fall 2019
7 pages
CD-20 Sample Probe Controller User Manual: Clif Mock™
No ratings yet
CD-20 Sample Probe Controller User Manual: Clif Mock™
16 pages
X Switch PDF
No ratings yet
X Switch PDF
4 pages
Class 12 Computer Science Sample Paper Set 10
No ratings yet
Class 12 Computer Science Sample Paper Set 10
15 pages
TV080WXM NL0 Lenovo
No ratings yet
TV080WXM NL0 Lenovo
41 pages
DIN Mounted Surge Protective Device: AC/DC Power Low Voltage / Data Network Communications
No ratings yet
DIN Mounted Surge Protective Device: AC/DC Power Low Voltage / Data Network Communications
8 pages
Types of AI Agents
No ratings yet
Types of AI Agents
11 pages
2tne66kc-Etk - 2tne66kcetke - 2tne66kcetk2 - 2tne66ketke2 - 3tne66kc-Etk - 3tne66kcetku Thermokimg
No ratings yet
2tne66kc-Etk - 2tne66kcetke - 2tne66kcetk2 - 2tne66ketke2 - 3tne66kc-Etk - 3tne66kcetku Thermokimg
35 pages
Space Shooter Combat Game Python Project
No ratings yet
Space Shooter Combat Game Python Project
7 pages

String Naive and KMP

Uploaded by

String Naive and KMP

Uploaded by

EXPERIMENT - 02

Design, Develop and Implement a program in C for the following

Preprocessing Overview: KMP algorithm preprocesses pat[] and

In the preprocessing part,

When we see a mismatch

You might also like