0% found this document useful (0 votes)

42 views3 pages

From Import From Import From Import From Import Import

This document contains code to perform sentence similarity comparison between two sentences. It tokenizes the sentences, removes stopwords, lemmatizes the words, calculates the WordNet similarity between each pair of words using WUP similarity, takes the maximum similarity value and calculates the average similarity score between 0-1 to classify the sentence pairs as similar, somewhat similar or not similar.

Uploaded by

femi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

42 views3 pages

From Import From Import From Import From Import Import

Uploaded by

femi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

from nltk.

tokenize import sent_tokenize, word_tokenize

from nltk.corpus import stopwords,wordnet
from nltk.stem import WordNetLemmatizer
from itertools import product
import numpy

# str1 = "Abhishek is a good boy."

# str2 = "Abhishek is not a bad boy."
# str1 = "Cat is drinking water."
# str2 = "Lions eat flesh."
# str1 = "He loves to play football."
# str2 = "Football is his favourite sport."
# str1 = "Many consider Maradona as the best player in soccer
history."
# str2 = "Maradona is one of the best soccer player."

str1 = "I was given a card by her in the garden."

str2 = "In the garden, she gave me a card."

# str1 = "Ballmer has been vocal in the past warning that Linux
is a threat to Microsoft."
# str2 = "In the memo, Ballmer reiterated the open-source threat
to Microsoft."
# str1 = "The boy is fetching water from the well."
# str2 = "The lion is running in the forest."
# str1 = "A school is a place where kids go to study."
# str2 = "School is an institution for children who want to
study."
# str1 = "The world knows it has lost a heroic champion of
justice and freedom."
# str2 = "The earth recognizes the loss of a valiant champion of
independence and justice."
# str1 = "A cemetery is a place where dead people's bodies or
their ashes are buried."
# str2 = "A graveyard is an area of land ,sometimes near a
church, where dead people are buried."

##--------------- stopwords ---------------##

stop_words = set(stopwords.words("English"))

##---------------Initialising ---------------##
filtered_sentence1 = []
filtered_sentence2 = []
lemm_sentence1 = []
lemm_sentence2 = []
sims = []
temp1 = []
temp2 = []
simi = []
final = []
same_sent1 = []
same_sent2 = []

##---------------WordNet Lematizer---------------##
lemmatizer = WordNetLemmatizer()

##---------------Tokenizing & Stopwords removal

s1---------------##

for words1 in word_tokenize(str1):

if words1 not in stop_words:
if words1.isalnum():
filtered_sentence1.append(words1)

##---------------Lemmatizing s1---------------##

for i in filtered_sentence1:
lemm_sentence1.append(lemmatizer.lemmatize(i))

print(lemm_sentence1)

##---------------Tokenizing and removing the Stopwords

s2---------------##

for words2 in word_tokenize(str2):

if words2 not in stop_words:
if words2.isalnum():
filtered_sentence2.append(words2)

##---------------Lemmatizing s2 ---------------##

for i in filtered_sentence2:
lemm_sentence2.append(lemmatizer.lemmatize(i))

print(lemm_sentence2)

##---------------Similarity check for each word in s1 &

s2---------------##
for word1 in lemm_sentence1:
simi =[]
for word2 in lemm_sentence2:
sims = []
print(word1)
print(word2)
syns1 = wordnet.synsets(word1)
print(syns1)
syns2 = wordnet.synsets(word2)
print(syns2)
for sense1, sense2 in product(syns1, syns2):
d = wordnet.wup_similarity(sense1, sense2)
if d != None:
sims.append(d)

print(sims)
#print(max(sims))
if sims != []:
max_sim = max(sims)
print(max_sim)
simi.append(max_sim)

if simi != []:
max_final = max(simi)
final.append(max_final)

##--------------- classification-Output---------------##

similarity_index = numpy.mean(final)
similarity_index = round(similarity_index , 2)
print("Sentence 1: ",str1)
print("Sentence 2: ",str2)
print("Similarity index value : ", similarity_index)

if similarity_index>0.8:
print("Similar")
elif similarity_index>=0.6:
print("Somewhat Similar")
else:
print("Not Similar")

NLP-Lab Manual - Ashwini - Kachare
No ratings yet
NLP-Lab Manual - Ashwini - Kachare
41 pages
NLP Lab Manual
No ratings yet
NLP Lab Manual
15 pages
Natural Language Processing
No ratings yet
Natural Language Processing
17 pages
20BCP112 - NLP Lab - LAB - Manual
No ratings yet
20BCP112 - NLP Lab - LAB - Manual
65 pages
NLP Unit Test 2
No ratings yet
NLP Unit Test 2
10 pages
NLP Lecture2 Text Pre Processing
No ratings yet
NLP Lecture2 Text Pre Processing
54 pages
20BCP123 - NLP Lab Manual
No ratings yet
20BCP123 - NLP Lab Manual
45 pages
NLP Expts
No ratings yet
NLP Expts
41 pages
Natural Language Processing
No ratings yet
Natural Language Processing
22 pages
NLP - Practical List
No ratings yet
NLP - Practical List
14 pages
NLP Lab Manual
No ratings yet
NLP Lab Manual
32 pages
All Practicals
No ratings yet
All Practicals
33 pages
NLP Final Review
No ratings yet
NLP Final Review
32 pages
NLP - Exp 1 11
No ratings yet
NLP - Exp 1 11
29 pages
NLP - Record (Weeks 1-12)
No ratings yet
NLP - Record (Weeks 1-12)
41 pages
NLP Lab - Manual
No ratings yet
NLP Lab - Manual
33 pages
Final NLP Lab File
No ratings yet
Final NLP Lab File
28 pages
R22 NLP Python Programs
No ratings yet
R22 NLP Python Programs
15 pages
NLP Intro
No ratings yet
NLP Intro
15 pages
SK NLP Practical (FS)
No ratings yet
SK NLP Practical (FS)
22 pages
NLP Lab Work
No ratings yet
NLP Lab Work
34 pages
NLP Lab Manual
No ratings yet
NLP Lab Manual
17 pages
NLP Op
No ratings yet
NLP Op
16 pages
NLP Lab Programs
No ratings yet
NLP Lab Programs
18 pages
NLP Record
No ratings yet
NLP Record
23 pages
ASTW RA03 PracticalManual
No ratings yet
ASTW RA03 PracticalManual
18 pages
NLP Lab
No ratings yet
NLP Lab
18 pages
NLP Record
No ratings yet
NLP Record
15 pages
Soundarya 256 NLP Practs
No ratings yet
Soundarya 256 NLP Practs
14 pages
NLP Pratical
No ratings yet
NLP Pratical
14 pages
NLP Lab Manual
No ratings yet
NLP Lab Manual
19 pages
Shubham Jade MSC It 31031420010 NLP Practical Journal
No ratings yet
Shubham Jade MSC It 31031420010 NLP Practical Journal
17 pages
Lab
No ratings yet
Lab
8 pages
AI Lab Manual Aktu
No ratings yet
AI Lab Manual Aktu
11 pages
DSBD 7 Ass
No ratings yet
DSBD 7 Ass
9 pages
Experiment 3 Manual
No ratings yet
Experiment 3 Manual
7 pages
NLPPractical
No ratings yet
NLPPractical
12 pages
NLP Lab1
No ratings yet
NLP Lab1
6 pages
115 Ir 7
No ratings yet
115 Ir 7
6 pages
Aped For Fake News
No ratings yet
Aped For Fake News
6 pages
1 - Write A Python Program To Perform Following Tasks On Text A) Tokenization
No ratings yet
1 - Write A Python Program To Perform Following Tasks On Text A) Tokenization
13 pages
Text Semantic Similarity
No ratings yet
Text Semantic Similarity
17 pages
Record
No ratings yet
Record
6 pages
7 Idf
No ratings yet
7 Idf
5 pages
NLP Projects
No ratings yet
NLP Projects
4 pages
x0 Process
No ratings yet
x0 Process
4 pages
NLP Lab Programms
No ratings yet
NLP Lab Programms
9 pages
3 A Morphology
No ratings yet
3 A Morphology
4 pages
7 TextAnalysis
No ratings yet
7 TextAnalysis
3 pages
UsingNLTK - Jupyter Notebook
No ratings yet
UsingNLTK - Jupyter Notebook
3 pages
D22CS097 P6
No ratings yet
D22CS097 P6
3 pages
Text Analytics
No ratings yet
Text Analytics
3 pages
Exp 4
No ratings yet
Exp 4
5 pages
Sree017 NLP
No ratings yet
Sree017 NLP
3 pages
NLP Using Python
No ratings yet
NLP Using Python
4 pages
NLP Exp 5, Implement Stemming, Lemmetization, Pos - Tag, Wordnet - Colab
No ratings yet
NLP Exp 5, Implement Stemming, Lemmetization, Pos - Tag, Wordnet - Colab
2 pages
NLP Exp 7 - Colab
No ratings yet
NLP Exp 7 - Colab
2 pages
Prophetic Activation Exercises - A Yearning Heart's Journey
100% (1)
Prophetic Activation Exercises - A Yearning Heart's Journey
39 pages
Ovation Developer Studio WIN60 00
No ratings yet
Ovation Developer Studio WIN60 00
306 pages
Can Daily Reading of English Newspaper Articles Enhance Students Vocabulary
No ratings yet
Can Daily Reading of English Newspaper Articles Enhance Students Vocabulary
19 pages
SQL - Join and Basics
No ratings yet
SQL - Join and Basics
8 pages
B TR UNIT 1
No ratings yet
B TR UNIT 1
5 pages
An 1162
No ratings yet
An 1162
36 pages
Chap 18
No ratings yet
Chap 18
35 pages
2-3 4 Chapter
No ratings yet
2-3 4 Chapter
40 pages
Disorders of Thought
50% (2)
Disorders of Thought
30 pages
Flight Without Formulae PDF
No ratings yet
Flight Without Formulae PDF
48 pages
Enlightening Religion: Light and Darkness in Religious Knowledge and Knowledge About Religion
No ratings yet
Enlightening Religion: Light and Darkness in Religious Knowledge and Knowledge About Religion
7 pages
Cs6303 - Computer Architecture Lession Notes Unit Ii Arithmetic Operations
No ratings yet
Cs6303 - Computer Architecture Lession Notes Unit Ii Arithmetic Operations
18 pages
Test
No ratings yet
Test
56 pages
Planning A Party: Read The Conversation About Two Neighbors Talking
No ratings yet
Planning A Party: Read The Conversation About Two Neighbors Talking
2 pages
Emotion AI Slides PDF
No ratings yet
Emotion AI Slides PDF
25 pages
TV Commercial Rubric
No ratings yet
TV Commercial Rubric
3 pages
2fkg2 Week9 Notes Term
No ratings yet
2fkg2 Week9 Notes Term
10 pages
Assignment Server Security - Answer
No ratings yet
Assignment Server Security - Answer
10 pages
MCS 312: NP Completeness and Approximation Algorithms: Instructor Neelima Gupta Ngupta@cs - Du.ac - in
No ratings yet
MCS 312: NP Completeness and Approximation Algorithms: Instructor Neelima Gupta Ngupta@cs - Du.ac - in
18 pages
Module Pool Programming From Scratch - SAP Blogs
No ratings yet
Module Pool Programming From Scratch - SAP Blogs
11 pages
SQL - Join and Basics
No ratings yet
SQL - Join and Basics
8 pages
Counting Tree
No ratings yet
Counting Tree
7 pages
Javascript: Targeted At: Entry Level Trainees
No ratings yet
Javascript: Targeted At: Entry Level Trainees
13 pages
Strategy For The TOEFL: By: Fitriana Yuli P (133221161/3E)
No ratings yet
Strategy For The TOEFL: By: Fitriana Yuli P (133221161/3E)
12 pages
Wu Starter E-Tb Sample
No ratings yet
Wu Starter E-Tb Sample
29 pages
3 6 Adjectives
No ratings yet
3 6 Adjectives
5 pages
Ansys-Product-Reference-Table-Startup-Program-Rev-9-11-23 - 1 1
No ratings yet
Ansys-Product-Reference-Table-Startup-Program-Rev-9-11-23 - 1 1
2 pages
Thesis Statements For 5th Graders
100% (2)
Thesis Statements For 5th Graders
5 pages
Bahasa Inggris Kelas 5
No ratings yet
Bahasa Inggris Kelas 5
5 pages
History and Theory of Architecture III Essay Assignment 2013
No ratings yet
History and Theory of Architecture III Essay Assignment 2013
4 pages
21ST REVIEWER Uwu
No ratings yet
21ST REVIEWER Uwu
4 pages
PURC111
No ratings yet
PURC111
2 pages
Advanced Java
No ratings yet
Advanced Java
7 pages
25 French Writing Activities, French Writing Projects - World Language Cafe
No ratings yet
25 French Writing Activities, French Writing Projects - World Language Cafe
1 page
Exercise of Simple Present Tense
No ratings yet
Exercise of Simple Present Tense
2 pages
Server Release Info: 1) - Sysinfo
No ratings yet
Server Release Info: 1) - Sysinfo
2 pages
ENGG105 Tutorial wk08 Preliminary Design Presentation
No ratings yet
ENGG105 Tutorial wk08 Preliminary Design Presentation
1 page
Assignment 01
No ratings yet
Assignment 01
1 page
Data Structures Using C and C
No ratings yet
Data Structures Using C and C
1 page
The Essential R Reference
From Everand
The Essential R Reference
Mark Gardener
No ratings yet
Learn Python through Nursery Rhymes and Fairy Tales: Classic Stories Translated into Python Programs (Coding for Kids and Beginners)
From Everand
Learn Python through Nursery Rhymes and Fairy Tales: Classic Stories Translated into Python Programs (Coding for Kids and Beginners)
Shari Eskenas
5/5 (1)
Computer Engineering Laboratory Solution Primer
From Everand
Computer Engineering Laboratory Solution Primer
Karan Bhandari
No ratings yet

From Import From Import From Import From Import Import

Uploaded by

From Import From Import From Import From Import Import

Uploaded by

from nltk.

tokenize import sent_tokenize, word_tokenize

# str1 = "Abhishek is a good boy."

str1 = "I was given a card by her in the garden."

##--------------- stopwords ---------------##

##---------------Tokenizing & Stopwords removal

for words1 in word_tokenize(str1):

##---------------Tokenizing and removing the Stopwords

for words2 in word_tokenize(str2):

##---------------Similarity check for each word in s1 &

You might also like