0% found this document useful (0 votes)

84 views13 pages

A Soft Introduction To NLP - Semantic Similarity Calculations Using Python - Medium

This document summarizes an article that introduces semantic similarity calculations using Python and natural language processing (NLP). It explains that semantic similarity measures how alike the meanings of two words or phrases are by converting them into vectors and using a similarity function. The document provides an example Python script that imports a pre-trained model and calculates similarity scores between 0-1 for some sample sentences to demonstrate how semantic similarity can be measured programmatically. It also notes that different models may produce varying results, so testing multiple models is important.

Uploaded by

Tridiv

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

84 views13 pages

A Soft Introduction To NLP - Semantic Similarity Calculations Using Python - Medium

Uploaded by

Tridiv

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 13

17/08/2023, 12:54 A Soft Introduction to NLP / Semantic Similarity Calculations Using Python | Medium

Open in app

Get unlimited access to all of Medium for less than $1/week. Become a member

Semantic Similarity Calculations Using NLP

and Python: A Soft Introduction
Tanner Overcash · Follow
5 min read · Mar 21

Listen Share More

This article covers at a very high level what semantic similarity is and demonstrates a
quick example of how you can take advantage of open-source tools and pre-trained models
in your Python scripts. I hope you like the word ‘similarity’ because you’re about to read it
a thousand times.

Follow along with the code, available here.

https://medium.com/@tanner.overcash/semantic-similarity-calculations-using-nlp-and-python-a-soft-introduction-1f31df965e40 1/13
17/08/2023, 12:54 A Soft Introduction to NLP / Semantic Similarity Calculations Using Python | Medium

Similar Houses? Semantic Similarity? Get it? Photo by Maria Orlova: https://www.pexels.com/photo/similar-
houses-in-highland-valley-covered-with-dense-forest-4946680/

Introduction
Semantic Similarity is a field of Artificial Intelligence (AI), specifically Natural
Language Processing (NLP), that creates a quantitative measure of the meaning
likeness between two words or phrases. At a high level, this is done by converting
words, sentences, or phrases into a vector — a mathematical representation of that
word, sentence, or phrase— using a process called sentence embedding. Using a
function of similarity (great post about different functions of similarity here by
Ashutosh Kumar), these embeddings are used to find that quantitative measure of
similarity.

This measure of similarity can be attributed to different aspects of the subjects

you’re comparing, so be sure you fully understand what your end goals are. For
example, some similarity models compare the lengths of words or phrases to
determine a similarity measure. Others work best for comparing words in specific
lexicons, such as medical texts. The language, context, and content of what you’re
trying to compare are all considerations you need to take when choosing a tool or
model.

Environment Setup

https://medium.com/@tanner.overcash/semantic-similarity-calculations-using-nlp-and-python-a-soft-introduction-1f31df965e40 2/13
17/08/2023, 12:54 A Soft Introduction to NLP / Semantic Similarity Calculations Using Python | Medium

To start using semantic similarity with Python, we’re going to use the sentence-
transformers library, which is a framework for state-of-the-art sentence, text, and
image embeddings. One of the reasons I’m pointing this framework out is because
of the range of different models it supports. This support is important since we’ll
not be training a model and will need iteration to find which model best suits our
needs.

Let’s get started! First, we need to install the library. I’m on an Apple Silicon
MacBook, so I needed a few prerequisites before setting up my virtual environment.
If you’re not using a Mac, you can skip ahead to creating the virtual environment.

First, we’ll need to install Rust:

curl --proto '=https' --tlsv1.2 -sSf https://sh.rustup.rs | sh

Next, we need to install cmake. I recommend using Homebrew to make this simple:

brew install cmake

Next, let’s set up a virtual environment. I like to use Pyenv, but from the sentence-
transformers installation notes, if you plan on using your GPU, you’ll need to use
PyTorch. I won’t cover using PyTorch here, but let me know if you’d be interested in
an article on that!

pyenv install 3.11.1

pyenv virtualenv 3.11.1 learning_nlp
pyenv activate learning_nlp
pip install -U sentence-transformers

Example Script
Now that we have the library installed let’s look at how we can use the library to
compare two sentences using a simple script.

https://medium.com/@tanner.overcash/semantic-similarity-calculations-using-nlp-and-python-a-soft-introduction-1f31df965e40 3/13
17/08/2023, 12:54 A Soft Introduction to NLP / Semantic Similarity Calculations Using Python | Medium

from sentence_similarity import sentence_similarity

def compare_sentences(sentence_1=str, sentence_2=str, model_name=str, embedding

"""Utilizes an NLP model that calculates the similarity between
two sentences or phrases."""

model = sentence_similarity(model_name=model_name, embedding_type=embedding

score = model.get_score(sentence_1, sentence_2, metric=metric)
return(f"Comparison Score between '{sentence_1}' and '{sentence_2}': {score

model_1 = "sentence-transformers/all-MiniLM-L6-v2"

sentence_1 = "rivers woods and hills"

sentence_2 = "streams forests and mountains"
sentence_3 = "deserts sand and shrubs"

print(compare_sentences(sentence_1=sentence_1, sentence_2=sentence_2, model_nam

print(compare_sentences(sentence_1=sentence_1, sentence_2=sentence_3, model_nam
print(compare_sentences(sentence_1=sentence_2, sentence_2=sentence_3, model_nam

Here, we’ve created a function that requires two ‘sentences’ and a model name. The
function creates sentence embeddings for each sentence (the variable named
model) and calculates the similarity difference between the two (the variable named
score). I’ve predefined two variables in the function: embedding type, which is the
methodology being used to create the sentence embeddings, and the metric, or the
function of similarity. For our embedding type, I selected cls_token_embedding,
with cls standing for classification. This tells the model we’re creating embeddings
of sentences rather than full words. The metric we’re using is cosine similarity, one
of several measures.

The score returned by the function will be a number (specifically a float) between 0
and 1. You can see the different scores for each sentence, of which I created two that
are semantically similar and one that is semantically different but contextually
similar: I did this on purpose to further illustrate the importance of knowing what
you’re attempting to measure here. Each sentence describes features within a
biome, but only sentences one and two describe similar biomes. This similarity is
what we’re trying to measure, and we can see a positive correlation to our desired
output from the scores we get back:

https://medium.com/@tanner.overcash/semantic-similarity-calculations-using-nlp-and-python-a-soft-introduction-1f31df965e40 4/13
17/08/2023, 12:54 A Soft Introduction to NLP / Semantic Similarity Calculations Using Python | Medium

Comparison Score between 'rivers woods and hills' and 'streams forests and
mountains': 0.84
Comparison Score between 'rivers woods and hills' and 'deserts sand and
shrubs': 0.631
Comparison Score between 'streams forests and mountains' and 'deserts sand
and shrubs': 0.576

Our function worked!… at least, it worked relative to the sentences we put into it. It
will be dependent on what you’re trying to accomplish if 0.84 crosses your threshold
for similar enough to mark as a positive correlation or not. This is where having
multiple models to test comes into play. Using

Now using model: 'sentence-transformers/all-MiniLM-L6-v2'

Now using model: 'sentence-transformers/all-mpnet-base-v2'

Comparison Score between 'rivers woods and hills' and 'streams forests and
mountains': 0.781
Comparison Score between 'rivers woods and hills' and 'deserts sand and
shrubs': 0.501
Comparison Score between 'streams forests and mountains' and 'deserts sand
and shrubs': 0.572

Now using model: 'sentence-transformers/paraphrase-MiniLM-L12-v2'

Comparison Score between 'rivers woods and hills' and 'streams forests and
mountains': 0.922
Comparison Score between 'rivers woods and hills' and 'deserts sand and
shrubs': 0.764
Comparison Score between 'streams forests and mountains' and 'deserts sand
and shrubs': 0.722

Now using model: 'sentence-transformers/multi-qa-MiniLM-L6-cos-v1'

Comparison Score between 'rivers woods and hills' and 'streams forests and
mountains': 0.805
Comparison Score between 'rivers woods and hills' and 'deserts sand and
shrubs': 0.662
Comparison Score between 'streams forests and mountains' and 'deserts sand

https://medium.com/@tanner.overcash/semantic-similarity-calculations-using-nlp-and-python-a-soft-introduction-1f31df965e40 5/13
17/08/2023, 12:54 A Soft Introduction to NLP / Semantic Similarity Calculations Using Python | Medium

and shrubs': 0.613

Now using model: 'sentence-transformers/nli-mpnet-base-v2'

Comparison Score between 'rivers woods and hills' and 'streams forests and
mountains': 0.823
Comparison Score between 'rivers woods and hills' and 'deserts sand and
shrubs': 0.446
Comparison Score between 'streams forests and mountains' and 'deserts sand
and shrubs': 0.398

As you can see, changing the model will drastically change the score you get back
after comparing the model. By changing the model you’re using (and using a MUCH
larger sample size than three sentences), you can tune the function to work for what
you need. Or, you may learn you need to train a model!

Summary
Through the colossal efforts of the open-source community, the barrier to entry for
working with NLP in Python isn’t as high as it may feel. The HuggingFace
community and sentence-similarity library offer a range of options for
quantitatively calculating the semantic similarity between words, sentences, or
phrases. By spending some time reviewing and testing different pertained models,
you can implement semantic similarity into your applications and reap the benefits
of machine learning.

NLP Semantic Similarity Python Machine Learning Artificial Intelligence

Written by Tanner Overcash

27 Followers

https://medium.com/@tanner.overcash/semantic-similarity-calculations-using-nlp-and-python-a-soft-introduction-1f31df965e40 6/13
17/08/2023, 12:54 A Soft Introduction to NLP / Semantic Similarity Calculations Using Python | Medium

Geospatial Data and Python enthusiast. Probably somewhere in the Rocky Mountains right now!

More from Tanner Overcash

Tanner Overcash

Python For Spatial Analysis

Learning To Use Geopandas

7 min read · Feb 4

https://medium.com/@tanner.overcash/semantic-similarity-calculations-using-nlp-and-python-a-soft-introduction-1f31df965e40 7/13
17/08/2023, 12:54 A Soft Introduction to NLP / Semantic Similarity Calculations Using Python | Medium

Tanner Overcash

Introduction to Remote Sensing: Part One

A brief introduction to what Remote Sensing is, how it’s performed, and some of its many uses.

9 min read · Mar 29

5 1

See all from Tanner Overcash

Recommended from Medium

https://medium.com/@tanner.overcash/semantic-similarity-calculations-using-nlp-and-python-a-soft-introduction-1f31df965e40 8/13
17/08/2023, 12:54 A Soft Introduction to NLP / Semantic Similarity Calculations Using Python | Medium

Christian Bernecker

NLP SIMILARITY: Use pretrained word embeddings for semantic

similarity search with BERT
Use pretrained word embeddings to measure document similarity and do semantic similarity
search with a BERT Transformer.

5 min read · Mar 1

68 2

Ruben Winastwan in Towards Data Science

https://medium.com/@tanner.overcash/semantic-similarity-calculations-using-nlp-and-python-a-soft-introduction-1f31df965e40 9/13
17/08/2023, 12:54 A Soft Introduction to NLP / Semantic Similarity Calculations Using Python | Medium

Semantic Textual Similarity with BERT

How to use BERT to calculate the semantic similarity between two texts

· 11 min read · Feb 15

190

Lists

Natural Language Processing

508 stories · 134 saves

Predictive Modeling w/ Python

20 stories · 267 saves

Practical Guides to Machine Learning

10 stories · 280 saves

ChatGPT
21 stories · 109 saves

dominiconorton

Optimizing Similarity Search with OpenAI’s Word Embeddings for

Pinecone Database

https://medium.com/@tanner.overcash/semantic-similarity-calculations-using-nlp-and-python-a-soft-introduction-1f31df965e40 10/13
17/08/2023, 12:54 A Soft Introduction to NLP / Semantic Similarity Calculations Using Python | Medium

In today’s data-driven world, many businesses and organizations rely on machine learning to
process and analyze large amounts of data…

4 min read · Mar 1

Kshitiz Sahay

Fine-tuning Llama 2 for news category prediction: A step-by-step

comprehensive guide to…
A step-by-step comprehensive guide to fine-tuning any LLM.

14 min read · Aug 7

https://medium.com/@tanner.overcash/semantic-similarity-calculations-using-nlp-and-python-a-soft-introduction-1f31df965e40 11/13
17/08/2023, 12:54 A Soft Introduction to NLP / Semantic Similarity Calculations Using Python | Medium

Abdulkader Helwan

Introduction to Word and Sentence Embedding

In the field of Natural Language Processing (NLP), the use of word and sentence embeddings
has revolutionized the way we analyze and…

8 min read · Feb 25

Rijul Dahiya

https://medium.com/@tanner.overcash/semantic-similarity-calculations-using-nlp-and-python-a-soft-introduction-1f31df965e40 12/13
17/08/2023, 12:54 A Soft Introduction to NLP / Semantic Similarity Calculations Using Python | Medium

How to Build a Question-Answering App with LangChain and OpenAI

Langchain is a natural language processing library that provides various tools and models for
working with text data. In this blog post, we…

2 min read · May 3

See more recommendations

https://medium.com/@tanner.overcash/semantic-similarity-calculations-using-nlp-and-python-a-soft-introduction-1f31df965e40 13/13

Deep Learning For Semantic Similarity
No ratings yet
Deep Learning For Semantic Similarity
7 pages
Nlp Project[1]
No ratings yet
Nlp Project[1]
16 pages
Sun 等 - 2022 - Sentence Similarity Based on Contexts
No ratings yet
Sun 等 - 2022 - Sentence Similarity Based on Contexts
16 pages
10 1002@cpe 5971
No ratings yet
10 1002@cpe 5971
17 pages
Semantic Similarity Between Medium-Sized Texts
No ratings yet
Semantic Similarity Between Medium-Sized Texts
13 pages
Evolution of Semantic Similarity - A Survey
No ratings yet
Evolution of Semantic Similarity - A Survey
35 pages
A Cognitive Study On Semantic Similarity Analysis
No ratings yet
A Cognitive Study On Semantic Similarity Analysis
6 pages
NLP Final Review
No ratings yet
NLP Final Review
32 pages
A Hybrid Approach of Weighted Fine Tuned BERT Extraction With Deep Siamese Bi - LSTM Model For Semantic Text Similarity Identification
No ratings yet
A Hybrid Approach of Weighted Fine Tuned BERT Extraction With Deep Siamese Bi - LSTM Model For Semantic Text Similarity Identification
27 pages
genaii
No ratings yet
genaii
5 pages
Gen AI lab
No ratings yet
Gen AI lab
22 pages
Semantic Textual Similarity
No ratings yet
Semantic Textual Similarity
39 pages
Semantic Text Similarity
No ratings yet
Semantic Text Similarity
2 pages
2020.lrec-1.851
No ratings yet
2020.lrec-1.851
6 pages
If you
No ratings yet
If you
2 pages
Semantic Similarity
No ratings yet
Semantic Similarity
14 pages
NLP - Experiment - 8 - A10
No ratings yet
NLP - Experiment - 8 - A10
16 pages
Reference Material NLP - 2
No ratings yet
Reference Material NLP - 2
40 pages
Gen AI Micro
No ratings yet
Gen AI Micro
15 pages
NLP Lab Manual-1
No ratings yet
NLP Lab Manual-1
18 pages
A Survey of Numerous Text Similarity Approach
No ratings yet
A Survey of Numerous Text Similarity Approach
10 pages
nlp file
No ratings yet
nlp file
21 pages
8-Measuring Text Similarity Based On Structure and Word Embedding
No ratings yet
8-Measuring Text Similarity Based On Structure and Word Embedding
20 pages
Text Semantic Similarity
No ratings yet
Text Semantic Similarity
17 pages
week2and3
No ratings yet
week2and3
76 pages
Published Paper
No ratings yet
Published Paper
12 pages
NLP Lecture2 Text Pre Processing
No ratings yet
NLP Lecture2 Text Pre Processing
54 pages
cs224n lecture notes
No ratings yet
cs224n lecture notes
35 pages
Lab
No ratings yet
Lab
8 pages
AAAI06-123 (Revisar para Referencias)
No ratings yet
AAAI06-123 (Revisar para Referencias)
6 pages
Measure Term Similarity Using A Semantic Network Approach
No ratings yet
Measure Term Similarity Using A Semantic Network Approach
5 pages
Evaluating of Efficacy Semantic Similarity Methods
No ratings yet
Evaluating of Efficacy Semantic Similarity Methods
8 pages
Rajeev Mishra 20 SCSE1180087
No ratings yet
Rajeev Mishra 20 SCSE1180087
29 pages
Generative AI (1)
No ratings yet
Generative AI (1)
16 pages
Chapter06_IN_01_Eng
No ratings yet
Chapter06_IN_01_Eng
25 pages
AP for NLP-LO1
No ratings yet
AP for NLP-LO1
61 pages
Text Similarity Using Siamese Networks and Transformers
No ratings yet
Text Similarity Using Siamese Networks and Transformers
10 pages
NLP_Module 2
No ratings yet
NLP_Module 2
54 pages
Semantic Textual Similarity With Siamese Neural Networks: Tharindu Ranasinghe, Constantin or Asan and Ruslan Mitkov
No ratings yet
Semantic Textual Similarity With Siamese Neural Networks: Tharindu Ranasinghe, Constantin or Asan and Ruslan Mitkov
8 pages
UNIT_5_DL
No ratings yet
UNIT_5_DL
11 pages
4 Word Representation
No ratings yet
4 Word Representation
41 pages
Short Text Similarity Calculation Based On Jaccard and Semantic Mixture
No ratings yet
Short Text Similarity Calculation Based On Jaccard and Semantic Mixture
9 pages
alshammari-2023-ijca-922667
No ratings yet
alshammari-2023-ijca-922667
4 pages
NLP 2
No ratings yet
NLP 2
8 pages
Building A Simple Chatbot From Scratch in Python1
No ratings yet
Building A Simple Chatbot From Scratch in Python1
8 pages
NLP FinAL (1)
No ratings yet
NLP FinAL (1)
27 pages
Christopher Manning Lecture 1: Introduction and Word Vectors
No ratings yet
Christopher Manning Lecture 1: Introduction and Word Vectors
42 pages
Project
No ratings yet
Project
11 pages
Generative AI 2
No ratings yet
Generative AI 2
24 pages
Semantic Text Analysis
No ratings yet
Semantic Text Analysis
6 pages
AP for NLP-Word 2 Vec
No ratings yet
AP for NLP-Word 2 Vec
33 pages
DL Unit-IV
No ratings yet
DL Unit-IV
20 pages
NLP Notes and Related Questions
No ratings yet
NLP Notes and Related Questions
7 pages
2021.sustainlp-1.9
No ratings yet
2021.sustainlp-1.9
5 pages
NLP-proj
No ratings yet
NLP-proj
13 pages
Motivation Video: Mitsuku Vs Cleverbot - AI (Artificial Intelligence)
No ratings yet
Motivation Video: Mitsuku Vs Cleverbot - AI (Artificial Intelligence)
45 pages
Expert Systems With Applications: David Sánchez, Montserrat Batet, David Isern, Aida Valls
No ratings yet
Expert Systems With Applications: David Sánchez, Montserrat Batet, David Isern, Aida Valls
11 pages
Semantic Net(AI Practical 5) Print
No ratings yet
Semantic Net(AI Practical 5) Print
2 pages
Linux, Apache, MySQL, PHP Performance End to End
From Everand
Linux, Apache, MySQL, PHP Performance End to End
Colin McKinnon
5/5 (1)
Python: Best Practices to Programming Code with Python: Python Computer Programming, #2
From Everand
Python: Best Practices to Programming Code with Python: Python Computer Programming, #2
Charlie Masterson
No ratings yet
21st Century Literature From The Philippines and The World
100% (2)
21st Century Literature From The Philippines and The World
3 pages
EF4e_preint_filetest_1_mód
No ratings yet
EF4e_preint_filetest_1_mód
6 pages
One Page Guide Igcse Syllabus Website
No ratings yet
One Page Guide Igcse Syllabus Website
1 page
Exam - Unit 2: Full Name: .Date: March 19, 2022. Vocabulary
No ratings yet
Exam - Unit 2: Full Name: .Date: March 19, 2022. Vocabulary
4 pages
Word Formation Processes
No ratings yet
Word Formation Processes
5 pages
Rizki Annisaa Bab II
No ratings yet
Rizki Annisaa Bab II
14 pages
Ajay- CV
No ratings yet
Ajay- CV
4 pages
BIO316 Answers 5
No ratings yet
BIO316 Answers 5
5 pages
Japanese Language Transformed
No ratings yet
Japanese Language Transformed
7 pages
Thesis Final Kajal Pourjalil
No ratings yet
Thesis Final Kajal Pourjalil
58 pages
Documents
No ratings yet
Documents
1 page
Eapp DLL q1 Mod For Teaching Purposes
No ratings yet
Eapp DLL q1 Mod For Teaching Purposes
10 pages
Assignment 4 Louran
No ratings yet
Assignment 4 Louran
11 pages
Belgium
No ratings yet
Belgium
4 pages
English HL P1 GR 12 Exemplar 2014 Memo
No ratings yet
English HL P1 GR 12 Exemplar 2014 Memo
8 pages
Bost 23-05
No ratings yet
Bost 23-05
2 pages
A Written or Verbal Request Inviting Someone To Go Somewhere or To Do Something
No ratings yet
A Written or Verbal Request Inviting Someone To Go Somewhere or To Do Something
2 pages
1 Holidays and Travel
No ratings yet
1 Holidays and Travel
4 pages
Peran Dan Tugas Receptionist Pada Pt. Serim Indonesia: Disadur Oleh: Dra. Nani Nuraini Sarah Msi
No ratings yet
Peran Dan Tugas Receptionist Pada Pt. Serim Indonesia: Disadur Oleh: Dra. Nani Nuraini Sarah Msi
19 pages
English Grade 8 (Descriptive Text)
No ratings yet
English Grade 8 (Descriptive Text)
2 pages
Actividad 1 Inglés 5
No ratings yet
Actividad 1 Inglés 5
10 pages
12 Verb Tenses
No ratings yet
12 Verb Tenses
3 pages
CC Bachman
No ratings yet
CC Bachman
38 pages
Quarter 4, First Summative Test in All Subjects (With Tos and Answer Key
No ratings yet
Quarter 4, First Summative Test in All Subjects (With Tos and Answer Key
45 pages
Dokumen - Tips - Virginia Evans Neil Osullivan Students Book Teachers Book Workbook Students
No ratings yet
Dokumen - Tips - Virginia Evans Neil Osullivan Students Book Teachers Book Workbook Students
13 pages
Roget s Super Thesaurus 3rd Edition Marc Mccutcheon - Read the ebook now with the complete version and no limits
100% (1)
Roget s Super Thesaurus 3rd Edition Marc Mccutcheon - Read the ebook now with the complete version and no limits
53 pages
4am Test4
No ratings yet
4am Test4
49 pages
LS English 9 Unit 4 Test Answers Editable
No ratings yet
LS English 9 Unit 4 Test Answers Editable
3 pages
Uas 7
No ratings yet
Uas 7
6 pages
RAZ-B 007 Bonkers Likes To Bark
No ratings yet
RAZ-B 007 Bonkers Likes To Bark
10 pages

A Soft Introduction To NLP - Semantic Similarity Calculations Using Python - Medium

Uploaded by

A Soft Introduction To NLP - Semantic Similarity Calculations Using Python - Medium

Uploaded by

17/08/2023, 12:54 A Soft Introduction to NLP / Semantic Similarity Calculations Using Python | Medium

Semantic Similarity Calculations Using NLP

Listen Share More

Follow along with the code, available here.

This measure of similarity can be attributed to different aspects of the subjects

First, we’ll need to install Rust:

curl --proto '=https' --tlsv1.2 -sSf https://sh.rustup.rs | sh

brew install cmake

pyenv install 3.11.1

from sentence_similarity import sentence_similarity

def compare_sentences(sentence_1=str, sentence_2=str, model_name=str, embedding

model = sentence_similarity(model_name=model_name, embedding_type=embedding

sentence_1 = "rivers woods and hills"

print(compare_sentences(sentence_1=sentence_1, sentence_2=sentence_2, model_nam

Now using model: 'sentence-transformers/all-MiniLM-L6-v2'

Now using model: 'sentence-transformers/all-mpnet-base-v2'

Now using model: 'sentence-transformers/paraphrase-MiniLM-L12-v2'

Now using model: 'sentence-transformers/multi-qa-MiniLM-L6-cos-v1'

and shrubs': 0.613

Now using model: 'sentence-transformers/nli-mpnet-base-v2'

NLP Semantic Similarity Python Machine Learning Artificial Intelligence

Written by Tanner Overcash

More from Tanner Overcash

Python For Spatial Analysis

7 min read · Feb 4

Introduction to Remote Sensing: Part One

9 min read · Mar 29

See all from Tanner Overcash

Recommended from Medium

NLP SIMILARITY: Use pretrained word embeddings for semantic

5 min read · Mar 1

Ruben Winastwan in Towards Data Science

Semantic Textual Similarity with BERT

· 11 min read · Feb 15

Natural Language Processing

Predictive Modeling w/ Python

Practical Guides to Machine Learning

Optimizing Similarity Search with OpenAI’s Word Embeddings for

4 min read · Mar 1

Fine-tuning Llama 2 for news category prediction: A step-by-step

14 min read · Aug 7

Introduction to Word and Sentence Embedding

8 min read · Feb 25

How to Build a Question-Answering App with LangChain and OpenAI

2 min read · May 3

See more recommendations

You might also like