Spam Detection With Machine Learning

This document discusses building a spam detection system using machine learning with Python. It explains that spam detection identifies spam emails and messages by analyzing text to filter out unimportant notifications. It then walks through importing libraries, loading a spam dataset, splitting the data into training and test sets, using a CountVectorizer and MultinomialNB classifier to train a model to detect spam messages. The model is tested on a user-input message and correctly predicts it as spam.

Uploaded by

WT O

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

136 views2 pages

Spam Detection With Machine Learning

Uploaded by

WT O

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 2

Spam Detection with Machine Learning

Detecting spam alerts in emails and messages is one of the main applications that every big tech
company tries to improve for its customers. Apple’s official messaging app and Google’s Gmail are
great examples of such applications where spam detection works well to protect users from spam
alerts. So, if you are looking to build a spam detection system, this article is for you. In this article, I
will walk you through the task of Spam Detection with Machine Learning using Python.

Spam Detection

Whenever you submit details about your email or contact number on any platform, it has become
easy for those platforms to market their products by advertising them by sending emails or by
sending messages directly to your contact number. This results in lots of spam alerts and
notifications in your inbox. This is where the task of spam detection comes in.

Spam detection means detecting spam messages or emails by understanding text content so that
you can only receive notifications about messages or emails that are very important to you. If spam
messages are found, they are automatically transferred to a spam folder and you are never notified
of such alerts. This helps to improve the user experience, as many spam alerts can bother many
users.

Spam Detection using Python

Hope you now understand what spam detection is, now let’s see how to train a machine learning
model for detecting spam alerts using Python. I’ll start this task by importing the necessary Python
libraries and the dataset you need for this task:
import pandas as pd
import numpy as np
from sklearn.feature_extraction.text import CountVectorizer
from sklearn.model_selection import train_test_split
from sklearn.naive_bayes import MultinomialNB
data =
pd.read_csv("https://raw.githubusercontent.com/amankharwal/SMS-Spam-
Detection/master/spam.csv", encoding= 'latin-1')
data.head()

From this dataset, class and message are the only features we need to train a machine learning
model for spam detection, so let’s select these two columns as the new dataset:
data = data[["class", "message"]]

Now let’s split this dataset into training and test sets and train the model to detect spam messages:
x = np.array(data["message"])
y = np.array(data["class"])
cv = CountVectorizer()
X = cv.fit_transform(x) # Fit the Data
X_train, X_test, y_train, y_test = train_test_split(X, y,
test_size=0.33, random_state=42)
clf = MultinomialNB()
clf.fit(X_train,y_train)

Now let’s test this model by taking a user input as a message to detect whether it is spam or not:
sample = input('Enter a message:')
data = cv.transform([sample]).toarray()
print(clf.predict(data))

Enter a message:You won $40 cash price

['spam']

Summary

So this is how you can train a machine learning model for the task of detecting whether an email or a
message is spam or not. A Spam detector detects spam messages or emails by understanding text
content so that you can only receive notifications about messages or emails that are very important
to you. I hope you liked this article on the task of detecting spam alerts with machine learning using
Python. Feel free to ask your valuable questions in the comments section below.

AD0-E560 Adobe Marketo Engage Architect Master Dumps
No ratings yet
AD0-E560 Adobe Marketo Engage Architect Master Dumps
6 pages
Fake News Detection On Social Media Using Machine Learning Report
100% (1)
Fake News Detection On Social Media Using Machine Learning Report
27 pages
How To Hack Instagram Accounts Best Working Methods
71% (7)
How To Hack Instagram Accounts Best Working Methods
4 pages
Datawarehouse To Data Lakehouse
100% (1)
Datawarehouse To Data Lakehouse
48 pages
Obtl-Art Appriciation
No ratings yet
Obtl-Art Appriciation
4 pages
SE - Assignment 10
100% (1)
SE - Assignment 10
5 pages
Module 2 - The Framework and Process of Business Analytics
100% (1)
Module 2 - The Framework and Process of Business Analytics
9 pages
Digital Media Marketing Using Trend Analysis On Social Media Seminar Presentation
100% (1)
Digital Media Marketing Using Trend Analysis On Social Media Seminar Presentation
16 pages
Design and Implementation of Medicine Reminder Box and Health Checker
100% (1)
Design and Implementation of Medicine Reminder Box and Health Checker
6 pages
Coceptual Model - DevSecOps
No ratings yet
Coceptual Model - DevSecOps
28 pages
Attacks Concepts and Techniques
100% (1)
Attacks Concepts and Techniques
49 pages
Interim Project - Sentiment Analysis of Movie
No ratings yet
Interim Project - Sentiment Analysis of Movie
101 pages
Complete Final Sem Report PDF
No ratings yet
Complete Final Sem Report PDF
79 pages
Anomaly Detection With Machine Learning
No ratings yet
Anomaly Detection With Machine Learning
12 pages
IAS
No ratings yet
IAS
11 pages
The Price Prediction For Used Cars Using Multiple Linear Regression Model
No ratings yet
The Price Prediction For Used Cars Using Multiple Linear Regression Model
6 pages
E-Mail Spam Detection Using Machine Learning and Deep Learning
No ratings yet
E-Mail Spam Detection Using Machine Learning and Deep Learning
7 pages
JARVIS
No ratings yet
JARVIS
6 pages
Evolutionary Model (SDLC)
No ratings yet
Evolutionary Model (SDLC)
9 pages
Final Report
100% (1)
Final Report
20 pages
Gluttony - Fake Shopping Websites
No ratings yet
Gluttony - Fake Shopping Websites
8 pages
Ibm Websphere Datapower Soa Appliances Resources: Ozair Sheikh
No ratings yet
Ibm Websphere Datapower Soa Appliances Resources: Ozair Sheikh
9 pages
Airline Ticket Reservation System
50% (2)
Airline Ticket Reservation System
3 pages
Classification of Flower Species Final
No ratings yet
Classification of Flower Species Final
32 pages
Fruit Disease Detection Using Color, Texture Analysis: A Project Report
No ratings yet
Fruit Disease Detection Using Color, Texture Analysis: A Project Report
10 pages
Think Speak Iot Document
No ratings yet
Think Speak Iot Document
11 pages
Detection of Phishing WebsitesUsing Random Forest and XGBOOST
No ratings yet
Detection of Phishing WebsitesUsing Random Forest and XGBOOST
14 pages
Fake News Detection Using Machine Learning
No ratings yet
Fake News Detection Using Machine Learning
8 pages
Generating Fake News Detection Model Using A Two-Stage Evolutionary Approach 7th Aug 2023 Published
No ratings yet
Generating Fake News Detection Model Using A Two-Stage Evolutionary Approach 7th Aug 2023 Published
19 pages
Tensorflow Object Detection Api Tutorial PDF
No ratings yet
Tensorflow Object Detection Api Tutorial PDF
41 pages
Machine Learning in Traffic Classification of SDN - Final Project Report
No ratings yet
Machine Learning in Traffic Classification of SDN - Final Project Report
11 pages
Project Report
100% (1)
Project Report
60 pages
Music Player Document
No ratings yet
Music Player Document
8 pages
Path Visualizer: Gaurav Rana 01 Abhishek Kumar Singh 17 Adarsh Singh 18 Mentor-Mrs. Huda Khan
No ratings yet
Path Visualizer: Gaurav Rana 01 Abhishek Kumar Singh 17 Adarsh Singh 18 Mentor-Mrs. Huda Khan
18 pages
(KAVYA R SHETTY)
No ratings yet
(KAVYA R SHETTY)
21 pages
Blockchain Based Certificate Validation
No ratings yet
Blockchain Based Certificate Validation
7 pages
Real Time Currency Converter Ijariie13241
No ratings yet
Real Time Currency Converter Ijariie13241
5 pages
Stroke Prediction Project Report
No ratings yet
Stroke Prediction Project Report
7 pages
Building A Python Package in Minutes - Analytics Vidhya - Medium
No ratings yet
Building A Python Package in Minutes - Analytics Vidhya - Medium
23 pages
You Tube Transcript Summarizer - YOUTUBE TRANSCRIPT SUMMARISER MINI PROJECT REPORT Submitted by - Studocu PDF
No ratings yet
You Tube Transcript Summarizer - YOUTUBE TRANSCRIPT SUMMARISER MINI PROJECT REPORT Submitted by - Studocu PDF
1 page
Avocet Workflow Tech
No ratings yet
Avocet Workflow Tech
2 pages
Malicious Url Detection Based On Machine Learning
No ratings yet
Malicious Url Detection Based On Machine Learning
52 pages
Deep Audio Classification
No ratings yet
Deep Audio Classification
10 pages
Location Tracker Device Project Flow and Quotation
No ratings yet
Location Tracker Device Project Flow and Quotation
8 pages
File Sharing and Data Duplication Removal in Cloud Using File Checksum
No ratings yet
File Sharing and Data Duplication Removal in Cloud Using File Checksum
3 pages
SMS Spam Detection Using Machine Learning
No ratings yet
SMS Spam Detection Using Machine Learning
9 pages
Deserialization Attacks Explanation
No ratings yet
Deserialization Attacks Explanation
57 pages
Major Project Report (1702111025)
No ratings yet
Major Project Report (1702111025)
38 pages
Title: Personality Prediction System Problem Statement:: Literature Review
No ratings yet
Title: Personality Prediction System Problem Statement:: Literature Review
5 pages
Militant and Weapon Detection Final Report
No ratings yet
Militant and Weapon Detection Final Report
63 pages
Raspberry Pi
No ratings yet
Raspberry Pi
16 pages
Sentiment Analysis Report
No ratings yet
Sentiment Analysis Report
4 pages
Speech To Text Conversion
No ratings yet
Speech To Text Conversion
34 pages
Object Detection Tutorial
No ratings yet
Object Detection Tutorial
9 pages
Text Summarizer
No ratings yet
Text Summarizer
9 pages
Malware Detection
No ratings yet
Malware Detection
17 pages
167 Shreyas PDF
No ratings yet
167 Shreyas PDF
29 pages
Accident Detection System A Deep Learning Approach To Detect Accidents
No ratings yet
Accident Detection System A Deep Learning Approach To Detect Accidents
4 pages
Syllabus Gov1008 2014
No ratings yet
Syllabus Gov1008 2014
5 pages
Department of Electronics 2020-2021: Prof. Shilpa Achaliya
No ratings yet
Department of Electronics 2020-2021: Prof. Shilpa Achaliya
15 pages
CODRA-Brochure Panorama E2 - en
No ratings yet
CODRA-Brochure Panorama E2 - en
7 pages
Whatsapp Chat Analyser
No ratings yet
Whatsapp Chat Analyser
11 pages
Image Recognition and Its Language Translation Using OCR
No ratings yet
Image Recognition and Its Language Translation Using OCR
8 pages
Weather Prediction Using CPT+ Algorithm: Proposed Scheme
No ratings yet
Weather Prediction Using CPT+ Algorithm: Proposed Scheme
12 pages
How To Make Jarvis Iron Man Computer
No ratings yet
How To Make Jarvis Iron Man Computer
6 pages
Sign Language Recognition Using Deep Learning
No ratings yet
Sign Language Recognition Using Deep Learning
6 pages
Shajal Ahamed
No ratings yet
Shajal Ahamed
2 pages
Ass 3
No ratings yet
Ass 3
2 pages
Emotion Detection
No ratings yet
Emotion Detection
17 pages
Project
No ratings yet
Project
43 pages
Fake Profile Detection
100% (1)
Fake Profile Detection
69 pages
Drug Recommender System Using Machine Learning For Sentiment Analysis
No ratings yet
Drug Recommender System Using Machine Learning For Sentiment Analysis
4 pages
Port Forwarding Na Telekom Huawei HG530 - Bombastic85 PDF
No ratings yet
Port Forwarding Na Telekom Huawei HG530 - Bombastic85 PDF
11 pages
End of SF95D-08
No ratings yet
End of SF95D-08
9 pages
SMS Spam Detection and Classification Using NLP Thesis
No ratings yet
SMS Spam Detection and Classification Using NLP Thesis
14 pages
YouTube Transcript Summarizer
No ratings yet
YouTube Transcript Summarizer
62 pages
Thomas Jung
No ratings yet
Thomas Jung
4 pages
PCI Compliance: Understand and Implement Effective PCI Data Security Standard Compliance
No ratings yet
PCI Compliance: Understand and Implement Effective PCI Data Security Standard Compliance
3 pages
Three Level Architecture of DBMS
No ratings yet
Three Level Architecture of DBMS
7 pages
Spam Mail Detection Using Machine Learning
No ratings yet
Spam Mail Detection Using Machine Learning
14 pages
Fake News Detection
No ratings yet
Fake News Detection
18 pages
7.analysis and Detection of Malware in Android Applications Using Machine Learning
No ratings yet
7.analysis and Detection of Malware in Android Applications Using Machine Learning
55 pages
Wherescape Red Data Sheet PDF
No ratings yet
Wherescape Red Data Sheet PDF
2 pages
Ahmad Car Theft Reporting System Complete-1
No ratings yet
Ahmad Car Theft Reporting System Complete-1
47 pages
Status and Future of Manufacturing Execution Systems: Emrah Arica, Daryl Powell
No ratings yet
Status and Future of Manufacturing Execution Systems: Emrah Arica, Daryl Powell
6 pages
GovInfohub A Dynamic Government Scheme Chatbot For Informed Engagement and Accessibility
No ratings yet
GovInfohub A Dynamic Government Scheme Chatbot For Informed Engagement and Accessibility
6 pages
IBM Maximo Asset Management V7.6 Infrastructure and Implementation
No ratings yet
IBM Maximo Asset Management V7.6 Infrastructure and Implementation
18 pages
Speech Recognition Full Report
No ratings yet
Speech Recognition Full Report
11 pages
Resume 1735289601450 265733187
No ratings yet
Resume 1735289601450 265733187
3 pages
AWS Question Bank 2024-25 5th Sem
No ratings yet
AWS Question Bank 2024-25 5th Sem
5 pages

Spam Detection With Machine Learning

Uploaded by

Spam Detection With Machine Learning

Uploaded by

Spam Detection with Machine Learning

Spam Detection using Python

Enter a message:You won $40 cash price

You might also like