0% found this document useful (0 votes)

4 views8 pages

AI Report

The document discusses artificial intelligence (AI) and its capability to mimic human cognitive functions, with a focus on an AI application for email spam filtering. It details the implementation of a spam filter using Python and various machine learning techniques, including data preprocessing, model training, and evaluation through accuracy, confusion matrix, and ROC curve. The report concludes with a function to classify emails as spam or ham based on the trained model.

Uploaded by

omarhamdy4927

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

4 views8 pages

AI Report

Uploaded by

omarhamdy4927

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 8

Introduction

Artificial Intelligence is The technology that allows computers and other

devices to mimic human learning, comprehension, problem-solving,
decision-making, creativity, and autonomy is known as artificial
intelligence (AI).AI-enabled apps and gadgets are able to see and
recognize items. They are able to comprehend and react to human words.
They can pick up new knowledge and skills. They can provide consumers
and specialists with thorough advice. They can act on their own.

Application
In this report, one of the applications of AI is explained. Email spam filter
is an AI application which can be used in big work fields such as
universities and big companies, such feature saves time and effort to
remove unneeded emails.

Simulation
Our work is done using a colab application using python code. the code is
as follows:
# Step 1: Import necessary libraries

!pip install scikit-learn pandas

import pandas as pd

from sklearn.model_selection import train_test_split

from sklearn.feature_extraction.text import TfidfVectorizer

from sklearn.naive_bayes import MultinomialNB

from sklearn.metrics import classification_report, accuracy_score

# Step 2: Load the dataset

# Replace the path below with your actual file path in Colab

csv_path = '/content/spam_ham_dataset1.csv' # Update this path as needed

data = pd.read_csv(csv_path)

# Step 3: Preprocess the data

X = data['text']

y = data['label_num']

# Convert text to TF-IDF features

 vectorizer = TfidfVectorizer(stop_words='english', max_features=5000)

X_tfidf = vectorizer.fit_transform(X)

# Split the dataset into training and testing sets

X_train, X_test, y_train, y_test = train_test_split(X_tfidf, y, test_size=0.2,

random_state=42)

# Step 4: Train the machine learning model

model = MultinomialNB( )

model.fit(X_train, y_train)

# Step 5: Evaluate the model

y_pred = model.predict(X_test)
accuracy = accuracy_score(y_test, y_pred)

report = classification_report(y_test, y_pred)

print(f"\nAccuracy: {accuracy * 100:.2f}%")

print("\nClassification Report:\n")

print(report)

import matplotlib.pyplot as plt

from sklearn.metrics import confusion_matrix, ConfusionMatrixDisplay, roc_curve, auc

# Step 6: Confusion Matrix

conf_matrix = confusion_matrix(y_test, y_pred)

disp = ConfusionMatrixDisplay(conf_matrix, display_labels=["Ham","Spam"])

disp.plot(cmap='Blues')

plt.title("Confusion Matrix")

plt.show()

# Step 7: ROC Curve

y_prob = model.predict_proba(X_test)[:, 1] # Get probabilities for the positive class

fpr, tpr, thresholds = roc_curve(y_test, y_prob)

roc_auc = auc(fpr, tpr)

plt.figure()

plt.plot(fpr, tpr, color='blue', label=f"ROC curve (AUC = {roc_auc:.2f})" )

plt.plot([0, 1], [0, 1], color='red', linestyle='--') # Diagonal line

plt.xlabel("False Positive Rate")

plt.ylabel("True Positive Rate")

plt.title("Receiver Operating Characteristic (ROC) Curve")

plt.legend(loc="lower right")

plt.grid()

plt.show()

# Step 8: Visualizing Precision and Recall

from sklearn.metrics import precision_recall_curve

precision, recall, _ = precision_recall_curve(y_test, y_prob)

plt.figure()

plt.plot(recall, precision, color='green', label="Precision-Recall curve")

plt.xlabel("Recall")

plt.ylabel("Precision")

plt.title("Precision-Recall Curve")

plt.legend(loc="lower left")

plt.grid()

plt.show()

# Function to classify a single email as spam or ham

def classify_email(email_text):

email_tfidf = vectorizer.transform([email_text]) # Transform the email text into TF-IDF

features

prediction = model.predict(email_tfidf) # Predict using the trained model

return "Spam" if prediction[0] == 1 else "Ham"

# Example usage

example_email = "Subject: Meeting Reminder\nHi team, just a reminder about the

meeting scheduled for tomorrow at 10 AM in the conference room."

classification = classify_email(example_email)

classification

Results
1. Accuracy
2. Confusion Matrix

3. ROC Curve
4. Precision and Recall curve

Chan, Jamie - Machine Learning With Python For Beginners - A Step-By-Step Guide With Hands-On Projects (Learn Coding Fast With Hands-On Project (2021) - Libgen - Li
100% (1)
Chan, Jamie - Machine Learning With Python For Beginners - A Step-By-Step Guide With Hands-On Projects (Learn Coding Fast With Hands-On Project (2021) - Libgen - Li
200 pages
AI Phase4
No ratings yet
AI Phase4
11 pages
Email Spam Detection Final Presentation-21BSCHH010002
No ratings yet
Email Spam Detection Final Presentation-21BSCHH010002
17 pages
Project Ali Huzaifa
No ratings yet
Project Ali Huzaifa
6 pages
Beginner’s Guide to Implementing a Simple Machine Learning Project - DeV Community
No ratings yet
Beginner’s Guide to Implementing a Simple Machine Learning Project - DeV Community
9 pages
Manual
No ratings yet
Manual
48 pages
AI Manual
No ratings yet
AI Manual
69 pages
Capstone project_Jaro-Prof. Babji
No ratings yet
Capstone project_Jaro-Prof. Babji
5 pages
Spam Email Classifier
No ratings yet
Spam Email Classifier
17 pages
Arnav MLlab04
No ratings yet
Arnav MLlab04
7 pages
Lab Report 8
No ratings yet
Lab Report 8
11 pages
210170111018ai[1]rkjher
No ratings yet
210170111018ai[1]rkjher
36 pages
spamdetection
No ratings yet
spamdetection
6 pages
DWDM_pavan_final[1]
No ratings yet
DWDM_pavan_final[1]
10 pages
AI Manual
No ratings yet
AI Manual
36 pages
Email spam detection
No ratings yet
Email spam detection
3 pages
cs188-fa22-note19
No ratings yet
cs188-fa22-note19
8 pages
Python CA 4
No ratings yet
Python CA 4
9 pages
Kartik mlp 4-9prg (1)
No ratings yet
Kartik mlp 4-9prg (1)
10 pages
210..127 AI
No ratings yet
210..127 AI
35 pages
Spam Filter - Machine Learning
No ratings yet
Spam Filter - Machine Learning
25 pages
amlnew
No ratings yet
amlnew
25 pages
UNIT 1
No ratings yet
UNIT 1
28 pages
Progress of CATBOOST ALGORITHM FOR ELECTRICITY THEFT DETECTION IN POWER UTILITIES
No ratings yet
Progress of CATBOOST ALGORITHM FOR ELECTRICITY THEFT DETECTION IN POWER UTILITIES
9 pages
WDM - Week - I
No ratings yet
WDM - Week - I
24 pages
Email Spam Classifier
No ratings yet
Email Spam Classifier
22 pages
Python Advanced Programming: The Guide to Learn Python Programming. Reference with Exercises and Samples About Dynamical Programming, Multithreading, Multiprocessing, Debugging, Testing and More
From Everand
Python Advanced Programming: The Guide to Learn Python Programming. Reference with Exercises and Samples About Dynamical Programming, Multithreading, Multiprocessing, Debugging, Testing and More
Marcus Richards
No ratings yet
AIML ASSIGNMENT-2
No ratings yet
AIML ASSIGNMENT-2
8 pages
_OceanofPDF.com_Hands-On_Machine_Learning_from_Scratch_-_Venelin_Valkov
No ratings yet
_OceanofPDF.com_Hands-On_Machine_Learning_from_Scratch_-_Venelin_Valkov
119 pages
FND Imp Points
No ratings yet
FND Imp Points
6 pages
ML Summer Training
No ratings yet
ML Summer Training
20 pages
Maxbox Starter60 Machine Learning
No ratings yet
Maxbox Starter60 Machine Learning
8 pages
Prajwalpatil
No ratings yet
Prajwalpatil
24 pages
Session 1 - Introduction - Contemporary Business Anaytics
No ratings yet
Session 1 - Introduction - Contemporary Business Anaytics
28 pages
CASE STUDY STOCK MARKET PREDICITON
No ratings yet
CASE STUDY STOCK MARKET PREDICITON
10 pages
Data Science Report
No ratings yet
Data Science Report
33 pages
lec09 (1)
No ratings yet
lec09 (1)
50 pages
Machine Learning Learning With Email Spam Detection
No ratings yet
Machine Learning Learning With Email Spam Detection
5 pages
24CSPC212-PIC Lab Manual
No ratings yet
24CSPC212-PIC Lab Manual
45 pages
Machine Learning Path
No ratings yet
Machine Learning Path
21 pages
PAL Codes
No ratings yet
PAL Codes
18 pages
Ie ML Project (Getting Started)
No ratings yet
Ie ML Project (Getting Started)
3 pages
Machine Learning with PySpark and MLlib — Solving a Binary Classification Problem _ by Susan Li _ Towards Data Science
No ratings yet
Machine Learning with PySpark and MLlib — Solving a Binary Classification Problem _ by Susan Li _ Towards Data Science
10 pages
ML Checklist PDF
No ratings yet
ML Checklist PDF
4 pages
School of Engineering: Lab Manual On Machine Learning Lab
No ratings yet
School of Engineering: Lab Manual On Machine Learning Lab
23 pages
Ad3461-ML Manual (1)
No ratings yet
Ad3461-ML Manual (1)
27 pages
Machine Learning – I[1]
No ratings yet
Machine Learning – I[1]
126 pages
AD 8511 ML LAB RECORD
No ratings yet
AD 8511 ML LAB RECORD
27 pages
ml lab
No ratings yet
ml lab
13 pages
Machine Learning in Logistics: Machine Learning Algorithms
No ratings yet
Machine Learning in Logistics: Machine Learning Algorithms
33 pages
ML3,4
No ratings yet
ML3,4
11 pages
Project Report
No ratings yet
Project Report
19 pages
Assignment 2
No ratings yet
Assignment 2
3 pages
5-7
No ratings yet
5-7
3 pages
lec09 (1) (1)
No ratings yet
lec09 (1) (1)
50 pages
Model Evaluation - II
No ratings yet
Model Evaluation - II
12 pages
ML Lab Exercise - 9
No ratings yet
ML Lab Exercise - 9
4 pages
Week11-AI ML DL
No ratings yet
Week11-AI ML DL
43 pages
Document
No ratings yet
Document
11 pages
Fresher PyQt5: A Beginner’s Guide to PyQt5
From Everand
Fresher PyQt5: A Beginner’s Guide to PyQt5
Edward Chang
No ratings yet
Speed Control of Brushless Dc Motor Using Fuzzy Logic Pi Controller Compress
No ratings yet
Speed Control of Brushless Dc Motor Using Fuzzy Logic Pi Controller Compress
5 pages
DC3
No ratings yet
DC3
2 pages
Lap 1 Report
No ratings yet
Lap 1 Report
10 pages
2
No ratings yet
2
14 pages
speedcontrol4927
No ratings yet
speedcontrol4927
10 pages
Python Regular Expression
100% (1)
Python Regular Expression
31 pages
EMS Monitoring Software-Datasheet
No ratings yet
EMS Monitoring Software-Datasheet
4 pages
Modern Tutorial
No ratings yet
Modern Tutorial
18 pages
How To Identify Toner and Developer in Photocopier or Printer
No ratings yet
How To Identify Toner and Developer in Photocopier or Printer
9 pages
QP and Answers Cyber Security
No ratings yet
QP and Answers Cyber Security
8 pages
Q2 - M2 - Current and Future Trends in Media and Information
No ratings yet
Q2 - M2 - Current and Future Trends in Media and Information
32 pages
Unit I Introduction
No ratings yet
Unit I Introduction
51 pages
Chapter 1 Numbers & Place Value Notes
No ratings yet
Chapter 1 Numbers & Place Value Notes
59 pages
OLED C10 Product Guide
No ratings yet
OLED C10 Product Guide
22 pages
Windows 2008 Configuring Server Roles and Services
No ratings yet
Windows 2008 Configuring Server Roles and Services
11 pages
CS3251-Programming-in-C-Lecture-Notes-2
No ratings yet
CS3251-Programming-in-C-Lecture-Notes-2
168 pages
TQM Case Study
No ratings yet
TQM Case Study
87 pages
9618_w24_ms_42
No ratings yet
9618_w24_ms_42
39 pages
ADS(Question Bank)
No ratings yet
ADS(Question Bank)
6 pages
Python_Programming_LabManuals
No ratings yet
Python_Programming_LabManuals
96 pages
Review of A Pivotal Human Factors Article: "Humans and Automation: Use, Misuse, Disuse, Abuse"
No ratings yet
Review of A Pivotal Human Factors Article: "Humans and Automation: Use, Misuse, Disuse, Abuse"
7 pages
Lec2 Iisc
No ratings yet
Lec2 Iisc
11 pages
LTE Integrity Opitimization and Inteference Analysis
No ratings yet
LTE Integrity Opitimization and Inteference Analysis
62 pages
WIFI830 user manual
No ratings yet
WIFI830 user manual
15 pages
Untitled
No ratings yet
Untitled
16 pages
Wget
No ratings yet
Wget
76 pages
Invoice Asim Lahore
No ratings yet
Invoice Asim Lahore
1 page
Merlin Legend Communications System: MLX Queued Call Console Operator's Guide
No ratings yet
Merlin Legend Communications System: MLX Queued Call Console Operator's Guide
30 pages
Gcse Sociology Coursework Topics
100% (2)
Gcse Sociology Coursework Topics
9 pages
Compiler Unit 1
No ratings yet
Compiler Unit 1
110 pages
Cisco ASR920 Microburst Whitepaper 1
No ratings yet
Cisco ASR920 Microburst Whitepaper 1
4 pages
DxDiag
No ratings yet
DxDiag
39 pages
d5
No ratings yet
d5
3 pages
Pubs HPE Alletra dHCI Deployment Guide For New Installations On Array OS 6 1 X and Later 6 1 X
No ratings yet
Pubs HPE Alletra dHCI Deployment Guide For New Installations On Array OS 6 1 X and Later 6 1 X
105 pages
12c On Oracle Linux 5 - Red Hat
No ratings yet
12c On Oracle Linux 5 - Red Hat
98 pages