0% found this document useful (0 votes)

11 views4 pages

Assignment 2

asdasdadas

Uploaded by

zohaibsoomro100

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

11 views4 pages

Assignment 2

asdasdadas

Uploaded by

zohaibsoomro100

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 4

DEPARTMENT: ____Computer

Science________________________________________________
Session: Sprint-2024 Course Instructor: ___Shoukat
Ali____________________
Subject: __Machine Learning__ Course Code: __________ Max. Marks:
___5__
Class/Sec.: 8-C Submission Date: 06/15/24 Time Duration: () From: ____ to ______

Student Name: Muhammad Zohaib_ ID: _CSC-20F-132_

Assignment 02
Apply following machine learning classifier/algorithm on PIMA Indian diabetic database to predict whether
patients in datasets have diabetes or not.

Moreover, perform a comparative study of the mentioned algorithm.

1. Logistics regression
2. Decision tree
3. Random forest
4. Naive Byes
5. KNN
6. SVM

import pandas as pd

from sklearn.model_selection import train_test_split

from sklearn.linear_model import LogisticRegression

from sklearn.tree import DecisionTreeClassifier

from sklearn.ensemble import RandomForestClassifier

from sklearn.naive_bayes import GaussianNB

from sklearn.neighbors import KNeighborsClassifier

from sklearn.svm import SVC

from sklearn.metrics import accuracy_score, classification_report

from sklearn.preprocessing import StandardScaler, RobustScaler

from sklearn.pipeline import Pipeline

# Load the dataset

data = pd.read_csv('diabetes.csv')

X = data.drop('Outcome', axis=1)

y = data['Outcome']

# Split data

X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2,

random_state=42)

# Define pipelines for each model

pipelines = {

'Logistic Regression': Pipeline([('scaler', RobustScaler()),('logreg',

LogisticRegression(max_iter=1000, solver='liblinear'))]),

'Decision Tree': Pipeline([('scaler', StandardScaler()),('tree',

DecisionTreeClassifier())]),

'Random Forest': Pipeline([('scaler', StandardScaler()),('forest',

RandomForestClassifier())]),

'Naive Bayes': Pipeline([('scaler', StandardScaler()),('nb',

GaussianNB())]),

'KNN': Pipeline([('scaler', StandardScaler()),('knn',

KNeighborsClassifier())]),

'SVM': Pipeline([('scaler', StandardScaler()),('svm', SVC())])

# Train and evaluate models

for name, pipeline in pipelines.items():

pipeline.fit(X_train, y_train)

y_pred = pipeline.predict(X_test)

print(f'\n{name}:')

print(f'Accuracy: {accuracy_score(y_test, y_pred):.4f}') # Format

accuracy to 4 decimal places

print('Classification Report:\n', classification_report(y_test,

y_pred))

Study:

Support
Characteristi Logistic Decision Random K-Nearest Vector
c Regression Tree Forest Naive Bayes Neighbors Machine
Simple,
interpretable, Powerful,
handles accurate, less Non-
linearly Interpretable, prone to parametric, Effective in
separable visualizes overfitting, simple, can high-
data, efficient decision handles non- learn complex dimensional
with large rules, handles linear Simple, fast, decision spaces,
datasets, non-linear relationships, handles high- boundaries, flexible kernel
benefits from relationships, works well dimensional works well choice, works
feature useful for with data, good for with well with
Winning scaling/outlier feature standardized categorical standardized standardized
Qualities handling selection features features features features
Computationa
lly expensive Sensitive to
Assumes Prone to for large hyperparamet
linear overfitting, Assumes datasets, ers, less
relationships, sensitive to feature requires interpretable,
less accurate small data Less independenc careful tuning computationall
with complex changes, may interpretable, e, sensitive to of k, sensitive y demanding
Areas for decision not generalize computational data to irrelevant with large
Improvement boundaries well ly demanding distribution features datasets
Performance
on Pima
Indians 75-80% 70-75% 75-82% 70-75% 72-78% 75-82%
_____________________________________________________________________________________
BEST OF LUCK

Assignment 1-ML
No ratings yet
Assignment 1-ML
4 pages
Additional Program
No ratings yet
Additional Program
573 pages
Lang Flow
100% (1)
Lang Flow
102 pages
Ml practicals
No ratings yet
Ml practicals
21 pages
Hands On Machine Learning with Scikit Learn and TensorFlow Aurélien Géron pdf download
100% (3)
Hands On Machine Learning with Scikit Learn and TensorFlow Aurélien Géron pdf download
66 pages
Machine
100% (1)
Machine
45 pages
ML5_Implementation
No ratings yet
ML5_Implementation
32 pages
MFDM™ AI - The Renaissance - QUIZ - Atualizado - Resp
67% (49)
MFDM™ AI - The Renaissance - QUIZ - Atualizado - Resp
10 pages
Practice 2+
No ratings yet
Practice 2+
25 pages
SUMMARY
No ratings yet
SUMMARY
16 pages
ML P-6 - 024
No ratings yet
ML P-6 - 024
22 pages
01 Machine Learning
No ratings yet
01 Machine Learning
25 pages
ML LAB 146
No ratings yet
ML LAB 146
50 pages
Natural Disasters Prediction
No ratings yet
Natural Disasters Prediction
21 pages
ML File
No ratings yet
ML File
17 pages
FREE AI Code Generator - Generate Code Online in Any Language
No ratings yet
FREE AI Code Generator - Generate Code Online in Any Language
12 pages
20MIS7043 (LAB 7) .Ipynb Colaboratory
No ratings yet
20MIS7043 (LAB 7) .Ipynb Colaboratory
4 pages
20MIS7095 (LAB 7) .Ipynb Colaboratory
No ratings yet
20MIS7095 (LAB 7) .Ipynb Colaboratory
4 pages
HEART DIS
No ratings yet
HEART DIS
13 pages
AIML Practical 03 22105A2021
No ratings yet
AIML Practical 03 22105A2021
12 pages
Session 2 Machine Learning Execution
No ratings yet
Session 2 Machine Learning Execution
12 pages
AIML Practical 02 22105A2021
No ratings yet
AIML Practical 02 22105A2021
8 pages
Decision Trees
No ratings yet
Decision Trees
28 pages
DWDM Lab 3
No ratings yet
DWDM Lab 3
10 pages
Liver Patient Analysis
No ratings yet
Liver Patient Analysis
12 pages
ML_Prac1-10
No ratings yet
ML_Prac1-10
32 pages
Naive Bayes
No ratings yet
Naive Bayes
5 pages
Ds Assign 33
No ratings yet
Ds Assign 33
7 pages
Prathamesh KRAI
No ratings yet
Prathamesh KRAI
38 pages
ML_Experiments_TerminalStyle_Corrected
No ratings yet
ML_Experiments_TerminalStyle_Corrected
6 pages
MLT 1 - 7 Kanish
No ratings yet
MLT 1 - 7 Kanish
24 pages
ML With Python Practical
No ratings yet
ML With Python Practical
22 pages
ML Lab
No ratings yet
ML Lab
4 pages
Import Pandas As PD DF PD - Read - CSV ("Titanic - Train - CSV") DF - Head
No ratings yet
Import Pandas As PD DF PD - Read - CSV ("Titanic - Train - CSV") DF - Head
20 pages
AAM_pr_QB
No ratings yet
AAM_pr_QB
13 pages
Cancer Disease Classification
No ratings yet
Cancer Disease Classification
6 pages
Exp 3 121a1047 Lavanya Kurup ML
No ratings yet
Exp 3 121a1047 Lavanya Kurup ML
4 pages
p7
No ratings yet
p7
5 pages
MLDA1
No ratings yet
MLDA1
8 pages
23BCE7092_ML_Lab_Assignment[1]
No ratings yet
23BCE7092_ML_Lab_Assignment[1]
14 pages
ML - Other Pracs
No ratings yet
ML - Other Pracs
7 pages
ML in Python Part-2
No ratings yet
ML in Python Part-2
21 pages
ML MANUAL WITH OUTPUTS (2)
No ratings yet
ML MANUAL WITH OUTPUTS (2)
30 pages
8 To 12 Jaimeen
No ratings yet
8 To 12 Jaimeen
34 pages
Unit 2
No ratings yet
Unit 2
5 pages
IEEE Conference Team ATOM
No ratings yet
IEEE Conference Team ATOM
5 pages
Logistic Regression vs. SVMs - Solution
No ratings yet
Logistic Regression vs. SVMs - Solution
7 pages
Batch 03 Ppt
No ratings yet
Batch 03 Ppt
19 pages
PR 6
No ratings yet
PR 6
2 pages
AI ML - Cycle 2 Programs (1)
No ratings yet
AI ML - Cycle 2 Programs (1)
15 pages
Naive Bayes Classification
No ratings yet
Naive Bayes Classification
8 pages
Data Mining Journal 4 Kashan
No ratings yet
Data Mining Journal 4 Kashan
8 pages
Review of Data Mining Classification Techniques
No ratings yet
Review of Data Mining Classification Techniques
4 pages
Enthought Python Machine Learning SciKit Learn Cheat Sheets 1 3 v1.0
No ratings yet
Enthought Python Machine Learning SciKit Learn Cheat Sheets 1 3 v1.0
3 pages
Machine Learning
No ratings yet
Machine Learning
8 pages
Shobit Sharma (2124399) ML lab file pdf
No ratings yet
Shobit Sharma (2124399) ML lab file pdf
19 pages
Data Mining Assignment No. 1
No ratings yet
Data Mining Assignment No. 1
7 pages
MlLabManualdocx 2024 09 04 22 02 58
No ratings yet
MlLabManualdocx 2024 09 04 22 02 58
19 pages
PowerPoint Presentation-3
No ratings yet
PowerPoint Presentation-3
28 pages
Project Report (Group 9)
No ratings yet
Project Report (Group 9)
20 pages
8
No ratings yet
8
9 pages
ghjgkj
No ratings yet
ghjgkj
16 pages
Lecture3 Transfer Learning
No ratings yet
Lecture3 Transfer Learning
28 pages
ML Cheatsheet
No ratings yet
ML Cheatsheet
4 pages
ML Lab Programs (1)
No ratings yet
ML Lab Programs (1)
9 pages
Sparse Llama: Revolutionizing LLMs With 70% Sparsity
No ratings yet
Sparse Llama: Revolutionizing LLMs With 70% Sparsity
8 pages
Supervised Learning
No ratings yet
Supervised Learning
9 pages
Lecture 2
No ratings yet
Lecture 2
39 pages
Soft Computing 1
No ratings yet
Soft Computing 1
30 pages
Application of Soft Computing Question Paper 21 22
50% (2)
Application of Soft Computing Question Paper 21 22
3 pages
Challenges and Issues - HS1501 Artificial Intelligence and Society (2310)
No ratings yet
Challenges and Issues - HS1501 Artificial Intelligence and Society (2310)
12 pages
Training Deep Neural Networks in Python Keras Framework (Tensor Ow Backend) With Inertial Sensor Data For Human Activity Classification
No ratings yet
Training Deep Neural Networks in Python Keras Framework (Tensor Ow Backend) With Inertial Sensor Data For Human Activity Classification
28 pages
Installation: Libraries
No ratings yet
Installation: Libraries
4 pages
Aiml
No ratings yet
Aiml
16 pages
2408.15533v2-1
No ratings yet
2408.15533v2-1
12 pages
Final Exam_ Attempt review ai 2333
No ratings yet
Final Exam_ Attempt review ai 2333
13 pages
Alam Et Al. - 2021 - A Review of Bangla Natural Language Processing Tas
No ratings yet
Alam Et Al. - 2021 - A Review of Bangla Natural Language Processing Tas
48 pages
Pattern Recognition and Machine Learning: Fuzzy Sets in Pattern Recognition Debrup Chakraborty Cinvestav
No ratings yet
Pattern Recognition and Machine Learning: Fuzzy Sets in Pattern Recognition Debrup Chakraborty Cinvestav
38 pages
Lecture 5
No ratings yet
Lecture 5
114 pages
Exploring Pre-Trained Text-to-Video Diffusion Models For Referring Video Object Segmentation
No ratings yet
Exploring Pre-Trained Text-to-Video Diffusion Models For Referring Video Object Segmentation
21 pages
Imp.-Image Category Classification Using Deep Learning-MATLAB
No ratings yet
Imp.-Image Category Classification Using Deep Learning-MATLAB
9 pages
ML Algorithms
100% (1)
ML Algorithms
1 page
ML Lab Manual
No ratings yet
ML Lab Manual
26 pages
SSTP Poster PDF
No ratings yet
SSTP Poster PDF
1 page
Intro To Neural Nets PDF
No ratings yet
Intro To Neural Nets PDF
29 pages
GPT-4 Vs GPT-35 A Concise Showdown
No ratings yet
GPT-4 Vs GPT-35 A Concise Showdown
6 pages
What Is A Deepfake?: Furious 7. But It Used To Take Entire Studios Full of Experts A Year To Create
No ratings yet
What Is A Deepfake?: Furious 7. But It Used To Take Entire Studios Full of Experts A Year To Create
2 pages
CS8691 Syllabus
No ratings yet
CS8691 Syllabus
1 page
Pandas Essentials for Data Analysis: Definitive Reference for Developers and Engineers
From Everand
Pandas Essentials for Data Analysis: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Decision Tree Pruning: Fundamentals and Applications
From Everand
Decision Tree Pruning: Fundamentals and Applications
Fouad Sabry
No ratings yet

Assignment 2

Uploaded by

Assignment 2

Uploaded by

DEPARTMENT: ____Computer

Student Name: __Muhammad Zohaib___ ID: _CSC-20F-132_

Moreover, perform a comparative study of the mentioned algorithm.

from sklearn.model_selection import train_test_split

from sklearn.linear_model import LogisticRegression

from sklearn.tree import DecisionTreeClassifier

from sklearn.ensemble import RandomForestClassifier

from sklearn.naive_bayes import GaussianNB

from sklearn.neighbors import KNeighborsClassifier

from sklearn.svm import SVC

from sklearn.metrics import accuracy_score, classification_report

from sklearn.preprocessing import StandardScaler, RobustScaler

from sklearn.pipeline import Pipeline

X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2,

# Define pipelines for each model

'Logistic Regression': Pipeline([('scaler', RobustScaler()),('logreg',

'Decision Tree': Pipeline([('scaler', StandardScaler()),('tree',

'Random Forest': Pipeline([('scaler', StandardScaler()),('forest',

'Naive Bayes': Pipeline([('scaler', StandardScaler()),('nb',

'KNN': Pipeline([('scaler', StandardScaler()),('knn',

'SVM': Pipeline([('scaler', StandardScaler()),('svm', SVC())])

# Train and evaluate models

for name, pipeline in pipelines.items():

print(f'Accuracy: {accuracy_score(y_test, y_pred):.4f}') # Format

print('Classification Report:\n', classification_report(y_test,

You might also like

Student Name: Muhammad Zohaib_ ID: _CSC-20F-132_