0% found this document useful (0 votes)

49 views

Pattern Recognition Assignment: Hari Narayan N.U B110490EE EEE A Batch

This document contains the results of several machine learning algorithms applied to a haberman survival dataset: 1. A Naive Bayes classifier was trained on 153 instances with 4 attributes, achieving 77.78% accuracy on the training set and 74.50% on test data. 2. k-Nearest Neighbors with k=7 achieved 78.43% accuracy on 153 test instances. 3. Simple k-Means clustering with k=4 grouped 306 instances into 4 clusters based on the attributes. 70.91% of instances were incorrectly clustered.

Uploaded by

HARINARAYANNU

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

49 views

Pattern Recognition Assignment: Hari Narayan N.U B110490EE EEE A Batch

Uploaded by

HARINARAYANNU

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 18

PATTERN RECOGNITION ASSIGNMENT

Submitted by

HARI NARAYAN N.U

B110490EE
EEE A batch
TRAINING DATA
=== Run information ===

Scheme:weka.classifiers.bayes.NaiveBayes
Relation:

haberman-weka.filters.unsupervised.instance.RemovePercentage-P50.0-V

Instances: 153
Attributes: 4
Age_of_patient_at_time_of_operation
Patients_year_of_operation
Number_of_positive_axillary_nodes_detected
Survival_status
Test mode:evaluate on training data

=== Classifier model (full training set) ===

Naive Bayes Classifier

Class
Attribute

(0.74) (0.26)
=============================================================
Age_of_patient_at_time_of_operation

mean

43.0259 45.3968

std. dev.

6.1537 4.7468

weight sum
precision

114

1.0476 1.0476

Patients_year_of_operation
58

12.0

6.0

12.0

7.0

17.0

2.0

13.0

1.0

10.0

4.0

15.0

5.0

11.0

7.0

10.0

4.0

10.0

5.0

9.0

4.0

3.0

1.0

4.0

5.0

[total]

126.0 51.0

Number_of_positive_axillary_nodes_detected
mean

2.7161 7.3333

std. dev.

4.8927 10.2232

weight sum
precision

114

2.3636 2.3636

Time taken to build model: 0 seconds

=== Evaluation on training set ===

=== Summary ===

Correctly Classified Instances

119

77.7778 %

22.2222 %

Incorrectly Classified Instances

Kappa statistic

0.2817

Mean absolute error

0.2806

Root mean squared error

0.403

Relative absolute error

73.5784 %

Root relative squared error

92.4707 %

Total Number of Instances

153

=== Detailed Accuracy By Class ===

TP Rate FP Rate Precision Recall F-Measure ROC Area Class

0.947

0.718

0.794

0.947

0.864

0.791 1

0.282

0.053

0.647

0.282

0.393

0.791 2

Weighted Avg. 0.778

0.548

=== Confusion Matrix ===

a b <-- classified as
108 6 | a = 1
28 11 | b = 2

0.757

0.778

0.744

0.791

TEST DATA

=== Run information ===

Scheme:weka.classifiers.bayes.NaiveBayes
Relation:

haberman-weka.filters.unsupervised.instance.RemovePercentage-P50.0-V

Instances: 153
Attributes: 4
Age_of_patient_at_time_of_operation
Patients_year_of_operation
Number_of_positive_axillary_nodes_detected
Survival_status
Test mode:user supplied test set: size unknown (reading incrementally)

=== Classifier model (full training set) ===

Naive Bayes Classifier

Class
Attribute

(0.74) (0.26)
=============================================================
Age_of_patient_at_time_of_operation
mean

43.0259 45.3968

std. dev.

6.1537 4.7468

weight sum

114

precision

1.0476 1.0476

Patients_year_of_operation
58

12.0

6.0

12.0

7.0

17.0

2.0

13.0

1.0

10.0

4.0

15.0

5.0

11.0

7.0

10.0

4.0

10.0

5.0

9.0

4.0

3.0

1.0

4.0

5.0

[total]

126.0 51.0

Number_of_positive_axillary_nodes_detected
mean

2.7161 7.3333

std. dev.

4.8927 10.2232

weight sum
precision

114

2.3636 2.3636

Time taken to build model: 0 seconds

=== Evaluation on test set ===

=== Summary ===

Correctly Classified Instances

114

74.5098 %

25.4902 %

Incorrectly Classified Instances

Kappa statistic

0.2148

Mean absolute error

0.306

Root mean squared error

0.4831

Relative absolute error

78.2813 %

Root relative squared error

108.1765 %

Total Number of Instances

153

=== Detailed Accuracy By Class ===

TP Rate FP Rate Precision Recall F-Measure ROC Area Class

0.937

0.762

0.765

0.937

0.842

0.591 1

0.238

0.063

0.588

0.238

0.339

0.591 2

Weighted Avg. 0.745

0.57

=== Confusion Matrix ===

a b <-- classified as
104 7 | a = 1
32 10 | b = 2

0.716

0.745

0.704

0.591

NEAREST NEIGHBOUR CLASSIFICATION

=== Run information ===

Scheme:weka.classifiers.lazy.IBk -K 7 -W 0 -A "weka.core.neighboursearch.LinearNNSearch -A
\"weka.core.EuclideanDistance -R first-last\""
Relation:

haberman

Instances: 306
Attributes: 4
Age_of_patient_at_time_of_operation
Patients_year_of_operation
Number_of_positive_axillary_nodes_detected
Survival_status
Test mode:user supplied test set: size unknown (reading incrementally)

=== Classifier model (full training set) ===

IB1 instance-based classifier

using 7 nearest neighbour(s) for classification

Time taken to build model: 0 seconds

=== Evaluation on test set ===

=== Summary ===

Correctly Classified Instances

Incorrectly Classified Instances
Kappa statistic

120

78.4314 %

21.5686 %

0.3589

Mean absolute error

0.2999

Root mean squared error

0.3919

Relative absolute error

75.9937 %

Root relative squared error

87.808 %

Total Number of Instances

153

=== Detailed Accuracy By Class ===

TP Rate FP Rate Precision Recall F-Measure ROC Area Class

0.946

0.643

0.795

0.946

0.864

0.801 1

0.357

0.054

0.714

0.357

0.476

0.801 2

Weighted Avg. 0.784

0.481

=== Confusion Matrix ===

a b <-- classified as
105 6 | a = 1
27 15 | b = 2

0.773

0.784

0.758

0.801

K MEAN CLUSTERING

=== Run information ===

Scheme:weka.clusterers.SimpleKMeans -N 4 -A "weka.core.EuclideanDistance -R first-last" -I 500 -S 10

Relation:

haberman

Instances: 306
Attributes: 4
Age_of_patient_at_time_of_operation
Patients_year_of_operation
Number_of_positive_axillary_nodes_detected
Ignored:
Survival_status
Test mode:Classes to clusters evaluation on training data
=== Model and evaluation on training set ===

kMeans
======

Number of iterations: 6
Within cluster sum of squared errors: 197.29360453534517
Missing values globally replaced with mean/mode

Cluster centroids:
Cluster#
Attribute

Full Data
(306)

(52)

0
(89)

2
(87)

3
(78)

=====================================================================================
==============
Age_of_patient_at_time_of_operation
Patients_year_of_operation

52.4575 56.3462
58

Number_of_positive_axillary_nodes_detected

Clustered Instances

52 ( 17%)

89 ( 29%)

87 ( 28%)

78 ( 25%)

Class attribute: Survival_status

Classes to Clusters:

0 1 2 3 <-- assigned to cluster

31 67 67 60 | 1
21 22 20 18 | 2

Cluster 0 <-- No class

4.0261 10.1731

Time taken to build model (full training data) : 0.02 seconds

=== Model and evaluation on training set ===

59.618 43.8506 51.2949

64
2.3034

4.1494

1.7564

Cluster 1 <-- 2
Cluster 2 <-- 1
Cluster 3 <-- No class

Incorrectly clustered instances : 217.0

70.915 %

Sample Clusters

2.PROBLEM 1

2.PROBLEM 2

WPS-Tube To Tube Sheet (SS-SS)
0% (2)
WPS-Tube To Tube Sheet (SS-SS)
2 pages
Nilay Debnath CSE 06607735
No ratings yet
Nilay Debnath CSE 06607735
22 pages
ML program 7 ,8,9 and10
No ratings yet
ML program 7 ,8,9 and10
12 pages
Weka
No ratings yet
Weka
9 pages
Weka
No ratings yet
Weka
22 pages
Lab6-Data Mining
No ratings yet
Lab6-Data Mining
7 pages
Sas P8
No ratings yet
Sas P8
11 pages
A320210048_Tugas 3 Statistic C
No ratings yet
A320210048_Tugas 3 Statistic C
4 pages
KNN - Ipynb - Colaboratory
No ratings yet
KNN - Ipynb - Colaboratory
3 pages
Tables Perf
No ratings yet
Tables Perf
3 pages
WEKA Assignment I
No ratings yet
WEKA Assignment I
2 pages
NguyenCongSang ITITIU20292 Lab2
No ratings yet
NguyenCongSang ITITIU20292 Lab2
13 pages
Student - Linear Regression Example - Colaboratory
No ratings yet
Student - Linear Regression Example - Colaboratory
6 pages
Assignment 03
No ratings yet
Assignment 03
6 pages
Introduction to Neural Networks
No ratings yet
Introduction to Neural Networks
4 pages
Stata Ouputs On Randomized Complete Block Design
No ratings yet
Stata Ouputs On Randomized Complete Block Design
8 pages
Garishav Basra 102103129 2CO5
No ratings yet
Garishav Basra 102103129 2CO5
8 pages
Maghda Zakiyah Muthi'Ah - Colab
No ratings yet
Maghda Zakiyah Muthi'Ah - Colab
4 pages
ICFAI University, Dehradun: Student Name Iud No Ibs No
No ratings yet
ICFAI University, Dehradun: Student Name Iud No Ibs No
9 pages
ML Practical1
No ratings yet
ML Practical1
4 pages
Week 4 Naive Bayes Classifier
No ratings yet
Week 4 Naive Bayes Classifier
2 pages
Failmezger-Using-the-Dilatometer-Test-to-Make-Accurate-Settlement-Predictions
No ratings yet
Failmezger-Using-the-Dilatometer-Test-to-Make-Accurate-Settlement-Predictions
55 pages
Final Code-30 Bus Gauss Siedel
No ratings yet
Final Code-30 Bus Gauss Siedel
6 pages
Jawab: 1. Run Information
No ratings yet
Jawab: 1. Run Information
6 pages
labpg3.ipynb - Colab
No ratings yet
labpg3.ipynb - Colab
2 pages
KNN - Jupyter Notebook (1)
No ratings yet
KNN - Jupyter Notebook (1)
7 pages
Autometed Identification of Ocular toxoplasmosis in fundoscopic images utilizing deep learning Model
No ratings yet
Autometed Identification of Ocular toxoplasmosis in fundoscopic images utilizing deep learning Model
14 pages
EXP - 7- Prasham Doshi - 22bec097
No ratings yet
EXP - 7- Prasham Doshi - 22bec097
7 pages
Advanced Statistics Problems (New) 1
No ratings yet
Advanced Statistics Problems (New) 1
5 pages
Lab2 ITDSIU21030 Nguyen Duy Phuc
No ratings yet
Lab2 ITDSIU21030 Nguyen Duy Phuc
6 pages
Machine Learnin1
100% (1)
Machine Learnin1
41 pages
K FOLD
No ratings yet
K FOLD
6 pages
WINSEM2024-25_CSE3008_ELA_AP2024254001161_2025-02-13_Reference-Material-I (1)
No ratings yet
WINSEM2024-25_CSE3008_ELA_AP2024254001161_2025-02-13_Reference-Material-I (1)
2 pages
G 203008076 - 4 - Christhian Quiñonez - Ex1 - 2 A PDF
No ratings yet
G 203008076 - 4 - Christhian Quiñonez - Ex1 - 2 A PDF
20 pages
Reasons For Adopting Stochiastic Operation Method: (Diferencial Evolution)
No ratings yet
Reasons For Adopting Stochiastic Operation Method: (Diferencial Evolution)
11 pages
Shailesh020902@gmail - Com 1
No ratings yet
Shailesh020902@gmail - Com 1
1 page
Name: Le Ho Thao Nguyen Student ID: 20194224
No ratings yet
Name: Le Ho Thao Nguyen Student ID: 20194224
9 pages
Tugas Besar ASTL Lanjut&Softwarwe STL
No ratings yet
Tugas Besar ASTL Lanjut&Softwarwe STL
8 pages
GLA3 AJPODD2024 Nimish Shandilya
No ratings yet
GLA3 AJPODD2024 Nimish Shandilya
6 pages
Lab 07 NR
No ratings yet
Lab 07 NR
6 pages
output5
No ratings yet
output5
10 pages
Description of The Data: Ratio Desorbe D
No ratings yet
Description of The Data: Ratio Desorbe D
5 pages
Genetic Algorithm Based Feature Selection For Medical Diagnosing Using Artificial Neural Network
No ratings yet
Genetic Algorithm Based Feature Selection For Medical Diagnosing Using Artificial Neural Network
54 pages
eBay Auction Case Solution
No ratings yet
eBay Auction Case Solution
9 pages
Vertopal.com Heart Failure Prediction With Detailed Headings
No ratings yet
Vertopal.com Heart Failure Prediction With Detailed Headings
12 pages
Supervised Learning With Scikit-Learn: Preprocessing Data
No ratings yet
Supervised Learning With Scikit-Learn: Preprocessing Data
32 pages
031 Kokila's Problem
No ratings yet
031 Kokila's Problem
10 pages
Customer Churn Analysis
No ratings yet
Customer Churn Analysis
10 pages
HHHHH
No ratings yet
HHHHH
8 pages
Camera Ready
No ratings yet
Camera Ready
5 pages
Statistics and Data Analysis for Nursing Research (2nd Edition ) 2nd Editionpdf download
100% (2)
Statistics and Data Analysis for Nursing Research (2nd Edition ) 2nd Editionpdf download
31 pages
Practical 7
No ratings yet
Practical 7
4 pages
output
No ratings yet
output
2 pages
Feature Select
No ratings yet
Feature Select
19 pages
Lab 06 Guass
No ratings yet
Lab 06 Guass
7 pages
Soledad enero
No ratings yet
Soledad enero
12 pages
PCA File
No ratings yet
PCA File
7 pages
Correlation: Import As Import As Import As Import As From Import From Import Import Matplotlib Import
No ratings yet
Correlation: Import As Import As Import As Import As From Import From Import Import Matplotlib Import
1 page
Win Loss Scenarios
No ratings yet
Win Loss Scenarios
18 pages
Cu Esta
No ratings yet
Cu Esta
2 pages
Random Sample Consensus: Robust Estimation in Computer Vision
From Everand
Random Sample Consensus: Robust Estimation in Computer Vision
Fouad Sabry
No ratings yet
Relationship of Sine Theta and Acceleration
No ratings yet
Relationship of Sine Theta and Acceleration
3 pages
DM LCD24064 468 Datasheet
No ratings yet
DM LCD24064 468 Datasheet
10 pages
Discussion Paper
No ratings yet
Discussion Paper
5 pages
Answers of Adbms
No ratings yet
Answers of Adbms
48 pages
Laboratory Exercise #1
No ratings yet
Laboratory Exercise #1
10 pages
DC Microgrid Protection Thesis
100% (3)
DC Microgrid Protection Thesis
6 pages
Meinstrich 1978
No ratings yet
Meinstrich 1978
15 pages
Fluid Mechanics Applied To Medicine Cardiac Flow Visualization Techniques
No ratings yet
Fluid Mechanics Applied To Medicine Cardiac Flow Visualization Techniques
97 pages
Auger Electron Spectros
No ratings yet
Auger Electron Spectros
8 pages
Standard Costing & Variance Analysis
No ratings yet
Standard Costing & Variance Analysis
43 pages
Rheem Air Handler RHSL
No ratings yet
Rheem Air Handler RHSL
20 pages
Grade 6 Planner Circle
100% (1)
Grade 6 Planner Circle
24 pages
Trouble Shooting Flow Chart
No ratings yet
Trouble Shooting Flow Chart
55 pages
Aircraft Construction, Repair and Modification: Mock-Exam
No ratings yet
Aircraft Construction, Repair and Modification: Mock-Exam
5 pages
Com - Upgadata.up7723 Logcat
No ratings yet
Com - Upgadata.up7723 Logcat
775 pages
808D Commissioning Manual 0713 en en-US
No ratings yet
808D Commissioning Manual 0713 en en-US
172 pages
Brkipm-3017 2
No ratings yet
Brkipm-3017 2
112 pages
Pre Formulation
0% (1)
Pre Formulation
53 pages
Wireless Energy Meter Reading Using RF Technology
100% (1)
Wireless Energy Meter Reading Using RF Technology
26 pages
AN818 Rev-20 PDF
No ratings yet
AN818 Rev-20 PDF
5 pages
Banking Record Management System Project
No ratings yet
Banking Record Management System Project
34 pages
2022 May Refresher V5 SOLUTION
No ratings yet
2022 May Refresher V5 SOLUTION
3 pages
Math 7 DLL
No ratings yet
Math 7 DLL
8 pages
Principles of Ee 1 Laboratory
No ratings yet
Principles of Ee 1 Laboratory
22 pages
The Geological Time Scale-2015
No ratings yet
The Geological Time Scale-2015
9 pages
McAlpine, Kenneth Et Al. - 'Making Music With Algorithms - A Case-Study System'
100% (1)
McAlpine, Kenneth Et Al. - 'Making Music With Algorithms - A Case-Study System'
12 pages
Betelgeuse Essay
No ratings yet
Betelgeuse Essay
4 pages

Pattern Recognition Assignment: Hari Narayan N.U B110490EE EEE A Batch

Uploaded by

Pattern Recognition Assignment: Hari Narayan N.U B110490EE EEE A Batch

Uploaded by

PATTERN RECOGNITION ASSIGNMENT

HARI NARAYAN N.U

=== Classifier model (full training set) ===

Naive Bayes Classifier

Time taken to build model: 0 seconds

=== Evaluation on training set ===

Correctly Classified Instances

Incorrectly Classified Instances

Mean absolute error

Root mean squared error

Relative absolute error

Root relative squared error

Total Number of Instances

=== Detailed Accuracy By Class ===

TP Rate FP Rate Precision Recall F-Measure ROC Area Class

Weighted Avg. 0.778

=== Confusion Matrix ===

=== Run information ===

=== Classifier model (full training set) ===

Naive Bayes Classifier

Time taken to build model: 0 seconds

=== Evaluation on test set ===

Correctly Classified Instances

Incorrectly Classified Instances

Mean absolute error

Root mean squared error

Relative absolute error

Root relative squared error

Total Number of Instances

=== Detailed Accuracy By Class ===

TP Rate FP Rate Precision Recall F-Measure ROC Area Class

Weighted Avg. 0.745

=== Confusion Matrix ===

NEAREST NEIGHBOUR CLASSIFICATION

=== Run information ===

=== Classifier model (full training set) ===

IB1 instance-based classifier

Time taken to build model: 0 seconds

=== Evaluation on test set ===

Correctly Classified Instances

Mean absolute error

Root mean squared error

Relative absolute error

Root relative squared error

Total Number of Instances

=== Detailed Accuracy By Class ===

TP Rate FP Rate Precision Recall F-Measure ROC Area Class

Weighted Avg. 0.784

=== Confusion Matrix ===

=== Run information ===

Scheme:weka.clusterers.SimpleKMeans -N 4 -A "weka.core.EuclideanDistance -R first-last" -I 500 -S 10

Class attribute: Survival_status

0 1 2 3 <-- assigned to cluster

Cluster 0 <-- No class

Time taken to build model (full training data) : 0.02 seconds

=== Model and evaluation on training set ===

59.618 43.8506 51.2949

Incorrectly clustered instances : 217.0

You might also like