Pattern Recognition Assignment: Hari Narayan N.U B110490EE EEE A Batch
Pattern Recognition Assignment: Hari Narayan N.U B110490EE EEE A Batch
Submitted by
B110490EE
EEE A batch
TRAINING DATA
=== Run information ===
Scheme:weka.classifiers.bayes.NaiveBayes
Relation:
haberman-weka.filters.unsupervised.instance.RemovePercentage-P50.0-V
Instances: 153
Attributes: 4
Age_of_patient_at_time_of_operation
Patients_year_of_operation
Number_of_positive_axillary_nodes_detected
Survival_status
Test mode:evaluate on training data
Class
Attribute
(0.74) (0.26)
=============================================================
Age_of_patient_at_time_of_operation
mean
43.0259 45.3968
std. dev.
6.1537 4.7468
weight sum
precision
114
39
1.0476 1.0476
Patients_year_of_operation
58
12.0
6.0
59
12.0
7.0
60
17.0
2.0
61
13.0
1.0
62
10.0
4.0
63
15.0
5.0
64
11.0
7.0
65
10.0
4.0
66
10.0
5.0
67
9.0
4.0
68
3.0
1.0
69
4.0
5.0
[total]
126.0 51.0
Number_of_positive_axillary_nodes_detected
mean
2.7161 7.3333
std. dev.
4.8927 10.2232
weight sum
precision
114
39
2.3636 2.3636
119
77.7778 %
34
22.2222 %
0.2817
0.2806
0.403
73.5784 %
92.4707 %
153
0.718
0.794
0.947
0.864
0.791 1
0.282
0.053
0.647
0.282
0.393
0.791 2
0.548
a b <-- classified as
108 6 | a = 1
28 11 | b = 2
0.757
0.778
0.744
0.791
TEST DATA
Scheme:weka.classifiers.bayes.NaiveBayes
Relation:
haberman-weka.filters.unsupervised.instance.RemovePercentage-P50.0-V
Instances: 153
Attributes: 4
Age_of_patient_at_time_of_operation
Patients_year_of_operation
Number_of_positive_axillary_nodes_detected
Survival_status
Test mode:user supplied test set: size unknown (reading incrementally)
Class
Attribute
(0.74) (0.26)
=============================================================
Age_of_patient_at_time_of_operation
mean
43.0259 45.3968
std. dev.
6.1537 4.7468
weight sum
114
39
precision
1.0476 1.0476
Patients_year_of_operation
58
12.0
6.0
59
12.0
7.0
60
17.0
2.0
61
13.0
1.0
62
10.0
4.0
63
15.0
5.0
64
11.0
7.0
65
10.0
4.0
66
10.0
5.0
67
9.0
4.0
68
3.0
1.0
69
4.0
5.0
[total]
126.0 51.0
Number_of_positive_axillary_nodes_detected
mean
2.7161 7.3333
std. dev.
4.8927 10.2232
weight sum
precision
114
2.3636 2.3636
39
114
74.5098 %
39
25.4902 %
0.2148
0.306
0.4831
78.2813 %
108.1765 %
153
0.762
0.765
0.937
0.842
0.591 1
0.238
0.063
0.588
0.238
0.339
0.591 2
0.57
a b <-- classified as
104 7 | a = 1
32 10 | b = 2
0.716
0.745
0.704
0.591
Scheme:weka.classifiers.lazy.IBk -K 7 -W 0 -A "weka.core.neighboursearch.LinearNNSearch -A
\"weka.core.EuclideanDistance -R first-last\""
Relation:
haberman
Instances: 306
Attributes: 4
Age_of_patient_at_time_of_operation
Patients_year_of_operation
Number_of_positive_axillary_nodes_detected
Survival_status
Test mode:user supplied test set: size unknown (reading incrementally)
120
78.4314 %
33
21.5686 %
0.3589
0.2999
0.3919
75.9937 %
87.808 %
153
0.643
0.795
0.946
0.864
0.801 1
0.357
0.054
0.714
0.357
0.476
0.801 2
0.481
a b <-- classified as
105 6 | a = 1
27 15 | b = 2
0.773
0.784
0.758
0.801
K MEAN CLUSTERING
haberman
Instances: 306
Attributes: 4
Age_of_patient_at_time_of_operation
Patients_year_of_operation
Number_of_positive_axillary_nodes_detected
Ignored:
Survival_status
Test mode:Classes to clusters evaluation on training data
=== Model and evaluation on training set ===
kMeans
======
Number of iterations: 6
Within cluster sum of squared errors: 197.29360453534517
Missing values globally replaced with mean/mode
Cluster centroids:
Cluster#
Attribute
Full Data
(306)
(52)
0
(89)
2
(87)
3
(78)
=====================================================================================
==============
Age_of_patient_at_time_of_operation
Patients_year_of_operation
52.4575 56.3462
58
Number_of_positive_axillary_nodes_detected
67
58
Clustered Instances
52 ( 17%)
89 ( 29%)
87 ( 28%)
78 ( 25%)
63
4.0261 10.1731
4.1494
1.7564
Cluster 1 <-- 2
Cluster 2 <-- 1
Cluster 3 <-- No class
70.915 %
Sample Clusters
2.PROBLEM 1
2.PROBLEM 2