0% found this document useful (0 votes)

53 views

A Comparative Study of Various Algorithms To Detect Clustering in Spatial Data

The document describes a comparative study of various clustering algorithms for spatial data. It discusses k-means, k-medoids and AGNES algorithms for clustering crime data on female rapes in India from 2013. The methodology section explains data preprocessing, mapping data to a shapefile, performing clustering analysis using the algorithms, and comparing results using Jaccard index, Jaccard distance and Rand index.

Uploaded by

Woona Hanish

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

53 views

A Comparative Study of Various Algorithms To Detect Clustering in Spatial Data

Uploaded by

Woona Hanish

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 37

A comparative study of various algorithms to

detect clustering in spatial data

A Graduate Project Final Report submitted to Manipal Academy of Higher Education in partial
fulfilment of the requirement for the award of the degree of

BACHELOR OF TECHNOLOGY
in
Electronics and Communication Engineering

Submitted by
Hanish Woona
Reg No. 160907316
Under the guidance of

Internal Guide External Guide
Name: Vishnumurthy Kedlaya K Name:AmitaPuranik Electronics and
communication Dept of DataScience

Department of Electronics & Communication Engineering, MIT, Manipal

Contents

1. Introduction

2. Background theory/literature review

3. Methodology

4. Result analysis

5. Conclusion and future work

6. References

Department of Electronics & Communication Engineering, MIT, Manipal

Introduction

In this project we are going to compare various clustering algorithms using

aggregated. To compare these algorithms we are using LISA as standard. For this
project we are using crime data of female rapes in India 2013.

• Cluster analysis is the process of partitioning a set of data objects (or observations)
into cluster, such that objects in a cluster are similar to one another, yet dissimilar to
objects in other clusters.

• Different clustering methods may generate different clustering on the same data set.
The partitioning is done by the clustering algorithms. Hence, clustering is useful in
discovery of previously unknown groups within the data.

• It is an important part of spatial data mining since it provides certain insights into the
distribution of data and characteristics of spatial clusters.

Department of Electronics & Communication Engineering, MIT, Manipal

Introduction

• Spatial data, also known as geospatial data or geographic information, is the data or
information that identifies the geographic location of features and boundaries on
earth, such as natural or constructed features, oceans, and more. Spatial data is
usually stored as coordinates and topology and is data that can be mapped.

Department of Electronics & Communication Engineering, MIT, Manipal

Methodology

• Data Preprocessing
Step I

• Mapping the data into shapefile

Step II

• Performing clustering analysis

Step III

• Comparing the results

Sep IV

Department of Electronics & Communication Engineering, MIT, Manipal

Methodology

Data Preprocessing
• We are considering the crime data on female rapes from the year 2013 to perform
clustering analysis.
• We are considering each district as an object, but the raw data with which we are
dealing must have same number of objects and same names for objects in order
to map with the shape file.
• We are goring to divide the total no of cases with total female population and
multiply it by 10000 to standardize.

Department of Electronics & Communication Engineering, MIT, Manipal

Methodology

Mapping the data into shapefile

• The shape file is a geospatial vector data format for geographic information
system (GIS) software.
• We use two different formats of shapefile for this project.
shp – Has Geospatial visualization
dbf – Has date of each object in an excel sheet
• The data from the excel sheet is mapped into the Indian districts shapefile.
• We use a software called ArcGIS for mapping data into shapefile.
• Mapping the date into a shape file is an important step, which can be later used to
perform clustering analysis.

Department of Electronics & Communication Engineering, MIT, Manipal

Methodology
Comparing the results

Jaccard Index:
It’s a measure of similarity for the two sets of data, with a range from 0% to 100%.
The higher the percentage, the more similar the two populations.

Jaccard Index = (the number in both sets)/(the number in either sets)*100

Jaccard Distance:
It is a measure of how dissimilar two sets are. It
is the complement of the Jaccard index and can be found by subtracting the Jaccard
Index from 100%.

D(X,Y) = 1 – J(X,Y)

Department of Electronics & Communication Engineering, MIT, Manipal

Methodology

Rand Index:
It is a measure of the similarity between two data clustering.

Given a set of n elements S={o1,…..on} and two partitions of S to compare X={X1,…

Xr},a
partition of S into r subsets and Y={Y1,…Yn},a partition of S into s subsets,define the
following:

• a, the number of pairs of elements in S that are in the same subset in X and in the
same subset in Y.
• b, the number of pairs of elements in S that are different subsets in X and in the
different subsets in Y.
• c, the number of pairs of elements in S that are in the same subset in X and in the
different subsets in Y.
• d, the number of pairs of elements in S that are in the different subsets in X and in
the same subset in Y.

Department of Electronics & Communication Engineering, MIT, Manipal

Methodology

The Rand index R is:

a+b can be considered as the number of agreements between X and Y and c + d as the
number of disagreements between X and Y.
Since the denominator is the total number of pairs, the Rand index represents the
frequency of occurrence of agreements over the total pairs, or the probability that X
and Y will agree on a randomly chosen pair.
Similarly, one can also view the Rand index as a measure of the percentage of correct
decisions made by the algorithm. It can be computed using the following formula:

where TP is the number of true positives, TN is the number of true negatives, FP is the
number of false positives, and FN is the number of false negatives.

Department of Electronics & Communication Engineering, MIT, Manipal

K-means

Department of Electronics & Communication Engineering, MIT, Manipal

Background theory

1. To perform this algorithm we start with selecting k number of locations randomly as

the centroids for each.
2. Now we start forming clusters by allotting each observation to the closest centroid
based on Euclidean distance to form clusters.
3. To select the value k we start adding the standard deviation between the centroid and
each observation points
4. We plot a graph with k as x axis and standard deviation on y axis. We plot the graph
for k value starting from 2 to 10-20.
5. The graph looks similar to an exponentially decreasing graph. We consider the k
value where the change in sum of standard deviation is significantly less.

Department of Electronics & Communication Engineering, MIT, Manipal

Result analysis

Department of Electronics & Communication Engineering, MIT, Manipal

K-medoids

Department of Electronics & Communication Engineering, MIT, Manipal

Background theory

1. K-medoids algorithm is developed from K-means algorithm to eliminate the

drawback of not having a observation point at the centroid.
2. We follow the same steps we followed for k means algorithm, but we consider k
observation points as centroids.

Department of Electronics & Communication Engineering, MIT, Manipal

Result analysis

Department of Electronics & Communication Engineering, MIT, Manipal

Agnes

Department of Electronics & Communication Engineering, MIT, Manipal

Background theory
1. Hierarchical clustering is a method of cluster analysis which seeks to build a hierarchy of
clusters. Strategies for hierarchical clustering generally fall into two types :
2. Agglomerative: This is a bottom-up approach: each observation starts in its own cluster,
and pairs of clusters are merged as one moves up the hierarchy.

3. Divisive: This is a top-down approach: all observations start in one cluster, and splits are
performed recursively as one moves down the hierarchy.

4. AGNES algorithm works by grouping the data one by one on the basis of the nearest
distance measure of all the pairwise distance between the data point. Again distance
between data points is recalculated

Department of Electronics & Communication Engineering, MIT, Manipal

Result analysis

Department of Electronics & Communication Engineering, MIT, Manipal

DBSCAN

Department of Electronics & Communication Engineering, MIT, Manipal

Background theory

1. Take a point and with epcilon as radius draw a circle

2. If number of points inside the circle are greater than equal to Minpoints then the above
point is considered as a core point
3. If a point doesn’t satisfy Minpoints condition but we have at least one core point inside it
then it becomes a border point.
4. If both the above conditions fail then the point becomes noise point.
5. Only core and border points are considered to form a cluster Noise points are never taken
into consideration.

Department of Electronics & Communication Engineering, MIT, Manipal

Result analysis

Department of Electronics & Communication Engineering, MIT, Manipal

CLIQUE

Department of Electronics & Communication Engineering, MIT, Manipal

Background theory

1. Partition the data space and find the number of points that lie inside each cell of the
partition.
2. Identify the subspaces that contain clusters using the Apriori principle
3. Identify clusters
a. Determine dense units in all subspaces of interests
b. Determine connected dense units in all subspaces of interests.
4. Generate minimal description for the clusters
a. Determine maximal regions that cover a cluster of connected dense units for each
cluster
b. Determination of minimal cover for each cluster

Department of Electronics & Communication Engineering, MIT, Manipal

Background theory

Department of Electronics & Communication Engineering, MIT, Manipal

Result analysis

Department of Electronics & Communication Engineering, MIT, Manipal

FUZZY

Department of Electronics & Communication Engineering, MIT, Manipal

Background theory

1. Fuzzy clustering is a extension of

the Kmeans, Kmeans is a one
approach of Partitioning methods.
2. In the fuzzy clustering each data
point can belong to more than one
cluster , each data point has a degree
of membership of belonging to each
cluster . The main advantage of fuzzy
clustering is that the fuzzy approach
yields much more detailed information
on the structure.

Department of Electronics & Communication Engineering, MIT, Manipal

Result analysis

Department of Electronics & Communication Engineering, MIT, Manipal

LISA

Department of Electronics & Communication Engineering, MIT, Manipal

Background theory

Moran’s “I” Statistic:

N is no. of cases
Xi is the value of a variable at a particular location
Xj is the value of the same variable at another location (where i =/ j)
X is the mean of the variable
Wij is a weight applied to the comparison between location i and location j.
Wij=(1/dij)

Department of Electronics & Communication Engineering, MIT, Manipal

Result analysis

Department of Electronics & Communication Engineering, MIT, Manipal

Comparison

Department of Electronics & Communication Engineering, MIT, Manipal

Result analysis

SL No: Algorithm Jaccard Index Rand Index

1 K means 26.5 68.5

2 K medoid 23 68.3

3 AGNES 29 69.8

4 FUZZY 28.12 68.7

5 DBSCAN 30.4 71.3

Department of Electronics & Communication Engineering, MIT, Manipal

Conclusion and future work
Partitioning methods like k-means and k-medoids are more useful for applications like
facility allocation where the objective is not to find natural cluster but to minimize the
sum of distances from the data objects to their cluster centres.

AGNES algorithms fixed the membership of a data object once it has been allocated to a
cluster.

Instead of using distance to judge the membership of a data object, density-based

clustering algorithm like DBSCAN make use of the density of data points within a region
to discover clusters. DBSCAN results in a loss of efficiency for high dimensional
clustering.

To increase the efficiency of clustering grid based clustering methods approximate the
dense regions of the clustering space by quantizing it into a finite number of cells and
identifying cells that contain more than a number of points as dense. Grid based
approach is usually more efficient than a density-based approach.

Department of Electronics & Communication Engineering, MIT, Manipal

Conclusion and future work
To conclude the hierarchical clustering methods are similar in performance but takes
more time as compared to the others. The performance of partition based clustering
methods like k-means and k-medoid algorithms are not well in handling irregularly
shaped clusters. The density based methods and grid based methods are more suitable
for handling spatial data but when considering time complexity grid based methods are
more preferable.

The problem with LISA is it requires frequency of events associated with the data point
so it is not suitable for point data where each crime is reported individually which makes
the count of each data point as one. From the research papers we concluded fuzzy is the
best when dealing with point data. fuzzy shows decent results in aggregate.

Department of Electronics & Communication Engineering, MIT, Manipal

References
[1]. Neethu C V and Mr.Subu Surendra, “Review of Spatial Clustering Methods”,
SCT College of Engineering Trivandrum,India,2013,24.
[2]. S.Sivaranjani, Dr.S.Sivakumari and Aasha.M, “Crime Prediction and Forecasting
in Tamilnadu using Clustering Approaches”, Avinashilingam University Coimbatore,
India,2016,6
[3]. Tony H. Grubesic “On The Application of Fuzzy Clustering for Crime Hot Spot
Detection”
[4]. Wei Luo ,Michael Steptoe ,Zheng Chang ,Robert Link , Leon Clarke and Ross
Maciejewski “Impact of Spatial Scales on the Intercomparison of Climate Scenarios”

Department of Electronics & Communication Engineering, MIT, Manipal

Introduction Tris (Oxalato) Metallates (III)
0% (1)
Introduction Tris (Oxalato) Metallates (III)
3 pages
Mid Term 160907470
No ratings yet
Mid Term 160907470
39 pages
Clustering
No ratings yet
Clustering
47 pages
A Comparative Study of Various Algorithms To Detect Clustering in Spatial Data
No ratings yet
A Comparative Study of Various Algorithms To Detect Clustering in Spatial Data
42 pages
Paper 16 - Clustering Applied To Data Structuring and Retrieval
No ratings yet
Paper 16 - Clustering Applied To Data Structuring and Retrieval
6 pages
Graph Partitioning Advance Clustering Technique
No ratings yet
Graph Partitioning Advance Clustering Technique
14 pages
Terms, Concepts and Data Types in Gis: Orhan Gündüz
No ratings yet
Terms, Concepts and Data Types in Gis: Orhan Gündüz
17 pages
Clustering Theory Applications and Algorithms
No ratings yet
Clustering Theory Applications and Algorithms
9 pages
Sathyabama Institute of Science and Technology SIT1301-Data Mining and Warehousing
No ratings yet
Sathyabama Institute of Science and Technology SIT1301-Data Mining and Warehousing
22 pages
Presenting GIS Data
No ratings yet
Presenting GIS Data
13 pages
Clustering and Association Rule
No ratings yet
Clustering and Association Rule
69 pages
Unit IV Cluster Analysis
No ratings yet
Unit IV Cluster Analysis
7 pages
Spatial Data Mining On Remote Sensing Pe
No ratings yet
Spatial Data Mining On Remote Sensing Pe
9 pages
The General Considerations and Implementation In: K-Means Clustering Technique: Mathematica
No ratings yet
The General Considerations and Implementation In: K-Means Clustering Technique: Mathematica
10 pages
Introduction To Data Science: Tom A S Horv Ath
No ratings yet
Introduction To Data Science: Tom A S Horv Ath
39 pages
Unit 4
No ratings yet
Unit 4
5 pages
Unit-V Cluster Analysis?: Unsupervised Classification Stand-Alone Tool Preprocessing Step
No ratings yet
Unit-V Cluster Analysis?: Unsupervised Classification Stand-Alone Tool Preprocessing Step
24 pages
Unit IV and V
No ratings yet
Unit IV and V
19 pages
What Is Cluster Analysis?: - Cluster: A Collection of Data Objects
No ratings yet
What Is Cluster Analysis?: - Cluster: A Collection of Data Objects
51 pages
Clustering
No ratings yet
Clustering
51 pages
By Lior Rokach and Oded Maimon: Clustering Methods
No ratings yet
By Lior Rokach and Oded Maimon: Clustering Methods
5 pages
Data Mining Unit-Iv
No ratings yet
Data Mining Unit-Iv
34 pages
AI Ass 2
No ratings yet
AI Ass 2
32 pages
Unit- 4 DMA
No ratings yet
Unit- 4 DMA
145 pages
Datawarehousing and Data Mining
No ratings yet
Datawarehousing and Data Mining
119 pages
Geography 1
No ratings yet
Geography 1
12 pages
4.5-Cluster Analysis
No ratings yet
4.5-Cluster Analysis
17 pages
Dynamic spatio-temporal pattern discovery: a novel grid and density-based clustering algorithm
No ratings yet
Dynamic spatio-temporal pattern discovery: a novel grid and density-based clustering algorithm
11 pages
Clustering Analysis
No ratings yet
Clustering Analysis
102 pages
Cluster Analysis Introduction
No ratings yet
Cluster Analysis Introduction
23 pages
An Enhanced Clustering Algorithm To Analyze Spatial Data: Dr. Mahesh Kumar, Mr. Sachin Yadav
No ratings yet
An Enhanced Clustering Algorithm To Analyze Spatial Data: Dr. Mahesh Kumar, Mr. Sachin Yadav
3 pages
BCA Semester VI Data Mining Module 4 (Presentation Kind of N
No ratings yet
BCA Semester VI Data Mining Module 4 (Presentation Kind of N
56 pages
Unit 4
No ratings yet
Unit 4
65 pages
k-medoids
No ratings yet
k-medoids
101 pages
Spatial Data Mining: Three Case Studies: Shashi Shekhar, University of Minnesota
No ratings yet
Spatial Data Mining: Three Case Studies: Shashi Shekhar, University of Minnesota
18 pages
Lect 10 DM
No ratings yet
Lect 10 DM
36 pages
DM Clustering
No ratings yet
DM Clustering
51 pages
Module-5 Clustering Algorithms
No ratings yet
Module-5 Clustering Algorithms
44 pages
Feature Selection Based On Fuzzy Entropy
No ratings yet
Feature Selection Based On Fuzzy Entropy
5 pages
07-Clustering
No ratings yet
07-Clustering
54 pages
Spatial Data Mining: Presented By-: Rajkumar Jain M.tech (C.s.e) 1 Year (2 Sem)
0% (1)
Spatial Data Mining: Presented By-: Rajkumar Jain M.tech (C.s.e) 1 Year (2 Sem)
27 pages
10.cluster Analysis
No ratings yet
10.cluster Analysis
68 pages
Recent Advances in Clustering A Brief Survey
No ratings yet
Recent Advances in Clustering A Brief Survey
9 pages
CS 591.03 Introduction To Data Mining Instructor: Abdullah Mueen
No ratings yet
CS 591.03 Introduction To Data Mining Instructor: Abdullah Mueen
52 pages
Clustering 1
No ratings yet
Clustering 1
75 pages
Data Mining Unit-IV
No ratings yet
Data Mining Unit-IV
37 pages
Lecture 12 - Unsupervised Learning - Shoould Be Marged
No ratings yet
Lecture 12 - Unsupervised Learning - Shoould Be Marged
31 pages
Rangkuman Data Analitik Dan Big Data
No ratings yet
Rangkuman Data Analitik Dan Big Data
10 pages
Unit 5
No ratings yet
Unit 5
63 pages
Clustering and Applications and Trends in Data Mining
No ratings yet
Clustering and Applications and Trends in Data Mining
42 pages
Data Mining and Data Warehouses: Professor: Liana Stanescu Student: Georgian Vladutu
No ratings yet
Data Mining and Data Warehouses: Professor: Liana Stanescu Student: Georgian Vladutu
12 pages
CH 2
No ratings yet
CH 2
68 pages
Clustering Indices: Bernard Desgraupes University Paris Ouest Lab Modal'X
No ratings yet
Clustering Indices: Bernard Desgraupes University Paris Ouest Lab Modal'X
34 pages
Paper 40
No ratings yet
Paper 40
20 pages
Gis
No ratings yet
Gis
5 pages
Introduction
No ratings yet
Introduction
11 pages
ML Unit V
No ratings yet
ML Unit V
26 pages
DM Introduction
No ratings yet
DM Introduction
50 pages
Chapter-3-2
No ratings yet
Chapter-3-2
27 pages
K Nearest Neighbor Algorithm: Fundamentals and Applications
From Everand
K Nearest Neighbor Algorithm: Fundamentals and Applications
Fouad Sabry
No ratings yet
Advanced Mathematical Applications in Data Science
From Everand
Advanced Mathematical Applications in Data Science
Biswadip Basu Mallik
No ratings yet
PID Check List
100% (1)
PID Check List
6 pages
Vacon OPTE3 E5 Profibus Option Board User Manual DPD00997C UK
No ratings yet
Vacon OPTE3 E5 Profibus Option Board User Manual DPD00997C UK
117 pages
Works and Equipment For Bus Reactor Installation
No ratings yet
Works and Equipment For Bus Reactor Installation
2 pages
Game Theory - 4th Part
No ratings yet
Game Theory - 4th Part
53 pages
Methods To Find Slope and Deflection
No ratings yet
Methods To Find Slope and Deflection
10 pages
Design of Staircase - 8 Different Types and When To Use Them
No ratings yet
Design of Staircase - 8 Different Types and When To Use Them
9 pages
M24-Physics Subject Report
No ratings yet
M24-Physics Subject Report
8 pages
Mcqs in Dbms213
No ratings yet
Mcqs in Dbms213
9 pages
Cube
No ratings yet
Cube
8 pages
Cracks - The Hidden Defect
No ratings yet
Cracks - The Hidden Defect
10 pages
ECAT Test Sample Test Papers - New
38% (8)
ECAT Test Sample Test Papers - New
17 pages
The Cost of Capital: Assets Debt
No ratings yet
The Cost of Capital: Assets Debt
22 pages
Electricity: Measuring The Potential Around A Charged Sphere
No ratings yet
Electricity: Measuring The Potential Around A Charged Sphere
4 pages
Roller Wheel Dimensions
No ratings yet
Roller Wheel Dimensions
1 page
Green Building 240 Passive Solar Design: Ron Flax - Rodwin Architecture Mark Bloomfield - Sustainably Built
No ratings yet
Green Building 240 Passive Solar Design: Ron Flax - Rodwin Architecture Mark Bloomfield - Sustainably Built
52 pages
Real Exchange Rates
No ratings yet
Real Exchange Rates
3 pages
PH Meter Manual
No ratings yet
PH Meter Manual
4 pages
Habitable Zone Limits For Dry Planets
No ratings yet
Habitable Zone Limits For Dry Planets
19 pages
Applications of Chemical Reactions
No ratings yet
Applications of Chemical Reactions
7 pages
Ejercicios Sistemas de Ecuaciones Lineales
No ratings yet
Ejercicios Sistemas de Ecuaciones Lineales
8 pages
18-Real Number System Notes PDF for All Exams and Interviews
No ratings yet
18-Real Number System Notes PDF for All Exams and Interviews
17 pages
Western Valve Online Product Catalog
No ratings yet
Western Valve Online Product Catalog
85 pages
Vibration Measurement: Nirjhar Dhang
No ratings yet
Vibration Measurement: Nirjhar Dhang
10 pages
Syllabus
No ratings yet
Syllabus
20 pages
JMEST Template - Final (2020)
No ratings yet
JMEST Template - Final (2020)
8 pages
Mechanical Properties of Metals and Metal Alloys
No ratings yet
Mechanical Properties of Metals and Metal Alloys
6 pages
Ez Dyson Manual
No ratings yet
Ez Dyson Manual
30 pages
Electrochemical Sensors
No ratings yet
Electrochemical Sensors
15 pages

A Comparative Study of Various Algorithms To Detect Clustering in Spatial Data

Uploaded by

A Comparative Study of Various Algorithms To Detect Clustering in Spatial Data

Uploaded by

A comparative study of various algorithms to

detect clustering in spatial data

Department of Electronics & Communication Engineering, MIT, Manipal

2. Background theory/literature review

5. Conclusion and future work

Department of Electronics & Communication Engineering, MIT, Manipal

In this project we are going to compare various clustering algorithms using

Department of Electronics & Communication Engineering, MIT, Manipal

Department of Electronics & Communication Engineering, MIT, Manipal

• Mapping the data into shapefile

• Performing clustering analysis

• Comparing the results

Department of Electronics & Communication Engineering, MIT, Manipal

Department of Electronics & Communication Engineering, MIT, Manipal

Mapping the data into shapefile

Department of Electronics & Communication Engineering, MIT, Manipal

Jaccard Index = (the number in both sets)/(the number in either sets)*100

Department of Electronics & Communication Engineering, MIT, Manipal

Given a set of n elements S={o1,…..on} and two partitions of S to compare X={X1,…

Department of Electronics & Communication Engineering, MIT, Manipal

The Rand index R is:

Department of Electronics & Communication Engineering, MIT, Manipal

Department of Electronics & Communication Engineering, MIT, Manipal

1. To perform this algorithm we start with selecting k number of locations randomly as

Department of Electronics & Communication Engineering, MIT, Manipal

Department of Electronics & Communication Engineering, MIT, Manipal

Department of Electronics & Communication Engineering, MIT, Manipal

1. K-medoids algorithm is developed from K-means algorithm to eliminate the

Department of Electronics & Communication Engineering, MIT, Manipal

Department of Electronics & Communication Engineering, MIT, Manipal

Department of Electronics & Communication Engineering, MIT, Manipal

Department of Electronics & Communication Engineering, MIT, Manipal

Department of Electronics & Communication Engineering, MIT, Manipal

Department of Electronics & Communication Engineering, MIT, Manipal

1. Take a point and with epcilon as radius draw a circle

Department of Electronics & Communication Engineering, MIT, Manipal

Department of Electronics & Communication Engineering, MIT, Manipal

Department of Electronics & Communication Engineering, MIT, Manipal

Department of Electronics & Communication Engineering, MIT, Manipal

Department of Electronics & Communication Engineering, MIT, Manipal

Department of Electronics & Communication Engineering, MIT, Manipal

Department of Electronics & Communication Engineering, MIT, Manipal

1. Fuzzy clustering is a extension of

Department of Electronics & Communication Engineering, MIT, Manipal

Department of Electronics & Communication Engineering, MIT, Manipal

Department of Electronics & Communication Engineering, MIT, Manipal

Moran’s “I” Statistic:

Department of Electronics & Communication Engineering, MIT, Manipal

Department of Electronics & Communication Engineering, MIT, Manipal

Department of Electronics & Communication Engineering, MIT, Manipal

SL No: Algorithm Jaccard Index Rand Index

1 K means 26.5 68.5

4 FUZZY 28.12 68.7

5 DBSCAN 30.4 71.3

Department of Electronics & Communication Engineering, MIT, Manipal

Instead of using distance to judge the membership of a data object, density-based

Department of Electronics & Communication Engineering, MIT, Manipal

Department of Electronics & Communication Engineering, MIT, Manipal

Department of Electronics & Communication Engineering, MIT, Manipal

You might also like