0% found this document useful (0 votes)

67 views6 pages

Improvised Method of FAST Clustering Based Feature Selection Technique Algorithm For High Dimensional Data

The document presents an improved FAST clustering-based feature selection algorithm for high-dimensional data aimed at reducing dimensionality and runtime while enhancing accuracy. The proposed method involves removing irrelevant features, constructing a minimum spanning tree, and selecting representative features from clusters. Experimental results demonstrate that the improved FAST algorithm outperforms traditional feature selection methods in terms of selected features proportion, runtime, and classification accuracy.

Uploaded by

International Journal of Application or Innovation in Engineering & Management

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

67 views6 pages

Improvised Method of FAST Clustering Based Feature Selection Technique Algorithm For High Dimensional Data

Uploaded by

International Journal of Application or Innovation in Engineering & Management

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

International Journal of Application or Innovation in Engineering & Management (IJAIEM)

Web Site: www.ijaiem.org Email: [email protected]

Volume 4, Issue 6, June 2015

ISSN 2319 - 4847

Improvised Method Of FAST Clustering Based

Feature Selection Technique Algorithm For
High Dimensional Data
Avinash Godase1, Poonam Gupta2
1

Post Graduate Student ,G.H.Raisoni College Of Engg & Mgmt,Wagholi,Pune

H.O.D, Computer Engg, G.H.Raisoni College Of Engg & Mgmt,Wagholi,Pune

ABSTRACT
A high dimensional data is data consisting thousands of attributes or features. Nowadays for scientific and research
applications high dimensional data is used.But.as there are thousands of features present in the data, We need to select the
features those are non-redundant and most relevant in order to reduce the dimensionality and runtime, and also improve
accuracy of the results. In this paper we have proposed FAST algorithm of feature subset selection and improved method of
FAST algorithm. The efficiency and accuracy of the results is evaluated by empirical study. In this paper, we have presented a
novel clustering-based feature subset selection algorithm for high dimensional data. The algorithm involves (i) removing
irrelevant features, (ii) constructing a minimum spanning tree from relative ones, and (iii) partitioning the MST and selecting
representative features. In the proposed algorithm, a cluster consists of features. Each cluster is treated as a single feature and
thus dimensionality is highly reduced. The Proposed System will be Implementation of FAST algorithm Using Dice Coefficient
Measure to remove irrelevant and redundant features.

Keywords: FAST, Feature Subset Selection, Graph Based Clustering, Minimum Spanning Tree.

1. INTRODUCTION
With the goal of choosing a subset of good features with respect to the target classes, feature subset selection is an
proper way for reducing dimensionality, removing irrelevant data, increasing learning accuracy, and improving result
comprehensibility[01][02]. Basically there are four methods for features selection i. e Wrapper , Filter , Embedded and
Hybrid With respect to the filter feature selection methods, the application of cluster analysis has been demonstrated to
be more effective than traditional feature selection algorithms[04]. Filter approach uses intrinsic properties of data for
feature selection. This is the unsupervised feature selection approach. This approach performs the feature selection
without using induction algorithms which is display in the figure. This method is used for the transformation of
variable space. This transformation of variable space is required for the collation and computation of all the features
before dimension reduction can be achieved [05][06]. In particular, we accept the minimum spanning tree based
clustering algorithms, for the reason that they do not imagine that data points are clustered around centers or separated
by means of a normal geometric curve and have been extensively used in tradition [02]. In cluster analysis, graphtheoretic methods have been well studied and used in many applications. Select the features from the generated cluster
Which remove the redundant and irrelevant attributes [03][13]. This method is use for selecting the interesting features
from the clusters. Clustering is a semi-supervised learning problem, which tries to group a set of points into clusters
such that points in the same cluster are more similar to each other than points in different clusters, under a particular
similarity matrix[11][12]. Feature subset selection can be viewed as the process of identifying and removing as many
irrelevant and redundant features as possible. This is because 1) irrelevant features do not contribute to the predictive
accuracy, and 2) redundant features do not redound to getting a better predictor for that they provide mostly
information which is already present in other feature(s)[07][09][10].
Using this method we can get the quality of feature attributes In this paper we have focused on using best similarity
measure to calculate relevance between the features. In this paper we have compared results with few traditional feature
selection algorithms like CFS,FAST..The goal of this paper includes focusing on using best algorithm i.e improved
FAST for feature subset selection so that we will get effective accuracy[08].

2. LITERATURE SURVEY
In [02], Qinbao Song et al, proposed a new FAST algorithm that gain more accuracy and reduce time complexity than
traditional feature selection algorithm like, FCBF, Relief, CFS, FOCUS-SF, Consist and also compare the classification
accuracy with prominent classifiers. Graph theoretic clustering and MST based approach is used for ensure the
efficiency of feature selection process. Classifiers plays vital roles in feature selection operation since accuracy of

Volume 4, Issue 6, June 2015

Page 135

International Journal of Application or Innovation in Engineering & Management (IJAIEM)

Web Site: www.ijaiem.org Email: [email protected]
Volume 4, Issue 6, June 2015

ISSN 2319 - 4847

selected features are measured using the progression of classifiers[06]. The following classifiers are utilized to classify
the data sets [2], [3], Nave Bayes: it works under Bayes theory and is based on probabilistic approach and yet then
offers first-rate classification output. C4.5 is the successor of ID3 [4] support of decision tree induction method. Gain
ratio, gini index information gain are the measures used for the process of attribute selection. Simplest algorithm is IB1
(instance based) [5]. Based on the distance vectors, it performs the classification process. RIPPER [6] is the rule based
technique, it make a set of rules for the purpose of classify the data sets. Classifier is one of the evaluation parameter for
measuring the accuracy of the process.
Author
Description
Kononenko

Yu L. and
Liu H
Fleuret F
Krier C.,
Francois D
Qinbao

Relief is ineffective at removing redundant features as two predictive but highly correlated features
are likely both to be highly weighted. Relief, enabling this method to work with noisy and
incomplete data sets and to deal with multi-class[2]
FCBF is a fast filter method which can identify relevant features as well as redundancy among
relevant features without pair wise correlation analysis[01].
CMIM iteratively picks features which maximize their mutual information with the class to
predict, conditionally to the response of any feature already picked[14].
In this paper presented a methodology combining hierarchical constrained clustering of spectral
variables and selection of clusters by mutual information[15].
The FAST algorithm works in two steps. In the first step, features are divided into clusters by
using graph-theoretic clustering methods. In the second step, the most representative feature that
is strongly related to target classes is selected from each cluster to form the final subset of features.
Features in different clusters are relatively independent; the clustering-based strategy of FAST has
a high probability of producing a subset of useful and independent feature[02].

3.PROPOSED SYSTEM
In proposed sytem i.e improved FAST ,we use relevance between features and the relevance between feature and the
target concept.We have used dice coefficient for calculating the relevance between the features.We extract the best
representative features from cluster using relevance between the features and the relevance between feature and target
class. There exist many feature selection techniques which are aimed at reducing unnecessary features to reduce
dataset. But some of them failed at removing redundant features after removing irrelevant features. Proposed system
focuses on removing both irrelevant and redundant features. The features are first divided into clusters and features
from each clusters are selected which are more representative to target class. System provides MST (Minimum
Spanning Tree) method, using which we propose a Fast clustering based feature Selection algorithm (FAST).. Proposed
System will be Implementation of FAST algorithm Using Dice Coefficient Measure to remove irrelevant and redundant
features.
Feature selection is also useful as part of the data analysis process, as it shows which features are important for
prediction, and how these features are related. Clustering is mainly used in grouping the datasets which are similar to
the users search. The datasets which are irrelevant can be easily eliminated and redundant features inside the datasets
are also removed. The clustering finally produces the selected datasets. The clustering uses MST for selecting the
related datasets and finally the relevant datasets.
A minimum spanning tree (MST) or minimum weight spanning tree is then a spanning tree with weight less than or
equal to the weight of every other spanning tree. More generally, any undirected graph (not necessarily connected) has
a minimum spanning forest, which is a union of minimum spanning trees for its connected components. It is the cluster
analysis of data with anywhere from a few dozen to many thousands of dimensions. Such high-dimensional data spaces
are often encountered in areas such as medicine, where DNA microarray technology can produce a large number of
measurements at once, and the clustering of text documents, where, if a word frequency vector is used, the number of
dimensions equals the size of the dictionary.

Volume 4, Issue 6, June 2015

Page 136

International Journal of Application or Innovation in Engineering & Management (IJAIEM)

Web Site: www.ijaiem.org Email: [email protected]
Volume 4, Issue 6, June 2015

ISSN 2319 - 4847

Fig.1 Proposed Framework

3.1 Proposed Algorithm
Input: D (F1, F2, C) - the given data set
- the T-Relevance threshold
Output: S selected feature subset.
Start
//------Irrelevant Feature Removal---------//
1. for i=1 to m do
2. T-Relevance = SU (Fi, C)
3. if T-Relevance > then
4. S = S U {Fi};
5. Feature f = Dice (Fi, Fj) // use dice to remove irrelevant feature.
//-------Minimum Spanning Tree Construction--------//
6. G = Null; //G is a complete Graph
7. for each pair of features {Fi, Fj} C S do
8. F- Correlation = SU (Fi,Fj)
9. Add Fi and Fj to G with F-Correlation as the weight of
The corresponding edge.
10. minSpanTree = Prim(G); //Using Prims Algorithm to generate the Minimum spanning tree
//-------Tree partition and Representative Feature Selection-------//
11. Forest = minSpanTree
12. for each edge Eij Forest do
13. if SU( Fi , Fj ) < SU ( Fi , C ) SU ( Fi , Fj ) < SU ( Fj , C ) then
14. Forest = Forest - Eij
15. S =
16. for each tree Ti Forest do
17. Fr = argmax Fk Ti SU (Fk, C)
18. S = S U {Fr}
19. return S
End
3.2 Proposed Methodology
Modules Information
1. Module ( GUI Design and calculate Symmetric Uncertainty)
First module consists of development of application GUI in Java. Also includes the development of user registration
and login parts. Again this module contains calculation of Symmetric Uncertainty to find the relevance of particular
feature with target class
2. Module (Minimum Spanning Tree Construction)
In this module the construction of the minimum spanning tree from a weighted complete graph and then partitioning of
the MST into a forest with each tree representing a cluster.

Volume 4, Issue 6, June 2015

Page 137

International Journal of Application or Innovation in Engineering & Management (IJAIEM)

Web Site: www.ijaiem.org Email: [email protected]
Volume 4, Issue 6, June 2015

ISSN 2319 - 4847

3. Module (Selection of Features)

In this module we do selection of most relevant features from the clusters which
give us the
reduced training dataset containing relevant an useful features only which improves efficiency.
4. Module (Dice)
In this module we will use similarity function dice coefficient clustering algorithm for clustering and selecting most
relevant features from cluster.
3.3 Flow Chart Of The System
The following Diagram shows the flow chart for implementing the clustering based feature selection algorithm.
In machine learning and statistics, feature selection, also known as variable selection, attribute selection or variable
subset selection, is the process of selecting a subset of relevant features for use in model construction. The main
assumption when using a feature selection technique is that the data contains many redundant or irrelevant features.
Redundant features are those which provide no more information than the currently selected features, and irrelevant
features provide no useful information in any context. Feature selection techniques provide three main benefits when
constructing predictive models:
Improved model interpretability,
Shorter training times,
Enhanced generalization by reducing over fitting.
Feature selection is also useful as part of the data analysis process, as shows which features are important for
prediction, and how these features are related.

Fig.2 Flow Chart Of The System

4.RESULTS AND ANALYSIS

In this section we present the experimental results.The parameters used are
1. Proportion of the selected features,
2.The time to select the features
3.Classification accuracy.
We have performed experiments on three different algorithms i.e CFS,FAST, and Improved FAST.Proposed algorithm
shows best results for these 3 cosidered parameters.
1) Proportion Of The Selected Features:
Table 1:Proportion Of The Selected Features
Dataset
Fbis.wc.arff
New3s.wc.arff
Oh10.wc.arff
Oh0.wc.arff

CFS
50
40
28
41

Volume 4, Issue 6, June 2015

FAST
29
30
08
11

Improved FAST
19
25
06
10

Page 138

International Journal of Application or Innovation in Engineering & Management (IJAIEM)

Web Site: www.ijaiem.org Email: [email protected]
Volume 4, Issue 6, June 2015

ISSN 2319 - 4847

Fig3: Implementation Results Of Proportion Of The Selected Features

2) Time Required To Select Features
Table 2:Time Required To Select The Features
Dataset
Fbis.wc.arff
New3s.wc.arff
Oh10.wc.arff
Oh0.wc.arff

CFS
17382ms
13236ms
40486ms
44486ms

FAST
4416ms
3456ms
2418ms
2918ms

Improved FAST
3775ms
2800ms
1846ms
2246ms

Fig 4: Implementation Results Of Time Required To Select The Features

3) Classification Accuracy:
Table 3:Classification Accuracy Of The Selected Features
Dataset
Fbis.wc.arff
New3s.wc.arff
Oh10.wc.arff
Oh0.wc.arff

CFS
99.14
92.34
94.71
98.71

FAST
99.75
94.45
96.65
99.65

Improved FAST
99.81
99.10
98.69
99.69

Fig 5: Implementation Results Of Accuracy Of The Algorithms To Select The Features

Volume 4, Issue 6, June 2015

Page 139

International Journal of Application or Innovation in Engineering & Management (IJAIEM)

Web Site: www.ijaiem.org Email: [email protected]
Volume 4, Issue 6, June 2015

ISSN 2319 - 4847

5.CONCLUSION
An improved clustering based feature subset selection algorithm for high dimensional data. The algorithm involves (i)
deleting irrelevant features, (ii) devloping a minimum spanning tree from relative ones, and (iii) dividing the MST and
selecting representative features. In the proposed algorithm, a cluster consists of features. Each cluster is treated as a
single feature and thus dimensionality is highly reduced. The performance of the proposed algorithm with feature
selection algorithms like CFS, FAST on the different datasets is analysed. Proposed algorithm obtained the best
proportion of selected features, the best runtime, and the best classification accuracy for Naive Bayes, C4.5, and
RIPPER, and the second best classification accuracy for IB1FAST is the best algorithm amongst available algorithm for
all kind of datasets Its efficiency can be increased by using different similarity measures like dice coefficient.

REFERENCES
[1] Mr.Avinash Godase, Mrs. Poonam Gupta, A survey on Clustering Based Feature Selection Technique Algorithm
For High Dimensional Data,International journal of emerging trends & technology in computer science,Volume
4, Issue 1, January-February 2015, ISSN 2278-6856.
[2] QinBao , A Fast Clustering-Based Feature Subset Selection Algorithm for High Dimensional Data -in "ieee
transactions on knowledge and data engineering vol:25 no:1 year 2013.
[3] Kira K. and Rendell L.A., The feature selection problem: Traditional methods and a new algorithm, In
Proceedings of Nineth National Conference on Artificial Intelligence, pp 129-134, 1992.
[4] Yu L. and Liu H., Feature selection for high-dimensional data: a fast correlation-based filter) solution, in
Proceedings of 20th International Conference on Machine Leaning, 20(2), pp 856-863, 2003.
[5] Butterworth R., Piatetsky-Shapiro G. and Simovici D.A., On Feature Selection through Clustering, In
Proceedings of the Fifth IEEE international Conference on Data Mining, pp 581-584, 2005.
[6] Yu L. and Liu H.,Efficient feature selection via analysis of relevance and redundancy,journal of machine learning
research,10(5),pp 1205-1224,2004
[7] Van Dijk G. and Van Hulle M.M., Speeding Up the Wrapper Feature Subset Selection in Regression by Mutual
Information Relevance and Redundancy Analysis, International Conference on Artificial Neural Networks, 2006
[8] Krier C., Francois D., Rossi F. and Verleysen M., Feature clustering and mutual information for the selection of
variables in spectral data, In Proc European Symposium on Artificial Neural Networks Advances in Computational
Intelligence and Learning, pp 157-162, 2007.
[9] Zheng Zhao and Huan Liu in Searching for Interacting Features, ijcai07
[10] P. Soucy, G.W. Mineau, A simple feature selection method for text classification, in: Proceedings of IJCAI-01,
Seattle, WA, 2001, pp. 897903
[11] Kohavi R. and John G.H., Wrappers for feature subset selection, Artif.Intell., 97(1-2), pp 273-324, 1997.
[12] Chanda P., Cho Y., Zhang A. and Ramanathan M., Mining of Attribute Interactions Using Information Theoretic
Metrics, In Proceedings of IEEE international Conference on Data Mining Workshops, pp 350-355, 2009.
[13] Forman G., An extensive empirical study of feature selection metrics for text classification, Journal of Machine
Learning Research, 3, pp 1289-1305,2003.
[14] Fleuret F., Fast binary feature selection with conditional mutual Information, Journal of Machine Learning
Research, 5, pp 1531- 1555, 2004.
[15] C.Krier, D.Francois, F. Rossi, and M. Verleysen, Feature Clustering and Mutual Information for the Selection of
Variables in Spectral Data, Proc. European Symp. Artificial Neural Networks Advances in Computational
Intelligence and Learning, pp. 157-162, 2007.

Volume 4, Issue 6, June 2015

Page 140

A Fast Clustering-Based Feature Subset Selection Algorithm For High Dimensional Data
No ratings yet
A Fast Clustering-Based Feature Subset Selection Algorithm For High Dimensional Data
8 pages
1.a Faster Clustering-Based Feature Subset Selection Algorithm For High Dimensional Data
No ratings yet
1.a Faster Clustering-Based Feature Subset Selection Algorithm For High Dimensional Data
3 pages
An Improved Fast Clustering Method For Feature Subset Selection On High-Dimensional Data Clustering
No ratings yet
An Improved Fast Clustering Method For Feature Subset Selection On High-Dimensional Data Clustering
5 pages
Fast Clustering Based Feature Selection: Ubed S. Attar, Ajinkya N. Bapat, Nilesh S. Bhagure, Popat A. Bhesar
No ratings yet
Fast Clustering Based Feature Selection: Ubed S. Attar, Ajinkya N. Bapat, Nilesh S. Bhagure, Popat A. Bhesar
7 pages
FAST Algorithm for Attribute Selection
No ratings yet
FAST Algorithm for Attribute Selection
8 pages
Graph Regularized Feature Selection With Data Reconstruction
No ratings yet
Graph Regularized Feature Selection With Data Reconstruction
10 pages
Feature Subset Selection With Fast Algorithm Implementation
No ratings yet
Feature Subset Selection With Fast Algorithm Implementation
5 pages
A New Feature Selection Method Based On Frequent A
No ratings yet
A New Feature Selection Method Based On Frequent A
15 pages
IJETR2225
No ratings yet
IJETR2225
3 pages
2015 Elsevier Multi Objective Optimization of Shared Nearest Neighbor Similarity For Feature Selection
No ratings yet
2015 Elsevier Multi Objective Optimization of Shared Nearest Neighbor Similarity For Feature Selection
12 pages
NeurIPS 2022 Where To Pay Attention in Sparse Training For Feature Selection Paper Conference
No ratings yet
NeurIPS 2022 Where To Pay Attention in Sparse Training For Feature Selection Paper Conference
16 pages
Evolutionary Multiobjective Feature Selection Survey
No ratings yet
Evolutionary Multiobjective Feature Selection Survey
21 pages
Automatic Feature Subset Selection Using Genetic Algorithm For Clustering
No ratings yet
Automatic Feature Subset Selection Using Genetic Algorithm For Clustering
5 pages
Literature Review On Feature Subset Selection Techniques
No ratings yet
Literature Review On Feature Subset Selection Techniques
3 pages
SFE: A Simple, Fast and Efficient Feature Selection Algorithm For High-Dimensional Data
No ratings yet
SFE: A Simple, Fast and Efficient Feature Selection Algorithm For High-Dimensional Data
24 pages
Feature Selection 2011 Kotsiantis
No ratings yet
Feature Selection 2011 Kotsiantis
20 pages
A Study On Feature Selection Techniques in Bio Informatics
100% (1)
A Study On Feature Selection Techniques in Bio Informatics
7 pages
Clustering Before Classification
No ratings yet
Clustering Before Classification
3 pages
3.3 A Review of Unsupervised Feature Selection Methods
No ratings yet
3.3 A Review of Unsupervised Feature Selection Methods
42 pages
3038-Article Text-5729-1-10-20210418
No ratings yet
3038-Article Text-5729-1-10-20210418
6 pages
Feature Selection Methods Review
No ratings yet
Feature Selection Methods Review
25 pages
Samplepaper
No ratings yet
Samplepaper
13 pages
Expert Systems With Applications: Jianhua Hu, Kejin Pan, Yan Song, Guoliang Wei, Chungen Shen
No ratings yet
Expert Systems With Applications: Jianhua Hu, Kejin Pan, Yan Song, Guoliang Wei, Chungen Shen
15 pages
Feature Extraction Techniques Using Support Vector Machines in Disease Prediction
No ratings yet
Feature Extraction Techniques Using Support Vector Machines in Disease Prediction
8 pages
Adaptive Boosting Assisted Multiclass Classification
No ratings yet
Adaptive Boosting Assisted Multiclass Classification
5 pages
2015 Elsevier Kernel Penalized K Means A Feature Selection Method Based On Kernel K Means
No ratings yet
2015 Elsevier Kernel Penalized K Means A Feature Selection Method Based On Kernel K Means
11 pages
Review@data Mining Haiylachew
No ratings yet
Review@data Mining Haiylachew
14 pages
A Modified Adaptable Nearest Feature Space Classifier For Remote Sensing Images
No ratings yet
A Modified Adaptable Nearest Feature Space Classifier For Remote Sensing Images
4 pages
Chandra Shekar 2014
No ratings yet
Chandra Shekar 2014
13 pages
Graph Autoencoder-Based Unsupervised Feature Selection With Broad and Local Data Structure Preservation
No ratings yet
Graph Autoencoder-Based Unsupervised Feature Selection With Broad and Local Data Structure Preservation
28 pages
MRMR+ and CFS+ Feature Selection Algorithms For High-Dimensional Data
No ratings yet
MRMR+ and CFS+ Feature Selection Algorithms For High-Dimensional Data
14 pages
Li 2017
No ratings yet
Li 2017
7 pages
Bioinformatics Feature Selection Review
No ratings yet
Bioinformatics Feature Selection Review
11 pages
A Review of Feature Selection Techniques in Bioinformatics
No ratings yet
A Review of Feature Selection Techniques in Bioinformatics
11 pages
Ip2023 01 005
No ratings yet
Ip2023 01 005
10 pages
Apriori-Based Feature Selection Method
No ratings yet
Apriori-Based Feature Selection Method
5 pages
Binary Plant Rhizome Growth-Based Optimization Algorithm: An Efficient High-Dimensional Feature Selection Approach
No ratings yet
Binary Plant Rhizome Growth-Based Optimization Algorithm: An Efficient High-Dimensional Feature Selection Approach
43 pages
On Fuzzy Rough Attribute Selection Criteria of Max Depende 2013 Applied Sof
No ratings yet
On Fuzzy Rough Attribute Selection Criteria of Max Depende 2013 Applied Sof
13 pages
A Fast and Effective Partitional Clustering Algorithm For Large Categorical Datasets Using A K-Means Based Approach
No ratings yet
A Fast and Effective Partitional Clustering Algorithm For Large Categorical Datasets Using A K-Means Based Approach
21 pages
Data Mining: Practical Machine Learning Tools and Techniques
No ratings yet
Data Mining: Practical Machine Learning Tools and Techniques
69 pages
Using Reinforcement Learning To Select An Optimal
No ratings yet
Using Reinforcement Learning To Select An Optimal
11 pages
Feature Selection Techniques For ML - A Survey of More Than Two Decades of Research - Dipti Theng
No ratings yet
Feature Selection Techniques For ML - A Survey of More Than Two Decades of Research - Dipti Theng
63 pages
An Approach of Hybrid Clustering Technique For Maximizing Similarity of Gene Expression
No ratings yet
An Approach of Hybrid Clustering Technique For Maximizing Similarity of Gene Expression
14 pages
K-Means Clustering Algorithm and Its Improvement R
No ratings yet
K-Means Clustering Algorithm and Its Improvement R
6 pages
Explicit Unsupervised Feature Selection Based On Structured Graph and Locally
No ratings yet
Explicit Unsupervised Feature Selection Based On Structured Graph and Locally
16 pages
Liu 2009
No ratings yet
Liu 2009
9 pages
Knowledge Mining Using Classification Through Clustering
No ratings yet
Knowledge Mining Using Classification Through Clustering
6 pages
Aced
No ratings yet
Aced
17 pages
Improved Visual Final Version New
No ratings yet
Improved Visual Final Version New
9 pages
Feature Selection 1692278667
No ratings yet
Feature Selection 1692278667
100 pages
A Comparative Study Between Feature Selection Algorithms - Ok
No ratings yet
A Comparative Study Between Feature Selection Algorithms - Ok
10 pages
Feature Selection for Data Mining
No ratings yet
Feature Selection for Data Mining
12 pages
Feature Selection Techniques
No ratings yet
Feature Selection Techniques
5 pages
A Review of Feature Selection and Its Methods
No ratings yet
A Review of Feature Selection and Its Methods
15 pages
High Dimensional Data Feature Selection
No ratings yet
High Dimensional Data Feature Selection
1 page
Hybrid Feature Selection
No ratings yet
Hybrid Feature Selection
8 pages
QSAR Parameters in Drug Design Analysis
No ratings yet
QSAR Parameters in Drug Design Analysis
9 pages
Customer Experience with Uber in Mumbai
No ratings yet
Customer Experience with Uber in Mumbai
12 pages
Topological Indices of n-Heptane Isomers
No ratings yet
Topological Indices of n-Heptane Isomers
7 pages
Detection of Malicious Web Contents Using Machine and Deep Learning Approaches
No ratings yet
Detection of Malicious Web Contents Using Machine and Deep Learning Approaches
6 pages
Topological Indices of n-Heptane Isomers
No ratings yet
Topological Indices of n-Heptane Isomers
7 pages
FMECA Analysis of Washing Machine Reliability
No ratings yet
FMECA Analysis of Washing Machine Reliability
6 pages
Customer Experience with Uber in Mumbai
No ratings yet
Customer Experience with Uber in Mumbai
12 pages
Detection of Malicious Web Contents Using Machine and Deep Learning Approaches
No ratings yet
Detection of Malicious Web Contents Using Machine and Deep Learning Approaches
6 pages
Customer Satisfaction A Pillar of Total Quality Management
No ratings yet
Customer Satisfaction A Pillar of Total Quality Management
9 pages
The Mexican Innovation System: A System's Dynamics Perspective
No ratings yet
The Mexican Innovation System: A System's Dynamics Perspective
12 pages
Experimental Investigations On K/s Values of Remazol Reactive Dyes Used For Dyeing of Cotton Fabric With Recycled Wastewater
No ratings yet
Experimental Investigations On K/s Values of Remazol Reactive Dyes Used For Dyeing of Cotton Fabric With Recycled Wastewater
7 pages
Performance of Short Transmission Line Using Mathematical Method
No ratings yet
Performance of Short Transmission Line Using Mathematical Method
8 pages
Soil Stabilization of Road by Using Spent Wash
No ratings yet
Soil Stabilization of Road by Using Spent Wash
7 pages
FMECA Analysis of Washing Machine Reliability
No ratings yet
FMECA Analysis of Washing Machine Reliability
6 pages
Design and Detection of Fruits and Vegetable Spoiled Detetction System
No ratings yet
Design and Detection of Fruits and Vegetable Spoiled Detetction System
8 pages
The Impact of Effective Communication To Enhance Management Skills
No ratings yet
The Impact of Effective Communication To Enhance Management Skills
6 pages
QSAR Parameters in Drug Design Analysis
No ratings yet
QSAR Parameters in Drug Design Analysis
9 pages
BHIM vs Google Pay: UPI App Comparison
No ratings yet
BHIM vs Google Pay: UPI App Comparison
10 pages
A Deep Learning Based Assistant For The Visually Impaired
No ratings yet
A Deep Learning Based Assistant For The Visually Impaired
11 pages
Ijaiem 2021 01 28 6
No ratings yet
Ijaiem 2021 01 28 6
9 pages
SWOT Analysis of Alappuzha Tourism
No ratings yet
SWOT Analysis of Alappuzha Tourism
5 pages
6V Battery Mould Design for Trains
No ratings yet
6V Battery Mould Design for Trains
13 pages
Anchoring of Inflation Expectations and Monetary Policy Transparency in India
No ratings yet
Anchoring of Inflation Expectations and Monetary Policy Transparency in India
9 pages
The Effect of Work Involvement and Work Stress On Employee Performance: A Case Study of Forged Wheel Plant, India
No ratings yet
The Effect of Work Involvement and Work Stress On Employee Performance: A Case Study of Forged Wheel Plant, India
5 pages
Analysis of RCC Beam Using GFRP Wrapped With Cellular Stirrups
No ratings yet
Analysis of RCC Beam Using GFRP Wrapped With Cellular Stirrups
11 pages
Impact of Covid-19 On Employment Opportunities For Fresh Graduates in Hospitality &tourism Industry
No ratings yet
Impact of Covid-19 On Employment Opportunities For Fresh Graduates in Hospitality &tourism Industry
8 pages
Application of Mersey Silt As Fine Aggregate in Concrete
No ratings yet
Application of Mersey Silt As Fine Aggregate in Concrete
9 pages
Marco Economic Sustainability in India: Partisan Theory Approach
No ratings yet
Marco Economic Sustainability in India: Partisan Theory Approach
7 pages
Gu Original Chapter 1-2
No ratings yet
Gu Original Chapter 1-2
39 pages
Cato Group CPTED Last
No ratings yet
Cato Group CPTED Last
87 pages
Questionnaire Thesis Filipino
100% (3)
Questionnaire Thesis Filipino
6 pages
Developing An Effective Ethics Program
No ratings yet
Developing An Effective Ethics Program
22 pages
First Aid Training Boosts Teaching Readiness
No ratings yet
First Aid Training Boosts Teaching Readiness
61 pages
1A Student Checklist Research Plan Instructions
No ratings yet
1A Student Checklist Research Plan Instructions
2 pages
Lecture 8 Compatibility Mode
No ratings yet
Lecture 8 Compatibility Mode
10 pages
Probability & Stats Quiz
No ratings yet
Probability & Stats Quiz
2 pages
Opportunity Scanning
No ratings yet
Opportunity Scanning
12 pages
Offshore Installations Life Extension
No ratings yet
Offshore Installations Life Extension
17 pages
Primary vs Secondary Sources Guide
No ratings yet
Primary vs Secondary Sources Guide
29 pages
A Review of Small Unmanned Aircraft System (UAS) Advantages As A Tool in Condition Survey Works. Yasin 2016
No ratings yet
A Review of Small Unmanned Aircraft System (UAS) Advantages As A Tool in Condition Survey Works. Yasin 2016
5 pages
Not Reaching The Door Homeless Students Face Many Hurdles On The Way To School
No ratings yet
Not Reaching The Door Homeless Students Face Many Hurdles On The Way To School
58 pages
Students' Level of Awareness and Engagement in Guidance Activities
No ratings yet
Students' Level of Awareness and Engagement in Guidance Activities
11 pages
Chellas ModalLogic
100% (8)
Chellas ModalLogic
305 pages
Permission Letter
No ratings yet
Permission Letter
3 pages
AAA Assignment Due 19 April
No ratings yet
AAA Assignment Due 19 April
7 pages
Project Success in Ethiopia
No ratings yet
Project Success in Ethiopia
15 pages
(Ebook) Introduction To Biostatistics and Research Methods by P. S. S. Sundar Rao, J. Richard ISBN 9788120345201, 8120345207 Available Any Format
100% (5)
(Ebook) Introduction To Biostatistics and Research Methods by P. S. S. Sundar Rao, J. Richard ISBN 9788120345201, 8120345207 Available Any Format
308 pages
Qualitativen Quantitative Research
No ratings yet
Qualitativen Quantitative Research
6 pages
What Is Quantitative Research, Its Importance and Parts by Kelvin Batong 12 TVL G
No ratings yet
What Is Quantitative Research, Its Importance and Parts by Kelvin Batong 12 TVL G
11 pages
2018 05 Darch Han Yuqing
No ratings yet
2018 05 Darch Han Yuqing
169 pages
Analytics in Risk Management Insights
No ratings yet
Analytics in Risk Management Insights
16 pages
Budgeting and Forecasting Guide
No ratings yet
Budgeting and Forecasting Guide
58 pages
AQA 8585 C WRE Jun19 v1.0
No ratings yet
AQA 8585 C WRE Jun19 v1.0
11 pages
Military Promotion & Recruitment in Bula
No ratings yet
Military Promotion & Recruitment in Bula
26 pages
T-Distribution Guide for Students
No ratings yet
T-Distribution Guide for Students
47 pages
Applied - 12 - Research Project Inquiries Investigations and Immersion - semII - CLAS7 - Sharing Your Research - v4
No ratings yet
Applied - 12 - Research Project Inquiries Investigations and Immersion - semII - CLAS7 - Sharing Your Research - v4
18 pages
Digital Marketing for Small Biz
No ratings yet
Digital Marketing for Small Biz
22 pages
Sociology Research Essentials
No ratings yet
Sociology Research Essentials
16 pages

Improvised Method of FAST Clustering Based Feature Selection Technique Algorithm For High Dimensional Data

Uploaded by

Improvised Method of FAST Clustering Based Feature Selection Technique Algorithm For High Dimensional Data

Uploaded by

International Journal of Application or Innovation in Engineering & Management (IJAIEM)

Web Site: www.ijaiem.org Email: [email protected]

ISSN 2319 - 4847

Improvised Method Of FAST Clustering Based

Post Graduate Student ,G.H.Raisoni College Of Engg & Mgmt,Wagholi,Pune

H.O.D, Computer Engg, G.H.Raisoni College Of Engg & Mgmt,Wagholi,Pune

Volume 4, Issue 6, June 2015

International Journal of Application or Innovation in Engineering & Management (IJAIEM)

ISSN 2319 - 4847

Volume 4, Issue 6, June 2015

International Journal of Application or Innovation in Engineering & Management (IJAIEM)

ISSN 2319 - 4847

Fig.1 Proposed Framework

Volume 4, Issue 6, June 2015

International Journal of Application or Innovation in Engineering & Management (IJAIEM)

ISSN 2319 - 4847

3. Module (Selection of Features)

Fig.2 Flow Chart Of The System

4.RESULTS AND ANALYSIS

Volume 4, Issue 6, June 2015

International Journal of Application or Innovation in Engineering & Management (IJAIEM)

ISSN 2319 - 4847

Fig3: Implementation Results Of Proportion Of The Selected Features

Fig 4: Implementation Results Of Time Required To Select The Features

Fig 5: Implementation Results Of Accuracy Of The Algorithms To Select The Features

Volume 4, Issue 6, June 2015

International Journal of Application or Innovation in Engineering & Management (IJAIEM)

ISSN 2319 - 4847

Volume 4, Issue 6, June 2015

You might also like