0% found this document useful (0 votes)

204 views16 pages

Data Mining With Clustering AND Classification

This document discusses data mining techniques including clustering and classification. Clustering is an unsupervised learning technique that organizes data into groups of similar objects. Major clustering methods include distance-based, hierarchical, and partitioning. Classification is a supervised learning technique that predicts categorical class labels. It involves constructing a model from a training set and using it to classify new data. Major classification techniques discussed include decision trees, Bayesian classification, and association rule mining.

Uploaded by

Amanjyot Singh Oberoi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

204 views16 pages

Data Mining With Clustering AND Classification

Uploaded by

Amanjyot Singh Oberoi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 16

DATA MINING WITH

CLUSTERING
AND
CLASSIFICATION
DATA MINING
Data Mining is the process of discovering new
correlations, patterns, and trends by digging into
(mining) large amounts of data stored in warehouses,
using artificial intelligence, statistical and
mathematical techniques.
It is currently used in a wide range of profiling
practices, such as marketing ,fraud detection, and
scientific discovery.
From a managerial perspective:

Analyzing trends
Wealth generation

Security

Strategic decision making

MODELS OF DATA MINING
Predictive Model: Predictive models can be used to
forecast explicit values, based on patterns determined
from known results. For example, from a database of
customers who have already responded to a particular
offer, a model can be built that predicts which prospects
are likeliest to respond to the same offer.

Predictive data mining is further categorized into:

Classification
Regression
CONT…
Descriptive Model: Descriptive models describe
patterns in existing data, and are generally used to
create meaningful subgroups such as demographic
clusters. They are generally used to create meaningful
subgroups.

Descriptive data mining is further classified into

Clustering
Association
Sequential analysis.
CLUSTERING
• Clustering can be considered the most important
unsupervised learning technique; so, as every other
problem of this kind, it deals with finding a structure
in a collection of unlabeled data.

• Clustering is “the process of organizing objects into

groups whose members are similar in some way”.

• A cluster is therefore a collection of objects which

are “similar” between them and are “dissimilar” to
the objects belonging to other clusters.
CONT…
Where to use clustering?
Data mining
Information retrieval
text mining
Web analysis
marketing
medical diagnostic
Major clustering methods
Distance-based
Hierarchical
Partitioning
Probabilistic
CLASSIFICATION
predicts categorical class labels
classifies data (constructs a model) based on the
training set and the values (class labels) in a classifying
attribute and uses it in classifying new data
Classification—A Two-Step Process
Model construction: describing a set of predetermined classes
 Each tuple is assumed to belong to a predefined class, as determined
by the class label attribute (supervised learning)
 The set of tuples used for model construction: training set
 The model is represented as classification rules, decision trees, or
mathematical formulae
Model usage: for classifying previously unseen objects
 Estimate accuracy of the model using a test set
 The known label of test sample is compared with the classified
result from the model
 Accuracy rate is the percentage of test set samples that are correctly
classified by the model
 Test set is independent of training set, otherwise over-fitting will
occur
Classification Process: Model
Construction
Classification
Algorithms
Training
Data

NAME RANK YEARS TENURED Classifier

(Model)
Mike Assistant Prof 3 no
Mary Assistant Prof 7 yes
Bill Professor 2 yes
Jim Associate Prof 7 yes
IF rank = ‘professor’
Dave Assistant Prof 6 no OR years > 6
Anne Associate Prof 3 no THEN tenured = ‘yes’
Classification Process: Model
usage in Prediction

Classifier

Testing
Data Unseen Data

(Jeff, Professor, 4)

NAME RANK YEARS TENURED

Tom Assistant Prof 2 no Tenured?
Merlisa Associate Prof 7 no
George Professor 5 yes
Joseph Assistant Prof 7 yes
Classification Techniques

Classification by Decision Tree

Bayesian Classification
Classification by Backpropogation
Classification based on Association Rule Mining
Classification vs Clustering
Supervised learning (classification)
Supervision: The training data (observations,
measurements, etc.) are accompanied by labels
indicating the class of the observations
New data is classified based on the training set

Unsupervised learning (clustering)

The class labels of training data is unknown
Given a set of measurements, observations, etc. the
aim is to establish the existence of classes or clusters in
the data

Esolutions Manual - Powered by Cognero
No ratings yet
Esolutions Manual - Powered by Cognero
23 pages
Chapter 1 Shafriz
No ratings yet
Chapter 1 Shafriz
27 pages
OBGYN and Infertility Handbook For Clinicians
100% (3)
OBGYN and Infertility Handbook For Clinicians
237 pages
Lecture 3.1.1
No ratings yet
Lecture 3.1.1
17 pages
DWM Unit 3 Final Notes
No ratings yet
DWM Unit 3 Final Notes
47 pages
Data Warehouse and Mining Notes
No ratings yet
Data Warehouse and Mining Notes
12 pages
Lecture 3 Data Mining
No ratings yet
Lecture 3 Data Mining
36 pages
Unit Iii Classification
No ratings yet
Unit Iii Classification
57 pages
4 Datamining
No ratings yet
4 Datamining
90 pages
DM Chapter 4
No ratings yet
DM Chapter 4
47 pages
Classification (Part II)
No ratings yet
Classification (Part II)
162 pages
DAMI 011114a
No ratings yet
DAMI 011114a
48 pages
BI-Unit-3-Part-1-PPT.ppt
No ratings yet
BI-Unit-3-Part-1-PPT.ppt
51 pages
classification basic concept.data mining
No ratings yet
classification basic concept.data mining
20 pages
Lect 1
No ratings yet
Lect 1
38 pages
Discovering Knowledge in Data: Lecture Review of
No ratings yet
Discovering Knowledge in Data: Lecture Review of
20 pages
Classification in Data Mining
No ratings yet
Classification in Data Mining
60 pages
Classify Clustering
No ratings yet
Classify Clustering
31 pages
CT075!3!2-DTM-Topic 8 - Introduction To Data Mining
No ratings yet
CT075!3!2-DTM-Topic 8 - Introduction To Data Mining
32 pages
Chapter-V CLASSIFICATION & CLUSTERING
No ratings yet
Chapter-V CLASSIFICATION & CLUSTERING
153 pages
Classvac
No ratings yet
Classvac
14 pages
Data Mining Technique Using Weka Tool
No ratings yet
Data Mining Technique Using Weka Tool
21 pages
4 - Data Analytics Using DM and ML Algorithms - 1
No ratings yet
4 - Data Analytics Using DM and ML Algorithms - 1
71 pages
Wk. 1. Introduction [08.10.2020]
No ratings yet
Wk. 1. Introduction [08.10.2020]
30 pages
Classification in Data Mining
No ratings yet
Classification in Data Mining
14 pages
Classification in Data Mining 12
No ratings yet
Classification in Data Mining 12
7 pages
3 DM Classification (2)
No ratings yet
3 DM Classification (2)
62 pages
Data Mining
No ratings yet
Data Mining
30 pages
Module 3_classification
No ratings yet
Module 3_classification
9 pages
Chapter 4 Classification
No ratings yet
Chapter 4 Classification
78 pages
An Introduction To Data Mining: Prof. S. Sudarshan CSE Dept, IIT Bombay
No ratings yet
An Introduction To Data Mining: Prof. S. Sudarshan CSE Dept, IIT Bombay
47 pages
Data Mining Implementation
No ratings yet
Data Mining Implementation
9 pages
Slides Courtesy: Ling Chen [email protected]
No ratings yet
Slides Courtesy: Ling Chen [email protected]
42 pages
Data Mining: Prof Jyotiranjan Hota
No ratings yet
Data Mining: Prof Jyotiranjan Hota
17 pages
1.1 Project Overview: Data Mining
No ratings yet
1.1 Project Overview: Data Mining
74 pages
Data Mining Course Overview
No ratings yet
Data Mining Course Overview
38 pages
Data Mining
No ratings yet
Data Mining
33 pages
Understanding Data Mining
No ratings yet
Understanding Data Mining
21 pages
DSA Presentation Group 6
No ratings yet
DSA Presentation Group 6
34 pages
DATA MINING JNTUH CSE R18
No ratings yet
DATA MINING JNTUH CSE R18
20 pages
Classification
No ratings yet
Classification
50 pages
3 Data Mining
No ratings yet
3 Data Mining
58 pages
Clustering Agglo Devisive DBSCAN
No ratings yet
Clustering Agglo Devisive DBSCAN
78 pages
Evaluation_of_Student_Academic_Performan
No ratings yet
Evaluation_of_Student_Academic_Performan
7 pages
3 DM
No ratings yet
3 DM
36 pages
Article 6
No ratings yet
Article 6
6 pages
CEC453 Machine Learning
No ratings yet
CEC453 Machine Learning
168 pages
3 DM Classification
No ratings yet
3 DM Classification
55 pages
Data Mining: Concepts and Techniques: - Chapter 6
No ratings yet
Data Mining: Concepts and Techniques: - Chapter 6
115 pages
A Thorough Investigation On The Clustering and Classification Techniques in Various Applications
No ratings yet
A Thorough Investigation On The Clustering and Classification Techniques in Various Applications
4 pages
Classification Unit3
No ratings yet
Classification Unit3
15 pages
DM Unit-3
No ratings yet
DM Unit-3
46 pages
Classification and Prediction
No ratings yet
Classification and Prediction
126 pages
Data Mining
No ratings yet
Data Mining
23 pages
NI
No ratings yet
NI
10 pages
L1 Intro
No ratings yet
L1 Intro
32 pages
Bia Unit-3 Part-2
No ratings yet
Bia Unit-3 Part-2
43 pages
overview_basics
No ratings yet
overview_basics
16 pages
Data Mining: July 18, 2019 1
No ratings yet
Data Mining: July 18, 2019 1
41 pages
Lecture Notes 1.1 & 1.2
No ratings yet
Lecture Notes 1.1 & 1.2
8 pages
Basic Concept of Classification (Data Mining)
No ratings yet
Basic Concept of Classification (Data Mining)
11 pages
Core Concepts in Statistical Learning
From Everand
Core Concepts in Statistical Learning
Tushar Gulati
No ratings yet
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
César Pérez López
No ratings yet
119.+Susyam+Widiantho(733-740)
No ratings yet
119.+Susyam+Widiantho(733-740)
8 pages
Cambridge Primary End of Series Report May 2021 - tcm142-627850
No ratings yet
Cambridge Primary End of Series Report May 2021 - tcm142-627850
80 pages
Final-Pre-and-early-Harappan-sites-2
No ratings yet
Final-Pre-and-early-Harappan-sites-2
22 pages
Edelman Dissertation
No ratings yet
Edelman Dissertation
215 pages
Interior Photography Portfolio - (Low Quality)
No ratings yet
Interior Photography Portfolio - (Low Quality)
19 pages
Dasheen Bush Guide
No ratings yet
Dasheen Bush Guide
7 pages
David Cooper - The Language of Madness.-Pelican Books (1980)
No ratings yet
David Cooper - The Language of Madness.-Pelican Books (1980)
177 pages
Sentence Structure
No ratings yet
Sentence Structure
9 pages
Irony Worksheet 4
No ratings yet
Irony Worksheet 4
2 pages
Caring For Your Skin After A Skin Graft
No ratings yet
Caring For Your Skin After A Skin Graft
8 pages
Read The Story and Answer The Questions Below
No ratings yet
Read The Story and Answer The Questions Below
1 page
The Philippine Constitutions
No ratings yet
The Philippine Constitutions
2 pages
Few Liners Juris - Persons and Family Relations
No ratings yet
Few Liners Juris - Persons and Family Relations
2 pages
Synthetic Aperture Radar: Presented By: LT CDR Abhinaw Kumar Guide: Prof Kushal Tuckley
100% (1)
Synthetic Aperture Radar: Presented By: LT CDR Abhinaw Kumar Guide: Prof Kushal Tuckley
26 pages
Scenario From Below v2
No ratings yet
Scenario From Below v2
4 pages
Self-Assembly of Block Copolymers: Chemical Society Reviews July 2012
No ratings yet
Self-Assembly of Block Copolymers: Chemical Society Reviews July 2012
19 pages
University of Minnesota Press Cultural Critique
No ratings yet
University of Minnesota Press Cultural Critique
35 pages
Aiot PDF
No ratings yet
Aiot PDF
16 pages
Alphamet
No ratings yet
Alphamet
3 pages
Abdul Bari-Front Pages
No ratings yet
Abdul Bari-Front Pages
6 pages
Library Science and Information Science-1
No ratings yet
Library Science and Information Science-1
20 pages
Titanic (1997) YIFY - Download Movie TORRENT - YTS
No ratings yet
Titanic (1997) YIFY - Download Movie TORRENT - YTS
1 page
Romarico J. Mendoza, Petitioner, vs. People
No ratings yet
Romarico J. Mendoza, Petitioner, vs. People
9 pages
Instant download Populist authoritarianism : Chinese political culture and regime sustainability 1st Edition Wenfang Tang pdf all chapter
100% (2)
Instant download Populist authoritarianism : Chinese political culture and regime sustainability 1st Edition Wenfang Tang pdf all chapter
55 pages
NWS Residual Stress Measurement in Dissimilar Welding Joint
No ratings yet
NWS Residual Stress Measurement in Dissimilar Welding Joint
30 pages
Phenotypic Diversity Among The Virginia Breeding Lines of Groundnut (#1152984) - 2571852
No ratings yet
Phenotypic Diversity Among The Virginia Breeding Lines of Groundnut (#1152984) - 2571852
10 pages
Introduction To Random Graphs
100% (1)
Introduction To Random Graphs
583 pages

Data Mining With Clustering AND Classification

Uploaded by

Data Mining With Clustering AND Classification

Uploaded by

DATA MINING WITH

Strategic decision making

Predictive data mining is further categorized into:

Descriptive data mining is further classified into

• Clustering is “the process of organizing objects into

• A cluster is therefore a collection of objects which

NAME RANK YEARS TENURED Classifier

NAME RANK YEARS TENURED

Classification by Decision Tree

Unsupervised learning (clustering)

You might also like