unsupervised-learning

The document discusses the differences between supervised and unsupervised learning, emphasizing that unsupervised learning, particularly clustering, identifies intrinsic structures in data without predefined target attributes. It outlines various clustering techniques, their applications in real-life scenarios such as marketing and document organization, and highlights the advantages and disadvantages of unsupervised learning. Clustering is noted as a widely used data mining technique across multiple fields, with various algorithms and types available for implementation.

Uploaded by

Renee Winters

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

3 views13 pages

unsupervised-learning

Uploaded by

Renee Winters

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 13

Unsupervised

Learning
Supervised learning vs.
unsupervised learning
 Supervised learning: discover patterns in the
data that relate data attributes with a target (class)
attribute.
 These patterns are then utilized to predict the

values of the target attribute in future data

instances.
 Unsupervised learning: The data have no
target attribute.
 We want to explore the data to find some intrinsic

structures in them.

2
Clustering
 Clustering is a technique for finding similarity groups
in data, called clusters.
 Clustering is often called an unsupervised learning
task as no class values denoting an a priori grouping
of the data instances are given, which is the case in
supervised learning.
 Due to historical reasons, clustering is often
considered synonymous with unsupervised learning.
 In fact, association rule mining is also

unsupervised.

3
An illustration
 The data set has three natural groups of data points,
i.e., 3 natural clusters.

4
What is clustering for?
 Let us see some real-life examples
 Example 1: groups people of similar sizes
together to make “small”, “medium” and
“large” T-Shirts.
 Tailor-made for each person: too expensive
 One-size-fits-all: does not fit all.
 Example 2: In marketing, segment
customers according to their similarities
 To do targeted marketing.

5
What is clustering for?
(cont…)
 Example 3: Given a collection of text documents,
we want to organize them according to their content
similarities,
 To produce a topic hierarchy

 In fact, clustering is one of the most utilized

data mining techniques.
 It has a long history, and used in almost every
field, e.g., medicine, psychology, botany,
sociology, biology, archeology, marketing,
insurance, libraries, etc.
 In recent years, due to the rapid increase of online
documents, text clustering becomes important.

6
Why Unsupervised Learning?
 Unsupervised machine learning finds all kind of
unknown patterns in data.
 Unsupervised methods help you to find features
which can be useful for categorization.
 It is taken place in real time, so all the input data
to be analyzed and labeled in the presence of
learners.
 It is easier to get unlabeled data from a
computer than labeled data, which needs
manual intervention.

7
Aspects of clustering
 A clustering algorithm
 Partition clustering
 Hierarchical clustering
 A distance (similarity, or dissimilarity) function
 Clustering quality
 Inter-clusters distance  maximized
 Intra-clusters distance  minimized
 The quality of a clustering result depends on
the algorithm, the distance function, and the
application.

8
Clustering Types
 There are different types of clustering you
can utilize:
 Exclusive (partitioning) : In this clustering method,
Data are grouped in such a way that one data can
belong to one cluster only.
 Example: K-means

 Agglomerative: In this clustering technique,

every data is a cluster. The iterative unions
between the two nearest clusters reduce the
number of clusters.
 Example: Hierarchical clustering
9
Clustering Types(Contd..)

 Probabilistic: This technique uses probability

distribution to create the clusters.

 Overlapping: In this technique, fuzzy sets is used

to cluster data. Each point may belong to two or
more clusters with separate degrees of
membership.
Here, data will be associated with an appropriate
membership value.

10
Algorithm

 Apriori algorithm
 K-mean
 Agglomerative Clustering
 DBSCAN
 SVM
 Density based Cluster

11
Applications
 Clustering automatically split the dataset into groups
base on their similarities
 Anomaly detection can discover unusual data points
in your dataset. It is useful for finding fraudulent
transactions
 Association mining identifies sets of items which
often occur together in your dataset
 Latent variable models are widely used for data
preprocessing. Like reducing the number of features
in a dataset or decomposing the dataset into
multiple components

12
Disadvantages
 You cannot get precise information regarding data
sorting, and the output as data used in unsupervised
learning is labeled and not known
 Less accuracy of the results is because the input
data is not known and not labeled by people in
advance. This means that the machine requires to
do this itself.
 The spectral classes do not always correspond to
informational classes.
 The user needs to spend time interpreting and label
the classes which follow that classification.

Machine Learning with Clustering: A Visual Guide for Beginners with Examples in Python
From Everand
Machine Learning with Clustering: A Visual Guide for Beginners with Examples in Python
Artem Kovera
No ratings yet
Unit-4
No ratings yet
Unit-4
53 pages
Unit 2 Unsupervised Learning
No ratings yet
Unit 2 Unsupervised Learning
86 pages
Un-Supervised Machine Learning
No ratings yet
Un-Supervised Machine Learning
9 pages
unsupervised-learning
No ratings yet
unsupervised-learning
18 pages
UNIT-4
No ratings yet
UNIT-4
62 pages
Module 6.1
No ratings yet
Module 6.1
42 pages
Unsupervised - Learning Final
No ratings yet
Unsupervised - Learning Final
20 pages
ML Ch-3 Unsupervised Learning
100% (1)
ML Ch-3 Unsupervised Learning
31 pages
Clustering Part-A
No ratings yet
Clustering Part-A
41 pages
Unit 3 Supervised Learning
No ratings yet
Unit 3 Supervised Learning
89 pages
Unsupervised Learning
No ratings yet
Unsupervised Learning
15 pages
U5 unsupervised learning
No ratings yet
U5 unsupervised learning
15 pages
Unit III 1
No ratings yet
Unit III 1
22 pages
2nd Unit NN Final Class Notes (1)
No ratings yet
2nd Unit NN Final Class Notes (1)
50 pages
ML-UNSUPERVISED
No ratings yet
ML-UNSUPERVISED
35 pages
R20 machine learning unit 4
No ratings yet
R20 machine learning unit 4
49 pages
NI
No ratings yet
NI
10 pages
AI - W8L15
No ratings yet
AI - W8L15
44 pages
DSA Presentation Group 6
No ratings yet
DSA Presentation Group 6
34 pages
WINSEM2023-24 BEEE410L TH VL2023240502246 2024-03-22 Reference-Material-I
No ratings yet
WINSEM2023-24 BEEE410L TH VL2023240502246 2024-03-22 Reference-Material-I
95 pages
1
No ratings yet
1
59 pages
Unsupervised Learning
No ratings yet
Unsupervised Learning
20 pages
unit4
No ratings yet
unit4
96 pages
2nd Unit NN Final Class Notes
No ratings yet
2nd Unit NN Final Class Notes
51 pages
Machine Learning - Part -1
No ratings yet
Machine Learning - Part -1
17 pages
Unsupervised Learning
No ratings yet
Unsupervised Learning
21 pages
Module 6 - Un-Supervised Learning Algorithms
No ratings yet
Module 6 - Un-Supervised Learning Algorithms
31 pages
Group I Discrete Mathematics
No ratings yet
Group I Discrete Mathematics
4 pages
Lecture 3 Types of Machine Learning
No ratings yet
Lecture 3 Types of Machine Learning
40 pages
FAM_Unit5
No ratings yet
FAM_Unit5
47 pages
ARTIFICIAL INTELLIGENCE LEC 5
No ratings yet
ARTIFICIAL INTELLIGENCE LEC 5
20 pages
Unsupervised Learning
No ratings yet
Unsupervised Learning
14 pages
01 Introduction Clustering
No ratings yet
01 Introduction Clustering
11 pages
ML Unit-2 - RTU
No ratings yet
ML Unit-2 - RTU
33 pages
Unsupervised Machine Learning
No ratings yet
Unsupervised Machine Learning
16 pages
Unit-5 Clustering (March 16, 24)
No ratings yet
Unit-5 Clustering (March 16, 24)
25 pages
m Learning
No ratings yet
m Learning
11 pages
Unsupervised Lec
No ratings yet
Unsupervised Lec
12 pages
CLUSTERING
No ratings yet
CLUSTERING
20 pages
D3IT Clustering April 2023
No ratings yet
D3IT Clustering April 2023
70 pages
Unsupervised Learning
No ratings yet
Unsupervised Learning
9 pages
ML Unit 2 Notes
No ratings yet
ML Unit 2 Notes
14 pages
Unsupervised Machine Learning
No ratings yet
Unsupervised Machine Learning
63 pages
Unsupervised learning
No ratings yet
Unsupervised learning
10 pages
1694601073-Unit 3.1 Unsupervised Learning CU 2.0
No ratings yet
1694601073-Unit 3.1 Unsupervised Learning CU 2.0
35 pages
Unsupervised Machine Learning
No ratings yet
Unsupervised Machine Learning
10 pages
Week 9. Unsupervised Learning
No ratings yet
Week 9. Unsupervised Learning
32 pages
UnSupervised Learning
No ratings yet
UnSupervised Learning
3 pages
Introduction-to-Unsupervised-Machine-Learning
No ratings yet
Introduction-to-Unsupervised-Machine-Learning
9 pages
Unsupervised Learning
No ratings yet
Unsupervised Learning
6 pages
UnSupervised ML
No ratings yet
UnSupervised ML
17 pages
9 Som
No ratings yet
9 Som
32 pages
Day 3 - Content
No ratings yet
Day 3 - Content
50 pages
Unit 3 unsupervised learning algorith
No ratings yet
Unit 3 unsupervised learning algorith
15 pages
Unit 1
No ratings yet
Unit 1
52 pages
Unsupervised learning - overview
No ratings yet
Unsupervised learning - overview
6 pages
Lecture Unsupervised (17!04!2024).Pptx
No ratings yet
Lecture Unsupervised (17!04!2024).Pptx
61 pages
Machine Learning File
No ratings yet
Machine Learning File
7 pages
ML-Lecture-2-3-Types
No ratings yet
ML-Lecture-2-3-Types
27 pages
Language barriers 05-1
No ratings yet
Language barriers 05-1
12 pages
Security Challenges on the Web
No ratings yet
Security Challenges on the Web
16 pages
RESEARCH PAPER OUTLET
No ratings yet
RESEARCH PAPER OUTLET
8 pages
Leaf Disease Detection
No ratings yet
Leaf Disease Detection
15 pages
Comparative Study Ppt
No ratings yet
Comparative Study Ppt
10 pages
Computer Vision Part 2
No ratings yet
Computer Vision Part 2
5 pages
Brief Review Chat GPT
No ratings yet
Brief Review Chat GPT
9 pages
Age and Gender Prediction From Face Images Using Attentional Convolutional Network
No ratings yet
Age and Gender Prediction From Face Images Using Attentional Convolutional Network
6 pages
Ashish Patel Data Scientist
No ratings yet
Ashish Patel Data Scientist
4 pages
Instant Download Imbalanced Classification with Python Choose Better Metrics Balance Skewed Classes and Apply Cost Sensitive Learning 1st Edition Jason Brownlee PDF All Chapters
100% (3)
Instant Download Imbalanced Classification with Python Choose Better Metrics Balance Skewed Classes and Apply Cost Sensitive Learning 1st Edition Jason Brownlee PDF All Chapters
40 pages
Google AI Courses 2024 (Free Certifications)
No ratings yet
Google AI Courses 2024 (Free Certifications)
2 pages
Clevered AI Wizard Level 3
No ratings yet
Clevered AI Wizard Level 3
17 pages
Ai Term2 Practice - Paper - Answer Key
No ratings yet
Ai Term2 Practice - Paper - Answer Key
7 pages
mod3
No ratings yet
mod3
101 pages
01.query-By-Example On-Device Keyword Spotting
No ratings yet
01.query-By-Example On-Device Keyword Spotting
7 pages
Smart Logistics Warehouse Moving-Object Tracking Based on
No ratings yet
Smart Logistics Warehouse Moving-Object Tracking Based on
18 pages
Stanford Graph Learning Finance
No ratings yet
Stanford Graph Learning Finance
24 pages
Unit-5-2 Feedback Neural Networks-updated
No ratings yet
Unit-5-2 Feedback Neural Networks-updated
66 pages
Fraud Detection in Python Chapter4
No ratings yet
Fraud Detection in Python Chapter4
33 pages
SIGNLANGUAGE PPT
100% (1)
SIGNLANGUAGE PPT
15 pages
Chapter 1 - Machine Learning Fundamentals
No ratings yet
Chapter 1 - Machine Learning Fundamentals
52 pages
Neural Network Module 2 Notes
100% (1)
Neural Network Module 2 Notes
72 pages
Introduction To Deep Learning: Internet of Things Group
No ratings yet
Introduction To Deep Learning: Internet of Things Group
50 pages
Class 1 C
No ratings yet
Class 1 C
14 pages
Artificial Intelligence PDF
No ratings yet
Artificial Intelligence PDF
7 pages
Natural Language Processing With Improved Deep Lea
No ratings yet
Natural Language Processing With Improved Deep Lea
8 pages
CV 110121 Introduction
No ratings yet
CV 110121 Introduction
27 pages
DIP Final
No ratings yet
DIP Final
3 pages
Visual Question Generation in Bengali
No ratings yet
Visual Question Generation in Bengali
10 pages
Assignment On Crossword 2
No ratings yet
Assignment On Crossword 2
2 pages
ML PR-3
No ratings yet
ML PR-3
9 pages
AI 2025 SYLLABUS
No ratings yet
AI 2025 SYLLABUS
4 pages
2023 IEEE TNNLS A Survey On Evolutionary Neural Architecture Search
No ratings yet
2023 IEEE TNNLS A Survey On Evolutionary Neural Architecture Search
21 pages
Show and Tell: A Neural Image Caption Generator
No ratings yet
Show and Tell: A Neural Image Caption Generator
9 pages
Lecture 08 On Neural Networks 1
No ratings yet
Lecture 08 On Neural Networks 1
15 pages