0% found this document useful (0 votes)

15 views17 pages

Unt32pptx 2025 01 14 14 47 03

Uploaded by

iitachi1401

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

15 views17 pages

Unt32pptx 2025 01 14 14 47 03

Uploaded by

iitachi1401

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

Chandigarh School of Business, Jhanjeri

Department of Computer Application

Program Name: BCA
Course Code:UGCA1950
Course Name: Machine Learning

Prepared by: Dr. Gurwinder Singh

Department of Computer Application 1

Outlines

• PTU Syllabus of Unit-I

• CO’s Introduction
• Topic Overview
• Brief description of what the presentation will cover
• Importance or relevance of the topic
• Key objectives or learning outcomes
• Summary
• References

Department of Computer Application 2

PTU Syllabus of Unit-I

Clustering What is Clustering & its Use Cases, K-means Clustering, How does K-means
algorithm work, C-means Clustering, Hierarchical Clustering, How Hierarchical
Clustering works.

Department of Computer Application 3

CO Introduction

CO NUMBER TOPICS LEVEL

PO(1,2,3,4,9) &
CO3 Design solution for basic problems using machine learning algorithms PSO(1)

Department of Computer Application 4

Topic Overview

K-means Clustering
K-means Clustering is a popular unsupervised machine learning algorithm used for partitioning data into distinct groups or
clusters.
It aims to group data points in such a way that points within the same cluster are more similar to each other than to those in
other clusters.
How It Works:
Define the Number of Clusters (K):
The user specifies the desired number of clusters (K).
Random Initialization:
K initial "centeroids" (cluster centers) are randomly placed in the data space.
Assignment Step:
Each data point is assigned to the nearest centeroid based on a distance metric (e.g., Euclidean distance).
Update Step:
The centeroid of each cluster is recalculated as the mean of all points assigned to it.
Iterative Process:
Steps 3 and 4 are repeated until centeroids stabilize or a predefined stopping condition is met.

Department of Computer Applications 5

Brief of what the presentation will

What is Clustering & Its Use Cases

Introduction to clustering and its applications.
K-means Clustering
Explanation of the K-means algorithm and its working process.
C-means Clustering
Overview of C-means clustering.
Hierarchical Clustering
Description of hierarchical clustering and its working mechanism.

Department of Computer Applications 6

Clustering Algorithms

• Flat algorithms
– Usually start with a random (partial) partitioning
– Refine it iteratively
• K means clustering
• (Model based clustering)
• Hierarchical algorithms
– Bottom-up, agglomerative
– (Top-down, divisive)
Sec. 16.4

K-Means

• Assumes documents are real-valued vectors.

• Clusters based on centroids (aka the center of gravity or mean) of points in a cluster,
c:

 1 
μ(c)  
| c | xc
x

• Reassignment of instances to clusters is based on distance to the current cluster

centroids.
– (Or one can equivalently phrase it in terms of similarities)
K-Means Algorithm

Select K random docs {s1, s2,… sK} as seeds.

Until clustering converges (or other stopping criterion):
For each doc di:

Assign di to the cluster cj such that dist(xi, sj) is minimal.

(Next, update the seeds to the centroid of each cluster)
For each cluster cj

sj = (cj)
Sec. 16.4

K Means Example
(K=2)
Pick seeds
Reassign clusters
Compute centroids
Reassign clusters
x x Compute centroids
x
x
Reassign clusters
Sec. 16.4

Termination conditions

• Several possibilities, e.g.,

– A fixed number of iterations.
– Doc partition unchanged.
– Centroid positions don’t change.

Does this mean that the docs in a cluster are unchanged?

Sec. 16.4

Convergence

• Why should the K-means algorithm ever reach a fixed point?

– A state in which clusters don’t change.

• K-means is a special case of a general procedure known as the Expectation

Maximization (EM) algorithm.
– EM is known to converge.
– Number of iterations could be large.
– But in practice usually isn’t
Sec. 16.4

Convergence of K-Means

• Recomputation monotonically decreases each Gk since (mk is number of members in

cluster k):
– Σ (di – a)2 reaches minimum for:
– Σ –2(di – a) = 0
– Σ di = Σ a
– mK a = Σ di
– a = (1/ mk) Σ di = ck
• K-means typically converges quickly
Sec. 16.4

Time Complexity

• Computing distance between two docs is O(M) where M is the dimensionality of the
vectors.
• Reassigning clusters: O(KN) distance computations, or O(KNM).
• Computing centroids: Each doc gets added once to some centroid: O(NM).
• Assume these two steps are each done once for I iterations: O(IKNM).
Sec. 16.4

Seed Choice

Example showing
• Results can vary based on random seed selection.
sensitivity to seeds
• Some seeds can result in poor convergence rate, or
convergence to sub-optimal clusterings.
– Select good seeds using a heuristic (e.g., doc least In the above, if you start
with B and E as centroids
similar to any existing mean) you converge to {A,B,C}
and {D,E,F}
– Try out multiple starting points If you start with D and F
you converge to
– Initialize with the results of another method. {A,B,D,E} {C,F}
Key objectives or learning outcomes

Understand the Concept of K-means Clustering

Learn the definition and purpose of K-means clustering as a data segmentation tool.
Explore the Applications of K-means Clustering
Identify real-world scenarios where K-means clustering is commonly applied, such as customer
segmentation, market research, and image segmentation.
Learn How the K-means Algorithm Works
Gain a step-by-step understanding of the K-means clustering process, including initialization,
assignment, and updating steps.
Understand the Role of Distance Metrics
Understand how distance measures (e.g., Euclidean distance) are used to assign data points to clusters.
Appreciate the Strengths and Limitations of K-means
Recognize the advantages of K-means, such as simplicity and efficiency, and its limitations, such as
sensitivity to outliers and reliance on the predefined number of clusters.
Apply K-means in Practice

Department of Computer Applications 16

THANK YOU

Department of Computer Application 17

K Means Algorithm
No ratings yet
K Means Algorithm
4 pages
Understanding K-Means Clustering Basics
No ratings yet
Understanding K-Means Clustering Basics
31 pages
K Mean
No ratings yet
K Mean
7 pages
Shivwangi Banerjee (ML)
No ratings yet
Shivwangi Banerjee (ML)
8 pages
K-Means Clustering Explained
No ratings yet
K-Means Clustering Explained
27 pages
Understanding Clustering in Data Analysis
No ratings yet
Understanding Clustering in Data Analysis
61 pages
Clustering Techniques Overview
No ratings yet
Clustering Techniques Overview
44 pages
21csc305p Machine Learning Unit 3 - Updated
No ratings yet
21csc305p Machine Learning Unit 3 - Updated
147 pages
K-Means Clustering Explained
No ratings yet
K-Means Clustering Explained
32 pages
Unit 4
No ratings yet
Unit 4
22 pages
ADL LAB Manual
No ratings yet
ADL LAB Manual
27 pages
Unit 4
No ratings yet
Unit 4
125 pages
Algo
No ratings yet
Algo
59 pages
Understanding Document Clustering Techniques
No ratings yet
Understanding Document Clustering Techniques
48 pages
K Clustering
No ratings yet
K Clustering
28 pages
K-Means Clustering in Unsupervised Learning
No ratings yet
K-Means Clustering in Unsupervised Learning
9 pages
K Means Clustering
No ratings yet
K Means Clustering
29 pages
K-Means Clustering Explained
No ratings yet
K-Means Clustering Explained
30 pages
K-Means Clustering Tutorial
No ratings yet
K-Means Clustering Tutorial
10 pages
Kmean Clustering
No ratings yet
Kmean Clustering
3 pages
K-Means Clustering Explained
No ratings yet
K-Means Clustering Explained
45 pages
Enhancing K-means Clustering Techniques
No ratings yet
Enhancing K-means Clustering Techniques
13 pages
K-Means Clustering Explained
No ratings yet
K-Means Clustering Explained
16 pages
Unit 4 Clustering - K-Means and Hierarchical
No ratings yet
Unit 4 Clustering - K-Means and Hierarchical
40 pages
Clustering Techniques for CS Students
100% (1)
Clustering Techniques for CS Students
26 pages
Clustering Algorithms
No ratings yet
Clustering Algorithms
19 pages
K Means HTML
No ratings yet
K Means HTML
8 pages
Understanding Clustering Techniques
No ratings yet
Understanding Clustering Techniques
23 pages
K-Means Clustering in Machine Learning
No ratings yet
K-Means Clustering in Machine Learning
56 pages
Chapter 9
No ratings yet
Chapter 9
8 pages
K-Means Clustering Algorithm Guide
No ratings yet
K-Means Clustering Algorithm Guide
24 pages
Cluster
No ratings yet
Cluster
50 pages
K-Means Clustering Implementation in C++
No ratings yet
K-Means Clustering Implementation in C++
5 pages
K-Means Clustering Algorithm: - V - ' Is The Euclidean Distance Between X ' Is The Number of Data Points in I
No ratings yet
K-Means Clustering Algorithm: - V - ' Is The Euclidean Distance Between X ' Is The Number of Data Points in I
3 pages
CS8091 - Big Data Analytics - Unit 2
No ratings yet
CS8091 - Big Data Analytics - Unit 2
44 pages
Cluster Analysis Methods Explained
No ratings yet
Cluster Analysis Methods Explained
45 pages
Segment 7 (Ch10)
No ratings yet
Segment 7 (Ch10)
60 pages
Understanding Clustering Algorithms
No ratings yet
Understanding Clustering Algorithms
23 pages
Clustering
No ratings yet
Clustering
84 pages
K-Means Clustering Explained in Python
No ratings yet
K-Means Clustering Explained in Python
26 pages
K - Means Clustering
No ratings yet
K - Means Clustering
13 pages
ML Lec-16
No ratings yet
ML Lec-16
16 pages
Understanding K-Means Clustering Algorithm
No ratings yet
Understanding K-Means Clustering Algorithm
14 pages
A Tutorial On Clustering Algorithms
No ratings yet
A Tutorial On Clustering Algorithms
4 pages
ML Unit-4 Final 2024-25
No ratings yet
ML Unit-4 Final 2024-25
28 pages
Clustering Techniques in Data Mining
No ratings yet
Clustering Techniques in Data Mining
110 pages
42-Unsupervised Learning - K-Means Clustering-21-11-2024
No ratings yet
42-Unsupervised Learning - K-Means Clustering-21-11-2024
18 pages
ML Application in Signal Processing and Communication Engineering
No ratings yet
ML Application in Signal Processing and Communication Engineering
27 pages
K-means Clustering Explained
No ratings yet
K-means Clustering Explained
33 pages
AI ML Lecture 6
No ratings yet
AI ML Lecture 6
20 pages
UNIT III Part-1
No ratings yet
UNIT III Part-1
69 pages
Understanding Clustering Algorithms and Applications
No ratings yet
Understanding Clustering Algorithms and Applications
34 pages
Lect 6 - Clustering
No ratings yet
Lect 6 - Clustering
50 pages
Understanding K-Means Clustering
No ratings yet
Understanding K-Means Clustering
12 pages
7.introduction To Clustering
No ratings yet
7.introduction To Clustering
11 pages
L7 Clustering
No ratings yet
L7 Clustering
58 pages
Module 1 Activity 9
No ratings yet
Module 1 Activity 9
2 pages
Cyber-Attack Prediction Using Machine Learning
No ratings yet
Cyber-Attack Prediction Using Machine Learning
11 pages
Effective Teaching Strategies for Learning
No ratings yet
Effective Teaching Strategies for Learning
6 pages
Educational Advancement PDE5101
No ratings yet
Educational Advancement PDE5101
11 pages
Fullerton Teacher Induction POP Cycle Guide
No ratings yet
Fullerton Teacher Induction POP Cycle Guide
6 pages
Business English Presentations
No ratings yet
Business English Presentations
10 pages
Understanding Genre-Based Approach
No ratings yet
Understanding Genre-Based Approach
13 pages
Children's Belonging & Reading Benefits
No ratings yet
Children's Belonging & Reading Benefits
2 pages
Multilingualism Benefits Pashtun MS Students
No ratings yet
Multilingualism Benefits Pashtun MS Students
18 pages
Gen Ai Interview
No ratings yet
Gen Ai Interview
41 pages
Senior High School PE and Health Syllabus
No ratings yet
Senior High School PE and Health Syllabus
6 pages
Aryan. Dtil Project Report-2
No ratings yet
Aryan. Dtil Project Report-2
12 pages
Domains of Performance
No ratings yet
Domains of Performance
4 pages
Hanneke Loerts, Wander Lowie, Bregtje Seton - Essential Statistics For Applied Linguistics - Using R or JASP-Red Globe Press (2020)
No ratings yet
Hanneke Loerts, Wander Lowie, Bregtje Seton - Essential Statistics For Applied Linguistics - Using R or JASP-Red Globe Press (2020)
253 pages
GGH1501 TL 102 2017 3 e
No ratings yet
GGH1501 TL 102 2017 3 e
10 pages
Arabic Poetry
No ratings yet
Arabic Poetry
6 pages
Sample AR English
No ratings yet
Sample AR English
23 pages
EDU 311 Lesson 2
No ratings yet
EDU 311 Lesson 2
4 pages
Peer Teaching Observation Form
88% (8)
Peer Teaching Observation Form
2 pages
Mayer & Anderson (1992)
No ratings yet
Mayer & Anderson (1992)
9 pages
11 Plus Maths: Percentage Revision Session
No ratings yet
11 Plus Maths: Percentage Revision Session
8 pages
Grade 9 Dressmaking Item Analysis Report
No ratings yet
Grade 9 Dressmaking Item Analysis Report
6 pages
The Modern Language Journal - 2009 - LIU - Using A Corpus Based Lexicogrammatical Approach To Grammar Instruction in EFL
No ratings yet
The Modern Language Journal - 2009 - LIU - Using A Corpus Based Lexicogrammatical Approach To Grammar Instruction in EFL
18 pages
Reading Comprehension 11
No ratings yet
Reading Comprehension 11
4 pages
Advanced Swimming Plans for Kids
No ratings yet
Advanced Swimming Plans for Kids
20 pages
CEED 2026 Registration Guidelines
No ratings yet
CEED 2026 Registration Guidelines
3 pages
Lesson 4
No ratings yet
Lesson 4
3 pages
My House 2
No ratings yet
My House 2
6 pages
02 Answer Sheet - Progress Test 2
No ratings yet
02 Answer Sheet - Progress Test 2
1 page
For God and Humanity. Think That While: Mabuti, Magaling at May Malasakit Sa Kapwa
No ratings yet
For God and Humanity. Think That While: Mabuti, Magaling at May Malasakit Sa Kapwa
6 pages

Unt32pptx 2025 01 14 14 47 03

Uploaded by

Unt32pptx 2025 01 14 14 47 03

Uploaded by

Chandigarh School of Business, Jhanjeri

Department of Computer Application

Prepared by: Dr. Gurwinder Singh

Department of Computer Application 1

• PTU Syllabus of Unit-I

Department of Computer Application 2

Department of Computer Application 3

CO NUMBER TOPICS LEVEL

Department of Computer Application 4

Department of Computer Applications 5

What is Clustering & Its Use Cases

Department of Computer Applications 6

• Assumes documents are real-valued vectors.

• Reassignment of instances to clusters is based on distance to the current cluster

Select K random docs {s1, s2,… sK} as seeds.

Assign di to the cluster cj such that dist(xi, sj) is minimal.

• Several possibilities, e.g.,

Does this mean that the docs in a cluster are unchanged?

• Why should the K-means algorithm ever reach a fixed point?

• K-means is a special case of a general procedure known as the Expectation

• Recomputation monotonically decreases each Gk since (mk is number of members in

Understand the Concept of K-means Clustering

Department of Computer Applications 16

Department of Computer Application 17

You might also like