Supervised vs unsupervised machine learning algorithms

SUPERVISED VS.
UNSUPERVISED MACHINE
LEARNING ALGORITHMS
- Harsh Agarwal

Introduction
• Machine learning (ML) is a branch of Artificial Intelligence (AI) and
computer science that focuses on the using data and algorithms to
enable AI to imitate the way that humans learn, gradually improving
its accuracy.
• Supervised learning is a category of machine learning that uses
labeled datasets to train algorithms to predict outcomes and
recognize patterns.
• Unsupervised Learning is a type of machine learning that learns from
data without human supervision.

Supervised Learning
• In supervised learning, the machine is trained on a set of
labeled data, which means that the input data is paired with the
desired output. The machine then learns to predict the output
for new input data. Supervised learning is often used for tasks
such as classification, regression, and object detection.
• Some of the key characteristics include: Labeled datasets, input-
output mapping, training and testing phases, and evaluation
metrics
• One of the common algorithms in supervised learning is
Support Vector Machine (SVM)

Unsupervised Learning
• In unsupervised learning, the machine is trained on a set of
unlabeled data, which means that the input data is not
paired with the desired output. The machine then learns to
find patterns and relationships in the data.
• Some of the key characteristics include: Unlabeled data,
clustering, dimensionality reduction, anomaly detection,
association rule learning, no clear evaluation metric
• One of the common algorithms for unsupervised learning is
k-means clustering

Comparison
Supervised Learning Unsupervised Learning
Learns from labeled data to predict an
output
Discovers patterns or structure in
unlabeled data
Requires labeled data Works with unlabeled data
Predict outcomes for new
data(classification or regression)
Find hidden patterns, groupings, or
structures in data
Predictive: Output values or categories
Descriptive: Clusters, reduced dimensions,
or relationships

Real-World example
• Supervised Learning:
Spam email detection
predicting house prices
speech recognition
•Unsupervised Learning:
Anomaly detection
Market basket analysis
image compression

Student Performance Prediction
• Supervised Learning is the better choice when labeled data is
available, and the goal is to make specific, actionable predictions,
such as identifying students who might fail or excel in a course.
• Predicting grades or final scores based on historical academic
data.
• Classifying students into performance categories (e.g., "high
achievers," "average," "at-risk") using past performance and
demographic factors.
• Regression tasks like forecasting future scores based on
attendance, engagement, or socioeconomic data.

• Unsupervised Learning excels in exploratory
phases, identifying hidden patterns, and providing
insights when labeled data is unavailable. It is
particularly useful for segmenting students based
on learning behaviors or needs.
• Grouping students into clusters based on learning
behaviors, participation levels, or resource usage.
• Identifying hidden patterns in engagement data that
correlate with performance.
• Dimensionality reduction to simplify large datasets
while retaining essential patterns.

Challenges and Limitations
Supervised Learning:
Dependance on labelled data
overfitting/underfitting
scalability
handling imbalanced data
bias in data
Unsupervised Learning:
No base
Interpretability
Choosing the right
algorithm
Outliers
dimensionality
challenges

Supervised vs unsupervised machine learning algorithms

More Related Content

Similar to Supervised vs unsupervised machine learning algorithms (20)

Recently uploaded (20)

Supervised vs unsupervised machine learning algorithms

Editor's Notes