0% found this document useful (0 votes)

15 views15 pages

Unit-3

This document covers various supervised learning algorithms including K-Nearest Neighbors (KNN), Logistic Regression, Perceptron, Single-layer and Multi-layer Perceptrons, and Support Vector Machines (SVM). It explains key concepts, structures, and applications of each algorithm, highlighting their strengths, limitations, and the mathematical foundations behind them. Additionally, it provides links to example problems and videos for further learning.

Uploaded by

Jakka Karthik

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

15 views15 pages

Unit-3

Uploaded by

Jakka Karthik

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 15

UNIT – III: Supervised Learning – II

K-NN classifier – Logistic regression – Perceptron – Single layer & Multi-layer

Support Vector Machines – Linear & Non-linear – Metrics & Error Correction.

The K-Nearest Neighbour (KNN) Classifier is a simple and effective machine

learning algorithm used primarily for classification tasks. It operates based on the
idea that data points that are similar tend to have the same label.

Key Concepts of KNN:

1. Similarity Based on Distance:

○ The KNN algorithm compares the distance between the input
data point (the point you want to classify) and all the other
points in the dataset.
○ It often uses Euclidean distance to measure how "close" data
points are to each other.
2. K Nearest Neighbours:
○ After calculating the distance between the new point and all the
points in the dataset, it identifies the K closest points (where
K is a number chosen by the user).
○ The most common class among these K neighbors is then
assigned to the new data point.
3. Lazy Learning:
○ KNN is considered a lazy learner because it doesn’t
actually learn a model during the training phase. Instead, it
memorizes the entire dataset and only makes calculations
when it needs to classify a new data point.
Choosing the value of k:

● Square Root Rule: A common way to select K is to set it equal to the square root of
the number of data points in the training set.
● Testing Multiple K Values: You can try different K values and choose the one
that works best on validation data.
● Weighted Voting: In cases where some neighbors are much closer than others, you
can give them more weight in the voting process.
Problems :
1.https://www.youtube.com/watch?v=HZT0lxD5h6k&list=PLoE_l1sqGKLRXcbobCaczeu5y4
sk6yKrL&index=2
2.https://www.youtube.com/watch?v=YndCelYQfgs&list=PLoE_l1sqGKLRXcbobCaczeu5y4
sk6yKrL&index=5
3.https://www.youtube.com/watch?v=T0YkfWssHjk&list=PLoE_l1sqGKLRXcbobCaczeu5y4
sk6yKrL&index=6
4.https://www.youtube.com/watch?v=WgSGdukowNA&list=PLoE_l1sqGKLRXcbobCaczeu5
y4sk6yKrL&index=8
5.https://www.youtube.com/watch?v=kCNpLbCUo7g&list=PLoE_l1sqGKLRXcbobCaczeu5y
4sk6yKrL&index=8

Logistic Regression is a supervised machine learning algorithm used for classification

tasks, where the output is a categorical dependent variable. Unlike linear regression, which
predicts continuous values, logistic regression predicts probabilities for a binary outcome
(like Yes/No or 0/1). The predicted probabilities are mapped between 0 and 1 using a
sigmoid function.

Key Features of Logistic Regression:

● Binary Classification: The output is categorical (e.g., Yes/No, 0/1).

● Probabilistic Output: Logistic regression doesn’t directly predict 0 or 1, but
rather probabilities that the outcome is 0 or 1.
● Logistic Function (Sigmoid Function): This function maps predicted
values to probabilities between 0 and 1, creating an "S-shaped" curve.
Sigmoid function:

Types of Logistic Regression:

1. Binomial: The dependent variable has two possible types, e.g., 0 or 1.

2. Multinomial: More than two unordered categories, e.g., "cat", "dog", "rabbit".
3. Ordinal: More than two ordered categories, e.g., "low", "medium", "high".

Assumptions:

● The dependent variable must be binary (for binary logistic regression).

● Independent variables should not be highly correlated (low multicollinearity).
● Large sample sizes are preferred.
Logistic Regression Equation:

Logistic regression builds on the linear regression equation. It adjusts the linear output to a
probability using the sigmoid function. The basic logistic regression equation is:

Example Problems
1.https://www.youtube.com/watch?v=2C8IqOLO1os
2.https://www.youtube.com/watch?v=UCOm-LFKX9E

A Perceptron is the simplest type of artificial neural network, serving as the foundation for
more complex neural networks. It is a supervised learning algorithm used for binary
classification, which means it classifies input data into one of two categories (e.g., 0 or 1,
Yes or No).
Binary classifiers can be considered as linear classifiers. In simple words, we can
understand it as a classification algorithm that can predict linear predictor function in terms
of weight and feature vectors.

● The Perceptron is a foundational algorithm in Machine Learning and Artificial Neural Networks.
● Invented by Frank Rosenblatt in the mid-20th century, it was designed for binary classification
tasks.
● It is a linear, supervised learning algorithm used for classifying data into two categories.

Key Concepts

● Binary Classifier:
A function that determines whether an input belongs to one of two classes.
Perceptrons create a linear decision boundary for classification tasks.
● Components:
1. Input Layer: Accepts numerical input data.
2. Weights and Bias:
■ Weights: Measure the strength of the connection between input and output.
■ Bias: Allows shifting of the decision boundary.
3. Net Sum: Weighted sum of inputs and bias (∑wi⋅xi+b).
4. Activation Function: Determines the output (e.g., Step, Sign, Sigmoid functions).
How It Works

1. Step 1: Compute the weighted sum: Net Sum=∑wi⋅xi+b

2. Step 2: Apply the activation function: Y=f(Net Sum)
○ Outputs binary values (e.g., 0 or 1, -1 or 1).

Key Features of a Perceptron:

1. Inputs: Takes multiple input features (e.g., x1,x2,x3,…)

2. Weights: Each input is associated with a weight that adjusts its influence on the output.
3. Summation: The perceptron computes a weighted sum of the inputs.
4. Activation Function: The sum is passed through an activation function (usually a
step function) that outputs 0 or 1 based on a threshold.

Learning Process:

● Training: The perceptron adjusts its weights using gradient descent and the
perceptron learning rule until it correctly classifies all training examples.
● Output: It predicts 0 or 1 depending on whether the weighted sum of inputs is
below or above the threshold.

Limitations:

● Perceptrons can only solve linearly separable problems (e.g., classifying data that
can be separated by a straight line).
● They struggle with more complex tasks, which is why multi-layer perceptrons
(MLPs) were developed to solve nonlinear problems.
Applications: Basic binary classification tasks such as spam detection or image recognition
(for simple patterns).

Single-Layer Perceptron (SLP)

A Single-Layer Perceptron is the simplest type of neural network. It is composed of a single layer
of neurons (or nodes) that process inputs and produce outputs. It’s used for solving binary
classification problems when the data is linearly separable.

Structure

1. Inputs (x1,x2,...,xn): Features of the data.

2. Weights (w1,w2,...,wn): Values associated with each input, adjusting their importance.
3. Bias (b): A constant added to the weighted sum to shift the decision boundary.
4. Weighted Sum:

5. Activation Function:

How It Works

1. Forward Pass:
○ Takes inputs, computes the weighted sum, and passes it through the step activation
function to produce an output (0 or 1).
2. Training:
○ Uses the Perceptron Learning Rule:

○ Where η is the learning rate.

○ Weights are updated iteratively until all training examples are correctly classified.

Applications of Single-Layer Perceptron

1. Implementing simple logical gates like AND, OR, and NOT.

2. Solving binary classification problems where data is linearly separable.

Limitations

1. Linearity:
○ Can only classify data that is linearly separable (e.g., NOT, AND, OR).
○ Cannot handle non-linear problems like XOR.
2. Fixed Activation:
○ Uses a step function, which is not differentiable, limiting flexibility.
Multi-Layer Perceptron (MLP)
https://www.youtube.com/watch?v=tTjcakAuHPI&list=PL4gu8xQu0_5JBO1FKRO5p20wc8DprlOgn&ind
ex=94

A Multi-Layer Perceptron (MLP) is a type of neural network that consists of an input layer, one or
more hidden layers, and an output layer. It is capable of solving non-linear problems, making it
more powerful than the single-layer perceptron.

Structure of MLP

1. Input Layer:
○ Accepts the input features (x1,x2,...,xn) from the dataset.
2. Hidden Layers:
○ One or more layers of neurons between the input and output layers.
○ Each neuron applies weights, biases, and a non-linear activation function (e.g., Sigmoid,
ReLU).
○ These layers enable the MLP to learn complex relationships.
3. Output Layer:
○ Produces the final output (e.g., class probabilities or regression values).

How It Works

1. Forward Propagation:

● Data flows through the network:

○ Compute the weighted sum of inputs (z=∑wixi+b) for each neuron.
○ Apply a non-linear activation function (e.g., Sigmoid, ReLU) to introduce non-linearity.
● The output of one layer becomes the input for the next layer.

2. Backward Propagation:
● Error (difference between predicted and actual output) is calculated using a loss function (e.g.,
Mean Squared Error, Cross-Entropy Loss).
● The error is propagated backward through the network using the chain rule to compute
gradients.
● Weights are updated using gradient descent or its variants (e.g., SGD, Adam):

Where η is the learning rate, and L is the loss.

1.https://www.youtube.com/watch?v=tUoUdOdTkRw&list=PL4gu8xQu0_5JBO1FKRO5p20wc8DprlOgn
&index=88

2.https://www.youtube.com/watch?v=n2L1J5JYgUk&list=PL4gu8xQu0_5JBO1FKRO5p20wc8DprlOgn&
index=89

3.https://www.youtube.com/watch?v=AWhboi1aTxI&list=PL4gu8xQu0_5JBO1FKRO5p20wc8DprlOgn&i
ndex=90

Activation Functions in MLP

1. Sigmoid: Outputs values between 0 and 1, good for probabilities:

2. ReLU (Rectified Linear Unit): Outputs max⁡(0,x) helps avoid vanishing gradients.
3. Tanh: Outputs values between -1 and 1, useful for symmetric outputs.

Key Features of MLP

1. Non-Linearity:
○ Hidden layers and non-linear activation functions allow MLPs to model complex
relationships.
2. Universal Approximation:
○ An MLP with sufficient hidden neurons and layers can approximate any function.
3. Deep Learning:
○ Forms the basis of modern deep neural networks when extended with many hidden
layers.

Applications of MLP

1. Classification:
○ Handwritten digit recognition (e.g., MNIST dataset).
○ Sentiment analysis (positive/negative classification).
2. Regression:
○ Predicting housing prices.
3. Clustering and Feature Extraction:
○ Learning representations of data for tasks like anomaly detection.
4. Complex Tasks:
○ Image recognition, speech recognition, and natural language processing (with deep
extensions).

MLP Example: Solving XOR Problem

● XOR is not linearly separable, but MLP solves it using two hidden neurons:
1. Hidden neurons learn to split the input space into non-linear regions.
2. The output layer combines these regions to form the XOR logic.

Advantages

1. Can handle non-linear problems.

2. Flexible and can adapt to a wide range of tasks.
3. Enables deeper networks for more complex tasks.

Limitations

1. Computationally expensive for large datasets.

2. Susceptible to overfitting without proper regularization.
3. Requires a lot of labeled data for training.

perceptron:

● https://www.youtube.com/watch?v=ItkSCYzSD34&list=PL4gu8xQu0_5JBO1FKRO5p20wc8Dprl
Ogn&index=91
● https://www.youtube.com/watch?v=du5fyS44DR8&list=PL4gu8xQu0_5JBO1FKRO5
p20wc8DprlOgn&index=76
● https://www.youtube.com/watch?v=zmIzNBMsQYQ&list=PL4gu8xQu0_5JBO1FKRO
5p20wc8DprlOgn&index=77
● https://www.youtube.com/watch?v=uxmNDNb0u9A&list=PL4gu8xQu0_5JBO1FKRO
5p20wc8DprlOgn&index=78
● https://www.youtube.com/watch?v=dM_8Y41EgsY&list=PL4gu8xQu0_5JBO1FKRO
5p20wc8DprlOgn&index=7
Support Vector Machines (SVM) is a powerful supervised machine learning algorithm used for
classification and regression tasks, though it is more commonly applied to classification
problems. SVM works by finding a hyperplane that best divides a dataset into different classes.

The goal of SVM is to find a decision boundary (or hyperplane) that maximizes the margin
between data points of different classes. The margin is the distance between the hyperplane and
the closest data points from either class, known as support vectors. The idea is that the wider
the margin, the better the generalization ability of the model.

● Linear SVM: If the data is linearly separable, a straight line (in 2D), a plane (in 3D),
or a hyperplane (in higher dimensions) can separate the two classes.
● Non-linear SVM: If the data is not linearly separable, SVM can still perform well by
transforming the data into a higher-dimensional space where a linear hyperplane can
be used for separation (through the kernel trick).

Margin:The margin is the distance between the decision boundary (hyperplane) and the closest
data points from either class. SVM aims to maximize this margin to improve the model's
generalization ability. A larger margin reduces the risk of overfitting.

Support Vectors:Support vectors are the data points that are closest to the decision boundary
(hyperplane). These points are critical in defining the position and orientation of the hyperplane.
They are the most important data points for constructing the model.

Hyperplane:A hyperplane is a decision boundary that separates data points of different classes.
In 2D, it is a line; in 3D, a plane; and in higher dimensions, it becomes a hyperplane.
Linear SVM

When the data is linearly separable, SVM tries to find the hyperplane that best separates the two
classes. The hyperplane can be represented as:

Where:

● w is the weight vector (normal to the hyperplane).

● x is the feature vector.
● b is the bias term (distance of the hyperplane from the origin).

Objective:

● Maximize the margin, which is defined as the distance from the hyperplane to the
nearest data points (support vectors).
● The distance of a data point xifrom the hyperplane is given by:
Non-linear SVM and Kernel Trick

In many real-world problems, the data is not linearly separable. In these cases, SVM uses the
kernel trick to map the data into a higher-dimensional feature space where it becomes possible
to find a linear hyperplane that separates the classes.

Using the kernel trick, the SVM's decision boundary in the higher-dimensional space is a
hyperplane, which is equivalent to a non-linear decision boundary in the original input space.

SVM for Multi-class Classification

SVM is inherently a binary classifier, but it can be extended to multi-class classification problems
using strategies like:

● One-vs-One (OvO): Create a classifier for every pair of classes. If there are k
classes, then the total number of classifiers is k(k−1)/2
● One-vs-Rest (OvR): Create a classifier for each class, where each classifier
distinguishes one class from all other classes.
Advantages of SVM

● Effective in high-dimensional spaces: SVM works well in spaces with more

features than samples.
● Memory efficient: SVM uses a subset of training points (support vectors) in the
decision function.
● Robust to overfitting: Especially in high-dimensional space, given the use of the
regularization parameter.

Disadvantages of SVM

● Computationally intensive: SVM can be slow for large datasets because of the
quadratic optimization problem it involves.
● Choice of kernel and hyperparameters: The performance of SVM is highly
dependent on the correct selection of kernel, C, and other hyperparameters.

Applications:

● Text Classification: Used in spam detection, sentiment analysis, and document

categorization.
● Image Classification & Object Recognition: Applied in face recognition,
handwriting recognition, and medical image analysis (e.g., tumor detection).
● Biological & Biomedical: Used in gene expression data analysis, cancer detection,
and protein classification.
● Finance: Applied in credit scoring, fraud detection, and stock market predictions.
● Speech Recognition: Used for classifying speech data in voice recognition systems.
● Anomaly Detection: Detects outliers or abnormal behavior in various fields like
network security and manufacturing.

Soft Margin Svm:

Playlist(for Problems related to SVM):
https://www.youtube.com/playlist?list=PL4gu8xQu0_5JdEqxU_FcY2x2aPte_BzF5
Metrics for SVM

Error Correction and Hyperparameter Tuning

SVM models can be sensitive to hyperparameters, and improper selection can lead to poor
performance. Key hyperparameters include:

● C: Regularization parameter that controls the trade-off between margin size and
classification error.
● Kernel: Type of kernel to use (e.g., linear, RBF, polynomial).
● Gamma (for RBF kernel): A parameter that controls the influence of a single training
example on the decision boundary.

Techniques like grid search or random search can be used to tune these hyperparameters and
find the optimal configuration. Cross-validation is also essential to evaluate the model's
generalization ability.

ML-UNIT-I
No ratings yet
ML-UNIT-I
14 pages
Unit4_PPT
No ratings yet
Unit4_PPT
118 pages
lec08_Classification_kNN_ANN
No ratings yet
lec08_Classification_kNN_ANN
39 pages
knn ann
No ratings yet
knn ann
44 pages
Soft Computing Unit 2 Notes..
No ratings yet
Soft Computing Unit 2 Notes..
24 pages
A Presentation On: By: Edutechlearners
No ratings yet
A Presentation On: By: Edutechlearners
33 pages
CSCI946 w5-classification
No ratings yet
CSCI946 w5-classification
72 pages
This Document Is About Artificial Inteligence.
No ratings yet
This Document Is About Artificial Inteligence.
81 pages
Lecture 9
No ratings yet
Lecture 9
97 pages
Unit Iv DM
No ratings yet
Unit Iv DM
58 pages
unit 2_class
No ratings yet
unit 2_class
16 pages
Session 6 Machine Learning Algorithms
No ratings yet
Session 6 Machine Learning Algorithms
46 pages
ML
No ratings yet
ML
49 pages
Artificial Intelligence and Machine Learning
No ratings yet
Artificial Intelligence and Machine Learning
12 pages
NISS Deep Learning Tutorial
No ratings yet
NISS Deep Learning Tutorial
58 pages
International Baccalaureate (IB) : Artificial Neural Networks - #1
No ratings yet
International Baccalaureate (IB) : Artificial Neural Networks - #1
33 pages
Neural Networks
No ratings yet
Neural Networks
19 pages
20200428135045cfbc718e2c (1)
No ratings yet
20200428135045cfbc718e2c (1)
30 pages
unit 2_class_preceptron
No ratings yet
unit 2_class_preceptron
13 pages
NN Unit 2
No ratings yet
NN Unit 2
20 pages
1 Neural Networks
No ratings yet
1 Neural Networks
16 pages
ADVANCED_SUPERVISED_LEARNING[1]
No ratings yet
ADVANCED_SUPERVISED_LEARNING[1]
17 pages
Machine Learning - Classifiers and Boosting: Reading CH 18.6-18.12, 20.1-20.3.2
No ratings yet
Machine Learning - Classifiers and Boosting: Reading CH 18.6-18.12, 20.1-20.3.2
54 pages
Algorithm
No ratings yet
Algorithm
27 pages
2 marks
No ratings yet
2 marks
5 pages
CH 4
No ratings yet
CH 4
106 pages
Neural Networks - V Unit (2)
No ratings yet
Neural Networks - V Unit (2)
43 pages
NNDL
No ratings yet
NNDL
96 pages
Week 09 Lesson 1 Intro Machine Learning 1 to 32 (4)
No ratings yet
Week 09 Lesson 1 Intro Machine Learning 1 to 32 (4)
61 pages
Unit 1.1
No ratings yet
Unit 1.1
44 pages
ML Fundamentals by Bitspace
No ratings yet
ML Fundamentals by Bitspace
19 pages
Pattern Recognition & Analysis Assignment - Ii
No ratings yet
Pattern Recognition & Analysis Assignment - Ii
19 pages
Machine Learning Concept1
No ratings yet
Machine Learning Concept1
16 pages
3 - Perceptron in Machine Learning
No ratings yet
3 - Perceptron in Machine Learning
7 pages
Deep Learning Unit1
No ratings yet
Deep Learning Unit1
25 pages
Module 3 (1)
No ratings yet
Module 3 (1)
63 pages
Unit 4
No ratings yet
Unit 4
9 pages
Lecture Notes 2016
No ratings yet
Lecture Notes 2016
132 pages
DM assignment 2
No ratings yet
DM assignment 2
23 pages
Genetic Algorithms Versus Traditional Methods
No ratings yet
Genetic Algorithms Versus Traditional Methods
7 pages
soft computing unit 2
No ratings yet
soft computing unit 2
23 pages
Donalek_Classif
No ratings yet
Donalek_Classif
69 pages
CIE-2 Solutions
No ratings yet
CIE-2 Solutions
10 pages
Mad
No ratings yet
Mad
8 pages
Lec 12 NN
No ratings yet
Lec 12 NN
20 pages
SL
No ratings yet
SL
30 pages
Supervised Learning Neural Networks
No ratings yet
Supervised Learning Neural Networks
34 pages
Unit II - Perceptron
No ratings yet
Unit II - Perceptron
20 pages
Unit_2
No ratings yet
Unit_2
20 pages
Machine Learning
No ratings yet
Machine Learning
32 pages
DL mod 1 final
No ratings yet
DL mod 1 final
4 pages
Accelerated Data Science Introduction To Machine Learning Algorithms
No ratings yet
Accelerated Data Science Introduction To Machine Learning Algorithms
37 pages
20MEMECH Part 6 - NN Vol - 1
No ratings yet
20MEMECH Part 6 - NN Vol - 1
34 pages
Machine Learning
No ratings yet
Machine Learning
95 pages
1 - Perceptron in Machine Learning
No ratings yet
1 - Perceptron in Machine Learning
6 pages
Perceptron Classifier Explanation
No ratings yet
Perceptron Classifier Explanation
4 pages
Unit-1 ML
No ratings yet
Unit-1 ML
19 pages
Asset-V1 ColumbiaX+CSMM.101x+1T2017+type@asset+block@AI Edx ML 5.1intro
No ratings yet
Asset-V1 ColumbiaX+CSMM.101x+1T2017+type@asset+block@AI Edx ML 5.1intro
70 pages
Backpropagation: Fundamentals and Applications for Preparing Data for Training in Deep Learning
From Everand
Backpropagation: Fundamentals and Applications for Preparing Data for Training in Deep Learning
Fouad Sabry
No ratings yet
K Nearest Neighbor Algorithm: Fundamentals and Applications
From Everand
K Nearest Neighbor Algorithm: Fundamentals and Applications
Fouad Sabry
No ratings yet
Construction Cost Predictor
No ratings yet
Construction Cost Predictor
12 pages
Communications Mining Taxonomy Guide
No ratings yet
Communications Mining Taxonomy Guide
42 pages
The Role of Artificial Intelligence in Financial Analysis and Forecasting: Using Data and Algorithms
No ratings yet
The Role of Artificial Intelligence in Financial Analysis and Forecasting: Using Data and Algorithms
14 pages
Insights on Phase Formation From Thermodynamic Calculations and Machine Learning of 2436 Experimentally Measured High Entropy Alloys
No ratings yet
Insights on Phase Formation From Thermodynamic Calculations and Machine Learning of 2436 Experimentally Measured High Entropy Alloys
12 pages
SVVT Portfolio
No ratings yet
SVVT Portfolio
3 pages
Predictive Modelling Project
No ratings yet
Predictive Modelling Project
29 pages
IPL Win_Loss Doc 14_6_24
No ratings yet
IPL Win_Loss Doc 14_6_24
29 pages
V02 SS24 DLforCV NN Basics Teil1
No ratings yet
V02 SS24 DLforCV NN Basics Teil1
68 pages
Sample Project Doc-RIT
No ratings yet
Sample Project Doc-RIT
63 pages
Project 5-EasyVisa assignment (1)
No ratings yet
Project 5-EasyVisa assignment (1)
57 pages
An Auto-Explained Automated Machine Learning Tool For Big
No ratings yet
An Auto-Explained Automated Machine Learning Tool For Big
6 pages
Face Detection Report
No ratings yet
Face Detection Report
84 pages
PyTorch Workflow Fundamentals - Zero To Mastery Learn PyTorch For Deep Learning
No ratings yet
PyTorch Workflow Fundamentals - Zero To Mastery Learn PyTorch For Deep Learning
43 pages
Ugrd Cybs6101 Artificial Intelligence Fundamentals Final Exams
No ratings yet
Ugrd Cybs6101 Artificial Intelligence Fundamentals Final Exams
69 pages
Detail Project Report
No ratings yet
Detail Project Report
9 pages
Machine Learning Techniques - SDN
No ratings yet
Machine Learning Techniques - SDN
38 pages
Advanced Skin Category Prediction System For Cosmetic Suggestion Using Deep Convolution Neural Network Report Final
No ratings yet
Advanced Skin Category Prediction System For Cosmetic Suggestion Using Deep Convolution Neural Network Report Final
52 pages
A Facial and Vocal Expression Based Comprehensive Framework For Real-Time Student Stress Monitoring in An IoT-Fog-Cloud Environment
No ratings yet
A Facial and Vocal Expression Based Comprehensive Framework For Real-Time Student Stress Monitoring in An IoT-Fog-Cloud Environment
12 pages
[1.2]
No ratings yet
[1.2]
58 pages
Automated Penetration Testing Using Deep Reinforcement Learning
No ratings yet
Automated Penetration Testing Using Deep Reinforcement Learning
9 pages
Instant Access To Big Data Analytics With Java 1st Edition Rajat Mehta Ebook Full Chapters
100% (6)
Instant Access To Big Data Analytics With Java 1st Edition Rajat Mehta Ebook Full Chapters
41 pages
Dl Question Bank
No ratings yet
Dl Question Bank
23 pages
Machine Learning For Cyber Physical System by Janmenjoy Nayak
100% (1)
Machine Learning For Cyber Physical System by Janmenjoy Nayak
412 pages
Early Detection of Student Depression Using Deep Learning Algorithms (1) (1) (1) (1)
No ratings yet
Early Detection of Student Depression Using Deep Learning Algorithms (1) (1) (1) (1)
7 pages
Final Project Report
No ratings yet
Final Project Report
86 pages
Segmentation-of-diabetic-retinopathy-images-using-dee_2023_Alexandria-Engine
No ratings yet
Segmentation-of-diabetic-retinopathy-images-using-dee_2023_Alexandria-Engine
19 pages
Deep Learning Based Model For Fake Review Detection
No ratings yet
Deep Learning Based Model For Fake Review Detection
4 pages
Ultimate Resume Guide Final
No ratings yet
Ultimate Resume Guide Final
35 pages
Computer Science Students Academic Performance Prediction Using Ai[1]
No ratings yet
Computer Science Students Academic Performance Prediction Using Ai[1]
68 pages
Draft Group Project Mental Illness
No ratings yet
Draft Group Project Mental Illness
3 pages

Unit-3

Uploaded by

Unit-3

Uploaded by

UNIT – III: Supervised Learning – II

K-NN classifier – Logistic regression – Perceptron – Single layer & Multi-layer

The K-Nearest Neighbour (KNN) Classifier is a simple and effective machine

Key Concepts of KNN:

1. Similarity Based on Distance:

Logistic Regression is a supervised machine learning algorithm used for classification

Key Features of Logistic Regression:

● Binary Classification: The output is categorical (e.g., Yes/No, 0/1).

Types of Logistic Regression:

1. Binomial: The dependent variable has two possible types, e.g., 0 or 1.

● The dependent variable must be binary (for binary logistic regression).

1. Step 1: Compute the weighted sum: Net Sum=∑wi​⋅xi​+b

Key Features of a Perceptron:

1. Inputs: Takes multiple input features (e.g., x1,x2,x3,…)

Single-Layer Perceptron (SLP)

1. Inputs (x1,x2,...,xn​): Features of the data.

○ Where η is the learning rate.

Applications of Single-Layer Perceptron

1. Implementing simple logical gates like AND, OR, and NOT.

● Data flows through the network:

​Where η is the learning rate, and L is the loss.

Activation Functions in MLP

1. Sigmoid: Outputs values between 0 and 1, good for probabilities:

Key Features of MLP

MLP Example: Solving XOR Problem

1. Can handle non-linear problems.

1. Computationally expensive for large datasets.

● w is the weight vector (normal to the hyperplane).

SVM for Multi-class Classification

● Effective in high-dimensional spaces: SVM works well in spaces with more

● Text Classification: Used in spam detection, sentiment analysis, and document

Soft Margin Svm:

Error Correction and Hyperparameter Tuning

You might also like

1. Step 1: Compute the weighted sum: Net Sum=∑wi⋅xi+b

1. Inputs (x1,x2,...,xn): Features of the data.

Where η is the learning rate, and L is the loss.