Lecture 2.3

The document discusses error analysis in machine learning, focusing on techniques such as train/test split, validation sets, and performance metrics including confusion matrix, accuracy, precision, recall, F-measure, and ROC curve. It explains the importance of splitting datasets for model validation and tuning hyper-parameters. Additionally, it provides definitions and calculations for various performance metrics used to evaluate classification models.

Uploaded by

22051210

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

10 views9 pages

Lecture 2.3

Uploaded by

22051210

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 9

Lecture 2.

• Error Analysis
• Train/Test Split, validation set
• Confusion Matrix
• Accuracy, Precision, Recall, F-measure,
ROC curve,

Dr. Mainak Biswas

Train/Test Split in Machine Learning
• Train-test split is a machine learning technique
that divides a dataset into two subsets: a training
set and a testing set
• It's a model validation process that helps assess
how well a machine learning model will perform
on new data
• Typical Split Ratios
– 80% for training and 20% for testing
– 70% for training and 30% for testing
– 90% for training and 10% for testing (for large dataset)
Dr. Mainak Biswas
Validation Set
• The validation set is an additional subset of the dataset
used to tune the model's hyper-parameters and evaluate
its performance during training
• It acts as an intermediary between the training set and the
test set
• Purpose of a Validation Set
– Hyper parameter Tuning
– Early Stopping
– Model Selection
• Train/Validation/Test Split
– Training Set: Used to train the model
– Validation Set: Used to tune hyper parameters and evaluate the
model during training
– Test Set: Used to assess the final performance on unseen data
Dr. Mainak Biswas
Confusion Matrix

Dr. Mainak Biswas

𝑇𝑃 + 𝑇𝑁
𝐴𝑐𝑐𝑢𝑟𝑎𝑐𝑦 =
𝑇𝑃 + 𝑇𝑁 + 𝐹𝑃 + 𝐹𝑁
Recall measures the proportion of correctly
predicted positive observations out of all
actual positives.
𝑇𝑃 𝑇𝑃
𝑇𝑟𝑢𝑒 𝑝𝑜𝑠𝑖𝑡𝑖𝑣𝑒 𝑟𝑎𝑡𝑒 𝑇𝑃𝑅 , 𝑟𝑒𝑐𝑎𝑙𝑙, 𝑠𝑒𝑛𝑠𝑖𝑡𝑖𝑣𝑖𝑡𝑦 𝑆𝐸𝑁 = =
𝑇𝑃 + 𝐹𝑁 𝑃
Precision measures the proportion of correctly
𝑇𝑃
Precision = predicted positive observations out of all
𝑇𝑃 + 𝐹𝑃 predicted positives.
The F-score (or F1-score) is a metric that combines precision and
recall into a single score, providing a balance between the two.
It's especially useful when the data is imbalanced.
2. 𝑃𝑟𝑒𝑐𝑖𝑠𝑖𝑜𝑛. 𝑅𝑒𝑐𝑎𝑙𝑙
𝐹 − 𝑀𝑒𝑎𝑠𝑢𝑟𝑒 =
𝑃𝑟𝑒𝑐𝑖𝑠𝑖𝑜𝑛 + 𝑅𝑒𝑐𝑎𝑙𝑙
False Positive Rate (FPR) is a measure used in binary classification to quantify how
often a model incorrectly predicts a positive outcome for a negative instance
𝐹𝑃
𝐹𝑎𝑙𝑠𝑒 𝑝𝑜𝑠𝑖𝑡𝑖𝑣𝑒 𝑟𝑎𝑡𝑒 (𝑡𝑦𝑝𝑒 − 𝐼 𝑒𝑟𝑟𝑜𝑟) =
𝐹𝑃 + 𝑇𝑁
Dr. Mainak Biswas
ROC Curve
• An ROC (Receiver Operating Characteristic) plot
is a graphical representation used to evaluate the
performance of a binary classification model
• It illustrates the trade-off between the True
Positive Rate (TPR) and the False Positive Rate
(FPR) at various threshold settings for a classifier.
Here's a breakdown of its meaning and
components
𝑇𝑃 𝑇𝑃
𝑇𝑟𝑢𝑒 𝑝𝑜𝑠𝑖𝑡𝑖𝑣𝑒 𝑟𝑎𝑡𝑒 𝑇𝑃𝑅 , 𝑟𝑒𝑐𝑎𝑙𝑙, 𝑠𝑒𝑛𝑠𝑖𝑡𝑖𝑣𝑖𝑡𝑦 𝑆𝐸𝑁 = =
𝑇𝑃 + 𝐹𝑁 𝑃

𝐹𝑃
𝐹𝑎𝑙𝑠𝑒 𝑝𝑜𝑠𝑖𝑡𝑖𝑣𝑒 𝑟𝑎𝑡𝑒 (𝑡𝑦𝑝𝑒 − 𝐼 𝑒𝑟𝑟𝑜𝑟) =
𝐹𝑃 + 𝑇𝑁
Dr. Mainak Biswas
Components of ROC Curve
• X-axis: False Positive Rate (FPR)
• Y-axis: True Positive Rate (TPR)
• Curve: Plots TPR against FPR for various threshold
values
• Diagonal Line: Represents a random classifier (no
predictive power)
– The area under this line is 0.5
• Area Under the Curve (AUC): The AUC score
measures the overall performance of the model
– An AUC of 1.0 indicates a perfect classifier, while 0.5
indicates a model with no discriminative ability.
Dr. Mainak Biswas
Confusion Matrix Generation
Predicted True
1 1
Actually Actually
1 1 Positive Negative
1 0 (P) (1) (N) (0)
1 1
1 0 Predicted 8 (TP) 5 (FP) 𝐹𝑎𝑙𝑠𝑒 𝑝𝑜𝑠𝑖𝑡𝑖𝑣𝑒 𝑟𝑎𝑡𝑒 (𝑡𝑦𝑝𝑒
1 0 Positive 𝐹𝑃
1 1 − 𝐼 𝑒𝑟𝑟𝑜𝑟) =
1 1 (PP) (1) 𝐹𝑃 + 𝑇𝑁
0 0 5
0 0
Predicted 2 (FN) 5 (TN) = = 0.5
Negative 10
0 1
1 1 (PN) (0)
1 0
0 0
0 0
1 1 8+5
1 1 𝐴𝑐𝑐𝑢𝑟𝑎𝑐𝑦 = = 0.65
1 0 8+5+5+2
0 1
0 0 𝑇𝑃 8
𝑅𝑒𝑐𝑎𝑙𝑙, 𝑠𝑒𝑛𝑠𝑖𝑡𝑖𝑣𝑖𝑡𝑦 𝑆𝐸𝑁 = = = 0.8
𝑇𝑃 + 𝐹𝑁 8 + 2
𝑇𝑃 8 2 × 0.62 × 0.8
Precision = = = 0.62 𝐹 − 𝑀𝑒𝑎𝑠𝑢𝑟𝑒 = = 0.70
𝑇𝑃 + 𝐹𝑃 13 0.62 + 0.8
Dr. Mainak Biswas
ROC Generation
Predicted True
1 1
1 1
1 0
1 1
1 0
1 0
1 1
1 1
0 0
0 0
0 1
1 1
1 0
0 0
0 0
1 1
1 1
1 0
0 1
0 0

Dr. Mainak Biswas

3 - Model Evaluation & Validation
No ratings yet
3 - Model Evaluation & Validation
47 pages
Confusion Matrix
No ratings yet
Confusion Matrix
16 pages
Ca 3 Merged
No ratings yet
Ca 3 Merged
275 pages
Confusion Matrix & Evaluation Metrics in Machine Learning
No ratings yet
Confusion Matrix & Evaluation Metrics in Machine Learning
23 pages
Unit III Iml Final
No ratings yet
Unit III Iml Final
36 pages
Roc Curve in Python
No ratings yet
Roc Curve in Python
58 pages
IS4242 W6 Model Evaluation and Selection
No ratings yet
IS4242 W6 Model Evaluation and Selection
86 pages
AI351 Lecture 2 - Common Evaluation Metrics
No ratings yet
AI351 Lecture 2 - Common Evaluation Metrics
50 pages
Model Evaluation and Selection
No ratings yet
Model Evaluation and Selection
49 pages
AI Performance Evaluation - Annotated
No ratings yet
AI Performance Evaluation - Annotated
52 pages
6 Evaluarea performantei
No ratings yet
6 Evaluarea performantei
43 pages
جلسه 13
No ratings yet
جلسه 13
76 pages
Unit2- Perfomance Measures
No ratings yet
Unit2- Perfomance Measures
32 pages
DL_IT324a_4
No ratings yet
DL_IT324a_4
52 pages
Lec 12 13 Evaluation Measures
No ratings yet
Lec 12 13 Evaluation Measures
45 pages
Intermediate Analytics-Regression-Week 3-1
No ratings yet
Intermediate Analytics-Regression-Week 3-1
44 pages
performance evaluation
No ratings yet
performance evaluation
24 pages
Class Imbalance Problem: BY Dr. Anupam Ghosh 4 SEPT, 2023
No ratings yet
Class Imbalance Problem: BY Dr. Anupam Ghosh 4 SEPT, 2023
27 pages
IT 138 - Lecture 4
No ratings yet
IT 138 - Lecture 4
30 pages
5.3
No ratings yet
5.3
31 pages
ML CH 5
No ratings yet
ML CH 5
45 pages
Lecture 3b - Evaluation
No ratings yet
Lecture 3b - Evaluation
37 pages
UNIT-3
No ratings yet
UNIT-3
13 pages
lecture11evaluationmetricsforclassification-240913060639-0c766554
No ratings yet
lecture11evaluationmetricsforclassification-240913060639-0c766554
28 pages
Machine Learning Terminology
No ratings yet
Machine Learning Terminology
16 pages
Lecture 10
No ratings yet
Lecture 10
16 pages
UNIT-1-2.Binary Classification and Related Tasks
No ratings yet
UNIT-1-2.Binary Classification and Related Tasks
22 pages
MACHINELEARNING
No ratings yet
MACHINELEARNING
20 pages
Binary Classification PDF
No ratings yet
Binary Classification PDF
27 pages
06-FSSR_DS610_2024=2025T1_ٍMetrics
No ratings yet
06-FSSR_DS610_2024=2025T1_ٍMetrics
24 pages
AD3501-DL-UNIT 4 NOTES
No ratings yet
AD3501-DL-UNIT 4 NOTES
16 pages
Tutorial 6 Evaluation Metrics For Machine Learning Models: Classification and Regression Models
No ratings yet
Tutorial 6 Evaluation Metrics For Machine Learning Models: Classification and Regression Models
22 pages
Chap3 Part1 Classification
No ratings yet
Chap3 Part1 Classification
38 pages
AUC and the ROC Curve in Machine Learning _ DataCamp
No ratings yet
AUC and the ROC Curve in Machine Learning _ DataCamp
12 pages
ML-Lecture-11-Evaluation
No ratings yet
ML-Lecture-11-Evaluation
17 pages
19-Performance Metrics
No ratings yet
19-Performance Metrics
23 pages
How to evaluate and monitor performance of AI models for Financial Risk Management— a practical guide by Indraneel Dutta Barua
No ratings yet
How to evaluate and monitor performance of AI models for Financial Risk Management— a practical guide by Indraneel Dutta Barua
1 page
Evaluation Measures
No ratings yet
Evaluation Measures
8 pages
Unit 2 Chap 4
No ratings yet
Unit 2 Chap 4
14 pages
Evaluation Metrics
No ratings yet
Evaluation Metrics
11 pages
A10-Model-Performance-v2-2up
No ratings yet
A10-Model-Performance-v2-2up
11 pages
Model Performance Assessment
No ratings yet
Model Performance Assessment
13 pages
The ROC Curve
No ratings yet
The ROC Curve
5 pages
Roc and Auc: Receiver Operating Characteristic
No ratings yet
Roc and Auc: Receiver Operating Characteristic
4 pages
IAI&ML UNIT-5
No ratings yet
IAI&ML UNIT-5
15 pages
Confusion Matrix V 2.0
No ratings yet
Confusion Matrix V 2.0
14 pages
AUC ROC curve
No ratings yet
AUC ROC curve
5 pages
Machine Learning: B.Tech (CSBS) V Semester
No ratings yet
Machine Learning: B.Tech (CSBS) V Semester
9 pages
11.2 - Classification Evaluation Metrics
No ratings yet
11.2 - Classification Evaluation Metrics
22 pages
Module 5 ML
No ratings yet
Module 5 ML
12 pages
Performance Parameters
No ratings yet
Performance Parameters
23 pages
Machine Learning Project Report (Group 3) Shahbaz Khan
No ratings yet
Machine Learning Project Report (Group 3) Shahbaz Khan
11 pages
ML-Lecture-12 (Evaluation Metrics For Classification)
No ratings yet
ML-Lecture-12 (Evaluation Metrics For Classification)
15 pages
Ai DS 2 Book-Chpt-5
No ratings yet
Ai DS 2 Book-Chpt-5
17 pages
CS340 Machine Learning ROC Curves
No ratings yet
CS340 Machine Learning ROC Curves
8 pages
Auc Roc Curve Machine Learning
No ratings yet
Auc Roc Curve Machine Learning
12 pages
Learning Best Practices For Model Evaluation and Hyper-Parameter Tuning
No ratings yet
Learning Best Practices For Model Evaluation and Hyper-Parameter Tuning
20 pages
Exp7_MLAI2
No ratings yet
Exp7_MLAI2
8 pages
Roc 1 PDF
No ratings yet
Roc 1 PDF
8 pages
BUSINESS INTELLIGENCE sem 6 question paper
No ratings yet
BUSINESS INTELLIGENCE sem 6 question paper
6 pages
Unsupervised Learning: Part III Counter Propagation Network
100% (1)
Unsupervised Learning: Part III Counter Propagation Network
17 pages
[Typological Studies in Language 14] John Hinds (Ed.), Shoichi Iwasaki (Ed.), Senko K. Maynard (Ed.) - Perspectives on Topicalization_ the Case of Japanese WA (1987, John Benjamins Publishing Company)
No ratings yet
[Typological Studies in Language 14] John Hinds (Ed.), Shoichi Iwasaki (Ed.), Senko K. Maynard (Ed.) - Perspectives on Topicalization_ the Case of Japanese WA (1987, John Benjamins Publishing Company)
320 pages
THE_IMPACT_OF_CRITICAL_THINKING_ON_COMMUNICATIVE_COMPETENCE_IN_EFL
No ratings yet
THE_IMPACT_OF_CRITICAL_THINKING_ON_COMMUNICATIVE_COMPETENCE_IN_EFL
6 pages
Teaching Spanish Pronunciation Printable
100% (1)
Teaching Spanish Pronunciation Printable
71 pages
Fake News Detetcion PPT 2023
No ratings yet
Fake News Detetcion PPT 2023
25 pages
Daniel - Mafe Thesis p22 Page 30
No ratings yet
Daniel - Mafe Thesis p22 Page 30
140 pages
A Fairy-Tale Day
No ratings yet
A Fairy-Tale Day
6 pages
TFN
100% (1)
TFN
35 pages
BUS 360W E2 Course Package
No ratings yet
BUS 360W E2 Course Package
15 pages
Midterm Exam 2 1
100% (1)
Midterm Exam 2 1
4 pages
Analysis of Students' Motivation in Learning English at Senior High Schools
No ratings yet
Analysis of Students' Motivation in Learning English at Senior High Schools
29 pages
25689-Article Text-55437-1-10-20180914
No ratings yet
25689-Article Text-55437-1-10-20180914
7 pages
Lawrence Kohlberg's: Theory of Moral Development
No ratings yet
Lawrence Kohlberg's: Theory of Moral Development
12 pages
Willpower and Perseverance - How Grit Inspired The Power of Habit
No ratings yet
Willpower and Perseverance - How Grit Inspired The Power of Habit
5 pages
The 4 Secret Traits of Incredibly Lucky People
No ratings yet
The 4 Secret Traits of Incredibly Lucky People
10 pages
Staff Writer - The Decision Lab PDF
No ratings yet
Staff Writer - The Decision Lab PDF
6 pages
NCM 117 Week 4
No ratings yet
NCM 117 Week 4
6 pages
Learning Area Mathematics Learning Delivery Modality ONLINE Distance Modality (Learners-Led Modality)
No ratings yet
Learning Area Mathematics Learning Delivery Modality ONLINE Distance Modality (Learners-Led Modality)
8 pages
USC JPMAP Leadership Training
No ratings yet
USC JPMAP Leadership Training
3 pages
Cognitive Domain (Thinking, Knowledge) Evaluation Synthesis Analysis Application Comprehension Knowledge
No ratings yet
Cognitive Domain (Thinking, Knowledge) Evaluation Synthesis Analysis Application Comprehension Knowledge
3 pages
# How Much Does Each Statement Apply To You Mark Your Score: The Number
No ratings yet
# How Much Does Each Statement Apply To You Mark Your Score: The Number
3 pages
Bloom's Taxonomy of Learning Domains
No ratings yet
Bloom's Taxonomy of Learning Domains
9 pages
Barriers To Effective Communication
No ratings yet
Barriers To Effective Communication
13 pages
Ali Raza
No ratings yet
Ali Raza
4 pages
Reading A Film - Problem of Denotation in Fiction Film
No ratings yet
Reading A Film - Problem of Denotation in Fiction Film
0 pages
Yale Nus College Curriculum Preview
No ratings yet
Yale Nus College Curriculum Preview
45 pages
Icebreakers, Warm-Up, Review, and Motivator Activities
No ratings yet
Icebreakers, Warm-Up, Review, and Motivator Activities
6 pages
Blur Parameter Identification Using Support Vector Machine: Ratnakar Dash, Pankaj Kumar Sa, and Banshidhar Majhi
No ratings yet
Blur Parameter Identification Using Support Vector Machine: Ratnakar Dash, Pankaj Kumar Sa, and Banshidhar Majhi
4 pages
Patrón 4 Del Eneagrama
No ratings yet
Patrón 4 Del Eneagrama
7 pages
Core Concepts in Real Analysis
From Everand
Core Concepts in Real Analysis
Roshan Trivedi
No ratings yet

Lecture 2.3

Uploaded by

Lecture 2.3

Uploaded by

Lecture 2.

Dr. Mainak Biswas

Dr. Mainak Biswas

Dr. Mainak Biswas

You might also like