0% found this document useful (0 votes)

14 views25 pages

Ensemble Classification

The document discusses different ensemble classification techniques including simple voting, weighted voting, bootstrap aggregation, bagging of features, boosting, and mixture of experts. It explains how each technique works and the benefits they provide such as reducing overfitting and variance in predictions.

Uploaded by

netsahil2018

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

14 views25 pages

Ensemble Classification

Uploaded by

netsahil2018

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 25

Ensemble Classification

AI42001
Different classifiers, different results
f1(X) = Y1
f2(X) = Y2
………
fK(X) = YK
• How to choose a single answer??
Simple Voting
f1(X) = Y1
f2(X) = Y2
………
fK(X) = YK
• How to choose a single answer??
• Classification: mode of all predictions!
• Regression: mean of all predictions!
Simple Voting
• How to choose a single answer??
classifier f1 f2 f3 f4 f5 f6
Label 1 1 2 1 3 2

Label Y=1 Y=2 Y=3

Total Votes 3 2 1

• Classification: mode of all predictions!

• Regression: mean of all predictions!
• Final prediction: Y =1 !
Simple Voting
• How to choose a single answer??
classifier f1 f2 f3 f4 f5 f6
Label 1 1 2 1 3 2

Label Y=1 Y=2 Y=3

Total Votes 3 2 1

• Classification: mode of all predictions!

• Regression: mean of all predictions!
• Final prediction: Y =1 !
Simple Voting
f1(X) = Y1
f2(X) = Y2
………
fK(X) = YK
• How to choose a single answer??

• Classification: mode of all predictions!

• Regression: mean of all predictions!
• But if all classifiers are not equally reliable????
Weighted Voting
• Attach a “weight” with each classifier
• More reliable classifier -> higher weightage
classifier f1 f2 f3 f4 f5 f6
Label 1 1 2 1 3 2
Weight 7 5 9 5 7 9
Weighted Voting
• Attach a “weight” with each classifier
• More reliable classifier -> higher weightage
classifier f1 f2 f3 f4 f5 f6
Label 1 1 2 1 3 2
Weight 7 5 9 5 7 9

Label Y=1 Y=2 Y=3

Total Weight 17 18 7

• Final prediction: Y = 2!
How to have many classifiers?
• Same training set, different classifier functions
• Split the dataset into multiple parts; train a classifier on each part
• Select different subsets of features; train a classifier for each subset

• Found to reduce overfitting

• Improves “stability”, reduces “variance” (Similar examples assigned
similar/same label)
How to have many classifiers?
• Same training set, different classifier functions
• Split the dataset into multiple parts; train a classifier on each part
• Select different subsets of features; train a classifier for each subset

• Bootstrap Aggregation (BAGG-ing)!

Bootstrap Aggregation
• Given a training set of size N:
• Choose N examples from it with replacement
• i-th draw: each training example may be chosen with probability 1/N
• Same example may be chosen multiple times!
Bootstrap Aggregation
• Given a training set of size N:
• Choose N examples from it with replacement
• i-th draw: each training example may be chosen with probability 1/N
• Same example may be chosen multiple times!
X1 X2 X3 X4 X5 X6 X7 X8 X9 X10
Bootstrap Aggregation
• Given a training set of size N:
• Choose N examples from it with replacement
• i-th draw: each training example may be chosen with probability 1/N
• Same example may be chosen multiple times!
X1 X2 X3 X4 X5 X6 X7 X8 X9 X10

X1 X5 X9 X3 X4 X5 X1 X1 X6 X7 X9
X6 X3 X4 X10 X3 X7 X3 X6 X8 X2
X4 X7 X9 X5 X3 X6 X2 X6 X1 X6
Bootstrap Aggregation
• Given a training set of size N:
• Choose N examples from it with replacement
• i-th draw: each training example may be chosen with probability 1/N
• Same example may be chosen multiple times!

• Prob(X_j is never chosen) ~ 0.36

• Nearly one-third of original dataset will not be present in new
dataset!
Bootstrap Aggregation
• Use BAGG to generate K “versions” of original training set
• Train a classifier on each version
• Give weightage to each classifier according to its accuracy on
validation set
BAGGing of features
• Instead of choosing subsets of training examples
• We can choose subsets of the features!

• Each “version” has all the training examples, but only some of the
features!
X(1) X(4) X(2) X(4)
X(1)
X(2) X(2) X(1) X(2)
X(3) X(3) X(4) X(3)
X(4) X(3) X(3) X(2)
X(5) X(2) X(3) X(3)
BAGGing of features
• Instead of choosing subsets of training examples
• We can choose subsets of the features!

• Each “version” has all the training examples, but only some of the
features!

• If the classifier used is Decision Tree, then this approach is called

Random Forest
• RF is one of the more powerful classifiers around
Weighted Training
• What if the training samples have different weights?
• Some examples more important than the others!
• The “loss function” has to account for these weights!

• Weighted K-NN: consider weighted vote of each class label among the
K nearest neighbors
• Weighted Decision Tree: at each node, consider “weighted relative
frequencies” of each class label
Boosting
• 1) Take a “weak” classifier (just better than random)
• 2) Compute the error of the model on each training example
• 3) Identify the examples on which it makes mistakes
• 4) Increase the “weights” of such examples and retrain classifier
• 5) Add the updated classifier to the ensemble
• 6) Goto 2
Boosting
Adaboost Algorithm
• Initialize weights of each training sample wi = 1/N
• For iteration t = 1 to max_iter
• Learn a classifier ht on training examples according to weights w
• Calculate weighted error : et = ∑I wi I(ht(Xi) != yt)
• Set weightage of ht : at = 0.5 log ((1-et) / et)
• Update the weight of each training example
wi = wi exp(-at) if ht(xi) = yi (correct classification: decrease weight)
wi = wi exp(at) if ht(xi) != yi (wrong classification: increase weight)
• Normalize the weights so that they add up to 1
• End
• Output: all the classifiers “h” along with their weights “a”
Mixture of Experts
• Different classifiers may be more effective in different parts of feature
space
• Weights of classifiers should be dependent on features
Summary
• Complex learning problems can often be solved by eﬀectively
combining multiple weak model via ensemble learning methods
• Simple ones: Voting/averaging or stacking
• Bagging: Random Forests
• Boosting: Adaboost
• Mixture of Experts
• These models help reduce variance or overfitting, and may have
computational benefits over more complex classification algorithms
One-vs-one Classification
One-vs-all Classification

VTU Module-4 Chapter-2 Ensemble Learning and Random Forests
No ratings yet
VTU Module-4 Chapter-2 Ensemble Learning and Random Forests
61 pages
cz4041 9 Ensemble
No ratings yet
cz4041 9 Ensemble
54 pages
Bagging and Boosting
No ratings yet
Bagging and Boosting
32 pages
ML-Lecture-15-Ensemble
No ratings yet
ML-Lecture-15-Ensemble
27 pages
ML-Unit I - Ensemble Methods
No ratings yet
ML-Unit I - Ensemble Methods
54 pages
ML8Ensembles (1)
No ratings yet
ML8Ensembles (1)
31 pages
ENSEMBLE LEARNING-1
No ratings yet
ENSEMBLE LEARNING-1
61 pages
PA.UNIT - IV
No ratings yet
PA.UNIT - IV
45 pages
Validaciones - Bosstrap
No ratings yet
Validaciones - Bosstrap
50 pages
Ch 7 - Ensemble Learning and Random Forests
No ratings yet
Ch 7 - Ensemble Learning and Random Forests
78 pages
5 - EnsembleModeling
No ratings yet
5 - EnsembleModeling
80 pages
Module 5,1 Ensemble_Bagging, RF,Boosting
No ratings yet
Module 5,1 Ensemble_Bagging, RF,Boosting
66 pages
UE20CS302 Unit3 Slides
No ratings yet
UE20CS302 Unit3 Slides
308 pages
Class Adv Classification V
No ratings yet
Class Adv Classification V
50 pages
14 Model Ensembles
No ratings yet
14 Model Ensembles
63 pages
lecture slide 12
No ratings yet
lecture slide 12
22 pages
Bagging+Boosting+Gradient Boosting
100% (1)
Bagging+Boosting+Gradient Boosting
48 pages
Bagging and Boosting
No ratings yet
Bagging and Boosting
40 pages
Unit V -Multiple Learners
No ratings yet
Unit V -Multiple Learners
54 pages
Machine Learning: Ensemble Methods
No ratings yet
Machine Learning: Ensemble Methods
54 pages
Ensemble Method
No ratings yet
Ensemble Method
8 pages
Classification Algorithms
No ratings yet
Classification Algorithms
68 pages
12 Ensemble Model
No ratings yet
12 Ensemble Model
90 pages
AIML Lect6 Ensembles
No ratings yet
AIML Lect6 Ensembles
41 pages
Ensembles of Classifiers: Evgueni Smirnov
No ratings yet
Ensembles of Classifiers: Evgueni Smirnov
43 pages
An Introduction of Ensemble Learning
100% (1)
An Introduction of Ensemble Learning
40 pages
Ensemble Classifiers
No ratings yet
Ensemble Classifiers
37 pages
Lecture 10 Ensemble Methods
No ratings yet
Lecture 10 Ensemble Methods
69 pages
Module 4
No ratings yet
Module 4
30 pages
boosting
No ratings yet
boosting
28 pages
ML Cat 2 - 7
No ratings yet
ML Cat 2 - 7
30 pages
Unit-3(1)
No ratings yet
Unit-3(1)
63 pages
Unit-3(1)
No ratings yet
Unit-3(1)
59 pages
Ensemble Learning
No ratings yet
Ensemble Learning
52 pages
Ensemble_Learning_SA
No ratings yet
Ensemble_Learning_SA
27 pages
05 - Ensemble Learning
No ratings yet
05 - Ensemble Learning
39 pages
Module 7 - Ensemble Learning
No ratings yet
Module 7 - Ensemble Learning
41 pages
Module 2
No ratings yet
Module 2
34 pages
Ensemble Final
No ratings yet
Ensemble Final
41 pages
UNIT-V (Bagging, Boosting, Random Forest) : by Dr. K. Aditya Shastry Associate Professor Dept. of ISE NMIT, Bengaluru
No ratings yet
UNIT-V (Bagging, Boosting, Random Forest) : by Dr. K. Aditya Shastry Associate Professor Dept. of ISE NMIT, Bengaluru
27 pages
Ensemble Methods
No ratings yet
Ensemble Methods
31 pages
Lecture 17 - Ensemble Learning
No ratings yet
Lecture 17 - Ensemble Learning
31 pages
Unit 3
No ratings yet
Unit 3
99 pages
Bagging and Boosting
No ratings yet
Bagging and Boosting
32 pages
ML UNIT-3 PART-1
No ratings yet
ML UNIT-3 PART-1
17 pages
Week 11 EnsembleLearning
No ratings yet
Week 11 EnsembleLearning
34 pages
Ensemble Learning
No ratings yet
Ensemble Learning
15 pages
Ensemble Methods
100% (1)
Ensemble Methods
15 pages
UNIT IV
No ratings yet
UNIT IV
18 pages
Article Review 9 Eng
No ratings yet
Article Review 9 Eng
21 pages
Ensemble Learning
No ratings yet
Ensemble Learning
16 pages
MLDM Lect17 Classification Ensembles
No ratings yet
MLDM Lect17 Classification Ensembles
2 pages
Ensemble Classifiers
100% (1)
Ensemble Classifiers
37 pages
Trace Tables With Answers - Updated
50% (2)
Trace Tables With Answers - Updated
5 pages
2.4-Ensemble_methods_lecture_notes (1)
No ratings yet
2.4-Ensemble_methods_lecture_notes (1)
14 pages
Machine Learning and Data Mining: Prof. Alexander Ihler Fall 2012
No ratings yet
Machine Learning and Data Mining: Prof. Alexander Ihler Fall 2012
36 pages
Combining Classifiers: Outline
No ratings yet
Combining Classifiers: Outline
15 pages
Lab-09
No ratings yet
Lab-09
9 pages
DS Lab 22
No ratings yet
DS Lab 22
115 pages
Unit 5
No ratings yet
Unit 5
34 pages
280 - DS Complete-1
No ratings yet
280 - DS Complete-1
24 pages
Restaurant Business Database
No ratings yet
Restaurant Business Database
5 pages
ALGORITHM
No ratings yet
ALGORITHM
19 pages
ASM2 - 1649 - TRẦN HỮU VƯƠNG
No ratings yet
ASM2 - 1649 - TRẦN HỮU VƯƠNG
20 pages
Image Segmentation
No ratings yet
Image Segmentation
103 pages
Ensembles 1
No ratings yet
Ensembles 1
4 pages
Memo Java
No ratings yet
Memo Java
100 pages
Data structures important questions
No ratings yet
Data structures important questions
3 pages
Collate DS Unit 3-4 Notes
No ratings yet
Collate DS Unit 3-4 Notes
87 pages
CS3353 Unit5
No ratings yet
CS3353 Unit5
21 pages
Complexity PPS
No ratings yet
Complexity PPS
3 pages
Ba Paper 3
No ratings yet
Ba Paper 3
1 page
Dsa Syallabus
No ratings yet
Dsa Syallabus
2 pages
Divide and Conquer Algorithm
No ratings yet
Divide and Conquer Algorithm
7 pages
Wait-Free Simulations of Arbitrary Shared Objects
No ratings yet
Wait-Free Simulations of Arbitrary Shared Objects
12 pages
Problem Graphs Matching Hill Climbing
No ratings yet
Problem Graphs Matching Hill Climbing
32 pages
Data Structure and Algorithms Advanced Lab: Pham Quang Dung Backtracking
No ratings yet
Data Structure and Algorithms Advanced Lab: Pham Quang Dung Backtracking
56 pages
HN Daa m4 Question Bank
100% (1)
HN Daa m4 Question Bank
5 pages
Problem Set No 1 2013-2014
No ratings yet
Problem Set No 1 2013-2014
2 pages
P Median Model
No ratings yet
P Median Model
26 pages
Selection Sort: By-Abhishek Bhardwaj
No ratings yet
Selection Sort: By-Abhishek Bhardwaj
8 pages
Cheat Sheet For Exam
No ratings yet
Cheat Sheet For Exam
2 pages
Chapter-6 Data Structures (Notes)
No ratings yet
Chapter-6 Data Structures (Notes)
2 pages
Preconditioning: Condition Number
No ratings yet
Preconditioning: Condition Number
5 pages
ANN - Fuzzy Logic GA - Tutorial
No ratings yet
ANN - Fuzzy Logic GA - Tutorial
4 pages
Pascal Dan Fortran Pada Spanl
No ratings yet
Pascal Dan Fortran Pada Spanl
8 pages
Birla Institute of Technology and Science, Pilani: Pilani Campus AUGS/ AGSR Division
No ratings yet
Birla Institute of Technology and Science, Pilani: Pilani Campus AUGS/ AGSR Division
5 pages
High School Pre-Calculus Tutor
From Everand
High School Pre-Calculus Tutor
The Editors of REA
4/5 (1)
Precalculus: A Self-Teaching Guide
From Everand
Precalculus: A Self-Teaching Guide
Steve Slavin
4.5/5 (5)
Painless Pre-Algebra
From Everand
Painless Pre-Algebra
Barron's Educational Series
3/5 (2)

Ensemble Classification

Uploaded by

Ensemble Classification

Uploaded by

Ensemble Classification

Label Y=1 Y=2 Y=3

• Classification: mode of all predictions!

Label Y=1 Y=2 Y=3

• Classification: mode of all predictions!

• Classification: mode of all predictions!

Label Y=1 Y=2 Y=3

• Found to reduce overfitting

• Bootstrap Aggregation (BAGG-ing)!

• Prob(X_j is never chosen) ~ 0.36

• If the classifier used is Decision Tree, then this approach is called

You might also like