0% found this document useful (0 votes)

63 views13 pages

Decision Tree Random Forrest Naive Bayes 02

The document provides an overview of three machine learning algorithms: Decision Trees, Random Forests, and Naive Bayes. It explains the structure and functioning of Decision Trees, including concepts like entropy and information gain, along with the ID3 algorithm. Additionally, it covers the ensemble learning approach of Random Forests and the principles behind Naive Bayes, including its advantages, disadvantages, and applications.

Uploaded by

Muhammad Farhan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

63 views13 pages

Decision Tree Random Forrest Naive Bayes 02

Uploaded by

Muhammad Farhan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Decision Tree, Random Forest & Naive Bayes

I. Decision Tree

Decision Tree
✔ Decision Trees are widely used algorithms for supervised machine learning.
✔ A Decision Tree consists of a series of sequential decisions, or decision nodes, on some
data set's features.
✔ The resulting flow-like structure is navigated via conditional control statements, or if-then
rules, which split each decision node into two or more subnodes.
✔ Leaf nodes, also known as terminal nodes, represent prediction outputs for the model.

Figure 1. Decision Tree Structure Ilustration

Entropy
Entropy measures the amount of information of some variable or event. it can be used to
identify regions consisting of a large number of similar (pure) or dissimilar (impure) elements.

The entropy can be used to quantify the impurity of a collection of labeled data points: a node
containing multiple classes is impure whereas a node including only one class is pure.

1
Entropy Properties

Information Gain
✔ information gain measures an amount the information that we gain.
✔ The idea is to subtract from the entropy of our data before the split the entropy of each
possible partition thereafter.
✔ Then select the split that yields the largest reduction in entropy, or equivalently, the largest
increase in information.

ID3 Algorithm
✔ The core algorithm to calculate information gain is called ID3.
✔ It's a recursive procedure that starts from the root node of the tree and iterates top-down
on all non-leaf branches in a greedy manner
✔ Calculating at each depth the difference in entropy:

ID3 Algorithm Steps

1. Calculate the entropy associated to every feature of the data set.
2. Partition the data set into subsets using different features and cutoff values. For each,
compute the information gain ΔIG as the difference in entropy before and after the split
using the formula above. For the total entropy of all children nodes after the split, use the
weighted average taking into account Nchild i.e. how many of the N samples end up on
each child branch.
3. Identify the partition that leads to the maximum information gain. Create a decision node
on that feature and split value.
4. When no further splits can be done on a subset, create a leaf node and label it with the
most common class of the data points within it if doing classification or with the average

2
value if doing regression.
5. Recurse on all subsets. Recursion stops if after a split all elements in a child node are of the
same type. Additional stopping conditions may be imposed, such as requiring a minimum
number of samples per leaf to continue splitting, or finishing when the trained tree has
reached a given maximum depth.

The Problem of Pertubations

✔ Decision Tree can be extremely sensitive to small perturbations in the data: a minor change
in the training examples can result in a drastic change in the structure of the Decision Tree.
✔ Example: small random Gaussian perturbations on just 5% of the training examples create
a set of completely different Decision Trees

II. Random Forrest

Random Forest
Condorcet's Jury Theorem says that if each person is more than 50% correct, then adding more
people to vote increases the probability that the majority is correct.

Figure 2. Ensemble Principle Ilustration

Ensemble learning
✔ Ensemble learning creates a stronger model by aggregating the predictions of multiple
weak models, such as decision trees

3
Bagging
✔ Bagging or bootstrap aggregation a technique for reducing the variance of an estimated
prediction function.
✔ For classification, a committee of trees each cast a vote for the predicted class.

Bagging Method
✔ One way to produce multiple models that are different is to train each model using a
different training set.
✔ The Bagging (Bootstrap Aggregating) method randomly draws a fixed number of samples
from the training set with replacement (this means that a data point can be drawn more
than once).

Random forest classifier

✔ Random forest classifier, an extension to bagging which uses de-correlated trees.

Figure 3. Random Forest Classifier Ilustration

Variance in Composition
✔ The inventor of the random forest model Leo Breiman says in his paper "[o]ur results
indicate that better (lower generalization error) random forests have lower correlation
between classifiers and higher strength."

✔ The high variance of the decision tree model can help keep the correlation among trees
low. The Bagging Method as well as the Feature Selection are the key innovations to keep
correlation low.

4
III. Naive Bayes
Naive Bayes
✔ Naïve Bayes algorithm is a supervised learning algorithm, which is based on Bayes theorem
and used for solving classification problems.
✔ Naïve: It is called Naïve because it assumes that the occurrence of a certain feature is
independent of the occurrence of other features. Such as if the fruit is identified on the
bases of color, shape, and taste, then red, spherical, and sweet fruit is recognized as an
apple. Hence each feature individually contributes to identify that it is an apple without
depending on each other.
✔ Bayes: It is called Bayes because it depends on the principle of Bayes' Theorem.

Bayes' Theorem
✔ Bayes' theorem is also known as Bayes' Rule or Bayes' law, which is used to determine the
probability of a hypothesis with prior knowledge. It depends on the conditional probability.
✔ The formula for Bayes' theorem is given as:

Where,
⮚ P(A|B) is Posterior probability: Probability of hypothesis A on the observed event B.
⮚ P(B|A) is Likelihood probability: Probability of the evidence given that the probability
of a hypothesis is true.
⮚ P(A) is Prior Probability: Probability of hypothesis before observing the evidence.
⮚ P(B) is Marginal Probability: Probability of Evidence.

Steps:
1. Convert the given dataset into frequency tables.
2. Generate Likelihood table by finding the probabilities of given features.
3. Now, use Bayes theorem to calculate the posterior probability.

Advantages of Naïve Bayes Classifier

✔ Naïve Bayes is one of the fast and easy ML algorithms to predict a class of datasets.
✔ It can be used for Binary as well as Multi-class Classifications.
✔ It performs well in Multi-class predictions as compared to the other Algorithms.
✔ It is the most popular choice for text classification problems.

Disadvantages of Naïve Bayes Classifier

✔ Naive Bayes assumes that all features are independent or unrelated, so it cannot learn the

5
relationship between features.

Applications of Naïve Bayes Classifier

✔ It is used for Credit Scoring.
✔ It is used in medical data classification.
✔ It can be used in real-time predictions because Naïve Bayes Classifier is an eager learner.
✔ It is used in Text classification such as Spam filtering and Sentiment analysis.

IV. Experiment
- Use the following into google colab., then do the trials.

Decision Tree

6
7
Random Forest For Classifying Digits

Random Forest example 2

8
9
Naive Bayes (Social Network Ads Example)

10
Naive Bayes (Play or No)

11
Naive Bayes (weather)

12
V. Reference
1. https://www.kaggle.com/datasets/zaraavagyan/weathercsv
2. https://github.com/likarajo/petrol_consumption
3. https://www.kaggle.com/datasets/rakeshrau/social-network-ads?resource=download
4. https://realpython.com/logistic-regression-python/
5. https://www.datacamp.com/community/tutorials/understanding-logistic-regression-
python
6. https://www.analyticsvidhya.com/blog/2021/05/machine-learning-with-python-logistic-
regression/
7. https://www.analyticsvidhya.com/blog/2021/04/beginners-guide-to-logistic-regression-
using-python/
8. https://mlu-explain.github.io/decision-tree/
9. https://mlu-explain.github.io/random-forest/

Naive Bayes and Decision Tree Classifiers
No ratings yet
Naive Bayes and Decision Tree Classifiers
23 pages
Machine Learning: Supervised Learning Methods
No ratings yet
Machine Learning: Supervised Learning Methods
17 pages
Classification and Regression Trees in ML
No ratings yet
Classification and Regression Trees in ML
36 pages
ML Notes
No ratings yet
ML Notes
50 pages
Decision Tree Structure and Algorithms
No ratings yet
Decision Tree Structure and Algorithms
5 pages
Decision Tree & Naïve Bayes Overview
No ratings yet
Decision Tree & Naïve Bayes Overview
5 pages
ML Unit 3
No ratings yet
ML Unit 3
15 pages
2023-24 ML Notes 2
No ratings yet
2023-24 ML Notes 2
16 pages
AAM Unit 2
No ratings yet
AAM Unit 2
17 pages
Unit 3 (MLT)
No ratings yet
Unit 3 (MLT)
42 pages
Understanding Decision Trees in ML
No ratings yet
Understanding Decision Trees in ML
60 pages
Decision Trees and ID3 Algorithm Overview
No ratings yet
Decision Trees and ID3 Algorithm Overview
15 pages
Aam Unit 2 QB With Answer
No ratings yet
Aam Unit 2 QB With Answer
16 pages
Module 3 Supervised ML Algo
No ratings yet
Module 3 Supervised ML Algo
48 pages
Supervised Learning: Classification Techniques
No ratings yet
Supervised Learning: Classification Techniques
45 pages
ML-3-Decision Tree
No ratings yet
ML-3-Decision Tree
17 pages
Decision Trees: Overview and Examples
No ratings yet
Decision Trees: Overview and Examples
22 pages
Classification Techniques Overview
No ratings yet
Classification Techniques Overview
81 pages
ml2 PDF
No ratings yet
ml2 PDF
5 pages
Decision Tree Algorithm: and Classification Problems Too
No ratings yet
Decision Tree Algorithm: and Classification Problems Too
12 pages
Decision Tree Basics for Data Scientists
No ratings yet
Decision Tree Basics for Data Scientists
61 pages
Decision Tree and Random Forest
No ratings yet
Decision Tree and Random Forest
41 pages
Data Mining Classification Guide
No ratings yet
Data Mining Classification Guide
54 pages
DM Unit Iii
No ratings yet
DM Unit Iii
87 pages
ML Unit 3
No ratings yet
ML Unit 3
14 pages
Python Classification: Decision Tree vs Naïve Bayes
No ratings yet
Python Classification: Decision Tree vs Naïve Bayes
7 pages
Unit 3 PDF
No ratings yet
Unit 3 PDF
7 pages
Unit 6 DWDM
No ratings yet
Unit 6 DWDM
74 pages
Session 17-Decision Tree
No ratings yet
Session 17-Decision Tree
16 pages
L8 1 Decisiontrees Random Forest
No ratings yet
L8 1 Decisiontrees Random Forest
118 pages
Tree Based Algorithms in Machine Learning
No ratings yet
Tree Based Algorithms in Machine Learning
8 pages
Random Forest Regression in Machine Learning
No ratings yet
Random Forest Regression in Machine Learning
57 pages
19 - Decision Tree - ID3
No ratings yet
19 - Decision Tree - ID3
87 pages
Understanding Decision Trees in AI
No ratings yet
Understanding Decision Trees in AI
103 pages
Springer - Linguistic Decision Trees For Classification-2014
No ratings yet
Springer - Linguistic Decision Trees For Classification-2014
43 pages
What Is Decision Tree
No ratings yet
What Is Decision Tree
35 pages
Decision Trees Parth Gupta
No ratings yet
Decision Trees Parth Gupta
22 pages
Cse 445 Lecture 8 Mma
No ratings yet
Cse 445 Lecture 8 Mma
107 pages
Unit 1 ML (DT)
No ratings yet
Unit 1 ML (DT)
24 pages
Data Mining Unit 2
No ratings yet
Data Mining Unit 2
40 pages
Unit 6
No ratings yet
Unit 6
55 pages
Lecture 17 18
No ratings yet
Lecture 17 18
52 pages
7 DecisionTree
No ratings yet
7 DecisionTree
58 pages
k-Nearest Neighbors (kNN) Explained
No ratings yet
k-Nearest Neighbors (kNN) Explained
33 pages
Unit 1 ML (NN& ML Techniques)
No ratings yet
Unit 1 ML (NN& ML Techniques)
40 pages
Decision Trees
No ratings yet
Decision Trees
34 pages
Understanding Decision Trees in ML
No ratings yet
Understanding Decision Trees in ML
54 pages
Trees
No ratings yet
Trees
78 pages
MLT Unit 3
100% (1)
MLT Unit 3
38 pages
Decision Tree
No ratings yet
Decision Tree
41 pages
Decision Tree Classifier Overview
No ratings yet
Decision Tree Classifier Overview
15 pages
Decision Tree Learning Overview
No ratings yet
Decision Tree Learning Overview
85 pages
Decision Tree Learning Overview
No ratings yet
Decision Tree Learning Overview
124 pages
Data Mining Classification Techniques
No ratings yet
Data Mining Classification Techniques
12 pages
Classification and Clustering Algorithm Notes
No ratings yet
Classification and Clustering Algorithm Notes
19 pages
Decision Tree Learning Overview
No ratings yet
Decision Tree Learning Overview
33 pages
Machine Learning: Professor Department of Computer Science & Engineering
No ratings yet
Machine Learning: Professor Department of Computer Science & Engineering
45 pages
ML Unit-2.1
No ratings yet
ML Unit-2.1
17 pages
Deep Learning Fundamentals and Applications
No ratings yet
Deep Learning Fundamentals and Applications
107 pages
Understanding Support Vector Machines
No ratings yet
Understanding Support Vector Machines
32 pages
CNN: Innovations and Future Prospects
No ratings yet
CNN: Innovations and Future Prospects
21 pages
Neural Network and Deep Learning - Unit 1
No ratings yet
Neural Network and Deep Learning - Unit 1
20 pages
Module 3 - Chapter 6 - Decision Tree Learning
No ratings yet
Module 3 - Chapter 6 - Decision Tree Learning
150 pages
Neural Network Foundations & Perceptron
No ratings yet
Neural Network Foundations & Perceptron
31 pages
Rosenblatt's Perceptron: Neural Networks and Learning Machines
No ratings yet
Rosenblatt's Perceptron: Neural Networks and Learning Machines
12 pages
Machine Learning: Classification & Naive Bayes
No ratings yet
Machine Learning: Classification & Naive Bayes
20 pages
AI by Hand: Neural Network Concepts
No ratings yet
AI by Hand: Neural Network Concepts
28 pages
Hebb Network
No ratings yet
Hebb Network
10 pages
Deep Learning
No ratings yet
Deep Learning
12 pages
Structured Neural Networks Overview
No ratings yet
Structured Neural Networks Overview
35 pages
DLT Important Questions
No ratings yet
DLT Important Questions
3 pages
EPL Match Outcome Prediction Using ML
No ratings yet
EPL Match Outcome Prediction Using ML
6 pages
Deep Learning Course Overview 2023
No ratings yet
Deep Learning Course Overview 2023
69 pages
Research Paper
No ratings yet
Research Paper
5 pages
Fake Job Post Prediction Using ML
No ratings yet
Fake Job Post Prediction Using ML
7 pages
Backpropagation and Deep Learning Overview
No ratings yet
Backpropagation and Deep Learning Overview
36 pages
Analysis and Study of Perceptron To Solve Xor Problem
No ratings yet
Analysis and Study of Perceptron To Solve Xor Problem
6 pages
Universal Approximation Theorem Visualization
No ratings yet
Universal Approximation Theorem Visualization
11 pages
CS414-Lesson 07. Recurrent Neural Network
No ratings yet
CS414-Lesson 07. Recurrent Neural Network
23 pages
Video Compression Project Report
No ratings yet
Video Compression Project Report
10 pages
Data Analytics: Unit 3: Time Series
No ratings yet
Data Analytics: Unit 3: Time Series
11 pages
Quiz 1 DLNN
No ratings yet
Quiz 1 DLNN
4 pages
Building Neural Networks from Scratch
No ratings yet
Building Neural Networks from Scratch
14 pages
Adaptive Learning-Based K-Nearest Neighbor Classifiers With Resilience To Class Imbalance
No ratings yet
Adaptive Learning-Based K-Nearest Neighbor Classifiers With Resilience To Class Imbalance
17 pages
Combining Pattern Classifiers Methods and Algorithms 2nd Edition Ludmila I. Kuncheva Ebook Instant Chapter Access
100% (3)
Combining Pattern Classifiers Methods and Algorithms 2nd Edition Ludmila I. Kuncheva Ebook Instant Chapter Access
83 pages
Neural Networks and Deep Learning Overview
No ratings yet
Neural Networks and Deep Learning Overview
9 pages
Deep Learning - Colab 10
No ratings yet
Deep Learning - Colab 10
11 pages
Understanding Recurrent Neural Networks
No ratings yet
Understanding Recurrent Neural Networks
31 pages