0% found this document useful (0 votes)

6 views

Lab_Manual2 (2)

Uploaded by

kashish bhatt

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

6 views

Lab_Manual2 (2)

Uploaded by

kashish bhatt

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

Faculty of Engineering& Technology

Subject-Name: Machine Learning Laboratory

Subject-Code:303105354
B.Tech – 3rd Year – 6th Sem

Practical: 6
Aim: Write a program to demonstrate the working of the decision tree-based
ID3 algorithm.
The ID3 algorithm is a popular algorithm used to create a Decision Tree by selecting the
attribute that maximizes the information gain at each node. This program will:
1. Build a simple dataset.
2. Implement the ID3 algorithm to construct the decision tree.
3. Print the tree structure
Explanation:
1. Entropy Calculation:
○ Measures the uncertainty in the dataset.
2. Information Gain:
○ Measures the reduction in entropy after splitting the dataset on an attribute.
3. ID3 Algorithm:
○ Recursively selects the best attribute to split the data and builds the tree.
The ID3 (IterativeDichotomiser3) algorithm is a popular method used to build Decision Trees.
It splits the data based on the attribute that provides the highest Information Gain at each
step, and continues until the data is fully classified or no further splits can be made.

Steps:
1. Dataset:

Outlook Temperature Humidity Windy PlayTennis

Sunny Hot High False No

Sunny Hot High True No

Overcast Hot High False Yes

Rain Mild High False Yes

Enrollment No:2203031050150 26 | P a g e
Name: Yash Chudgar
Faculty of Engineering& Technology
Subject-Name: Machine Learning Laboratory
Subject-Code:303105354
B.Tech – 3rd Year – 6th Sem

Rain Cool Normal False Yes

Rain Cool Normal True No

Overcast Cool Normal True Yes

Sunny Mild High False No

Sunny Cool Normal False Yes

Rain Mild Normal False Yes

Sunny Mild Normal True Yes

Overcast Mild High True Yes

Overcast Hot Normal False Yes

Rain Mild High True No

2. Entropy:
Entropy is a measure of uncertainty or impurity in the dataset. It tells us how mixed or pure
a dataset is with respect to its labels. The formula for entropy is:

Where:
● Pi is the probability of label i.

3. Information Gain:
Information Gain is the reduction in entropy after a dataset is split on an attribute. It is given
by

Enrollment No:2203031050150 27 | P a g e
Name: Yash Chudgar
Faculty of Engineering& Technology
Subject-Name: Machine Learning Laboratory
Subject-Code:303105354
B.Tech – 3rd Year – 6th Sem

4. Choosing the Best Attribute to Split:

The attribute with the highest information gain is chosen as the best attribute to split the
data.

5. Building the Decision Tree (ID3 Algorithm):

The ID3 algorithm builds the decision tree recursively:
1. Select the best attribute to split the data.
2. Create a node for the attribute.
3. Split the data into subsets based on the values of the attribute.
4. Repeat the process for each subset.

6. Displaying the Decision Tree:

The tree is printed in a readable format using a recursive function.

Summary of Steps:
1. Calculate Entropy to measure uncertainty.
2. Calculate Information Gain for each attribute to find the best split.
3. Split the Dataset on the attribute with the highest gain.
4. Build the Tree recursively until all data is classified or no further splits can be made.

Code:
import math
import pandas as pd

def entropy(data):

Enrollment No:2203031050150 28 | P a g e
Name: Yash Chudgar
Faculty of Engineering& Technology
Subject-Name: Machine Learning Laboratory
Subject-Code:303105354
B.Tech – 3rd Year – 6th Sem

labels = data.iloc[:, -1]

label_counts = labels.value_counts()
probabilities = label_counts / len(labels)
return -sum(probabilities * probabilities.apply(lambda x: math.log2(x) if x > 0 else 0))

def information_gain(data, split_attribute):

total_entropy = entropy(data)
values = data[split_attribute].unique()
weighted_entropy = 0
for value in values:
subset = data[data[split_attribute] == value]
subset_entropy = entropy(subset)
weighted_entropy += (len(subset) / len(data)) * subset_entropy

return total_entropy - weighted_entropy

def best_attribute_to_split(data):
attributes = data.columns[:-1]
gains = {attribute: information_gain(data, attribute) for attribute in attributes}
return max(gains, key=gains.get)

def id3(data, tree=None):

best_attr = best_attribute_to_split(data)
if tree is None:
tree = {}
tree[best_attr] = {}

Enrollment No:2203031050150 29 | P a g e
Name: Yash Chudgar
Faculty of Engineering& Technology
Subject-Name: Machine Learning Laboratory
Subject-Code:303105354
B.Tech – 3rd Year – 6th Sem

values = data[best_attr].unique()
for value in values:
subset = data[data[best_attr] == value]
if entropy(subset) == 0:

tree[best_attr][value] = subset.iloc[0, -1]

else:
tree[best_attr][value] = id3(subset.drop(columns=[best_attr]))
return tree

def print_tree(tree, indent=""):

if not isinstance(tree, dict):
print(indent + "Label:", tree)
return
for attribute, sub_tree in tree.items():
print(indent + attribute)
for value, branch in sub_tree.items():
print(indent + f" {value}:")
print_tree(branch, indent + " ")
data = {
'Outlook': ['Sunny', 'Sunny', 'Overcast', 'Rain', 'Rain', 'Rain', 'Overcast', 'Sunny', 'Sunny', 'Rain',
'Sunny', 'Overcast', 'Overcast', 'Rain'],
'Temperature': ['Hot', 'Hot', 'Hot', 'Mild', 'Cool', 'Cool', 'Cool', 'Mild', 'Cool', 'Mild', 'Mild',
'Mild', 'Hot', 'Mild'],
'Humidity': ['High', 'High', 'High', 'High', 'Normal', 'Normal', 'Normal', 'High', 'Normal',
'Normal', 'Normal', 'High', 'Normal', 'High'],
'Windy': [False, True, False, False, False, True, True, False, False, False, True, True, False,
True],

Enrollment No:2203031050150 30 | P a g e
Name: Yash Chudgar
Faculty of Engineering& Technology
Subject-Name: Machine Learning Laboratory
Subject-Code:303105354
B.Tech – 3rd Year – 6th Sem

'PlayTennis': ['No', 'No', 'Yes', 'Yes', 'Yes', 'No', 'Yes', 'No', 'Yes', 'Yes', 'Yes', 'Yes', 'Yes', 'No']
}
df = pd.DataFrame(data)
decision_tree = id3(df)
print("Decision Tree:")
print_tree(decision_tree)
Output:

Enrollment No:2203031050150 31 | P a g e
Name: Yash Chudgar

Lab Program 3
No ratings yet
Lab Program 3
6 pages
ML Exp 3
No ratings yet
ML Exp 3
6 pages
Lab Program 3
No ratings yet
Lab Program 3
6 pages
Machine Learning Lab: Delhi Technological University
No ratings yet
Machine Learning Lab: Delhi Technological University
6 pages
LAB 3
No ratings yet
LAB 3
7 pages
Import Import Def
No ratings yet
Import Import Def
2 pages
Pra 5 ML
No ratings yet
Pra 5 ML
5 pages
ML Priyesha - 778
No ratings yet
ML Priyesha - 778
23 pages
Lecture 4
No ratings yet
Lecture 4
74 pages
DECISION TREES
No ratings yet
DECISION TREES
7 pages
MLT Experiment 3
No ratings yet
MLT Experiment 3
3 pages
ML lab manual
No ratings yet
ML lab manual
25 pages
MLExp 3
No ratings yet
MLExp 3
6 pages
Name: Suprit Darshan Shrestha Reg - no:19BCE2584: Lab DA1 Machine Learning Lab
No ratings yet
Name: Suprit Darshan Shrestha Reg - no:19BCE2584: Lab DA1 Machine Learning Lab
9 pages
P 4 Andp 5
No ratings yet
P 4 Andp 5
4 pages
MANUAL (1)
No ratings yet
MANUAL (1)
34 pages
DA_LAB3_221IT064
No ratings yet
DA_LAB3_221IT064
6 pages
2024 Decision Trees
No ratings yet
2024 Decision Trees
28 pages
AD3461 ML lab manual
No ratings yet
AD3461 ML lab manual
32 pages
Machine Learning Unit4
No ratings yet
Machine Learning Unit4
8 pages
ML 5
No ratings yet
ML 5
2 pages
3ID3 Algorithm
No ratings yet
3ID3 Algorithm
9 pages
Decision Tree and Random Forest
No ratings yet
Decision Tree and Random Forest
74 pages
MLT UNIT-3 notes
No ratings yet
MLT UNIT-3 notes
35 pages
da-lab3-221it084-final (1)
No ratings yet
da-lab3-221it084-final (1)
6 pages
MLWP LAB Experiment's
No ratings yet
MLWP LAB Experiment's
11 pages
Randomforest TNP
No ratings yet
Randomforest TNP
71 pages
AI Report 4
No ratings yet
AI Report 4
6 pages
Lesson 5
No ratings yet
Lesson 5
28 pages
Unit IV Notes
No ratings yet
Unit IV Notes
20 pages
MANUAL (2)
No ratings yet
MANUAL (2)
33 pages
Decision Tree
No ratings yet
Decision Tree
29 pages
Decision Trees
No ratings yet
Decision Trees
11 pages
2167TC1 Lab
No ratings yet
2167TC1 Lab
8 pages
Lab 3 ml
No ratings yet
Lab 3 ml
3 pages
07 - ML - Decision Tree
No ratings yet
07 - ML - Decision Tree
37 pages
ML_UNIT3
No ratings yet
ML_UNIT3
24 pages
Unit-3 Alt
No ratings yet
Unit-3 Alt
24 pages
DWDM Lab 2
No ratings yet
DWDM Lab 2
3 pages
Machine Learning Lab Record: Dr. Sarika Hegde
No ratings yet
Machine Learning Lab Record: Dr. Sarika Hegde
23 pages
Document
No ratings yet
Document
7 pages
Machine Learning Laboratory Record Book: 1 Find S Algorithm
No ratings yet
Machine Learning Laboratory Record Book: 1 Find S Algorithm
22 pages
ML Ex1
No ratings yet
ML Ex1
12 pages
Lab Manual
No ratings yet
Lab Manual
25 pages
Decision Tree Classifier-Introduction, ID3
No ratings yet
Decision Tree Classifier-Introduction, ID3
34 pages
Lecture 4
No ratings yet
Lecture 4
74 pages
Lec-2 Decision Tree_13-8-2024
No ratings yet
Lec-2 Decision Tree_13-8-2024
38 pages
prgm 4
No ratings yet
prgm 4
3 pages
Decision Trees
No ratings yet
Decision Trees
15 pages
ID3 Algorithm: Abbas Rizvi CS157 B Spring 2010
No ratings yet
ID3 Algorithm: Abbas Rizvi CS157 B Spring 2010
19 pages
Merging Result-Merged
No ratings yet
Merging Result-Merged
14 pages
Day 5 Supervised Technique-Decision Tree For Classification PDF
100% (1)
Day 5 Supervised Technique-Decision Tree For Classification PDF
58 pages
practical file machine learning
No ratings yet
practical file machine learning
29 pages
ML Lab Prog1-5 (5) College PDF
No ratings yet
ML Lab Prog1-5 (5) College PDF
12 pages
3 ID3 Algorithm Updated
No ratings yet
3 ID3 Algorithm Updated
3 pages
3
No ratings yet
3
3 pages
15 1 Random Forest and Decision Tree
No ratings yet
15 1 Random Forest and Decision Tree
66 pages
Machine Learning Lab (17CSL76)
No ratings yet
Machine Learning Lab (17CSL76)
48 pages
Decision Tree
100% (1)
Decision Tree
10 pages
Advanced C Concepts and Programming: First Edition
From Everand
Advanced C Concepts and Programming: First Edition
Gayatri
3/5 (1)
Decision Trees- Id3 Algorithms
No ratings yet
Decision Trees- Id3 Algorithms
12 pages
UNIT-IV - Decision Tree Induction
No ratings yet
UNIT-IV - Decision Tree Induction
19 pages
DM DT Solved Example 02 - Unlocked
No ratings yet
DM DT Solved Example 02 - Unlocked
3 pages
DM DT Solved Example 01 - Unlocked
No ratings yet
DM DT Solved Example 01 - Unlocked
4 pages
Unit3-ID3-DT-Examples
No ratings yet
Unit3-ID3-DT-Examples
12 pages
Decision Trees and Random Forests
No ratings yet
Decision Trees and Random Forests
36 pages
Decision Trees Pohon Keputusan
No ratings yet
Decision Trees Pohon Keputusan
5 pages
Decision_Trees_Concepts_Algorithms
No ratings yet
Decision_Trees_Concepts_Algorithms
15 pages
Module 5 Decision Tree Part2
No ratings yet
Module 5 Decision Tree Part2
47 pages
Decision Tree
No ratings yet
Decision Tree
1 page
Decision Tree - IDS3
No ratings yet
Decision Tree - IDS3
13 pages
AIML Lect5 Assignment ID3
No ratings yet
AIML Lect5 Assignment ID3
2 pages
Dokumen - Tips Contoh Studi Kasus Decision Tree
No ratings yet
Dokumen - Tips Contoh Studi Kasus Decision Tree
11 pages
Decision Tree
No ratings yet
Decision Tree
44 pages
Penerapan Algoritma CART Dalam Menentukan Jurusan Siswa Di MAN 1 Inhil
No ratings yet
Penerapan Algoritma CART Dalam Menentukan Jurusan Siswa Di MAN 1 Inhil
8 pages
ID3 BuyPC
No ratings yet
ID3 BuyPC
3 pages
Decision Trees
No ratings yet
Decision Trees
19 pages
Daily AI Exercise - Kmeans - KNN
No ratings yet
Daily AI Exercise - Kmeans - KNN
15 pages
DECISION TREE
No ratings yet
DECISION TREE
9 pages
DM - Lab - 8 - Jupyter Notebook
No ratings yet
DM - Lab - 8 - Jupyter Notebook
5 pages
Svmsmote 061430
No ratings yet
Svmsmote 061430
2 pages
Thomsons Decision Tree With ExcelQM
No ratings yet
Thomsons Decision Tree With ExcelQM
1 page
The Random Forest Algorithm - A Complete Guide - Built in
No ratings yet
The Random Forest Algorithm - A Complete Guide - Built in
12 pages
ID3 Decision Tree Explanation
No ratings yet
ID3 Decision Tree Explanation
8 pages
Decision Tree - Species Classification - Solution
No ratings yet
Decision Tree - Species Classification - Solution
14 pages
Machine Learning Approaches: Decision Trees
No ratings yet
Machine Learning Approaches: Decision Trees
44 pages
ID3 MedhaPradhan
No ratings yet
ID3 MedhaPradhan
22 pages

Lab_Manual2 (2)

Uploaded by

Lab_Manual2 (2)

Uploaded by

Faculty of Engineering& Technology

Subject-Name: Machine Learning Laboratory

Outlook Temperature Humidity Windy PlayTennis

Sunny Hot High False No

Sunny Hot High True No

Overcast Hot High False Yes

Rain Mild High False Yes

Rain Cool Normal False Yes

Rain Cool Normal True No

Overcast Cool Normal True Yes

Sunny Mild High False No

Sunny Cool Normal False Yes

Rain Mild Normal False Yes

Sunny Mild Normal True Yes

Overcast Mild High True Yes

Overcast Hot Normal False Yes

Rain Mild High True No

4. Choosing the Best Attribute to Split:

5. Building the Decision Tree (ID3 Algorithm):

6. Displaying the Decision Tree:

labels = data.iloc[:, -1]

def information_gain(data, split_attribute):

return total_entropy - weighted_entropy

def id3(data, tree=None):

tree[best_attr][value] = subset.iloc[0, -1]

def print_tree(tree, indent=""):

You might also like