0% found this document useful (0 votes)

263 views4 pages

C4.5 Decision Tree Solution With Calculations

Uploaded by

tejashsr.23aid

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

263 views4 pages

C4.5 Decision Tree Solution With Calculations

Uploaded by

tejashsr.23aid

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

C4.

5 Decision Tree Step-by-Step Calculations

Step 1: Entropy Calculation

The dataset contains 6 'Pass' results and 4 'Fail' results.

Total examples: 10

Entropy formula:

H(S) = -(p_+ log2(p_+)) - (p_- log2(p_-))

p_+ = 6/10 = 0.6 (Pass), p_- = 4/10 = 0.4 (Fail)

H(S) = -(0.6 log2(0.6)) - (0.4 log2(0.4))

H(S) = 0.971

Step 2: Information Gain Calculations

1. Assessment (Good, Average, Poor)

- Good: 6 examples (5 Pass, 1 Fail)

- Average: 3 examples (1 Pass, 2 Fail)

- Poor: 1 example (0 Pass, 1 Fail)

Entropy for 'Good' subset:

H(Good) = -(5/6 log2(5/6)) - (1/6 log2(1/6)) = 0.650

Entropy for 'Average' subset:

H(Average) = -(1/3 log2(1/3)) - (2/3 log2(2/3)) = 0.918

Entropy for 'Poor' subset:

H(Poor) = 0 (since all are Fail)

Weighted Entropy for 'Assessment':

H(Assessment) = (6/10) * 0.650 + (3/10) * 0.918 + (1/10) * 0

H(Assessment) = 0.665

Information Gain for 'Assessment':

IG(Assessment) = 0.971 - 0.665 = 0.306

2. Assignment (Yes, No)

- Yes: 6 examples (5 Pass, 1 Fail)

- No: 4 examples (1 Pass, 3 Fail)

Entropy for 'Yes' subset:

H(Yes) = -(5/6 log2(5/6)) - (1/6 log2(1/6)) = 0.650

Entropy for 'No' subset:

H(No) = -(1/4 log2(1/4)) - (3/4 log2(3/4)) = 0.811

Weighted Entropy for 'Assignment':

H(Assignment) = (6/10) * 0.650 + (4/10) * 0.811

H(Assignment) = 0.714

Information Gain for 'Assignment':

IG(Assignment) = 0.971 - 0.714 = 0.257

3. Project (Yes, No)

- Yes: 5 examples (4 Pass, 1 Fail)

- No: 5 examples (2 Pass, 3 Fail)

Entropy for 'Yes' subset:

H(Yes) = -(4/5 log2(4/5)) - (1/5 log2(1/5)) = 0.722

Entropy for 'No' subset:

H(No) = -(2/5 log2(2/5)) - (3/5 log2(3/5)) = 0.971

Weighted Entropy for 'Project':

H(Project) = (5/10) * 0.722 + (5/10) * 0.971

H(Project) = 0.846

Information Gain for 'Project':

IG(Project) = 0.971 - 0.846 = 0.125

4. Seminar (Good, Poor, Fair)

- Good: 4 examples (4 Pass, 0 Fail)

- Poor: 3 examples (1 Pass, 2 Fail)

- Fair: 3 examples (1 Pass, 2 Fail)

Entropy for 'Good' subset:

H(Good) = 0 (since all are Pass)

Entropy for 'Poor' subset:

H(Poor) = -(1/3 log2(1/3)) - (2/3 log2(2/3)) = 0.918

Entropy for 'Fair' subset:

H(Fair) = -(1/3 log2(1/3)) - (2/3 log2(2/3)) = 0.918

Weighted Entropy for 'Seminar':

H(Seminar) = (4/10) * 0 + (3/10) * 0.918 + (3/10) * 0.918

H(Seminar) = 0.550

Information Gain for 'Seminar':

IG(Seminar) = 0.971 - 0.550 = 0.421

Step 3: Choose the Best Attribute

The attribute with the highest information gain is 'Seminar' with IG = 0.421. Thus, 'Seminar' is

chosen as the root node.

Passport & Book Bank Systems
No ratings yet
Passport & Book Bank Systems
107 pages
Database Management Systems Homework 3
No ratings yet
Database Management Systems Homework 3
8 pages
CSCI-3132 Midterm Exam 2013
No ratings yet
CSCI-3132 Midterm Exam 2013
7 pages
Homework 2 Solution
No ratings yet
Homework 2 Solution
7 pages
Concept Learning for AI Students
No ratings yet
Concept Learning for AI Students
13 pages
Software Testing Practice Exam
No ratings yet
Software Testing Practice Exam
15 pages
STM Unit3 DFG
No ratings yet
STM Unit3 DFG
51 pages
2004 COMP302 Database Exam
No ratings yet
2004 COMP302 Database Exam
29 pages
Query Processing Exercises Solutions
No ratings yet
Query Processing Exercises Solutions
4 pages
2010 Semester 1 Final Exam Solutions
No ratings yet
2010 Semester 1 Final Exam Solutions
27 pages
Ass 2
100% (1)
Ass 2
27 pages
Class Diagram Examples for Systems Design
No ratings yet
Class Diagram Examples for Systems Design
9 pages
Web App Development for English Center
No ratings yet
Web App Development for English Center
11 pages
IRDM Assignment-I PDF
No ratings yet
IRDM Assignment-I PDF
4 pages
Prentice Object Oriented Software Engineering Using UML Patterns and Java 3rd 2012-1 2
No ratings yet
Prentice Object Oriented Software Engineering Using UML Patterns and Java 3rd 2012-1 2
13 pages
Structured Programming Exam
No ratings yet
Structured Programming Exam
2 pages
CSC302 Midterm: Large Software Systems
No ratings yet
CSC302 Midterm: Large Software Systems
14 pages
Online Examination System Project Report
No ratings yet
Online Examination System Project Report
41 pages
What's Next?: Binary Classification and Related Tasks Classification
No ratings yet
What's Next?: Binary Classification and Related Tasks Classification
44 pages
Albrecht's Function Point Albrecht's Approach
No ratings yet
Albrecht's Function Point Albrecht's Approach
23 pages
Bank Accounts Management System - P. 448: Chapter 5: Modelling With Classes - Bank Account Problem x1
No ratings yet
Bank Accounts Management System - P. 448: Chapter 5: Modelling With Classes - Bank Account Problem x1
14 pages
Java Array Questions
No ratings yet
Java Array Questions
4 pages
Refactoring and Code Smell
No ratings yet
Refactoring and Code Smell
145 pages
Evaluating Accuracy of Classifier or Predictor
No ratings yet
Evaluating Accuracy of Classifier or Predictor
3 pages
Cluster Validation Techniques Explained
No ratings yet
Cluster Validation Techniques Explained
47 pages
Question Solution AC14 Sol
No ratings yet
Question Solution AC14 Sol
127 pages
KNN Numerical
No ratings yet
KNN Numerical
4 pages
Constraint Satisfaction Problems: AIMA: Chapter 6
No ratings yet
Constraint Satisfaction Problems: AIMA: Chapter 6
64 pages
UML for Software Engineers
No ratings yet
UML for Software Engineers
51 pages
DAA Unit-2
No ratings yet
DAA Unit-2
25 pages
Software Engineering Midterm Solutions
No ratings yet
Software Engineering Midterm Solutions
5 pages
K Means Clustering Problem Solved
No ratings yet
K Means Clustering Problem Solved
27 pages
Classification vs Clustering Explained
No ratings yet
Classification vs Clustering Explained
153 pages
Data Mining - UOG (HH) - Final - F23-1
No ratings yet
Data Mining - UOG (HH) - Final - F23-1
10 pages
White-Box Testing for CS Students
No ratings yet
White-Box Testing for CS Students
6 pages
Session - Decision Trees PPT DOM304
No ratings yet
Session - Decision Trees PPT DOM304
8 pages
Minimal Cover Examples-2 (DBMS)
No ratings yet
Minimal Cover Examples-2 (DBMS)
5 pages
Credit Risk Assessment Lab Manual
No ratings yet
Credit Risk Assessment Lab Manual
34 pages
Binary Search Tree Exercises
No ratings yet
Binary Search Tree Exercises
4 pages
Naive Bayes Classification Numerical Example With Code
No ratings yet
Naive Bayes Classification Numerical Example With Code
8 pages
Evaluation Hypothesis New
No ratings yet
Evaluation Hypothesis New
55 pages
DT Solved Examples
No ratings yet
DT Solved Examples
20 pages
Decision Tree Induction Steps for Grades
No ratings yet
Decision Tree Induction Steps for Grades
3 pages
AI Report 4
No ratings yet
AI Report 4
6 pages
Decision Trees & Entropy Tutorial
No ratings yet
Decision Trees & Entropy Tutorial
267 pages
Decision Trees
No ratings yet
Decision Trees
11 pages
Decision Tree Algorithm in Machine Learning
No ratings yet
Decision Tree Algorithm in Machine Learning
625 pages
Decision Tree Analysis Guide
No ratings yet
Decision Tree Analysis Guide
22 pages
Decision Tree Rules and Information Gain
No ratings yet
Decision Tree Rules and Information Gain
28 pages
Entropy and IG
No ratings yet
Entropy and IG
23 pages
MLRD: Classifiers in Movie Reviews
No ratings yet
MLRD: Classifiers in Movie Reviews
23 pages
Data Mining Mini Projrct
No ratings yet
Data Mining Mini Projrct
16 pages
Understanding Entropy and Information Gain
No ratings yet
Understanding Entropy and Information Gain
10 pages
Conditional Probability & Decision Trees
No ratings yet
Conditional Probability & Decision Trees
8 pages
Decision Tree
No ratings yet
Decision Tree
29 pages
Decision Trees: Information Gain Analysis
No ratings yet
Decision Trees: Information Gain Analysis
3 pages
Tasks On Decision Trees
No ratings yet
Tasks On Decision Trees
11 pages
Assignment 4
No ratings yet
Assignment 4
3 pages
Assigment 2 Ammad Ali
No ratings yet
Assigment 2 Ammad Ali
8 pages
Artificial Intelligence 11. Decision Tree Learning
No ratings yet
Artificial Intelligence 11. Decision Tree Learning
25 pages
Treatment of Man-Woman Relationship in Preeti Shenoy's The Secret Wish List
No ratings yet
Treatment of Man-Woman Relationship in Preeti Shenoy's The Secret Wish List
11 pages
Home Economics: Module 5: Project Plan For Household Linens
100% (5)
Home Economics: Module 5: Project Plan For Household Linens
19 pages
Serena Pauley Draft Essay
No ratings yet
Serena Pauley Draft Essay
6 pages
UIU EEE 101 Midterm Exam Summer 2023
No ratings yet
UIU EEE 101 Midterm Exam Summer 2023
5 pages
Job Posting - Forvis - Senior Consultant - Risk Management
No ratings yet
Job Posting - Forvis - Senior Consultant - Risk Management
4 pages
Solutions For Predictive Equipment Maintenance: The Science of Fluid Analysis
No ratings yet
Solutions For Predictive Equipment Maintenance: The Science of Fluid Analysis
16 pages
Review of Mineral Economics in India
No ratings yet
Review of Mineral Economics in India
1 page
15080-Directory of Caribbean Publishers 10th Ed 2
No ratings yet
15080-Directory of Caribbean Publishers 10th Ed 2
335 pages
Digital Clamp Meter User Guide
No ratings yet
Digital Clamp Meter User Guide
28 pages
Chapter 7
No ratings yet
Chapter 7
10 pages
GL309 Manual
No ratings yet
GL309 Manual
156 pages
Assignment # Topic: Project Management Vs Program Management Vs Portfolio Management
No ratings yet
Assignment # Topic: Project Management Vs Program Management Vs Portfolio Management
3 pages
Cultural and Values Orientations - Final
No ratings yet
Cultural and Values Orientations - Final
15 pages
Basic Subscale
No ratings yet
Basic Subscale
2 pages
Isometric Pipe Plan Drawing DN250
No ratings yet
Isometric Pipe Plan Drawing DN250
1 page
The MarTech Blueprint - A Beginner's Guide To Marketing Mastery - LinkedIn
No ratings yet
The MarTech Blueprint - A Beginner's Guide To Marketing Mastery - LinkedIn
7 pages
Group Assignment 2: Business Statistics Analysis
No ratings yet
Group Assignment 2: Business Statistics Analysis
6 pages
Online Education vs. Traditional Classroom Learning
No ratings yet
Online Education vs. Traditional Classroom Learning
2 pages
English Comprehension Test "The Secret Passage" 4th Grade of Elementary School
No ratings yet
English Comprehension Test "The Secret Passage" 4th Grade of Elementary School
3 pages
RADWIN Portfolio
No ratings yet
RADWIN Portfolio
35 pages
Dickens and London PDF
No ratings yet
Dickens and London PDF
17 pages
Furniture for Indian Homes
No ratings yet
Furniture for Indian Homes
133 pages
Antonio Scott: National Youth Correspondent
No ratings yet
Antonio Scott: National Youth Correspondent
1 page
Student PEN Request Form
No ratings yet
Student PEN Request Form
1 page
Prism - A World of Adventure For Fate Core
No ratings yet
Prism - A World of Adventure For Fate Core
49 pages
ENMX High Feed Milling Tools Overview
No ratings yet
ENMX High Feed Milling Tools Overview
5 pages
Mla In-Text Citation
No ratings yet
Mla In-Text Citation
15 pages
TNPSC Group I & II Exam Overview
No ratings yet
TNPSC Group I & II Exam Overview
13 pages
Algebra 30 - The LRDI Master
No ratings yet
Algebra 30 - The LRDI Master
6 pages
Account Statement: 01-08 to 08-08-2023
No ratings yet
Account Statement: 01-08 to 08-08-2023
1 page

C4.5 Decision Tree Solution With Calculations

Uploaded by

C4.5 Decision Tree Solution With Calculations

Uploaded by

C4.

5 Decision Tree Step-by-Step Calculations

Step 1: Entropy Calculation

The dataset contains 6 'Pass' results and 4 'Fail' results.

H(S) = -(p_+ log2(p_+)) - (p_- log2(p_-))

p_+ = 6/10 = 0.6 (Pass), p_- = 4/10 = 0.4 (Fail)

H(S) = -(0.6 log2(0.6)) - (0.4 log2(0.4))

Step 2: Information Gain Calculations

1. Assessment (Good, Average, Poor)

- Good: 6 examples (5 Pass, 1 Fail)

- Average: 3 examples (1 Pass, 2 Fail)

- Poor: 1 example (0 Pass, 1 Fail)

Entropy for 'Good' subset:

H(Good) = -(5/6 log2(5/6)) - (1/6 log2(1/6)) = 0.650

Entropy for 'Average' subset:

H(Average) = -(1/3 log2(1/3)) - (2/3 log2(2/3)) = 0.918

Entropy for 'Poor' subset:

Weighted Entropy for 'Assessment':

H(Assessment) = (6/10) * 0.650 + (3/10) * 0.918 + (1/10) * 0

Information Gain for 'Assessment':

IG(Assessment) = 0.971 - 0.665 = 0.306

2. Assignment (Yes, No)

- Yes: 6 examples (5 Pass, 1 Fail)

- No: 4 examples (1 Pass, 3 Fail)

Entropy for 'Yes' subset:

H(Yes) = -(5/6 log2(5/6)) - (1/6 log2(1/6)) = 0.650

Entropy for 'No' subset:

H(No) = -(1/4 log2(1/4)) - (3/4 log2(3/4)) = 0.811

Weighted Entropy for 'Assignment':

H(Assignment) = (6/10) * 0.650 + (4/10) * 0.811

Information Gain for 'Assignment':

IG(Assignment) = 0.971 - 0.714 = 0.257

3. Project (Yes, No)

- Yes: 5 examples (4 Pass, 1 Fail)

- No: 5 examples (2 Pass, 3 Fail)

H(Yes) = -(4/5 log2(4/5)) - (1/5 log2(1/5)) = 0.722

Entropy for 'No' subset:

H(No) = -(2/5 log2(2/5)) - (3/5 log2(3/5)) = 0.971

Weighted Entropy for 'Project':

H(Project) = (5/10) * 0.722 + (5/10) * 0.971

Information Gain for 'Project':

IG(Project) = 0.971 - 0.846 = 0.125

4. Seminar (Good, Poor, Fair)

- Good: 4 examples (4 Pass, 0 Fail)

- Poor: 3 examples (1 Pass, 2 Fail)

- Fair: 3 examples (1 Pass, 2 Fail)

Entropy for 'Good' subset:

H(Good) = 0 (since all are Pass)

Entropy for 'Poor' subset:

H(Poor) = -(1/3 log2(1/3)) - (2/3 log2(2/3)) = 0.918

Entropy for 'Fair' subset:

H(Fair) = -(1/3 log2(1/3)) - (2/3 log2(2/3)) = 0.918

H(Seminar) = (4/10) * 0 + (3/10) * 0.918 + (3/10) * 0.918

Information Gain for 'Seminar':

IG(Seminar) = 0.971 - 0.550 = 0.421

Step 3: Choose the Best Attribute

chosen as the root node.

You might also like