Decision - Tree Using R

The document discusses building and evaluating a decision tree model to predict loan defaults. It covers preparing training and test data by random sampling, using the C5.0 algorithm to train a decision tree model on the training data, and then evaluating the model's performance on the test data by calculating accuracy and error rates.

Uploaded by

Wajahat Ali085

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

23 views13 pages

Decision - Tree Using R

Uploaded by

Wajahat Ali085

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 13

Decision Tree

exploring and preparing the data

Data preparation – creating random training and test
datasets
• Usually data that had been sorted in a random order, we simply divided the dataset
into two portions, by taking the first 90 percent of records for training, and the
remaining 10 percent for testing.
• In contrast, the credit dataset is not randomly ordered, making the prior approach
unwise.
• Suppose that the bank had sorted the data by the loan amount, with the largest
loans at the end of the file.
• If we used the first 90 percent for training and the remaining 10 percent for testing,
we would be training a model on only the small loans and testing the model on the
big loans. Obviously, this could be problematic.
• We'll solve this problem by using a random sample of the credit data for training.
• A random sample is simply a process that selects a subset of records at random.
• In R, the sample() function is used to perform random sampling.
• However, before putting it in action, a common practice is to set a seed value, which
causes the randomization process to follow a sequence that can be replicated later
on if desired.
training a model on the data
• We will use the C5.0 algorithm in the
C50 package to train our decision tree
model.
• For the first iteration of our credit
approval model, we'll use the default
C5.0 configuration, as shown in the
following code.
• The 17th column in credit_train is the
default class variable, so we need to
exclude it from the training data frame,
but supply it as the target factor vector
for classification.
If the checking account balance is unknown or greater than 200 DM, then classify as "not likely to
default."
2. Otherwise, if the checking account balance is less than zero DM or between one and 200 DM.
3. And the credit history is perfect or very good, then classify as "likely to default."
evaluating model performance
• credit_pred <- predict(credit_model, credit_test)
• This creates a vector of predicted class values, which we can compare
to the actual class values using the CrossTable() function in the
gmodels package.
Results
 Out of the 100 test loan application
records, our model correctly predicted that
59 did not default and 14 did default,
resulting in an accuracy of 73 percent and
an error rate of 27 percent.

 Also note that the model only correctly

predicted 14 of the 33 actual loan defaults
in the test data, or 42 percent.

CH 8 Data Mining
No ratings yet
CH 8 Data Mining
30 pages
Decision Tree
No ratings yet
Decision Tree
5 pages
Decision Tree
No ratings yet
Decision Tree
30 pages
MGTSC 645 Shivani Gupta Assignment 2 1646112 Decision Tree
No ratings yet
MGTSC 645 Shivani Gupta Assignment 2 1646112 Decision Tree
4 pages
Classification Algorithm
No ratings yet
Classification Algorithm
78 pages
08 - Classification - Decision Trees
No ratings yet
08 - Classification - Decision Trees
116 pages
Unit 3 Data Mining
No ratings yet
Unit 3 Data Mining
21 pages
Loan Default Prediction Using ML Models
100% (1)
Loan Default Prediction Using ML Models
17 pages
R20 DMT Unit-Iii
No ratings yet
R20 DMT Unit-Iii
21 pages
Classification and Prediction Overview
No ratings yet
Classification and Prediction Overview
75 pages
Classification & Prediction Guide
100% (1)
Classification & Prediction Guide
67 pages
Classification Ppts 2021
No ratings yet
Classification Ppts 2021
80 pages
Project Stage I Report
No ratings yet
Project Stage I Report
17 pages
Classification Models in Data Mining
No ratings yet
Classification Models in Data Mining
62 pages
Ranvijay 12203409
No ratings yet
Ranvijay 12203409
13 pages
Credit Risk Modeling in Python Chapter3
No ratings yet
Credit Risk Modeling in Python Chapter3
35 pages
PA v0.7
No ratings yet
PA v0.7
15 pages
Loan-Prediction Using Machine Learning
No ratings yet
Loan-Prediction Using Machine Learning
31 pages
An Kit
No ratings yet
An Kit
12 pages
Decision Tree Assignment
No ratings yet
Decision Tree Assignment
7 pages
Decision Tree in R Programming Language
No ratings yet
Decision Tree in R Programming Language
22 pages
08 Class Basic
No ratings yet
08 Class Basic
103 pages
Credit Card Risk Analyzer with DT
No ratings yet
Credit Card Risk Analyzer with DT
4 pages
AIMLB PGP 2025 Session 8
No ratings yet
AIMLB PGP 2025 Session 8
52 pages
Jalali@mshdiua - Ac.ir Jalali - Mshdiau.ac - Ir: Data Mining
No ratings yet
Jalali@mshdiua - Ac.ir Jalali - Mshdiau.ac - Ir: Data Mining
50 pages
Module 6
No ratings yet
Module 6
82 pages
E-Commerce Data Mining Guide
No ratings yet
E-Commerce Data Mining Guide
50 pages
Classification: Decision Trees: Business Analytics Lecture 7/8
No ratings yet
Classification: Decision Trees: Business Analytics Lecture 7/8
35 pages
Decision Trees for Data Enthusiasts
No ratings yet
Decision Trees for Data Enthusiasts
52 pages
Unit 3
No ratings yet
Unit 3
16 pages
3-Classification, Clustering and Prediction
No ratings yet
3-Classification, Clustering and Prediction
142 pages
Credit Risk Modeling in R
100% (2)
Credit Risk Modeling in R
66 pages
Loan Response Prediction Models
No ratings yet
Loan Response Prediction Models
97 pages
Loan Eligibility Prediction Model Analysis
No ratings yet
Loan Eligibility Prediction Model Analysis
12 pages
Module 5 Machine Learning
No ratings yet
Module 5 Machine Learning
36 pages
Project 2
100% (2)
Project 2
17 pages
UNIT-5 DWM
No ratings yet
UNIT-5 DWM
73 pages
Classification
No ratings yet
Classification
36 pages
Data Mining: Classification & Prediction
No ratings yet
Data Mining: Classification & Prediction
71 pages
Lecture 13-Supervised Learning-Decision Trees-M
No ratings yet
Lecture 13-Supervised Learning-Decision Trees-M
47 pages
Week 6 - 7 - Classification
No ratings yet
Week 6 - 7 - Classification
67 pages
Week 7 Laboratory Activity
No ratings yet
Week 7 Laboratory Activity
12 pages
Classification & Prediction
No ratings yet
Classification & Prediction
24 pages
Credit Card Default Prediction
No ratings yet
Credit Card Default Prediction
33 pages
Module 7 Homework Prompt - JMP
No ratings yet
Module 7 Homework Prompt - JMP
6 pages
Credit Risk Analysis
No ratings yet
Credit Risk Analysis
6 pages
Decision Trees
No ratings yet
Decision Trees
77 pages
Unit 3 Machine Learning
No ratings yet
Unit 3 Machine Learning
159 pages
Decision Tree R
No ratings yet
Decision Tree R
5 pages
Unit 3 Classification - Dr. Vidyut D
No ratings yet
Unit 3 Classification - Dr. Vidyut D
72 pages
Predict Default of Credit Card Clients: By: Varsha Waingankar
No ratings yet
Predict Default of Credit Card Clients: By: Varsha Waingankar
25 pages
Lecture 11
No ratings yet
Lecture 11
24 pages
Classification-1
No ratings yet
Classification-1
48 pages
Classification and Prediction Lecture-22,23,24,25,26,27, 28: Dr. Sudhir Sharma Manipal University Jaipur
No ratings yet
Classification and Prediction Lecture-22,23,24,25,26,27, 28: Dr. Sudhir Sharma Manipal University Jaipur
43 pages
Classification Techniques Overview
No ratings yet
Classification Techniques Overview
141 pages
Data Mining-Unit-3
No ratings yet
Data Mining-Unit-3
16 pages
Airline Passenger Satisfaction Analysis
No ratings yet
Airline Passenger Satisfaction Analysis
55 pages
Surah Rehman Listening App
100% (1)
Surah Rehman Listening App
56 pages
Microsoft Teams Project Upgrade
No ratings yet
Microsoft Teams Project Upgrade
49 pages
OOSE Chapter 8
No ratings yet
OOSE Chapter 8
48 pages
Ai Lab Terminal (Fa21-Bse-033)
No ratings yet
Ai Lab Terminal (Fa21-Bse-033)
3 pages
Lista de Sites Parceiros para ExclusÃ - Â o No Bing
No ratings yet
Lista de Sites Parceiros para ExclusÃ - Â o No Bing
38 pages
H560 User Manual v05
No ratings yet
H560 User Manual v05
206 pages
Functional Decomposition Diagram
No ratings yet
Functional Decomposition Diagram
11 pages
T003a TMU Project Report Template v1.3
No ratings yet
T003a TMU Project Report Template v1.3
21 pages
Fix Dual Boot Grub Issues
No ratings yet
Fix Dual Boot Grub Issues
17 pages
The Telecom Sutras - Dedicated To Telecomwallahs of India
No ratings yet
The Telecom Sutras - Dedicated To Telecomwallahs of India
64 pages
MIA 4. Arduino Applications
No ratings yet
MIA 4. Arduino Applications
4 pages
Ict Phase 1 PPR P1
No ratings yet
Ict Phase 1 PPR P1
12 pages
3d Printer For Biomedical Application
No ratings yet
3d Printer For Biomedical Application
23 pages
CCCpresentation
No ratings yet
CCCpresentation
37 pages
Script Programing Manual
100% (1)
Script Programing Manual
736 pages
JavaScript Basics for Beginners
No ratings yet
JavaScript Basics for Beginners
37 pages
Tutorial Letter 101/3/2021: Water Engineering II
No ratings yet
Tutorial Letter 101/3/2021: Water Engineering II
10 pages
Use AI in Architecture
No ratings yet
Use AI in Architecture
15 pages
Incident Management Handbook: How Zoho Handles The Spectrum of It Incidents
No ratings yet
Incident Management Handbook: How Zoho Handles The Spectrum of It Incidents
55 pages
Osi Pi Use Case.
No ratings yet
Osi Pi Use Case.
5 pages
GitHub - Krishnaik06 - Roadmap-To-Learn-Generative-AI-In-2024
No ratings yet
GitHub - Krishnaik06 - Roadmap-To-Learn-Generative-AI-In-2024
5 pages
Emotionally Intelligent Tutoring System
No ratings yet
Emotionally Intelligent Tutoring System
9 pages
6439521511bcd8941bf2f6ef - Supplier Audit Form
No ratings yet
6439521511bcd8941bf2f6ef - Supplier Audit Form
45 pages
Shortest Path Algorithm Report
No ratings yet
Shortest Path Algorithm Report
6 pages
AI Applications in Academia and Work
No ratings yet
AI Applications in Academia and Work
5 pages
Etl CV
No ratings yet
Etl CV
2 pages
Trend Micro Partner Portal Introduction PDF
No ratings yet
Trend Micro Partner Portal Introduction PDF
69 pages
PDF ICT Education 48th Annual Conference of The Southern African Computer Lecturers Association SACLA 2019 Northern Drakensberg South Africa July 15 17 2019 Revised Selected Papers Bobby Tait Download
100% (3)
PDF ICT Education 48th Annual Conference of The Southern African Computer Lecturers Association SACLA 2019 Northern Drakensberg South Africa July 15 17 2019 Revised Selected Papers Bobby Tait Download
65 pages
Beginning Fly Tying
No ratings yet
Beginning Fly Tying
268 pages
NGT 03E Thickness Verification Report
No ratings yet
NGT 03E Thickness Verification Report
4 pages
Instruction Level Pipelining
100% (1)
Instruction Level Pipelining
113 pages
Oracle PCTFREE & PCTUSED Guide
No ratings yet
Oracle PCTFREE & PCTUSED Guide
2 pages
MAIS-eFaraid Project TimeLine CR 04082021
No ratings yet
MAIS-eFaraid Project TimeLine CR 04082021
2 pages
DXpatrol SDR Radio MK4
No ratings yet
DXpatrol SDR Radio MK4
29 pages

Decision - Tree Using R

Uploaded by

Decision - Tree Using R

Uploaded by

Decision Tree

exploring and preparing the data

 Also note that the model only correctly

You might also like