Assignment 4

The assignment focuses on predicting the prices of used Toyota Corollas using a dataset from 2004, requiring students to perform multiple linear regression and assess model accuracy through metrics like mean absolute percentage error and root mean squared error. Additionally, it involves clustering analysis using the Framingham Heart Study dataset, where students will prepare data, determine the optimal number of clusters, and evaluate clustering quality. The tasks are to be completed using R Markdown in RStudio Cloud/Blackboard.

Uploaded by

Sai Sharan Burugu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

8 views1 page

Assignment 4

Uploaded by

Sai Sharan Burugu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 1

MBA 739 – Advanced Analytics

Week 4 Classification Numeric Prediction, and Clustering

Assignment

Predicting Prices of Used Cars. The file ToyotaCorolla.csv contains data on used cars (Toyota
Corolla) on sale during late summer of 2004 in the Netherlands. It has 1436 records containing
details on 38 attributes, including Price, Age, Kilometers, HP, and other specifications. The goal is to
predict the price of a used Toyota Corolla based on its specifications.
• Split the data into training (60%) and validation (40%) datasets. Use the seed 739 to ensure consistent
output.
• Run a multiple linear regression with the outcome variable Price and predictor variables Age_08_04,
KM, Fuel_Type, HP, Automatic, Doors, Quarterly_Tax, Mfr_Guarantee, Guarantee_Period, Airco,
Automatic_airco, CD_Player, Powered_Windows, Sport_Model, and Tow_Bar.
• What factors which appear to have no predictive power in assessing price (p > 0.05)?
• Run the predictive model using the above variables. Assess the accuracy of the model in predicting
prices. What is the model’s mean absolute percentage error and root mean squared error? Interpret what
these values mean in real dollars and in plain language.
• Remove the factors which previously appeared to have no predictive power based on statistical
significance. Run the prediction and assess the accuracy of the model in predicting prices. What is the
model’s mean absolute percentage error and root mean squared error? Interpret what these values mean
in real dollars and in plain language. Is the model improved?

Framingham Heart Study – Clustering

When it launched in 1948, the original goal of the Framingham Heart Study (FHS) was to identify
common factors or characteristics that contribute to cardiovascular disease. Over the years, the FHS
has become a successful multigenerational study that analyzes family patterns of cardiovascular and
other diseases. We will use a small subset of that dataset for cluster analysis.

• To prepare the data. Remove any NA values. Similarly, remove the TenYearCHD outcome variable and
normalize the remaining data to 1.
• Using a seed of 10, produce an initial kmeans() cluster with three clusters. Then graph the clusters and
answer the following:
o Determine the optimal number of clusters using a silhouette plot. Produce the plot. What is the
optimal number?
o Replicate the clustering process with the optimal number of clusters. Assess the cluster plot.
Does this appear to meaningfully improve the quality of the clustering?

Use the R Markdown file available in RStudio Cloud/Blackboard to complete the homework.

Car Price Prediction Model in R
No ratings yet
Car Price Prediction Model in R
5 pages
33 Submission
No ratings yet
33 Submission
8 pages
Finalised FBA CIA 3
No ratings yet
Finalised FBA CIA 3
16 pages
Machine Learning-Based Models For Accurate Car Pri
No ratings yet
Machine Learning-Based Models For Accurate Car Pri
6 pages
Predictive Analytics for Car Pricing
No ratings yet
Predictive Analytics for Car Pricing
8 pages
Financial Data
No ratings yet
Financial Data
8 pages
Problem: # Partition
No ratings yet
Problem: # Partition
5 pages
Final DAProject
No ratings yet
Final DAProject
48 pages
Car Price Prediction
No ratings yet
Car Price Prediction
12 pages
Report
No ratings yet
Report
6 pages
Report
No ratings yet
Report
7 pages
Identifying The Most Influential Attributes For Predicting Vehicle Prices Using Extremely Randomized Trees Regression
No ratings yet
Identifying The Most Influential Attributes For Predicting Vehicle Prices Using Extremely Randomized Trees Regression
7 pages
Machine Learning Projects Report Puranjay
No ratings yet
Machine Learning Projects Report Puranjay
27 pages
Data Analysis Report
No ratings yet
Data Analysis Report
67 pages
Toyota Car Price Prediction Analysis
No ratings yet
Toyota Car Price Prediction Analysis
13 pages
Assignment 1
No ratings yet
Assignment 1
11 pages
Team AN
No ratings yet
Team AN
23 pages
Model Lab
No ratings yet
Model Lab
6 pages
1st Review
No ratings yet
1st Review
9 pages
Model Evalution
No ratings yet
Model Evalution
6 pages
Mini
No ratings yet
Mini
16 pages
Ajay and Saurabh
No ratings yet
Ajay and Saurabh
16 pages
Assignment 3
No ratings yet
Assignment 3
3 pages
Report Car Price Prediction
No ratings yet
Report Car Price Prediction
8 pages
Optimizing Bank Marketing with ML
No ratings yet
Optimizing Bank Marketing with ML
56 pages
78 - Used Car Price Prediction Using Machine Learning
100% (1)
78 - Used Car Price Prediction Using Machine Learning
5 pages
Capstone Project
No ratings yet
Capstone Project
24 pages
Car Resale Price Prediction Analysis
No ratings yet
Car Resale Price Prediction Analysis
32 pages
Class Participation
No ratings yet
Class Participation
9 pages
openSAP Sac5 Week 4 Unit 7 PREDKEYINT Exercise
No ratings yet
openSAP Sac5 Week 4 Unit 7 PREDKEYINT Exercise
18 pages
Methadology 400 Words
No ratings yet
Methadology 400 Words
2 pages
DSPY Lab Project (Formatted) 2
No ratings yet
DSPY Lab Project (Formatted) 2
14 pages
Market Segmentation Practical
No ratings yet
Market Segmentation Practical
20 pages
A10421291S3
No ratings yet
A10421291S3
8 pages
Plag
No ratings yet
Plag
3 pages
Car Price Prediction
No ratings yet
Car Price Prediction
18 pages
Udacity Business Analyst Project 8
No ratings yet
Udacity Business Analyst Project 8
19 pages
ML Case Study
No ratings yet
ML Case Study
11 pages
Bulldozer Price Prediction Model
No ratings yet
Bulldozer Price Prediction Model
19 pages
Research Paper
No ratings yet
Research Paper
3 pages
Tristan8 Paper 115
No ratings yet
Tristan8 Paper 115
4 pages
DM Assignment
No ratings yet
DM Assignment
17 pages
Marketing Analytics for Bajaj Allianz
No ratings yet
Marketing Analytics for Bajaj Allianz
30 pages
Assignment 1
No ratings yet
Assignment 1
6 pages
Decision Tree
No ratings yet
Decision Tree
5 pages
Car Price Pre
No ratings yet
Car Price Pre
12 pages
Report
No ratings yet
Report
47 pages
Car Price Prediction Insights
No ratings yet
Car Price Prediction Insights
18 pages
Capstone Project: Banking Data Analysis
No ratings yet
Capstone Project: Banking Data Analysis
54 pages
Sample Paper 6
No ratings yet
Sample Paper 6
10 pages
Project Soft
No ratings yet
Project Soft
28 pages
7708 - MBA PredAnanBigDataNov21
No ratings yet
7708 - MBA PredAnanBigDataNov21
11 pages
Predictive Modeling for Gem Stones Pricing
100% (3)
Predictive Modeling for Gem Stones Pricing
14 pages
MS5107 Boston Housing, Corolla NUIG
No ratings yet
MS5107 Boston Housing, Corolla NUIG
6 pages
Yeniyeniduzelcek
No ratings yet
Yeniyeniduzelcek
37 pages
ARIMA for Stock Price Prediction
No ratings yet
ARIMA for Stock Price Prediction
1 page
Ai and Machine Learning For Predicting
No ratings yet
Ai and Machine Learning For Predicting
9 pages
Package ISLR': R Topics Documented
100% (1)
Package ISLR': R Topics Documented
15 pages
Optimization of Bus Body Structure
No ratings yet
Optimization of Bus Body Structure
6 pages
OU Relay for Life Success 2016
No ratings yet
OU Relay for Life Success 2016
6 pages
Geo SCADA Expert Design Guidelines
No ratings yet
Geo SCADA Expert Design Guidelines
27 pages
MDFA Legacy
No ratings yet
MDFA Legacy
278 pages
SAP GUI 7.50 Installation for Mac OS
No ratings yet
SAP GUI 7.50 Installation for Mac OS
10 pages
Telesales Manager - Delhi-NCR
No ratings yet
Telesales Manager - Delhi-NCR
9 pages
Taktis Kau2044 Manual
No ratings yet
Taktis Kau2044 Manual
16 pages
Mostly Metrics the State of the Agentic Financial Stack 1
No ratings yet
Mostly Metrics the State of the Agentic Financial Stack 1
61 pages
Business Mathematics I Course Syllabus
No ratings yet
Business Mathematics I Course Syllabus
6 pages
Business Functions in ERP Systems
No ratings yet
Business Functions in ERP Systems
5 pages
Grp3 BEED2A Act2
No ratings yet
Grp3 BEED2A Act2
15 pages
Blockchain Solutions for Logistics Challenges
No ratings yet
Blockchain Solutions for Logistics Challenges
8 pages
Digital Voltmeter
No ratings yet
Digital Voltmeter
5 pages
HVAC Water Treatment Specifications
No ratings yet
HVAC Water Treatment Specifications
6 pages
Eureka Forbes Customer Preference Report
No ratings yet
Eureka Forbes Customer Preference Report
5 pages
Practical Training For Fitters and Fabricators in Tank - Repair
No ratings yet
Practical Training For Fitters and Fabricators in Tank - Repair
9 pages
Taken Actions by The Company: To Address/resolve The PROBLEMS A) Jollibee Temporarily Closes 72 Stores
No ratings yet
Taken Actions by The Company: To Address/resolve The PROBLEMS A) Jollibee Temporarily Closes 72 Stores
3 pages
Cassidy Bates
No ratings yet
Cassidy Bates
1 page
Form - Mutual Recognition Statutory Declaration
No ratings yet
Form - Mutual Recognition Statutory Declaration
2 pages
Process of Recognition Under Startup Odisha
No ratings yet
Process of Recognition Under Startup Odisha
1 page
Chapter 1
No ratings yet
Chapter 1
17 pages
Exemple de Dissertation en Droit Du Travail
100% (2)
Exemple de Dissertation en Droit Du Travail
4 pages
Delaware
0% (1)
Delaware
4 pages
Garis Panduan Log in Sistem Smpweb Bagi Pelajar Baharu: Guidelines For Login To Smpweb For New Students
No ratings yet
Garis Panduan Log in Sistem Smpweb Bagi Pelajar Baharu: Guidelines For Login To Smpweb For New Students
4 pages
Economix Agitator Data Sheet
No ratings yet
Economix Agitator Data Sheet
2 pages
Essential Brick Masonry Terminology
100% (2)
Essential Brick Masonry Terminology
39 pages
Case Study - Seven Eleven Japan
No ratings yet
Case Study - Seven Eleven Japan
3 pages
The Future of BRICS and Global Order
No ratings yet
The Future of BRICS and Global Order
14 pages
CCAR Part 22 Carriage of Dangerous Goods 30-06-20-24-8-21
No ratings yet
CCAR Part 22 Carriage of Dangerous Goods 30-06-20-24-8-21
35 pages
Baramulla to Boniyar Tender Details
No ratings yet
Baramulla to Boniyar Tender Details
70 pages

Assignment 4

Uploaded by

Assignment 4

Uploaded by

MBA 739 – Advanced Analytics

Week 4 Classification Numeric Prediction, and Clustering

Framingham Heart Study – Clustering

You might also like