0% found this document useful (0 votes)

63 views5 pages

MBSD

The document describes a grading rubric for a machine learning course term project. It outlines four criteria being assessed: meeting specifications, code organization, report formatting, and individual/team performance. Each criterion is scored on a scale of 1 to 4. The final score is a weighted calculation combining the criteria scores. It then summarizes the preprocessing steps applied to a student grades dataset, including handling missing values, encoding categorical data, and feature selection. Two machine learning algorithms - linear regression and KNN regression - were used to predict final-year GPA based on 1st, 1st-2nd, and 1st-3rd year grades. Models using additional years of grades achieved higher accuracy, with the 3

Uploaded by

IQRA IMRAN

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

63 views5 pages

MBSD

Uploaded by

IQRA IMRAN

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

DEPARTMENT OF COMPUTER & INFORMATION SYSTEMS ENGINEERING

BACHELORS IN COMPUTER SYSTEMS ENGINEERING

Course Code: CS-324
Course Title: Machine Learning
Complex Engineering Problem
TE Batch 2019, Spring Semester 2022
Grading Rubric
TERM PROJECT
Group Members:
Student No. Name Roll No.
S1 Kaenaat Samad CS-19001
S2 Iqra Imran CS-19009
S3 Neha Fatima CS-19010

Marks Obtained
CRITERIA AND SCALES
S1 S2 S3
Criterion 1: Does the application meet the desired specifications and produce the desired outputs?
(CPA-1, CPA-2, CPA-3) [8 marks]
1 2 3 4
The application does not The application partially The application meets the The application meets all
meet the desired meets the desired desired specifications but the desired specifications
specifications and is specifications and is is producing incorrect or and is producing correct
producing incorrect producing incorrect or partially correct outputs. outputs.
outputs. partially correct outputs.
Criterion 2: How well is the code organization? [2 marks]
1 2 3 4
The code is poorly The code is readable only Some part of the code is The code is well
organized and very to someone who knows well organized, while organized and very easy
difficult to read. what it is supposed to be some part is difficult to to follow.
doing. follow.
Criterion 3: Does the report adhere to the given format and requirements? [6 marks]
1 2 3 4
The report does not The report contains the The report contains all the The report contains all the
contain the required required information only required information but required information and
information and is partially but is formatted is formatted poorly. completely adheres to the
formatted poorly. well. given format.
Criterion 4: How does the student performed individually and as a team member?
(CPA-1, CPA-2, CPA-3) [4 marks]
1 2 3 4
The student worked on the The student worked on the The student worked on the
The student did not work assigned task, and assigned task, and assigned task, and
on the assigned task. accomplished goals accomplished goals accomplished goals
partially. satisfactorily. beyond expectations.
Final Score = (Criterial_1_score x 2) + (Criteria_2_score / 2) + (Criteria_3_score x (3/2)) + (Criteria_4_score)
= ______________________

Page 1 of 5
DATA PREPROCESSING STEPS:
For the given dataset of grades achieved by different students, we have done the following
preprocessing on data:
First of all, we have displayed the dataset to analyze the full picture. Then we checked all the columns
provided in the dataset. We found out the dimensions of the dataset by using shape method.
The data was inspected to see if there were any null values present in any column. We applied a
method which returned the number of null values in each column. To deal with missing values, we had
to apply a solution that could manage all the null values efficiently without rendering any significant
impact on data accuracy. Hence, we found out the mode of all the values in each column and replaced
the null values in each column with its mode.
Next, we looked for categorical data in the dataset and created a function for making key value pair of
categorical columns with respect to features. Then we searched for unique values in each column. The
number of unique values was found to be 13. We assigned the respective numeric value (GPA) for each
value (grade) in the column as follows:
𝑨+ = 4.0, A = 4.0, 𝑨− = 3.7, 𝑩+ = 3.3, B = 3.0, 𝑩− = 2.7, 𝑪+ = 2.3, C = 2, 𝑪− = 1.7, 𝑫+ = 1.3, D = 1.0, F = 0.0,
WU = 0.0
Furthermore, the seat number was removed from the dataset in order to attain a simplified data for
making prediction. We implemented a function that could efficiently retrieve all the courses taught in
different years. If we provide it the parameter ‘(1)’, it would only retrieve the courses of First year.
Similarly, if we provide ‘(1,2)’, it would retrieve the courses of First year combined with Second year
and for parameters ‘(1,2,3)’, the courses of First year combined with both Second and Third year are
regained. The courses are the features and the CGPA is set as the target of the model.

MODEL AND ALGORITHM:

We have applied two machine learning algorithms to predict the final CGPA of a student at the end of
fourth year with the help of CGPAs of the courses obtained at the end of 1st, 2nd and 3rd years.

MODELS USED:
Model 1: predict final CGPA based on GPs of first year only.
Model 2: predict final CGPA based on GPs of first two years.
Model 2: predict final CGPA based on GPs of first three years.

ALGORITHMS USED:
We have implemented Linear Regression Model and KNN Regressor Model for predicting final CGPAs.

Page 2 of 5
1. Linear Regression:
Linear regression is the most commonly used model for predictive analysis of continuous data. It
attempts to model the relationship between two variables by fitting a linear equation to observed
data. One variable is considered to be an explanatory variable, and the other is considered to be a
dependent variable.
First of all, we split the data into 70% train data and 30% test data, and then applied Linear Regression
model on data.
For Model 1:
The accuracy of train data for first year was found to be 84.23%. Secondly, we verified if there are any
NaN values in our test data and forwarded with its prediction. The mean squared error was calculated
as 6% and the accuracy was 81%.
For Model 2:
The 2nd model was handled in the same way as the 1st. The accuracy of the train data was acquired as
90%. As for the test data, its mean squared error was computed as 3% and the accuracy was 92%.
For Model 3:
The accuracy of the train data was found to be 92.568%. The test data for this model consisted of the
minimum mean squared error, that is, 1% and the accuracy was 97% which is the best among all three
models.

2. KNN Regressor:
KNN regressor is a non-parametric model that, in an intuitive manner, approximates the association
between independent variables and the continuous outcome by averaging the observations in the
same neighborhood.
For Model 1:
We calculated the accuracy of train data first which came out as 84.64%. As far as test data is
concerned, the prediction was made for it and its mean squared error was observed as 6.99% and the
accuracy was 79%. We also provided the solution for handling the case for a single input by the user.
To achieve this, the test data was first reshaped into 1 column and as many rows as suggested by
NumPy and then sent on for prediction.
For Model 2:
The 2nd model was managed in the similar way. The accuracy of the train data was obtained as 87.7%.
Meanwhile, the mean squared error and accuracy of test data were gained as 3% and 91%
respectively.

Page 3 of 5
For Model 3:
The accuracy of the train data for this model was acquired as 89.3%. The test data was discovered to
have the accuracy of 95% with mean squared error of 1.5%.

GRAPHICAL COMPARISON OF MODELS:

FOR LINEAR REGRESSION (BASED ON PREDICTION OF TEST DATA):

COMPARISON OF THE ACCURACY PERCENTAGE

OF THREE MODELS FOR LINEAR REGRESSION
100

85
97
80 92

75 81

70
Model 1 Model 2 Model 3

Series 1 Column1 Column2

FOR KNN REGRESSOR (BASED ON PREDICTION OF TEST DATA):

COMPARISON OF THE ACCURACY PERCENTAGE

OF THREE MODELS FOR KNN REGRESSOR
100
90
80
70
60
50 95
91
40 79
30
20
10
0
Model 1 Model 2 Model 3

Series 1 Series 2 Series 3

Page 4 of 5
PERFORMANCE OF MACHINE LEARNING SYSTEMS:
LINEAR REGRESSION:
The accuracy achieved by implementing linear regression is quite good in all three cases.

The accuracy of the first model lies in the range of 80% to 90% whereas the accuracies for model 2 and 3 are
significantly above 90% for both train and test data. Moreover, For model 1, the difference in the accuracies of
train and test data is 3.23%. For model 2, the difference is 2% and for model 3, it is 5.568% which is pretty much
acceptable.

Based on all these values, we can form a statement that our model is a good fit.

KNN REGRESSOR:
KNN regressor also succeeded in providing high accuracies. In this case, the model 1 possessed a difference of
5.64% in the accuracies of train and test data. But for model 2, the difference between the two got improved
and turned out to be 3.3%. The model 3 had the difference of 5.7% in the accuracies which is also not bad at all.
The accuracies of all the models for both train and test data are mostly above 80% which is considered to be
excellent.

Hence, we can say that our model is a really good fit according to the analysis of all the accuracies.

Here, the dataset provided to us comprised of the record of 571 students which is not a very large number. One
reason for obtaining high accuracies could be the size of the dataset as well as the simplification of data. But
irrespective of the reason, the correctness of predictions is quite impressive.

Page 5 of 5

Vijaya ML
88% (8)
Vijaya ML
26 pages
Predictive Modelling ALOK KUMAR
100% (1)
Predictive Modelling ALOK KUMAR
25 pages
ML (AutoRecovered)
No ratings yet
ML (AutoRecovered)
4 pages
Student Performance Prediction: Mukul Gharpure, Pushpak Chaudhari, Yash Bhole, Sagar Borkar, Aashutosh Awasthi
No ratings yet
Student Performance Prediction: Mukul Gharpure, Pushpak Chaudhari, Yash Bhole, Sagar Borkar, Aashutosh Awasthi
7 pages
Report WT
No ratings yet
Report WT
24 pages
12058-Article Text-21417-1-10-20220201
No ratings yet
12058-Article Text-21417-1-10-20220201
7 pages
Machine Learning Based Student AcademicPerformance Prediction
No ratings yet
Machine Learning Based Student AcademicPerformance Prediction
6 pages
DataAnalytics Lab Manual (1)
No ratings yet
DataAnalytics Lab Manual (1)
35 pages
Student Performance Analysis Using Machine Learning: Yamnampet, Hyderabad.
No ratings yet
Student Performance Analysis Using Machine Learning: Yamnampet, Hyderabad.
8 pages
Graduate Admission Prediction - Data Analytics
No ratings yet
Graduate Admission Prediction - Data Analytics
32 pages
Machine Learning Project: Sneha Sharma PGPDSBA Mar'21 Group 2
100% (4)
Machine Learning Project: Sneha Sharma PGPDSBA Mar'21 Group 2
36 pages
Irjet V10i395
No ratings yet
Irjet V10i395
4 pages
Regression PDF
No ratings yet
Regression PDF
10 pages
2017 - StudentCGPA PDF
No ratings yet
2017 - StudentCGPA PDF
7 pages
CE802 Report
No ratings yet
CE802 Report
7 pages
Class X a Project File
No ratings yet
Class X a Project File
10 pages
ML LAB
No ratings yet
ML LAB
23 pages
ML Tutorial 2
No ratings yet
ML Tutorial 2
2 pages
Academic Analytics Using Machine Learning
No ratings yet
Academic Analytics Using Machine Learning
26 pages
Predictive Modelling Sweta Kumari
No ratings yet
Predictive Modelling Sweta Kumari
35 pages
Project Report
100% (3)
Project Report
36 pages
Project - Machine Learning-Business Report: By: K Ravi Kumar PGP-Data Science and Business Analytics (PGPDSBA.O.MAR23.A)
No ratings yet
Project - Machine Learning-Business Report: By: K Ravi Kumar PGP-Data Science and Business Analytics (PGPDSBA.O.MAR23.A)
38 pages
ML Question Bank
No ratings yet
ML Question Bank
7 pages
Prediction of Student Performance Using Linear Regression
No ratings yet
Prediction of Student Performance Using Linear Regression
5 pages
About The Dataset - Car Evaluation Dataset (UCI Machine Learning Repository
No ratings yet
About The Dataset - Car Evaluation Dataset (UCI Machine Learning Repository
5 pages
Machine Leaning
No ratings yet
Machine Leaning
29 pages
0.extracted Pages 20MCA201 From 2020 MCA S3 S4
No ratings yet
0.extracted Pages 20MCA201 From 2020 MCA S3 S4
18 pages
Machine Learning Glob(22241a1237)
No ratings yet
Machine Learning Glob(22241a1237)
16 pages
Data Science and ML-KTU
No ratings yet
Data Science and ML-KTU
11 pages
Question Bank1
No ratings yet
Question Bank1
9 pages
Exam With Solutions
No ratings yet
Exam With Solutions
7 pages
Machine Learning Business Report - Compress (AutoRecovered)
100% (3)
Machine Learning Business Report - Compress (AutoRecovered)
69 pages
Student Performance Prediction
No ratings yet
Student Performance Prediction
4 pages
Học viện ngân hàng Banking Academy of Vietnam International School of Business
No ratings yet
Học viện ngân hàng Banking Academy of Vietnam International School of Business
9 pages
Machine Learning Assignment
No ratings yet
Machine Learning Assignment
2 pages
LuckyMiniProject[01]
No ratings yet
LuckyMiniProject[01]
32 pages
PAMLSET2.docx (1)
No ratings yet
PAMLSET2.docx (1)
4 pages
A Comparison of Regression Models For Prediction of Graduate Admissions
No ratings yet
A Comparison of Regression Models For Prediction of Graduate Admissions
5 pages
Student Performance Analysis and Prediction
No ratings yet
Student Performance Analysis and Prediction
19 pages
18d2d550ad9b71c9315f45c680d8629283cd
No ratings yet
18d2d550ad9b71c9315f45c680d8629283cd
6 pages
ML Project Shivani Pandey
100% (2)
ML Project Shivani Pandey
49 pages
Em Semester Project
No ratings yet
Em Semester Project
21 pages
PAMLSET1new.docx (1)
No ratings yet
PAMLSET1new.docx (1)
4 pages
CE802 Pilot
No ratings yet
CE802 Pilot
2 pages
Exam in Statistical Machine Learning Statistisk Maskininlärning (1RT700)
No ratings yet
Exam in Statistical Machine Learning Statistisk Maskininlärning (1RT700)
11 pages
ML assignment-2 pdf
No ratings yet
ML assignment-2 pdf
4 pages
Practise Questions
No ratings yet
Practise Questions
26 pages
Personalized Learning PPt
No ratings yet
Personalized Learning PPt
13 pages
First Project
No ratings yet
First Project
34 pages
Asiign2 Aaryan Ai
No ratings yet
Asiign2 Aaryan Ai
11 pages
Winter Report
No ratings yet
Winter Report
82 pages
1 To 10 DSBDA Case Study
No ratings yet
1 To 10 DSBDA Case Study
17 pages
Prediction of Students Performance With Learning Coefficients Using Regression Based Machine Learning Models
No ratings yet
Prediction of Students Performance With Learning Coefficients Using Regression Based Machine Learning Models
11 pages
Data Science Homework
No ratings yet
Data Science Homework
13 pages
Machine Learning Business Report PDF
No ratings yet
Machine Learning Business Report PDF
54 pages
ML Lab Manual TE 2021-22
No ratings yet
ML Lab Manual TE 2021-22
43 pages
turover prediction
No ratings yet
turover prediction
52 pages
Student Score Prediction System Based On Studies: Jay Patel D20DIT084, Nishchal Thakkar D20DIT088
No ratings yet
Student Score Prediction System Based On Studies: Jay Patel D20DIT084, Nishchal Thakkar D20DIT088
7 pages
[1122]AI_Assignment2
No ratings yet
[1122]AI_Assignment2
2 pages
DATA MINING and MACHINE LEARNING: CLUSTER ANALYSIS and kNN CLASSIFIERS. Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING: CLUSTER ANALYSIS and kNN CLASSIFIERS. Examples with MATLAB
César Pérez López
No ratings yet
ES 211 - Technopreneurship
No ratings yet
ES 211 - Technopreneurship
7 pages
On Sociology of Literature
No ratings yet
On Sociology of Literature
89 pages
Dynamic Structure or Enduring Activity
No ratings yet
Dynamic Structure or Enduring Activity
326 pages
Optimisation 1
No ratings yet
Optimisation 1
17 pages
2023 She Task Sheet
No ratings yet
2023 She Task Sheet
4 pages
Forces and Motion Assessment
No ratings yet
Forces and Motion Assessment
3 pages
Agriculture Handout
No ratings yet
Agriculture Handout
6 pages
CSC 490
No ratings yet
CSC 490
5 pages
5 6059728628751532816 PDF
No ratings yet
5 6059728628751532816 PDF
52 pages
Individual Reflection Sofia Damia Binti Mohd Shuhaimi (2224822)
No ratings yet
Individual Reflection Sofia Damia Binti Mohd Shuhaimi (2224822)
2 pages
14PHDRM PDF
No ratings yet
14PHDRM PDF
1 page
LaTeX+pdf+Kaleidoscope + (Beta+by+AI)
No ratings yet
LaTeX+pdf+Kaleidoscope + (Beta+by+AI)
8 pages
SS Commercial Books 2024 2025
No ratings yet
SS Commercial Books 2024 2025
2 pages
Burt - Natural Science and The Philosophy of Nature (Philos Review May 1892)
No ratings yet
Burt - Natural Science and The Philosophy of Nature (Philos Review May 1892)
9 pages
Tseamcet - 2021 - Last Ranks - Final Phase
No ratings yet
Tseamcet - 2021 - Last Ranks - Final Phase
58 pages
Mow VC Huiskes RBasic Orthopaedic Biomechanics and PDF
No ratings yet
Mow VC Huiskes RBasic Orthopaedic Biomechanics and PDF
1 page
ISSN: 2320-7493 (Online) 2320-8449 (Print)
No ratings yet
ISSN: 2320-7493 (Online) 2320-8449 (Print)
2 pages
RESEARCH METHODOLOGY
No ratings yet
RESEARCH METHODOLOGY
3 pages
AaCape-physics-unit-1-p2-mark-schemes-2022-2007 Pdfcoffee - Com - Pdf-Free
No ratings yet
AaCape-physics-unit-1-p2-mark-schemes-2022-2007 Pdfcoffee - Com - Pdf-Free
2 pages
2024 NSC Draft Timetable May Examination
No ratings yet
2024 NSC Draft Timetable May Examination
2 pages
Magic Beads Lab Write Up Example
No ratings yet
Magic Beads Lab Write Up Example
2 pages
Arabic Science Fiction Ian Campbell download
100% (3)
Arabic Science Fiction Ian Campbell download
49 pages
Research Methodology For Business
No ratings yet
Research Methodology For Business
3 pages
Lecture 4 EDU 6205-Special Methods For Teachcing TE Education - 020920
No ratings yet
Lecture 4 EDU 6205-Special Methods For Teachcing TE Education - 020920
17 pages
Effectiveness, Relevance, and Sustainability of Extension Projects The Case of Bukidnon State University College of Education
No ratings yet
Effectiveness, Relevance, and Sustainability of Extension Projects The Case of Bukidnon State University College of Education
10 pages
Socially Useful Productive Work and Community Service
100% (2)
Socially Useful Productive Work and Community Service
9 pages
Transcendental Realism
No ratings yet
Transcendental Realism
18 pages
Lesson06 PDF
No ratings yet
Lesson06 PDF
4 pages
3 - Clinical Assessment, Diagnosis, and Research Methods
100% (1)
3 - Clinical Assessment, Diagnosis, and Research Methods
12 pages
Local Media5326781183002441368
No ratings yet
Local Media5326781183002441368
6 pages

MBSD

Uploaded by

MBSD

Uploaded by

DEPARTMENT OF COMPUTER & INFORMATION SYSTEMS ENGINEERING

BACHELORS IN COMPUTER SYSTEMS ENGINEERING

MODEL AND ALGORITHM:

GRAPHICAL COMPARISON OF MODELS:

COMPARISON OF THE ACCURACY PERCENTAGE

Series 1 Column1 Column2

FOR KNN REGRESSOR (BASED ON PREDICTION OF TEST DATA):

COMPARISON OF THE ACCURACY PERCENTAGE

Series 1 Series 2 Series 3

You might also like