0% found this document useful (0 votes)

31 views

ML Lab 07 Manual - Linear Regression 2 (Updated Version 4)

Its a lab on machine learning.

Uploaded by

Naima Yaqub

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

31 views

ML Lab 07 Manual - Linear Regression 2 (Updated Version 4)

Its a lab on machine learning.

Uploaded by

Naima Yaqub

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 8

Department of Electrical Engineering

Faculty Member: LE Munadi Sial Date:

Semester: Group:

CS471 Machine Learning

Lab 7: Linear Regression II – Train-Validation Split and
Regularization

PLO4 - PLO4 - PLO5 - PLO8 - PLO9 -

CLO4 CLO4 CLO5 CLO6 CLO7
Name Reg. No Viva /Quiz / Analysis Modern Ethics Individual
Lab of data in Tool and Team
Performance Lab Usage Work
Report

5 Marks 5 Marks 5 Marks 5 Marks 5 Marks

CS471 Machine Learning SEECS, NUST

Introduction

This laboratory exercise will extend the python implementation of linear

regression performed in the previous lab. Linear regression is a basic supervised
learning technique in which parameters are trained on a dataset to fit a model
that best approximates that dataset. The problem with using simple linear
regression is that the trained models can overfit the dataset at which point
regularization must be used to prevent overfitting. This lab will focus on
integrating the regularization concept into the gradient descent algorithm.

Objectives

The following are the main objectives of this lab:

• Extract and prepare the training and cross-validation datasets

• Use feature scaling to ensure uniformity among the feature columns
• Implement cost function on both training and cross-validation datasets
• Implement gradient descent algorithm
• Plot the training and cross-validation losses
• Use L2 regularization to counter overfitting

Lab Conduct

• Respect faculty and peers through speech and actions

• The lab faculty will be available to assist the students. In case some aspect
of the lab experiment is not understood, the students are advised to seek
help from the faculty.
• In the tasks, there are commented lines such as #YOUR CODE STARTS
HERE# where you have to provide the code. You must put the
code/screenshot/plot between the #START and #END parts of these
commented lines. Do NOT remove the commented lines.
• Use the tab key to provide the indentation in python.
• When you provide the code in the report, keep the font size at 12
Theory

CS471 Machine Learning SEECS, NUST

Linear Regression is a very basic supervised learning technique. To calculate
the loss in each training example, the difference between a hypothesis and the
label (y) is calculated. The hypothesis is a linear equation of the features (x) in
the dataset with the coefficients acting as the weight parameters. These weight
parameters are initialized to random values at the start but are then trained
over time to learn the model. The cost function is used to calculated the error
between the predicted y^ and the actual y.

A major problem in the training is that the weights that are trained may fit the
model for only the data it is given. This means that the model will not
generalize to examples outside the dataset and is referred to as “overfitting”.
Such overfitting makes the machine learning implementation very impractical
for real-life applications where data has high variation. To prevent overfitting
of the model, a modification in the cost function and gradient descent is
implemented. This modification is called regularization and is itself controlled
by a hyperparameter (lambda).

A brief summary of the relevant keywords and functions in python is provided

below:
print() output text on console
input() get input from user on console
range() create a sequence of numbers
len() gives the number of characters in a string
if contains code that executes depending on a logical condition
else connects with if and elif, executes when conditions are not met
elif equivalent to else if
while loops code as long as a condition is true
for loops code through a sequence of items in an iterable object
break exit loop immediately
continue jump to the next iteration of the loop
def used to define a function

Lab Task 1 - Dataset Preparation, Feature Scaling ______________________

CS471 Machine Learning SEECS, NUST

You have been provided with a dataset containing several feature columns. You
will need to select any 3 of the feature columns to make your own dataset. The
“Sale Price” is the label column that your model will predict. The dataset
examples are to be divided into 2 separate portions: training and cross-
validation datasets (choose from 80-20 to 70-30 ratios). Save the prepared
datasets as CSV files. Next, load the datasets into your python program and
store them as NumPy arrays (Xtrain , ytrain, Xval, yval,). Next, use feature scaling to
rescale the feature columns of both datasets so that their values range from 0 to
1. Finally, print both of the datasets (you need to show any 5 rows of the
datasets).

### TASK 1 CODE STARTS HERE ###

### TASK 1 CODE ENDS HERE ###

### TASK 1 SCREENSHOT STARTS HERE ###

### TASK 1 SCREENSHOT ENDS HERE ###

Lab Task 2 - Cost Function with Regularization __________________________

For linear regression, you will implement the following hypothesis:
h(x) = w0 + w1x1 + w2x2 + w3x3 + …
The wj and b represent the weights while the x j represents the jth feature. The
linear hypothesis h(x) is to be calculated for each training example and its
difference with the label y of that training example will represent the loss. In this
task, you will write a cost function that calculates the overall loss across a set of
examples. This cost function will be useful to calculate the losses in both the
training and cross-validation phases of the program.

cost_function(X, y, lambd)

CS471 Machine Learning SEECS, NUST

The X and y are the features and labels of either the training or the cross-
validation datasets. This is useful as it can be used for either the training
examples or the cross-validation examples of the dataset. The lambd is the
regularization parameter (Note that lambda is a keyword reserved in python).
The function will calculate the losses to return the overall cost value. The cost
function is given by:
m
1
J ( w )= ∑ ¿¿
2m i=1

The m is the number of the examples in the dataset and n is the total number of
features (or non-bias weights) in the hypothesis. Write the code for the cost
function and implement it for your training and cross-validation datasets to
print out the cost. Provide the code and all relevant screenshots of the final
output.

### TASK 2 CODE STARTS HERE ###

### TASK 2 CODE ENDS HERE ###

### TASK 2 SCREENSHOT STARTS HERE ###

### TASK 2 SCREENSHOT ENDS HERE ###

Lab Task 3 –Gradient Descent with Regularization _____________________

In this task, you will write a function that uses gradient descent to update the
weight parameters:

gradient_descent(X, y, alpha, lambd)

CS471 Machine Learning SEECS, NUST

The alpha is the learning rate (hyperparameter 1) and lambd is the
regularization parameter (hyperparameter 2). The gradient descent algorithm
is given as follows:
m
∂J 1 λ
d w j= = ∑ (h( x (i ))– y (i )) x j(i) + w j
∂ w j m i=1 m

m
∂J 1
db= = ∑ (h(x ( i)) – y (i) )
∂ b m i=1

∂J
w j :=w j−α
∂wj

∂J
b :=b−α
∂wj

For the submission, you will need to run the gradient descent algorithm once to
update the weights. You will need to print the weights, training cost and
validation cost both before and after the weight update. Provide the code and
all relevant screenshots of the final output.

### TASK 3 CODE STARTS HERE ###

### TASK 3 CODE ENDS HERE ###

### TASK 3 SCREENSHOT STARTS HERE ###

### TASK 3 SCREENSHOT ENDS HERE ###

Lab Task 4 – Training and Validation Program _________________________

CS471 Machine Learning SEECS, NUST

In this task, you will use the functions from the previous two tasks to write a
“main” function that performs the actual training and validation. Use the cost
function and gradient descent function on the training examples to determine
the training loss and update the weights respectively. Then, use the cost
function on the cross-validation examples to determine the cross-validation
loss. This single iteration over the entire dataset (both training and cross-
validation) marks the completion of one epoch. You will need to perform the
training and cross-validation over several epochs (the epoch number is another
hyperparameter that must be chosen). Ensure that at the end of each epoch, the
training and cross-validation losses are stored for plotting purposes. When the
final epoch is performed, note down the trained parameters (weights and bias)
and make plot of the training and cross-validation losses (y-axis) over the
epochs (x-axis). Ensure that both of the losses appear on the same graph. You
only need to show a single plot for this task. Provide the code (excluding
function definitions) and all relevant screenshots of the final output.

### TASK 4 CODE STARTS HERE ###

### TASK 4 CODE ENDS HERE ###

### TASK 4 SCREENSHOT STARTS HERE ###

### TASK 4 SCREENSHOT ENDS HERE ###

Lab Task 5 – Tuning Alpha and Lambda ____________________________________

In this task, you will use your linear regression code from the previous task.
Tune the alpha and lambda hyperparameters at different values to get several
plots. You need to get at least 6 plots. Mention the alpha and lambda values in
the plot titles. Ensure all axes are labeled appropriately.

### TASK 5 PLOTS START HERE ###

CS471 Machine Learning SEECS, NUST

### TASK 5 PLOTS END HERE ###

CS471 Machine Learning SEECS, NUST

Neural Networks: 1 Basic Optimizer
No ratings yet
Neural Networks: 1 Basic Optimizer
8 pages
Lab 03 - Linear Regression
No ratings yet
Lab 03 - Linear Regression
5 pages
ML Lab 06 Manual - Linear Regression 1 (Version 6)
No ratings yet
ML Lab 06 Manual - Linear Regression 1 (Version 6)
8 pages
ML Lab 08 Manual - Logisitic Regression (Ver7)
No ratings yet
ML Lab 08 Manual - Logisitic Regression (Ver7)
9 pages
Lab 04 - Logisitic Regression
No ratings yet
Lab 04 - Logisitic Regression
5 pages
ML Labs
No ratings yet
ML Labs
46 pages
ML Lab 11 Manual - Neural Networks (Ver4)
No ratings yet
ML Lab 11 Manual - Neural Networks (Ver4)
8 pages
A2 Linear Models From Scratch
No ratings yet
A2 Linear Models From Scratch
2 pages
HW1
No ratings yet
HW1
4 pages
ML Lab Black & White
No ratings yet
ML Lab Black & White
83 pages
WEEK 1-5
No ratings yet
WEEK 1-5
13 pages
Lab01 Linear Regression
No ratings yet
Lab01 Linear Regression
4 pages
chapter_4_assignment (6)
No ratings yet
chapter_4_assignment (6)
5 pages
CPE EL3 Lab 2 Using TensorFlow For Linear Regression With Synthetic Data
No ratings yet
CPE EL3 Lab 2 Using TensorFlow For Linear Regression With Synthetic Data
2 pages
C1 W1 Lab02 Model Representation Soln
No ratings yet
C1 W1 Lab02 Model Representation Soln
7 pages
Programming Exercise 5: Regularized Linear Regression and Bias V.S. Variance
No ratings yet
Programming Exercise 5: Regularized Linear Regression and Bias V.S. Variance
14 pages
Machine Learning Assignments
No ratings yet
Machine Learning Assignments
3 pages
C1 W1 Lab02 Model Representation Soln
No ratings yet
C1 W1 Lab02 Model Representation Soln
7 pages
Machine Learning Lab Assignment: Instructions
No ratings yet
Machine Learning Lab Assignment: Instructions
4 pages
Lecture 02
No ratings yet
Lecture 02
43 pages
CM20315 02 Supervised
No ratings yet
CM20315 02 Supervised
53 pages
C1 W1 Lab02 Model Representation Soln
No ratings yet
C1 W1 Lab02 Model Representation Soln
5 pages
RLDL File
No ratings yet
RLDL File
31 pages
Ex 5
No ratings yet
Ex 5
14 pages
1.1 ID5059 1.2 Tom Kelsey - Jan 2021: February 15, 2021
No ratings yet
1.1 ID5059 1.2 Tom Kelsey - Jan 2021: February 15, 2021
43 pages
CS229 Lecture Notes
No ratings yet
CS229 Lecture Notes
142 pages
Assignmnet-5 CSET 335 Deep Learning
No ratings yet
Assignmnet-5 CSET 335 Deep Learning
3 pages
ML Lab 04 Manual - Pandas and MatplotLib
No ratings yet
ML Lab 04 Manual - Pandas and MatplotLib
7 pages
C1 W1 Lab03 Model Representation Soln-Copy1
No ratings yet
C1 W1 Lab03 Model Representation Soln-Copy1
7 pages
ML Exp 1
No ratings yet
ML Exp 1
12 pages
CS335 Lab6
No ratings yet
CS335 Lab6
7 pages
Homework2
No ratings yet
Homework2
3 pages
Lecture 3 - Linear Regression
No ratings yet
Lecture 3 - Linear Regression
31 pages
Ex5 PDF
No ratings yet
Ex5 PDF
14 pages
Ex 5
No ratings yet
Ex 5
14 pages
Assignment_1
No ratings yet
Assignment_1
1 page
Lab 01
No ratings yet
Lab 01
15 pages
C1 W1 Lab02 Model Representation Soln
No ratings yet
C1 W1 Lab02 Model Representation Soln
5 pages
C2_W3_Assignment
No ratings yet
C2_W3_Assignment
437 pages
niraj dl
No ratings yet
niraj dl
15 pages
A1388404476 - 64039 - 23 - 2023 - Machine Learning II
No ratings yet
A1388404476 - 64039 - 23 - 2023 - Machine Learning II
10 pages
ML Coursera Python Assignments
100% (1)
ML Coursera Python Assignments
20 pages
hw7
No ratings yet
hw7
7 pages
AI2025_Lecture02_recording_slides (1)
No ratings yet
AI2025_Lecture02_recording_slides (1)
52 pages
C2W3_Lab_01_Model_Evaluation_and_Selection
No ratings yet
C2W3_Lab_01_Model_Evaluation_and_Selection
21 pages
ML Lab 01 Manual - Intro To Python
No ratings yet
ML Lab 01 Manual - Intro To Python
9 pages
C2W3 Lab 01 Model Evaluation and Selection
No ratings yet
C2W3 Lab 01 Model Evaluation and Selection
21 pages
ML Lab Manual
No ratings yet
ML Lab Manual
38 pages
hw1
No ratings yet
hw1
12 pages
AIML Lab
No ratings yet
AIML Lab
48 pages
School of Engineering: Lab Manual On Machine Learning Lab
No ratings yet
School of Engineering: Lab Manual On Machine Learning Lab
23 pages
Lecture 7
No ratings yet
Lecture 7
29 pages
L4 Linear Regression
No ratings yet
L4 Linear Regression
51 pages
HW 1 in 2015
No ratings yet
HW 1 in 2015
3 pages
Lecture 1
No ratings yet
Lecture 1
6 pages
DL LAb
No ratings yet
DL LAb
30 pages
Backpropagation: Fundamentals and Applications for Preparing Data for Training in Deep Learning
From Everand
Backpropagation: Fundamentals and Applications for Preparing Data for Training in Deep Learning
Fouad Sabry
No ratings yet
Introduction to Algorithms
From Everand
Introduction to Algorithms
S VASIST
No ratings yet
Python Advanced Programming: The Guide to Learn Python Programming. Reference with Exercises and Samples About Dynamical Programming, Multithreading, Multiprocessing, Debugging, Testing and More
From Everand
Python Advanced Programming: The Guide to Learn Python Programming. Reference with Exercises and Samples About Dynamical Programming, Multithreading, Multiprocessing, Debugging, Testing and More
Marcus Richards
No ratings yet
C Programming
From Everand
C Programming
Netra
No ratings yet
Communication Systems Lab 2
No ratings yet
Communication Systems Lab 2
3 pages
Communication Systems Lab 4
No ratings yet
Communication Systems Lab 4
3 pages
Communication Systems Lab 5
No ratings yet
Communication Systems Lab 5
3 pages
Communication Systems Lab 1
No ratings yet
Communication Systems Lab 1
3 pages
Communication Systems Lab 3
No ratings yet
Communication Systems Lab 3
4 pages
AML - Mid Term - Merged
No ratings yet
AML - Mid Term - Merged
192 pages
ML Unit 1
No ratings yet
ML Unit 1
22 pages
Prediction of Risk Delay in Construction Projects Using A Hybrid Artificial Intelligence Model
No ratings yet
Prediction of Risk Delay in Construction Projects Using A Hybrid Artificial Intelligence Model
14 pages
Impact of Machine Learning On Manufacturing Industries
No ratings yet
Impact of Machine Learning On Manufacturing Industries
7 pages
2 marks
No ratings yet
2 marks
5 pages
The Little Book of Deep Learning
100% (1)
The Little Book of Deep Learning
140 pages
Regression Notes
100% (1)
Regression Notes
20 pages
Class X Artificial Intelligence Evaluation Assignment
No ratings yet
Class X Artificial Intelligence Evaluation Assignment
3 pages
Decision Tree
No ratings yet
Decision Tree
51 pages
arning Time Series Classification with Fisher Information
No ratings yet
arning Time Series Classification with Fisher Information
22 pages
Unit 5-1
No ratings yet
Unit 5-1
6 pages
UNIT-1: 1. What Is Machine Learning?
No ratings yet
UNIT-1: 1. What Is Machine Learning?
130 pages
22BCS14374 - Sanya - Singh - Assignment 2
No ratings yet
22BCS14374 - Sanya - Singh - Assignment 2
8 pages
Data Science 面试必备指南 + 面试真题
No ratings yet
Data Science 面试必备指南 + 面试真题
54 pages
Introduction To Machine Learning PART 3
No ratings yet
Introduction To Machine Learning PART 3
6 pages
ChatGPT The End of Online Exam Integrity
No ratings yet
ChatGPT The End of Online Exam Integrity
21 pages
Lecture 9.1 - Model Evaluations - Train Test Cross-Validate (Autosaved)
No ratings yet
Lecture 9.1 - Model Evaluations - Train Test Cross-Validate (Autosaved)
33 pages
DL Unit 4
No ratings yet
DL Unit 4
15 pages
Ashmore Et Al_2022
No ratings yet
Ashmore Et Al_2022
39 pages
Quiz Data Mining
50% (6)
Quiz Data Mining
3 pages
Ridge and Lasso Regression in Python
No ratings yet
Ridge and Lasso Regression in Python
18 pages
Unit-5 Rel
No ratings yet
Unit-5 Rel
5 pages
Spreadsheet Modeling & Decision Analysis: A Practical Introduction To Business Analytics
No ratings yet
Spreadsheet Modeling & Decision Analysis: A Practical Introduction To Business Analytics
35 pages
Report Document Batch (9) (12 90)
No ratings yet
Report Document Batch (9) (12 90)
80 pages
Machine Learning concise notes
No ratings yet
Machine Learning concise notes
7 pages
BDE Final Report
No ratings yet
BDE Final Report
53 pages
Reading 3 Machine Learning - Answers
No ratings yet
Reading 3 Machine Learning - Answers
12 pages
Bias Variance Overfitting
No ratings yet
Bias Variance Overfitting
3 pages
Building A Smooth Yield Curve: Email: Jgreco@math - Uchicago.edu
No ratings yet
Building A Smooth Yield Curve: Email: Jgreco@math - Uchicago.edu
23 pages
Optimizing The Seed-Cell Filling Performance of An Inclined Plate Seed Metering Device Using Integrated ANN-PSO Approach
No ratings yet
Optimizing The Seed-Cell Filling Performance of An Inclined Plate Seed Metering Device Using Integrated ANN-PSO Approach
12 pages

ML Lab 07 Manual - Linear Regression 2 (Updated Version 4)

Uploaded by

ML Lab 07 Manual - Linear Regression 2 (Updated Version 4)

Uploaded by

Department of Electrical Engineering

Faculty Member: LE Munadi Sial Date:

CS471 Machine Learning

PLO4 - PLO4 - PLO5 - PLO8 - PLO9 -

5 Marks 5 Marks 5 Marks 5 Marks 5 Marks

CS471 Machine Learning SEECS, NUST

This laboratory exercise will extend the python implementation of linear

The following are the main objectives of this lab:

• Extract and prepare the training and cross-validation datasets

• Respect faculty and peers through speech and actions

CS471 Machine Learning SEECS, NUST

A brief summary of the relevant keywords and functions in python is provided

Lab Task 1 - Dataset Preparation, Feature Scaling ______________________

CS471 Machine Learning SEECS, NUST

### TASK 1 CODE STARTS HERE ###

### TASK 1 CODE ENDS HERE ###

### TASK 1 SCREENSHOT STARTS HERE ###

### TASK 1 SCREENSHOT ENDS HERE ###

Lab Task 2 - Cost Function with Regularization __________________________

CS471 Machine Learning SEECS, NUST

### TASK 2 CODE STARTS HERE ###

### TASK 2 CODE ENDS HERE ###

### TASK 2 SCREENSHOT STARTS HERE ###

### TASK 2 SCREENSHOT ENDS HERE ###

Lab Task 3 –Gradient Descent with Regularization _____________________

gradient_descent(X, y, alpha, lambd)

CS471 Machine Learning SEECS, NUST

### TASK 3 CODE STARTS HERE ###

### TASK 3 CODE ENDS HERE ###

### TASK 3 SCREENSHOT STARTS HERE ###

### TASK 3 SCREENSHOT ENDS HERE ###

Lab Task 4 – Training and Validation Program _________________________

CS471 Machine Learning SEECS, NUST

### TASK 4 CODE STARTS HERE ###

### TASK 4 CODE ENDS HERE ###

### TASK 4 SCREENSHOT STARTS HERE ###

### TASK 4 SCREENSHOT ENDS HERE ###

Lab Task 5 – Tuning Alpha and Lambda ____________________________________

### TASK 5 PLOTS START HERE ###

CS471 Machine Learning SEECS, NUST

CS471 Machine Learning SEECS, NUST

You might also like