EXP-4 DMusingPYTHON

Uploaded by

manimellaankammarao

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

33 views

EXP-4 DMusingPYTHON

Uploaded by

manimellaankammarao

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 7

EXPERIMENT-4

Aim: Build a model using linear regression algorithm on any dataset.

What is Simple Linear Regression?

In statistics, simple linear regression is a linear regression model with a single

explanatory variable. In simple linear regression, we predict scores on one
variable based on results on another. The criteria variable Y is the variable we
are predicting. Predictor variable X is the variable using which we are making
our predictions. The prediction approach is known as simple regression as there
is only one predictor variable,

As a result, a linear function that predicts the values of the dependent variable as
a function of the independent variable is discovered for two-dimensional sample
points with one independent variable and one dependent variable.

The below graph explains the relation between Salary and Years of Experience

Equation : y = mx + c

This is the simple linear regression equation where c is the constant and m is
the slope and describes the relationship between x (independent
variable) and y (dependent variable). The coefficient can be positive or negative
and is the degree of change in the dependent variable for every 1 unit of change
in the independent variable.
β0 (y-intercept) and β1 (slope) are the coefficients whose values represent the
accuracy of predicted values with the actual values.
Implement Simple Linear Regression in Python
In this example, we will use the salary data concerning the experience of
employees. In this dataset, we have two columns YearsExperience and Salary
Step 1: Import the required python packages
We need Pandas for data manipulation, NumPy for mathematical calculations,
and MatplotLib, and Seaborn for visualizations. Sklearn libraries are used for
machine learning operations
# Import libraries
import pandas as pd
import numpy as np
import matplotlib.pyplot as plt
import seaborn as sns
from sklearn.model_selection import train_test_split
from pandas.core.common import random_state
from sklearn.linear_model import LinearRegression
Step 2: Load the dataset
Download the dataset from here and upload it to your notebook and read it into
the pandas dataframe.
# Get dataset
df_sal = pd.read_csv(r"C:\Users\ayyap\OneDrive\Desktop\DMDW#LAB\
Salary_Data.csv")
df_sal.head()
Step 3: Data analysis
Now that we have our data ready, let's analyze and understand its trend in detail.
To do that we can first describe the data below –
# Describe data
df_sal.describe()

Here, we can see Salary ranges from 37731 to 122391 and a median of65237.We
can also find how the data is distributed visually using Seaborn Histplot
# Data distribution
plt.title('Salary Distribution Plot')
sns.Histplot(df_sal['Salary'])
plt.show()

A Histplot or distribution plot shows the variation in the data distribution.

It represents the data by combining a line with a histogram.
Then we check the relationship between Salary and Experience –
# Relationship between Salary and Experience
plt.scatter(df_sal['YearsExperience'], df_sal['Salary'], color = 'lightcoral')
plt.title('Salary vs Experience')
plt.xlabel('Years of Experience')
plt.ylabel('Salary')
plt.box(False)
plt.show()

It is clearly visible now, our data varies linearly. That means, that an individual
receives more Salary as they gain Experience.
Step 4: Split the dataset into dependent/independent variables
Experience (X) is the independent variable
Salary (y) is dependent on experience
# Splitting variables
X = df_sal.iloc[:, :1] # independent
y = df_sal.iloc[:, 1:] # dependent
Step 4: Split data into Train/Test sets
Further, split your data into training (80%) and test (20%) sets
using train_test_split

# Splitting dataset into test/train

X_train, X_test, y_train, y_test = train_test_split(X, y, test_size = 0.2, random_state
= 0)

Step 5: Train the regression model

Pass the X_train and y_train data into the regressor model by regressor.fit to
train the model with our training data.

# Regressor model
regressor = LinearRegression()
regressor.fit(X_train, y_train)
Step 6: Predict the result
Here comes the interesting part, when we are all set and ready to predict any
value of y (Salary) dependent on X (Experience) with the trained model
using regressor.predict
# Prediction result
y_pred_test = regressor.predict(X_test) # predicted value of y_test
y_pred_train = regressor.predict(X_train) # predicted value of y_train
Step 7: Plot the training and test results
Its time to test our predicted results by plotting graphs
Plot training set data vs predictions
First we plot the result of training sets (X_train, y_train) with X_train and
predicted value of y_train (regressor.predict(X_train))
# Prediction on training set
plt.scatter(X_train, y_train, color = 'lightcoral')
plt.plot(X_train, y_pred_train, color = 'firebrick')
plt.title('Salary vs Experience (Training Set)')
plt.xlabel('Years of Experience')
plt.ylabel('Salary')
plt.legend(['X_train/Pred(y_test)', 'X_train/y_train'], title = 'Sal/Exp', loc='best',
facecolor='white')
plt.box(False)
plt.show()
Plot test set data vs predictions
Secondly, we plot the result of test sets (X_test, y_test) with X_train and
predicted value of y_train (regressor.predict(X_train))

# Prediction on test set

plt.scatter(X_test, y_test, color = 'lightcoral')
plt.plot(X_train, y_pred_train, color = 'firebrick')
plt.title('Salary vs Experience (Test Set)')
plt.xlabel('Years of Experience')
plt.ylabel('Salary')
plt.legend(['X_train/Pred(y_test)', 'X_train/y_train'], title = 'Sal/Exp', loc='best',
facecolor='white')
plt.box(False)
plt.show()
We can see, in both plots, the regressor line covers train and test data.Also, you
can plot results with the predicted value of y_test (regressor.predict(X_test)) but
the regression line would remain the same at it is generated from the unique
equation of linear regression with the same training data.
If you remember from the beginning of this article, we discussed the linear
equation y = mx + c, we can also get
the c (yintercept) and m (slope/coefficient) from the regressor model.
# Regressor coefficients and intercept
print(f'Coefficient: {regressor.coef_}')
print(f'Intercept: {regressor.intercept_}')
Output:

Math Biostatistics Boot Camp 1
100% (1)
Math Biostatistics Boot Camp 1
3 pages
Linear Regression2
No ratings yet
Linear Regression2
9 pages
Regression Dataset Example
No ratings yet
Regression Dataset Example
14 pages
Machine Learning 2
No ratings yet
Machine Learning 2
45 pages
Experiment No.8
No ratings yet
Experiment No.8
5 pages
2.1 ML (Implementation of Simple Linear Regression in Python)
No ratings yet
2.1 ML (Implementation of Simple Linear Regression in Python)
8 pages
Simple Linear Regression in Machine Learning
No ratings yet
Simple Linear Regression in Machine Learning
7 pages
Simple Linear Regression
No ratings yet
Simple Linear Regression
4 pages
DS_P6_yash
No ratings yet
DS_P6_yash
8 pages
Exp 1
No ratings yet
Exp 1
6 pages
Task1
No ratings yet
Task1
5 pages
Regression
No ratings yet
Regression
16 pages
Lecture-2 Unit 2
No ratings yet
Lecture-2 Unit 2
56 pages
Simple Linear Regression
No ratings yet
Simple Linear Regression
30 pages
Practical # 10
No ratings yet
Practical # 10
5 pages
Data Science Chapitre 2
No ratings yet
Data Science Chapitre 2
98 pages
EXP 2 ML
No ratings yet
EXP 2 ML
4 pages
Unit5 - Linear Regression
No ratings yet
Unit5 - Linear Regression
4 pages
Praktikum 1 Jupiter Machine Learning
No ratings yet
Praktikum 1 Jupiter Machine Learning
1 page
Linear Regression
No ratings yet
Linear Regression
20 pages
Simple - Linear - Regression - Ipynb - Colaboratory
No ratings yet
Simple - Linear - Regression - Ipynb - Colaboratory
2 pages
Simple Linear Regression Lab II
No ratings yet
Simple Linear Regression Lab II
5 pages
Simple Linear Regression Code
No ratings yet
Simple Linear Regression Code
3 pages
Data Science Chapitre 2
No ratings yet
Data Science Chapitre 2
132 pages
Machine Learning Assignment
No ratings yet
Machine Learning Assignment
2 pages
ML_recordjp
No ratings yet
ML_recordjp
35 pages
Solution To Task 1
No ratings yet
Solution To Task 1
2 pages
2.3 ML (Implementation of Polynomial Regression Using Python)
No ratings yet
2.3 ML (Implementation of Polynomial Regression Using Python)
9 pages
python 1
No ratings yet
python 1
3 pages
CSL0777 L15
No ratings yet
CSL0777 L15
24 pages
Regression Demo
No ratings yet
Regression Demo
8 pages
ML Experiment No 1 Linear Regression Analysis
No ratings yet
ML Experiment No 1 Linear Regression Analysis
3 pages
Linear - Regression - Ipynb - Colaboratory
No ratings yet
Linear - Regression - Ipynb - Colaboratory
4 pages
Salary_Prediction
No ratings yet
Salary_Prediction
9 pages
Machine Learning Hands-On
100% (1)
Machine Learning Hands-On
18 pages
DMV Unit 3 PPT_RSK_250419_125620 jfhuehiwhu
No ratings yet
DMV Unit 3 PPT_RSK_250419_125620 jfhuehiwhu
89 pages
ML manoj
No ratings yet
ML manoj
51 pages
lab mannual of ML
No ratings yet
lab mannual of ML
43 pages
Unit 2 Regression Analysis
No ratings yet
Unit 2 Regression Analysis
16 pages
ml_6_7_8 (1)
No ratings yet
ml_6_7_8 (1)
10 pages
Experiment 1
No ratings yet
Experiment 1
17 pages
ssrn-3526707
No ratings yet
ssrn-3526707
5 pages
Question 1 B
No ratings yet
Question 1 B
6 pages
211423205047-Exp1c
No ratings yet
211423205047-Exp1c
6 pages
3. Machine Learning
No ratings yet
3. Machine Learning
158 pages
Simple Linear Regression - Assign4
No ratings yet
Simple Linear Regression - Assign4
8 pages
Lab Experiment 4 - AI
No ratings yet
Lab Experiment 4 - AI
7 pages
Task8
No ratings yet
Task8
2 pages
11Soln
No ratings yet
11Soln
3 pages
Lecture-5---Polynomial-Regression-imran-07032025-114203am
No ratings yet
Lecture-5---Polynomial-Regression-imran-07032025-114203am
39 pages
ML Combined
No ratings yet
ML Combined
254 pages
ML 1-11
No ratings yet
ML 1-11
27 pages
AI Lec 3
No ratings yet
AI Lec 3
36 pages
Lab 11,12 - Copy
No ratings yet
Lab 11,12 - Copy
7 pages
Salary Prediction LinearRegression
100% (1)
Salary Prediction LinearRegression
7 pages
Agniva
No ratings yet
Agniva
16 pages
Assignment No.4 - (20-Ele-68)
No ratings yet
Assignment No.4 - (20-Ele-68)
17 pages
UnivariateRegression Summary
No ratings yet
UnivariateRegression Summary
36 pages
Code Book
No ratings yet
Code Book
20 pages
Top Numerical Methods With Matlab For Beginners!
From Everand
Top Numerical Methods With Matlab For Beginners!
Andrei Besedin
No ratings yet
Process Performance Models: Statistical, Probabilistic & Simulation
From Everand
Process Performance Models: Statistical, Probabilistic & Simulation
Vishnuvarthanan Moorthy
No ratings yet
Long-Term-Effects-and-Legacy
No ratings yet
Long-Term-Effects-and-Legacy
1 page
Transportation-Revolution
No ratings yet
Transportation-Revolution
1 page
Social-and-Economic-Changes
No ratings yet
Social-and-Economic-Changes
1 page
The-Factory-System
No ratings yet
The-Factory-System
1 page
180722f0-68eb-4231-80d6-00c37312aabb
No ratings yet
180722f0-68eb-4231-80d6-00c37312aabb
1 page
The-Industrial-Revolution-Transforming-Society
No ratings yet
The-Industrial-Revolution-Transforming-Society
8 pages
Verbal Ability (2)
No ratings yet
Verbal Ability (2)
4 pages
Verbal Ability (4)
No ratings yet
Verbal Ability (4)
4 pages
Data1 Excel
No ratings yet
Data1 Excel
1 page
Beginners Guide to Python Programming
No ratings yet
Beginners Guide to Python Programming
1 page
Print 1
No ratings yet
Print 1
79 pages
Unit 1 1
No ratings yet
Unit 1 1
9 pages
Print 1
No ratings yet
Print 1
79 pages
OODA UNIT - 1 New
No ratings yet
OODA UNIT - 1 New
12 pages
(Ebook) Markov Models & Optimization by M.H.A. Davis ISBN 9780203748039, 9780412314100, 9781351433495, 0203748034, 041231410X, 1351433490 instant download
100% (1)
(Ebook) Markov Models & Optimization by M.H.A. Davis ISBN 9780203748039, 9780412314100, 9781351433495, 0203748034, 041231410X, 1351433490 instant download
59 pages
Numerical Optimization in Matlab
No ratings yet
Numerical Optimization in Matlab
25 pages
Convolutional Neural Networks
No ratings yet
Convolutional Neural Networks
76 pages
Math Exe Form 3 Chapter 2
No ratings yet
Math Exe Form 3 Chapter 2
3 pages
Datamites Ai Expert Brochure (1)
No ratings yet
Datamites Ai Expert Brochure (1)
10 pages
DAA PBL
No ratings yet
DAA PBL
10 pages
Chapter 2 Adaline
No ratings yet
Chapter 2 Adaline
71 pages
Entropy: The Von Neumann Entropy For Mixed States
No ratings yet
Entropy: The Von Neumann Entropy For Mixed States
9 pages
Learning Objectives: Simple Linear Regression
No ratings yet
Learning Objectives: Simple Linear Regression
6 pages
Cluster Analysis in R TML
No ratings yet
Cluster Analysis in R TML
5 pages
Get Hands on Machine Learning with Scikit Learn Keras and TensorFlow 2 / Paperback Edition Aurélien Géron free all chapters
100% (1)
Get Hands on Machine Learning with Scikit Learn Keras and TensorFlow 2 / Paperback Edition Aurélien Géron free all chapters
67 pages
Wind Loads
No ratings yet
Wind Loads
1 page
A1 - E1-1-to-E1-6 Database System
No ratings yet
A1 - E1-1-to-E1-6 Database System
7 pages
Time Series and Spectral Analysis Part V. Spectral Analysis: Sonia - Gouveia@ua - PT
No ratings yet
Time Series and Spectral Analysis Part V. Spectral Analysis: Sonia - Gouveia@ua - PT
20 pages
Signals and Systems
No ratings yet
Signals and Systems
42 pages
Controller Design Using Root Locus: 14.1 PD Control
No ratings yet
Controller Design Using Root Locus: 14.1 PD Control
11 pages
Aktu Tafl 2022 23 Kcs 402 Question Paper Solution
No ratings yet
Aktu Tafl 2022 23 Kcs 402 Question Paper Solution
18 pages
ECE1657PDFNotesGTEGT 2016
No ratings yet
ECE1657PDFNotesGTEGT 2016
188 pages
NPTEL Introduction To Machine Learning Assignment 10 Answers
100% (1)
NPTEL Introduction To Machine Learning Assignment 10 Answers
7 pages
Evolution of AI - Final
No ratings yet
Evolution of AI - Final
14 pages
Computer Graphics Lab Manual
No ratings yet
Computer Graphics Lab Manual
7 pages
c6d - Channel Coding Part 1
No ratings yet
c6d - Channel Coding Part 1
78 pages
eigenvalueBarycentric
No ratings yet
eigenvalueBarycentric
29 pages
Track
No ratings yet
Track
49 pages
Data Level Fusion For Multi Biometric System Using Face and Finger
No ratings yet
Data Level Fusion For Multi Biometric System Using Face and Finger
5 pages
2006ACC Elevator
No ratings yet
2006ACC Elevator
6 pages
Indian Institute of Technology Bombay September 29, 2023 EE782 Advanced Topics in Machine Learning Assignment 2: Metric Learning and Generative AI
No ratings yet
Indian Institute of Technology Bombay September 29, 2023 EE782 Advanced Topics in Machine Learning Assignment 2: Metric Learning and Generative AI
1 page
Eet305 Signals and Systems, December 2022 - 2
No ratings yet
Eet305 Signals and Systems, December 2022 - 2
4 pages
Deep Learning Techniques For Cyber Security Intrusion Detection: A Detailed Analysis
No ratings yet
Deep Learning Techniques For Cyber Security Intrusion Detection: A Detailed Analysis
11 pages

EXP-4 DMusingPYTHON

Uploaded by

EXP-4 DMusingPYTHON

Uploaded by

EXPERIMENT-4

Aim: Build a model using linear regression algorithm on any dataset.

In statistics, simple linear regression is a linear regression model with a single

A Histplot or distribution plot shows the variation in the data distribution.

# Splitting dataset into test/train

Step 5: Train the regression model

# Prediction on test set

You might also like