0% found this document useful (0 votes)

4 views2 pages

Practical1c.ipynb - Colab

The document outlines a data processing workflow using Python libraries, including pandas and scikit-learn, to manipulate a sample dataset with categorical and numerical variables. It demonstrates label encoding, min-max scaling, standard scaling, and binarization of the data. Finally, the processed dataset is saved as a CSV file named 'processed_data.csv'.

Uploaded by

Tania Jamdar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

4 views2 pages

Practical1c.ipynb - Colab

Uploaded by

Tania Jamdar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 2

11/30/24, 3:34 PM Practical1c.

ipynb - Colab

# Import required libraries

import pandas as pd
import numpy as np
from sklearn.preprocessing import LabelEncoder, MinMaxScaler, StandardScaler, Binarizer

# Create a sample dataset

data = pd.DataFrame({
'Category': ['A', 'B', 'C', 'A', 'B', 'C'], # Categorical variable
'Age': [23, 45, 31, 22, 35, 30], # Numerical variable
'Income': [50000, 60000, 70000, 80000, 90000, 100000], # Numerical variable
'Has_Car': ['Yes', 'No', 'Yes', 'No', 'Yes', 'No'] # Binary categorical variable
})
# Display the dataset
print("Sample Dataset:")
print(data)

Sample Dataset:
Category Age Income Has_Car
0 A 23 50000 Yes
1 B 45 60000 No
2 C 31 70000 Yes
3 A 22 80000 No
4 B 35 90000 Yes
5 C 30 100000 No

# Label Encoding for 'Category' column

label_encoder = LabelEncoder()
data['Category_Encoded'] = label_encoder.fit_transform(data['Category'])
# Label Encoding for binary column 'Has_Car'
data['Has_Car_Encoded'] = label_encoder.fit_transform(data['Has_Car'])
print("\nAfter Label Encoding:")
print(data)

After Label Encoding:

Category Age Income Has_Car Category_Encoded Has_Car_Encoded
0 A 23 50000 Yes 0 1
1 B 45 60000 No 1 0
2 C 31 70000 Yes 2 1
3 A 22 80000 No 0 0
4 B 35 90000 Yes 1 1
5 C 30 100000 No 2 0

# Min-Max Scaling for 'Income'

min_max_scaler = MinMaxScaler()
data['Income_MinMax'] = min_max_scaler.fit_transform(data[['Income']])
# Standard Scaling for 'Age'
standard_scaler = StandardScaler()
data['Age_Standardized'] = standard_scaler.fit_transform(data[['Age']])
print("\nAfter Scaling:")
print(data)

After Scaling:
Category Age Income Has_Car Category_Encoded Has_Car_Encoded \
0 A 23 50000 Yes 0 1
1 B 45 60000 No 1 0
2 C 31 70000 Yes 2 1
3 A 22 80000 No 0 0
4 B 35 90000 Yes 1 1
5 C 30 100000 No 2 0

Income_MinMax Age_Standardized
0 0.0 -1.035676
1 0.2 1.812434
2 0.4 0.000000
3 0.6 -1.165136
4 0.8 0.517838
5 1.0 -0.129460

# Binarization for 'Income' with a threshold of 75,000

binarizer = Binarizer(threshold=75000)
data['Income_Binary'] = binarizer.fit_transform(data[['Income']])
print("\nAfter Binarization:")
print(data)

After Binarization:
Category Age Income Has_Car Category_Encoded Has_Car_Encoded \
0 A 23 50000 Yes 0 1
1 B 45 60000 No 1 0

https://colab.research.google.com/drive/1vzCv7xFKj-Mru4D-MXvHU496haU-bL0I#scrollTo=8V8mxZ5Uhops&printMode=true 1/2
11/30/24, 3:34 PM Practical1c.ipynb - Colab
2 C 31 70000 Yes 2 1
3 A 22 80000 No 0 0
4 B 35 90000 Yes 1 1
5 C 30 100000 No 2 0

Income_MinMax Age_Standardized Income_Binary

0 0.0 -1.035676 0
1 0.2 1.812434 0
2 0.4 0.000000 0
3 0.6 -1.165136 1
4 0.8 0.517838 1
5 1.0 -0.129460 1

# Save the processed dataset

data.to_csv('processed_data.csv', index=False)
print("\nProcessed dataset saved as 'processed_data.csv'")

Processed dataset saved as 'processed_data.csv'

https://colab.research.google.com/drive/1vzCv7xFKj-Mru4D-MXvHU496haU-bL0I#scrollTo=8V8mxZ5Uhops&printMode=true 2/2

Aosdijfpqoiew
No ratings yet
Aosdijfpqoiew
6 pages
Apache Cassandra Administrator Associate - Exam Practice Tests
From Everand
Apache Cassandra Administrator Associate - Exam Practice Tests
Cristian Scutaru
No ratings yet
Untitled
No ratings yet
Untitled
1,326 pages
Chapter 4 INTERNAL ENVIRONMENTAL ANALYSIS COMPETITIVE ADVANTAGE
No ratings yet
Chapter 4 INTERNAL ENVIRONMENTAL ANALYSIS COMPETITIVE ADVANTAGE
18 pages
Credit Card Default
No ratings yet
Credit Card Default
5 pages
Week 10
No ratings yet
Week 10
50 pages
Data Preprocessing & Visualization1
No ratings yet
Data Preprocessing & Visualization1
2 pages
Name: Dhruvil K Kotecha ID No.: 17CP024 Sub. Code: CP-402 Sub. Name: ADT Semester: 7 Year: 2020/21
No ratings yet
Name: Dhruvil K Kotecha ID No.: 17CP024 Sub. Code: CP-402 Sub. Name: ADT Semester: 7 Year: 2020/21
30 pages
Lambda Functions & Alternative Methods in Python
No ratings yet
Lambda Functions & Alternative Methods in Python
8 pages
Project paarth (1) (1)
No ratings yet
Project paarth (1) (1)
21 pages
FeatureEngineering (1)
No ratings yet
FeatureEngineering (1)
50 pages
Abhiml ML File
No ratings yet
Abhiml ML File
74 pages
Machine Learning Record VR19
No ratings yet
Machine Learning Record VR19
46 pages
Predictive+Modelling+-+Logistic+Regression+-+Student+Version-New2.3.ipynb - Colaboratory
No ratings yet
Predictive+Modelling+-+Logistic+Regression+-+Student+Version-New2.3.ipynb - Colaboratory
12 pages
KNN - Jupyter Notebook (1)
No ratings yet
KNN - Jupyter Notebook (1)
7 pages
決策樹-R程式練習
No ratings yet
決策樹-R程式練習
11 pages
ML LAB - BCSL606
No ratings yet
ML LAB - BCSL606
67 pages
DAV_practicle_File
No ratings yet
DAV_practicle_File
28 pages
Panda Merged
No ratings yet
Panda Merged
19 pages
Germany Credit Analysis
No ratings yet
Germany Credit Analysis
41 pages
決策樹-R程式練習
No ratings yet
決策樹-R程式練習
11 pages
Machine Learning Program
No ratings yet
Machine Learning Program
12 pages
AIDS - DM Using Python - Lab Programs
No ratings yet
AIDS - DM Using Python - Lab Programs
19 pages
Abhi ML
No ratings yet
Abhi ML
11 pages
AI Final PDF
No ratings yet
AI Final PDF
38 pages
pt1 Answer Oops
No ratings yet
pt1 Answer Oops
8 pages
Openlab1
No ratings yet
Openlab1
17 pages
Data Science Practical Problems
No ratings yet
Data Science Practical Problems
40 pages
Assignment 1 Data Mining
No ratings yet
Assignment 1 Data Mining
1 page
ML LAB manual-1
No ratings yet
ML LAB manual-1
33 pages
Danmairo - Analysis - Ipynb - Colaboratory
No ratings yet
Danmairo - Analysis - Ipynb - Colaboratory
18 pages
Lab Programmes Adwaith
No ratings yet
Lab Programmes Adwaith
18 pages
Salary Estimation using K-Nearest Neighbour
No ratings yet
Salary Estimation using K-Nearest Neighbour
1 page
Develop A Program To Implement Data Preprocessing Using
No ratings yet
Develop A Program To Implement Data Preprocessing Using
19 pages
Data Preprocessing 1
No ratings yet
Data Preprocessing 1
6 pages
2022UCD2164-1-2
No ratings yet
2022UCD2164-1-2
35 pages
DM Lab Progrmas 35
No ratings yet
DM Lab Progrmas 35
38 pages
Assignmnet 5
No ratings yet
Assignmnet 5
11 pages
End Sem PYQ
No ratings yet
End Sem PYQ
8 pages
Ensemmmmm
No ratings yet
Ensemmmmm
10 pages
EDA - Exploratory Data Analysis
No ratings yet
EDA - Exploratory Data Analysis
16 pages
Assignment 03
No ratings yet
Assignment 03
6 pages
Mathallcodes 1
No ratings yet
Mathallcodes 1
32 pages
Note 4
No ratings yet
Note 4
18 pages
EDP-3[2]
No ratings yet
EDP-3[2]
16 pages
Student - Linear Regression Example - Colaboratory
No ratings yet
Student - Linear Regression Example - Colaboratory
6 pages
data analytics lab manual
No ratings yet
data analytics lab manual
26 pages
Vertopal.com AML Project LearnerNotebook LowCode
No ratings yet
Vertopal.com AML Project LearnerNotebook LowCode
74 pages
AIL303 M
No ratings yet
AIL303 M
22 pages
Machine Learning
No ratings yet
Machine Learning
81 pages
ML Cops
No ratings yet
ML Cops
17 pages
Alishba(S005)
No ratings yet
Alishba(S005)
5 pages
Mid-Sem Model Answer 7
No ratings yet
Mid-Sem Model Answer 7
5 pages
Practical 3
No ratings yet
Practical 3
8 pages
Pandas Questions Ip File
No ratings yet
Pandas Questions Ip File
13 pages
FYMCA IDSLab A6 Submission
No ratings yet
FYMCA IDSLab A6 Submission
9 pages
Predictive_Modelling_Alternate_Project_Business_Case.docx
No ratings yet
Predictive_Modelling_Alternate_Project_Business_Case.docx
47 pages
Student Notebook HR Analysis
No ratings yet
Student Notebook HR Analysis
11 pages
CALCULATION
No ratings yet
CALCULATION
15 pages
ml lab
No ratings yet
ml lab
23 pages
Data Mining Lab 03
No ratings yet
Data Mining Lab 03
10 pages
Installation/Service Manual: Slow Exhaust Autoclave & Rapid Exhaust Autoclave
No ratings yet
Installation/Service Manual: Slow Exhaust Autoclave & Rapid Exhaust Autoclave
35 pages
Download
No ratings yet
Download
1 page
(Ebook) Handbook of 3D Integration, Volume 3: 3D Process Technology by Philip Garrou; Mitsumasa Koyanagi; Peter Ramm ISBN 9783527334667, 3527334661 instant download
No ratings yet
(Ebook) Handbook of 3D Integration, Volume 3: 3D Process Technology by Philip Garrou; Mitsumasa Koyanagi; Peter Ramm ISBN 9783527334667, 3527334661 instant download
46 pages
Mennonite 3
No ratings yet
Mennonite 3
308 pages
What Is Science 1-1 PDF
No ratings yet
What Is Science 1-1 PDF
24 pages
Perspective Drawings in 3D AutoCAD 2010
No ratings yet
Perspective Drawings in 3D AutoCAD 2010
3 pages
Exam Timetable 2022-2023
No ratings yet
Exam Timetable 2022-2023
20 pages
Download Full Where the Millennials Will Take Us A New Generation Wrestles with the Gender Structure 1st Edition Barbara J. Risman PDF All Chapters
100% (4)
Download Full Where the Millennials Will Take Us A New Generation Wrestles with the Gender Structure 1st Edition Barbara J. Risman PDF All Chapters
55 pages
PR2 Module 5 Research Title
100% (1)
PR2 Module 5 Research Title
9 pages
Sustainable Development Goal Media Design Framework
No ratings yet
Sustainable Development Goal Media Design Framework
1 page
DSA DAY 5 - Trees
100% (1)
DSA DAY 5 - Trees
36 pages
Projected Shadows Psychoanalytic Reflections on the Representation of Loss in European Cinema The New Library of Psychoanalysis 1st Edition Sabbadini - The ebook is available for instant download, read anywhere
100% (1)
Projected Shadows Psychoanalytic Reflections on the Representation of Loss in European Cinema The New Library of Psychoanalysis 1st Edition Sabbadini - The ebook is available for instant download, read anywhere
52 pages
XXXXXX: NT (E) NT - Nov
No ratings yet
XXXXXX: NT (E) NT - Nov
16 pages
Actrj1ucmpsc Datasheet Ae En-Gb
No ratings yet
Actrj1ucmpsc Datasheet Ae En-Gb
1 page
Modeling of Fixed Bed Catalytic Reactors: Computers & Chemical Engineering December 1985
No ratings yet
Modeling of Fixed Bed Catalytic Reactors: Computers & Chemical Engineering December 1985
12 pages
Grade 7 First Quarter Exam
No ratings yet
Grade 7 First Quarter Exam
5 pages
A Analogy Types
No ratings yet
A Analogy Types
4 pages
Full Download Introduction To Learning and Behavior 5th Edition Powell Test Bank
100% (62)
Full Download Introduction To Learning and Behavior 5th Edition Powell Test Bank
35 pages
Michael Frede, The ἐφ ἡμῖν in ancient philosophy
No ratings yet
Michael Frede, The ἐφ ἡμῖν in ancient philosophy
15 pages
A New Modern Philosophy The Inclusive Anthology Of... - (2 Francis Bacon)
No ratings yet
A New Modern Philosophy The Inclusive Anthology Of... - (2 Francis Bacon)
6 pages
Ireland Fellows Programme Directory of Programmes 2021-22 0 PDF
No ratings yet
Ireland Fellows Programme Directory of Programmes 2021-22 0 PDF
131 pages
Research Paper in Mathematics
100% (1)
Research Paper in Mathematics
4 pages
Class Xii Cumulative Class Test Math Ch-3
No ratings yet
Class Xii Cumulative Class Test Math Ch-3
5 pages
Radiographers Journal January 2023
No ratings yet
Radiographers Journal January 2023
32 pages
Remission of Prison Sentences Through Reading in Rio de Janeiro: Possibilities and Advances
No ratings yet
Remission of Prison Sentences Through Reading in Rio de Janeiro: Possibilities and Advances
15 pages
Instant ebooks textbook Social Psychology: First South African Edition Roy F. Baumeister download all chapters
100% (2)
Instant ebooks textbook Social Psychology: First South African Edition Roy F. Baumeister download all chapters
51 pages
Question 2: Consider The Directed Graph G With The Adjacency Matrix (In The Order of Vertices A, B, C
No ratings yet
Question 2: Consider The Directed Graph G With The Adjacency Matrix (In The Order of Vertices A, B, C
7 pages
Summative Test in Q4 Math 2 - No.1
No ratings yet
Summative Test in Q4 Math 2 - No.1
4 pages
The Routledge Companion to Feminist Philosophy 1st Edition Ann Garry download pdf
100% (5)
The Routledge Companion to Feminist Philosophy 1st Edition Ann Garry download pdf
55 pages

Practical1c.ipynb - Colab

Uploaded by

Practical1c.ipynb - Colab

Uploaded by

11/30/24, 3:34 PM Practical1c.

# Import required libraries

# Create a sample dataset

# Label Encoding for 'Category' column

After Label Encoding:

# Min-Max Scaling for 'Income'

# Binarization for 'Income' with a threshold of 75,000

Income_MinMax Age_Standardized Income_Binary

# Save the processed dataset

Processed dataset saved as 'processed_data.csv'

You might also like