0% found this document useful (0 votes)
16 views4 pages

Assignment No. 1

Uploaded by

Rishu Singh
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
16 views4 pages

Assignment No. 1

Uploaded by

Rishu Singh
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 4

Name : Rishu Singh

Class : TE-C/T213050
---------------------------------------------------------------------------------------------------------------

PRACTICAL NO: 01
Problem Statement :
Data Wrangling, I

Perform the following operations using Python on any open source dataset (e.g., data.csv)

1. Import all the required Python Libraries.

2. Locate an open source data from the web (e.g., https://www.kaggle.com). Provide a clear
description of the data and its source (i.e., URL of the web site).

3. Load the Dataset into pandas dataframe.

4. Data Preprocessing: check for missing values in the data using pandas isnull(), describe() function
to get some initial statistics. Provide variable descriptions. Types of variables etc. Check the
dimensions of the data frame.

5. Data Formatting and Data Normalization: Summarize the types of variables by checking the data
types (i.e., character, numeric, integer, factor, and logical) of the variables in the data set. If variables
are not in the correct data type, apply proper type conversions.

6. Turn categorical variables into quantitative variables in Python. In addition to the codes and
outputs, explain every operation that you do in the above steps and explain everything that you do
to import/read/scrape the data set.

You might also like