Assignment No. 1
Assignment No. 1
Class : TE-C/T213050
---------------------------------------------------------------------------------------------------------------
PRACTICAL NO: 01
Problem Statement :
Data Wrangling, I
Perform the following operations using Python on any open source dataset (e.g., data.csv)
2. Locate an open source data from the web (e.g., https://www.kaggle.com). Provide a clear
description of the data and its source (i.e., URL of the web site).
4. Data Preprocessing: check for missing values in the data using pandas isnull(), describe() function
to get some initial statistics. Provide variable descriptions. Types of variables etc. Check the
dimensions of the data frame.
5. Data Formatting and Data Normalization: Summarize the types of variables by checking the data
types (i.e., character, numeric, integer, factor, and logical) of the variables in the data set. If variables
are not in the correct data type, apply proper type conversions.
6. Turn categorical variables into quantitative variables in Python. In addition to the codes and
outputs, explain every operation that you do in the above steps and explain everything that you do
to import/read/scrape the data set.