ml file
ml file
Objective
This exercise aims to demonstrate basic data manipulation techniques using Python's
Pandas library.
Dataset Overview
The Raw Housing Prices dataset provides detailed information about housing sales. This
dataset is useful for understanding pricing trends, property characteristics, and market
behaviors.
Data Description
- Date House was Sold: The date when the house was sold.
- Sale Price: The price at which the house was sold.
- Zipcode: The area code of the property location.
- Bedrooms: The number of bedrooms in the house.
- Bathrooms: The number of bathrooms in the house.
- Living Area (sqft): The living space size in square feet.
- Lot Area (sqft): The size of the lot in square feet.
- Floors: The number of floors in the house.
- Waterfront View: Whether the house has a view of the waterfront.
- Condition: The overall condition of the property.
Purpose
- Price Trend Analysis: Identifying pricing trends over time and across locations.
- Property Segmentation: Analyzing features that affect property prices.
- Location Insights: Understanding how location impacts housing prices.
- Market Behavior: Evaluating market behaviors to assist in real estate decision-making.
# Import Libraries
import pandas as pd
df.info()
Q 1.4) Generating Descriptive Statistics
df.describe()
df.shape
Q 2) Exporting Data
Q 3) Filtering Data
Q 3) Grouping Data