0% found this document useful (0 votes)

3 views

DS3.1

The document outlines a lab practical focused on Python data types and libraries such as NumPy, Matplotlib, and Pandas for data manipulation and visualization. Students will perform tasks including creating arrays, generating charts, and analyzing datasets to gain hands-on experience. The conclusion emphasizes the importance of these libraries in efficiently handling and interpreting large datasets.

Uploaded by

Armankhan Pathan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

3 views

DS3.1

Uploaded by

Armankhan Pathan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 8

Enrolment No.

: 210430116083

Experiment No: 3

Date:

AIM: Study of Basics of Python data types, NumPy, Matplotlib, Pandas.

Relevant CO: CO1, CO2

Objective:
The objective of this lab practical is to gain hands-on experience with NumPy, Matplotlib, and
Pandas libraries to manipulate and visualize data. Through this practical, students will learn
how to use different functions of these libraries to perform various data analysis tasks.

Materials Used:
- Python programming environment
- NumPy library
- Matplotlib library
- Pandas library
- Dataset file (provided by faculty)
//Example of dataset file like sales_Data.csv
o Date: Date of sale
o Product: Name of the product sold
o Units Sold: Number of units sold
o Revenue: Total revenue generated from the sale
o Region: Geographic region where the sale took place
o Salesperson: Name of the salesperson who made the sale

Procedures:

Part 1: NumPy
1. Import the NumPy library into Python.
2. Create a NumPy array with the following specifications:
a. Dimensions: 5x5
b. Data type: integer
c. Values: random integers between 1 and 100
3. Reshape the array into a 1x25 array and calculate the mean, median, variance, and standard
deviation using NumPy functions.
4. Generate a random integer array of length 10 and find the percentile, decile, and quartile
values using NumPy functions.

Part 2: Matplotlib
1. Import the Matplotlib library into Python.
2. Create a simple bar chart using the following data:
a. X-axis values: ['A', 'B', 'C', 'D']
b. Y-axis values: [10, 20, 30, 40]
3. Customize the plot by adding a title, axis labels, and changing the color and style of the bars.
4. Create a pie chart using the following data:
a. Labels: ['Red', 'Blue', 'Green', 'Yellow']
b. Values: [20, 30, 10, 40]
5. Customize the pie chart by adding a title, changing the colors of the slices, and adding a

15
Enrolment No.: 210430116083

legend.

Part 3: Pandas
1. Import the Pandas library into Python.
2. Load the "sales_data.csv" file into a Pandas data frame.
3. Calculate the following statistics for the Units Sold and Revenue columns:
a. Mean
b. Median
c. Variance
d. Standard deviation
4. Group the data frame by Product and calculate the mean, median, variance, and standard
deviation of Units Sold and Revenue for each product using Pandas functions.
5. Create a line chart to visualize the trend of Units Sold and Revenue over time for each
product.

Interpretation/Program/code:
Part 1:
import numpy as np

np.random.seed(0)

arr = np.random.randint(1, 101, size=(5, 5), dtype=int)

print(arr)

reshaped_arr = arr.reshape(1, 25)

mean = np.mean(reshaped_arr)
median = np.median(reshaped_arr)
variance = np.var(reshaped_arr)
std_dev = np.std(reshaped_arr)

print("Mean:", mean)
print("Median:", median)
print("Variance:", variance)
print("Standard Deviation:", std_dev)

percentiles = np.percentile(arr, [10, 25, 50, 75, 90])

deciles = np.percentile(arr, [10, 20, 30, 40, 50, 60, 70, 80, 90])

16
Enrolment No.: 210430116083

quartiles = np.percentile(arr, [25, 50, 75])

print("Percentiles:", percentiles)
print("Deciles:", deciles)
print("Quartiles:", quartiles)

Part 2:
import matplotlib.pyplot as mt\
x_values = ['A', 'B', 'C', 'D']
y_values = [10, 20, 30, 40]
mt.bar(x_values,y_values)
mt.xlabel('X-axis')
mt.ylabel('Y-axis')
mt.title('Simple Bar Chart')
mt.show()

mt.bar(x_values, y_values, color='skyblue', edgecolor='black', linestyle='--')

mt.show()

17
Enrolment No.: 210430116083

labels = ['Red', 'Blue', 'Green', 'Yellow']

values = [20, 30, 10, 40]
mt.pie(values,labels=labels)
mt.title('Pie Chart')
mt.show()

colors = ['red', 'blue', 'green', 'yellow']

mt.pie(values, labels=labels, colors=colors)
mt.legend()
mt.show()

18
Enrolment No.: 210430116083

Part 3:
import pandas as pd
df=pd.read_csv(r'sales_data.csv')
mean_units_sold = df['Order_Quantity'].mean()
median_units_sold = df['Order_Quantity'].median()
variance_units_sold = df['Order_Quantity'].var()
std_dev_units_sold = df['Order_Quantity'].std()

mean_revenue = df['Revenue'].mean()
median_revenue = df['Revenue'].median()
variance_revenue = df['Revenue'].var()
std_dev_revenue = df['Revenue'].std()

print("Units Sold:")
print("Mean:", mean_units_sold)
print("Median:", median_units_sold)
print("Variance:", variance_units_sold)
print("Standard Deviation:", std_dev_units_sold)

print("\nRevenue:") print("Mean:",
mean_revenue) print("Median:",
median_revenue)
print("Variance:", variance_revenue)
print("Standard Deviation:", std_dev_revenue)

19
Enrolment No.: 210430116083

grouped_df = df.groupby('Product').agg({'Order_Quantity': ['mean', 'median', 'var', 'std'],

'Revenue': ['mean', 'median', 'var', 'std']})

print(grouped_df)

20
Enrolment No.: 210430116083

df['Date'] = pd.to_datetime(df['Date'])

groupe_df = df.groupby(['Product', 'Date']).sum().reset_index()

mt.plot(groupe_df[grouped_df['Product']=='Hitch Rack - 4-Bike']['Date'],

groupe_df[groupe_df['Product'] == 'Hitch Rack - 4-Bike']['Order_Quantity'], label='Product
A')
mt.plot(groupe_df[grouped_df['Product'] == 'Sport-100 Helmet, Black']['Date'],
groupe_df[groupe_df['Product'] == 'Sport-100 Helmet, Black']['Order_Quantity'],
label='Product B')
mt.plot(groupe_df[grouped_df['Product'] == 'Long-Sleeve Logo Jersey, L']['Date'],
groupe_df[groupe_df['Product'] == 'Long-Sleeve Logo Jersey, L']['Order_Quantity'],
label='Product C')
mt.xlabel('Date')
mt.ylabel('Units Sold')
mt.legend()
mt.show()

21
Enrolment No.: 210430116083

Conclusion:
In conclusion, this lab practical provided hands-on experience with NumPy, Matplotlib, and
Pandas libraries in Python for data manipulation and visualization. These libraries have wide-
ranging applications in various fields, enabling researchers and analysts to gain insights from
large datasets quickly and efficiently. Through exercises such as calculating statistical
measures and visualizing data using charts, we explored the functionality and flexibility of
these powerful data analysis tools. Overall, gaining proficiency in these libraries equips
individuals to tackle complex data analysis challenges and contribute to their respective fields
of study or industries.

Quiz:
1. What is the difference between a list and a tuple in Python?
2. How can you use NumPy to generate an array of random numbers?

Suggested References:-
1. Dinesh Kumar, Business Analytics, Wiley India Business alytics: The Science
2. V.K. Jain, Data Science & Analytics, Khanna Book Publishing, New Delhi of Dat
3. Data Science For Dummies by Lillian Pierson , Jake Porway
Rubrics wise marks obtained

Understanding of Analysis of Capability of Documentation

Problem the Problem writing program Total

02 02 05 01 10

2021-Jack W Baker-Seismic Hazard and Risk Analysis
No ratings yet
2021-Jack W Baker-Seismic Hazard and Risk Analysis
595 pages
N. David Mermin - Space and Time in Special Relativity-McGraw-Hill, Inc. (1968) PDF
100% (2)
N. David Mermin - Space and Time in Special Relativity-McGraw-Hill, Inc. (1968) PDF
264 pages
Bba-Fundamentals of Business Mathematics Questions
100% (1)
Bba-Fundamentals of Business Mathematics Questions
45 pages
EXP1-siddhant gupta (23_SE_148)
No ratings yet
EXP1-siddhant gupta (23_SE_148)
17 pages
EDA LAB ASSIGNMENT2
No ratings yet
EDA LAB ASSIGNMENT2
10 pages
PDS_Exp_10_to_12
No ratings yet
PDS_Exp_10_to_12
8 pages
DV Lab Manual 2022-23
No ratings yet
DV Lab Manual 2022-23
10 pages
fdsa lab manual final
No ratings yet
fdsa lab manual final
70 pages
Machinelearning Prac
No ratings yet
Machinelearning Prac
17 pages
BDA File
No ratings yet
BDA File
26 pages
Data Science
No ratings yet
Data Science
18 pages
Class X - A.I. - Practical Lab Manual - VVA 2024-25
No ratings yet
Class X - A.I. - Practical Lab Manual - VVA 2024-25
50 pages
Unit 5 PythonPackages(Matplotlib)
No ratings yet
Unit 5 PythonPackages(Matplotlib)
24 pages
ML3_Data_Analysis
No ratings yet
ML3_Data_Analysis
80 pages
End semester Answer key format-fods
No ratings yet
End semester Answer key format-fods
8 pages
Fundamentals of Data Science Lab Manual
No ratings yet
Fundamentals of Data Science Lab Manual
34 pages
FODS_LAB_MANUAL
No ratings yet
FODS_LAB_MANUAL
26 pages
AI Lab 06 Lab Tasks
No ratings yet
AI Lab 06 Lab Tasks
11 pages
prac2
No ratings yet
prac2
11 pages
Answers 1
No ratings yet
Answers 1
17 pages
12 Ip Practical List With Solution Complete
No ratings yet
12 Ip Practical List With Solution Complete
5 pages
Data Science Algorithmen Master - 02 Data Handling
No ratings yet
Data Science Algorithmen Master - 02 Data Handling
76 pages
Guides
No ratings yet
Guides
23 pages
ML(sudhanshu)
No ratings yet
ML(sudhanshu)
24 pages
final dev record
No ratings yet
final dev record
49 pages
Data Analysis Lab - Final - 23-24
No ratings yet
Data Analysis Lab - Final - 23-24
11 pages
Data science and analtics Laboratory
No ratings yet
Data science and analtics Laboratory
21 pages
Study Material For XII Computer Science On: Data Visualization Using Pyplot
No ratings yet
Study Material For XII Computer Science On: Data Visualization Using Pyplot
22 pages
DATA_ANALYTICS_LAB_MANUAL_FINAL1[1]
No ratings yet
DATA_ANALYTICS_LAB_MANUAL_FINAL1[1]
32 pages
KJD ML File
No ratings yet
KJD ML File
45 pages
Data Visualization - 1 by Matplot Lib
No ratings yet
Data Visualization - 1 by Matplot Lib
19 pages
Data Visualization - Matplotlib PDF
100% (1)
Data Visualization - Matplotlib PDF
15 pages
3rd Semester DDM AI DAA DEV Print Pages For Spiral Record 25-1-24 - Removed
No ratings yet
3rd Semester DDM AI DAA DEV Print Pages For Spiral Record 25-1-24 - Removed
28 pages
Assignment 4 On Visualization On Graph With Solution
No ratings yet
Assignment 4 On Visualization On Graph With Solution
14 pages
FDA Lab Manual Final
No ratings yet
FDA Lab Manual Final
42 pages
BTech_5_CSE_Data_Analytics_Using_Python_Unit_5_Notes
No ratings yet
BTech_5_CSE_Data_Analytics_Using_Python_Unit_5_Notes
9 pages
Visualisation All
0% (1)
Visualisation All
70 pages
exp1
No ratings yet
exp1
5 pages
Data Visualisation
No ratings yet
Data Visualisation
5 pages
AI Lab 06 Lab Tasks
No ratings yet
AI Lab 06 Lab Tasks
6 pages
ML Lab File
No ratings yet
ML Lab File
43 pages
Data Visualization
No ratings yet
Data Visualization
48 pages
Python Unit IV
No ratings yet
Python Unit IV
12 pages
EX-02-Data manipulation pandas matplot
No ratings yet
EX-02-Data manipulation pandas matplot
9 pages
EDAP LAB
No ratings yet
EDAP LAB
47 pages
ML Lab Manual
No ratings yet
ML Lab Manual
28 pages
Content From Jose Portilla's Udemy Course Learning Python For Data Analysis and Visualization Notes by Michael Brothers, Available On
No ratings yet
Content From Jose Portilla's Udemy Course Learning Python For Data Analysis and Visualization Notes by Michael Brothers, Available On
13 pages
FDS Final Manual
No ratings yet
FDS Final Manual
41 pages
Data Visualization Python Tutorial
No ratings yet
Data Visualization Python Tutorial
9 pages
BIDA practical print
No ratings yet
BIDA practical print
56 pages
ML 1-11
No ratings yet
ML 1-11
27 pages
prac2
No ratings yet
prac2
11 pages
Numpy and Pandas
No ratings yet
Numpy and Pandas
11 pages
Data Visualizations in Python With Matplotlib: Sidita Duli, PHD
No ratings yet
Data Visualizations in Python With Matplotlib: Sidita Duli, PHD
6 pages
PR final file
No ratings yet
PR final file
49 pages
2,3. Introduction Pandas & Matplotlib - Copy
No ratings yet
2,3. Introduction Pandas & Matplotlib - Copy
32 pages
DEV Lab Record - Merged
No ratings yet
DEV Lab Record - Merged
28 pages
Unit 4 python
No ratings yet
Unit 4 python
12 pages
Data Visualization in Python
No ratings yet
Data Visualization in Python
11 pages
Report
No ratings yet
Report
18 pages
DAV Guidelines
No ratings yet
DAV Guidelines
4 pages
Machine Learning Lab Manual
No ratings yet
Machine Learning Lab Manual
36 pages
Data Science with R: Beginner to Expert
From Everand
Data Science with R: Beginner to Expert
Narayana Nemani
No ratings yet
Maths Revision Questions
No ratings yet
Maths Revision Questions
11 pages
Units and Conversions
No ratings yet
Units and Conversions
2 pages
Principles of Economics: Twelfth Edition
No ratings yet
Principles of Economics: Twelfth Edition
35 pages
Guozhen Wu - Tsinghua University Press - Vibrational Spectroscopy-De Gruyter (2019)
No ratings yet
Guozhen Wu - Tsinghua University Press - Vibrational Spectroscopy-De Gruyter (2019)
206 pages
Immediate download Pathfinder CDS Combined Defence Services Entrance Examination 2020-21 Edition Arihant Experts ebooks 2025
100% (1)
Immediate download Pathfinder CDS Combined Defence Services Entrance Examination 2020-21 Edition Arihant Experts ebooks 2025
84 pages
Global Positioning System (GPS) Antenna Calibration at The National Geodetic Survey
No ratings yet
Global Positioning System (GPS) Antenna Calibration at The National Geodetic Survey
12 pages
Top of Form
No ratings yet
Top of Form
13 pages
Lesson 1 - Steps in Hypothesis Testing
No ratings yet
Lesson 1 - Steps in Hypothesis Testing
22 pages
Ariel Li Basic Geometry
No ratings yet
Ariel Li Basic Geometry
5 pages
First Quarter Periodic Test in Mathematics 8 Table of Specification
No ratings yet
First Quarter Periodic Test in Mathematics 8 Table of Specification
1 page
JNTU Mechanical Engineering (R09) Syllabus Book
100% (1)
JNTU Mechanical Engineering (R09) Syllabus Book
147 pages
NCERT Solutions Class9 Motion-And-Rest PDF
No ratings yet
NCERT Solutions Class9 Motion-And-Rest PDF
19 pages
Quarter 1 - Module 2 Textual Aids
No ratings yet
Quarter 1 - Module 2 Textual Aids
38 pages
413409G 2MAM20 AT2 Surge and Logistics Investigation
No ratings yet
413409G 2MAM20 AT2 Surge and Logistics Investigation
12 pages
Ansa For Automatic Meshing
No ratings yet
Ansa For Automatic Meshing
6 pages
Module 4 Circuit Theorems
No ratings yet
Module 4 Circuit Theorems
24 pages
Machine Learning (R17A0534) Lecture Notes: B.Tech Iv Year - I Sem (R17) (2020-21)
No ratings yet
Machine Learning (R17A0534) Lecture Notes: B.Tech Iv Year - I Sem (R17) (2020-21)
5 pages
RC&RL
No ratings yet
RC&RL
25 pages
3rd Unpackingmath
No ratings yet
3rd Unpackingmath
25 pages
MATHS
No ratings yet
MATHS
4 pages
13 Sacred Geometry Forms
100% (3)
13 Sacred Geometry Forms
7 pages
Energy-based Homogenization Method for Lattice Structures With Generalized Periodicity
No ratings yet
Energy-based Homogenization Method for Lattice Structures With Generalized Periodicity
20 pages
Property Definition
No ratings yet
Property Definition
2 pages
Cubic Spline Tutorial v3
No ratings yet
Cubic Spline Tutorial v3
10 pages
Gibbs & Apell Equations
No ratings yet
Gibbs & Apell Equations
9 pages
Electric Current - DPP 04 - Abhimanyu 2.0 (Telugu)
No ratings yet
Electric Current - DPP 04 - Abhimanyu 2.0 (Telugu)
3 pages
Theory of Electromagnetic Fields: Part II: Standing Waves
No ratings yet
Theory of Electromagnetic Fields: Part II: Standing Waves
73 pages

DS3.1

Uploaded by

DS3.1

Uploaded by

Enrolment No.

AIM: Study of Basics of Python data types, NumPy, Matplotlib, Pandas.

Relevant CO: CO1, CO2

arr = np.random.randint(1, 101, size=(5, 5), dtype=int)

reshaped_arr = arr.reshape(1, 25)

percentiles = np.percentile(arr, [10, 25, 50, 75, 90])

quartiles = np.percentile(arr, [25, 50, 75])

mt.bar(x_values, y_values, color='skyblue', edgecolor='black', linestyle='--')

labels = ['Red', 'Blue', 'Green', 'Yellow']

colors = ['red', 'blue', 'green', 'yellow']

grouped_df = df.groupby('Product').agg({'Order_Quantity': ['mean', 'median', 'var', 'std'],

groupe_df = df.groupby(['Product', 'Date']).sum().reset_index()

mt.plot(groupe_df[grouped_df['Product']=='Hitch Rack - 4-Bike']['Date'],

Understanding of Analysis of Capability of Documentation

You might also like