Vanshika Goyal Gec Practicals

The document outlines a series of practical exercises for a Data Visualization using Python course. It includes tasks involving the numpy and pandas libraries for statistical analysis, data manipulation, and visualization techniques. The exercises cover various topics such as computing statistics, creating and reshaping arrays, handling missing values, and performing data merges and visualizations with the Iris dataset.

Uploaded by

vanshikagoyal726

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

13 views31 pages

Vanshika Goyal Gec Practicals

Uploaded by

vanshikagoyal726

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 31

GEC - PRACTICALS

NAME : VANSHIKA GOYAL

ROLL NO. : 24504061
COURSE : BCOM [HONS.]
SUBJECT : DATA VISUALISATION USING
PYTHON
1. Write programmes in python using numpy library to do the following:

a. compute the mean , standard deviation , and variance of a two dimensional random integer
array along the second axis.
In[1] import numpy as np
Array1=np.random.randint(2,20,size=(3,4))
print(array1)
print('mean of random array along second axis:',np.mean(array1,axis=1))
print('standard deviation of random array along second axis:',np.std(array1,axis=1))
print('variance of random array along second axis:',np.var(array1,axis=1))

Out[1] [[15 6 7 16]

[ 6 14 10 6]
[14 7 5 2]]
mean of random array along second axis: [11. 9. 7.]
standard deviation of random array along second axis: [4.52769257 3.31662479 4.41588043]
variance of random array along second axis: [20.5 11. 19.5]
b. Create a 2-dimensional array of size m x n integer elements, also print the shape , type
and data type of the array and then reshape it into an n x m array, where n and m are user
inputs given at the run time.
c. Test whether the elements of a given 1D array are zero, non-zero and NaN. Record the
indices of these elements in seperate arrays.
d. Create three random arrays of the same size: Array1,Array2,Array3. subtract Array2 from
array3 and store Array4, create another array Array5 having two times the values in array1.
find co-variance and correlation of Array 1 with array4 and array5 respectively.
e. Create 2 random arrays of the same size 10 : array1,array2. Find the sum of the first half
of both the arrays and product of the second half of both the arrays.
In[1] import numpy as np
arr1=np.random.random(size=10)
arr2=np.random.random(size=10)
print(arr1)
print(arr2)
arr3=arr1[0:5]+arr2[0:5]
arr4=arr1[5:10]*arr2[5:10]
print(arr3)
print(arr4)

Out[1] [0.63618375 0.86171874 0.56897631 0.37959409 0.34805725 0.91758604

0.17253892 0.77094538 0.95741841 0.95282946]
[0.60511907 0.51368738 0.73009941 0.87229216 0.11689907 0.57703957
0.81210364 0.70982319 0.33538714 0.4209075 ]
[1.24130282 1.37540612 1.29907573 1.25188625 0.46495632]
[0.52948345 0.14011949 0.54723491 0.32110582 0.40105307]
2. Do the following using Pandas series:

a.Create a series with 5 elements. Display the series sorted on index and also sorted on values separately.
In[1] import pandas as pd
s1=pd.Series([9,5,0,8,6],index=['a','b','c','d','e'])
x=s1.sort_index()
y=s1.sort_values()
print(x)
print(y)
Out[1] a 9
b 5
c 0
d 8
e 6
dtype: int64
c 0
b 5
e 6
d 8
a 9
dtype: int64
b. Create a series with N elements with some duplicate values. Find the minimum and
maximum ranks assigned to the values using ‘first’ and ‘max’ method.
In[1] import pandas as pd
s2=pd.Series([8,5,4,3,1,2],index=['a','b','c','d','e','f'])
x=s2.rank(method='first‘)
y=s2.rank(method='max')
print(x)
print(y)

Out[1] a 6.0
b 5.0
c 4.0
d 3.0
e 1.0
f 2.0
dtype: float64
a 6.0
b 5.0
c 4.0
d 3.0
e 1.0
f 2.0
dtype: float64
c. Display the index value of the minimum and maximum elements of a series.
In[] import pandas as pd
s=pd.Series([123,564,181,345,65,4567,41,5])
print(s)
print("index value of the maximum element is:",s.idxmax())
print("index value of the minimum element is:",s.idxmin())

Out[] 0 123
1 564
2 181
3 345
4 65
5 4567
6 41
7 5
dtype: int64
index value of the maximum element is: 5
index value of the minimum element is: 7
3. Create a data frame having atleast 3 columns and 50 rows to store numerical data generated
using a random function. Replace 10%of the values by null values whose index positions are
generated using random function.
a. Identify and count missing values in a data frame.
b. Drop the column having more than 5 null values.
c. Identify the row label having maximum of the sum of all values in a row and drop that row.
d. Sort the data on the basis of the first column.
e. Remove all the duplicates from the first column.
f. Find the correlation between first and second column and covariance between second and
third column.
g. Discretize the second column and create 5 bins.
4. Consider 2 excel files having attendance of two workshops. Each file has 3 fields ‘name’ , ’date’ ,
’duration’ (in minutes) where names are unique within a file . Note that the duration may take one of the
three values (30,40,50) only import the data into two data frames and do the following:

a. Perform merging of the two data frames to find the names of the student who had
attended both workshops.
b. Find names of all students who have attended a single workshop only.

c. Merge two data frames row wise and find the total number of records in the data frame.
d. Merge two data frames row wise and use two columns viz. names and dates as multi-row
indexes. Generate descriptive statistics for the hierarchical data frame.
5. Using iris data, plot the following with proper legand and axis labels: ( download IRIS Data from
: https://archieve.ics.uci.edu/ml/datasets/iris or import it from sklearn datasets).
a. Plot bar chart to show the frequency of each class label in the Data.
b. Draw a scatter plot for petal width vs. sepal width and fit a regression line.
c. Plot density distribution for feature petal length.
d. Use a pair plot to show pairwise bivariate distribution in the Iris Dataset.
e. Draw heatmap for the four numeric attributes.
g. Compute correlation coefficients between each pair of features and plot heatmap.
6. Consider the following data frame containing a family name, gender of the family member
and his/her monthly income in each record.
NAME GENDER MONTHLY INCOME {RS.}

Shah Male 11400.00

Vats Male 65000.00

Vats Female 43150.00

Kumar Female 69500.00

Vats Female 155000.00

Kumar Male 103000.00

Shah Male 55000.00

Shah Female 112400.00

Kumar Female 81030.00

Vats Male 71900.00

Write a program in python using Pandas to perform the following:

a. Calculate and display family wise gross monthly income.

b. Calculate and display the member with highest monthly income.

c. Calculate and display monthly income of all members with income greater than Rs. 60000.00.

d. Calculate and display the average monthly income of female members.

Training Manual PLXXF5000AG en
100% (1)
Training Manual PLXXF5000AG en
32 pages
Gec Practicals
No ratings yet
Gec Practicals
31 pages
DAV Guidelines
No ratings yet
DAV Guidelines
4 pages
DAV Practicle File
No ratings yet
DAV Practicle File
28 pages
GE Practical Sem 2
No ratings yet
GE Practical Sem 2
28 pages
23HCS4142 PDF
No ratings yet
23HCS4142 PDF
24 pages
23bet10114 Naman Gupta Assignment-1
No ratings yet
23bet10114 Naman Gupta Assignment-1
17 pages
21hcs4108 Davpracticals
No ratings yet
21hcs4108 Davpracticals
29 pages
DAV Practical
No ratings yet
DAV Practical
12 pages
Ge Sem II Dav Upc 2344001201 Sl. No. Qp. 2012 July 2023
No ratings yet
Ge Sem II Dav Upc 2344001201 Sl. No. Qp. 2012 July 2023
16 pages
DAV Practical File 234003
No ratings yet
DAV Practical File 234003
14 pages
Practical 1 and 2-1
No ratings yet
Practical 1 and 2-1
33 pages
DAV Practicals
No ratings yet
DAV Practicals
26 pages
Manishadav
No ratings yet
Manishadav
27 pages
Khadeeja - DS - PRACTICAL 4
No ratings yet
Khadeeja - DS - PRACTICAL 4
24 pages
2023 Data Analysis and Visualization Using Python
100% (2)
2023 Data Analysis and Visualization Using Python
9 pages
Shalvin
No ratings yet
Shalvin
9 pages
12 Ip Practical List With Solution Complete
No ratings yet
12 Ip Practical List With Solution Complete
5 pages
Python 1
No ratings yet
Python 1
16 pages
PYQ Data Analysis and Visualisation Using Python GE May 2024
No ratings yet
PYQ Data Analysis and Visualisation Using Python GE May 2024
6 pages
GE02 (DAVP) Assignment
No ratings yet
GE02 (DAVP) Assignment
3 pages
DAV Guidelines
No ratings yet
DAV Guidelines
4 pages
QP DAV 3rd Sem Dec 2023
No ratings yet
QP DAV 3rd Sem Dec 2023
12 pages
DP Prog
No ratings yet
DP Prog
10 pages
GE - Computer Scien EaQvs42
No ratings yet
GE - Computer Scien EaQvs42
6 pages
Data Analysis 6060
No ratings yet
Data Analysis 6060
6 pages
Practical Assignment4 1
No ratings yet
Practical Assignment4 1
6 pages
Important Questions With Solutions IP
No ratings yet
Important Questions With Solutions IP
5 pages
FDS Lab 1 Manuel .1..1new
No ratings yet
FDS Lab 1 Manuel .1..1new
34 pages
Pds Record Document Ds II
No ratings yet
Pds Record Document Ds II
36 pages
External
No ratings yet
External
11 pages
Guidelines DAVP
No ratings yet
Guidelines DAVP
3 pages
DXE 24gksmknvj
No ratings yet
DXE 24gksmknvj
16 pages
Python Programming U5
No ratings yet
Python Programming U5
46 pages
FDS Record-1-4
No ratings yet
FDS Record-1-4
18 pages
2020-21 XIIInfo - Pract.S.E.155
No ratings yet
2020-21 XIIInfo - Pract.S.E.155
11 pages
GE Python Visualization 2023
No ratings yet
GE Python Visualization 2023
16 pages
Even Students
No ratings yet
Even Students
36 pages
Data Science and Analtics Laboratory
No ratings yet
Data Science and Analtics Laboratory
21 pages
Programs
No ratings yet
Programs
8 pages
Cs Sem III Dav Upc 2343012002 Sl. No. Qp. 1673 Dec '23
No ratings yet
Cs Sem III Dav Upc 2343012002 Sl. No. Qp. 1673 Dec '23
12 pages
Data Analysis Lab - Final - 23-24
No ratings yet
Data Analysis Lab - Final - 23-24
11 pages
Fda Batch2program
No ratings yet
Fda Batch2program
18 pages
Data Science
No ratings yet
Data Science
18 pages
Ge - Computer Science Data Analysis
No ratings yet
Ge - Computer Science Data Analysis
16 pages
PP DWDM 4 5
No ratings yet
PP DWDM 4 5
26 pages
EX-02-Data Manipulation Pandas Matplot
No ratings yet
EX-02-Data Manipulation Pandas Matplot
9 pages
Practical File Question 28.09.2022
No ratings yet
Practical File Question 28.09.2022
15 pages
CLASS XII - IP List of Practicals With Coding 2020
No ratings yet
CLASS XII - IP List of Practicals With Coding 2020
15 pages
DAV Previous Year
No ratings yet
DAV Previous Year
7 pages
DAVPy 2024GE
No ratings yet
DAVPy 2024GE
12 pages
DWDM Lab Manual
No ratings yet
DWDM Lab Manual
32 pages
Ip Project Work 2
No ratings yet
Ip Project Work 2
52 pages
AI Final PDF
No ratings yet
AI Final PDF
38 pages
AD3411 - 1 To 5
No ratings yet
AD3411 - 1 To 5
11 pages
Certificate
No ratings yet
Certificate
25 pages
Data Science Using Python Lab Week8
No ratings yet
Data Science Using Python Lab Week8
23 pages
Data Science Algorithmen Master - 02 Data Handling
No ratings yet
Data Science Algorithmen Master - 02 Data Handling
76 pages
Ip 2019
No ratings yet
Ip 2019
12 pages
Profound Python Data Science
From Everand
Profound Python Data Science
Onder Teker
No ratings yet
Advanced C Concepts and Programming: First Edition
From Everand
Advanced C Concepts and Programming: First Edition
Gayatri
3/5 (1)
(NagpurStudents - Org) Digital Electronics and Microprocessor
No ratings yet
(NagpurStudents - Org) Digital Electronics and Microprocessor
4 pages
CV Tran Phuong Nam Embedded
No ratings yet
CV Tran Phuong Nam Embedded
3 pages
DL Report (Prabal)
No ratings yet
DL Report (Prabal)
11 pages
General IELTS Reading Test 1 Answers - IELTS Fever
No ratings yet
General IELTS Reading Test 1 Answers - IELTS Fever
5 pages
IDOC Step by Step Guide
No ratings yet
IDOC Step by Step Guide
7 pages
Statistics For People Who Think They Hate Statistics Using R 1st Edition Salkind Ebook and TestBank Bundle Full Download
No ratings yet
Statistics For People Who Think They Hate Statistics Using R 1st Edition Salkind Ebook and TestBank Bundle Full Download
404 pages
LCD Based Moving Message Display With PC Data
100% (1)
LCD Based Moving Message Display With PC Data
20 pages
Practice-Entrepreneurial-Skills-In-The-Workplace (Hernani and Gutang)
No ratings yet
Practice-Entrepreneurial-Skills-In-The-Workplace (Hernani and Gutang)
40 pages
Daftar Kode Molex Mini SPOX
No ratings yet
Daftar Kode Molex Mini SPOX
2 pages
Sick CLV65 Catalog - PDF Room PDF
No ratings yet
Sick CLV65 Catalog - PDF Room PDF
76 pages
Apply Problem - Solving Techniques To Routine Malfunction
100% (1)
Apply Problem - Solving Techniques To Routine Malfunction
5 pages
Testo 865 868 871 872 Short Instructions
No ratings yet
Testo 865 868 871 872 Short Instructions
8 pages
02 Do You Need Blockchain?
100% (1)
02 Do You Need Blockchain?
54 pages
The Essential Guide To Zero Trust
No ratings yet
The Essential Guide To Zero Trust
28 pages
Digital Literacy Final Paper.
No ratings yet
Digital Literacy Final Paper.
57 pages
Execution Unit and BIU
No ratings yet
Execution Unit and BIU
29 pages
Dissertation Topics in Electrical Engineering
100% (2)
Dissertation Topics in Electrical Engineering
5 pages
Mini Project New
No ratings yet
Mini Project New
6 pages
NOKIA7250IXR-X1 EVPN-VPWS PerfStudy
No ratings yet
NOKIA7250IXR-X1 EVPN-VPWS PerfStudy
20 pages
Network Security Lab File
No ratings yet
Network Security Lab File
25 pages
Unit 5
No ratings yet
Unit 5
25 pages
Araya Salas (2006) - Warbler An R Package To Streamline Analysis of Animal Acoustic Signals
No ratings yet
Araya Salas (2006) - Warbler An R Package To Streamline Analysis of Animal Acoustic Signals
8 pages
Overview of Formulas in Excel
No ratings yet
Overview of Formulas in Excel
5 pages
165830-1CD Instrucciones PDF
No ratings yet
165830-1CD Instrucciones PDF
91 pages
Jasperreportsserver Auth Cookbook
No ratings yet
Jasperreportsserver Auth Cookbook
124 pages
TRN 5301 420 02 - SG Ins - EN
100% (1)
TRN 5301 420 02 - SG Ins - EN
668 pages
Toke
No ratings yet
Toke
5 pages
REOI Platform LMS
No ratings yet
REOI Platform LMS
2 pages
Individual Daily Log and Accomplishment Report: Holiday
No ratings yet
Individual Daily Log and Accomplishment Report: Holiday
6 pages