0% found this document useful (0 votes)

30 views8 pages

Data Science Exam Answer Key 2024

The document outlines the examination details for the course 'Foundations of Data Science' at Jai Shriram Engineering College, including an answer key for various questions related to data science concepts. It covers topics such as project charters, data warehousing, correlation coefficients, and data visualization techniques using Python libraries like NumPy and Matplotlib. Additionally, it includes tasks for analyzing sales data using pandas and creating different types of plots.

Uploaded by

Dhanasekar Sethupathi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

30 views8 pages

Data Science Exam Answer Key 2024

Uploaded by

Dhanasekar Sethupathi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

Q.

P Code: 100114
JAI SHRIRAM ENGINEERING COLLEGE
An Autonomous Institution
B.E / B.Tech Degree Examinations Nov/ Dec – 2024

Course Code: CS3352 Course Name: Foundations of Data Science

Semester:3rd semester Max Marks: 100

Answer key
Part – A
10 x 2 = 20
1. Question: Identify the importance of project charter.
Answer:

A project charter authorizes a project, defines objectives, scope, and stakeholder roles,
ensuring alignment and clarity. It acts as a reference document throughout the project
lifecycle.

2. Question: Define Data Warehousing.

Answer:
Data warehousing involves collecting and storing data from multiple sources in a
centralized repository. It facilitates efficient querying, reporting, and decision-making.

3. Question: Given the following data set: 5,7,8,10,12,14,15,18,20.Calculate the

interquartile range.
Answer:

4. Question: Apply the formula to convert Z score to original score

Answer:
5. Question: Define z scores.
Answer:

6. Question: Identify the properties of correlation coefficient.

Answer:

7. Question: Name some NumPy Array attributes.

Answer:

8. Question: Write a comment to create two-dimensional array?

Answer:

# Create a 2D NumPy array using np.array()

import numpy as np
# Create a 2D array with 3 rows and 4 columns
array_2d = np.array([[1, 2, 3, 4], [5, 6, 7, 8],[9, 10, 11, 12]])
print(array_2d)
9. Question: How can you set different colors for bar plot?
Answer:
Use the color parameter in plt.bar().
plt.bar(x, y, color=['red', 'blue', 'green'])

10. Question: State the purpose of histogram

Answer:
 To visualize the distribution of numerical data.
 Shows the frequency of data within specific intervals (bins).
 Helps identify patterns, such as skewness or modality.

Scheme of Evaluation
Part – B
5 X 13 = 65
15. b) .

import matplotlib.pyplot as plt

import numpy as np
# Generate sample data for three groups
np.random.seed(42)
# Group 1
weights_group1 = np.random.uniform(56, 64, 20)
heights_group1 = np.random.uniform(120, 180, 20)
# Group 2
weights_group2 = np.random.uniform(60, 68, 20)
heights_group2 = np.random.uniform(140, 200, 20)
# Group 3
weights_group3 = np.random.uniform(66, 72, 20)
heights_group3 = np.random.uniform(160, 240, 20)
# Plotting the scatter plot
plt.figure(figsize=(8, 6))
# Group 1
plt.scatter(weights_group1, heights_group1, label='Group 1', color='blue', alpha=0.7)
# Group 2
plt.scatter(weights_group2, heights_group2, label='Group 2', color='green', alpha=0.7)
# Group 3
plt.scatter(weights_group3, heights_group3, label='Group 3', color='red', alpha=0.7)
# Adding labels, title, and legend
plt.title("Group wise Weight vs Height scatter plot")
plt.xlabel("weight")
plt.ylabel("height")
plt.legend()
plt.grid(True)
# Show plot
plt.show()

Part – C
1 X 15 = 15
16. a) You have been provided with a CSV file named "sales_data.csv" that contains
sales data for acompany. The file has the following columns: "Date", "Product",
"Quantity", and" Revenue". Your task is to load the data into a pandas Data Frame
and perform the following analysis.
Each 3 marks
i. Calculate the total revenue generated by the company.
ii. Find the product that generated the highest revenue.
iii. Calculate the average quantity sold per day.
iv. Group the data by month and calculate the total revenue for each month.
v. Plot a line graph showing the monthly revenue over time.
Answer

Python program
import pandas as pd
import matplotlib.pyplot as plt

# Load the CSV file into a DataFrame

df = pd.read_csv("sales_data.csv")

# Ensure 'Date' column is in datetime format

df['Date'] = pd.to_datetime(df['Date'])

# i. Calculate the total revenue generated by the company

total_revenue = df['Revenue'].sum()
print(f"Total Revenue: {total_revenue}")

# ii. Find the product that generated the highest revenue

highest_revenue_product = df.groupby('Product')['Revenue'].sum().idxmax()
print(f"Product with highest revenue: {highest_revenue_product}")

# iii. Calculate the average quantity sold per day

avg_quantity_per_day = df.groupby('Date')['Quantity'].sum().mean()
print(f"Average quantity sold per day: {avg_quantity_per_day}")

# iv. Group the data by month and calculate the total revenue for each month
df['Month'] = df['Date'].dt.to_period('M') # Group by month
monthly_revenue = df.groupby('Month')['Revenue'].sum()

# v. Plot a line graph showing the monthly revenue over time

plt.figure(figsize=(10, 6))
monthly_revenue.plot(kind='line', marker='o')
plt.title('Monthly Revenue Over Time')
plt.xlabel('Month')
plt.ylabel('Total Revenue')
plt.grid(True)
plt.show()

OR
b) Develop an example for contour plot,histogram,3D plotting and line plot for
Matplotlib.
Answer

import matplotlib.pyplot as plt

from mpl_toolkits.mplot3d import Axes3D
import numpy as np

# Prepare a grid and data for Contour Plot and 3D Plot

x = np.linspace(-5, 5, 50)
y = np.linspace(-5, 5, 50)
X, Y = np.meshgrid(x, y)
Z = np.sin(np.sqrt(X**2 + Y**2))

# Random data for Histogram

data = np.random.randn(1000)

# Data for Line Plot

x_line = np.linspace(0, 10, 100)
y_line = np.sin(x_line)

# Create a figure with 4 subplots

fig = plt.figure(figsize=(14, 10))

# 1. Contour Plot
ax1 = fig.add_subplot(2, 2, 1)
contour = ax1.contour(X, Y, Z, levels=10, cmap='viridis')
fig.colorbar(contour, ax=ax1)
ax1.set_title('Contour Plot')
ax1.set_xlabel('X-axis')
ax1.set_ylabel('Y-axis')

# 2. Histogram
ax2 = fig.add_subplot(2, 2, 2)
ax2.hist(data, bins=30, color='blue', alpha=0.7, edgecolor='black')
ax2.set_title('Histogram')
ax2.set_xlabel('Data')
ax2.set_ylabel('Frequency')
# 3. 3D Plot
ax3 = fig.add_subplot(2, 2, 3, projection='3d')
ax3.plot_surface(X, Y, Z, cmap='viridis', edgecolor='none')
ax3.set_title('3D Surface Plot')
ax3.set_xlabel('X-axis')
ax3.set_ylabel('Y-axis')
ax3.set_zlabel('Z-axis')

# 4. Line Plot
ax4 = fig.add_subplot(2, 2, 4)
ax4.plot(x_line, y_line, label='sin(x)', color='red', linewidth=2)
ax4.set_title('Line Plot')
ax4.set_xlabel('X-axis')
ax4.set_ylabel('Y-axis')
ax4.legend()
ax4.grid(True)

# Adjust layout and show the plots

plt.tight_layout()
plt.show()
Course In-Charge HoD

PP DWDM 4 5
No ratings yet
PP DWDM 4 5
26 pages
Data Visualization with Matplotlib
No ratings yet
Data Visualization with Matplotlib
50 pages
FDS Lab 1 Manuel .1..1new
No ratings yet
FDS Lab 1 Manuel .1..1new
38 pages
VIP Question Bank For DPV For Theory Exam
No ratings yet
VIP Question Bank For DPV For Theory Exam
6 pages
Working with Pandas and NumPy in Python
No ratings yet
Working with Pandas and NumPy in Python
34 pages
23bet10114 Naman Gupta Assignment-1
No ratings yet
23bet10114 Naman Gupta Assignment-1
17 pages
Fds QB
No ratings yet
Fds QB
6 pages
Matplotlib: Essential 2D Plotting Guide
No ratings yet
Matplotlib: Essential 2D Plotting Guide
10 pages
Data Visualization
No ratings yet
Data Visualization
10 pages
Types of Data Plots and Visualization
No ratings yet
Types of Data Plots and Visualization
17 pages
Exp 2 SDK Ok
No ratings yet
Exp 2 SDK Ok
18 pages
Key Concepts in Informatics Practices
No ratings yet
Key Concepts in Informatics Practices
5 pages
Prac 2
No ratings yet
Prac 2
11 pages
Graphs Using Matplotlib
No ratings yet
Graphs Using Matplotlib
23 pages
Machine Learning: Data Preparation Guide
No ratings yet
Machine Learning: Data Preparation Guide
30 pages
EXP1-siddhant Gupta (23 - SE - 148)
No ratings yet
EXP1-siddhant Gupta (23 - SE - 148)
17 pages
Prac 2
No ratings yet
Prac 2
11 pages
DS3 1
No ratings yet
DS3 1
8 pages
AD3411
No ratings yet
AD3411
28 pages
EX-02-Data Manipulation Pandas Matplot
No ratings yet
EX-02-Data Manipulation Pandas Matplot
9 pages
AI & Data Science Lab Record
No ratings yet
AI & Data Science Lab Record
28 pages
ML3 Data Analysis
No ratings yet
ML3 Data Analysis
80 pages
Understanding Regression and Data Visualization
No ratings yet
Understanding Regression and Data Visualization
3 pages
Data Handling in Data Science
No ratings yet
Data Handling in Data Science
76 pages
Vanshika Goyal Gec Practicals
No ratings yet
Vanshika Goyal Gec Practicals
31 pages
Gec Practicals
No ratings yet
Gec Practicals
31 pages
CS-3361-Data-science-lab Manual
No ratings yet
CS-3361-Data-science-lab Manual
36 pages
Content From Jose Portilla's Udemy Course Learning Python For Data Analysis and Visualization Notes by Michael Brothers, Available On
No ratings yet
Content From Jose Portilla's Udemy Course Learning Python For Data Analysis and Visualization Notes by Michael Brothers, Available On
13 pages
Data Science Exam Answer Key 2024
No ratings yet
Data Science Exam Answer Key 2024
50 pages
Eda Lab Assignment2
No ratings yet
Eda Lab Assignment2
10 pages
Data Science Concepts and Techniques
No ratings yet
Data Science Concepts and Techniques
53 pages
Machine Learning Project Roadmap
No ratings yet
Machine Learning Project Roadmap
4 pages
NumPy and Pandas Data Manipulation Guide
No ratings yet
NumPy and Pandas Data Manipulation Guide
11 pages
Dsa Lab Record (Ai&Ds)
No ratings yet
Dsa Lab Record (Ai&Ds)
34 pages
Introduction to Pandas and NumPy
No ratings yet
Introduction to Pandas and NumPy
11 pages
TD5Numpy Pandas and Matplotlib
No ratings yet
TD5Numpy Pandas and Matplotlib
5 pages
Data Analysis Lab: Python & Visualization
No ratings yet
Data Analysis Lab: Python & Visualization
11 pages
Lab Record Dev
No ratings yet
Lab Record Dev
20 pages
Comprehensive Data Visualization With Matplotlib and Seaborn
No ratings yet
Comprehensive Data Visualization With Matplotlib and Seaborn
40 pages
Data Analysis
No ratings yet
Data Analysis
20 pages
Chapter - 4
No ratings yet
Chapter - 4
4 pages
Python Basics for Data Science
No ratings yet
Python Basics for Data Science
30 pages
Grade 10 AI Practicals DATA SCIENCE-Solution
100% (1)
Grade 10 AI Practicals DATA SCIENCE-Solution
6 pages
Ad3301 Apr May 2024 Answer Key
No ratings yet
Ad3301 Apr May 2024 Answer Key
31 pages
Exp 12 and 15
No ratings yet
Exp 12 and 15
4 pages
EDA Exp 2 Outout
No ratings yet
EDA Exp 2 Outout
7 pages
DSF Lab
No ratings yet
DSF Lab
14 pages
Dev Record Aids
No ratings yet
Dev Record Aids
24 pages
Data Science Exam: Python & Visualization
No ratings yet
Data Science Exam: Python & Visualization
3 pages
Matplotlib Implementation in Python
No ratings yet
Matplotlib Implementation in Python
20 pages
EDA Code Snippets for Pandas & NumPy
No ratings yet
EDA Code Snippets for Pandas & NumPy
17 pages
Unit 5
No ratings yet
Unit 5
25 pages
ML (Sudhanshu)
No ratings yet
ML (Sudhanshu)
24 pages
Fds SLOT 2
No ratings yet
Fds SLOT 2
12 pages
Mvda - Question Bank
No ratings yet
Mvda - Question Bank
14 pages
Data Analysis Practical
No ratings yet
Data Analysis Practical
13 pages
Fundamentals of Data Science Students
No ratings yet
Fundamentals of Data Science Students
52 pages
Data Visualization Techniques Guide
No ratings yet
Data Visualization Techniques Guide
48 pages
EX - NO4 - Study Point
No ratings yet
EX - NO4 - Study Point
37 pages
C Program Output Examples and Explanations
No ratings yet
C Program Output Examples and Explanations
7 pages
C Pointers for Programming Students
No ratings yet
C Pointers for Programming Students
8 pages
CS6503 Theory of Computation Question Paper Nov Dec 2017
No ratings yet
CS6503 Theory of Computation Question Paper Nov Dec 2017
3 pages
Procedure For Hydro Testing
No ratings yet
Procedure For Hydro Testing
3 pages
Equilibrium of Force Systems Explained
No ratings yet
Equilibrium of Force Systems Explained
9 pages
All-Terrain Carrier for Extreme Conditions
No ratings yet
All-Terrain Carrier for Extreme Conditions
8 pages
English 3 Q2 - FINAL
No ratings yet
English 3 Q2 - FINAL
5 pages
The Next Normal The Future of Fashion
No ratings yet
The Next Normal The Future of Fashion
17 pages
Dominar 400 Spare Parts Catalogue
No ratings yet
Dominar 400 Spare Parts Catalogue
83 pages
Seismic Upgrade of Existing Buildings With Fluid Viscous Dampers: Design Methodologies and Case Study
No ratings yet
Seismic Upgrade of Existing Buildings With Fluid Viscous Dampers: Design Methodologies and Case Study
12 pages
60F 70B 90A Manual Yamaha
No ratings yet
60F 70B 90A Manual Yamaha
215 pages
Biotechnology Basics for Students
No ratings yet
Biotechnology Basics for Students
2 pages
1460 Load Indication System Manual
No ratings yet
1460 Load Indication System Manual
26 pages
Model Exam Dpco
No ratings yet
Model Exam Dpco
2 pages
Tooth Wear
No ratings yet
Tooth Wear
17 pages
Plumbing Tools and Their Uses
No ratings yet
Plumbing Tools and Their Uses
4 pages
Water Crisis
No ratings yet
Water Crisis
6 pages
Pure Mathematics P1 Exam Paper
No ratings yet
Pure Mathematics P1 Exam Paper
32 pages
MA3151 Matrices and Calculus Reg 2021 Jan 2022.
No ratings yet
MA3151 Matrices and Calculus Reg 2021 Jan 2022.
5 pages
3D Printing Safety Guide
No ratings yet
3D Printing Safety Guide
9 pages
Aircraft Component Reliability Issues
No ratings yet
Aircraft Component Reliability Issues
2 pages
Lesson 6 Interpretation of Batch Reaction Data
No ratings yet
Lesson 6 Interpretation of Batch Reaction Data
39 pages
Boeco Catalog 1 en
No ratings yet
Boeco Catalog 1 en
162 pages
Ed Super Imp
No ratings yet
Ed Super Imp
2 pages
Brusilica K. Jung
No ratings yet
Brusilica K. Jung
6 pages
Homework Assignment #1: Boost Converter Simulation
50% (18)
Homework Assignment #1: Boost Converter Simulation
2 pages
Djibouti To Ethiopia TC 20250618
No ratings yet
Djibouti To Ethiopia TC 20250618
16 pages
Philips Lamp Lumens Data
100% (1)
Philips Lamp Lumens Data
26 pages
AHS 4 Rolls Hydraulic Plate Bending Machine Operation and Maintenance Manual
100% (5)
AHS 4 Rolls Hydraulic Plate Bending Machine Operation and Maintenance Manual
90 pages
Article Beginning Defoggers
100% (1)
Article Beginning Defoggers
23 pages
Coral Catalogue
No ratings yet
Coral Catalogue
13 pages
Electromagnetic Induction Basics
No ratings yet
Electromagnetic Induction Basics
33 pages
Power Pilates Class Format
67% (3)
Power Pilates Class Format
2 pages

Data Science Exam Answer Key 2024

Uploaded by

Data Science Exam Answer Key 2024

Uploaded by

Q.

Course Code: CS3352 Course Name: Foundations of Data Science

2. Question: Define Data Warehousing.

3. Question: Given the following data set: 5,7,8,10,12,14,15,18,20.Calculate the

4. Question: Apply the formula to convert Z score to original score

6. Question: Identify the properties of correlation coefficient.

7. Question: Name some NumPy Array attributes.

8. Question: Write a comment to create two-dimensional array?

# Create a 2D NumPy array using np.array()

10. Question: State the purpose of histogram

import matplotlib.pyplot as plt

# Load the CSV file into a DataFrame

# Ensure 'Date' column is in datetime format

# i. Calculate the total revenue generated by the company

# ii. Find the product that generated the highest revenue

# iii. Calculate the average quantity sold per day

# v. Plot a line graph showing the monthly revenue over time

import matplotlib.pyplot as plt

# Prepare a grid and data for Contour Plot and 3D Plot

# Random data for Histogram

# Data for Line Plot

# Create a figure with 4 subplots

# Adjust layout and show the plots

You might also like