106 Data Science
106 Data Science
DATA SCIENCE
Time allowed : 2 hours Maximum Marks : 50
General Instructions :
(i) Please read the instructions carefully.
(ii) This question paper consists of 21 questions in two sections : Section A
and Section B.
(iii) Section A has Objective Type Questions, whereas Section B contains
Subjective Type Questions.
(iv) Out of the given (5 + 16 =) 21 questions, the candidate has to answer
(5 + 10 =) 15 questions in the allotted (maximum) time of 2 hours.
(v) All questions of a particular section must be attempted in the correct
order.
SECTION A
(Objective Type Questions) (24 marks)
4-106 Page 2
(iii) Managing ___________ is about making a plan to be able to cope
effectively with daily pressures and to strike a balance between
life, work, relationships, relaxation, and fun.
(A) Weight
(B) Stress
(C) Beliefs
(D) Interests
(v) What is the keyboard shortcut for copying selected text or files in
most operating systems ?
(A) Ctrl + X (B) Ctrl + Z
(C) Ctrl + V (D) Ctrl + C
(i) In data analysis, why should you create subsets of your data ?
(A) To increase the size of your dataset
(B) To make the data easier to analyse
(C) To add extra information in your dataset
(D) To add noise to the data
(A)
(B)
(C)
(D)
p_id c_id
1 3
2 1
3 3
1
3
4-106 Page 6
(ii)
income data includes a few extremely high values (outliers) due to
the presence of some wealthy individuals. In this scenario, which
form of central tendency would be more appropriate to describe the
(v) If you have a dataset with 100 data points, what data point defines
the first quartile (Q1) ?
(A) The 25th percentile
(B) The 50th percentile
(C) The 75th percentile
(D) The 100th percentile
(ii) You have a dataset of ages for a group of 5 people. The ages are as
follows: 25, 32, 29, 35 and 27. What is the median age for this
dataset ?
(A) 27 (B) 28
(C) 29 (D) 30
(iii) You have a bag with two red marbles and two green marbles. If
you randomly select one marble from the bag without looking,
what is the probability that it will be a red marble ?
(A) 0·50 (B) 0·25
(C) 1·0 (D) 0·75
(vi) Assertion (A) : Privacy does not always mean confidentiality of data.
Reason (R) : Private data may need to be audited based on the
relevant requirements.
(A) Both (A) and (R) are correct and (R) is the correct
explanation of (A).
(B) Both (A) and (R) are correct, but (R) is not the correct
explanation of (A).
(C) (A) is correct, but (R) is incorrect.
(D) (R) is correct, but (A) is incorrect.
SECTION B
(Subjective Type Questions) (26 marks)
Answer any 3 out of the given 5 questions on Employability Skills. Answer each
question in 20 30 words. 3 2=6
8. Explain any two key strategies or initiatives that can contribute towards
achieving Sustainable Development Goals (SDGs).
10. What does the acronym SMART stand for in the context of goal setting ?
Why is goal setting an essential factor in your personal life ?
11. Discuss any two ways of protecting confidential data that is stored in
digital form.
17. What is a two-way frequency table ? Explain its features with a suitable
example.
18. Explain the following components of the Statistical Problem Solving
Process:
(a) Analyse the data
(b) Interpret the results
19. Explain Central Limit Theorem. Give any two real world scenarios in
which it is used.
20. What is z-score in data science ? Write the formula to calculate z-score.
Also explain the importance of z-score in data science.
4-106 Page 10
21. You are a data analyst working for a healthcare organisation. Your team
is responsible for analysing patient data to improve healthcare services
and outcomes. Ethical considerations are crucial in handling this
sensitive information. Recently, you received a dataset containing patient
records, including medical history, personal information, and treatment
details.