0% found this document useful (0 votes)
12 views

Data Analytics 2022

The document discusses a data analytics exam containing questions on hypothesis testing, predictive analytics, data preprocessing, regression analysis and their applications in the built environment sector. The exam has three sections with a mix of theoretical and practical questions testing concepts of central tendency, hypothesis testing, predictive modeling and interpreting regression results.

Uploaded by

nikhil das
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
12 views

Data Analytics 2022

The document discusses a data analytics exam containing questions on hypothesis testing, predictive analytics, data preprocessing, regression analysis and their applications in the built environment sector. The exam has three sections with a mix of theoretical and practical questions testing concepts of central tendency, hypothesis testing, predictive modeling and interpreting regression results.

Uploaded by

nikhil das
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 5

·of Printed Pages - 5]

[NO-

REAL243 Enrol. No.


··············
[BMCF]

END SEMESTER EXAMINATION: APRIL-MAY 202 2

DATA ANALYTICS FOR BUILT


ENVIRONMENT SECTOR

Time: 3 Hrs. Maximum Marks : 50

Note: Attempt questions from all sections as directed.


Use of Simple Calculator & Statistical tables are
allowed.

SECTION - A (20 Marks)


Attempt any four questions - out of five. •

Each question carries 05 marks.

--
-
-'
1. Explain the Type I and Type II errors that may occur
during hypothesis testing?

--
_, 2. What is predictive analytics and its application in the
built environment?

-- 3. Jiffy Markets compares prices charged for identical


items in all ;f its food stores. Here are the prices
-_; charged by each store for a pound of bread la St week:
--_,I

I
-"' \
P.T.0.

... i:..., (461)


REAL243 2
J
( In VSD) 1.08, 0.98, 1.09, 1.24, 1.33 l I )
' . 4 I 55
1.22, 1.05 ' . 'l .08,
.)
(a) Calculate the median price per pound. )

(2)
(b) Calculate the mean price per pound.
)
(2) •
( c) Which value is the better measure of th 1
e central
tendency of these data? ),.__
(I)
l
4 _ How does boxplot help m understanding the data )
better?
l
.,
5. Explain the term data preprocessing and how does it
support data analysis?
J,
SECTION - B (16 Marks)
Attempt any two questions out of three.
Each question carries 08 marks.

6. With the growing popularity of online retailing, brick


1
and mortar stores are keenly watching their sales.
Walmart determines that customers spend an average
of $130 per trip. Target another retail chain, would
~,
like to test if its customers spend. the same or more.
They take a survey of 25 shoppers and obtain a sample ~,
mean of $135.25.
3
REAL243
Assuming that spending follows a normal distribution,
the population standard deviation is $10.50.

(a) Specify the null and alternate hypothesis to be


used to test if average spending at Target is more
than $130, (2)

(b) Calculate the value of the test statistic and state


the decision rule used to accept or reject the null
hypothesis. (6)

7. An 1.Q test was administered to 10 men before and


after they were trained. The results are given below

I.Q Before Training 167 124 157 155 163 154 156
168 133 143

I.Q After Training 170 138 158 158 156 167 168
172 142 138

Test whether there 1s any change m I.Q after the


training programme. (8)

8. Calci is a pharma company selling calcium supplements


independent
from hyper stores. The sales ana 1ys t uses
. · various cities
variables of population and mcome m
to model and forecast the sal~s of calcium

supplements.

P.T.0.
4
R EAL243
)•
Coefficient Standard T-Statistic

.,
)'"
Error

10.35 4 .02 2.57


:r
8.47

7.62
2.71

6.63
3.21

1.15
..?,
.r
(a) Write down the modelled equation wh·· Ic h ca
!
used to predict the sales of calcium supplem ents..
n he
l
(2)
l
(b) Interpret the coefficients
. of independent variabl es l
and are these coefficients significant at a 5% I
significance level?
(4) :I
(c) Predict the sales in a city with a population of 1.5
l
million and an average income of $ 44,000.
I
(2) I
l
SECTION - C
(14 Marks)
l
(Compulsory) 1
]
The overhead costs incurred during the productionth ]
9.
process is estimated by the cost accountants on
d e -,
basis of the level of production- At. StandaT car
overhead ·r
Company they have collected information onlants, and
P
expenses and units produced at different
REAL243 5

want to estimate a regression equation t 0 ct·


pre 1ct future
overhead. The below table highlights the d t
a a co 11ected
Overhead Expense
(Rs. Million) 191 170 272 155 280 173 234 1-16
153 178

Production Units 40 42 53 35 56 39 48 30 37 40

(a) What is the dependent and independent variable


in the dataset? (2)

(b) Develop a regression equation for the cost


accountants? ' (8)

( ) Predict what will be overhead expenses if units


C (2)
produced are 100.

. s ou consider while
(d) What are the assumpt10n y
· mo d el?. (2)
. a linear regress10n
deve Iopmg

You might also like