Week-7_GA_Solution_1
Week-7_GA_Solution_1
1. Suppose X1 , X2 , . . . , Xn are n iid random variables with mean µ and variance σ 2 = 16.
Using Chebyshev’s inequality, find the minimum value of n such that
Answer: 160
Solution:
σ2
E[X̄] = µ, Var[X̄] =
n
Note that X̄ = X1 + X2 + . . . + Xn
By Chebyshev’s inequality, we have
σ2
P X −µ <δ ≥1− 2
nδ
And given
Solution :
Since, the empirical distribution is the discrete distribution with PMF
#(Xi = t)
p(t) =
n
#(Xi = t) denotes the number of times t occurs in the samples.
Therefore,
3
p(0) = = 0.3
10
And
7
p(1) = = 0.7
10
Hence, option (B) is correct.
(b) Compute the sample mean. Enter the answer correct to one decimal place.
Answer: 0.7
Solution:
X 1 + X2 + . . . + X 1 0
X̄ =
n
1+0+1+0+1+1+0+1+1+1
=
10
7
=
10
= 0.7
3. Let X1 , X2 , X3 are three independent and identically distributed random variables with
mean µ and variance σ 2 . Given below are 3 different formulations of sample mean.
(Observe that E[A] = E[B] = E[C]).
X1 + X 2 + X3
A=
3
Solution:
Let X1 , X2 , X3 ∼ i.i.d. X, where E[X] = µ, Var(X) = σ 2
X1 + X2 + X3
Var(A) = Var
3
1
= (Var[X1 ] + Var[X2 ] + Var[X3 ])
9
1 σ2
= (3σ 2 ) =
9 3
σ2
Var[X] =
n
52
=
25
=1
Course: Statistics for Data Science - II Page 4 of 11
X
5. A fair die is rolled 100 times. Let denote the number of times six is obtained. Find
100
X 1
a bound for the probability that differs from by less than 0.1 using the weak law
100 6
of large numbers.
5
(a) at least
36
31
(b) at least
36
5
(c) at most
36
31
(d) at most
36
Solution:
X denotes the number of times six is obtained on rolling a fair die 100 times. Let
X1 , X2 , . . . , X100 be 100 i.i.d. samples such that
(
1 if six appears on rolling a fair die
Xi =
0 otherwise
1 5
E[Xi ] = µ = and Var(Xi ) = σ 2 =
6 36
Notice that X = X1 + X2 + X3 + · · · + X100 .
X 1
To find: Bound on P − < 0.1 . By the weak law of large numbers, we have
100 6
σ2
P (|X − µ| < δ) ≥ 1 − 2
nδ
5
X 1 36
⇒P − < 0.1 ≥ 1 −
100 6 100 × 0.01
X 1 5 31
⇒P − < 0.1 ≥ 1 − =
100 6 36 36
Solution:
σ2 4
E[X] = µ = 20 and Var[X] = = = 0.8.
n 5
" n
#
1 X
E[S 2 ] = E (Xi − X)2
n i=1
" n #
1 X
= E (Xi − X)2
n
" i=1
n
#
1 X 2
= E (Xi2 + X − 2Xi X)
n
" i=1
n
#
1 X 2
= E Xi2 + nX − 2nXX
n
" i=1
n
#
1 X 2 2
= E Xi2 + nX − nX
n
" i=1
n
#
1 X
= E Xi2
n
" ni=1 #
1 X 2
= E[Xi2 ] − nE[X ]
n i=1
" n 2 #
1 X 2 σ
= (σ + µ2 ) − n + µ2
n i=1 n
1
(nσ 2 + nµ2 ) − (σ 2 + nµ2 )
=
n
(n − 1)σ 2
=
n
4
Here, n = 5, therefore, E[S 2 ] = × 4 = 3.2.
5
7. Suppose Xi ∼ Normal 0, i12 , where i = 1, 2, . . . , 9 and X1 , X2 , . . . , X9 are independent
of each other. Let Y be a random variable defined as Y = 9i=1 iXi . Find the variance
P
of Y .
Solution
Course: Statistics for Data Science - II Page 6 of 11
9
!
X
Var(Y ) = Var iXi
i=1
= Var(X1 + 2X2 + 3X3 + · · · + 9X9 )
= Var(X1 ) + Var(2X2 ) + · · · + Var(9X9 )
= Var(X1 ) + 4Var(X2 ) + · · · + 81Var(X9 )
1 1 1
= 2 + 4 2 + · · · + 81 2
1 2 9
=9
Solution:
P ∼ Uniform[0, 12]
0 + 12 (12 − 0)2 144
E[P ] = µ = = 6, Var(P ) = σ 2 = = = 12
2 12 12
σ2
P (|X − µ| < δ) ≥ 1 −
nδ 2
12 73
P |X − µ| < 3 ≥ 1 − = ≈ 0.9733
50 × 9 75
9. Suppose a random sample is used to estimate the proportion of voters in a city. If the
sample proportion is roughly 0.45, what sample size is necessary so that the standard
deviation of the sample proportion is 0.02?
Solution
Let the random variable X represent that the selected candidate is a voter.
Let Xi be defined as
(
1, if the selected candidate is a voter
Xi =
0, otherwise
Define an event A as A : X = 1.
It is given that P (A) = 0.45.
P (A)(1 − P (A))
We know that Var(S(A)) = .
n
Course: Statistics for Data Science - II Page 7 of 11
r
p(1 − p)
= 0.02
r n
(0.45)(0.55)
= 0.02 =⇒ n = 618.75 ≈ 619
n
10. The average life (in years) of an electronic watch follows an exponential distribution with
1
parameter . Find the lower bound on the probability that the mean life of a random
2
sample of 50 such watches falls between 1 and 3 years. Enter your answer correct to two
decimals.
Hint: Use the weak law of large numbers.
Solution
Let the random variable X represent the life of an electronic watch.
It is given that X ∼ Exp(1/2) and 50 such samples are taken.
E[X] = µ = 2, Var(X) = σ 2 = 4
To find: a lower bound on P (1 < X < 3).
By the weak law of large numbers, we have
σ2
P (|X − µ| < δ) ≥ 1 −
nδ 2
4
P |X − 2| < 1 ≥ 1 −
50 × 1
23
=
25
= 0.92
11. A university evaluates the final scores of students based on coursework and a final project.
The variance of the coursework scores is 15, and the variance of the final project scores
is 30. Coursework contributes 70% to the final evaluation, while the project contributes
30%. Assuming the scores of coursework and project are uncorrelated, what is the
variance of the final evaluation scores? Enter the answer correct to two decimal places.
Answer : 10.05 ; Range : 10.02 to 10.08
Solution
To compute the variance of the final evaluation scores, we use the formula for the variance
of a linear combination of uncorrelated random variables:
where X and Y are independent random variables, and a and b are the weights of X
and Y , respectively.
Course: Statistics for Data Science - II Page 8 of 11
In this case:
12. In a large population of students, an unknown proportion p prefers Chips over Cookies.
A survey was conducted among 200 students and 120 preferred Chips. Assume the 200
samples are i.i.d. Bernoulli (p).
Based on the given information, answer the following questions:
[(i).]What is the sample mean? Enter the answer correct to one decimal place.
Answer : 0.6 To determine the sample mean, we use the formula for the sample
mean:
Number of successes
X̄ =
Total number of samples
Given:
(b) 0.36
(c) 0.0018
(d) 0.0012
Answer : d
The variance of the sample mean is given by the formula:
p(1 − p)
Var(X̄) = ,
n
where:
0.6(1 − 0.6)
Var(X̄) = .
200
Simplify the expression:
0.6 · 0.4
Var(X̄) = .
200
0.24
Var(X̄) = .
200
Var(X̄) = 0.0012.
Thus, the variance of the sample mean is:
0.0012 .
3. If 30 more students join the survey and they all prefer Cookies, what will be the
new sample mean and variance of the sample mean?
[(a)]new sample mean = 0.24, new variance = 0.0012 new sample mean = 0.6,
new variance = 0.0012 new sample mean = 0.6522, new variance = 0.0009 new
sample mean = 0.5217, new variance = 0.00108
Answer : d
Course: Statistics for Data Science - II Page 10 of 11
If 30 more students join and they all prefer Cookies, the updated values are:
p = 0.52174,
n = 230.
0.249998
Var(X̄) = .
230
0.00108 .