Statistics And Probability Exam Quiz!
1.
For Pearson’s correlation, if X increases Y increases, and when X
decreases Y you don’t know. Pearson’s r should be close to which of
the below values?
o A.
R=-1
o B.
R=1
o C.
R=0
o D.
I can't Know
Correct Answer
C. R=0
Explanation
Pearson's correlation coefficient (r) measures the strength and direction of the
linear relationship between two variables. In this case, if X increases and Y
increases, it suggests a positive linear relationship. However, when X decreases
and Y, we don't have enough information to determine the relationship. Therefore,
Pearson's r should be close to 0, indicating a weak or no linear relationship
between the variables.
Rate this question:
2
0
2.
Suppose all salaries in a company are normally distributed, with a
mean of $70,000 and a standard deviation of $10,000. If all salaries
are doubled. What is the new mean and standard deviation?
o A.
Mean = 140,000 and std = $20,000
o B.
Mean = 70,000 and std = $10,000
o C.
Mean = 140,000 and std = $10,000
o D.
Mean = 120,000 and std = $15,000
Correct Answer
A. Mean = 140,000 and std = $20,000
Explanation
If all salaries in a company are doubled, it will affect both the mean and the
standard deviation. Let's calculate the new mean and standard deviation:
Original Mean (μ) = $70,000
Original Standard Deviation (σ) = $10,000
When all salaries are doubled, the new mean (μ') will be:
New Mean (μ') = 2 * Original Mean
New Mean (μ') = 2 * $70,000 = $140,000
The new standard deviation (σ') will also be affected. When salaries are multiplied
by a constant (in this case, 2), the standard deviation is also multiplied by that
constant. So:
New Standard Deviation (σ') = 2 * Original Standard Deviation
New Standard Deviation (σ') = 2 * $10,000 = $20,000
So, after doubling all the salaries, the new mean is $140,000, and the new
standard deviation is $20,000.
Rate this question:
0
1
3.
Consider a ball that is kicked by a mean of 10 feet in the right
direction and with a standard deviation of 1 foot, it is then kicked
back in the opposite direction towards where it was started by 5
feet but with a standard deviation of 0.5. What are the mean and
standard deviation of this new Gaussian distribution of the
distance?
o A.
Mean = 5, std= 0.5
o B.
Mean = 10, std= 1.5
o C.
Mean = 15, std= 1.5
o D.
Mean = 5, std= 1.118
Correct Answer
D. Mean = 5, std= 1.118
Explanation
The mean of the new Gaussian distribution is 5 because the ball is kicked back 5
feet towards where it was started. The standard deviation of the new Gaussian
distribution is 1.118 because when the ball is kicked back, the standard deviation
is added to the original standard deviation. Therefore, the standard deviation
becomes the square root of (1^2 + 0.5^2) = 1.118.
Rate this question:
1
0
4.
Covariance indicates the strength of the linear
relationship between variables.
o A.
True
o B.
False
Correct Answer
B. False
Explanation
The explanation for the given answer, False, is that covariance measures the
extent to which two variables vary together, but it does not indicate the strength
of the linear relationship between them. Covariance can be positive, negative, or
zero, indicating the direction of the relationship, but it does not provide
information about the strength or magnitude of the relationship. To measure the
strength of the linear relationship between variables, one should use the
correlation coefficient.
Rate this question:
5.
Correlation measures both the strength and direction of the non-
linear relationship between two variables.
o A.
True
o B.
False
Correct Answer
B. False
Explanation
Correlation measures the strength and direction of the linear relationship between
two variables, not the non-linear relationship.
Rate this question:
6.
IQ is distributed with a mean of 100 and a variance of 225. What is
the standard score for IQ of 130?
o A.
o B.
0.13
o C.
30
o D.
15
Correct Answer
A. 2
Explanation
The standard score, also known as the z-score, measures how many standard
deviations an individual's IQ score is above or below the mean. To calculate the z-
score, we subtract the mean from the IQ score and divide it by the standard
deviation. In this case, the standard deviation is the square root of the variance,
which is 15. Therefore, the z-score for an IQ of 130 would be (130-100)/15 = 2.
Rate this question:
2
0
7.
Is the below relationship Linear and Exact?
o A.
True
o B.
False
Correct Answer
B. False
Explanation
The given question is asking whether the relationship is linear and exact. The
answer is False. This means that the relationship is either non-linear or it is not
exact. In a linear relationship, there is a constant rate of change between the
variables, while an exact relationship means that there is no error or uncertainty
in the relationship. Therefore, if the answer is False, it indicates that the
relationship is either non-linear or there is some degree of error or uncertainty
present.
Rate this question:
1
0
8.
Given a Summer/Winter classification problem: Winter is 165 days
and Summer is 200 days. The temperature is uniformly distributed
between 5 - 25 degrees in Winter and 22 - 24 in Summer. What is
the classification of the day that temperature is 23 degrees?
o A.
Summer
o B.
Winter
Correct Answer
A. Summer
Explanation
To determine the classification of a day with a temperature of 23 degrees, we
need to consider the temperature ranges for both Winter and Summer.
In Winter, the temperature ranges from 5 to 25 degrees.
In Summer, the temperature ranges from 22 to 24 degrees.
Since 23 degrees falls within the range of 22 to 24 degrees, the temperature of 23
degrees is within the Summer temperature range. Therefore, the classification of
the day with a temperature of 23 degrees is "Summer."
Rate this question:
0
1
9.
You are given a revolver with six slots. There are two adjacent
bullets. You have to shoot twice and are given the chance to rotate
the cylinder randomly in-between. How do you maximize your
chance of survival?
o A.
Rotate cylinder
o B.
Do not rotate cylinder
o C.
It doesn't matter, as survival chance is the same in both cases
o D.
The question doesn't give all information required to answer the question
Correct Answer
B. Do not rotate cylinder
Explanation
By not rotating the cylinder, you ensure that the position of the bullets remains
the same. This means that when you shoot the first time, you have a 1 in 6
chance of hitting a bullet. However, since the second shot is also required, the
probability of hitting a bullet on the second shot is also 1 in 6. By not rotating the
cylinder, you maintain this probability throughout both shots, giving you the
maximum chance of survival.
Rate this question:
10.
Set S consists of the numbers 4, 10, 12, 7, 19, 10, 5, and x. For what
value of x will the mode, the median, and the mean all be equal?
o A.
13
o B.
17
o C.
21
o D.
42
Correct Answer
A. 13
Explanation
To find the value of x that will make the mode, median, and mean all equal, we
need to first find the mode, median, and mean of the given set S. The mode is the
number that appears most frequently in the set, which is 10. The median is the
middle number when the set is arranged in ascending order, which is also 10. The
mean is the average of all the numbers in the set, which can be found by
summing all the numbers and dividing by the total count. Since the sum of the
given numbers is 77, and there are 9 numbers in total, the mean is 77/9 = 8.56
(rounded to two decimal places). Therefore, the value of x that will make the
mode, median, and mean all equal is 13, as it will make the mean equal to 10.
Rate this question:
1
0
11.
If the variance of a dataset is 50 and all data points are increased
by 100% then what will be the variance?
o A.
50
o B.
100
o C.
200
o D.
25
Correct Answer
C. 200
Explanation
When all data points in a dataset are increased by 100%, it means that each data
point is doubled. This results in a new dataset with values that are twice as large
as the original dataset. Since variance is a measure of how spread out the data
points are from the mean, doubling all the values will also double the spread.
Therefore, the new variance will be 200, which is twice the original variance of 50.
Rate this question:
1
0
12.
If you have a dataset with n observations and mean m. What will be
the new mean if you add 5 to each data point?
o A.
o B.
M+5
o C.
o D.
None of the above
Correct Answer
B. M + 5
Explanation
Adding 5 to each data point will increase the value of each observation by 5.
Since the mean is calculated by summing up all the observations and dividing by
the number of observations, adding 5 to each observation will increase the sum of
all the observations by 5 multiplied by the number of observations. Dividing this
new sum by the number of observations will give us the new mean, which is m +
5.
Rate this question:
13.
Given the following distribution Which of the following statements is
true?
o A.
Mean < Median < Mode
o B.
Median < Mode < Mean
o C.
Mode < Median < Mean
o D.
Mode < Mean < Median
Correct Answer
C. Mode < Median < Mean
Explanation
The mode is the value that appears most frequently in a distribution. The median
is the middle value when the data is arranged in ascending or descending order.
The mean is the average of all the values in the distribution. In this case, the
mode is less than the median and the median is less than the mean. Therefore,
the correct statement is "Mode < Median < Mean."
Rate this question:
14.
What is the number of observations in a dataset with variance 5 if
the sum of squared distances from the mean is 20?
o A.
o B.
o C.
20
o D.
100
Correct Answer
A. 4
Explanation
The sum of squared distances from the mean is a measure of the variance of a
dataset. In this case, the variance is given as 5 and the sum of squared distances
is given as 20. The formula for variance is the sum of squared distances divided
by the number of observations. So, if we let the number of observations be x, we
can set up the equation 20/x = 5. Solving for x, we find that x = 4. Therefore, the
number of observations in the dataset is 4.
Rate this question:
0
1
15.
Rank the below correlation coefficient from lowest to highest
coefficient.
o A.
B>A>C>D
o B.
B>C>D>A
o C.
B>C>A>D
o D.
A>B>C>D
Correct Answer
A. B > A > C > D
Explanation
The given answer is B > A > C > D. This means that the correlation coefficient for
B is the highest, followed by A, then C, and finally D. The ranking is based on the
strength of the correlation between the variables being compared. B has the
strongest correlation, A has a weaker correlation than B but stronger than C, and
C has a weaker correlation than A but stronger than D. D has the lowest
correlation coefficient among all the options.
Rate this question:
0
1
16.
Given that we have a probability of rain = 0.2 on a given day. What
is the probability of having rain at least 2 days during the week?
o A.
0.36
o B.
0.40
o C.
0.42
o D.
0.53
Correct Answer
C. 0.42
Explanation
The probability of having rain at least 2 days during the week can be calculated
by finding the probability of having rain on exactly 2 days, 3 days, 4 days, 5 days,
and 6 days, and then adding them together. Since the probability of rain on any
given day is 0.2, the probability of having rain on exactly 2 days is (0.2)^(2) *
(0.8)^(5), the probability of having rain on exactly 3 days is (0.2)^(3) * (0.8)^(4),
and so on. After calculating these probabilities and adding them together, the
result is 0.42.
Rate this question:
1
0
17.
Which statistical measurement is affected by outliers the most?
o A.
Range
o B.
Mean
o C.
Mode
o D.
Median
Correct Answer
A. Range
Explanation
The range is the statistical measurement that is affected the most by outliers.
Outliers are extreme values that are significantly different from the other data
points. Since the range is the difference between the maximum and minimum
values in a dataset, the presence of outliers can greatly impact the range. If there
are outliers, they can significantly increase or decrease the range, making it a less
reliable measure of the spread of the data.
Rate this question:
18.
You have n numbers that must sum to 10. How many degrees of
freedom are there?
o A.
o B.
N+1
o C.
N-1
o D.
N/2
Correct Answer
C. N - 1
Explanation
When you have n numbers that must sum to 10, there is only one constraint - the
sum of the numbers must be 10. This means that you have n-1 degrees of
freedom, as you can freely choose the values of n-1 numbers and the value of the
nth number will be determined by the constraint of the sum.
Rate this question:
2
0
19.
Subtracting two Gaussian Distributions results in:
o A.
Mean is subtracted and standard deviation is added
o B.
Mean is subtracted and variance is added
o C.
Mean is subtracted and standard deviation is subtracted
o D.
Mean is subtracted and variance is subtracted
Correct Answer
B. Mean is subtracted and variance is added
Explanation
When subtracting two Gaussian distributions, the mean of the resulting
distribution is obtained by subtracting the mean of the second distribution from
the mean of the first distribution. This is because the mean represents the central
tendency of the data. On the other hand, the variance of the resulting distribution
is obtained by adding the variances of the two distributions being subtracted. This
is because when subtracting random variables, the variances add up. Therefore,
the correct answer is that the mean is subtracted and the variance is added.
Rate this question:
20.
Given a bag of marbles with 8 red marbles, 4 blue marbles, and 5
green marbles. Removing marbles one at a time from the bag, what
is the likelihood of removing 4 marbles without removing a green
marble?
o A.
50.2%
o B.
38.5%
o C.
70.6%
o D.
20.8%
Correct Answer
D. 20.8%
Explanation
The likelihood of removing 4 marbles without removing a green marble can be
calculated by considering the total number of marbles and the number of green
marbles. There are a total of 8 + 4 + 5 = 17 marbles in the bag. To remove 4
marbles without removing a green marble, we can only choose from the 8 red
marbles and 4 blue marbles. So, the probability can be calculated as (8 + 4) / 17 *
(7 + 3) / 16 * (6 + 2) / 15 * (5 + 1) / 14 = 20.8%.
Rate this question:
2
1