Statistical Hypothesis Testing Guide
Statistical Hypothesis Testing Guide
Number of degrees of freedom is defined as the number of values in a set which may be assigned
arbitrarily.
Ex:- Let 𝑥 + 𝑦 + 𝑧 = 6. If we assign any values to two variables (say) 𝑥, 𝑦 then 𝑧 can be determined.
Therefore, number of degrees of freedom for this equation is 2.
𝑥−𝜇
Test Statistic: 𝑡 =
𝑠⁄√𝑛−1
Problem: A sample of 26 bulbs gives a mean life of 990 hours with a standard deviation of 20 hours. The
manufacturer claims that the mean life of bulbs is 1000 hours. Is the sample not upto the standard?
Alternative Hypothesis, 𝑯𝟏 : The sample is not upto the standard i.e. 𝜇 < 1000
Problem: The mean life time of a sample of 25 fluorescent bulbs produced by company is computed to be
1570 hours with a standard deviation of 120 hours. The company claims that the average life of the bulbs
produced by company is 1600 hours. Using level of significance 5%, Is the claim acceptable?
Null Hypothesis, 𝑯𝟎 : The average life of the bulbs is 1600 hours i.e. 𝜇 = 1600
K.KIRAN KUMAR 1
Alternative Hypothesis, 𝑯𝟏 : The average life of the bulbs is not 1600 hours i.e. 𝜇 ≠ 1600
Conclusion: Here |𝑡| = 1.22 < 2.06 i.e. |𝑡| < 𝑡α⁄2
Problem: A certain injection administered to each of 12 patients resulted in the following increases of
Can it be concluded that the injection will be in general, accompanied by an increase in BP.
∑(𝑥𝑖 −𝑥)2
Sample standard deviation, 𝑠 = √ 𝑛−1
116.5168
=√ = 3.2546
11
𝑥−𝜇 2.58 − 0
Test Statistic: 𝑡 = = = 2.7461
𝑠⁄√𝑛 3.2546 ⁄ √12
K.KIRAN KUMAR 2
Since, |𝑡| > 𝑡𝛼 null hypothesis, 𝐻0 is rejected.
i.e. we can conclude that the injection will be, in general accompained by an increase in BP.
70, 67, 62, 68, 61, 68, 70, 64,64, 66. Is it reasonable that the average height is greater than 64 inches?
∑(𝑥𝑖 −𝑥)2
Sample standard deviation, 𝑠 = √ 𝑛−1
(70 − 66)2 + (67 − 66)2 + (62 − 66)2 + 2(68 − 66)2 + (61 − 66)2
√ +(70 − 66)2 + 2(64 − 66)2 + (66 − 66)2
=
10 − 1
90
=√ = √10 = 3.16
9
𝑥−𝜇 66 − 64
Test Statistic: 𝑡 = = = 2.0014
𝑠⁄√𝑛 3.16 ⁄ √10
i.e. we can conclude that the average height is greater than 64 inches .
Problem: Prices of shares of a company on the different days in a month were found to be :
66, 65, 69, 70, 69, 71, 70, 63,63, 64, 68. Discuss whether the mean price of the shares in the
month is 65?
K.KIRAN KUMAR 3
Test of Significance for difference of Two Means
Problem: Two independent samples of 8 & 7 items respectively by the following data.
Sample 1 11 11 13 11 15 9 12 14
Sample 2 9 11 10 13 9 8 10 -
∑ 𝑦𝑖 9 + 11 + 10 + 13 + 9 + 8 + 10 70
𝑦= = = = 10
𝑛 7 7
2
∑(𝑦𝑗 − 𝑦) = 1 + 1 + 0 + 9 + 1 + 4 + 0 = 16
2
∑(𝑥𝑖 −𝑥)2 + ∑(𝑦𝑗 −𝑦)
Sample standard deviation, 𝑠 = √ 𝑛1 + 𝑛2 − 2
26 + 16 42
=√ = √ = √3.2308 = 1.7974 ≅ 1.8
13 13
𝑥−𝑦 12 − 10 2
Test Statistic: 𝑡 = 1 1
= = = 2.1469
𝑠√𝑛 +𝑛 1 1 15
(1.8)√ + (1.8)√
1 2 8 7 56
K.KIRAN KUMAR 4
Conclusion: Here |𝑡| = 2.1469 < 2.16 i.e. |𝑡| < 𝑡𝛼/2
Problem: To examine the hypothesis that the husbands are more than intelligent than the wives, an
investigation took a sample of 10 couples and administered them a test which measures the IQ. The
Test the hypothesis with a reasonable test at the level of significance 0.05
2
Also, ∑(𝑥𝑖 − 𝑥)2 = 1606 , ∑(𝑦𝑗 − 𝑦) = 1679.6
2
∑(𝑥𝑖 −𝑥)2 + ∑(𝑦𝑗 −𝑦)
Sample standard deviation, 𝑠 = √ 𝑛1 + 𝑛2 − 2
Null Hypothesis, 𝑯𝟎 : There is no significant difference between the IQ’s of husbands and wives.
i.e. 𝑥 = 𝑦
Alternative Hypothesis, 𝑯𝟏 : Husbands are more intelligent than the wives. i.e. 𝑥 > 𝑦
K.KIRAN KUMAR 5
Since, |𝑡| < 𝑡𝛼 null hypothesis, 𝐻0 is accepted.
i.e. There is no significant difference between the IQ’s of husbands and wives.
Problem: The following data relate to the marks obtained by 11 students in two tests, one held at
beginning of a year and the other at the end of the year after intensive coaching.
Test 1 19 23 16 24 17 18 20 18 21 19 20
Test 2 17 24 20 24 20 22 20 20 18 22 19
Solution: Here the given data relate to the marks obtained in two tests by the same set of students.
2
∑(𝑑𝑖 −𝑑)
Sample standard deviation, 𝑠𝑑 = √ 𝑛−1
9 + 0 + 9 + 1 + 4 + 9 + 1 + 1 + 16 + 4 + 4
=√
10
58
=√ = √5.8 = 2.41
10
𝑑 −1 √11
Test Statistic: 𝑡 = = =− = −1.3762
𝑠𝑑 ⁄√𝑛 2.41⁄√11 2.41
K.KIRAN KUMAR 6
i.e. The students haven’t been benefited by coaching.
Problem: Memory capacity of 10 students was tested before and after training. State whether training
was effective or not from the following data:
Before 12 14 11 8 7 10 3 0 5 6
After 15 16 10 7 5 12 10 2 3 8
F-test
∑(𝑥𝑖 − 𝑥)2
𝑆12 =
𝑆12 𝑛1 − 1
Test Statistic: 𝐹 = ∋ 𝑆12 > 𝑆22 where
𝑆22 2
∑(𝑦𝑗 − 𝑦)
𝑆22 =
𝑛2 − 1
Solution: To test whether the samples came from the same population, we have to test
1. The equality of variances using F-test
2. The equality of means using t-test
2
∑(𝑥𝑖 −𝑥)2 90 ∑(𝑦𝑗 −𝑦) 108
𝑆12 = = = 10 𝑆22 = = = 9.8181 ≅ 9.82
𝑛1 − 1 9 𝑛2 −1 11
F - test
Null Hypothesis, 𝑯𝟎 : There is no significant difference between the variances. i.e. 𝜎21 = 𝜎22
Alternative Hypothesis, 𝑯𝟏 : There is significant difference between the variances. i.e. 𝜎21 ≠ 𝜎22
K.KIRAN KUMAR 7
𝑆12 10
Test Statistic: 𝐹 = = = 1.018
𝑆22 9.82
t - test
2
∑(𝑥𝑖 −𝑥)2 + ∑(𝑦𝑗 −𝑦)
Sample standard deviation, 𝑠 = √ 𝑛1 + 𝑛2 − 2
90 + 108 198
=√ =√ = √9.9 = 3.15
20 20
𝑥−𝑦 15 − 14 1
Test Statistic: 𝑡 = 1 1
= = = 0.74
𝑠√𝑛 +𝑛 1 1 22
(3.15)√ + (3.15)√
1 2 10 12 120
Conclusion: Here |𝑡| = 0.74 < 2.09 i.e. |𝑡| < 𝑡𝛼/2
Problem: The nicotine contents in milligrams in two samples of tobacco were found to be as follows:
Sample A 24 27 26 21 25 -
Sample B 27 30 28 31 22 36
Can it be said that two samples came from same normal population?
Solution: To test whether the samples came from the same population, we have to test
1. The equality of variances using F-test
2. The equality of means using t-test
K.KIRAN KUMAR 8
∑ 𝑥𝑖 24 + 27 + 26 + 21 + 25 123
First sample mean, 𝑥 = = = = 24.6
𝑛 5 5
∑ 𝑦𝑖 27 + 30 + 28 + 31 + 22 + 36 123
Second sample mean 𝑦 = = = = 29
𝑛 6 6
2
Also, ∑(𝑥𝑖 − 𝑥)2 = 21.2 ∑(𝑦𝑗 − 𝑦) = 108
2
∑(𝑥𝑖 −𝑥)2 21.2 ∑(𝑦𝑗 −𝑦) 108
𝑆12 = = = 5.3 𝑆22 = = = 21.6
𝑛1 − 1 4 𝑛2 −1 5
F - test
Null Hypothesis, 𝑯𝟎 : There is no significant difference between the variances. i.e. 𝜎21 = 𝜎22
Alternative Hypothesis, 𝑯𝟏 : There is significant difference between the variances. i.e. 𝜎21 ≠ 𝜎22
𝑆22 21.6
Test Statistic: 𝐹 = = = 4.0755
𝑆12 5.3
t - test
2
∑(𝑥𝑖 −𝑥)2 + ∑(𝑦𝑗 −𝑦)
Sample standard deviation, 𝑠 = √ 𝑛1 + 𝑛2 − 2
K.KIRAN KUMAR 9
Conclusion: Here |𝑡| = 1.9198 < 2.26 i.e. |𝑡| < 𝑡𝛼/2
Problem: The measurements of the output of two units have given the following results. Assuming
that both samples have been obtained from the normal population at 5% level of significance, test
2
Also, ∑(𝑥𝑖 − 𝑥)2 = 13.488 ∑(𝑦𝑗 − 𝑦) = 1.84
2
∑(𝑥𝑖 −𝑥)2 13.488 ∑(𝑦𝑗 −𝑦) 1.84
𝑆12 = = = 3.372 𝑆22 = = = 0.46
𝑛1 − 1 4 𝑛2 −1 4
Null Hypothesis, 𝑯𝟎 : There is no significant difference between the variances. i.e. 𝜎21 = 𝜎22
Alternative Hypothesis, 𝑯𝟏 : There is significant difference between the variances. i.e. 𝜎21 ≠ 𝜎22
𝑆12 3.372
Test Statistic: 𝐹 = = = 7.33
𝑆22 0.46
Problem: Pumpkins were grown under two experimental conditions. Two random samples of 11 and 9
pumpkins whose sample standard deviations of their weights are 0.8 and 0.5 respectively. Assuming that
the weight distributions are normal, test hypothesis that the true variances are equal.
K.KIRAN KUMAR 10
Solution: Given that, First sample size, 𝑛1 = 11 Second sample size, 𝑛2 = 9
First sample standard deviation, 𝑠1 = 0.8 Second sample standard deviation, 𝑠2 = 0.5
𝑛 𝑠2 11(0.8)2 𝑛 𝑠2 9(0.5)2
𝑆12 = 𝑛 1 −1 1 = = 0.704 𝑆22 = 𝑛 2−1
2
= = 0.28125 ≅ 0.2813
1 10 2 8
Null Hypothesis, 𝑯𝟎 : There is no significant difference between the variances. i.e. 𝜎21 = 𝜎22
Alternative Hypothesis, 𝑯𝟏 : There is significant difference between the variances. i.e. 𝜎21 ≠ 𝜎22
𝑆12 0.704
Test Statistic: 𝐹 = = = 2.5027
𝑆22 0.2813
Problem: The following random samples are measurements of the heat-producing capacity of specimens
of coal from two mines. Whether the two populations have the same variance?
2
∑(𝑦𝑗 − 𝑦) = 100 + 2500 + 1600 + 40000 + 400 + 10000 = 54600
2
∑(𝑥𝑖 −𝑥)2 63000 ∑(𝑦𝑗 −𝑦) 54600
𝑆12 = = = 15750 𝑆22 = = = 10920
𝑛1 − 1 4 𝑛2 −1 5
Null Hypothesis, 𝑯𝟎 : There is no significant difference between the variances. i.e. 𝜎21 = 𝜎22
Alternative Hypothesis, 𝑯𝟏 : There is significant difference between the variances. i.e. 𝜎21 ≠ 𝜎22
K.KIRAN KUMAR 11
Level of significance, 𝜶: α = 5 % = 0.05
𝑆12 15750
Test Statistic: 𝐹 = = = 1.4423 ≅ 1.44
𝑆22 10920
K.KIRAN KUMAR 12