Block 3
Block 3
STATISTICAL
Indira Gandhi
National Open University INFERENCE
School of Sciences
Block
3
TESTING OF HYPOTHESIS
UNIT 9
Concepts of Testing of Hypothesis 5
UNIT 10
Large Sample Tests 25
UNIT 11
Small Sample Tests 63
UNIT 12
Chi-Square and F-Tests 91
Curriculum and Course Design Committee
Prof. K. R. Srivathasan Prof. Rahul Roy
Pro-Vice Chancellor Math. and Stat. Unit
IGNOU, New Delhi Indian Statistical Institute, New Delhi
Block Production
Mr. Sunil Kumar, AR (P),School of Sciences, IGNOU
CRC prepared by Mr. Prabhat Kumar Sangal, School of Sciences, IGNOU
July, 2013
© Indira Gandhi National Open University, 2013
ISBN-978-81-266-
All rights reserved. No part of this work may be reproduced in any form, by mimeograph or any other
means, without permission in writing from the Indira Gandhi National Open University.
Further information on the Indira Gandhi National Open University may be obtained from University’s
Office at Maidan Garhi, New Delhi-110068 or visit University’s website http://www.ignou.ac.in
Printed and published on behalf of the Indira Gandhi National Open University, New Delhi by the
Director, School of Sciences.
Printed at:
TESTING OF HYPOTHESIS
In previous block of this course, we have discussed one part of statistical
inference, that is, estimation and we have learnt how we estimate the unknown
population parameter(s) by using point estimation and interval estimation.
In this block we will focus on the second part of statistical inference which is
known as testing of hypothesis.
In this block, you will study the basic concept of testing of hypothesis and
different kinds of well-known tests. At the end of this block you will be aware
of the idea, procedure and applications of the hypothesis testing. This block
contains four units.
Unit 9: Concepts of Testing of Hypothesis
This unit explains the fundamental aspect relating to testing of hypothesis with
examples. It describes basic concepts and methodologies as hypothesis,
critical region, types of errors, level of significance, general procedure of
testing a hypothesis, concept of p-value, relation between confidence interval
and testing of hypothesis.
Unit 10: Large Sample Tests
This unit explores the procedure of testing the hypothesis based on one sample
and two samples when sample size is large and taken from normal or non-
normal population(s). In this unit, you will learn about Z-test for testing of
hypothesis about mean, difference of two means, proportion, difference of two
proportions, variance and equality of two variances.
Unit 11: Small Sample Tests
This unit is devoted to test a hypothesis based on one sample and two samples
when sample sizes are small. In this unit, you will learn about t-test for testing
of hypothesis about mean and difference of two means. This unit also explores
the idea for testing the hypothesis for equality of two means when samples are
dependent and testing of hypothesis about population correlation coefficient.
Unit 12: Chi-square and F Tests
The last unit of this block describes the chi-square test for population variance
and F-test for equality of two population variances.
Notations and Symbols
X1, X2, …, Xn : Random sample of size n
H0 : Null hypothesis
H1 or HA : Alternative hypothesis
: Critical (rejection) region
: Non-rejection region
α : Size of critical region or type-I error or level of significance
: Type-II error
1‒ : Power of the test
z : Critical value of Z-test at α level of significance
9.1 INTRODUCTION
In previous block of this course, we have discussed one part of statistical
inference, that is, estimation and we have learnt how we estimate the unknown
population parameter(s) by using point estimation and interval estimation. In
this block, we will focus on the second part of statistical inference which is
known as testing of hypothesis.
In our day-to-day life, we see different commercials advertisements in
television, newspapers, magazines, etc. such as
(i) The refrigerator of certain brand saves up to 20% electric bill,
(ii) The motorcycle of certain brand gives 60 km/liter mileage,
(iii) A detergent of certain brand produces the cleanest wash,
(iv) Ninety nine out of hundred dentists recommend brand A toothpaste for
their patients to save the teeth against cavity, etc.
Now, the question may arise in our mind “can such types of claims be verified
statistically?” Fortunately, in many cases the answer is “yes”.
The technique of testing such type of claims or statements or assumptions is
known as testing of hypothesis. The truth or falsity of a claim or statement is
never known unless we examine the entire population. But practically it is not
possible in mostly situations so we take a random sample from the population
under study and use the information contained in this sample to take the
decision whether a claim is true or false.
This unit is divided into 11 sections. Section 9.1 is introductory in nature. In
Section 9.2, we defined the hypothesis. The concept and role of critical region
in testing of hypothesis is described in Section 9.3. In Section 9.4, we explored
the types of errors in testing of hypothesis whereas level of significance is
explored in Section 9.5. In Section 9.6, we explored the types of tests in testing
5
Testing of Hypothesis of hypothesis. The general procedure of testing a hypothesis is discussed in
Section 9.7. In Section 9.8, the concept of p-value in decision making about the
null hypothesis is discussed whereas the relation between confidence interval
and testing of hypothesis is discussed in Section 9.9. Unit ends by providing
summary of what we have discussed in this unit in Section 9.10 and solution of
exercises in Section 9.11.
Objectives
After reading this unit, you should be able to:
define a hypothesis;
formulate the null and alternative hypotheses;
explain what we mean by type-I and type-II errors;
explore the concept of critical region and level of significance;
define one-tailed and two-tailed tests;
describe the general procedure of testing a hypothesis;
concept of p-value; and
test a hypothesis by using confidence interval.
Before coming to the procedure of testing of hypothesis, we will discuss the
basis terms used in this procedure one by one in subsequent sections.
9.2 HYPOTHESIS
As we have discussed in previous section that in our day-to-day life, we see
different commercials advertisements in television, newspapers, magazines,
etc. and if someone may be interested to test such type of claims or statement
then we come across the problem of testing of hypothesis. For example,
(i) a customer of motorcycle wants to test whether the claim of motorcycle
of certain brand gives the average mileage 60 km/liter is true or false,
(ii) the businessman of banana wants to test whether the average weight of a
banana of Kerala is more than 200 gm,
(iii) a doctor wants to test whether new medicine is really more effective for
controlling blood pressure than old medicine,
(iv) an economist wants to test whether the variability in incomes differ in
two populations,
(v) a psychologist wants to test whether the proportion of literates between
two groups of people is same, etc.
In all the cases discussed above, the decision maker is interested in making
inference about the population parameter(s). However, he/she is not interested
in estimating the value of parameter(s) but he/she is interested in testing a
claim or statement or assumption about the value of population parameter(s).
Such claim or statement is postulated in terms of hypothesis.
In statistics, a hypothesis is a statement or a claim or an assumption about the
value of a population parameter (e.g., mean, median, variance, proportion,
etc.).
Similarly, in case of two or more populations a hypothesis is comparative
statement or a claim or an assumption about the values of population
parameters. (e.g., means of two populations are equal, variance of one
population is greater than other, etc.). The plural of hypothesis is hypotheses.
6
In hypothesis testing problems first of all we should being identifying the claim Concepts of Testing of
or statement or assumption or hypothesis to be tested and write it in the words. Hypothesis
Once the claim has been identified then we write it in symbolical form if
possible. As in the above examples,
(i) Customer of motorcycle may write the claim or postulate the hypothesis
“the motorcycle of certain brand gives the average mileage 60 km/liter.”
Here, we are concerning the average mileage of the motorcycle so let µ
represents the average mileage then our hypothesis becomes µ = 60 km /
liter.
(ii) Similarly, the businessman of banana may write the statement or
postulate the hypothesis “the average weight of a banana of Kerala is
greater than 200 gm.” So our hypothesis becomes µ > 200 gm.
(iii) Doctor may write the claim or postulate the hypothesis “ the new
medicine is really more effective for controlling blood pressure than old
medicine.” Here, we are concerning the average effect of the medicines
so let µ1 and µ2 represent the average effect of new and old medicines
respectively on controlling blood pressure then our hypothesis becomes
µ1 > µ2.
(iv) Economist may write the statement or postulate the hypothesis “ the
variability in incomes differ in two populations.” Here, we are concerning
the variability in income so let 12 and 22 represent the variability in
incomes in two populations respectively then our hypothesis becomes
12 22 .
(v) Psychologist may write the statement or postulate the hypothesis “the
proportion of literates between two groups of people is same.” Here, we
are concerning the proportion of literates so let P1 and P2 represent the
proportions of literates of two groups of people respectively then our
hypothesis becomes P1 = P2 or P1 –P2 = 0.
The hypothesis is classified according to its nature and usage as we will discuss
in subsequent subsections.
9.2.1 Simple and Composite Hypotheses
In general sense, if a hypothesis specifies only one value or exact value of the
population parameter then it is known as simple hypothesis. And if a
A hypothesis which
hypothesis specifies not just one value but a range of values that the population completely specifies
parameter may assume is called a composite hypothesis. parameter(s) of a
As in the above examples, the hypothesis postulated in (i) µ = 60 km/liter is theoretical population
(probability distribution)
simple hypothesis because it gives a single value of parameter (µ = 60) is called a simple
whereas the hypothesis postulated in (ii) µ > 200 gm is composite hypothesis hypothesis otherwise
because it does not specify the exact average value of weight of a banana. It called composite
may be 260, 350, 400 gm or any other. hypothesis.
Similarly, (iii) µ1 > µ2 or µ1 −µ2 > 0 and (iv) 12 22 or 12 22 0 are not
simple hypotheses because they specify more than one value as µ1 −µ2 = 4,
µ1 −µ2 = 7, 12 22 2, 12 22 5 , etc. and (v) P1 = P2 or P1 –P2 = 0 is simple
hypothesis because it gives a single value of parameter as P1 –P2 = 0.
9.2.2 Null and Alternative Hypotheses
As we have discussed in last page that in hypothesis testing problems first of
all we identify the claim or statement to be tested and write it in symbolical
7
Testing of Hypothesis form. After that we write the complement or opposite of the claim or statement
in symbolical form. In our example of motorcycle, the claim is µ = 60 km/liter
then its complement is µ ≠ 60 km/liter. In (ii) the claim is µ > 200 gm then its
complement is µ ≤ 200 gm. If the claim is µ < 200 gm then its complement is
µ ≥ 200 gm. The claim and its complement are formed in such a way that they
cover all possibility of the value of population parameter.
Once the claim and its compliment have been established then we decide of
We state the null and these two which is the null hypothesis and which is the alternative hypothesis.
alternative The thump rule is that the statement containing equality is the null hypothesis.
hypotheses in such a
way that they cover
That is, the hypothesis which contains symbols or or is taken as null
all possibility of the hypothesis and the hypothesis which does not contain equality i.e. contains
value of population or or is taken as alternative hypothesis. The null hypothesis is denoted
parameter.
by H0 and alternative hypothesis is denoted by H1 or HA.
In our example of motorcycle, the claim is µ = 60 km/liter and its complement
is µ ≠ 60 km/liter. Since claim µ = 60 km/liter contains equality sign so we take
it as a null hypothesis and complement µ ≠ 60 km/liter as an alternative
hypothesis, that is,
H0 : µ = 60 km/liter and H1: µ ≠ 60 km/liter
In our second example of banana, the claim is µ > 200 gm and its complement
is µ ≤ 200 gm. Since complement µ ≤ 200 gm contains equality sign so we take
complement as a null hypothesis and claim µ > 200 gm as an alternative
hypothesis, that is,
H0 : µ ≤ 200 gm and H1: µ > 200 gm
Formally these hypotheses are defined as
The hypothesis which we wish to test is called as the null hypothesis.
According to Prof. R.A. Fisher,
“A null hypothesis is a hypothesis which is tested for possible rejection under
the assumption that it is true.”
The hypothesis which complements to the null hypothesis is called alternative
hypothesis.
Note 1: Some authors use equality sign in null hypothesis instead of ≥ and ≤
signs.
The alternative hypothesis has two types:
(i) Two-sided (tailed) alternative hypothesis
(ii) One-sided (tailed) alternative hypothesis
If the alternative hypothesis gives the alternate of null hypothesis in both
directions (less than and greater than) of the value of parameter specified in
null hypothesis then it is known as two-sided alternative hypothesis and if it
gives an alternate only in one direction( less than or greater than) only then it is
known as one-sided alternative hypothesis. For example, if our alternative
hypothesis is H1 : θ ≠ 60 then it is a two-sided alternative hypothesis because its
means that the value of parameter θ is greater than or less than 60. Similarly, if
H1 : θ > 60 then it is a right-sided alternative hypothesis because its means that
the value of parameter θ is greater than 60 and if H1: θ < 60 then it is a
left-sided alternative hypothesis because its means that the value of parameter θ
is less than 60.
In testing procedure, we assume that the null hypothesis is true until there is
sufficient evidence to prove that it is false. Generally, the hypothesis is tested
8
with the help of a sample so evidence in testing of hypothesis comes from a Concepts of Testing of
sample. If there is enough sample evidence to suggest that the null hypothesis Hypothesis
is false then we reject the null hypothesis and support the alternative
hypothesis. If the sample fails to provide us sufficient evidence against the null
hypothesis we are not saying that the null hypothesis is true because here, we
take the decision on the basis of a random sample which is a small part of the
population. To say that null hypothesis is true we must study all observations
of the population under study. For example, if someone wants to test that the
person of India has two hands then to prove that this is true we must check all
the persons of India whereas to prove that it is false we require a person he /
she has one hand or no hand. So we can only say that there is not enough
evidence against the null hypothesis.
Note 2: When we assume that null hypothesis is true then we are actually
assuming that the population parameter is equal to the value in the claim. In our
example of motorcycle, we assume that µ = 60 km/liter whether the null
hypothesis is µ = 60 km/liter or µ ≤ 60 km/liter or µ ≥ 60 km/liter.
Now, you can try the following exercises.
E1) A company manufactures car tyres. Company claims that the average life
of its tyres is 50000 miles. To test the claim of the company, formulate
the null and alternative hypotheses.
E2) Write the null and alternative hypotheses in case (iii), (iv) and (v) of our
example given in Section 9.2.
E3) A businessman of orange formulates different hypotheses about the
average weight of the orange which are given below:
(i) H0: = 100 (ii) H1 : >100 (iii) H0 : ≤ 100 (iv) H1: ≠ 100
(v) H1: > 150 (vi) H0: = 130 (vii) H1: 0
Categorize the above cases into simple and composite hypotheses.
After describing the hypothesis and its types our next point in the testing of
hypothesis is critical region which will be described in next section.
range of T10 is 0 T10 1000. Now, we divide the whole space ( 0-1000) into
two regions as no-distinction awarded region (less than 750) and distinction
awarded region (greater than or equal to 750) as shown in Fig. 9.1. Here, 750 is
the critical value which separates the no-distinction and distinction awarded
regions.
Tn
Fig. 9.1: Non-rejection and critical regions for distinction award
On the basis of scores in all the papers of the selected student, we calculate the
10
value of the statistic T10 Xi . And calculated value may fall in distinction
i 1
award region or not, depending upon the observed value of test statistic.
For making a decision to reject or do not reject H0, we use test statistic
10
T10 X
i 1
i (sum of scores of 10 papers). If calculated value of test statistic T10
lies in no-distinction awarded region (critical region), that is, T10 < 750 then we
reject H0 and if calculated value of test statistic T10 lies in distinction awarded
region (non-rejection region), that is, T10 750 then we do not reject H0. It is a
basic structure of the procedure of testing of hypothesis which needs two
regions like:
(i) Region of rejection of null hypothesis H0
(ii) Region of non-rejection of null hypothesis H0
The point of discussion in this test procedure is “how to fix the cut off value
750”? What is the justification for this value? The distinction award region
may be like T10 800 or at T10 850 or at T10 900. So, there must be a
scientific justification for the cut-off value 750. In a statistical test procedure it
is obtained by using the probability distribution of the test statistic.
The region of rejection is called critical region. It has a pre-fixed area generally
denoted by , corresponding to a cut-off value in a probability distribution of
test statistic.
The rejection (critical) region lies in one-tail or two-tails on the probability
curve of sampling distribution of the test statistic its depends upon the
alternative hypothesis. Therefore, three cases arise: Concepts of Testing of
Hypothesis
Case I: If the alternative hypothesis is right-sided such as H1 : θ > θ0 or
H1 : θ1 > θ2 then the entire critical or rejection region of size α lies on
right tail of the probability curve of sampling distribution of the test
statistic as shown in Fig. 9.2.
Critical value is a
value or values that
separate the region of
rejection from the non-
rejection region.
Fig. 9.2
Case II: If the alternative hypothesis is left-sided such as H1: θ < θ0 or
H1 : θ1 < θ2 then the entire critical or rejection region of size α lies on
left tail of the probability curve of sampling distribution of the test
statistic as shown in Fig. 9.3.
Fig. 9.3
Case III: If the alternative hypothesis is two sided such as H1 : θ ≠ θ0 or
H1 : θ1 ≠ θ2 then critical or rejection regions of size α/2 lies on both
tails of the probability curve of sampling distribution of the test
statistic as shown in Fig. 9.4.
Fig. 9.4
Now, you can try the following exercise.
E4) If H0: θ = 60 and H1: θ ≠ 60 then critical region lies in one-tail or two-
tails.
Testing of Hypothesis
9.4 TYPE-I AND TYPE-II ERRORS
In Section 9.3, we have discussed a rule that if the value of test statistic falls in
rejection (critical) region then we reject the null hypothesis and if it falls in the
non-rejection region then we do not reject the null hypothesis. A test statistic is
calculated on the basis of observed sample observations. But a sample is a
small part of the population about which decision is to be taken. A random
sample may or may not be a good representative of the population.
A faulty sample misleads the inference (or conclusion) relating to the null
hypothesis. For example, an engineer infers that a packet of screws is sub-
standard when actually it is not. It is an error caused due to poor or
inappropriate (faulty) sample. Similarly, a packet of screws may infer good
when actually it is sub-standard. So we can commit two kinds of errors while
testing a hypothesis which are summarised in Table 9.1 which is given below:
Table 9.1: Type of Errors
Decision H0 True H1 True
Reject H0 Type-I Error Correct Decision
Do not reject H0 Correct Decision Type-II Error
Let us take a situation where a patient suffering from high fever reaches to a
doctor. And suppose the doctor formulates the null and alternative hypotheses
as
H0 : The patient is a malaria patient
H1 : The patient is not a malaria patient
Then following cases arise:
Case I: Suppose that the hypothesis H0 is really true, that is, patient actually
a malaria patient and after observation, pathological and clinical
examination, the doctor rejects H0, that is, he / she declares him / her
a non-malaria-patient. It is not a correct decision and he / she
commits an error in decision known as type-I error.
Case II: Suppose that the hypothesis H0 is actually false, that is, patient
actually a non-malaria patient and after observation, the doctor
rejects H0, that is, he / she declares him / her a non-malaria-patient. It
is a correct decision.
Case III: Suppose that the hypothesis H0 is really true, that is, patient actually
a malaria patient and after observation, the doctor does not reject H0,
that is, he / she declares him / her a malaria-patient. It is a correct
decision.
Case IV: Suppose that the hypothesis H0 is actually false, that is, patient
actually a non-malaria patient and after observation, the doctor does
not reject H0, that is, he / she declares him / her a malaria-patient. It
is not a correct decision and he / she commits an error in decision
known as type-II error.
Thus, we formally define type-I and type-II errors as below:
Type-I Error:
The decision relating to rejection of null hypothesis H0 when it is true is called
type-I error. The probability of committing the type-I error is called size of test,
denoted by and is given by
= P [Reject H0 when H0 is true] = P [Reject H0 / H0 is true]
12
We reject the null hypothesis if random sample / test statistic falls in rejection Concepts of Testing of
region, therefore, Hypothesis
α = P [ X / H0 ]
where X = (X1, X2,…,Xn) is a random sample and ω is the rejection region and
1- = 1-P[Reject H0 / H0 is true]
= P[Do not reject H0 / H0 is true] = P[Correct decision]
The (1-) is the probability of correct decision and it correlates to the concept
of 100(1-)% confidence interval used in estimation.
Type-II Error:
The decision relating to non-rejection of null hypothesis H0 when it is false
(i.e. H1 is true) is called type-II error. The probability of committing type-II
error is generally denoted by and is given by
= P[Do not reject H0 when H0 is false]
= P[Do not reject H0 when H1 is true]
= P[Do not reject H0 / H1 is true]
= P[ X ω / H1 ] where, is the non-rejection region.
and
1- = 1-P[Do not reject H0 / H1 is true]
= P[Reject H0 / H1 is true] = P[Correct decision]
The (1-) is the probability of correct decision and also known as “power of
the test”. Since it indicates the ability or power of the test to recognize
correctly that the null hypothesis is false, therefore, we wish a test that yields a
large power.
We say that a statistical test is ideal if it minimizes the probability of both types
of errors and maximizes the probability of correct decision. But for fix sample
size, and are so interrelated that the decrement in one results into the
increment in other. So minimization of both probabilities of type-I and type-II
errors simultaneously for fixed sample size is not possible without increasing
sample size. Also both types of errors will be at zero level (i.e. no error in
decision) if size of the sample is equal to the population size. But it involves
huge cost if population size is large. And it is not possible in all situations such
as testing of blood.
Depending on the problem in hand, we have to choose the type of error which
has to minimize. For this, we have to look at a situation, suppose there is a
decision making problem and there is a rule that if we make type-I error, we
lose10 rupees and if we make type-II error we lose 1000 rupees. In this case,
we try to eliminate the type-II error, since it is more expensive.
In another situation, suppose the Delhi police arrests a person whom they
suspect is a murderer. Now, policemen have to test hypothesis:
H0: Arrested person is innocent (not murderer)
H1: Arrested person is a murderer
The type-I error is
= P [Reject H0 when it is true]
That is, suspected person who is actually an innocent will be sent to jail when
H0 rejects, although H0 being a true.
Testing of Hypothesis The type-II error is
= P [Do not reject H0 when H1 is true]
That is, when arrested person truly a murderer but released by the police. Now,
we see that in this case type-I error is more serious than type-II error because a
murderer may be arrested / punished later on but sending jail to an innocent
person is serious.
Consider another situation, suppose we want to test the null hypothesis
H0 : p 0.5 against H1 : p 0.5 on the basis of tossing a coin once, where p is
the probability of getting a head in a single toss (trial). And we reject the null
hypothesis if a head appears and do not reject otherwise. The type-I error, that
is, the probability of Reject H0 when it is true can be calculated easily(as shown
in Example 1) but the computation of type-II error is not possible because there
are infinitely many alternatives for p such as p = 0.6, p = 0.1, etc.
Generally, strong control on α is necessary. It should be kept as low as
possible. In test procedure, we prefix it at very low level like = 0.05 ( 5%) or
0.01 (1%) .
Now, it is time to do some examples relating to α and .
Example 1: It is desired to test a hypothesis H 0 :p p0 1/ 2 against the
alternative hypothesis H1 :p p1 1/ 4 on the basis of tossing a coin once,
where p is the probability of “getting a head” in a single toss (trial) and
agreeing to reject H0 if a head appears and accept H0 otherwise. Find the value
of and .
Solution: In such type of problems, first of all we search for critical region.
Here, we have critical region = {head}
Therefore, the probability of type-I error can be obtained as
= P[Reject H0 when H0 is true]
P[X / H 0 ]= P[Head appears / H 0 ]
1 H0 is trueso we take value
P Head appears 1
p
2 2 of parameter pgiven in H 0
Also,
= P[Do not reject H0 when H1 is true]
P X / H1 P [Tail appears / H1 ]
Obtain type-I and type-II errors when critical region is X 0.4. Also obtain
power function of the test.
14
Solution: Here, we have critical (rejection) and non-rejection regions as Concepts of Testing of
Hypothesis
X : X 0.4 and X : X 0.4
We have to test the null hypothesis
H 0 : θ 1 against H1 : θ 2
The size of type-I error is given by
P X / H 0 P X 0.4 / 1
f x, dx
P X a f x, dx
… (1)
0.4 1 a
1
Now, by using f x , ; 0 x , we get from equation (1)
1 1
1
dx dx x 0.4 1 0.4 0.6
0.4 1 0.4
Similarly, the size of type-II error is given by
P X / H1 P X 0.4 / 2
0.4 1 0.4
1 1 0.4 1
dx dx x 0 0.4 0 0.2
0 2 0 2 2 2
16
Let us do one example based on type of tests. Concepts of Testing of
Hypothesis
Example 3: A company has replaced its original technology of producing
electric bulbs by CFL technology. The company manager wants to compare the
average life of bulbs manufactured by original technology and new technology
CFL. Write appropriate null and alternate hypotheses. Also say about one tailed
and two tailed tests.
Solution: Suppose the average lives of original and CFL technology bulbs are
denoted by 1 and 2 respectively.
If company manager is interested just to know whether any significant
difference exists in average-life time of two types of bulbs then null and
alternative hypotheses will be:
H0 : µ1 = µ2 [average lives of two types of bulbs are same]
H1 : µ1 µ2 [average lives of two types of bulbs are different]
Since alternative hypothesis is two-tailed therefore corresponding test will be
two-tailed.
If company manager is interested just to know whether average life of CFL is
greater than original technology bulbs then our null and alternative hypotheses
will be
H0 : µ1 ≥ µ2
17
Testing of Hypothesis assumed value θ0 of parameter θ. So we can take the null and
alternative hypotheses as
H 0 : 0 and H1 : 0 for two-tailed test
H 0 : 0 and H1 : 0
or for one-tailed test
H 0 : 0 and H1 : 0
In case of comparing same parameter of two populations of interest,
say, 1 and 2, then our null and alternative hypotheses would be
H0 : 1 2 and H1 : 1 2 for two-tailed test
H 0 : 1 2 and H1 : 1 2
or for one-tailed test
H 0 : 1 2 and H1 : 1 2
Step II: After setting the null and alternative hypotheses, we establish a
criteria for rejection or non-rejection of null hypothesis, that is,
decide the level of significance (), at which we want to test our
hypothesis. Generally, it is taken as 5% or 1% (α = 0.05 or 0.01).
Step III: The third step is to choose an appropriate test statistic under H0 for
testing the null hypothesis as given below:
Statistic Value of the parameter under H 0
Test statistic
Standard error of statistic
After that, specify the sampling distribution of the test statistic
preferably in the standard form like Z (standard normal), 2, t, F or
any other well-known in literature.
Step IV: Calculate the value of the test statistic described in Step III on the
basis of observed sample observations.
Step V: Obtain the critical (or cut-off) value(s) in the sampling distribution
of the test statistic and construct rejection (critical) region of size .
Generally, critical values for various levels of significance are
putted in the form of a table for various standard sampling
distributions of test statistic such as Z-table, 2-table, t-table, etc.
Step VI: After that, compare the calculated value of test statistic obtained
from Step IV, with the critical value(s) obtained in Step V and
locates the position of the calculated test statistic, that is, it lies in
rejection region or non-rejection region.
Step VII: In testing of hypothesis ultimately we have to reach at a conclusion.
It is done as explained below:
(i) If calculated value of test statistic lies in rejection region at
level of significance then we reject null hypothesis. It means
that the sample data provide us sufficient evidence against the
null hypothesis and there is a significant difference between
hypothesized value and observed value of the parameter.
(ii) If calculated value of test statistic lies in non-rejection region at
level of significance then we do not reject null hypothesis. Its
means that the sample data fails to provide us sufficient
evidence against the null hypothesis and the difference between
hypothesized value and observed value of the parameter due to
fluctuation of sample.
18
Note 3: Nowadays the decision about the null hypothesis is taken with the help Concepts of Testing of
of p-value. The concept of p-value is very important, because computer Hypothesis
packages and statistical software such as SPSS, SAS, STATA, MINITAB,
EXCEL, etc. all provide p-value. So, Section 9.8 is devoted to explain the
concept of p-value.
Now, with the help of an example we explain the above procedure.
Example 4: Suppose, it is found that average weight of a potato was 50 gm
and standard deviation was 5.1 gm nearly 5 years ago. We want to test that due
to advancement in agricultural technology, the average weight of a potato has
been increased. To test this, a random sample of 50 potatoes is taken and
calculate the sample mean (X) as 52gm. Describe the procedure to carry out
this test.
Solution: Here, we are given that
Specified value of population mean = 0 = 50 gm,
Population standard deviation = σ = 5.1 gm,
Sample size = n = 50,
Sample mean = X = 52 gm
To carry out the above test, we have to follow up the following steps:
Step I: First of all, we setup null and alternative hypotheses. Here, we want
to test that the average weight of potato is increased. So our claim is
“average weight of potato has increased” i.e. µ > 50 and its
complement is µ ≤ 50. Since complement contains equality sign so
we can take the complement as the null hypothesis and claim as the
alternative hypothesis, that is,
H0 : µ ≤ 50 gm and H1: µ > 50 gm [Here, θ = µ]
Since the alternative hypothesis is right-tailed, so our test is right-
tailed.
Step II: After setting the null and alternative hypotheses, we fix level of
significance α. Suppose, α = 0.01 (= 1 % level).
Step III: Define a test statistic to test the null hypothesis as
Statistic Value of the parameter under H 0
Test staistic
Standard error of statistic
X 50
T
σ/ n
Since sample size is large (n = 50 > 30) so by central limit theorem
the sampling distribution of test statistic approximately follows
standard normal distribution (as explained in Unit 1 of this course),
i.e. T ~ N(0,1)
Step IV: Calculate the value of test statistic on the basis of sample
observations as
52 50 2
T 2.78
5.1/ 50 0.72
Step V: Now, we find the critical value. The critical value or cut-off value for
standard normal distribution is given in Table I (Z-table) in the
Appendix at the end of Block 1 of this course. So from this table, the
critical value for right-tailed test at = 0.01 is zα = 2.33.
19
Testing of Hypothesis Step IV: Now, to take the decision about the null hypothesis, we compare the
calculated value of test statistic with the critical value.
Since calculated value of test statistic (= 2.78) is greater than critical
value (= 2.33), that means calculated value of test statistic lies in
rejection region at 1% level of significance as shown in Fig. 9.5. So
we reject null hypothesis and support the alternative hypothesis. Since
alternative hypothesis is our claim, so we support the claim.
Thus, we conclude that sample does not provide us sufficient
evidence against the claim so we may assume that the average weight
of potato has increased.
Fig. 9.5 Now, you can try the following exercise.
E9) What is the first step in testing of hypothesis?
Procedure of taking the decision about the null hypothesis on the basis of
p-value:
To take the decision about the null hypothesis based on p-value, the p-value is
compared with level of significance (α) and if p-value is equal or less than
then we reject the null hypothesis and if the p-value is greater than we do not
reject the null hypothesis.
The p-value for various tests can be obtained with the help of the tables given
in the Appendix of the Block 1 of this course. But unless we are dealing with
the standard normal distribution, the exact p-value is not obtained with the
tables as mentioned above. But if we test our hypothesis with the help of
computer packages or softwares such as SPSS, SAS, MINITAB, STATA,
EXCEL, etc. then these types of computer packages or softwares present the p-
value as part of the output for each hypothesis testing procedure. Therefore, in
this block we will also describe the procedure to take the decision about the
null hypothesis on the basis of critical value as well as p-value concepts.
9.10 SUMMARY
In this unit, we have covered the following points:
1. Statistical hypothesis, null hypothesis, alternative hypothesis, simple &
composite hypotheses.
2. Type-I and Type-II errors.
3. Critical region.
4. One-tailed and two-tailed tests.
5. General procedure of testing a hypothesis.
6. Level of significance.
7. Concept of p-value.
8. Relation between confidence interval and testing of hypothesis.
H1 : 12 22
23
Testing of Hypothesis (v) Here, psychologist wants to test whether the proportion of literates
between two groups of people is same so
Claim: P1 = P2 and complement: P1 ≠ P2
Since claim contains equality sign so we take claim as the null
hypothesis and complement as the alternative hypothesis i.e.
H0 : P1 = P2
H1 : P1 ≠ P2
E3) Here, (i) and (vi) represent the simple hypotheses because these
hypotheses tell us the exact values of parameter average weight of
orange as = 100 and = 130.
The rest (ii), (iii), (iv), (v) and (vii) represent the composite hypotheses
because these hypotheses do not tell us the exact values of parameter .
E4) Since alternative hypothesis H1 : θ ≠ 60 is two tailed so critical region
lies in two-tails.
E5) Let A and B denote the number of white balls and black balls in the urn
respectively. Further, let X be the number of white balls drawn among
the two balls from the urn then we can take the null and alternative
hypotheses as
H0 : A = 4 & B = 2 and H1: A = 2 & B = 4
The critical region is given by
w X : X 2 X : X 0,1
Thus,
= P [Reject H0 when H0 is true]
P X w / H0 PX 0 / H0 PX 1/ H0
4
C0 2 C0 4
C1 2 C1 1 1 4 2 1 8
6
6
C2 C2 15 15 15 15
9 3
α
15 5
Similarly,
= P [Do not reject H0 when H1 is true]
2
C 2 4 C0 1 1 1
P X w / H1 PX 2 / H1 6
C2 15 15
E6) Since level of significance is the probability of type-I error so in this
case level of significance is 0.05 or 5%.
E7) Here, the alternative hypothesis is two-tailed therefore, the test will be
two-tailed test.
E8) Whether the test of testing a hypothesis is one-tailed or two-tailed
depends on the alternative hypothesis. So correct option is (ii).
E9) First step in testing of hypothesis is to setup null and alternative
hypotheses.
24
UNIT 10 LARGE SAMPLE TESTS
Structure
10.1 Introduction
Objectives
10.2 Procedure of Testing of Hypothesis for Large Samples
10.3 Testing of Hypothesis for Population Mean Using Z-Test
10.4 Testing of Hypothesis for Difference of Two Population Means Using
Z-Test
10.5 Testing of Hypothesis for Population Proportion Using Z-Test
10.6 Testing of Hypothesis for Difference of Two Population Proportions
Using Z-Test
10.7 Testing of Hypothesis for Population Variance Using Z-Test
10.8 Testing of Hypothesis for Two Population Variances Using Z-Test
10.9 Summary
10.10 Solutions /Answers
10.1 INTRODUCTION
In previous unit, we have defined basic terms used in testing of hypothesis.
After providing you necessary material required for any test, we can move
towards discussing particular tests one by one. But before doing that let us tell
you the strategy we are adopting here.
First we categories the tests under two heads:
Large sample tests
Small sample tests
After that, their unit wise distribution is done. In this unit, we will discuss large
sample tests whereas in Units 11 and 12 we will discuss small sample tests.
The tests which are described in these units are known as “parametric tests”.
Sometimes in our studies in the fields of economics, psychology, medical, etc.
we take a sample of objects / units / participants / patients, etc. such as 70, 500,
1000, 10,000, etc. This situation comes under the category of large samples.
As a thumb rule, a sample of size n is treated as a large sample only if it
contains more than 30 units (or observations, n > 30). And we know that, for
large sample (n > 30), one statistical fact is that almost all sampling
distributions of the statistic(s) are closely approximated by the normal
distribution. Therefore, the test statistic, which is a function of sample
observations based on n > 30, could be assumed follow the normal distribution
approximately (or exactly).
But story does not end here. There are some other issues which need to be
taken care off. Some of these issues have been highlighted by making different
cases in each test as you will see when go through Sections 10.3 to 10.8 of this
unit.
This unit is divided into ten sections. Section 10.1 is introductory in nature.
General procedure of testing of hypothesis for large samples is described in
25
Testing of Hypothesis Section 10.2. In Section 10.3, testing of hypothesis for population mean is
discussed whereas in Section 10.4, testing of hypothesis for difference of two
population means with examples is described. Similarly, in Sections 10.5 and
10.6, testing of hypothesis for population proportion and difference of two
population proportions are explained respectively. Testing of hypothesis for
population variance and two population variances are described in Sections
10.7 and 10.8 respectively. Unit ends by providing summary of what we have
discussed in this unit in Section 10.9 and solution of exercises in Section 10.10.
Objectives
After studying this unit, you should be able to:
judge for a given situation whether we should go for large sample test or
not;
Applying the Z-test for testing the hypothesis about the population mean
and difference of two population means;
Applying the Z-test for testing the hypothesis about the population
proportion and difference of two population proportions; and
Applying the Z-test for testing the hypothesis about the population variance
and two population variances.
Step II: After setting the null and alternative hypotheses, we have to choose
level of significance. Generally, it is taken as 5% or 1% (α = 0.05 or
0.01). And accordingly rejection and non-rejection regions will be
decided.
Step III: Third step is to determine an appropriate test statistic, say, Z in case
of large samples. Suppose Tn is the sample statistic such as sample
mean, sample proportion, sample variance, etc. for the parameter
then for testing the null hypothesis, test statistic is given by
we know that SE of a statistic is
Tn E(Tn ) Tn E(Tn ) theSD of the sampling distribution
Z
SE(Tn ) Var(Tn ) of that statistic
SE(Tn ) SD(Tn ) Var(Tn )
In this case, the rejection (critical) region falls under the right tail of
the probability curve of the sampling distribution of test statistic Z.
Fig. 10.1
Suppose z is the critical value at level of significance so entire
region greater than or equal to z is the rejection region and less than
z is the non-rejection region as shown in Fig. 10.1.
Testing of Hypothesis If z (calculated value ) ≥ z (tabulated value), that means the
calculated value of test statistic Z lies in the rejection region, then we
reject the null hypothesis H0 at level of significance. Therefore, we
conclude that sample data provides us sufficient evidence against the
null hypothesis and there is a significant difference between
hypothesized or specified value and observed value of the parameter.
If z < z , that means the calculated value of test statistic Z lies in non-
rejection region, then we do not reject the null hypothesis H0 at
level of significance. Therefore, we conclude that the sample data
fails to provide us sufficient evidence against the null hypothesis and
the difference between hypothesized value and observed value of the
parameter due to fluctuation of sample.
so the population parameter θ may be 0.
Case II: When H 0 : 0 and H 1 : 0 (left-tailed test)
In this case, the rejection (critical) region falls under the left tail of the
probability curve of the sampling distribution of test statistic Z.
Suppose -z is the critical value at level of significance then entire
region less than or equal to -z is the rejection region and greater
than -z is the non-rejection region as shown in Fig. 10.2.
Fig. 10.2 If z ≤-z, that means the calculated value of test statistic Z lies in the
rejection region, then we reject the null hypothesis H0 at level of
significance.
If z >-z, that means the calculated value of test statistic Z lies in the
non-rejection region, then we do not reject the null hypothesis H0 at
level of significance.
In case of two-tailed test: When H 0 : 0 and H 1 : 0
In this case, the rejection region falls under both tails of the
probability curve of sampling distribution of the test statistic Z. Half
the area (α) i.e. α/2 will lies under left tail and other half under the
right tail. Suppose zα / 2 and zα / 2 are the two critical values at the
left-tailed and right-tailed respectively. Therefore, entire region less
than or equal to z / 2 and greater than or equal to zα / 2 are the
rejection regions and between zα / 2 and zα / 2 is the non-rejection
Fig. 10.3
region as shown in Fig. 10.3.
These p-values for Z-test can be obtained with the help of Table-I (Z-table)
given in the Appendix at the end of Block 1 of this course (which gives the
probability [0 Z z] for different value of z) as discussed in Unit 14 of
MST-003.
For example, if test is right-tailed and calculated value of test statistic Z is 1.23
then
p-value = P Z z P Z 1.23 0.5 P 0 Z 1.23
σ
SE X Var X … (2)
n
Now, follow the same procedure as we have discussed in previous section, that
is, first of all we have to setup null and alternative hypotheses. Since here we
want to test the hypothesis about the population mean so we can take the null
and alternative hypotheses as
30
Here, and 0 0 Large Sample Tests
H0 : 0 and H1 : 0 for two-tailed test if we compareit with
general procedure.
H0 : 0 and H1 : 0
or for one-tailed test
H0 : 0 and H1 : 0
When we assume that
For testing the null hypothesis, the test statistic Z is given by the null hypothesis is
true then we are actually
X E X assuming that the
Z
SE X population parameter is
equal to the value in the
X µ0 null hypothesis. For
Z Using equations (1) and (2) and example, we assume that
σ/ n under H 0 we assume that µ µ 0 . µ = 60 whether the null
hypothesis is µ = 60 or
The sampling distribution of the test statistic depends upon σ2 that it is known µ ≤ 60 or µ ≥ 60.
or unknown. Therefore, two cases arise:
Case I: When σ2 is known
In this case, the test statistic follows the normal distribution with
mean 0 and variance unity when the sample size is the large as the
population under study is normal or non-normal. If the sample size is
small then test statistic Z follows the normal distribution only when
population under study is normal. Thus,
X µ0
Z ~ N 0, 1
σ/ n
Case II: When σ2 is unknown
In this case, we estimate σ2 by the value of sample variance (S2)
where,
1 n 2
S2
n 1 i1
Xi X
X 0
Then become test statistic follows the t-distribution with
S/ n
(n−1) df as the sample size is large or small provided the population
under study follows normal as we have discussed in Unit 2 of this
course. But when population under study is not normal and sample
size is large then this test statistic approximately follows normal
distribution with mean 0 and variance unity, that is,
X 0
Z ~ N 0,1
S/ n
After that, we calculate the value of test statistic as may be the case (σ2 is
known or unknown) and compare it with the critical value given in Table 10.1
at prefixed level of significance α. Take the decision about the null hypothesis
as described in the previous section.
From above discussion of testing of hypothesis about population mean, we note
following point:
(i) When σ2 is known then we apply the Z-test as the population under study
is normal or non-normal for the large sample. But when sample size is
31
Testing of Hypothesis small then we apply the Z-test only when population under study is
normal.
(ii) When σ2 is unknown then we apply the t-test only when the population
under study is normal as sample size is large or small. But when the
assumption of normality is not fulfilled and sample size is large then we
can apply the Z-test.
(iii) When sample is small and σ2 is known or unknown and the form of the
population is not known then we apply the non-parametric test as we will
be discussed in Block 4 of this course.
Following examples will help you to understand the procedure more clearly.
Example 1: A light bulb company claims that the 100-watt light bulb it sells
has an average life of 1200 hours with a standard deviation of 100 hours. For
testing the claim 50 new bulbs were selected randomly and allowed to burn
out. The average lifetime of these bulbs was found to be 1180 hours. Is the
company’s claim is true at 5% level of significance?
Solution: Here, we are given that
Specified value of population mean = 0 = 1200 hours,
Population standard deviation = σ = 100 hours,
Sample size = n = 50
Sample mean = X = 1180 hours.
In this example, the population parameter being tested is population mean i.e.
average life of a bulb (µ) and we want to test the company’s claim that average
life of a bulb is 1200 hours. So our claim is = 1200 and its complement is
≠ 1200. Since claim contains the equality sign so we can take the claim as the
null hypothesis and complement as the alternative hypothesis. So
H 0 : 0 1200 average life of a bulb is 1200 hours
H 1 : 1200 average life of a bulb is not1200 hours
Also the alternative hypothesis is two-tailed so the test is two-tailed test.
Here, we want to test the hypothesis regarding mean when population SD
(variance) is known and sample size n = 50(> 30) is large. So we will go for
Z-test.
Thus, for testing the null hypothesis the test statistic is given by
X 0
Z
/ n
1180 1200 20
1.41
100 / 50 14.14
The critical (tabulated) values for two-tailed test at 5% level of significance are
± zα/2 = ± z0.025 = ± 1.96.
Fig. 10.4 Since calculated value of test statistic Z ( = –1.41) is greater than critical value
(= − 1.96) and less than the critical value (= 1.96), that means it lies in non-
rejection region as shown in Fig. 10.4, so we do not reject the null hypothesis.
Since the null hypothesis is the claim so we support the claim at 5% level of
significance.
Decision according to p-value:
The test is two-tailed, therefore,
p-value = 2P Z z 2P Z 1.41 Large Sample Tests
Since p-value (= 0.1586) is greater than α (= 0.05) so we do not reject the null
hypothesis at 5% level of significance.
Decision according to confidence interval:
Here, test is two-tailed, therefore, we contract two-sided confidence interval for
population mean.
Since population standard deviation is known, therefore, we can use
(1−α) 100 % confidence interval for population mean when population
variance is known which is given by
X z / 2 n , X z / 2 n
100 100
1180 1.96 50 ,1180 1.96 50
or 1180 27.71,1180 27.71
or 1152.29, 1207.71
Since 95% confidence interval for average life of a bulb contains the value of
the parameter specified by the null hypothesis, that is, 0 1200 so we do
not reject the null hypothesis.
Thus, we conclude that sample does not provide us sufficient evidence against
the claim so we may assume that the company’s claim that the average life of a
bulb is 1200 hours is true.
Note 2: Here, we note that the decisions about null hypothesis based on three
approaches (critical value or classical, p-value and confidence interval) are
same. The learners are advised to make the decision about the claim or
statement by using only one of the three approaches in the examination. Here,
we used all these approaches only to give you an idea how they can be used in
a given problem. Those learners who will opt biostatistics specialisation will
see and realize the importance of confidence interval approach in Unit 16 of
MSTE-004.
Example 2: A manufacturer of ball point pens claims that a certain pen
manufactured by him has a mean writing-life at least 460 A-4 size pages. A
purchasing agent selects a sample of 100 pens and put them on the test. The
mean writing-life of the sample found 453 A-4 size pages with standard
deviation 25 A-4 size pages. Should the purchasing agent reject the
manufacturer’s claim at 1% level of significance?
Solution: Here, we are given that
Specified value of population mean = 0 = 460,
33
Testing of Hypothesis Sample size = n = 100,
Sample mean = X = 453,
Sample standard deviation = S = 25
Here, we want to test the manufacturer’s claim that the mean writing-life (µ) of
pen is at least 460 A-4 size pages. So our claim is ≥ 460 and its complement
is < 460. Since claim contains the equality sign so we can take the claim as
the null hypothesis and the complement as the alternative hypothesis. So
H 0 : 0 460 and H 1 : 460
Since p-value (= 0.0026) is less than α (= 0.01) so we reject the null hypothesis
at 1% level of significance.
Therefore, we conclude that the sample provide us sufficient evidence against
the claim so the purchasing agent rejects the manufacturer’s claim at 1% level
of significance.
Now, you can try the following exercises.
E4) A sample of 900 bolts has a mean length 3.4 cm. Is the sample regarded
to be taken from a large population of bolts with mean length 3.25 cm
and standard deviation 2.61 cm at 5% level of significance?
E5) A big company uses thousands of CFL lights every year. The brand that
the company has been using in the past has average life of 1200 hours. A
new brand is offered to the company at a price lower than they are paying
for the old brand. Consequently, a sample of 100 CFL light of new brand Large Sample Tests
is tested which yields an average life of 1220 hours with standard
deviation 90 hours. Should the company accept the new brand at 5% level
of significance?
and
12 22
Var X Y Var X Var Y
n1 n 2
But we know that standard error = Variance
12 22
SE X Y Var X Y … (4)
n1 n 2
Now, follow the same procedure as we have discussed in Section 10.2, that is,
first of all we have to setup null and alternative hypotheses. Here, we want to
test the hypothesis about the difference of two population means so we can take
the null hypothesis as
Here, 1 1 and 2 2
H0 : 1 2 (no difference in means) if we compareit with
general procedure.
35
Testing of Hypothesis or H 0 : 1 2 0 (difference in two means is 0)
and the alternative hypothesis as
H1 : 1 2 for two-tailed test
H0 : 1 2 and H1 : 1 2
or for one-tailed test
H0 : 1 2 and H1 : 1 2
For testing the null hypothesis, the test statistic Z is given by
Z
X Y E X Y
SE X Y
X Y µ1 µ 2
or Z using equations (3) and (4)
σ12 σ 22
n1 n 2
Since under null hypothesis we assume that µ1 = µ2, therefore, we have
XY
Z
σ12 σ 22
n1 n 2
Now, the sampling distribution of the test statistic depends upon 12 and 22
that both are known or unknown. Therefore, four cases arise:
Case I: When 12 & 22 are known and 12 22 2
In this case, the test statistic follows normal distribution with mean
0 and variance unity when the sample sizes are large as both the
populations under study are normal or non-normal. But when
sample sizes are small then test statistic Z follows normal
distribution only when populations under study are normal, that is,
XY
Z ~ N 0,1
1 1
σ
n1 n2
Case II: When 12 & 22 are known and 12 22
In this case, the test statistic also follows the normal distribution as
described in case I, that is,
XY
Z ~ N(0, 1)
σ12 σ 22
n1 n 2
Case III: When 12 & 22 are unknown and 12 22 2
In this case, 12 & 22 are estimated by the values of the sample
variances S12 &S22 respectively and the exact distribution of test
statistic is difficult to derive. But when sample sizes n1 and n2 are
large (> 30) then central limit theorem, the test statistic
approximately normally distributed with mean 0 and variance unity,
that is,
XY
Z ~ N(0, 1)
S12 S22
n1 n 2
After that, we calculate the value of test statistic and compare it with the
critical value given in Table 10.1 at prefixed level of significance α. Take the
decision about the null hypothesis as described in Section10.2.
From above discussion of testing of hypothesis about population mean, we note
following point:
(i) When 12 & 22 are known then we apply the Z-test as both the population
under study are normal or non-normal for the large sample. But when
sample sizes are small then we apply the Z-test only when populations
under study are normal.
(ii) When 12 & 22 are unknown then we apply the t-test only when the
populations under study are normal as sample sizes are large or small.
But when the assumption of normality is not fulfilled and sample sizes
are large then we can apply the Z-test.
(iii) When samples are small and 12 & 22 are known or unknown and the
form of the population is not known then we apply the non-parametric
test as we will be discussed in Block 4 of this course.
Let us do some examples based on above test.
Example 3: In two samples of women from Punjab and Tamilnadu, the mean
height of 1000 and 2000 women are 67.6 and 68.0 inches respectively. If
population standard deviation of Punjab and Tamilnadu are same and equal to
5.5 inches then, can the mean heights of Punjab and Tamilnadu women be
regarded as same at 1% level of significance?
37
Testing of Hypothesis Solution: We are given
n1 = 1000, n2 = 2000, X 67.6, Y 68.0 and σ1 σ2 σ 5.5
Here, we wish to test that the mean height of Punjab and Tamilnadu women is
same. If 1 and 2 denote the mean heights of Punjab and Tamilnadu women
respectively then our claim is 1 = 2 and its complement is 1 ≠ 2. Since the
claim contains the equality sign so we can take the claim as the null hypothesis
and complement as the alternative hypothesis. Thus,
H 0 : 1 2 and H1 : 1 2
Since the alternative hypothesis is two-tailed so the test is two-tailed test.
Here, we want to test the hypothesis regarding two population means. The
standard deviations of both populations are known and sample sizes are large,
so we should go for Z-test.
So, for testing the null hypothesis, the test statistic Z is given by
XY
Z
σ12 σ 22
n1 n 2
67.6 68.0 0.4
2 2
5.5 5.5 1 1
1000 2000 5.5
1000 2000
0.4
1.88
5.5 0.0387
The critical (tabulated) values for two-tailed test at 1% level of significance are
± zα/2 = ± z0.005 = ± 2.58.
Since calculated value of Z ( = −1.88) is greater than the critical value
(= − 2.58) and less than the critical value (= 2.58), that means it lies in non-
rejection region as shown in Fig. 10.6, so we do not reject the null hypothesis
i.e. we fail to reject the claim.
Decision according to p-value:
Fig. 10.6
The test is two-tailed, therefore,
p-value = 2P Z z 2P Z 1.88
Since p-value (= 0.0602) is greater than ( 0.01) so we do not reject the null
hypothesis at1% level of significance.
Thus, we conclude that the samples do not provide us sufficient evidence
against the claim so we may assume that the average height of women of
Punjab and Tamilnadu is same.
Example 4: A university conducts both face to face and distance mode classes
for a particular course indented both to be identical. A sample of 50 students of
face to face mode yields examination results mean and SD respectively as:
X 80.4, S1 12.8
and other sample of 100 distance-mode students yields mean and SD of their Large Sample Tests
examination results in the same course respectively as:
Y 74.3, S2 20.5
Are both educational methods statistically equal at 5% level?
Solution: Here, we are given that
n1 50, X 80.4, S1 12.8;
p-value = 2P Z z 2P Z 2.23
Since p-value (= 0.0258) is less than ( 0.05) so we reject the null hypothesis
at 5% level of significance.
Thus, we conclude that samples provide us sufficient evidence against the
claim so both methods of education, i.e. face-to-face and distance-mode, are
not statistically equal.
Testing of Hypothesis Now, you can try the following exercises.
E6) Two brands of electric bulbs are quoted at the same price. A buyer was
tested a random sample of 200 bulbs of each brand and found the
following information:
Mean Life (hrs.) SD(hrs.)
Brand A 1300 41
Brand B 1280 46
Is there significant difference in the mean duration of their lives of two
brands of electric bulbs at 1% level of significance?
E7) Two research laboratories have identically produced drugs that provide
relief to BP patients. The first drug was tested on a group of 50 BP
patients and produced an average 8.3 hours of relief with a standard
deviation of 1.2 hours. The second drug was tested on 100 patients,
producing an average of 8.0 hours of relief with a standard deviation of
1.5 hours. Does the first drug provide a significant longer period of relief
at a significant level of 5%?
Case I: When sample size is not sufficiently large i.e. either of the conditions
np > 5 or nq > 5 does not meet, then we use exact binomial test. But exact
binomial test is beyond the scope of this course.
Case II: When sample size is sufficiently large, such that np > 5 and nq > 5
then by central limit theorem, the sampling distribution of sample proportion p
is approximately normally distributed with mean and variance as
PQ
E(p) = P and Var(p) = … (5)
n
But we know that standard error = Variance
PQ
SE (p) … (6)
n
Now, follow the same procedure as we have discussed in Section 10.2, first of
all we setup null and alternative hypotheses. Since here we want to test the
hypothesis about specified value P0 of the population proportion so we can take
the null and alternative hypotheses as
Here, P and P0
H 0 : P P0 and H1 : P P0 for two-tailed test if we compare it with
0
general procedure.
H 0 : P P0 and H1 : P P0
or for one-tailed test
H 0 : P P0 and H1 : P P0
For testing the null hypothesis, the test statistic Z is given by
p E p
Z
SE p
p P0
Z ~ N 0, 1 under H0 using equations(5) and (6)
P0 Q 0
n
After that, we calculate the value of test statistic and compare it with the
critical value(s) given in Table 10.1 at prefixed level of significance α. Take
the decision about the null hypothesis as described in Section 10.2.
Let us do some examples of testing of hypothesis about population proportion.
Example 5: A machine produces a large number of items out of which 25%
are found to be defective. To check this, company manager takes a random
sample of 100 items and found 35 items defective. Is there an evidence of more
deterioration of quality at 5% level of significance?
Solution: The company manager wants to check that his machine produces
25% defective items. Here, attribute under study is defectiveness. And we
define our success and failure as getting a defective or non defective item.
Let P = Population proportion of defectives items = 0.25(= P0 )
p = Observed proportion of defectives items in the sample = 35/100 = 0.35
Here, we want to test that machine produces more defective items, that is, the
proportion of defective items (P) greater than 0.25. So our claim is P > 0.25
41
Testing of Hypothesis and its complement is P ≤ 0.25. Since complement contains the equality sign so
we can take the complement as the null hypothesis and the claim as the
alternative hypothesis. So
H 0 : P P0 0.25 and H 1 : P 0.25
p-value = P Z z P Z 2.31
0.0104
Since p-value (= 0.0104) is less than ( 0.05) so we reject the null
hypothesis at 5% level of significance.
Thus, we conclude that the sample fails to provide us sufficient evidence
against the claim so we may assume that deterioration in quality exists at 5%
level of significance.
Example 6: A die is thrown 9000 times and draw of 2 or 5 is observed 3100
times. Can we regard that die is unbiased at 5% level of significance.
Solution: Let getting a 2 or 5 be our success, and getting a number other than 2
or 5 be a failure then in usual notions, we have
n = 9000, X = number of successes = 3100, p = 3100/9000 = 0.3444
Here, we want to test that the die is unbiased and we know that if die is Large Sample Tests
unbiased then proportion or probability of getting 2 or 5 is
P = Probability of getting a 2 or 5
= Probability of getting 2 + Probability of getting 5
1 1 1
0.3333
6 6 3
So our claim is P = 0.3333 and its complement is P ≠ 0.3333. Since the claim
contains the equality sign so we can take the claim as the null hypothesis and
complement as the alternative hypothesis. Thus,
H0 : P P0 0.3333 and H1 :P 0.3333
44
PQ PQ Large Sample Tests
p1 ~ N P1 , 1 1 and p 2 ~ N P2 , 2 2
n1 n2
That is,
PQ P Q
p1 p 2 ~ N P1 P2 , 1 1 2 2
n1 n2
Now, follow the same procedure as we have discussed in Section 10.2, first of
all we have to setup null and alternative hypotheses. Here, we want to test the
hypothesis about the difference of two population proportions so we can take
the null hypothesis as
Here, 1 P1 and
H 0 : P1 P2 (no difference in proportions) 2 P2 if we compare
it with general
procedure.
H 0 : P1 P2 and H1 : P1 P2
or for one-tailed test
H 0 : P1 P2 and H1 : P1 P2
p1 p2 P1 P2
or Z using equations (7) and (8)
P1Q1 P2Q2
n1 n2
Since under null hypothesis we assume that P1 = P2 = P, therefore, we have
45
Testing of Hypothesis p1 p 2
Z
1 1
PQ
n1 n 2
where, Q = 1-P.
Generally, P is unknown then it is estimated by the value of pooled proportion
P̂, where
n p n 2 p 2 X1 X 2 ˆ 1 Pˆ
Pˆ 1 1 and Q
n1 n 2 n1 n 2
After that, we calculate the value of test statistic and compare it with the
critical value(s) given in Table 10.1 at prefixed level of significance α. Take
the decision about the null hypothesis as described in Section 10.2.
Now, it is time for doing some examples for testing of hypothesis about the
difference of two population proportions.
Example 7: In a random sample of 100 persons from town A, 60 are found to
be high consumers of wheat. In another sample of 80 persons from town B, 40
are found to be high consumers of wheat. Do these data reveal a significant
difference between the proportions of high wheat consumers in town A and
town B ( at α = 0.05 )?
Solution: Here, attribute under study is high consuming of wheat. And we
define our success and failure as getting a person of high consumer of wheat
and not high consumer of wheat respectively.
We are given that
n1 = total number of persons in the sample of town A = 100
n2 = total number of persons in the sample of town B = 80
X1 = number of persons of high consumer of wheat in town A = 60
X2 = number of persons of high consumer of wheat in town B = 40
The sample proportion of high wheat consumers in town A is
X1 60
p1 0.60
n1 100
and the sample proportion of wheat consumers in town B is
X 2 40
p2 0.50
n 2 80
Here, we want to test that the proportion of high consumers of wheat in two
towns, say, P1 and P2, is not same. So our claim is P1 ≠ P2 and its complement
is P1 = P2. Since the complement contains the equality sign, so we can take the
complement as the null hypothesis and the claim as the alternative hypothesis.
Thus,
H 0 : P1 P2 P and H 1 : P1 P2
46
n1p1 100 0.60 60 5, n1q1 100 0.40 40 5 Large Sample Tests
n 2 p2 80 0.50 40 5, n 2q 2 80 0.50 40 5
We see that condition of normality meets, so we can go for Z-test.
The estimate of the combined proportion (P) of high wheat consumers in two
towns is given by
n1p1 n 2p2 X1 X 2 60 40 5
P̂
n1 n 2 n1 n 2 100 80 9
ˆ 1 Pˆ 1 5 4
Q
9 9
For testing the null hypothesis, the test statistic Z is given by
p1 p 2
Z
ˆ 1 1
P̂Q
n1 n 2
0.60 0.50 0.10
1.34
5 4 1 1 0.0745
9 9 100 80
The critical values for two-tailed test at 5% level of significance are ± zα/2
= ± z0.025 = ±1.96.
Since calculated value of Z (=1.34) is less than the critical value (= 1.96) and
greater than critical value (= −1.96), that means calculated value of Z lies in
non-rejection region, so we do not reject the null hypothesis and reject the
alternative hypothesis i.e. we reject the claim.
Decision according to p-value:
Since the test is two-tailed, therefore
p-value = 2 P Z z 2P Z 1.34
Since p-value (= 0.1802) is greater than ( 0.05) so we do not reject the null
hypothesis at 5% level of significance.
Thus, we conclude that the samples provide us the sufficient evidence against
the claim so we may assume that the proportion of high consumers of wheat in
two towns A and B is same.
Example 8: A machine produced 60 defective articles in a batch of 400. After
overhauling it produced 30 defective in a batch of 300. Has the machine
improved due to overhauling? (Take = 0.01).
Solution: Here, the machine produced articles and attribute under study is
defectiveness. And we define our success and failure as getting a defective or
non defective article. Therefore, we are given that
X1 = number of defective articles produced by the machine before overhauling
= 60
X2 = number of defective articles produced by the machine after overhauling
= 30
47
Testing of Hypothesis and n1 400, n 2 300,
Let p1 = Observed proportion of defective articles in the sample before the
overhauling
X1 60
0.15
n1 400
and p2 = Observed proportion of defective articles in the sample after the
overhauling
X2 30
0.10
n 2 300
Here, we want to test that machine improved due to overhauling that means the
proportion of defective articles is less after overhauling. If P1 and P2 denote the
proportion defectives before and after the overhauling the machine so our claim
is P1 > P2 and its complement P1 ≤ P2. Since the complement contains the
equality sign so we can take the complement as the null hypothesis and claim
as the alternative hypothesis. Thus,
H 0 : P1 P2 and H1 : P1 P2
Since the alternative hypothesis is right-tailed so the test is right-tailed test.
Since P is unknown, so the pooled estimate of proportion is given by
X1 X 2 60 30 90 9 ˆ 1 Pˆ 1 9 61 .
P̂ and Q
n1 n 2 400 300 700 70 70 70
Before proceeding further, first we have to check whether the condition of
normality meets or not.
n1p1 400 0.15 60 5, n1q1 400 0.85 340 5
n 2 p2 300 0.10 30 5, n 2q 2 300 0.90 270 5
We see that condition of normality meets, so we can go for Z-test.
For testing the null hypothesis, the statistic is given by
p1 p 2
Z
ˆ 1 1
P̂Q
n1 n 2
0.15 0.10 0.05
1.95
9 61 1 1 0.0256
70 70 400 300
The critical value for right-tailed test at 1% level of significance is
zα = z0.01 = 2.33.
Since calculated value of Z (= 1.95) is less than the critical value (= 2.33) that
means calculated value of Z lies in non-rejection region, so we do not reject the
null hypothesis and reject the alternative hypothesis i.e. we reject the claim at
1% level of significance.
Decision according to p-value:
Since the test is right-tailed, therefore,
p-value = P Z z P Z 1.95
48
0.5 P 0 Z 1.95 0.5 0.4744 0.0256 Large Sample Tests
Since p-value (= 0.0256) is greater than ( 0.01) so we do not reject the null
hypothesis at1% level of significance.
Thus, we conclude that the samples provide us sufficient evidence against the
claim so the machine has not been improved after overhauling.
Now, you can try the following exercises.
E10) The proportions of literates between groups of people of two districts A
and B are tested. Out of the 100 persons selected at random from each
district, 50 from district A and 40 from district B are found literates. Test
whether the proportion of literate persons in two districts A and B is
same at 1% level of significance?
E11) In a large population 30% of a random sample of 1200 persons had blue-
eyes and 20% of a random sample of 900 persons had the same blue-
eyes in another population. Test the proportion of blue-eyes persons is
same in two populations at 5% level of significance.
2 2
SE S2 Var S2 σ … (10)
n
The general procedure of this test is explained in the next page.
49
Testing of Hypothesis As we are doing so far in all tests, first Step in hypothesis testing problems is to
setup null and alternative hypotheses. Here, we want to test the hypothesis
specified value 20 of the population variance 2 so we can take our null and
alternative hypotheses as
and
214 224
Var S12 S22 Var S12 Var S22
n1 n2
51
Testing of Hypothesis But we know that standard error = Variance
2 14 2 42
SE S12 S22 Var S12 S22 … (12)
n1 n2
Now, follow the same procedure as we have discussed in Section 10.2, that is,
first of all we have to setup null and alternative hypothesis. Here, we want to
test the hypothesis about the two population variances, so we can take our null
and alternative hypotheses as
H 0 : 12 22 2 and H1 : 12 22 for two-tailed test
H 0 : 12 22 and H1 : 12 22
or for one-tailed test
H 0 : 12 22 and H1 : 12 22
or
S12 S22 σ12 σ 22
Z using equations (11) and (12)
2σ14 2σ 42
n1 n2
Since under null hypothesis we assume that σ12 σ22 σ2 , therefore, we have
S12 S22
Z ~ N 0,1
1 1
σ2 2
n1 n 2
Generally, population variances σ12 and σ 22 are unknown, so we estimate them
by their corresponding sample variances S12 and S 22 as
ˆ 12 S12 and ˆ 22 S 22
Thus, the test statistic Z is given by
S12 S22
Z ~ N 0,1
2S14 2S42
n n
1 2
After that, we calculate the value of test statistic as may be the case and
compare it with the critical value given in Table 10.1 at prefixed level of
significance α. Take the decision about the null hypothesis as described in
Section 10.2.
Note 3: When populations under study are normal then for testing the
hypothesis about equality of population variances we use F- test which will be
discussed in Unit 12 of this course. Whereas when the form of the populations
under study is not known and sample sizes are large then we apply Z-test as
discussed above.
Now, it is time to do an example based on above test.
52
Example 10: A comparative study of variation in weights (in pound) of Army- Large Sample Tests
soldiers and Navy- sailors was made. The sample variance of the weight of 120
soldiers was 60 pound2 and the sample variance of the weight of 160 sailors
was 70 pound2. Test whether the soldiers and sailors have equal variation in
their weights. Use 5% level of significance.
We want to test that the Army-soldiers and Navy-sailors have equal variation
in their weights. If 12 and 22 denote the variances in the weight of Army-
soldiers and Navy-sailors so our claim is 12 22 and its complement is
12 22 . Since the claim contains the equality sign so we can take the claim as
the null hypothesis and complement as the alternative hypothesis. Thus,
Since population variances are unknown so for testing the null hypothesis, the
test statistic Z is given by
S12 S22
Z
2S14 2S24
n n
1 2
60 70
2 2
2 60 2 70
120 160
10 10
0.91
60.0 61.25 11.01
53
Testing of Hypothesis E 13) Two sources of raw materials of bulbs are under consideration by a bulb
manufacturing company. Both sources seem to have similar
characteristics but the company is not sure about their respective
uniformity. A sample of 52 lots from source A yields variance 25 and a
sample of 40 lots from source B yields variance of 12. Test whether the
variance of source A significantly differs to the variances of source B at
= 0.05?
We now end this unit by giving a summary of what we have covered in it.
10.9 SUMMARY
In this unit we have covered the following points:
1. How to judge a given situation whether we should go for large sample test
or not.
2. Applying the Z-test for testing the hypothesis about the population mean
and difference of two population means.
3. Applying the Z-test for testing the hypothesis about the population
proportion and difference of two population proportions.
4. Applying the Z-test for testing the hypothesis about the population variance
and two population variances.
p-value = 2P Z z 2P Z 2.42
n 2 200, Y 1280, S2 46
Here, we want to test that there is significant difference in the mean
duration of their lives of two brands of electric bulbs. If 1 and 2
denote the mean lives of two brands of electric bulbs respectively then
our claim is 1 ≠ 2 and its complement is 1 = 2. Since the
complement contains the equality sign so we can take the complement
as the null hypothesis and the claim as the alternative hypothesis. Thus,
H 0 : 1 2 and H1 : 1 2
Since the alternative hypothesis is two-tailed so the test is two-tailed
test.
We want to test the null hypothesis regarding equality of two
population means. The standard deviations of both populations are
unknown so we should go for t-test if population of difference is known
to be normal. But it is not the case. Since sample sizes are large (n1, and
n2 > 30) so we go for Z-test.
So for testing the null hypothesis, the test statistic Z is given by
XY
Z
S12 S22
n1 n 2
1300 1280 20 20
4.59
41
2
46
2
8.41 10.58 4.36
200 200
The critical (tabulated) values for two-tailed test at 1% level of
significance are ± zα/2 = ± z0.005 = ± 2.58.
Since calculated value of test statistic Z (= 4.59) is greater than the
critical values (= ± 2.58), that means it lies in rejection region, so we
reject the null hypothesis an support the alternative hypothesis i.e.
support the claim at 1% level of significance.
Thus, we conclude that samples do not provide us sufficient evidence
against the claim so there is significant difference in the mean duration
of their lives of two brands of electric bulbs.
E7) Given that
n1 50, X 8.3, S1 1.2;
58
Since the alternative hypothesis is right-tailed so the test is right-tailed Large Sample Tests
test.
Before proceeding further, first we have to check whether the condition
of normality meets or not.
np 200 0.9 180 5
7 6
13
13
2.80
2 2 36 0.129 4.64
6
120
The critical values for two-tailed test at 5% level of significance are
± zα/2 = ± z0.025 = ±1.96.
Since calculated value of Z (= 2.8) is greater than critical values
(= ±1.96), that means it lies in rejection region, so we reject the null
hypothesis i.e. we reject our claim at 5% level of significance.
61
Testing of Hypothesis Thus, we conclude that sample provides us sufficient evidence against
the claim so standard deviation of the life of bulbs of the lot is not 6.0
hours.
E13) Here, we are given that
n1 52, S12 25
n2 40, S22 12
Here, we want to test that variance of source A significantly differs to
the variances of source B. If 12 and 22 denote the variances in the raw
materials of sources A and B respectively so our claim is 12 22 and
its complement is 12 22 . Since complement contains the equality
sign so we can take the complement as the null hypothesis and the
claim as the alternative hypothesis. Thus,
H0 : 12 22 and H1 : 12 22
Since the alternative hypothesis is two-tailed so the test is two-tailed
test.
Here, the distributions of populations under study are not known and
sample sizes are large (n1 52 30, n 2 40 30) so we can go for Z-
test.
Since population variances are unknown so for testing the null
hypothesis, the test statistic Z is given by
S12 S22
Z
2S14 2S24
n n
1 2
25 12 13
2.36
2
225 212
2 5.5
52 40
The critical values for two-tailed test at 5% level of significance are
± zα/2 = ± z0.025 = ±1.96.
Since calculated value of Z (= 2.36) is greater than critical values (= ±1.96),
that means it lies in rejection region, so we reject the null hypothesis and
support the alternative hypothesis i.e. we support our claim at 5% level of
significance.
Thus, we conclude that samples fail to provide us sufficient evidence against
the claim so variance of source A significantly differs to the variance of source
B.
62
UNIT 11 SMALL SAMPLE TESTS
Structure
11.1 Introduction
Objectives
11.2 General Procedure of t-Test for Testing a Hypothesis
11.3 Testing of hypothesis for Population Mean Using t-Test
11.4 Testing of Hypothesis for Difference of Two Population Means Using
t-Test
11.5 Paired t-Test
11.6 Testing of Hypothesis for Population Correlation Coefficient Using
t-Test
11.7 Summary
11.8 Solutions /Answers
11.1 INTRODUCTION
In previous unit, we have discussed the testing of hypothesis for large samples
in details. Recall that throughout the unit, we were making an assumption that
“if sample size is sufficiently large then test statistic follows approximately
standard normal distribution”. Also recall two points highlighted in this course,
i.e.
Cost of our study increases as sample size increases.
Sometime nature of the units in the population under study is such that they
destroyed under investigation.
If there are limited recourses in terms of money then first point listed above
force us not to go for large sample size when items /units under study are very
costly such as airplane, computer, etc. Second point listed above give an alarm
for not to go for large sample if population units are destroyed under
investigation.
So, we need an alternative technique which is used to test the hypothesis based
on small sample(s). Small sample tests do this job for us. But in return they
demand one basic assumption that population under study should be normal as
you will see when you go through the unit. t, 2 and F -tests are some
commonly used small sample tests.
In this unit, we will discuss t-test in details which is based on the t-distribution
described in Unit 3 of this course. And 2 and F-tests will be discussed in next
unit which are based on χ2 and F-distributions described in Unit 3 and Unit 4 of
this course respectively.
This unit is divided into eight sections. Section 11.1 is described the need of
small sample tests. The general procedure of t-test for testing a hypothesis is
described in Section 11.2. In Section 11.3, we discuss testing of hypothesis for
population mean using t-test. Testing of hypothesis for difference of two
population means when samples are independent is described in Section 11.4
whereas in Section 11.5, the paired t-test for difference of two population
means when samples are dependent(paired) is discussed. In Section 11.6
testing of hypothesis for population correlation coefficient is explained. Unit
63
Testing of Hypothesis ends by providing summary of what we have discussed in this unit in Section
11.7 and solution of exercises in Section 11.8.
Before moving further a humble suggestion to you that please revise what you
have learned in previous two units. The concepts discussed there will help you
a lot to better understand the concepts discussed in this unit.
Objectives
After studying this unit, you should be able to:
realize the importance of small sample tests;
know the procedure of t-test for testing a hypothesis;
describe testing of hypothesis for population mean for using t-test;
explain the testing of hypothesis for difference of two population means
when samples are independent using t-test;
describe the procedure for paired t-test for testing of hypothesis for
difference of two population means when samples are dependent or paired;
and
explain the testing of hypothesis for population correlation coefficient using
t-test.
Step II: After setting the null and alternative hypotheses our next step is to
decide a criteria for rejection or non-rejection of null hypothesis i.e.
64
decide the level of significance , at which we want to test our null Small Sample Tests
hypothesis. We generally take = 5 % or 1%.
Step III: The third step is to determine an appropriate test statistic, say, t for
testing the null hypothesis. Suppose Tn is the sample statistic (may be
sample mean, sample correlation coefficient, etc. depending upon )
for the parameter then test-statistic t is given by
Tn E(Tn )
t
SE Tn
In this case, the rejection (critical) region falls under the right tail of
the probability curve of the sampling distribution of test statistic t.
Suppose t(ν), is the critical value at level of significance then entire
region greater than or equal to t(ν), is the rejection region and less
than t(ν), is the non-rejection region as shown in Fig. 11.1.
If tcal ≥ t(ν,), that means calculated value of test statistic t lies in the Fig. 11.1
rejection (critical) region, then we reject the null hypothesis H0 at
level of significance. Therefore, we conclude that sample data
provides us sufficient evidence against the null hypothesis and there is
a significant difference between hypothesized value and observed
value of the parameter.
If tcal < t(ν,), that means calculated value of test statistic t lies in non-
rejection region, then we do not reject the null hypothesis H0 at
level of significance. Therefore, we conclude that the sample data
fails to provide us sufficient evidence against the null hypothesis and
the difference between hypothesized value and observed value of the
parameter due to fluctuation of sample.
Testing of Hypothesis Case II: When H 0 : 0 and H 1 : 0 (left-tailed test)
In this case, the rejection (critical) region falls under the left tail of the
probability curve of the sampling distribution of test statistic t.
Suppose - t(ν), is the critical value at level of significance then
entire region less than or equal to - t(ν), is the rejection region and
greater than - t(ν), is the non-rejection region as shown in Fig. 11.2.
If tcal ≤ − t(ν),, that means calculated value of test statistic t lies in the
rejection (critical) region, then we reject the null hypothesis H0 at
level of significance.
If tcal >- t(ν),, that means calculated value of test statistic t lies in the
Fig. 11.2 non-rejection region, then we do not reject the null hypothesis H0 at
level of significance.
In case of two-tailed test:
That is, when H 0 : 0 and H 1 : 0
In this case, the rejection region falls under both tails of the
probability curve of sampling distribution of the test statistic t. Half
the area (α) i.e. α/2 will lies under left tail and other half under the
right tail. Suppose t ( ), / 2 and t ( ), / 2 are the two critical values at
the left- tailed and right-tailed respectively. Therefore, entire region
less than or equal to t ( ), / 2 and greater than or equal to t ( ), / 2 are
Fig. 11.3
the rejection regions and between t ( ), / 2 and t ( ), / 2 is the non-
rejection region as shown in Fig. 11.3.
If tcal ≥ t(ν),/2, or tcal ≤ -t(ν),/2, that means calculated value of test
statistic t lies in the rejection(critical) region, then we reject the null
hypothesis H0 at level of significance.
And if -t(ν),/2 < tcal < t(ν),/2, that means calculated value of test
statistic t lies in the non-rejection region, then we do not reject the
null hypothesis H0 at level of significance.
Procedure of taking the decision about the null hypothesis on the basis of
p-value:
To take the decision about the null hypothesis on the basis of p-value, the p-
value is compared with given level of significance (α). And if p-value is less
than or equal to α then we reject the null hypothesis and if p-value is greater
than α then we do not reject the null hypothesis at α level of significance.
Since the distribution of test statistic t follows t-distribution with ν df and we
also know that t-distribution is symmetrical about t = 0 line therefore, if tcal
represents calculated value of test statistic t then p-value can be defined as:
For one-tailed test:
For H1: θ > θ0 (right-tailed test)
p-value = P[t tcal]
For H1: θ < θ0 (left-tailed test)
p-value = P t t cal
For two-tailed test: For H1: 0
p-value = 2P t t cal Small Sample Tests
These p-values for t-test can be obtained with the help of Table-II (t-table)
given in the Appendix at the end of Block 1 of this course. But this table gives
the t-values corresponding to the standard values of α such as 0.10, 0.05,
0.025, 0.01 and 0.005 only, therefore, the exact p-values are not obtained with
the help of this table and we can approximate the p-value for this test.
For example, if test is right-tailed and calculated (observed) value of test
statistic t is 2.94 with 9 df then p-value is obtained as:
Since calculated value of test statistic t is based on the 9 df therefore, we use
row for 9 df in the t-table and move across this row to find the values in which
calculated t-value falls. Since calculated t-value falls between 2.821 and 3.250,
which are corresponding to the values of one-tailed area α = 0.01 and 0.005
respectively, therefore, p-value will lie between 0.005 and 0.01, that is,
0.005 p-value 0.01
If in the above example, the test is two-tailed then the two values 0.01 and
0.005 would be doubled for p-value, that is,
0.005 2 0.01 p-value 0.02 2 0.01
Note 1: With the help of computer packages and softwares such as SPSS, SAS,
MINITAB, EXCEL, etc. we can find the exact p-values for t-test.
Now, you can try the following exercise.
E1) If test is two-tailed and calculated value of test statistic t is 2.42 with 15
df then find the p-value for t-test.
H 0 : 0 and H1 : 0
or for one-tailed test
H 0 : 0 and H1 : 0
For testing the null hypothesis, the test statistic t is given by
X 0
t ~ t n 1 under H 0
S/ n
1 n 1 n 2
where, X
n i1
X i is the sample mean and S 2
n 1 i1
X i X is the
sample variance.
For computational simplicity, we may use the following formulae for X, S2 :
2
1 1 d
n 1
X a d and S
2 2
d
n n
where, d = (X − a ), ‘a’ being the assumed arbitrary value.
Here, the test statistic t follows t-distribution with (n − 1) degrees of freedom as
we discussed in Unit 3 of this course.
After substituting values of X, S and n , we get calculated value of test
statistic t. Then we look for critical (or cut-off or tabulated) value(s) of test
statistic t from the t-table. On comparing calculated value and critical value(s),
we take the decision about the null hypothesis as discussed in previous section.
Let us do some examples of testing of hypothesis about population mean using
t-test.
Example 1: A manufacturer claims that a special type of projector bulb has an
average life 160 hours. To check this claim an investigator takes a sample of 20
such bulbs, puts on the test, and obtains an average life 167 hours with standard
deviation 16 hours. Assuming that the life time of such bulbs follows normal
distribution, does the investigator accept the manufacturer’s claim at 5% level
of significance?
Solution: Here, we are given that
0 160, n 20, X 167 and S 16
Here, we want to test the manufacturer claims that a special type of projector
bulb has an average life (µ) 160 hours. So claim is µ = 160 and its complement
is µ ≠ 160. Since the claim contains the equality sign so we can take the claim
as the null hypothesis and complement as the alternative hypothesis. Thus,
68
H 0 : 0 160 and H 1 : 160 Small Sample Tests
Since calculated value of test statistic t (= 1.96) is greater than the critical
value (= − 2.093) and is less than critical value (= 2.093), that means calculated
value of test statistic lies in non-rejection region as shown in Fig. 11.4. So we
do not reject the null hypothesis i.e. we support the manufacture’s claim at 5%
level of significance.
Fig. 11.4
Decision according to p-value:
Since calculated value of test statistic t is based on 19 df therefore, we use row
for 19 df in the t-table and move across this row to find the values in which
calculated t-value falls. Since calculated t-value falls between 1.729 and 2.093
corresponding to one-tailed area α = 0.05 and 0.025 respectively therefore p-
value lies between 0.025 and 0.05, that is,
0.025 p-value 0.05
Since test is two-tailed so
2 0.025 0.05 p-value 0.10 0.05 2
Since p-value is greater than ( 0.05) so we do not reject the null
hypothesis at 5% level of significance.
Thus, we conclude that sample fails to provide us sufficient evidence against
the null hypothesis so we may assume that the manufacture’s claim is true so
the investigator may accept the manufacturer’s claim at 5% level of
significance.
Example 2: The mean share price of companies of Pharma sector is Rs.70. The
share prices of all companies were changed time to time. After a month, a
sample of 10 Pharma companies was taken and their share prices were noted as
below:
70, 76, 75, 69, 70, 72, 68, 65, 75, 72
Assuming that the distribution of share prices follows normal distribution, test
whether mean share price is still the same at 1% level of significance?
Testing of Hypothesis Solution: Here, we wish to test that the mean share price (µ) of companies of
Pharma sector is still Rs.70 besides all changes. So our claim is µ = 70 and its
complement is µ ≠ 70. Since the claim contains the equality sign so we can
take the claim as the null hypothesis and complement as the alternative
hypothesis. Thus,
H 0 : 0 70 mean share price of companies is still Rs. 70
H1 : 0 70 mean share price of companies is not still Rs. 70
Since the alternative hypothesis is two-tailed so the test is two-tailed test.
Here, we want to test the hypothesis regarding population mean when
population SD is unknown. Also sample size is small n = 10(n < 30) and
population under study is normal, so we can go for t-test for testing the
hypothesis about population mean.
For testing the null hypothesis, the test statistic t is given by
X µ0
t ~ t ( n 1) … (1)
S/ n
Calculation for X and S:
S. No. Sample value Deviation d2
(X) d = (X-a), a = 70
1 70 0 0
2 76 6 36
3 75 5 25
4 69 -1 1
5 70 0 0
6 72 2 4
7 68 -2 4
8 65 -5 25
9 75 5 25
10 72 2 4
Total 12 124
2
S
1 2
d
d
2
n 1 n
1 12 2 1 144
124 124 12.18
10 1 10 9 10
Since calculated value of test statistic t (= 1.09) is less than the critical value
(= 3.250) and greater than the critical value (= ‒3.250), that means calculated
value of t lies in non-rejection region as shown in Fig. 11.5. So we do not
reject the null hypothesis i.e. we support the claim at 1% level of significance.
Decision according to p-value:
Since calculated value of test statistic t is based on 9 df therefore, we use row
for 9 df in the t-table and move across this row to find the values in which Fig. 11.5
calculated t-value falls. Since all values in this row are greater than calculated
t-value 1.09 and the smallest value is 1.383 corresponding to one-tailed area
α = 0.10 therefore p-value is greater than 0.10, that is,
p-value 0.10
Since test is two-tailed so
p-value 2 0.10 0.20
Since p-value (= 0.20) is greater than ( 0.01) so we do not reject the null
hypothesis at 1% level of significance.
Thus, we conclude that the sample fails to provide us sufficient evidence
against the claim so may assume that the mean share price is still Rs. 70.
Now, you can try the following exercises.
E2) A tyre manufacturer claims that the average life of a particular category
of his tyre is 18000 km when used under normal driving conditions. A
random sample of 16 tyres was tested. The mean and SD of life of the
tyres in the sample were 20000 km and 6000 km respectively.
Assuming that the life of the tyres is normally distributed, test the claim
of the manufacturer at 1% level of significance using appropriate test.
E3) It is known that the average weight of cadets of a centre follows normal
distribution. Weights of 10 randomly selected cadets from the same
centre are as given below:
48, 50, 62, 75, 80, 60, 70, 56, 52, 77
Can we say that average weight of all cadets of the centre from which
the above sample was taken is equal to 60 kg at 5% level of
significance?
and
n n
1 1
2 1 2
2
S12
(n1 1) i 1
X i X , S 2
2
(n 2 1) i1
Yi Y
1 n1 2
n2
2
Xi X Yi Y
2
S
p
n1 n 2 2 i 1 i 1
For computational simplicity, use the following formulae for X, Y and S 2p :
1 1
Xa
n1 d1 , Y b
n2 d 2 and
2 2
1 d1 d 2
n1 n 2 2 1 d2
2 2 2
S
p d
n1 n2
where, d1= (X − a ) and d2 = (Y − b), ‘a’ and ‘b’ are the assumed arbitrary
values.
Now, follow the same procedure as we have discussed in Section 11.2, that is,
first of all we have to setup null and alternative hypotheses. Here, we want to
test the hypothesis about the difference of two population means so we can take
the null hypothesis as
Here, 1 1 and 2 2
H 0 : 1 2 (no difference in means) if we compare it with
general procedure.
H0 : 1 2 and H1 : 1 2
or for one-tailed test
H0 : 1 2 and H1 : 1 2
For testing the null hypothesis, the test statistic t is given by
XY
t ~ t (n 1 n 2 2) under H0
1 1
Sp
n1 n 2
73
Testing of Hypothesis Calculation for X, Y and S p :
Diet A Diet B
X d1 = (X−a) d1 2 Y d2 = (Y−b) d2 2
a =12 b = 16
12 0 0 14 −2 4
8 −4 16 13 −3 9
14 2 4 12 −4 16
16 4 16 15 −1 1
13 1 1 16 0 0
12 0 0 14 −2 4
8 −4 16 18 2 4
14 2 4 17 1 1
10 −2 4 21 5 25
9 −3 9 15 −1 1
Total −4 70 −5 65
1 5
Y b d 2 16 15.5
n2 10
2 2
1 d1 d2
n1 n 2 2 1 d2
2 2 2
S
p d
n1 n2
1
70
42 65 52
10 10 2 10 10
1
68.4 62.5 7.27
18
Sp 7.27 2.70
Since calculated value of test statistic t (= −3.21) is less than critical values
(± 2.101) that means calculated value of test statistic t lies in rejection region,
so we reject the null hypothesis and support the alternative hypothesis i.e.
support our claim at 5% level of significance.
74
Thus, we conclude that samples do not provide us sufficient evidence against Small Sample Tests
the claim so diets A and B differ significantly in terms of gain in weights of
pigs.
Example 4: The means of two random samples of sizes 10 and 8 drawn from
two normal populations are 210.40 and 208.92 respectively. The sum of
squares of the deviations from their means is 26.94 and 24.50 respectively.
Assuming that the populations are normal with equal variances, can samples be
considered to have been drawn from normal populations having equal mean.
Solution: In usual notations, we are given that
n1 10, n 2 8, X 210.40, Y 208.92,
2 2
X X 26.94, Y Y 24.50
Therefore,
1
S 2p X X 2 Y Y 2
n1 n 2 2
1
26.94 24.50 1 51.44 3.215
10 8 2 16
Sp 3.215 1.79
We wish to test that both the samples are drawn from normal populations
having the same means. If 1 and 2 denote the means of both normal
populations respectively then our claim is 1= 2 and its complement is
1 ≠ 2. Since the claim contains the equality sign so we can take the claim as
the null hypothesis and complement as the alternative hypothesis. Thus,
H 0 : 1 2 [mean of both populations is equal]
H1 : 1 2 [mean of both populations is not equal]
Since the alternative hypothesis is two-tailed so the test is two-tailed test.
Since it is given that two populations are normal with equal and unknown
variances and other assumptions of t-test for testing a hypothesis about
difference of two population means also meet. So we can go for this test.
For testing the null hypothesis, the test statistic t is given by
XY
t ~ t (n 1 n 2 2) under H0
1 1
Sp
n1 n 2
210.40 208.92 1.48 1.48
1.76
1 1 1.79 0.47 0.84
1.79
10 8
The critical values of test statistic t for two-tailed test corresponding
(n1 + n2 -2) = 16 df at 5% level of significance are
t ( n1 n 2 2 ), α / 2 t (16 ), 0.025 2.12.
Since calculated value of test statistic t (= 1.76) is less than the critical value
(= 2.12) and greater than the critical value (= ‒2.12), that means calculated
value of test statistic t lies in non-rejection region so we do not reject the null
hypothesis i.e. we support the claim.
75
Testing of Hypothesis Thus, we conclude that samples fail to provide us sufficient evidence against
the claim so we may assume that both samples are taken from normal
populations having equal means.
Now, you can try the following exercises.
E4) Two different types of drugs A and B were tried on some patients for
increasing their weights. Six persons were given drug A and other 7
persons were given drug B. The gain in weights (in ponds) is given
below:
Drug A 5 8 7 10 9 6 −
Drug B 9 10 15 12 14 8 12
76
Assumptions Small Sample Tests
77
Testing of Hypothesis Assuming that the memories of the children before and after the practice
session follow normal distributions, is the memory practice session improve the
performance of children?
Solution: Here, we want to test that memory practice session improve the
performance of children. If 1 and 2 denote the mean digit repetition before
and after the practice so our claim is 1 < 2 and its complement is 1 ≥ 2.
Since complement contains the equality sign so we can take the complement as
the null hypothesis and the claim as the alternative hypothesis. Thus,
H 0 : 1 2 and H1 : 1 2
1 14 2 1
32 15.67 1.42
11 12 11
SD 1.42 1.19
78
Substituting these values in equation (3), we have Small Sample Tests
1.17 1.17
t 3.44
1.19 / 12 0.34
The critical value of test statistic t for left-tailed test corresponding (n-1) = 11
df at 5% level of significance is t ( n 1), α t (11), 0.05 1.796.
Since calculated value of test statistic t (= −3.44) is less than the critical value
(=−1.796), that means calculated value of t lies in rejection region, so we reject
the null hypothesis and support the alternative hypothesis i.e. support the claim
at 5% level of significance.
Thus, we conclude that samples fail to provide us sufficient evidence against
the claim so we may assume that memory practice session improves the
performance of children.
Example 6: Ten students were given a test in Statistics and after one month’s
coaching they were again given a test of the similar nature and the increase in
their marks in the second test over the first are shown below:
Roll No. 1 2 3 4 5 6 7 8 9 10
Increase in Marks 6 −2 8 −4 10 2 5 −4 6 0
1 27 2 1
301 228.1 25.34
10 1 10 9
SD 25.34 5.03
Substituting these values in equation (4), we have
2.7 2.7
t 1.70
5.03/ 10 1.59
The critical value of test statistic t for right-tailed test corresponding (n-1) = 9
df at 1% level of significance is t (n 1), α t (9), 0.01 2.821.
Since calculated value of test statistic t (= 1.70) is less than the critical value
(= 2.821), that means calculated value of test statistic t lies in non-rejection
region, so we do not reject null hypothesis and reject the alternative hypothesis
i.e. we reject our claim at 1% level of significance.
Thus, we conclude that sample provides us sufficient evidence against the
claim so students are not gained knowledge from the coaching.
Now, you can try the following exercises.
E6) To verify whether the programme “Post Graduate Diploma in Applied
Statistics (PGDAST)” improved performance of the graduate students
in Statistics, a similar test was given to10 participants both before and
after the programme. The original marks out of 100 (before course)
recorded in an alphabetical order of the participants are 42, 46, 50, 36,
44, 60, 62, 43, 70 and 53. After the course the marks in the same order
are 45, 46, 60, 42, 60, 72, 63, 43, 80 and 65. Assuming that marks of
the students before and after the course follow normal distribution. Test
whether the programme PGDAST has improved the performance of the
graduate students in Statistics at 5% level of significance?
E7) A drug is given to 8 patients and the increments in their blood pressure
are recorded to be 4, 0, 7, −2, 0, −3, 2, 0. Assume that increment in their
blood pressure follows normal distribution. Is it reasonable to believe
that the drug has no effect on the change of blood pressure at 5% level
of significance?
80
between −1 and +1, where −1 representing a perfect negative correlation, 0 Small Sample Tests
representing no correlation, and +1 representing a perfect positive correlation.
Sometime, the sample data indicate for non-zero correlation but in population
they are uncorrelated ( = 0).
For example, price of tomato in Delhi (X) and in London (Y) are not correlated
in population ( = 0). But paired sample data of 20 days of prices of tomato at
both places may show correlation coefficient (r) ≠ 0. In general, in sample data
r ≠ 0 does not ensure in population ≠ 0 holds.
In this section, we will know how we test the hypothesis that population
correlation coefficient is zero.
Assumptions
This test works under following assumptions:
(i) The characteristic under study follows normal distribution in both the
populations. In other words, both populations from which random
samples are drawn should be normal with respect to the characteristic of
interest.
(ii) Samples observations are random.
Let us consider a random sample (X1, Y1), (X2, Y2), …, (Xn, Yn) of size n taken
from a bivariate normal population. Let and r be the correlation coefficients
of population and sample data respectively.
Here, we wish to test the hypothesis about population correlation coefficient
(), that is, linear correlation between two variables X and Y in the population,
so we can take the null hypothesis as
H0 : 0 and H1 : 0 for two-tailed test Here, and 0 0 if
we compare it with general
procedure given inSection11.2.
H 0 : 0 and H1 : 0
or for one-tailed test
H 0 : 0 and H1 : 0
For testing the null hypothesis, the test statistic t is given by
r n2
t ~ t n 2
1 r2
which follows t-distribution with n – 2 degrees of freedom.
After substituting values of r and n, we find out calculated value of test statistic
t. Then we look for critical (or cut-off or tabulated) value(s) of test statistic t
from the t-table. On comparing calculated value and critical value(s), we take
the decision about the null hypothesis as discussed in Section 11.2.
Let us do some examples of testing of hypothesis that population correlation
coefficient is zero.
Example 7: A random sample of 18 pairs of observations from a normal
population gave a correlation coefficient of 0.7. Test whether the population
correlation coefficient is zero at 5% level of significance.
Solution: Given that
n = 18, r = 0.7
Here, we wish to test that population correlation coefficient () is zero so our
claim is = 0 and its complement is ≠ 0. Since the claim contains the
81
Testing of Hypothesis equality sign so we can take the claim as the null hypothesis and complement
as the alternative hypothesis. Thus,
H 0 : 0 and H1 : 0
The critical value of test statistic t for two-tailed test corresponding (n-2) = 16
df at 5% level of significance are t ( n 2 ), α / 2 t (16 ), 0.025 2.120.
Since calculated value of test statistic t (= 3.94) is greater than the critical
values (= ± 2.120), that means calculated value of test statistic t lies in rejection
region, so we reject the null hypothesis. i.e. we reject the claim at 5% level of
significance.
Thus, we conclude that sample provides us sufficient evidence against the
claim so there exists a relationship between two variables.
Example 8: A random sample of 15 married couples was taken from a
population consisting of married couples between the ages of 30 and 40. The
correlation coefficient between the IQs of husbands and wives was found to be
0.68. Assuming that the IQs of husbands and wives follow normal distributions
then test that IQs of husbands and wives in the population are positively
correlated at1% level of significance.
Solution: Given that
n = 15, r = 0.68
Here, we wish to test that IQs of husbands and wives in the population are
positively correlated. If denote the correlation coefficient between IQs of
husbands and wives in the population then the claim is > 0 and its
complement is ≤ 0. Since complement contains the equality sign so we can
take the complement as the null hypothesis and the claim as the alternative
hypothesis. Thus,
H 0 : 0 and H1 : 0
Since the alternative hypothesis is right-tailed so the test is right-tailed test.
Here, we want to test the hypothesis regarding population correlation
coefficient is zero and the populations under study follow normal distributions,
so we can go for t-test.
For testing the null hypothesis, the test statistic t is given by
r n2
t
1 r2
82
0.68 15 2 0.68 3.61 Small Sample Tests
3.36
1 0.68
2 0.73
The critical value of test statistic t for right-tailed test corresponding (n-2) = 13
df at 1% level of significance is t ( n 2 ), α t (13 ), 0.01 2.650.
Since calculated value of test statistic t (= 3.36) is greater than the critical value
(= 2.650), that means calculated value of test statistic t lies in rejection region,
so we reject the null hypothesis and support the alternative hypothesis i.e. we
support our claim at 1% level of significance.
Thus, we conclude that sample fail to provide us sufficient evidence against the
claim so we may assume that the correlation between IQs of husbands and
wives in the population is positive.
In the same way, you can try the following exercise.
E8) Twenty families were selected randomly from a colony to determine
that correlation exists between family income and the amount of money
spent per family member on food each month. The sample correlation
coefficient was computed as r = 0.40. Assuming that the family income
and the amount of money spent per family member on food each month
follow normal distributions then test that there is a positive linear
relationship between the family income and the amounts of money
spent per family member on food each month in colony at 1% level of
significance.
We now end this unit by giving a summary of what we have covered in it.
11.7 SUMMARY
In this unit, we have discussed the following points:
1. Need of small sample tests.
2. Procedure of testing a hypothesis for t-test.
3. Testing of hypothesis for population mean using t-test.
4. Testing of hypothesis for difference of two population means when samples
are independent using t-test.
5. The procedure of paired t-test for testing of hypothesis for difference of two
population means when samples are dependent or paired.
6. Testing of hypothesis for population correlation coefficient using t-test.
84
56 −7 49 Small Sample Tests
52 −11 121
77 14 196
X 630 X X
2
1252
1
1252 139.11
10 1
S 139.11 11.79
Putting the values in equation (5), we have
63 60
t
11.79 / 10
3
0.80
3.73
The critical values of test statistic t for two-tailed test corresponding
(n-1) = 9 df at 5% level of significance are ± t(9) ,0.025 = ± 2.262.
Since calculated value of test statistic t (= 0.80) is less than the critical
value (= 2.262) and greater than the critical value (= −2.262), that
means calculated value of test statistic t lies in non-rejection region so
we do not reject H0 i.e. we support the claim at 5% level of
significance.
Thus, we conclude that sample fails to provide sufficient evidence
against the claim so we may assume that the average weight of all the
cadets of given centre is 60 kg.
E4) Here, we want to test that there is no difference between drugs A and B
with regard to their mean weight increment. If 1 and 2 denote the
mean weight increment due to drug A and drug B respectively then our
claim is µ1 = 2 and its complement is µ1 ≠ 2. Since the claim contains
the equality sign so we can take the claim as the null hypothesis and
complement as the alternative hypothesis. Thus,
H 0 : 1 2 effect of both drugs is same
H 1 : 1 2 effect of both drugs is not same
Since the alternative hypothesis is two-tailed so the test is two-tailed
test.
Since it is given that increments in the weight due to both drugs follow
normal distributions with equal and unknown variances and other
assumptions of t-test for testing a hypothesis about difference of two
population means also meet. So we can go for this test.
For testing the null hypothesis, the test statistic t is given by
85
Testing of Hypothesis XY
t ~ t ( n 1 n 22) under H0 … (6)
1 1
Sp
n1 n 2
X d1 = (X-a) d1 2 Y d2 = (Y-b) d2 2
a=8 b = 12
5 ‒3 9 9 ‒3 9
8 0 0 10 ‒2 4
7 ‒1 1 15 3 9
10 2 4 12 0 0
9 1 1 14 2 4
6 ‒2 4 8 ‒4 16
12 0 0
d 1 3 d 2
1 19 d 2 4 d22 42
From above calculation, we have
1 1
n1 1
Xa d 8 3 7.5,
6
1 1
Y b d 2 12 4 11.43
n2 7
2 2
1 d1 d2
n1 n 2 2 1 d2
2 2 2
S
p d
n1 n2
2 2
1 3 4
19 42
6 7 2 6 7
1
17.5 39.71 5.20
11
Sp 5.20 2.28
Putting these values in equation (6), we have
7.5 11.43 3.93 3.93
t 3.07
1 1 2.28 0.56 1.28
2.28
6 7
The critical values of test statistic t for two-tailed test corresponding
(n1 + n2 -2) = 11 df at 5% level of significance are ± t(11) ,0.025 = ± 2.201.
Since calculated value of test statistic t (= −3.07) is less than the critical
values (= ± 2.201) that means calculated value of test statistic t lies in
rejection region, so we reject the null hypothesis i.e. we reject the claim
at 5% level of significance.
86
Thus, we conclude that samples provide us sufficient evidence against Small Sample Tests
the claim so drugs A and B differ significantly. Any one of them is
better than other.
E5) Here, we are given that
n1 13, X 4.6, S1 0.5,
87
Testing of Hypothesis Thus, we conclude that samples fail to provide us sufficient evidence
against the claim so we may assume that there is significant
improvement in wheat production due to fertilizer.
E6) Here, we want to test whether the programme PGDAST has improved
the performance of the graduate students in Statistics. If 1 and 2
denote the average marks before and after the programmed so our claim
is 1 < 2 and its complement is 1 ≥ 2. Since complement contains the
equality sign so we can take the complement as the null hypothesis and
the claim as the alternative hypothesis. Thus,
H0 : 1 2 and H1 : 1 2
Since the alternative hypothesis is left-tailed so the test is left-tailed
test.
It is a situation of before and after. Also, the marks of the students
before and after the programme PGDAST follow normal distributions.
So, population of differences will also be normal. Also all the
assumptions of paired t-test meet. So we can go for paired t-test.
For testing the null hypothesis, the test statistic t is given by
D
t ~ t n 1 under H 0 … (7)
SD / n
Calculation for D and S D :
1 42 45 −3 9
2 46 46 0 0
3 50 60 −10 100
4 36 42 −6 36
5 44 60 −16 256
6 60 72 −12 144
7 62 63 −1 1
8 43 43 0 0
9 70 80 −10 100
10 53 65 −12 144
2
D 70 D 790
1
790
702 1
300 33 . 33
9 10 9
SD 33.33 5.77
88
Putting the values in equation (7), we have Small Sample Tests
7.0
t 3.83
5.77 / 10
The critical value of test statistic t for left-tailed test corresponding
(n-1) = 9 df at 5% level of significance is − t(9), 0.05 = −1.833.
Since calculated value of test statistic t (= −3.83) is less than the critical
(tabulated) value (= −1.833), that means calculated value of test statistic
t lies in rejection region, so we reject the null hypothesis and support
the alternative hypothesis i.e. we support our claim at 5% level of
significance.
Thus, we conclude that samples fail to provide us sufficient evidence
against the claim so we may assume that the participants have
significant improvement after the programme “Post Graduate Diploma
in Applied Statistics (PGDAST)”.
E7) Here, we want to test that the drug has no effect on change in blood
pressure. If D denotes the average increment in the blood pressure
before drug then our claim is D = 0 and its complement is D ≠ 0.
Since the claim contains the equality sign so we can take the claim as
the null hypothesis and complement as the alternative hypothesis. Thus,
H 0 : D 1 2 0 the drug has no effect
89
Testing of Hypothesis 1 1
t 0.87
3.25 / 8 1.15
The critical value of test statistic t for two-tailed test corresponding
(n-1) = 7 df at 5% level of significance are ± t(7), 0.025 = ± 2.365.
Since calculated value of test statistic t (= 0.87) is less than the critical
value (= 2.365) and greater than the critical value (=−2.365) that means
calculated value of test statistic t lies in non-rejection region, so we do
not reject the null hypothesis i.e. we support the claim at 5% level of
significance.
Thus, we conclude that samples fail to provide us sufficient evidence
against the claim so we may assume that the drug has no effect on the
change of blood pressure of patients.
E8) We are given that
n = 20, r = 0.40
and we wish to test that there is a positive linear relationship between
the family income and the amounts of money spent per family member
on food each month in colony. If denote the correlation coefficient
between the family income and the amounts of money spent per family
member then the claim is > 0 and its complement is ≤ 0. Since
complement contains the equality sign so we can take the complement
as the null hypothesis and the claim as the alternative hypothesis. Thus,
H0 : 0 and H0 : 0
Since the alternative hypothesis is right-tailed so the test is right-tailed
test.
For testing the null hypothesis, the test statistic t is given by
r n2
t
1 r2
0.40 20 2 0.40 4.24
1.84
1 0.40
2 0.92
Since calculated value of test statistic t (= 1.84) is less than the critical
value (= 2.552), that means calculated value of test statistic t lies in non-
rejection region, so we do not reject the null hypothesis and reject the
alternative hypothesis i.e. we reject our claim at 1% level of significance.
Thus, we conclude that sample provide us sufficient evidence against the
claim so there is no positive linear correlation between the family income
and the amounts of money spent per family member on food each month
in colony.
90
UNIT 12 CHI-SQUARE AND F-TESTS
Structure
12.1 Introduction
Objectives
12.2 Testing of Hypothesis for Population Variance Using χ2-Test
12.3 Testing of Hypothesis for Two Population Variances Using F-Test
12.4 Summary
12.5 Solutions / Answers
12.1 INTRODUCTION
Recall from the previous unit when we test the hypothesis about the difference
of means of two populations, t-test needs an assumption of equality of
variances of two populations under study. Other than this there are situations
where we want to test the hypothesis about the variances of two populations.
For example, an economist may want to test whether the variability in incomes
differ in two population. In such situations, we use F-test when the populations
under study follow the normal distributions.
Similarly, there are many other situations where we need to test the hypothesis
about the hypothetical or specified value of the variance of the population
under study. For example, the manager of the electric bulbs company would
probably be interested whether or not the variability in the life of bulbs is
within acceptable limits, the product controller of a milk company may be
interested in the variance of the amount of fat in the whole milk processed by
the company is no more than the specified level. In such situations, we use
χ2-test when the population under study follows the normal distribution.
This unit is divided into five sections. Section 12.1 is described the need of χ2
and F-tests. χ2-test for testing the hypothesis about the population variance is
discussed in Section 12.2. And F-test for equality of variances of two
populations is discussed in Section 12.3. Unit ends by providing summary of
what we have discussed in this unit in Section 12.4 and solution of exercises in
Section 12.5.
Objectives
After studying this unit, you should be able to:
χ 2
X i X
(n 1)S 2
~ χ 2 n 1 under H 0 ... (1)
σ 02 σ 02
1 2
where, S2
n 1
X X
Here, the test statistic χ2 follows chi square distribution with (n − 1) degrees of
freedom as we have discussed in Unit 3 of this course.
After substituting values of n, S and 02 , we get calculated value of test
statistic. Let 2cal be the calculated value of test statistic χ2.
Obtain the critical value(s) or cut-off value(s) in the sampling distribution of
the test statistic χ2 and construct rejection (critical) region of size . The critical
value of the test statistic χ2 for various df and different level of significance
are given in Table III of the Appendix given at the end of the Block 1of this
course.
After doing all the calculation discussed above, we have to take the decision
about rejection or non-rejection of the null hypothesis. The procedure of taking
the decision about the null hypothesis is explained is the next page:
92
For one-tailed test: Chi-square and F-Tests
χ 2
X i X
~ χ 2 n 1 under H 0 ... (2)
2
σ 0
2
Calculation for Xi X :
X X X X X
2
96
From above calculation, we have Chi-square and F-Tests
1 1
n
X X 18 1.5
12
2
Putting the values of X X and σ 02 in equation (2), we have
2
χ 2
X i X0.16
10
2
σ 0 0.016
The critical value of test statistic χ2 for left-tailed test corresponding
(n-1) = 11 df at 5% level of significance is (2n 1),(1 ) 2(11),0.95 = 4.57.
Since calculated value of test statistic (= 10) is greater than the critical value
(= 4.57), that means calculated value of test statistic lies in non-rejection region
as shown in Fig. 12.5 so we do not reject the null hypothesis and reject the
alternative hypothesis i.e. we reject the claim at 5% level of significance.
Thus, we conclude that sample provide sufficient evidence against the claim so
the variance in the measurement of the instrument is not less than 0.016.
In the same way, you can try the following exercises. Fig. 12.5
E1) An ambulance agency claims that the standard deviation in the length of
serving times is less than 15 minutes. Investigator suspects that this
claim is wrong and takes a random sample of 20 serving times which
has a standard deviation of 17 minutes. Assume that the service time of
the ambulance follows normal distribution. Test at = 0.01, is there
enough evidence to reject the agency’s claim?
E2) A cigarette manufacturer claims that the variance of nicotine content of
its cigarettes is 0.62. Nicotine content is measured in milligrams and is
normally distributed. A sample of 25 cigarettes has a variance of 0.65.
Test the manufacturer’s claim at 5% level of significance.
Since calculated F-value (= 2.65) falls between 2.35 and 2.79, corresponding
to one-tailed area α = 0.05 and 0.025 respectively therefore p-value lies
between 0.0025 and 0.05, that is,
0.025 p-value 0.05
If in the above example the test is two-tailed then the two values 0.025 and
0.05 would be doubled for p-value, that is,
0.05 p-value 0.10
Note 3: With the help of computer packages and software such as SPSS, SAS,
MINITAB, EXCEL, etc. we can find the exact p-value for F-test.
Let us do some examples based on this test.
100
Example 3: The following data relate to the number of items produced in a Chi-square and F-Tests
shift by two workers A and B for some days:
A 26 37 40 35 30 30 40 26 30 35 45
B 19 22 24 27 24 18 20 19 25
Assuming that the parent populations are normal, can it be inferred that B is
more stable (or consistent) worker compared to A?
Solution: Here, we want to test that worker B is more stable than worker A.
As we know that stability of data is related to variance of the data. Smaller
value of the variance implies data that it is more stable. Therefore, to compare
stability of two workers, it is enough to compare their variances. If 12 and 22
denote the variances of worker A and worker B respectively then our claim is
12 22 and its complement is 12 22 . Since complement contains the
equality sign so we can take the complement as the null hypothesis and the
claim as the alternative hypothesis. Thus,
H0 : 12 22
1 n1 2 1 n2 2
2
where, S1
n1 1 i1
Xi X and S2
2
n2 1 i1
Yi Y
Produced by
X X Produced by
Y Y
A X 34 B Y 22
( Variable X) (Variable Y)
26 −8 64 19 −3 9
37 3 9 22 0 0
40 6 36 24 2 4
35 1 1 27 5 25
30 −4 16 24 2 4
30 −4 16 18 −4 16
40 6 36 20 −2 4
26 −8 64 19 −3 9
30 −4 16 25 3 9
35 1 1
45 11 121
Total = 374 0 380 198 0 80
101
Testing of Hypothesis
Therefore, we have
1 1
X X 374 34
n1 11
and
1 1
Y Y 198 22
n2 9
Thus,
1 2 1
S12
n1 1
X X 380 38
10
1 2 1
S22
n2 1
Y Y 80 10
8
Putting the value of S12 and S 22 in equation (4), we have
38
F 3.8
10
The critical (tabulated) value of test statistic F for right-tailed test
corresponding (n1-1, n2 -1) = (10, 8) df at 1% level of significance is
F( n 1, n 1), F(10,8), 0.01 = 5.81.
1 2
Since calculated value of test statistic (= 3.8) is less than the critical value
Fig. 12.9 (= 5.81), that means calculated value of test statistic lies in rejection region as
shown in Fig. 12.9, so we do not reject the null hypothesis and reject the
alternative hypothesis i.e. we reject the claim at 1% level of significance.
Thus, we conclude that samples provide us sufficient evidence against the
claim so worker B is not more stable (or consistent) worker compared to A.
Example 4: Two random samples drawn from two normal populations gave
the following results:
Test whether both samples are from the same normal populations?
Solution: Since we have to test whether both the samples are from same
normal population, therefore, we will test two hypotheses separately:
(i) Two population means are equal, i.e. H 0 : µ1 µ 2
1 2 1
S22
n2 1
Y Y
11 1
36 3.60
First we want to test that the variances of both normal populations are equal so
our claim is 12 22 and its complement is 12 22 . Thus, we can take the null
and alternative hypotheses as
H0 : 12 22 and H1 : 12 22
Since the alternative hypothesis is two-tailed so the test is two-tailed test.
For testing this, the test statistic F is given by
S12
F ~ F n1 1,n 2 1
S22
3.20
0.88
3.65
The critical (tabulated) value of test statistic F for two-tailed test corresponding
(n1-1, n2 -1) = (8, 11) df at 5% level of significance are F( n1 1, n 2 1), / 2
1 1 1
F(8,11),0.025 3.66 and F( n 1, n 1),(1 / 2) 0.24.
1 2
F(n2 1,n1 1), / 2 F(11,8),0.025 4.24
Since calculated value of test statistic (= 0.88) is less than the critical value
(= 3.66) and greater than the critical value (= 0.24), that means calculated value
of test statistic F lies in non-rejection region so we do not reject the null
hypothesis i.e. we support the claim.
Thus, we conclude that both samples may be taken from normal populations
having equal variances.
Now, we test that the means of two normal populations are equal so our claim
is 1 2 and its complement is 1 2. Thus, we can take the null and
alternative hypotheses as
H0 : 1 2 and H1 : 1 2
The test statistic is given by
XY
t … (5)
1 1
Sp
n1 n 2
1 X X 2 Y Y 2
where, S2p
n1 n 2 2
1
26 32 1 58 3.22
9 11 2 18
103
Testing of Hypothesis
Sp 3.22 1.79
59 60 1 1
t 1.23
1 1 1.79 0.45 0.81
1.79
9 11
The critical values of test statistic t for (n1 + n2 -2) = 18 df at 5% level of
significance for two-tailed test are ± t(18) ,0.025 = ± 2.101.
Since calculated value of test statistic t (= −1.23) is less than the critical value
(= 2.101) and greater than the critical value (= ‒2.101), that means calculated
value of test statistic t lies in non-rejection region so we do not reject the null
hypothesis i.e. we support the claim.
Thus, we conclude that both samples may be taken from normal populations
having equal means.
Hence, overall we conclude that both samples may come from the same normal
populations.
Now, you can try the following exercises.
E3) Two sources of raw materials are under consideration by a bulb
manufacturing company. Both sources seem to have similar
characteristics but the company is not sure about their respective
uniformity. A sample of 12 lots from source A yields a variance of 125
and a sample of 10 lots from source B yields a variance of 112. Is it
likely that the variance of source A significantly differs to the variance
of source B at significance level = 0.01?
E4) A laptop computer maker uses battery packs of two brands, A and B.
While both brands have the same average battery life between charges
(LBC), the computer maker seems to receive more complaints about
shorter LBC than expected for battery packs of brand A. The computer
maker suspects that this could be caused by higher variance in LBC for
brand A. To check that, ten new battery packs from each brand are
selected, installed on the same models of laptops, and the laptops are
allowed to run until the battery packs are completely discharged. The
following are the observed LBCs in hours:
Brand A 3.2 3.7 3.1 3.3 2.5 2.2 3.2 3.1 3.2 4.3
Brand B 3.4 3.6 3.0 3.2 3.2 3.2 3.0 3.1 3.2 3.2
12.4 SUMMARY
In this unit, we have discussed the following points:
1. Testing of hypothesis for population variance using χ2-test.
2. Testing of hypothesis for two population variances using F-test.
104
Chi-square and F-Tests
12.5 SOLUTIONS / ANSWERS
E1) Here, we are given that
σ0 15, n 20, S 17
Here, we want to test the agency’s claim that the standard deviation ()
of the length of serving times is less than 15 minutes. So our claim is
< 15 and its complement is ≥ 15. Since complement contains the
equality sign so we can take complement as null hypothesis and claim
as the alternative hypothesis. Thus,
H 0 : 0 15
105
Testing of Hypothesis Here, we want to test the hypothesis about the population variance and
sample size is small n = 25(< 30). Also we are given that the nicotine
content of its cigarettes follows normal distribution so we can go for χ2
test for population variance.
The test statistic is given by
n 1 S2
χ2 ~ χ 2 n 1
σ2
24 0.65
25.16
0.62
The critical (tabulated) values of test statistic χ2 for two-tailed test
corresponding (n-1) = 24 df at 5% level of significance are
2( n 1), / 2 (224),0.025 39.36 and 2( n 1), (1 / 2) (224), 0.975 =12.40.
Since calculated value of test statistic (= 25.16) is less than the critical
value (= 39.36) and greater than the critical value (= 12.40), that means
calculated value of test statistic lies in non-rejection region, so we do
not reject the null hypothesis i.e. we support the claim at 5% level of
significance.
Thus, we conclude that sample fails to provide sufficient evidence
against the claim so we may assume that manufacturer’s claim that the
variance of the nicotine content of the cigarettes is 0.62 milligram is
true.
E3) Here, we are given that
n1 12, S12 125, n 2 10, S22 112
Here, we want to test that variance of source A significantly differs to
the variances of source B. If 12 and 22 denote the variances in the raw
materials of sources A and B respectively so our claim is 12 22 and
its complement is 12 22 . Since complement contains the equality sign
so we can take the complement as the null hypothesis and the claim as
the alternative hypothesis. Thus,
H0 : 12 22 and H1 : 12 22
Since the alternative hypothesis is two-tailed so the test is two-tailed
test.
Here, we want to test the hypothesis about two population variances
and sample sizes n1 = 12(< 30) and n2 = 10 (< 30) are small. Also
populations under study are normal and both samples are independent
so we can go for F-test for two population variances.
For testing this, the test statistic is given by
S12
F ~ F n1 1,n 2 1
S22
125
1.11
112
The critical (tabulated) value of test statistic F for two-tailed test
corresponding (n1-1, n2 -1) = (11, 9) df at 5% level of significance are
F( n1 1, n 2 1), / 2 F(11,9),0.025 3.91 and
106
1 1 1 Chi-square and F-Tests
F( n1 1, n 2 1),(1 / 2) 0.28.
F(n2 1,n1 1), / 2 F(9, 11),0.025 3.59
Since calculated value of test statistic (= 1.11) is less than the critical
value (= 3.91) and greater than the critical value (= 0.28), that means
calculated value of test statistic lies in non-rejection region, so we do
not reject the null hypothesis and reject the alternative hypothesis i.e.
we reject the claim at 5% level of significance.
Thus, we conclude that samples provide us sufficient evidence against
the claim so we may assume that the variances of source A and B is
differ.
E4) Here, we want to test that the LBCs of brand A have a larger variance
than those of brand B. If 12 and 22 denote the variances in the LBCs
of brands A and B respectively so our claim is 12 22 and its
complement is 12 22 . Since complement contains the equality sign
so we can take the complement as the null hypothesis and the claim as
the alternative hypothesis. Thus,
H0 : 12 22 and H1 : 12 22
Since the alternative hypothesis is right-tailed so the test is right-tailed
test.
Here, we want to test the hypothesis about two population variances
and sample sizes n1 = 10(< 30) and n2 = 10 (< 30) are small. Also
populations under study are normal and both samples are independent
so we can go for F-test for two population variances.
For testing the null hypothesis, test statistic is given by
S12
F ~ F n1 1, n 2 1 … (6)
S22
2
1 2 1 X
where, S
n1 1
2
1 X X n 1 X n and
2
1 1
2
1 2 1 Y .
S22
n2 1
Y Y n 1
Y 2
n2
2
2 2
Calculation for S1 and S2 :
LBCs of Brand A X2 LBCs of Brand A Y2
(X) (Y)
3.7 13.69 3.6 12.96
3.2 10.24 3.2 10.24
3.3 10.89 3.2 10.24
3.1 9.61 3.0 9.00
2.5 6.25 3.0 9.00
2.2 4.84 3.2 10.24
3.1 9.61 3.2 10.24
3.2 10.24 3.1 9.61
4.3 18.49 3.2 10.24
3.2 10.24 3.1 9.61
Total = 3.18 104.1 31.8 101.38
107
Testing of Hypothesis
From the calculation, we have
1 1
X
n1 X 31.8 3.18,
10
1 1
Y
n2 Y 31.8 3.18
10
Thus,
2
1 X 1 31.8 2
n1 1
2 2
S X 104.10
1
n1 9 10
1
2.98 0.33
9
2
1 Y 1 101.38 31.8 2
n2 1
S22 Y 2
n 2 9 10
1
0.26 0.03
9
Putting the values of S12 and S22 in equation (6), we have
0.33
F 1.1
0.03
The critical (tabulated) value of test statistic F for right-tailed test
corresponding (n1-1, n2 -1) = (9, 9) df at 1% level of significance is
F( n1 1, n2 1), F(9,9),0.01 = 5.35.
Since calculated value of test statistic (= 1.1) is less than the critical
value (= 5.35), that means calculated value of test statistic lies in non-
rejection region so we do not reject the null hypothesis and reject the
alternative hypothesis i.e. we reject the claim at 1% level of
significance.
Thus, we conclude that samples provide us sufficient evidence against
the claim so variance in LBCs of brand A is not greater than variance in
LBCs of brand B.
108