Statistics
Statistics
Do not look at the answer and try to work backwards. This would
defeat the purpose of doing the problem. Remember the purpose
of doing an assignment problem is not simply to get the answer
(it is only evidence that you solved it correctly) but to develop
your ability to think. Try to introduce twists and turns in given
problem to create similar problems.
CONTENT
INTRODUCTION …1
Variable …1
Frequency Distribution …1
Exercise 1 …4
Measure of Location …4
Types of Averages …4
Exercise 2 …7
Weighted Means …8
Exercise 3 …9
Median …9
Mode …11
Exercise 4 …13
Measure of Dispersion …13
Means Deviation …14
Standard Deviation …15
Exercise 5 …19
Answers to Exercises …21
Solved Problems …22
Subjective …22
Chapter Practice Problems …25
Subjective …25
Objective …25
Assignments …27
Section-I …27
Section-II …29
A variable (or variate) which is not capable of assuming all values in a given range is called a
discrete variable.
A variable which is capable of assuming all the numerical values in a given range is called a
continuous variable.
Example
S.N. Individual (item) Characteristic Type of characteristic
1 A student Height in cms Continuous variable
Weight in kgs Continuous variable
Colour of skin Attribute (qualitative)
Age Continuous variable
Sex Attribute (qualitative)
Mother tongue Attribute (qualitative)
Marks in English Discrete variable
2 A bolt Diameter in cms Continuous variable
Defective or not Attribute (qualitative)
3 A family Number of members Discrete variable
Monthly income in rupees Discrete variable
Frequency Distribution:
Let the data regarding the weights (in kgs) of 20 students of a class be given as
50 48 54 49 60 54 61 55 48 49
55 60 50 48 57 62 49 50 52 54
This is called the raw data. This is also called an individual series. We note that some of the weights
(values of the quantitative variable) are repeated. If there are 3 students having weight 50 kg, then we say
IITJEE-2223-MATHEMATICS-STATISTICS
2
the frequency of 50 is 3. Therefore, the number of times the value of the item is repeated is called the
frequency of that value. The table containing the weights and the corresponding frequencies is given as
Tally bars are used to count the number of times the values of the variable has occurred. Also denote
that the value is repeated 5 times. The table containing the values and its frequencies is called a
frequency distribution. The variable is denoted by x and the frequency by f. In the order of magnitude, the
frequency distribution is written as follows;
We denote the total number of students, that is the total frequency by n i.e. n = f. Also we denote
different values of the variables x as xi and different frequencies by f i.
Let the data be classified according to different classes of values of the variable. This is an important tool
in condensing a large data. In the above example, the classes may be defined as;
45 and under 50, 50 and under 55, 55 and under 60 etc.
We denote these classes by 45 49, 50 54, 55 59 etc. Usually the length of the class is taken as
same. With the length of the class as 5, the above frequency distribution can be displayed as
In the above frequency table 45 49, 50 54 are called class intervals. 45 49 is one of the class
intervals in which 45 is lower class limit and 49 is upper class limit.
The classes are written in two forms.
IITJEE-2223-MATHEMATICS-STATISTICS
3
Relative Frequency:
The frequency of any class in a frequency table is the number of units of observations for which the
values of the variable belong to that class. Sometimes it is useful to express the frequency as a fraction of
total frequency, usually as a percentage. This fraction expressed as a percentage is called relative
frequency of the class.
The relative frequency gives useful information about the data, particularly when the calss frequencies
are large and total frequency is very large.
class frequency 100
Relative frequency = % .
Total frequency
Cumulative Frequency of a value (or class of values) is obtained by adding all the frequencies of all
values (or classes of values) less than or equal to that under consideration. Cumulative frequency is an
important concept and is useful is determining the measures of location.
Solution: From the data we find the number of plants with heights
between 30 & 40 cms = 52 48 = 4
between 40 & 50 cms = 48 30 = 18
between 80 & 90 cms = 5 2 = 3
above 90 cms = 2.
Hence, frequency distribution is
IITJEE-2223-MATHEMATICS-STATISTICS
4
EXERCISE 1
1. The following data gives the number of children in 30 families in a village
2, 3, 0, 1, 2, 4, 3, 0, 1, 2, 1, 3, 0, 2, 2, 3, 1, 1, 1, 0, 2, 4, 1, 2, 3, 2, 1, 2, 2, 1.
Represent the data in the form of a frequency distribution.
2. Following are the ages of 360 patients getting medical treatment in a hospital on a day:
Age (in years) 10 20 20 30 30 40 40 50 50 60 60 70
No. of patients 90 50 60 80 50 30
Construct the cumulative frequency distribution table.
Measure of Location:
One of the most important objectives of statistical analysis is to get one single value that describes the
characteristics of entire mass of unwieldy data. Such a value is called central value or the average. It is a
single value, which represents a group of values.
Types of Averages
(a) Mean
(i) Arithmetic Mean (ii) Weighted arithmetic mean
(iii) Geometric Mean (iv) Weighted Geometric Mean
(v) Harmonic Mean (vi) Weighted Harmonic Mean
(b) Median
(c) Mode
1 1 n
xi
x
n
x1 x 2 ... xn = xi
n i 1
i 1
n
.
(b). If the numbers xi, i = 1, 2, …, n are very large, then we can shift the origin to a point so that the
new numbers yi are smaller compared to xi. So it is easier to compute y . This method is based on the fact
IITJEE-2223-MATHEMATICS-STATISTICS
5
that if each observation of the data be changed by an amount a, then its mean is also changed by the
same amount a. This can be easily proved as below:
Let x1, x2, …, xn be the observations. Then new observations are defined by
y1 = x1 + a, y2 = x2 + a, …, yn = xn + a
The mean y of new data is
1 1 1
y x1 a x2 a ... xn a x1 x2 .... xn a a ... a nx na = x a .
n n n
Thus, when the observations xi, i = 1, 2, …, n are very large then the arithmetic mean is calculated as
follows:
A.M. A
di where d= xi A, i = 1, 2, …, n.
n
A is the assumed mean.
y
yifi or x a xi afi .
fi fi
Hence, x a
difi
fi
where di = xi a, a is assumed mean.
Hence, x
xifi 1 a f h d f = a h difi .
fi fi i i i f
i
Thus x a h i i
df
f
i
where a = assumed mean, h = length of class interval, fi = frequency of each variable
x a
di = i .
h
IITJEE-2223-MATHEMATICS-STATISTICS
6
Illustration 2: A group of 10 items has mean 6. If the mean of 4 of these items is 7.5, then find the
mean of remaining items.
1 1 1
Illustration 3: If the values 1, , , ...., occur at frequencies 1, 2, 3, 4, 5,…., n in a distribution, then
2 3 n
find the mean.
1 1 1 1 1
1 1 2 3 4 5 ...... n
Solution: Mean 2 3 4 5 n
1 2 3 ........ n
2 1 1 1 ....... 1 2
= .
n n 1 n 1
Solution: Let a = 3000 be the shift in the salaries. Then we have the following distribution
Salary No. of teachers di = xi3000 f idi
xi fi
1800 5 1200 6000
2200 6 800 4800
2800 7 200 1400
3200 5 200 1000
4500 3 1500 4500
6000 4 3000 12000
f i = 30 f idi = 5300
Illustration 5: A factory manufactures nuts and bolts of various sizes. The measurement of inner
diameters of 1000 nuts gave the following frequency table:
Diameter (mm) 43 45 46 48 49 51 52 54 55 57
No. of nuts 175 236 200 196 193
Determine the mean inner diameter per nut.
Solution: The given table is in inclusive form of frequency distribution. We first convert it into an
exclusive form and write it in the form.
Diameter (mm) 42.5 45.5 45.5 48.5 48.5 51.5 51.5 54.5 54.5 57.5
No. of nuts 175 236 200 196 193
IITJEE-2223-MATHEMATICS-STATISTICS
7
x a
fidi h = 50 4 3 49.998 mm.
fi 1000
EXERCISE 2
1. In a factory, the workers each of age 20 years or more are grouped as follows
2. A college sends the results of their entrance examination by post. The following distribution of
amount spent and the number of letters dispatched was given:
IITJEE-2223-MATHEMATICS-STATISTICS
8
Illustration 6: A school runs in two shifts. The morning shift is from 7:30 am to 12:30 pm. The afternoon
shift is from 1:00 pm to 6:00 pm. There are a total of 60 teachers working in either of the
shifts and their average salary is Rs 3000/. There are no teachers working in both the
shifts. The average salary of 36 teachers working in the morning shift is Rs 3200/. Find
the average in the afternoon shift.
Solution: We have:
Total teachers = 60
No. of teachers in morning shift = n1 = 36
No. of teachers in the afternoon shift = n2 = 60 36 = 24
Average salary of all teachers = x = 3000
Average salary of teachers in the morning shift = x1 = 3200
Let average salary of teachers in the afternoon shift be x2 .
n1x1 n2 x2 36 3200 24 x 2
We have x or 3000 =
n1 n2 36 24
1
x2 180000 115200 2700 .
24
Geometric Mean
If x1, x2, …, xn are n values of a variable x, none of them being zero, then the geometric mean G is
defined as G = (x1x2x3 …. xn)1/n.
1/N
G = x1f1 x 2f2 .... xnfn or G = antilog i 1 , where N = fi .
N i 1
Harmonic Mean
The harmonic mean of n items x1, x2, x3,…, xn is defined as:
n
Harmonic Mean =
1 1 1 1
x1 x 2 x 3 xn
Harmonic Mean = 1 2 3
f f f fn
fi .
f1 f2 f3 fn 1
x1 x2 x3
xn
fi
xi
IITJEE-2223-MATHEMATICS-STATISTICS
9
3
Solution: H.M. of 4, 8, 16 = 6.85.
1 1 1
4 8 16
EXERCISE 3
1. A boy goes to school from his home at a speed of x km/ hr and comes back at a speed of y km/ h
then find the average speed of the boy.
2. If the values 2, 8, 16, 128, 512 are given, then find the geometric mean.
Median
Median is defined as the middle most or the central value of the variables in a set of observations, when
the observations are arranged either in ascending or in descending order of their magnitudes. It divides
the arranged series in two equal parts. Median is a position average, whereas, the arithmetic mean is the
calculated average. When a series consists of an even number of terms, median is the arithmetic mean of
the two central items. It is generally denoted by M.
n1 n1
In this case th value is the median i.e. M th term.
2 2
IITJEE-2223-MATHEMATICS-STATISTICS
10
In case the data is given in the form of a frequency table with class-interval, etc., we prepare the
th
n
cumulative frequency table and determine the median class i.e. the class in which the
2
observation lies and the following formula is used to calculate the Median:
n
C
M=L+2 i , where
f
L = lower limit of the class in which the median lies
n = total number of frequencies, i.e., n = f.
f = frequency of the class in which the median lies
C = cumulative frequency of the class preceding the median class
i = width of the class-interval of the class in which the median lies.
Illustration 9: The marks obtained by 9 students in a class are 70, 47, 52, 66, 73, 61, 55, 59, 68. Find
the median marks of the students.
X 8 5 6 10 9 4 7
F 6 4 5 8 9 6 4
Solution: We note that the values of x are not given in ascending order. Hence, we first arrange the
values of x in ascending order and then form the cumulative frequency table. We have
the following table
x f Cumulative frequency
4 6 6
5 4 10
6 5 15
7 4 19
8 6 25
9 9 34
10 8 42
IITJEE-2223-MATHEMATICS-STATISTICS
11
Illustration 11: The wage distribution for the workers in a certain factory is given below:
Mode
Mode is defined as that value in a series which occurs most frequently. In a frequency distribution mode
is that variate which has maximum frequency. This measure is used when it is important to know which
values occurs most frequently.
Sometimes it so happens that the above formula fails to give the mode. In this case, the modal value lies
in a class other than the one containing maximum frequency. In such cases we take the help of the
following formula:
IITJEE-2223-MATHEMATICS-STATISTICS
12
f2
Mode = L i , where L, f 1, f 2, i have usual meanings.
f1 f2
Symmetrical Distribution:
A distribution in which mean, median and mode coincide is called symmetrical distribution.
A = M = M0
IITJEE-2223-MATHEMATICS-STATISTICS
13
Asymmetrical distribution:
In this distribution, variations do not have symmetry. If the distribution is moderately asymmetrical then
mean, median and mode are connected by the formula
Mode = 3 Median 2 Mean.
Illustration 14: In a moderately skewed distribution the values of mean and median are 5 and 6
respectively. Find the value of mode in such a situation.
EXERCISE 4
1. Find the median of the data 13, 14, 16, 18, 20, 22.
3. If the mode of a data is 18 and the mean is 24, then find median.
Measure of Dispersion
We have defined an average (mean, median) as a measure of central tendency. However, it does not
show as to how the variates are scattered about the central value. It is possible that two distributions may
have the same average (or the average may be very close) but they may differ widely in the scatter of
their values.
For example consider the scores of two cricket players in six innings. Let the scores of the two players
be as follows:
Player A 80 5 6 0 90 20
Player B 35 31 25 38 42 30
IITJEE-2223-MATHEMATICS-STATISTICS
14
Total score of both the players is 201. Therefore arithmetic mean of two players is same but A shows
large variation and B does not show much variation from inning to inning. So, players B is more
consistent. The scores of B are bunched together while the scores of A are scattered. This property is
called dispersion.
Dispersion is defined as scatter or spread of the observed valued of a quantitative variable from a central
value.
Normally, the following measures of dispersion are used:
(a) Range
(b) Mean Deviation
(c) Standard Deviation
(a) Range:
It is the simplest form of measuring the variation. The range of a set of values is the difference between
the largest and the smallest values in the set.
For example range of the values 2, 4, 10, 20, 15, 21, 16, 3 is 21 2 = 19.
Range gives very limited information. It tells the difference between the extreme values but nothing about
the variations between other values
xi 15 20 25 30 35 40 45 50
fi 6 4 7 6 5 9 5 8
IITJEE-2223-MATHEMATICS-STATISTICS
15
Solution: We form the cumulative frequency table as
xi fi Cumulative di f idi
frequency =|xiM|
15 6 6 20 120
20 4 10 15 60
25 7 17 10 70
30 6 23 5 30
35 5 28 0 0
40 9 37 5 45
45 5 42 10 50
50 8 50 15 120
495
N N
We have N = 50 = 25 + 1 = 26.
2 2
Hence, median =
1
2
value of 25th term value of 26th term
1
= 35 35 35 .
2
Mean deviation from median = fidi =
1 99
495 10 9.9 .
N 50
In case of frequency distribution when the values of the variable are given in terms of classes, then their
mid-values are taken as the values xi of the variable.
Median is used in calculating the mean deviation, because of the property that the sum of absolute values
of the deviations of the observed values from median is always the least.
This indicates that the amount of dispersion of the observed values about the median is minimum.
Illustration 16: The scores of a cricketer in 7 innings are given as follows: 67, 56, 38, 45, 52, 58, 69.
Find the mean deviation from median.
Standard Deviation:
Standard deviation of a given set of observations is defined as the positive square root of the average of
squared deviations of all observations taken from their arithmetic mean. It is generally denoted by Greek
alphabet or s.
Variance
The square of the standard deviation is called variance and is denoted by 2.
In computing the mean deviation, the signs of the deviations are ignored. Thus it is inconvient for further
mathematical treatment. So, standard deviation and variance are used which are more convenient for
further mathematical treatment and are based on all the values of the data.
IITJEE-2223-MATHEMATICS-STATISTICS
16
di
2 2 2
=
1
di2 =
di2 di =
di2 di
n n n n n n
where di = xi A,
A = assumed mean,
n = total number of observation.
(b) For grouped data
If a variate x takes values x 1, x2, …, xn with respective frequencies f i, f 2, …, f n then standard deviation is
given by
n n
fi xi x fi xi
2
i 1 i1
= n
where x n
.
fi fi
i 1 i1
If class intervals are given, then mid values of class intervals give the values of variate x.
IITJEE-2223-MATHEMATICS-STATISTICS
17
But when the mean has a fractional value, then the following formula is applied to calculate standard
deviation
2
n n
fidi2 fi di
= i 1n i1n
fi fi
i 1 i1
where di = xi A, A assumed mean.
Coefficient of Variation:
For comparing two or more series for variability, we calculate the coefficient of standard deviation and the
coefficient of variation.
The coefficient of standard deviation is defined as: coefficient of standard deviation = .
x
The coefficient of variation is defined as: coefficient of variation = 100 .
x
Coefficient of variation gives us a measure of scattering (dispersion). Scattering is less if the coefficient of
variation is small.
where xi, i = 1, 2, …, n are the first n natural numbers.
n n 1
Now, xi = 1 + 2 + 3 + … + n =
2
2 2 2 2 n n 1 2n 1
xi = 1 + 2 + … + n =
6
11 n2 n 1
2
Thus, V = n n 1 2n 1
n 6 4n
n n 1 n 1 2n 2 n2 1 .
= 4 2n 1 6 n 1 =
24n 24 12
IITJEE-2223-MATHEMATICS-STATISTICS
18
n2 1
Solution: Standard deviation of first n natural numbers is . For n = 7,
12
72 1
the value = 4 2.
12
Illustration 19: Find the mean and standard deviation for the following data
Solution: We have a grouped data. The distance between two successive mid-values of the
classes is 5, that is h = 5. We choose
x a xi 42.5
a = 42.5 and ui = i .
h 5
Class Mid value fi ui f iui ui2 f iui2
(xi)
2530 27.5 30 3 90 9 270
3035 32.5 23 2 46 4 92
3540 37.5 20 1 20 1 20
4045 42.5 14 0 0 0 0
4550 47.5 10 1 10 1 10
5055 52.5 3 2 6 4 12
100 140 404
x a h i i 42.5 5
fu 140
f 35.5
i 100
fiui
2
h2 25 140 2 = 52.
V= =
2
f u 2
= 404
fi i i fi 100 100
S.d. = = 52 7.21.
Illustration 20: The scores of 25 students in an intelligence test are given below: 75, 56, 50, 62, 68, 62,
56, 78, 80, 75, 50, 62, 72, 78, 68, 67, 80, 75, 50, 68, 80, 68, 62, 56, 68.
Find the mean and standard deviation of the data.
IITJEE-2223-MATHEMATICS-STATISTICS
19
57444
and S.d. = = 9.59.
25
Illustration 21: Find the variance of 2, 4, 6, 8, 10.
n2 1 52 1
Solution: Variance of 1, 2, 3, 4, 5, is 2
12 12
( variance of first n natural number is (n2 –1)/12, when each item is doubled (i.e. 2, 4, 5,
8, 10) variance is multiplied by 22 = 4. Required variance = 4 2 = 8)
Illustration 22: The standard deviations of two samples of sizes 50 and 100 are 8 and 7 respectively.
Find the standard deviation of the combined sample.
n112 n222 1
Solution: S.D. of combined sample 50 64 100 49 7.35
n1 n2 150
EXERCISE 5
1. Which of the following is not a measure of dispersion?
(A) mean (B) variance
(C) mean deviation (D) range
IITJEE-2223-MATHEMATICS-STATISTICS
20
5. The batting scores of two cricket players A and B in 10 innings are as follows:
Batsman A 15 17 19 27 30 36 40 90 95 110
Batsman B 10 16 21 28 37 41 36 80 82 85
Find which of the player is more consistent.
6. The weights of 9 men are 76, 74.5, 61, 64, 69, 67.5, 71, 73, 74. Find the variance and standard
deviation of the weights.
IITJEE-2223-MATHEMATICS-STATISTICS
21
ANSWERS TO EXERCISES
Exercise 1
1.
No. of children No. of families
0 4
1 9
2 10
3 5
4 2
2.
Age No, of patients Cumulative frequency
10 20 90 90
20 30 50 140
30 40 60 200
40 50 80 280
50 60 50 330
60 70 30 360
3.
Age frequency
30 40 12
40 50 18
50 60 17
60 70 13
70 80 15
There are 28 students getting more than 60 marks.
Exercise 2
1. 38.3 years 2. 8.43 rupees 3. 61.2
x 10
4. 2x 3 5.
Exercise 3
2xy
1. km/h 2. 224/5 3. 6.54
xy
Exercise 4
1. 17 2. 155 3. 22
4. median = 68.33, mode = 76.33 5. 46
Exercise 5
3. mean = 50, standard deviation = 58
4. 5
5. x A 47.9 , A 34.18 , c.v. for A = 71.35%,
x B 45.6 , B 27.06 , c.v. for B = 59.34%.
B is consistent.
6. = 4.79, variance = 22.94.
IITJEE-2223-MATHEMATICS-STATISTICS
22
SOLVED PROBLEMS
SUBJECTIVE
Problem 1: Co-efficient of variation of two series are 75% and 90% and their standard deviations
15 and 18. Find their mean.
15
Solution: Co-efficient of variance = 100 for first series 75 = 100 x = 20
x x
18
and for second series 90 = 100 x = 20.
x
Thus both the series have same mean i.e. 20.
Problem 2: Find the mean and standard deviation of the binomial co-efficients of the expansion of (1
+ x)n. Find the value for n = 4.
C0 C1 C2 ..... Cn 2n
Solution: Mean = .
n1 n 1
2
Ci Ci
2
2 = .
n n
We know that .
= co-efficient of xn in (1 + x)2n
2
2
2n
Cn 2n (2n) ! 22n
= = .
n n 1 n ! 2 n (n 1)2
Now if n = 4,
8 ! 28 363
= .
4 ! 4 ! 4 25 50
ax b
Problem 3: The S.D of the variate x is . Find the S.D of the variable ; a, b, c are constant.
c
ax b a b
Solution: Let y = y = x y = Ax + B
c c c
a b
where A = and y Ax B and hence
c c
y – y = Ax + B –(A x + B) = A (x – x ) (y – y )2 = A2(x – x )2
(y – y )2 = A2 (x – x )2 ny2 = A2 (nx2) y = |A|x.
a
Hence S.D is multiplied by |A| = .
c
IITJEE-2223-MATHEMATICS-STATISTICS
23
n 2 n
Solution: (1 + x) = C0 + C1x + C2x + ... + Cnx
n 2 3 n+1
Multiplying with x, we get x (1 + x) = C0x + C1x + C2x + ... + Cnx
Differentiating w. r. t. x, we have
nx (1 + x)n1 + (1 + x)n = C0 + 2C1x + 3C2x2 + ..... + (n + 1)Cnxn.
n1 n
Putting x = 1, this gives n (2 ) + 2 = C0 + 2C1 +3 C2 + ...... + (n + 1) Cn
C 2C1 3C2 .... (n 1)Cn 2n1(n 2)
so that, A.M. = 0 = .
(n 1) (n 1)
Problem 5: The number of observations in a group is 40. If the average of first 10 is 4.5 and that of
the remaining 30 is 3.5, then find the average of the whole group?
x1 x 2 ........... x10
Solution: 4.5
10
x x ............x 40
and 11 12 3.5
30
x x 2 ... x 40 4.5 10 3.5 30 150 15
1 .
40 10 30 40 4
Problem 6: Find the mean of the values 0, 1, 2, …………n having corresponding weight nC0, nC1,
n
C2,……………,nCn respectively?
7 4 10 9 15 12 7 9 7 80
Solution: Mean x 8.9 .
9 9
Mode z = item with maximum frequency = 7.
Arranging the data in ascending order, we get
4, 7, 7, 9, 9, 10, 12, 15.
n 1 9 1
Hence median, M = th item = 5 th item = 9.
2 2
IITJEE-2223-MATHEMATICS-STATISTICS
24
Problem 8: For a set of 100 observations, taking assumed mean as 4, the sum of the deviations is
– 11 cm, and the sum of the squares of these deviations is 275 cm 2. Find the coefficient
of variation.
Solution:
2 2
d 2 d 257 11
and = 1 .6 .
n n 100 100
1.6
Coefficient of Variation = 100 100 = 41.13% .
x 3.89
Problem 9: If a variable takes the discrete value + 4, –7/2, –5/2, –3, –2, + 1/2,
–1/2, + 5, ( > 0), then find the median.
Solution: Arranging the data, we have –7/2, –3, –5/2, –2, –1/2, + 1/2, + 4, +5
Median is 1/2(4th observation + 5th observation) = 1/2( –2 + –1/2) = –5/4.
Problem 10: A student obtained 75%, 80% and 85% in three subjects. If the marks of another subject
are added, then find the minimum average marks?
IITJEE-2223-MATHEMATICS-STATISTICS
25
4. (a) If each observation of a raw data whose variance is 2 is multiplied by k then find the
variance of new set.
(b) The median and standard deviation of a distribution are 20 & 4 respectively. If each item is
increased by 2 then find the new median & standard deviation.
5. (a) If coefficient of variation of a series is 50. Its S.D. is 21.2. Then find its arithmetic mean ?
(b) The mean of two samples of sizes 200 and 300 were found to be 25, 10 respectively. Their
standard deviations were 3 and 4 respectively. Find the variance of combined sample of size
500.
OBJECTIVE
6. If standard deviations for two variables X and Y are 3 and 4 respectively and their covariance is 8,
then correlation coefficient between them is
2 8
(A) (B)
3 3 2
9 2
(C) (D)
8 2 9
7. The arithmetic mean of a set of observation is X . If each observation is divided by and then is
increased by 10, then the mean of the new series is
X X 10
(A) (B)
X 10
(C) (D) X 10
2n
8. Median of C0 , 2nC1, 2nC2 , 2nC3 ......, 2nCn (where n is even) is
2n 2n
(A) Cn (B) Cn1
2 2
2n
(C) Cn1 (D) none of these
2
9. The median of a set of 9 distinct observations is 20.5. If each of the largest 4 observations of the
set is increased by 2, then the median of the new set
(A) is increased by 2 (B) is decreased by 2
(C) is two times the original median (D) remains the same as that of the original set
IITJEE-2223-MATHEMATICS-STATISTICS
26
10. The means and variance of n observations x1, x2, x3, …., xn are 5 and 0 respectively. If
n
x
i1
2
i 400 , then the value of n is equal to
(A) 80 (B) 25
(C) 20 (D) 16
18 18
11. If x1,x 2 ,..., x18 are observation such that (x
j1
j 8) 9 and (x
j1
j 8)2 45 , then the
46n
12. If the mean of n observations 12, 22, 32, …., n2 is then n is equal to
11
(A) 11 (B) 12
(C) 23 (D) 22
n n
13. The standard deviation of n observations x 1, x2, ….,xn is 2. If
i1
xi 20 and x
i1
2
i 100 , then n
is
(A) 10 or 20 (B) 5 or 10
(C) 5 or 20 (D) 5 or 15
14. If is the standard deviation of a random variable x, then the standard deviation of the random
variable ax + b, where a, b R is
(A) a + b (B) |a|
(C) |a| + b (D) a2
IITJEE-2223-MATHEMATICS-STATISTICS
27
ASSIGNMENTS
SECTIONI
1. The mean of n observations x1, x2, x3, …, x n is X . If (a b) is added to each of the observations,
show that the mean of the new set of observations is X (a b) .
2. The mean monthly salary of 10 members of a group is 1445, one more member whose monthly
salary is Rs. 1500 has joined in group. Find the mean monthly salary of 11 members of the group.
3. The sum of the deviations of a set of n values x 1, x2, x3, … , x n measured from 50 is 10 and the
sum of deviations of the values from 46 is 70. Find the values of n and the mean.
5. The mean of 200 items was 50. Later on it was discovered that the two items were misread 92
and 8 instead of 192 and 88. Find the correct mean.
6. Thirty children were asked about the number of hours they watched T.V. programs in the
previous week. The result were as follows:
1 6 2 3 5 12 5 8 4 8
10 3 4 12 2 8 15 1 17 6
3 2 8 5 9 6 8 7 14 12
(i) Make a grouped frequency distribution table for this data, taking class width 5 and one of the
class intervals as 5-10.
(ii) How many children watched television for 15 or more hours a week ?
7. The mean of marks scored by 100 students was found to be yes. Later on it was discovered that
a score of 53 was misread as 83. Find correct mean.
10. The following observations have been arranged in ascending order. If the median of data is 63.
Find the vale of x.
29, 32, 48, 50, x, x + 2, 72, 78, 84, 95.
11. The mean height of 29 male workers is 71 cms and 31 female workers is 48 cms. Find the
combined mean height of all 60 workers in the factory.
12. The price of a commodity is increased by 5% from 1997 to 1998, 8% from 1998 to 1999 and 53%
from 1999 to 2000. Find the average from the period 1997 to 2000.
13. The arithmetic mean of 4 observations was calculated as 22. It was later observed that one of the
observation was recorded a 14 instead of 40. Find the correct arithmetic mean.
IITJEE-2223-MATHEMATICS-STATISTICS
28
14. The weighted arithmetic mean of 10 observations was 36. However a particular observation was
recorded as 60 instead of 40. In what ratio should be the weights of correct and incorrect
readings so as to have no change in AM.
15. (a) The geometric mean of n items is G. If first term is kept same, second made twice, third
made thrice … and so on, find the new mean.
(b) If each item is made n times, then prove that mean also becomes n times.
16. Show that the mean deviation from the mean of the A.P a, a + d, a + 2d, ..., a + 2nd is
independent of the common difference of A.P..
18. The mean and standard deviation of one sample are respectively 54.8 and 8, the mean and
standard deviation of another sample are 50.3 and 7 respectively. The size of the first sample is
50 and that of the second is 100. Find the mean and standard deviation of the composite sample
(size 150) combining the above two samples.
19. The geometric mean of 5 observations was calculated as 11. If was later observed that one of the
observation was recorded as instead of 64. Find the correct geometric mean.
20. The mean annual salaries paid to 1000 employees of a company was Rs. 5000. The mean
annual salaries paid to male and female employees were Rs. 200 and Rs. 4200 respectively.
Determine the percentage of males and females employed by the company.
21. If a vehicle covers the distance along four sides of a square with four speeds x, 2x, 3x and 4x
m/sec respectively, then show that harmonic mean of speeds is better average than arithmetic
mean and hence find the average speed.
22. The mean and standard deviation of a set of 100 observations were worked out as 40 and 5
respectively. But by mistake a value 50 was taken in place of 40 for the observation. Recalculate
the correct mean and standard deviation.
23. Prove that the sum of squares of the deviations of a set of values is minimum when taken about
mean.
IITJEE-2223-MATHEMATICS-STATISTICS
29
SECTIONII
MULTI CHOICE SINGLE CORRECT
6. The mean deviation of the series a, a + d, a + 2d, ….., a + 2nd from its mean is
(A)
n 1 d (B)
nd
2n 1 2n 1
n(n 1)d (2n 1)d
(C) (D)
(2n 1) n(n 1)
7. A batsman scores runs in 10 innings as 38, 70, 48, 34, 42, 55, 63, 46, 54 and 44. the mean
deviation about mean is
(A) 8.6 (B) 6.4
(C) 10.6 (D) 7.6
IITJEE-2223-MATHEMATICS-STATISTICS
30
9. Let x1, x2, x3, ……,xn be the values taken by a variable X and y1, y2, ….. , yn be the values taken
by a variable Y2such that yi = axi + b, I = 1, 2, …., n. Then, 2
(A) Var (Y) = a Var(X) (B) var (X) = a var (Y)
(C) Var (Y) = Var (X) + b (D) None of these
ax b
10. If the standard deviation of a variable X is 6, then the standard deviation of variable is
c
a
(A) a (B)
c
a a b
(C) (D)
c c
11. If the standard deviation. of a set of observations is 8 and if each observation is divided by -2, the
standard deviation of the new set of observations will be
(A) -4 (B) -8
(C) 8 (D) 4
aX b
12. If two variants X and Y are connected by the relation Y = , where a, b, c are constants such
c
that ac < 0, then
a a
(A) y = X (B) y = X
c c
a
(C) y = X b (D) None of these
c
IITJEE-2223-MATHEMATICS-STATISTICS
31
x 170 . On observation that was 20 was found to be wrong and was replaced by the correct
value 30, then the corrected variance is
(A) 78.00 (B) 188.66
(C) 177.33 (D) 8.33
20. Mean of the numbers a, b, 8, 5, 10 is 6 and the variance is 6.80, then the possible values of a and
b respectively are
(A) 5, 2 (B) 1, 6
(C) 3, 4 (D) 0, 7
21. The mean and standard deviation of the marks of 200 candidates were found to be 40 and 15
respectively. Later it was discovered that a score of 40 was wrongly used as 50. The correct
mean and standard deviation respectively are
(A) 14.98, 39.95 (B) 39.95, 14.98
(C) 39.95, 224.5 (D) none of these
COMPREHENSION TYPE
Consider the observations x1 = 1, x2 = 2, x3 = 3,----------------- x100 = 100,
x101 = 101, x102 = 102, x103 = 103, x104 = 104
IITJEE-2223-MATHEMATICS-STATISTICS
32
ANSWERS TO ASSIGNMENTS
SECTION - I
2. 1450 3. n = 26, mean = 49.5
5. 50.9 6. (ii) Two children
7. 39.7 8. n = 8, x = 10.75
9. 61.71 10. 62
k 1
13. 19.5 15.(a) X +
2
16.
n 1 n 19. X 12 =51.67 ; 12 = 7.6
2n 1
20. 22 21. 80% and 20%
22. X =39.9 ; = 4.9
SECTIONII
MULTI CHOICE SINGLE CORRECT
1. D 2. A 3. C 4. D
5. A 6. C 7. A 8. C
9. A 10. C 11. D 12. B
13. D 14. B 15. D 16. A
17. B 18. C 19. A 20. C
21. B
IITJEE-2223-MATHEMATICS-STATISTICS