1st Sem Stats Book (BHALOTIA) (PDF - Io)
1st Sem Stats Book (BHALOTIA) (PDF - Io)
2. Formulae 02 – 04
4. Chapter 2:Tabulation 09 – 11
8. Chapter 6:Mean 41 – 56
1st Semester
Module II: Statistics (GE: 1.1 Chg)
Internal Assessment: 10 marks
Semester-end Examinations: 40 marks
Total: 50 marks
Unit 1: Fundamentals:
Definition of Statistics, Scope and limitation of Statistics, Attribute and variable, Primary and secondary data,
Method of data collection, Tabulation of data, Graphs and charts, Frequency distribution, Diagrammatic
presentation of frequency distribution. [8Marks]
Unit 5: Interpolation:
Finite differences, Polynomial function, Newton’s forward and backward interpolation formula, Lagrange’s
interpolation formula. [8Marks]
Formulae:
Measures of Central Tendency
Mean:
1. without Frequency: A.M. x = ( ) ∑n x
With Frequency: A.M. x = ( ) ∑Nfx where, N = ∑ f
2. Step deviation method: x = A +
∑d x I where, d =
x− A
N I
3. Combined A.M. x = () n1 x1 + n 2 x 2
n1 + n 2
=
4. ∑
(
f x−x =0)
β
n
5. Without Frequency::HM (H) =
I I I
+ + ... +
x1 x 2 xn
6. For two positive numbers: A.M. ≥ G.M . ≥ H .M .
7. For two positive numbers: A.M. x H.M. = (G.M )
2
-------------------------------------------------------------------------------------------------------------------
Median:
=
N +1
For ungrouped data: Median = The value of th item
2
N
− c. f .
For grouped data: Median = L + 2 ×I
f
Where, L = lower boundary of the median class.
N = total frequency.
c.f = cumulative frequency (less than type) preceding the median class.
I = class width of the median class.
-------------------------------------------------------------------------------------------------------------------
Mode:
For a frequency distribution having equal width and only one modal class
f m − f1
Mode = L + ×I
2 f m − f1 − f 2
Where, L = lower class boundary of the modal class
fm = frequency of the modal class
f1 = frequency of the class preceding the modal class
f2 = frequency of the class next to the modal class
I = common width
If y = a + bx, then
A.M. of y = a+b (A.M. of x)
Median of y = a +b (Median of x)
Mode of y = a + b (Mode of x)
=
-------------------------------------------------------------------------------------------------------------------
Measures of Dispersion
β
Quartile deviation
Q3 − Q1 Q − Q.1
Quartile deviation (Q.D.) = , Coefficient of Q.D. = 3 × 100
2 Q3 + Q.1
-------------------------------------------------------------------------------------------------------------------
Mean Deviation:
1
Without Frequency: Mean deviation (M.D.) = ∑ f x − A
=
n
1
With Frequency : (M.D.) = ∑ f x − A where, A = mean or median
N
M .D.
Coefficient of M.D. = × 100
A
-------------------------------------------------------------------------------------------------------------------
S.D: [Using Mean]
Without Frequency: : (σ ) =
∑x 2
−
∑x
; With Frequency: (σ ) =
∑ fx 2 ∑ fx
−
n n N N
-------------------------------------------------------------------------------------------------------------------
Step-deviation method :
Without Frequency: : (σ ) =
∑d 2
∑d
− where d = ( x − A)
n n
2
With Frequency: (σ ) =
∑ fd ∑ fd
−
2
× I where d = x − A
N
N I
S .D.
Coefficient of variation (C.V.) = × 100
Mean
-------------------------------------------------------------------------------------------------------------------
=
Combined S.D & Variance:
β
n1(s.d 12 + d 2 )+n (s.d 2+ d 2 )
Combined S.D. (σ ) = 1 2 2 2
n1 + n2
n1 x1 + n2 x 2
Where, d1 = x1 − x, d 2 = x 2 − x and x=
n1 + n2
8. If y = a + bx, then σ y = b σ x
=
2. Limitation of Statistics
(a) Statistics does not study individuals:
(b) Statistics is not suitable for studying qualitative data:
(c) Statistical laws are not exact:
(d) Statistics is liable to be misused:
3. Primary data
Primary data:-
Data that has been generated by the researcher himself/herself, surveys, interviews, experiments, specially
designed for understanding and solving the research problem at hand.
Methods of collecting Primary data:-
The methods of collecting primary data in a statistical investigation are as follows.
(a) Direct personal inquiry
(b) Information from local agents and correspondents
(c) Questionnaires
(d) Interviews, surveys, and fieldwork
(e) Internet communications on email, blogs, listservs, and newsgroups
4. Secondary data
Secondary Data:-
Using existing data generated by large government Institutions, healthcare facilities etc. as part of
organizational record keeping. The data is then extracted from data files.
Sources of Secondary Data
(a) Books, Newspapers, Websites
(b) Government Records
(c) Published/Semi-published Sources & Unpublished Personal Sources
BASIS FOR
PRIMARY DATA SECONDARY DATA
COMPARISON
Meaning Primary data refers to the first hand Secondary data means data collected by
data gathered by the researcher someone else earlier.
MCQ of 2 Marks
1. Which of the following is the source of Secondary Data?
(a) Interview method
(b) Mailed questionnaire method;
(c) Observation method
(d) International sources & Government Sources
2. Which of the following is the source of Primary Data?
(a) Interview method
(b) International sources & Government sources
(c) Private and quasi-government sources
(d) Unpublished sources of various research institutes,
3. When data are arranged at regular interval of time, the classification is called:
(a) Qualitative
(b) Quantitative
(c) Chronological
(d) Geographical
4. Classification of data according to location or areas is called:
(a) Qualitative classification
(b) Quantitative classification
(c) Geographical classification
(d) Chronological classification
5. Classification of data by attributes is called:
(a) Quantitative classification
(b) Chronological classification
(c) Qualitative classification
(d) Geographical classification
6. The frequency distribution according to individual variate values is called:
(a) Discrete frequency distribution
(b) Cumulative frequency distribution
(c) Percentage frequency distribution
(d) Continuous frequency distribution
7. Qualitative character of statistics is known as [2020]
(A) variable (B) attribute (C) data (D) information.
8. Statistics comes from the Latin word__________
(a) Status
(b) Statista
(c) Statistik
(d) None of these
9. Which of the following is an example of attribute
(a) Age of students
(b) Ranks of students
(c) Merit of students
(d) All of these
10. Drinking habit of a person is
(a) An attribute
(b) A variable
(c) A discrete variable
(d) A continuous variable
- 8 – Admission going on for Regular/Crash course for B.com. Call
Bhalotia Classes (9883034569/8820696761): 1st Semester (Statistics)
CHAPTER 2: Tabulation
Tabulation:
Tabulation of Data
The process of placing classified data into tabular form is known as tabulation. A table is a symmetric
arrangement of statistical data in rows and columns. Rows are horizontal arrangements whereas columns
are vertical arrangements. It may be simple, double or complex depending upon the type of classification.
----Box Head----
Foot Notes…
Source Notes…
(1) The Title
The title is the main heading written in capitals shown at the top of the table. It must explain the contents
of the table and throw light on the table.
1. Draw a Table
In 2020, out of total 1,250 workers in a factory, 775 were members of a trade union. The number of
women workers employed was 125, out of which 70 did not belong to any trade union.
In 2021, the number of union workers was 1,390 of which 825 were men. The number of non-union
workers was 270, among which 110 were women. Present the given information in a suitable tabular
form.
Solution:
Distribution of Workers in a Factory
2020 2021
Category
Male Female Total Male Female Total
Union
720 55 775 825 565 1390
Non-
405 70 475 160 110 270
Union
Total 1125 125 1250 985 675 1660
2. Draw a Table
In a sample study about coffee habit in two towns, the following information was received:
(i) Town A: Females were 40%; Total coffee drinkers were 45% and Males non-coffee drinkers were
20%.
(ii) Town B: Males were 55%; Males non-coffee drinkers were 30% and Females coffee drinkers were
15%. Present the data in a tabular form.
Solution:
Percentage of Coffee Drinkers and Non-Coffee Drinkers in City A and City B
City A City B
Attribute Male Female Total Male Female Total
(%) (%) (%) (%) (%) (%)
Coffee drinkers 40 5 45 25 15 40
Non-coffee drinkers 20 35 55 30 30 60
Total 60 40 100 55 45 100
3. Draw a table
In 2020, out of a total of 3600 workers in a factory 2050 were members of trade union. The number of
women workers employed were 1200 of which 650 did not belong to any union.
In 2021, the number of workers in the union was 2600 of which 1800 were men. The number of non-
union workers was 1900 of which 1200 were women. Present the information in a suitable table.
4. Draw a table
Out of total number of 750 people, who applied for job in Reliance Industries, 150 are females. Out of
total applicants, 420 are graduates and other are post graduate. The number of male applicants who are
post graduate are 250. Tabulate the given information.
MCQ of 2 Marks
1. The arrangement of data in rows and columns is called:
(a) Classification
(b) Tabulation
(c) Frequency distribution
(d) Cumulative frequency distribution
3. The_______ is the main heading written in capitals shown at the top of the table.
(a) Title
(b) Caption
(c) Head-notes
(d) Stubs
4. The vertical heading and subheading of the column are called columns _______
(a) Stubs
(b) Title
(c) Caption
(d) Head-notes
5. The headings of the rows given in the first column of a statistical table are called
(a) Stubs
(b) Title
(c) Caption
(d) Head-notes
6. These appear immediately below the body of the table providing additional explanation.
a) Prefatory Notes
b) Foot Notes
c) Source Notes
d) Stub
7. A statement given below the title and enclosed in brackets usually describes the units of
measurement and is called the prefatory notes.
a) Prefatory Notes
b) Foot Notes
c) Source Notes
d) Stub
8. The ________are given at the end of the table indicating the source the information has been taken
from. It includes the information about compiling agency, publication, etc.
a) Source Notes
b) Stub
CHAPTER 3: Frequency
Distribution
1. Frequency Distribution
A frequency distribution is a convenient way of presenting a large mass of data in tabular form by
grouping the data. These are two types of frequency distributes, viz discrete and continuous
Class Frequency:-
The number of observations that fall in a particular class is called the class frequency and is denoted by
the letter ‘f’.
Class Limits:-
The smallest and largest values that define a given class interval are referred to as its lower class limit &
upper class limit respectively.
Types of Classes:
(1)Inclusive Method (Non-overlapping): Class may be grouped as 0-9, 10-19, 20-29, ….., where the
class interval 0-9 include all values from 0 to 9 (both inclusive).
(2)Exclusive Method (Overlapping): The class interval 0-10 includes all values which are greater than
or equal to 0 but less than 10, the class interval 10-20 includes all values which are greater than or equal
to 10 but less than 20, and so on. Thus if a student has secured 10 marks, he will be included in the class
10-20.
Class Boundaries:-
In Overlapping;Class limits = Class boundaries
In Non- Overlapping;
Lower class boundary = lower class limit – d/2
Upper class boundary = upper class limit + d/2
Here d is the gap between the upper class limit of one class and the lower class limit of next class.
Class Width:-
The difference between the upper and lower class boundaries of a class interval is called the class width
or Length of the class interval. Class width = Upper class boundary - Lower class boundary.
Frequency Density :
Class Frequency
Frequency Density =
Class Width
Range +1
Minimum Class Width =
Number of classes
100 schools decided to plant 100 tree saplings in their gardens on world
environment day. Represent the given data in the form of frequency distribution
and find the number of schools that are able to plant 50% of the plants or more?
95, 67, 28, 32, 65, 65, 69, 33, 98, 96, 76, 42, 32, 38, 42, 40, 40, 69, 95, 92, 75, 83, 76,
83, 85, 62, 37, 65, 63, 42, 89, 65, 73, 81, 49, 52, 64, 76, 83, 92, 93, 68, 52, 79, 81, 83,
59, 82, 75, 82, 86, 90, 44, 62, 31, 36, 38, 42, 39, 83, 87, 56, 58, 23, 35, 76, 83, 85, 30,
68, 69, 83, 86, 43, 45, 39, 83, 75, 66, 83, 92, 75, 89, 66, 91, 27, 88, 89, 93, 42, 53, 69,
90, 55, 66, 49, 52, 83, 34, 36
Solution:
To include all the observations in groups, we will create various groups of equal
intervals. These intervals are called class intervals. In the frequency distribution, the
number of plants survived is showing the class intervals, tally marks are showing
frequency, and the number of schools is the frequency in numbers.
In the table 1, the gap between any two consecutive classes is 1 and therefore ½ = 0.5 is subtracted from
lower class limit & 0.5 is added in upper class limit.
MCQ of 2 Marks
1. The largest value is 60 and the smallest value is 40 and number of classes desired is 5 then the
length of the class interval is:
a) 4
F
b) 20
c) 12
d) 8
2. Cumulative frequency is of [2018]
(a) One type
(b) Two type
(c) Three type
(d) Four type
3. In a frequency distribution, which of the following is sometimes referred to as the class mark?
(a) relative frequency
(b) cumulative frequency
(c) class mid-point
(d) frequency
4. The monthly wages (in Rs.) of 20 workers are given below:
130,62,145,95,116,100,103,71,76,151,142,110,98,85,80,122,132,118,125 & 95
Prepare a frequency distribution with 5 classes of equal widths starting from Rs. 62 and calculate the
respective class frequencies.
=
bp
a) 4,3,6,4,3
b) 3,4,6,4,3
pp
c) 4,6,3,4,3
d) 3,4,4,6,3
5. The monthly salaries of 20 employees are as follows (in ₹).
i
130, 62, 145, 95, 116, 100, 103, 71, 76, 152, 142, 110, 98, 85, 122, 132, 118, 125, 95, 99
What will be the class width of a frequency distribution table with 5 classes of equal intervals.
=
(a) 19
(b) 18
f
(c) 20
(d) 17
il
6. From the following table find the number of workers and express the same in percentage
having wages between ₹ 22 and ₹ 58.
e
Wages (₹): 0-10 10-20 20-30 30-40 40-50 50-60 60-70 70-80
No. of Workers: 20 45 85 160 70 55 35 30
(a) 342, 68.4 %
(b) 340, 68 %
(c) 350, 70 %
(d) None of the above
7. From the following table find the number of workers and express the same in percentage
having wages more than ₹ 44
Wages (₹): 0-10 10-20 20-30 30-40 40-50 50-60 60-70 70-80
No. of Workers: 20 45 85 160 70 55 35 30
(a) 162, 32.4 %
(b) 160, 22%
(c) 150, 30%
Cereals 3940
Oilseeds 1165
Pulses 464
Cotton 249
Others 822
Draw a simple Bar chart.
Solution:
1974 100 30 10
1975 120 40 15
1976 130 45 25
1977 150 50 25
Solution:
The Multiple bar diagram for the above data is given below.
Saving (Rs.) 50 30
Solution:
1974 100 30 10
1975 120 40 15
1976 130 45 25
1977 150 50 25
Solution:
The Component bar diagram for the above data is given below.
9. Ogives:
For a set of observations, we know how to construct a frequency distribution. In some cases we may
Require the number of observations less than a given value or more than a given value. This is obtained
by a accumulating (adding) the frequencies up to (or above) the give value. This accumulated frequency is
Called cumulative frequency.
Draw the Ogives for the following data.
Class interval Frequency
20-30 4
30-40 6
40-50 13
50-60 25
60-70 32
70-80 19
80-90 8
90-100 3
Solution:
Class limit Less than ogive More than Ogive
20 0 110
30 4 106
40 10 100
50 23 87
60 48 62
70 80 30
80 99 11
90 107 3
100 110 0
OGIVE
10. Ogive
Question:
Draw ogives for the following data
0-10 4 4 29
10-20 7 11 25
20-30 6 17 18
30-40 10 27 12
40-50 2 29 2
Solution:
11. Histogram:
A Histogram as a bar chart or graph showing the frequency of occurrence of each value of the variable
being analysed. In histogram, data are plotted as a series of rectangles. Class intervals are shown on the
‘X-axis’ and the frequencies on the ‘Y-axis’.
The height of each rectangle represents the frequency of the class interval. Each rectangle is formed with
the other so as to give a continuous picture. Such a graph is called staircase or block diagram.
Example 4:****
Draw a histogram for the following data.
Daily Wages Number of Workers
0-50 8
50-100 16
100-150 27
150-200 19
200-250 10
250-300 6
Solutions:
Note:
Mode can be calculated with the help of Histogram.
12. Histogram
Question:
Draw a histogram for the following data
2.5-3.5 4
3.5-4.5 6
4.5-5.5 10
5.5-6.5 26
6.5-7.5 24
7.5-8.5 15
8.5-9.5 10
9.5-10.5 5
Solution:
2.5-3.5 3 4
3.5-4.5 4 6
4.5-5.5 5 10
5.5-6.5 6 26
6.5-7.5 7 24
7.5-8.5 8 15
8.5-9.5 9 10
9.5-10.5 10 5
Solution:
Frequency Polygon
Pie-chart
17. Pie-chart
Question:
Given the population of 1991 of four southern states of India. Construct a pie diagram forthe following data.
State Population
Karnataka 448
Kerala 290
Total 1957
Solution:
Using the formula
Actual value
Angle (in Degree) = x 360o
Total of actual value
State Angle
Karnataka 82.410
Kerala 53.350
Pie-chart
F
₹)
Food 36,000 50 % 180o
Rent 9,000 12.5% 45 o
Cloth 27,000 37.5% 135 o
Total 72,000 100 360 o
Amount
Percentage (%) = 𝑥𝑥 100
Total Amount
36,000
% of Food = 𝑥𝑥 100 = 50 %
72,000
9,000
% of Rent = 𝑥𝑥 100 = 12.5 %
72,000
=
bp
27,000
pp
Amount
Angle (in Degree) = x 360o
Total Amount
f
36,000
Angle of Food = 𝑥𝑥 360 = 180o
72,000
il
9,000
Angle of Rent = 𝑥𝑥 360 = 45 o
72,000
e
27,000
Angle of Cloth = 𝑥𝑥 360 = 135o
72,000
Angle Given
Amount = Total Amount x
360
180
Amount of food = 72,000 x = 36,000
360
45
Amount of Rent = 72,000 x = 9,000
360
135
Amount of Cloth = 72,000 x = 27,000
360
Angle Given
Percentage (%) = x 100
360
180
% of food = 𝑥𝑥 100 = 50 %
360
45
% of Rent = 𝑥𝑥 100 = 12.5%
360
135
% of cloth = 𝑥𝑥 100 = 37.5%
360
If the total yield is 8100 tons, how much will be the yield of rice (In ton) and by how much percentage the
yield of Ragi will exceed the yield of rice?
a) 900 and 140%
b) 800 and 120%
c) 700 and 150%
d) 900 and 150%
Solutions:
40
Yield of Rice = 8100 x = 900 tonnes
360
% of Ragi−% of Rice
percentage the yield of Ragi will exceed the yield of rice =
% of Rice
x 100
100 % − 40%
= x 100
40 %
60%
= x 100
40 %
= 150%
Answer (d) 900 & 150 %
What is the difference between the number of Literate Males and Literate Females, If total number is
2,50,000 ?
a) 3500
b) 4500
c) 5000
d) 6000
Solutions:
% of Literate Males = 35 %,
% of Literate Females = 33 %
Difference = 35 % - 33 % = 2 % of total
= 2% of 2,50,000
= 5,000
Hindi 60o;
Mathematics 90o,
History 72o,
Science 75o,
English 63o
Solutions:
Amount
Angle (in Degree) = x 360o
Total Amount
108
= x 360o
540
= 720
Hence The subject in which the student scored 108 marks is History
(i) If the total amount that was spent in the year 2000 was 50000, the amount that was spent
on food was
(a) ₹ 225000
(b) ₹ 22500
(c) ₹ 24000
(d) ₹ 21400
(ii) If the total amount that was spent in the year2000 was Rs. 48000, the amount that was
spent on housing and clothing combined was
(a) ₹ 9600
(b) ₹ 8000
(c) ₹ 10000
(d) ₹ 12000
(iii) The ratio of the amount spent on teaching and education to clothing and food was
(a) 3:2
(b) 2:3
(c) 1:5
(d) 1:2
(iv) According to the graph, the maximum amount that was spent was on which item (if the
expenditure that is given to be Rs. 58256)?
(a) Food
(b) Housing
(c) Clothing
(d) Others
(v) If the amount spent during the year 2000 was Rs. 62000, the amount spent on clothing
during the year was
(a) ₹ 1500
(b) ₹ 15000
(c) ₹ 3100
(d) ₹ 3200
Mean
The mean is the arithmetic average, and it is probably the measure of central tendency that you are most
familiar. Calculating the mean is very simple. You just add up all of the values and divide by the number
of observations in your dataset.
The calculation of the mean incorporates all values in the data. If you change any value, the mean
changes. However, the mean doesn’t always locate the centre of the data accurately.
Median
The median is the middle value. It is the value that splits the dataset in half. To find the median, order
your data from smallest to largest, and then find the data point that has an equal amount of values above it
and below it.
Mode
The mode is the value that occurs the most frequently in your data set. On a bar chart, the mode is the
highest bar. If the data have multiple values that are tied for occurring the most frequently, you have a
multimodal distribution. If no value repeats, the data do not have a mode.
Quartiles
The quartiles of a data set divide the data into four equal parts, with one-fourth of the data values in each
part. The second quartile position is the median of the data set, which divides the data set in half. We
need 3 quartiles to divide data in 4 parts: Q1, Q2, and Q3,
Deciles:
Deciles and percentiles are usually applied to large data sets. Deciles divide a data set into ten equal parts.
We need 9 Deciles to divide data in 10 parts: D1, D2,…………. and Q9,
Percentiles
Percentiles divide a data set into 100 equal parts. We need 99 percentiles to divide data in 100 parts: P1,
P2,…………. and P99,
CHAPTER 6: Mean
Arithmetic Mean:-
The arithmetic mean or simply the mean is the most popular and commonly used measure of central
tendency. It is what we commonly call the average.
Definition: The arithmetic mean of a set of observations is equal to the sum of all the observations
divided by the total number of observations.
∑d
A.M. = A +
𝑁𝑁
∑fX
A.M. =
𝑁𝑁
Where N = ∑f = total frequency.
Where x is the mid-value and i is the common size (or width) of the class intervals.
d = (x-A)/ i
Property 2. If each observation of a data is increased or decreased by a constant k, then the arithmetic
mean of the new data also gets increased or decreased by k. That is, if y = x+k
Property 3. If each observation of a data is multiplied by a constant k, then the arithmetic mean of the
data so obtained also gets multiplied by it.
Property 4. If all the observations in the given data set have a value say ‘m’, then their AM, GM & HM
are also ‘m’
Combined Mean:-
����+ 𝑁𝑁2𝑋𝑋2
𝑁𝑁1𝑋𝑋1 ���� +⋯
Combined Mean or 𝑋𝑋� =
𝑁𝑁1+𝑁𝑁2+⋯
Geometric Mean:
G.M. = n
�𝑿𝑿𝟏𝟏 . 𝑿𝑿𝟐𝟐 . 𝑿𝑿𝟑𝟑 … … … . . 𝑿𝑿𝒏𝒏
Harmonic Mean:
𝑵𝑵
H.M. = 𝟏𝟏
∑
𝑿𝑿
∑ 𝑥𝑥 5+3+7+8+4+9 36
Mean = 𝑥𝑥 = 𝑁𝑁
= 6
= 6
=6
X f fx
10 1 10
11 2 22
12 4 48
13 2 26
14 1 14
∑f=10 ∑fX=120
∑ 𝑓𝑓𝑓𝑓 120
Mean = 𝑥𝑥 = ∑ 𝑓𝑓
= 10
= 12
∑ 𝑓𝑓𝑓𝑓 4130
Mean = 𝑥𝑥 = ∑ 𝑓𝑓
= 80
= 51.525
Marks f X fx
(Mid-value)
20-40 10 30 300
40-60 16 50 800
60-80 32 70 2240
80-100 28 90 2520
=
100-120 14 110 1540
∑f=100 ∑fX = 7400
∑ 𝑓𝑓𝑓𝑓 7400
Mean = 𝑥𝑥 = ∑ 𝑓𝑓
= 100
= 74
Solution:
=
∑ 𝑥𝑥
Mean = 𝑥𝑥 = 𝑁𝑁
9 + 𝑥𝑥 = 18, 𝑜𝑜𝑜𝑜, 𝑥𝑥 = 18 − 9 = 9.
∴ Mean (x ) = n 1 x1 + n2 x 2
n1 + n2
=
50 × 54.1 + 100 × 50.3
50 + 100
=
2705 + 5030
150
7735
= 150 = 51.57(𝑎𝑎𝑎𝑎𝑎𝑎𝑎𝑎𝑎𝑎𝑎𝑎)
7. Geometric Mean
Calculate the Geometric Mean of 3, 6, 24 and 48.
Solution:
G.M. = (3 x 6 x 24 x 48 )1/4
= (3 x 3 x 2 x 3 x 23 x 3 x 24) 1/4
= (3 4 x2 8 ) 1 / 4
= 3 x 2 2 = 12
8. Geometric Mean
The G.M. of x, 9, 12 be 6, find the value of x.
Solution:
By definition, (x . 9 . 12) 1/3 = 6;
or, 108x = 63 = 216 or, x = 2
9. Geometric Mean
The G.M. of a, 4, 8 be 6, find the value of a
Solution:
Since the G.M. of three numbers a, 4, 8 is 6, hence,
(a x 4 x 8)1/3 = 6
or, 32a = 63 (cubing both sides)
Or, a = 216 / 32 = 27/4 = 6.75.
𝟒𝟒
H.M =
0.3333+0.1667+0.04167+0.02083
𝟒𝟒
H.M = = 7.11
0.5625
OR
𝟒𝟒
H.M. = 𝟏𝟏 𝟏𝟏 𝟏𝟏
+ + + 𝟏𝟏
𝟑𝟑 𝟔𝟔 𝟐𝟐𝟐𝟐 𝟒𝟒𝟒𝟒
𝟒𝟒
= 𝟏𝟏𝟏𝟏+𝟖𝟖+𝟐𝟐 +𝟏𝟏
𝟒𝟒𝟒𝟒
48
= 4x
27
= 7.11
24 𝟑𝟑
=
7 𝟒𝟒𝟒𝟒 + 𝟖𝟖 + 𝐚𝐚
𝟖𝟖𝒂𝒂
24 𝟐𝟐𝟐𝟐𝟐𝟐
=
7 5𝑎𝑎+8
48 a = 192
a=4
12. AM x HM = (GM)2
The A.M. & G.M between two numbers are 6.5 and 6 respectively; find the H.M.
Solution:
For two numbers we have, A.M. x H.M. = (G.M.)2
or, 6.5 x H. M. = 62 [A.M. = 6.5 and G.M. = 6 (given)]
Or, HM = 36/6.5 = 5.54 [Approx]
a+b = 2 x 6.5 = 13
b = 13-a …(1)
GM2 = ab = 6x 2 = 36 …(2)
a (13-a) = 36
a2–13a+36 = 0
a2–9a – 4a +36 = 0
(a – 9) (a – 4) = 0
𝟐𝟐 𝟐𝟐
Average speed = H.M = 𝟏𝟏 𝟏𝟏 = = 12.92/kms/hour
+ 0.1547
𝟏𝟏𝟏𝟏 𝟏𝟏𝟏𝟏
= (1520 - 36 + 56)
= 1540.
= 500.
Mean weight of 35 students and the teacher (45 + 0.5) kg = 45.5 kg.
Total weight of 35 students and the teacher = (45.5 × 36) kg = 1638 kg.
Or, a = 80
Hence, the percentage of men and women in the group is 80, 20 respectively
970+2𝑥𝑥
Or, 117 = 10
Solution:
Height (inch) No. of Person (f) Mid value (X) fx
60–62 15 61 915
63–65 a 64 64a
66–68 126 67 8442
=
69–71 81 70 5670
72–74 24 73 1752
∑f=246 + a ∑fX=16779 + 64a
∑ 𝑓𝑓𝑓𝑓
Mean = 𝑥𝑥 = ∑ 𝑓𝑓
16779+64𝑎𝑎
Or, 67.45 = 246+𝑎𝑎
Or, a = 54
Arithmetic Mean:
6. Find the A.M, G.M and H.M of 3, 6, 24, 48.
Geometric Mean:
10. Find the GM of the set of numbers 1,3,9,27.
a) 9
b) 3
c) 5.916
d) 5.196
11. Find the GM of 12, 24, 48
a) 16
b) 24
c) 36
d) 18
Harmonic Mean:
16. Find the H.M of 3, 6, 24, 48
a) 20.25
b) 12
c) 7.11
d) 16
17. Find the HM of 1,1/2,1/3,1/4,1/5.
a) 0.5
b) 0.33
c) 0.25
18. The harmonic mean of 2, a, 8 be 24/7, then the value of a is [2020]
(A) 9 (B) 4 (C) 6 (D) 8.
Average Speed:
23. A man travels from Agra to Gwalior at a speed of 30 kmph and returns at a speed of 60 kmph.
What is his average speed?
a) 45 kmph
b) 40 kmph
Correct Mean:
24. The A.M of 25 observations is 44; later on it was reported that two of the observations 34 and
46 were copied wrongly as 28 and 42. Find the actual A.M?
a) 46.4
b) 40.4
c) 42.4
d) 44.4
25. The mean of 20 observations is 45. On checking it was found that two observations were
wrongly recorded as 36 and 44. While the correct value were 38 and 46. What is the correct
mean.
a. 45.5
b. 45.2
c. 45.3
d. 45.4
Combined Mean:
30. The means of two samples of sizes 50 and 100 are 54 and 51 respectively. Find the mean of
sample of size 150 formed by combing the two samples.
a) 52
b) 53
c) 52.5
d) 53.5
31. Find the mean of 30 members, where mean of 10 of them is 12 and mean of rest 20 is 9. [2019]
(A) 9 (B) 10 (C) 11 (D) 12.
32. The mean age of combined group of men and women is 25 years. If the mean age of group of
men is 26 and that of group of women is 21, then percentage of men and women in the group
respectively is:
a) 60, 40
b) 80, 20
c) 30, 70
d) 50, 50
33. The mean of a group of 250 items was 15. If the mean of 100 of these items are 15. find the
mean of the remaining 150 items.
a) 15
b) 16
c) 17
d) 18
41. If the mean of the following distribution is 2.6, then the value of y is
Variable (x) 1 2 3 4 5
frequency 4 5 y 1 2
a) 13
b) 24
42. The A.M. calculated from the following frequency distribution is known to be 117 Lbs, find the
value of x.
Weight in Ibs 100 110 120 X + 25 140 Total
No. of Person 1 4 2 2 1 10
a) 90
b) 110
c) 100
d) None of these
43. If the average weight of the following frequency distribution is 118 lbs then find the value of x.
Weight (in Ibs) 95-105 105- 115 115-125 125- 135 135-145
Frequency 10 40 20 x 10
a) 20
b) 30
c) 40
d) 50
44. The A.M. of the following distribution is 67·45 inches. Then missing frequency is [2020]
45. Find the out the missing frequencies of the following data, given that A. M. is 67.45 inches.
Height (inches) 60-62 63-65 66-68 69-71 72-74 Total
No. of students 5 18 a b 8 100
a) 42, 27
b) 41, 28
c) 43, 26
d) 40, 29
Mode:
Calculation of Mode-Ungrouped Data:-
The mode of a set of observation is that value which appears most frequently or with the greatest
frequency.
Knowing any two values out of the three, we can compute the third from these relationships.
𝑁𝑁+1 7+1
Median =
2
𝑡𝑡ℎ 𝑡𝑡𝑡𝑡𝑡𝑡𝑡𝑡 = 2
𝑡𝑡ℎ 𝑡𝑡𝑡𝑡𝑡𝑡𝑡𝑡 = 4th term = 70
𝑁𝑁+1 6+1
Median =
2
𝑡𝑡ℎ 𝑡𝑡𝑡𝑡𝑡𝑡𝑡𝑡 = 2
𝑡𝑡ℎ 𝑡𝑡𝑡𝑡𝑡𝑡𝑡𝑡 = 3.5th term
𝑁𝑁+1 10 +1
Median =
2
𝑡𝑡ℎ 𝑡𝑡𝑡𝑡𝑡𝑡𝑡𝑡 = 2
𝑡𝑡ℎ 𝑡𝑡𝑡𝑡𝑡𝑡𝑡𝑡 = 5.5th term
𝑁𝑁 13
= 𝑡𝑡ℎ 𝑡𝑡𝑡𝑡𝑡𝑡𝑡𝑡 = 6.5 𝑡𝑡ℎ 𝑡𝑡𝑡𝑡𝑡𝑡𝑡𝑡 𝑙𝑙𝑙𝑙𝑙𝑙𝑙𝑙 𝑏𝑏𝑏𝑏𝑏𝑏𝑏𝑏𝑏𝑏𝑏𝑏𝑏𝑏 40 − 60 , Hence Median class = 40 − 60
2 2
N = total frequency = 13
6.5− 3
Median = 40 + × 20 = 40 + 17.5 = 57.5
4
𝑁𝑁 60
= 𝑡𝑡ℎ 𝑡𝑡𝑡𝑡𝑡𝑡𝑡𝑡 = 30𝑡𝑡ℎ 𝑡𝑡𝑡𝑡𝑡𝑡𝑡𝑡 𝑙𝑙𝑙𝑙𝑙𝑙𝑙𝑙 𝑏𝑏𝑏𝑏𝑏𝑏𝑏𝑏𝑏𝑏𝑏𝑏𝑏𝑏 50.5 − 60.5, Hence Median class = 50.5 − 60.5
2 2
N = total frequency = 60
i = class width = upper class boundary - lower class boundary = 60.5 – 50.5 = 10
30− 20
Median = 50.5 + × 10 = 50.5 + 5 = 55.5
20
Computation of Median
Class boundaries f Cf
(less than)
Less than 10 10 10
10-20 20 30
20-30 30 60
30-40 50 110
40-50 40 150
50 & More 30 180
N = ∑f= 180
𝑵𝑵
−𝒄𝒄.𝒇𝒇
Median = L + 𝟐𝟐 𝒇𝒇
× 𝒊𝒊
𝑁𝑁 180
= 𝑡𝑡ℎ 𝑡𝑡𝑡𝑡𝑡𝑡𝑡𝑡 = 90𝑡𝑡ℎ 𝑡𝑡𝑡𝑡𝑡𝑡𝑡𝑡 𝑙𝑙𝑙𝑙𝑙𝑙𝑙𝑙 𝑏𝑏𝑏𝑏𝑏𝑏𝑏𝑏𝑏𝑏𝑏𝑏𝑏𝑏 30 − 40, Hence Median class = 30 − 40
2 2
90− 60
Median = 30 + × 10 = 30 + 6 = 36
50
Solution:
Class 20-30 has the highest frequency, hence Modal class is 20-30.
f m − f1
Mode = L + x i
2
m f − f 1 − f 2
Here
l = lower class boundary of the modal class = 20
fm = frequency of the modal class = 20
f1 = frequency of the class preceding the modal class. = 8
f2 = frequency of the class succeeding the modal class. = 12
i = Class width of the modal class = 30 - 20 = 1
20 −8
Mode = 20 + �(2 𝑥𝑥 20)−8 −12� x 10
12
= 20 + �20� x 10 = 20 + 6 = 26
35+𝑎𝑎
− (3)
∴ 28 - 20 = 2
2
35+𝑎𝑎
∴8 x 2 = − 3
2
35+𝑎𝑎
∴ 16 + 3= ; 35 + a = 38; a = 38 – 35 = a
2
f m − f1
Mode = L + x i
2
m f − f 1 − f 2
28−22
Or, 66 = 60 + �(2×28)−22−𝑎𝑎� x 10
6
Or, 6 = �34−𝑎𝑎� x 10
Or, 34 - a=10
Or, a=24
20−𝑎𝑎
Or, 26 = 20 +�(2×20)−𝑎𝑎−12�x 10
20−𝑎𝑎
Or, 6 = �28−𝑎𝑎�x 10
Median:
𝑁𝑁
−𝑐𝑐𝑐𝑐
Median = 𝐿𝐿 + 2
𝑓𝑓
× 𝑖𝑖
43+𝑏𝑏
− (11)
∴ 27 - 20 = 2
2
43+𝑏𝑏
∴7 x 2 = − (11)
2
43+𝑏𝑏
∴ 14 + 11=
2
∴ 25 x 2 = 43 + 𝑏𝑏,
b = 50 – 43 = 7. Hence a = 8, b = 7
2. The median of 12, 48, 30, 112, 62, 94, 75, 98 is [2020]
(A) 69 (B) 68·5 (C) 68 (D) 67·5.
4. In a moderately symmetrical series, the arithmetic mean, median and mode are related as:
a) Mean - Mode = 3 (Mean - Median)
b) Mean - Median = 2 (Median - Mode)
c) Median - Mode =3 (Mean - Median)
d) Mode – Median = 2 (Mean – Median)
5. If median and mode of a distribution are 14 and 10 respectively, then mean is [2019]
a) 18
b) 13
c) 12
d) 16
6. In an asymmetrical distribution the mode and mean are respectively ₹ 12·30 and ₹ 18·48 The
median value is [2020]
a) ₹ 15·39
b) ₹ 16·42
c) ₹ 15·49
d) ₹ 16·30.
7. The Median of a set of 10 numbers is 20. If each number is first multiplied by 2 and then
increased by 5, then what is the mean of new numbers?
a) 40
b) 45
14. Construct the grouped frequency distribution from the following data and hence find median
& Mode from the following:
Marks Less than 10 Less than 20 Less than 30 Less than 40 Less than 50 Less than 60
No. of students 175 360 680 790 900 1000
[Ans. Median = 24.375, Mode = 23.91]
16. The median of the following frequency distribution is known to be 28. The value of a is: [2017]
Dividend (%) 10-20 20-30 30-40 40-50
No. of Companies 3 20 12 a
a) 7
b) 3
c) 4
d) 2
18. The median of the following frequency distribution are known to be 27. Find the values of a.
19. The mode of the following frequency distribution is 26 Find the values of a
Values 0-10 10-20 20-30 30-40 40-50
Frequency 3 a 20 12 7
a) 8
b) 6
c) 9
d) 7
Range:-
Range = L-S; Where L = largest value and S= smallest value.
In case of a grouped frequency distribution, range is defined as the difference between the upper class
boundary of the highest class and the lower class boundary of the smallest class.
Coefficient of Range:
𝐋𝐋−𝐒𝐒
Coefficient of range = x 100
𝐋𝐋+𝐒𝐒
1. What is the coefficient of range for the following? 80, 65, 90, 60, 75, 70, 72, 85
Solution
L = largest value = 90; S= smallest value = 60; Range = 90 – 60 = 30
𝟗𝟗𝟗𝟗−𝟔𝟔𝟔𝟔
Coefficient of range = x 100 = 20%
𝟗𝟗𝟗𝟗+𝟔𝟔𝟔𝟔
N = total frequency
3𝑁𝑁
−𝑐𝑐.𝑓𝑓
Q3 = L+ 4 𝑓𝑓
× 𝑖𝑖
N = total frequency
2𝑁𝑁 𝑁𝑁
−𝑐𝑐.𝑓𝑓 −𝑐𝑐.𝑓𝑓
Q2 = Median = L+ 4 𝑓𝑓
× 𝑖𝑖 = L+ 2 𝑓𝑓
× 𝑖𝑖
𝑁𝑁+1 7+1
Q1 =
4
𝑡𝑡ℎ 𝑡𝑡𝑡𝑡𝑡𝑡𝑡𝑡 = 4
𝑡𝑡ℎ 𝑡𝑡𝑡𝑡𝑡𝑡𝑡𝑡 = 2nd term = 40
3 (𝑁𝑁+1) 3 (7+1)
Q3 =
4
𝑡𝑡ℎ 𝑡𝑡𝑡𝑡𝑡𝑡𝑡𝑡 = 4
𝑡𝑡ℎ 𝑡𝑡𝑡𝑡𝑡𝑡𝑡𝑡 = 6th term = 90
Q3 − Q.1 90−40
Coefficient of Q.D. = × 100 = 90+40 x 100 = 23.08%
Q3 + Q.1
𝑁𝑁+1 6+1
Q1 =
4
𝑡𝑡ℎ 𝑡𝑡𝑡𝑡𝑡𝑡𝑡𝑡 = 4
𝑡𝑡ℎ 𝑡𝑡𝑡𝑡𝑡𝑡𝑡𝑡 = 1.75th term
= 37.5
3 (𝑁𝑁+1) 3 (6+1)
Q3 =
4
𝑡𝑡ℎ 𝑡𝑡𝑡𝑡𝑡𝑡𝑡𝑡 = 4
𝑡𝑡ℎ 𝑡𝑡𝑡𝑡𝑡𝑡𝑡𝑡 = 5.25th term
= 82.5
Q3 − Q.1 82.5−37.5
Coefficient of Q.D. = × 100 = 82.5+37.5 x 100 = 37.5%
Q3 + Q.1
- 71 – Admission going on for Regular/Crash course for B.com. Call
Bhalotia Classes (9883034569/8820696761): 1st Semester (Statistics)
𝑁𝑁+1 10+1
Q1 =
4
𝑡𝑡ℎ 𝑡𝑡𝑡𝑡𝑡𝑡𝑡𝑡 = 4
𝑡𝑡ℎ 𝑡𝑡𝑡𝑡𝑡𝑡𝑡𝑡 = 2.75th term
= 20
3 (𝑁𝑁+1) 3 (10+1)
Q3 =
4
𝑡𝑡ℎ 𝑡𝑡𝑡𝑡𝑡𝑡𝑡𝑡 = 4
𝑡𝑡ℎ 𝑡𝑡𝑡𝑡𝑡𝑡𝑡𝑡 = 8.25th term
= 40
Q3 − Q.1 40−20
Coefficient of Q.D. = × 100 = 40+20 x 100 = 33.33 %
Q3 + Q.1
For the given data, we can form the required table with the cumulative frequency as –
Cumulative Frequency
Marks Frequency
(less than type)
Less than 10 10 10
10-20 20 30
20-30 30 60
30-40 50 110
40-50 40 150
Similarly, the third quartile will lie in the 40-50 marks range. Calculation –
3N
− c. f .
Q3= L + 4 ×I
f
Here, L = 40;
i = 10
c.f = 110;
f = 40;
n = 180
Thus, Q3 = 46.25
Now, using the values for Q1 and Q3, now we can calculate the Quartile Deviation and its
coefficient as follows –
Quartile Deviation = Semi-Inter Quartile Range
Q − Q1
Quartile deviation (Q.D.) = 3 ,
2
= (46.25–25)/2
=(21.25)/2
= 10.625
Coefficient of Quartile Deviation
Q − Q.1
Coefficient of Q.D. = 3 × 100
Q3 + Q.1
= {(46.25–25) / (46.25+25) } × 100
= 21.25 / 71.25 × 100
= 29.82 %
Solution:
Computation of Quartile deviation
Class interval f Cf
(less than type)
20-40 3 3
40-60 4 7
60-80 5 12
80-100 1 13
N = ∑f=13
𝑵𝑵
−𝒄𝒄.𝒇𝒇
Q1 = L + 𝟒𝟒
𝒇𝒇
× 𝒊𝒊
𝑁𝑁 13
= 𝑡𝑡ℎ 𝑡𝑡𝑡𝑡𝑡𝑡𝑡𝑡 = 3.25𝑡𝑡ℎ 𝑡𝑡𝑡𝑡𝑡𝑡𝑡𝑡 𝑙𝑙𝑙𝑙𝑙𝑙𝑙𝑙 𝑏𝑏𝑏𝑏𝑏𝑏𝑏𝑏𝑏𝑏𝑏𝑏𝑏𝑏 40 − 60 , Hence Q1 class = 40 − 60
4 4
3.25−3
Q1 = 40 + × 20 = 40 + 1.25 = 41.25
4
𝟑𝟑𝟑𝟑
−𝒄𝒄.𝒇𝒇
Q3 = L + 𝟒𝟒
𝒇𝒇
× 𝒊𝒊
3𝑁𝑁 39
= 𝑡𝑡ℎ 𝑡𝑡𝑡𝑡𝑡𝑡𝑡𝑡 = 9.75 𝑡𝑡ℎ 𝑡𝑡𝑡𝑡𝑡𝑡𝑡𝑡 𝑙𝑙𝑙𝑙𝑙𝑙𝑙𝑙 𝑏𝑏𝑏𝑏𝑏𝑏𝑏𝑏𝑏𝑏𝑏𝑏𝑏𝑏 60 − 80 , Hence Q3 class = 60 − 80
4 4
9.75−7
Q3 = 60 + × 20 = 60 + 11 = 71
5
𝟕𝟕𝟕𝟕−𝟒𝟒𝟒𝟒.𝟐𝟐𝟐𝟐 𝟐𝟐𝟐𝟐.𝟕𝟕𝟕𝟕
Quartile deviation (Q.D.) = 𝟐𝟐
= 𝟐𝟐
= 14.875
Class boundaries f Cf
(less than)
30.5-40.5 8 8
40.5-50.5 12 20
50.5-60.5 20 40
60.5-70.5 15 55
70.5-80.5 5 60
N = ∑f=60
𝑵𝑵
−𝒄𝒄.𝒇𝒇
Q1 = L + 𝟒𝟒 𝒇𝒇
× 𝒊𝒊
𝑁𝑁 60
= 𝑡𝑡ℎ 𝑡𝑡𝑡𝑡𝑡𝑡𝑡𝑡 = 15𝑡𝑡ℎ 𝑡𝑡𝑡𝑡𝑡𝑡𝑡𝑡 𝑙𝑙𝑙𝑙𝑙𝑙𝑙𝑙 𝑏𝑏𝑏𝑏𝑏𝑏𝑏𝑏𝑏𝑏𝑏𝑏𝑏𝑏 40.5 − 50.5 , Hence Q1 class = 40.5 − 50.5
4 4
i = class width = upper class boundary = lower class boundary = 50.5 – 40.5 = 10
15−8
Q1 = 40.5 + × 10 = 40.5 + 5.83 = 46.33
12
𝟑𝟑𝟑𝟑
−𝒄𝒄.𝒇𝒇
Q3 = L + 𝟒𝟒
𝒇𝒇
× 𝒊𝒊
3𝑁𝑁 3 𝑥𝑥 60
= 𝑡𝑡ℎ 𝑡𝑡𝑡𝑡𝑡𝑡𝑡𝑡 = 45𝑡𝑡ℎ 𝑡𝑡𝑡𝑡𝑡𝑡𝑡𝑡 𝑙𝑙𝑙𝑙𝑙𝑙𝑙𝑙 𝑏𝑏𝑏𝑏𝑏𝑏𝑏𝑏𝑏𝑏𝑏𝑏𝑏𝑏 60.5 − 70.5 , Hence Q3 class = 60.5 − 70.5
4 4
45−40
Q3 = 60.5 + × 10 = 60.5 + 3.33 = 63.83
15
𝟔𝟔𝟔𝟔.𝟖𝟖𝟖𝟖−𝟒𝟒𝟒𝟒.𝟑𝟑𝟑𝟑 𝟏𝟏𝟏𝟏.𝟓𝟓𝟓𝟓
Quartile deviation (Q.D.) = 𝟐𝟐
= 𝟐𝟐
= 8.75
2. Which of the following divide a group of data into ten equal subgroups? [2017]
a) Percentiles
b) Median
c) Deciles
d) Quartiles
7. The third quartile of the numbers 22, 36, 14, 23, 40 is [2019]
(a) 36
(b) 30
(c) 38
(d) None of these.
8. The quartile deviation of 22, 17, 25, 20, 29, 27 and 35 will be [2018]
a) 5.5
b) 6
c) 8
d) 4.5
- 77 – Admission going on for Regular/Crash course for B.com. Call
Bhalotia Classes (9883034569/8820696761): 1st Semester (Statistics)
9. Find quartile deviation If Q1 = 10, Q2 = 12 and Q3, = 14
(a) 2
(b) 4
(c) 0
(d) 1
10. If the 1st Quartile is 142 and semi-inter quartile range is 18. What would be the 3rd quartile?
a) 151
b) 160
c) 178
d) None
11. Find the Quartile Deviation from the following income distribution of a group f workers:
Wages in Rs. 20 32 61 75 82 93
No. of workers 2 4 7 5 4 2
a) Rs. 25.50
b) Rs. 20.50
c) Rs. 22.50
d) Rs. 21.50
13. From the following data, find Quartile Deviation and its Coefficient.
x 10-15 15-20 20-25 25-30 30-35 35-40 40-45 45-50
f 8 10 12 15 10 7 8 5
a) 26.32
b) 28.32
c) 27.32
d) None of these
16. Calculate the 3rd decile, 45th percentile & 3rd Quartile from the following distribution:
80, 90, 70, 100, 80, 60, 50, 30
Step 2: Find the distance of each value from that mean and add them:
3.75
= 9
𝑥𝑥 100
= 41.67 %
d = x - 𝑥𝑥 = x – 6
∑ IdI 12
The mean deviation about mean =
𝑁𝑁
= 6
=2
d = x - 𝑀𝑀𝑀𝑀𝑀𝑀𝑀𝑀𝑀𝑀𝑀𝑀 = x – 6
X d=x -6 IdI
5 -1 1
3 -3 3
7 1 1
8 2 2
4 -2 2
9 3 3
∑X=36 ∑ IdI =12
∑ IdI 12
The mean deviation about median =
𝑁𝑁
= 6
=2
X f fx d = x - 12 IdI f IdI
10 1 10 -2 2 2
11 2 22 -1 1 2
12 4 48 0 0 0
13 2 26 1 1 2
14 1 14 2 2 2
∑f=10 ∑fX=120 ∑ f IdI = 8
∑ 𝑓𝑓𝑓𝑓 120
Mean = 𝑥𝑥 = ∑ 𝑓𝑓
= 10
= 12
d = x - 𝑥𝑥 = x – 12
∑ f IdI 8
The mean deviation about mean =
∑ 𝑓𝑓
= 10
= 0.8
d = x - 𝑀𝑀𝑀𝑀𝑀𝑀𝑀𝑀𝑀𝑀𝑀𝑀 = x – 12
X f d = x - 12 IdI f IdI
10 1 -2 2 2
11 2 -1 1 2
12 4 0 0 0
13 2 1 1 2
14 1 2 2 2
∑f=10 ∑ f IdI = 8
∑ f IdI 8
The mean deviation about median =
∑ 𝑓𝑓
= 10
= 0.8
d = x - 𝑥𝑥 = x – 51.625
∑ f IdI 921.5
The mean deviation about mean =
∑ 𝑓𝑓
= 80
= 11.52
d = x - 𝑥𝑥 = x – 74
∑ f IdI 1904
The mean deviation about mean =
∑ 𝑓𝑓
= 100
= 19.04
Solution:
Computation of mean deviation about median
𝑁𝑁
−𝑐𝑐.𝑓𝑓
Median = L+ 2 𝑓𝑓
× 𝑖𝑖
𝑁𝑁 10
= 𝑡𝑡ℎ 𝑡𝑡𝑡𝑡𝑡𝑡𝑡𝑡 = 5𝑡𝑡ℎ 𝑡𝑡𝑡𝑡𝑡𝑡𝑡𝑡 𝑙𝑙𝑙𝑙𝑙𝑙𝑙𝑙 𝑏𝑏𝑏𝑏𝑏𝑏𝑏𝑏𝑏𝑏𝑏𝑏𝑏𝑏 40 − 60 , Hence Median class = 40 − 60
2 2
N = total frequency = 10
5−3
Median = 40 + 4
× 20 = 40 + 10 = 50
d = x - 𝑀𝑀𝑀𝑀𝑀𝑀𝑀𝑀𝑀𝑀𝑀𝑀 = x – 50
∑ f IdI 140
The mean deviation about median =
∑ 𝑓𝑓
= 10
= 14
𝑀𝑀𝑀𝑀𝑀𝑀𝑀𝑀 𝑑𝑑𝑑𝑑𝑑𝑑𝑑𝑑𝑑𝑑𝑑𝑑𝑑𝑑𝑑𝑑𝑑𝑑 𝑎𝑎𝑎𝑎𝑎𝑎𝑎𝑎𝑎𝑎 𝑀𝑀𝑀𝑀𝑀𝑀𝑀𝑀𝑀𝑀𝑀𝑀
Coefficient of mean deviation about median =
𝑀𝑀𝑀𝑀𝑀𝑀𝑀𝑀𝑀𝑀𝑀𝑀
𝑥𝑥 100
14
= 50
𝑥𝑥 100 = 28 %
Solution:
Computation of mean deviation about median
𝑁𝑁
−𝑐𝑐.𝑓𝑓
Median = L+ 2
𝑓𝑓
× 𝑖𝑖
𝑁𝑁 60
= 𝑡𝑡ℎ 𝑡𝑡𝑡𝑡𝑡𝑡𝑡𝑡 = 30𝑡𝑡ℎ 𝑡𝑡𝑡𝑡𝑡𝑡𝑡𝑡 𝑙𝑙𝑙𝑙𝑙𝑙𝑙𝑙 𝑏𝑏𝑏𝑏𝑏𝑏𝑏𝑏𝑏𝑏𝑏𝑏𝑏𝑏 50.5 − 60.5 , Hence Median class = 50.5 - 60.5
2 2
N = total frequency = 60
i = class width = upper class boundary = lower class boundary = 60.5 – 50.5 = 10
30−20
Median = 50.5 + 20
× 10 = 40 + 5 = 55.5
d = x - 𝑀𝑀𝑀𝑀𝑀𝑀𝑀𝑀𝑀𝑀𝑀𝑀 = x – 55.5
∑ f IdI 530
The mean deviation about median =
∑ 𝑓𝑓
= 60
= 8.83
𝑀𝑀𝑀𝑀𝑀𝑀𝑀𝑀 𝑑𝑑𝑑𝑑𝑑𝑑𝑑𝑑𝑑𝑑𝑑𝑑𝑑𝑑𝑑𝑑𝑑𝑑 𝑎𝑎𝑎𝑎𝑎𝑎𝑎𝑎𝑎𝑎 𝑀𝑀𝑀𝑀𝑀𝑀𝑀𝑀𝑀𝑀𝑀𝑀
Coefficient of mean deviation about median =
𝑀𝑀𝑀𝑀𝑀𝑀𝑀𝑀𝑀𝑀𝑀𝑀
𝑥𝑥 100
8.83
= 55.5
𝑥𝑥 100 = 15.92 %
𝟐𝟐
S.D. or 𝝈𝝈 = �∑𝒙𝒙𝑵𝑵
Where x = X – A.M. and N= Total no. of observation
(ii)Assumed Mean Method:
𝟐𝟐 ∑𝒅𝒅
S.D. or 𝝈𝝈 = �∑𝒅𝒅𝑵𝑵 – ( ).𝟐𝟐
𝑵𝑵
Where d = x-A
Discrete Series & Continuous Series:-
If X1, X2, …, XN are the grouped data with frequencies f1, f2, …, fN then the standard deviation is given
by
(i)Direct method:
𝟐𝟐
S.D. or 𝝈𝝈 = �∑𝒇𝒇𝒙𝒙
𝑵𝑵
Variance:-
Definition: The variance of a given set of observation is defined as the square of its standard deviation
and is denoted by Thus Variance = S.D.2
where, d1 =𝑋𝑋�1 - 𝑋𝑋� d2= 𝑋𝑋�2 - 𝑋𝑋� where 𝑋𝑋� is the combined mean.
Coefficient of S.D. or C.V.:-
The standard deviation is an absolute measure of dispersion, depending upon the units of measurement. It
does not tell us much about the variability of a single set of data. The coefficient of standard deviation,
based on standard, is a relative measure of dispersion and is given by
𝐒𝐒.𝐃𝐃.
Coefficient of Standard Deviation or C.V.= x 100
𝐌𝐌𝐌𝐌𝐌𝐌𝐌𝐌
𝝈𝝈
= � x 100
𝑋𝑋
which expresses the standard deviation as a percentage of the mean. Since C.V. is a measure of relative
variation expressed as a percent, the coefficient of variation can be used to compare the variability of two
are or more sets of data even when the observations are expressed in different units of measurement.
A distribution for which the coefficient of variation is smaller is said to be less variable or more
consistent, more uniform, more stable or more homogeneous. On the other hand, the distribution for
which the coefficient of variation is greater is said to be more variable or less consistent, less uniform,
less stable or less homogeneous.
3. If all the numbers of the distribution are equal then the standard deviation will be 0.
a−b
Standard deviation =
2
5. Variance = (S.D)2
S.D
7. Coefficient of variation = x 100
Mean
∑ 𝑥𝑥 42
Mean = 𝑥𝑥 = 𝑁𝑁
= 6
=7
∑ 𝑥𝑥 2 ∑ 𝑥𝑥 2
364 42 2
S.D. = 𝜎𝜎𝑥𝑥 = � − � 𝑛𝑛 � = � 6 − � 6 � = √60.67 − 49 = 3.42
𝑛𝑛
𝑆𝑆.𝐷𝐷. 3.42
C. V. =𝑀𝑀𝑀𝑀𝑀𝑀𝑀𝑀 × 100 = 7
× 100 = 48.795 %
728 −180 2
=� −� � × 10
150 150
= √4.853 − 1.44 × 10
= 1.847 x 10 = 18.47
x = A+
∑ fd × i = 55.5 + − 180 × 10 = 55.5 − 12 = 43.5
N 150
𝑆𝑆.𝐷𝐷 18.47
∴ 𝐶𝐶𝐶𝐶𝐶𝐶𝐶𝐶𝐶𝐶𝐶𝐶𝐶𝐶𝐶𝐶𝐶𝐶𝐶𝐶𝐶𝐶 𝑜𝑜𝑜𝑜 𝑣𝑣𝑣𝑣𝑣𝑣 𝑖𝑖 𝑎𝑎𝑎𝑎𝑎𝑎𝑎𝑎𝑎𝑎 = 𝑀𝑀𝑀𝑀𝑀𝑀𝑀𝑀 × 100 = 43.5 × 100 = 42.46 %
2
∑ 𝑓𝑓𝑥𝑥 2 ∑ 𝑓𝑓𝑓𝑓
∴ 𝑆𝑆. 𝐷𝐷. = � −� �
∑ 𝑓𝑓 ∑ 𝑓𝑓
2000 400 2
=� −� �
100 100
= √20 − 16 =2
∑ 𝑓𝑓𝑓𝑓 400
𝑥𝑥 = � ∑ 𝑓𝑓 � = �100� = 40
𝑆𝑆.𝐷𝐷 2
∴ 𝐶𝐶𝐶𝐶𝐶𝐶𝐶𝐶𝐶𝐶𝐶𝐶𝐶𝐶𝐶𝐶𝐶𝐶𝐶𝐶𝐶𝐶 𝑜𝑜𝑜𝑜 𝑣𝑣𝑣𝑣𝑣𝑣 𝑖𝑖 𝑎𝑎𝑎𝑎𝑎𝑎𝑎𝑎𝑎𝑎 = 𝑀𝑀𝑀𝑀𝑀𝑀𝑀𝑀 × 100 = 40 × 100 = 5 %
S.D.= σ x =
∑u 2
∑u
− = 6500 0
2
− = 650 = 25.5
n n 100 10
𝑆𝑆.𝐷𝐷. 25.5
C. V. =𝑀𝑀𝑀𝑀𝑀𝑀𝑀𝑀 × 100 = 46
× 100 = 55.43 %
For batsman B:
Mean= y = ∑ =
y 500
= 50
n 10
∑ 𝑣𝑣 2 ∑ 𝑣𝑣 2 5968 0 2
S.D.=𝜎𝜎𝑦𝑦 = � − � 𝑛𝑛 � = � − �10� = √596.8 = 24.43
𝑛𝑛 10
S .D. 24.43
C.V.= × 100 = × 100 = 48.86%
Mean 50
Since the C.V. of the scores of the batsman B is less than that of A, the batsman B is more
consistent in scoring.
()
Mean x and S.D. (σ ) of combined set are given by
Obtain the mean and standard deviation of combining the two samples.
Solution
= ₹ 4729.166
= 63344.5
∴ 𝜎𝜎 = √63344.5 = 251.68
∑ 𝑥𝑥 2 ∑ 𝑥𝑥 2
S.D = 𝜎𝜎𝑥𝑥 = � − � 𝑛𝑛 �
𝑛𝑛
∑ 𝑥𝑥 2
5.1 = � 100 − (40)2
∑ 𝑥𝑥 2
26.01 = − 1600
100
∑ 𝑥𝑥 2
1626.01 = 100
∑ 𝑥𝑥 2 = 162601
∑ 𝑥𝑥 3990
Correct Mean = 𝑥𝑥 = 𝑁𝑁
= 100
= 39.90
∑ 𝑥𝑥 2 ∑ 𝑥𝑥 2
Correct S.D.=𝜎𝜎𝑥𝑥 = � − � 𝑛𝑛 �
𝑛𝑛
∑ 161701 3990 2
=� 100
− � 100 �
= √1617.01 − 1592.01
= √25
=5
Hence ∑ 𝑥𝑥 = 45 x 10 = 450
∑ 𝑥𝑥 2 ∑ 𝑥𝑥 2
S.D = 𝜎𝜎𝑥𝑥 = � − � 𝑛𝑛 �
𝑛𝑛
∑ 𝑥𝑥 2
5.8 = � − (45)2
10
∑ 𝑥𝑥 2
33.64 = − 2025
10
∑ 𝑥𝑥 2
2058.64 = 10
∑ 𝑥𝑥 2 = 20586.4
∑ 𝑥𝑥 508
New Mean = 𝑥𝑥 = 𝑁𝑁
= 11
= 46.18
∑ 𝑥𝑥 2 ∑ 𝑥𝑥 2
New S.D.=𝜎𝜎𝑥𝑥 = � − � 𝑛𝑛 �
𝑛𝑛
23950.4
=� 11
− (46.18)2
= √2177.31 − 2132.59
= √44.72 = 6.687
1 + 2 + 6 + 𝑎𝑎 + 𝑏𝑏
4.4 =
5
22 = 9 + a + b
a + b = 22 – 9 = 13……………………………………………………………………..(1)
∑ 𝑥𝑥 2 ∑ 𝑥𝑥 2
Variance = 𝑛𝑛
−� �
𝑛𝑛
∑ 𝑥𝑥 2
8.24 = − (4.4)2
5
∑ 𝑥𝑥 2
8.24 + 19.36 = 5
∑ 𝑥𝑥 2 = 138
12 + 22 + 62 + a2 + b2 = 138
a2 + b2 = 97
(a−b)2 + 132 = 2 x 97
⇒ (a−b) = 5……………………………………………………………………..(2)
𝑎𝑎 + 𝑏𝑏
15 =
2
a + b = 30……………………………………………………………………..(1)
𝑎𝑎− 𝑏𝑏
S.D of two numbers =
2
𝑎𝑎 − 𝑏𝑏
3=
2
a - b = 6 ……………………………………………………………………..(2)
Solution:
∑ 𝑥𝑥 2 ∑ 𝑥𝑥 2
S.D.=𝜎𝜎𝑥𝑥 = � − � 𝑛𝑛 �
𝑛𝑛
200 20 2
= � 10 − �10�
= √20 − 4
= √16 = 4
Variance = (S.D)2 = 42 = 16
Hence 𝝈𝝈y = 5 x 7 = 35
𝑯𝑯𝑯𝑯𝑯𝑯𝑯𝑯𝑯𝑯 𝟗𝟗 = 𝟑𝟑 . 𝜎𝜎x
9
Hence 𝝈𝝈x = =3
3
Solution:
6
60 = 𝑀𝑀𝑀𝑀𝑀𝑀𝑀𝑀 × 100
6
Mean = 60 × 100 = 10
33. From the following data determine in which firm A or B, there is greater variability in individual
wages:
Firm A Firm B
Average hourly wages Rs. 52.50 Rs. 47.50
Variance 100 121
a) B
b) A
c) Same consistency
d) Cannot be determined with the given data.
2. Asymmetrical Distribution:-
A distribution which is not symmetrical is called a skew distribution and such a distribution could either
be positively skewed or negatively skewed.
(i) Positively Skewed Distribution: In the positively skewed distribution. The value of the mean is
maximum and that of mode least-the median lies in between the two.
(ii) Negatively Skewed Distribution: In the positively skewed distribution. The value of the mode is
maximum and that of mean least the median lies in between the two.
Kurtosis:-
Kurtosis refers to the convexity of the frequency distribution curve. In other words, it is cocerned with the
Flatness or peakedness of the frequency curve.
The degree of kurtosis of a distribution is measured relative to the peakedness of a curve.
It the curve of a distribution is more peaked than the normal curve, distribution is said to be leptokurtic.
If the distribution curve is more float than the normal curve. the distribution is said to be platykurtic. The
normal distribution is said to be mesokurtic.
Kurtosis is measured by the coefficient β2 which is defined by the formula
m4
β2 =
𝑚𝑚22
(i)If β2 > 3 then curve is Leptokurtic
(ii) β2= 3, then curve is Meso-Kurtic
(iii) ) β2<3 then curve is Platy-Kurtic
1. Karl-Pearson Method
Question:
For a moderately skewed distribution, mean = 172, median = 167 and S.D = 60. Find
coefficient of skewness.
Solution:
Coefficient of skewness = 3 (Mean - Median)/SD
Mean = 172; Median = 167; SD = 60
coefficient of skewness = 3 (172 - 167)/60 = 15/60 = 1/4 = 0.25
2. Karl-Pearson Method
Question:
For a moderately skewed distribution, Mode = 157, median = 167 and S.D = 60. Find
coefficient of skewness.
Solution:
coefficient of skewness = (Mean - Mode)/SD
as Mean is not given,
Mode = 3 Median – 2 Mean; 157 = (3 x 167) – 2 Mean
2 Mean = 501 – 157 = 344; Mean = 172
coefficient of skewness = (Mean - Mode)/SD = (172 – 157)/60 = 0.25
3. Karl-Pearson Method
Question:
The Karl Pearson's coefficient of skewness of a distribution is 0.64. It’s S.D. is 13 and the
mean is 59.2. Find the Mode & median.
Solution:
4. Karl-Pearson Method
Question:
The median, mode and coefficient of skewness for a certain distribution are
respectively 17.4, 15.3 and 0.35. Find The coefficient of variation.
Solution:
As Mean is not given,
Mode = 3 Median – 2 Mean;
15.3 = (3 x 17.4) – 2 Mean
2 Mean = 52.2 –15.3 = 344;
Mean = 18.45
Coefficient of skewness = (Mean - Mode)/S.D
0.35= (18.45 – 15.3)/S.D , Hence S.D = 9
coefficient of variation = S.D ÷ Mean = 9 ÷ 18.45 = 0.4878 = 48.78%
5. Karl-Pearson Method
Question:
For a group of 10 items, ∑x = 452, ∑x2 = 24270 and mode = 43.7. Find the Pearson's
measure of skewness.
Solution:
6. Karl-Pearson Method
Question:
The sum of 25 observations is 500 and sum of their squares is 11225. Find the
Pearson's measure of skewness.
Solution:
7. Karl-Pearson Method
Question:
For a moderately skewed distribution, mean = 20, median = 17 and coefficient of variation
is 20%. Find coefficient of skewness.
Solution:
8. Karl-Pearson Method
Question:
The Karl Pearson's coefficient of skewness of a distribution is 0.4. It’s S.D. is 8 and the
mean is 30. Find the Mode & median.
9. Karl-Pearson Method
Question:
The karl person’s coefficient of skewness of a distribution is 0.32. Its S.D. is 6.5 and
the mean is 29.6. Find the Mode.
Solution:
Karl person’s first measure of skewness:
𝑀𝑀𝑀𝑀𝑀𝑀𝑀𝑀−𝑀𝑀𝑀𝑀𝑀𝑀𝑀𝑀 29.6−𝑀𝑀𝑀𝑀𝑀𝑀𝑀𝑀
Coefficient of Skewness = , 𝑜𝑜𝑜𝑜 0.32 = ,
𝑆𝑆.𝐷𝐷. 6.5
− 15 150
Or, S.D. = = = 25.
− 0.6 6
10
0-10 10 5 10 -3 -30 90
10-20 40 15 50 -2 -80 160
20-30 20 25 70 -1 -20 20
30-40 0 35 70 0 0 0
40-50 10 45 80 +1 10 10
50-60 40 55 120 +2 80 160
60-70 16 65 136 +3 48 144
70-80 14 75 150 +4 56 244
∑ fd
'
∑
'2
fd
= 64 = 828
76+26 –(2×46) 10
= = 50 = 0.2.
76−26
Skewness = m3/(S.D)3
= - 6/(√15)3
= 0.103
Solution:
Computation of 1st four Moments about 5
X f d= x-5 fd f d 2 fd 3 fd 4
1 1 -4 -4 16 -64 256
2 6 -3 -18 54 -162 486
3 13 -2 -26 52 -104 208
4 25 -1 -25 25 -25 25
5 30 0 0 0 0 0
6 22 1 22 22 22 22
7 9 2 18 36 72 144
8 5 3 15 45 135 405
9 2 4 8 32 128 512
113 0 -10 282 2 2058
25. Kurtosis
The 2nd and 4th central moments of a distribution be 5 and 75 respectively, find the Kurtosis
of the distribution.
Solution:
2nd central moment (m2) = variance = 5
4th central moment (m4) = 75
Kurtosis is measured by the coefficient β2 which is defined by the formula
m4 75 75
β2 =
𝑚𝑚22
=
5 2 = 25
=3
As β2 = 3, hence the ditribution is meso -kurtic.
26. Kurtosis
The second and fourth central moments of a distributions are 6 and 27 respectively then find
the kurtosis of the distribution.
Solution:
2nd central moment (m2) = variance = 6
4th central moment (m4) = 27
Kurtosis is measured by the coefficient β2 which is defined by the formula
m4 27 27
β2 = = = = 0.75
𝑚𝑚22 62 36
As β2 < 3, hence the ditribution is Platy -kurtic.
27. Kurtosis
The S.D. of a symmetrical distribution is 3. If the distribution is meso-kurtic, the find the
value of 4th central moment.
Solution:
2nd central moment (m2) = variance = 32 = 9
4th central moment (m4) = ?
m4
Kurtosis is measured by the coefficient β2 which is defined by the formula β2 =
𝑚𝑚22
As the distribution is meso-kurtic, hence β2 =3
m4 m4 m4
3= 2 = 2 =
𝑚𝑚2 9 81
Hence m4 = 81 𝑥𝑥 3 = 243
9. The Karl Pearson's coefficient of skewness of a distribution is 0.32. Its S.D. is 6.5 and the mean is
29.6. Find the Mode.
10. Calculate Karl Pearson’s coefficient of skewness from the following data:
x 4-6 6-8 8 - 10 10 - 12 12 - 14 14 - 16 16 - 18 18 - 20
f 70 150 220 100 35 20 10 5
Bowley's Method
11. If Q1 = 52, Q2 = 152, Q3 = 262, then skewness is [2018 type]
(a) 0.28
(b) 0.48
(c) 0.68
(d) 0.88
12. Find the co-efficient of skewness if Q1 = 25, Q2 = 36 and Q3 = 45.
(a) - 0.1
(b) 0.2
(c) 0.1
(d) - 0.2
13. If Q1 = 26, Q3 = 76 and co-efficient of skewness = 0.2, find median. [2020 type]
(a) 46
(b) 52
(c) 56
(d) 51
14. For some symmetrical distribution, Q1 = 36, Q3 = 63, find median
(a) 49.5
(b) 99
(c) 50
(d) 49
15. In a symmetrical distribution, Q3 – Q1 = 20, median = 15. Q3 is equal to:
(a) 5
(b) 15
(c) 20
(d) 25
16. In a distribution the difference of 1st & 3rd quartile is 2.03, their sum is 72.67 and the median
is 36.18, find the skewness.
(a) - 0.15
(b) 0.15
(c) 0.20
(d) 0.25
20. Using Bowley's formula find the co-efficient of skewness of the following frequency distribution
Marks 0 -1 0 10 - 20 20 - 30 30 -40 40 - 50
Frequency 5 9 12 8 6
Moment's Measure
21. The lack of uniformity or symmetry is called:
(a) Skewness
(b) Dispersion
(c) Kurtosis
(d) Standard deviation
22. The moments about mean are called:
(a) Raw moments
(b) Central moments
(c) Moments about origin
23. The moments about origin are called:
(a) Moments about zero
(b) Raw moments
(c) Both (a) and (b)
24. Value of the 2nd Central Moment is [2020]
(a) Zero
(b) Mean
(c) Variance
(d) Median.
25. Value of the 1st Central Moment is [2020]
(a) Zero
(b) Mean
(c) Variance
(d) Median.
39. Find the first, second and third moments about 9 from the following data :
Value 2 4 6 8 10 12
Frequency 6 9 7 15 7 6
40. The first two central moments of the following distribution are : [2020]
Income (₹) 62 65 68 72 75
Frequency 5 18 42 27 8
(A) 0, 11·02 (B) 0, 10 (C) 1, 13·5 (D) 1, 11·02.
Kurtosis
41. The degree of peakedness or flatness of a unimodel distribution is called:
(a) Skewness
(b) Symmetry
(c) Dispersion
(d) Kurtosis
42. The second and fourth moments about mean are 4 and 48 respectively, then the distribution is:
(a) Leptokurtic
(b) Platykurtic
(c) Mesokurtic or normal
(d) Positively skewed
43. A distribution is said to be Platy-Kurtic when [2020]
(A) β > 3
(B) β2 = 3
(C) β2 < 3
(D) β2 = 0.
44. The second and fourth order central moments of a distribution are 5 and 41 respectively. The
co-efficient of kurtosis and the nature of kurtosis are: [2017]
(a) -1.36, lepto kurtic
(b) 1.64, meso kurtic
(c) 1.64, Platy kurtic
(d) -1.36, platy kurtic
45. The first four central moments of a distribution are 0, 16, -36 and 120. Comment on the
kurtosis of the distribution
(a) 0.469, the distribution is platy kurtic.
(b) 0.469, the distribution is Meso kurtic.
(c) 0.469, the distribution is Lepto kurtic.
(d) - 0.469, the distribution is platy kurtic.
46. The S.D. of a symmetrical distribution is 3. If the distribution is meso-kurtic, the value of 4th
central moment will be [2018]
(a) 81
(b) 243
(c) 9
(d) 27
47. The second and fourth central moments of a distributions are 6 and 27 respectively then the
kurtosis of the distribution will be [2018]
(a) 0.75
(b) 1.04
(c) 1.30
(d) 2.15
48. The 2nd and 4th central moments of a distribution be 5 and 75 respectively, find the Kurtosis
of the distribution.
2. What Is Extrapolation?
The process of computing the value of the function outside the given range is called extrapolation. The
term interpolation however, is taken to include extrapolation.
3. Interpolation Example
Here's an example that will illustrate the concept of interpolation. A gardener planted a tomato plant
and she measured and kept track of its growth every other day. This gardener is a curious person, and
she would like to estimate how tall her plant was on the fourth day.
Her table of observations looked like this:
Based on the chart, it's not too difficult to figure out that the plant was probably 6 mm tall on the
fourth day. This is because this disciplined tomato plant grew in a linear pattern; there was a linear
relationship between the number of days measured and the plant's height growth. Linear pattern means
the points created a straight line. We could even estimate by plotting the data on a graph. the
interpolation formula would come in handy.
Going back to the tomato plant example, the first set of values for day three are (3,4), the second set of
values for day five are (5,8), and the value for x is 4 since we want to find the height, y, on the fourth day.
After substituting these values into the formula, calculate the estimated height of the plant on the fourth
day.
5. Methods of interpolation
There are broadly two methods of interpolation:
i. Graphical Method, (ii) Algebraic Method
Here we shall discuss only the algebraic method of interpolation. Different formulae or methods used
in algebraic method are:
1) Newton’s Forward Formula
2) Newton’s Backward Formula
3) Lagrange’s Central Formula
4) Lagrange’s Inverse Interpolation Formula
v(v + 1) 2 v(v + 1) (v + 2 ) 3
y = yn + v∆yn−1 + .∆ yn−2 + ∆ y n −3
1.2 1.2.3
v(v + 1) (v + 2 )...(v + n − 1) n
+ ∆ y0 ,
1.2.3....n
x − xn
Where v = and h = x1 − x0 = x2 − x1 = ... = xn − xn −1
h
In this case also we can express y as a function of x.
1. Difference table
In particular, ∆y0 = y1 − y0, ∆y1 = y2 − y1, . . . , ∆yn−1 = yn − yn−1. These are called
first order differences. The differences of the first order differences are called
second order differences. The second order differences are denoted by ∆2y0, ∆2y1, .
2. Difference table
Below are given the values of a function f ( x) for certain values of x :
x 0 1 2 3 4
f (x ) : 1 0 5 22 57
Construct the table of differences.
Solution. DIFFERENCE TABLE
x y = f (x ) ∆y ∆2 y ∆3 y ∆4 y
0 1
−1
1 0 6
5 6
2 5 12 0
17 6
3 22 18
35
4 57
5. Lagrange’s formula
Using Lagrange’s formula and hence find f(3)
x: 0 1 2 5
f (x): 2 3 12 147
Solution:
Here x0 = 0, x1 = 1, x2 = 2, x3=5
and y0 = 2, y1 = 3, y2 = 12, y3=147.
X=3
6. Lagrange’s formula
Find the missing term in the following table using interpolation:
x: 0 1 2 3 4
y: 1 3 9 ... 81
Solution:
Since the given data is unevenly spaced, therefore we use Lagrange’s interpolation formula:
(𝑥𝑥−𝑥𝑥1 )(𝑥𝑥−𝑥𝑥2 ) (𝑥𝑥−𝑥𝑥3 ) (𝑥𝑥−𝑥𝑥0 )(𝑥𝑥−𝑥𝑥2 ) (𝑥𝑥−𝑥𝑥3 )
Y =(𝑥𝑥 y 0 + y1
0 −𝑥𝑥1 )(𝑥𝑥0 −𝑥𝑥2 ) (𝑥𝑥0 −𝑥𝑥3 ) ( 𝑥𝑥1− 𝑥𝑥0 )(𝑥𝑥1 −𝑥𝑥2 ) (𝑥𝑥1 −𝑥𝑥3 )
7. Lagrange’s formula
Using Lagrange’s interpolation formula find f (10) from the following table:
Solution:
Here the intervals are unequal. By Lagrange’s interpolation formula we have
8. Miscellaneous
Prove: E ≡ 1 + Δ
Solution::
From the definition of Δ we know that
Δ f (x) = f (x + h ) − f (x) and
E[ f (x )] = f (x + h)
where h is the interval of difference.
Δ f (x) = f (x + h ) − f (x)
Δ f (x) = Ef (x ) − f (x)
⇒ Δ f (x) = (E −1) f (x)
Δ ≡ E −1;
∴ E≡1+Δ
- 132 – Admission going on for Regular/Crash course for B.com. Call
Bhalotia Classes (9883034569/8820696761): 1st Semester (Statistics)
9. Miscellaneous
The values of a functions f(x) are given: f(o) = 1, f(1) + f(2) = 10, f(3) + f(4) + f(5)
= 65. Then the value of f(4) will be: [2017]
Solution:
Consider f(x)= ax2+bx+c
f(x)=ax2+bx+c (why quadratic equation Because, 3 equations are given and we can have 3
variables)
Given:
• f(0) =1⟹ a (0)+ b (0) + c=1⟹ c =1
• f(1) + f(2)=10
f(1) + f(2)=10
⟹ 5a + 3b= 8……………………………………………(1)
f(3) + f(4) + f(5)=65
f(3)= a(3)2 + b (3)+1=9a+3b+1
f(4)=a (4)2 + b(4) +1 = 16a+ 4b +1
f(5)=a(5)2 + b (5) +1=25a+5b+1
f(3) +f(4)+f(5)=65
⟹ 9a+3b+1+16a+4b+1+25a+5b+1=65
25a+6b=31……………………………………………………………………………….(2)
5∗(1)−(2)= 25a+15b−25a−6b=40−31⟹ 9b = 9, hence b = 1
From (1):5a+3(1)=8⟹a=1;
∴f(x)=ax2+bx+c =x2+x+1
F (4)=42+4+1=16 + 4 + 1=21; Ans: 21
9. Newton’s forward and backward formula can be used only when values of the arguments are
(B) equidistant (B) not equidistant
(C) constant (D) none of these. [2020]
11. Given
X 4 6
Y 11 15
When x = 5 the value of y will be [2018]
(a) 14
(b) 13
(c) 12
(d) 10
12. Given
X 4 6
Y 11 15
When y = 13 the value of x will be
(a) 5.5
(b) 5
(c) 5.5
(d) 5.8
𝟐𝟐
13. ∆ 𝒚𝒚𝟏𝟏 can be expressed as [2018]
(a) y2 + 2y1 + y0
(b) 𝐲𝐲𝟑𝟑 − 𝟐𝟐𝐲𝐲𝟐𝟐 + 𝐲𝐲𝟏𝟏
(c) y2 − 2y1 + y0
(d) y3 + 2y2 + y1
14. The values of a functions f(x) are given: f(o) = 1, f(1) + f(2) = 10, f(3) + f(4) + f(5) = 65. Then the
value of f(4) will be: [2017]
(a) 7
(b) 13
(c) 21
(d) 31
15. The (n+1)th order forward difference of the nth degree polynomial is – [2019]
(A) n! (B) (n+1)! (C) 0 (D) (n-1)!
x 2 4 6 8
y 6 9 20 39
Lagrange's Formula:
29. The values of y are 10, 20 & 30 respectively when the values of x are 1, 4 and 5. Find by
suitable method of interpolation the value of y when x = 7.
(a) 40
(b) 35
(c) 45
(d) 50
30. Find the missing term in the following table using interpolation:
x: 0 1 2 3 4
y: 1 3 9 ... 81
(a) 27
(b) 35
(c) 31
(d) 50
Inverse Interpolation:
31. If f (1) = 10, f (2) = 15 and f (5) = 42, then the value of x when f (x) = 25 is
(A) 2·5 (B) 3·4 (C) 4·0 (D) 4·5
Miscellaneous:
32. From the following information, [2020]
x 0 1 2 3 4
y 1 0 5 10 8
the value of ∆3y1 is equal to
(A) – 6 (B) –7 (C) – 8 (D) – 9.
2017
1st Semester statistics
Group-A
1. During the year 2012-13 the Income (in crores) of a railway company of different section is as
follows:
Section 2012 2013
Coaching 10 15
Goods 30 25
Other 5 2
The component bar chart for the above data is:
(a) Answer (a)
(b)
(d)
3. The largest value is 60 and the smallest value is 40 and number of classes desired is 5 then the
length of the class interval is:
a) 4
b) 20
c) 12
d) 8
4. The second and fourth order central moments of a distribution are 5 and 41 respectively. The co-
efficient of kurtosis and the nature of kurtosis are:
a) -1.36, lepo kurtic
b) 1.64, meso kurtic
c) 1.64, lepto kurtic
d) -1.36, platykurtic
5. The values of a functions f(x) are given: f(o) = 1, f(1) + f(2) = 10, f(3) + f(4) + f(5) = 65. Then the
value of f(4) will be:
a) 7
b) 13
c) 21
d) 31
6. If the second and third central moments of a distribution are 4 and 10, then the skewness of the
distribution is:
a) 2.5
b) 0.4
c) 5
d) 1.25
7. The first quartile and third quartile of the following members: 88, 72, 33, 29,70, 86, 54 are
respectively
a) 33, 86
b) 32, 84
c) 33, 76
d) 32, 86
10. From the following table the value of y can be determined for the value of x = 3.5 by using
x 0 1 2 3 4
y 2 10 28 56 94
Group-B
11. The median of the following frequency distribution is known to be 28. The value of a is:
Dividend (%) 10-20 20-30 30-40 40-50
No. of 3 20 12 a
Companies
a) 7
b) 3
c) 4
d) 2
12. The S.D. of the given data: 49, 63, 46, 59, 65, 52, 60, 54 is:
a) 6.36
b) 6.21
c) 6.29
d) 6.12
13. The median, mode and coefficient of skewness for a certain distribution are respectively 17.4,
15.3 and 0.35. The coefficient of variation is:
a) 50.12%
b) 45%
c) 40%
d) 48.78%
15. The following pie-chart shows the annual agricultural yield of Punjab state for the year 2016:
If the total yield is 8100 tons, how much will be the yield of rice (In ton) and by how much
percentage the yield of Ragi will exceed the yield of rice? [CHECK]
a) 900 and 140%
b) 800 and 120%
c) 700 and 150%
d) 900 and 150%
2018
1st Semester statistics
Group –A
1. ∆2 𝑦𝑦1 can be expressed as
a) 𝑦𝑦2 + 2𝑦𝑦1 + 𝑦𝑦0
b) 𝒚𝒚𝟑𝟑 − 𝟐𝟐𝒚𝒚𝟐𝟐 + 𝒚𝒚𝟏𝟏
c) 𝑦𝑦2 − 2𝑦𝑦1 + 𝑦𝑦0
d) 𝑦𝑦3 + 2𝑦𝑦2 + 𝑦𝑦1
2. Cumulative frequency is of
a) One type
b) Two type
c) Three type
d) Four type
5. The second and fourth central moments of a distributions are 6 and 27 respectively then the
kurtosis of the distribution will be
a) 0.75
b) 1.04
c) 1.30
d) 2.15
8. The S.D. of a symmetrical distribution is 3. If the distribution is meso-kurtic the value of central
moment will be
a) 81
b) 243
c) 9
d) 27
9. Given
X 4 6
Y 11 15
When x = 5 the value of y will be
a) 14
b) 13
c) 12
d) 10
10. The variance of a variable x is 100. Then the standard deviation of a variable y = 5x – 10 is
a) 90
b) 50
c) 500
d) 490
Group – B
1. Using the following table the value of y will be
X 0 1 2 3
Y 1 10 49 142
a) 𝟒𝟒𝟒𝟒𝟑𝟑 + 𝟑𝟑𝟑𝟑𝟐𝟐 + 𝟐𝟐𝟐𝟐 + 𝟏𝟏
b) 𝑥𝑥 3 + 5𝑥𝑥 2 + 3𝑥𝑥 + 4
c) 4𝑥𝑥 3 + 2𝑥𝑥 2 + 3𝑥𝑥 + 1
d) 2𝑥𝑥 3 + 3𝑥𝑥 2 + 𝑥𝑥 + 3
2. If the mean and standard deviation f two numbers are respectively 15 and 3 then the numbers are
a) 10,20
b) 14,16
c) 12,18
d) 11,19
a) 3500
b) 4500
c) 5000
d) 6000
5. The quartile deviation of 22, 17, 25, 20, 29, 27 and 35 will be
a) 5.5
b) 6
c) 8
d) 4.5
2019
1st Semester statistics
Group – A
1. The (n+1)th order forward difference of the nth degree polynomial is –
10. For a moderately skewed distribution, mean = 172, median = 167 and S.D. = 60, the co-efficient
skewness is –
(A) 0.35 (B) 0.25 (C) 0.26 (D) 0.47
Group – B
1. Find Mean Deviation about Mean:
x 10 11 12 13 14
Y 3 12 18 12 3
(A) 0.80 (B) 0.70 (C) 0.85 (D) 0.75
3. The following pie-chart shows the marks obtained by a student in an examination , who scored
540 in total.
Hindi 60o; Mathematics 90o, History 72o, Science 75o, English 63o
The subject in which the student scored 108 marks is –
(A) English (B) History (C) Mathematics (D) Science.
4. Using Newtown’s Backward Interpolation formula find Y when x = 8 from the following table:
x 3 5 7 9
y 2 18 42 74
(A) 57 (B) 59 (C) 91 (D) 93
5. Find the mean of 30 members, where mean of 10 of them is 12 and mean of rest 20 is 9.
(A) 9 (B) 10 (C) 11 (D) 12.
2020
1st Semester statistics
Group - A
Answer any five questions. 2×5
2. Qualitative character of statistics is known as
(A) variable (B) attribute (C) data (D) information.
6. The mean deviation about Median from the following data will be 46, 79, 26, 85, 39, 65, 99,
29, 56, 72
(A) 20·4 (B) 19·4 (C) 19·3 (D) 20·9.
9. Newton’s forward and backward formula can be used only when values of the arguments
are
(A) equidistant (B) not equidistant
(C) constant (D) none of these.
11. The A.M of two numbers is 5 and their G.M. is 4. The value of H.M. is
(A) 4·47 (B) 3·2 (C) 6·25 (D) 5
Group - B
Answer any ten questions. 3×10
12. The harmonic mean of 2, a, 8 be 24/ , then the value of a is
7
(A) 9 (B) 4 (C) 6 (D) 8.
13. The median of 12, 48, 30, 112, 62, 94, 75, 98 is
(A) 69 (B) 68·5 (C) 68 (D) 67·5.
x 0 1 2 3 4
y 1 0 5 10 8
the value of ∆3y1 is equal to
(A) – 6 (B) –7 (C) – 8 (D) – 9.
18. For a distribution the coefficient of skewness = 0·25, mean = 50 and mode = 42. Then coefficient
ofvariation of the distribution is
(A) 54% (B) 60% (C) 58% (D) 64%.
20. For a distribution Q1 = 36, Q3 = 76 and coefficient of skewness = 0·3. Then the median is
21. The A.M. of the following distribution is 67·45 inches. Then the missing frequency is
Height (inch) 60–62 63–65 66–68 69–71 72–74
Frequency 15 ? 126 81 24
(A) 25 (B) 45 (C) 50 (D) 54.
22. The mean of 5 observations is 4·4 and the variance is 8·24. If three of these five
observations are1, 2 and 6, then the other two are
(A) 5 and 8 (B) 4 and 9 (C) 4 and 7 (D) 5 and 9.
23. The first two central moments of the following distribution are :
Income (`) 62 65 68 72 75
Frequency 5 18 42 27 8
(A) 0, 11·02 (B) 0, 10 (C) 1, 13·5 (D) 1, 11·02.
x 2 4 6 8
y 6 9 20 39
25. If f (1) = 10, f (2) = 15 and f (5) = 42, then the value of x when f (x) = 25 is
(A) 2·5 (B) 3·4 (C) 4·0 (D) 4·5.