CHAPTER-01 (BASIC CONCEPT OF STATISTICS)
1.1 Statistics: Statistics is the practice or science of collecting and analyzing numerical data in large
quantities, especially for the purpose of inferring proportions in a whole from those in a representative sample.
Simply, statistics is a branch of mathematics dealing with the collection, analysis, interpretation, and
presentation of masses of numerical data. Generally, statistics are two types: Descriptive Statistics and
Inferential Statistics.
1.1.1 Descriptive Statistics: Statistics that are used to describe some aspect of the data, such as the typical
score or the variability of scores.
1.1.2 Inferential Statistics: Statistical procedures that are used to draw conclusions about the populations
under study based on the sample data from those populations.
1.1.3 Difference between Mathematics and Statistics:
Mathematics is an academic subject whereas statistics is a part of applied mathematics
Mathematics deals with numbers, patterns and their relationships whereas statistics is concerned with
systematic representation and analysis of data
Mathematics gives exact result but statistical measure gives approximation
Mathematics form the basis of our understanding of quantity and measurement whereas statistics
makes understanding of data easy and
1.2 Data: Information in raw or unorganized form (such as alphabets, numbers, or symbols) is known as data.
Data is limitless and present everywhere in the universe.
1.2.1 Primary Data: Data collected by the investigator himself/ herself for a specific purpose.
Examples: Data collected by a student for his/her thesis or research project.
1.2.2 Secondary Data: Data collected by someone else for some other purpose (but being utilized by the
investigator for another purpose).
Examples: Census data being used to analyze the impact of education on career choice and earning.
1.3 Information: Information is a data or a collection of data that is (1) accurate and timely, (2) specific and
organized for a purpose, (3) presented within a context that gives it meaning and relevance, and (4) can lead to
an increase in understanding and decrease in uncertainty. For example, if a manager is told his/her company's
net profit decreased in the past month, he/she may use this information as a reason to cut financial spending
for the next month
1.4 Variable: A quantitative or qualitative characteristic varying from the individual item to another is known
as a variable. Examples of variable are the phenomena like income, height, weight, birth rate, death rate etc.
The variables are of two types: (i) continuous variable and (ii) discontinuous variable.
1.4.1 Continuous Variable: The variable which is changeable is called continuous variable.
For example: income, age, production, birth rate etc.
1.4.2 Discontinuous Variable: The variable which cannot be expressed in every fraction value but is to be
shown integer number only is called discontinuous or discrete variable.
For example: the number of person in a family, the number of employers in a factory and so on.
1.5 Population: In sampling process we have to consider a complete set of objects or things, which are known
as population.
1.6 Sample: A finite part of population or a subject of a set of sampling units is selected by some process is
called sample.
1|Page
CHAPTER-02 (MEASURES OF CENTRAL TENDENCY)
2.1 Measures of central tendency: The tendency of the individual item of a statistical series to cluster around
the central value is called the central tendency. The central value gives us the good description of the entire
mass of data. The measure of central tendency gives us a value which is typical or representative of the entire
distribution. It is sometimes called the measure of location or a measure of representation.
2.2 Question: What are the measures of central tendency?
Answer: There are three important measures of central tendency:
(i) Mean
(ii) Median and
(iii) Mode
2.3 Mean: There are three types of mean:
(i) Arithmetic mean
(ii) Geometric mean and
(iii) Harmonic mean
2.3.1 Arithmetic mean: The arithmetic mean of a series of observation is equal to the sum of the observations
divided by their number. Let the variable 𝑥 takes the values 𝑥1 , 𝑥2 , 𝑥3 , ⋯ ⋯ ⋯ ⋯ , 𝑥𝑁 then the arithmetic mean
is defined by
𝑥1 + 𝑥2 + 𝑥3 + ⋯ ⋯ ⋯ ⋯ + 𝑥𝑁 ∑𝑛𝑖=1 𝑥𝑖
𝜇 𝑜𝑟 𝑥̅ = = ,
𝑁 𝑁
where 𝜇 is the sample mean.
For grouped data or frequency distribution, arithmetic mean is defined by
∑𝑁
𝑖=1 𝑓𝑖 𝑥𝑖
𝑥̅ = , where ∑𝑛𝑖=1 𝑓𝑖 = 𝑁
𝑁
𝑥𝑖 = Mid values of the class
𝑓𝑖 = Frequency of the class
𝑁 = Total number of frequency.
2.3.2 Geometric mean: Let the variable 𝑥 takes the values 𝑥1 , 𝑥2 , 𝑥3 , ⋯ ⋯ ⋯ ⋯ , 𝑥𝑁 then the geometric mean
is denoted by G.M. and is defined by
𝟏
𝑮. 𝑴. = (𝒙𝟏 ∙ 𝒙𝟐 ∙ 𝒙𝟑 ∙ ⋯ ⋯ ⋯ ⋯ ∙ 𝒙𝑵 )𝑵 .
For grouped data or frequency distribution, geometric mean is defined by
1
𝐺. 𝑀. = (𝑥1 𝑓1 ∙ 𝑥2 𝑓2 ∙ 𝑥3 𝑓3 ∙∙ ⋯ ⋯ ⋯ ⋯ ∙ 𝑥𝑁 𝑓𝑁 )𝑁 , where 𝑁 = ∑𝑛𝑖=1 𝑓𝑖 = 𝑁 total frequency.
2.3.3 Harmonic Mean: The geometric mean of a set of n positive values is the nth root of their
product. The harmonic mean of a set of values is the reciprocal of the arithmetic mean of the reciprocals. Let
the variable 𝑥 takes the values 𝑥1 , 𝑥2 , 𝑥3 , ⋯ ⋯ ⋯ ⋯ , 𝑥𝑛 then the harmonic mean is denoted by H.M. and is
defined by
𝑵 𝑁
𝐻. 𝑴. = = .
𝟏 𝟏 𝟏 𝟏 1
+ + + ⋯ ⋯ ⋯ + ∑𝑛𝑖=1
𝒙𝟏 𝒙𝟐 𝒙𝟑 𝒙𝑵 𝑥𝑖
For grouped data or frequency distribution, geometric mean is defined by
𝑁
𝑛
𝐻. 𝑀. = , 𝑤ℎ𝑒𝑟𝑒 𝑁 = ∑ 𝑓𝑖 .
𝑓𝑖
∑𝑁 𝑖=1
𝑖=1 𝑥𝑖
Relation-01: For Two Numbers 𝑎 and 𝑏, prove that, A.M G.M H .M
𝑎+𝑏
Proof: Suppose that 𝑎 and 𝑏 are two observations. Then, 𝐴. 𝑀 = 2
1
2 2𝑎𝑏
𝐺. 𝑀 = (𝑎𝑏)2 = √𝑎𝑏 = √𝑎√𝑏 and 𝐻. 𝑀 = 1 1 = 𝑎+𝑏
( + )
𝑎 𝑏
𝑎+𝑏 1 1 2 2
Now 𝐴. 𝑀 − 𝐺. 𝑀 = − √𝑎√𝑏 = (𝑎 + 𝑏 − 2√𝑎√𝑏) = ((√𝑎) + (√𝑏) − 2√𝑎√𝑏)
2 2 2
Page | 2
1
= (√𝑎 − √𝑏 )2 ≥ 0
2
Since square of any number is positive. Thus 𝐴. 𝑀 − 𝐺. 𝑀 ≥ 0 ⇒ 𝐴. 𝑀 ≥ 𝐺. 𝑀 … … … … … (1)
2𝑎𝑏
Again 𝐺. 𝑀 − 𝐻. 𝑀 = √𝑎√𝑏 − 𝑎+𝑏
1
= (√𝑎√𝑏(𝑎 + 𝑏) − 2𝑎𝑏)
𝑎+𝑏
√𝑎√𝑏
= (𝑎 + 𝑏 − 2√𝑎√𝑏)
𝑎+𝑏
√𝑎√𝑏 2 2
= ((√𝑎) + (√𝑏) − 2√𝑎√𝑏)
𝑎+𝑏
√𝑎√𝑏
= (√𝑎 − √𝑏 )2 ≥ 0
𝑎+𝑏
Since 𝑎, 𝑏 ≥ 0 and (√𝑎 − √𝑏 )2 ≥ 0. So, 𝐺𝑀 − 𝐻𝑀 ≥ 0 ⇒ 𝐺𝑀 ≥ 𝐻𝑀 … … … … … (2)
Thus from inequality (1) and (2), we get 𝐴𝑀 ≥ 𝐺𝑀 ≥ 𝐻𝑀. (Proved).
Relation-02: For Two numbers 𝑎 and 𝑏, prove that, A.M * H .M G.M
2
Proof: Suppose that 𝑎 and 𝑏 are two observations.
1
𝑎+𝑏 2 2𝑎𝑏
Then, 𝐴. 𝑀 = 2
, 𝐺. 𝑀 = (𝑎𝑏)2 = √𝑎𝑏 = √𝑎√𝑏 and 𝐻. 𝑀 = 1 1 = 𝑎+𝑏
( + )
𝑎 𝑏
𝑎+𝑏 2𝑎𝑏
Now, 𝐴. 𝑀 ∗ 𝐻. 𝑀 = 2
∗ 𝑎+𝑏 = 𝑎𝑏 = (𝐺. 𝑀)2 (Proved).
Problem-01: If 𝑥, 𝑦, 𝑧 are positive real numbers, such that 𝑥 + 𝑦 + 𝑧 = 𝑎,
1 1 1 9
prove that 𝑥 + 𝑦 + 𝑧> 𝑎 .
Solution: Given that 𝑥, 𝑦, 𝑧 are positive real numbers and 𝑥 + 𝑦 + 𝑧 = 𝑎
Since 𝐴. 𝑀. > 𝐻. 𝑀. Hence, using this we obtain,
𝑥+𝑦+𝑧 3
3
> 1 1 1
+ +
𝑥 𝑦 𝑧
1 1 1 9
⇒ + + >
𝑥 𝑦 𝑧 𝑥+𝑦+𝑧
1 1 1 9
⇒ 𝑥
+ 𝑦 + 𝑧 > 𝑎 (Proved).
Problem -02: Prove that (𝑎 + 𝑏 + 𝑐) (𝑎𝑏 + 𝑏𝑐 + 𝑐𝑎) > 9𝑎𝑏𝑐.
1 1
𝑎+𝑏+𝑐 𝑎𝑏+𝑏𝑐+𝑐𝑎
Solution: Using 𝐴. 𝑀 > 𝐺. 𝑀, we have 3
> (𝑎𝑏𝑐)3 and 3
> (𝑎𝑏. 𝑏𝑐. 𝑐𝑎)3
Multiplying these two results, we have
1 1
𝑎+𝑏+𝑐 𝑎𝑏+𝑏𝑐+𝑐𝑎
3
× 3
> (𝑎𝑏𝑐)3 × (𝑎𝑏. 𝑏𝑐. 𝑐𝑎)3
1
( 𝑎𝑏+𝑏𝑐+𝑐𝑎)∗( 𝑎+𝑏+𝑐)
Or, > (𝑎3 𝑏 3 𝑐 3 )3
9
or, ( 𝑎𝑏 + 𝑏𝑐 + 𝑐𝑎) ∗ ( 𝑎 + 𝑏 + 𝑐) > 9𝑎𝑏𝑐. (Proved)
Page | 3
2.4 Median:
If the values of a series are arranged in ascending or descending order of magnitude, then the middle most
value in this arrangement is the median of the series. When the number 𝑛 is odd then the middle most value of
𝑛+1
the ( 2 ) 𝑡ℎ observation in the arrangement will be the median. If 𝑛 is even then the arithmetic mean
𝑛 𝑛
of 2 𝑡ℎ and (2 + 1) 𝑡ℎ term in the arrangement will be the median.
For grouped data or frequency distribution the median is defined by,
ℎ 𝑁
𝑀𝑒𝑑𝑖𝑎𝑛, 𝑀𝑒 = 𝑙 + ( − 𝐶)
𝑓 2
Where, 𝑙 =lower limit of the median class
ℎ =class width of the median class
𝑓 =frequency of the median class
𝑐 =cumulative frequency of the class preceding the median class
𝑁 =total number of frequency.
𝑦
Cumulative frequency
𝑛
2
Median Value
𝑥
Class
2.5 Mode:
The mode is the value of the variable of which occurs most frequently in the series of observations of the
variable and the mode of its distribution is that value of the variable for which the frequency is maximum.
Mode is the most fashionable value of the variable.
For grouped data or frequency distribution the mode is defined by,
ℎ(𝑓1 − 𝑓2 )
𝑀𝑜𝑑𝑒, 𝑀𝑜 = 𝑙 +
2𝑓1 − 𝑓0 − 𝑓2
Where, 𝑙 = lower limit of the modal class
ℎ = class width of the modal class
𝑓1 = frequency of the modal class
𝑓2 = frequency of the class following the modal class
𝑓0 = frequency of the class preceding the modal class
𝑦
Frequency
Mode
𝑥
Class boundary
Page | 4
2.6 Question-01: Describe the merits and demerits of Arithmetic mean.
Answer:
Merits:
(i) It is rigidly defined by a mathematical formula.
(ii) If is easy to understand and easy to calculate.
(iii) It is based on all the observations.
(iv) It is amenable to algebraic treatment.
Demerits:
(i) It is affected very much by extreme values.
(ii) It cannot be calculated if the extreme class is open.
(iii) It cannot be used when we are dealing with qualitative characteristics which cannot be measured
quantitatively.
2.7 Question-02: Describe the merits and demerits of geometric mean.
Answer:
Merits:
(i) It is rigidly defined.
(ii) It is based on all the observations.
(iii) It is also used for further mathematical treatment.
Demerits:
(i) It is neither easy to calculate nor to understand.
(ii) It vanishes if any value is zero.
(iii) In case of odd numbers of negative value it cannot be computed at all.
2.8 Question-03: Describe the merits and demerits of Harmonic mean.
Answer:
Merits:
(i) It is rigidly defined.
(ii) It is based on all the observations.
(iii) It is also used for further mathematical treatment.
(iv) It is not affected by fluctuations of sampling.
Demerits:
(i) It is not easy to understand and is difficult to calculate.
(ii) It is impossible to calculate if the extreme classes of the frequency distribution are given.
2.9 Question-04: Describe the merits and demerits of median.
Answer:
Merits:
(i) It is rigidly defined.
(ii) It is easy to understand and easy to calculate.
(iii) It is not affected by the extreme values.
(iv) It cannot be calculated for distribution with open-end class.
Demerits:
(i) It is not amenable to algebraic treatment.
(ii) It is not based on all the observations.
(iii) It is affected by fluctuations of sampling.
2.10 Question-05: Describe the merits and demerits of mode.
Answer:
Merits:
(i) It is easy to understand and easy to calculate.
(ii) It is not affected by the extreme values.
(iii) It cannot be calculated for distribution with open-end class.
Demerits:
(i) It is not suitable for further mathematical treatment.
(ii) It is not based on all the observations.
(iii) It is affected by fluctuations of sampling.
Page | 5
2.11 Question-06: Which is the best measure of central tendency and why?
Answer: A comparison among the various measures of central tendency is made on the basis of the following
criteria:
(i) It should be rigidly defined.
(ii) It should be easy to understand and easy to calculate.
(iii) It should be least affected by fluctuations of sampling.
(iv) It should be amenable to mathematical treatment.
(v) It should be based on all the observations.
(vi) It should not be affected much by the extreme values.
Since the arithmetic mean satisfies most of the criteria given above, hence it is considered as the best of all
the measures of central tendency.
2.12 Question-07: Show that in a uni-model symmetrical distribution, mean=median=mode.
Proof: Let us consider the following uni-model symmetrical distribution:
𝑥: 1 2 3 4 5 6 7 8 9
𝐹𝑟𝑒𝑞𝑢𝑒𝑛𝑐𝑦 3 4 6 9 10 9 6 4 3
The arithmetic mean of the distribution is given by,
∑ 𝑓𝑖 𝑥𝑖
𝑥̅ =
∑ 𝑓𝑖
1 × 3 + 2 × 4 + 3 × 6 + 4 × 9 + 5 × 10 + 6 × 9 + 7 × 6 + 8 × 4 + 9 × 3
⇒ 𝑥̅ =
(3 + 4 + 6 + 9 + 10 + 9 + 6 + 4 + 3)
3 + 8 + 18 + 36 + 50 + 54 + 42 + 32 + 27
⇒ 𝑥̅ =
54
270
⇒ 𝑥̅ =
54
∴ 𝑥̅ = 5
Therefore, mean=5.
In this distribution, the highest frequency is 10.
Therefore mode of this distribution is 5.
Hence, mode=5.
Since the number of observation is odd, so the median is given by,
𝑛+1
𝑚𝑒𝑑𝑖𝑎𝑛 = , 𝑤ℎ𝑒𝑟𝑒 𝑛 = 𝑛𝑢𝑚𝑏𝑒𝑟 𝑜𝑓 𝑜𝑏𝑠𝑒𝑟𝑣𝑎𝑡𝑖𝑜𝑛𝑠,
2
9 + 1 10
⇒ 𝑚𝑒𝑑𝑖𝑎𝑛 = = =5
2 2
∴ 𝑚𝑒𝑑𝑖𝑎𝑛 = 5 .
Thus, we can conclude that, mean=median=mode=5. (Proved)
Example-01: The annual number of births to women by current age in Gopalganj district is given in the
following table:
Age in years Number of Births
14.5-19.5 677
19.5-24.5 1908
24.5-29.5 1737
29.5-34.5 1040
34.5-39.5 294
39.5-44.5 91
44.5-49.5 16
Now compute the mean age, median age and modal age.
Page | 6
Solution:
Age in Number of 𝒙 𝒇𝒙 Cumulative Range of
years Births (𝒇) number of cumulative number
Births (𝒄. 𝒇. ) of Births
14.5-19.5 677 17 11509 677 677 or less
19.5-24.5 1908 22 41976 2585 678-2585
24.5-29.5 1737 27 46899 4322 2586-4322
29.5-34.5 1040 32 33280 5362 4323-5362
34.5-39.5 294 37 10878 5656 5363-5656
39.5-44.5 91 42 3822 5747 5657-5747
44.5-49.5 16 47 752 5763 5748-5763
All Ages 𝑁 = 5763 ∑ 𝑓𝑥 = 149116
Now,
∑ 𝑓𝑥 149116
𝑀𝑒𝑎𝑛, 𝑥̅ = = = 25.87
𝑁 5763
Here, 𝑁 = 5763 . Therefore,
𝑁 5763
= = 2881.5 .
2 2
𝑁
From the table we observe that 2 falls in the range of cumulative births 2586-4322. Hence, the median class
is 24.5-29.5. This means that median value lies somewhere between 24.5 and 29.5.
We know that,
ℎ 𝑁
𝑀𝑒𝑑𝑖𝑎𝑛, 𝑀𝑒 = 𝑙 + ( − 𝐶) ⋯ ⋯ ⋯ (1)
𝑓 2
Here,
𝑙 = 24.5, ℎ = 5, 𝑓 = 1737, 𝑁 = 5763 𝑎𝑛𝑑 𝐶 = 2585 .
Now from (1) we have,
5 5763
𝑀𝑒𝑑𝑖𝑎𝑛, 𝑀𝑒 = 24.5 + ( − 2585)
1737 2
= 24.5 + 0.8535 = 25.35 .
Again we know that,
ℎ(𝑓1 − 𝑓2 )
𝑀𝑜𝑑𝑒, 𝑀𝑜 = 𝑙 + ⋯ ⋯ ⋯ (2)
2𝑓1 − 𝑓0 − 𝑓2
Here, the highest frequency is 1908. Therefore the modal class is 19.5-24.5. Hence,
𝑙 = 19.5, ℎ = 5, 𝑓1 = 1908, 𝑓0 = 677 𝑎𝑛𝑑 𝑓2 = 1737 .
Now from (1) we have,
5(1908 − 1737)
𝑀𝑜𝑑𝑒, 𝑀𝑜 = 19.5 +
2 × 1908 − 677 − 1737
855
= 19.5 + = 19.5 + 0.61 = 20.11
1402
Therefore, mean age is 25.87 years, median age is 25.35 years and modal age is 20.11 years. (Ans.)
Example-02: The annual number of women by current age in Dhaka district is given in the following table:
Age in years Number of Births
9.5-14.5 27
14.5-19.5 34
19.5-24.5 41
24.5-29.5 45
29.5-34.5 45
34.5-39.5 43
39.5-44.5 35
44.5-49.5 30
Now compute the mean age, median age and modal age.
Page | 7
Solution:
Age in Number of 𝒙 𝒇𝒙 Cumulative Range of cumulative
years women (𝒇) number of number of women
women (𝒄. 𝒇. )
9.5-14.5 27 12 324 27 27 or less
14.5-19.5 34 17 578 61 28-61
19.5-24.5 41 22 902 102 62-102
24.5-29.5 45 27 1215 147 103-147
29.5-34.5 45 32 1440 192 148-192
34.5-39.5 43 37 1591 235 193-235
39.5-44.5 35 42 1470 270 236-270
44.5-49.5 30 47 1410 300 271-300
All Ages 𝑁 = 300 ∑ 𝑓𝑥 = 8930
Now,
∑ 𝑓𝑥 8930
𝑀𝑒𝑎𝑛, 𝑥̅ = = = 29.77 .
𝑁 300
Here, 𝑁 = 300 . Therefore,
𝑁 300
= = 150 .
2 2
𝑁
From the table we observe that 2 falls in the range of cumulative births 148-192. Hence, the median class is
29.5-34.5. This means that median value lies somewhere between 29.5 and 34.5.
We know that,
ℎ 𝑁
𝑀𝑒𝑑𝑖𝑎𝑛, 𝑀𝑒 = 𝑙 + ( − 𝐶) ⋯ ⋯ ⋯ (1)
𝑓 2
Here,
𝑙 = 29.5, ℎ = 5, 𝑓 = 45, 𝑁 = 300 𝑎𝑛𝑑 𝐶 = 147 .
Now from (1) we have,
5 300
𝑀𝑒𝑑𝑖𝑎𝑛, 𝑀𝑒 = 29.5 + ( − 147)
45 2
= 29.5 + 0.33 = 29.83 .
Again we know that,
ℎ(𝑓1 − 𝑓2 )
𝑀𝑜𝑑𝑒, 𝑀𝑜 = 𝑙 + ⋯ ⋯ ⋯ (2)
2𝑓1 − 𝑓0 − 𝑓2
Here, the highest frequency is 45. Therefore the mode occurs in two adjacent intervals: 24.5-29.5 and 29.5-
34.5. Thus, combining these two groups into one, the interval becomes 24.5-34.5 with 90 as class frequency
and 10 as width of the interval.
Therefore the modal class is 24.5-34.5.
Hence,
𝑙 = 24.5, ℎ = 10, 𝑓1 = 90, 𝑓0 = 41 𝑎𝑛𝑑 𝑓2 = 43 .
Now from (1) we have,
10(90 − 41)
𝑀𝑜𝑑𝑒, 𝑀𝑜 = 24.5 +
2 × 90 − 41 − 43
490
= 24.5 +
96
= 24.5 + 5.1042 = 29.6042 = 29.60
Therefore, mean age is 29.77 years, median age is 29.83 years and modal age is 29.60 years. (Ans.)
Page | 8
Example-03: From the following table showing the daily wage distribution in a certain factory, determine
(i) the mean wage
(ii) the median wage and
(iii) the modal wage.
Daily wage Number of
(Taka) employees (𝒇)
120-140 6
140-160 14
160-180 20
180-200 30
200-220 40
220-240 36
240-260 17
260-280 7
280-300 6
300-320 4
Solution:
Daily wage Number of 𝒙 𝒇𝒙 Cumulative Range of cumulative
(Taka) employees number of number of employees
(𝒇) employees
(𝒄. 𝒇. )
120-140 6 130 780 6 6 or less
140-160 14 150 2100 20 7-20
160-180 20 170 3400 40 21-40
180-200 30 190 5700 70 41-70
200-220 40 210 8400 110 71-110
220-240 36 230 8280 146 111-146
240-260 17 250 4250 163 147-163
260-280 7 270 1890 170 164-170
280-300 6 290 1740 176 171-176
300-320 4 310 1240 180 177-180
All wages 𝑁 = 180 ∑ 𝑓𝑥 = 37780
Now,
∑ 𝑓𝑥 37780
𝑀𝑒𝑎𝑛, 𝑥̅ = = = 209.89 .
𝑁 180
Here, 𝑁 = 180 . Therefore,
𝑁 180
= = 90 .
2 2
𝑵
From the table we observe that 𝟐 falls in the range of cumulative employees 71-110. Hence, the median class
is 200-220. This means that median value lies somewhere between 200 and 220.
We know that,
ℎ 𝑁
𝑀𝑒𝑑𝑖𝑎𝑛, 𝑀𝑒 = 𝑙 + ( − 𝐶) ⋯ ⋯ ⋯ (1)
𝑓 2
Here,
𝑙 = 200, ℎ = 20, 𝑓 = 40, 𝑁 = 180 𝑎𝑛𝑑 𝐶 = 70 .
Now from (1) we have,
20 180
𝑀𝑒𝑑𝑖𝑎𝑛, 𝑀𝑒 = 200 + ( − 70)
40 2
= 200 + 10 = 210 .
Again we know that,
ℎ(𝑓1 − 𝑓2 )
𝑀𝑜𝑑𝑒, 𝑀𝑜 = 𝑙 + ⋯ ⋯ ⋯ (2)
2𝑓1 − 𝑓0 − 𝑓2
Here, the highest frequency is 40. Therefore the modal class is 200-220.
Page | 9
Hence,
𝑙 = 200, ℎ = 20, 𝑓1 = 40, 𝑓0 = 30 𝑎𝑛𝑑 𝑓2 = 36 .
Now from (1) we have,
20(40 − 36)
𝑀𝑜𝑑𝑒, 𝑀𝑜 = 200 +
2 × 40 − 30 − 36
80
= 200 +
14
= 200 + 5.71 = 205.71
Therefore, mean wage is 209.89 tk., median wage is 210 tk. and modal wage is 205.71 tk. (Ans.)
Example-04: The following table gives the distribution of daily income of 700 middle class families in Dhaka
City, determine
(i) the mean income
(ii) the median income and
(iii) the modal income.
Marks Number of families (𝒇)
50-75 65
75-100 80
100-125 95
125-150 135
150-175 170
175-200 65
200-225 50
225-250 40
Solution:
Marks Number of 𝒙 𝒇𝒙 Cumulative number Range of cumulative
families (𝒇) of families (𝒄. 𝒇. ) number of families
50-75 65 62.5 4062.5 65 65 or less
75-100 80 87.5 7000 145 66-145
100-125 95 112.5 10687.5 240 146-240
125-150 135 137.5 18562.5 375 241-375
150-175 170 162.5 27625 545 376-545
175-200 65 187.5 12187.5 610 546-610
200-225 50 212.5 10625 660 611-660
225-250 40 237.5 9500 700 661-700
All ∑ 𝑓𝑥
Marks 𝑁 = 700 = 100250
Now,
∑ 𝑓𝑥 100250
𝑀𝑒𝑎𝑛, 𝑥̅ = = = 143.21.
𝑁 700
Here, 𝑁 = 700. Therefore,
𝑁 700
= = 350 .
2 2
𝑁
From the table we observe that 2 falls in the range of cumulative families 241-375. Hence, the median class
is 125-150. This means that median value lies somewhere between 125 and 150.
We know that,
ℎ 𝑁
𝑀𝑒𝑑𝑖𝑎𝑛, 𝑀𝑒 = 𝑙 + ( − 𝐶) ⋯ ⋯ ⋯ (1)
𝑓 2
Here,
𝑙 = 125, ℎ = 25, 𝑓 = 135, 𝑁 = 700 𝑎𝑛𝑑 𝐶 = 240 .
Page | 10
Now from (1) we have,
25 700
𝑀𝑒𝑑𝑖𝑎𝑛, 𝑀𝑒 = 125 + ( − 240)
135 2
= 125 + 20.37 = 145.37 .
Again we know that,
ℎ(𝑓1 − 𝑓2 )
𝑀𝑜𝑑𝑒, 𝑀𝑜 = 𝑙 + ⋯ ⋯ ⋯ (2)
2𝑓1 − 𝑓0 − 𝑓2
Here, the highest frequency is 170. Therefore, the modal class is 150-175.
Hence,
𝑙 = 150, ℎ = 25, 𝑓1 = 170, 𝑓0 = 135 𝑎𝑛𝑑 𝑓2 = 65 .
Now from (1) we have,
25(170 − 65)
𝑀𝑜𝑑𝑒, 𝑀𝑜 = 150 +
2 × 170 − 135 − 65
2625
= 150 +
140
= 150 + 18.75 = 168.75 .
Therefore, mean income is 143.21, median income is 145.37 and modal income is 168.75. (Ans.)
Page | 11