Lesson-2 Descriptive-Statistics Lecture
Lesson-2 Descriptive-Statistics Lecture
(a) MEAN ()
A set of data has only one mean
Applied for interval and ratio data
All values in the data set are included
Affected by the extreme small or large values on a data set
MEAN for UNGROUPED DATA
(a) Sample Mean: (b) Population Mean: (c) Weighted Mean: (d) Mean for Simple Frequency
𝜮𝒙 𝜮𝒙 Distribution:
𝒙= 𝝁= 𝜮 𝒘𝒙
𝒏 𝑵 𝒙= 𝜮 𝒇𝒙
𝜮𝒘 𝒙=
𝒏
Example:
The daily rates of a sample of eight employees at JNM Inc. are ₧550,
₧420, ₧560, ₧500, ₧700, ₧670, ₧860, ₧480. Find the mean daily
rate of employee.
Solution:
𝒙=
∑ 𝒙 𝒙 𝟏+ 𝒙 𝟐 + 𝒙 𝟑 +⋯ + 𝒙 𝒏
¿
𝒏 𝒏
𝟓𝟓𝟎+𝟒𝟐𝟎+𝟓𝟔𝟎+𝟓𝟎𝟎+𝟕𝟎𝟎+𝟔𝟕𝟎+𝟖𝟔𝟎+𝟒𝟖𝟎
¿
𝟖
𝟒 ,𝟕𝟒𝟎
¿
𝟖
𝒙=𝟓𝟗𝟐.𝟓𝟎
Sample mean daily salary is ₧592.50
Example:
A student took 3 exams in Math. He finished the first exam in 45 minutes and
got a grade of 88; 60 minutes on the second exam and scored 92; and 90 minutes on the third
exam and got 85. What was the student’s mean score for the three exams?
Solution:
𝜮 𝒘𝒙 𝒘 𝟏 𝒙 𝟏+ 𝒘 𝟐 𝒙 𝟐+𝒘 𝟑 𝒙 𝟑
𝒙= ¿
𝜮𝒘 𝒘 𝟏+ 𝒘 𝟐+𝒘 𝟑
(𝟒𝟓)(𝟖𝟖)+(𝟔𝟎)(𝟗𝟐)+(𝟗𝟎)(𝟖𝟓)
𝒙=
𝟒𝟓+𝟔𝟎+𝟗𝟎
𝟏𝟕 ,𝟏𝟑𝟎
¿
𝟏𝟗𝟓
𝒙=𝟖𝟖
𝒙=𝟓 . 𝟔𝟓
𝜮 𝒇𝒙
(a)
8(65) = 520
𝑥=𝑐𝑙𝑎𝑠𝑠𝑚𝑎𝑟𝑘𝑠
is the midpoint of the lower limit and upper limit ADD
𝑛=𝑠𝑎𝑚𝑝𝑙𝑒 𝑠𝑖𝑧𝑒
=
𝑙𝑙+𝑢𝑙 62+68
𝑥= ¿ ¿ 65
2 2
𝜮 𝒇𝒙
69+75 𝒙=
¿ 2 𝒏
2
𝟓𝟕𝟓𝟖
𝒏 𝜮 𝒇𝒙
𝟔𝟓 𝒙=𝟖𝟖 . 𝟓
¿
76 +82
9 𝒙=
2
Mean for GROUPED DATA Example: Find the mean of the given data shown below.
(b) Class
[ ]
intervals f x d fd 𝜮 𝒇𝒅
𝒙=𝒙 𝟎+ 𝑪
where 62 – 68 8 65 -4 -32 𝒏
𝒏=𝒏𝒖𝒎𝒃𝒆𝒓 𝒐𝒇 𝒔𝒂𝒎𝒑𝒍𝒆𝒔
69 – 75
76 – 82
5
9
72
79
-3
-2
-15
-18
𝑥=93+
[ − 41
65 ](7)
Solution:
Step 1 ⇰ 420 , 480 , 500 , 550 , 560 , 670 , 700 , 860
Step 2 ⇰ Use: .5
Step 3 ⇰ 420 , 480 , 500 , 550 , 560 , 670 , 700 , 860
4th 5th
4.5th
~ 4 𝑡h +5 𝑡h 550+560
𝑥= ¿ =₧ 𝟓𝟓𝟓
2 2
The median is
Example: Find the median of the given data shown
MEDIAN for GROUPED DATA
[ ] [ ]
Class 𝒏
𝒏 intervals − <𝒄 𝒇
−<𝒄 𝒇 f <cf~ 𝟐
~ 𝟐 𝒙 = 𝑳𝑳 𝑹 + 𝑪
𝒙 =𝑳𝑳 𝑹 + 𝑪 𝒇
𝒇
⟹
[ ]
where ⟹ 62 – 68 8 8 32.5 − 22
~
69 – 75 5 13 𝑥=82.5+ 11
(
𝑓 = 𝑓𝑟𝑒𝑞𝑢𝑒𝑛𝑐𝑦 76 – 82 9 22 ~
𝑥=82.5+ [ 6.7 ]
𝑛=𝑛𝑢𝑚𝑏𝑒𝑟 𝑜𝑓 𝑠𝑎𝑚𝑝𝑙𝑒𝑠 83 – 89 11 33 32.5~
¿𝑐 𝑓 =𝑙𝑒𝑠𝑠 𝑡h𝑎𝑛𝑐𝑢𝑚 𝑓𝑟𝑒𝑞𝑢𝑒𝑛𝑐𝑦 90 – 96 12 45 𝒙=𝟖𝟗.𝟐
𝐶=𝑐𝑙𝑎𝑠𝑠 𝑠𝑖𝑧𝑒(𝑤𝑖𝑑𝑡h ) 97 – 103 10 55 the median is 89.2
𝒏 65
¿ =32.5 104 – 110 5 60
𝟐 2 Locate 32.5 in <cf to find out modal class
111 – 117 5 65
The modal class is 83 - 89
𝐿𝐿 𝑅= 𝐿𝑜𝑤𝑒𝑟 𝑟𝑒𝑎𝑙 𝐿𝑖𝑚𝑖𝑡 𝒏 65
𝑙𝑜𝑤𝑒𝑟 𝑙𝑖𝑚𝑖𝑡 − 0.5
𝐿𝐿 𝑅=83 −0.5 ¿𝑐 𝑓 =𝑙𝑒𝑠𝑠 𝑡h𝑎𝑛𝑐𝑢𝑚 𝑓𝑟𝑒𝑞𝑢𝑒𝑛𝑐𝑦 𝑓 𝑖𝑠 𝑡h𝑒 𝑓𝑟𝑒𝑞𝑢𝑒𝑛𝑐𝑦 𝑓𝑟𝑜𝑚 𝑡h𝑒 𝑚𝑜𝑑𝑎𝑙𝑐𝑙𝑎𝑠𝑠
𝐿𝐿 𝑅=82.5 ¿ 𝑐𝑓 𝑖𝑠 𝑡h𝑒𝑛𝑢𝑚𝑏𝑒𝑟 𝑏𝑒𝑙𝑜𝑤 𝑡h𝑒<𝑐𝑓 𝑜𝑓 𝑡h𝑒𝑚𝑜𝑑𝑎𝑙 𝑐𝑙𝑎𝑠𝑠 𝑓 =11
¿ 𝑐𝑓 =22
(c) MODE ()
It is found by locating the most frequently occurring value
The easiest average to compute
There can be more than one mode or even no mode in any given data set
It is not affected by the extreme small or large values
It can be applied for nominal, ordinal, interval and ratio data
The value in a data set that appears most frequently
A data may not contain any mode if none of the values is most typical.
Unimodal With 1 mode Multimodal With more than 2 modes
Bimodal Þ With 2 modes No mode ÞWithout mode
ÞWith equal number of frequency
MODE for UNGROUPED DATA
Example:
In the given the set of scores: 2 , 5 , 6 , 66, 7 , 8 , 9 , 66
since it appears the most number of times.
MODE for UNGROUPED DATA
Example 1: The following data represent the total sales for Condominium units from a sample of
10 Real Estates for the month of August: 15, 17, 10, 12, 13, 10, 14, 10, 8, and 9. Find the mode.
Solution: The ordered array for these data is
8, 9, 10, 10, 10, 12, 13, 14, 15, 17
The mode is 10.
Example 2: An operations manager in charge of a company’s manufacturing keeps track of the
mber of manufactured 55” LED Television in a day. Compute for the following data that represents
e number of LED Television manufactured for the past three weeks: 20, 18, 19, 25, 20, 21, 20, 25, 30,
, 28, 29, 25, 25, 27, 26, 22, and 20. Find the mode of the given data set.
Solution: The ordered array for these data is
18, 19, 20, 20, 20, 20, 21, 22, 25, 25, 25, 25, 26, 27, 28, 29, 29, 30
There are two modes 20 and 25.
Example 3: Let the resulting scores be: 32, 37, 42, 41, 56, 59, 83, 65, 96 and 71, Determine the mode.
Answer:
There is no mode as every score has equal frequency.
MODE for GROUPED DATA Example:
𝒅𝟏
[ ]
Class
^
𝒙 =𝑳𝑳 𝑹+ 𝑪 intervals f
𝒅𝟏 + 𝒅𝟐
where
⇒ 62 – 68 8
69 – 75 5
76 – 82 9
83 – 89 11
𝒅𝟏
𝐶=𝑐𝑙𝑎𝑠𝑠 𝑠𝑖𝑧𝑒(𝑤𝑖𝑑𝑡h ) 90 – 96 12
97 – 103 10 𝒅𝟐
STEP 1: Find the highest number of frequency
to identify the modal class 104 – 110 5
modal class = 90 -96
111 – 117 5
STEP 2: above and below highest frequency 65
will be our deviation
STEP 3: Compute
^
𝒙 = 𝑳𝑳 𝑹+
[1 𝒅𝟏
𝒅𝟏 + 𝒅𝟐
𝑪
] ~
𝑥= 89.5 + [ 2.33 ]
Lower limit – 0.5
90 – 0.5 = 89.5
~
𝑥=89.5+ [ ]
1+2
(7) ~
𝒙 =𝟗𝟏 .𝟖
The mode is 91.8
MODE for GROUPED DATA
Step 5: Get the difference between the upper and lower score.
Step 6: Multiply the difference by the decimal part in the computed location.
Step 7: Add the product to the lower score.
Example: Given the data :
Find: (a) (b)
26 51 44 23 25 61 45
23 43 41 55 34 35 65
Solution: Arrange the data:
23 23 25 26 34 35 41 43 44 45 51 55 61 65
Find: (a)
𝑝 (𝑛+1)4 (14+1)
L ¿
𝑞
¿
10
¿ 6 𝑡h location
23 23 25 26 34 35
35 41 43 44 45 51 55 61 65
𝑇h𝑒𝑟𝑒𝑓𝑜𝑟𝑒 , 𝑫 𝟒=𝟑𝟓 6th
Find: (b)
9th
Arrange the data:
23 23 25 26 34 35 41 43 44 44 45 51 55 61 65
𝑝 (𝑛+1) 60 (14+1)
L ¿ ¿ ¿ 9 𝑡h location
𝑞 100
𝑇h𝑒𝑟𝑒𝑓𝑜𝑟𝑒 , 𝑷 𝟔𝟎 =𝟒𝟒
Find: (c) 11.25
Arrange the data:
23 23 25 26 34 35 41 43 44 45 5111th
55
12th
61 65
𝑝 (𝑛+1)3 (14+1)
L ¿ ¿
4
¿11.25 location
𝑞
Interpolation:
Step 5: Get the difference between the upper and lower score. ⇒12 𝑡h 𝑠𝑐𝑜𝑟𝑒− 11 𝑡h 𝑠𝑐𝑜𝑟𝑒=55− 51=4
4
Step 6: Multiply the difference by the decimal part in the computed ⇒ 4 ( .25 ) =𝟏
location.
Step 7: Add the product to the lower score. ⇒ 𝑄 3=11 𝑡h +𝟏
¿ 51+ 1
⇒ 𝑄 3=52
MEASURES OF POSITION FOR GROUPED DATA
[ ]
𝑝𝑛
𝑝𝑡h 𝑄𝑢𝑎𝑟𝑡𝑖𝑙𝑒 : 4
−≺ 𝑐 𝑓
𝑄𝑝 = 𝐿𝐿 𝑅 + 𝐶
𝑓
𝑝𝑡h 𝐷𝑒𝑐𝑖𝑙𝑒 :
[ ]
𝑝𝑛
−≺𝑐 𝑓
10
𝐷𝑝 = 𝐿𝐿 𝑅 + 𝐶
𝑓
𝑝𝑡h 𝑃𝑒𝑟𝑐𝑒𝑛𝑡𝑖𝑙𝑒 :
[ ]
𝑝𝑛
−≺ 𝑐 𝑓
100
𝑃 𝑝 = 𝐿𝐿 𝑅 + 𝐶
𝑓
Where
= lower real limit <cf = cumulative frequency
n = sample size C = class width
f = frequency p = desire position of quantile
q = quantile (4, 10,100)
Steps in solving Quantile Measures:
Class f <cf
Step 1: Prepare the frequency distribution table Intervals
Step 2: Record the cumulative frequencies 62 – 68 8 8
Step 3: Determine and identify the class where it belongs 69 – 75 5 13
Step 4: Determine the class size. 76 – 82 9
≺ 𝑐22
𝑓
Step 5: Plug in values to the formula. 83 – 89 11 33
Example: (a) Find 𝑚𝑜𝑑𝑎𝑙𝑐𝑙𝑎𝑠𝑠 ⇒ 90 – 96 12 𝑓 45
⇒32.5
𝑝𝑛 2(65) 𝐿𝐿 𝑅=83 −0.5=82.5 97 – 103 10 55
= =32.5 104 – 110 5
4 4quartile 60
[ ]
𝑝𝑛 111 – 117 5
−≺𝑐 𝑓 65
𝑄2 = 𝐿𝐿 𝑅 +
4
𝐶 n=65
𝑓
𝑄2 =82.5+
[ 32.5 −22
11
(7)
]
𝑄2 =82.5+6.681
𝑸 𝟐=𝟖𝟗. 𝟏𝟖
Example: (b) Find
[ ]
Class f <cf 𝑝𝑛
Intervals 10
−≺𝑐 𝑓 𝐿𝐿 𝑅=76 −0.5=75.5
𝐷3 = 𝐿𝐿 𝑅 + 𝐶
62 – 68 8 𝑓
8
69 – 75 5 ≺ 𝑐13
𝑓 𝑝𝑛 3(65)
76 – 82 9 = =19.5
22
83 – 89 𝑓11 ⇒ 19.5 10 10
decile
33
90 – 96
97 – 103
12
10
45
55
𝐷3 =75.5+
[
19.5 −13
9 ](7)
Class f <cf
[ ]
Intervals 𝑝𝑛
−≺𝑐 𝑓
62 – 68 8 100
8 𝑃 95 = 𝐿𝐿 𝑅 + 𝐶
𝑓
69 – 75 5
13 𝑝𝑛 95(65)
76 – 82 9 = =61.75
22 100 100percentile
83 – 89 11
33
90 – 96 12 𝐿𝐿 𝑅=111− 0.5=110.5
97 – 103 10 45
104 – 110
111 – 117
5 55
5 ≺ 𝑐60
𝑓
𝑃 95=110.5 +
[ 61.75 − 60
5 ](7)
𝑃 95=110.5 +2.45
n=65 65
𝑓 ⇒61.75 𝑷 𝟗𝟓 =𝟏𝟏𝟐 .𝟗𝟓
Measures of Variability
describes how far apart data points lie from each other and from the center of a distribution.
(a) RANGE - The difference of the highest value and the lowest value in the data set.
Example: The daily rates of a sample of eight employees at GMS Inc. are ₧550, ₧420, ₧560,
₧500, ₧700, ₧670, ₧860, ₧480. Find the range. Example: b
Solution: 420, 480, 500, 550, 560, 670, 700, 860
x | |
HV = ₧860 and LV = ₧420 =|
Range = HV – LV = 860 – 420 83 0.1429 = 0.1429
R = 440 79 3.8571
|
(b) MEAN DEVIATION 19.1429 86 3.1429 =|
𝑴𝑫= = 3.8 571
It is the average distance between the 𝟕 82 82.8571 0.8571
|
mean and the scores in the distribution. 𝑴𝑫=𝟐. 𝟕𝟑𝟒𝟕 78 4.8571 =|
𝒙=
𝟖𝟑+𝟕𝟗+𝟖𝟔+𝟖𝟐+𝟕𝟖+𝟖𝟖+𝟖𝟒
𝟕
=𝟖𝟐 .𝟖𝟓𝟕𝟏
𝒏=𝟕
∑|𝒙−𝒙|=¿¿ 19.1429
(c) VARIANCE
It is a mathematical expectation of the average squared deviations from the mean.
Individual value
Sample
𝝈 𝟐=∑ ¿¿¿
Variance
Sample mean
Example:
Sample Population
x |
83
79
0.1429
3.8571
2
⇒|𝟎. 𝟏𝟒𝟐𝟗| =0.0204
2
⇒|𝟑. 𝟖𝟓𝟕𝟏| =14.8772
∑ ¿¿
𝝈 𝟐=∑ ¿¿¿
86 3.1429 2
⇒|𝟑. 𝟏𝟒𝟐𝟗| =9.8778
82 82.8571 0.8571 2
⇒|𝟎. 𝟖𝟓𝟕𝟏| =0.7346
78 4.8571 2
⇒|𝟒 .𝟖𝟓𝟕𝟏| =23.5914 𝟐 𝟕𝟔 . 𝟖𝟓𝟐𝟕
𝝈 =
88 5.1429 2
⇒|𝟓. 𝟏𝟒𝟐𝟗| =26.4494 𝟕 −𝟏
𝟐 𝟕𝟔. 𝟖𝟓𝟐𝟕
84 1.1429
2
⇒|𝟏. 𝟏𝟒𝟐𝟗| =1.3062 𝝈 =
𝟔
𝝈 𝟐=𝟏𝟐 .𝟖𝟎𝟗𝟓
Example:
x
83
x2
6,889 ⇒=83 2=6,889
𝝈 =∑ 𝒙 −¿¿¿¿
𝟐 𝟐
( 𝟓𝟖𝟎 )𝟐
𝟒𝟖 , 𝟏𝟑𝟒 −
79 6,241 ⇒=792 =6,241
𝟐 𝟕
𝝈 =
𝟕−𝟏
86 7,396 ⇒=86 2=7,396
𝟐 𝟒𝟖,𝟏𝟑𝟒− 𝟒𝟖,𝟎𝟓𝟕.𝟏𝟒𝟐𝟗
82 82.8571 6,724 ⇒=82 2=6,724 𝝈=
𝟔
78 6,084 ⇒=78 2=6,084 8095
88 7,744 2
⇒=88 =7,744
580 48,134
∑𝒙 ∑ 𝒙 𝟐
(d) STANDARD DEVIATION VARIANCE
𝝈=√∑ ¿¿¿¿
Sample SD
It is half of the difference between (or ) )
Example:
𝝈=
√𝟕𝟔.𝟖𝟓𝟐𝟕
𝟕𝟔.𝟖𝟓𝟐𝟕
𝟕−𝟏
Example x
83
78 79 82 83 84 86 88
𝑸𝟑 −𝑸 𝟏
√
𝑸=
𝟐
79
𝝈= (or ) = 86
𝟔
𝝈 =√ 𝟏𝟐.𝟖𝟎𝟗𝟓
86 (or ) = 79
82 𝟖𝟔 − 𝟕𝟗
𝝈 =𝟑.𝟓𝟕𝟗𝟎 𝑸=
𝟐
=𝟑 . 𝟓𝟎
78
88
∑ ¿¿ 84