Business statistics_lecture note
Business statistics_lecture note
1 — INTRODUCTION TO STATISTICS
Solution
a Because the average of birr 3,400 is based on a subset of the population, it is
a sample statistic.
b Because the average SAT math score of 54 is based on the entire freshman
class, it is a population parameter.
c Because the percent, 34%, is based on a subset of the population, it is a
sample statistic.
For example, if subjects are classified according to gender (male or female), then
the variable gender is qualitative. Other examples of qualitative variables are
religious preference and geographic locations.
Qualitative variables take on values that are names or labels. The color of a ball
(e.g., red, green, blue) or the breed of a dog (e.g., collie, shepherd, terrier) would
be examples of qualitative or categorical variables.
Definition 1.5.2 Quantitative variables are variables that can be counted or
measured.
For example, the variable age is numerical, and people can be ranked in order
according to the value of their ages. Other examples of quantitative variables are
heights, weights, and body temperatures.
Quantitative variables can be further classified into two groups: discrete and
continuous.
Definition 1.5.3 Discrete variables assume values that can be counted. A dis-
crete variable takes always whole number values that are counted.
Examples of discrete variables are the number of children in a family, the number
of students in a classroom, and the number of calls received by a switchboard
operator each day for a month.
Definition 1.5.4 Continuous variables can assume an infinite number of values
between any two specific values. They are obtained by measuring. They often
include fractions and decimals.
Temperature, for example, is a continuous variable, since the variable can assume
an infinite number of values between any two given temperatures.
An inherent zero is a zero that implies “none.” For instance, the amount of money
you have in a savings account could be zero dollars. In this case, the zero represents
no money; it is an inherent zero. On the other hand, a temperature of 0oCdoes not
represent a condition in which no heat is present. The 0oCtemperature is simply
a position on the Celsius scale; it is not an inherent zero. To distinguish between
data at the interval level and at the ratio level, determine whether the expression
“twice as much” has any meaning in the context of the data. For instance, $2 is
twice as much as $1, so these data are at the ratio level. On the other hand, 2oC is
not twice as warm as 1oC, so these data are at the interval level.
Uses of Statistics
Today the field of statistics is recognized as a highly useful tool to making decision
process by managers of modern business, industry, frequently changing technology.
It has a lot of functions in every day activities. The following are some of the most
important uses of statistics.
• Statistics condenses and summarizes complex data. The original set of
data (raw data) is normally voluminous and disorganized unless it is summa-
rized and expressed in few numerical values.
• Statistics facilitates comparison of data. Measures obtained from different
set of data can be compared to draw conclusion about those sets. Statistical
values such as averages, percentages, ratios, etc., are the tools that can be
used for the purpose of comparing sets of data.
• Statistics helps in predicting future trends. Statistics is extremely useful
for analyzing the past and present data and predicting some future trends.
• Statistics influences the policies of government. Statistical study results
Limitations of statistics
Even though, statistics is widely used in various fields of natural and social sciences,
which closely related with human inhabitant, it has its own limitations as far as its
application is concerned. Some of these limitations are:-
• Statistics doesn’t deal with single (individual) values. Statistics deals only
with aggregate values. But in some cases single individual is highly important
to consider in some situations. Example, the sun, a deriver of bus, president,
etc.
• Statistics can’t deal with qualitative characteristics. It only deals with data
which can be quantified. Example, it does not deal with marital status
(married, single, divorced, widowed) but it deal with number of married,
number of single, number of divorced.
• Statistical conclusions are not universally true. Statistical conclusions are true
only under certain condition or true only on average. The conclusions drawn
from the analysis of the sample may, perhaps, differ from the conclusions
that would be drawn from the entire population. For this reason, statistics is
not an exact science.
Example 1.4 Assume that in your class there are 50 numbers of students.
Take there CGPA for all 50 students and analyse mean CGPA; that is
assumed 3.00. This value is on average, because all individual has not
CGPA 3.00. There is a student who has scored above 3.00 and below 3.00.
Example 1.5 Example, the report of head of the minister about Etio-
Somalia terrorist attack mission dismissed terrorists25% at first day, 50%
at second day, 75% at third day. However, we doubt about the mechanisms
how the mission is measured and quantified. This leads miss use of
statistical figures.
• Secondary data may help to clarify or redefine the definition of the problem
as part of the exploratory research process.
• Time saving
• Provides a larger database as compared to primary data
• Lack of availability
• Lack of relevance
• Inaccurate data
• Insufficient data
Example 2.1 Assume that a simple study is to be conducted to see the age
distribution of HIV/AIDS victim citizens. Clearly, the variable of study is age.
Data about the age of HIV/AIDS victim citizens may be obtained by making
direct interview with the victims. Note, in this specific case, the victim citizens
are primary sources. Moreover, the data to be collected from them are primary
data. Alternatively, one may use records of hospitals and other related agencies
to obtain age of the victim citizens without the need of tracing the victims
personally. Therefore, the records of the hospitals, in our case, are secondary
sources and the data copied from such records are secondary data.
type, etc.
Example 2.2 Thirty students were given a blood test to determine their blood
type. The data set is given as follows:
A B B AB O O O B AB B
B B O A O A O O O AB
AB O A B A O A B AB O
Construct a frequency, relative frequency and percent frequency distribution for
the above data
Solution:
Class Tally Frequency
A ///// / 6
B ///// /// 8
AB ///// 5
O ///// ///// / 11
the graph (usually the horizontal axis), we specify the labels that are used for the
classes (categories). A frequency, relative frequency, or percent frequency scale
can be used for the other axis of the chart(usually the vertical axis).Then, using a
bar of fixed width drawn above each class label, we extend the length of the bar
until we reach the frequency, relative frequency, or percent frequency of the class.
For categorical data, the bars should be separated to emphasize the fact that each
class is separate.
Definition 2.2.2 A bar graph represents the data by using vertical or horizontal
bars whose heights or lengths represent the frequencies of the data.
Example 2.3 The table shows the average money spent by first year college
students. Draw a horizontal and vertical bar graph for the data.
Electronics $ 728, Dorm decor $ 344,
Clothing $ 141, and shoes $ 72
Bar graphs can also be used to compare data for two or more groups. These types
Example 2.4 Consider the following data for the number (in millions) of never
married adults in the United States.
Definition 2.2.3 A pie graph is a circle that is divided into sections or wedges
according to the percentage of frequencies in each category of the distribution.
Example 2.5 This frequency distribution shows the number of pounds of each
snack food eaten during the Super Bowl. Construct a pie graph for the data.
To draw a pie chart, the angle of each slice can be calculated as follows:
f
Angle = × 3600
n
The angle of the first slice, for example, is
11.2 8.2
Potato chips = × 3600 = 1340 , Tortilla chips = × 3600 = 980
30 30
4.3 0 0 3.8
Pretzels = × 360 = 52 , Popcorn = × 360 = 460
0
30 30
2.5
Snack nuts = × 3600 = 300 ;
30
Total = 3600 Each frequency must also be converted to a percentage. i.e., % =
f
× 100
n
For example,
11.2 8.2
Potato chips × 100 = 37.3% Tortilla chips = × 100 = 27.3%
30 30
When the data are numerical instead of categorical, the range of data is small and
each class is only one unit, this distribution is called an ungrouped frequency
distribution. The major components of this type of frequency distributions are
class, tally, frequency, relative frequency and cumulative frequency.
A cumulative frequency distribution is a distribution that shows the number
of data values less than or equal to a specific value (usually an upper boundary).
Cumulative frequencies are used to show how many values are accumulated up
to and including a specific class. We have less than and more than cumulative
frequencies.
Example 2.6 The following data represent the number of days of sick leave
taken by each of 50 workers of a company over the last 6 weeks.
2 0 0 5 8 3 4 1 0 0
7 1 7 1 5 4 0 4 0 1
8 9 7 0 1 2 7 2 5 5
4 3 3 0 0 5 2 1 3 0
2 4 5 0 5 7 5 1 1 0
1. Construct ungrouped frequency distribution
2. How many workers had at least 1 day of sick leave?
3. How many workers had between 3 and 5 days of sick leave?
Solution:
1. Since this data set contains only a relatively small number (9) of distinct or
different values, it is convenient to represent it in a frequency table which
presents each distinct value along with its frequency of occurrence.
2. Since 12 of the 50 workers had no days of sick leave, the answer is 50 − 12 =
38.
3. The answer is the sum of the frequencies for values 3, 4 and 5 that is
4 + 5 + 8 = 17.
R The class limits should have the same decimal place value as the data,
but the class boundaries should have one additional place value and
end in a 5.
For example, if the values in the data set are whole numbers, such as 59, 68,
and 82, the limits for a class might be 58 – 64, and the boundaries are 57.5 –
64.5. Find the boundaries by subtracting 0.5 from 58 (the lower class limit)
and adding 0.5 to 64 (the upper class limit).
If the data are in tenths, such as 6.2, 7.8, and 12.6, the limits for a class
hypothetically might be 7.8–8.8, and the boundaries for that class would be
7.75 – 8.85. Find these values by subtracting 0.05 from 7.8 and adding 0.05
to 8.8.
• Class width for a class in a frequency distribution is the difference beteween
the lower (or upper) class limit of one class and the lower (or upper) class
limit of the next class.
• The class midpoint Xm is obtained by adding the lower and upper boundaries
and dividing by 2, or adding the lower and upper limits and dividing by 2:
lower boundary + upper boundary
Xm =
2
or
lower limit + upper limit
Xm =
2
using 7 classes.
112 100 127 120 134 118 105 110 109 112
110 118 117 116 118 122 114 114 105 109
107 112 114 115 118 117 118 122 106 110
116 108 110 121 113 120 119 111 104 111
120 113 120 117 105 110 118 112 114 114
Solution:
The cumulative frequency distribution for the data in this example is as follows:
Cumulative frequency
Less than 99.5 0
Less than 104.5 2
Less than 109.5 10
Less than 114.5 28
Less than 119.5 41
Less than 124.5 48
Less than 129.5 49
Less than 134.5 50
R (Remember that the lines for the frequency polygon begin and end on
the x axis while the lines for the ogive begin on the x axis.)
2.3.1 Histogram
Definition 2.3.1 The histogram is a graph that displays the data by using con-
tiguous vertical bars (unless the frequency of a class is 0) of various heights to
represent the frequencies of the classes.
Definition 2.3.2 The frequency polygon is a graph that displays the data by
using lines that connect points plotted for the frequencies at the midpoints of the
classes. The frequencies are represented by the heights of the points.
2.3.3 Ogive
Definition 2.3.3 The ogive is a graph that represents the cumulative frequencies
for the classes in a frequency distribution.
Cumulative frequency graphs are used to visually represent how many values are
below a certain upper class boundary.
Exercise 2.3 1. Use the data set, which represents the student-to-faculty
ratios for 20 public colleges.
13 15 15 8 16 20 28 19 18 15 21 23 30 17 10 16 15 16 20 15
(a) Construct a frequency distribution for the data set using five classes.
Include class limits, midpoints, boundaries, frequencies, relative
frequencies, and cumulative frequencies.
(b) Construct histogram , frequency polygon and ogive curve .
2. Using the histogram shown here, do the following.
(a) Construct a frequency distribution; include class limits, class fre-
quencies, midpoints, and cumulative frequencies.
(b) Construct a frequency polygon.
(c) Construct an ogive.
Sample mean
x1 + x2 + x3 + · · · + xn ∑ xi
x̄ = = (3.1)
n n
where n represents the total number of values in the sample.
population mean
x1 + x2 + x3 + · · · + xN ∑ xi
µ = = (3.2)
N N
where N represents the total number of values in the population.
Example 3.1 The monthly starting salaries for a sample of 12 Business school
graduates is shown. Find the mean.
Solution:
∑ xi x1 + x2 + x3 + · · · + x12
x̄ = =
n 12
3450 + 3550 + 3650 + · · · + 3480 42, 480
= = = 3540
12 12
Example 3.2 The data show the number of patients in a sample of six hospitals
who acquired an infection while hospitalized. Find the mean.
110 76 29 38 105 31
Solution:
∑ xi 110 + 76 + 29 + 38 + 105 + 31 389
x̄ = = = = 64.8
n 6 6
The mean of the number of hospital infections for the six hospitals is 64.8.
R
1. It is possible to calculate the combined (or pooled) arithmetic mean of
two or more than two sets of data of the same nature. Let x̄1 and x̄2 be
arithmetic means of two sets of data of the same nature of size n1 and
n2 respectively. Then their combined A.M. can be calculated as
n1 x̄1 + n2 x̄2
x̄12 =
n1 + n2
2. While compiling the data for calculating A.M., it is possible that
we may wrongly read and/or write certain number of observations.
In such a case, the correct value of A.M. can be calculated first by
subtracting the sum of observations wrongly recorded from ∑ xi (total
of all observations) and then adding the sum of the correct observations
to it. The result is then divided by the total number of observations.
Example 3.3 The mean salary paid to 1500 employees of an organization was
found to be Br12,500. Later on, after disbursement of salary, it was discovered
that the salary of two employees was wrongly entered as Br 15,760 and 9590.
Their correct salaries were Br 17,760 and 8590. Calculate correct mean.
Solution: Let xi , (i = 1, 2, ..., 1500) be the salary of ith employee. Then, we are
given that
1
x̄ = x = 12, 500
1500 ∑
∑ x = 12, 500 × 1500 = 1, 87, 50, 000
This gives the total salary disbursed to all 1500 employees. Now after adding
the correct salary figures of two employees and subtracting the wrong salary figures
posted against two employees, we have
∑ x = 1, 87, 50, 000+ (Sum of correct salaries figures) - (Sum of wrong salaries
figures)
= 1,87,50,000 + (17,760 + 8590) - (15,760 + 9590)
= 1,87,50,000 + 26,350 - 25,350 = Br1,87,51,000
1, 87, 51, 000
Thus, the correct mean salary is given by x̄ = = Br 12500.67
1500
Example 3.4 There are two units of an automobile company in two different
cities employing 760 and 800 persons, respectively. The arithmetic means of
monthly salaries paid to persons in these two units are $18,750 and $ 16,950
respectively. Find the combined arithmetic mean of salaries of the employees in
both the units.
In other words, G.M. of a set of n observations is the nth root of their product.
GM measure the rate of change of a variable over time.
Step 2 Find the midpoints of each class and place them in column C
Step 3 Multiply the frequency by the midpoint for each class, and place the product
in column D.
Step 4 Find the sum of column D
Step 5 Divide the sum obtained in column D by the sum of the frequencies obtained
in column B.
Example 3.6 The data represent the number of miles run during one week for
a sample of 20 runners. Find the mean.
Solution:
A B C D
Class Frequency Midpoint xm f .xm
5.5 – 10.5 1
10.5 – 15.5 2
15.5 – 20.5 3
20.5 – 25.5 5
25.5 – 30.5 4
30.5 – 35.5 3
35.5 – 40.5 2
n = 20
∑ f .xm 490
∴ x̄ = = = 24.5
n 20
For example, wheat, rice, and pulses may be more important when compared with
ciga- rettes, tea, and other luxury items.
Definition 3.1.1 The weighted mean of a variable X is multiplying each value
by its corresponding weight and dividing the sum of the products by the sum of
the weights.
∑ wX ∑ w1 X1 + w2 X2 + · · · + wn Xn
X̄ = =
∑w ∑ w1 + w2 + · · · + wn
where w1 + w2 + · · · + wn are the weights and X1 , X2 , . . . , Xn are the values.
Solution:
Exercise 3.1 1. You are taking a class in which your grade is determined
from five sources: 50% from your test mean, 15% from your midterm,
20% from your final exam, 10% from your computer lab work, and 5%
from your homework. Your scores are 86 (test mean), 96 (midterm),
82 (final exam), 98 (computer lab), and 100 (homework). What is the
weighted mean of your scores? The minimum average for an A is 90. Did
you get an A?
2. An investor is fond of investing in equity shares. During a period of falling
prices in the stock exchange, a stock is sold at $ 120 per share on one day,
$ 105 on the next and $ 90 on the third day. The investor has purchased
50 shares on the first day, 80 shares on the second day and 100 shares on
the third day. What average price per share did the investor pay?
Example 3.8 The number of police officers killed in the line of duty over the
last 11 years is shown. Find the median.
177 153 122 141 189 155 162 165 149 157 240
Solution: The median number of police officers killed for the 11-year period is
157.
Example 3.10 Based on the grouped data below, find the median:
Solution: Since 10 occurred 3 times—a frequency larger than any other num-
ber—the mode is 10
Example 3.12 The data show the number of licensed nuclear reactors in the
United States for a recent 15-year period. Find the mode.
104, 107, 109, 104, 109, 111, 104, 109, 112, 104, 109, 111, 104, 110, 109
Solution: Since the values 104 and 109 both occur 5 times, the modes are 104 and
109. The data set is said to be bimodal.
Example 3.13 The number of accidental deaths due to firearms for a six-year
period is shown. Find the mode.
649, 789, 642, 613, 610, 600
where :
i is the class width
∆1 is the difference between the frequency of class mode and the frequency of
the class before the class mode i.e., ∆1 = fcm − fcm−1
∆2 is the difference between the frequency of class mode and the frequency of
the class after the class mode ∆2 = fcm − fcm+1
Lm0 is the lower boundary of class mode
Example 3.14 Find the modal class for the frequency distribution of miles
that 20 runners ran in one week,
Class boundaries Frequency
5.5 – 10.5 1
10.5 – 15.5 2
15.5 – 20.5 3
20.5 – 25.5 5 ←− Modal class
25.5 – 30.5 4
30.5 – 35.5 3
35.5 – 40.5 2
R The mode is the only measure of central tendency that can be used in finding
the most typical case when the data are nominal or categorical.
Example 3.15 The number of bank failures for a recent five-year period is
shown. Find the midrange.
3, 30, 148, 157, 71
Solution: The lowest data value is 3, and the highest data value is 157.
3 + 157
MR = = 80
2
The midrange for the number of bank failures is 80.
dispersion indicates high uniformity of the items, while large dispersion indicates
less uniformity. Consider the following marks of two students.
Student 1 Student 2
68 85
75 90
65 80
67 25
70 65
Both have got a total of 345 and an average of 69 each. The fact is that the second
student has failed in one paper. When the averages alone are considered, the two
students are equal. But first student has less variation than second student. Less
variation is a desirable characteristic.
Definition 3.2.1 The range is the highest value minus the lowest value. The
symbol R is used for the range.
Example 3.16 A testing lab wishes to test two experimental brands of outdoor
paint to see how long each will last before fading. The testing lab makes 6
gallons of each paint to test. Since different chemical agents are added to each
group and only six cans are involved, these two groups constitute two small
populations. The results (in months) are shown. Find the range of each group.
Brand A 10 60 50 30 40 20
Brand B 35 45 30 35 40 25
R = 60 − 10 = 50 months
R = 45 − 25 = 20 months
Example 3.17 Find the range for the following frequency distribution; which
shows the distribution of the maximum loads supported by a certain number of
cables.
3.3.1 Percentiles
A percentile provides information about how the data are spread over the interval
from the smallest value to the largest value.
Definition 3.3.1 Percentiles divide the data set into 100 equal groups.
Example 3.18 A teacher gives a 20-point test to 10 students. The scores are
shown here. Find the percentile rank of a score of 12.
Example 3.19 Using the scores in Example 3.18, find the value corresponding
to the 25th percentile.
Solution: Arrange the data in order from lowest to highest: 2, 3, 5, 6, 8, 10, 12,
15, 18, 20
n.p 10 × 25
=⇒ c = = = 2.5
100 100
Since c is not a whole number, round it up to the next whole number; in this case,
c = 3. Start at the lowest value and count over to the third value, which is 5. Hence,
the value 5 corresponds to the 25th percentile.
Example 3.20 Using the scores in Example 3.18, find the value corresponding
to the 60th percentile.
Solution: Here c = 6
Since c is a whole number, use the value halfway between the c and c + 1 values
when counting up from the lowest value—in this case, the 6th and 7th values. Find
10 + 12
it by adding the two values and dividing by 2. = 11
2
Hence, 11 corresponds to the 60th percentile. Anyone scoring 11 would have done
better than 60% of the class.
Step 3 Find the median of the data values that fall below Q2 . This is the value for
Q1 .
Step 4 Find the median of the data values that fall above Q2 . This is the value for
Q3 .
Example 3.21 Find Q1 , Q2 and Q3 for the data set 15, 13, 6, 5, 12, 50, 22, 18.
IQR = Q3 − Q1
Example 3.22 Find the interquartile range for the data set in Example 3.21
Outliers
Definition 3.3.3 An outlier is an extremely high or an extremely low data value
when compared with the rest of the data values.
An outlier can strongly affect the mean and standard deviation of a variable. For
example, suppose a researcher mistakenly recorded an extremely high data value.
This value would then make the mean and standard deviation of the variable much
larger than they really were.
Procedure for Identifying Outliers
Step 1 Arrange the data in order from lowest to highest and find Q1 and Q3 .
Step 2 Find the interquartile range: IQR = Q3 − Q1 .
Step 3 Multiply the IQR by 1.5.
Step 4 Subtract the value obtained in step 3 from Q1 and add the value obtained in
step 3 to Q3 .
Step 5 Check the data set for any data value that is smaller than Q1 − 1.5(IQR)or
larger than Q3 + 1.5(IQR).
Solution: The data value 50 is extremely suspect. These are the steps in checking
for an outlier.
Check the data set for any data values that fall outside the interval from −7.5 to
36.5. The value 50 is outside this interval; hence, it can be considered an outlier.
∑(X − µ)2
σ2 = (3.6)
N
where X = individual value, µ = population mean , N = population size
The population standard deviation is the square root of the variance. The
symbol for the population standard deviation is σ .
∑X
µ=
N
Step 2 Find the deviation for each data value. X − µ
Step 3 Square each of the deviations. (X − µ)2
Step 4 Find the sum of the squares. ∑(X − µ)2
Step 5 Divide by N to get the variance.
Step 6 Take the square root of the variance to get the standard deviation.
Definition 3.3.5 Sample variance
∑(X − X̄)2
s2 = (3.8)
n−1
Sample standard deviation
s
∑(X − X̄)2
s= (3.9)
n−1
Example 3.24 Find the variance and standard deviation for brand B paint data
in Example 3.16. The months brand B lasted before fading were
A B C
X X −µ (X − µ)2
35 0 0
45 10 100
30 -5 25
35 0 0
40 5 25
25 -10 100
∑ = 250
Moybon W.@ ASTU 2022 Introduction to Statistics
50 Measures of central Tendency and Dispersion
∑ fi (Xmi − µ)2
σ2 =
N
∑ fi (Xmi − X̄)2
s2 =
n−1
Example 3.25 The following are the frequency distribution of the time in days
required to complete year-end audits:
Solution:
A B C D
Xm (class mid point) f .Xm Xm − x̄ f .(Xm − x̄)2
12 48 -7 196
17 136 -2 32
22 110 3 45
27 54 8 128
32 32 13 169
n = 20 ∑ f xm = 380 ∑ f .(Xm − x̄)2 = 570
∑ f .(Xm − x̄)2 570 √
s2 = = = 30 =⇒ s = 30 ≈ 5.5
n−1 19
Example 3.26 The mean of the number of sales of cars over a 3-month period
is 87, and the standard deviation is 5. The mean of the commissions is br. 5225,
and the standard deviation is br. 773. Compare the variations of the two.
Example 3.27 The following are the number of visit made by ten mothers to
the local doctor’s surgery.
8, 6, 5, 5, 7, 4, 5, 9, 7, 4
Example 3.28 Find mean deviation about mean, and median for the following
distributions.
Item Frequency
2–4 20
4–6 40
6–8 30
8 – 10 10
remains incomplete until we measure the degree to which these individual values
in the data set deviate from symmetry on both sides of the central value and
the direction in which these are distributed. This analysis is important due to
the fact that data sets may have the same mean and standard deviation but the
frequency curves may differ in their shape. A frequency distribution of the set of
values that is not ‘symmetrical (normal)’ is called asymmetrical or skewed. In a
skewed distribution, extreme values in a data set move towards one side or tail of a
distribution, thereby lengthening that tail. When extreme values move towards the
upper or right tail, the distribution is positively skewed. When such values move
towards the lower or left tail, the distribution is negatively skewed.
The z score represents the number of standard deviations that a data value falls
above or below the mean.
Example 3.29 A student scored 65 on a Maths for management test that had
a mean of 50 and a standard deviation of 10; she scored 30 on a Civic test with a
mean of 25 and a standard deviation of 5. Compare her relative positions on the
two tests.
Solution: First, find the z scores. For Maths for management the z score is
x − x̄ 65 − 50
z= = = 1.5
s 10
For Civic the z score is
x − x̄ 30 − 25
z= = = 1.0
s 5
Since the z score for Maths for management is larger, her relative position in the
mathematics class is higher than her relative position in the Civic class.
R Note that if the z score is positive, the score is above the mean. If the z score
is 0, the score is the same as the mean. And if the z score is negative, the
score is below the mean.
Example 3.30 Find the z score for each test, and state which is higher.
Solution: The score for test A is relatively higher than the score for test B.
3.3.6 Skewness
Skewness is the degree of asymmetry or departure from symmetry of a distribution.
A skewed frequency distribution is one that is not symmetrical.
Skewness is concerned with the shape of the curve not size.
Test of skewness
1. If Mean = Median = Mode, then there is no skewness in the distribution. In
other words, the curve of the frequency distribution would be symmetrical
or bell shaped.
2. If arthimetic mean is less than the values of the mode, the tail of a sym-
metrical distribution is on the left side, i.e., the distribution is negatively
skewed.
3. If arthimetic mean is greater than the values of the mode, the tail of a
symmetrical distribution is on the right side, i.e., the distribution is posetively
skewed.
Figure 3.1:
Measure of Skewness
The difference between the mean and mode gives as absolute measure of skewness.
If we divide this difference by standard deviation we obtain a relative measure of
skewness known as coefficient and denoted by SK. Karl Pearson coefficient of
Skewness
SK = Mean − Mode/S.D
SK = 3(Mean − Median)/S.D
SK = Q1 + Q3 − 2Median/Q3 − Q1
∑(xi − µ)r
µr =
n
Kurtosis
All the three curves are symmetrical about the mean and have same variation
(range). In order to identify a distribution completely we need one more measure
which Prof. Karl Pearson called ‘convexity of the curve’ or its ‘Kurtosis’. While
skewness helps us in identifying the right or left tails of the frequency curve,
kurtosis enables us to have an idea about the shape and nature of the hump (middle
part) of a frequency distribution. In other words, kurtosis is concerned with the
Figure 3.2:
4 — Introduction to Probability
Note that the probability of 1 represents something that is certain to happen, and
the probability of 0 represents something that cannot happen.
60 Introduction to Probability
The closer a probability is to 0, the more improbable it is the event will happen.
The closer the probability is to 1, the more sure we are it will happen.
Example 4.1 .
Experiment Outcomes
Tossing of a fair coin Head, tail
Rolling a die 1, 2, 3, 4, 5, 6
Selecting an item from a production lot defective (faulty), non-defective (good)
Introducing a new product Success, failure
Play a football game Win, lose, tie
Solution: S = {H, T }
Example 4.3 Find the sample space for the gender of the children if a
family has three children. Use B for boy and G for girl.
of number 8.
=⇒ A = {1, 3, 5} B = {2, 4, 6} C = 0/ or impossible
event
A favorable outcome is the outcome of interest. For instance, one can define
a favorable outcome in the flip of a coin as a tail.
11. Independent Events: are not affected by previous events. Two events A
and B are said to be independent events if the occurrence of event A has no
influence (bearing) on the occurrence of event B. For example, if two fair
coins are tossed, then the result of one toss is totally independent of the result
of the other toss. i.e., What it did in the past will not affect the current toss!
The probability that a head will be the outcome of any one toss will always
1
be , irrespective of whatever the outcome is of the other toss. Hence, these
2
two events are independent. On the other hand, consider drawing two cards
from a pack of 52 playing cards. The probability that the second card will be
an ace would depend up on whether the first card was an ace or not. Hence
these two events are not independent events.
Another example A bag contains balls of two different colours say yellow
and white. Two balls are drawn successively .First ball is drawn from a
bag and replaced after notes its colour. Let us assume that it is yellow and
denote this event by A. Another ball is drawn from the same bag and its
colour is noted let this event denoted by B. Clearly, the result of first draw
has no effect on the result of the second draw. Hence, the events A and B are
independent events.
12. Equally likely outcomes: In a certain experiment, if each outcome in the
sample space has the same chance to occur, then we say that the outcomes
are equally likely outcomes.
Addition rule
Suppose that a procedure designated by 1, can be performed in n1 ways. Assume
that second procedure designated by 2 can be performed in n2 ways. Suppose
further more that it is not possible both procedures 1 and 2 are performed to-
gether. The number of ways in which we can perform 1 or 2 procedures is n1 + n2
ways.
Example 4.5 A student can choose a computer project from one of three lists.
The three lists contain 16, 21, and 13 possible projects, respectively. No project
is on more than one list. How many possible projects are there to choose from?
Solution: The student can choose a project by selecting a project from the first list,
the second list, or the third list. Because no project is on more than one list, by the
sum rule there are 16 + 21 + 13 = 50 ways to choose a project.
Example 4.6 Suppose that we are planning a trip and are deciding between
bus and train transportation. If there are 3 bus routes and 2 train routes to go
from A to B, find the available routes for the trip. There are 3 + 2 = 5 possible
routes for someone to go from A to B.
Multiplication Rule
If one event can occur in m ways and a second event can occur in n ways, then the
number of ways the two events can occur in sequence ism × n. This rule can be
extended to any number of events occurring in sequence.
In words, the number of ways that events can occur in sequence is found by
multiplying the number of ways one event can occur by the number of ways the
other event(s) can occur.
Example 4.7 A coin is tossed and a die is rolled. Find the number of outcomes
for the sequence of events.
Solution: Since the coin can land either heads up or tails up and since the die
can land with any one of six numbers showing face up, there are 2 × 6 = 12
possibilities.
Example 4.8 There are four blood types, A, B, AB, and O. Blood can also be
Rh+ and Rh−. Finally, a blood donor can be classified as either male or female.
How many different ways can a donor have his or her blood labeled?
Solution: Since there are 4 possibilities for blood type, 2 possibilities for Rh factor,
and 2 possibilities for the gender of the donor, there are 4 × 2 × 2 = 16 different
classification categories.
Example 4.9 Assume that a license plate contains two letters followed by
three digits. How many different license plates can be printed?
Solution: Each letter can be printed in 26 ways, and each digit can be printed in 10
ways, so 26.26.10.10.10 = 676000 different plates can be printed.
Exercise 4.1 The access code for a car’s security system consists of four digits.
Each digit can be any number from 0 through 9. How many access codes are
possible when
1. each digit can be used only once and not repeated?
2. each digit can be repeated?
3. each digit can be repeated but the first digit cannot be 0 or 1?
Permutation Rule
A permutation is an arrangement of n objects in a specific order.
1. The number of permutations of n objects taken r at a time is given by
n!
n Pr = = n(n − 1)(n − 2) . . . (n − r + 1)
(n − r)!
4!
Solution: 4 P3 = = 24
(4 − 3)!
2. The number of permutations of n distinct objects taken all together is n! Or
In particular, the number of permutations of n objects taken n at a time is
n!
n Pn = = n!
(n − n)!
Example 4.11 In how many ways 4 people are lined up to get on a bus
(or to sit for photo graph)?
Solution: In 4! = 4 ∗ 3 ∗ 2 ∗ 1 = 24 ways.
3. The number of permutation of n objects taken all at a time, when n1 objects
are alike of one kind, n2 objects are alike of second kind, ..., nk objects are
alike of kth kind is given by:
n!
n1 !.n2 !. . . . .nk !
Example 4.12 Find the number of permutations of the letter for the
word "statistics".
Solution: There are 10 letters in the word "statistics" out of which there are
3s’s, 3t’s, 2i’s and 1a’s. So the number of permutiations of the letters of the
word statistics is:
10!
= 50, 400
3!3!2!1!
Exercise 4.2 An artist has created 20 original paintings, and she will
exhibit some of them in three galleries. Four paintings will be sent to
gallery A, four to gallery B, and three to gallery C. In how many ways can
this be done?
Example 4.13 RVU Registrar Office want to give identity number for
students by using 4 digits. The number should be considered by the fol-
lowing numbers only: {0, 1, 2, 3, 4, 5, and 6}. Hence, how many different
ID Numbers could be preferred by the Registrar
(a) Without repeating the number? (b) With repetition of numbers?
Solution: (a) The possible number of ID numbers given for students with
out repeating the digit is
7!
n Pr =7 P4 = = 840 (b) The possible number of
(7 − 4)!
ID numbers given for students with repeating the digit is
nr = 74 = 2401
Exercise 4.3 In how many different ways can a quiz be answered under each of
the following conditions?
1. The quiz consists of three multiple-choice questions with four choices for
each.
2. The quiz consists of three multiple-choice questions (with four choices
for each) and five true–false questions.
Combinations Rule
Combination is the selection of objects without regarding order of arrangement.
A combination of n different objects taken r at a time is a selection of r out of
n objects, with no attention given to the order of arrangement. The number of
n
combinations of n objects taken r at a time is denoted by the symbol or nCr is
r
given by
n n!
=
r r!(n − r)!
Example 4.14
The number of combinations of letter a, b, & c taken two at a
3 3!
time is nCr = = =3
2 2!(3 − 2)!
Exercise 4.4 Suppose in the box 3 red, 3 white and 5 black equal sized balls
are there. We want to draw 3 balls at a time. How many ways do we have from
each type?
Events on set
If A and B are two events then
• A ∪ B the happening of at least event A or B.
• A ∩ B the simultaneously happening of both events A and B.
• A0 or Ac A does not happen (complement of event A)
• Ac ∩ Bc neither A nor B happens
R
• Complementary event P(E c ) = 1 − P(E)
• Addition Law P(A or B) = P(A ∪ B) = P(A) + P(B) − P(A ∩ B)
If A and B are mutually exclusive events, then P(A ∪ B) = P(A) + P(B)
• Multiplication law P(A∩B) = P(B)P(A\B) or P(A∩B) = P(A)P(B\A)
Example 4.16 Assume there are 2 blue candies, 5 red candies and 3 yellow
candy. Find the probability of the blue candies.
Solution: There are a total of 10 candies. Thus the probability of the blue candies
is:
n(B) 2
P(B) = = = 0.2
n(s) 10
Example 4.17 From a production run of 5000 light bulbs, 2% of which are
defective, 1 bulb is selected at random. What is the probability that the bulb is
defective? What is the probability that it is not defective?
4 13 1 16
P(E1 ∪ E2 ) = P(E1 ) + P(E2 ) − P(E1 ∩ E2 ) = + − =
52 52 52 52
Exercise 4.5 1. A fair die is rolling once. What is the probability of getting
(a) Number 4? (b) An odd number?
(c) An even number? (d) Number 8?
2. A store receives 3 red, 6 white, and 7 blue shirts. Two shirts are drawn at
random. Determine the probability that:
1. Both the shirts are white
2. Both the shirts are blue
3. One shirt is red and the other is white
4. One shirt is white and the other shirt is blue.
Example 4.19 If 1000 tosses of a coin result in 529 heads, the relative fre-
quency of heads is 529/1000 = 0.529. If another 1000 tosses results in 493
heads, the relative frequency in the total of 2000 tosses is
529 + 493
= 0.511
2000
According to the statistical definition, by counting in this manner we should
ultimately get closer and closer to a number that represents the probability of a
head in a single toss of the coin. From the results so far presented, this should be
0.5 to one significant figure.
Solution:
21 22 5
(a) P(O) = = 0.42 (b) P(A or B) = P(A) + P(B) = + = 0.44 +
50 50 50
0.1 = 0.54
5 2 7
(c) P( neither A nor O) = + = = 0.14
50 50 50
(Neither A nor O means that a person has either type B or type AB blood.)
2 24
(d) P(not AB) = 1 − P(AB) = 1 − = = 0.96
50 25
SUBJECTIVE CONCEPT OF PROBABILITY
If there is little or no experience or information on which to base a probability,
it may be arrived at subjectively. Essentially, this means an individual evaluates the
available opinions and information and then estimates or assigns the probability.
The likelihood (probability) of a particular event happening that is assigned by an
individual based on whatever information is available.
Example 4.21 For a given patient’s health and extent of injuries, a doctor may
feel that the patient has a 90% chance of a full recovery.
Or a business analyst may predict that the chance of the employees of a certain
company going on strike is 0.25.
Estimating the likelihood you will be married before the age of 30.
P(A ∩ B) P(A ∩ B)
P(A|B) = or P(B|A) =
P(B) P(A)
Solution: The probability of a person to be male given that it has a good perfor-
mance is
P(M ∩ G) 60/120 60
P(M|G) = = = = 0.71
P(G) 85/120 85
Ex. Find the probability of a person to be female given that it has a poor perfor-
mance.
Example 4.23 A jar contains black and white marbles. Two marbles are
chosen without replacement. The probability of selecting a black marble and a
white marble is 0.34, and the probability of selecting a black marble on the first
draw is 0.47. What is the probability of selecting white marble on the second
draw, given that the first marble drawn is black?
Exercise 4.7 1. The probability that it is Friday and that a student is absent
is 0.03. Since there are 5 schooldays in a week, the probability that it is
Friday is 0.2. What is the probability that a student is absent given that
today is Friday?
2. Suppose that an office has 100 calculating machines. Some of them use
electric power (E) while others are manual (M) and some machines are
well known (N) while others are used (U). The table below gives numbers
E M Total
N 40 30 70
U 20 10 30
Total 60 40 100
3. In a firm 20% of the employees have an accounting background, while 5%
of the employees are excutives and have an accounting backgrounds. If
an employee has accounting background, what is the probability taht the
employee is an excutive?
Solution: Let D denote the event that a household subscribes to the daily edition,
then
. P(D) = 0.84, and P(S|D) = 0.75. Thus,
P(S ∩ D) = P(D)P(S|D) = 0.84 × 0.75 = 0.63
Hence, 63% of the households subscribe to both the Sunday and daily editions.
Multiplication Law for independent Event: P(A ∩ B) = P(A)P(B)
Example 4.25 A coin is flipped and a die is rolled. Find the probability of
getting a head on the coin and a 4 on the die.
Solution: The sample space for the coin is H, T ; and for the die it is 1, 2, 3, 4, 5, 6.
1 1 1
P(H ∩ 4) = P(H)P(4) = . = = 0.083
2 6 12
Example 4.26 A box contains four black and six white balls. What is the
probability of getting two black balls in drawing one after the other under the
following conditions? a. The first ball drawn is replaced
b. The first ball drawn is not replaced
Exercise 4.8 A contractor is bidding for two projects with company A and
company B. The contractor estimates that the probability of obtaining the project
with company A is 0.45. He also fells that if he should get the project with
company A then there is a 0.90 probability that company B will also give him
the project. What are the contractor’s chances of getting both projects?
formula:
P(Ai ∩ B)
P(Ai |B) =
P(B)
where the posterior probability of events Ai given event B is the conditional prob-
ability P(Ai |B). Since events A1 , A2 , ..., An are mutually exclusive and collec-
tively exhaustive, the event B is bound to occur with either A1 , A2 , ..., An . That is,
B = (A1 ∩ B) ∪ (A2 ∩ B) ∪ ... ∪ (An ∩ B) where the posterior probability of Ai given
B is the conditonal probability P(Ai |B).
P(A|Y ).P(Y )
P(Y |A) =
P(X).P(A|X) + P(Y ).P(A|Y ) + P(Z).P(A|Z)
(0.07)(1/3)
= = 0.33
(1/3)(0.02) + (1/3)(0.07) + (1/3)(0.12)
P(C ∩ E) P(C)P(E|C)
P(C|E) = =
P(E) P(A)P(E|A) + P(B)P(E|B) + P(C)P(E|C)
Tabular Approach
A tabular approach is helpful in conducting the Bayes’ theorem calculations.
The computations shown there are done in the following steps.
Step 1. Prepare the following three columns:
Column 1—The mutually exclusive events A i for which posterior probabilities
are desired
Column 2—The prior probabilities P(Ai ) for the events
Column 3—The conditional probabilities P(B|Ai ) of the new information B
given each event.
Step 2. In column 4, compute the joint probabilities P(Ai ∩ B) for each event and
the new information B by using the multiplication law. These joint probabilities
are found by multiplying the prior probabilities in column 2 by the corresponding
conditional probabilities in column 3; that is,P(Ai ∩ B) = P(Ai )P(B|Ai ).
Step 3. Sum the joint probabilities in column 4. The sum is the probability of
the new information, P(B).
Step 4. In column 5, compute the posterior probabilities using the basic relation
ship of conditional probability.
P(Ai ∩ B)
P(Ai |B) =
P(B)
Note that the joint probabilities P(Ai ∩ B) are in column 4 and the probability
P(B) is the sum of column 4
Exercise 4.9 1. Suppose your firm has two suppliers of a particular part
used in the assembly of your product. You get 60% of the parts from
supplier A and the rest from supplier B. 2% of the parts from A and 5%
of the parts from B are defective. If you select a part at random and it is
defective, what is the probability that it came from A? B?
2. A consulting firm submitted a bid for a large research project. The firm’s
management initially felt they had a 50 - 50 chance of getting the project.
However, the agency to which the bid was submitted subsequently re-
quested additional information on the bid. Past experience indicates that
for 75% of the successful bids and 40% of the unsuccessful bids the agency
requested additional information.
(a) What is the prior probability of the bid being successful (that is, prior
to the request for additional information)?
(b) What is the conditional probability of a request for additional informa-
tion given that the bid will ultimately be successful?
(c) Compute the posterior probability that the bid will be successful given
a request for additional information.
3. A local bank reviewed its credit card policy with the intention of recalling
some of its credit cards. In the past approximately 5% of cardholders de-
faulted, leaving the bank unable to collect the outstanding balance. Hence,
management established a prior probability of .05 that any particular card-
holder will default. The bank also found that the probability of missing a
monthly payment is .20 for customers who do not default. Of course, the
probability of missing a monthly payment for those who default is 1.
(a) Given that a customer missed one or more monthly payments, com-
pute the posterior probability that the customer will default.
(b) The bank would like to recall its card if the probability that a cus-
tomer will default is greater than .20. Should the bank recall its card if the
customer misses a monthly pay- ment? Why or why not?
5 — Probability Distributions
S = {1, 2, 3, 4, 5, 6}
Let the random variable X denotes the outcomes ’ a number greater than 2
occurs’. Then the random variable can assume the values 3, 4, 5 or 6
P(x) ≥ 0 (5.1)
∑ P(x) = 1 (5.2)
Solution: Since the sample space is 1, 2, 3, 4, 5, 6 and each outcome has a proba-
1
bility of , the distribution is as shown.
6
Outcomes X 1 2 3 4 5 6
1 1 1 1 1 1
Probability P(X)
6 6 6 6 6 6
When probability distributions are shown graphically, the values of X are placed
on the x axis and the probabilities P(X) on the y axis. These graphs are help-
ful in determining the shape of the distribution (right-skewed, left-skewed, or
symmetric).
Example 5.3 Represent graphically the probability distribution for the sample
space for tossing three coins.
Note that for visual appearances, it is not necessary to start with 0 at the ori-
gin.
Variance
The mean does not describe the amount of spread or variation of a distribuition.
The variance and standard devation allows us to compare the variation in two
Example 5.4 A car dealer has established the following probability distribu-
tion for the number of cars he expects to sell on a particular Saturday. Find the
variance and standard devation.
Number of car sold X 0 1 2 3 4
Probability P(X) 0.1 0.2 0.3 0.3 0.1
(d) The probability of a success must remain the same for each trial. So
does the probability of a failure. This implies that the probability of
failure of any trial is q = 1 − probability of sucesses = 1 − p.
The word success does not imply that something good or positive has occurred. For
example, in a probability experiment, we might want to select 10 people and let S
represent the number of people who were in an automobile accident in the last six
months. In this case, a success would not be a positive or good thing.
Solution:
a. No. There are five possible outcomes: freshman, sophomore, junior, senior,
and graduate student.
b. Yes. All four requirements are met.
c. No. Since the cards are not replaced, the events are not independent.
d. Yes. All four requirements are met.
e. No. There can be more than two categories for the answers.
n!
P(r) = pr qn−r
(n − r)!r!
where
• P(r) = The probability of success
• p = The numerical probability of success
• q = The numerical probability of a failure
• n = The number of trials
• r = The number of successes in n trials
In binomial experiments, the outcomes are usually classified as successes or fail-
ures.
For example, the correct answer to a multiple-choice item can be classified as a
success, but any of the other choices would be incorrect and hence classified as
a failure. The notation that is commonly used for binomial experiments and the
binomial distribution is defined now.
Example 5.6 Suppose that 40% of all customers who enter a department store
make a purchase. What is the probability that 2 of the next 3 customers will
make a purchase?
1
a. none out of four correct? P(0) = = 0.0625
16
4
b. one out of four correct? P(1) = = 0.25
16
Example 5.9 Suppose that 40% of the people entering a store make a
purchase. If 10 people enter the store, find the expected number of people
making a purchase?
Solution: µ = np = 10(0.4) = 4
µ r e−µ 45 (2.71828)−4
P(r = 5) = = = 0.1563
r! 5!
Example 5.11 Suppose in Tekure Ambessa Hospital, the average new born
female baby in every 24 hour is 7. What is the probability that
text (a) No female babies are born in a day?
(b) Only three female babies are born per day?
(c) 2 female babies are born in 12 hours?
Example 5.12 The avarage number of traffic accidents in Addis Ababa city is
2 per week. Find the probability of
(a) No accident during the first week period?
(b) At most three during a 2 week period?
20 e−2
Solution: (a). µ = 2 =⇒ P(r = 0) = = e−2 = 0.135335283
0!
(b) µ = 4. “At most 3 accidents” means 0, 1, 2, or 3 accidents. Hence
R N −R
r n−r
RC (N−R)C r n−r
P(r) = n
=
NC N
n
where
r = the number of successes
n = the number of trials
P(r) = the probability of x successes in n trials
N = the number of elements in the population
R = the number of elements in the population labeled success
Example 5.13 Ten people apply for a job as assistant manager of a restaurant.
Five have completed college and five have not. If the manager selects 3 applicants
at random, find the probability that all 3 are college graduates.
Example 5.14 A recent study found that 2 out of every 10 houses in a neigh-
borhood have no insurance. If 5 houses are selected from 10 houses, find the
probability that exactly 1 will be uninsured.
5
Solution:P(r) =
9
9. The propability that a random variable will have a value between two points
is equal to the area under the curve between these two points.
x−µ
z=
σ
Emperical: The area under the part of a normal curve that lies
1. within 1 standard deviation of the mean is approximately 0.68, or 68%;
2. within 2 standard deviations, about 0.95, or 95%; and
3. within 3 standard deviations, about 0.997, or 99.7%;
See Figure below, which also shows the area in each region.
Finding Areas Under the Standard Normal Distribution Curve
Step 1 Draw the normal distribution curve and shade the area.
Step 2 Find the appropriate figure in the Procedure Table and follow the directions
given.
Areas under the standard normal distribution curve have been tabulated in various
ways. The most common ones are the areas between Z = 0 and a positive value
of Z Given a normal distributed random variable X with Mean µ and standard
deviation σ
a−µ x−µ b−µ
P(a < x < b) = P < <
σ σ σ
a−µ b−µ
=⇒ P(a < x < b) = P <z<
σ σ
Example 5.15 1. The life time of a certain kind of electronic devices have
a mean of 300hr and standard deviation of 25hr. Assuming that the distri-
bution is normally distributed, what percentages of the electronic device
will have life of ;
(a) Between 300hrs and 310hrs.
(b) Between 290hrs and 300hrs.
(c) Less than 310hrs.
(d) Less than 290hrs.
(e) Between 290hrs and 310hrs.
(f) Between 250hrs and 290hrs.
2. A life test on large number of batteries revealed that the mean life time
of batteries before failure is 19hrs, The useful life of the battery follows
normal distribution with a standard deviation of 1.2hrs.
Required
(a) About what two values that 68% of the batteries fail?
(b) About what two values that 95% of the batteries fail?
(c) About what two values that all of the batteries fail?
3. The daily demand for coca-cola in a certain cafeteria is normally distributed
with mean of 200 and standard deviation of 20.
Required
(a) What is the probability that the daily demand on the given day is
i. 200 and 230 bottles?
ii. 190 and 200 bottles?
iii. Greater than 230 bottles?
iv. Fewer than 190 bottles?
v. Between 190 and 230 bottles?
(b) About what two values that 68% of daily demand lie?
(c) About what two values that 95% of daily demand lie?
(d) About what two values that all of the daily demands expects to lie?
4. Find the area under the standard normal distribution which lies a) Between
Z = 0 and Z = 0.96
b) Between Z = −1.45 and Z = 0
c) To the right of Z = −0.35
5. Find the value of Z if
a) The normal curve area between 0 and z(positive) is 0.4726
b) The area to the left of z is 0.9868